1-bp-blogspot-com-1046	----	None 
1-bp-blogspot-com-1202	----	None 
1-bp-blogspot-com-1642	----	None 
1-bp-blogspot-com-2268	----	None 
1-bp-blogspot-com-2516	----	None 
1-bp-blogspot-com-3471	----	None 
1-bp-blogspot-com-3948	----	None 
1-bp-blogspot-com-4344	----	None 
1-bp-blogspot-com-4437	----	None 
1-bp-blogspot-com-5708	----	None 
1-bp-blogspot-com-5875	----	None 
1-bp-blogspot-com-5906	----	None 
1-bp-blogspot-com-7161	----	None 
1-bp-blogspot-com-7275	----	None 
1-bp-blogspot-com-7583	----	None 
1-bp-blogspot-com-758	----	None 
1-bp-blogspot-com-8165	----	None 
1-bp-blogspot-com-8918	----	None 
1-bp-blogspot-com-902	----	None 
1-bp-blogspot-com-9265	----	None 
2021-code4lib-org-1940	----	code4lib 2021 Skip to main content Toggle navigation Home Schedule Speakers Sponsors General Info Conduct & Safety code for lib 2021 Code4Lib 2021 March 22 - 26 • Online image attributions The conference for people who code for libraries. An annual gathering of technologists from around the world, who largely work for and with libraries, archives, and museums and have a commitment to open technologies. Conference Recordings View on YouTube View the full recordings of the conference from the livestream! Presentation Slides View the Open Science Foundation repository with slides and materials from the presentations. What Comes Next? Code4Lib 2022 Attendees, fill out the post-conference survey! See you next year! Thanks to our sponsors! Platinum Welcome to Code4Lib 2021 code4lib is everything to me. In the community I feel like my work and knowledge is appreciated, so I feel very comfortable and motivated to volunteer, give talks, teach workshops, participate in conferences, host events. It's a great support network, I've never felt as comfortable as I do in this library group! Kim Pham University of Denver The confluence of technology and libraries drew me to Code4Lib when I was a young librarian straddling the areas of library metadata and technology. After eleven years in the community I am still amazed and humbled by the people I meet in the community and the work they do. There isn't another space that seamlessly combines libraries, technology, and the human aspect quite like Code4Lib in the library world. Becky Yoose I came away from Code4lib wanting to invite most of the people I met into my office and ask all of the questions about what everyone is doing and how they’re doing it and how can I do those things and what would they change about their tools; what's better is many of them would gladly help. 7 years on I keep coming back for more because over the years technical excellence isn't the only metric used in this community's continued growth. I have made friends in the Code4lib community. Francis Kayiwa Princeton University Libraries Code4Lib offers the space to be self-aware, outwardly conscious, and vastly creative. The community seems to be expanding and learning without ego, and I feel lucky to have been welcomed into the group with open arms. The conference is a place where one can look holistically at technology alongside thoughtful practitioners, while building lasting friendships. Alyssa Loera Cal Poly Pomona Code4Lib has been transformative for me. When I first learned of Code4Lib, I was considering leaving libraryland. Attending the first Code4Lib conference opened my eyes to the community I never knew I had. Code4Lib continues to humble, to inspire, and to anchor; our collective work is grounded in the cultural heritage mission and in the value of working inclusively in the open for the collective good. Here's to another twelve years, Code4Lib---and then some! Michael Giarlo Stanford University I attended Code4Lib 2018 on a diversity scholarship and I will always be grateful for that opportunity. It was free of buzzwords, full of welcoming people, and the sessions were interesting and accessible even though I don't work closely with technology or coding. I'm more motivated to explore new areas of technology and librarianship, I've started volunteering with the web committee, and I'm looking forward to attending the conference again! Laura Chuang Attending my first Code4Lib allowed me to explore the potential of technology, while upholding the importance of empathy and community-building. The connections I made at Code4Lib have continued to deepen over the last year, and it has been fantastic to see how we have implemented ideas that were shaped by conversations there. Code4Lib has modeled accountability and care, including publicly standing up against harassment and organizing support for our community. Nicky Andrews North Carolina State University Code4Lib has been a great conference for me as a metadata person interested in gaining computer science skills and insights. Presentations and topics are selected by the community. As such, I find a greater portion of presentations at this conference to be interesting, relevant, and educational than at other conferences in which presentations are jury selected. They also offer generous scholarships to underrepresented folks in the Code4Lib community. Yay!. Sonoe Nakasone North Carolina State University Libraries At Code4Lib, you really get the sense that people are there to share with and learn from one another — to advance their work individually and collectively — and have fun while they’re at it. I left the conference reminded of the widespread passion for libraries as critical features of our society, the passion that draws interesting, creative people to library work, and found I had a renewed sense of purpose in my job. Hannah Frost Stanford University Home Schedule Speakers Sponsors General Info Conduct & Safety Contact Us Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution 4.0 International license. 
accesa-org-4815	----	Inicio - ACCESA info@accesa.org Eventos Inicio Nosotros Proyectos Publicaciones Equipo Transparencia Blog Boletín Revista Sinergias Contacto Suscríbase ACCESA Centro Ciudadano de Estudios para una Sociedad Abierta ACCESA Centro Ciudadano de Estudios para una Sociedad Abierta En ACCESA buscamos mejorar la relación entre el Estado y la sociedad, transformando las estructuras del Estado a unas más abiertas y transparentes que logren satisfacer las demandas ciudadanas, y promoviendo que la sociedad se involucre activamente en la solución de sus problemas. Transparencia Promovemos todas las prácticas que facilitan el control ciudadano y la rendición de cuentas claras sobre las decisiones, acciones y asuntos de interés público. Acceso a la Información Fomentamos que se garantice el derecho humano de acceder a toda la información de interés público de forma amplia, libre, igualitaria y sin discriminación alguna. Participación ciudadana Impulsamos el derecho y deber de la población a integrarse en procesos de deliberación y debate colectivo para incidir en la toma de decisión sobre asuntos de interés público. Proyectos Trabajamos para mejorar la calidad de la democracia en Costa Rica. Conocé nuestros proyectos Revista Sinergias Sinergias es nuestro proyecto editorial de divulgación, reflexión y análisis sobre apertura gubernamental, participación ciudadana, tecnología cívica,… Apoyo a la co-creación del Plan de Acción de Estado Abierto 2019-2022 Como parte de sus obligaciones por ser miembro de la Alianza para un Gobierno Abierto… Construcción colectiva de Política y Reglamento de Participación Ciudadana para el cantón de Osa desde la perspectiva de Gobierno Abierto Durante el año 2019 ACCESA, con el apoyo y financiamiento de la Fundación Trust for the… La sociedad abierta es una sociedad humanista, inclusiva, diversa, plural y orientada al bien común, que busca la realización de la libertad y los derechos de todas las personas.  Blog   Mapeando el desarrollo rural con la apertura de datos 24 marzo, 2021 Desde hace tres años nuestra organización ha buscado aprovechar la celebración internacional del Open Data… Leer más Proceso de co-creación del Plan de Estado Abierto: nuestras reflexiones 26 enero, 2021 Consideramos que este ha sido el proceso de co-creación más transparente, participativo y riguroso que… Leer más La participación ciudadana y el Gobierno Abierto: ¿antídotos para la crisis de la democracia? 22 diciembre, 2020 Acciones tendientes hacia la puesta en marcha de políticas de Gobierno Abierto siguen siendo incipientes… Leer más Nuestra valoración sobre el Diálogo Multisectorial 15 diciembre, 2020 Apoyamos las iniciativas de colaboración y deliberación, pero identificamos errores y falencias en este proceso de… Leer más Asociación Centro Ciudadano de Estudios para una Sociedad Abierta Inicio Nosotros Proyectos Publicaciones Equipo Transparencia Blog Boletín Revista Sinergias Contacto Suscríbase 
acrl-ala-org-5792	----	ACRL TechConnect ACRL TechConnect Broken Links in the Discovery Layer—Pt. II: Towards an Ethnography of Broken Links This post continues where my last one left off, investigating broken links in our discovery layer. Be forewarned—most of it will be a long, dry list of all the mundane horrors of librarianship. Metadata mismatches, EZproxy errors, and OpenURL resolvers, oh my! What does it mean when we say a link is broken? The simplest &#8230; Continue reading "Broken Links in the Discovery Layer—Pt. II: Towards an Ethnography of Broken Links" Broken Links in the Discovery Layer—Pt. I: Researching a Problem Like many administrators of discovery layers, I&#8217;m constantly baffled and frustrated when users can&#8217;t access full text results from their searches. After implementing Summon, we heard a few reports of problems and gradually our librarians started to stumble across them on their own. At first, we had no formal system for tracking these errors. Eventually, &#8230; Continue reading "Broken Links in the Discovery Layer—Pt. I: Researching a Problem" ORCID for System Interoperability in Scholarly Communication Workflows What is ORCID? If you work in an academic library or otherwise provide support for research and scholarly communication, you have probably heard of ORCID (Open Contributor &#38; Researcher Identifier) in terms of “ORCID iD,” a unique 16-digit identifier that represents an individual in order to mitigate name ambiguity. The ORCID iD number is presented &#8230; Continue reading "ORCID for System Interoperability in Scholarly Communication Workflows" Creating Presentations with Beautiful.AI Updated 2018-11-12 at 3:30PM with accessibility information. Beautiful.AI is a new website that enables users to create dynamic presentations quickly and easily with “smart templates” and other design optimized features. So far the service is free with a paid pro tier coming soon. I first heard about Beautiful.AI in an advertisement on NPR and was &#8230; Continue reading "Creating Presentations with Beautiful.AI" National Forum on Web Privacy and Web Analytics We had the fantastic experience of participating in the National Forum on Web Privacy and Web Analytics in Bozeman, Montana last month. This event brought together around forty people from different areas and types of libraries to do in-depth discussion and planning about privacy issues in libraries. Our hosts from Montana State University, Scott Young, &#8230; Continue reading "National Forum on Web Privacy and Web Analytics" The Ex Libris Knowledge Center and Orangewashing Two days after ProQuest completed their acquisition of Ex Libris in December 2015, Ex Libris announced the launch of their new online Customer Knowledge Center. In the press release for the Knowledge Center, the company describes it as “a single gateway to all Ex Libris knowledge resources,” including training materials, release notes, and product manuals. &#8230; Continue reading "The Ex Libris Knowledge Center and Orangewashing" Managing ILS Updates We&#8217;ve done a few screencasts in the past here at TechConnect and I wanted to make a new one to cover a topic that&#8217;s come up this summer: managing ILS updates. Integrated Library Systems are huge, unwieldy pieces of software and it can be difficult to track what changes with each update: new settings are &#8230; Continue reading "Managing ILS Updates" Blockchain: Merits, Issues, and Suggestions for Compelling Use Cases Blockchain holds a great potential for both innovation and disruption. The adoption of blockchain also poses certain risks, and those risks will need to be addressed and mitigated before blockchain becomes mainstream. A lot of people have heard of blockchain at this point. But many are unfamiliar with how this new technology exactly works and &#8230; Continue reading "Blockchain: Merits, Issues, and Suggestions for Compelling Use Cases" Introducing Our New Best Friend, GDPR You&#8217;ve seen the letters GDPR in every single email you&#8217;ve gotten from a vendor or a mailing list lately, but you might not be exactly sure what it is. With GDPR enforcement starting on May 25, it&#8217;s time for a crash course in what GDPR is, and why it could be your new best friend &#8230; Continue reading "Introducing Our New Best Friend, GDPR" Names are Hard A while ago I stumbled onto the post &#8220;Falsehoods Programmers Believe About Names&#8221; and was stunned. Personal names are one of the most deceptively difficult forms of data to work with and this article touched on so many common but unaddressed problems. Assumptions like &#8220;people have exactly one canonical name&#8221; and &#8220;My system will never &#8230; Continue reading "Names are Hard" 
activitypub-rocks-4532	----	ActivityPub Rocks! Don't you miss the days when the web really was the world's greatest decentralized network? Before everything got locked down into a handful of walled gardens? So do we. Enter ActivityPub! ActivityPub is a decentralized social networking protocol based on the ActivityStreams 2.0 data format. ActivityPub is an official W3C recommended standard published by the W3C Social Web Working Group. It provides a client to server API for creating, updating and deleting content, as well as a federated server to server API for delivering notifications and subscribing to content. Sounds exciting? Dive in! ==> Latest published version <== ==> Latest editor's draft <== Or, are you a user looking for ActivityPub software to use? Check out this guide for ActivityPub users (community edited)! ~= Hey, Implementers! =~We're so stoked to have you implementing ActivityPub! To make sure ActivityPub implementations work together, we have: Guide for new ActivityPub implementers -- Community edited and unofficial, but useful! A test suite: -- Make sure your application works right according to the ActivityPub standard. Implementation reports: -- See the implementation coverage of applications which implemented ActivityPub during the standardization process. Looking to discuss implementing ActivityPub? You can join the #social IRC channel on irc.w3.org! See also SocialHub, a community-run forum to discuss ActivityPub developments and ideas, and the Social CG, a W3C Community Group to continue the work of advancing the federated social web... including ActivityPub! -=* ActivityPub News *=- Some (long overdue) site updatesMon 04 January 2021 Let us meet on SocialHub!Thu 26 December 2019 ActivityPub reaches W3C Recommendation status! Everybody party!Tue 20 March 2018 ActivityPub reaches Proposed Recommendation status!Fri 08 December 2017 Test suite up, implementation reports page up... let's get more reports in!Mon 06 November 2017 Mastodon launches their ActivityPub support, and a new CR!Sun 10 September 2017 New tutorial, new logo!Tue 09 May 2017 Help submit implementation reports!Sun 09 April 2017 ActivityPub reaches Candidate Recommendation status!Thu 17 November 2016 activitypub.rocks launches!Mon 14 November 2016 Site contents dual licensed under Creative Commons Attribution-Sharealike 4.0 International and the GNU GPL, version 3 or any later version. ActivityPub logo by mray, released into public domain under CC0 1.0. Powered by Haunt. 
acrl-ala-org-4948	----	ACRL TechConnect Skip to content ACRL TechConnect Menu About Authors Broken Links in the Discovery Layer—Pt. II: Towards an Ethnography of Broken Links This post continues where my last one left off, investigating broken links in our discovery layer. Be forewarned—most of it will be a long, dry list of all the mundane horrors of librarianship. Metadata mismatches, EZproxy errors, and OpenURL resolvers, oh my! What does it mean when we say a link is broken? The simplest definition would be: when a link that claims to lead to full text does not. But the way that many discovery layers work is by translating article metadata into a query in a separate database, which leads to some gray areas. What if the link leads to a search with only a single result, the resource in question? What if the link leads to a search with two results, a dozen, a hundred…and the resource is among them? What if the link leads to a journal index and it takes some navigation to get to the article’s full text? Where do we draw the line? The user’s expectation is that selecting something that says “full text” leads to the source itself. I think all of the above count as broken links, though they obviously range in severity. Some mean that the article simply cannot be accessed while others mean that the user has to perform a little more work. For the purposes of this study, I am primarily concerned with the first case: when the full text is nowhere near the link’s destination. As we discuss individual cases reported by end users, it will solidify our definition. Long List I’m going to enumerate some types of errors I’ve seen, providing a specific example and detailing its nature as much as possible to differentiate the errors from each other. 1. The user selects a full text link but is taken to a database query that doesn’t yield the desired result. We had someone report this with an article entitled “LAND USE: U.S. Soil Erosion Rates–Myth and Reality” in Summon which was translated into a query on the article’s ISSN, publication title, and an accidentally truncated title (just “LAND USE”).1 The query fails to retrieve the article but does show 137 other results. The article is present in the database and can be retrieved by editing the query, for instance by changing the title parameter to “U.S. soil erosion rates”. Indeed, the database has the title as “U.S. soil erosion rates–myth and reality”. The article appears to be part of a recurring column and is labelled “POLICY FORUM: LAND USE” which explains the discovery layer’s representation of the title. Fundamentally, the problem is a disagreement about the title between the discovery layer and database. As another example, I’ve seen this problem occur with book reviews where one side prefixes the title with “Review:” while the other does not. In a third instance of this, I’ve seen a query title = "Julia Brannen Peter Moss "and" Ann Mooney Working "and" Caring over the Twentieth Century Palgrave Macmillan Basingstoke Hampshire 2004 234 pp hbk £50 ISBN 1 4039 2059 1" where a lot of ancillary text spilled into the title. 2. The user is looking for a specific piece except the destination database combines this piece with similar ones into a single record with a generic title such that incoming queries fail. So, for instance, our discovery layer’s link might become a title query for Book Review: Bad Feminist by Roxane Gay in the destination, which only has an article named “Book Reviews” in the same issue of the host publication. In my experience, this is one of the more common discovery layer problems and can be described as a granularity mismatch. The discovery layer and subscription database disagree about what the fundamental unit of the publication is. While book reviews often evince this problem, so too do letters to the editor, opinion pieces, and recurring columns. 3. An article present in one of our subscription databases is not represented in the discovery layer, despite the database being correctly selected in the knowledgebase that informs the discovery system’s index. We’re able to read the article “Kopfkino: Julia Phillips’ sculptures beyond the binary” in an EBSCO database that provides access to the journal Flash Art International but no query in Summon can retrieve it as a result. I suppose this is not technically a broken link as a non-existent link but it falls under the general umbrella of discovery layer content problems. 4. The exact inverse of the above: an article is correctly represented by the discovery layer index as being part of a database subscription that the user should have access to, but the article does not actually exist within the source database due to missing content. This occurred with an interview of Howard Willard in American Artist from 1950. While our subscription to Art & Architecture Source does indeed include the issue of American Artist in question, and one can read other articles from it, there was no record for the interview itself in EBSCOHost nor are its pages present in any of the PDF scans of the issue. 5. The user is looking for an article that is combined with another, even though the source seems to agree that they should be treated separately. For instance, one of our users was looking for the article “Musical Curiosities in Athanasius Kircher’s Antiquarian Visions” in the journal Music in Art but Summon’s link lands on a broken link resolver page in the destination EBSCO database. It turns out, upon closer inspection, that the pages for this article are appended to the PDF of the article that appears before it. All other articles for the issue have their own record. This is an interesting hybrid metadata/content problem similar to granularity mismatch: while there is no record for the article itself in the database, the article’s text is present. Yet unlike some granularity mismatches it is impossible to circumvent via search; you have to know to browse the issue and utilize page numbers to locate it. 6. The user selects a link to an article published within the past year in a journal with a year-long embargo. The discovery layer shows a “full text online” link but because the source’s link resolver doesn’t consider an embargoed article to be a valid destination, the link lands on an error page. This is an instance where Summon would, ideally, at least take to you to the article’s citation page but in any case the user won’t be able to retrieve the full text. 7. The user selects an article that is in a journal not contained within any of the library’s database subscriptions. This is usually simple knowledge base error where the journal lists for a database changed without being updated in the discovery layer index. Still, it’s quite common because not all subscription changes are published in a machine-readable manner that would allow discovery layers to automate their ingestion. 8. The user selects an article listed as being published in 2016 in the discovery layer, while the source database has 2017 so the OpenURL fails to resolve properly. Upon investigation, this date mismatch can be traced back to the journal’s publisher which lists the individual articles as being published in 2016 while the issue in which they are contained comes from 2017. The Summon support staff rightly points out to me that they can’t simply change the article dates to match one source; while it might fix some links, it will break others, and this date mismatch is a fundamentally unsolvable disagreement. This issue highlights the brittleness of real world metadata; publishers, content aggregators, and discovery products do not live in harmony. Reviewing the list of problems, this dual organization seems to helpfully group like issues: Metadata & linking problems Metadata mismatch (1, 5, 8) Granularity mismatch (2) Link resolver error (6) Index problems Article not in database/journal/index (3, 4, 5, 6) Journal not in database (7) Of these three, the first category accounts for the vast majority of problems according to my anecdata. It’s notable that issues overlap and their classification is inexact. When a link to an embargoed article fails, should we say that is due to the article being “missing” or a link resolver issue? Whatever the case, it is often clear when a link is broken even if we could argue endlessly about how exactly. There are also a host of problems that we, as librarians, cause. We might misconfigure EZproxy for a database or fail to keep our knowledge base holdings up to date. The difference with these problems is that they tend to happen once and then be resolved forever; I fix the EZproxy stanza, I remove access to the database we unsubscribed from. So the proportion of errors we account for is vanishingly low, while these other errors are eternal. No matter how many granularity mismatches or missing articles in I point out, there are always millions more waiting to cause problems for our users. Notes This sort of incredibly poor handling of punctuation in queries is sadly quite common. Even though, in this instance, the source database and discovery layer are made by the same company the link between them still isn’t prepared to handle a colon in a text string. Consider how many academic articles have colons in their title. This is not good. ↩ Author Eric PhetteplacePosted on July 11, 2019Categories discovery, metadata1 Comment on Broken Links in the Discovery Layer—Pt. II: Towards an Ethnography of Broken Links Broken Links in the Discovery Layer—Pt. I: Researching a Problem Like many administrators of discovery layers, I’m constantly baffled and frustrated when users can’t access full text results from their searches. After implementing Summon, we heard a few reports of problems and gradually our librarians started to stumble across them on their own. At first, we had no formal system for tracking these errors. Eventually, I added a script which inserted a “report broken link” form into our discovery layer’s search results. 1 I hoped that collecting reported problems and then reporting then would identify certain systemic issues that could be resolved, ultimately leading to fewer problems. Pointing out patterns in these errors to vendors should lead to actual progress in terms of user experience. From the broken links form, I began to cull some data on the problem. I can tell you, for instance, which destination databases experience the most problems or what the character of the most common problems is. The issue is the sample bias—are the problems that are reported really the most common? Or are they just the ones that our most diligent researchers (mostly our librarians, graduate students, and faculty) are likely to report? I long for quantifiable evidence of the issue without this bias. How I classify the broken links that have been reported via our form. N = 57 Select Searches & Search Results So how would one go about objectively studying broken links in a discovery layer? The first issue to solve is what searches and search results to review. Luckily, we have data on this—we can view in our analytics what the most popular searches are. But a problem becomes apparent when one goes to review those search terms: artstor hours jstor kanopy Of course, the most commonly occurring searches tend to be single words. These searches all trigger “best bet” or database suggestions that send users directly to other resources. If their result lists do contain broken links, those links are unlikely to ever be visited, making them a poor choice for our study. If I go a little further into the set of most common searches, I see single-word subject searches for “drawing” followed by some proper nouns (“suzanne lacy”, “chicago manual of style”). These are better since it’s more likely users actually select items from their results but still aren’t a great representation of all the types of searches that occur. Why are these types of single-word searches not the best test cases? Because search phrases necessarily have a long tail distribution; the most popular searches aren’t that popular in the context of the total quantity of searches performed 2. There are many distinct search queries that were only ever executed once. Our most popular search of “artstor”? It was executed 122 times over the past two years. Yet we’ve had somewhere near 25,000 searches in the past six months alone. This supposedly popular phrase has a negligible share of that total. Meanwhile, just because a search for “How to Hack it as a Working Parent. Jaclyn Bedoya, Margaret Heller, Christina Salazar, and May Yan. Code4Lib (2015) iss. 28″ has only been run once doesn’t mean it doesn’t represent a type of search—exact citation search—that is fairly common and worth examining, since broken links during known item searches are more likely to be frustrating. Even our 500 most popular searches evince a long tail distribution. So let’s say we resolve the problem of which searches to choose by creating a taxonomy of search types, from single-word subjects to copy-pasted citations. 3 We can select a few real world samples of each type to use in our study. Yet we still haven’t decided which search results we’re going to examine! Luckily, this proves much easier to resolve. People don’t look very far down in the search results 4, rarely scrolling past the first “page” listed (Summon has an infinite scroll so there technically are no pages, but you get the idea). Only items within the first ten results are likely to be selected. Once we have our searches and know that we want to examine only the first ten or so results, my next thought is that it might be worth filtering our results that are unlikely to have problems. But does skipping the records from our catalog, institutional repository, LibGuides, etc. make other problems abnormally more apparent? After all, these sorts of results are likely to work since we’re providing direct links to the Summon link. Also, our users do not heavily employ facets—they would be unlikely to filter out results from the library catalog. 5 In a way, by focusing a study on search results that are the most likely to fail and thus give us information about underlying linking issues, we’re diverging away from the typical search experience. In the end, I think it’s worthwhile to stay true to more realistic search patterns and not apply, for instance, a “Full Text Online” filter which would exclude our library catalog. Next Time on Tech Connect—oh how many ways can things go wrong?!? I’ll start investigating broken links and attempt to enumerate their differing natures. Notes This script was largely copied from Robert Hoyt of Fairfield University, so all credit due to him. ↩ For instance, see: Beitzel, S. M., Jensen, E. C., Chowdhury, A., Frieder, O., & Grossman, D. (2007). Temporal analysis of a very large topically categorized web query log. Journal of the American Society for Information Science and Technology, 58(2), 166–178. “… it is clear that the vast majority of queries in an hour appear only one to five times and that these rare queries consistently account for large portions of the total query volume” ↩ Ignore, for the moment, that this taxonomy’s constitution is an entire field of study to itself. ↩ Pan, B., Hembrooke, H., Joachims, T., Lorigo, L., Gay, G., & Granka, L. (2007). In google we trust: Users’ decisions on rank, position, and relevance. Journal of Computer-Mediated Communication, 12(3), 801–823. ↩ In fact, the most common facet used in our discovery layer is “library catalog” showing that users often want only bibliographic records; the precise opposite of a search aimed at only retrieving article database results. ↩ Author Eric PhetteplacePosted on March 11, 2019March 11, 2019Categories data, discovery3 Comments on Broken Links in the Discovery Layer—Pt. I: Researching a Problem ORCID for System Interoperability in Scholarly Communication Workflows What is ORCID? If you work in an academic library or otherwise provide support for research and scholarly communication, you have probably heard of ORCID (Open Contributor & Researcher Identifier) in terms of “ORCID iD,” a unique 16-digit identifier that represents an individual in order to mitigate name ambiguity. The ORCID iD number is presented as a URI (unique resource identifier) that serves as the link to a corresponding ORCID record, where disambiguating data about an individual is stored. For example, https://orcid.org/0000-0002-9079-593X is the ORCID iD for the late Stephen Hawking, and clicking on this link will take you to Hawking’s ORCID record. Data within ORCID records can include things like names(s) and other identifiers, biographical information, organizational affiliations, and works. Figure 1: This screenshot shows the types of data that can be contained in an ORCID record. Anyone can register for an ORCID iD for free, and individuals have full control over what data appears in their record, the visibility of that data, and whether other individuals or organizations are authorized to add data to their ORCID record on their behalf. Individuals can populate information in their ORCID record themselves, or they can grant permission to organizations, like research institutions, publishers, and funding agencies, to connect with their ORCID record as trusted parties, establishing an official affiliation between the individual and the organization. For example, Figures 2 and 3 illustrate an authenticated ORCID connection between an individual author and the University of Virginia (UVA) as represented in LibraOpen, the UVA Library’s Samvera institutional repository. Figure 2: The University of Virginia Library’s LibraOpen Institutional Repository is configured to make authenticated connections with authors’ ORCID records, linking the author to their contributions and to the institution. Once an author authenticates/connects their ORCID iD in the system, ORCID iD URIs are displayed next to the authors’ names. Image source: doi.org/10.18130/V3FB8T Figure 3: By clicking on the author’s ORCID iD URI in LibraOpen, we can see the work listed on the individual’s ORCID record, with “University of Virginia” as the source of the data, which means that the author gave permission for UVA to write to their ORCID record. This saves time for the author, ensures integrity of metadata, and contributes trustworthy data back to the scholarly communication ecosystem that can then be used by other systems connected with ORCID. Image courtesy of Sherry Lake, UVA https://orcid.org/0000-0002-5660-2970 ORCID Ecosystem & Interoperability These authenticated connections are made possible by configuring software systems to communicate with the ORCID registry through the ORCID API, which is based on OAuth 2.0. With individual researchers/contributors at the center, and their affiliated organizations connecting with them through the ORCID API, all participating organizations’ systems can also communicate with each other. In this way, ORCID not only serves as a mechanism for name disambiguation, it also provides a linchpin for system interoperability in the research and scholarly communication ecosystem. Figure 4: ORCID serves as a mechanism for interoperability between systems and data in the scholarly communication ecosystem. Graphic courtesy of the ORCID organization. Publishers, funders, research institutions (employers), government agencies, and other stakeholders have been adopting and using ORCID increasingly in their systems over the past several years. As a global initiative, over 5 million individuals around the world have registered for an ORCID iD, and that number continues to grow steadily as more organizations start to require ORCID iDs in their workflows. For example, over 65 publishers have signed on to an open letter committing to use ORCID in their processes, and grant funders are continuing to come on board with ORCID as well, having recently released their own open letter demonstrating commitment to ORCID. A full list of participating ORCID member organizations around the globe can be found at https://orcid.org/members. ORCID Integrations ORCID can be integrated into any system that touches the types of data contained within an ORCID record, including repositories, publishing and content management platforms, data management systems, central identity management systems, human resources, grants management, and Current Research Information Systems (CRIS). ORCID integrations can either be custom built into local systems, such as the example from UVA above, or made available through a vendor system out of the box. Several vendor-hosted CRIS such as Pure, Faculty 180, Digital Measures, and Symplectic Elements, already have built-in support for authenticated ORCID connections that can be utilized by institutional ORCID members, which provides a quick win for pulling ORCID data into assessment workflows with no development required. While ORCID has a public API that offers limited functionality for connecting with ORCID iDs and reading public ORCID data, the ORCID member API allows organizations to read from, write to, and auto-update ORCID data for their affiliated researchers. The ORCID institutional membership model allows organizations to support the ORCID initiative and benefit from the more robust functionality that the member API provides. ORCID can be integrated with disparate systems, or with one system from which data flows into others, as illustrated in Figure 5. Figure 5: This graphic from the Czech Technical University in Prague illustrates how a central identity management system is configured to connect with the ORCID registry via the ORCID API, with ORCID data flowing internally to other institutional systems. Image Source: Czech Technical University in Prague Central Library & Computing and Information Centre , 2016: Solving a Problem of Authority Control in DSpace During ORCID Implementation ORCID in US Research Institutions In January of 2018, four consortia in the US – the NorthEast Research Libraries (NERL), the Greater Western Library Alliance (GWLA), the Big Ten Academic Alliance (BTAA), and LYRASIS – joined forces to form a national partnership for a consortial approach to ORCID membership among research institutions in the US, known as the ORCID US Community. The national partnership allows non-profit research institutions to become premium ORCID member organizations for a significantly discounted fee and employs staff to provide dedicated technical and community support for its members. As of December 1, 2018, there are 107 member organizations in the ORCID US Community. In addition to encouraging adoption of ORCID, a main goal of the consortium approach is to build a community of practice around ORCID in the US. Prior to 2018, any institutions participating in ORCID were essentially going it alone and there were no dedicated communication channels or forums for discussion and sharing around ORCID at a national level. However, with the formation of the ORCID US Community, there is now a website with community resources for ORCID adoption specific to the US, dedicated communication channels, and an open door to collaboration between member institutions. Among ORCID US Community member organizations, just under half have integrated ORCID with one or more systems, and the other slightly more than half are either in early planning stages or technical development. (See the ORCID US Community 2018 newsletter for more information.) As an ecosystem, ORCID relies not only on organizations but also the participation of individual researchers, so all members have also been actively reaching out to their affiliated researchers to encourage them to register for, connect, and use their ORCID iD. Getting Started with ORCID ORCID can benefit research institutions by mitigating confusion caused by name ambiguity, providing an interoperable data source that can be used for individual assessment and aggregated review of institutional impact, allowing institutions to assert authority over their institutional name and verify affiliations with researchers, ultimately saving time and reducing administrative burden for both organizations and individuals. To get the most value from ORCID, research institutions should consider the following three activities as outlined in the ORCID US Planning Guide: Forming a cross-campus ORCID committee or group with stakeholders from different campus units (libraries, central IT, research office, graduate school, grants office, human resources, specific academic units, etc.) to strategically plan ORCID system integration and outreach efforts Assessing all of the current systems used on campus to determine which workflows could benefit from ORCID integration Conducting outreach and education around research impact and ORCID to encourage researchers to register for and use their ORCID iD The more people and organizations/systems using ORCID, the more all stakeholders can benefit from ORCID by maintaining a record of an individuals’ scholarly and cultural contributions throughout their career, mitigating confusion caused by name ambiguity, assessing individual contributions as well as institutional impact, and enabling trustworthy and efficient sharing of data across scholarly communication workflows. Effectively, ORCID represents a paradigm shift from siloed, repetitive workflows to the ideal of being able to “enter once, re-use often” by using ORCID to transfer data between systems, workflows, and individuals, ultimately making everyone’s lives easier. Sheila Rabun is the ORCID US Community Specialist at LYRASIS, providing technical and community support for 100+ institutional members of the ORCID US Community. In prior roles, she managed community and communication for the International Image Interoperability Framework (IIIF) Consortium, and served as a digital project manager for several years at the University of Oregon Libraries’ Digital Scholarship Center. Learn more at https://orcid.org/0000-0002-1196-6279 Author Sheila RabunPosted on December 18, 2018December 17, 2018Categories digital scholarship, publication, Scholarly Communication Creating Presentations with Beautiful.AI Updated 2018-11-12 at 3:30PM with accessibility information. Beautiful.AI is a new website that enables users to create dynamic presentations quickly and easily with “smart templates” and other design optimized features. So far the service is free with a paid pro tier coming soon. I first heard about Beautiful.AI in an advertisement on NPR and was immediately intrigued. The landscape of presentation software platforms has broadened in recent years to include websites like Prezi, Emaze, and an array of others beyond the tried and true PowerPoint. My preferred method of creating presentations for the past couple of years has been to customize the layouts available on Canva and download the completed PDFs for use in PowerPoint. I am also someone who enjoys tinkering with fonts and other design elements until I get a presentation just right, but I know that these steps can be time consuming and overwhelming for many people. With that in mind, I set out to put Beautiful.AI to the test by creating a short “prepare and share” presentation about my first experience at ALA’s Annual Conference this past June for an upcoming meeting. A title slide created with Beautiful.AI. Features To help you get started, Beautiful.AI includes an introductory “Design Tips for Beautiful Slides” presentation. It is also fully customizable so you can play around with all of of the features and options as you explore, or you can click on “create new presentation” to start from scratch. You’ll then be prompted to choose a theme, and you can also choose a color palette. Once you start adding slides you can make use of Beautiful.AI’s template library. This is the foundation of the site’s usefulness because it helps alleviate guesswork about where to put content and that dreaded “staring at the blank slide” feeling. Each individual slide becomes a canvas as you create a presentation, similar to what is likely familiar in PowerPoint. In fact, all of the most popular PowerPoint features are available in Beautiful.AI, they’re just located in very different places. From the navigation at the left of the screen users can adjust the colors and layout of each slide as well as add images, animation, and presenter notes. Options to add, duplicate, or delete a slide are available on the right of the screen. The organize feature also allows you to zoom out and see all of the slides in the presentation. Beautiful.AI offers a built-in template to create a word cloud. One of Beautiful.AI’s best features, and my personal favorite, is its built-in free stock image library. You can choose from pre-selected categories such as Data, Meeting, Nature, or Technology or search for other images. An import feature is also available, but providing the stock images is extremely useful if you don’t have your own photos at the ready. Using these images also ensures that no copyright restrictions are violated and helps add a professional polish to your presentation. The options to add an audio track and advance times to slides are also nice to have for creating presentations as tutorials or introductions to a topic. When you’re ready to present, you can do so directly from the browser or export to PDF or PowerPoint. Options to share with a link or embed with code are also available. Usability While intuitive design and overall usability won’t necessarily make or break the existence of a presentation software platform, each will play a role in influencing whether someone uses it more than once. For the most part, I found Beautiful.AI to be easy and fun to use. The interface is bold, yet simplistic, and on trend with current website design aesthetics. Still, users who are new to creating presentations online in a non-PowerPoint environment may find the Beautiful.AI interface to be confusing at first. Most features are consolidated within icons and require you to hover over them to reveal their function. Icons like the camera to represent “Add Image” are pretty obvious, but others such as Layout and Organize are less intuitive. Some of Beautiful.AI’s terminology may also not be as easily recognizable. For example, the use of the term “variations” was confusing to me at first, especially since it’s only an option for the title slide. The absence of any drag and drop capability for text boxes is definitely a feature that’s missing for me. This is really where the automated design adaptability didn’t seem to work as well as I would’ve expected given that it’s one of the company’s most prominent marketing statements. On the title slide of my presentation, capitalizing a letter in the title caused the text to move closer to the edge of the slide. In Canva, I could easily pull the text block over to the left a little or adjust the font size down by a few points. I really am a stickler for spacing in my presentations, and I would’ve expected this to be an element that the “Design AI” would pick up on. Each template also has different pre-set design elements, and it can be confusing when you choose one that includes a feature that you didn’t expect. Yet, text sizes that are pre-set to fit the dimensions of each template does help not only with readability in the creation phase but with overall visibility for audiences. Again, this alleviates some of the guesswork that often happens in PowerPoint with not knowing exactly how large your text sizes will appear when projected onto larger screens. A slide created using a basic template and stock photos available in Beautiful.AI. One feature that does work really well is the export option. Exporting to PowerPoint creates a perfectly sized facsimile presentation, and being able to easily download a PDF is very useful for creating handouts or archiving a presentation later on. Both are nice to have as a backup for conferences where Internet access may be spotty, and it’s nice that Beautiful.AI understands the need for these options. Unfortunately, Beautiful.AI doesn’t address accessibility on its FAQ page nor does it offer alternative text or other web accessibility features. Users will need to add their own slide titles and alt text in PowerPoint and Adobe Acrobat after exporting from Beautiful.AI to create an accessible presentation.  Conclusion Beautiful.AI challenged me to think in new ways about how best to deliver information in a visually engaging way. It’s a useful option for librarians and students who are looking for a presentation website that is fun to use, engaging, and on trend with current web design. Click here to view “My first ALA”presentation created with Beautiful.AI. Jeanette Sewell is the Database and Metadata Management Coordinator at Fondren Library, Rice University. Author Jeanette SewellPosted on November 12, 2018November 12, 2018Categories conferences, library, presentation, technology, tools National Forum on Web Privacy and Web Analytics We had the fantastic experience of participating in the National Forum on Web Privacy and Web Analytics in Bozeman, Montana last month. This event brought together around forty people from different areas and types of libraries to do in-depth discussion and planning about privacy issues in libraries. Our hosts from Montana State University, Scott Young, Jason Clark, Sara Mannheimer, and Jacqueline Frank, framed the event with different (though overlapping) areas of focus. We broke into groups based on our interests from a pre-event survey and worked through a number of activities to identify projects. You can follow along with all the activities and documents produced during the Forum in this document that collates all of them. Float your boat exercise             While initially worried that the activities would feel too forced, instead they really worked to release creative ideas. Here’s an example: our groups drew pictures of boats with sails showing opportunities, and anchors showing problems. We started out in two smaller subgroups of our subgroups and drew a boat, then met with the large subgroup to combine the boat ideas. This meant that it was easy to spot the common themes—each smaller group had written some of the same themes (like GDPR). Working in metaphor meant we could express some more complex issues, like politics, as the ocean—something that always surrounds the issue and can be helpful or unhelpful without much warning. This helped us think differently about issues and not get too focused on our own individual perspective. The process of turning metaphor into action was hard. We had to take the whole world of problems and opportunities and come up with how these could be realistically accomplished. Good and important ideas had to get left behind because they were so big there was no way to feasibly plan them, certainly not in a day or two. The differing assortment of groups (which were mixable where ideas overlapped) ensured that we were able to question each other’s assumptions and ask some hard questions. For example, one of the issues Margaret’s group had identified as a problem was disagreement in the profession about what the proper limits were on privacy. Individually identifiable usage metrics are a valuable commodity to some, and a thing not to be touched to others. While everyone in the room was probably biased more in favor of privacy than perhaps the profession at large is, we could share stories and realities of the types of data we were collecting and what it was being used for. Considering the realities of our environments, one of our ideas to bring everyone from across the library and archives world to create a unified set of privacy values was not going to happen. Despite that, we were able to identify one of the core problems that led to a lack of unity, which was, in many cases, lack of knowledge about what privacy issues existed and how these might affect institutions. When you don’t completely understand something, or only half understand it, you are more likely to be afraid of it.             On the afternoon of the second day and continuing into the morning of the third day, we had to get serious and pick just one idea to focus on to create a project plan. Again, the facilitators utilized a few processes that helped us take a big idea and break it down into more manageable components. We used “Big SCAI” thinking to frame the project: what is the status quo, what are the challenges, what actions are required, and what are the ideals. From there we worked through what was necessary for the project, nice to have, unlikely to get, and completely unnecessary to the project. This helped focus efforts and made the process of writing a project implementation plan much easier. What the workday looked like. Writing the project implementation plan as a group was made easier by shared documents, but we all commented on the irony of using Google Docs to write privacy plans. On the other hand, trying to figure out how to write in groups and easily share what we wrote using any other platform was a challenge in the moment. This reality illustrates the problems with privacy: the tool that is easiest to use and comes to mind first will be the one that ends up being used. We have to create tools that make privacy easy (which was a discussion many of us at the Forum had), but even more so we need to think about the tradeoffs that we make in choosing a tool and educate ourselves and others about this. In this case, since all the outcomes of the project were going to be public anyway, going on the “quick and easy” side was ok.             The Forum project leaders recently presented about their work at the DLF Forum 2018 conference. In this presentation, they outlined the work that they did leading up to the Forum, and the strategies that emerged from the day. They characterized the strategies as Privacy Badging and Certifications, Privacy Leadership Training, Privacy for Tribal Communities and Organizations, Model License for Vendor Contracts, Privacy Research Institute, and a Responsible Assessment Toolkit. You can read through the thought process and implementation strategies for these projects and others yourself at the project plan index. The goal is to ensure that whoever wants to do the work can do it. To quote Scott Young’s follow-up email, “We ask only that you keep in touch with us for the purposes of community facilitation and grant reporting, and to note the provenance of the idea in future proposals—a sort of CC BY designation, to speak in copyright terms.”             For us, this three-day deep dive into privacy was an inspiration and a chance to make new connections (while also catching up with some old friends). But even more, it was a reminder that you don’t need much of anything to create a community. Provided the right framing, as long as you have people with differing experiences and perspectives coming together to learn from each other, you’ve facilitated the community-building.   Author Margaret HellerPosted on October 29, 2018October 29, 2018Categories conferences, privacy The Ex Libris Knowledge Center and Orangewashing Two days after ProQuest completed their acquisition of Ex Libris in December 2015, Ex Libris announced the launch of their new online Customer Knowledge Center. In the press release for the Knowledge Center, the company describes it as “a single gateway to all Ex Libris knowledge resources,” including training materials, release notes, and product manuals. A defining feature is that there has never been any paywall or log-on requirement, so that all Knowledge Center materials remain freely accessible to any site visitor. Historically, access to documentation for automated library systems has been restricted to subscribing institutions, so the Knowledge Center represents a unique change in approach. Within the press release, it is also readily apparent how Ex Libris aims to frame the openness of the Knowledge Center as a form of support for open access. As the company states in the second paragraph, “Demonstrating the Company’s belief in the importance of open access, the site is open to all, without requiring any logon procedure.” Former Ex Libris CEO Matti Shem Tov goes a step further in the following paragraph: “We want our resources and documentation to be as accessible and as open as our library management, discovery, and higher-education technology solutions are.” The problem with how Ex Libris frames their press release is that it elides the difference between mere openness and actual open access. They are a for-profit company, and their currently burgeoning market share is dependent upon a software-as-a-service (SaaS) business model. Therefore, one way to describe their approach in this case is orangewashing. During a recent conversation with me, Margaret Heller came up with the term, based on the color of the PLOS open access symbol. Similar in concept to greenwashing, we can define orangewashing as a misappropriation of open access rhetoric for business purposes. What perhaps makes orangewashing more initially difficult to diagnose in Ex Libris’s (and more broadly, ProQuest’s) case is that they attempt to tie support for open access to other product offerings. Even before purchasing Ex Libris, ProQuest had been including an author-side paid open-access publishing option to its Electronic Thesis and Dissertation platform, though we can question whether this is actually a good option for authors. For its part, Ex Libris has listened to customer feedback about open access discovery. As an example, there are now open access filters for both the Primo and Summon discovery layers. Ex Libris has also, generally speaking, remained open to customer participation regarding systems development, particularly with initiatives like the Developer Network and Idea Exchange. Perhaps the most credible example is in a June 24, 2015 press release, where the company declares “support of the Open Discovery Initiative (ODI) and conformance with ODI’s recommended practice for pre-indexed ‘web-scale’ discovery services.” A key implication is that “conforming to ODI regulations about ranking of search results, linking to content, inclusion of materials in Primo Central, and discovery of open access content all uphold the principles of content neutrality.” Given the above information, in the case of the Knowledge Center, it is tempting to give Ex Libris the benefit of the doubt. As an access services librarian, I understand how much of a hassle it can be to find and obtain systems documentation in order to properly do my job. I currently work for an Ex Libris institution, and can affirm that the Knowledge Center is of tangible benefit. Besides providing easier availability for their materials, Ex Libris has done fairly well in keeping information and pathing up to date. Notably, as of last month, customers can also contribute their own documentation to product-specific Community Knowledge sections within the Knowledge Center. Nevertheless, this does not change the fact that while the Knowledge Center is unique in its format, it represents a low bar to clear for a company of Ex Libris’s size. Their systems documentation should be openly accessible in any case. Moreover, the Knowledge Center represents openness—in the form of company transparency and customer participation—for systems and products that are not open. This is why when we go back to the Knowledge Center press release, we can identify it as orangewashing. Open access is not the point of a profit-driven company offering freely accessible documentation, and any claims to this effect ultimately ring hollow. So what is the likely point of the Knowledge Center, then? We should consider that Alma has become the predominant service platform within academic libraries, with Primo and Summon being the only supported discovery layers for it. While OCLC and EBSCO offer or support competing products, Ex Libris already held an advantageous position even before the ProQuest purchase. Therefore, besides the Knowledge Center serving as supportive measure for current customers, we can view it as a sales pitch to future ones. This may be a smart business strategy, but again, it has little to do with open access. Two other recent developments provide further evidence of Ex Libris’s orangewashing. The first is MLA’s announcement that EBSCO will become the exclusive vendor for the MLA International Bibliography. On the PRIMO-L listserv, Ex Libris posted a statement [listserv subscription required] noting that the agreement “goes against the goals of NISO’s Open Discovery Initiative…to promote collaboration and transparency among content and discovery providers.” Nevertheless, despite not being involved in the agreement, Ex Libris shares some blame given the long-standing difficulty over EBSCO not providing content to the Primo Central Index. As a result, what may occur is the “siloing” of an indispensable research database, while Ex Libris customers remain dependent on the company to help determine an eventual route to access. Secondly, in addition to offering research publications through ProQuest and discovery service through Primo/Summon, Ex Libris now provides end-to-end content management through Esploro. Monetizing more aspects of the research process is certainly far from unusual among academic publishers and service providers. Elsevier arguably provides the most egregious example, and as Lisa Janicke Hinchliffe notes, their pattern of recent acquisitions belies an apparent goal of creating a vertical stack service model for publication services. In considering what Elsevier is doing, it is unsurprising—from a business standpoint—for Ex Libris and ProQuest to pursue profits in a similar manner. That said, we should bear in mind that libraries are already losing control over open access as a consequence of the general strategy that Elsevier is employing. Esploro will likely benefit from having strong library development partners and “open” customer feedback, but the potential end result could place its customers in a more financially disadvantageous and less autonomous position. This is simply antithetical to open access. Over the past few years, Ex Libris has done well not just in their product development, but also their customer support. Making the Knowledge Center “open to all” in late 2015 was a very positive step forward. Yet the company’s decision to orangewash through claiming support for open access as part of a product unveiling still warrants critique. Peter Suber reminds us that open access is a “revolutionary kind of access”—one that is “unencumbered by a motive of financial gain.” While Ex Libris can perhaps talk about openness with a little more credibility than their competitors, their bottom line is still what really matters. Author Chris MartinPosted on September 25, 2018September 25, 2018Categories open access, Scholarly Communication Managing ILS Updates We’ve done a few screencasts in the past here at TechConnect and I wanted to make a new one to cover a topic that’s come up this summer: managing ILS updates. Integrated Library Systems are huge, unwieldy pieces of software and it can be difficult to track what changes with each update: new settings are introduced, behaviors change, bugs are (hopefully) fixed. The video belows shows my approach to managing this process and keeping track of ongoing issues with our Koha ILS. Author Eric PhetteplacePosted on August 13, 2018August 10, 2018Categories library Blockchain: Merits, Issues, and Suggestions for Compelling Use Cases Blockchain holds a great potential for both innovation and disruption. The adoption of blockchain also poses certain risks, and those risks will need to be addressed and mitigated before blockchain becomes mainstream. A lot of people have heard of blockchain at this point. But many are unfamiliar with how this new technology exactly works and unsure about under which circumstances or on what conditions it may be useful to libraries. In this post, I will provide a brief overview of the merits and the issues of blockchain. I will also make some suggestions for compelling use cases of blockchain at the end of this post. What Blockchain Accomplishes Blockchain is the technology that underpins a well-known decentralized cryptocurrency, Bitcoin. To simply put, blockchain is a kind of distributed digital ledger on a peer-to-peer (P2P) network, in which records are confirmed and encrypted. Blockchain records and keeps data in the original state in a secure and tamper-proof manner[1] by its technical implementation alone, thereby obviating the need for a third-party authority to guarantee the authenticity of the data. Records in blockchain are stored in multiple ledgers in a distributed network instead of one central location. This prevents a single point of failure and secures records by protecting them from potential damage or loss. Blocks in each blockchain ledger are chained to one another by the mechanism called ‘proof of work.’ (For those familiar with a version control system such as Git, a blockchain ledger can be thought of as something similar to a P2P hosted git repository that allows sequential commits only.[2]) This makes records in a block immutable and irreversible, that is, tamper-proof. In areas where the authenticity and security of records is of paramount importance, such as electronic health records, digital identity authentication/authorization, digital rights management, historic records that may be contested or challenged due to the vested interests of certain groups, and digital provenance to name a few, blockchain can lead to efficiency, convenience, and cost savings. For example, with blockchain implemented in banking, one will be able to transfer funds across different countries without going through banks.[3] This can drastically lower the fees involved, and the transaction will take effect much more quickly, if not immediately. Similarly, adopted in real estate transactions, blockchain can make the process of buying and selling a property more straightforward and efficient, saving time and money.[4] Disruptive Potential of Blockchain The disruptive potential of blockchain lies in its aforementioned ability to render the role of a third-party authority obsolete, which records and validates transactions and guarantees their authenticity, should a dispute arise. In this respect, blockchain can serve as an alternative trust protocol that decentralizes traditional authorities. Since blockchain achieves this by public key cryptography, however, if one loses one’s own personal key to the blockchain ledger holding one’s financial or real estate asset, for example, then that will result in the permanent loss of such asset. With the third-party authority gone, there will be no institution to step in and remedy the situation. Issues This is only some of the issues with blockchain. Other issues include (a) interoperability between different blockchain systems, (b) scalability of blockchain at a global scale with large amount of data, (c) potential security issues such as the 51% attack [5], and (d) huge energy consumption [6] that a blockchain requires to add a block to a ledger. Note that the last issue of energy consumption has both environmental and economic ramifications because it can cancel out the cost savings gained from eliminating a third-party authority and related processes and fees. Challenges for Wider Adoption There are growing interests in blockchain among information professionals, but there are also some obstacles to those interests gaining momentum and moving further towards wider trial and adoption. One obstacle is the lack of general understanding about blockchain in a larger audience of information professionals. Due to its original association with bitcoin, many mistake blockchain for cryptocurrency. Another obstacle is technical. The use of blockchain requires setting up and running a node in a blockchain network, such as Ethereum[7], which may be daunting to those who are not tech-savvy. This makes a barrier to entry high to those who are not familiar with command line scripting and yet still want to try out and test how a blockchain functions. The last and most important obstacle is the lack of compelling use cases for libraries, archives, and museums. To many, blockchain is an interesting new technology. But even many blockchain enthusiasts are skeptical of its practical benefits at this point when all associated costs are considered. Of course, this is not an insurmountable obstacle. The more people get familiar with blockchain, the more ways people will discover to use blockchain in the information profession that are uniquely beneficial for specific purposes. Suggestions for Compelling Use Cases of Blockchain In order to determine what may make a compelling use case of blockchain, the information profession would benefit from considering the following. (a) What kind of data/records (or the series thereof) must be stored and preserved exactly the way they were created. (b) What kind of information is at great risk to be altered and compromised by changing circumstances. (c) What type of interactions may need to take place between such data/records and their users.[8] (d) How much would be a reasonable cost for implementation. These will help connecting the potential benefits of blockchain with real-world use cases and take the information profession one step closer to its wider testing and adoption. To those further interested in blockchain and libraries, I recommend the recordings from the Library 2.018 online mini-conference, “Blockchain Applied: Impact on the Information Profession,” held back in June. The Blockchain National Forum, which is funded by IMLS and is to take place in San Jose, CA on August 6th, will also be livestreamed. Notes [1] For an excellent introduction to blockchain, see “The Great Chain of Being Sure about Things,” The Economist, October 31, 2015, https://www.economist.com/news/briefing/21677228-technology-behind-bitcoin-lets-people-who-do-not-know-or-trust-each-other-build-dependable. [2] Justin Ramos, “Blockchain: Under the Hood,” ThoughtWorks (blog), August 12, 2016, https://www.thoughtworks.com/insights/blog/blockchain-under-hood. [3] The World Food Programme, the food-assistance branch of the United Nations, is using blockchain to increase their humanitarian aid to refugees. Blockchain may possibly be used for not only financial transactions but also the identity verification for refugees. Russ Juskalian, “Inside the Jordan Refugee Camp That Runs on Blockchain,” MIT Technology Review, April 12, 2018, https://www.technologyreview.com/s/610806/inside-the-jordan-refugee-camp-that-runs-on-blockchain/. [4] Joanne Cleaver, “Could Blockchain Technology Transform Homebuying in Cook County — and Beyond?,” Chicago Tribune, July 9, 2018, http://www.chicagotribune.com/classified/realestate/ct-re-0715-blockchain-homebuying-20180628-story.html. [5] “51% Attack,” Investopedia, September 7, 2016, https://www.investopedia.com/terms/1/51-attack.asp. [6] Sherman Lee, “Bitcoin’s Energy Consumption Can Power An Entire Country — But EOS Is Trying To Fix That,” Forbes, April 19, 2018, https://www.forbes.com/sites/shermanlee/2018/04/19/bitcoins-energy-consumption-can-power-an-entire-country-but-eos-is-trying-to-fix-that/#49ff3aa41bc8. [7] Osita Chibuike, “How to Setup an Ethereum Node,” The Practical Dev, May 23, 2018, https://dev.to/legobox/how-to-setup-an-ethereum-node-41a7. [8] The interaction can also be a self-executing program when certain conditions are met in a blockchain ledger. This is called a “smart contract.” See Mike Orcutt, “States That Are Passing Laws to Govern ‘Smart Contracts’ Have No Idea What They’re Doing,” MIT Technology Review, March 29, 2018, https://www.technologyreview.com/s/610718/states-that-are-passing-laws-to-govern-smart-contracts-have-no-idea-what-theyre-doing/. Author Bohyun KimPosted on July 24, 2018July 26, 2018Categories coding, data, technologyTags bitcoin, blockchain, distributed ledger technology1 Comment on Blockchain: Merits, Issues, and Suggestions for Compelling Use Cases Introducing Our New Best Friend, GDPR You’ve seen the letters GDPR in every single email you’ve gotten from a vendor or a mailing list lately, but you might not be exactly sure what it is. With GDPR enforcement starting on May 25, it’s time for a crash course in what GDPR is, and why it could be your new best friend whether you are in the EU or not. First, you can check out the EU GDPR information site (though it probably will be under heavy load for a few days!) for lots of information on this. It’s important to recognize, however, that for universities like mine with a campus located in the EU, it has created additional oversight to ensure that our own data collection practices are GDPR compliant, or that we restrict people residing in the EU from accessing those services. You should definitely work with legal counsel on your own campus in making any decisions about GDPR compliance. So what does the GDPR actually mean in practice? The requirements break down this way: any company which holds the data of any EU citizen must provide data controls, no matter where the company or the data is located. This means that every large web platform and pretty much every library vendor must comply or face heavy fines. The GDPR offers the following protections for personally identifiable information, which includes things like IP address: privacy terms and conditions must be written in easy to understand language, data breaches require quick notifications, the right to know what data is being collected and to receive a copy of it, the “right to be forgotten” or data erasure (unless it’s in the public interest for the data to be retained), ability to transfer data between providers, systems to be private by design and only collect necessary data, and for companies to appoint data privacy officers without conflicts of interest. How this all works in practice is not consistent, and there will be a lot to be worked out in the courts in the coming years. Note that Google recently lost several right to be forgotten cases, and were required to remove information that they had originally stated was in the public interest to retain. The GDPR has actually been around for a few years, but May 25, 2018 was set as the enforcement date, so many people have been scrambling to meet that deadline. If you’re reading this today, there’s probably not a lot of time to do anything about your own practices, but if you haven’t yet reviewed what your vendors are doing, this would be a good time. Note too that there are no rights guaranteed for any Americans, and several companies, including Facebook, have moved data governance out of their Irish office to California to be out of reach of suits brought in Irish courts. Where possible, however, we should be using all the features at our disposal. As librarians, we already tend to the “privacy by design” philosophy, even though we aren’t always perfect at it. As I wrote in my last post, my library worked on auditing our practices and creating a new privacy policy, and one of the last issues was trying to figure out how we would approach some of our third-party services which we need to provide services to our patrons but that did not allow deleting data. Now some of those features are being made available. For example, Google Analytics now has a data retention feature, which allows you to set data to expire and be deleted after a certain amount of time. Google provides some more detailed instructions to ensure that you are not accidentally collecting personally-identifiable information in your analytics data. Lots of our library vendors provide personal account features, and those too are subject to these new GDPR features. This means that there are new levels of transparency about what kinds of tracking they are doing, and greater ability for patrons to control data, and for you to control data on the behalf of patrons. Here are a few example vendor GDPR compliance statements or FAQs: EBSCO Ex Libris ProQuest Springshare Note that some vendors, like EBSCO, are moving to HTTPS for all sites that weren’t before, and so this may require changes to proxy servers or other links. I am excited about GDPR because no matter where we are located, it gives us new tools to defend the privacy of our patrons. Even better than that, it is providing lots of opportunities on our campuses to talk about privacy with all stakeholders. At my institution, the library has been able to showcase our privacy expertise and have some good conversations about data governance and future goals for privacy. It doesn’t mean that all our problems will be solved, but we are moving in a more positive direction. Author Margaret HellerPosted on May 24, 2018May 23, 2018Categories administration, privacyTags gdpr Names are Hard A while ago I stumbled onto the post “Falsehoods Programmers Believe About Names” and was stunned. Personal names are one of the most deceptively difficult forms of data to work with and this article touched on so many common but unaddressed problems. Assumptions like “people have exactly one canonical name” and “My system will never have to deal with names from China/Japan/Korea” were apparent everywhere. I consider myself a fairly critical and studious person, I devote time to thinking about the consequences of design decisions and carefully attempt to avoid poor assumptions. But I’ve repeatedly run into trouble when handling personal names as data. There is a cognitive dissonance surrounding names; we treat them as rigid identifiers when they’re anything but. We acknowledge their importance but struggle to take them as seriously. Names change. They change due to marriage, divorce, child custody, adoption, gender identity, religious devotion, performance art, witness protection, or none of these at all. Sometimes people just want a new name. And none of these reasons for change are more or less valid than others, though our legal system doesn’t always treat them equally. We have students who change their legal name, which is often something systems expect, but then they have the audacity to want to change their username, too! And that works less often because all sorts of system integrations expect usernames to be persistent. Names do not have a universal structure. There is no set quantity of components in a name nor an established order to those components. At my college, we have students without surnames. In almost all our systems, surname is a required field, so we put a period “.” there to satisfy that requirement. Then, on displays in our digital repository where surnames are assumed, we end up with bolded section headers like “., Johnathan” which look awkward. Many Western names might follow a [Given name] – [Middle name] – [Surname] structure and an unfortunate number of the systems I have to deal with assume all names share this structure. It’s easy to see how this yields problematic results. For instance, if you want to a see a sorted list of users, you probably want to sort by family name, but many systems sort by the name in the last position causing, for instance, Chinese names 1 to be handled differently from Western ones. 2 But it’s not only that someone might not have a middle name, or might have two middle names, or might have a family name in the first position—no, even that would be too simple! Some name components defy simple classifications. I once met a person named “Bus Stop”. “Stop” is clearly not a family affiliation, despite coming in the final position of the name. Sometimes the second component of a tripartite Western name isn’t a middle name at all, but a maiden name or the second word of a two-word first name (e.g. “Mary Anne” or “Lady Bird”)! One cannot even determine by looking at a familiar structure the roles of all of a name’s pieces! Names are also contextual. One’s name with family, with legal institutions, and with classmates can all differ. Many of our international students have alternative Westernized first names. Their family may call them Qiáng but they introduce themselves as Brian in class. We ask for a “preferred name” in a lot of systems, which is a nice step forward, but don’t ask when it’s preferred. Names might be meant for different situations. We have no system remotely ready for this, despite the personalization that’s been seeping into web platforms for decades. So if names are such a trouble, why not do our best and move on? Aren’t these fringe cases that don’t affect the vast majority of our users? These issues simply cannot be ignored because names are vital. What one is called, even if it’s not a stable identifier, has great effects on one’s life. It’s dispiriting to witness one’s name misspelled, mispronounced, treated as an inconvenience, botched at every turn. A system that won’t adapt to suit a name delegitimizes the name. It says, “oh that’s not your real name” as if names had differing degrees of reality. But a person may have multiple names—or many overlapping names over time—and while one may be more institutionally recognized at a given time, none are less real than the others. If even a single student a year is affected, it’s the absolute least amount of respect we can show to affirm their name(s). So what do we to do? Endless enumerations of the difficulties of working with names does little but paralyze us. Honestly, when I consider about the best implementation of personal names, the MODS metadata schema comes to mind. Having a <name> element with any number of <namePart> children is the best model available. The <namePart>s can be ordered in particular ways, a “@type” attribute can define a part’s function 3, a record can include multiple names referencing the same person, multiple names with distinct parts can be linked to the same authority record, etc. MODS has a flexible and comprehensive treatment of name data. Unfortunately, returning to “Falsehoods Programmers Believe”, none of the library systems I administer do anywhere near as good a job as this metadata schema. Nor is it necessarily a problem with Western bias—even the Chinese government can’t develop computer systems to accurately represent the names of people in the country, or even agree on what the legal character set should be! 4 It seems that programmers start their apps by creating a “users” database table with columns for unique identifier, username, “firstname”/”lastname” [sic], and work from there. On the bright side, the name isn’t used as the identifier at least! We all learned that in databases class but we didn’t learn to make “names” a separate table linked to “users” in our relational databases. In my day-to-day work, the best I’ve done is to be sensitive to the importance of names changes specifically and how our systems handle them. After a few meetings with a cross-departmental team, we developed a name change process at our college. System administrators from across the institution are on a shared listserv where name changes are announced. In the libraries, I spoke with our frontline service staff about assisting with name changes. Our people at the circulation desk know to notice name discrepancies—sometimes a name badge has been updated but not our catalog records, we can offer to make them match—but also to guide students who may need to contact the registrar or other departments on campus to initiate the top-down name change process. While most of our the library’s systems don’t easily accommodate username changes, I can write administrative scripts for our institutional repository that alter the ownership of a set of items from an old username to a new one. I think it’s important to remember that we’re inconveniencing the user with the work of implementing their name change and not the other way around. So taking whatever extra steps we can do on our own, without pushing labor onto our students and staff, is the best way we can mitigate how poorly our tools are able to support the protean nature of personal names. Notes Chinese names typically have the surname first, followed by the given name. ↩ Another poor implementation can be seen in The Chicago Manual of Style‘s indexing instructions, which has an extensive list of exceptions to the Western norm and how to handle them. But CMoS provides no guidance on how one would go about identifying a name’s cultural background or, for instance, identifying a compound surname. ↩ Although the MODS user guidelines sadly limit the use of the type attribute to a fixed list of values which includes “family” and “given”, rendering it subject to most of the critiques in this post. Substantially expanding this list with “maiden”, “patronymic/matronymic” (names based on a parental given name, e.g. Mikhailovich), and more, as well as some sort of open-ended “other” option, would be a great improvement. ↩ https://www.nytimes.com/2009/04/21/world/asia/21china.html ↩ Author Eric PhetteplacePosted on May 14, 2018May 13, 2018Categories change, data, diversity2 Comments on Names are Hard Posts navigation Page 1 Page 2 … Page 23 Next page Search for: Search About ACRL TechConnect is a moderated blog written by librarians and archivists covering innovative projects, emerging tech tools, coding, usability, design, and more. ACRL TechConnect serves as your source for technology-related content from the Association of College and Research Libraries, a division of the American Library Association, and C&RL News magazine. CC-BY-NC-ND This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. Based on a work at acrl.ala.org/techconnect. Recent Posts Broken Links in the Discovery Layer—Pt. II: Towards an Ethnography of Broken Links Broken Links in the Discovery Layer—Pt. I: Researching a Problem ORCID for System Interoperability in Scholarly Communication Workflows Creating Presentations with Beautiful.AI National Forum on Web Privacy and Web Analytics ACRL Technology Resources Articles, Books, and Podcasts C&RL News Column Communities And Groups Categories academic librarianship accessibility administration algorithms API book review Books careers change coding conferences continuing education copyright data design digital libraries digital scholarship digitization discovery diversity hacking ILS information architecture library library as makerspace library design library instruction linked-data management marketing metadata mobile open access pedagogy Planning presentation privacy publication reference Scholarly Communication social media technology The Setup tools tutorial Uncategorized usability use study version control web what-if workflow writing About Authors ACRL TechConnect Proudly powered by WordPress 
afonte-info-140	----	Home - Afonte Jornalismo de Dados Home Conteúdo Sobre O que fazemos Quem Somos Contato Curso online da Cásper Líbero aborda marketing político nas redes sociais Abril 19, 2021 0 Ferramentas, métodos e estratégias de construção de imagem pública serão explorados em dois sábados de aulas (15 e 22/5) 10 referências sobre fact-checking para pesquisas e trabalhos acadêmicos Março 31, 2021 0 Dicas de leitura sobre checagem de fatos e desinformação para pesquisadores iniciantes Dados Jornalistas e cientistas de dados são os mais citados por especialistas brasileiros no Twitter Fevereiro 11, 2021 0 Pesquisa do IBPAD e Science Pulse analisou os perfis mais mencionados nas discussões sobre Covid-19 Então, o que é jornalismo profissional? Janeiro 13, 2021 0 Pesquisa recém publicada pela Folha sugere que quem consome “jornalismo profissional” tem menos chance de acreditar em desinformação, mas há um conceito anterior a ser discutido para que tal conclusão faça sentido Eventos Confira como foi o lançamento do projeto Postar ou Não Março 31, 2021 0 Site e e-book buscam incentivar leitura crítica de conteúdos digitais, oferecendo referências bibliográficas e atividades focadas no público jovem Afonte e Goethe-Institut Porto Alegre lançam site e e-book de educação midiática Março 25, 2021 0 Em site e e-book, “Postar ou Não?” é um guia hipermídia que busca incentivar a leitura crítica de conteúdos digitais com conceitos, dicas e testes Siga nas redes sociais Instagram Facebook Twitter Pesquisar por: Curso online da Cásper Líbero aborda marketing político nas redes sociais 10 referências sobre fact-checking para pesquisas e trabalhos acadêmicos Confira como foi o lançamento do projeto Postar ou Não Afonte e Goethe-Institut Porto Alegre lançam site e e-book de educação midiática Inscrições abertas para nova turma do curso de fact-checking na Cásper Líbero Veja como foi o Open Data Day POA 2021 Jornalistas e cientistas de dados são os mais citados por especialistas brasileiros no Twitter Open Data Day POA 2021 foca em fiscalização de gastos públicos Então, o que é jornalismo profissional? 2020: o ano da transparência? Amazônia Artigos checagem coronavirus curso dadodasemana Dados Dados Abertos desinformação Dia dos Dados Abertos eleições eleições2020 entrevista fact-checking Google Trends Jornalismo Jornalismo de Dados LAI Lava Jato media literacy Open Data Day palestra Porto Alegre Postar ou Não? transparência Nenhum comentário encontrado Etiquetas Amazônia Artigos checagem coronavirus curso dadodasemana Dados Dados Abertos desinformação Dia dos Dados Abertos eleições eleições2020 entrevista fact-checking Google Trends Jornalismo Jornalismo de Dados LAI Lava Jato media literacy Meio Ambiente Mina Guaíba Open Data Day palestra pesquisa podcast Porto Alegre Postar ou Não? Reportagem transparência Páginas Contato Conteúdo Home O que fazemos Sobre Quem Somos Artigos recentes Curso online da Cásper Líbero aborda marketing político nas redes sociais 10 referências sobre fact-checking para pesquisas e trabalhos acadêmicos Confira como foi o lançamento do projeto Postar ou Não Afonte e Goethe-Institut Porto Alegre lançam site e e-book de educação midiática Inscrições abertas para nova turma do curso de fact-checking na Cásper Líbero Desenvolvedor Marketing To Go Copyright © 2021 | WordPress Theme by MH Themes 
afroimpacto-com-470	----	Afroimpacto Home Clube E-book Newsletter Afroimpacto transformar a vida de pessoas negras causar um afroimpacto Quem somos Somos a Afroimpacto: um hub de desenvolvimento afroempreendedor que realiza ações nos eixos de Consultoria, Educação Empreendedora e Programas de desenvolvimento, com o objetivo de reduzir a desigualdade social, econômica e educacional no cenário do empreendedorismo. Um "hub", no cenário da inovação, são serviços integrados oferecidos a comunidade empreendedora, conectando pessoas e promovendo oportunidades iguais de desenvolvimento. Queremos conectar ecossistemas para impulsionar empreendedores(as) negros(as), e assim, promover seu desenvolvimento sócio-econômico. Para cumprir nossa missão, atuamos em diferentes frentes, considerando a linguagem e adaptabilidade do conteúdo empreendedor à realidade de pessoas negras. Clube Afro O Clube Afro é um clube de conteúdo afroempreendedor para conectar e fortalecer empreendedores negros. Conteúdo semanal sobre afroempreendedorismo e negócios em uma linguagem simplificada Exercícios e ferramentas para desenvolver o seu negócio em conjunto com fórum para tirar dúvidas Rede de Afroempreendedores em diferentes estágios disponíveis 24h A assinatura do clube é mensal e por um preço acessível! Participe E-book Empreender é se lançar, inovar, criar soluções, transformar problemas em oportunidades de negócio, e muitas outras características. Além disso, possui diversas ramificações, entre elas, o Empreendedorismo Negro. Mas como se define esta modalidade de empreendedorismo? Antes de respondermos a essa pergunta, traremos para vocês uma série de dados introdutórios que são necessários para compreender o contexto específico e atual da população negra no cenário brasileiro, que influenciam tanto a forma como estes empreendedores começam a empreender, quanto à perspectiva que empreendem. Download Faça parte da nossa lista de conteúdo! Cadastre-se Tudo certinho! Agora você faz parte da nossa lista :D Contato: contato@afroimpacto.com Faça o download grátis! Preencha o formulário abaixo para receber o e-book em seu e-mail Download Enviamos o link do download para seu e-mail! Boa leitura :D 
allcontributors-org-6127	----	All Contributors: ✨ Recognize all contributors, not just the ones who push code ✨ Docs GitHub Star Recognize All Contributors Including those that don't push code Install the Bot Add New Contributors in Seconds We’ve built a bot to automate the tedious stuff for adding project contributors, so you can focus on your project instead of managing your ReadMe. Read the Documentation Emoji Key (Contributions Cheatsheet) How to Use the Bot Submit an Issue How it Works 1. Install the Bot to your project Check the Installation doc for how to add it 2. Start a pull request or comment 3. Mention the @all-contributors Bot 4. Add a contributor’s username and contribution type Check the contribution types in the Emoji Key Cheatsheet 5. Post and your ReadMe updates automatically! It'll add the Contributor Table for your first time, too Who's Using it? There are 2000+ Projects using All Contributors! Start adding contributors to your project today Install the Bot Read the Docs 
andromedayelton-com-2311	----	andromeda yelton Skip to content andromeda yelton Menu Home About Contact Resume HAMLET LITA Talks Machine Learning (ALA Midwinter 2019) Boston Python Meetup (August 21, 2018) SWiB16 LibTechConf 2016 Code4Lib 2015 Keynote Texas Library Association 2014 Online Northwest 2014: Five Conversations About Code New Jersey ESummit (May 2, 2013) Westchester Library Association (January 7, 2013) Bridging the Digital Divide with Mobile Services (Webjunction, July 25 2012) I haven’t failed, I’ve just tried a lot of ML approaches that don’t work “Let’s blog every Friday,” I thought. “It’ll be great. People can see what I’m doing with ML, and it will be a useful practice for me!” And then I went through weeks on end of feeling like I had nothing to report because I was trying approach after approach to this one problem that simply didn’t work, hence not blogging. And finally realized: oh, the process is the thing to talk about… Hi. I’m Andromeda! I am trying to make a neural net better at recognizing people in archival photos. After running a series of experiments — enough for me to have written 3,804 words of notes — I now have a neural net that is ten times worse at its task. 🎉 And now I have 3,804 words of notes to turn into a blog post (a situation which gets harder every week). So let me catch you up on the outline of the problem: Download a whole bunch of archival photos and their metadata (thanks, DPLA!) Use a face detection ML library to locate faces, crop them out, and save them in a standardized way Benchmark an off-the-shelf face recognition system to see how good it is at identifying these faces Retrain it Benchmark my new system Step 3: profit, right? Well. Let me also catch you up on some problems along the way: Alas, metadata Archival photos are great because they have metadata, and metadata is like labels, and labels mean you can do supervised learning, right? Well…. Is he “Du Bois, W. E. B. (William Edward Burghardt), 1868-1963” or “Du Bois, W. E. B. (William Edward Burghardt) 1868-1963” or “Du Bois, W. E. B. (William Edward Burghardt)” or “W.E.B. Du Bois”? I mean, these are all options. People have used a lot of different metadata practices at different institutions and in different times. But I’m going to confuse the poor computer if I imply to it that all these photos of the same person are photos of different people. (I have gone through several attempts to resolve this computationally without needing to do everything by hand, with only modest success.) What about “Photographs”? That appears in the list of subject labels for lots of things in my data set. “Photographs” is a person, right? I ended up pulling in an entire other ML component here — spaCy, to do some natural language processing to at least guess which lines are probably names, so I can clear the rest of them out of my way. But spaCy only has ~90% accuracy on personal names anyway and, guess what, because everything is terrible, in predictable ways, it has no idea “Kweisi Mfume” is a person. Is a person who appears in the photo guaranteed to be a person who appears in the photo? Nope. Is a person who appears in the metadata guaranteed to be a person who appears in the photo? Also nope! Often they’re a photographer or other creator. Sometimes they are the subject of the depicted event, but not themselves in the photo. (spaCy will happily tell you that there’s personal name content in something like “Martin Luther King Day”, but MLK is unlikely to appear in a photo of an MLK day event.) Oh dear, linear algebra OK but let’s imagine for the sake of argument that we live in a perfect world where the metadata is exactly what we need — no more, no less — and its formatting is perfectly consistent. 🦄 Here you are, in this perfect world, confronted with a photo that contains two people and has two names. How do you like them apples? I spent more time than I care to admit trying to figure this out. Can I bootstrap from photos that have one person and one name — identify those, subtract them out of photos of two people, go from there? (Not reliably — there’s a lot of data I never reach that way — and it’s horribly inefficient.) Can I do something extremely clever with matrix multiplication? Like…once I generate vector space embeddings of all the photos, can I do some sort of like dot-product thing across all of my photos, or big batches of them, and correlate the closest-match photos with overlaps in metadata? Not only is this a process which begs the question — I’d have to do that with the ML system I have not yet optimized for archival photo recognition, thus possibly just baking bad data in — but have I mentioned I have taken exactly one linear algebra class, which I didn’t really grasp, in 1995? What if I train yet another ML system to do some kind of k-means clustering on the embeddings? This is both a promising approach and some really first-rate yak-shaving, combining all the question-begging concerns of the previous paragraph with all the crystalline clarity of black box ML. Possibly at this point it would have been faster to tag them all by hand, but that would be admitting defeat. Also I don’t have a research assistant, which, let’s be honest, is the person who would usually be doing this actual work. I do have a 14-year-old and I am strongly considering paying her to do it for me, but to facilitate that I’d have to actually build a web interface and probably learn more about AWS, and the prospect of reading AWS documentation has a bracing way of reminding me of all of the more delightful and engaging elements of my todo list, like calling some people on the actual telephone to sort out however they’ve screwed up some health insurance billing. Nowhere to go but up Despite all of that, I did actually get all the way through the 5 steps above. I have a truly, spectacularly terrible neural net. Go me! But at a thousand-plus words, perhaps I should leave that story for next week…. Andromeda Uncategorized Leave a comment April 16, 2021 this time: speaking about machine learning No tech blogging this week because most of my time was taken up with telling people about ML instead! One talk for an internal Harvard audience, “Alice in Dataland”, where I explained some of the basics of neural nets and walked people through the stories I found through visualizing HAMLET data. One talk for the NISO plus conference, “Discoverability in an AI World”, about ways libraries and other cultural heritage institutions are using AI both to enhance traditional discovery interfaces and provide new ones. This was recorded today but will be played at the conference on the 23rd, so there’s still time to register if you want to see it! NISO Plus will also include a session on AI, metadata, and bias featuring Dominique Luster, who gave one of my favorite code4lib talks, and one on AI and copyright featuring one of my go-to JD/MLSes, Nancy Sims. And I’m prepping for an upcoming talk that has not yet been formally announced. Which is to say, I guess, I have a lot of talks about AI and cultural heritage in my back pocket, if you were looking for someone to speak about that 😉 Andromeda Uncategorized Leave a comment February 12, 2021 archival face recognition for fun and nonprofit In 2019, Dominique Luster gave a super good Code4Lib talk about applying AI to metadata for the Charles “Teenie” Harris collection at the Carnegie Museum of Art — more than 70,000 photographs of Black life in Pittsburgh. They experimented with solutions to various metadata problems, but the one that’s stuck in my head since 2019 is the face recognition one. It sure would be cool if you could throw AI at your digitized archival photos to find all the instances of the same person, right? Or automatically label them, given that any of them are labeled correctly? Sadly, because we cannot have nice things, the data sets used for pretrained face recognition embeddings are things like lots of modern photos of celebrities, a corpus which wildly underrepresents 1) archival photos and 2) Black people. So the results of the face recognition process are not all that great. I have some extremely technical ideas for how to improve this — ideas which, weirdly, some computer science PhDs I’ve spoken with haven’t seen in the field. So I would like to experiment with them. But I must first invent the universe set up a data processing pipeline. Three steps here: Fetch archival photographs; Do face detection (draw bounding boxes around faces and crop them out for use in the next step); Do face recognition. For step 1, I’m using DPLA, which has a super straightforward and well-documented API and an easy-to-use Python wrapper (which, despite not having been updated in a while, works just fine with Python 3.6, the latest version compatible with some of my dependencies). For step 2, I’m using mtcnn, because I’ve been following this tutorial. For step 3, face recognition, I’m using the steps in the same tutorial, but purely for proof-of-concept — the results are garbage because archival photos from mid-century don’t actually look anything like modern-day celebrities. (Neural net: “I have 6% confidence this is Stevie Wonder!” How nice for you.) Clearly I’m going to need to build my own corpus of people, which I have a plan for (i.e. I spent some quality time thinking about numpy) but haven’t yet implemented. So far the gotchas have been: Gotcha 1: If you fetch a page from the API and assume you can treat its contents as an image, you will be sad. You have to treat them as a raw data stream and interpret that as an image, thusly: from PIL import Image import requests response = requests.get(url, stream=True) response.raw.decode_content = True data = requests.get(url).content Image.open(io.BytesIO(data)) This code is, of course, hilariously lacking in error handling, despite fetching content from a cesspool of untrustworthiness, aka the internet. It’s a first draft. Gotcha 2: You see code snippets to convert images to pixel arrays (suitable for AI ingestion) that look kinda like this: np.array(image).astype('uint8'). Except they say astype('float32') instead of astype('uint32'). I got a creepy photonegative effect when I used floats. Gotcha 3: Although PIL was happy to manipulate the .pngs fetched from the API, it was not happy to write them to disk; I needed to convert formats first (image.convert('RGB')). Gotcha 4: The suggested keras_vggface library doesn’t have a Pipfile or requirements.txt, so I had to manually install keras and tensorflow. Luckily the setup.py documented the correct versions. Sadly the tensorflow version is only compatible with python up to 3.6 (hence the comment about DPyLA compatibility above). I don’t love this, but it got me up and running, and it seems like an easy enough part of the pipeline to rip out and replace if it’s bugging me too much. The plan from here, not entirely in order, subject to change as I don’t entirely know what I’m doing until after I’ve done it: Build my own corpus of identified people This means the numpy thoughts, above It also means spending more quality time with the API to see if I can automatically apply names from photo metadata rather than having to spend too much of my own time manually labeling the corpus Decide how much metadata I need to pull down in my data pipeline and how to store it Figure out some kind of benchmark and measure it Try out my idea for improving recognition accuracy Benchmark again Hopefully celebrate awesomeness Andromeda Uncategorized Leave a comment February 5, 2021 sequence models of language: slightly irksome Not much AI blogging this week because I have been buried in adulting all week, which hasn’t left much time for machine learning. Sadface. However, I’m in the last week of the last deeplearning.ai course! (Well. Of the deeplearning.ai sequence that existed when I started, anyway. They’ve since added an NLP course and a GANs course, so I’ll have to think about whether I want to take those too, but at the moment I’m leaning toward a break from the formal structure in order to give myself more time for project-based learning.) This one is on sequence models (i.e. “the data comes in as a stream, like music or language”) and machine translation (“what if we also want our output to be a stream, because we are going from a sentence to a sentence, and not from a sentence to a single output as in, say, sentiment analysis”). And I have to say, as a former language teacher, I’m slightly irked. Because the way the models work is — OK, consume your input sentence one token at a time, with some sort of memory that allows you to keep track of prior tokens in processing current ones (so far, so okay). And then for your output — spit out a few most-likely candidate tokens for the first output term, and then consider your options for the second term and pick your most-likely two-token pairs, and then consider all the ways your third term could combine with those pairs and pick your most likely three-token sequences, et cetera, continue until done. And that is…not how language works? Look at Cicero, presuming upon your patience as he cascades through clause after clause which hang together in parallel but are not resolved until finally, at the end, a verb. The sentence’s full range of meanings doesn’t collapse until that verb at the end, which means you cannot be certain if you move one token at a time; you need to reconsider the end in light of the beginning. But, at the same time, that ending token is not equally presaged by all former tokens. It is a verb, it has a subject, and when we reached that subject, likely near the beginning of the sentence, helpfully (in Latin) identified by the nominative case, we already knew something about the verb — a fact we retained all the way until the end. And on our way there, perhaps we tied off clause after clause, chunking them into neat little packages, but none of them nearly so relevant to the verb — perhaps in fact none of them really tied to the verb at all, because they’re illuminating some noun we met along the way. Pronouns, pointing at nouns. Adjectives, pointing at nouns. Nouns, suspended with verbs like a mobile, hanging above and below, subject and object. Adverbs, keeping company only with verbs and each other. There’s so much data in the sentence about which word informs which that the beam model casually discards. Wasteful. And forcing the model to reinvent all these things we already knew — to allocate some of its neural space to re-engineering things we could have told it from the beginning. Clearly I need to get my hands on more modern language models (a bizarre sentence since this class is all of 3 years old, but the field moves that fast). Andromeda Uncategorized 1 Comment January 15, 2021 Adapting Coursera’s neural style transfer code to localhost Last time, when making cats from the void, I promised that I’d discuss how I adapted the neural style transfer code from Coursera’s Convolutional Neural Networks course to run on localhost. Here you go! Step 1: First, of course, download (as python) the script. You’ll also need the nst_utils.py file, which you can access via File > Open. Step 2: While the Coursera file is in .py format, it’s iPython in its heart of hearts. So I opened a new file and started copying over the bits I actually needed, reading them as I went to be sure I understood how they all fit together. Along the way I also organized them into functions, to clarify where each responsibility happened and give it a name. The goal here was ultimately to get something I could run at the command line via python dpla_cats.py, so that I could find out where it blew up in step 3. Step 3: Time to install dependencies. I promptly made a pipenv and, in running the code and finding what ImportErrors showed up, discovered what I needed to have installed: scipy, pillow, imageio, tensorflow. Whatever available versions of the former three worked, but for tensorflow I pinned to the version used in Coursera — 1.2.1 — because there are major breaking API changes with the current (2.x) versions. This turned out to be a bummer, because tensorflow promptly threw warnings that it could be much faster on my system if I compiled it with various flags my computer supports. OK, so I looked up the docs for doing that, which said I needed bazel/bazelisk — but of course I needed a paleolithic version of that for tensorflow 1.2.1 compat, so it was irritating to install — and then running that failed because it needed a version of Java old enough that I didn’t have it, and at that point I gave up because I have better things to do than installing quasi-EOLed Java versions. Updating the code to be compatible with the latest tensorflow version and compiling an optimized version of that would clearly be the right answer, but also it would have been work and I wanted messed-up cat pictures now. (As for the rest of my dependencies, I ended up with scipy==1.5.4, pillow==8.0.1, and imageio==2.9.0, and then whatever sub-dependencies pipenv installed. Just in case the latest versions don’t work by the time you read this. 🙂 At this point I had achieved goal 1, aka “getting anything to run at all”. Step 4: I realized that, honestly, almost everything in nst_utils wanted to be an ImageUtility, which was initialized with metadata about the content and style files (height, width, channels, paths), and carried the globals (shudder) originally in nst_utils as class data. This meant that my new dpla_cats script only had to import ImageUtility rather than * (from X import * is, of course, deeply unnerving), and that utility could pingpong around knowing how to do the things it knew how to do, whenever I needed to interact with image-y functions (like creating a generated image or saving outputs) rather than neural-net-ish stuff. Everything in nst_utils that properly belonged in an ImageUtility got moved, step by step, into that class; I think one or two functions remained, and they got moved into the main script. Step 5: Ughhh, scope. The notebook plays fast and loose with scope; the raw python script is, rightly, not so forgiving. But that meant I had to think about what got defined at what level, what got passed around in an argument, what order things happened in, et cetera. I’m not happy with the result — there’s a lot of stuff that will fail with minor edits — but it works. Scope errors will announce themselves pretty loudly with exceptions; it’s just nice to know you’re going to run into them. Step 5a: You have to initialize the Adam optimizer before you run sess.run(tf.global_variables_initializer()). (Thanks, StackOverflow!) The error message if you don’t is maddeningly unhelpful. (FailedPreconditionError, I mean, what.) Step 6: argparse! I spent some quality time reading this neural style implementation early on and thought, gosh, that’s argparse-heavy. Then I found myself wanting to kick off a whole bunch of different script runs to do their thing overnight investigating multiple hypotheses and discovered how very much I wanted there to be command-line arguments, so I could configure all the different things I wanted to try right there and leave it alone. Aw yeah. I’ve ended up with the following: parser.add_argument('--content', required=True) parser.add_argument('--style', required=True) parser.add_argument('--iterations', default=400) # was 200 parser.add_argument('--learning_rate', default=3.0) # was 2.0 parser.add_argument('--layer_weights', nargs=5, default=[0.2,0.2,0.2,0.2,0.2]) parser.add_argument('--run_until_steady', default=False) parser.add_argument('--noisy_start', default=True) content is the path to the content image; style is the path to the style image; iterations and learning_rate are the usual; layer_weights is the value of STYLE_LAYERS in the original code, i.e. how much to weight each layer; run_until_steady is a bad API because it means to ignore the value of the iterations parameter and instead run until there is no longer significant change in cost; and noisy_start is whether to use the content image plus static as the first input or just the plain content image. I can definitely see adding more command line flags if I were going to be spending a lot of time with this code. (For instance, a layer_names parameter that adjusted what STYLE_LAYERS considered could be fun! Or making “significant change in cost” be a user-supplied rather than hardcoded parameter!) Step 6a: Correspondingly, I configured the output filenames to record some of the metadata used to create the image (content, style, layer_weights), to make it easier to keep track of which images came from which script runs. Stuff I haven’t done but it might be great: Updating tensorflow, per above, and recompiling it. The slowness is acceptable — I can run quite a few trials on my 2015 MacBook overnight — but it would get frustrating if I were doing a lot of this. Supporting both num_iterations and run_until_steady means my iterator inside the model_nn function is kind of a mess right now. I think they’re itching to be two very thin subclasses of a superclass that knows all the things about neural net training, with the subclass just handling the iterator, but I didn’t spend a lot of time thinking about this. Reshaping input files. Right now it needs both input files to be the same dimensions. Maybe it would be cool if it didn’t need that. Trying different pretrained models! It would be easy to pass a different arg to load_vgg_model. It would subsequently be annoying to make sure that STYLE_LAYERS worked — the available layer names would be different, and load_vgg_model makes a lot of assumptions about how that model is shaped. As your reward for reading this post, you get another cat image! A friend commented that a thing he dislikes about neural style transfer is that it’s allergic to whitespace; it wants to paint everything with a texture. This makes sense — it sees subtle variations within that whitespace and it tries to make them conform to patterns of variation it knows. This is why I ended up with the noisy_start flag; I wondered what would happen if I didn’t add the static to the initial image, so that the original negative space stayed more negative-spacey. This, as you can probably tell, uses the Harlem renaissance style image. It’s still allergic to negative space — even without the generated static there are variations in pixel color in the original — but they are much subtler, so instead of saying “maybe what I see is coiled hair?” it says “big open blue patches; we like those”. But the semantics of the original image are more in place — the kittens more kitteny, the card more readable — even though the whole image has been pushed more to colorblocks and bold lines. I find I like the results better without the static — even though the cost function is larger, and thus in a sense the algorithm is less successful. Look, one more. Superhero! Andromeda Uncategorized Leave a comment January 3, 2021 Dear Internet, merry Christmas; my robot made you cats from the void Recently I learned how neural style transfer works. I wanted to be able to play with it more and gain some insights, so I adapted the Coursera notebook code to something that works on localhost (more on that in a later post), found myself a nice historical cat image via DPLA, and started mashing it up with all manner of images of varying styles culled from DPLA’s list of primary source sets. (It really helped me that these display images were already curated for looking cool, and cropped to uniform size!) These sweet babies do not know what is about to happen to them. Let’s get started, shall we? Style image from the Fake News in the 1890s: Yellow Journalism primary source set. I really love how this one turned out. It’s pulled the blue and yellow colors, and the concerned face of the lower kitten was a perfect match for the expression on the right-hand muckraker. The lines of the card have taken on the precise quality of those in the cartoon — strong outlines and textured interiors. “Merry Christmas” the bird waves, like an eager newsboy. Style image from the Food and Social Justice exhibit. This is one of the first ones I made, and I was delighted by how it learned the square-iness of its style image. Everything is more snapped to a grid. The colors are bolder, too, cueing off of that dominant yellow. The Christmas banner remains almost readable and somehow heraldic. Style image from the Truth, Justice, and the American Way primary source set. How about Christmas of Steel? These kittens have broadly retained their shape (perhaps as the figures in the comic book foreground have organic detail?), but the background holly is more polygon-esque. The colors have been nudged toward primary, and the static of the background has taken on a swirl of dynamic motion lines. Style image from the Visual Art During the Harlem Renaissance primary source set. How about starting with something boldly colored and almost abstract? Why look: the kittens have learned a world of black and white and blue, with the background transformed into that stippled texture it picked up from the hair. The holly has gone more colorblocky and the lines bolder. Style image from the Treaty of Versailles and the End of World War I primary source set. This one learned its style so aptly that I couldn’t actually tell where the boundary between the second and third images was when I was placing that equals sign. The soft pencil lines, the vertical textures of shadows and jail bars, the fact that all the colors in the world are black and white and orange (the latter mostly in the middle) — these kittens are positively melting before the force of Wilsonian propaganda. Imagine them in the Hall of Mirrors, drowning in gold and reflecting back at you dozens of times, for full nightmare effect. Style image from the Victorian Era primary source set. Shall we step back a few decades to something slightly more calming? These kittens have learned to take on soft lines and swathes of pale pink. The holly is perfectly happy to conform itself to the texture of these New England trees. The dark space behind the kittens wonders if, perhaps, it is meant to be lapels. I totally can’t remember how I found this cropped version of US food propaganda. And now for kittens from the void. Brown, it has learned. The world is brown. The space behind the kittens is brown. Those dark stripes were helpfully already brown. The eyes were brown. Perhaps they can be the same brown, a hole dropped through kitten-space. I thought this was honestly pretty creepy, and I wondered if rerunning the process with different layer weights might help. Each layer of the neural net notices different sorts of things about its image; it starts with simpler things (colors, straight lines), moves through compositions of those (textures, basic shapes), and builds its way up to entire features (faces). The style transfer algorithm looks at each of those layers and applies some of its knowledge to the generated image. So I thought, what if I change the weights? The initial algorithm weights each of five layers equally; I reran it weighted toward the middle layers and entirely ignoring the first layer, in hopes that it would learn a little less about gaping voids of brown. Same thing, less void. This worked! There’s still a lot of brown, but the kitten’s eye is at least separate from its facial markings. My daughter was also delighted by how both of these images want to be letters; there are lots of letter-ish shapes strewn throughout, particularly on the horizontal line that used to be the edge of a planter, between the lower cat and the demon holly. So there you go, internet; some Christmas cards from the nightmare realm. May 2021 bring fewer nightmares to us all. Andromeda Uncategorized 1 Comment December 24, 2020December 24, 2020 this week in my AI After visualizing a whole bunch of theses and learning about neural style transfer and flinging myself at t-SNE I feel like I should have something meaty this week but they can’t all be those weeks, I guess. Still, I’m trying to hold myself to Friday AI blogging, so here are some work notes: Finished course 4 of the deeplearning.ai sequence. Yay! The facial recognition assignment is kind of buggy and poorly documented and I felt creepy for learning it in the first place, but I’m glad to have finished. Only one more course to go! It’s a 3-week course, so if I’m particularly aggressive I might be able to get it all done by year’s end. Tried making a 3d version of last week’s visualization — several people had asked — but it turned out to not really add anything. Oh well. Been thinking about Charlie Harper’s talk at SWiB this year, Generating metadata subject labels with Doc2Vec and DBPedia. This talk really grabbed me because he started with the exact same questions and challenges as HAMLET — seriously, the first seven and a half minutes of this talk could be the first seven and a half minutes of a talk on HAMLET, essentially verbatim — but took it off in a totally different direction (assigning subject labels). I have lots of ideas about where one might go with this but right now they are all sparkling Voronoi diagrams in my head and that’s not a language I can readily communicate. All done with the second iteration of my AI for librarians course. There were some really good final projects this term. Yay, students! Andromeda Uncategorized 1 Comment December 18, 2020December 19, 2020 Though these be matrices, yet there is method in them. When I first trained a neural net on 43,331 theses to make HAMLET, one of the things I most wanted to do is be able to visualize them. If word2vec places documents ‘near’ each other in some kind of inferred conceptual space, we should be able to see some kind of map of them, yes? Even if I don’t actually know what I’m doing? Turns out: yes. And it’s even better than I’d imagined. 43,331 graduate theses, arranged by their conceptual similarity. Let me take you on a tour! Region 1 is biochemistry. The red dots are biology; the orange ones, chemistry. Theses here include Positional cloning and characterization of the mouse pudgy locus and Biosynthetic engineering for the assembly of better drugs. If you look closely, you will see a handful of dots in different colors, like a buttery yellow. This color is electrical engineering & computer science, and its dots in this region include Computational regulatory genomics : motifs, networks, and dynamics — that is to say, a computational biology thesis that happens to have been housed in computation rather than biology. The green south of Region 2 is physics. But you will note a bit of orange here. Yes, that’s chemistry again; for example, Dynamic nuclear polarization of amorphous and crystalline small molecules. If (like me), you almost majored in chemistry and realized only your senior year that the only chemistry classes that interested you were the ones that were secretly physics…this is your happy place. In fact, most of the theses here concern nuclear magnetic resonance applications. Region 3 has a striking vertical green stripe which turns out to be the nuclear engineering department. But you’ll see some orange streaks curling around it like fingers, almost suggesting three-dimensional depth. I point this out as a reminder that the original neural net embeds these 43,331 documents in a 52-dimensional space; I have projected that down to 2 dimensions because I don’t know about you but I find 52 dimensions somewhat challenging to visualize. However — just as objects may overlap in a 2-dimensional photo even when they are quite distant in 3-dimensional space — dots that are close together in this projection may be quite far apart in reality. Trust the overall structure more than each individual element. The map is not the territory. That little yellow thumb by Region 4 is mathematics, now a tiny appendage off of the giant discipline it spawned — our old friend buttery yellow, aka electrical engineering & computer science. If you zoom in enough you find EECS absolutely everywhere, applied to all manner of disciplines (as above with biology), but the bulk of it — including the quintessential parts, like compilers — is right here. Dramatically red Region 5, clustered together tightly and at the far end, is architecture. This is a renowned department (it graduated I.M. Pei!), but definitely a different sort of creature than most of MIT, so it makes sense that it’s at one extreme of the map. That said, the other two programs in its school — Urban Studies & Planning and Media Arts & Sciences — are just to its north. Region 6 — tiny, yellow, and pale; you may have missed it at first glance — is linguistics island, housing theses such as Topics in the stress and syntax of words. You see how there are also a handful of red dots on this island? They are Brain & Cognitive Science theses — and in particular, ones that are secretly linguistics, like Intonational phrasing in language production and comprehension. Similarly — although at MIT it is not the department of linguistics, but the department of linguistics & philosophy — the philosophy papers are elsewhere. (A few of the very most abstract ones are hanging out near math.) And what about Region 7, the stingray swimming vigorously away from everything else? I spent a long time looking at this and not seeing a pattern. You can tell there’s a lot of colors (departments) there, randomly assorted; even looking at individual titles I couldn’t see anything. Only when I looked at the original documents did I realize that this is the island of terrible OCR. Almost everything here is an older thesis, with low-quality printing or even typewriting, often in a regrettable font, maybe with the reverse side of the page showing through. (A randomly chosen example; pdf download.) A good reminder of the importance of high-quality digitization labor. A heartbreaking example of the things we throw away when we make paper the archival format for born-digital items. And also a technical inspiration — look how much vector space we’ve had to carve out to make room for these! the poor neural net, trying desperately to find signal in the noise, needing all this space to do it. I’m tempted to throw out the entire leftmost quarter of this graph, rerun the 2d projection, and see what I get — would we be better able to see the structures in the high-quality data if they had room to breathe? And were I to rerun the entire neural net training process again, I’d want to include some sort of threshhold score for OCR quality. It would be a shame to throw things away — especially since they will be a nonrandom sample, mostly older theses — but I have already had to throw away things I could not OCR at all in an earlier pass, and, again, I suspect the neural net would do a better job organizing the high-quality documents if it could use the whole vector space to spread them out, rather than needing some of it to encode the information “this is terrible OCR and must be kept away from its fellows”. Clearly I need to share the technical details of how I did this, but this post is already too long, so maybe next week. tl;dr I reached out to Matt Miller after reading his cool post on vectorizing the DPLA and he tipped me off to UMAP and here we are — thanks, Matt! And just as clearly you want to play with this too, right? Well, it’s super not ready to be integrated into HAMLET due to any number of usability issues but if you promise to forgive me those — have fun. You see how when you hover over a dot you get a label with the format 1721.1-X.txt? It corresponds to a URL of the format https://hamlet.andromedayelton.com/similar_to/X. Go play :). Andromeda Uncategorized 2 Comments December 11, 2020December 11, 2020 Of such stuff are (deep)dreams made: convolutional networks and neural style transfer Skipped FridAI blogging last week because of Thanksgiving, but let’s get back on it! Top-of-mind today are the firing of AI queen Timnit Gebru (letter of support here) and a couple of grant applications that I’m actually eligible for (this is rare for me! I typically need things for which I can apply in my individual capacity, so it’s always heartening when they exist — wish me luck). But for blogging today, I’m gonna talk about neural style transfer, because it’s cool as hell. I started my ML-learning journey on Coursera’s intro ML class and have been continuing with their deeplearning.ai sequence; I’m on course 4 of 5 there, so I’ve just gotten to neural style transfer. This is the thing where a neural net outputs the content of one picture in the style of another: Via https://medium.com/@build_it_for_fun/neural-style-transfer-with-swift-for-tensorflow-b8544105b854. OK, so! Let me explain while it’s still fresh. If you have a neural net trained on images, it turns out that each layer is responsible for recognizing different, and progressively more complicated, things. The specifics vary by neural net and data set, but you might find that the first layer gets excited about straight lines and colors; the second about curves and simple textures (like stripes) that can be readily composed from straight lines; the third about complex textures and simple objects (e.g. wheels, which are honestly just fancy circles); and so on, until the final layers recognize complex whole objects. You can interrogate this by feeding different images into the neural net and seeing which ones trigger the highest activation in different neurons. Below, each 3×3 grid represents the most exciting images for a particular neuron. You can see that in this network, there are Layer 1 neurons excited about colors (green, orange), and about lines of particular angles that form boundaries between dark and colored space. In Layer 2, these get built together like tiny image legos; now we have neurons excited about simple textures such as vertical stripes, concentric circles, and right angles. Via https://adeshpande3.github.io/The-9-Deep-Learning-Papers-You-Need-To-Know-About.html, originally from Zeller & Fergus, Visualizing and Understanding Convolutional Networks So how do we get from here to neural style transfer? We need to extract information about the content of one image, and the style of another, in order to make a third image that approximates both of them. As you already expect if you have done a little machine learning, that means that we need to write cost functions that mean “how close is this image to the desired content?” and “how close is this image to the desired style?” And then there’s a wrinkle that I haven’t fully understood, which is that we don’t actually evaluate these cost functions (necessarily) against the outputs of the neural net; we actually compare the activations of the neurons, as they react to different images — and not necessarily from the final layer! In fact, choice of layer is a hyperparameter we can vary (I super look forward to playing with this on the Coursera assignment and thereby getting some intuition). So how do we write those cost functions? The content one is straightforward: if two images have the same content, they should yield the same activations. The greater the differences, the greater the cost (specifically via a squared error function that, again, you may have guessed if you’ve done some machine learning). The style one is beautifully sneaky; it’s a measure of the difference in correlation between activations across channels. What does that mean in English? Well, let’s look at the van Gogh painting, above. If an edge detector is firing (a boundary between colors), then a swirliness detector is probably also firing, because all the lines are curves — that’s characteristic of van Gogh’s style in this painting. On the other hand, if a yellowness detector is firing, a blueness detector may or may not be (sometimes we have tight parallel yellow and blue lines, but sometimes yellow is in the middle of a large yellow region). Style transfer posits that artistic style lies in the correlations between different features. See? Sneaky. And elegant. Finally, for the style-transferred output, you need to generate an image that does as well as possible on both cost functions simultaneously — getting as close to the content as it can without unduly sacrificing the style, and vice versa. As a side note, I think I now understand why DeepDream is fixated on a really rather alarming number of eyes. Since the layer choice is a hyperparameter, I hypothesize that choosing too deep a layer — one that’s started to find complex features rather than mere textures and shapes — will communicate to the system, yes, what I truly want is for you to paint this image as if those complex features are matters of genuine stylistic significance. And, of course, eyes are simple enough shapes to be recognized relatively early (not very different from concentric circles), yet ubiquitous in image data sets. So…this is what you wanted, right? the eager robot helpfully offers. https://www.ucreative.com/inspiration/google-deep-dream-is-the-trippiest-thing-in-the-internet/ I’m going to have fun figuring out what the right layer hyperparameter is for the Coursera assignment, but I’m going to have so much more fun figuring out the wrong ones. Andromeda Uncategorized 2 Comments December 4, 2020December 4, 2020 Let’s visualize some HAMLET data! Or, d3 and t-SNE for the lols. In 2017, I trained a neural net on ~44K graduate theses using the Doc2Vec algorithm, in hopes that doing so would provide a backend that could support novel and delightful discovery mechanisms for unique library content. The result, HAMLET, worked better than I hoped; it not only pulls together related works from different departments (thus enabling discovery that can’t be supported with existing metadata), but it does a spirited job on documents whose topics are poorly represented in my initial data set (e.g. when given a fiction sample it finds theses from programs like media studies, even though there are few humanities theses in the data set). That said, there are a bunch of exploratory tools I’ve had in my head ever since 2017 that I’ve not gotten around to implementing. But here, in the spirit of tossing out things that don’t bring me joy (like 2020) and keeping those that do, I’m gonna make some data viz! There are only two challenges with this: By default Doc2Vec embeds content in a 100-dimensional space, which is kind of hard to visualize. I need to project that down to 2 or 3 dimensions. I don’t actually know anything about dimensionality reduction techniques, other than that they exist. I also don’t know know JavaScript much beyond a copy-paste level. I definitely don’t know d3, or indeed the pros and cons of various visualization libraries. Also art. Or, like, all that stuff in Tufte’s book, which I bounced off of. (But aside from that, Mr. Lincoln, how was the play?) I decided I should start with the pages that display the theses most similar to a given thesis (shout-out to Jeremy Brown, startup founder par excellence) rather than with my ideas for visualizing the whole collection, because I’ll only need to plot ten or so points instead of 44K. This will make it easier for me to tell visually if I’m on the right track and should let me skip dealing with performance issues for now. On the down side, it means I may need to throw out any code I write at this stage when I’m working on the next one. 🤷‍♀️ And I now have a visualization on localhost! Which you can’t see because I don’t trust it yet. But here are the problems I’ve solved thus far: It’s hard to copy-paste d3 examples on the internet. d3’s been around for long enough there’s substantial content about different versions, so you have to double-check. But also most of the examples are live code notebooks on Observable, which is a wicked cool service but not the same environment as a web page! If you just copy-paste from there you will have things that don’t work due to invisible environment differences and then you will be sad. 😢 I got tipped off to this by Mollie Marie Pettit’s great Your First d3 Scatterplot notebook, which both names the phenomenon and provides two versions of the code (the live-editable version and the one you can actually copy/paste into your editor). If you start googling for dimensionality reduction techniques you will mostly find people saying “use t-SNE”, but t-SNE is a lying liar who lies. Mind you, it’s what I’m using right now because it’s so well-documented it was the easiest thing to set up. (This is why I said above that I don’t trust my viz.) But it produces different results for the same data on different pageloads (obviously different, so no one looking at the page will trust it either), and it’s not doing a good job preserving the distances I care about. (I accept that anything projecting from 100d down to 2d will need to distort distances, but I want to adequately preserve meaning — I want the visualization to not just look pretty but to give people an intellectually honest insight into the data — and I’m not there yet.) Conveniently this is not my first time at the software engineering rodeo, so I encapsulated my dimensionality reduction strategy inside a function, and I can swap it out for whatever I like without needing to rewrite the d3 as long as I return the same data structure. So that’s my next goal — try out UMAP (hat tip to Matt Miller for suggesting that to me), try out PCA, fiddle some parameters, try feeding it just the data I want to visualize vs larger neighborhoods, see if I’m happier with what I get. UMAP in particular alleges itself to be fast with large data sets, so if I can get it working here I should be able to leverage that knowledge for my ideas for visualizing the whole thing. Onward, upward, et cetera. 🎉 Andromeda Uncategorized 2 Comments November 20, 2020 Posts navigation Older posts Blog at WordPress.com. Email (Required) Name (Required) Website   Loading Comments... Comment × Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use. To find out more, including how to control cookies, see here: Cookie Policy 
andromedayelton-com-4308	----	andromeda yelton andromeda yelton I haven’t failed, I’ve just tried a lot of ML approaches that don’t work &#8220;Let&#8217;s blog every Friday,&#8221; I thought. &#8220;It&#8217;ll be great. People can see what I&#8217;m doing with ML, and it will be a useful practice for me!&#8221; And then I went through weeks on end of feeling like I had nothing to report because I was trying approach after approach to this one problem that simply &#8230; Continue reading I haven&#8217;t failed, I&#8217;ve just tried a lot of ML approaches that don&#8217;t&#160;work &#8594; this time: speaking about machine learning No tech blogging this week because most of my time was taken up with telling people about ML instead! One talk for an internal Harvard audience, &#8220;Alice in Dataland&#8221;, where I explained some of the basics of neural nets and walked people through the stories I found through visualizing HAMLET data. One talk for the &#8230; Continue reading this time: speaking about machine&#160;learning &#8594; archival face recognition for fun and nonprofit In 2019, Dominique Luster gave a super good Code4Lib talk about applying AI to metadata for the Charles &#8220;Teenie&#8221; Harris collection at the Carnegie Museum of Art &#8212; more than 70,000 photographs of Black life in Pittsburgh. They experimented with solutions to various metadata problems, but the one that&#8217;s stuck in my head since 2019 &#8230; Continue reading archival face recognition for fun and&#160;nonprofit &#8594; sequence models of language: slightly irksome Not much AI blogging this week because I have been buried in adulting all week, which hasn&#8217;t left much time for machine learning. Sadface. However, I&#8217;m in the last week of the last deeplearning.ai course! (Well. Of the deeplearning.ai sequence that existed when I started, anyway. They&#8217;ve since added an NLP course and a GANs &#8230; Continue reading sequence models of language: slightly&#160;irksome &#8594; Adapting Coursera’s neural style transfer code to localhost Last time, when making cats from the void, I promised that I&#8217;d discuss how I adapted the neural style transfer code from Coursera&#8217;s Convolutional Neural Networks course to run on localhost. Here you go! Step 1: First, of course, download (as python) the script. You&#8217;ll also need the nst_utils.py file, which you can access via &#8230; Continue reading Adapting Coursera&#8217;s neural style transfer code to&#160;localhost &#8594; Dear Internet, merry Christmas; my robot made you cats from the void Recently I learned how neural style transfer works. I wanted to be able to play with it more and gain some insights, so I adapted the Coursera notebook code to something that works on localhost (more on that in a later post), found myself a nice historical cat image via DPLA, and started mashing it &#8230; Continue reading Dear Internet, merry Christmas; my robot made you cats from the&#160;void &#8594; this week in my AI After visualizing a whole bunch of theses and learning about neural style transfer and flinging myself at t-SNE I feel like I should have something meaty this week but they can&#8217;t all be those weeks, I guess. Still, I&#8217;m trying to hold myself to Friday AI blogging, so here are some work notes: Finished course &#8230; Continue reading this week in my&#160;AI &#8594; Though these be matrices, yet there is method in them. When I first trained a neural net on 43,331 theses to make HAMLET, one of the things I most wanted to do is be able to visualize them. If word2vec places documents &#8216;near&#8217; each other in some kind of inferred conceptual space, we should be able to see some kind of map of them, yes? &#8230; Continue reading Though these be matrices, yet there is method in&#160;them. &#8594; Of such stuff are (deep)dreams made: convolutional networks and neural style transfer Skipped FridAI blogging last week because of Thanksgiving, but let&#8217;s get back on it! Top-of-mind today are the firing of AI queen Timnit Gebru (letter of support here) and a couple of grant applications that I&#8217;m actually eligible for (this is rare for me! I typically need things for which I can apply in my &#8230; Continue reading Of such stuff are (deep)dreams made: convolutional networks and neural style&#160;transfer &#8594; Let’s visualize some HAMLET data! Or, d3 and t-SNE for the lols. In 2017, I trained a neural net on ~44K graduate theses using the Doc2Vec algorithm, in hopes that doing so would provide a backend that could support novel and delightful discovery mechanisms for unique library content. The result, HAMLET, worked better than I hoped; it not only pulls together related works from different departments (thus &#8230; Continue reading Let&#8217;s visualize some HAMLET data! Or, d3 and t-SNE for the&#160;lols. &#8594; 
andromedayelton-com-509	----	I haven’t failed, I’ve just tried a lot of ML approaches that don’t work – andromeda yelton Skip to content andromeda yelton Menu Home About Contact Resume HAMLET LITA Talks Machine Learning (ALA Midwinter 2019) Boston Python Meetup (August 21, 2018) SWiB16 LibTechConf 2016 Code4Lib 2015 Keynote Texas Library Association 2014 Online Northwest 2014: Five Conversations About Code New Jersey ESummit (May 2, 2013) Westchester Library Association (January 7, 2013) Bridging the Digital Divide with Mobile Services (Webjunction, July 25 2012) I haven’t failed, I’ve just tried a lot of ML approaches that don’t work Andromeda Uncategorized April 16, 2021 “Let’s blog every Friday,” I thought. “It’ll be great. People can see what I’m doing with ML, and it will be a useful practice for me!” And then I went through weeks on end of feeling like I had nothing to report because I was trying approach after approach to this one problem that simply didn’t work, hence not blogging. And finally realized: oh, the process is the thing to talk about… Hi. I’m Andromeda! I am trying to make a neural net better at recognizing people in archival photos. After running a series of experiments — enough for me to have written 3,804 words of notes — I now have a neural net that is ten times worse at its task. 🎉 And now I have 3,804 words of notes to turn into a blog post (a situation which gets harder every week). So let me catch you up on the outline of the problem: Download a whole bunch of archival photos and their metadata (thanks, DPLA!) Use a face detection ML library to locate faces, crop them out, and save them in a standardized way Benchmark an off-the-shelf face recognition system to see how good it is at identifying these faces Retrain it Benchmark my new system Step 3: profit, right? Well. Let me also catch you up on some problems along the way: Alas, metadata Archival photos are great because they have metadata, and metadata is like labels, and labels mean you can do supervised learning, right? Well…. Is he “Du Bois, W. E. B. (William Edward Burghardt), 1868-1963” or “Du Bois, W. E. B. (William Edward Burghardt) 1868-1963” or “Du Bois, W. E. B. (William Edward Burghardt)” or “W.E.B. Du Bois”? I mean, these are all options. People have used a lot of different metadata practices at different institutions and in different times. But I’m going to confuse the poor computer if I imply to it that all these photos of the same person are photos of different people. (I have gone through several attempts to resolve this computationally without needing to do everything by hand, with only modest success.) What about “Photographs”? That appears in the list of subject labels for lots of things in my data set. “Photographs” is a person, right? I ended up pulling in an entire other ML component here — spaCy, to do some natural language processing to at least guess which lines are probably names, so I can clear the rest of them out of my way. But spaCy only has ~90% accuracy on personal names anyway and, guess what, because everything is terrible, in predictable ways, it has no idea “Kweisi Mfume” is a person. Is a person who appears in the photo guaranteed to be a person who appears in the photo? Nope. Is a person who appears in the metadata guaranteed to be a person who appears in the photo? Also nope! Often they’re a photographer or other creator. Sometimes they are the subject of the depicted event, but not themselves in the photo. (spaCy will happily tell you that there’s personal name content in something like “Martin Luther King Day”, but MLK is unlikely to appear in a photo of an MLK day event.) Oh dear, linear algebra OK but let’s imagine for the sake of argument that we live in a perfect world where the metadata is exactly what we need — no more, no less — and its formatting is perfectly consistent. 🦄 Here you are, in this perfect world, confronted with a photo that contains two people and has two names. How do you like them apples? I spent more time than I care to admit trying to figure this out. Can I bootstrap from photos that have one person and one name — identify those, subtract them out of photos of two people, go from there? (Not reliably — there’s a lot of data I never reach that way — and it’s horribly inefficient.) Can I do something extremely clever with matrix multiplication? Like…once I generate vector space embeddings of all the photos, can I do some sort of like dot-product thing across all of my photos, or big batches of them, and correlate the closest-match photos with overlaps in metadata? Not only is this a process which begs the question — I’d have to do that with the ML system I have not yet optimized for archival photo recognition, thus possibly just baking bad data in — but have I mentioned I have taken exactly one linear algebra class, which I didn’t really grasp, in 1995? What if I train yet another ML system to do some kind of k-means clustering on the embeddings? This is both a promising approach and some really first-rate yak-shaving, combining all the question-begging concerns of the previous paragraph with all the crystalline clarity of black box ML. Possibly at this point it would have been faster to tag them all by hand, but that would be admitting defeat. Also I don’t have a research assistant, which, let’s be honest, is the person who would usually be doing this actual work. I do have a 14-year-old and I am strongly considering paying her to do it for me, but to facilitate that I’d have to actually build a web interface and probably learn more about AWS, and the prospect of reading AWS documentation has a bracing way of reminding me of all of the more delightful and engaging elements of my todo list, like calling some people on the actual telephone to sort out however they’ve screwed up some health insurance billing. Nowhere to go but up Despite all of that, I did actually get all the way through the 5 steps above. I have a truly, spectacularly terrible neural net. Go me! But at a thousand-plus words, perhaps I should leave that story for next week…. Share this: Twitter Facebook Like this: Like Loading... Tagged fridAI Published by Andromeda Romantic analytical technologist librarian. View all posts by Andromeda Published April 16, 2021 Post navigation Previous Post this time: speaking about machine learning Leave a Reply Cancel reply Enter your comment here... Fill in your details below or click an icon to log in: Email (required) (Address never made public) Name (required) Website You are commenting using your WordPress.com account. ( Log Out /  Change ) You are commenting using your Google account. ( Log Out /  Change ) You are commenting using your Twitter account. ( Log Out /  Change ) You are commenting using your Facebook account. ( Log Out /  Change ) Cancel Connecting to %s Notify me of new comments via email. Notify me of new posts via email. Create a free website or blog at WordPress.com. Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use. To find out more, including how to control cookies, see here: Cookie Policy <span>%d</span> bloggers like this: 
api-flickr-com-6743	----	Recent Uploads tagged code4lib Recent Uploads tagged code4lib IMG_9817 IMG_9861 IMG_9945 IMG_9946 IMG_9922 IMG_9924 IMG_9932 IMG_9941 IMG_9881 IMG_9866 IMG_9952 IMG_9877 IMG_9959 IMG_9882 IMG_9905 IMG_9845 IMG_9823 IMG_9843 IMG_9895 IMG_9855 
apps-lib-umich-edu-4914	----	Library Tech Talk - U-M Library Library Tech Talk - U-M Library Technology Innovations and Project Updates from the U-M Library I.T. Division Library IT Services Portfolio Academic library service portfolios are mostly a mix of big to small strategic initiatives and tactical projects. Systems developed in the past can become a durable bedrock of workflows and services around the library, remaining relevant and needed for five, ten, and sometimes as long as twenty years. There is, of course, never enough time and resources to do everything. The challenge faced by Library IT divisions is to balance the tension of sustaining these legacy systems while continuing to innovate and develop new services. The University of Michigan’s Library IT portfolio has legacy systems in need of ongoing maintenance and support, in addition to new projects and services that add to and expand the portfolio. We, at Michigan, worked on a process to balance the portfolio of services and projects for our Library IT division. We started working on the idea of developing a custom tool for our needs since all the other available tools are oriented towards corporate organizations and we needed a light-weight tool to support our process. We went through a complete planning process first on whiteboards and paper, then developed an open source tool called TRACC for helping us with portfolio management. 4 keys to a dazzling library website redesign The U-M Library launched a completely new primary website in July after 2 years of work. The redesign project team focused on building a strong team, internal communication, content strategy, and practicing needs informed design and development to make the project a success. Sweet Sixteen: Digital Collections Completed July 2019 - June 2020 Digital Content &amp; Collections (DCC) relies on content and subject experts to bring us new digital collections. This year, 16 digital collections were created or significantly enhanced. Here you will find links to videos and articles by the subject experts speaking in their own words about the digital collections they were involved in and why they found it so important to engage in this work with us. Thank you to all of the people involved in each of these digital collections! Adding Ordered Metadata Fields to Samvera Hyrax How to add ordered metadata fields in Samvera Hyrax. Includes example code and links to actual code. Sinking our Teeth into Metadata Improvement Like many attempts at revisiting older materials, working with a couple dozen volumes of dental pamphlets started very simply but ended up being an interesting opportunity to explore the challenges of making the diverse range of materials held in libraries accessible to patrons in a digital environment. And while improving metadata may not sound glamorous, having sufficient metadata for users to be able to find what they are looking for is essential for the utility of digital libraries. Collaboration and Generosity Provide the Missing Issue of The American Jewess What started with a bit of wondering and conversation within our unit of the Library led to my reaching out to Princeton University with a request but no expectations of having that request fulfilled. Individuals at Princeton, however, considered the request and agreed to provide us with the single issue of The American Jewess that we needed to complete the full run of the periodical within our digital collection. Especially in these stressful times, we are delighted to bring you a positive story, one of collaboration and generosity across institutions, while also sharing the now-complete digital collection itself. How to stop being negative, or digitizing the Harry A. Franck film collection This article reviews how 9,000+ frames of photographic negatives from the Harry A. Franck collection are being digitally preserved. Combine Metadata Harvester: Aggregate ALL the data! The Digital Public Library of America (DPLA) has collected and made searchable a vast quantity of metadata from digital collections all across the country. The Michigan Service Hub works with cultural heritage institutions throughout the state to collect their metadata, transform those metadata to be compatible with the DPLA’s online library, and send the transformed metadata to the DPLA, using the Combine aggregator software, which is being developed here at the U of M Library. Hacks with Friends 2020 Retrospective: A pitch to hitch in 2021 When the students go on winter break I go to Hacks with Friends (HWF) and highly recommend and encourage everyone who can to participate in HWF 2021. Not only is it two days of free breakfast, lunch, and snacks at the Ross School of Business, but it’s a chance to work with a diverse cross section of faculty, staff, and students on innovative solutions to complex problems. U-M Library’s Digital Collection Items are now Included in Library Search The University Library’s digital collections, encompassing more than 300 collections with over a million items, are now discoverable through the library’s Articles discovery tool, powered by Summon. Read on to learn about searching this trove of images and text, and how to add it to your library’s Summon instance. 
apps-lib-umich-edu-9881	----	Library Tech Talk Blog | U-M Library Skip to main content Log in Library Tech Talk Technology Innovations and Project Updates from the U-M Library I.T. Division Search Library Tech Talk Subscribe To RSS feed Get updates via Email (U-M Only) Popular posts for Library Tech Talk Library IT Services Portfolio 4 keys to a dazzling library website redesign Sweet Sixteen: Digital Collections Completed July 2019 - June 2020 Adding Ordered Metadata Fields to Samvera Hyrax Sinking our Teeth into Metadata Improvement Tags in Library Tech Talk HathiTrust Library Website MLibrary Labs DLXS Web Content Strategy Mirlyn Digital Collections Digitization search Design MTagger OAI Accessibility Usability Group UX Archive for Library Tech Talk Show 2020 October 2020 (1) September 2020 (1) August 2020 (1) July 2020 (1) June 2020 (2) April 2020 (2) March 2020 (1) January 2020 (1) Show 2019 October 2019 (1) June 2019 (2) April 2019 (1) February 2019 (2) January 2019 (1) Show 2018 December 2018 (1) November 2018 (1) September 2018 (1) July 2018 (2) April 2018 (1) February 2018 (1) Show Older Show 2017 November 2017 (3) September 2017 (1) August 2017 (1) June 2017 (1) April 2017 (1) March 2017 (1) February 2017 (1) January 2017 (1) Show 2016 December 2016 (2) November 2016 (2) August 2016 (2) June 2016 (1) April 2016 (1) March 2016 (1) February 2016 (1) January 2016 (1) Show 2015 December 2015 (1) November 2015 (1) October 2015 (2) September 2015 (2) July 2015 (2) June 2015 (2) May 2015 (2) April 2015 (2) March 2015 (2) February 2015 (2) January 2015 (2) Show 2014 December 2014 (2) November 2014 (2) October 2014 (2) September 2014 (2) August 2014 (2) July 2014 (2) June 2014 (2) Show 2012 December 2012 (1) October 2012 (1) September 2012 (2) April 2012 (2) March 2012 (1) January 2012 (1) Show 2011 August 2011 (2) July 2011 (1) June 2011 (1) May 2011 (1) Show 2010 December 2010 (1) November 2010 (2) September 2010 (2) July 2010 (5) May 2010 (1) April 2010 (1) March 2010 (2) Show 2009 December 2009 (3) October 2009 (2) September 2009 (1) August 2009 (1) July 2009 (1) May 2009 (1) February 2009 (1) January 2009 (2) Show 2008 December 2008 (3) November 2008 (1) October 2008 (2) September 2008 (2) August 2008 (3) July 2008 (5) June 2008 (6) May 2008 (6) Library IT Services Portfolio Academic library service portfolios are mostly a mix of big to small strategic initiatives and tactical projects. Systems developed in the past can become a durable bedrock of workflows and services around the library, remaining relevant and needed for five, ten, and sometimes as long as twenty years. There is, of course, never enough time and resources to do everything. The challenge faced by Library IT divisions is to balance the tension of sustaining these legacy systems while continuing to... October 7, 2020 See all posts by Nabeela Jaffer 4 keys to a dazzling library website redesign The U-M Library launched a completely new primary website in July after 2 years of work. The redesign project team focused on building a strong team, internal communication, content strategy, and practicing needs informed design and development to make the project a success. September 8, 2020 See all posts by Heidi Steiner Burkhardt Sweet Sixteen: Digital Collections Completed July 2019 - June 2020 Digital Content & Collections (DCC) relies on content and subject experts to bring us new digital collections. This year, 16 digital collections were created or significantly enhanced. Here you will find links to videos and articles by the subject experts speaking in their own words about the digital collections they were involved in and why they found it so important to engage in this work with us. Thank you to all of the people involved in each of these digital collections! August 6, 2020 See all posts by Lauren Havens Adding Ordered Metadata Fields to Samvera Hyrax How to add ordered metadata fields in Samvera Hyrax. Includes example code and links to actual code. July 20, 2020 See all posts by Fritz Freiheit Sinking our Teeth into Metadata Improvement Like many attempts at revisiting older materials, working with a couple dozen volumes of dental pamphlets started very simply but ended up being an interesting opportunity to explore the challenges of making the diverse range of materials held in libraries accessible to patrons in a digital environment. And while improving metadata may not sound glamorous, having sufficient metadata for users to be able to find what they are looking for is essential for the utility of digital libraries. June 30, 2020 See all posts by Jackson Huang Collaboration and Generosity Provide the Missing Issue of The American Jewess What started with a bit of wondering and conversation within our unit of the Library led to my reaching out to Princeton University with a request but no expectations of having that request fulfilled. Individuals at Princeton, however, considered the request and agreed to provide us with the single issue of The American Jewess that we needed to complete the full run of the periodical within our digital collection. Especially in these stressful times, we are delighted to bring you a positive... June 15, 2020 See all posts by Lauren Havens How to stop being negative, or digitizing the Harry A. Franck film collection This article reviews how 9,000+ frames of photographic negatives from the Harry A. Franck collection are being digitally preserved. April 27, 2020 See all posts by Larry Wentzel Pager Page 1 of 21 1 2 3 4 5 … 21 Older Posts Library Contact Information University of Michigan Library 818 Hatcher Graduate Library South, 913 S. University Avenue Ann Arbor, MI 48109-1190 (734) 764-0400 | contact-mlibrary@umich.edu Except where otherwise noted, this work is subject to a Creative Commons Attribution 4.0 license. For details and exceptions, see the Library Copyright Policy. ©2014, Regents of the University of Michigan 
archive-org-5338	----	Internet Archive: About IA Skip to main content See what's new with book lending at the Internet Archive A line drawing of the Internet Archive headquarters building façade. An illustration of a magnifying glass. An illustration of a magnifying glass. An illustration of a horizontal line over an up pointing arrow. Upload An illustration of a person's head and chest. Sign up | Log in An illustration of a computer application window Wayback Machine An illustration of an open book. Books An illustration of two cells of a film strip. Video An illustration of an audio speaker. Audio An illustration of a 3.5" floppy disk. Software An illustration of two photographs. Images An illustration of a heart shape Donate An illustration of text ellipses. More An icon used to represent a menu that can be toggled by interacting with this icon. About Blog Projects Help Donate An illustration of a heart shape Contact Jobs Volunteer People Search Metadata Search text contents Search TV news captions Search archived websites Advanced Search Sign up for free Log in Read More Server Statistics Archive Statistics Job Opportunities at the Internet Archive Events News [more] Hooniverse: Wayback Machine Allows a Peek into Defunct Detroit Automaker Websites Laughing Squid: An Amazing Collection of Pulp Magazines Going Back 75 Years Is Available Online at The Internet Archive Far Out Magazine: Over 100,000 historic vinyl records are being digitised and made available to stream online for free GigaZine: ウェブ上の情報を記録・保存する「インターネット・アーカイブ」の存続をひっそりと脅かしているものとは？ ActuaLitte: Plongez dans l'art japonais de la fin du XIXe siècle grâce à ce magazine numérisé Library Journal: Better World Libraries, Internet Archive Partner, Acquires Better World Books Open Culture: The Internet Archive Is Digitizing & Preserving Over 100,000 Vinyl Records: Hear 750 Full Albums Now Against The Grain: ATG Newsflash: For the Love of Literacy–Better World Books and the Internet Archive Unite to Preserve Millions of Books Research Information: Better World Books affiliates with Internet Archive Wired: The Internet Archive Is Making Wikipedia More Reliable About the Internet Archive The Internet Archive, a 501(c)(3) non-profit, is building a digital library of Internet sites and other cultural artifacts in digital form. Like a paper library, we provide free access to researchers, historians, scholars, the print disabled, and the general public. Our mission is to provide Universal Access to All Knowledge. We began in 1996 by archiving the Internet itself, a medium that was just beginning to grow in use. Like newspapers, the content published on the web was ephemeral - but unlike newspapers, no one was saving it. Today we have 25+ years of web history accessible through the Wayback Machine and we work with 750+ library and other partners through our Archive-It program to identify important web pages. As our web archive grew, so did our commitment to providing digital versions of other published works. Today our archive contains: 475 billion web pages 28 million books and texts 14 million audio recordings (including 220,000 live concerts) 6 million videos (including 2 million Television News programs) 3.5 million images 580,000 software programs Anyone with a free account can upload media to the Internet Archive. We work with thousands of partners globally to save copies of their work into special collections. Because we are a library, we pay special attention to books. Not everyone has access to a public or academic library with a good collection, so to provide universal access we need to provide digital versions of books. We began a program to digitize books in 2005 and today we scan 3,500 books per day in 18 locations around the world. Books published prior to 1926 are available for download, and hundreds of thousands of modern books can be borrowed through our Open Library site. Some of our digitized books are only available to people with print disabilities. Like the Internet, television is also an ephemeral medium. We began archiving television programs in late 2000, and our first public TV project was an archive of TV news surrounding the events of September 11, 2001. In 2009 we began to make selected U.S. television news broadcasts searchable by captions in our TV News Archive. This service allows researchers and the public to use television as a citable and sharable reference. The Internet Archive serves millions of people each day and is one of the top 300 web sites in the world. A single copy of the Internet Archive library collection occupies 70+ Petabytes of server space (and we store at least 2 copies of everything). We are funded through donations, grants, and by providing web archiving and book digitization services for our partners. As with most libraries we value the privacy of our patrons, so we avoid keeping the IP (Internet Protocol) addresses of our readers and offer our site in https (secure) protocol. You can find information about our projects on our blog (including important announcements), contact us, buy swag in our store, and follow us on Twitter and Facebook. Welcome to the library! Recent foundation funding generously provided by:: Andrew W. Mellon Foundation Council on Library and Information Resources Democracy Fund Federal Communications Commission Universal Service Program for Schools and Libraries (E-Rate) Institute of Museum and Library Services (IMLS) Knight Foundation Laura and John Arnold Foundation National Endowment for the Humanities, Office of Digital Humanities National Science Foundation The Peter and Carmen Lucia Buck Foundation The Philadelphia Foundation Rita Allen Foundation The Internet Archive is a member of: American Library Association (ALA) Biodiversity Heritage Library (BHL) Boston Library Consortium (BLC) Califa Council on Library and Information Resources (CLIR) Coalition for Networked Information (CNI) Digital Library Federation (DLF) Digital Preservation Coalition (DPC) Digital Public Library of America (DPLA) International Federation of Library Associations and Institutions (IFLA) International Internet Preservation Consortium (IIPC) Music Library Association National Digital Stewardship Alliance (NDSA) ReShare 
archive-org-9038	----	Internet Archive: Wayback Machine Skip to main content See what's new with book lending at the Internet Archive A line drawing of the Internet Archive headquarters building façade. An illustration of a magnifying glass. An illustration of a magnifying glass. An illustration of a horizontal line over an up pointing arrow. Upload An illustration of a person's head and chest. Sign up | Log in An illustration of a computer application window Wayback Machine An illustration of an open book. Books An illustration of two cells of a film strip. Video An illustration of an audio speaker. Audio An illustration of a 3.5" floppy disk. Software An illustration of two photographs. Images An illustration of a heart shape Donate An illustration of text ellipses. More An icon used to represent a menu that can be toggled by interacting with this icon. About Blog Projects Help Donate An illustration of a heart shape Contact Jobs Volunteer People Search Metadata Search text contents Search TV news captions Search archived websites Advanced Search Sign up for free Log in Explore more than 544 billion web pages saved over time BROWSE HISTORY Find the Wayback Machine useful? DONATE deviantart.com Oct 15, 2013 21:28:20 cl.cam.ac.uk Feb 29, 2000 18:34:39 foodnetwork.com Oct 20, 2013 22:40:56 yahoo.com Dec 20, 1996 15:45:10 spiegel.com Oct 01, 2013 15:26:30 imdb.com Oct 21, 2013 16:53:47 stackoverflow.com Oct 14, 2013 21:22:10 ubl.com Dec 27, 1996 20:38:47 bloomberg.com Oct 01, 2013 23:10:45 reference.com Oct 18, 2013 07:12:58 feedmag.com Dec 23, 1996 10:53:17 wikihow.com Oct 21, 2013 20:56:46 nbcnews.com Oct 21, 2013 17:24:52 goodreads.com Oct 21, 2013 00:42:42 obamaforillinois.com Nov 09, 2004 04:28:06 geocities.com Feb 22, 1997 17:47:51 amazon.com Feb 04, 2005 00:47:33 nytimes.com Oct 01, 2013 01:42:36 bbc.co.uk Oct 01, 2013 00:13:32 huffingtonpost.com Oct 21, 2013 17:11:12 reddit.com Oct 01, 2013 03:15:39 cnet.com Oct 21, 2013 02:07:03 whitehouse.gov Dec 27, 1996 06:25:41 aol.com Oct 01, 2013 05:01:31 yelp.com Oct 19, 2013 02:44:53 etsy.com Jun 01, 2013 01:38:52 foxnews.com Oct 01, 2013 01:08:27 well.com Jan 08, 1997 06:53:37 w3schools.com Oct 19, 2013 00:55:10 buzzfeed.com Oct 21, 2013 17:32:21 nasa.gov Dec 31, 1996 23:58:47 mashable.com Oct 21, 2013 02:16:14 nfl.com Oct 21, 2013 07:39:25   Tools Wayback Machine Availability API Build your own tools. WordPress Broken Link Checker Banish broken links from your blog. 404 Handler for Webmasters Help users get where they were going. Subscription Service Archive-It enables you to capture, manage and search collections of digital content without any technical expertise or hosting facilities. Visit Archive-It to build and browse the collections. Save Page Now SAVE PAGE Capture a web page as it appears now for use as a trusted citation in the future. Only available for sites that allow crawlers.   FAQ | Contact Us | Terms of Service (Dec 31, 2014) 
arstechnica-com-2640	----	Fender bender in Arizona illustrates Waymo’s commercialization challenge | Ars Technica Skip to main content Biz & IT Tech Science Policy Cars Gaming & Culture Store Forums Subscribe Close Navigate Store Subscribe Videos Features Reviews RSS Feeds Mobile Site About Ars Staff Directory Contact Us Advertise with Ars Reprints Filter by topic Biz & IT Tech Science Policy Cars Gaming & Culture Store Forums Settings Front page layout Grid List Site theme Black on white White on black Sign in Comment activity Sign up or login to join the discussions! Stay logged in | Having trouble? Sign up to comment and more Sign up Self-driving — Fender bender in Arizona illustrates Waymo’s commercialization challenge Self-driving systems won't necessarily make the same mistakes as human drivers. Timothy B. Lee - Apr 2, 2021 5:07 pm UTC Enlarge / A Waymo self-driving car in Silicon Valley in 2019. Sundry Photography / Getty reader comments 293 with 120 posters participating, including story author Share this story Share on Facebook Share on Twitter Share on Reddit A police report obtained by the Phoenix New Times this week reveals a minor Waymo-related crash that occurred last October but hadn't been publicly reported until now. Here's how the New Times describes the incident: A white Waymo minivan was traveling westbound in the middle of three westbound lanes on Chandler Boulevard, in autonomous mode, when it unexpectedly braked for no reason. A Waymo backup driver behind the wheel at the time told Chandler police that "all of a sudden the vehicle began to stop and gave a code to the effect of 'stop recommended' and came to a sudden stop without warning." A red Chevrolet Silverado pickup behind the vehicle swerved to the right but clipped its back panel, causing minor damage. Nobody was hurt. Overall, Waymo has a strong safety record. Waymo has racked up more than 20 million testing miles in Arizona, California, and other states. This is far more than any human being will drive in a lifetime. Waymo's vehicles have been involved in a relatively small number of crashes. These crashes have been overwhelmingly minor with no fatalities and few if any serious injuries. Waymo says that a large majority of those crashes have been the fault of the other driver. So it's very possible that Waymo's self-driving software is significantly safer than a human driver. Further Reading This Arizona college student has taken over 60 driverless Waymo rides At the same time, Waymo isn't acting like a company with a multi-year head start on potentially world-changing technology. Three years ago, Waymo announced plans to buy "up to" 20,000 electric Jaguars and 62,000 Pacifica minivans for its self-driving fleet. The company hasn't recently released numbers on its fleet size, but it's safe to say that the company is nowhere near hitting those numbers. The service territory for the Waymo One taxi service in suburban Phoenix hasn't expanded much since it launched two years ago. Waymo hasn't addressed the slow pace of expansion, but incidents like last October's fender-bender might help explain it. Advertisement It’s hard to be sure if self-driving technology is safe Rear-end collisions like this rarely get anyone killed, and Waymo likes to point out that Arizona law prohibits tailgating. In most rear-end crashes, the driver in the back is considered to be at fault. At the same time, it's obviously not ideal for a self-driving car to suddenly come to a stop in the middle of the road. More generally, Waymo's vehicles sometimes hesitate longer than a human would when they encounter complex situations they don't fully understand. Human drivers sometimes find this frustrating, and it occasionally leads to crashes. In January 2020, a Waymo vehicle unexpectedly stopped as it approached an intersection where the stoplight was green. A police officer in an unmarked vehicle couldn't stop in time and hit the Waymo vehicle from behind. Again, no one was seriously injured. It's difficult to know if this kind of thing happens more often with Waymo's vehicles than with human drivers. Minor fender benders aren't always reported to the police and may not be reflected in official crash statistics, overstating the safety of human drivers. By contrast, any crash involving cutting-edge self-driving technology is likely to attract public attention. The more serious problem for Waymo is that the company can't be sure that the idiosyncrasies of its self-driving software won't contribute to a more serious crash in the future. Human drivers cause a fatality about once every 100 million miles of driving—far more miles than Waymo has tested so far. If Waymo scaled up rapidly, it would be taking a risk that an unnoticed flaw in Waymo's programming could lead to someone getting killed. And crucially, self-driving cars are likely to make different types of mistakes than human drivers. So it's not sufficient to make a list of mistakes human drivers commonly make and verify that self-driving software avoids making them. You also need to figure out if self-driving cars will screw up in scenarios that human drivers deal with easily. And there may be no other way to find these scenarios than with lots and lots of testing. Waymo has logged far more testing miles than other companies in the US, but there's every reason to think Waymo's competitors will face this same dilemma as they move toward large-scale commercial deployments. By now, a number of companies have developed self-driving cars that can handle most situations correctly most of the time. But building a car that can go millions of miles without a significant mistake is hard. And proving it is even harder. reader comments 293 with 120 posters participating, including story author Share this story Share on Facebook Share on Twitter Share on Reddit Timothy B. Lee Timothy is a senior reporter covering tech policy, blockchain technologies and the future of transportation. He lives in Washington DC. Email timothy.lee@arstechnica.com // Twitter @binarybits Advertisement You must login or create an account to comment. Channel Ars Technica ← Previous story Next story → Related Stories Sponsored Stories Powered by Today on Ars Store Subscribe About Us RSS Feeds View Mobile Site Contact Us Staff Advertise with us Reprints Newsletter Signup Join the Ars Orbital Transmission mailing list to get weekly updates delivered to your inbox. Sign me up → CNMN Collection WIRED Media Group © 2021 Condé Nast. All rights reserved. Use of and/or registration on any portion of this site constitutes acceptance of our User Agreement (updated 1/1/20) and Privacy Policy and Cookie Statement (updated 1/1/20) and Ars Technica Addendum (effective 8/21/2018). Ars may earn compensation on sales from links on this site. Read our affiliate link policy. Your California Privacy Rights | Do Not Sell My Personal Information The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of Condé Nast. Ad Choices 
arstechnica-com-4270	----	Ars Technica’s non-fungible guide to NFTs | Ars Technica Skip to main content Biz & IT Tech Science Policy Cars Gaming & Culture Store Forums Subscribe Close Navigate Store Subscribe Videos Features Reviews RSS Feeds Mobile Site About Ars Staff Directory Contact Us Advertise with Ars Reprints Filter by topic Biz & IT Tech Science Policy Cars Gaming & Culture Store Forums Settings Front page layout Grid List Site theme Black on white White on black Sign in Comment activity Sign up or login to join the discussions! Stay logged in | Having trouble? Sign up to comment and more Sign up This article is for sale as an NFT, probably — Ars Technica’s non-fungible guide to NFTs Is blockchain item authentication a speculative fad or a technological sea change? Kyle Orland - Mar 29, 2021 11:15 am UTC Enlarge / Look ma, I'm on the blockchain Chris Torres | Beeple | Aurich Lawson reader comments 280 with 168 posters participating, including story author Share this story Share on Facebook Share on Twitter Share on Reddit It has been nearly 10 years now since Ars Technica first described Bitcoin to readers as “the world’s first virtual currency… designed by an enigmatic, freedom-loving hacker, and currently used by the geek underground to buy and sell everything from servers to cellphone jammers.” A decade later, Bitcoin and other cryptocurrencies are practically mainstream, and even most non-techies know the blockchain basics powering a decentralized financial revolution (or a persistent bubble, if you prefer). What Bitcoin was to 2011, NFTs are to 2021. So-called “non-fungible tokens” are having a bit of a moment in recent weeks, attracting a surge of venture capital cash and eye-watering speculative values for traceable digital goods. This despite the fact that most of the general public barely understands how this blockchain-based system of digital authentication works, or why it’s behind people paying $69 million for a single GIF. Fungible? Token? Perhaps the simplest way to start thinking about NFTs is as a digital version of the various “certificates of authenticity” that are prevalent in the market for real-world art and collectibles. Instead of a slip of paper, though, NFTs use cryptographic smart contracts and a distributed blockchain (most often built on top of Ethereum these days) to certify who owns each distinct, authentic token. As with cryptocurrencies, those contracts are verified by the collective distributed work of miners who keep the entire system honest with their computational work (the electricity for which creates a lot of nasty carbon emissions). And just like cryptocurrencies, those NFTs can be sold and traded directly on any number of marketplaces without any centralized control structure dictating the rules of those transfers. What makes NFTs different from your run-of-the-mill cryptocurrency is each token’s distinctiveness. With a cryptocurrency like Bitcoin, each individual unit is indistinguishable from another and has an identical value. Each individual Bitcoin can be traded or divided up just like any other Bitcoin (i.e. the Bitcoins are fungible). NFTs being “non-fungible” means each one represents a distinct entity with a distinct value that can’t be divided into smaller units. Just as anyone can start printing their own line of Certificates of Authenticity (or anyone can start up their own cryptocurrency to try to be “the next Bitcoin”), anyone with just a little technical knowhow can start minting their own distinct NFTs. Etherscan currently lists over 9,600 distinct NFT contracts, each its own network of trust representing and tracking its own set of digital goods. Enlarge / It's trivial to make a digital copy of any of the images for sale on Rarible. But those copies won't have the "authenticity" of the actual NFT being sold... These NFT contracts can represent pretty much anything that can exist digitally: a webpage, a GIF, a video clip, you name it. Digital artists are using NFTs to create “scarce” verified versions of their pieces, while collectible companies are using them to create traceable, unforgeable digital trading cards. Video game items and characters can be represented as NFTs, too, allowing for easy proof of ownership and portability even between games controlled by different companies (though the market for such games is still very immature). There are plenty of even odder examples out there. Vid is a TikTok-like social media network that gives users NFT-traced ownership of their posted videos (and royalty payments for the same). The Ethereum Name Service is using NFTs to set up a decentralized version of the ICANN-controlled Domain Name Service for finding online content. Aavegotchi is a weird hybrid that uses digital pets to represent your stake in a decentralized finance protocol called Aave. Essentially, there are hundreds of companies looking to NFTs for situations where they need to trace and verify ownership of distinct digital goods. Advertisement The idea has been catching on quickly, at least among speculators with a lot of money to throw around. Nonfungibles’ database of hundreds of different NFTs has tracked over $48 million in sales across nearly 40,000 NFT transactions in just the last week. Rarible, one of the most popular NFT marketplaces, saw its daily trading volume hit $1.9 million earlier this month, tripling the same number from just a day before. Cryptopunks, an early NFT representing 10,000 unique pixellated avatars, has seen over $176 million in total transactions since its creation in 2017 (with over 10 percent of that volume coming in the last week). How does it work? On a technical level, most NFTs are built on the ERC-721 standard. That framework sets up the basic cryptographic system to track ownership of each individual token (by linking it to user-controlled digital wallets) and allow for secure, verified transfer on the blockchain. Some NFT contracts have built additional attributes and features on top of that standard. The NFT for a cryptokitty, for instance, contains metadata representing that digital avatar’s unique look and traits. That metadata also establishes rules for how often it can “breed” new cryptokitty NFTs and what traits it will pass down to future generations. Those attributes are set and verified on the blockchain, and they can’t be altered no matter how or where the cryptokitty is used. When NFT’s are used to represent digital files (like GIFs or videos), however, those files usually aren’t stored directly “on-chain” in the token itself. Doing so for any decently sized file could get prohibitively expensive, given the cost of replicating those files across every user on the chain. Instead, most NFTs store the actual content as a simple URI string in their metadata, pointing to an Internet address where the digital thing actually resides. It may seem odd to link a system of decentralized, distributed digital goods to content hosted on centralized servers controlled by actual people or companies. Given that the vast majority of webpage links become defunct after just a few years, an NFT pointing to a plain-old web address wouldn’t seem to be a good long-term store of value. Enlarge / A diagram laying out the basic difference between IPFS distributed file storage and standard, centrally controlled HTTP servers. Blocknomi / MaxCDN Many NFTs get around this by using burgeoning blockchain-based file networks such as IPFS or pixelchain. These networks are designed to let users find, copy, and store cryptographically signed files that could be distributed among any number of independent nodes (including ones controlled by the NFT owner). In theory, linking an NFT to an IPFS address could ensure the digital file in question will continue to be accessible in perpetuity, as long as someone has mirrored a verifiable copy on some node in the IPFS network. Are NFTs really that valuable? Just like a certificate of authenticity, the value of an NFT (and the “unique” digital item it represents) is strongly tied to its provenance. The person who spent $560,000 for an NFT representing the original Nyan Cat meme, for instance, obviously didn’t purchase every copy of the famous animated GIF of a pop-tart cat with a rainbow trail behind it. You can still download your own identical copy with a few clicks. The NFT doesn’t even include the copyright to Nyan Cat, which would at least give the owner some legal control over the work (though some NFTs try to embed such rights in their contracts). What makes the Nyan Cat NFT interesting (and potentially valuable) is that it was verified and sold by Chris Torres, the person who created and posted the original Nyan Cat video to YouTube in 2011. That gives this copy of Nyan Cat a unique history and a tie to the meme’s creation that can’t be matched by any other copy (or any other NFT, unless Torres starts diluting the value by minting more). And the blockchain technology behind the NFT ensures the chain of custody for that version of the GIF can be traced back to Torres' original minting, no matter how many times it's sold or transferred. Advertisement Enlarge / This Nyan Cat GIF is practically worthless. So why is an NFT of an "identical" GIF worth so much money to a collector? Does that fact alone really give this NFT any more value than all of the other identical Nyan Cat GIFs floating around on the Internet? That’s for a highly speculative market to figure out. But just as a stroke-for-stroke copy of a Vermeer masterpiece doesn’t have the same value as the one-of-a-kind original, a verified “original” Nyan Cat from the meme’s creator may retain some persistent value to collectors. Just because digital goods are easier to copy than paintings doesn’t make one less valuable than the other, either. It’s trivial to make a near-perfect copy of a photographic print, but original photographs can still sell for millions of dollars to the right buyer. On the other hand, these NFTs might end up being more akin to those novelty deeds that claim the document gives you “ownership” of a star in the night sky. While there’s probably some sentimental value to the idea of owning a star, there isn’t any real robust market where the most coveted stars trade for large sums. And just like there are a lot of competing organizations offering “star deeds” these days, there are a lot of competing firms that could dilute the market with their own NFT offerings. Do you know where your NFT came from? All of this means that tracing the provenance of any given NFT can be of prime importance to its implicit value. NFT marketplace SuperRare ensures its NFTs are “authentic” by only minting tokens for a set of “hand-picked artists” for the time being. NBA Top Shot, meanwhile, relies on its NBA license to make sure its randomized packs of basketball video clips are each unique and have an “official” air to them. But there are plenty of situations where the original ownership of a particular NFT is more questionable. Game developer Jason Rohrer drew some controversy earlier this month by trying to sell NFT tokens for artwork originally created by other artists for his 2012 game The Castle Doctrine. This did not please many of the artists who were not aware their digital work was being resold as a token, to say the least. Then there’s Tokenized Tweets, a simple service that can create a sellable NFT token representing any tweet on the service, including ones created by other people. The service has recently stopped tokenizing tweets that include visual media, and it lets artists make takedown requests if their copyrighted art/photography is tokenized by the service. But that seems like a pretty skimpy Band-Aid for an offering that seems rife with fraud potential. Enlarge / The NFT-backed "Marble Card" frame for Reddit.com has no actual connection to the creators or owners of Reddit. But does that matter? There are also gray areas like Marble Cards, which lets you create an NFT “frame” intended to go around a specific, unique webpage URL. That makes each frame akin to a unique trading card with a picture of a webpage on it. While the service states clearly that “no third-party content is claimed or saved on any blockchain,” the direct link and implicit association with the webpage in question could lead to some thorny questions of ownership. With literally thousands of companies jumping into the NFT space, there’s a gold rush mentality that seems primed to spawn plenty of scams. And even legitimate NFT efforts could see their values fade away quickly if the market’s attention moves on to a different blockchain as its store of “authentic” value. Cryptokitties, one of the first popular NFT collectibles in late 2017, saw transaction volume plummet 98 percent in 2018 as high Ethereum fees and lack of novelty drove some of the more speculative players away. Back in 2011, it was unclear if Bitcoin was going to be a lasting financial instrument or a flash-in-the-pan technological fad. And here in 2021, you can say the same thing about the future for NFTs. Promoted Comments kaworu1986 Ars Praetorian jump to post So the NFT gives you... nothing? It has no connection to copyright ownership of the work it refers to and there's nothing stopping the creation of multiple NFTs for the same work either. Seems really pointless. Surprised to see there's people willing to pay actual money for any of this. 511 posts | registered 6/20/2007 reader comments 280 with 168 posters participating, including story author Share this story Share on Facebook Share on Twitter Share on Reddit Kyle Orland Kyle is the Senior Gaming Editor at Ars Technica, specializing in video game hardware and software. He has journalism and computer science degrees from University of Maryland. He is based in the Washington, DC area. Email kyle.orland@arstechnica.com // Twitter @KyleOrl Advertisement You must login or create an account to comment. Channel Ars Technica ← Previous story Next story → Related Stories Sponsored Stories Powered by Today on Ars Store Subscribe About Us RSS Feeds View Mobile Site Contact Us Staff Advertise with us Reprints Newsletter Signup Join the Ars Orbital Transmission mailing list to get weekly updates delivered to your inbox. Sign me up → CNMN Collection WIRED Media Group © 2021 Condé Nast. All rights reserved. Use of and/or registration on any portion of this site constitutes acceptance of our User Agreement (updated 1/1/20) and Privacy Policy and Cookie Statement (updated 1/1/20) and Ars Technica Addendum (effective 8/21/2018). Ars may earn compensation on sales from links on this site. Read our affiliate link policy. Your California Privacy Rights | Do Not Sell My Personal Information The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of Condé Nast. Ad Choices 
archivesblogs-com-4254	----	ArchivesBlogs | a syndicated collection of blogs by and for archivists ArchivesBlogs a syndicated collection of blogs by and for archivists Search Main menu Skip to primary content Skip to secondary content Home About Post navigation ← Older posts Meet Ike Posted on September 18, 2020 from AOTUS “I come from the very heart of America.” – Dwight Eisenhower, June 12, 1945 At a time when the world fought to overcome tyranny, he helped lead the course to victory as the Supreme Allied Commander in Europe. When our nation needed a leader, he upheld the torch of liberty as our 34th president. As a new memorial is unveiled, now is the time for us to meet Dwight David Eisenhower. Eisenhower Memorial statue and sculptures, photo by the Dwight D. Eisenhower Memorial Commission An opportunity to get to know this man can be found at the newly unveiled Eisenhower Memorial in Washington, DC, and the all-new exhibits in the Eisenhower Presidential Library and Museum in Abilene, Kansas. Each site in its own way tells the story of a humble man who grew up in small-town America and became the leader of the free world. The Eisenhower Presidential Library and Museum is a 22-acre campus which includes several buildings where visitors can interact with the life of this president. Starting with the Boyhood Home, guests discover the early years of Eisenhower as he avidly read history books, played sports, and learned lessons of faith and leadership. The library building houses the documents of his administration. With more than 26 million pages and 350,000 images, researchers can explore the career of a 40+-year public servant. The 25,000 square feet of all-new exhibits located in the museum building is where visitors get to meet Ike and Mamie again…for the first time. Using NARA’s holdings, guests gain insight into the life and times of President Eisenhower. Finally, visitors can be reflective in the Place of Meditation where Eisenhower rests beside his first-born son, Doud, and his beloved wife Mamie. A true encapsulation of his life. Eisenhower Presidential Library and Museum, Abilene, Kansas The updated gallery spaces were opened in 2019. The exhibition includes many historic objects from our holdings which highlight Eisenhower’s career through the military years and into the White House. Showcased items include Ike’s West Point letterman’s sweater, the D-Day Planning Table, Soviet lunasphere, and letters related to the Crisis at Little Rock. Several new films and interactives have been added throughout the exhibit including a D-Day film using newly digitized footage from the archives. Eisenhower Presidential Library and Museum, Abilene, Kansas In addition to facts and quotes, visitors will leave with an understanding of how his experiences made Ike the perfect candidate for Supreme Allied Commander of the Allied Expeditionary Force in Europe and the 34th President of the United States. The Eisenhower Memorial, which opened to the public on September 18, is located at an important historical corridor in Washington, DC. The 4-acre urban memorial park is surrounded by four buildings housing institutions that were formed during the Eisenhower Administration and was designed by award-winning architect, Frank Gehry. In 2011, the National Archives hosted Frank Gehry and his collaborator, theater artist Robert Wilson in a discussion about the creation of the Eisenhower National Memorial.  As part of the creative process, Gehry’s team visited the Eisenhower Presidential Library and drew inspiration from the campus. They also used the holdings of the Eisenhower Presidential Library to form the plans for the memorial itself. This also led to the development of online educational programs which will have a continued life through the Eisenhower Foundation. Visitors to both sites will learn lasting lessons from President Eisenhower’s life of public service. Eisenhower Memorial, photo by the Dwight D. Eisenhower Memorial Commission Link to Post | Language: English The First Post 9/11 Phone-In: Richard Hake Sitting-in For Brian Lehrer Posted on September 16, 2020 from NYPR Archives & Preservation On September 18, 2001, The late Richard Hake sat-in for Brian Lehrer at Columbia University’s new studios at WKCR.  Just one week after the attack on the World Trade Center, WNYC was broadcasting on FM at reduced power from the Empire State Building and over WNYE (91.5 FM). Richard spoke with New York Times columnist Paul Krugman on airport security, author James Fallows on the airline industry, Robert Roach Jr. of the International Association of Machinists, and security expert and former New York City Police Commissioner William Bratton as well as WNYC listeners. Link to Post | Language: English Capturing Virtual FSU Posted on September 16, 2020 from Illuminations When the world of FSU changed in March 2020, the website for FSU was used as one of the primary communication tools to let students, faculty, and staff know what was going on. New webpages created specifically to share information and news popped up all over fsu.edu and we had no idea how long those pages would exist (ah, the hopeful days of March) so Heritage & University Archives wanted to be sure to capture those pages quickly and often as they changed and morphed into new online resources for the FSU community. Screenshot of a capture of the main FSU News feed regarding coronavirus. Captured March 13, 2020. While FSU has had an Archive-It account for a while, we hadn’t fully implemented its use yet. Archive-It is a web archiving service that captures and preserves content on websites as well as allowing us to provide metadata and a public interface to viewing the collected webpages. COVID-19 fast-tracked me on figuring out Archive-It and how we could best use it to capture these unique webpages documenting FSU’s response to the pandemic. I worked to configure crawls of websites to capture the data we needed, set up a schedule that would be sufficient to capture changes but also not overwhelm our data allowance, and describe the sites being captured. It took me a few tries but we’ve successfully been capturing a set of COVID related FSU URLs since March. One of the challenges of this work was some of the webpages had functionality that the web crawling just wouldn’t capture. This was due to some interactive widgets on pages or potentially some CSS choices the crawler didn’t like. I decided the content was the most important thing to capture in this case, more so than making sure the webpage looked exactly like the original. A good example of this is the International Programs Alerts page. We’re capturing this to track information about our study abroad programs but what Archive-It displays is quite different from the current site in terms of design. The content is all there though. On the left is how Archive-It displays a capture of the International Programs Alerts page. On the right is how the site actually looks. While the content is the same, the formatting and design is not As the pandemic dragged on and it became clear that Fall 2020 would be a unique semester, I added the online orientation site and the Fall 2020 site to my collection line-up. The Fall 2020 page, once used to track the re-opening plan recently morphed into the Stay Healthy FSU site where the community can look for current information and resources but also see the original re-opening document. We’ll continue crawling and archiving these pages in our FSU Coronavirus Archive for future researchers until they are retired and the university community returns to “normal” operations – whatever that might look like when we get there! Link to Post | Language: English Welcome to the New ClintonLibrary.Gov! Posted on September 14, 2020 from AOTUS The National Archives’ Presidential Libraries and Museums preserve and provide access to the records of 14 presidential administrations. In support of this mission, we developed an ongoing program to modernize the technologies and designs that support the user experience of our Presidential Library websites. Through this program, we have updated the websites of the Hoover, Truman, Eisenhower and Nixon Presidential Libraries.  Recently we launched an updated website for the William J. Clinton Presidential Library & Museum. The website, which received more than 227,000 visitors over the past year, now improves access to the Clinton Presidential Library holdings by providing better performance, improving accessibility, and delivering a mobile-friendly experience. The updated website’s platform and design, based in the Drupal web content management framework, enables the Clinton Presidential Library staff to make increasing amounts of resources available online—especially while working remotely during the COVID-19 crisis. To achieve this website redesign, staff from the National Archives’ Office of Innovation, with both web development and user experience expertise, collaborated with staff from the Clinton Presidential Library to define goals for the new website. Our user experience team first launched the project by interviewing staff of the Clinton Presidential Library to determine the necessary improvements for the updated website to facilitate their work. Next, the user experience team researched the Library’s customers—researchers, students, educators, and the general public—by analyzing user analytics, heatmaps, recordings of real users navigating the site, and top search referrals. Based on the data collected, the user experience team produced wireframes and moodboards that informed the final site design. The team also refined the website’s information architecture to improve the user experience and meet the Clinton Library staff’s needs.  Throughout the project, the team used Agile project management development processes to deliver iterative changes focused on constant improvement. To be Agile, specific goals were outlined, defined, and distributed among team members for mutual agreement. Work on website designs and features was broken into development “sprints”—two-week periods to complete defined amounts of work. At the end of each development sprint, the resulting designs and features were demonstrated to the Clinton Presidential Library staff stakeholders for feedback which helped further refine the website. The project to update the Clinton Presidential Library and Museum website was guided by the National Archives’ strategic goals—to Make Access Happen, Connect with Customers, Maximize NARA’s Value to the Nation, and Build our Future Through our People. By understanding the needs of the Clinton Library’s online users and staff, and leveraging the in-house expertise of our web development and user experience staff, the National Archives is providing an improved website experience for all visitors. Please visit the site, and let us know what you think! Link to Post | Language: English The Road to Edinburgh (Part 2) Posted on September 11, 2020 from Culture on Campus “Inevitably, official thoughts early turned to the time when Scotland would be granted the honour of acting as hosts. Thought was soon turned into action and resulted in Scotland pursuing the opportunity to be host to the Games more relentlessly than any other country has.” From foreword to The Official History of the IXth Commonwealth Games (1970) In our last blog post we left the campaigners working to bring the Commonwealth Games to Edinburgh reflecting on the loss of the 1966 Games to Kingston, Jamaica. The original plan of action sketched out by Willie Carmichael in 1957 had factored in a renewed campaign for 1970 if the initial approach to host the 1966 Games proved unsuccessful. The choice of host cities for the Games were made at the bi-annual General Assemblies of the Commonwealth Games Federation. The campaign to choose the host for 1970 began at a meeting held in Tokyo in 1964 (to coincide with the Olympics), with the final vote taking place at the 1966 Kingston Games. In 1964 the Edinburgh campaign presented a document to the Federation restating its desire to be host city for the Games in 1970. Entitled ‘Scotland Invites’ it laid out Scotland’s case: “We are founder members of the Federation; we have taken part in each Games since the inception in 1930; and we are the only one of six countries who have taken part in every Games, who have not yet had the honour of celebrating the Games.” From Scotland Invites, British Empire and Commonwealth Games Council for Scotland (1964) Documents supporting Edinburgh’s bid to host the 1970 Commonwealth Games presented to meetings of the General Assembly of the Commonwealth Games Federation at Tokyo in 1964 and Kingston in 1966 (ref. WC/2/9/2) Edinburgh faced a rival bid from Christchurch, New Zealand, the competition between the two cities recorded in a series of press cutting files collected by Willie Carmichael. Reports in the Scottish press presented Edinburgh as the favourites for 1970, with Christchurch using their bid as a rehearsal for a more serious campaign to host the 1974 competition. However, the New Zealanders rejected this assessment, arguing that it was the turn of a country in the Southern Hemisphere to host the Games. The 1966 Games brought the final frantic round of lobbying and promotion for the rival bids as members of the Commonwealth Games Federation gathered in Kingston. The British Empire and Commonwealth Games Council for Scotland presented a bid document entitled ‘Scotland 1970’ which included detailed information on the venues and facilities to be provided for the competition along with a broader description of the city of Edinburgh. Artists impression of the new Meadowbank athletics stadium, Edinburgh (ref. WC/2/9/2/12) At the General Assembly of the Commonwealth Games Federation held in Kingston, Jamaica, on 7 August 1966 the vote took place to decide the host of the 1970 Games. Edinburgh was chosen as host city by 18 votes to 11. The Edinburgh campaign team kept a souvenir of this important event. At the end of the meeting they collected together the evidence of their success and put it in an envelope marked ‘Ballot Cards – which recorded votes for Scotland at Kingston 1966.’ The voting cards and envelope now sit in an administrative file which forms part of the Commonwealth Games Scotland Archive. Voting card recording vote for Scotland to host the 1970 Commonwealth Games (ref. CG/2/9/1/2/7) Link to Post | Language: English New Ancient Texts Research Guide Posted on September 10, 2020 from Illuminations “What are the oldest books you have?” is a common question posed to Special Collections & Archives staff at Strozier Library. In fact, the oldest materials in the collection are not books at all but cuneiform tablets ranging in date from 2350 to 1788 BCE (4370-3808 years old). These cuneiform tablets, along with papyrus fragments and ostraka comprise the ancient texts collection in Special Collections & Archives. In an effort to enhance remote research opportunities for students to engage with the oldest materials housed in Strozier Library, a research guide to Ancient Texts at FSU Libraries has been created by Special Collections & Archives staff. Ancient Texts Research Guide The Ancient Texts at FSU Libraries research guide provides links to finding aids with collections information, high-resolution photos of the objects in the digital library, and links to articles or books about the collections. Research guides can be accessed through the tile, “Research Guides,” on the library’s main page. Special Collections & Archives currently has 11 research guides published that share information and resources on specific collections or subjects that can be accessed remotely. While direct access to physical collections is unavailable at this time due to Covid-19, we hope to resume in-person research when it is safe to do so, and Special Collections & Archives is still available to assist you remotely with research and instruction. Please get in touch with us via email at: lib-specialcollections@fsu.edu. For a full list of our remote services, please visit our services page. Link to Post | Language: English SSCI Members Embrace Need for Declassification Reform, Discuss PIDB Recommendations at Senate Hearing Posted on September 10, 2020 from Transforming Classification The Board would like to thank Acting Chairman Marco Rubio (R-FL), Vice Chairman Mark Warner (D-VA), and members of the Senate Select Committee on Intelligence (SSCI) for their invitation to testify yesterday (September 9, 2020) at the open hearing on “Declassification Policy and Prospects for Reform.”    At the hearing, PIDB Member John Tierney responded to questions from committee members about recommendations in the PIDB’s May 2020 Report to the President. He stressed the need for modernizing information security systems and the critical importance of sustained leadership through a senior-level Executive Agent (EA) to oversee and implement meaningful reform. In addition to Congressman Tierney, Greg Koch, the Acting Director of Information Management in the Office of the Director of National Intelligence (ODNI), testified in response to the SSCI’s concerns about the urgent need to improve how the Executive Branch classifies and declassifies national security information. Much of the discussion focused on the PIDB recommendation that the President designate the ODNI as the EA to coordinate the application of information technology, including artificial intelligence and machine learning, to modernize classification and declassification across the Executive Branch. Senator Jerry Moran (R-KS), and Senator Ron Wyden (D-OR), who is a member of the SSCI, joined the hearing to discuss the bill they are cosponsoring to modernize declassification. Their proposed “Declassification Reform Act of 2020” aligns with the PIDB Report recommendations, including the recommendation to designate the ODNI as the EA for coordinating the required reforms. The Board would like to thank Senators Moran and Wyden for their continued support and attention to this crucial issue. Modernizing the classification and declassification system is important for our 21st century national security and it is important for transparency and our democracy. Video of the entire hearing is available to view at the SSCI’s website, and from C-SPAN.  The transcript of prepared testimony submitted to the SSCI by Mr. Tierney is posted on the PIDB website. Link to Post | Language: English Be Connected, Keep A Stir Diary Posted on September 9, 2020 from Culture on Campus The new semester approaches and it’s going to be a bit different from what we’re used to here at the University of Stirling. To help you with your mental health and wellbeing this semester, we’ve teamed up with the Chaplaincy to provide students new and returning with a diary where you can keep your thoughts and feelings, process your new environment, record your joys and capture what the University was like for you in this unprecedented time. Diaries will be stationed at the Welcome Lounges from 12th September and we encourage students to take one for their personal use. Please be considerate of others and only take one diary each. Inside each diary is a QR code which will take you to our project page where you can learn more about the project and where we will be creating an online resource for you to explore the amazing diaries that we keep in Archives and Special Collections. We will be updating this page throughout semester with information from the Archives and events for you to join. Keep an eye out for #StirDiary on social media for all the updates! At the end of semester, you are able to donate your diary to the Archive where it will sit with the University’s institutional records and form a truthful and creative account of what student life was like in 2020. You absolutely don’t have to donate your diary if you don’t want to, the diary belongs to you and you can keep it, throw it away, donate it or anything else (wreck it?) as you like. If you would like to take part in the project but you have missed the Welcome Lounges, don’t worry! Contact Rosie on archives@stir.ac.uk or Janet on janet.foggie1@stir.ac.uk Welcome to the University of Stirling – pick a colour! Link to Post | Language: English PIDB Member John Tierney to Support Modernizing Classification and Declassification before the Senate Select Committee on Intelligence, Tomorrow at 3:00 p.m., Live on C-SPAN Posted on September 8, 2020 from Transforming Classification PIDB member John Tierney will testify at an open hearing on declassification policy and the prospects for reform, to be held by the Senate Select Committee on Intelligence (SSCI) tomorrow, Wednesday, September 9, 2020, from 3:00-4:30 p.m. EST. The hearing will be shown on the SSCI’s website, and televised live on C-SPAN.  SSCI members Senators Ron Wyden (D-OR) and Jerry Moran (R-KS) have cosponsored the proposed “Declassification Reform Act of 2020,” which aligns with recommendations of the PIDB’s latest report to the President, A Vision for the Digital Age: Modernization of the U.S, National Security Classification and Declassification System (May 2020). In an Opinion-Editorial appearing today on the website Just Security, Senators Wyden and Moran present their case for legislative reform to address the challenges of outmoded systems for classification and declassification. At the hearing tomorrow, Mr. Tierney will discuss how the PIDB recommendations present a vision for a uniform, integrated, and modernized security classification system that appropriately defends national security interests, instills confidence in the American people, and maintains sustainability in the digital environment. Mr. Greg Koch, Acting Director of the Information Management Office for the Office of the Director of National Intelligence, will also testify at the hearing. The PIDB welcomes the opportunity to speak before the SSCI and looks forward to discussing the need for reform with the Senators. After the hearing, the PIDB will post a copy of Mr. Tierney’s prepared testimony on its website and on this blog. Link to Post | Language: English Wiki loves monuments – digital skills and exploring stirling Posted on September 8, 2020 from Culture on Campus Every year the Wikimedia Foundation runs Wiki Loves Monuments – the world’s largest photo competition. Throughout September there is a push to take good quality images of listed buildings and monuments and add them to Wiki Commons where they will be openly licensed and available for use across the world – they may end up featuring on Wikipedia pages, on Google, in research and presentations worldwide and will be entered into the UK competition where there are prizes to be had! Below you’ll see a map covered in red and blue pins. These represent all of the listed buildings and monuments that are covered by the Wiki Loves Monuments competition, blue pins are places that already have a photograph and red pins have no photograph at all. The aim of the campaign is to turn as many red pins blue as possible, greatly enhancing the amazing bank of open knowledge across the Wikimedia platforms. The University of Stirling sits within the black circle. The two big clusters of red pins on the map are Stirling and Bridge of Allan – right on your doorstep! We encourage you to explore your local area. Knowing your surroundings, finding hidden gems and learning about the history of the area will all help Stirling feel like home to you, whether you’re a first year or returning student. Look at all those red dots! Of course, this year we must be cautious and safe while taking part in this campaign and you should follow social distancing rules and all government coronavirus guidelines, such as wearing facemasks where appropriate, while you are out taking photographs. We encourage you to walk to locations you wish to photograph, or use the NextBikes which are situated on campus and in Stirling rather than take excessive public transport purely for the purposes of this project. Walking and cycling will help you to get a better sense of where everything is in relation to where you live and keeping active is beneficial to your mental health and wellbeing. Here are your NextBike points on campus where you can pick up a bike to use We hope you’ll join us for this campaign – we have a session planned for 4-5pm on Thursday 17th September on Teams where we’ll tell you more about Wiki Loves Monuments and show you how to upload your images. Sign up to the session on Eventbrite. If you cannot make our own University of Stirling session then Wikimedia UK have their own training session on the 21st September which you can join. Please note that if you want your photographs to be considered for the competition prizes then they must be submitted before midnight on the 30th September. Photographs in general can be added at any time so you can carry on exploring for as long as you like! Finally, just to add a little incentive, this year we’re having a friendly competition between the University of Stirling and the University of St Andrews students to see who can make the most edits so come along to a training session, pick up some brilliant digital skills and let’s paint the town green! Link to Post | Language: English What’s the Tea? Posted on September 4, 2020 from Illuminations Katie McCormick, Associate Dean (she/her/hers) For this post, I interviewed Kate McCormick in order to get a better understanding of the dynamics of Special Collections & Archives. Katie is one of the Associate Deans and has been with SCA for about nine years now (here’s a video of Katie discussing some of our collections on C-SPAN in 2014!). As a vital part of the library, and our leader in Special Collections & Archives, I wanted to get her opinion on how the division has progressed thus far and how they plan to continue to do so in regards to diversity and inclusion.  How would you describe FSU SCA when you first started? “…People didn’t feel comfortable communicating [with each other]… There was one person who really wrote for the blog, and maybe it would happen once every couple of months. When I came on board, my general sense was that we were a department and a group of people with a lot of really great ideas and some fantastic materials, who had come a long way from where things has been, but who hadn’t gotten to a place to be able to organize to change more or to really work more as a team… We were definitely valued as (mostly) the fancy crown jewel group. Really all that mattered was the stuff… it didn’t matter what we were doing with it.” How do you feel the lapse in communication affected diversity and inclusion? “While I don’t have any direct evidence that it excluded people or helped create an environment that was exclusive, I do know that even with our staff at the time, there were times where it contributed to hostilities, frustrations, an  environment where people didn’t feel able to speak or be comfortable in…Everybody just wanted to be comfortable with the people who were just like them that it definitely created some potentially hostile environments. Looking back, I recognize what a poor job we did, as a workplace and a community truly being inclusive, and not just in ways that are immediately visible.” How diverse was SCA when you started?  “In Special Collections there was minimal diversity, certainly less than we have now… [For the libraries as a whole] as you go up in classification and pay, the diversity decreases. That was certainly true when I got here and that remains true.” How would you rank SCA’s diversity and inclusion when you first started? “…Squarely a 5, possibly in some arenas a 4. Not nothing, but I feel like no one was really thinking of it.” And how would you describe it now? “Maybe we’re approaching a 7, I feel like there’s been progress, but there’s still a long way to go in my opinion.” What are some ways we can start addressing these issues? What are some tangible ways you are planning to enact? “For me, some of the first places [is] forming the inclusive research services task force in Special Collections, pulling together a group to look at descriptive practices and applications, and what we’re doing with creating coordinated processing workflows. Putting these issues on the table from the beginning is really important… Right now because we’re primarily in an online environment, i think we have some time to negotiate and change our practices so when we are re-open to the public and people are physically coming in to the spaces, we have new forms, new trainings, people have gone through training that gives them a better sense of identity, communication, diversity.” After my conversation with Katie, I feel optimistic about the direction we are heading in. Knowing how open Special Collections & Archives is about taking critique and trying to put it into action brought me comfort. I’m excited to see how these concerns are addressed and how the department will be putting Dynamic Inclusivity, one of Florida State University’s core values, at the forefront of their practice. I would like to give a big thank you to Katie McCormick for taking the time to do this post with me and for having these conversations! Link to Post | Language: English friday art blog: Terry Frost Posted on September 3, 2020 from Culture on Campus Black and Red on Blue (Screenprint, A/P, 1968) Born in Leamington Spa, Warwickshire, in 1915, Terry Frost KBE RA did not become an artist until he was in his 30s. During World War II, he served in France, the Middle East and Greece, before joining the commandos. While in Crete in June 1941 he was captured and sent to various prisoner of war camps. As a prisoner at Stalag 383 in Bavaria, he met Adrian Heath who encouraged him to paint. After the war he attended Camberwell School of Art and the St. Ives School of Art and painted his first abstract work in 1949. In 1951 he moved to Newlyn and worked as an assistant to the sculptor Barbara Hepworth. He was joined there by Roger Hilton, where they began a collaboration in collage and construction techniques. In 1960 he put on his first exhibition in the USA, in New York, and there he met many of the American abstract expressionists, including Marc Rothko who became a great friend. Terry Frost’s career included teaching at the Bath Academy of Art, serving as Gregory Fellow at the University of Leeds, and also teaching at the Cyprus College of Art. He later became the artist in residence and Professor of Painting at the Department of Fine Art of the University of Reading. Orange Dusk (Lithograph, 2/75, 1970) Frost was renowned for his use of the Cornish light, colour and shape. He became a leading exponent of abstract art and a recognised figure of the British art establishment. These two prints were purchased in the early days of the Art Collection at the beginning of the 1970s. Terry Frost married Kathleen Clarke in 1945 and they had six children, two of whom became artists, (and another, Stephen Frost, a comedian). His grandson Luke Frost, also an artist, is shown here, speaking about his grandfather. Link to Post | Language: English PIDB Sets Next Virtual Public Meeting for October 7, 2020 Posted on September 3, 2020 from Transforming Classification The Public Interest Declassification Board (PIDB) has scheduled its next virtual public meeting for Wednesday, October 7, 2020, from 1:00 to 2:30 p.m.  At the meeting, PIDB members will discuss their priorities for improving classification and declassification in the next 18 months. They will also introduce former Congressman Trey Gowdy, who was appointed on August 24, 2020, to a three-year term on the PIDB. A full agenda, as well as information on how to pre-register, and how to submit questions and comments to the PIDB prior to the virtual meeting, will be posted soon to Transforming Classification. The PIDB looks forward to your participation in continuing our public discussion of priorities for modernizing the classification system going forward. Link to Post | Language: English Digital Collections Updates Posted on September 3, 2020 from UNC Greensboro Digital Collections So as we start a new academic year, we thought this would be a good time for an update on what we’ve been working on recently. Digital collections migration: After more than a year’s delay, the migration of our collections into a new and more user-friendly (and mobile-friendly) platform driven by the Islandora open-source content management system is in the home stretch. This has been a major undertaking and has given us the opportunity to reassess how our collections work. We hope to be live with the new platform in November. 30,000 items (over 380,000 digital images) have already been migrated. 2019-2020 Projects: We’ve made significant progress on most of this year’s projects (see link for project descriptions), though many of these are currently not yet online pending our migration to the Islandora platform: Grant-funded projects: Temple Emanuel Project: We are working with the Public History department and a graduate student in that program. Several hundred items have already been digitized and more work is being done. We are also exploring grant options with the temple to digitize more material. People Not Property: NC Slave Deeds Project: We are in the final year of this project funded by the National Archives and hope to have it online as part of the Digital Library on American Slavery late next year. We are also exploring additional funding options to continue this work. Women Who Answered the Call: This project was funded by a CLIR Recordings at Risk grant. The fragile cassettes have been digitized and we are midway through the process of getting them online in the new platform. Library-funded projects: Poetas sin Fronteras: Poets Without Borders, the Scrapbooks of Dr. Ramiro Lagos: These items have been digitized and will go online when the new platform launches. North Carolina Runaway Slaves Ads Project, Phase 2: Work continues on this ongoing project and over 5700 ads are now online. This second phase has involved both locating and digitizing/transcribing the ads, and we will soon triple the number of ads done in Phase One. We are also working on tighter integration of this project into the Digital Library on American Slavery. PRIDE! of the Community: This ongoing project stemmed from an NEH grant two years ago and is growing to include numerous new oral history interviews and (just added) a project to digitize and display ads from LGBTQ+ bars and other businesses in the Triad during the 1980s and 1990s. We are also working with two Public History students on contextual and interpretive projects based on the digital collection. Faculty-involved projects: Black Lives Matter Collections: This is a community-based initiative to document the Black Lives Matter movement and recent demonstrations and artwork in the area. Faculty: Dr. Tara Green (African America and Diaspora Studies);  Stacey Krim, Erin Lawrimore, Dr. Rhonda Jones, David Gwynn (University Libraries). Civil Rights Oral Histories: This has become multiple projects. We are working with several faculty members in the Media Studies department to make these transcribed interviews available online. November is the target. Faculty: Matt Barr, Jenida Chase, Hassan Pitts, and Michael Frierson (Media Studies); Richard Cox, Erin Lawrimore, David Gwynn (University Libraries). Oral Contraceptive Ads: Working with a faculty member and a student on this project, which may be online by the end of the year. Faculty: Dr. Heather Adams (English); David Gwynn and Richard Cox (University Libraries). Well-Crafted NC: Work is ongoing and we are in the second year of a UNCG P2 grant, working with a faculty member in eth Bryan School and a brewer based in Asheboro. Faculty: Erin Lawrimore, Richard Cox, David Gwynn (University Libraries), Dr. Erick Byrd (Marketing, Entrepreneurship, Hospitality, and Tourism) New projects taken on during the pandemic: City of Greensboro Scrapbooks: Huge collection of scrapbooks from the Greensboro Urban Development Department dating back to the 1940s. These items have been digitized and will go online when the new platform launches. Negro Health Week Pamphlets: 1930s-1950s pamphlets published by the State of North Carolina. These items are currently being digitized and will go online when the new platform launches. Clara Booth Byrd Collection: Manuscript collection. These items are currently being digitized and will go online when the new platform launches. North Carolina Speaker Ban Collection: Manuscript collection. These items are currently being digitized and will go online when the new platform launches. Mary Dail Dixon Papers: Manuscript collection. These items are currently being digitized and will go online when the new platform launches. Ruth Wade Hunter Collection: Manuscript collection. These items are currently being digitized and will go online when the new platform launches. Projects on hold pending the pandemic: Junior League of Greensboro: Much of this has already been digitized and will go online when the new platform launches. UNCG Graduate School Bulletins: Much of this has already been digitized and will go online when the new platform launches.  David Gwynn (Digitization Coordinator, me) offers kudos to Erica Rau and Kathy Howard (Digitization and Metadata Technicians); Callie Coward (Special Collections Cataloging & Digital Projects Library Technician); Charley Birkner (Technology Support Technician); and Dr. Brian Robinson (Fellow for Digital Curation and Scholarship) for their great work in very surreal circumstances over the past six months. Link to Post | Language: English CORRECTION: Creative Fellowship Call for Proposals Posted on September 3, 2020 from Notes For Bibliophiles We have an update to our last post! We’re still accepting proposals for our 2021 Creative Fellowship… But we’ve decided to postpone both the Fellowship and our annual Exhibition & Program Series by six months due to the coronavirus. The annual exhibition will now open on October 1, 2021 (which is 13 months away, but we’re still hard at work planning!). The new due date for Fellowship proposals is April 1, 2021. We’ve adjusted the timeline and due dates in the call for proposals accordingly. Link to Post | Language: English On This Day in the Florida Flambeau, Friday, September 2, 1983 Posted on September 2, 2020 from Illuminations Today in 1983, a disgruntled reader sent in this letter to the editor of the Flambeau. In it, the reader describes the outcome of a trial and the potential effects that outcome will have on the City of Tallahassee. Florida Flambeau, September 2, 1983 It is such a beautifully written letter that I still can’t tell whether or not it’s satire. Do you think the author is being serious or sarcastic? Leave a comment below telling us what you think! Link to Post | Language: English Hartgrove, Meriwether, and Mattingly Posted on September 2, 2020 from The Consecrated Eminence The past few months have been a challenging time for archivists everywhere as we adjust to doing our work remotely. Fortunately, the materials available in Amherst College Digital Collections enable us to continue doing much of our work. Back in February, I posted about five Black students from the 1870s and 1880s — Black Men of Amherst, 1877-1883 — and now we’re moving into the early 20th century. A small clue in The Olio has revealed another Black student that was not included in Harold Wade’s Black Men of Amherst. Robert Sinclair Hartgrove (AC 1905) was known to Wade, as was Robert Mattingly (AC 1906), but we did not know about Robert Henry Meriwether. These three appear to be the first Black students to attend Amherst in the twentieth century. Robert Sinclair Hartgrove, Class of 1905 The text next to Hartgrove’s picture in the 1905 yearbook gives us a tiny glimpse into his time at Amherst. The same yearbook shows Hartgrove not just jollying the players, but playing second base for the Freshman baseball team during the 1902 season. Freshman Baseball Team, 1902 The reference to Meriwether sent me to the Amherst College Biographical Record, where I found Robert Henry Meriwether listed as a member of the Class of 1904. A little digging into the College Catalogs revealed that he belongs with the Class of 1905. College Catalog, 1901-02 Hartgrove and Meriwether are both listed as members of the Freshman class in the 1901-02 catalog. The catalog also notes that they were both from Washington, DC and the Biographical Record indicates that they both prepped at Howard University before coming to Amherst. We find Meriwether’s name in the catalog for 1902-03, but he did not “pull through” as The Olio hopes Hartgrove will; Meriwether returned to Howard University where he earned his LLB in 1907. Hartgrove also became a lawyer, earning his JB from Boston University in 1908 and spending most of his career in Jersey City, NJ. Robert Nicholas Mattingly, Class of 1906 Mattingly was born in Louisville, KY in 1884 and prepped for Amherst at The M Street School in Washington, DC, which changed its name in 1916 to The Dunbar School. Matt Randolph (AC 2016) wrote “Remembering Dunbar: Amherst College and African-American Education in Washington, DC” for the book Amherst in the World, which includes more details of Mattingly’s life. The Amherst College Archives and Special Collections reading room is closed to on-site researchers. However, many of our regular services are available remotely, with some modifications. Please read our Services during COVID-19 page for more information. Contact us at archives@amherst.edu. Link to Post | Language: English Democratizing Access to our Records Posted on September 1, 2020 from AOTUS The National Archives has a big, hairy audacious strategic goal to provide public access to 500 million digital copies of our records through our online Catalog by FY24. When we first announced this goal in 2010, we had less than a million digital copies in the Catalog and getting to 500 million sounded to some like a fairy tale. The goal received a variety of reactions from people across the archival profession, our colleagues and our staff. Some were excited to work on the effort and wanted particular sets of records to be first in line to scan. Some laughed out loud at the sheer impossibility of it. Some were angry and said it was a waste of time and money. Others were fearful that digitizing the records could take their jobs away. We moved ahead. Staff researched emerging technologies and tested them through pilots in order to increase our efficiency. We set up a room at our facilities in College Park to transfer our digital copies from individual hard drives to new technology from Amazon, known as snowballs. We worked on developing new partnership projects in order to get more records digitized. We streamlined the work in our internal digitization labs and we piloted digitization projects with staff in order to find new ways to get digital copies into the Catalog. By 2015, we had 10 million in the Catalog. We persisted. In 2017, we added more digital objects, with their metadata, to the Catalog in a single year than we had for the preceding decade of the project. Late in 2019, we surpassed a major milestone by having more than 100 million digital copies of our records in the Catalog. And yes, it has strained our technology. The Catalog has developed growing pains, which we continue to monitor and mitigate. We also created new finding aids that focus on digital copies of our records that are now available online: see our Record Group Explorer and our Presidential Library Explorer. So now, anyone with a smart phone or access to a computer with wifi, can view at least some of the permanent records of the U.S. Federal government without having to book a trip to Washington, D.C. or one of our other facilities around the country. The descriptions of over 95% of our records are also available through the Catalog, so even if you can’t see it immediately, you can know what records exist. And that is convenient for the millions of visitors we get each year to our website, even more so during the pandemic. National Archives Identifier 20802392 We are well on our way to 500 million digital copies in the Catalog by FY24. And yet, with over 13 billion pages of records in our holdings, we know, we have only just begun. Link to Post | Language: English Lola Hayes and “Tone Pictures of the Negro in Music” Posted on August 31, 2020 from NYPR Archives & Preservation Lola Wilson Hayes (1906-2001) was a highly-regarded African-American mezzo-soprano, WNYC producer, and later, much sought after vocal teacher and coach. A Boston native, Hayes was a music graduate of Radcliffe College and studied voice with Frank Bibb at Baltimore’s Peabody Conservatory. She taught briefly at a black vocational boarding school in New Jersey known as the ‘Tuskeegee of the north'[1] before embarking on a recital and show career which took her to Europe and around the United States. During World War II, she also made frequent appearances at the American Theatre Wing of the Stage Door Canteen of New York and entertained troops at USO clubs and hospitals. Headline from The New York Age, August 12, 1944, pg. 10. (WNYC Archive Collections) Hayes also made time to produce a short but notable run of WNYC programs, which she hosted and performed on the home front. Her November and December 1943 broadcasts were part of a rotating half-hour time slot designated for known recitalists. She shared the late weekday afternoon slot with sopranos Marjorie Hamill, Pina La Corte, Jean Carlton, Elaine Malbin, and the Hungarian pianist Arpád Sándor. Hayes’ series, Tone Pictures of the Negro in Music, sought to highlight African-American composers and was frequently referred to as The Negro in Music. The following outline of 1943 and 1944 broadcasts was pieced together from the WNYC Masterwork Bulletin program guide and period newspaper radio listings. Details on the 1943 programs are sparse. We know that Hayes’ last broadcast in 1943 featured the pianist William Duncan Allen (1906-1999) performing They Led My Lord Away by Roland Hayes and Good Lord Done Been Here by Hall Johnson, and a Porgy and Bess medley by George Gershwin. Excerpt from “Behind the Mike,” November/December 1944, WNYC Masterwork Bulletin. (WNYC Archive Collections) The show was scheduled again in August 1944 as a 15-minute late Tuesday afternoon program and in November that year as a half-hour Wednesday evening broadcast. The August programs began with an interview of soprano Abbie Mitchell (1884-1960), the widow of composer and choral director Will Marion Cook (1869-1944). The composer and arranger Hall Johnson (1888-1970) was her studio guest the following week. The third Tuesday of the month featured pianist Jonathan Brice performing “songs of young contemporary Negro composers,” and the August shows concluded with selections from Porgy and Bess and Cameron Jones. The November broadcasts focused on the work of William Grant Still, “the art songs, spirituals and street cries” of William Lawrence, as well as the songs and spirituals of William Rhodes, lyric soprano Lillian Evanti, and baritone Harry T. Burleigh. Hayes also spent airtime on the work of neo-romantic composer and violinist Clarence Cameron White. The November 29th program considered “the musical setting of poems by Langston Hughes and reportedly included the bard himself. “Langston Hughes was guest of honor and punctuated his interview with a reading from his opera Troubled Island.”[2] This was not the first time the poet’s work was the subject of Hayes’ broadcast. Below is a rare copy of her script from a program airing eight months earlier when she sat in for the regularly scheduled host, soprano Marjorie Hamill. The script for Tone Pictures of the Negro in Music hosted by Lola Hayes on March 24, 1944. (Image used with permission of Van Vecten Trust and courtesy of the Carl Van Vechten Papers Relating to African American Arts and Letters. James Weldon Johnson Collection in the Yale Collection in the Yale Collection of American Literature, Beinecke Rare Book and Manuscript Library)[3] It is unfortunate, but it appears there are no recordings of Lola Hayes’ WNYC program. We can’t say if that’s because they weren’t recorded or, if they were, the lacquer discs have not survived. We do know that World War II-era transcription discs, in general, are less likely to have survived since most of them were cut on coated glass, rather than aluminum, to save vital metals for the war effort. After the war, Hayes focused on voice teaching and coaching. Her students included well-known performers like Dorothy Rudd Moore, Hilda Harris, Raoul Abdul-Rahim, Carol Brice, Nadine Brewer, Elinor Harper, Lucia Hawkins, and Margaret Tynes. She was the first African-American president of the New York Singing Teachers Association (NYSTA), serving in that post from 1970-1972. In her later years, she devoted much of her time to the Lola Wilson Hayes Vocal Artists Award, which gave substantial financial aid to young professional singers worldwide.[4]  ___________________________________________________________ [1] The Manual Training and Industrial School for Colored Youth in Bordentown, New Jersey [2] “The Listening Room,” The People’s Voice, December 2, 1944, pg. 29. The newspaper noted that the broadcast included Hall Johnson’s Mother to Son, Cecil Cohen’s Death of an Old Seaman and Florence Price’s Song to a Dark Virgin, all presumably sung by host, Lola Hayes.  Troubled Island is an opera set in Haiti in 1791. It was composed by William Grant Still with a libretto by Langston Hughes and Verna Arvey. [3] Page two of the script notes Langston Hughes’ grandmother was married to a veteran of the 1859 Harper’s Ferry raid led by abolitionist John Brown. Indeed, Hughes’ grandmother’s first husband was Lewis Sheridan Leary, who was one of Brown’s raiders at Harper’s Ferry. For more on the story please see: A Shawl From Harper’s Ferry. [4] Abdul, Raoul, “Winners of the Lola Hayes Vocal Scholarship and Awards,” The New York Amsterdam News, February 8, 1992, pg. 25. Special thanks to Valeria Martinez for research assistance.   Link to Post | Language: English the road to edinburgh Posted on August 28, 2020 from Culture on Campus On the 50th anniversary of the 1970 Edinburgh Commonwealth Games newly catalogued collections trace the long road to the first Games held in Scotland. A handwritten note dated 10th April 1957 sits on the top of a file marked ‘Scotland for 1970 Host’. The document forms part of a series of files recording the planning, organisation and operation of the 1970 Edinburgh Commonwealth Games, the first to be held in Scotland. Written by Willie Carmichael, a key figure in Scotland’s Games history, the note sets out his plans to secure the Commonwealth Games for Scotland. He begins by noting that Scotland’s intention to host the Games was made at a meeting of Commonwealth Games Federations at the 1956 Melbourne Olympic Games. Carmichael then proceeds to lay out the steps required to make Scotland’s case to be the host of the Games in 1966 or 1970. Willie Carmichael The steps which Carmichael traced out in his note can be followed through the official records and personal papers relating to the Games held in the University Archives. The recently catalogued administrative papers of Commonwealth Games Scotland for the period provide a detailed account of the long process of planning for this major event, recording in particular the close collaboration with Edinburgh Corporation which was an essential element in securing the Games for Scotland (with major new venues being required for the city to host the event). Further details and perspectives on the road to the 1970 Games can be found in the personal papers of figures associated with Commonwealth Games Scotland also held in the University Archives including Sir Peter Heatly and Willie Carmichael himself. The choice of host city for the 1966 Games was to be made at a meeting held at the 1962 Games in Perth, Australia. The first target on Carmichael’s plan, the Edinburgh campaign put forward its application as host city at a Federation meeting held in Rome in 1960. A series of press cutting files collected by Carmichael trace the campaigns progress from this initial declaration of intent through to the final decision made in Perth. Documents supporting Edinburgh’s bid to host the 1966 Commonwealth Games presented to meetings of the Commonwealth Games Federation in Rome (1960) and Perth (1962), part of the Willie Carmichael Archive. Edinburgh faced competition both within Scotland, with the press reporting a rival bid from Glasgow, and across the Commonwealth, with other nations including Jamaica, India and Southern Rhodesia expressing an interest in hosting the 1966 competition. When it came to the final decision in 1962 three cities remained in contention: Edinburgh, Kingston in Jamaica, and Salisbury in Southern Rhodesia. The first round of voting saw Salisbury eliminated. In the subsequent head-to-head vote Kingston was selected as host city for the 1966 Games by the narrowest of margins (17 votes to 16). As Carmichael had sketched out in his 1957 plan if Edinburgh failed in its attempt to host the 1966 Games it would have another opportunity to make its case to hold the 1970 event. Carmichael and his colleagues travelled to Kingston in 1966 confident of securing the support required to bring the Games to Scotland in 1970. In our next blog we’ll look at how they succeeded in making the case for Edinburgh. ‘Scotland Invites’, title page to document supporting Edinburgh’s bid to host the 1966 Commonwealth Games (Willie Carmichael Archive). Link to Post | Language: English friday art blog: kate downie Posted on August 27, 2020 from Culture on Campus Nanbei by Kate Downie (Oil on canvas, 2013) During a series of visits to China a few years ago, Kate Downie was brought into contact with traditional ink painting techniques, and also with the China of today. There she encountered the contrasts and meeting points between the epic industrial and epic romantic landscapes: the motorways, rivers, cityscapes and geology – all of which she absorbed and reflected on in a series of oil and ink paintings. As Kate creates studies for her paintings in situ, she is very much immersed in the landscapes that she is responding to and reflecting on. The artwork shown above, ‘Nanbei’, which was purchased by the Art Collection in 2013, tackles similar themes to Downie’s Scottish based work, reflecting both her interest in the urban landscape and also the edges where land meets water. Here we encounter both aspects within a new setting – an industrial Chinese landscape set by the edge of a vast river. Downie is also obsessed with bridges. As well as the bridge that appears in this image, seemingly supported by trees that follow its line, the space depicted forms an unseen bridge between two worlds and two extremes, between epic natural and epic industrial forms. In this imagined landscape, north meets south (Nanbei literally means North South) and mountains meet skyscrapers; here both natural and industrial structures dominate the landscape. This juxtaposition is one of the aspects of China that impressed the artist and inspired the resulting work. After purchasing this work by Kate Downie, the Art Collection invited her to be one of three exhibiting artists in its exhibition ‘Reflections of the East’ in 2015 (the other two artists were Fanny Lam Christie and Emma Scott Smith). All artists had links to China, and ‘Nanbei’ was central to the display of works in the Crush Hall that Kate had entitled ‘Shared Vision’. Temple Bridge (Monoprint, 2015) Kate Downie studied Fine Art at Gray’s School of Art, Aberdeen and has held artists’ residencies in the USA and Europe. She has exhibited widely and has also taught and directed major art projects. In 2010 Kate Downie travelled to Beijing and Shanghai to work with ink painting masters and she has since returned there several times, slowly building a lasting relationship with Chinese culture. On a recent visit she learned how to carve seals from soapstone, and these red stamps can now be seen on all of her work, including on her print ‘Temple Bridge’ above, which was purchased by the Collection at the end of the exhibition. Kate Downie recently gave an interesting online talk about her work and life in lockdown. It was organised by The Scottish Gallery in Edinburgh which is currently holding an exhibition entitled ‘Modern Masters Women‘ featuring many women artists. Watch Kate Downie’s talk below: Link to Post | Language: English Telling Untold Stories Through the Emmett Till Archives Posted on August 27, 2020 from Illuminations Detail of a newspaper clipping from the Joseph Tobias Papers, MSS 2017-002 Friday August 28th marks the 65th anniversary of the abduction and murder of Emmett Till. Till’s murder is regarded as a significant catalyst for the mid-century African-American Civil Rights Movement. Calls for justice for Till still drive national conversations about racism and oppression in the United States. In 2015, Florida State University (FSU) Libraries Special Collections & Archives established the Emmett Till Archives in collaboration with Emmett Till scholar Davis Houck, filmmaker Keith Beauchamp, and author Devery Anderson. Since then, we have continued to build robust research collections of primary and secondary sources related to the life, murder, and commemoration of Emmett Till. We invite researchers from around the world, from any age group, to explore these collections and ask questions. It is through research and exploration of original, primary resources that Till’s story can be best understood and that truth can be shared. “Mamie had a little boy…”, from the Wright Family Interview, Keith Beauchamp Audiovisual Recordings, MSS 2015-016 FSU Special Collections & Archives. As noted in our Emmett Till birthday post this year, an interview with Emmett Till’s family, conducted by civil rights filmmaker Keith Beauchamp in 2018, is now available through the FSU Digital Library in two parts. Willie Wright, Thelma Wright Edwards, and Wilma Wright Edwards were kind enough to share their perspectives with Beauchamp and in a panel presentation at the FSU Libraries Heritage Museum that Spring. Soon after this writing, original audio and video files from the interview will be also be available to any visitor, researcher, or aspiring documentary filmmaker through the FSU Digital Library. Emmett Till, December 1954. Image from the Davis Houck Papers A presentation by a Till scholar in 2019 led to renewed contact with and a valuable donation from FSU alum Steve Whitaker, who in a way was the earliest contributor to Emmett Till research at FSU. His seminal 1963 master’s thesis, completed right here at Florida State University, is still the earliest known scholarly work on the kidnapping and murder of Till, and was influential on many subsequent retellings of the story. The Till Archives recently received a few personal items from Whitaker documenting life in mid-century Mississippi, as well as a small library of books on Till, Mississippi law, and other topics that can give researchers valuable context for his thesis and the larger Till story. In the future, the newly-founded Emmett Till Lecture and Archives Fund will ensure further opportunities to commemorate Till through events and collection development. FSU Libraries will continue to partner with Till’s family, the Emmett Till Memory Project, Emmett Till Interpretive Center, the Emmett Till Project, the FSU Civil Rights Institute, and other institutions and private donors to collect, preserve and provide access to the ongoing story of Emmett Till. Sources and Further Reading FSU Libraries. Emmett Till Archives Research Guide. https://guides.lib.fsu.edu/till Wright Family Interview, Keith Beauchamp Audiovisual Recordings, MSS 2015-016, Special Collections & Archives, Florida State University, Tallahassee, Florida. Interview Part I: http://purl.flvc.org/fsu/fd/FSU_MSS2015-016_BD_001 Interview Part II: http://purl.flvc.org/fsu/fd/FSU_MSS2015-016_BD_002 Link to Post | Language: English Former Congressman Trey Gowdy Appointed to the PIDB Posted on August 26, 2020 from Transforming Classification On August 24, 2020, House Minority Leader Kevin McCarthy (R-CA) appointed former Congressman Harold W. “Trey” Gowdy, III as a member of the Public Interest Declassification Board. Mr. Gowdy served four terms in Congress, representing his hometown of Spartansburg in South Carolina’s 4th congressional district. The Board members and staff welcome Mr. Gowdy and look forward to working with him in continuing efforts to modernize and improve how the Federal Government classifies and declassifies sensitive information. Mr. Gowdy was appointed by the Minority Leader McCarthy on August 24, 2020. He is serving his first three-year term on the Board. His appointment was announced on August 25, 2020 in the Congressional Record https://www.congress.gov/116/crec/2020/08/25/CREC-2020-08-25-house.pdf Link to Post | Language: English Tracey Sterne Posted on August 25, 2020 from NYPR Archives & Preservation In November of 1981, an item appeared in The New York Times -and it seemed all of us in New York (and elsewhere) who were interested in music, radio, and culture in general, saw it:  “Teresa Sterne,” it read, “who in 14 years helped build the Nonesuch Record label into one of the most distinguished and innovative in the recording industry, will be named Director of Music Programming at WNYC radio next month.” The piece went on to promise that Ms. Sterne, under WNYC’s management, would be creating “new kinds of programming -including some innovative approaches to new music and a series of live music programs.”  This was incredible news. Sterne, by this time, was a true cultural legend. She was known not only for those 14 years she’d spent building Nonesuch, a remarkably smart, serious, and daring record label —but also for how it had all ended, with her sudden dismissal from that label by Elektra, its parent company (whose own parent company was Warner Communications), two years earlier. The widely publicized outrage over her termination from Nonesuch included passionate letters of protest from the likes of Leonard Bernstein, Elliott Carter, Aaron Copland —only the alphabetical beginning of a long list of notable musicians, critics and journalists who saw her firing as a sharp blow to excellence and diversity in music. But the dismissal stood.  By coincidence, only three weeks before the news of her hiring broke, I had applied for a job as a part-time music-host at WNYC. Steve Post, a colleague whom I’d met while doing some producing and on-air work at New York’s decidedly non-profit Pacifica station, WBAI, had come over from there to WNYC, a year before, to do the weekday morning music and news program. “Fishko,” he said to me, “they need someone on the weekends -and I think they want a woman.” My day job of longstanding was as a freelance film editor, but I wanted to keep my hand in the radio world. Weekends would be perfect. In two interviews with executives at WNYC, I had failed to impress. But now I could feel hopeful about making a connection to Ms. Sterne, who was a music person, as was I.  Soon after her tenure began, I threw together a sample tape and got it to her through a contact on the inside. And she said, simply: Yeah, let’s give her a chance. And so it began.  Tracey—the name she was called by all friends and colleagues — seemed, immediately, to be a fascinating, controversial character: she was uniquely qualified to do the work at hand, but at the same time she was a fish out of water. She was un-corporate, not inclined to be polite to the young executives upstairs, and not at all enamored of current trends or audience research. For this we dearly loved her, those of us on the air. She cared how the station sounded, how the music connected, how the information about the music surrounded it. Her preoccupations seemed, even then, to be of the Old School. But she was also fiercely modern in her attitude toward the music, unafraid to mix styles and periods, admiring of new music, up on every instrumentalist and conductor and composer, young, old, avant-garde, traditional. And she had her own emphatic and impeccable taste. Always the best, that was her motto —whatever it is, if it’s great, or even just extremely good, it will distinguish itself and find its audience, she felt.  Tracey Sterne, age 13, rehearsing for a Tchaikovsky concerto performance at WNYC in March 1940. (Finkelstein/WNYC Archive Collections) She had developed her ear and her convictions, as it turned out, as a musician, having been a piano prodigy who performed at Madison Square Garden at age 12. She went on to a debut with the New York Philharmonic, gave concerts at Lewisohn Stadium and the Brooklyn Museum, and so on. I could relate. Though my gifts were not nearly at her level, I, too, had been a dedicated, early pianist and I, too, had looked later for other ways to use what I’d learned at the piano keyboard. And our birthdays were on the same date in March. So, despite being at least a couple of decades apart in age, we bonded.  Tracey’s tenure at WNYC was fruitful, though not long. As she had at Nonesuch, she embraced ambitious and adventurous music programming. She encouraged some of the on-air personalities to express themselves about the music, to “personalize” the air, to some degree. That was also happening in special programs launched shortly before she arrived as part of a New Music initiative, with John Schaefer and Tim Page presenting a range of music way beyond the standard classical fare. And because of Tracey’s deep history and contacts in the New York music business, she forged partnerships with music institutions and found ways to work live performances by individual musicians and chamber groups into the programming. She helped me carve out a segment on air for something we called Great Collaborations, a simple and very flexible idea of hers that spread out to every area of music and made a nice framework for some observations about musical style and history. She loved to talk (sometimes to a fault) and brainstorm about ways to enliven the idea of classical music on the radio, not something all that many people were thinking about, then.  But management found her difficult, slow and entirely too perfectionistic. She found management difficult, slow and entirely too superficial. And after a short time, maybe a year, she packed up her sneakers —essential for navigating the unforgiving marble floors in that old place— and left the long, dusty hallways of the Municipal Building.  After that, I occasionally visited Tracey’s house in Brooklyn for events which I can only refer to as “musicales.” Her residence was on the Upper West Side, but this family house was treated as a country place, she’d go on the weekends. She’d have people over, they’d play piano, and sing, and it might be William Bolcom and Joan Morris, or some other notables, spending a musical and social afternoon. Later, she and I produced a big, New York concert together for the 300th birthday of Domenico Scarlatti –which exact date fell on a Saturday in 1985. “Scarlatti Saturday,” we called it, with endless phone-calling, musician-wrangling and fundraising needed for months to get it off the ground.  The concert itself, much of which was also broadcast on WNYC, went on for many hours, with appearances by some of the finest pianists and harpsichordists in town and out, lines all up and down Broadway to get into Symphony Space.  Throughout, Tracey was her incorruptible self — and a brilliant organizer, writer, thinker, planner, and impossibly driven producing-partner.  I should make clear, however, that for all her knowledge and perfectionistic, obsessive behavior, she was never the cliche of the driven, lonely careerist -or whatever other cliche you might want to choose. She was a warm, haimish person with friends all over the world, friends made mostly through music. A case in point: the “Scarlatti Saturday” event was produced by the two of us on a shoestring. And Tracey, being Tracey, she insisted that we provide full musical and performance information in printed programs, offered free to all audience members, and of course accurate to the last comma. How to assure this? She quite naturally charmed and befriended the printer — who wound up practically donating the costly programs to the event. By the time we were finished she was making him batches of her famous rum balls and he was giving us additional, corrected pages —at no extra charge. It was not a calculated maneuver -it was just how she did things.  You just had to love and respect her for the life force, the intelligence, the excellence and even the temperament she displayed at every turn. Sometimes even now, after her death many years ago at 73 from ALS, I still feel Tracey Sterne’s high standards hanging over me —in the friendliest possible way. ___________________________________________ Sara Fishko hosts WNYC’s culture series, Fishko Files. Link to Post | Language: English Heroes Work Here Posted on August 24, 2020 from AOTUS The National Archives is home to an abundance of remarkable records that chronicle and celebrate the rich history of our nation. It is a privilege to be Archivist of the United States—to be the custodian of our most treasured documents and the head of an agency with such a unique and rewarding mission. But it is my greatest privilege to work with such an accomplished and dedicated staff—the real treasures of the National Archives go home at night. Today I want to recognize and thank the mission-essential staff of NARA’s National Personnel Records Center (NPRC). Like all NARA offices, the NPRC closed in late March to protect its workforce and patrons from the spread of the pandemic and comply with local government movement orders. While modern military records are available electronically and can be referenced remotely, the majority of NPRC’s holdings and reference activity involve paper records that can be accessed only by on-site staff. Furthermore, these records are often needed to support veterans and their families with urgent matters such as medical emergencies, homeless veterans seeking shelter, and funeral services for deceased veterans. Concerned about the impact a disruption in service would have on veterans and their families, over 150 staff voluntarily set aside concerns for their personal welfare and regularly reported to the office throughout the period of closure to respond to these types of urgent requests. These exceptional staff were pioneers in the development of alternative work processes to incorporate social distancing and other protective measures to ensure a safe work environment while providing this critical service. National Personnel Records Center (NPRC) building in St. Louis The Center is now in Phase One of a gradual re-opening, allowing for additional on-site staff.  The same group that stepped up during the period of closure continues to report to the office and are now joined by additional staff volunteers, enabling them to also respond to requests supporting employment opportunities and home loan guaranty benefits. There are now over 200 staff supporting on-site reference services on a rotational basis. Together they have responded to over 32,000 requests since the facility closed in late March. More than half of these requests supported funeral honors for deceased veterans. With each passing day we are a day closer to the pandemic being behind us. Though it may seem far off, there will come a time when Covid-19 is no longer the threat that it is today, and the Pandemic of 2020 will be discussed in the context of history. When that time comes, the mission essential staff of NPRC will be able to look back with pride and know that during this unprecedented crisis, when their country most needed them, they looked beyond their personal well-being to serve others in the best way they were able. As Archivist of the United States, I applaud you for your commitment to the important work of the National Archives, and as a Navy veteran whose service records are held at NPRC, I thank you for your unwavering support to America’s veterans. Link to Post | Language: English Contribute to the FSU Community COVID 19 Project Posted on August 21, 2020 from Illuminations Masks Sign, contributed by Lorraine Mon, view this item in the digital library here Students, faculty, and alumni! Heritage & University Archives is collecting stories and experiences from the FSU community during COVID-19. University life during a pandemic will be studied by future scholars. During this pandemic, we have received requests surrounding the 1918 Flu Pandemic. Unfortunately, not many documents describing these experiences survive in the archive.  To create a rich record of life in these unique times we are asking the FSU Community to contribute their thoughts, experiences, plans, and photographs to the archive. Working from Home, contributed by Shaundra Lee, view this time in the digital library here How did COVID-19 affect your summer? Tell us about your plans for fall. How did COVID-19 change your plans for classes? Upload photographs of your dorm rooms or your work from home set ups. If you’d like to see examples of what people have already contributed, please see the collection on Diginole. You can add your story to the project here. Link to Post | Language: English 2021 Creative Fellowship – Call for Proposals Posted on August 21, 2020 from Notes For Bibliophiles PPL is now accepting proposals for our 2021 Creative Fellowship! We’re looking for an artist working in illustration or two-dimensional artwork to create new work related to the theme of our 2021 exhibition, Tomboys. View the full call for proposals, including application instructions, here. The application deadline is October 1, 2020 April 1, 2021*. *This deadline has shifted since we originally posted this call for proposals! The 2021 Fellowship, and the Exhibition & Program Series, have both been shifted forward by six months due to the coronavirus. Updated deadlines and timeline in the call for proposals! Link to Post | Language: English Friday art blog: still life in the collection Posted on August 20, 2020 from Culture on Campus Welcome to our new regular blog slot, the ‘Friday Art Blog’. We look forward to your continued company over the next weeks and months. You can return to the Art Collection website here, and search our entire permanent collection here. Pears by Jack Knox (Oil on board, 1973) This week we are taking a look at some of the still life works of art in the permanent collection. ‘Still life’ (or ‘nature morte’ as it is also widely known) refers to the depiction of mostly inanimate subject matter. It has been a part of art from the very earliest days, from thousands of years ago in Ancient Egypt, found also on the walls in 1st century Pompeii, and featured in illuminated medieval manuscripts. During the Renaissance, when it began to gain recognition as a genre in its own right, it was adapted for religious purposes. Dutch golden age artists in particular, in the early 17th century, depicted objects which had a symbolic significance. The still life became a moralising meditation on the brevity of life. and the vanity of the acquisition of possessions. But, with urbanization and the rise of a middle class with money to spend, it also became fashionable simply as a celebration of those possessions – in paintings of rare flowers or sumptuous food-laden table tops with expensive silverware and the best china. The still life has remained a popular feature through many modern art movements. Artists might use it as an exercise in technique (much cheaper than a live model), as a study in colour, form, or light and shade, or as a meditation in order to express a deeper mood. Or indeed all of these. The works collected by the University of Stirling Art Collection over the past fifty years reflect its continuing popularity amongst artists and art connoisseurs alike. Bouteille et Fruits by Henri Hayden (Lilthograph, 75/75, 1968) In the modern era the still life featured in the post impressionist art of Van Gogh, Cezanne and Picasso. Henri Hayden trained in Warsaw, but moved to Paris in 1907 where Cezanne and Cubism were influences. From 1922 he rejected this aesthetic and developed a more figurative manner, but later in life there were signs of a return to a sub-cubist mannerism in his work, and as a result the landscapes and still lifes of his last 20 years became both more simplified and more definitely composed than the previous period, with an elegant calligraphy. They combine a new richness of colour with lyrical melancholy. Meditation and purity of vision mark the painter’s last years. Black Lace by Anne Redpath (Gouache, 1951) Anne Redpath is best known for her still lifes and interiors, often with added textural interest, and also with the slightly forward-tilted table top, of which this painting is a good example. Although this work is largely monochrome it retains the fascination the artist had in fabric and textiles – the depiction of the lace is enhanced by the restrained palette. Untitled still life by Euan Heng (Linocut, 1/5, 1974) While Euan Heng’s work is contemporary in practice his imagery is not always contemporary in origin. He has long been influenced by Italian iconography, medieval paintings and frescoes. Origin of a rose by Ceri Richards (Lithograph, 30/70, 1967) In Ceri Richards’ work there is a constant recurrence of visual symbols and motifs always associated with the mythic cycles of nature and life. These symbols include rock formations, plant forms, sun, moon and seed-pods, leaf and flower. These themes refer to the cycle of human life and its transience within the landscape of earth. Still Life, Summer by Elizabeth Blackadder (Oil on canvas, 1963) This is a typical example of one of Elizabeth Blackadder’s ‘flattened’ still life paintings, with no perspective. Works such as this retain the form of the table, with the top raised to give the fullest view. Broken Cast by David Donaldson (Oil on canvas , 1975) David Donaldson was well known for his still lifes and landscape paintings as well as literary, biblical and allegorical subjects. Flowers for Fanny by William MacTaggart Oil on board, 1954 William MacTaggart typically painted landscapes, seascapes and still lifes featuring vases of flowers. These flowers, for his wife, Fanny Aavatsmark, are unusual for not being poppies, his most commonly painted flower. Cake by Fiona Watson (Digital print, 18/25, 2009) We end this blog post with one of the most popular still lifes in the collection. This depiction of Scottish classic the Tunnock’s teacake is a modern take on the still life. It is a firm favourite whenever it is on display. Image by Julie Howden Link to Post | Language: English Solar Energy: A Brief Look Back Posted on August 20, 2020 from Illuminations In the early 1970’s the United States was in the midst of an energy crisis. Massive oil shortages and high prices made it clear that alternative ideas for energy production were needed and solar power was a clear front runner. The origins of the solar cell in the United States date back to inventor Charles Fritz in the 1880’s, and the first attempts at harvesting solar energy for homes, to the late 1930’s. In 1974, the State of Florida put it’s name in the ring to become the host of the National Solar Energy Research Institute. Site proposal for the National Solar Energy Research Institute. Claude Pepper Papers S. 301 B. 502 F. 4 With potential build sites in Miami and Cape Canaveral, the latter possessing the added benefit of proximity to NASA, the Florida Solar Energy Task Force, led by Robert Nabors and endorsed by Representative Pepper, felt confident. The state made it to the final rounds of the search before the final location of Golden, Colorado was settled upon, which would open in 1977. Around this same time however (1975), the Florida Solar Energy Center was established at the University of Central Florida. The Claude Pepper Papers contain a wealth of information on Florida’s efforts in the solar energy arena from the onset of the energy crisis, to the late 1980’s. Carbon copy of correspondence between Claude Pepper and Robert L. Nabors regarding the Cape Canaveral proposed site for the National Solar Research Institute. Claude Pepper Papers S. 301 B. 502 F. 4 Earlier this year, “Tallahassee Solar II”, a new solar energy farm, began operating in Florida’s capitol city.  Located near the Tallahassee International Airport, it provides electricity for more than 9,500 homes in the Leon County area. With the steady gains that the State of Florida continues to make in the area of solar energy expansion, it gets closer to fully realizing its nickname, “the Sunshine State.” Link to Post | Language: English (C)istory Lesson Posted on August 18, 2020 from Illuminations Our next submission is from Rachel Duke, our Rare Books Librarian, who has been with Special collections for two years. This project was primarily geared towards full-time faculty and staff, so I chose to highlight her contribution to see what a full-time faculty’s experience would be like looking through the catalog. Frontispiece and Title Page, Salome, 1894. Image from https://collection.cooperhewitt.org/objects/68775953/ The item she chose was Salome, originally written in French by Oscar Wilde, then translated into English, as her object. While this book does not explicitly identify as a “Queer Text,” Wilde has become canonized in queer historical literature. In the first edition of the book, there is even a dedication to his lover, Lord Alfred Bruce Douglas, who helped with the translation. While there are documented historical examples of what we would refer to today as “queerness,” (queer meaning non-straight) there is still no demarcation of his queerness anywhere in the catalog record. Although the author is not necessarily unpacking his own queer experiences in the text, “both [Salome’s] author and its legacy participate strongly in queer history” as Duke states in her submission.  Oscar Wilde and Lord Alfred Bruce Douglas Even though Wilde was in a queer relationship with Lord Alfred Bruce Douglas, and has been accepted into the Queer canon, why doesn’t his catalog record reflect that history? Well, a few factors come into play. One of the main ones is an aversion to retroactively labeling historical figures. Since we cannot confirm which modern label would fit Wilde, we can’t necessarily outright label him as gay. How would a queer researcher like me go about finding authors and artists from the past who are connected with queer history? It is important to acknowledge LGBTQ+ erasure when discussing this topic. Since the LGBTQ+ community has historically been marginalized, documentation of queerness is hard to come by because: People did not collect, and even actively erased, Queer and Trans Histories. LGBTQ+ history has been passed down primarily as an oral tradition.  Historically, we cannot confirm which labels people would have identified with. Language and social conventions change over time. So while we view and know someone to be queer, since it is not in official documentation we have no “proof.” On the other hand, in some cultures, gay relations were socially acceptable. For example, in the Middle Ages, there was a legislatively approved form of same-sex marriage, known as affrèrement. This example is clearly labeled as *gay* in related library-based description because it was codified that way in the historical record. By contrast, Shakespeare’s sonnets, which (arguably) use queer motifs and themes, are not labeled as “queer” or “gay.” Does queer content mean we retroactively label the AUTHOR queer? Does the implication of queerness mean we should make the text discoverable under queer search terms? Cartoon depicting Oscar Wilde’s visit to San Francisco. By George Frederick Keller – The Wasp, March 31, 1882. Personally, I see both sides. As someone who is queer, I would not want a random person trying to retroactively label me as something I don’t identify with. On the other hand, as a queer researcher, I find it vital to have access to that information. Although they might not have been seen as queer in their time period, their experiences speak to queer history. Identities and people will change, which is completely normal, but as a group that has experienced erasure of their history, it is important to acknowledge all examples of historical queerness as a proof that LGBTQ+ individuals have existed throughout time. How do we responsibly and ethically go about making historical queerness discoverable in our finding aids and catalogs? Click Here to see some more historical figures you might not have known were LGBTQ+. Link to Post | Language: English Post navigation ← Older posts About ArchivesBlogs ArchivesBlogs syndicates content from weblogs about archives and archival issues and then makes the content available in a central location in a variety of formats.More Info.   Languages Deutsch English Español Français Italiano Nederlands Nihongo (日本語) العربية Syndicated Blogs ????????? blog? A Lively Experiment A Repository for Bottled Monsters A View to Hugh Academic Health Center Archives Adventures in Records Management African American Studies at Beinecke Library Annotations: The NEH Preservation Project AOTUS Archaeology Archives Oxford Archivagando Archival science / ??? ??????? Archivalia Archiveros Españoles en la Función Pública (AEFP) Archives and Auteurs Archives and Special Collections Archives d’Assy Archives Forum Archives Gig Archives Hub Blog Archives Outside Archives, Records and Artefacts ArchivesInfo ArchivesNext Archivistica e dintorni Archivium Sancti Iacobi Archivólogo – blog de archivo – Lic. Carmen Marín ArcHiVóNoMo.biT Arkivformidling Around the D AuthentiCity Beaver Archivist Blog bloggers@brooklynmuseum Â» Libraries & Archives Bogdan's Archival Blog — Blog de arhivist born digital archives (AIMS Project) Brandeis Special Collections Spotlight Calames – le blog Consultores Documentales Cultural Compass Culture on Campus Daily Searchivist De Digitale Archivaris Depotdrengen Digital Library of Georgia Digitization 101 discontents Dub Collections Endangered archives blog Ephemeral Archives F&M Archives & Special Collections Fil d'ariane frei23 – GeschichtsPuls Fresh Pickin's futureArch, or the future of archives… Hanging Together Helen Morgan Historical Notes Illuminations In the mailbox inside the CHS Inside the Gates Keeping Time L’Affaire Makropoulos l’Archivista La Tribune des Archives LiveJournal Archivists LSU Libraries Special Collections Blog M.E. Grenander Department of Special Collections and Archives MIT Libraries News » Archives + MIT History Modern Books and Manuscripts Mudd Manuscript Library Blog National Union of Women Teachers NC Miscellany nccdhistory New Archivist New York State Archives News and Events News – Litwin Books & Library Juice Press Notes For Bibliophiles O arquivista Old Things With Stories Open Beelden Order from Chaos Out of the Box Out of the box Pacific Northwest Features PaulingBlog Peeling Back the Bark Poetry at Beinecke Library Posts on Mark A. Matienzo Practical Archivist Practical E-Records Presbyterian Research RATilburg ReadyResources Reclamation & Representation Records management futurewatch Records Mgmt & Archiving Richard B. Russell Library for Political Research and Studies Room 26 Cabinet of Curiosities SDSU Special Collections: New Acquisitions, Events, and Highlights from Our Collections Special Collections Blog Special Collections – The University of Chicago Library News Special Collections – UGA Libraries News & Events Special Collections – UTC Library Spellbound Blog Stacked Five High State Library of Massachusetts State Records Office of Western Australia The Anarchivist The Autry Blog The Back Table The Butler Center for Arkansas Studies The Charleston Archive The Consecrated Eminence The Devil's Tale The Last Campaign The Legacy Center The Posterity Project The Quantum Archivist The Top Shelf the visible archive Touchable Archives Transforming Classification Trinity University Special Collections and Archives Twin Cities Archives Round Table UNC Greensboro Digital Collections Vault217 VPRO Radio Archief WebArchivists WebArchivists (FR) What the fonds? What's Cool at Hoole What’s on the 6th floor? WNYC Archives & Preservation You Ought to be Ashamed Proudly powered by WordPress 
arstechnica-com-5132	----	Dogecoin has risen 400 percent in the last week because why not | Ars Technica Skip to main content Biz & IT Tech Science Policy Cars Gaming & Culture Store Forums Subscribe Close Navigate Store Subscribe Videos Features Reviews RSS Feeds Mobile Site About Ars Staff Directory Contact Us Advertise with Ars Reprints Filter by topic Biz & IT Tech Science Policy Cars Gaming & Culture Store Forums Settings Front page layout Grid List Site theme Black on white White on black Sign in Comment activity Sign up or login to join the discussions! Stay logged in | Having trouble? Sign up to comment and more Sign up Here we go again — Dogecoin has risen 400 percent in the last week because why not Dogecoin rallied after Elon Musk tweeted a photo of "Doge Barking at the Moon." Timothy B. Lee - Apr 16, 2021 6:56 pm UTC Enlarge peng song / Getty reader comments 162 with 112 posters participating, including story author Share this story Share on Facebook Share on Twitter Share on Reddit Dogecoin, a blockchain-based digital currency named for a meme about an excitable canine, has seen its price rise by a factor of five over the last week. The price spike has made it one of the world's 10 most valuable cryptocurrencies, with a market capitalization of $45 billion. Understanding the value of cryptocurrencies is never easy, and it's especially hard for Dogecoin, which was created as a joke. Dogecoin isn't known for any particular technology innovations and doesn't seem to have many practical applications. What Dogecoin does have going for it, however, is memorable branding and an enthusiastic community of fans. And in 2021, that counts for a lot. In recent months, we've seen shares of GameStop soar to levels that are hard to justify based on the performance of GameStop's actual business. People bought GameStop because it was fun and they thought the price might go up. So too for Dogecoin. Tesla CEO Elon Musk may have also played an important role in Dogecoin's ascendancy. Musk has periodically tweeted about the cryptocurrency, and those tweets are frequently followed by rallies in Dogecoin's price. Late on Wednesday night, Musk tweeted out this image: Advertisement Doge Barking at the Moon pic.twitter.com/QFB81D7zOL — Elon Musk (@elonmusk) April 15, 2021 Dogecoin's price tripled over the next 36 hours. My editor suggested that I write about whether Dogecoin's rise is a sign of an overheated crypto market, but for a coin like Dogecoin, I'm not sure that's even a meaningful concept. Dogecoin isn't a company that has revenues or profits. And unlike bitcoin and ether, no one seriously thinks it's going to be the foundation of a new financial system. People are trading Dogecoin because it's fun to trade and because they think they might make money from it. The rising price is a sign that a lot of people have decided it would be fun to speculate in Dogecoin. Of course, the fact that lots of people have money to spend on joke investments might itself be a result of larger macroeconomic forces. The combination of stimulus spending, low interest rates, and pandemic-related saving means that a lot of people have more money than usual sitting in their bank accounts. And restrictions on travel and nightlife mean that many of those same people have a lot of time on their hands. reader comments 162 with 112 posters participating, including story author Share this story Share on Facebook Share on Twitter Share on Reddit Timothy B. Lee Timothy is a senior reporter covering tech policy, blockchain technologies and the future of transportation. He lives in Washington DC. Email timothy.lee@arstechnica.com // Twitter @binarybits Advertisement You must login or create an account to comment. Channel Ars Technica ← Previous story Next story → Related Stories Sponsored Stories Powered by Today on Ars Store Subscribe About Us RSS Feeds View Mobile Site Contact Us Staff Advertise with us Reprints Newsletter Signup Join the Ars Orbital Transmission mailing list to get weekly updates delivered to your inbox. Sign me up → CNMN Collection WIRED Media Group © 2021 Condé Nast. All rights reserved. Use of and/or registration on any portion of this site constitutes acceptance of our User Agreement (updated 1/1/20) and Privacy Policy and Cookie Statement (updated 1/1/20) and Ars Technica Addendum (effective 8/21/2018). Ars may earn compensation on sales from links on this site. Read our affiliate link policy. Your California Privacy Rights | Do Not Sell My Personal Information The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of Condé Nast. Ad Choices 
arstechnica-com-8015	----	Tesla: “Full self-driving beta” isn’t designed for full self-driving | Ars Technica Skip to main content Biz & IT Tech Science Policy Cars Gaming & Culture Store Forums Subscribe Close Navigate Store Subscribe Videos Features Reviews RSS Feeds Mobile Site About Ars Staff Directory Contact Us Advertise with Ars Reprints Filter by topic Biz & IT Tech Science Policy Cars Gaming & Culture Store Forums Settings Front page layout Grid List Site theme Black on white White on black Sign in Comment activity Sign up or login to join the discussions! Stay logged in | Having trouble? Sign up to comment and more Sign up Mixed messages — Tesla: “Full self-driving beta” isn’t designed for full self-driving Tesla told California regulators the FSD beta lacks "true autonomous features." Timothy B. Lee - Mar 9, 2021 11:15 pm UTC Enlarge / YouTuber Brandon M captured this drone footage of his Tesla steering toward a parked car in October 2020, shortly after the FSD beta became available to the public. Brandon M / YouTube reader comments 486 with 145 posters participating, including story author Share this story Share on Facebook Share on Twitter Share on Reddit The transparency site PlainSite recently published a pair of letters Tesla wrote to the California Department of Motor Vehicles in late 2020. The letters cast doubt on Elon Musk's optimistic timeline for the development of fully driverless technology. For years, Elon Musk has been predicting that fully driverless technology is right around the corner. At an April 2019 event, Musk predicted that Teslas would be capable of fully driverless operation—known in industry jargon as "level 5"—by the end of 2020. "There's three steps to self-driving," Musk told Tesla investors at the event. "There's being feature complete. Then there's being feature complete to the degree where we think the person in the car does not need to pay attention. And then there's being at a reliability level where we also convince regulators that that is true." Tesla obviously missed Musk's 2020 deadline. But you might be forgiven for thinking Tesla is now belatedly executing the strategy he described two years ago. In October, Tesla released what it called its "full self-driving beta" software to a few-dozen Tesla owners. A few days ago, Musk announced plans to expand the program to more customers. Further Reading “Oh Jeeeesus”: Drivers react to Tesla’s full self-driving beta release Given that the product is called "full self-driving," this might seem like the first step in Musk's three-step progression. After a few more months of testing, perhaps it will become reliable enough to operate without human supervision. That could allow Musk to make good on his latest optimistic timeline for Autopilot: in a December 2020 interview, Musk said he was "extremely confident" that Tesla vehicles would reach level 5 by the end of 2021. But a letter Tesla sent to California regulators the same month had a different tone. Despite the "full self-driving" name, Tesla admitted it doesn't consider the current beta software suitable for fully driverless operation. The company said it wouldn't start testing "true autonomous features" until some unspecified point in the future. “We do not expect significant enhancements” Enlarge In a pair of letters last November and December, officials at the California DMV asked Tesla for details about the FSD beta program. Tesla requires drivers using the beta software to actively supervise it so they can quickly intervene if needed. The DMV wanted to know if Tesla planned to relax requirements for human supervision once the software was made available to the general public. Advertisement In its first response, sent in November, Tesla emphasized that the beta software had limited functionality. Tesla told state regulators that the software is "not capable of recognizing or responding" to "static objects and road debris, emergency vehicles, construction zones, large uncontrolled intersections with multiple incoming ways, occlusions, adverse weather, complicated or adversarial vehicles in the driving path, and unmapped roads." In a December follow-up, Tesla added that "we expect the functionality to remain largely unchanged in a future, full release to the customer fleet." Tesla added that "we do not expect significant enhancements" that would "shift the responsibility for the entire dynamic driving task to the system." The system "will continue to be an SAE Level 2, advanced driver-assistance feature." SAE level 2 is industry jargon for driver-assistance systems that perform functions like lane-keeping and adaptive cruise control. By definition, level 2 systems require continual human oversight. Fully driverless systems—like the taxi service Waymo is operating in the Phoenix area—are considered level 4 systems. In its letter to California officials, Tesla added that "Tesla's development of true autonomous features will follow our iterative process (development, validation, early release, etc.) and any such features will not be released to the general public until we have fully validated them." Critics pounced on the disclosure. "Here it is, straight from Tesla," tweeted prominent Tesla skeptic Ed Niedermeyer. "'Full Self-Driving' is not, and will never be, actually self-driving." This might not be quite fair to Tesla—the company apparently does plan to develop more advanced software eventually. But at a minimum, Tesla's public communication about the full self-driving package could easily give customers the wrong impression about the software's future capabilities. Full autonomy is always right around the corner Enlarge / Elon Musk in 2020. BRENDAN SMIALOWSKI / Getty Since 2016, Tesla has given customers every reason to expect that its "full self-driving" software would be, well, fully self-driving. Early promotional materials for the FSD package described a driver getting out of the vehicle and having it find a parking spot on its own. Tesla has repeatedly talked about the FSD package enabling a Tesla vehicle to operate as an autonomous taxi—an application that requires the car to drive itself without anyone behind the wheel. In 2016, Musk predicted that, within two years, a Tesla owner in Los Angeles would be able to summon their vehicle from New York City. Advertisement Further Reading Tesla’s autonomy event: Impressive progress with an unrealistic timeline If Tesla is really going to achieve fully driverless operation in 2021, that doesn't leave much time to develop, test, and validate complex, safety-critical software. So it would be natural for customers to assume that the software Tesla named "Full Self Driving beta" is, in fact, a beta version of Tesla's long-awaited fully self-driving software. But in its communications with California officials, Tesla makes it clear that's not true. Of course, Elon Musk has a long history of announcing over-optimistic timelines for his products. It's not really news that Tesla failed to meet an optimistic deadline set by its CEO. But there's a deeper philosophical issue that may go beyond a few blown deadlines. The long road to full autonomy Enlarge / Waymo tested its driverless taxis in the Phoenix area for more than three years before beginning driverless commercial operations. Waymo Tesla's overall Autopilot strategy is to start with a driver-assistance system and gradually evolve it into a fully driverless system. A bunch of other companies in the industry—led by Google's Waymo—believe that this is a mistake. They think the requirements of the two products are so different that it makes more sense to create a driverless taxi, shuttle, or delivery service from scratch. In particular, companies like Waymo argue that it's too difficult to get regular customers to pay close attention to an almost-but-not-fully driverless vehicle. If a car drives perfectly for 1,000 miles and then makes a big mistake, there's a significant risk the human driver won't be paying close enough attention to prevent a crash. Waymo initially considered creating an Autopilot-like driver assistance system and licensing it to automakers, but the company ultimately decided that doing so would be too risky. Musk has always shrugged this critique off. As we've seen, he believes improvements to Autopilot's driver-assistance features will transform it into a system capable of fully driverless operation. But in its comments to the DMV, Tesla seems to endorse the opposite viewpoint: that adding "true autonomous features" to Autopilot will require more than just incrementally improving the performance of its existing software. Tesla acknowledged that it needs more sophisticated systems for handling the "static objects, road debris, emergency vehicles, construction zones." And this makes it a little hard to believe Musk's boast that Tesla will achieve level 5 autonomy by the end of 2021. Notably, Google's prototype self-driving vehicles have been able to navigate most roadway conditions—much like today's Tesla FSD beta software—since roughly 2015. Yet the company needed another five years to refine the technology enough to enable fully driverless operation. And that was within a limited geographic area and with help from powerful lidar sensors. Tesla is trying to achieve the same feat for every street nationwide—and using only cameras and radar. Perhaps Tesla will move faster than Waymo, and it won't take another five years to achieve fully driverless operation. But customers considering whether to pay $10,000 for Tesla's full self-driving software package should certainly take Musk's optimistic timeline with a grain of salt. Promoted Comments Frodo Douchebaggins Ars Tribunus Militum jump to post I bought FSD over three years ago when Elon's charisma roll beat my wisdom save. There have been a number of questionable claims, and a few outright lies regarding things that were 100% within their control ( https://web.archive.org/web/20190304232 ... capability being a notable example. TLDR; If you bought FSD and were not thrilled with the price drop after you bought it but well before a single feature was delivered, take heart, for you won’t receive a refund of the difference like an ethical business will do, but you will receive an invite to the early access program and get to use the upcoming features before other people! Except that a month or so later they took down that blog post, and the invites never happened.) I understand that they probably really did think they'd be further along now than they are, but the fact that they're not letting us transfer our FSD license if we want to buy a new car means I'm likely not buying another tesla. It's tantamount to preordering a product and then being told you can't change your shipping address when you move a few years later, and is a slap in the face to the early FSD buyers who have received only a single small feature for that money. Why would I give them more money after what they've done? Their pride is costing them their relationship with the customers that should be the most loyal, and they're doing it over something that costs them nothing except flipping a bit each on the current car and the new car, and for that they get to sell another car. At this point I think we have to be close to large class actions finally emerging, and while they won't help us get what we paid for, and won't get our money back, maybe it'll hurt enough to make them stop over promising and underdelivering 2710 posts | registered 12/17/2012 nimble Ars Centurion et Subscriptor jump to post jeffpkamp wrote: Soon tesla will have four assist levels. Autopilot, Full Self driving, level 5, and , no-for-real-la-to-ny-without-assistance (NFRLNWA). But all joking and sarcasm aside. I've watched probably 2 hours of FSD beta videos, and the only thing that really seemed to give the car trouble was roundabouts. Residential level streets, busy commercial streets, and highways were all navigated with relatives from what I could see. And the computer seemed to be accurately picking up everything important. There were some hilarious glitches as the CV software tried to classify things like turning semi trucks, but it got the positions right. Honestly I think this is just a report from someone who speaks fluent bureaucrat, which Elon obviously does not. In terms of achieving full unmonitored self driving, a video like that means very little. It can demonstrate that the car did the right thing in the particular circumstances of the video. It says next to nothing about how reliably it can do it, nor whether it can deal with the enormous range of possible situations that aren't in the video. For such a video to be meaningful, it would need to be hundeds of thousands of hours long, contain a random selection of driving situations that the software is expected to deal with, and show that no driver interventions were required. In other words, it's only large scale statistics that can demonstrate whether self driving systems are safe, not a bunch of short and quite possibly cherry-picked video clips. The good news is that Tesla are collecting those statistics. The bad news is that they're not sharing them. https://www.forbes.com/sites/bradtemple ... 95cc0f7fab 216 posts | registered 5/30/2005 reader comments 486 with 145 posters participating, including story author Share this story Share on Facebook Share on Twitter Share on Reddit Timothy B. Lee Timothy is a senior reporter covering tech policy, blockchain technologies and the future of transportation. He lives in Washington DC. Email timothy.lee@arstechnica.com // Twitter @binarybits Advertisement You must login or create an account to comment. Channel Ars Technica ← Previous story Next story → Related Stories Sponsored Stories Powered by Today on Ars Store Subscribe About Us RSS Feeds View Mobile Site Contact Us Staff Advertise with us Reprints Newsletter Signup Join the Ars Orbital Transmission mailing list to get weekly updates delivered to your inbox. Sign me up → CNMN Collection WIRED Media Group © 2021 Condé Nast. All rights reserved. Use of and/or registration on any portion of this site constitutes acceptance of our User Agreement (updated 1/1/20) and Privacy Policy and Cookie Statement (updated 1/1/20) and Ars Technica Addendum (effective 8/21/2018). Ars may earn compensation on sales from links on this site. Read our affiliate link policy. Your California Privacy Rights | Do Not Sell My Personal Information The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of Condé Nast. Ad Choices 
arxiv-org-2331	----	None 
bibwild-wordpress-com-389	----	Bibliographic Wilderness Bibliographic Wilderness Code that Lasts: Sustainable And Usable Open Source Code A presentation I gave at online conference Code4Lib 2021, on Monday March 21. I have realized that the open source projects I am most proud of are a few that have existed for years now, increasing in popularity, with very little maintenance required. Including traject and bento_search. While community aspects matter for open source sustainability, &#8230; Continue reading Code that Lasts: Sustainable And Usable Open Source&#160;Code &#8594; Product management In my career working in the academic sector, I have realized that one thing that is often missing from in-house software development is &#8220;product management.&#8221; But what does that mean exactly? You don&#8217;t know it&#8217;s missing if you don&#8217;t even realize it&#8217;s a thing and people can use different terms to mean different roles/responsibilities. Basically, &#8230; Continue reading Product management &#8594; Rails auto-scaling on Heroku We are investigating moving our medium-small-ish Rails app to heroku. We looked at both the Rails Autoscale add-on available on heroku marketplace, and the hirefire.io service which is not listed on heroku marketplace and I almost didn&#8217;t realize it existed. I guess hirefire.io doesn&#8217;t have any kind of a partnership with heroku, but still uses &#8230; Continue reading Rails auto-scaling on&#160;Heroku &#8594; Managed Solr SaaS Options I was recently looking for managed Solr &#8220;software-as-a-service&#8221; (SaaS) options, and had trouble figuring out what was out there. So I figured I&#8217;d share what I learned. Even though my knowledge here is far from exhaustive, and I have only looked seriously at one of the ones I found. The only managed Solr options I &#8230; Continue reading Managed Solr SaaS&#160;Options &#8594; Gem authors, check your release sizes Most gems should probably be a couple hundred kb at most. I&#8217;m talking about the package actually stored in and downloaded from rubygems by an app using the gem. After all, source code is just text, and it doesn&#8217;t take up much space. OK, maybe some gems have a couple images in there. But if &#8230; Continue reading Gem authors, check your release&#160;sizes &#8594; Every time you decide to solve a problem with code… Every time you decide to solve a problem with code, you are committing part of your future capacity to maintaining and operating that code. Software is never done. Software is drowning the world by James Abley Updating SolrCloud configuration in ruby We have an app that uses Solr. We currently run a Solr in legacy &#8220;not cloud&#8221; mode. Our solr configuration directory is on disk on the Solr server, and it&#8217;s up to our processes to get our desired solr configuration there, and to update it when it changes. We are in the process of moving &#8230; Continue reading Updating SolrCloud configuration in&#160;ruby &#8594; Are you talking to Heroku redis in cleartext or SSL? In &#8220;typical&#8221; Redis installation, you might be talking to redis on localhost or on a private network, and clients typically talk to redis in cleartext. Redis doesn&#8217;t even natively support communications over SSL. (Or maybe it does now with redis6?) However, the Heroku redis add-on (the one from Heroku itself) supports SSL connections via &#8220;Stunnel&#8221;, &#8230; Continue reading Are you talking to Heroku redis in cleartext or&#160;SSL? &#8594; Comparing performance of a Rails app on different Heroku formations I develop a &#8220;digital collections&#8221; or &#8220;asset management&#8221; app, which manages and makes digitized historical objects and their descriptions available to the public, from the collections here at the Science History Institute. The app receives relatively low level of traffic (according to Google Analytics, around 25K pageviews a month), although we want it to be &#8230; Continue reading Comparing performance of a Rails app on different Heroku&#160;formations &#8594; Deep Dive: Moving ruby projects from Travis to Github Actions for CI So this is one of my super wordy posts, if that&#8217;s not your thing abort now, but some people like them. We&#8217;ll start with a bit of context, then get to some detailed looks at Github Actions features I used to replace my travis builds, with example config files and examination of options available. For &#8230; Continue reading Deep Dive: Moving ruby projects from Travis to Github Actions for&#160;CI &#8594; Unexpected performance characteristics when exploring migrating a Rails app to Heroku I work at a small non-profit research institute. I work on a Rails app that is a &#8220;digital collections&#8221; or &#8220;digital asset management&#8221; app. Basically it manages and provides access (public as well as internal) to lots of files and description about those files, mostly images. It&#8217;s currently deployed on some self-managed Amazon EC2 instances &#8230; Continue reading Unexpected performance characteristics when exploring migrating a Rails app to&#160;Heroku &#8594; faster_s3_url: Optimized S3 url generation in ruby Subsequent to my previous investigation about S3 URL generation performance, I ended up writing a gem with optimized implementations of S3 URL generation. github: faster_s3_url It has no dependencies (not even aws-sdk). It can speed up both public and presigned URL generation by around an order of magnitude. In benchmarks on my 2015 MacBook compared &#8230; Continue reading faster_s3_url: Optimized S3 url generation in&#160;ruby &#8594; Delete all S3 key versions with ruby AWS SDK v3 If your S3 bucket is versioned, then deleting an object from s3 will leave a previous version there, as a sort of undo history. You may have a &#8220;noncurrent expiration lifecycle policy&#8221; set which will delete the old versions after so many days, but within that window, they are there. What if you were deleting &#8230; Continue reading Delete all S3 key versions with ruby AWS SDK&#160;v3 &#8594; Github Actions tutorial for ruby CI on Drifting Ruby I&#8217;ve been using travis for free automated testing (&#8220;continuous integration&#8221;, CI) on my open source projects for a long time. It works pretty well. But it&#8217;s got some little annoyances here and there, including with github integration, that I don&#8217;t really expect to get fixed after its acquisition by private equity. They also seem to &#8230; Continue reading Github Actions tutorial for ruby CI on Drifting&#160;Ruby &#8594; More benchmarking optimized S3 presigned_url generation In a recent post, I explored profiling and optimizing S3 presigned_url generation in ruby to be much faster. In that post, I got down to using a Aws::Sigv4::Signer instance from the AWS SDK, but wondered if there was a bunch more optimization to be done within that black box. Julik posted a comment on that &#8230; Continue reading More benchmarking optimized S3 presigned_url&#160;generation &#8594; Delivery patterns for non-public resources hosted on S3 I work at the Science History Institute on our Digital Collections app (written in Rails), which is kind of a &#8220;digital asset management&#8221; app combined with a public catalog of our collection. We store many high-resolution TIFF images that can be 100MB+ each, as well as, currently, a handful of PDFs and audio files. We &#8230; Continue reading Delivery patterns for non-public resources hosted on&#160;S3 &#8594; Speeding up S3 URL generation in ruby It looks like the AWS SDK is very slow at generating S3 URLs, both public and presigned, and that you can generate around an order of magnitude faster in both cases. This can matter if you are generating hundreds of S3 URLs at once. My app The app I work is a &#8220;digital collections&#8221; or &#8230; Continue reading Speeding up S3 URL generation in&#160;ruby &#8594; A custom local OHMS front-end Here at the Science History Institute, we’ve written a custom OHMS viewer front-end, to integrate seamlessly with our local custom &#8220;content management system&#8221; (a Rails-based digital repository app with source available), and provide some local functionality like the ability to download certain artifacts related to the oral history. We spent quite a bit of energy &#8230; Continue reading A custom local OHMS&#160;front-end &#8594; Encrypting patron data (in Rails): why and how Special guest post by Eddie Rubeiz I&#8217;m Eddie Rubeiz. Along with the owner of this blog, Jonathan Rochkind, and our system administrator, Dan, I work on the Science History Institute&#8217;s digital collections website, where you will find, among other marvels, this picture of the inventor of Styrofoam posing with a Santa &#8220;sculpture&#8221;, which predates the &#8230; Continue reading Encrypting patron data (in Rails): why and&#160;how &#8594; Intentionally considering fixity checking In our digital collections app rewrite at Science History Institute, we took a moment to step back and  be intentional about how we approach &#8220;fixity checking&#8221; features and UI, to make sure it&#8217;s well-supporting the needs it&#8217;s meant to.  I think we do a good job of providing UI to let repository managers and technical &#8230; Continue reading Intentionally considering fixity&#160;checking &#8594; 
bitcoinmagazine-com-5669	----	What Is The Bitcoin Block Size Limit? - Bitcoin Magazine: Bitcoin News, Articles, Charts, and Guides Events Culture Business Technical Markets Store Earn Press Releases Reviews Learn About Bitcoin Magazine Advertise Terms of Use Privacy Policy B.TC Inc Privacy Settings Articles Store Conference Buy Bitcoin Learn Articles Store Conference Buy Bitcoin Learn What Is The Bitcoin Block Size Limit? Author: Bitcoin Magazine Publish date: Aug 17, 2020 The Bitcoin block size limit is a parameter in the Bitcoin protocol that limits the size of Bitcoin blocks, and, therefore, the number of transactions that can be confirmed on the network approximately every 10 minutes. Although Bitcoin launched without this parameter, Satoshi Nakamoto added a 1 megabyte block size limit back when he was still the lead developer of the project. This translated into about three to seven transactions per second, depending on the size of transactions. Further Reading: Who Created Bitcoin? In 2017, Bitcoin’s block size limit was replaced by a block weight limit of 4 million “weight units.” This changed how data in blocks is “counted”: some data weighs more than other data. Perhaps more importantly, it also represented an effective block size limit increase: Bitcoin blocks now have a theoretical maximum size of 4 megabytes and a more realistic maximum size of 2 megabytes. The exact size depends on the types of transactions included. Why Is the Block Size Limit Controversial? The block size limit is controversial because there is disagreement over whether or not such a limit “should be” part of the Bitcoin protocol, and if it should, how big it should be. Satoshi Nakamoto never publicly specified why he added a block size limit to the Bitcoin protocol. It has been speculated that he intended it to be an anti-spam measure, to prevent an attacker from overloading the Bitcoin network with artificially large Bitcoin blocks full of bogus transactions. Some have also been speculated that he intended for it to be a temporary measure, but it is unclear how temporary or under what conditions he foresaw the block size limit being increased or lifted. The code itself that enforces the block size limit certainly wasn’t temporary. Further Reading: Can Bitcoin Scale? A couple years after Satoshi Nakamoto left the project, developers and users started to disagree on the temporality and necessity of the block size limit. As Bitcoin’s user base grew, some believed it was time to increase or lift the block size limit entirely, specifically before Bitcoin blocks would start filling up with transactions. Others came to believe that the block size limit represents a vital security parameter of the protocol and believed it should not be lifted — or at least, it should be lifted more conservatively. Yet others think that the 1 megabyte put in place by Satoshi Nakamoto was actually too large and advocated for a block size limit decrease .Adding more complications, since Bitcoin is decentralized, no particular group or person is in charge of decisions like increasing or decreasing the block size. Disagreements on how such decisions should be made, by whom, or if they should be made at all, has probably led to at least as much controversy as the block size limit itself — but this aspect of the debate is outside the scope of this article. Further Reading: What Is Bitcoin? Why Shouldn’t Bitcoin Blocks Be Too Small? Note: Almost anything about Bitcoin’s block size limit and the risks of it being too big or too small is contested, but these are some of the more general arguments. If Bitcoin blocks are too small, not many transactions can be processed by the Bitcoin network. Broadly speaking, proponents of a block size limit increase (“big blockers”) argue this can have two negative consequences. Not Enough Space? Firstly, smaller bitcoin blocks would mean that there isn’t enough space to include everyone’s transactions in these blocks, and the transaction fee “bidding war” to get transactions confirmed would price most people out of using bitcoin at all. Instead, it could lead to a future where only bank-like institutions make transactions with one another, while regular users hold accounts with these institutions. This would, in turn, open the door to fractional reserve banking, transaction censorship and more of the problems with traditional finance that many bitcoiners hoped to get away from. Deterrent to Adoption Secondly — and this is probably what many “big blockers” consider to be a more pressing concern — users would simply give up on Bitcoin altogether because blocks are too small. Perhaps users would switch to a competing cryptocurrency or they would give up on this type of technology altogether. Why Shouldn’t Bitcoin Blocks Be Too Big? Note: Almost anything about Bitcoin’s block size limit and the risks of it being too big or too small is contested, but these are some of the more general arguments. Opponents of a block size limit increase (“small blockers”) argue there are, roughly speaking, three risks if blocks are too big, each of which have several “sub-risks” as well as nuances. Increased Cost for Bitcoin Nodes The first of these risks is that bigger blocks increase the cost of operating a Bitcoin node. It increases this cost in four ways: It increases the cost of storing the blockchain, as the blockchain would grow faster. It increases bandwidth costs to download (and upload) all transactions and blocks. It increases CPU costs required to validate all transactions and blocks. The bigger the total blockchain is, the longer it takes to bootstrap a new node on the network: It has to download and validate all past transactions and blocks. If the cost to operate a Bitcoin node becomes too high, and users have to (or choose to) use lightweight clients instead, they can no longer verify that the transactions they receive are valid. They could, for example, receive a transaction from an attacker that created coins out of thin air; without knowing the entire history of the Bitcoin blockchain, there is no way to tell the difference. In that case, users would only find out that their coins are fake once they try to spend them later on. Even if users do validate that the block that includes the transaction was mined sufficiently (which is common), miners could be colluding with the attacker. Further Reading: What Is Bitcoin Mining? Perhaps an even bigger risk could arise if, over time, so few users choose to run Bitcoin nodes that the fraudulent coins are noticed too late or not at all. In that case, the Bitcoin protocol itself effectively becomes subject to changes imposed by miners. Miners could go as far as to increase the coin supply or spend coins they do not own. Only a healthy ecosystem with a significant share of users validating their own transactions prevents this. In the Bitcoin white paper, Satoshi Nakamoto acknowledged the above mentioned problems and suggested that light clients could be made secure through a technical solution called “fraud proofs.” Unfortunately, however, he did not detail what these fraud proofs would look like exactly, and so far no one has been able to figure it out. (In fact, some of today’s Bitcoin developers do not believe fraud proofs are viable.) Mining Centralization The second risk of bigger blocks is that they could lead to mining centralization. Whenever a miner finds a new block, it sends this block to the rest of the network, and, in normal circumstances, bigger blocks take longer to find their way to all other miners. While the block is finding its way, however, the miner that found it can immediately start mining on top of the new block himself, giving him a head start on finding the next block. Bigger miners (or pools) find more blocks than smaller miners, thereby gaining more head starts. This means that smaller miners will be less profitable and will eventually be outcompeted, leading to a more centralized mining ecosystem. If mining becomes too centralized, some miners could end up in a position where they can 51 attack the network. That said, this is probably the most complex and nuanced argument against smaller blocks. For one, even big miners have an incentive against creating blocks that are too big: While they can benefit from a head start, too much delay can work to their detriment as a competing block may find its way through the network faster, and other miners will mine on that block instead. There are also technical solutions to speed up block relay, as well as technical solutions to limit the damage from mining centralization itself, but these solutions come with trade-offs of their own. Lower Block Subsidies Could Lead to Less Network Security The third and final risk of big blocks is that they could disincentivize users from adding fees to their transactions. As long as block space is limited, users must outbid each other to have their transactions included in blocks, and as Bitcoin’s block subsidy diminishes, this will have to become a more significant part of the block reward to support Bitcoin’s security model. Without a block size limit, this incentive is taken away. (While individual miners can still choose to only include fees with a minimum fee, other miners would still have an incentive to include transactions below that threshold — thereby diminishing the fee incentive after all.) Attentive readers will have noticed that this last argument in particular works both ways. While “big blockers” see high fees as a problem as it would make Bitcoin less attractive, “small blockers” see high fees as a positive as it would benefit Bitcoin’s security. Will Bitcoin Core Developers Ever Increase the Block Size Limit? Bitcoin Core is the predominant — though not only — Bitcoin implementation in use on the Bitcoin network today. Therefore, many “big blockers” have been looking at Bitcoin Core developers to implement an increase.  Bitcoin Core developers did indeed increase the block size limit, through the Segregated Witness (SegWit) protocol upgrade. By replacing it for a block weight limit, blocks now have a theoretical limit of 4 megabytes and a more realistic limit of 2 megabytes. Cleverly, this was a backwards-compatible soft fork protocol upgrade, which meant that users could opt into the change without splitting the network. However, exactly because this was a soft fork, and not a hard fork as many “big blockers” preferred, they sometimes do not “count” this increase as a block size limit increase at all. Further Reading: What Are Bitcoin Forks? Indeed, Bitcoin Core developers have not deployed a block size limit increase through a hard fork, which is a backwards-incompatible protocol upgrade. This would either require consensus from all of Bitcoin’s users or possibly split the Bitcoin network in two: a version of Bitcoin with the current block weight limit and a version of Bitcoin with the increased block size/weight limit. Users of the version of Bitcoin with the current block weight limit would probably not even consider the hard-forked version of Bitcoin to be “Bitcoin” at all; they might refer to it as “Bitcoin Core coin” or something along these lines. Perhaps more importantly, the current group of Bitcoin Core contributors seem to have no desire to dictate Bitcoin’s protocol rules, nor do they want to split the network. Therefore, they are unlikely to deploy a hard fork (for the block size limit or otherwise) without broad consensus throughout Bitcoin’s user base for such a protocol upgrade. Given the controversial nature of the block size/weight parameter, it’s unlikely that such consensus will form anytime soon, but it could happen down the road. Alternative Solutions There are some alternative solutions to increase Bitcoin’s block size limit, like Extension Blocks, as well as solutions that could achieve something similar, such as “big block” sidechains. It’s not clear that any of these solutions will see the light of day anytime soon either, however; current focus seems more directed toward “layer two” scaling solutions like the Lightning Network. Further Reading: What Is the Lightning Network? Is Bitcoin Block Size Limit Discussion Censored? The short answer is no. As for a slightly longer answer… During the heat of the block size limit debate, one of the most popular Bitcoin discussion platforms on the internet, the Bitcoin-focused subreddit r/bitcoin, imposed heavy-handed moderation. This moderation was intended to stop forum users from promoting consensus-breaking software before the greater user base had actually come to a consensus on the best way forward.  At the time, it was not obvious to everyone that using such software could lead to a split (a non-backwards-compatible hard fork) of the network, and it was often advertised as if it couldn’t. Arguing in favor of a block size limit increase and/or hard fork without directly promoting consensus-breaking software was always allowed. Whether this constituted a form of “censorship” is perhaps in the eye of the beholder, but what’s certain is that anyone who disagreed with this policy was free to start or contribute to competing Bitcoin subreddits, and this is exactly what happened. The r/btc subreddit in particular become a popular discussion platform for those who favored a block size limit increase hard fork. Furthermore, Reddit is only a relatively small part of the internet and an even smaller part of the entire world. While there are some other platforms that have been accused of similar censorship (such as the Bitcointalk forum and the Bitcoin-development mailing list), it is hard to deny that the debate took place loud and clear across social media, news sites, conferences, chat groups and far beyond. Anyone interested in hearing about the different arguments had every chance to inform themselves and even those who didn’t care had a hard time escaping the fallout from the debate. In the end, those who favored a block size limit increase hard fork were unable to convince enough people of their case, and it seems as if some of them have channeled their frustration about this disappointment into anger toward a particular subreddit and its moderators. (Or maybe, by writing this, Bitcoin Magazine is just part of a great cover-up conspiracy. Spooky!) What Is Bitcoin Cash? What Is Bitcoin SV? When it became clear that Bitcoin would increase its block size limit (among other things) through the SegWit soft fork protocol upgrade, some “big blockers” decided to move forward with a block size limit increase hard fork, even knowing that they would be in a minority and split off into their own network to become a new cryptocurrency. This new network and the resulting cryptocurrency is called Bitcoin Cash. Since Bitcoin Cash split off from Bitcoin, it has itself implemented several more hard fork upgrades, some of which, in turn, led to even more splits in the network and new cryptocurrencies. The most notable of these is Bitcoin SV, loosely centered around Craig Wright, one of the men who (almost certainly fraudulently) claims to have been behind the pseudonym Satoshi Nakamoto. It has an even bigger block size limit than Bitcoin Cash does. By Bitcoin Magazine Guides What Is Bitcoin? By Bitcoin Magazine Mar 8, 2021 Guides What Is Bitcoin Mining? By Bitcoin Magazine Aug 10, 2020 Guides What Is Quantum Computing? By Bitcoin Magazine Nov 6, 2019 Guides What is the Lightning Network? By Bitcoin Magazine Oct 9, 2020 Guides What is SegWit? By Bitcoin Magazine Aug 17, 2020 Technical GreenAddress: Increasing Bitcoin's Block-size Limit is not Scaling; it's Pivoting By Aaron van Wirdum Dec 2, 2015 Guides What is 'The Halvening'? By Bitcoin Magazine Jul 20, 2020 Technical Roger Ver Is Still Determined to Increase the Bitcoin Block Size Limit via a Hard Fork By Kyle Torpey Sep 22, 2016 Guides What Is KYC? By Bitcoin Magazine Sep 10, 2020 Guides What Are Bitcoin Mixers? By Bitcoin Magazine Aug 17, 2020 Technical Is It Time to Take an Initiative to Decrease Bitcoinâ€™s Block Size Seriously? By Aaron van Wirdum Feb 15, 2019 Technical Settling the Block Size Debate By Eric Lombrozo Jul 29, 2015 Guides What Are Bitcoin Mining Pools? By Bitcoin Magazine Jun 12, 2020 Guides What Are Bitcoin Forks? By Bitcoin Magazine Aug 11, 2020 Guides What Is A Bitcoin Improvement Proposal (BIP)? By Bitcoin Magazine Aug 17, 2020 Loadingâ€¦ See More About Bitcoin Magazine Advertise Terms of Use Privacy Policy B.TC Inc © 2021 
bit-ly-1192	----	Documenting the Now Slack Documenting the Now Slack The DocNow team and advisory board is using Slack as a collaboration space. You are welcome to join us by filling out the form below If you are interested in contributing to the conversation around the ethics of social media archiving, the DocNow application, and web archiving practices in general. Once you've been invited you can join our slack at: http://docnowteam.slack.com If you have questions or comments that are not answered in a timely manner, or that you would prefer to ask privately please get in touch with the core team at info@docnow.io and we will get back to you. Documenting the Now is dedicated to a harassment-free experience for everyone. Our anti-harassment policy can be found at: https://github.com/DocNow/code-of-conduct * Required Your Name * Your answer Your Email Address * Your answer Your Interest * Your answer Submit Never submit passwords through Google Forms. This content is neither created nor endorsed by Google. Report Abuse - Terms of Service - Privacy Policy  Forms     
blog-cbeer-info-8871	----	blog.cbeer.info blog.cbeer.info Autoscaling AWS Elastic Beanstalk worker tier based on SQS queue length LDPath in 3 examples Building a Pivotal Tracker IRC bot with Sinatra and Cinch Real-time statistics with Graphite, Statsd, and GDash Icemelt: A stand-in for integration tests against AWS Glacier 
blog-dataunbound-com-3127	----	Data Unbound : Helping organizations access and share data effectively. Special focus on web APIs for data integration. Data Unbound Helping organizations access and share data effectively. Special focus on web APIs for data integration. Skip to content About Some of what I missed from the Cmd-D Automation Conference The CMD-D|Masters of Automation one-day conference in early August would have been right up my alley: It’ll be a full day of exploring the current state of automation technology on both Apple platforms, sharing ideas and concepts, and showing what’s possible—all with the goal of inspiring and furthering development of your own automation projects. Fortunately, those of us who missed it can still get a meaty summary of the meeting by listening to the podcast segment Upgrade #154: Masters of Automation – Relay FM. I've been keen on automation for a long time now and was delighted to hear the panelists express their own enthusiasm for customizing their Macs, iPhones, or iPads to make repetitive tasks much easier and less time-consuming. Noteworthy take-aways from the podcast include: Something that I hear and believe but have yet to experience in person: non-programmers can make use of automation through applications such as Automator — for macOS — and Workflow for iOS. Also mentioned often as tools that are accessible to non-geeks: Hazel and Alfred – Productivity App for Mac OS X. Automation can make the lives of computer users easier but it's not immediately obvious to many people exactly how. To make a lot of headway in automating your workflow, you need a problem that you are motivated to solve. Many people use AppleScript by borrowing from others, just like how many learn HTML and CSS from copying, pasting, and adapting source on the web. Once you get a taste for automation, you will seek out applications that are scriptable and avoid those that are not. My question is how to make it easier for developers to make their applications scriptable without incurring onerous development or maintenance costs? E-book production is an interesting use case for automation. People have built businesses around scripting Photoshop [is there really a large enough market?] OmniGroup's automation model is well worth studying and using. I hope there will be a conference next year to continue fostering this community of automation enthusists and professionals. 2017 09 25 Raymond Yee automation macOS Comments (0) Permalink Fine-tuning a Python wrapper for the hypothes.is web API and other #ianno17 followup In anticipation of #ianno17 Hack Day, I wrote about my plans for the event, one of which was to revisit my own Python wrapper for the nascent hypothes.is web API. Instead of spending much time on my own wrapper, I spent most of the day working with Jon Udell's wrapper for the API. I've been working on my own revisions of the library but haven't yet incorporated Jon's latest changes. One nice little piece of the puzzle is that I learned how to introduce retries and exponential backoff into the library, thanks to a hint from Nick Stenning and a nice answer on Stackoverflow . Other matters In addition to the Python wrapper, there are other pieces of follow-up for me. I hope to write more extensively on those matters down the road but simply note those topics for the moment. Videos from the conference I might start by watching videos from #ianno17 conference: I Annotate 2017 – YouTube. Because I didn't attend the conference per se, I might glean insight into two particular topics of interest to me (the role of page owner in annotations and the intermingling of annotations in ebooks.) An extension for embedding selectors in the URL I will study and try Treora/precise-links: Browser extension to support Web Annotation Selectors in URIs. I've noticed that the same annotation is shown in two related forms: https://hyp.is/Zj2dyi9tEeeTmxvuPjLhSw/blog.dataunbound.com/2017/05/01/revisiting-hypothes-is-at-i-annotate-2017/ https://blog.dataunbound.com/2017/05/01/revisiting-hypothes-is-at-i-annotate-2017/#annotations:Zj2dyi9tEeeTmxvuPjLhSw Does the precise-links extension let me write the selectors into the URL? 2017 05 22 Raymond Yee annotation Comments (0) Permalink Revisiting hypothes.is at I Annotate 2017 I'm looking forward to hacking on web and epub annotation at the #ianno17 Hack Day. I won't be at the I Annotate 2017 conference per se but will be curious to see what comes out of the annual conference. I continue to have high hopes for digital annotations, both on the Web and in non-web digital contexts. I have used Hypothesis on and off since Oct 2013. My experiences so far: I like the ability to highlight and comment on very granular sections of articles for comment, something the hypothes.is annotation tool makes easy to do. I appreciate being able to share annotation/highlight with others (on Twitter or Facebook), though I'm pretty sure most people who bother to click on the links might wonder "what's this" when they click on the link. A small user request: hypothes.is should allow a user to better customize the Facebook preview image for the annotation. I've enjoyed using hypothes.is for code review on top of GitHub. (Exactly how hypothes.is complements the extensive code-commenting functionality in GitHub might be worth a future blog post.) My Plans for Hack Day Python wrapper for hypothes.is This week, I plan to revisit rdhyee/hypothesisapi: A Python wrapper for the nascent hypothes.is web API to update or abandon it in favor of new developments. (For example, I should look at kshaffer/pypothesis: Python scripts for interacting with the hypothes.is API.) Epubs + annotations I want to figure out the state of art for epubs and annotations. I'm happy to see the announcement of a partnership to bring open annotation to eBooks from March 2017. I'd definitely like to figure out how to annotate epubs (e.g., Oral Literature in Africa (at unglue.it) or Moby Dick). The best approach is probably for me to wait until summer at which time we'll see the fruits of the partnership: Together, our goal is to complete a working integration of Hypothesis with both EPUB frameworks by Summer 2017. NYU plans to deploy the ReadiumJS implementation in the NYU Press Enhanced Networked Monographs site as a first use case. Based on lessons learned in the NYU deployment, we expect to see wider integration of annotation capabilities in eBooks as EPUB uptake continues to grow. In the meantime, I can catch up on the current state of futurepress/epub.js: Enhanced eBooks in the browser., grok Epub CFI Updates, and relearn how to parse epubs using Python (e.g., rdhyee/epub_avant_garde: an experiment to apply ideas from https://github.com/sandersk/ebook_avant_garde to arbitrary epubs). Role of page owners I plan to check in on what's going on with efforts at Hypothes.is to involve owners in page annotations: In the past months we launched a small research initiative to gather different points of view about website publishers and authors consent to annotation. Our goal was to identify different paths forward taking into account the perspectives of publishers, engineers, developers and people working on abuse and harassment issues. We have published a first summary of our discussion on our blog post about involving page owners in annotation. I was reminded of these efforts after reading that Audrey Watters had blocked annotation services like hypothes.is and genius from her domains: Un-Annotated Episode 52: Marginalia In the spirit of communal conversation, I threw in my two cents: Have there been any serious exploration of easy opt-out mechanisms for domain owners? Something like robots.txt for annotation tools? 2017 05 01 Raymond Yee annotation Comments (2) Permalink My thoughts about Fargo.io using fargo.io 2013 11 03 Raymond Yee Uncategorized Comments (0) Permalink Organizing Your Life With Python: a submission for PyCon 2015? I have penciled into my calendar a trip  to Montreal to attend PyCon 2014.   In my moments of suboptimal planning, I wrote an overly ambitious abstract for a talk or poster session I was planning to submit.  As I sat down this morning to meet the deadline for submitting a proposal for a poster session (Nov 1), I once again encountered the ominous (but for me, definitive) admonition: Avoid presenting a proposal for code that is far from completion. The program committee is very skeptical of "conference-driven development". It's true: my efforts to organize my life with Python are in the early stages. I hope that I'll be able to write something like the following for PyCon 2015. Organizing Your Life with Python David Allen's Getting Things Done (GTD) system is a popular system for personal productivity. Although GTD can be implemented without any computer technology, I have pursued two different digital implementations, including my current implementation using Evernote, the popular note-taking program. This talk explores using Python in conjunction with the Evernote API to implement GTD on top of Evernote. I have found that a major practical hinderance for using GTD is that it way too easy to commit to too many projects. I will discuss how to combine Evernote, Python, GTD with concepts from Personal Kanban to solve this problem. Addendum: Whoops…I find it embarrassing that I already quoted my abstract in a previous blog post in September that I had forgotten about. Oh well. Where's my fully functioning organization system when I need it! Tagged PyCon, Python 2013 10 30 Raymond Yee Evernote GTD Comments (0) Permalink Current Status of Data Unbound LLC in Pennsylvania I'm currently in the process of closing down Data Unbound LLC in Pennsylvania.  I submitted the paperwork to dissolve the legal entity in April 2013 and have been amazed to learn that it may take up to a year to get the final approval done.  In the meantime, as I establishing a similar California legal entity, I will certainly continue to write on this blog about APIs, mashups, and open data. 2013 10 30 Raymond Yee Data Unbound LLC Comments (0) Permalink Must Get Cracking on Organizing Your Life with Python Talk and tutorial proposals for PyCon 2014 are due tomorrow (9/15) .  I was considering submitting a proposal until I took the heart the appropriate admonition against "conference-driven" development of the program committee.   I will nonetheless use the Oct 15 and Nov 1 deadlines for lightning talks and proposals respectively to judge whether to submit a refinement of the following proposal idea: Organizing Your Life with Python David Allen's Getting Things Done (GTD) system is a popular system for personal productivity.  Although GTD can be implemented without any computer technology, I have pursued two different digital implementations, including my current implementation using Evernote, the popular note-taking program.  This talk explores using Python in conjunction with the Evernote API to implement GTD on top of Evernote. I have found that a major practical hinderance for using GTD is that it way too easy to commit to too many projects.  I will discuss how to combine Evernote, Python, GTD with concepts from Personal Kanban to solve this problem.   2013 09 14 Raymond Yee Getting Things Done Python Comments (0) Permalink Embedding Github gists in WordPress As I gear up I to write more about programming, I have installed the Embed GitHub Gist plugin. So by writing [gist id=5625043] in the text of this post, I can embed https://gist.github.com/rdhyee/5625043 into the post to get: from itertools import islice def triangular(): n = 1 i = 1 while True: yield n i +=1 n += i # <codecell> for i, n in enumerate(islice(triangular(), 10)): print i+1, n Tagged gist, github 2013 05 21 Raymond Yee Wordpress Comments (2) Permalink Working with Open Data I'm very excited to be teaching a new course Working with Open Data at the UC Berkeley School of Information in the Spring 2013 semester: Open data — data that is free for use, reuse, and redistribution — is an intellectual treasure-trove that has given rise to many unexpected and often fruitful applications. In this course, students will 1) learn how to access, visualize, clean, interpret, and share data, especially open data, using Python, Python-based libraries, and supplementary computational frameworks and 2) understand the theoretical underpinnings of open data and their connections to implementations in the physical and life sciences, government, social sciences, and journalism.   2012 11 23 Raymond Yee Uncategorized Comments (0) Permalink A mundane task: updating a config file to retain old settings I want to have a hand in creating an excellent personal information manager (PIM) that can be a worthy successor to Ecco Pro. So far, running EccoExt (a clever and expansive hack of Ecco Pro) has been a eminently practical solution.   You can download the most recent version of this actively developed extension from the files section of the ecco_pro Yahoo! group.   I would do so regularly but one of the painful problems with unpacking (using unrar) the new files is that there wasn't an updater that would retain the configuration options of the existing setup.  So a mundane but happy-making programming task of this afternoon was to write a Python script to do exact that function, making use of the builtin ConfigParser library. """ compare eccoext.ini files My goal is to edit the new file so that any overlapping values take on the current value """ current_file_path = "/private/tmp/14868/C/Program Files/ECCO/eccoext.ini" new_file_path = "/private/tmp/14868/C/utils/eccoext.ini" updated_file = "/private/tmp/14868/C/utils/updated_eccoext.ini" # extract the key value pairs in both files to compare the two # http://docs.python.org/library/configparser.html import ConfigParser def extract_values(fname): # generate a parsed configuration object, set of (section, options) config = ConfigParser.SafeConfigParser() options_set = set() config.read(fname) sections = config.sections() for section in sections: options = config.options(section) for option in options: #value = config.get(section,option) options_set.add((section,option)) return (config, options_set) # process current file and new file (current_config, current_options) = extract_values(current_file_path) (new_config, new_options) = extract_values(new_file_path) # what are the overlapping options overlapping_options = current_options & new_options # figure out which of the overlapping options are the values different for (section,option) in overlapping_options: current_value = current_config.get(section,option) new_value = new_config.get(section,option) if current_value != new_value: print section, option, current_value, new_value new_config.set(section,option,current_value) # write the updated config file with open(updated_file, 'wb') as configfile: new_config.write(configfile) 2011 02 12 Raymond Yee Ecco Pro Python Comments (0) Permalink « Older posts Pages About Categories Amazon annotation announcments APIs architecture art history automation bibliographics bioinformatics BPlan 2009 Chickenfoot Citizendium collaboration consulting copyright creative commons data mining Data Unbound LLC digital scholarship Ecco Pro education Evernote Firefox Flickr freebase Getting Things Done Google government GTD hardware HCI higher education humanities imaging iSchool journalism libraries macOS mashups meta MITH API workshop Mixing and Remixing information notelets OCLC open access open data OpenID personal information management personal news politics Processing programming tip prototype publishing Python recovery.gov tracking repositories REST screen scraping screencast services SOAP training tutorial UC Berkeley Uncategorized web hosting web services web20 weblogging Wikipedia Wordpress writing Zotero Tags API art history books Chickenfoot codepad coins creative commons data hosting data portability Educause EXIF Firefox Flickr freebase JCDL JCDL 2008 kses Library of Congress mashups mashup symfony Django metadata news NYTimes AmazonEC2 AmazonS3 OMB OpenID openlibrary OpenOffice.org photos politics Project Bamboo Python pywin32 recovery.gov tracking screencast stimulus sychronization video webcast Wikipedia Windows XP WMI Wordpress workshops XML in libraries Zotero Blogroll Information Services and Technology, UC Berkeley UC Berkeley RSS Feeds All posts All comments Meta Log in Blog Search © 2021 | Thanks, WordPress | Barthelme theme by Scott Allan Wallick | Standards Compliant XHTML & CSS | RSS Posts & Comments 
blog-dataunbound-com-6587	----	Data Unbound Data Unbound Helping organizations access and share data effectively. Special focus on web APIs for data integration. Some of what I missed from the Cmd-D Automation Conference The CMD-D&#124;Masters of Automation one-day conference in early August would have been right up my alley: It’ll be a full day of exploring the current state of automation technology on both Apple platforms, sharing ideas and concepts, and showing what’s possible—all with the goal of inspiring and furthering development of your own automation projects. Fortunately, [&#8230;] Fine-tuning a Python wrapper for the hypothes.is web API and other #ianno17 followup In anticipation of #ianno17 Hack Day, I wrote about my plans for the event, one of which was to revisit my own Python wrapper for the nascent hypothes.is web API. Instead of spending much time on my own wrapper, I spent most of the day working with Jon Udell&#039;s wrapper for the API. I&#039;ve been [&#8230;] Revisiting hypothes.is at I Annotate 2017 I&#039;m looking forward to hacking on web and epub annotation at the #ianno17 Hack Day. I won&#039;t be at the I Annotate 2017 conference per se but will be curious to see what comes out of the annual conference. I continue to have high hopes for digital annotations, both on the Web and in non-web [&#8230;] My thoughts about Fargo.io using fargo.io Organizing Your Life With Python: a submission for PyCon 2015? I have penciled into my calendar a trip  to Montreal to attend PyCon 2014.   In my moments of suboptimal planning, I wrote an overly ambitious abstract for a talk or poster session I was planning to submit.  As I sat down this morning to meet the deadline for submitting a proposal for a poster [&#8230;] Current Status of Data Unbound LLC in Pennsylvania I&#039;m currently in the process of closing down Data Unbound LLC in Pennsylvania.  I submitted the paperwork to dissolve the legal entity in April 2013 and have been amazed to learn that it may take up to a year to get the final approval done.  In the meantime, as I establishing a similar California legal [&#8230;] Must Get Cracking on Organizing Your Life with Python Talk and tutorial proposals for PyCon 2014 are due tomorrow (9/15) .  I was considering submitting a proposal until I took the heart the appropriate admonition against &#034;conference-driven&#034; development of the program committee.   I will nonetheless use the Oct 15 and Nov 1 deadlines for lightning talks and proposals respectively to judge whether to [&#8230;] Embedding Github gists in WordPress As I gear up I to write more about programming, I have installed the Embed GitHub Gist plugin. So by writing &#x5b;gist id=5625043&#x5d; in the text of this post, I can embed https://gist.github.com/rdhyee/5625043 into the post to get: Working with Open Data I&#039;m very excited to be teaching a new course Working with Open Data at the UC Berkeley School of Information in the Spring 2013 semester: Open data — data that is free for use, reuse, and redistribution — is an intellectual treasure-trove that has given rise to many unexpected and often fruitful applications. In this [&#8230;] A mundane task: updating a config file to retain old settings I want to have a hand in creating an excellent personal information manager (PIM) that can be a worthy successor to Ecco Pro. So far, running EccoExt (a clever and expansive hack of Ecco Pro) has been a eminently practical solution.   You can download the most recent version of this actively developed extension from [&#8230;] 
bibwild-wordpress-com-6809	----	Bibliographic Wilderness Skip to content Bibliographic Wilderness Menu About Contact Code that Lasts: Sustainable And Usable Open Source Code A presentation I gave at online conference Code4Lib 2021, on Monday March 21. I have realized that the open source projects I am most proud of are a few that have existed for years now, increasing in popularity, with very little maintenance required. Including traject and bento_search. While community aspects matter for open source sustainability, the task gets so much easier when the code requires less effort to keep alive, for maintainers and utilizers. Using these projects as examples, can we as developers identify what makes code “inexpensive” to use and maintain over the long haul with little “churn”, and how to do that? Slides on Google Docs Rough transcript (really the script I wrote for myself) Hi, I’m Jonathan Rochkind, and this is “Code that Lasts: Sustainable and Usable Open Source Code” So, who am I? I have been developing open source library software since 2006, mainly in ruby and Rails.  Over that time, I have participated in a variety open source projects meant to be used by multiple institutions, and I’ve often seen us having challenges with long-term maintenance sustainability and usability of our software. This includes in projects I have been instrumental in creating myself, we’ve all been there!  We’re used to thinking of this problem in terms of needing more maintainers. But let’s first think more about what the situation looks like, before we assume what causes it. In addition to features  or changes people want not getting done, it also can look like, for instance: Being stuck using out-of-date dependencies like old, even end-of-lifed, versions of Rails or ruby. A reduction in software “polish” over time.  What do I mean by “polish”? Engineer Richard Schneeman writes: [quote] “When we say something is “polished” it means that it is free from sharp edges, even the small ones. I view polished software to be ones that are mostly free from frustration. They do what you expect them to and are consistent.”  I have noticed that software can start out very well polished, but over time lose that polish.  This usually goes along with decreasing “cohesion” in software over time, a feeling like that different parts of the software start to no longer tell the developer a consistent story together.  While there can be an element of truth in needing more maintainers in some cases – zero maintainers is obviously too few — there are also ways that increasing the number of committers or maintainers can result in diminishing returns and additional challenges. One of the theses of Fred Brooks famous 1975 book “The Mythical Man-Month” is sometimes called ”Brooks Law”:  “under certain conditions, an incremental person when added to a project makes the project take more, not less time.” Why? One of the main reasons Brooks discusses is the the additional time taken for communication and coordination between more people – with every person you add, the number of connections between people goes up combinatorily.  That may explain the phenomenon we sometimes see with so-called “Design  by committee” where “too many cooks in the kitchen” can produce inconsistency or excessive complexity. Cohesion and polish require a unified design vision— that’s  not incompatible with increasing numbers of maintainers, but it does make it more challenging because it takes more time to get everyone on the same page, and iterate while maintaining a unifying vision.  (There’s also more to be said here about the difference between just a bunch of committers committing PR’s, and the maintainers role of maintaining historical context and design vision for how all the parts fit together.) Instead of assuming adding more committers or maintainers is the solution, can there instead be ways to reduce the amount of maintenance required? I started thinking about this when I noticed a couple projects of mine which had become more widely successful than I had any right  to expect, considering how little maintainance was being put into them.  Bento_search is a toolkit for searching different external search engines in a consistent way. It’s especially but not exclusively for displaying multiple search results in “bento box” style, which is what Tito Sierra from NCSU first called these little side by side search results.  I wrote bento_search  for use at a former job in 2012.  55% of all commits to the project were made in 2012.  95% of all commits in 2016 or earlier. (I gave it a bit of attention for a contracting project in 2016). But bento_search has never gotten a lot of maintenance, I don’t use it anymore myself. It’s not in wide use, but I found  it kind of amazing, when I saw people giving me credit in conference presentations for the gem (thanks!), when I didn’t even know they were using it and I hadn’t been paying it any attention at all! It’s still used by a handful of institutions for whom it just works with little attention from maintainers. (The screenshot from Cornell University Libraries) Traject is a Marc-to-Solr indexing tool written in ruby  (or, more generally, can be a general purpose extract-transform-load tool), that I wrote with Bill Dueber from the University of Michigan in 2013.  We hoped it would catch on in the Blacklight community, but for the first couple years, it’s uptake was slow.  However, since then, it has come to be pretty popular in Blacklight and Samvera communities, and a few other library technologist uses.  You can see the spikes of commit activity in the graph for a 2.0 release in 2015 and a 3.0 release in 2018 – but for the most part at other times, nobody has really been spending much time on maintaining traject.   Every once in a while a community member submits a minor Pull Request, and it’s usually me who reviews it. Me and Bill remain the only maintainers.  And yet traject just keeps plugging along, picking up adoption and working well for adopters.   So, this made me start thinking, based on what I’ve seen in my career, what are some of the things that might make open source projects both low-maintenance and successful in their adoption and ease-of-use for developers? One thing both of these projects did was take backwards compatibility very seriously.  The first step of step there is following “semantic versioning” a set of rules whose main point is that releases can’t include backwards incompatible changes unless they are a new major version, like going from 1.x to 2.0.  This is important, but it’s not alone enough to minimize backwards incompatible changes that add maintenance burden to the ecosystem. If the real goal is preventing the pain of backwards incompatibility, we also need to limit the number of major version releases, and limit the number and scope of backwards breaking changes in each major release! The Bento_search gem has only had one major release, it’s never had a 2.0 release, and it’s still backwards compatible to it’s initial release.  Traject is on a 3.X release after 8 years, but the major releases of traject have had extremely few backwards breaking changes, most people could upgrade through major versions changing very little or most often nothing in their projects.  So OK, sure, everyone wants to minimize backwards incompatibility, but that’s easy to say, how do you DO it? Well, it helps to have less code overall, that changes less often overall all  – ok, again, great, but how do you do THAT?  Parsimony is a word in general English that means “The quality of economy or frugality in the use of resources.” In terms of software architecture, it means having as few as possible moving parts inside your code: fewer classes, types, components, entities, whatever: Or most fundamentally, I like to think of it in terms of minimizing the concepts in the mental model a programmer needs to grasp how the code works and what parts do what. The goal of architecture design is, what is the smallest possible architecture we can create to make [quote] “simple things simple and complex things possible”, as computer scientist Alan Kay described the goal of software design.  We can see this in bento_search has very few internal architectural concepts.  The main thing bento_search does is provide a standard API for querying a search engine and representing results of a search. These are consistent across different searche engines,, with common metadata vocabulary for what results look like. This makes search engines  interchangeable to calling code.  And then it includes half a dozen or so search engine implementations for services I needed or wanted to evaluate when I wrote it.   This search engine API at the ruby level can be used all by itself even without the next part, the actual “bento style” which is a built-in support for displaying search engine results in a boxes on a page of your choice in a Rails app, way to,  writing very little boilerplate code.   Traject has an architecture which basically has just three parts at the top. There is a reader which sends objects into the pipeline.  There are some indexing rules which are transformation steps from source object to build an output Hash object.  And then a writer which which translates the Hash object to write to some store, such as Solr. The reader, transformation steps, and writer are all independent and uncaring about each other, and can be mixed and matched.   That’s MOST of traject right there. It seems simple and obvious once you have it, but it can take a lot of work to end up with what’s simple and obvious in retrospect!  When designing code I’m often reminded of the apocryphal quote: “I would have written a shorter letter, but I did not have the time” And, to be fair, there’s a lot of complexity within that “indexing rules” step in traject, but it’s design was approached the same way. We have use cases about supporting configuration settings in a  file or on command line; or about allowing re-usable custom transformation logic – what’s the simplest possible architecture we can come up with to support those cases. OK, again, that sounds nice, but how do you do it? I don’t have a paint by numbers, but I can say that for both these projects I took some time – a few weeks even – at the beginning to work out these architectures, lots of diagraming, some prototyping I was prepared to throw out,  and in some cases “Documentation-driven design” where I wrote some docs for code I hadn’t written yet. For traject it was invaluable to have Bill Dueber at University of Michigan also interested in spending some design time up front, bouncing ideas back and forth with – to actually intentionally go through an architectural design phase before the implementation.  Figuring out a good parsimonious architecture takes domain knowledge: What things your “industry” – other potential institutions — are going to want to do in this area, and specifically what developers are going to want to do with your tool.  We’re maybe used to thinking of “use cases” in terms of end-users, but it can be useful at the architectural design stage, to formalize this in terms of developer use cases. What is a developer going to want to do, how can I come up with a small number of software pieces she can use to assemble together to do those things. When we said “make simple things simple and complex things possible”, we can say domain analysis and use cases is identifying what things we’re going to put in either or neither of those categories.  The “simple thing” for bento_search , for instance is just “do a simple keyword search in a search engine, and display results, without having the calling code need to know anything about the specifics of that search engine.” Another way to get a head-start on solid domain knowledge is to start with another tool you have experience with, that you want to create a replacement for. Before Traject, I and other users used a tool written in Java called SolrMarc —  I knew how we had used it, and where we had had roadblocks or things that we found harder or more complicated than we’d like, so I knew my goals were to make those things simpler. We’re used to hearing arguments about avoiding rewrites, but like most things in software engineering, there can be pitfalls on either either extreme. I was amused to notice, Fred Brooks in the previously mentioned Mythical Man Month makes some arguments in both directions.  Brooks famously warns about a “second-system effect”, the [quote] “tendency of small, elegant, and successful systems to be succeeded by over-engineered, bloated systems, due to inflated expectations and overconfidence” – one reason to be cautious of a rewrite.  But Brooks in the very same book ALSO writes [quote] “In most projects, the first system built is barely usable….Hence plan to throw one away; you will, anyhow.” It’s up to us figure out when we’re in which case. I personally think an application is more likely to be bitten by the “second-system effect” danger of a rewrite, while a shared re-usable library is more likely to benefit from a rewrite (in part because a reusable library is harder to change in place without disruption!).  We could sum up a lot of different princples as variations of “Keep it small”.  Both traject and bento_search are tools that developers can use to build something. Bento_search just puts search results in a box on a page; the developer is responsible for the page and an overall app.  Yes, this means that you have to be a ruby developer to use it. Does this limit it’s audience? While we might aspire to make tools that even not-really-developers can just use out of the box, my experience has been that our open source attempts at shrinkwrapped “solutions” often end up still needing development expertise to successfully deploy.  Keeping our tools simple and small and not trying to supply a complete app can actually leave more time for these developers to focus on meeting local needs, instead of fighting with a complicated frameworks that doesn’t do quite what they need. It also means we can limit interactions with any external dependencies. Traject was developed for use with a Blacklight project, but traject code does not refer to Blacklight or even Rails at all, which means new releases of Blacklight or Rails can’t possibly break traject.  Bento_search , by doing one thing and not caring about the details of it’s host application, has kept working from Rails 3.2 all the way up to current Rails 6.1 with pretty much no changes needed except to the test suite setup.  Sometimes when people try to have lots of small tools working together, it can turn into a nightmare where you get a pile of cascading software breakages every time one piece changes. Keeping assumptions and couplings down is what lets us avoid this maintenance nightmare.  And another way of keeping it small is don’t be afraid to say “no” to features when you can’t figure out how to fit them in without serious harm to the parsimony of your architecture. Your domain knowledge is what lets you take an educated guess as to what features are core to your audience and need to be accomodated, and which are edge cases and can be fulfilled by extension points, or sometimes not at all.  By extension points we mean we prefer opportunities for developer-users to write their own code which works with your tools, rather than trying to build less commonly needed features in as configurable features.  As an example, Traject does include some built-in logic, but one of it’s extension point use cases is making sure it’s simple to add whatever transformation logic a developer-user wants, and have it look just as “built-in” as what came with traject. And since traject makes it easy to write your own reader or writer, it’s built-in readers and writers don’t need to include every possible feature –we plan for developers writing their own if they need something else.  Looking at bento_search, it makes it easy to write your own search engine_adapter — that will be useable interchangeably with the built-in ones. Also, bento_search provides a standard way to add custom search arguments specific to a particular adapter – these won’t be directly interchangeable with other adapters, but they are provided for in the architecture, and won’t break in future bento_search releases – it’s another form of extension point.  These extension points are the second half of “simple things simple, complex things possible.” – the complex things possible. Planning for them is part of understanding your developer use-cases, and designing an architecture that can easily handle them. Ideally, it takes no extra layers of abstraction to handle them, you are using the exact  architectural join points the out-of-the-box code is using, just supplying custom components.  So here’s an example of how these things worked out in practice with traject, pretty well I think. Stanford ended up writing a package of extensions to traject called TrajectPlus, to take care of some features they needed that traject didn’t provide. Commit history suggests it was written in 2017, which was Traject 2.0 days.   I can’t recall, but I’d guess they approached me with change requests to traject at that time and I put them off because I couldn’t figure out how to fit them in parsimoniously, or didn’t have time to figure it out.  But the fact that they were *able* to extend traject in this way I consider a validation of traject’s architecture, that they could make it do what they needed, without much coordination with me, and use it in many projects (I think beyond just Stanford).  Much of the 3.0 release of traject was “back-port”ing some features that TrajectPlus had implemented, including out-of-the-box support for XML sources. But I didn’t always do them with the same implementation or API as TrajectPlus – this is another example of being able to use a second go at it to figure out how to do something even more parsimoniously, sometimes figuring out small changes to traject’s architecture to support flexibility in the right dimensions.  When Traject 3.0 came out – the TrajectPlus users didn’t necessarily want to retrofit all their code to the new traject way of doing it. But TrajectPlus could still be used with traject 3.0 with few or possibly no changes, doing things the old way, they weren’t forced to upgrade to the new way. This is a huge win for traject’s backwards compat – everyone was able to do what they needed to do, even taking separate paths, with relatively minimized maintenance work.  As I think about these things philosophically, one of my takeaways is that software engineering is still a craft – and software design is serious thing to be studied and engaged in. Especially for shared libraries rather than local apps, it’s not always to be dismissed as so-called “bike-shedding”.  It’s worth it to take time to think about design, self-reflectively and with your peers, instead of just rushing to put our fires or deliver features, it will reduce maintenance costs and increase values over the long-term.  And I want to just briefly plug “kithe”, a project of mine which tries to be guided by these design goals to create a small focused toolkit for building Digital Collections applications in Rails.  I could easily talk about all of this this another twenty minutes, but that’s our time! I’m always happy to talk more, find me on slack or IRC or email.  This last slide has some sources mentioned in the talk. Thanks for your time!  jrochkind General Leave a comment March 23, 2021March 23, 2021 Product management In my career working in the academic sector, I have realized that one thing that is often missing from in-house software development is “product management.” But what does that mean exactly? You don’t know it’s missing if you don’t even realize it’s a thing and people can use different terms to mean different roles/responsibilities. Basically, deciding what the software should do. This is not about colors on screen or margins (what our stakeholderes often enjoy micro-managing) — I’d consider those still the how of doing it, rather than the what to do. The what is often at a much higher level, about what features or components to develop at all. When done right, it is going to be based on both knowledge of the end-user’s needs and preferences (user research); but also knowledge of internal stakeholder’s desires and preferences (overall organiational strategy, but also just practically what is going to make the right people happy to keep us resourced). Also knowledge of the local capacity, what pieces do we need to put in place to get these things developed. When done seriously, it will necessarily involve prioritization — there are many things we could possibly done, some subset of them we very well may do eventually, but which ones should we do now? My experience tells me it is a very big mistake to try to have a developer doing this kind of product management. Not because a developer can’t have the right skillset to do them. But because having the same person leading development and product management is a mistake. The developer is too close to the development lense, and there’s just a clarification that happens when these roles are separate. My experience also tells me that it’s a mistake to have a committee doing these things, much as that is popular in the academic sector. Because, well, just of course it is. But okay this is all still pretty abstract. Things might become more clear if we get more specific about the actual tasks and work of this kind of product management role. I found Damilola Ajiboye blog post on “Product Manager vs Product Marketing Manager vs Product Owner” very clear and helpful here. While it is written so as to distinguish between three different product management related roles, but Ajiboye also acknowledges that in a smaller organization “a product manager is often tasked with the duty of these 3 roles. Regardless of if the responsibilities are to be done by one or two or three person, Ajiboye’s post serves as a concise listing of the work to be done in managing a product — deciding the what of the product, in an ongoing iterative and collaborative manner, so that developers and designers can get to the how and to implementation. I recommend reading the whole article, and I’ll excerpt much of it here, slightly rearranged. The Product Manager These individuals are often referred to as mini CEOs of a product. They conduct customer surveys to figure out the customer’s pain and build solutions to address it. The PM also prioritizes what features are to be built next and prepares and manages a cohesive and digital product roadmap and strategy. The Product Manager will interface with the users through user interviews/feedback surveys or other means to hear directly from the users. They will come up with hypotheses alongside the team and validate them through prototyping and user testing. They will then create a strategy on the feature and align the team and stakeholders around it. The PM who is also the chief custodian of the entire product roadmap will, therefore, be tasked with the duty of prioritization. Before going ahead to carry out research and strategy, they will have to convince the stakeholders if it is a good choice to build the feature in context at that particular time or wait a bit longer based on the content of the roadmap. The Product Marketing Manager The PMM communicates vital product value — the “why”, “what” and “when” of a product to intending buyers. He manages the go-to-market strategy/roadmap and also oversees the pricing model of the product. The primary goal of a PMM is to create demand for the products through effective messaging and marketing programs so that the product has a shorter sales cycle and higher revenue. The product marketing manager is tasked with market feasibility and discovering if the features being built align with the company’s sales and revenue plan for the period. They also make research on how sought-after the feature is being anticipated and how it will impact the budget. They communicate the values of the feature; the why, what, and when to potential buyers — In this case users in countries with poor internet connection. [While expressed in terms of a for-profit enterprise selling something, I think it’s not hard to translate this to a non-profit or academic environment. You still have an audience whose uptake you need to be succesful, whether internal or external. — jrochkind ] The Product Owner A product owner (PO) maximizes the value of a product through the creation and management of the product backlog, creation of user stories for the development team. The product owner is the customer’s representative to the development team. He addresses customer’s pain points by managing and prioritizing a visible product backlog. The PO is the first point of call when the development team needs clarity about interpreting a product feature to be implemented. The product owner will first have to prioritize the backlog to see if there are no important tasks to be executed and if this new feature is worth leaving whatever is being built currently. They will also consider the development effort required to build the feature i.e the time, tools, and skill set that will be required. They will be the one to tell if the expertise of the current developers is enough or if more engineers or designers are needed to be able to deliver at the scheduled time. The product owner is also armed with the task of interpreting the product/feature requirements for the development team. They serve as the interface between the stakeholders and the development team. When you have someone(s) doing these roles well, it ensures that the development team is actually spending time on things that meet user and business needs. I have found that it makes things so much less stressful and more rewarding for everyone involved. When you have nobody doing these roles, or someone doing it in a cursory or un-intentional way not recognized as part of their core job responsibilities, or have a lead developer trying to do it on top of develvopment, I find it leads to feelings of: spinning wheels, everything-is-an-emergency, lack of appreciation, miscommunication and lack of shared understanding between stakeholders and developers, general burnout and dissatisfaction — and at the root, a product that is not meeting user or business needs well, leading to these inter-personal and personal problems. jrochkind General Leave a comment February 3, 2021 Rails auto-scaling on Heroku We are investigating moving our medium-small-ish Rails app to heroku. We looked at both the Rails Autoscale add-on available on heroku marketplace, and the hirefire.io service which is not listed on heroku marketplace and I almost didn’t realize it existed. I guess hirefire.io doesn’t have any kind of a partnership with heroku, but still uses the heroku API to provide an autoscale service. hirefire.io ended up looking more fully-featured and lesser priced than Rails Autoscale; so the main service of this post is just trying to increase visibility of hirefire.io and therefore competition in the field, which benefits us consumers. Background: Interest in auto-scaling Rails background jobs At first I didn’t realize there was such a thing as “auto-scaling” on heroku, but once I did, I realized it could indeed save us lots of money. I am more interested in scaling Rails background workers than I a web workers though — our background workers are busiest when we are doing “ingests” into our digital collections/digital asset management system, so the work is highly variable. Auto-scaling up to more when there is ingest work piling up can give us really nice inget throughput while keeping costs low. On the other hand, our web traffic is fairly low and probably isn’t going to go up by an order of magnitude (non-profit cultural institution here). And after discovering that a “standard” dyno is just too slow, we will likely be running a performance-m or performance-l anyway — which likely can handle all anticipated traffic on it’s own. If we have an auto-scaling solution, we might configure it for web dynos, but we are especially interested in good features for background scaling. There is a heroku built-in autoscale feature, but it only works for performance dynos, and won’t do anything for Rails background job dynos, so that was right out. That could work for Rails bg jobs, the Rails Autoscale add-on on the heroku marketplace; and then we found hirefire.io. Pricing: Pretty different hirefire As of now January 2021, hirefire.io has pretty simple and affordable pricing. $15/month/heroku application. Auto-scaling as many dynos and process types as you like. hirefire.io by default can only check into your apps metrics to decide if a scaling event can occur once per minute. If you want more frequent than that (up to once every 15 seconds), you have to pay an additional $10/month, for $25/month/heroku application. Even though it is not a heroku add-on, hirefire does advertise that they bill pro-rated to the second, just like heroku and heroku add-ons. Rails autoscale Rails autoscale has a more tiered approach to pricing that is based on number and type of dynos you are scaling. Starting at $9/month for 1-3 standard dynos, the next tier up is $39 for up to 9 standard dynos, all the way up to $279 (!) for 1 to 99 dynos. If you have performance dynos involved, from $39/month for 1-3 performance dynos, up to $599/month for up to 99 performance dynos. For our anticipated uses… if we only scale bg dynos, I might want to scale from (low) 1 or 2 to (high) 5 or 6 standard dynos, so we’d be at $39/month. Our web dynos are likely to be performance and I wouldn’t want/need to scale more than probably 2, but that puts us into performance dyno tier, so we’re looking at $99/month. This is of course significantly more expensive than hirefire.io’s flat rate. Metric Resolution Since Hirefire had an additional charge for finer than 1-minute resolution on checks for autoscaling, we’ll discuss resolution here in this section too. Rails Autoscale has same resolution for all tiers, and I think it’s generally 10 seconds, so approximately the same as hirefire if you pay the extra $10 for increased resolution. Configuration Let’s look at configuration screens to get a sense of feature-sets. Rails Autoscale web dynos To configure web dynos, here’s what you get, with default values: The metric Rails Autoscale uses for scaling web dynos is time in heroku routing queue, which seems right to me — when things are spending longer in heroku routing queue before getting to a dyno, it means scale up. worker dynos For scaling worker dynos, Rails Autoscale can scale dyno type named “worker” — it can understand ruby queuing libraries Sidekiq, Resque, Delayed Job, or Que. I’m not certain if there are options for writing custom adapter code for other backends. Here’s what the configuration options are — sorry these aren’t the defaults, I’ve already customized them and lost track of what defaults are. You can see that worker dynos are scaled based on the metric “number of jobs queued”, and you can tell it to only pay attention to certain queues if you want. Hirefire Hirefire has far more options for customization than Rails Autoscale, which can make it a bit overwhelming, but also potentially more powerful. web dynos You can actually configure as many Heroku process types as you have for autoscale, not just ones named “web” and “worker”. And for each, you have your choice of several metrics to be used as scaling triggers. For web, I think Queue Time (percentile, average) matches what Rails Autoscale does, configured to percentile, 95, and is probably the best to use unless you have a reason to use another. (“Rails Autoscale tracks the 95th percentile queue time, which for most applications will hover well below the default threshold of 100ms.“) Here’s what configuration Hirefire makes available if you are scaling on “queue time” like Rails Autoscale, configuration may vary for other metrics. I think if you fill in the right numbers, you can configure to work equivalently to Rails Autoscale. worker dynos If you have more than one heroku process type for workers — say, working on different queues — Hirefire can scale the independently, with entirely separate configuration. This is pretty handy, and I don’t think Rails Autoscale offers this. (update i may be wrong, Rails Autoscale says they do support this, so check on it yourself if it matters to you). For worker dynos, you could choose to scale based on actual “dyno load”, but I think this is probably mostly for types of processes where there isn’t the ability to look at “number of jobs”. A “number of jobs in queue” like Rails Autoscale does makes a lot more sense to me as an effective metric for scaling queue-based bg workers. Hirefire’s metric is slightly difererent than Rails Autoscale’s “jobs in queue”. For recognized ruby queue systems (a larger list than Rails Autoscale’s; and you can write your own custom adapter for whatever you like), it actually measures jobs in queue plus workers currently busy. So queued+in-progress, rather than Rails Autoscale’s just queued. I actually have a bit of trouble wrapping my head around the implications of this, but basically, it means that Hirefire’s “jobs in queue” metric strategy is intended to try to scale all the way to emptying your queue, or reaching your max scale limit, whichever comes first. I think this may make sense and work out at least as well or perhaps better than Rails Autoscale’s approach? Here’s what configuration Hirefire makes available for worker dynos scaling on “job queue” metric. Since the metric isn’t the same as Rails Autosale, we can’t configure this to work identically. But there are a whole bunch of configuration options, some similar to Rails Autoscale’s. The most important thing here is that “Ratio” configuration. It may not be obvious, but with the way the hirefire metric works, you are basically meant to configure this to equal the number of workers/threads you have on each dyno. I have it configured to 3 because my heroku worker processes use resque, with resque_pool, configured to run 3 resque workers on each dyno. If you use sidekiq, set ratio to your configured concurrency — or if you are running more than one sidekiq process, processes*concurrency. Basically how many jobs your dyno can be concurrently working is what you should normally set for ‘ratio’. Hirefire not a heroku plugin Hirefire isn’t actually a heroku plugin. In addition to that meaning separate invoicing, there can be some other inconveniences. Since hirefire only can interact with heroku API, for some metrics (including the “queue time” metric that is probably optimal for web dyno scaling) you have to configure your app to log regular statistics to heroku’s “Logplex” system. This can add a lot of noise to your log, and for heroku logging add-ons that are tired based on number of log lines or bytes, can push you up to higher pricing tiers. If you use paperclip, I think you should be able to use the log filtering feature to solve this, keep that noise out of your logs and avoid impacting data log transfer limits. However, if you ever have cause to look at heroku’s raw logs, that noise will still be there. Support and Docs I asked a couple questions of both Hirefire and Rails Autoscale as part of my evaluation, and got back well-informed and easy-to-understand answers quickly from both. Support for both seems to be great. I would say the documentation is decent-but-not-exhaustive for both products. Hirefire may have slightly more complete documentation. Other Features? There are other things you might want to compare, various kinds of observability (bar chart or graph of dynos or observed metrics) and notification. I don’t have time to get into the details (and didn’t actually spend much time exploring them to evaluate), but they seem to offer roughly similar features. Conclusion Rails Autoscale is quite a bit more expensive than hirefire.io’s flat rate, once you get past Rails Autoscale’s most basic tier (scaling no more than 3 standard dynos). It’s true that autoscaling saves you money over not, so even an expensive price could be considered a ‘cut’ of that, and possibly for many ecommerce sites even $99 a month might a drop in the bucket (!)…. but this price difference is so significant with hirefire (which has flat rate regardless of dynos), that it seems to me it would take a lot of additional features/value to justify. And it’s not clear that Rails Autoscale has any feature advantage. In general, hirefire.io seems to have more features and flexibility. Until 2021, hirefire.io could only analyze metrics with 1-minute resolution, so perhaps that was a “killer feature”? Honestly I wonder if this price difference is sustained by Rails Autoscale only because most customers aren’t aware of hirefire.io, it not being listed on the heroku marketplace? Single-invoice billing is handy, but probably not worth $80+ a month. I guess hirefire’s logplex noise is a bit inconvenient? Or is there something else I’m missing? Pricing competition is good for the consumer. And are there any other heroku autoscale solutions, that can handle Rails bg job dynos, that I still don’t know about? update a day after writing djcp on a reddit thread writes: I used to be a principal engineer for the heroku add-ons program. One issue with hirefire is they request account level oauth tokens that essentially give them ability to do anything with your apps, where Rails Autoscaling worked with us to create a partnership and integrate with our “official” add-on APIs that limits security concerns and are scoped to the application that’s being scaled. Part of the reason for hirefire working the way it does is historical, but we’ve supported the endpoints they need to scale for “official” partners for years now. A lot of heroku customers use hirefire so please don’t think I’m spreading FUD, but you should be aware you’re giving a third party very broad rights to do things to your apps. They probably won’t, of course, but what if there’s a compromise? “Official” add-on providers are given limited scoped tokens to (mostly) only the actions / endpoints they need, minimizing blast radius if they do get compromised. You can read some more discussion at that thread. jrochkind General 2 Comments January 27, 2021January 30, 2021 Managed Solr SaaS Options I was recently looking for managed Solr “software-as-a-service” (SaaS) options, and had trouble figuring out what was out there. So I figured I’d share what I learned. Even though my knowledge here is far from exhaustive, and I have only looked seriously at one of the ones I found. The only managed Solr options I found were: WebSolr; SearchStax; and OpenSolr. Of these, i think WebSolr and SearchStax are more well-known, I couldn’t find anyone with experience with OpenSolr, which perhaps is newer. Of them all, SearchStax is the only one I actually took for a test drive, so will have the most to say about. Why we were looking We run a fairly small-scale app, whose infrastructure is currently 4 self-managed AWS EC2 instances, running respectively: 1) A rails web app 2) Bg workers for the rails web app 3) Postgres, and 4) Solr. Oh yeah, there’s also a redis running one of those servers, on #3 with pg or #4 with solr, I forget. Currently we manage this all ourselves, right on the EC2. But we’re looking to move as much as we can into “managed” servers. Perhaps we’ll move to Heroku. Perhaps we’ll use hatchbox. Or if we do stay on AWS resources we manage directly, we’d look at things like using an AWS RDS Postgres instead of installing it on an EC2 ourselves, an AWS ElastiCache for Redis, maybe look into Elastic Beanstalk, etc. But no matter what we do, we need a Solr, and we’d like to get it managed. Hatchbox has no special Solr support, AWS doesn’t have a Solr service, Heroku does have a solr add-on but you can also use any Solr with it and we’ll get to that later. Our current Solr use is pretty small scale. We don’t run “SolrCloud mode“, just legacy ordinary Solr. We only have around 10,000 documents in there (tiny for Solr), our index size is only 70MB. Our traffic is pretty low — when I tried to figure out how low, it doesn’t seem we have sufficient logging turned on to answer that specifically but using proxy metrics to guess I’d say 20K-40K requests a day, query as well as add. This is a pretty small Solr installation, although it is used centrally for the primary functions of the (fairly low-traffic) app. It currently runs on an EC2 t3a.small, which is a “burstable” EC2 type with only 2G of RAM. It does have two vCPUs (that is one core with ‘hyperthreading’). The t3a.small EC2 instance only costs $14/month on-demand price! We know we’ll be paying more for managed Solr, but we want to do get out of the business of managing servers — we no longer really have the staff for it. WebSolr (didn’t actually try out) WebSolr is the only managed Solr currently listed as a Heroku add-on. It is also available as a managed Solr independent of heroku. The pricing in the heroku plans vs the independent plans seems about the same. As a heroku add-on there is a $20 “staging” plan that doesn’t exist in the independent plans. (Unlike some other heroku add-ons, no time-limited free plan is available for WebSolr). But once we go up from there, the plans seem to line up. Starting at: $59/month for: 1 million document limit 40K requests/day 1 index 954MB storage 5 concurrent requests limit (this limit is not mentioned on the independent pricing page?) Next level up is $189/month for: 5 million document limit 150K requests/day 4.6GB storage 10 concurrent request limit (again concurrent request limits aren’t mentioned on independent pricing page) As you can see, WebSolr has their plans metered by usage. $59/month is around the price range we were hoping for (we’ll need two, one for staging one for production). Our small solr is well under 1 million documents and ~1GB storage, and we do only use one index at present. However, the 40K requests/day limit I’m not sure about, even if we fit under it, we might be pushing up against it. And the “concurrent request” limit simply isn’t one I’m even used to thinking about. On a self-managed Solr it hasn’t really come up. What does “concurrent” mean exactly in this case, how is it measured? With 10 puma web workers and sometimes a possibly multi-threaded batch index going on, could we exceed a limit of 4? Seems plausible. What happens when they are exceeded? Your Solr request results in an HTTP 429 error! Do I need to now write the app to rescue those gracefully, or use connection pooling to try to avoid them, or something? Having to rewrite the way our app functions for a particular managed solr is the last thing we want to do. (Although it’s not entirely clear if those connection limits exist on the non-heroku-plugin plans, I suspect they do?). And in general, I’m not thrilled with the way the pricing works here, and the price points. I am positive for a lot of (eg) heroku customers an additional $189*2=$378/month is peanuts not even worth accounting for, but for us, a small non-profit whose app’s traffic does not scale with revenue, that starts to be real money. It is not clear to me if WebSolr installations (at “standard” plans) are set up in “SolrCloud mode” or not; I’m not sure what API’s exist for uploading your custom schema.xml (which we’d need to do), or if they expect you to do this only manually through a web UI (that would not be good); I’m not sure if you can upload custom solrconfig.xml settings (this may be running on a shared solr instance with standard solrconfig.xml?). Basically, all of this made WebSolr not the first one we looked at. Does it matter if we’re on heroku using a managed Solr that’s not a Heroku plugin? I don’t think so. In some cases, you can get a better price from a Heroku plug-in than you could get from that same vendor not on heroku or other competitors. But that doesn’t seem to be the case here, and other that that does it matter? Well, all heroku plug-ins are required to bill you by-the-minute, which is nice but not really crucial, other forms of billing could also be okay at the right price. With a heroku add-on, your billing is combined into one heroku invoice, no need to give a credit card to anyone else, and it can be tracked using heroku tools. Which is certainly convenient and a plus, but not essential if the best tool for the job is not a heroku add-on. And as a heroku add-on, WebSolr provides a WEBSOLR_URL heroku config/env variable automatically to code running on heroku. OK, that’s kind of nice, but it’s not a big deal to set a SOLR_URL heroku config manually referencing the appropriate address. I suppose as a heroku add-on, WebSolr also takes care of securing and authenticating connections between the heroku dynos and the solr, so we need to make sure we have a reasonable way to do this from any alternative. SearchStax (did take it for a spin) SearchStax’s pricing tiers are not based on metering usage. There are no limits based on requests/day or concurrent connections. SearchStax runs on dedicated-to-you individual Solr instances (I would guess running on dedicated-to-you individual (eg) EC2, but I’m not sure). Instead the pricing is based on size of host running Solr. You can choose to run on instances deployed to AWS, Google Cloud, or Azure. We’ll be sticking to AWS (the others, I think, have a slight price premium). While SearchStax gives you a pricing pages that looks like the “new-way-of-doing-things” transparent pricing, in fact there isn’t really enough info on public pages to see all the price points and understand what you’re getting, there is still a kind of “talk to a salesperson who has a price sheet” thing going on. What I think I have figured out from talking to a salesperson and support, is that the “Silver” plans (“Starting at $19 a month”, although we’ll say more about that in a bit) are basically: We give you a Solr, we don’t don’t provide any technical support for Solr. While the “Gold” plans “from $549/month” are actually about paying for Solr consultants to set up and tune your schema/index etc. That is not something we need, and $549+/month is way more than the price range we are looking for. While the SearchStax pricing/plan pages kind of imply the “Silver” plan is not suitable for production, in fact there is no real reason not to use it for production I think, and the salesperson I talked to confirmed that — just reaffirming that you were on your own managing the Solr configuration/setup. That’s fine, that’s what we want, we just don’t want to mangage the OS or set up the Solr or upgrade it etc. The Silver plans have no SLA, but as far as I can tell their uptime is just fine. The Silver plans only guarantees 72-hour support response time — but for the couple support tickets I filed asking questions while under a free 14-day trial (oh yeah that’s available), I got prompt same-day responses, and knowledgeable responses that answered my questions. So a “silver” plan is what we are interested in, but the pricing is not actually transparent. $19/month is for the smallest instance available, and IF you prepay/contract for a year. They call that small instance an NDN1 and it has 1GB of RAM and 8GB of storage. If you pay-as-you-go instead of contracting for a year, that already jumps to $40/month. (That price is available on the trial page). When you are paying-as-you-go, you are actually billed per-day, which might not be as nice as heroku’s per-minute, but it’s pretty okay, and useful if you need to bring up a temporary solr instance as part of a migration/upgrade or something like that. The next step up is an “NDN2” which has 2G of RAM and 16GB of storage, and has an ~$80/month pay-as-you-go — you can find that price if you sign-up for a free trial. The discount price price for an annual contract is a discount similar to the NDN1 50%, $40/month — that price I got only from a salesperson, I don’t know if it’s always stable. It only occurs to me now that they don’t tell you how many CPUs are available. I’m not sure if I can fit our Solr in the 1G NDN1, but I am sure I can fit it in the 2G NDN2 with some headroom, so I didn’t look at plans above that — but they are available, still under “silver”, with prices going up accordingly. All SearchStax solr instances run in “SolrCloud” mode — these NDN1 and NDN2 ones we’re looking at just run one node with one zookeeper, but still in cloud mode. There are also “silver” plans available with more than one node in a “high availability” configuration, but the prices start going up steeply, and we weren’t really interested in that. Because it’s SolrCloud mode though, you can use the standard Solr API for uploading your configuration. It’s just Solr! So no arbitrary usage limits, no features disabled. The SearchStax web console seems competently implemented; it let’s you create and delete individual Solr “deployments”, manage accounts to login to console (on “silver” plan you only get two, or can pay $10/month/account for more, nah), and set up auth for a solr deployment. They support IP-based authentication or HTTP Basic Auth to the Solr (no limit to how many Solr Basic Auth accounts you can create). HTTP Basic Auth is great for us, because trying to do IP-based from somewhere like heroku isn’t going to work. All Solrs are available over HTTPS/SSL — great! SearchStax also has their own proprietary HTTP API that lets you do most anything, including creating/destroying deployments, managing Solr basic auth users, basically everything. There is some API that duplicates the Solr Cloud API for adding configsets, I don’t think there’s a good reason to use it instead of standard SolrCloud API, although their docs try to point you to it. There’s even some kind of webhooks for alerts! (which I haven’t really explored). Basically, SearchStax just seems to be a sane and rational managed Solr option, it has all the features you’d expect/need/want for dealing with such. The prices seem reasonable-ish, generally more affordable than WebSolr, especially if you stay in “silver” and “one node”. At present, we plan to move forward with it. OpenSolr (didn’t look at it much) I have the least to say about this, have spent the least time with it, after spending time with SearchStax and seeing it met our needs. But I wanted to make sure to mention it, because it’s the only other managed Solr I am even aware of. Definitely curious to hear from any users. Here is the pricing page. The prices seem pretty decent, perhaps even cheaper than SearchStax, although it’s unclear to me what you get. Does “0 Solr Clusters” mean that it’s not SolrCloud mode? After seeing how useful SolrCloud APIs are for management (and having this confirmed by many of my peers in other libraries/museums/archives who choose to run SolrCloud), I wouldn’t want to do without it. So I guess that pushes us to “executive” tier? Which at $50/month (billed yearly!) is still just fine, around the same as SearchStax. But they do limit you to one solr index; I prefer SearchStax’s model of just giving you certain host resources and do what you want with it. It does say “shared infrastructure”. Might be worth investigating, curious to hear more from anyone who did. Now, what about ElasticSearch? We’re using Solr mostly because that’s what various collaborative and open source projects in the library/museum/archive world have been doing for years, since before ElasticSearch even existed. So there are various open source libraries and toolsets available that we’re using. But for whatever reason, there seem to be SO MANY MORE managed ElasticSearch SaaS available. At possibly much cheaper pricepoints. Is this because the ElasticSearch market is just bigger? Or is ElasticSearch easier/cheaper to run in a SaaS environment? Or what? I don’t know. But there’s the controversial AWS ElasticSearch Service; there’s the Elastic Cloud “from the creators of ElasticSearch”. On Heroku that lists one Solr add-on, there are THREE ElasticSearch add-ons listed: ElasticCloud, Bonsai ElasticSearch, and SearchBox ElasticSearch. If you just google “managed ElasticSearch” you immediately see 3 or 4 other names. I don’t know enough about ElasticSearch to evaluate them. There seem on first glance at pricing pages to be more affordable, but I may not know what I’m comparing and be looking at tiers that aren’t actually usable for anything or will have hidden fees. But I know there are definitely many more managed ElasticSearch SaaS than Solr. I think ElasticSearch probably does everything our app needs. If I were to start from scratch, I would definitely consider ElasticSearch over Solr just based on how many more SaaS options there are. While it would require some knowledge-building (I have developed a lot of knowlege of Solr and zero of ElasticSearch) and rewriting some parts of our stack, I might still consider switching to ES in the future, we don’t do anything too too complicated with Solr that would be too too hard to switch to ES, probably. jrochkind General Leave a comment January 12, 2021January 27, 2021 Gem authors, check your release sizes Most gems should probably be a couple hundred kb at most. I’m talking about the package actually stored in and downloaded from rubygems by an app using the gem. After all, source code is just text, and it doesn’t take up much space. OK, maybe some gems have a couple images in there. But if you are looking at your gem in rubygems and realize that it’s 10MB or bigger… and that it seems to be getting bigger with every release… something is probably wrong and worth looking into it. One way to look into it is to look at the actual gem package. If you use the handy bundler rake task to release your gem (and I recommend it), you have a ./pkg directory in your source you last released from. Inside it are “.gem” files for each release you’ve made from there, unless you’ve cleaned it up recently. .gem files are just .tar files it turns out. That have more tar and gz files inside them etc. We can go into it, extract contents, and use the handy unix utility du -sh to see what is taking up all the space. How I found the bytes jrochkind-chf kithe (master ?) $ cd pkg jrochkind-chf pkg (master ?) $ ls kithe-2.0.0.beta1.gem kithe-2.0.0.pre.rc1.gem kithe-2.0.0.gem kithe-2.0.1.gem kithe-2.0.0.pre.beta1.gem kithe-2.0.2.gem jrochkind-chf pkg (master ?) $ mkdir exploded jrochkind-chf pkg (master ?) $ cp kithe-2.0.0.gem exploded/kithe-2.0.0.tar jrochkind-chf pkg (master ?) $ cd exploded jrochkind-chf exploded (master ?) $ tar -xvf kithe-2.0.0.tar x metadata.gz x data.tar.gz x checksums.yaml.gz jrochkind-chf exploded (master ?) $ mkdir unpacked_data_tar jrochkind-chf exploded (master ?) $ tar -xvf data.tar.gz -C unpacked_data_tar/ jrochkind-chf exploded (master ?) $ cd unpacked_data_tar/ /Users/jrochkind/code/kithe/pkg/exploded/unpacked_data_tar jrochkind-chf unpacked_data_tar (master ?) $ du -sh * 4.0K MIT-LICENSE 12K README.md 4.0K Rakefile 160K app 8.0K config 32K db 100K lib 300M spec jrochkind-chf unpacked_data_tar (master ?) $ cd spec jrochkind-chf spec (master ?) $ du -sh * 8.0K derivative_transformers 300M dummy 12K factories 24K indexing 72K models 4.0K rails_helper.rb 44K shrine 12K simple_form_enhancements 8.0K spec_helper.rb 188K test_support 4.0K validators jrochkind-chf spec (master ?) $ cd dummy/ jrochkind-chf dummy (master ?) $ du -sh * 4.0K Rakefile 56K app 24K bin 124K config 4.0K config.ru 8.0K db 300M log 4.0K package.json 12K public 4.0K tmp Doh! In this particular gem, I have a dummy rails app, and it has 300MB of logs, cause I haven’t b bothered trimming them in a while, that are winding up including in the gem release package distributed to rubygems and downloaded by all consumers! Even if they were small, I don’t want these in the released gem package at all! That’s not good! It only turns into 12MB instead of 300MB, because log files are so compressable and there is compression involved in assembling the rubygems package. But I have no idea how much space it’s actually taking up on consuming applications machines. This is very irresponsible! What controls what files are included in the gem package? Your .gemspec file of course. The line s.files = is an array of every file to include in the gem package. Well, plus s.test_files is another array of more files, that aren’t supposed to be necessary to run the gem, but are to test it. (Rubygems was set up to allow automated *testing* of gems after download, is why test files are included in the release package. I am not sure how useful this is, and who if anyone does it; although I believe that some linux distro packagers try to make use of it, for better or worse). But nobody wants to list every file in your gem individually, manually editing the array every time you add, remove, or move one. Fortunately, gemspec files are executable ruby code, so you can use ruby as a shortcut. I have seen two main ways of doing this, with different “gem skeleton generators” taking one of two approaches. Sometimes a shell out to git is used — the idea is that everything you have checked into your git should be in the gem release package, no more or no less. For instance, one of my gems has this in it, not sure where it came from or who/what generated it. spec.files = `git ls-files -z`.split("\x0").reject do |f| f.match(%r{^(test|spec|features)/}) end In that case, it wouldn’t have included anything in ./spec already, so this obviously isn’t actually the gem we were looking at before. But in this case, in addition to using ruby logic to manipulate the results, nothing excluded by your .gitignore file will end up included in your gem package, great! In kithe we were looking at before, those log files were in the .gitignore (they weren’t in my repo!), so if I had been using that git-shellout technique, they wouldn’t have been included in the ruby release already. But… I wasn’t. Instead this gem has a gemspec that looks like: s.test_files = Dir["spec/*/"] Just include every single file inside ./spec in the test_files list. Oops. Then I get all those log files! One way to fix I don’t really know which is to be preferred of the git-shellout approach vs the dir-glob approach. I suspect it is the subject of historical religious wars in rubydom, when there were still more people around to argue about such things. Any opinions? Or another approach? Without being in the mood to restructure this gemspec in anyway, I just did the simplest thing to keep those log files out… Dir["spec/*/"].delete_if {|a| a =~ %r{/dummy/log/}} Build the package without releasing with the handy bundler supplied rake build task… and my gem release package size goes from 12MB to 64K. (which actually kind of sounds like a minimum block size or something, right?) Phew! That’s a big difference! Sorry for anyone using previous versions and winding up downloading all that cruft! (Actually this particular gem is mostly a proof of concept at this point and I don’t think anyone else is using it). Check your gem sizes! I’d be willing to be there are lots of released gems with heavily bloated release packages like this. This isn’t the first one I’ve realized was my fault. Because who pays attention to gem sizes anyway? Apparently not many! But rubygems does list them, so it’s pretty easy to see. Are your gem release packages multiple megs, when there’s no good reason for them to be? Do they get bigger every release by far more than the bytes of lines of code you think were added? At some point in gem history was there a big jump from hundreds of KB to multiple MB? When nothing particularly actually happened to gem logic to lead to that? All hints that you might be including things you didn’t mean to include, possibly things that grow each release. You don’t need to have a dummy rails app in your repo to accidentally do this (I accidentally did it once with a gem that had nothing to do with rails). There could be other kind of log files. Or test coverage or performance metric files, or any other artifacts of your build or your development, especially ones that grow over time — that aren’t actually meant to or needed as part of the gem release package! It’s good to sanity check your gem release packages now and then. In most cases, your gem release package should be hundreds of KB at most, not MBs. Help keep your users’ installs and builds faster and slimmer! jrochkind General Leave a comment January 11, 2021 Every time you decide to solve a problem with code… Every time you decide to solve a problem with code, you are committing part of your future capacity to maintaining and operating that code. Software is never done. Software is drowning the world by James Abley jrochkind General Leave a comment January 10, 2021 Updating SolrCloud configuration in ruby We have an app that uses Solr. We currently run a Solr in legacy “not cloud” mode. Our solr configuration directory is on disk on the Solr server, and it’s up to our processes to get our desired solr configuration there, and to update it when it changes. We are in the process of moving to a Solr in “SolrCloud mode“, probably via the SearchStax managed Solr service. Our Solr “Cloud” might only have one node, but “SolrCloud mode” gives us access to additional API’s for managing your solr configuration, as opposed to writing it directly to disk (which may not be possible at all in SolrCloud mode? And certainly isn’t using managed SearchStax). That is, the Solr ConfigSets API, although you might also want to use a few pieces of the Collection Management API for associating a configset with a Solr collection. Basically, you are taking your desired solr config directory, zipping it up, and uploading it to Solr as a “config set” [or “configset”] with a certain name. Then you can create collections using this config set, or reassign which named configset an existing collection uses. I wasn’t able to find any existing ruby gems for interacting with these Solr API’s. RSolr is a “ruby client for interacting with solr”, but was written before most of these administrative API’s existed for Solr, and doesn’t seem to have been updated to deal with them (unless I missed it), RSolr seems to be mostly/only about querying solr, and some limited indexing. But no worries, it’s not too hard to wrap the specific API I want to use in some ruby. Which did seem far better to me than writing the specific HTTP requests each time (and making sure you are dealing with errors etc!). (And yes, I will share the code with you). I decided I wanted an object that was bound to a particular solr collection at a particular solr instance; and was backed by a particular local directory with solr config. That worked well for my use case, and I wound up with an API that looks like this: updater = SolrConfigsetUpdater.new( solr_url: "https://example.com/solr", conf_dir: "./solr/conf", collection_name: "myCollection" ) # will zip up ./solr/conf and upload it as named MyConfigset: updater.upload("myConfigset") updater.list #=> ["myConfigSet"] updater.config_name # what configset name is MyCollection currently configured to use? # => "oldConfigSet" # what if we try to delete the one it's using? updater.delete("oldConfigSet") # => raises SolrConfigsetUpdater::SolrError with message: # "Can not delete ConfigSet as it is currently being used by collection [myConfigset]" # okay let's change it to use the new one and delete the old one updater.update_config_name("myConfigset") # now MyCollection uses this new configset, although we possibly # need to reload the collection to make that so updater.reload # now let's delete the one we're not using updater.delete("oldConfigSet") OK, great. There were some tricks in there in trying to catch the apparently multiple ways Solr can report different kinds of errors, to make sure Solr-reported errors turn into exceptions ideally with good error messages. Now, in addition to uploading a configset initially for a collection you are creating to use, the main use case I have is wanting to UPDATE the configuration to new values in an existing collection. Sure, this often requires a reindex afterwards. If you have the recently released Solr 8.7, it will let you overwrite an existing configset, so this can be done pretty easily. updater.upload(updater.config_name, overwrite: true) updater.reload But prior to Solr 8.7 you can not overwrite an existing configset. And SearchStax doesn’t yet have Solr 8.7. So one way or another, we need to do a dance where we upload the configset under a new name than switch the collection to use it. Having this updater object that lets us easily execute relevant Solr API lets us easily experiment with different logic flows for this. For instance in a Solr listserv thread, Alex Halovnic suggests a somewhat complicated 8-step process workaround, which we can implement like so: current_name = updater.config_name temp_name = "#{current_name}_temp" updater.create(from: current_name, to: temp_name) updater.change_config_name(temp_name) updater.reload updater.delete(current_name) updater.upload(configset_name: current_name) updater.change_config_name(current_name) updater.reload updater.delete(temp_name) That works. But talking to Dann Bohn at Penn State University, he shared a different algorithm, which goes like: Make a cryptographic digest hash of the entire solr directory, which we’re going to use in the configset name. Check if the collection is already using a configset named $name_$digest, which if it already is, you’re done, no change needed. Otherwise, upload the configset with the fingerprint-based name, switch the collection to use it, reload, delete the configset that the collection used to use. At first this seemed like overkill to me, but after thinking and experimenting with it, I like it! It is really quick to make a digest of a handful of files, that’s not a big deal. (I use first 7 chars of hex SHA256). And even if we had Solr 8.7, I like that we can avoid doing any operation on solr at all if there had been no changes — I really want to use this operation much like a Rails db:migrate, running it on every deploy to make sure the solr schema matches the one in the repo for the depoy. Dann also shared his open source code with me, which was helpful for seeing how to make the digest, how to make a Zip file in ruby, etc. Thanks Dann! Sharing my code So I also wrote some methods to implement those variant updating stragies, Dann’s, and Alex Halovnic’s from the list etc. I thought about wrapping this all up as a gem, but I didn’t really have the time to make it really good enough for that. My API is a little bit janky, I didn’t spend the extra time think it out really well to minimize the need for future backwards incompat changes like I would if it were a gem. I also couldn’t figure out a great way to write automated tests for this that I would find particularly useful; so in my code base it’s actually not currently test-covered (shhhhh) but in a gem I’d want to solve that somehow. But I did try to write the code general purpose/flexible so other people could use it for their use cases; I tried to document it to my highest standards; and I put it all in one file which actually might not be the best OO abstraction/design, but makes it easier for you to copy and paste the single file for your own use. :) So you can find my code here; it is apache-licensed; and you are welcome to copy and paste it and do whatever you like with it, including making a gem yourself if you want. Maybe I’ll get around to making it a gem in the future myself, I dunno, curious if there’s interest. The SearchStax proprietary API’s SearchStax has it’s own API’s that can I think be used for updating configsets and setting collections to use certain configsets etc. When I started exploring them, they are’t the worst vendor API’s I’ve seen, but I did find them a bit cumbersome to work with. The auth system involves a lot of steps (why can’t you just create an API Key from the SearchStax Web GUI?). Overall I found them harder to use than just the standard Solr Cloud API’s, which worked fine in the SearchStax deployment, and have the added bonus of being transferable to any SolrCloud deployment instead of being SearchStax-specific. While the SearchStax docs and support try to steer you to the SearchStax specific API’s, I don’t think there’s really any good reason for this. (Perhaps the custom SearchStax API’s were written long ago when Solr API’s weren’t as complete?) SearchStax support suggested that the SearchStax APIs were somehow more secure; but my SearchStax Solr API’s are protected behind HTTP basic auth, and if I’ve created basic auth credentials (or IP addr allowlist) those API’s will be available to anyone with auth to access Solr whether I use em or not! And support also suggested that the SearchStax API use would be logged, whereas my direct Solr API use would not be, which seems to be true at least in default setup, I can probably configure solr logging differently, but it just isn’t that important to me for these particular functions. So after some initial exploration with SearchStax API, I realized that SolrCloud API (which I had never used before) could do everything I need and was more straightforward and transferable to use, and I’m happy with my decision to go with that. jrochkind General 3 Comments December 15, 2020December 16, 2020 Are you talking to Heroku redis in cleartext or SSL? In “typical” Redis installation, you might be talking to redis on localhost or on a private network, and clients typically talk to redis in cleartext. Redis doesn’t even natively support communications over SSL. (Or maybe it does now with redis6?) However, the Heroku redis add-on (the one from Heroku itself) supports SSL connections via “Stunnel”, a tool popular with other redis users use to get SSL redis connections too. (Or maybe via native redis with redis6? Not sure if you’d know the difference, or if it matters). There are heroku docs on all of this which say: While you can connect to Heroku Redis without the Stunnel buildpack, it is not recommend. The data traveling over the wire will be unencrypted. Perhaps especially because on heroku your app does not talk to redis via localhost or on a private network, but on a public network. But I think I’ve worked on heroku apps before that missed this advice and are still talking to heroku in the clear. I just happened to run across it when I got curious about the REDIS_TLS_URL env/config variable I noticed heroku setting. Which brings us to another thing, that heroku doc on it is out of date, it doesn’t mention the REDIS_TLS_URL config variable, just the REDIS_URL one. The difference? the TLS version will be a url beginning with rediss:// instead of redis:// , note extra s, which many redis clients use as a convention for “SSL connection to redis probably via stunnel since redis itself doens’t support it”. The redis docs provide ruby and go examples which instead use REDIS_URL and writing code to swap the redis:// for rediss:// and even hard-code port number adjustments, which is silly! (While I continue to be very impressed with heroku as a product, I keep running into weird things like this outdated documentation, that does not match my experience/impression of heroku’s all-around technical excellence, and makes me worry if heroku is slipping…). The docs also mention a weird driver: ruby arg for initializing the Redis client that I’m not sure what it is and it doesn’t seem necessary. The docs are correct that you have to tell the ruby Redis client not to try to verify SSL keys against trusted root certs, and this implementation uses a self-signed cert. Otherwise you will get an error that looks like: OpenSSL::SSL::SSLError: SSL_connect returned=1 errno=0 state=error: certificate verify failed (self signed certificate in certificate chain) So, can be as simple as: redis_client = Redis.new(url: ENV['REDIS_TLS_URL'], ssl_params: { verify_mode: OpenSSL::SSL::VERIFY_NONE }) $redis = redis_client # and/or Resque.redis = redis_client I don’t use sidekiq on this project currently, but to get the SSL connection with VERIFY_NONE, looking at sidekiq docs maybe on sidekiq docs you might have to(?): redis_conn = proc { Redis.new(url: ENV['REDIS_TLS_URL'], ssl_params: { verify_mode: OpenSSL::SSL::VERIFY_NONE }) } Sidekiq.configure_client do |config| config.redis = ConnectionPool.new(size: 5, &redis_conn) end Sidekiq.configure_server do |config| config.redis = ConnectionPool.new(size: 25, &redis_conn) end (Not sure what values you should pick for connection pool size). While the sidekiq docs mention heroku in passing, they don’t mention need for SSL connections — I think awareness of this heroku feature and their recommendation you use it may not actually be common! Update: Beware REDIS_URL can also be rediss On one of my apps I saw a REDIS_URL which used redis: and a REDIS_TLS_URL which uses (secure) rediss:. But on another app, it provides *only* a REDIS_URL, which is rediss — meaning you have to set the verify_mode: OpenSSL::SSL::VERIFY_NONE when passing it to ruby redis client. So you have to be prepared to do this with REDIS_URL values too — I think it shouldn’t hurt to set the ssl_params option even if you pass it a non-ssl redis: url, so just set it all the time? This second app was heroku-20 stack, and the first was heroku-18 stack, is that the difference? No idea. Documented anywhere? I doubt it. Definitely seems sloppy for what I expect of heroku, making me get a bit suspicious of whether heroku is sticking to the really impressive level of technical excellence and documentation I expect from them. So, your best bet is to check for both REDIS_TLS_URL and REDIS_URL, prefering the TLS one if present, realizing the REDIS_URL can have a rediss:// value in it too. The heroku docs also say you don’t get secure TLS redis connection on “hobby” plans, but I”m not sure that’s actually true anymore on heroku-20? Not trusting the docs is not a good sign. jrochkind General 4 Comments November 24, 2020November 25, 2020 Comparing performance of a Rails app on different Heroku formations I develop a “digital collections” or “asset management” app, which manages and makes digitized historical objects and their descriptions available to the public, from the collections here at the Science History Institute. The app receives relatively low level of traffic (according to Google Analytics, around 25K pageviews a month), although we want it to be able to handle spikes without falling down. It is not the most performance-optimized app, it does have some relatively slow responses and can be RAM-hungry. But it works adequately on our current infrastructure: Web traffic is handled on a single AWS EC2 t2.medium instance, with 10 passenger processes (free version of passenger, so no multi-threading). We are currently investigating the possibility of moving our infrastructure to heroku. After realizing that heroku standard dynos did not seem to have the performance characteristics I had expected, I decided to approach performance testing more methodically, to compare different heroku dyno formations to each other and to our current infrastructure. Our basic research question is probably What heroku formation do we need to have similar performance to our existing infrastructure? I am not an expert at doing this — I did some research, read some blog posts, did some thinking, and embarked on this. I am going to lead you through how I approached this and what I found. Feedback or suggestions are welcome. The most surprising result I found was much poorer performance from heroku standard dynos than I expected, and specifically that standard dynos would not match performance of present infrastructure. What URLs to use in test Some older load-testing tools only support testing one URL over and over. I decided I wanted to test a larger sample list of URLs — to be a more “realistic” load, and also because repeatedly requesting only one URL might accidentally use caches in ways you aren’t expecting giving you unrepresentative results. (Our app does not currently use fragment caching, but caches you might not even be thinking about include postgres’s built-in automatic caches, or passenger’s automatic turbocache (which I don’t think we have turned on)). My initial thought to get a list of such URLs from our already-in-production app from production logs, to get a sample of what real traffic looks like. There were a couple barriers for me to using production logs as URLs: Some of those URLs might require authentication, or be POST requests. The bulk of our app’s traffic is GET requests available without authentication, and I didn’t feel like the added complexity of setting up anything else in a load traffic was worthwhile. Our app on heroku isn’t fully functional yet. Without having connected it to a Solr or background job workers, only certain URLs are available. In fact, a large portion of our traffic is an “item” or “work” detail page like this one. Additionally, those are the pages that can be the biggest performance challenge, since the current implementation includes a thumbnail for every scanned page or other image, so response time unfortunately scales with number of pages in an item. So I decided a good list of URLs was simply a representative same of those “work detail” pages. In fact, rather than completely random sample, I took the 50 largest/slowest work pages, and then added in another 150 randomly chosen from our current ~8K pages. And gave them all a randomly shuffled order. In our app, every time a browser requests a work detail page, the JS on that page makes an additional request for a JSON document that powers our page viewer. So for each of those 200 work detail pages, I added the JSON request URL, for a more “realistic” load, and 400 total URLs. Performance: “base speed” vs “throughput under load” Thinking about it, I realized there were two kinds of “performance” or “speed” to think about. You might just have a really slow app, to exagerate let’s say typical responses are 5 seconds. That’s under low/no-traffic, a single browser is the only thing interacting with the app, it makes a single request, and has to wait 5 seconds for a response. That number might be changed by optimizations or performance regressions in your code (including your dependencies). It might also be changed by moving or changing hardware or virtualization environment — including giving your database more CPU/RAM resources, etc. But that number will not change by horizontally scaling your deployment — adding more puma or passenger processes or threads, scaling out hosts with a load balancer or heroku dynos. None of that will change this base speed because it’s just how long the app takes to prepare a response when not under load, how slow it is in a test only one web worker , where adding web workers won’t matter because they won’t be used. Then there’s what happens to the app actually under load by multiple users at once. The base speed is kind of a lower bound on throughput under load — page response time is never going to get better than 5s for our hypothetical very slow app (without changing the underlying base speed). But it can get a lot worse if it’s hammered by traffic. This throughput under load can be effected not only by changing base speed, but also by various forms of horizontal scaling — how many puma or passenger processes you have with how many threads each, and how many CPUs they have access to, as well as number of heroku dynos or other hosts behind a load balancer. (I had been thinking about this distinction already, but Nate Berkopec’s great blog post on scaling Rails apps gave me the “speed” vs “throughout” terminology to use). For my condition, we are not changing the code at all. But we are changing the host architecture from a manual EC2 t2.medium to heroku dynos (of various possible types) in a way that could effect base speed, and we’re also changing our scaling architecture in a way that could change throughput under load on top of that — from one t2.medium with 10 passenger process to possibly multiple heroku dynos behind heroku’s load balancer, and also (for Reasons) switching from free passenger to trying puma with multiple threads per process. (we are running puma 5 with new experimental performance features turned on). So we’ll want to get a sense of base speed of the various host choices, and also look at how throughput under load changes based on various choices. Benchmarking tool: wrk We’re going to use wrk. There are LOTS of choices for HTTP benchmarking/load testing, with really varying complexity and from different eras of web history. I got a bit overwhelmed by it, but settled on wrk. Some other choices didn’t have all the features we need (some way to test a list of URLs, with at least some limited percentile distribution reporting). Others were much more flexible and complicated and I had trouble even figuring out how to use them! wrk does need a custom lua script in order to handle a list of URLs. I found a nice script here, and modified it slightly to take filename from an ENV variable, and not randomly shuffle input list. It’s a bit confusing understanding the meaning of “threads” vs “connections” in wrk arguments. This blog post from appfolio clears it up a bit. I decided to leave threads set to 1, and vary connections for load — so -c1 -t1 is a “one URL at a time” setting we can use to test “base speed”, and we can benchmark throughput under load by increasing connections. We want to make sure we run the test for long enough to touch all 400 URLs in our list at least once, even in the slower setups, to have a good comparison — ideally it would be go through the list more than once, but for my own ergonomics I had to get through a lot of tests so ended up less tha ideal. (Should I have put fewer than 400 URLs in? Not sure). Conclusions in advance As benchmarking posts go (especially when I’m the one writing them), I’m about to drop a lot of words and data on you. So to maximize the audience that sees the conclusions (because they surprise me, and I want feedback/pushback on them), I’m going to give you some conclusions up front. Our current infrastructure has web app on a single EC2 t2.medium, which is a burstable EC2 type — our relatively low-traffic app does not exhaust it’s burst credits. Measuring base speed (just one concurrent request at a time), we found that performance dynos seem to have about the CPU speed of a bursting t2.medium (just a hair slower). But standard dynos are as a rule 2 to 3 times slower; additionally they are highly variable, and that variability can be over hours/days. A 3 minute period can have measured response times 2 or more times slower than another 3 minute period a couple hours later. But they seem to typically be 2-3x slower than our current infrastructure. Under load, they scale about how you’d expect if you knew how many CPUs are present, no real surprises. Our existing t2.medium has two CPUs, so can handle 2 simultaneous requests as fast as 1, and after that degrades linearly. A single performance-L ($500/month) has 4 CPUs (8 hyperthreads), so scales under load much better than our current infrastructure. A single performance-M ($250/month) has only 1 CPU (!), so scales pretty terribly under load. Testing scaling with 4 standard-2x’s ($200/month total), we see that it scales relatively evenly. Although lumpily because of variability, and it starts out so much worse performing that even as it scales “evenly” it’s still out-performed by all other arcchitectures. :( (At these relatively fast median response times you might say it’s still fast enough who cares, but in our fat tail of slower pages it gets more distressing). Now we’ll give you lots of measurements, or you can skip all that to my summary discussion or conclusions for our own project at the end. Let’s compare base speed OK, let’s get to actual measurements! For “base speed” measurements, we’ll be telling wrk to use only one connection and one thread. Existing t2.medium: base speed Our current infrastructure is one EC2 t2.medium. This EC2 instance type has two vCPUs and 4GB of RAM. On that single EC2 instance, we run passenger (free not enterprise) set to have 10 passenger processes, although the base speed test with only one connection should only touch one of the workers. The t2 is a “burstable” type, and we do always have burst credits (this is not a high traffic app; verified we never exhausted burst credits in these tests), so our test load may be taking advantage of burst cpu. $ URLS=./sample_works.txt wrk -c 1 -t 1 -d 3m --timeout 20s --latency -s load_test/multiplepaths.lua.txt https://[current staging server] multiplepaths: Found 400 paths multiplepaths: Found 400 paths Running 3m test @ https://staging-digital.sciencehistory.org 1 threads and 1 connections Thread Stats Avg Stdev Max +/- Stdev Latency 311.00ms 388.11ms 2.37s 86.45% Req/Sec 11.89 8.96 40.00 69.95% Latency Distribution 50% 90.99ms 75% 453.40ms 90% 868.81ms 99% 1.72s 966 requests in 3.00m, 177.43MB read Requests/sec: 5.37 Transfer/sec: 0.99MB I’m actually feeling pretty good about those numbers on our current infrastructure! 90ms median, not bad, and even 453ms 75th percentile is not too bad. Now, our test load involves some JSON responses that are quicker to deliver than corresponding HTML page, but still pretty good. The 90th/99th/and max request (2.37s) aren’t great, but I knew I had some slow pages, this matches my previous understanding of how slow they are in our current infrastructure. 90th percentile is ~9 times 50th percenile. I don’t have an understanding of why the two different Req/Sec and Requests/Sec values are so different, and don’t totally understand what to do with the Stdev and +/- Stdev values, so I’m just going to be sticking to looking at the latency percentiles, I think “latency” could also be called “response times” here. But ok, this is our baseline for this workload. And doing this 3 minute test at various points over the past few days, I can say it’s nicely regular and consistent, occasionally I got a slower run, but 50th percentile was usually 90ms–105ms, right around there. Heroku standard-2x: base speed From previous mucking about, I learned I can only reliably fit one puma worker in a standard-1x, and heroku says “we typically recommend a minimum of 2 processes, if possible” (for routing algorithmic reasons when scaled to multiple dynos), so I am just starting at a standard-2x with two puma workers each with 5 threads, matching heroku recommendations for a standard-2x dyno. So one thing I discovered is that bencharks from a heroku standard dyno are really variable, but here are typical ones: $ heroku dyno:resize type size qty cost/mo ─────── ─────────── ─── ─────── web Standard-2X 1 50 $ heroku config:get --shell WEB_CONCURRENCY RAILS_MAX_THREADS WEB_CONCURRENCY=2 RAILS_MAX_THREADS=5 $ URLS=./sample_works.txt wrk -c 1 -t 1 -d 3m --timeout 20s --latency -s load_test/multiplepaths.lua.txt https://scihist-digicoll.herokuapp.com/ multiplepaths: Found 400 paths multiplepaths: Found 400 paths Running 3m test @ https://scihist-digicoll.herokuapp.com/ 1 threads and 1 connections Thread Stats Avg Stdev Max +/- Stdev Latency 645.08ms 768.94ms 4.41s 85.52% Req/Sec 5.78 4.36 20.00 72.73% Latency Distribution 50% 271.39ms 75% 948.00ms 90% 1.74s 99% 3.50s 427 requests in 3.00m, 74.51MB read Requests/sec: 2.37 Transfer/sec: 423.67KB I had heard that heroku standard dynos would have variable performance, because they are shared multi-tenant resources. I had been thinking of this like during a 3 minute test I might see around the same median with more standard deviation — but instead, what it looks like to me is that running this benchmark on Monday at 9am might give very different results than at 9:50am or Tuesday at 2pm. The variability is over a way longer timeframe than my 3 minute test — so that’s something learned. Running this here and there over the past week, the above results seem to me typical of what I saw. (To get better than “seem typical” on this resource, you’d have to run a test, over several days or a week I think, probably not hammering the server the whole time, to get a sense of actual statistical distribution of the variability). I sometimes saw tests that were quite a bit slower than this, up to a 500ms median. I rarely if ever saw results too much faster than this on a standard-2x. 90th percentile is ~6x median, less than my current infrastructure, but that still gets up there to 1.74 instead of 864ms. This typical one is quite a bit slower than than our current infrastructure, our median response time is 3x the latency, with 90th and max being around 2x. This was worse than I expected. Heroku performance-m: base speed Although we might be able to fit more puma workers in RAM, we’re running a single-connection base speed test, so it shouldn’t matter to, and we won’t adjust it. $ heroku dyno:resize type size qty cost/mo ─────── ───────────── ─── ─────── web Performance-M 1 250 $ heroku config:get --shell WEB_CONCURRENCY RAILS_MAX_THREADS WEB_CONCURRENCY=2 RAILS_MAX_THREADS=5 $ URLS=./sample_works.txt wrk -c 1 -t 1 -d 3m --timeout 20s --latency -s load_test/multiplepaths.lua.txt https://scihist-digicoll.herokuapp.com/ multiplepaths: Found 400 paths multiplepaths: Found 400 paths Running 3m test @ https://scihist-digicoll.herokuapp.com/ 1 threads and 1 connections Thread Stats Avg Stdev Max +/- Stdev Latency 377.88ms 481.96ms 3.33s 86.57% Req/Sec 10.36 7.78 30.00 37.03% Latency Distribution 50% 117.62ms 75% 528.68ms 90% 1.02s 99% 2.19s 793 requests in 3.00m, 145.70MB read Requests/sec: 4.40 Transfer/sec: 828.70KB This is a lot closer to the ballpark of our current infrastructure. It’s a bit slower (117ms median intead of 90ms median), but in running this now and then over the past week it was remarkably, thankfully, consistent. Median and 99th percentile are both 28% slower (makes me feel comforted that those numbers are the same in these two runs!), that doesn’t bother me so much if it’s predictable and regular, which it appears to be. The max appears to me still a little bit less regular on heroku for some reason, since performance is supposed to be non-shared AWS resources, you wouldn’t expect it to be, but slow requests are slow, ok. 90th percentile is ~9x median, about the same as my current infrastructure. heroku performance-l: base speed $ heroku dyno:resize type size qty cost/mo ─────── ───────────── ─── ─────── web Performance-L 1 500 $ heroku config:get --shell WEB_CONCURRENCY RAILS_MAX_THREADS WEB_CONCURRENCY=2 RAILS_MAX_THREADS=5 URLS=./sample_works.txt wrk -c 1 -t 1 -d 3m --timeout 20s --latency -s load_test/multiplepaths.lua.txt https://scihist-digicoll.herokuapp.com/ multiplepaths: Found 400 paths multiplepaths: Found 400 paths Running 3m test @ https://scihist-digicoll.herokuapp.com/ 1 threads and 1 connections Thread Stats Avg Stdev Max +/- Stdev Latency 471.29ms 658.35ms 5.15s 87.98% Req/Sec 10.18 7.78 30.00 36.20% Latency Distribution 50% 123.08ms 75% 635.00ms 90% 1.30s 99% 2.86s 704 requests in 3.00m, 130.43MB read Requests/sec: 3.91 Transfer/sec: 741.94KB No news is good news, it looks very much like performance-m, which is exactly what we expected, because this isn’t a load test. It tells us that performance-m and performance-l seem to have similar CPU speeds and similar predictable non-variable regularity, which is what I find running this test periodically over a week. 90th percentile is ~10x median, about the same as current infrastructure. The higher Max speed is just evidence of what I mentioned, the speed of slowest request did seem to vary more than on our manual t2.medium, can’t really explain why. Summary: Base speed Not sure how helpful this visualization is, charting 50th, 75th, and 90th percentile responses across architectures. But basically: performance dynos perform similarly to my (bursting) t2.medium. Can’t explain why performance-l seems slightly slower than performance-m, might be just incidental variation when I ran the tests. The standard-2x is about twice as slow as my (bursting) t2.medium. Again recall standard-2x results varied a lot every time I ran them, the one I reported seems “typical” to me, that’s not super scientific, admittedly, but I’m confident that standard-2x are a lot slower in median response times than my current infrastructure. Throughput under load Ok, now we’re going to test using wrk to use more connections. In fact, I’ll test each setup with various number of connections, and graph the result, to get a sense of how each formation can handle throughput under load. (This means a lot of minutes to get all these results, at 3 minutes per number of connection test, per formation!). An additional thing we can learn from this test, on heroku we can look at how much RAM is being used after a load test, to get a sense of the app’s RAM usage under traffic to understand the maximum number of puma workers we might be able to fit in a given dyno. Existing t2.medium: Under load A t2.medium has 4G of RAM and 2 CPUs. We run 10 passenger workers (no multi-threading, since we are free, rather than enterprise, passenger). So what do we expect? With 2 CPUs and more than 2 workers, I’d expect it to handle 2 simultaneous streams of requests almost as well as 1; 3-10 should be quite a bit slower because they are competing for the 2 CPUs. Over 10, performance will probably become catastrophic. 2 connections are exactly flat with 1, as expected for our two CPUs, hooray! Then it goes up at a strikingly even line. Going over 10 (to 12) simultaneous connections doesn’t matter, even though we’ve exhausted our workers, I guess at this point there’s so much competition for the two CPUs already. The slope of this curve is really nice too actually. Without load, our median response time is 100ms, but even at a totally overloaded 12 overloaded connections, it’s only 550ms, which actually isn’t too bad. We can make a graph that in addition to median also has 75th, 90th, and 99th percentile response time on it: It doesn’t tell us too much; it tells us the upper percentiles rise at about the same rate as the median. At 1 simultaneous connection 90th percentile of 846ms is about 9 times the median of 93ms; at 10 requests the 90th percentile of 3.6 seconds is about 8 times the median of 471ms. This does remind us that under load when things get slow, this has more of a disastrous effect on already slow requests than fast requests. When not under load, even our 90th percentile was kind of sort of barley acceptable at 846ms, but under load at 3.6 seconds it really isn’t. Single Standard-2X dyno: Under load A standard-2X dyno has 1G of RAM. The (amazing, excellent, thanks schneems) heroku puma guide suggests running two puma workers with 5 threads each. At first I wanted to try running three workers, which seemed to fit into available RAM — but under heavy load-testing I was getting Heroku R14 Memory Quota Exceeded errors, so we’ll just stick with the heroku docs recommendations. Two workers with 5 threads each fit with plenty of headroom. A standard-2x dyno is runs on shared (multi-tenant) underlying Amazon virtual hardware. So while it is running on hardware with 4 CPUs (each of which can run two “hyperthreads“), the puma doc suggests “it is best to assume only one process can execute at a time” on standard dynos. What do we expect? Well, if it really only had one CPU, it would immediately start getting bad at 2 simulataneous connections, and just get worse from there. When we exceed the two worker count, will it get even worse? What about when we exceed the 10 thread (2 workers * 5 threads) count? You’d never run just one dyno if you were expecting this much traffic, you’d always horizontally scale. This very artificial test is just to get a sense of it’s characteristics. Also, we remember that standard-2x’s are just really variable; I could get much worse or better runs than this, but graphed numbers from a run that seemed typical. Well, it really does act like 1 CPU, 2 simultaneous connections is immediately a lot worse than 1. The line isn’t quite as straight as in our existing t2.medium, but it’s still pretty straight; I’d attribute the slight lumpiness to just the variability of shared-architecture standard dyno, and figure it would get perfectly straight with more data. It degrades at about the same rate of our baseline t2.medium, but when you start out slower, that’s more disastrous. Our t2.medium at an overloaded 10 simultaneous requests is 473ms (pretty tolerable actually), 5 times the median at one request only. This standard-2x has a median response time of 273 ms at only one simultaneous request, and at an overloaded 10 requests has a median response time also about 5x worse, but that becomes a less tolerable 1480ms. Does also graphing the 75th, 90th, and 99th percentile tell us much? Eh, I think the lumpiness is still just standard shared-architecture variability. The rate of “getting worse” as we add more overloaded connections is actually a bit better than it was on our t2.medium, but since it already starts out so much slower, we’ll just call it a wash. (On t2.medium, 90th percentile without load is 846ms and under an overloaded 10 connections 3.6s. On this single standard-2x, it’s 1.8s and 5.2s). I’m not sure how much these charts with various percentiles on them tell us, I’ll not include them for every architecture hence. standard-2x, 4 dynos: Under load OK, realistically we already know you shouldn’t have just one standard-2x dyno under that kind of load. You’d scale out, either manually or perhaps using something like the neat Rails Autoscale add-on. Let’s measure with 4 dynos. Each is still running 2 puma workers, with 5 threads each. What do we expect? Hm, treating each dyno as if it has only one CPU, we’d expect it to be able to handle traffic pretty levelly up to 4 simultenous connections, distributed to 4 dynos. It’s going to do worse after that, but up to 8 there is still one puma worker per connection so it might get even worse after 8? Well… I think that actually is relatively flat from 1 to 4 simultaneous connections, except for lumpiness from variability. But lumpiness from variability is huge! We’re talking 250ms median measured at 1 connection, up to 369ms measured median at 2, down to 274ms at 3. And then maybe yeah, a fairly shallow slope up to 8 simutaneous connections than steeper. But it’s all fairly shallow slope compared to our base t2.medium. At 8 connections (after which we pretty much max out), the standard-2x median of 464ms is only 1.8 times the median at 1 conection. Compared to the t2.median increase of 3.7 times. As we’d expect, scaling out to 4 dynos (with four cpus/8 hyperthreads) helps us scale well — the problem is the baseline is so slow to begin (with very high bounds of variability making it regularly even slower). performance-m: Under load A performance-m has 2.5 GB of memory. It only has one physical CPU, although two “vCPUs” (two hyperthreads) — and these are all your apps, it is not shared. By testing under load, I demonstrated I could actually fit 12 workers on there without any memory limit errors. But is there any point to doing that with only 1/2 CPUs? Under a bit of testing, it appeared not. The heroku puma docs recommend only 2 processes with 5 threads. You could do a whole little mini-experiment just trying to measure/optimize process/thread count on performance-m! We’ve already got too much data here, but in some experimentation it looked to me like 5 processes with 2 threads each performed better (and certainly no worse) than 2 processes with 5 threads — if you’ve got the RAM just sitting there anyway (as we do), why not? I actually tested with 6 puma processes with 2 threads each. There is still a large amount of RAM headroom we aren’t going to use even under load. What do we expect? Well, with the 2 “hyperthreads” perhaps it can handle 2 simultaneous requests nearly as well as 1 (or not?); after that, we expect it to degrade quickly same as our original t2.medium did. It an handle 2 connections slightly better than you’d expect if there really was only 1 CPU, so I guess a hyperthread does give you something. Then the slope picks up, as you’d expect; and it looks like it does get steeper after 4 simultaneous connections, yup. performance-l: Under load A performance-l ($500/month) costs twice as much as a performance-m ($250/month), but has far more than twice as much resources. performance-l has a whopping 14GB of RAM compared to performance-m’s 2.5GB; and performance-l has 4 real CPUs/hyperthreads available to use (visible using the nproc technique in the heroku puma article. Because we have plenty of RAM to do so, we’re going to run 10 worker processes to match our original t2.medium’s. We still ran with 2 threads, just cause it seems like maybe you should never run a puma worker with only one thread? But who knows, maybe 10 workers with 1 thread each would perform better; plenty of room (but not plenty of my energy) for yet more experimentation. What do we expect? The graph should be pretty flat up to 4 simultaneous connections, then it should start getting worse, pretty evenly as simultaneous connections rise all the way up to 12. It is indeed pretty flat up to 4 simultaneous connections. Then up to 8 it’s still not too bad — median at 8 is only ~1.5 median at 1(!). Then it gets worse after 8 (oh yeah, 8 hyperthreads?). But the slope is wonderfully shallow all the way. Even at 12 simultaneous connections, the median response time of 266ms is only 2.5x what it was at one connection. (In our original t2.medium, at 12 simultaneous connections median response time was over 5x what it was at 1 connection). This thing is indeed a monster. Summary Comparison: Under load We showed a lot of graphs that look similar, but they all had different sclaes on the y-axis. Let’s plot median response times under load of all architectures on the same graph, and see what we’re really dealing with. The blue t2.medium is our baseline, what we have now. We can see that there isn’t really a similar heroku option, we have our choice of better or worse. The performance-l is just plain better than what we have now. It starts out performing about the same as what we have now for 1 or 2 simultaneous connections, but then scales so much flatter. The performance-m also starts out about thesame, but sccales so much worse than even what we have now. (it’s that 1 real CPU instead of 2, I guess?). The standard-2x scaled to 4 dynos… has it’s own characteristics. It’s baseline is pretty terrible, it’s 2 to 3 times as slow as what we have now even not under load. But then it scales pretty well, since it’s 4 dynos after all, it doesn’t get worse as fast as performance-m does. But it started out so bad, that it remains far worse than our original t2.medium even under load. Adding more dynos to standard-2x will help it remain steady under even higher load, but won’t help it’s underlying problem that it’s just slower than everyone else. Discussion: Thoughts and Surprises I had been thinking of a t2.medium (even with burst) as “typical” (it is after all much slower than my 2015 Macbook), and has been assuming (in retrospect with no particular basis) that a heroku standard dyno would perform similarly. Most discussion and heroku docs, as well as the naming itself, suggest that a ‘standard’ dyno is, well, standard, and performance dynos are for “super scale, high traffic apps”, which is not me. But in fact, heroku standard dynos are much slower and more variable in performance than a bursting t2.medium. I suspect they are slower than other options you might consider non-heroku “typical” options. My conclusion is honestly that “standard” dynos are really “for very fast, well-optimized apps that can handle slow and variable CPU” and “performance” dynos are really “standard, matching the CPU speeds you’d get from a typical non-heroku option”. But this is not how they are documented or usually talked about. Are other people having really different experiences/conclusions than me? If so, why, or where have I gone wrong? This of course has implications for estimating your heroku budget if considering switching over. :( If you have a well-optimized fast app, say even 95th percentile is 200ms (on bursting t2.medium), then you can handle standard slowness — so what your 95th percentile is now 600ms (and during some time periods even much slower, 1s or worse, due to variability). That’s not so bad for a 95th percentile. One way to get a very fast is of course caching. There is lots of discussion of using caching in Rails, sometimes the message (explicit or implicit) is “you have to use lots of caching to get reasonable performance cause Rails is so slow.” What if many of these people are on heroku, and it’s really you have to use lots of caching to get reasonable performance on heroku standard dyno?? I personally don’t think caching is maintenance free; in my experience properly doing cache invalidation and dealing with significant processing spikes needed when you choose to invalidate your entire cache (cause cached HTML needs to change) lead to real maintenance/development cost. I have not needed caching to meet my performance goals on present architecture. Everyone doesn’t necessarily have the same performance goals/requirements. Mine of a low-traffic non-commercial site are are maybe more modest, I just need users not to be super annoyed. But whatever your performance goals, you’re going to have to spend more time on optimization on a heroku standard than something with much faster CPU — like a standard affordable mid-tier EC2. Am I wrong? One significant factor on heroku standard dyno performance is that they use shared/multi-tenant infrastructure. I wonder if they’ve actually gotten lower performance over time, as many customers (who you may be sharing with) have gotten better at maximizing their utilization, so the shared CPUs are typically more busy? Like a frog boiling, maybe nobody noticed that standard dynos have become lower performance? I dunno, brainstorming. Or maybe there are so many apps that start on heroku instead of switcching from somewhere else, that people just don’t realize that standard dynos are much slower than other low/mid-tier options? I was expecting to pay a premium for heroku — but even standard-2x’s are a significant premium over paying for t2.medium EC2 yourself, one I found quite reasonable…. performance dynos are of course even more premium. I had a sort of baked-in premise that most Rails apps are “IO-bound”, they spend more time waiting on IO than using CPU. I don’t know where I got that idea, I heard it once a long time ago and it became part of my mental model. I now do not believe this is true true of my app, and I do not in fact believe it is true of most Rails apps in 2020. I would hypothesize that most Rails apps today are in fact CPU-bound. The performance-m dyno only has one CPU. I had somehow also been assuming that it would have two CPUs — I’m not sure why, maybe just because at that price! It would be a much better deal with two CPUs. Instead we have a huge jump from $250 performance-m to $500 performance-l that has 4x the CPUs and ~5x the RAM. So it doesn’t make financial sense to have more than one performance-m dyno, you might as well go to performance-l. But this really complicates auto-scaling, whether using Heroku’s feature , or the awesome Rails Autoscale add-on. I am not sure I can afford a performance-l all the time, and a performance-m might be sufficient most of the time. But if 20% of the time I’m going to need more (or even 5%, or even unexpectedly-mentioned-in-national-media), it would be nice to set things up to autoscale up…. I guess to financially irrational 2 or more performance-m’s? :( The performance-l is a very big machine, that is significantly beefier than my current infrastructure. And has far more RAM than I need/can use with only 4 physical cores. If I consider standard dynos to be pretty effectively low tier (as I do), heroku to me is kind of missing mid-tier options. A 2 CPU option at 2.5G or 5G of RAM would make a lot of sense to me, and actually be exactly what I need… really I think performance-m would make more sense with 2 CPUs at it’s existing already-premium price point, and to be called a “performance” dyno. . Maybe heroku is intentionally trying set options to funnel people to the highest-priced performance-l. Conclusion: What are we going to do? In my investigations of heroku, my opinion of the developer UX and general service quality only increases. It’s a great product, that would increase our operational capacity and reliability, and substitute for so many person-hours of sysadmin/operational time if we were self-managing (even on cloud architecture like EC2). But I had originally been figuring we’d use standard dynos (even more affordably, possibly auto-scaled with Rails Autoscale plugin), and am disappointed that they end up looking so much lower performance than our current infrastructure. Could we use them anyway? Response time going from 100ms to 300ms — hey, 300ms is still fine, even if I’m sad to lose those really nice numbers I got from a bit of optimization. But this app has a wide long-tail ; our 75th percentile going from 450ms to 1s, our 90th percentile going from 860ms to 1.74s and our 99th going from 2.3s to 4.4s — a lot harder to swallow. Especially when we know that due to standard dyno variability, a slow-ish page that on my present architecture is reliably 1.5s, could really be anywhere from 3 to 9(!) on heroku. I would anticipate having to spend a lot more developer time on optimization on heroku standard dynos — or, i this small over-burdened non-commercial shop, not prioritizing that (or not having the skills for it), and having our performance just get bad. So I’m really reluctant to suggest moving our app to heroku with standard dynos. A performance-l dyno is going to let us not have to think about performance any more than we do now, while scaling under high-traffic better than we do now — I suspect we’d never need to scale to more than one performance-l dyno. But it’s pricey for us. A performance-m dyno has a base-speed that’s fine, but scales very poorly and unaffordably. Doesn’t handle an increase in load very well as one dyno, and to get more CPUs you have to pay far too much (especially compared to standard dynos I had been assuming I’d use). So I don’t really like any of my options. If we do heroku, maybe we’ll try a performance-m, and “hope” our traffic is light enough that a single one will do? Maybe with Rails autoscale for traffic spikes, even though 2 performance-m dynos isn’t financially efficient? If we are scaling to 2 (or more!) performance-m’s more than very occasionally, switch to performance-l, which means we need to make sure we have the budget for it? jrochkind General Leave a comment November 19, 2020November 19, 2020 Deep Dive: Moving ruby projects from Travis to Github Actions for CI So this is one of my super wordy posts, if that’s not your thing abort now, but some people like them. We’ll start with a bit of context, then get to some detailed looks at Github Actions features I used to replace my travis builds, with example config files and examination of options available. For me, by “Continuous Integration” (CI), I mostly mean “Running automated tests automatically, on your code repo, as you develop”, on every PR and sometimes with scheduled runs. Other people may mean more expansive things by “CI”. For a lot of us, our first experience with CI was when Travis-ci started to become well-known, maybe 8 years ago or so. Travis was free for open source, and so darn easy to set up and use — especially for Rails projects, it was a time when it still felt like most services focused on docs and smooth fit for ruby and Rails specifically. I had heard of doing CI, but as a developer in a very small and non-profit shop, I want to spend time writing code not setting up infrastructure, and would have had to get any for-cost service approved up the chain from our limited budget. But it felt like I could almost just flip a switch and have Travis on ruby or rails projects working — and for free! Free for open source wasn’t entirely selfless, I think it’s part of what helped Travis literally define the market. (Btw, I think they were the first to invent the idea of a “badge” URL for a github readme?) Along with an amazing Developer UX (which is today still a paragon), it just gave you no reason not to use it. And then once using it, it started to seem insane to not have CI testing, nobody would ever again want to develop software without the build status on every PR before merge. Travis really set a high bar for ease of use in a developer tool, you didn’t need to think about it much, it just did what you needed, and told you what you needed to know in it’s read-outs. I think it’s an impressive engineering product. But then. End of an era Travis will no longer be supporting open source projects with free CI. The free open source travis projects originally ran on travis-ci.org, with paid commercial projects on travis-ci.com. In May 2018, they announced they’d be unifying these on travis-ci.com only, but with no announced plan that the policy for free open source would change. This migration seemed to proceed very slowly though. Perhaps because it was part of preparing the company for a sale, in Jan 2019 it was announced private equity firm Idera had bought travis. At the time the announcement said “We will continue to maintain a free, hosted service for open source projects,” but knowing what “private equity” usually means, some were concerned for the future. (HN discussion). While the FAQ on the migration to travis-ci.com still says that travis-ci.org should remain reliable until projects are fully migrated, in fact over the past few months travis-ci.org projects largely stopped building, as travis apparently significantly reduced resources on the platform. Some people began manually migrating their free open source projects to travis-ci.com where builds still worked. But, while the FAQ also still says “Will Travis CI be getting rid of free users? Travis CI will continue to offer a free tier for public or open-source repositories on travis-ci.com” — in fact, travis announced that they are ending the free service for open source. The “free tier” is a limited trial (available not just to open source), and when it expires, you can pay, or apply to a special program for an extension, over and over again. They are contradicting themselves enough that while I’m not sure exactly what is going to happen, but no longer trust them as a service. Enter Github Actions I work mostly on ruby and Rails projects. They are all open source, almost all of them use travis. So while (once moved to travis-ci.com) they are all currently working, it’s time to start moving them somewhere else, before I have dozens of projects with broken CI and still don’t know how to move them. And the new needs to be free — many of these projects are zero-budget old-school “volunteer” or “informal multi-institutional collaboration” open source. There might be several other options, but the one I chose is Github Actions — my sense that it had gotten mature enough to start approaching travis level of polish, and all of my projects are github-hosted, and Github Actions is free for unlimited use for open source. (pricing page; Aug 2019 announcement of free for open source). And we are really fortunate that it became mature and stable in time for travis to withdraw open source support (if travis had been a year earlier, we’d be in trouble). Github Actions is really powerful. It is built to do probably WAY MORE than travis does, definitely way beyond “automated testing” to various flows for deployment and artifact release, to really just about any kind of process for managing your project you want. The logic you can write almost unlimited, all running on github’s machines. As a result though…. I found it a bit overwhelming to get started. The Github Actions docs are just overwhelmingly abstract, there is so much there, you can almost anything — but I don’t actually want to learn a new platform, I just want to get automated test CI for my ruby project working! There are some language/project speccific Guides available, for node.js, python, a few different Java setups — but not for ruby or Rails! My how Rails has fallen, from when most services like this would be focusing on Rails use cases first. :( There are some third part guides available that might focus on ruby/rails, but one of the problems is that Actions has been evolving for a few years with some pivots, so it’s easy to find outdated instructions. One I found helpful orientation was this Drifting Ruby screencast. This screencast showed me there is a kind of limited web UI with integrated docs searcher — but i didn’t end up using it, I just created the text config file by hand, same as I would have for travis. Github provides templates for “ruby” or “ruby gem”, but the Drifting Ruby sccreencast said “these won’t really work for our ruby on rails application so we’ll have to set up one manually”, so that’s what I did too. ¯\_(ツ)_/¯ But the cost of all the power github Actions provides is… there are a lot more switches and dials to understand and get right (and maintain over time and across multiple projects). I’m not someone who likes copy-paste without understanding it, so I spent some time trying to understand the relevant options and alternatives; in the process I found some things I might have otherwise copy-pasted from other people’s examples that could be improved. So I give you the results of my investigations, to hopefully save you some time, if wordy comprehensive reports are up your alley. A Simple Test Workflow: ruby gem, test with multiple ruby versions Here’s a file for a fairly simple test workflow. You can see it’s in the repo at .github/workflows. The name of the file doesn’t matter — while this one is called ruby.yml, i’ve since moved over to naming the file to match the name: key in the workflow for easier traceability, so I would have called it ci.yml instead. Triggers You can see we say that this workflow should be run on any push to master branch, and also for any pull_request at all. Many other examples I’ve seen define pull_request: branches: ["main"], which seems to mean only run on Pull Requests with main as the base. While that’s most of my PR’s, if there is ever a PR that uses another branch as a base for whatever reason, I still want to run CI! While hypothetically you should be able leave branches out to mean “any branch”, I only got it to work by explicitly saying branches: ["**"] Matrix For this gem, we want to run CI on multiple ruby versions. You can see we define them here. This works similarly to travis matrixes. If you have more than one matrix variable defined, the workflow will run for every combination of variables (hence the name “matrix”). matrix: ruby: [ '2.4.4', '2.5.1', '2.6.1', '2.7.0', 'jruby-9.1.17.0', 'jruby-9.2.9.0' ] In a given run, the current value of the matrix variables is available in github actions “context”, which you can acccess as eg ${{ matrix.ruby }}. You can see how I use that in the name, so that the job will show up with it’s ruby version in it. name: Ruby ${{ matrix.ruby }} Ruby install While Github itself provides an action for ruby install, it seems most people are using this third-party action. Which we reference as `ruby/setup-ruby@v1`. You can see we use the matrix.ruby context to tell the setup-ruby action what version of ruby to install, which works because our matrix values are the correct values recognized by the action. Which are documented in the README, but note that values like jruby-head are also supported. Note, although it isn’t clearly documented, you can say 2.4 to mean “latest available 2.4.x” (rather than it meaning “2.4.0”), which is hugely useful, and I’ve switched to doing that. I don’t believe that was available via travis/rvm ruby install feature. For a project that isn’t testing under multiple rubies, if we left out the with: ruby-version, the action will conveniently use a .ruby-version file present in the repo. Note you don’t need to put a gem install bundler into your workflow yourself, while I’m not sure it’s clearly documented, I found the ruby/setup-ruby action would do this for you (installing the latest available bundler, instead of using whatever was packaged with ruby version), btw regardless of whether you are using the bundler-cache feature (see below). Note on How Matrix Jobs Show Up to Github With travis, testing for multiple ruby or rails versions with a matrix, we got one (or, well, actually two) jobs showing up on the Github PR: Each of those lines summaries a collection of matrix jobs (eg different ruby versions). If any of the individual jobs without the matrix failed, the whole build would show up as failed. Success or failure, you could click on “Details” to see each job and it’s status: I thought this worked pretty well — especially for “green” builds I really don’t need to see the details on the PR, the summary is great, and if I want to see the details I can click through, great. With Github Actions, each matrix job shows up directly on the PR. If you have a large matrix, it can be… a lot. Some of my projects have way more than 6. On PR: Maybe it’s just because I was used to it, but I preferred the Travis way. (This also makes me think maybe I should change the name key in my workflow to say eg CI: Ruby 2.4.4 to be more clear? Oops, tried that, it just looks even weirder in other GH contexts, not sure.) Oh, also, that travis way of doing the build twice, once for “pr” and once for “push”? Github Actions doesn’t seem to do that, it just does one, I think corresponding to travis “push”. While the travis feature seemed technically smart, I’m not sure I ever actually saw one of these builds pass while the other failed in any of my projects, I probably won’t miss it. Badge Did you have a README badge for travis? Don’t forget to swap it for equivalent in Github Actions. The image url looks like: https://github.com/$OWNER/$REPOSITORY/workflows/$WORKFLOW_NAME/badge.svg?branch=master, where $WORKFLOW_NAME of course has to be URL-escaped if it ocntains spaces etc. The github page at https://github.com/owner/repo/actions, if you select a particular workflow/branch, does, like travis, give you a badge URL/markdown you can copy/paste if you click on the three-dots and then “Create status badge”. Unlike travis, what it gives you to copy/paste is just image markdown, it doesn’t include a link. But I definitely want the badge to link to viewing the results of the last build in the UI. So I do it manually. Limit to the speciifc workflow and branch that you made the badge for in the UI then just copy and paste the URL from the browser. A bit confusing markdown to construct manually, here’s what it ended up looking like for me: [![CI Status](https://github.com/jrochkind/attr_json/workflows/CI/badge.svg?branch=master)%5D(https://github.com/jrochkind/attr_json/actions?query=workflow%3ACI+branch%3Amaster) view raw gh_badge_markdown_example.txt hosted with ❤ by GitHub I copy and paste that from an existing project when I need it in a new one. :shrug: Require CI to merge PR? However, that difference in how jobs show up to Github, the way each matrix job shows up separately now, has an even more negative impact on requiring CI success to merge a PR. If you want to require that CI passes before merging a PR, you configure that at https://github.com/acct/project/settings/branches under “Branch protection rules”.When you click “Add Rule”, you can/must choose WHICH jobs are “required”. For travis, that’d be those two “master” jobs, but for the new system, every matrix job shows up separately — in fact, if you’ve been messing with job names trying to get it right as I have, you have any job name that was ever used in the last 7 days, and they don’t have the Github workflow name appended to them or anything (another reason to put github workflow name in the job name?). But the really problematic part is that if you edit your list of jobs in the matrix — adding or removing ruby versions as one does, or even just changing the name that shows up for a job — you have to go back to this screen to add or remove jobs as a “required status check”. That seems really unworkable to me, I’m not sure how it hasn’t been a major problem already for users. It would be better if we could configure “all the checks in the WORKFLOW, whatever they may be”, or perhaps best of all if we could configure a check as required in the workflow YML file, the same place we’re defining it, just a required_before_merge key you could set to true or use a matrix context to define or whatever. I’m currently not requiring status checks for merge on most of my projects (even though i did with travis), because I was finding it unmanageable to keep the job names sync’d, especially as I get used to Github Actions and kept tweaking things in a way that would change job names. So that’s a bit annoying. fail_fast: false By default, if one of the matrix jobs fails, Github Acitons will cancel all remaining jobs, not bother to run them at all. After all, you know the build is going to fail if one job fails, what do you need those others for? Well, for my use case, it is pretty annoying to be told, say, “Job for ruby 2.7.0 failed, we can’t tell you whether the other ruby versions would have passed or failed or not” — the first thing I want to know is if failed on all ruby versions or just 2.7.0, so now I’d have to spend extra time figuring that out manually? No thanks. So I set `fail_fast: false` on all of my workflows, to disable this behavior. Note that travis had a similar (opt-in) fast_finish feature, which worked subtly different: Travis would report failure to Github on first failure (and notify, I think), but would actually keep running all jobs. So when I saw a failure, I could click through to ‘details’ to see which (eg) ruby versions passed, from the whole matrix. This does work for me, so I’d chose to opt-in to that travis feature. Unfortunately, the Github Actions subtle difference in effect makes it not desirable to me. Note You may see some people referencing a Github Actions continue-on-error feature. I found the docs confusing, but after experimentation what this really does is mark a job as successful even when it fails. It shows up in all GH UI as succeeeded even when it failed, the only way to know it failed would be to click through to the actual build log to see failure in the logged console. I think “continue on error” is a weird name for this; it is not useful to me with regard to fine-tuning fail-fast; or honestly in any other use case I can think of that I have. Bundle cache? bundle install can take 60+ seconds, and be a significant drag on your build (not to mention a lot of load on rubygems servers from all these builds). So when travis introduced a feature to cache: bundler: true, it was very popular. True to form, Github Actions gives you a generic caching feature you can try to configure for your particular case (npm, bundler, whatever), instead of an out of the box feature “just do the right thing you for bundler, you figure it out”. The ruby/setup-ruby third-party action has a built-in feature to cache bundler installs for you, but I found that it does not work right if you do not have a Gemfile.lock checked into the repo. (Ie, for most any gem, rather than app, project). It will end up re-using cached dependencies even if there are new releases of some of your dependencies, which is a big problem for how I use CI for a gem — I expect it to always be building with latest releases of dependencies, so I can find out of one breaks the build. This may get fixed in the action. If you have an app (rather than gem) with a Gemfile.lock checked into repo, the bundler-cache: true feature should be just fine. Otherwise, Github has some suggestions for using the generic cache feature for ruby bundler (search for “ruby – bundler” on this page) — but I actually don’t believe they will work right without a Gemfile.lock checked into the repo either. Starting from that example, and using the restore-keys feature, I think it should be possible to design a use that works much like travis’s bundler cache did, and works fine without a checked-in Gemfile.lock. We’d want it to use a cache from the most recent previous (similar job), and then run bundle install anyway, and then cache the results again at the end always to be available for the next run. But I haven’t had time to work that out, so for now my gem builds are simply not using bundler caching. (my gem builds tend to take around 60 seconds to do a bundle install, so that’s in every build now, could be worse). update nov 27: The ruby/ruby-setup action should be fixed to properly cache-bust when you don’t have a Gemfile.lock checked in. If you are using a matrix for ruby version, as below, you must set the ruby version by setting the BUNDLE_GEMFILE env variable rather than the way we did it below, and there is is a certain way Github Action requires/provides you do that, it’s not just export. See the issue in ruby/ruby-setup project. Notifications: Not great Travis has really nice defaults for notifications: The person submitting the PR would get an email generally only on status changes (from pass to fail or fail to pass) rather than on every build. And travis would even figure out what email to send to based on what email you used in your git commits. (Originally perhaps a workaround to lack of Github API at travis’ origin, I found it a nice feature). And then travis has sophisticated notification customization available on a per-repo basis. Github notifications are unfortunately much more basic and limited. The only notification settings avaialable are for your entire account at https://github.com/settings/notifications, “GitHub Actions”. So they apply to all github workflows in all projects, there are no workflow- or project-specific settings. You can set to receive notification via web push or email or both or neither. You can receive notifications for all builds or only failed builds. That’s it. The author of a PR is the one who receives the notifications, same as in travis. You will get notifications for every single build, even repeated successes or failures in a series. I’m not super happy with the notification options. I may end up just turning off Github Actions notifications entirely for my account. Hypothetically, someone could probably write a custom Github action to give you notifications exactly how travis offered — after all, travis was using public GH API that should be available to any other author, and I think should be usable from within an action. But when I started to think through it, while it seemed an interesting project, I realized it was definitely beyond the “spare hobby time” I was inclined to give to it at present, especially not being much of a JS developer (the language of custom GH actions, generally). (While you can list third-party actions on the github “marketplace”, I don’t think there’s a way to charge for them). . There are custom third-party actions available to do things like notify slack for build completion; I haven’t looked too much into any of them, beyond seeing that I didn’t see any that would be “like travis defaults”. A more complicated gem: postgres, and Rails matrix Let’s move to a different example workflow file, in a different gem. You can see I called this one ci.yml, matching it’s name: CI, to have less friction for a developer (including future me) trying to figure out what’s going on. This gem does have rails as a dependency and does test against it, but isn’t actually a Rails engine as it happens. It also needs to test against Postgres, not just sqlite3. Scheduled Builds At one point travis introduced a feature for scheduling (eg) weekly builds even when no PR/commit had been made. I enthusiastically adopted this for my gem projects. Why? Gem releases are meant to work on a variety of different ruby versions and different exact versions of dependencies (including Rails). Sometimes a new release of ruby or rails will break the build, and you want to know about that and fix it. With CI builds happening only on new code, you find out about this with some random new code that is unlikely to be related to the failure; and you only find out about it on the next “new code” that triggers a build after a dependency release, which on some mature and stable gems could be a long time after the actual dependency release that broke it. So scheduled builds for gems! (I have no purpose for scheduled test runs on apps). Github Actions does have this feature. Hooray. One problem is that you will receive no notification of the result of the scheduled build, success or failure. :( I suppose you could include a third-party action to notify a fixed email address or Slack or something else; not sure how you’d configure that to apply only to the scheduled builds and not the commit/PR-triggered builds if that’s what you wanted. (Or make an custom action to file a GH issue on failure??? But make sure it doesn’t spam you with issues on repeated failures). I haven’t had the time to investigate this yet. Also oops just noticed this: “In a public repository, scheduled workflows are automatically disabled when no repository activity has occurred in 60 days.” Which poses some challenges for relying on scheduled builds to make sure a stable slow-moving gem isn’t broken by dependency updates. I definitely am committer on gems that are still in wide use and can go 6-12+ months without a commit, because they are mature/done. I still have it configured in my workflow; I guess even without notifications it will effect the “badge” on the README, and… maybe i’ll notice? Very far from ideal, work in progress. :( Rails Matrix OK, this one needs to test against various ruby versions AND various Rails versions. A while ago I realized that an actual matrix of every ruby combined with every rails was far too many builds. Fortunately, Github Actions supports the same kind of matrix/include syntax as travis, which I use. matrix: include: - gemfile: rails_5_0 ruby: 2.4 - gemfile: rails_6_0 ruby: 2.7 I use the appraisal gem to handle setting up testing under multiple rails versions, which I highly recommend. You could use it for testing variant versions of any dependencies, I use it mostly for varying Rails. Appraisal results in a separate Gemfile committed to your repo for each (in my case) rails version, eg ./gemfiles/rails_5_0.gemfile. So those values I use for my gemfile matrix key are actually portions of the Gemfile path I’m going to want to use for each job. Then we just need to tell bundler, in a given matrix job, to use the gemfile we specified in the matrix. The old-school way to do this is with the BUNDLE_GEMFILE environmental variable, but I found it error-prone to make sure it stayed consistently set in each workflow step. I found that the newer (although not that new!) bundle config set gemfile worked swimmingly! I just set it before the bundle install, it stays set for the rest of the run including the actual test run. steps: # [...] - name: Bundle install run: | bundle config set gemfile "${GITHUB_WORKSPACE}/gemfiles/${{ matrix.gemfile }}.gemfile" bundle install --jobs 4 --retry 3 Note that single braces are used for ordinary bash syntax to reference the ENV variable ${GITHUB_WORKSPACE}, but double braces for the github actions context value interpolation ${{ matrix.gemfile }}. Works great! Oh, note how we set the name of the job to include both ruby and rails matrix values, important for it showing up legibly in Github UI: name: ${{ matrix.gemfile }}, ruby ${{ matrix.ruby }}. Because of how we constructed our gemfile matrix, that shows up with job names rails_5_0, ruby 2.7. Still not using bundler caching in this workflow. As before, we’re concerned about the ruby/setup-ruby built-in bundler-cache feature not working as desired without a Gemfile.lock in the repo. This time, I’m also not sure how to get that feature to play nicely with the variant gemfiles and bundle config set gemfile. Github Actions makes you put together a lot more pieces together yourself compared to travis, there are still things I just postponed figuring out for now. update jan 11: the ruby/setup-ruby action now includes a ruby version matrix example in it’s README. https://github.com/ruby/setup-ruby#matrix-of-gemfiles It does require you use the BUNDLE_GEMFILE env variable, rather than the bundle config set gemfile command I used here. This should ordinarily be fine, but is something to watch out for in case other instructions you are following tries to use bundle config set gemfile instead, for reasons or not. Postgres This project needs to build against a real postgres. That is relatively easy to set up in Github Actions. Postgres normally by default allows connections on localhost without a username/password set, and my past builds (in travis or locally) took advantage of this to not bother setting one, which then the app didn’t have to know about. But the postgres image used for Github Actions doesn’t allow this, you have to set a username/password. So the section of the workflow that sets up postgres looks like: jobs: tests: services: db: image: postgres:9.4 env: POSTGRES_USER: postgres POSTGRES_PASSWORD: postgres ports: ['5432:5432'] 5432 is the default postgres port, we need to set it and map it so it will be available as expected. Note you also can specify whatever version of postgres you want, this one is intentionally testing on one a bit old. OK now our Rails app that will be executed under rspec needs to know that username and password to use in it’s postgres connection; when before it connected without a username/password. That env under the postgres service image is not actually available to the job steps. I didn’t find any way to DRY the username/password in one place, I had to repeat it in another env block, which I put at the top level of the workflow so it would apply to all steps. And then I had to alter my database.yml to use those ENV variables, in the test environment. On a local dev machine, if your postgres doens’t have a username/password requirement and you don’t set the ENV variables, it keeps working as before. I also needed to add host: localhost to the database.yml; before, the absence of the host key meant it used a unix-domain socket (filesystem-located) to connect to postgres, but that won’t work in the Github Actions containerized environment. Note, there are things you might see in other examples that I don’t believe you need: No need for an apt-get of pg dev libraries. I think everything you need is on the default GH Actions images now. Some examples I’ve seen do a thing with options: --health-cmd pg_isready, my builds seem to be working just fine without it, and less code is less code to maintain. allow_failures In travis, I took advantage of the travis allow_failures key in most of my gems. Why? I am testing against various ruby and Rails versions; I want to test against *future* (pre-release, edge) ruby and rails versions, cause its useful to know if I’m already with no effort passing on them, and I’d like to keep passing on them — but I don’t want to mandate it, or prevent PR merges if the build fails on a pre-release dependency. (After all, it could very well be a bug in the dependency too!) There is no great equivalent to allow_failures in Github Actions. (Note again, continue_on_error just makes failed jobs look identical to successful jobs, and isn’t very helpful here). I investigated some alternatives, which I may go into more detail on in a future post, but on one project I am trying a separate workflow just for “future ruby/rails allowed failures” which only checks master commits (not PRs), and has a separate badge on README (which is actually pretty nice for advertising to potential users “Yeah, we ALREADY work on rails edge/6.1.rc1!”). Main downside there is having to copy/paste synchronize what’s really the same workflow in two files. A Rails app I have many more number of projects I’m a committer on that are gems, but I spend more of my time on apps, one app in specific. So here’s an example Github Actions CI workflow for a Rails app. It mostly remixes the features we’ve already seen. It doesn’t need any matrix. It does need a postgres. It does need some “OS-level” dependencies — the app does some shell-out to media utilities like vips and ffmpeg, and there are integration tests that utilize this. Easy enough to just install those with apt-get, works swimmingly. - name: Install apt dependencies run: | sudo apt-get -y install libvips-tools ffmpeg mediainfo Update 25 Nov: My apt-get that worked for a couple weeks started failing for some reason on trying to install a libpulse0 dependency of one of those packages, the solution was doing a sudo apt-get update before the sudo apt-get install. I guess this is always good practice? (That forum post also uses apt install and apt update instead of apt-get install and apt-get update, that I can’t tell you much about, I’m really not a linux admin). In addition to the bundle install, a modern Rails app using webpacker needs yarn install. This just worked for me — no need to include lines for installing npm itself or yarn or any yarn dependencies, although some examples I find online have them. (My yarn installs seem to happen in ~20 seconds, so I’m not motivated to try to figure out caching for yarn). And we need to create the test database in the postgres, which I do with RAILS_ENV=test bundle exec rails db:create — typical Rails test setup will then automatically run migrations if needed. There might be other (better?) ways to prep the database, but I was having trouble getting rake db:prepare to work, and didn’t spend the time to debug it, just went with something that worked. - name: Set up app run: | RAILS_ENV=test bundle exec rails db:create yarn install Rails test setup usually ends up running migrations automatically is why I think this worked alone, but you could also throw in a RAILS_ENV=test bundle exec rake db:schema:load if you wanted. Under travis I had to install chrome with addons: chrome: stable to have it available to use with capybara via the webdrivers gem. No need for installing chrome in Github Actions, some (recent-ish?) version of it is already there as part of the standard Github Actions build image. In this workflow, you can also see a custom use of the github “cache” action to cache a Solr install that the test setup automatically downloads and sets up. In this case the cache doesn’t actually save us any build time, but is kinder on the apache foundation servers we are downloading from with every build otherwise (and have gotten throttled from in the past). Conclusion Github Aciton sis a really impressively powerful product. And it’s totally going to work to replace travis for me. It’s also probably going to take more of my time to maintain. The trade-off of more power/flexibility and focusing on almost limitless use cases is more things th eindividual project has to get right for their use case. For instance figuring out the right configuration to get caching for bundler or yarn right, instead of just writing cache: { yarn: true, bundler: true}. And when you have to figure it out yourself, you can get it wrong, which when you are working on many projects at once means you have a bunch of places to fix. The amazingness of third-party action “marketplace” means you have to figure out the right action to use (the third-party ruby/setup-ruby instead of the vendor’s actions/setup-ruby), and again if you change your mind about that you have a bunch of projects to update. Anyway, it is what it is — and I’m grateful to have such a powerful and in fact relatively easy to use service available for free! I could not really live without CI anymore, and won’t have to! Oh, and Github Actions is giving me way more (free) simultaneous parallel workers than travis ever did, for my many-job builds! jrochkind General 7 Comments November 12, 2020January 11, 2021 Posts navigation Older posts Bibliographic Wilderness is a blog by Jonathan Rochkind about digital library services, ruby, and web development. Contact Search for: Email Subscription Enter your email address to subscribe to this blog and receive notifications of new posts by email. Join 218 other followers Email Address: Subscribe Recent Posts Code that Lasts: Sustainable And Usable Open Source Code March 23, 2021 Product management February 3, 2021 Rails auto-scaling on Heroku January 27, 2021 Managed Solr SaaS Options January 12, 2021 Gem authors, check your release sizes January 11, 2021 Archives Archives Select Month March 2021  (1) February 2021  (1) January 2021  (4) December 2020  (1) November 2020  (3) October 2020  (2) September 2020  (3) August 2020  (2) April 2020  (1) March 2020  (1) December 2019  (1) October 2019  (1) September 2019  (1) August 2019  (2) June 2019  (2) April 2019  (3) March 2019  (3) February 2019  (1) December 2018  (1) November 2018  (1) October 2018  (2) September 2018  (4) August 2018  (1) June 2018  (2) May 2018  (1) April 2018  (1) March 2018  (3) February 2018  (1) January 2018  (1) November 2017  (1) October 2017  (1) September 2017  (1) August 2017  (3) July 2017  (1) May 2017  (4) April 2017  (2) March 2017  (9) February 2017  (5) January 2017  (1) December 2016  (7) November 2016  (4) September 2016  (1) August 2016  (4) June 2016  (2) May 2016  (4) March 2016  (2) February 2016  (1) January 2016  (2) November 2015  (2) October 2015  (5) September 2015  (7) August 2015  (5) July 2015  (4) May 2015  (3) April 2015  (5) March 2015  (2) February 2015  (2) January 2015  (4) December 2014  (2) November 2014  (2) October 2014  (6) September 2014  (5) August 2014  (3) July 2014  (3) June 2014  (1) May 2014  (3) April 2014  (5) March 2014  (9) February 2014  (4) January 2014  (5) December 2013  (5) November 2013  (14) October 2013  (4) September 2013  (6) August 2013  (2) July 2013  (7) June 2013  (10) May 2013  (4) April 2013  (5) March 2013  (8) February 2013  (6) January 2013  (16) December 2012  (8) November 2012  (14) October 2012  (6) September 2012  (6) August 2012  (2) July 2012  (5) June 2012  (5) May 2012  (7) April 2012  (12) March 2012  (6) February 2012  (7) January 2012  (6) December 2011  (5) November 2011  (7) October 2011  (5) September 2011  (10) August 2011  (4) July 2011  (5) June 2011  (7) May 2011  (8) April 2011  (5) March 2011  (13) February 2011  (4) January 2011  (12) December 2010  (7) November 2010  (5) October 2010  (5) September 2010  (10) August 2010  (6) July 2010  (7) June 2010  (5) May 2010  (8) April 2010  (8) March 2010  (14) February 2010  (3) January 2010  (3) December 2009  (4) November 2009  (2) October 2009  (3) September 2009  (9) August 2009  (1) July 2009  (4) June 2009  (7) May 2009  (14) April 2009  (17) March 2009  (21) February 2009  (11) January 2009  (16) December 2008  (12) November 2008  (30) October 2008  (12) September 2008  (3) July 2008  (4) June 2008  (2) May 2008  (11) April 2008  (3) March 2008  (4) February 2008  (10) January 2008  (7) December 2007  (4) November 2007  (4) September 2007  (1) August 2007  (3) June 2007  (6) May 2007  (12) April 2007  (11) March 2007  (9) Feeds  RSS - Posts  RSS - Comments Recent Comments jrochkind on Rails auto-scaling on Heroku Adam (Rails Autoscale) on Rails auto-scaling on Heroku On catalogers, programmers, and user tasks – Gavia Libraria on Broad categories from class numbers Replacing MARC – Gavia Libraria on Linked Data Caution jrochkind on Deep Dive: Moving ruby projects from Travis to Github Actions for CI jrochkind on Deep Dive: Moving ruby projects from Travis to Github Actions for CI jrochkind on Deep Dive: Moving ruby projects from Travis to Github Actions for CI eregontp on Deep Dive: Moving ruby projects from Travis to Github Actions for CI Top Posts Bootstrap 3 to 4: Changes in how font size, line-height, and spacing is done. Or "what happened to $line-height-computed." Some notes on what's going on in ActiveStorage yes, product owner and technical lead need to be different people Deep Dive: Moving ruby projects from Travis to Github Actions for CI Are you talking to Heroku redis in cleartext or SSL? Top Clicks w3schools.com/tags/ref_ur… uppy.io github.com/nahi/httpclien… google.com/fonts/specimen… news.ycombinator.com/item… A blog by Jonathan Rochkind. All original content licensed CC-BY. Create a website or blog at WordPress.com Email (Required) Name (Required) Website   Loading Comments... Comment × Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use. To find out more, including how to control cookies, see here: Cookie Policy 
blog-dshr-org-2809	----	DSHR's Blog: Elon Musk: Threat or Menace? DSHR's Blog I'm David Rosenthal, and this is a place to discuss the work I'm doing in Digital Preservation. Tuesday, April 6, 2021 Elon Musk: Threat or Menace? Although both Tesla and SpaceX are major engineering achievements, Elon Musk seems completely unable to understand the concept of externalities, unaccounted-for costs that society bears as a result of these achievements. First, in Tesla: carbon offsetting, but in reverse, Jaime Powell reacted to Tesla taking $1.6B in carbon offsets which provided the only profit Tesla ever made and putting them into Bitcoin: Looked at differently, a single Bitcoin purchase at a price of ~$50,000 has a carbon footprint of 270 tons, the equivalent of 60 ICE cars. Tesla’s average selling price in the fourth quarter of 2020? $49,333. We’re not sure about you, but FT Alphaville is struggling to square the circle of “buy a Tesla with a bitcoin and create the carbon output of 60 internal combustion engine cars” with its legendary environmental ambitions. Unless, of course, that was never the point in the first place. Below the fold, more externalities Musk is ignoring. Second, there is Musk's obsession with establishing a colony on Mars. Even assuming SpaceX can stop their Starship second stage exploding on landing, and do the same with the much bigger first stage, the Mars colony scheme would have massive environmental impacts. Musk envisages a huge fleet of Starships ferrying people and supplies to Mars for between 40 and 100 years. The climate effects of dumping this much rocket exhaust into the upper atmosphere over such a long period would be significant. The idea that a world suffering the catastrophic effects of climate change could sustain such an expensive program over many decades simply for the benfit of a miniscule fraction of the population is laughable. These externalities are in the future. But there are a more immediate set of externalities. Back in 2017 I expressed my skepticism about "Level 5" self-driving cars in Techno-hype part 1, stressing that the problem was that to get to Level 5, or as Musk calls it "Full Self-Driving", you need to pass through the levels where the software has to hand-off to the human. And the closer you get to Level 5, the harder this problem becomes: Suppose, for the sake of argument, that self-driving cars three times as good as Waymo's are in wide use by normal people. A normal person would encounter a hand-off once in 15,000 miles of driving, or less than once a year. Driving would be something they'd be asked to do maybe 50 times in their life. Even if, when the hand-off happened, the human was not "climbing into the back seat, climbing out of an open car window, and even smooching" and had full "situational awareness", they would be faced with a situation too complex for the car's software. How likely is it that they would have the skills needed to cope, when the last time they did any driving was over a year ago, and on average they've only driven 25 times in their life? Current testing of self-driving cars hands-off to drivers with more than a decade of driving experience, well over 100,000 miles of it. It bears no relationship to the hand-off problem with a mass deployment of self-driving technology. Mack Hogan's Tesla's "Full Self Driving" Beta Is Just Laughably Bad and Potentially Dangerous starts: A beta version of Tesla's "Full Self Driving" Autopilot update has begun rolling out to certain users. And man, if you thought "Full Self Driving" was even close to a reality, this video of the system in action will certainly relieve you of that notion. It is perhaps the best comprehensive video at illustrating just how morally dubious, technologically limited, and potentially dangerous Autopilot's "Full Self Driving" beta program is. Hogan sums up the lesson of the video: Tesla's software clearly does a decent job of identifying cars, stop signs, pedestrians, bikes, traffic lights, and other basic obstacles. Yet to think this constitutes anything close to "full self-driving" is ludicrous. There's nothing wrong with having limited capabilities, but Tesla stands alone in its inability to acknowledge its own shortcomings. Hogan goes on to point out the externalities: When technology is immature, the natural reaction is to continue working on it until it's ironed out. Tesla has opted against that strategy here, instead choosing to sell software it knows is incomplete, charging a substantial premium, and hoping that those who buy it have the nuanced, advanced understanding of its limitations—and the ability and responsibility to jump in and save it when it inevitably gets baffled. In short, every Tesla owner who purchases "Full Self-Driving" is serving as an unpaid safety supervisor, conducting research on Tesla's behalf. Perhaps more damning, the company takes no responsibility for its actions and leaves it up to driver discretion to decide when and where to test it out. That leads to videos like this, where early adopters carry out uncontrolled tests on city streets, with pedestrians, cyclists, and other drivers unaware that they're part of the experiment. If even one of those Tesla drivers slips up, the consequences can be deadly. Of course, the drivers are only human so they do slip up: the Tesla arrives at an intersection where it has a stop sign and cross traffic doesn't. It proceeds with two cars incoming, the first car narrowly passing the car's front bumper and the trailing car braking to avoid T-boning the Model 3. It is absolutely unbelievable and indefensible that the driver, who is supposed to be monitoring the car to ensure safe operation, did not intervene there. An example of the kinds of problems that can be caused by autonomous vehicles behaving in ways that humans don't expect is reported by Timothy B. Lee in Fender bender in Arizona illustrates Waymo’s commercialization challenge: A white Waymo minivan was traveling westbound in the middle of three westbound lanes on Chandler Boulevard, in autonomous mode, when it unexpectedly braked for no reason. A Waymo backup driver behind the wheel at the time told Chandler police that "all of a sudden the vehicle began to stop and gave a code to the effect of 'stop recommended' and came to a sudden stop without warning." A red Chevrolet Silverado pickup behind the vehicle swerved to the right but clipped its back panel, causing minor damage. Nobody was hurt. The Tesla in the video made a similar unexpected stop. Lee stresses that, unlike Tesla's, Waymo's responsible test program has resulted in a generally safe product, but not one that is safe enough: Waymo has racked up more than 20 million testing miles in Arizona, California, and other states. This is far more than any human being will drive in a lifetime. Waymo's vehicles have been involved in a relatively small number of crashes. These crashes have been overwhelmingly minor with no fatalities and few if any serious injuries. Waymo says that a large majority of those crashes have been the fault of the other driver. So it's very possible that Waymo's self-driving software is significantly safer than a human driver. ... The more serious problem for Waymo is that the company can't be sure that the idiosyncrasies of its self-driving software won't contribute to a more serious crash in the future. Human drivers cause a fatality about once every 100 million miles of driving—far more miles than Waymo has tested so far. If Waymo scaled up rapidly, it would be taking a risk that an unnoticed flaw in Waymo's programming could lead to someone getting killed. I'm a pedestrian, cyclist and driver in an area infested with Teslas owned, but potentially not actually being driven, by fanatical early adopters and members of the cult of Musk. I'm personally at risk from these people believing that what they paid good money for was "Full Self Driving". When SpaceX tests Starship at their Boca Chica site they take precautions, including road closures, to ensure innocent bystanders aren't at risk from the rain of debris when things go wrong. Tesla, not so much. Of course, Tesla doesn't tell the regulators that what the cult members paid for was "Full Self Driving"; that might cause legal problems. As Timothy B. Lee reports, Tesla: “Full self-driving beta” isn’t designed for full self-driving: "Despite the "full self-driving" name, Tesla admitted it doesn't consider the current beta software suitable for fully driverless operation. The company said it wouldn't start testing "true autonomous features" until some unspecified point in the future. ... Tesla added that "we do not expect significant enhancements" that would "shift the responsibility for the entire dynamic driving task to the system." The system "will continue to be an SAE Level 2, advanced driver-assistance feature." SAE level 2 is industry jargon for a driver-assistance systems that perform functions like lane-keeping and adaptive cruise control. By definition, level 2 systems require continual human oversight. Fully driverless systems—like the taxi service Waymo is operating in the Phoenix area—are considered level 4 systems." There is an urgent need for regulators to step up and stop this dangerous madness: The NHTSA should force Tesla to disable "Full Self Driving" in all its vehicles until the technology has passed an approved test program Any vehicles taking part in such a test program on public roads should be clearly distinguishable from Teslas being driven by actual humans, for example with orange flashing lights. Self-driving test vehicles from less irresponsible companies such as Waymo are distinguishable in this way, Teslas in which some cult member has turned on "Full Self Driving Beta" are not. The FTC should force Tesla to refund, with interest, every dollar paid by their customers under the false pretense that they were paying for "Full Self Driving". Posted by David. at 8:00 AM Labels: techno-hype 5 comments: David. said... Aaron Gordon's This Is the Most Embarrassing News Clip in American Transportation History is a brutal takedown of yet another of Elon Musk's fantasies: "Last night, Shepard Smith ran a segment on his CNBC show revealing Elon Musk's Boring Campany's new Las Vegas car tunnel, which was paid for by $50 million in taxpayer dollars. It is one of the most bizarre and embarrassing television segments in American transportation history, a perfect cap for one of the most bizarre and embarrassing transportation projects in American history." April 11, 2021 at 7:20 AM David. said... Eric Berger's A new documentary highlights the visionary behind space settlement reviews The High Frontier: The Untold Story of Gerard K. O'Neill: "O'Neill popularized the idea of not just settling space, but of doing so in free space rather than on the surface of other planets or moons. His ideas spread through the space-enthusiast community at a time when NASA was about to debut its space shuttle, which first flew in 1981. NASA had sold the vehicle as offering frequent, low-cost access to space. It was the kind of transportation system that allowed visionaries like O'Neill to think about what humans could do in space if getting there were cheaper. The concept of "O'Neill cylinders" began with a question he posed to his physics classes at Princeton: "Is a planetary surface the right place for an expanding industrial civilization?" As it turned out, following their analysis, the answer was no. Eventually, O'Neill and his students came to the idea of free-floating, rotating, cylindrical space colonies that could have access to ample solar energy." However attractive the concept is in the far future, I need to point out that pursuing it before the climate crisis has been satisfactorily resolved will make the lives of the vast majority of humanity worse for the benefit of a tiny minority. April 11, 2021 at 4:24 PM David. said... ‘No one was driving the car’: 2 men dead after fiery Tesla crash in Spring, officials say : "Harris County Precinct 4 Constable Mark Herman told KPRC 2 that the investigation showed “no one was driving” the fully-electric 2019 Tesla when the accident happened. There was a person in the passenger seat of the front of the car and in the rear passenger seat of the car." April 18, 2021 at 10:27 AM David. said... Timothy B. Lee's Consumer Reports shows Tesla Autopilot works with no one in the driver’s seat reports: "Tesla defenders also insisted that Autopilot couldn't have been active because the technology doesn't operate unless someone is in the driver's seat. Consumer Reports decided to test this latter claim by seeing if it could get Autopilot to activate without anyone in the driver's seat. It turned out not to be very difficult. Sitting in the driver's seat, Consumer Reports' Jake Fisher enabled Autopilot and then used the speed dial on the steering wheel to bring the car to a stop. He then placed a weighted chain on the steering wheel (to simulate pressure from a driver's hands) and hopped into the passenger seat. From there, he could reach over and increase the speed using the speed dial. Autopilot won't function unless the driver's seatbelt is buckled, but it was also easy to defeat this check by threading the seatbelt behind the driver. ... the investigation makes clear that activating Autopilot without being in the driver's seat requires deliberately disabling safety measures. Fisher had to buckle the seatbelt behind himself, put a weight on the steering wheel, and crawl over to the passenger seat without opening any doors. Anybody who does that knows exactly what they're doing. Tesla fans argue that people who deliberately bypass safety measures like this have only themselves to blame if it leads to a deadly crash." Well, yes, but Musk's BS has been convincing them to try stunts like this for years. He has to be held responsible, and he has to disable "Full Self Driving" before some innocent bystanders get killed. April 22, 2021 at 2:57 PM David. said... This Automotive News editorial is right but misses the bigger picture: "Tesla's years of misleading consumers about its vehicles' "full self-driving" capabilities — or lack thereof — claimed two more lives this month. ... When critics say the term "autopilot" gives the impression that the car can drive without oversight, Tesla likes to argue that that's based on an erroneous understanding of airplanes' systems. But the company exploits consumers' overconfidence in that label with the way the feature is sold and promoted without correction among Tesla's fanatical online community. Those practices encourage misunderstanding and misuse. In public, Musk says the company is very close to full SAE Level 5 automated driving. In conversations with regulators, the company admits that Autopilot and Full Self-Driving are Level 2 driver-assist suites, not unlike those sold by many other automakers. This nation does not have a good track record of holding manufacturers accountable when their products are misused by the public, which is what happened in this case." It isn't just the Darwin Award winners at risk, it is innocent bystanders at risk. April 27, 2021 at 8:50 AM Post a Comment Newer Post Older Post Home Subscribe to: Post Comments (Atom) Blog Rules Posts and comments are copyright of their respective authors who, by posting or commenting, license their work under a Creative Commons Attribution-Share Alike 3.0 United States License. Off-topic or unsuitable comments will be deleted. DSHR DSHR in ANWR Recent Comments Full comments Blog Archive ▼  2021 (18) ▼  April (5) Dogecoin Disrupts Bitcoin! What Is The Point? NFTs and Web Archiving Cryptocurrency's Carbon Footprint Elon Musk: Threat or Menace? ►  March (3) ►  February (5) ►  January (5) ►  2020 (55) ►  December (4) ►  November (4) ►  October (3) ►  September (6) ►  August (5) ►  July (3) ►  June (6) ►  May (3) ►  April (5) ►  March (6) ►  February (5) ►  January (5) ►  2019 (66) ►  December (2) ►  November (4) ►  October (8) ►  September (5) ►  August (5) ►  July (7) ►  June (6) ►  May (7) ►  April (6) ►  March (7) ►  February (4) ►  January (5) ►  2018 (96) ►  December (7) ►  November (8) ►  October (10) ►  September (5) ►  August (8) ►  July (5) ►  June (7) ►  May (10) ►  April (8) ►  March (9) ►  February (9) ►  January (10) ►  2017 (82) ►  December (6) ►  November (6) ►  October (8) ►  September (6) ►  August (7) ►  July (5) ►  June (7) ►  May (6) ►  April (7) ►  March (11) ►  February (5) ►  January (8) ►  2016 (89) ►  December (4) ►  November (8) ►  October (10) ►  September (8) ►  August (8) ►  July (7) ►  June (8) ►  May (7) ►  April (5) ►  March (10) ►  February (7) ►  January (7) ►  2015 (75) ►  December (7) ►  November (5) ►  October (11) ►  September (5) ►  August (3) ►  July (3) ►  June (8) ►  May (10) ►  April (6) ►  March (6) ►  February (7) ►  January (4) ►  2014 (68) ►  December (7) ►  November (8) ►  October (6) ►  September (8) ►  August (7) ►  July (3) ►  June (5) ►  May (6) ►  April (5) ►  March (6) ►  February (2) ►  January (5) ►  2013 (67) ►  December (3) ►  November (6) ►  October (7) ►  September (6) ►  August (3) ►  July (5) ►  June (6) ►  May (5) ►  April (9) ►  March (5) ►  February (5) ►  January (7) ►  2012 (43) ►  December (4) ►  November (4) ►  October (6) ►  September (6) ►  August (2) ►  July (5) ►  June (2) ►  May (5) ►  March (1) ►  February (5) ►  January (3) ►  2011 (40) ►  December (2) ►  November (1) ►  October (7) ►  September (3) ►  August (5) ►  July (2) ►  June (2) ►  May (2) ►  April (4) ►  March (4) ►  February (4) ►  January (4) ►  2010 (17) ►  December (5) ►  November (3) ►  October (4) ►  September (2) ►  July (1) ►  June (1) ►  February (1) ►  2009 (8) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (2) ►  January (2) ►  2008 (8) ►  December (2) ►  March (1) ►  January (5) ►  2007 (14) ►  December (1) ►  October (3) ►  September (1) ►  August (1) ►  July (2) ►  June (3) ►  May (1) ►  April (2) LOCKSS system has permission to collect, preserve, and serve this Archival Unit. Simple theme. Powered by Blogger. 
blog-dshr-org-397	----	DSHR's Blog: A Note On Blockchains DSHR's Blog I'm David Rosenthal, and this is a place to discuss the work I'm doing in Digital Preservation. Tuesday, October 6, 2020 A Note On Blockchains Blockchains have three components, a data structure, a set of replicas, and a consensus mechanism: The data structure is often said to provide immutability or to be tamper-proof, but this is wrong. It is made out of bits, and bits can be changed or destroyed. What it actually provides is tamper-evidence, revealing that the data structure has changed. If an unauthorized change to the data structure is detected the damage must be repaired. So there must be multiple replicas of the data structure to allow an undamaged replica to be copied to the damaged replica. The role of the consensus mechanism is to authorize changes to the data structure, and prevent unauthorized changes. A change is authorized if the consensus of the replicas agrees to it. Below the fold, some details. Data Structure The data structure used for blockchains is a form of Merkle or hash tree, published by Ralph Merkle in 1980. In the blockchain application it is a linear chain to which fixed-size blocks are added at regular intervals. Each block contains the hash of its predecessor; a chain of blocks. Hash algorithms have a limited lifetime, but while the hash algorithm remains unbroken it is extremely difficult to change blocks in the chain but maintain the same hash values. A change that does not maintain the same hash values is easy to detect. Replicas The set of replicas can be either closed, composed of only replicas approved by some authority, or open, in which case no approval is required for participation. In blockchain jargon, closed replica sets correspond to permissioned blockchains, and open replicas sets to permissionless blockchains. Consensus Mechanism Faults Replicas 1 4 2 7 3 10 4 13 5 16 6 19 An important result in theoretical computer science was published in The Byzantine Generals Problem by Lamport et al in 1982. They showed that the minimum size of a replica set to survive f simultaneous failures was 3f+1. Thus Byzantine Fault Tolerance (BFT) is the most efficient possible consensus mechanism in terms of number of replicas. BFT requires a closed replica set, and synchronized operation of the replicas, so can be used only in permissioned blockchains. If joining the replica set of a permissionless blockchain is free, it will be vulnerable to Sybil attacks, in which an attacker creates many apparently independent replicas which are actually under his sole control. If creating and maintaining a replica is free, anyone can authorize any change they choose simply by creating enough Sybil replicas. Defending against Sybil attacks requires that membership in a replica set be expensive. The cost of an attack is at least the membership cost of half the replica set, so that the attacker controls a majority of the replicas. Permissionless blockchains have implemented a number of ways to make it expensive to take part, including: Proof of Work (PoW), a concept originated by Cynthia Dwork and Moni Naor in 1992, in which the expensive resource is CPU cycles. This is the "mining" technique used by Bitcoin, and is the only technique that has been demonstrated to work well at scale. But at scale the cost and environmental damage is unsustainable; the top 5 cryptocurrencies are estimated to use as much energy as The Netherlands. At smaller scales it doesn't work well because renting 51% of the mining power is cheap enough to motivate attacks. 51% attacks have become endemic among the smaller alt-coins. For example, there were three successful attacks on Ethereum Classic in a single month. Proof of Stake (PoS) in which the expensive resource is capital tied up, or staked. Participants stand to lose their stake in case of detected misbehavior. The Ethereum blockchain has been trying to implement PoS for 5 years, so far without success. The technique has similar economic linits and vulnerabilities as PoW. Proofs of Time & Space (PoTS), advocated by Bram Cohen, in which the expensive resource is disk storage. Conclusion Eric Budish points out the fundamental problem with expensive defenses in The Economic Limits of Bitcoin and the Blockchain: From a computer security perspective, the key thing to note ... is that the security of the blockchain is linear in the amount of expenditure on mining power, ... In contrast, in many other contexts investments in computer security yield convex returns (e.g., traditional uses of cryptography) ... analogously to how a lock on a door increases the security of a house by more than the cost of the lock. The difference between permissioned and permissionless blockchains is the presence or absence of a trusted authority controlling the replica set. A decision not to trust such an authority imposes enormous additional costs and performance penalties on the system because the permissionless consensus mechanism has to be expensive. Decentralization in Bitcoin and Ethereum Networks by Adem Efe Gencer et al compares the cost of a permissioned system using BFT to the actual Bitcoin PoW blockchain: a Byzantine quorum system of size 20 could achieve better decentralization than proof-of-work mining at a much lower resource cost. As an Englishman I appreciate understatement. By "much lower", they mean around 5 orders of magnitude lower. Posted by David. at 8:00 AM Labels: bitcoin 2 comments: David. said... Going from Bad to Worse: From Internet Voting to Blockchain Voting by Sunoo Park, Neha Narula, Michael Specter and Ronald L. Rivest argues that: "given the current state of computer security, any turnout increase derived from with Internet- or blockchain-based voting would come at the cost of losing meaningful assurance that votes have been counted as they were cast, and not undetectably altered or discarded. This state of affairs will continue as long as standard tactics such as malware, zero days, and denial-of-service attacks continue to be effective. This article analyzes and systematizes prior research on the security risks of online and electronic voting, and show that not only do these risks persist in blockchain-based voting systems, but blockchains may introduce additional problems for voting systems." November 19, 2020 at 8:48 AM Michael Hogan said... Which is why voting systems still include paper records, and probably will always include paper records. It calls to mind my oft-stated admonition to amateur futurists, that all of the cool stuff in our increasingly digitized world still relies to far too great an extent on a technology commercialized in 1882 (burning fossil fuels in a boiler to spin a turbine-generator), and even the dominant battery chemistry is about 50 years old. Beware of the "TED talk" mindset - be on the lookout for the dirty old smelter behind that shiny penny. November 28, 2020 at 4:04 AM Post a Comment Newer Post Older Post Home Subscribe to: Post Comments (Atom) Blog Rules Posts and comments are copyright of their respective authors who, by posting or commenting, license their work under a Creative Commons Attribution-Share Alike 3.0 United States License. Off-topic or unsuitable comments will be deleted. DSHR DSHR in ANWR Recent Comments Full comments Blog Archive ►  2021 (18) ►  April (5) ►  March (3) ►  February (5) ►  January (5) ▼  2020 (55) ►  December (4) ►  November (4) ▼  October (3) The Long Now Unbanking The Banked A Note On Blockchains ►  September (6) ►  August (5) ►  July (3) ►  June (6) ►  May (3) ►  April (5) ►  March (6) ►  February (5) ►  January (5) ►  2019 (66) ►  December (2) ►  November (4) ►  October (8) ►  September (5) ►  August (5) ►  July (7) ►  June (6) ►  May (7) ►  April (6) ►  March (7) ►  February (4) ►  January (5) ►  2018 (96) ►  December (7) ►  November (8) ►  October (10) ►  September (5) ►  August (8) ►  July (5) ►  June (7) ►  May (10) ►  April (8) ►  March (9) ►  February (9) ►  January (10) ►  2017 (82) ►  December (6) ►  November (6) ►  October (8) ►  September (6) ►  August (7) ►  July (5) ►  June (7) ►  May (6) ►  April (7) ►  March (11) ►  February (5) ►  January (8) ►  2016 (89) ►  December (4) ►  November (8) ►  October (10) ►  September (8) ►  August (8) ►  July (7) ►  June (8) ►  May (7) ►  April (5) ►  March (10) ►  February (7) ►  January (7) ►  2015 (75) ►  December (7) ►  November (5) ►  October (11) ►  September (5) ►  August (3) ►  July (3) ►  June (8) ►  May (10) ►  April (6) ►  March (6) ►  February (7) ►  January (4) ►  2014 (68) ►  December (7) ►  November (8) ►  October (6) ►  September (8) ►  August (7) ►  July (3) ►  June (5) ►  May (6) ►  April (5) ►  March (6) ►  February (2) ►  January (5) ►  2013 (67) ►  December (3) ►  November (6) ►  October (7) ►  September (6) ►  August (3) ►  July (5) ►  June (6) ►  May (5) ►  April (9) ►  March (5) ►  February (5) ►  January (7) ►  2012 (43) ►  December (4) ►  November (4) ►  October (6) ►  September (6) ►  August (2) ►  July (5) ►  June (2) ►  May (5) ►  March (1) ►  February (5) ►  January (3) ►  2011 (40) ►  December (2) ►  November (1) ►  October (7) ►  September (3) ►  August (5) ►  July (2) ►  June (2) ►  May (2) ►  April (4) ►  March (4) ►  February (4) ►  January (4) ►  2010 (17) ►  December (5) ►  November (3) ►  October (4) ►  September (2) ►  July (1) ►  June (1) ►  February (1) ►  2009 (8) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (2) ►  January (2) ►  2008 (8) ►  December (2) ►  March (1) ►  January (5) ►  2007 (14) ►  December (1) ►  October (3) ►  September (1) ►  August (1) ►  July (2) ►  June (3) ►  May (1) ►  April (2) LOCKSS system has permission to collect, preserve, and serve this Archival Unit. Simple theme. Powered by Blogger. 
blog-dshr-org-4	----	DSHR's Blog DSHR's Blog I'm David Rosenthal, and this is a place to discuss the work I'm doing in Digital Preservation. Thursday, April 22, 2021 Dogecoin Disrupts Bitcoin! Two topics I've posted about recently, Elon Musk's cult and the illusory "prices" of cryptocurrencies, just intersected in spectacular fashion. On April 14 the Bitcoin "price" peaked at $63.4K. Early on April 15, the Musk cult saw this tweet from their prophet. Immediately, the Dogecoin "price" took off like a Falcon 9. A day later, Jemima Kelley reported that If you believe, they put a Dogecoin on the moon. That was to say that: Dogecoin — the crypto token that was started as a joke and that is the favourite of Elon Musk — is having a bit of a moment. And when we say a bit of a moment, we mean that it is on a lunar trajectory (in crypto talk: it is going to da moon). At the time of writing this, it is up over 200 per cent in the past 24 hours — more than tripling in value (for those of you who need help on percentages, it is Friday afternoon after all). Over the past week it’s up more than 550 per cent (almost seven times higher!). The headlines tell the story — Timothy B. Lee's Dogecoin has risen 400 percent in the last week because why not and Joanna Ossinger's Dogecoin Rips in Meme-Fueled Frenzy on Pot-Smoking Holiday. The Dogecoin "price" graph Kelly posted was almost vertical. The same day, Peter Schiff, the notorious gold-bug, tweeted: So far in 2021 #Bitcoin has lost 97% of its value verses #Dogecoin. The market has spoken. Dogecoin is eating Bitcoin. All the Bitcoin pumpers who claim Bitcoin is better than gold because its price has risen more than gold's must now concede that Dogecoin is better than Bitcoin. Below the fold I look back at this revolution in crypto-land. Read more » Posted by David. at 9:00 AM 1 comment: Labels: bitcoin What Is The Point? During a discussion of NFTs, Larry Masinter pointed me to his 2012 proposal The 'tdb' and 'duri' URI schemes, based on dated URIs. The proposal's abstract reads: This document defines two URI schemes. The first, 'duri' (standing for "dated URI"), identifies a resource as of a particular time. This allows explicit reference to the "time of retrieval", similar to the way in which bibliographic references containing URIs are often written. The second scheme, 'tdb' ( standing for "Thing Described By"), provides a way of minting URIs for anything that can be described, by the means of identifying a description as of a particular time. These schemes were posited as "thought experiments", and therefore this document is designated as Experimental. As far as I can tell, this proposal went nowhere, but it raises a question that is also raised by NFTs. What is the point of a link that is unlikely to continue to resolve to the expected content? Below the fold I explore this question. Read more » Posted by David. at 8:00 AM No comments: Labels: personal digital preservation, web archiving Thursday, April 15, 2021 NFTs and Web Archiving One of the earliest observations of the behavior of the Web at scale was "link rot". There were a lot of 404s, broken links. Research showed that the half-life of Web pages was alarmingly short. Even in 1996 this problem was obvious enough for Brewster Kahle to found the Internet Archive to address it. From the Wikipedia entry for Link Rot: A 2003 study found that on the Web, about one link out of every 200 broke each week,[1] suggesting a half-life of 138 weeks. This rate was largely confirmed by a 2016–2017 study of links in Yahoo! Directory (which had stopped updating in 2014 after 21 years of development) that found the half-life of the directory's links to be two years.[2] One might have thought that academic journals were a relatively stable part of the Web, but research showed that their references decayed too, just somewhat less rapidly. A 2013 study found a half-life of 9.3 years. See my 2015 post The Evanescent Web. I expect you have noticed the latest outbreak of blockchain-enabled insanity, Non-Fungible Tokens (NFTs). Someone "paying $69M for a JPEG" or $560K for a New York Times column attracted a lot of attention. Follow me below the fold for the connection between NFTs, "link rot" and Web archiving. Read more » Posted by David. at 8:00 AM 2 comments: Labels: bitcoin, distributed web, web archiving Tuesday, April 13, 2021 Cryptocurrency's Carbon Footprint China’s bitcoin mines could derail carbon neutrality goals, study says and Bitcoin mining emissions in China will hit 130 million tonnes by 2024, the headlines say it all. Excusing this climate-destroying externality of Proof-of-Work blockchains requires a continuous flow of new misleading arguments. Below the fold I discuss one of the more recent novelties. Read more » Posted by David. at 8:00 AM 5 comments: Labels: bitcoin, security Tuesday, April 6, 2021 Elon Musk: Threat or Menace? Although both Tesla and SpaceX are major engineering achievements, Elon Musk seems completely unable to understand the concept of externalities, unaccounted-for costs that society bears as a result of these achievements. First, in Tesla: carbon offsetting, but in reverse, Jaime Powell reacted to Tesla taking $1.6B in carbon offsets which provided the only profit Tesla ever made and putting them into Bitcoin: Looked at differently, a single Bitcoin purchase at a price of ~$50,000 has a carbon footprint of 270 tons, the equivalent of 60 ICE cars. Tesla’s average selling price in the fourth quarter of 2020? $49,333. We’re not sure about you, but FT Alphaville is struggling to square the circle of “buy a Tesla with a bitcoin and create the carbon output of 60 internal combustion engine cars” with its legendary environmental ambitions. Unless, of course, that was never the point in the first place. Below the fold, more externalities Musk is ignoring. Read more » Posted by David. at 8:00 AM 5 comments: Labels: techno-hype Thursday, March 25, 2021 Internet Archive Storage The Internet Archive is a remarkable institution, which has become increasingly important during the pandemic. It has been for many years in the world's top 300 Web sites and is currently ranked #209, sustaining almost 60Gb/s outbound bandwidth from its collection of almost half a trillion archived Web pages and much other content. It does this on a budget of under $20M/yr, yet maintains 99.98% availability. Jonah Edwards, who runs the Core Infrastructure team, gave a presentation on the Internet Archive's storage infrastructure to the Archive's staff. Below the fold, some details and commentary. Read more » Posted by David. at 8:00 AM 1 comment: Labels: storage costs, storage failures, storage media Tuesday, March 16, 2021 Correlated Failures The invaluable statistics published by Backblaze show that, despite being built from technologies close to the physical limits (Heat-Assisted Magnetic Recording, 3D NAND Flash), modern digital storage media are extraordinarily reliable. However, I have long believed that the models that attempt to project the reliability of digital storage systems from the statistics of media reliability are wildly optimistic. They ignore foreseeable causes of data loss such as Coronal Mass Ejections and ransomware attacks, which cause correlated failures among the media in the system. No matter how many they are, if all replicas are destroyed or corrupted the data is irrecoverable. Modelling these "black swan" events is clearly extremely difficult, but much less dramatic causes are in practice important too. It has been known at least since Talagala's 1999 Ph.D. thesis that media failures in storage systems are significantly correlated, and at least since Jiang et al's 2008 Are Disks the Dominant Contributor for Storage Failures? A Comprehensive Study of Storage Subsystem Failure Characteristics that only about half the failures in storage systems are traceable to media failures. The rest happen in the pipeline from the media to the CPU. Because this typically aggregates data from many media components, it naturally causes correlations. As I wrote in 2015's Disk reliability, discussing Backblaze's experience of a 40% Annual Failure Rate (AFR) in over 1,100 Seagate 3TB drives: Alas, there is a long history of high failure rates among particular batches of drives. An experience similar to Backblaze's at Facebook is related here, with an AFR over 60%. My first experience of this was nearly 30 years ago in the early days of Sun Microsystems. Manufacturing defects, software bugs, mishandling by distributors, vibration resonance, there are many causes for these correlated failures. Despite plenty of anecdotes, there is little useful data on which to base models of correlated failures in storage systems. Below the fold I summarize and comment on an important paper by a team from the Chinese University of Hong Kong and Alibaba that helps remedy this. Read more » Posted by David. at 8:00 AM No comments: Labels: fault tolerance, storage failures, storage media Older Posts Home Subscribe to: Posts (Atom) Blog Rules Posts and comments are copyright of their respective authors who, by posting or commenting, license their work under a Creative Commons Attribution-Share Alike 3.0 United States License. Off-topic or unsuitable comments will be deleted. DSHR DSHR in ANWR Recent Comments Full comments Blog Archive ▼  2021 (18) ▼  April (5) Dogecoin Disrupts Bitcoin! What Is The Point? NFTs and Web Archiving Cryptocurrency's Carbon Footprint Elon Musk: Threat or Menace? ►  March (3) ►  February (5) ►  January (5) ►  2020 (55) ►  December (4) ►  November (4) ►  October (3) ►  September (6) ►  August (5) ►  July (3) ►  June (6) ►  May (3) ►  April (5) ►  March (6) ►  February (5) ►  January (5) ►  2019 (66) ►  December (2) ►  November (4) ►  October (8) ►  September (5) ►  August (5) ►  July (7) ►  June (6) ►  May (7) ►  April (6) ►  March (7) ►  February (4) ►  January (5) ►  2018 (96) ►  December (7) ►  November (8) ►  October (10) ►  September (5) ►  August (8) ►  July (5) ►  June (7) ►  May (10) ►  April (8) ►  March (9) ►  February (9) ►  January (10) ►  2017 (82) ►  December (6) ►  November (6) ►  October (8) ►  September (6) ►  August (7) ►  July (5) ►  June (7) ►  May (6) ►  April (7) ►  March (11) ►  February (5) ►  January (8) ►  2016 (89) ►  December (4) ►  November (8) ►  October (10) ►  September (8) ►  August (8) ►  July (7) ►  June (8) ►  May (7) ►  April (5) ►  March (10) ►  February (7) ►  January (7) ►  2015 (75) ►  December (7) ►  November (5) ►  October (11) ►  September (5) ►  August (3) ►  July (3) ►  June (8) ►  May (10) ►  April (6) ►  March (6) ►  February (7) ►  January (4) ►  2014 (68) ►  December (7) ►  November (8) ►  October (6) ►  September (8) ►  August (7) ►  July (3) ►  June (5) ►  May (6) ►  April (5) ►  March (6) ►  February (2) ►  January (5) ►  2013 (67) ►  December (3) ►  November (6) ►  October (7) ►  September (6) ►  August (3) ►  July (5) ►  June (6) ►  May (5) ►  April (9) ►  March (5) ►  February (5) ►  January (7) ►  2012 (43) ►  December (4) ►  November (4) ►  October (6) ►  September (6) ►  August (2) ►  July (5) ►  June (2) ►  May (5) ►  March (1) ►  February (5) ►  January (3) ►  2011 (40) ►  December (2) ►  November (1) ►  October (7) ►  September (3) ►  August (5) ►  July (2) ►  June (2) ►  May (2) ►  April (4) ►  March (4) ►  February (4) ►  January (4) ►  2010 (17) ►  December (5) ►  November (3) ►  October (4) ►  September (2) ►  July (1) ►  June (1) ►  February (1) ►  2009 (8) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (2) ►  January (2) ►  2008 (8) ►  December (2) ►  March (1) ►  January (5) ►  2007 (14) ►  December (1) ►  October (3) ►  September (1) ►  August (1) ►  July (2) ►  June (3) ►  May (1) ►  April (2) LOCKSS system has permission to collect, preserve, and serve this Archival Unit. Simple theme. Powered by Blogger. 
blog-dshr-org-5180	----	DSHR's Blog: Cryptocurrency's Carbon Footprint DSHR's Blog I'm David Rosenthal, and this is a place to discuss the work I'm doing in Digital Preservation. Tuesday, April 13, 2021 Cryptocurrency's Carbon Footprint China’s bitcoin mines could derail carbon neutrality goals, study says and Bitcoin mining emissions in China will hit 130 million tonnes by 2024, the headlines say it all. Excusing this climate-destroying externality of Proof-of-Work blockchains requires a continuous flow of new misleading arguments. Below the fold I discuss one of the more recent novelties. In Bitcoin and Ethereum Carbon Footprints – Part 2, Moritz Seibert claims the reason for mining is to get the mining reward: Bitcoin transactions themselves don’t cause a lot of power usage. Getting the network to accept a transaction consumes almost no power, but having ASIC miners grind through the mathematical ether to solve valid blocks does. Miners are incentivized to do this because they are compensated for it. Presently, that compensation includes a block reward which is paid in bitcoin (6.25 BTC per block) as well as a miner fee (transaction fee). Transaction fees are denominated in fractional bitcoins and paid by the initiator of the transaction. Today, about 15% of total miners’ rewards are transactions fees, and about 85% are block rewards. So, he argues, Bitcoin's current catastrophic carbon footprint doesn't matter because, as the reward decreases, so will the carbon footprint: This also means that the power usage of the Bitcoin network won’t scale linearly with the number of transactions as the network becomes predominantly fee-based and less rewards-based (which causes a lot of power to the thrown at it in light of increasing BTC prices), and especially if those transactions take place on secondary layers. In other words, taking the ratio of “Bitcoin’s total power usage” to “Number of transactions” to calculate the “Power cost per transaction” falsely implies that all transactions hit the final settlement layer (they don’t) and disregards the fact that the final state of the Bitcoin base layer is a fee-based state which requires a very small fraction of Bitcoin’s overall power usage today (no more block rewards). Seibert has some vague idea that there are implications of this not just for the carbon footprint but also for the security of the Bitcoin blockchain: Going forward however, miners’ primary revenue source will change from block rewards to the fees paid for the processing of transactions, which don’t per se cause high carbon emissions. Bitcoin is set to become be a purely fee-based system (which may pose a risk to the security of the system itself if the overall hash rate declines, but that’s a topic for another article because a blockchain that is fully reliant on fees requires that BTCs are transacted with rather than held in Michael Saylor-style as HODLing leads to low BTC velocity, which does not contribute to security in a setup where fees are the only rewards for miners.) Lets leave aside the stunning irresponsibility of arguing that it is acceptable to dump huge amounts of long-lasting greenhouse gas into the atmosphere now because you believe that in the future you will dump less. How realistic is the idea that decreasing the mining reward will decrease the carbon footprint? The graph shows the history of the hash rate, which is a proxy for the carbon footprint. You can see the effect of the "halvening", when on May 11th 2020 the mining reward halved. There was a temporary drop, but the hash rate resumed its inexorable rise. This experiment shows that reducing the mining reward doesn't reduce the carbon footprint. So why does Seibert think that eliminating it will reduce the carbon footprint? The answer appears to be that Seibert thinks the purpose of mining is to create new Bitcoins, that the reason for the vast expenditure of energy is to make the process of creating new coins secure, and that it has nothing to do with the security of transactions. This completely misunderstands the technology. In The Economic Limits of Bitcoin and the Blockchain, Eric Budish examines the return on investment in two kinds of attacks on a blockchain like Bitcoin's. The simpler one is a 51% attack, in which an attacker controls the majority of the mining power. Budish explains what this allows the attacker to do: An attacker could (i) spend Bitcoins, i.e., engage in a transaction in which he sends his Bitcoins to some merchant in exchange for goods or assets; then (ii) allow that transaction to be added to the public blockchain (i.e., the longest chain); and then subsequently (iii) remove that transaction from the public blockchain, by building an alternative longest chain, which he can do with certainty given his majority of computing power. The merchant, upon seeing the transaction added to the public blockchain in (ii), gives the attacker goods or assets in exchange for the Bitcoins, perhaps after an escrow period. But, when the attacker removes the transaction from the public blockchain in (iii), the merchant effectively loses his Bitcoins, allowing the attacker to “double spend” the coins elsewhere. Such attacks are endemic among the smaller alt-coins; for example there were three successful attacks on Ethereum Classic in a single month last year. Clearly, Seibert's future "transaction only" Bitcoin must defend against them. There are two ways to mount a 51% attack, from the outside or from the inside. An outside attack requires more mining power than the insiders are using, whereas an insider attack only needs a majority of the mining power to conspire. Bitcoin miners collaborate in "mining pools" to reduce volatility of their income, and for many years it would have taken only three or so pools to conspire for a successful attack. But assuming insiders are honest, outsiders must acquire more mining power than the insiders are using. Clearly, Bitcoin insiders are using so much mining power that this isn't feasible. The point of mining isn't to create new Bitcoins. Mining is needed to make the process of adding a block to the chain, and thus adding a set of transactions to the chain, so expensive that it isn't worth it for an attacker to subvert the process. The cost, and thus in the case of Proof of Work the carbon footprint, is the whole point. As Budish wrote: From a computer security perspective, the key thing to note ... is that the security of the blockchain is linear in the amount of expenditure on mining power, ... In contrast, in many other contexts investments in computer security yield convex returns (e.g., traditional uses of cryptography) — analogously to how a lock on a door increases the security of a house by more than the cost of the lock. Lets consider the possible futures of a fee-based Bitcoin blockchain. It turns out that currently fee revenue is a smaller proportion of total miner revenue than Seibert claims. Here is the chart of total revenue (~$60M/day): And here is the chart of fee revenue (~$5M/day): Thus the split is about 8% fee, 92% reward: If security stays the same, blocksize stays the same, fees must increase to keep the cost of a 51% attack high enough. The chart shows the average fee hovering around $20, so the average cost of a single transaction would be over $240. This might be a problem for Seibert's requirement that "BTCs are transacted with rather than held". If blocksize stays the same, fees stay the same, security must decrease because the fees cannot cover the cost of enough hash power to deter a 51% attack. Similarly, in this case it would be 12 times cheaper to mount a 51% attack, which would greatly increase the risk of delivering anything in return for Bitcoin. It is already the case that users are advised to wait 6 blocks (about an hour) before treating a transaction as final. Waiting nearly half a day before finality would probably be a disincentive. If fees stay the same, security stays the same, blocksize must increase to allow for enough transactions so that their fees cover the cost of enough hash power to deter a 51% attack. Since 2017 Bitcoin blocks have been effectively limited to around 2MB, and the blockchain is now over one-third of a Terabyte growing at over 25%/yr. Increasing the size limit to say 22MB would solve the long-term problem of a fee-based system at the cost of reducing miners income in the short term by reducing the scarcity value of a slot in a block. Doubling the effective size of the block caused a huge controversy in the Bitcoin community for precisely this short vs. long conflict, so a much larger increase would be even more controversial. Not to mention that the size of the blockchain a year from now would be 3 times bigger imposing additional storage costs on miners. That is just the supply side. On the demand side it is an open question as to whether there would be 12 times the current demand for transactions costing $20 and taking an hour which, at least in the US, must each be reported to the tax authorities. Short vs. Long None of these alternatives look attractive. But there's also a second type of attack in Budish's analysis, which he calls "sabotage". He quotes Rosenfeld: In this section we will assume q < p [i.e., that the attacker does not have a majority]. Otherwise, all bets are off with the current Bitcoin protocol ... The honest miners, who no longer receive any rewards, would quit due to lack of incentive; this will make it even easier for the attacker to maintain his dominance. This will cause either the collapse of Bitcoin or a move to a modified protocol. As such, this attack is best seen as an attempt to destroy Bitcoin, motivated not by the desire to obtain Bitcoin value, but rather wishing to maintain entrenched economical systems or obtain speculative profits from holding a short position. Short interest in Bitcoin is currently small relative to the total stock, but much larger relative to the circulating supply. Budish analyzes various sabotage attack cases, with a parameter ∆attack representing the proportion of the Bitcoin value destroyed by the attack: For example, if ∆attack = 1, i.e., if the attack causes a total collapse of the value of Bitcoin, the attacker loses exactly as much in Bitcoin value as he gains from double spending; in effect, there is no chance to “double” spend after all. ... However, ∆attack is something of a “pick your poison” parameter. If ∆attack is small, then the system is vulnerable to the double-spending attack ... and the implicit transactions tax on economic activity using the blockchain has to be high. If ∆attack is large, then a short time period of access to a large amount of computing power can sabotage the blockchain. The current cryptocurrency bubble ensures that everyone is making enough paper profits from the golden eggs to deter them from killing the goose that lays them. But it is easy to create scenarios in which a rush for the exits might make killing the goose seem like the best way out. Seibert's misunderstanding illustrates the fundamental problem with permissionless blockchains. As I wrote in A Note On Blockchains: If joining the replica set of a permissionless blockchain is free, it will be vulnerable to Sybil attacks, in which an attacker creates many apparently independent replicas which are actually under his sole control. If creating and maintaining a replica is free, anyone can authorize any change they choose simply by creating enough Sybil replicas. Defending against Sybil attacks requires that membership in a replica set be expensive. There are many attempts to provide less environmentally damaging ways to make adding a block to a blockchain expensive, but attempts to make adding a block cheaper are self-defeating because they make the blockchain less secure. There are two reasons why the primary use of a permissionless blockchain cannot be transactions as opposed to HODL-ing: The lack of synchronization between the peers means that transactions must necessarily be slow. The need to defend against Sybil attacks means either that transactions must necessarily be expensive, or that blocks must be impractically large. Posted by David. at 8:00 AM Labels: bitcoin, security 5 comments: David. said... Seibert apparently believes (a) that a fee-only Bitcoin network would be secure, used for large numbers of transactions, and have a low carbon footprint, and (b) that the network would have a low carbon footprint because most transactions would use the Lightning network. Ignoring the contradiction, anyone who believes that the Lightning network would do the bulk of the transactions needs to read the accounts of people actually trying to transact using it. David Gerard writes: "Crypto guy loses a bet, and tries to pay the bet using the Lightning Network. Hilarity ensues." Indeed, the archived Twitter thread from the loser is a laugh-a-minute read. April 20, 2021 at 7:16 PM David. said... Jaime Powell shreds another attempt at cryptocurrency carbon footprint gaslighting in The destructive green fantasy of the bitcoin fanatics: "It is in this context that we should consider the latest “research” from the good folks at ETF-house-come-fund manager ARK Invest and $113bn payment company Square. Titled “Bitcoin is Key to an Abundant, Clean Energy Future”, it does exactly what you’d expect it to. Which is to try justify, after the fact, bitcoin’s insane energy use. Why? Because both entities are deeply involved in this “space” and now need to a) feel better about themselves and b) guard against people going off crypto on the grounds that it is actually a Very Bad Thing. ... The white paper imagines bitcoin mining being a solution, alongside battery storage, for excess energy. It also imagines that if solar and wind prices continue to collapse, bitcoin could eventually transition to being completely renewable-powered in the future. “Imagines” is the key word here. Because in reality, bitcoin mining is quite the polluter. It’s estimated that 72 per cent of bitcoin mining is concentrated in China, where nearly two-thirds of all electricity is generated by coal power, according to a recent Bank of America report. In fact, mining uses coal power so aggressively that when one coal mine flooded and shut down in Xianjiang province over the weekend, one-third of all bitcoin’s computing power went offline." April 25, 2021 at 5:00 PM David. said... In Jack Dorsey and Elon Musk agree on bitcoin's green credentials the BBC reports on yet another of Elon Musk's irresponsible cryptocurrency tweets: "The tweet comes soon after the release of a White Paper from Mr Dorsey's digital payment services firm Square, and global asset management business ARK Invest. Entitled "Bitcoin as key to an abundant, clean energy future", the paper argues that "bitcoin miners are unique energy buyers", because they offer flexibility, pay in a cryptocurrency, and can be based anywhere with an internet connection." The BBC fails to point out that Musk and Dorsey are "talking their book"; Tesla invested $1.6B and Square $220M in Bitcoin. So they have over $1.8B reasons to worry about efforts to limit its carbon footprint. April 25, 2021 at 5:10 PM David. said... This comment has been removed by the author. April 25, 2021 at 5:45 PM David. said... Nathan J. Robinson's Why Cryptocurrency Is A Giant Fraud has an interesting footnote, discussing a "pseudoscholarly masterpiece" of Bitcoin puffery by Vijay Boyapati: "Interestingly, Boyapati cites Bitcoin’s high transaction fees as a feature rather than a bug: “A recent criticism of the Bitcoin network is that the increase in fees to transmit bitcoins makes it unsuitable as a payment system. However, the growth in fees is healthy and expected… A network with ‘low’ fees is a network with little security and prone to external censorship. Those touting the low fees of Bitcoin alternatives are unknowingly describing the weakness of these so-called ‘alt-coins.’” As you can see, this successfully makes the case that high fees are unavoidable, but it also undermines the reasons why any sane person would use this as currency rather than a speculative investment." Right! A permissionless blockchain has to be expensive to run if it is to be secure. Those costs have either to be borne, ultimately, by the blockchain's users, or dumped on the rest of us as externalities (e.g. the blockchain's carbon footprint, the shortage of GPUs, ...). April 25, 2021 at 5:55 PM Post a Comment Newer Post Older Post Home Subscribe to: Post Comments (Atom) Blog Rules Posts and comments are copyright of their respective authors who, by posting or commenting, license their work under a Creative Commons Attribution-Share Alike 3.0 United States License. Off-topic or unsuitable comments will be deleted. DSHR DSHR in ANWR Recent Comments Full comments Blog Archive ▼  2021 (18) ▼  April (5) Dogecoin Disrupts Bitcoin! What Is The Point? NFTs and Web Archiving Cryptocurrency's Carbon Footprint Elon Musk: Threat or Menace? ►  March (3) ►  February (5) ►  January (5) ►  2020 (55) ►  December (4) ►  November (4) ►  October (3) ►  September (6) ►  August (5) ►  July (3) ►  June (6) ►  May (3) ►  April (5) ►  March (6) ►  February (5) ►  January (5) ►  2019 (66) ►  December (2) ►  November (4) ►  October (8) ►  September (5) ►  August (5) ►  July (7) ►  June (6) ►  May (7) ►  April (6) ►  March (7) ►  February (4) ►  January (5) ►  2018 (96) ►  December (7) ►  November (8) ►  October (10) ►  September (5) ►  August (8) ►  July (5) ►  June (7) ►  May (10) ►  April (8) ►  March (9) ►  February (9) ►  January (10) ►  2017 (82) ►  December (6) ►  November (6) ►  October (8) ►  September (6) ►  August (7) ►  July (5) ►  June (7) ►  May (6) ►  April (7) ►  March (11) ►  February (5) ►  January (8) ►  2016 (89) ►  December (4) ►  November (8) ►  October (10) ►  September (8) ►  August (8) ►  July (7) ►  June (8) ►  May (7) ►  April (5) ►  March (10) ►  February (7) ►  January (7) ►  2015 (75) ►  December (7) ►  November (5) ►  October (11) ►  September (5) ►  August (3) ►  July (3) ►  June (8) ►  May (10) ►  April (6) ►  March (6) ►  February (7) ►  January (4) ►  2014 (68) ►  December (7) ►  November (8) ►  October (6) ►  September (8) ►  August (7) ►  July (3) ►  June (5) ►  May (6) ►  April (5) ►  March (6) ►  February (2) ►  January (5) ►  2013 (67) ►  December (3) ►  November (6) ►  October (7) ►  September (6) ►  August (3) ►  July (5) ►  June (6) ►  May (5) ►  April (9) ►  March (5) ►  February (5) ►  January (7) ►  2012 (43) ►  December (4) ►  November (4) ►  October (6) ►  September (6) ►  August (2) ►  July (5) ►  June (2) ►  May (5) ►  March (1) ►  February (5) ►  January (3) ►  2011 (40) ►  December (2) ►  November (1) ►  October (7) ►  September (3) ►  August (5) ►  July (2) ►  June (2) ►  May (2) ►  April (4) ►  March (4) ►  February (4) ►  January (4) ►  2010 (17) ►  December (5) ►  November (3) ►  October (4) ►  September (2) ►  July (1) ►  June (1) ►  February (1) ►  2009 (8) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (2) ►  January (2) ►  2008 (8) ►  December (2) ►  March (1) ►  January (5) ►  2007 (14) ►  December (1) ►  October (3) ►  September (1) ►  August (1) ►  July (2) ►  June (3) ►  May (1) ►  April (2) LOCKSS system has permission to collect, preserve, and serve this Archival Unit. Simple theme. Powered by Blogger. 
blog-dshr-org-6705	----	DSHR's Blog: Dogecoin Disrupts Bitcoin! DSHR's Blog I'm David Rosenthal, and this is a place to discuss the work I'm doing in Digital Preservation. Thursday, April 22, 2021 Dogecoin Disrupts Bitcoin! Two topics I've posted about recently, Elon Musk's cult and the illusory "prices" of cryptocurrencies, just intersected in spectacular fashion. On April 14 the Bitcoin "price" peaked at $63.4K. Early on April 15, the Musk cult saw this tweet from their prophet. Immediately, the Dogecoin "price" took off like a Falcon 9. A day later, Jemima Kelley reported that If you believe, they put a Dogecoin on the moon. That was to say that: Dogecoin — the crypto token that was started as a joke and that is the favourite of Elon Musk — is having a bit of a moment. And when we say a bit of a moment, we mean that it is on a lunar trajectory (in crypto talk: it is going to da moon). At the time of writing this, it is up over 200 per cent in the past 24 hours — more than tripling in value (for those of you who need help on percentages, it is Friday afternoon after all). Over the past week it’s up more than 550 per cent (almost seven times higher!). The headlines tell the story — Timothy B. Lee's Dogecoin has risen 400 percent in the last week because why not and Joanna Ossinger's Dogecoin Rips in Meme-Fueled Frenzy on Pot-Smoking Holiday. The Dogecoin "price" graph Kelly posted was almost vertical. The same day, Peter Schiff, the notorious gold-bug, tweeted: So far in 2021 #Bitcoin has lost 97% of its value verses #Dogecoin. The market has spoken. Dogecoin is eating Bitcoin. All the Bitcoin pumpers who claim Bitcoin is better than gold because its price has risen more than gold's must now concede that Dogecoin is better than Bitcoin. Below the fold I look back at this revolution in crypto-land. I'm writing on April 21, and the Bitcoin "price" is around $55K, about 87% of its peak on April 14. In the same period Dogecoin's "price" peaked at $0.37, and is now around $0.32, or 267% of its $0.12 "price" on April 14. There are some reasons for Bitcoin's slump apart from people rotating out of BTC into DOGE in response to Musk's tweet. Nivesh Rustgi reports: Bitcoin’s hashrate dropped 25% from all-time highs after an accident in the Xinjiang region’s mining industry caused flooding and a gas explosion, leading to 12 deaths with 21 workers trapped since. ... The leading Bitcoin mining data centers in the region have closed operations to comply with the fire and safety inspections. The Chinese central authority is conducting site inspections “on individual mining operations and related local government agencies,” tweeted Dovey Wan, partner at Primitive Crypto. ... The accident has reignited the centralization problems arising from China’s dominance of the Bitcoin mining sector, despite global expansion efforts. The drop in the hash rate had the obvious effects. David Gerard reports: The Bitcoin hash rate dropped from 220 exahashes per second to 165 EH/s. The rate of new blocks slowed. The Bitcoin mempool — the backlog of transactions waiting to be processed — has filled. Transaction fees peaked at just over $50 average on 18 April. The average BTC transaction fee is now just short of $60, with a median fee over $26! The BTC blockchain did around 350K transactions on April 15, but on April 16 it could only manage 190K. It is also true that DOGE had upward momentum before Musk's tweet. After being nearly flat for almost a month, it had already doubled since April 6. Kelly quotes David Kimberley at Freetrade: Dogecoin’s rise is a classic example of greater fool theory at play, Dogecoin investors are basically betting they’ll be able to cash out by selling to the next person wanting to invest. People are buying the cryptocurrency, not because they think it has any meaningful value, but because they hope others will pile in, push the price up and then they can sell off and make a quick buck. But when everyone is doing this, the bubble eventually has to burst and you’re going to be left short-changed if you don’t get out in time. And it’s almost impossible to say when that’s going to happen. Kelly also quotes Khadim Shubber explaining that this is all just entertainment: Bitcoin, and cryptocurrencies in general, are not directly analogous to the fairly mundane practice of buying a Lottery ticket, but this part of its appeal is often ignored in favour of more intellectual or high-brow explanations. It has all the hallmarks of a fun game, played out across the planet with few barriers to entry and all the joy and pain that usually accompanies gambling. There’s a single, addictive reward system: the price. The volatility of cryptocurrencies is often highlighted as a failing, but in fact it’s a key part of its appeal. Where’s the fun in an asset whose price snoozes along a predictable path? The rollercoaster rise and fall and rise again of the crypto world means that it’s never boring. If it’s down one day (and boy was it down yesterday) well, maybe the next day it’ll be up again. Note the importance of volatility. In a must-read interview that New York Magazine entitled BidenBucks Is Beeple Is Bitcoin Prof. George Galloway also stressed the importance of volatility: Young people want volatility. If you have assets and you’re already rich, you want to take volatility down. You want things to stay the way they are. But young people are willing to take risks because they can afford to lose everything. For the opportunity to double their money, they will risk losing everything. Imagine a person who has the least to lose: He’s in solitary confinement in a supermax-security prison. That person wants maximum volatility. He prays for such volatility, that there’s a revolution and they open the prison. People under the age of 40 are fed up. They have less than half of the economic security, as measured by the ratio of wealth to income, that their parents did at their age. Their share of overall wealth has crashed. A lot of them are bored. A lot of them have some stimulus money in their pocket. And in the case of GameStop, they did what’s kind of a mob short squeeze. ... I see crypto as a mini-revolution, just like GameStop. The central banks and governments are all conspiring to create more money to keep the shareholder class wealthy. Young people think, That’s not good for me, so I’m going to exit the ecosystem and I’m going to create my own currency. This all reinforces my skepticism about the "price" and "market cap" of cryptocurrencies. Posted by David. at 9:00 AM Labels: bitcoin 1 comment: David. said... Joe Weisenthal (@TheStalwart) tweeted: "WHY I LOVE THE DOGECOIN RALLY SO MUCH See all the serious stuff about decentralized finance, or stores of value, or people thirsting for alternatives for the dollar. Nobody can talk about it with a straight face when it comes to Dogecoin. ... But really, all the crypto talking points go out the window with Doge." April 26, 2021 at 12:19 PM Post a Comment Older Post Home Subscribe to: Post Comments (Atom) Blog Rules Posts and comments are copyright of their respective authors who, by posting or commenting, license their work under a Creative Commons Attribution-Share Alike 3.0 United States License. Off-topic or unsuitable comments will be deleted. DSHR DSHR in ANWR Recent Comments Full comments Blog Archive ▼  2021 (18) ▼  April (5) Dogecoin Disrupts Bitcoin! What Is The Point? NFTs and Web Archiving Cryptocurrency's Carbon Footprint Elon Musk: Threat or Menace? ►  March (3) ►  February (5) ►  January (5) ►  2020 (55) ►  December (4) ►  November (4) ►  October (3) ►  September (6) ►  August (5) ►  July (3) ►  June (6) ►  May (3) ►  April (5) ►  March (6) ►  February (5) ►  January (5) ►  2019 (66) ►  December (2) ►  November (4) ►  October (8) ►  September (5) ►  August (5) ►  July (7) ►  June (6) ►  May (7) ►  April (6) ►  March (7) ►  February (4) ►  January (5) ►  2018 (96) ►  December (7) ►  November (8) ►  October (10) ►  September (5) ►  August (8) ►  July (5) ►  June (7) ►  May (10) ►  April (8) ►  March (9) ►  February (9) ►  January (10) ►  2017 (82) ►  December (6) ►  November (6) ►  October (8) ►  September (6) ►  August (7) ►  July (5) ►  June (7) ►  May (6) ►  April (7) ►  March (11) ►  February (5) ►  January (8) ►  2016 (89) ►  December (4) ►  November (8) ►  October (10) ►  September (8) ►  August (8) ►  July (7) ►  June (8) ►  May (7) ►  April (5) ►  March (10) ►  February (7) ►  January (7) ►  2015 (75) ►  December (7) ►  November (5) ►  October (11) ►  September (5) ►  August (3) ►  July (3) ►  June (8) ►  May (10) ►  April (6) ►  March (6) ►  February (7) ►  January (4) ►  2014 (68) ►  December (7) ►  November (8) ►  October (6) ►  September (8) ►  August (7) ►  July (3) ►  June (5) ►  May (6) ►  April (5) ►  March (6) ►  February (2) ►  January (5) ►  2013 (67) ►  December (3) ►  November (6) ►  October (7) ►  September (6) ►  August (3) ►  July (5) ►  June (6) ►  May (5) ►  April (9) ►  March (5) ►  February (5) ►  January (7) ►  2012 (43) ►  December (4) ►  November (4) ►  October (6) ►  September (6) ►  August (2) ►  July (5) ►  June (2) ►  May (5) ►  March (1) ►  February (5) ►  January (3) ►  2011 (40) ►  December (2) ►  November (1) ►  October (7) ►  September (3) ►  August (5) ►  July (2) ►  June (2) ►  May (2) ►  April (4) ►  March (4) ►  February (4) ►  January (4) ►  2010 (17) ►  December (5) ►  November (3) ►  October (4) ►  September (2) ►  July (1) ►  June (1) ►  February (1) ►  2009 (8) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (2) ►  January (2) ►  2008 (8) ►  December (2) ►  March (1) ►  January (5) ►  2007 (14) ►  December (1) ►  October (3) ►  September (1) ►  August (1) ►  July (2) ►  June (3) ►  May (1) ►  April (2) LOCKSS system has permission to collect, preserve, and serve this Archival Unit. Simple theme. Powered by Blogger. 
blog-dshr-org-7459	----	DSHR's Blog: Techno-hype part 1 DSHR's Blog I'm David Rosenthal, and this is a place to discuss the work I'm doing in Digital Preservation. Tuesday, November 14, 2017 Techno-hype part 1 Don't, don't, don't, don't believe the hype! Public Enemy New technologies are routinely over-hyped because people under-estimate the gap between a technology that works and a technology that is in everyday use by normal people. You have probably figured out that I'm skeptical of the hype surrounding blockchain technology. Despite incident-free years spent routinely driving in company with Waymo's self-driving cars, I'm also skeptical of the self-driving car hype. Below the fold, an explanation. Clearly, self-driving cars driven by a trained self-driving car driver in Bay Area traffic work fine: We've known for several years now that Waymo's (previously Google's) cars can handle most road conditions without a safety driver intervening. Last year, the company reported that its cars could go about 5,000 miles on California roads, on average, between human interventions. Crashes per 100M miles Waymo's cars are much safer than almost all human drivers: Waymo has logged over two million miles on U.S. streets and has only had fault in one accident, making its cars by far the lowest at-fault rate of any driver class on the road— about 10 times lower than our safest demographic of human drivers (60–69 year-olds) and 40 times lower than new drivers, not to mention the obvious benefits gained from eliminating drunk drivers. However, Waymo’s vehicles have a knack for getting hit by human drivers. When we look at total accidents (at fault and not), the Waymo accident rate is higher than the accident rate of most experienced drivers ... Most of these accidents are fender-benders caused by humans, with no fatalities or serious injuries. The leading theory is that Waymo’s vehicles adhere to the letter of traffic law, leading them to brake for things they are legally supposed to brake for (e.g., pedestrians approaching crosswalks). Since human drivers are not used to this lawful behavior, it leads to a higher rate of rear-end collisions (where the human driver is at-fault). Clearly, this is a technology that works. I would love it if my grand-children never had to learn to drive, but even a decade from now I think they will still need to. But, as Google realized some time ago, just being safer on average than most humans almost all the time is not enough for mass public deployment of self-driving cars. Back in June, John Markoff wrote: Three years ago, Google’s self-driving car project abruptly shifted from designing a vehicle that would drive autonomously most of the time while occasionally requiring human oversight, to a slow-speed robot without a brake pedal, accelerator or steering wheel. In other words, human driving was no longer permitted. The company made the decision after giving self-driving cars to Google employees for their work commutes and recording what the passengers did while the autonomous system did the driving. In-car cameras recorded employees climbing into the back seat, climbing out of an open car window, and even smooching while the car was in motion, according to two former Google engineers. “We saw stuff that made us a little nervous,” Chris Urmson, a roboticist who was then head of the project, said at the time. He later mentioned in a blog post that the company had spotted a number of “silly” actions, including the driver turning around while the car was moving. Johnny Luu, a spokesman for Google’s self-driving car effort, now called Waymo, disputed the accounts that went beyond what Mr. Urmson described, but said behavior like an employee’s rummaging in the back seat for his laptop while the car was moving and other “egregious” acts contributed to shutting down the experiment. Gareth Corfield at The Register adds: Google binned its self-driving cars' "take over now, human!" feature because test drivers kept dozing off behind the wheel instead of watching the road, according to reports. "What we found was pretty scary," Google Waymo's boss John Krafcik told Reuters reporters during a recent media tour of a Waymo testing facility. "It's hard to take over because they have lost contextual awareness." ... Since then, said Reuters, Google Waymo has focused on technology that does not require human intervention. Timothy B. Lee at Ars Technica writes: Waymo cars are designed to never have anyone touch the steering wheel or pedals. So the cars have a greatly simplified four-button user interface for passengers to use. There are buttons to call Waymo customer support, lock and unlock the car, pull over and stop the car, and start a ride. But, during a recent show-and-tell with reporters, they weren't allowed to press the "pull over" button: a Waymo spokesman tells Ars that the "pull over" button does work. However, the event had a tight schedule, and it would have slowed things down too much to let reporters push it. Google was right to identify the "hand-off" problem as essentially insoluble, because the human driver would have lost "situational awareness". Jean-Louis Gassée has an appropriately skeptical take on the technology, based on interviews with Chris Urmson: Google’s Director of Self-Driving Cars from 2013 to late 2016 (he had joined the team in 2009). In a SXSW talk in early 2016, Urmson gives a sobering yet helpful vision of the project’s future, summarized by Lee Gomesin an IEEE Spectrum article [as always, edits and emphasis mine]: “Not only might it take much longer to arrive than the company has ever indicated — as long as 30 years, said Urmson — but the early commercial versions might well be limited to certain geographies and weather conditions. Self-driving cars are much easier to engineer for sunny weather and wide-open roads, and Urmson suggested the cars might be sold for those markets first.” But the problem is actually much worse than either Google or Urmson say. Suppose, for the sake of argument, that self-driving cars three times as good as Waymo's are in wide use by normal people. A normal person would encounter a hand-off once in 15,000 miles of driving, or less than once a year. Driving would be something they'd be asked to do maybe 50 times in their life. Even if, when the hand-off happened, the human was not "climbing into the back seat, climbing out of an open car window, and even smooching" and had full "situational awareness", they would be faced with a situation too complex for the car's software. How likely is it that they would have the skills needed to cope, when the last time they did any driving was over a year ago, and on average they've only driven 25 times in their life? Current testing of self-driving cars hands-off to drivers with more than a decade of driving experience, well over 100,000 miles of it. It bears no relationship to the hand-off problem with a mass deployment of self-driving technology. Remember the crash of AF447? the aircraft crashed after temporary inconsistencies between the airspeed measurements – likely due to the aircraft's pitot tubes being obstructed by ice crystals – caused the autopilot to disconnect, after which the crew reacted incorrectly and ultimately caused the aircraft to enter an aerodynamic stall, from which it did not recover. This was a hand-off to a crew that was highly trained, but had never before encountered a hand-off during cruise. What this means is that unrestricted mass deployment of self-driving cars requires Level 5 autonomy: Level 5 _ Full Automation System capability: The driverless car can operate on any road and in any conditions a human driver could negotiate. • Driver involvement: Entering a destination. Note that Waymo is just starting to work with Level 4 cars (the link is to a fascinating piece by Alexis C. Madrigal on Waymo's simulation and testing program). There are many other difficulties on the way to mass deployment, outlined by Timothy B. Lee at Ars Technica. Although Waymo is actually testing Level 4 cars in the benign environment of Phoenix, AZ: Waymo, the autonomous car company from Google’s parent company Alphabet, has started testing a fleet of self-driving vehicles without any backup drivers on public roads, its chief executive officer said Tuesday. The tests, which will include passengers within the next few months, mark an important milestone that brings autonomous vehicle technology closer to operating without any human intervention. But the real difficulty is this. The closer the technology gets to Level 5, the worse the hand-off problem gets, because the human has less experience. Incremental progress in deployments doesn't make this problem go away. Self-driving taxis in restricted urban areas maybe in the next five years; a replacement for the family car, don't hold your breath. My grand-children will still need to learn to drive. Posted by David. at 8:00 AM Labels: techno-hype 31 comments: David. said... Cecilia Kang's Where Self-Driving Cars Go To Learn looks at the free-for-all testing environment in Arizona: "Over the past two years, Arizona deliberately cultivated a rules-free environment for driverless cars, unlike dozens of other states that have enacted autonomous vehicle regulations over safety, taxes and insurance. Arizona took its anything-goes approach while federal regulators delayed formulating an overarching set of self-driving car standards, leaving a gap for states. The federal government is only now poised to create its first law for autonomous vehicles; the law, which echoes Arizona’s stance, would let hundreds of thousands of them be deployed within a few years and would restrict states from putting up hurdles for the industry." What could possibly go wrong? November 16, 2017 at 9:30 PM Mike K said... It seems to me that that there's a "good enough" solution for mass deployment before Stage 5 is in production, provided that the "pull over button" works and that in all situations where you invoke concern about a human-driver-takeover, the AI can reliably default to avoiding hitting anything while it decelerates. That is, if the AI realizes it doesn't know how to handle the situation normally, it accepts defeat and comes to stop. (That seems to be the norm during current testing, based on my read of Madrigal's Waymo article.) If that's the case, humans don't suddenly have to take over a moving vehicle that's already in a boundary situation. Instead, having stopped, the AI can then reassess (if the confounding factors have changed) or the human can slowly drive out of proximity. Or perhaps such situations become akin to a flat tire is now--some people are capable of recovering on their own, others wait for roadside assistance. Coming to a stop on, or even alongside, a highway is far from ideal, I concede, and will lead to more rear-enders as long as humans still drive some percentage of vehicles. But rear end accidents are far less likely to cause fatalities than other types (citation needed,) so that seems like an acceptable trade-off during a transitional period. All that said, I'm cautiously pessimistic about self-driving cars in our lifetimes. I'm more worried about bugs, outages, and hacking preventing widespread implementation. November 20, 2017 at 12:33 PM David. said... "how much preparation have federal transportation authorities carried out to meet the challenge of the advent of self-driving cars and trucks? Not nearly enough, according to a new 44-page report by the Government Accountability Office, a Congressional watchdog agency." reports Paul Feldman. And: "the U.S. House of Representatives has approved a bill allowing self-driving vehicles to operate on public roadways with minimal government supervision. Similar legislation has been OK’d by a Senate committee, but is currently stalled by a handful of senators concerned about safety provisions." December 11, 2017 at 7:13 AM David. said... In increasing order of skepticism, we have first A Decade after DARPA: Our View on the State of the Art in Self-Driving Cars by Bryan Salesky, CEO, Argo AI (Ford's self-driving effort): "Those who think fully self-driving vehicles will be ubiquitous on city streets months from now or even in a few years are not well connected to the state of the art or committed to the safe deployment of the technology." Second, After Peak Hype, Self-Driving Cars Enter the Trough of Disillusionment by Aarian Marshall at Wired using Gartner’s “hype cycle” methodology: "Volvo’s retreat is just the latest example of a company cooling on optimistic self-driving car predictions. In 2012, Google CEO Sergey Brin said even normies would have access to autonomous vehicles in fewer than five years—nope. Those who shelled out an extra $3,000 for Tesla’s Enhanced Autopilot are no doubt disappointed by its non-appearance, nearly six months after its due date. New Ford CEO Jim Hackett recently moderated expectations for the automaker’s self-driving service, which his predecessor said in 2016 would be deployed at scale by 2021. “We are going to be in the market with products in that time frame,” he told the San Francisco Chronicle. “But the nature of the romanticism by everybody in the media about how this robot works is overextended right now.”" And third Wired: Self Driving Car Hype Crashes Into Harsh Realities by Yves Smith at naked capitalism, which is the only piece to bring up the hand-off problem: "The fudge is to have a human at ready to take over the car in case it asks for help. First, as one might infer, the human who is suddenly asked to intervene is going to have to quickly asses the situation. The handoff delay means a slower response than if a human had been driving the entire time. Second, and even worse, the human suddenly asked to take control might not even see what the emergency need is. Third, the car itself might not recognize that it is about to get into trouble." All three pieces are worth reading. December 30, 2017 at 7:09 AM David. said... More skepticism from Christian Wolmar: “This is a fantasy that has not been thought through, and is being promoted by technology and auto manufacturers because tech companies have vast amounts of footloose capital they don’t know what to do with, and auto manufacturers are terrified they’re not on board with the new big thing,” he said. “So billions are being spent developing technology that nobody has asked for, that will not be practical, and that will have many damaging effects.” He has an entire book on the topic. January 11, 2018 at 8:21 AM David. said... Tim Bradshaw reports: "Autonomous vehicles are in danger of being turned into “weapons”, leading governments around the world to block cars operated by foreign companies, the head of Baidu’s self-driving car programme has warned. Qi Lu, chief operating officer at the Chinese internet group, said security concerns could become a problem for global carmakers and technology companies, including the US and China. “It has nothing to do with any particular government — it has to do with the very nature of autonomy,” he said on the sidelines of the Consumer Electronics Show last week. “You have an object that is capable of moving by itself. By definition, it is a weapon.” Charlie Stross figured this out ten years ago. January 15, 2018 at 8:40 AM David. said... “We will have autonomous cars on the road, I believe within the next 18 months,” [Uber CEO Khosrowshahi} said. ... for example, Phoenix, there will be 95% of cases where the company may not have everything mapped perfectly, or the weather might not be perfect, or there could be other factors that will mean Uber will opt to send a driver. “But in 5 percent of cases, we’ll send an autonomous car,” Khosrowshahi said, when everything’s just right, and still the user will be able to choose whether they get an AV or a regular car." reports Darrell Etherington at TechCrunch. Given that Uber loses $5B/yr and Khosrowshahi has 25 months to IPO it, you should treat everything he says as pre-IPO hype. January 23, 2018 at 2:01 PM David. said... Uber and Lyft want you banned from using your own self-driving car in urban areas is the title of a piece by Ethan Baron at siliconbeat. The geometric impossibility of replacing mass transit with fleets of autonomous cars is starting to sink in. February 4, 2018 at 5:06 PM David. said... Ross Marchand at Real Clear Policy looks into Waymo's reported numbers: "The company’s headline figures since 2015 are certainly encouraging, with “all reported disengagements” dropping from .80 per thousand miles (PTM) driven to .18 PTM. Broken down by category, however, this four-fold decrease in disengagements appears very uneven. While the rate of technology failures has fallen by more than 90 percent (from .64 to .06), unsafe driving rates decreased only by 25 percent (from .16 to .12). ... But the ability of cars to analyze situations on the road and respond has barely shown improvement since the beginning of 2016. In key categories, like “incorrect behavior prediction” and “unwanted maneuver of the vehicle,” Waymo vehicles actually did worse in 2017 than in 2016." February 19, 2018 at 11:30 AM David. said... And also The most cutting-edge cars on the planet require an old-fashioned handwashing: "For example, soap residue or water spots could effectively "blind" an autonomous car. A traditional car wash's heavy brushes could jar the vehicle's sensors, disrupting their calibration and accuracy. Even worse, sensors, which can cost over $100,000, could be broken. A self-driving vehicle's exterior needs to be cleaned even more frequently than a typical car because the sensors must remain free of obstructions. Dirt, dead bugs, bird droppings or water spots can impact the vehicle's ability to drive safely." February 23, 2018 at 7:24 AM David. said... "[California]’s Department of Motor Vehicles said Monday that it was eliminating a requirement for autonomous vehicles to have a person in the driver’s seat to take over in the event of an emergency. ... The new rules also require companies to be able to operate the vehicle remotely ... and communicate with law enforcement and other drivers when something goes wrong." reports Daisuke Wakabayashi at the NYT. Note that these are not level 5 autonomous cars, they are remote-controlled. February 26, 2018 at 8:37 PM David. said... "Cruise vehicles "can't easily handle two-way residential streets that only have room for one car to pass at a time. That's because Cruise cars treat the street as one lane and always prefer to be in the center of a lane, and oncoming traffic causes the cars to stop." Other situations that give Cruise vehicles trouble: - Distinguishing between motorcycles and bicycles - Entering tunnels, which can interfere with the cars' GPS sensors - U-turns - Construction zones" From Timothy B. Lee's New report highlights limitations of Cruise self-driving cars. It is true that GM's Cruise is trying to self-drive in San Francisco, which isn't an easy place for humans. But they are clearly a long way from Waymo's level, even allowing for the easier driving in Silicon Valley and Phoenix. March 14, 2018 at 5:29 PM David. said... "While major technology and car companies are teaching cars to drive themselves, Phantom Auto is working on remote control systems, often referred to as teleoperation, that many see as a necessary safety feature for the autonomous cars of the future. And that future is closer than you might think: California will allow companies to test autonomous vehicles without a safety driver — as long as the car can be operated remotely — starting next month." from John R. Quain's When Self-Driving Cars Can’t Help Themselves, Who Takes the Wheel?. So the car is going to call Tech Support and be told "All our operators are busy driving other cars. You call is important to us, please don't hang up." March 15, 2018 at 10:59 AM David. said... "Police in Tempe, Arizona, have released dash cam footage showing the final seconds before an Uber self-driving vehicle crashed into 49-year-old pedestrian Elaine Herzberg. She died at the hospital shortly afterward. ... Tempe police also released internal dash cam footage showing the car's driver, Rafaela Vasquez, in the seconds before the crash. Vasquez can be seen looking down toward her lap for almost five seconds before glancing up again. Almost immediately after looking up, she gets a look of horror on her face as she realizes the car is about to hit Herzberg." writes Timothy B. Lee at Ars Technica. In this case the car didn't hand off to the human, but even if it had the result would likely have been the same. March 22, 2018 at 6:17 AM David. said... Timothy B. Lee at Ars Technica has analyzed the video and writes Video suggests huge problems with Uber’s driverless car program: "The video shows that Herzberg crossed several lanes of traffic before reaching the lane where the Uber car was driving. You can debate whether a human driver should have been able to stop in time. But what's clear is that the vehicle's lidar and radar sensors—which don't depend on ambient light and had an unobstructed view—should have spotted her in time to stop. On top of that, the video shows that Uber's "safety driver" was looking down at her lap for nearly five seconds just before the crash. This suggests that Uber was not doing a good job of supervising its safety drivers to make sure they actually do their jobs." March 22, 2018 at 5:03 PM David. said... "In a blogpost, Tesla said the driver of the sport-utility Model X that crashed in Mountain View, 38-year-old Apple software engineer Wei Huang, “had received several visual and one audible hands-on warning earlier in the drive and the driver’s hands were not detected on the wheel for six seconds prior to the collision." reports The Guardian. The car tried to hand off to the driver but he didn't respond. March 31, 2018 at 8:43 PM David. said... “Technology does not eliminate error, but it changes the nature of errors that are made, and it introduces new kinds of errors,” said Chesley Sullenberger, the former US Airways pilot who landed a plane in the Hudson River in 2009 after its engines were struck by birds and who now sits on a Department of Transportation advisory committee on automation. “We have to realize that it’s not a panacea.” from the New York Times editorial The Bright, Shiny Distraction of Self-Driving Cars. April 1, 2018 at 8:29 PM David. said... In The way we regulate self-driving cars is broken—here’s how to fix it Timothy B. Lee sets out a very pragmatic approach to regulation of self-driving cars. Contrast this with the current rush to exempt them from regulations! For example: "Anyone can buy a conventional car and perform safety tests on it. Academic researchers, government regulators, and other independent experts can take a car apart, measure its emissions, probe it for computer security flaws, and subject it to crash tests. This means that if a car has problems that aren't caught (or are even covered up) by the manufacturer, they're likely to be exposed by someone else. But this kind of independent analysis won't be an option when Waymo introduces its driverless car service later this year. Waymo's cars won't be for sale at any price, and the company likely won't let customers so much as open the hood. This means that the public will be mostly dependent on Waymo itself to provide information about how its cars work." April 10, 2018 at 12:11 PM David. said... In People must retain control of autonomous vehicles Ashley Nunes, Bryan Reimer and Joseph F. Coughlin sound a warning against Level 5 self-driving vehicles and many strong cautions against rushed deployment of lower levels in two areas: Liability: "Like other producers, developers of autonomous vehicles are legally liable for damages that stem from the defective design, manufacture and marketing of their products. The potential liability risk is great for driverless cars because complex systems interact in ways that are unexpected." Safety: "Driverless cars should be treated much like aircraft, in which the involvement of people is required despite such systems being highly automated. Current testing of autonomous vehicles abides by this principle. Safety drivers are present, even though developers and regulators talk of full automation." April 11, 2018 at 2:06 PM David. said... Alex Roy's The Half-Life Of Danger: The Truth Behind The Tesla Model X Crash is a must-read deep dive into the details of the argument in this post, with specifics about Tesla's "Autopilot" and Cadillac's "SuperCruise": "As I stated a year ago, the more such systems substitute for human input, the more human skills erode, and the more frequently a 'failure' and/or crash is attributed to the technology rather than human ignorance of it. Combine the toxic marriage of human ignorance and skill degradation with an increasing number of such systems on the road, and the number of crashes caused by this interplay is likely to remain constant—or even rise—even if their crash rate declines." April 18, 2018 at 9:16 AM David. said... A collection of posts about Stanford's autonomous car research is here. See, in particular, Holly Russell's research on the hand-off problem. April 26, 2018 at 9:07 AM David. said... "All companies testing autonomous vehicles on [California]’s public roads must provide annual reports to the DMV about “disengagements” that occur when a human backup driver has to take over from the robotic system. The DMV told eight companies with testing permits to provide clarification about their reports." from Ethan Barron's Self-driving cars’ shortcomings revealed in DMV reports. The clarifications are interesting, including such things as: "delayed perception of a pedestrian walking into the street" "failed to give way to another vehicle trying to enter a lane" "trouble when other drivers behaved badly. Other drivers had failed to yield, run stop signs, drifted out of their own lane and cut in front aggressively" May 3, 2018 at 4:03 PM David. said... Angie Schmidt's How Uber’s Self-Driving System Failed to Brake and Avoid Killing Elaine Herzberg reports on the devastating NTSB report: "The report doesn’t assign culpability for the crash but it points to deficiencies in Uber’s self-driving car tests. Uber’s vehicle used Volvo software to detect external objects. Six seconds before striking Herzberg, the system detected her but didn’t identify her as a person. The car was traveling at 43 mph. The system determined 1.3 seconds before the crash that emergency braking would be needed to avert a collision. But the vehicle did not respond, striking Herzberg at 39 mph. NTSB writes: According to Uber, emergency braking maneuvers are not enabled while the vehicle is under computer control, to reduce the potential for erratic vehicle behavior. The vehicle operator is relied on to intervene and take action. The system is not designed to alert the operator. Amir Efrati at The Information cites two anonymous sources at Uber who say the company “tuned” its emergency brake system to be less sensitive to unidentified objects." People need to be jailed for this kind of irresponsibility. May 24, 2018 at 3:40 PM David. said... Timothy B. Lee's As Uber and Tesla struggle with driverless cars, Waymo moves forward stresses how far ahead Waymo is in (mostly) self-driving cars: "So Waymo's recently announced car deals—20,000 cars from Jaguar Land Rover, another 62,000 from Fiat Chrysler—are just the latest sign that Waymo is assembling all the pieces it will need for a full-scale commercial taxi service in the Phoenix area and likely other places not long after that. It would be foolish for Waymo to invest so heavily in all this infrastructure if its technology were still years away from being ready for commercial deployment. Those 23 rider support workers need customers to talk to. And, of course, Waymo needs to get those 82,000 Jaguar and Chrysler vehicles on the road to avoid losing millions of dollars on the investment. Throughout all this, Waymo has been testing its vehicles at a faster and faster pace. It took Waymo six months to go from 3 million testing miles in May 2017 to 4 million miles in November. Then it took around three months to reach 5 million miles in February, and less than three months to reach 6 million in early May." June 1, 2018 at 8:36 PM David. said... Timothy B. Lee's Why emergency braking systems sometimes hit parked cars and lane dividers makes the same point as my post, this time about "driver assistance" systems: "The fundamental issue here is that tendency to treat lane-keeping, adaptive cruise control, and emergency braking as independent systems. As we've seen, today's driver assistance systems have been created in a piecemeal fashion, with each system following a do-no-harm philosophy. They only intervene if they're confident they can prevent an accident—or at least avoid causing one. If they're not sure, they do nothing and let the driver make the decision. The deadly Tesla crash in Mountain View illustrates how dangerous this kind of system can be." Thus: "Once a driver-assistance system reaches a certain level of complexity, the assumption that it's safest for the system to do nothing no longer makes sense. Complex driver assistance systems can behave in ways that surprise and confuse drivers, leading to deadly accidents if the driver's attention wavers for just a few seconds. At the same time, by handling most situations competently, these systems can lull drivers into a false sense of security and cause them to pay less careful attention to the road." June 8, 2018 at 9:22 AM David. said... "[Drive.AI board member Andrew Ng] seems to be saying that he is giving up on the promise of self-driving cars seamlessly slotting into the existing infrastructure. Now he is saying that every person, every “bystander”, is going to be responsible for changing their behavior to accommodate imperfect self-driving systems. And they are all going to have to be trained! I guess that means all of us. Whoa!!!! The great promise of self-driving cars has been that they will eliminate traffic deaths. Now [Ng] is saying that they will eliminate traffic deaths as long as all humans are trained to change their behavior? What just happened? If changing everyone’s behavior is on the table then let’s change everyone’s behavior today, right now, and eliminate the annual 35,000 fatalities on US roads, and the 1 million annual fatalities world-wide. Let’s do it today, and save all those lives." From Bothersome Bystanders and Self Driving Cars, Rodney Brooks' awesome takedown of Andrew Ng's truly stupid comments reported in Russell Brandom's Self-driving cars are headed toward an AI roadblock: "There’s growing concern among AI experts that it may be years, if not decades, before self-driving systems can reliably avoid accidents. As self-trained systems grapple with the chaos of the real world, experts like NYU’s Gary Marcus are bracing for a painful recalibration in expectations, a correction sometimes called “AI winter.” That delay could have disastrous consequences for companies banking on self-driving technology, putting full autonomy out of reach for an entire generation." July 5, 2018 at 7:56 PM David. said... "Drive.ai plans to license its technology to others, and has struck a deal with Lyft, a ride-hailing firm, to operate vehicles in and around San Francisco. “I think the autonomous-vehicle industry should be upfront about recognising the limitations of today’s technology,” says Mr Ng. It is surely better to find pragmatic ways to work around those limitations than pretend they do not exist or promise that solving them will be easy." reports The Economist. They describe drive.ai's extremely constrained trial service: "Drive.ai, a startup, has deployed seven minivans to transport people within a limited area of the city that includes an office park and a retail area. ... All pick-ups and drop-offs happen at designated stops, to minimise disruption as passengers get on and off. ... The vans are painted a garish orange and clearly labelled as self-driving vehicles. ... Screens mounted on the vans’ exteriors let them communicate with pedestrians and other road users, ... Similarly, rather than trying to build a vehicle that can navigate roadworks (a notoriously difficult problem, given inconsistent signage), Drive.ai has arranged for the city authorities to tell it where any roadworks are each day, so that its vehicles can avoid them. ... Drive.ai will limit the service to daylight hours, which makes things simpler and safer. Each vehicle will initially have a safety driver, ... If a van gets confused it can stop and call for help: a remote supervisor then advises it how to proceed (rather than driving the vehicle remotely, which would not be safe, says Mr Ng).: It seems that Mr. Ng has learned from the response to his comments that it isn't our responsibility to avoid running into his cars. August 4, 2018 at 12:38 PM David. said... In Even self-driving leader Waymo is struggling to reach full autonomy Timothy B. Lee reports on the "launch" of Waymo's "public" "autonomous" taxi service: "In late September, a Waymo spokeswoman told Ars by email that the Phoenix service would be fully driverless and open to members of the public—claims I reported in this article. We now know that Waymo One won't be fully driverless; there will be a driver in the driver's seat. And Waymo One is open to the public in only the narrowest, most technical sense: initially it will only be available to early riders—the same people who have been participating in Waymo's test program for months." Even in the benign environment of Phoenix, trained self-driving car drivers are still needed: "Over the course of October and November, Randazzo spent three days observing Waymo's cars in action—either by following them on the roads or staking out the company's depot in Chandler. He posted his findings in a YouTube video. The findings suggest that Waymo's vehicles aren't yet ready for fully autonomous operation." December 7, 2018 at 11:03 AM David. said... Paris Marx writes in Self-Driving Cars Will Always Be Limited. Even the Industry Leader Admits it: "even Waymo’s CEO, John Krafcik, now admits that the self-driving car that can drive in any condition, on any road, without ever needing a human to take control — what’s usually called a “level 5” autonomous vehicle — will never exist. At the Wall Street Journal’s D.Live conference on November 13, Krafcik said that “autonomy will always have constraints.” It will take decades for self-driving cars to become common on roads, and even then they will not be able to drive in certain conditions, at certain times of the year, or in any weather. In short, sensors on autonomous vehicles don’t work well in snow or rain — and that may never change." January 8, 2019 at 6:12 AM David. said... Christian Wolmar's My speech on driverless cars at the Transportation Research Board, Washington DC, 15/1/19 is a must-read debunking of the autonomous car hype by a respected British transport journalist. Among his many points: "Michael DeKort, an aerospace engineer turned whistleblower wrote recently: ‘Handover cannot be made safe no matter what monitoring and notification system is used. That is because enough time cannot be provided to regain proper situational awareness in critical scenarios.’" No-one could have predicted ... January 20, 2019 at 6:06 AM David. said... Ashley Nunes' The Cost of Self-Driving Cars Will Be the Biggest Barrier to Their Adoption tackles the important question of whether, even if they can be made safe, self-driving cars can be affordable: "However, the systems underlying HAVs, namely sensors, radar, and communication devices, are costly compared to older (less safe) vehicles. This raises questions about the affordability of life-saving technology for those who need it most. While all segments of society are affected by road crashes, the risks are greatest for the poor. These individuals are more likely to die on the road partly because they own older vehicles that lack advanced safety features and have lower crash-test ratings. Some people have suggested that the inability to purchase HAVs outright may be circumvented by offering these vehicles for-hire. This setup, analogous to modern day taxis, distributes operating costs over a large number of consumers making mobility services more affordable. Self-driving technology advocates suggest that so-called robotaxis, operated by for-profit businesses, could produce considerable savings for consumers." Nunes computes that, even assuming the capital cost of a robotaxi is a mere $15K, the answer is public subsidy: "consumer subsidies will be crucial to realizing the life-saving benefits of this technology. Although politically challenging, public revenues already pay for a portion of road crash-related expenditures. In the United States alone, this amounts to $18 billion, the equivalent of over $156 in added taxes for every household." But to justify the subsidy, they have to be safe. Which brings us back to the hand-off problem. March 14, 2019 at 7:05 AM Post a Comment Newer Post Older Post Home Subscribe to: Post Comments (Atom) Blog Rules Posts and comments are copyright of their respective authors who, by posting or commenting, license their work under a Creative Commons Attribution-Share Alike 3.0 United States License. Off-topic or unsuitable comments will be deleted. DSHR DSHR in ANWR Recent Comments Full comments Blog Archive ►  2021 (18) ►  April (5) ►  March (3) ►  February (5) ►  January (5) ►  2020 (55) ►  December (4) ►  November (4) ►  October (3) ►  September (6) ►  August (5) ►  July (3) ►  June (6) ►  May (3) ►  April (5) ►  March (6) ►  February (5) ►  January (5) ►  2019 (66) ►  December (2) ►  November (4) ►  October (8) ►  September (5) ►  August (5) ►  July (7) ►  June (6) ►  May (7) ►  April (6) ►  March (7) ►  February (4) ►  January (5) ►  2018 (96) ►  December (7) ►  November (8) ►  October (10) ►  September (5) ►  August (8) ►  July (5) ►  June (7) ►  May (10) ►  April (8) ►  March (9) ►  February (9) ►  January (10) ▼  2017 (82) ►  December (6) ▼  November (6) Intel's "Management Engine" Has Web Advertising Jumped The Shark? Techno-hype part 2 Techno-hype part 1 Keynote at Pacific Neighborhood Consortium Randall Munroe Says It All ►  October (8) ►  September (6) ►  August (7) ►  July (5) ►  June (7) ►  May (6) ►  April (7) ►  March (11) ►  February (5) ►  January (8) ►  2016 (89) ►  December (4) ►  November (8) ►  October (10) ►  September (8) ►  August (8) ►  July (7) ►  June (8) ►  May (7) ►  April (5) ►  March (10) ►  February (7) ►  January (7) ►  2015 (75) ►  December (7) ►  November (5) ►  October (11) ►  September (5) ►  August (3) ►  July (3) ►  June (8) ►  May (10) ►  April (6) ►  March (6) ►  February (7) ►  January (4) ►  2014 (68) ►  December (7) ►  November (8) ►  October (6) ►  September (8) ►  August (7) ►  July (3) ►  June (5) ►  May (6) ►  April (5) ►  March (6) ►  February (2) ►  January (5) ►  2013 (67) ►  December (3) ►  November (6) ►  October (7) ►  September (6) ►  August (3) ►  July (5) ►  June (6) ►  May (5) ►  April (9) ►  March (5) ►  February (5) ►  January (7) ►  2012 (43) ►  December (4) ►  November (4) ►  October (6) ►  September (6) ►  August (2) ►  July (5) ►  June (2) ►  May (5) ►  March (1) ►  February (5) ►  January (3) ►  2011 (40) ►  December (2) ►  November (1) ►  October (7) ►  September (3) ►  August (5) ►  July (2) ►  June (2) ►  May (2) ►  April (4) ►  March (4) ►  February (4) ►  January (4) ►  2010 (17) ►  December (5) ►  November (3) ►  October (4) ►  September (2) ►  July (1) ►  June (1) ►  February (1) ►  2009 (8) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (2) ►  January (2) ►  2008 (8) ►  December (2) ►  March (1) ►  January (5) ►  2007 (14) ►  December (1) ►  October (3) ►  September (1) ►  August (1) ►  July (2) ►  June (3) ►  May (1) ►  April (2) LOCKSS system has permission to collect, preserve, and serve this Archival Unit. Simple theme. Powered by Blogger. 
blog-dshr-org-9208	----	DSHR's Blog: NFTs and Web Archiving DSHR's Blog I'm David Rosenthal, and this is a place to discuss the work I'm doing in Digital Preservation. Thursday, April 15, 2021 NFTs and Web Archiving One of the earliest observations of the behavior of the Web at scale was "link rot". There were a lot of 404s, broken links. Research showed that the half-life of Web pages was alarmingly short. Even in 1996 this problem was obvious enough for Brewster Kahle to found the Internet Archive to address it. From the Wikipedia entry for Link Rot: A 2003 study found that on the Web, about one link out of every 200 broke each week,[1] suggesting a half-life of 138 weeks. This rate was largely confirmed by a 2016–2017 study of links in Yahoo! Directory (which had stopped updating in 2014 after 21 years of development) that found the half-life of the directory's links to be two years.[2] One might have thought that academic journals were a relatively stable part of the Web, but research showed that their references decayed too, just somewhat less rapidly. A 2013 study found a half-life of 9.3 years. See my 2015 post The Evanescent Web. I expect you have noticed the latest outbreak of blockchain-enabled insanity, Non-Fungible Tokens (NFTs). Someone "paying $69M for a JPEG" or $560K for a New York Times column attracted a lot of attention. Follow me below the fold for the connection between NFTs, "link rot" and Web archiving. Kahle's idea for addressing "link rot", which became the Wayback Machine, was to make a copy of the content at some URL, say: http://www.example.com/page.html keep the copy for posterity, and re-publish it at a URL like: https://web.archive.org/web/19960615083712/http://www.example.com/page.html What is the difference between the two URLs? The original is controlled by Example.Com, Inc.; they can change or delete it on a whim. The copy is controlled by the Internet Archive, whose mission is to preserve it unchanged "for ever". The original is subject to "link rot", the second is, one hopes, not subject to "link rot". The Wayback Machine's URLs have three components: https://web.archive.org/web/ locates the archival copy at the Internet Archive. 19960615083712 indicates that the copy was made on 15th June, 1996 at 8:37:12. http://www.example.com/page.html is the URL from which the copy was made. The fact that the archival copy is at a different URL from the original causes a set of problems that have bedevilled Web archiving. One is that, if the original goes away, all the links that pointed to it break, even though there may be an archival copy to which they could point to fulfill the intent of the link creator. Another is that, if the content at the original URL changes, the link will continue to resolve but the content it returns may no longer reflect the intent of the link creator, although there may be an archival copy that does. Even in the early days of the Web it was evident that Web pages changed and vanished at an alarming rate. The point is that the meaning of a generic Web URL is "whatever content, or lack of content, you find at this location". That is why URL stands for Universal Resource Locator. Note the difference with URI, which stands for Universal Resource Identifier. Anyone can create a URL or URI linking to whatever content they choose, but doing so provides no rights in or control over the linked-to content. In People's Expensive NFTs Keep Vanishing. This Is Why, Ben Munster reports that: over the past few months, numerous individuals have complained about their NFTs going “missing,” “disappearing,” or becoming otherwise unavailable on social media. This despite the oft-repeated NFT sales pitch: that NFT artworks are logged immutably, and irreversibly, onto the Ethereum blockchain. So NTFs have the same problem that Web pages do. Isn't the blockchain supposed to make things immortal and immutable? Kyle Orland's Ars Technica’s non-fungible guide to NFTs provides an over-simplified explanation: When NFT’s are used to represent digital files (like GIFs or videos), however, those files usually aren’t stored directly “on-chain” in the token itself. Doing so for any decently sized file could get prohibitively expensive, given the cost of replicating those files across every user on the chain. Instead, most NFTs store the actual content as a simple URI string in their metadata, pointing to an Internet address where the digital thing actually resides. NFTs are just links to the content they represent, not the content itself. The Bitcoin blockchain actually does contain some images, such as this ASCII portrait of Len Sassaman and some pornographic images. But the blocks of the Bitcoin blockchain were originally limited to 1MB and are now effectively limited to around 2MB, enough space for small image files. What’s the Maximum Ethereum Block Size? explains: Instead of a fixed limit, Ethereum block size is bound by how many units of gas can be spent per block. This limit is known as the block gas limit ... At the time of writing this, miners are currently accepting blocks with an average block gas limit of around 10,000,000 gas. Currently, the average Ethereum block size is anywhere between 20 to 30 kb in size. That's a little out-of-date. Currently the block gas limit is around 12.5M gas per block and the average block is about 45KB. Nowhere near enough space for a $69M JPEG. The NFT for an artwork can only be a link. Most NFTs are ERC-721 tokens, providing the optional Metadata extension: /// @title ERC-721 Non-Fungible Token Standard, optional metadata extension /// @dev See https://eips.ethereum.org/EIPS/eip-721 /// Note: the ERC-165 identifier for this interface is 0x5b5e139f. interface ERC721Metadata /* is ERC721 */ { /// @notice A descriptive name for a collection of NFTs in this contract function name() external view returns (string _name); /// @notice An abbreviated name for NFTs in this contract function symbol() external view returns (string _symbol); /// @notice A distinct Uniform Resource Identifier (URI) for a given asset. /// @dev Throws if `_tokenId` is not a valid NFT. URIs are defined in RFC /// 3986. The URI may point to a JSON file that conforms to the "ERC721 /// Metadata JSON Schema". function tokenURI(uint256 _tokenId) external view returns (string); } The Metadata JSON Schema specifies an object with three string properties: name: "Identifies the asset to which this NFT represents" description: "Describes the asset to which this NFT represents" image: "A URI pointing to a resource with mime type image/* representing the asset to which this NFT represents. Consider making any images at a width between 320 and 1080 pixels and aspect ratio between 1.91:1 and 4:5 inclusive." Note that the JSON metadata is not in the Ethereum blockchain, it is only pointed to by the token on the chain. If the art-work is the "image", it is two links away from the blockchain. So, given the evanescent nature of Web links, the standard provides no guarantee that the metadata exists, or is unchanged from when the token was created. Even if it is, the standard provides no guarantee that the art-work exists or is unchanged from when the token is created. Caveat emptor — Absent unspecified actions, the purchaser of an NFT is buying a supposedly immutable, non-fungible object that points to a URI pointing to another URI. In practice both are typically URLs. The token provides no assurance that either of these links resolves to content, or that the content they resolve to at any later time is what the purchaser believed at the time of purchase. There is no guarantee that the creator of the NFT had any copyright in, or other rights to, the content to which either of the links resolves at any particular time. There are thus two issues to be resolved about the content of each of the NFT's links: Does it exist? I.e. does it resolve to any content? Is it valid? I.e. is the content to which it resolves unchanged from the time of purchase? These are the same questions posed by the Holy Grail of Web archiving, persistent URLs. Assuming existence for now, how can validity be assured? There have been a number of systems that address this problem by switching from naming files by their location, as URLs do, to naming files by their content by using the hash of the content as its name. The idea was the basis for Bram Cohen's highly successful BitTorrent — it doesn't matter where the data comes from provided its integrity is assured because the hash in the name matches the hash of the content. The content-addressable file system most used for NFTs is the Interplanetary File System (IPFS). From its Wikipedia page: As opposed to a centrally located server, IPFS is built around a decentralized system[5] of user-operators who hold a portion of the overall data, creating a resilient system of file storage and sharing. Any user in the network can serve a file by its content address, and other peers in the network can find and request that content from any node who has it using a distributed hash table (DHT). In contrast to BitTorrent, IPFS aims to create a single global network. This means that if Alice and Bob publish a block of data with the same hash, the peers downloading the content from Alice will exchange data with the ones downloading it from Bob.[6] IPFS aims to replace protocols used for static webpage delivery by using gateways which are accessible with HTTP.[7] Users may choose not to install an IPFS client on their device and instead use a public gateway. If the purchaser gets both the NFT's metadata and the content to which it refers via IPFS URIs, they can be assured that the data is valid. What do these IPFS URIs look like? The (excellent) IPFS documentation explains: https://ipfs.io/ipfs/<CID> # e.g https://ipfs.io/ipfs/Qme7ss3ARVgxv6rXqVPiikMJ8u2NLgmgszg13pYrDKEoiu Browsers that support IPFS can redirect these requests to your local IPFS node, while those that don't can fetch the resource from the ipfs.io gateway. You can swap out ipfs.io for your own http-to-ipfs gateway, but you are then obliged to keep that gateway running forever. If your gateway goes down, users with IPFS aware tools will still be able to fetch the content from the IPFS network as long as any node still hosts it, but for those without, the link will be broken. Don't do that. Note the assumption here that the ipfs.io gateway will be running forever. Note also that only some browsers are capable of accessing IPFS content without using a gateway. Thus the ipfs.io gateway is a single point of failure, although the failure is not complete. In practice NFTs using IPFS URIs are dependent upon the continued existence of Protocol Labs, the organization behind IPFS. The ipfs.io URIs in the NFT metadata are actually URLs; they don't point to IPFS, but to a Web server that accesses IPFS. Pointing to the NFT's metadata and content using IPFS URIs assures their validity but does it assure their existence? The IPFS documentation's section Persistence, permanence, and pinning explains: Nodes on the IPFS network can automatically cache resources they download, and keep those resources available for other nodes. This system depends on nodes being willing and able to cache and share resources with the network. Storage is finite, so nodes need to clear out some of their previously cached resources to make room for new resources. This process is called garbage collection. To ensure that data persists on IPFS, and is not deleted during garbage collection, data can be pinned to one or more IPFS nodes. Pinning gives you control over disk space and data retention. As such, you should use that control to pin any content you wish to keep on IPFS indefinitely. To assure the existence of the NFT's metadata and content they must both be not just written to IPFS but also pinned to at least one IPFS node. To ensure that your important data is retained, you may want to use a pinning service. These services run lots of IPFS nodes and allow users to pin data on those nodes for a fee. Some services offer free storage-allowance for new users. Pinning services are handy when: You don't have a lot of disk space, but you want to ensure your data sticks around. Your computer is a laptop, phone, or tablet that will have intermittent connectivity to the network. Still, you want to be able to access your data on IPFS from anywhere at any time, even when the device you added it from is offline. You want a backup that ensures your data is always available from another computer on the network if you accidentally delete or garbage-collect your data on your own computer. Thus to assure the existence of the NFT's metadata and content pinning must be rented from a pinning service, another single point of failure. In summary, it is possible to take enough precautions and pay enough ongoing fees to be reasonably assured that your $69M NFT and its metadata and the JPEG it refers to will remain accessible. Whether in practice these precautions are taken is definitely not always the case. David Gerard reports: But functionally, IPFS works the same way as BitTorrent with magnet links — if nobody bothers seeding your file, there’s no file there. Nifty Gateway turn out not to bother to seed literally the files they sold, a few weeks later. [Twitter; Twitter] Anil Dash claims to have invented, with Kevin McCoy, the concept of NFTs referencing Web URLs in 2014. He writes in his must-read NFTs Weren’t Supposed to End Like This: Seven years later, all of today’s popular NFT platforms still use the same shortcut. This means that when someone buys an NFT, they’re not buying the actual digital artwork; they’re buying a link to it. And worse, they’re buying a link that, in many cases, lives on the website of a new start-up that’s likely to fail within a few years. Decades from now, how will anyone verify whether the linked artwork is the original? All common NFT platforms today share some of these weaknesses. They still depend on one company staying in business to verify your art. They still depend on the old-fashioned pre-blockchain internet, where an artwork would suddenly vanish if someone forgot to renew a domain name. “Right now NFTs are built on an absolute house of cards constructed by the people selling them,” the software engineer Jonty Wareing recently wrote on Twitter. My only disagreement with Dash is that, as someone who worked on archiving the "old-fashioned pre-blockchain internet" for two decades, I don't believe that there is a new-fangled post-blockchain Internet that makes the problems go away. And neither does David Gerard: The pictures for NFTs are often stored on the Interplanetary File System, or IPFS. Blockchain promoters talk like IPFS is some sort of bulletproof cloud storage that works by magic and unicorns. Posted by David. at 8:00 AM Labels: bitcoin, distributed web, web archiving 2 comments: David. said... Kal Rustiala & Christopher Jon Sprigman's The One Redeeming Quality of NFTs Might Not Even Exist explains: "once you understand what the NFT is and how it actually works, you can see that it does nothing to permit the buyer, as the New Yorker put it, to own a “digital Beanie Baby” with only one existing copy. In fact, the NFT may make the authenticity question even more difficult to resolve." They quote David Hockney agreeing with David Gerard: "On an art podcast, Hockney recently said, “What is it that they’re owning? I don’t really know.” NFTs, Hockney said, are the domain of “international crooks and swindlers.” Hockney may have a point. If you look at them closely, NFTs do almost nothing to guarantee authenticity. In fact, for reasons we’ll explain, NFTs may actually make the problem of authenticity in digital art worse." April 20, 2021 at 7:58 PM David. said... Who could have predicted counterfeit NFTs? Tim Schneider's The Gray Market: How a Brazen Hack of That $69 Million Beeple Revealed the True Vulnerability of the NFT Market (and Other Insights) reports that: "In the opening days of April, an artist operating under the pseudonym Monsieur Personne (“Mr. Nobody”) tried to short-circuit the NFT hype machine by unleashing “sleepminting,” a process that complicates, if not corrodes, one of the value propositions underlying non-fungible tokens. ... Sleepminting enables him to mint NFTs for, and to, the crypto wallets of other artists, then transfer ownership back to himself without their consent or knowing participation. Nevertheless, each of these transactions appears as legitimate on the blockchain record as if the unwitting artist had initiated them on their own, opening up the prospect of sophisticated fraud on a mass scale." And it is arguably legal because NFTs are just a (pair of) links: "Personne told me that, after being “thoroughly consulted and advised by personal lawyers and specialist law firms,” he is confident there are “little to no legal repercussions for sleepminting.” His argument is that ERC721 smart contracts only contain a link pointing to a JSON (Javascript Object Notation) file, which in turn points to a “publicly available and hosted digital asset file”—here, Beeple’s Everydays image." April 23, 2021 at 2:07 PM Post a Comment Newer Post Older Post Home Subscribe to: Post Comments (Atom) Blog Rules Posts and comments are copyright of their respective authors who, by posting or commenting, license their work under a Creative Commons Attribution-Share Alike 3.0 United States License. Off-topic or unsuitable comments will be deleted. DSHR DSHR in ANWR Recent Comments Full comments Blog Archive ▼  2021 (18) ▼  April (5) Dogecoin Disrupts Bitcoin! What Is The Point? NFTs and Web Archiving Cryptocurrency's Carbon Footprint Elon Musk: Threat or Menace? ►  March (3) ►  February (5) ►  January (5) ►  2020 (55) ►  December (4) ►  November (4) ►  October (3) ►  September (6) ►  August (5) ►  July (3) ►  June (6) ►  May (3) ►  April (5) ►  March (6) ►  February (5) ►  January (5) ►  2019 (66) ►  December (2) ►  November (4) ►  October (8) ►  September (5) ►  August (5) ►  July (7) ►  June (6) ►  May (7) ►  April (6) ►  March (7) ►  February (4) ►  January (5) ►  2018 (96) ►  December (7) ►  November (8) ►  October (10) ►  September (5) ►  August (8) ►  July (5) ►  June (7) ►  May (10) ►  April (8) ►  March (9) ►  February (9) ►  January (10) ►  2017 (82) ►  December (6) ►  November (6) ►  October (8) ►  September (6) ►  August (7) ►  July (5) ►  June (7) ►  May (6) ►  April (7) ►  March (11) ►  February (5) ►  January (8) ►  2016 (89) ►  December (4) ►  November (8) ►  October (10) ►  September (8) ►  August (8) ►  July (7) ►  June (8) ►  May (7) ►  April (5) ►  March (10) ►  February (7) ►  January (7) ►  2015 (75) ►  December (7) ►  November (5) ►  October (11) ►  September (5) ►  August (3) ►  July (3) ►  June (8) ►  May (10) ►  April (6) ►  March (6) ►  February (7) ►  January (4) ►  2014 (68) ►  December (7) ►  November (8) ►  October (6) ►  September (8) ►  August (7) ►  July (3) ►  June (5) ►  May (6) ►  April (5) ►  March (6) ►  February (2) ►  January (5) ►  2013 (67) ►  December (3) ►  November (6) ►  October (7) ►  September (6) ►  August (3) ►  July (5) ►  June (6) ►  May (5) ►  April (9) ►  March (5) ►  February (5) ►  January (7) ►  2012 (43) ►  December (4) ►  November (4) ►  October (6) ►  September (6) ►  August (2) ►  July (5) ►  June (2) ►  May (5) ►  March (1) ►  February (5) ►  January (3) ►  2011 (40) ►  December (2) ►  November (1) ►  October (7) ►  September (3) ►  August (5) ►  July (2) ►  June (2) ►  May (2) ►  April (4) ►  March (4) ►  February (4) ►  January (4) ►  2010 (17) ►  December (5) ►  November (3) ►  October (4) ►  September (2) ►  July (1) ►  June (1) ►  February (1) ►  2009 (8) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (2) ►  January (2) ►  2008 (8) ►  December (2) ►  March (1) ►  January (5) ►  2007 (14) ►  December (1) ►  October (3) ►  September (1) ►  August (1) ►  July (2) ►  June (3) ►  May (1) ►  April (2) LOCKSS system has permission to collect, preserve, and serve this Archival Unit. Simple theme. Powered by Blogger. 
blog-dshr-org-9536	----	DSHR's Blog: The Evanescent Web DSHR's Blog I'm David Rosenthal, and this is a place to discuss the work I'm doing in Digital Preservation. Tuesday, February 10, 2015 The Evanescent Web Papers drawing attention to the decay of links in academic papers have quite a history, i blogged about three relatively early ones six years ago. Now Martin Klein and a team from the Hiberlink project have taken the genre to a whole new level with a paper in PLoS One entitled Scholarly Context Not Found: One in Five Articles Suffers from Reference Rot. Their dataset is 2-3 orders of magnitude bigger than previous studies, their methods are far more sophisticated, and they study both link rot (links that no longer resolve) and content drift (links that now point to different content). There's a summary on the LSE's blog. Below the fold, some thoughts on the Klein et al paper. As regards link rot, they write: In order to combat link rot, the Digital Object Identifier (DOI) was introduced to persistently identify journal articles. In addition, the DOI resolver for the URI version of DOIs was introduced to ensure that web links pointing at these articles remain actionable, even when the articles change web location. But even when used correctly, such as http://dx.doi.org/10.1371/journal.pone.0115253, DOIs introduce a single point of failure. This became obvious on January 20th when the doi.org domain name briefly expired. DOI links all over the Web failed, illustrating yet another fragility of the Web. It hasn't been a good time for access to academic journals for other reasons either. Among the publishers unable to deliver content to their customers in the last week or so were Elsevier, Springer, Nature, HighWire Press and Oxford Art Online. I've long been a fan of Herbert van de Sompel's work, especially Memento. He's a co-author on the paper and we have been discussing it. Unusually, we've been disagreeing. We completely agree on the underlying problem of the fragility of academic communication in the Web era as opposed to its robustness in the paper era. Indeed, in the introduction of another (but much less visible) recent paper entitled Towards Robust Hyperlinks for Web-Based Scholarly Communication Herbert and his co-authors echo the comparison between the paper and Web worlds from the very first paper we published on the LOCKSS system a decade and a half ago. Nor am I critical of the research underlying the paper, which is clearly of high quality and which reveals interesting and disturbing properties of Web-based academic communication. All I'm disagreeing with Herbert about is the way the research is presented in the paper. My problem with the presentation is that this paper, which has a far higher profile than other recent publications in this area, and which comes at a time of unexpectedly high visibility for web archiving, seems to me to be excessively optimistic, and to fail to analyze the roots of the problem it is addressing. It thus fails to communicate the scale of the problem. The paper is, for very practical reasons of publication in a peer-reviewed journal, focused on links from academic papers to the web-at-large. But I see it as far too optimistic in its discussion of the likely survival of the papers themselves, and the other papers they link to (see Content Drift below). I also see it as far too optimistic in its discussion of proposals to fix the problem of web-at-large references that it describes (see Dependence on Authors below). All the proposals depend on actions being taken either before or during initial publication by either the author or the publisher. There is evidence in the paper itself (see Getting Links Right below) that neither authors nor publishers can get DOIs right. Attempts to get authors to deposit their papers in institutional repositories notoriously fail. The LOCKSS team has met continual frustration in getting publishers to make small changes to their publishing platforms that would make preservation easier, or in some cases even possible. Viable solutions to the problem cannot depend on humans to act correctly. Neither authors nor publishers have anything to gain from preservation of their work. In addition, the paper fails to even mention the elephant in the room, the fact that both the papers and the web-at-large content are copyright. The archives upon which the proposed web-at-large solutions rest, such as the Internet Archive, are themselves fragile. Not just for the normal economic and technical reasons we outlined nearly a decade ago, but because they operate under the DMCA's "safe harbor" provision and thus must take down content upon request from a claimed copyright holder. The archives such as Portico and LOCKSS that preserve the articles themselves operate instead with permission from the publisher, and thus must impose access restrictions. This is the root of the problem. In the paper world in order to monetize their content the copyright owner had to maximize the number of copies of it. In the Web world, in order to monetize their content the copyright owner has to minimize the number of copies. Thus the fundamental economic motivation for Web content militates against its preservation in the ways that Herbert and I would like. None of this is to suggest that developing and deploying partial solutions is a waste of time. It is what I've been doing the last quarter of my life. There cannot be a single comprehensive technical solution. The best we can do is to combine a diversity of partial solutions. But we need to be clear that even if we combine everything anyone has worked on we are still a long way from solving the problem. Now for some details. Content Drift As regards content drift, they write: Content drift is hardly a matter of concern for references to journal articles, because of the inherent fixity that, especially PDF-formated, articles exhibit. Nevertheless, special-purpose solutions for long-term digital archiving of the digital journal literature, such as LOCKSS, CLOCKSS, and Portico, have emerged to ensure that articles and the articles they reference can be revisited even if the portals that host them vanish from the web. More recently, the Keepers Registry has been introduced to keep track of the extent to which the digital journal literature is archived by what memory organizations. These combined efforts ensure that it is possible to revisit the scholarly context that consists of articles referenced by a certain article long after its publication. While I understand their need to limit the scope of their research to web-at-large resources, the last sentence is far too optimistic. First, research using the Keepers Registry and other resources shows that at most 50% of all articles are preserved. So future scholars depending on archives of digital journals will encounter large numbers of broken links. Second, even the 50% of articles that are preserved may not be accessible to a future scholar. CLOCKSS is a dark archive and is not intended to provide access to future scholars unless the content is triggered. Portico is a subscription archive, future scholars' institutions may not have a subscription. LOCKSS provides access only to readers at institutions running a LOCKSS box. These restrictions are a response to the copyright on the content and are not susceptible to technical fixes. Third, the assumption that journal articles exhibit "inherent fixity" is, alas, outdated. Both the HTML and PDF versions of articles from state-of-the-art publishing platforms contain dynamically generated elements, even when they are not entirely generated on-the-fly. The LOCKSS system encounters this on a daily basis. As each LOCKSS box collects content from the publisher independently, each box gets content that differs in unimportant respects. For example, the HTML content is probably personalized ("Welcome Stanford University") and updated ("Links to this article"). PDF content is probably watermarked ("Downloaded by 192.168.1.100"). Content elements such as these need to be filtered out of the comparisons between the "same" content at different LOCKSS boxes. One might assume that the words, figures, etc. that form the real content of articles do not drift, but in practice it would be very difficult to validate this assumption. Soft-404 Responses I've written before about the problems caused for archiving by "soft-403 and soft-404" responses by Web servers. These result from Web site designers who believe their only audience is humans, so instead of providing the correct response code when they refuse to supply content, they return a pretty page with a 200 response code indicating valid content. The valid content is a refusal to supply the requested content. Interestingly, PubMed is an example, as I discovered when clicking on the (broken) PubMed link in the paper's reference 58. Klein et al define a live web page thus: On the one hand, the HTTP transaction chain could end successfully with a 2XX-level HTTP response code. In this case we declared the URI to be active on the live web. Their estimate of the proportion of links which are still live is thus likely to be optimistic, as they are likely to have encountered at least soft-404s if not soft-403s. Getting Links Right Even when the dx.doi.org resolver is working, its effectiveness in persisting links depends on its actually being used. Klein et al discover that in many cases it isn't: one would assume that URI references to journal articles can readily be recognized by detecting HTTP URIs that carry a DOI, e.g., http://dx.doi.org/10.1007/s00799-014-0108-0. However, it turns out that references rather frequently have a direct link to an article in a publisher's portal, e.g. http://link.springer.com/article/10.1007%2Fs00799-014-0108-0, instead of the DOI link. The direct link may well survive relocation of the content within the publisher's site. But journals are frequently bought and sold between publishers, causing the link to break. I believe there are two causes for these direct links, publisher's platforms inserting them so as not to risk losing the reader, but more importantly the difficulty for authors to create correct links. Cutting and pasting from the URL bar in their browser necessarily gets the direct link, creating the correct one via dx.doi.org requires the author to know that it should be hand-edited, and to remember to do it. Attempts to ensure linked materials are preserved suffer from a similar problem: The solutions component of Hiberlink also explores how to best reference archived snapshots. The common and obvious approach, followed by Webcitation and Perma.cc, is to replace the original URI of the referenced resource with the URI of the Memento deposited in a web archive. This approach has several drawbacks. First, through removal of the original URI, it becomes impossible to revisit the originally referenced resource, for example, to determine what its content has become some time after referencing. Doing so can be rather relevant, for example, for software or dynamic scientific wiki pages. Second, the original URI is the key used to find Mementos of the resource in all web archives, using both their search interface and the Memento protocol. Removing the original URI is akin to throwing away that key: it makes it impossible to find Mementos in web archives other than the one in which the specific Memento was deposited. This means that the success of the approach is fully dependent on the long term existence of that one archive. If it permanently ceases to exist, for example, as a result of legal or financial pressure, or if it becomes temporally inoperative as a result of technical failure, the link to the Memento becomes rotten. Even worse, because the original URI was removed from the equation, it is impossible to use other web archives as a fallback mechanism. As such, in the approach that is currently common, one link rot problem is replaced by another. The paper, and a companion paper, describe Hiberlink's solution, which is to decorate the link to the original resource with an additional link to its archived Memento. Rene Voorburg of the KB has extended this by implementing robustify.js:  robustify.js checks the validity of each link a user clicks. If the linked page is not available, robustify.js will try to redirect the user to an archived version of the requested page. The script implements Herbert Van de Sompel's Memento Robust Links - Link Decoration specification (as part of the Hiberlink project) in how it tries to discover an archived version of the page. As a default, it will use the Memento Time Travel service as a fallback. You can easily implement robustify.js on your web pages in so that it redirects pages to your preferred web archive. Note, however, that soft-403s and soft-404s pose the same problem for robustify.js as they do for all Web archiving technologies. Dependence on Authors Many of the solutions that have been proposed to the problem of reference rot also suffer from dependence on authors: Webcitation was a pioneer in this problem domain when, years ago, it introduced the service that allows authors to archive, on demand, web resources they intend to reference. ... But Webcitation has not been met with great success, possibly the result of a lack of authors' awareness regarding reference rot, possibly because the approach requires an explicit action by authors, likely because of both. Webcitation is not the only one: To a certain extent, portals like FigShare and Zenodo play in this problem domain as they allow authors to upload materials that might otherwise be posted to the web at large. The recent capability offered by these systems that allows creating a snapshot of a GitHub repository, deposit it, and receive a DOI in return, serves as a good example. The main drivers for authors to do so is to contribute to open science and to receive a citable DOI, and, hence potentially credit for the contribution. But the net effect, from the perspective of the reference rot problem domain, is the creation of a snapshot of an otherwise evolving resource. Still, these services target materials created by authors, not, like web archives do, resources on the web irrespective of their authorship. Also, an open question remains to which extent such portals truly fulfill a long term archival function rather than being discovery and access environments. Hiberlink is trying to reduce this dependence: In the solutions thread of Hiberlink, we explore pro-active archiving approaches intended to seamlessly integrate into the life cycle of an article and to require less explicit intervention by authors. One example is an experimental Zotero extension that archives web resources as an author bookmarks them during note taking. Another is HiberActive, a service that can be integrated into the workflow of a repository or a manuscript submission system and that issues requests to web archives to archive all web at large resources referenced in submitted articles. But note that these services (and Voorburg's) depend on the author or the publisher installing them. Experience shows that authors are focused on getting their current paper accepted, large publishers are reluctant to implement extensions to their publishing platforms that offer no immediate benefit, and small publishers lack the expertise to do so. Ideally, these services would be back-stopped by a service that scanned recently-published articles for web-at-large links and submitted them for archiving, thus requiring no action by author or publisher. The problem is that doing so requires the service to have access to the content as it is published. The existing journal archiving services, LOCKSS, CLOCKSS and Portico have such access to about half the published articles, and could in principle be extended to perform this service. In practice doing so would need at least modest funding. The problem isn't as simple as it appears at first glance, even for the articles that are archived. For those that aren't, primarily from less IT-savvy authors and small publishers, the outlook is bleak. Archiving Finally, the solutions assume that submitting a URL to an archive is enough to ensure preservation. It isn't. The referenced web site might have a robots.txt policy preventing collection. The site might have crawler traps, exceed the archive's crawl depth, or use Javascript in ways that prevent the archive collecting a usable representation. Or the archive may simply not process the request in time to avoid content drift or link rot. Acknowledgement I have to thank Herbert van de Sompel for greatly improving this post through constructive criticism. But it remains my opinion alone. Update: Fixed broken link to Geoff Bilder post at Crossref flagged by Rob Baxter in comments to a December 2016 post on a similar topic. Posted by David. at 8:00 AM Labels: digital preservation, e-journals, memento 10 comments: rv said... "Note, however, that soft-403s and soft-404s pose the same problem for robustify.js as they do for all Web archiving technologies." I just uploaded a new version of the robustify.js helper script (https://github.com/renevoorburg/robustify.js) that attempts to recognize soft-404s. It does so by forcing a '404' with a random request and comparing the results of that with the results of the original request (using fuzzy hashing). It seems to work very well but I am missing a good test set of soft 404's. February 10, 2015 at 1:14 PM David. said... Good idea, René! February 10, 2015 at 2:51 PM Unknown said... As ever, a good and challenging read. Although I am not one of the authors of the paper you review I have been involved in a lot of the underlying thinking as one of the PIs in the project, described at Hiberlink.org and would like to add a few comments, especially on the matter of potential remedy. We were interested in the prospect of change & intervention in three simple workflows (for the author; for the issuing body; for the hapless library/repository) in order to enable transactional archiving of referenced content - reasoning that it was best that this was done as early as possible after the content on the web was regarded as important, and also that such archiving was best done when the actor in question had their mind in gear. The prototyping using Zotero and OJS was done via plug-ins because having access to the source code our colleague Richard Wincewicz could mock this up as a demonstrator. One strategy was that would then invite ‘borrowing’ of the functionality (of snapshot/DateTimeStamp/archive/‘decorate’ with DateTimeStamp of URI within the citation) by commercial reference managers and editorial software so that authors and/or publishers (editors?) did not have to do something special. Reference rot is a function of time: the sooner the fish (fruit?) is flash frozen the less it has chance to rot. However, immediate post-publication remedy is better than none. The suggestion that there is pro-active fix for content ingested into LOCKSS, CLOCKSS and Portico (and other Keepers of digital content) by archiving of references is very much welcomed. This is part of our thinking for remodelling Repository Junction Broker which supports machine ingest into institutional repositories but what you suggest could have greater impact. February 12, 2015 at 6:27 AM Martin Klein said... A comment on the issue of soft404s: Your point is well taken and the paper's methodology section would clearly have benefited from mentioning this detriment and why we chose to not address it. My co-authors and I are very well aware of the soft404 issue, common approaches to detect them (such as introduced in [1] and [2]), and have, in fact, applied such methods in the past [3]. However, given the scale of our corpus of 1 million URIs, and the soft404 ratio found in previous studies (our [3] found a ratio of 0.36% and [4] found 3.41%), we considered checking for soft404s too expensive in light of potential return. Especially since, as you have pointed out in the past [5], web archives also archive soft404s, we would have had to detect soft404s on the live web as well as in web archives. Regardless, I absolutely agree that our reference rot numbers for links to web at large resources likely represent a lower bound. It would be interesting to investigate the ratio of soft404s and build a good size corpus to evaluate common and future detection approaches. The soft404 on the paper's reference 58 (which is introduced by the publisher) seems to "only" be a function of the PubMed search as a request for [6] returns a 404. [1] http://dx.doi.org/10.1145/988672.988716 [2] http://dx.doi.org/10.1145/1526709.1526886 [3] http://arxiv.org/abs/1102.0930 [4] http://dx.doi.org/10.1007/978-3-642-33290-6_22 [5] http://blog.dshr.org/2013/04/making-memento-succesful.html [6] http://www.ncbi.nlm.nih.gov/pubmed/aodfhdskjhfsjkdhfskldfj February 13, 2015 at 2:47 PM David. said... Peter Burnhill supports the last sentence of my post with this very relevant reference<: thoughts of (Captain) Clarence Birdseye Some advice on quick freezing references to Web caught resources: Better done when references are noted (by the author), and then could be re-examined at point of issue (by the editor / publisher). When delivered by the crate (onto digital shelves) the rot may have set in for some of these fish ... March 1, 2015 at 6:39 AM David. said... Geoffrey Bilder has a very interesting and detailed first instalment of a multi-part report on the DOI outage that is well worth reading. April 20, 2015 at 7:24 PM David. said... As reported on the UK Serials Group listserv, UK Elsevier subscribers encountered a major outage last weekend due to "unforeseen technical issues". June 9, 2015 at 9:23 AM David. said... The outages continued sporadically through Tuesday. This brings up another issue about the collection of link rot statistics. The model behind these studies so far is that a Web resource appears at some point in time, remains continually accessible for a period, then becomes inaccessible and remains inaccessible "for ever". Clearly, the outages noted here show that this isn't the case. Between the resource's first appearance and its last, there is some probably time-varying probability that it is available that is less than 1. June 10, 2015 at 7:29 AM David. said... Timothy Geigner at TechDirt supplies the canonical example of why depending on the DMCA "safe harbor" is risky for preservation. Although in this case the right thing happened in response to a false DMCA takedown notice, detecting them is between difficult and impossible. July 10, 2015 at 5:15 PM David. said... Herbert Van de Sompel, Martin Klein and Shawn Jones revisit the issue of why DOIs are not in practice used to refer to articles in a poster for WWW2016 Persistent URIs Must Be Used To Be Persistent. Note that this link is not a DOI, in this case because the poster doesn't have one (yet?). March 1, 2016 at 6:11 AM Post a Comment Newer Post Older Post Home Subscribe to: Post Comments (Atom) Blog Rules Posts and comments are copyright of their respective authors who, by posting or commenting, license their work under a Creative Commons Attribution-Share Alike 3.0 United States License. Off-topic or unsuitable comments will be deleted. DSHR DSHR in ANWR Recent Comments Full comments Blog Archive ►  2021 (18) ►  April (5) ►  March (3) ►  February (5) ►  January (5) ►  2020 (55) ►  December (4) ►  November (4) ►  October (3) ►  September (6) ►  August (5) ►  July (3) ►  June (6) ►  May (3) ►  April (5) ►  March (6) ►  February (5) ►  January (5) ►  2019 (66) ►  December (2) ►  November (4) ►  October (8) ►  September (5) ►  August (5) ►  July (7) ►  June (6) ►  May (7) ►  April (6) ►  March (7) ►  February (4) ►  January (5) ►  2018 (96) ►  December (7) ►  November (8) ►  October (10) ►  September (5) ►  August (8) ►  July (5) ►  June (7) ►  May (10) ►  April (8) ►  March (9) ►  February (9) ►  January (10) ►  2017 (82) ►  December (6) ►  November (6) ►  October (8) ►  September (6) ►  August (7) ►  July (5) ►  June (7) ►  May (6) ►  April (7) ►  March (11) ►  February (5) ►  January (8) ►  2016 (89) ►  December (4) ►  November (8) ►  October (10) ►  September (8) ►  August (8) ►  July (7) ►  June (8) ►  May (7) ►  April (5) ►  March (10) ►  February (7) ►  January (7) ▼  2015 (75) ►  December (7) ►  November (5) ►  October (11) ►  September (5) ►  August (3) ►  July (3) ►  June (8) ►  May (10) ►  April (6) ►  March (6) ▼  February (7) Don't Panic Using the official Linux overlayfs Report from FAST15 Vint Cerf's talk at AAAS The Evanescent Web It takes longer than it takes Disk reliability ►  January (4) ►  2014 (68) ►  December (7) ►  November (8) ►  October (6) ►  September (8) ►  August (7) ►  July (3) ►  June (5) ►  May (6) ►  April (5) ►  March (6) ►  February (2) ►  January (5) ►  2013 (67) ►  December (3) ►  November (6) ►  October (7) ►  September (6) ►  August (3) ►  July (5) ►  June (6) ►  May (5) ►  April (9) ►  March (5) ►  February (5) ►  January (7) ►  2012 (43) ►  December (4) ►  November (4) ►  October (6) ►  September (6) ►  August (2) ►  July (5) ►  June (2) ►  May (5) ►  March (1) ►  February (5) ►  January (3) ►  2011 (40) ►  December (2) ►  November (1) ►  October (7) ►  September (3) ►  August (5) ►  July (2) ►  June (2) ►  May (2) ►  April (4) ►  March (4) ►  February (4) ►  January (4) ►  2010 (17) ►  December (5) ►  November (3) ►  October (4) ►  September (2) ►  July (1) ►  June (1) ►  February (1) ►  2009 (8) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (2) ►  January (2) ►  2008 (8) ►  December (2) ►  March (1) ►  January (5) ►  2007 (14) ►  December (1) ►  October (3) ►  September (1) ►  August (1) ►  July (2) ►  June (3) ►  May (1) ►  April (2) LOCKSS system has permission to collect, preserve, and serve this Archival Unit. Simple theme. Powered by Blogger. 
blog-dshr-org-9748	----	DSHR's Blog: The Bitcoin "Price" DSHR's Blog I'm David Rosenthal, and this is a place to discuss the work I'm doing in Digital Preservation. Thursday, January 14, 2021 The Bitcoin "Price" Jemima Kelly writes No, bitcoin is not “the ninth-most-valuable asset in the world” and its a must-read. Below the fold, some commentary. Source The "price" of BTC in USD has quadrupuled in the last three months, and thus its "market cap" has sparked claims that it is the 9th most valuable asset in the world. Kelly explains the math: Just like you would calculate a company’s market capitalisation by multiplying its stock price by the number of shares outstanding, with bitcoin you just multiply its price by its total “supply” of coins (ie, the number of coins that have been mined since the first one was in January 2009). Simples! If you do that sum, you’ll see that you get to a very large number — if you take the all-time-high of $37,751 and multiply that by the bitcoin supply (roughly 18.6m) you get to just over $665bn. And, if that were accurate and representative and if you could calculate bitcoin’s value in this way, that would place it just below Tesla and Alibaba in terms of its “market value”. (On Wednesday!) Then Kelly starts her critique, which is quite different from mine in Stablecoins: In the context of companies, the “market cap” can be thought of as loosely representing what someone would have to pay to buy out all the shareholders in order to own the company outright (though in practice the shares have often been over- or undervalued by the market, so shareholders are often offered a premium or a discount). Companies, of course, have real-world assets with economic value. And there are ways to analyse them to work out whether they are over- or undervalued, such as price-to-earnings ratios, net profit margins, etc. With bitcoin, the whole value proposition rests on the idea of the network. If you took away the coinholders there would be literally nothing there, and so bitcoin’s value would fall to nil. Trying to value it by talking about a “market cap” therefore makes no sense at all. Secondly, she takes aim at the circulating BTC supply: Another problem is that although 18.6m bitcoins have indeed been mined, far fewer can actually be said to be “in circulation” in any meaningful way. For a start, it is estimated that about 20 per cent of bitcoins have been lost in various ways, never to be recovered. Then there are the so-called “whales” that hold most of the bitcoin, whose dominance of the market has risen in recent months. The top 2.8 per cent of bitcoin addresses now control 95 per cent of the supply (including many that haven’t moved any bitcoin for the past half-decade), and more than 63 per cent of the bitcoin supply hasn’t been moved for the past year, according to recent estimates. The small circulating supply means that BTC liquidity is an illusion: the idea that you can get out of your bitcoin position at any time and the market will stay intact is frankly a nonsense. And that’s why the bitcoin religion’s “HODL” mantra is so important to be upheld, of course. Because if people start to sell, bad things might happen! And they sometimes do. The excellent crypto critic Trolly McTrollface (not his real name, if you’re curious) pointed out on Twitter that on Saturday a sale of just 150 bitcoin resulted in a 10 per cent drop in the price. And there are a lot of "whales' HODL-ing. If one decides to cash out, everyone will get trampled in the rush for the exits: More than 2,000 wallets contain over 1,000 bitcoin in them. What would happen to the price if just one of those tried to unload their coins on to the market at once? It wouldn’t be pretty, we would wager. What we call the “bitcoin price” is in fact only the price of the very small number of bitcoins that wash around the retail market, and doesn’t represent the price that 18.6m bitcoins would actually be worth, even if they were all actually available. Source Note that Kelly's critique implictly assumes that BTC is priced in USD, not in the mysteriously inflatable USDT. The graph shows that the vast majority of the "very small number of bitcoins that wash around the retail market" are traded for, and thus priced in USDT. So the actual number of bitcoins being traded for real money is a small fraction of a very small number. Bitfinex & Tether have agreed to comply with the New York Supreme Court and turn over their financial records to the New York Attorney General by 15th January. If they actually do, and the details of what is actually backing the current stock of nearly 24 billion USDT become known, things could get rather dynamic. As Tim Swanson explains in Parasitic Stablecoins, the 24B USD are notionally in a bank account, and the solvency of that account is not guaranteed by any government deposit insurance. So even if there were a bank account containing 24B USD, if there is a rush for the exits the bank holding that account could well go bankrupt. To give a sense of scale, the 150 BTC sale that crashed the "price" by 10% represents ( 150 / 6.25 ) / 6 = 4 hours of mining reward. If miners were cashing out their rewards, they would be selling 900BTC or $36M/day. In the long term, the lack of barriers to entry means that the margins on mining are small. But in the short term, mining capacity can't respond quickly to large changes in the "price". It certainly can't increase four times in three months. Source Lets assume that three months ago, when 1BTC≈10,000USDT, the BTC ecosystem was in equilibrium with the mining rewards plus fees slightly more than the cost of mining. While the BTC "price" has quadrupled, the hash rate and thus the cost of mining has oscillated between 110M and 150M TeraHash/s. It hasn't increased significantly, so miners only now need to sell about 225BTC or $9M/day to cover their costs. With the price soaring, they have an incentive to HODL their rewards. Posted by David. at 8:00 AM Labels: bitcoin 23 comments: David. said... Alex Pickard was an early buyer of BTC, and became a miner in 2017. But the scales have fallen from his eyes, In Bitcoin: Magic Internet Money he explains that BTC is useless for anything except speculation: "Essentially overnight it became “digital gold” with no use other than for people to buy and hodl ... and hope more people would buy and hodl, and increase the price of BTC until everyone on earth sells their fiat currency for BTC, and then…? Well, what exactly happens then, when BTC can only handle about 350,000 transactions per day and 7.8 billion people need to buy goods and services?" And he is skeptical that Tether will survive: "If Tether continues as a going concern, and if the rising price of BTC is linked to USDT issuance, then BTC will likely continue to mechanically build a castle to the sky. I have shown how BTC price increases usually follow USDT issuance. In late 2018, when roughly 1 billion USDT were redeemed, the price of BTC subsequently fell by over 50%. Now, imagine what would happen if Tether received a cease-and-desist order, and its bank accounts were seized. Today’s digital gold would definitely lose its luster." January 14, 2021 at 10:10 AM David. said... The saga of someone trying to turn "crypto" into "fiat". January 14, 2021 at 3:39 PM David. said... An anonymous Bitcoin HODL-er finally figured out the Tether scam and realized his winnings. His must-read account is The Bit Short: Inside Crypto’s Doomsday Machine: "The legitimate crypto exchanges, like Coinbase and Bitstamp, clearly know to stay far away from Tether: neither supports Tether on their platforms. And the feeling is mutual! Because if Tether Ltd. were ever to allow a large, liquid market between Tethers and USD to develop, the fraud would instantly become obvious to everyone as the market-clearing price of Tether crashed far below $1. Kraken is the biggest USD-banked crypto exchange on which Tether and US dollars trade freely against each other. The market in that trading pair on Kraken is fairly modest — about $16M worth of daily volume — and Tether Ltd. surely needs to keep a very close eye on its movements. In fact, whenever someone sells Tether for USD on Kraken, Tether Ltd. has no choice but to buy it — to do otherwise would risk letting the peg slip, and unmask the whole charade. My guess is that maintaining the Tether peg on Kraken represents the single biggest ongoing capital expense of this entire fraud. If the crooks can’t scrape together enough USD to prop up the Tether peg on Kraken, then it’s game over, and the whole shambles collapses. And that makes it the fraud’s weak point." January 18, 2021 at 8:22 AM David. said... Tether's bank is Deltec, in the Bahamas. The anonymous Bitcoin HODL-er points out that: "Bahamas discloses how much foreign currency its domestic banks hold each month." As of the end of September 2020, all Bahamian banks in total held about $5.3B USD worth of foreign currency. At that time there were about 15.5B USDT in circulation. Even if we assume that Deltec held all of it, USDT was only 34% backed by actual money. January 18, 2021 at 9:14 AM David. said... David Gerard's Tether printer go brrrrr — cryptocurrency’s substitute dollar problem collects a lot of nuggets about Tether, but also this: "USDC loudly touts claims that it’s well-regulated, and implies that it’s audited. But USDC is not audited — accountants Grant Thornton sign a monthly attestation that Centre have told them particular things, and that the paperwork shows the right numbers. An audit would show for sure whether USDC’s reserve was real money, deposited by known actors — and not just a barrel of nails with a thin layer of gold and silver on top supplied by dubious entities. But, y’know, it’s probably fine and you shouldn’t worry." February 3, 2021 at 3:05 PM David. said... In 270 addresses are responsible for 55% of all cryptocurrency money laundering, Catalin Cimpanu discusses a report from Chainalysis: "1,867 addresses received 75% of all criminally-linked cryptocurrency funds in 2020, a sum estimated at around $1.7 billion. ... The company believes that the cryptocurrency-related money laundering field is now in a vulnerable position where a few well-orchestrated law enforcement actions against a few cryptocurrency operators could cripple the movement of illicit funds of many criminal groups at the same time. Furthermore, additional analysis also revealed that many of the services that play a crucial role in money laundering operations are also second-tier services hosted at larger legitimate operators. In this case, a law enforcement action wouldn't even be necessary, as convincing a larger company to enforce its anti-money-laundering policies would lead to the shutdown of many of today's cryptocurrency money laundering hotspots." February 15, 2021 at 12:24 PM David. said... In Bitcoin is now worth $50,000 — and it's ruining the planet faster than ever, Eric Holthaus points out the inevitable result of the recent spike in BTC: "The most recent data, current as of February 17 from the University of Cambridge shows that Bitcoin is drawing about 13.62 Gigawatts of electricity, an annualized consumption of 124 Terawatt-hours – about a half-percent of the entire world’s total – or about as much as the entire country of Pakistan. Since most electricity used to mine Bitcoin comes from fossil fuels, Bitcoin produces a whopping 37 million tons of carbon dioxide annually, about the same amount as Switzerland does by simply existing." February 21, 2021 at 12:19 PM David. said... In Elon Musk wants clean power, but Tesla's dealing in environmentally dirty bitcoin notes that: "Tesla boss Elon Musk is a poster child of low-carbon technology. Yet the electric carmaker's backing of bitcoin this week could turbocharge global use of a currency that's estimated to cause more pollution than a small country every year. Tesla revealed on Monday it had bought $1.5 billion of bitcoin and would soon accept it as payment for cars, sending the price of the cryptocurrency though the roof. ... The digital currency is created via high-powered computers, an energy-intensive process that currently often relies on fossil fuels, particularly coal, the dirtiest of them all." But Reuters fails to ask where the $1.5B that spiked BTC's "price" came from. It wasn't Musk's money, it was the Tesla shareholder's money. And how did they get it? By selling carbon offsets. So Musk is taking subsidies intended to reduce carbon emissions and using them to generate carbon emissions. February 21, 2021 at 12:30 PM David. said... One flaw in Eric Holthaus' Bitcoin is now worth $50,000 — and it's ruining the planet faster than ever is that while he writes: "There are decent alternatives to Bitcoin for people still convinced by the potential social benefits of cryptocurrencies. Ethereum, the world’s number two cryptocurrency, is currently in the process of converting its algorithm from one that’s fundamentally competitive (proof-of-work, like Bitcoin uses) to one that’s collaborative (proof-of-stake), a move that will conserve more than 99% of its electricity use." He fails to point out that (a) Ethereum has been trying to move to proof-of-stake for many years without success, and (b) there are a huge number of other proof-of-work cryptocurrencies that, in aggregate, also generate vast carbon emissions. February 21, 2021 at 12:57 PM David. said... Four posts worth reading inspired by Elon Musk's pump-and-HODL of Bitcoin. First, Jamie Powell's Tesla and bitcoin: the accounting explains how $1.5B of BTC will further obscure the underlying business model of Tesla. Of course, if investors actually understood Tesla's business model they might not be willing to support a PE of, currently, 1,220.78, so the obscurity may be the reason for the HODL. Second, Izabella Kaminska's What does institutional bitcoin mean? looks at the investment strategies hedge funds like Blackrock will use as they "dabble in Bitcoin". It involves the BTC futures market being in contango and is too complex to extract but well worth reading. Third, David Gerard's Number go up with Tether — Musk and Bitcoin set the world on fire points out that Musk's $1.5B only covers 36 hours of USDT printing: "Tether has given up caring about plausible appearances, and is now printing a billion tethers at a time. As I write this, Tether states its reserve as $34,427,896,266.91 of book value. That’s $34.4 billion — every single dollar of which is backed by … pinky-swears, maybe? Tether still won’t reveal what they’re claiming to constitute backing reserves." In Bitcoin's 'Elon Musk pump' rally to $48K was exclusively driven by whales, Joseph Young writes: "n recent months, so-called “mega whales” sold large amounts of Bitcoin between $33,000 and $40,000. Orders ranging from $1 million to $10 million rose significantly across major cryptocurrency exchanges, including Binance. But as the price of Bitcoin began to consolidate above $33,000 after the correction from $40,000, the buyer demand from whales surged once again. Analysts at “Material Scientist” said that whales have been showing unusually large volume, around $150 million in 24 hours. This metric shows that whales are consistently accumulating Bitcoin in the aftermath of the news that Tesla bought $1.5 billion worth of BTC." February 21, 2021 at 4:49 PM David. said... Ethereum consumes about 22.5TWh/yr - much less than Bitcoin's 124TWh/yr, but still significant. It will continue to waste power until the switch to proof-of-stake, underway for the past 7 years, finally concludes. Don't hold your breath. February 22, 2021 at 10:21 AM David. said... The title of Jemima Kelly's Hey Citi, your bitcoin report is embarrassingly bad says all that needs to be said, but her whole post is a fun read. March 2, 2021 at 9:26 AM David. said... Jemima Kelley takes Citi's embarrassing "bitcoin report" to the woodshed again in The many chart crimes of *that* Citi bitcoin report: "Not only was this “report” actually just a massive bitcoin-shilling exercise, it also contained some really quite embarrassing errors from what is meant to be one of the top banks in the world (and their “premier thought leadership” division at that). The error that was probably most shocking was the apparent failure of the six Citi analysts who authored the report to grasp the difference between basis points and percentage points." March 3, 2021 at 6:43 AM David. said... Adam Tooze's Talking (and reading) about Bitcoin is an economist's view of Bitcoin: "To paraphrase Gramsci, crypto is the morbid symptom of an interregnum, an interregnum in which the gold standard is dead but a fully political money that dares to speak its name has not yet been born. Crypto is the libertarian spawn of neoliberalism’s ultimately doomed effort to depoliticize money." Tooze quotes Izabella Kaminska contrasting the backing of "fiat" by the requirement to pay tax with Bitcoin: "Private “hackers” routinely raise revenue from stealing private information and then demanding cryptocurrency in return. The process is known as a ransom attack. It might not be legal. It might even be classified as extortion or theft. But to the mindset of those who oppose “big government” or claim that “tax is theft”, it doesn’t appear all that different. A more important consideration is which of these entities — the hacker or a government — is more effective at enforcing their form of “tax collection” upon the system. The government, naturally, has force, imprisonment and the law on its side. And yet, in recent decades, that hasn’t been quite enough to guarantee effective tax collection from many types of individuals or corporations. Hackers, at a minimum, seem at least comparably effective at extracting funds from rich individuals or multinational organisations. In many cases, they also appear less willing to negotiate or to cut deals." March 5, 2021 at 8:49 AM David. said... IBM Blockchain Is a Shell of Its Former Self After Revenue Misses, Job Cuts: Sources by Ian Allison is the semi-official death-knell for IBM's Hyperledger: "IBM has cut its blockchain team down to almost nothing, according to four people familiar with the situation. Job losses at IBM (NYSE: IBM) escalated as the company failed to meet its revenue targets for the once-fêted technology by 90% this year, according to one of the sources." David Gerard comments: "Hyperledger was a perfect IBM project — a Potemkin village open source project, where all the work was done in an IBM office somewhere." March 5, 2021 at 2:23 PM David. said... Ketan Joshi's Bitcoin is a mouth hungry for fossil fuels is a righteous rant about cryptocurrencies' energy usage: "I think the story of Bitcoin isn’t a sideshow to climate; it’s actually a very significant and central force that will play a major role in dragging down the accelerating pace of positive change. This is because it has an energy consumption problem, it has a fossil fuel industry problem, and it has a deep cultural / ideological problem. All three, in symbiotic concert, position Bitcoin to stamp out the hard-fought wins of the past two decades, in climate. Years of blood, sweat and tears – in activism, in technological development, in policy and regulation – extinguished by a bunch of bros with laser-eye profile pictures." March 16, 2021 at 10:57 AM David. said... The externalities of cryptocurrencies, and bitcoin in particular, don't just include ruining the climate, but also ruining the lives of vulnerable elderly who have nothing to do with "crypto". Mark Rober's fascinating video Glitterbomb Trap Catches Phone Scammer (who gets arrested) reveals that Indian phone scammers transfer their ill-gotten gains from stealing the life savings of elderly victims from the US to India using Bitcoin. March 19, 2021 at 6:19 PM David. said... The subhead of Noah Smith's Bitcoin Miners Are on a Path to Self-Destruction is: "Producing the cryptocurrency is a massive drain on global power and computer chip supplies. Another way is needed before countries balk." March 26, 2021 at 11:50 AM David. said... In Before Bitfinex and Tether, Bennett Tomlin pulls together the "interesting" backgrounds of the "trustworthy" people behind Bitfinex & Tether. March 29, 2021 at 4:14 PM David. said... David Gerard reports that: "Coinbase has had to pay a $6.5 million fine to the CFTC for allowing an unnamed employee to wash-trade Litecoin on the platform. On some days, the employee’s wash-trading was 99% of the Litecoin/Bitcoin trading pair’s volume. Coinbase also operated two trading bots, “Hedger and Replicator,” which often matched each others’ orders, and reported these matches to the market." As he says: "If Coinbase — one of the more regulated exchanges — did this, just think what the unregulated exchanges get up to." Especially with the "trustworthy" characters running the unregulated exchanges. March 29, 2021 at 4:19 PM David. said... Martin C. W. Walker and Winnie Mosioma's Regulated cryptocurrency exchanges: sign of a maturing market or oxymoron? examines the (mostly lack of) regulation of exchanges and concludes; "In general, cryptocurrencies lack anyone that is genuinely accountable for core processes such as transfers of ownership, trade validation and creation of cryptocurrencies. A concern that can ultimately only be dealt with by acceptance of the situation or outright bans. However, the almost complete lack of regulation of the highly centralised cryptocurrency exchanges should be an easier-to-fill gap. Regulated entities relying on prices from “exchanges” for accounting or calculation of the value of futures contracts are clearly putting themselves at significant risk." Coinbase just filed for a $65B direct listing despite just having been fine $6.5M forwash-tradding Litecoin. April 14, 2021 at 12:10 PM David. said... Izabella Kaminska outlines the the risks underlying Coinbase's IPO in Why Coinbase’s stellar earnings are not what they seem. The sub-head is: "It’s easy to be profitable if your real unique selling point is being a beneficiary of regulatory arbitrage." And she concludes: "Coinbase may be a hugely profitable business, but it may also be a uniquely risky one relative to regulated trading venues such as the CME or ICE, neither of which are allowed to take principal positions to facilitate liquidity on their platforms. Instead, they rely on third party liquidity providers. Coinbase, however, is not only known to match client transactions on an internalised “offchain” basis (that is, not via the primary blockchain) but also to square-off residual unmatched positions via bilateral relationships in crypto over-the-counter markets, where it happens to have established itself as a prominent market maker. It’s an ironic state of affairs because the netting processes that are at the heart of this system expose Coinbase to the very same risks that real-time gross settlement systems (such as bitcoin) were meant to vanquish." April 16, 2021 at 1:24 PM David. said... Nathan J. Robinson hits the nail on the head with Why Cryptocurrency Is A Giant Fraud: "You may have ignored Bitcoin because the evangelists for it are some of the most insufferable people on the planet—and you may also have kicked yourself because if you had listened to the first guy you met who told you about Bitcoin way back, you’d be a millionaire today. But now it’s time to understand: is this, as its proponents say, the future of money?" and: "But as is generally the case when someone is trying to sell you something, the whole thing should seem extremely fishy. In fact, much of the cryptocurrency pitch is worse than fishy. It’s downright fraudulent, promising people benefits that they will not get and trying to trick them into believing in and spreading something that will not do them any good. When you examine the actual arguments made for using cryptocurrencies as currency, rather than just being wowed by the complex underlying system and words like “autonomy,” “global,” and “seamless,” the case for their use by most people collapses utterly. Many believe in it because they have swallowed libertarian dogmas that do not reflect how the world actually works." Robinson carefully dismantles the idea that cryptocurrencies offer "security", "privacy", "convenience", and many of the other arguments for them. TGhe whole article is well worth reading. April 25, 2021 at 5:34 PM Post a Comment Newer Post Older Post Home Subscribe to: Post Comments (Atom) Blog Rules Posts and comments are copyright of their respective authors who, by posting or commenting, license their work under a Creative Commons Attribution-Share Alike 3.0 United States License. Off-topic or unsuitable comments will be deleted. DSHR DSHR in ANWR Recent Comments Full comments Blog Archive ▼  2021 (18) ►  April (5) ►  March (3) ►  February (5) ▼  January (5) Effort Balancing And Rate Limits ISP Monopolies The Bitcoin "Price" Two Million Page Views! The New Oldweb.today ►  2020 (55) ►  December (4) ►  November (4) ►  October (3) ►  September (6) ►  August (5) ►  July (3) ►  June (6) ►  May (3) ►  April (5) ►  March (6) ►  February (5) ►  January (5) ►  2019 (66) ►  December (2) ►  November (4) ►  October (8) ►  September (5) ►  August (5) ►  July (7) ►  June (6) ►  May (7) ►  April (6) ►  March (7) ►  February (4) ►  January (5) ►  2018 (96) ►  December (7) ►  November (8) ►  October (10) ►  September (5) ►  August (8) ►  July (5) ►  June (7) ►  May (10) ►  April (8) ►  March (9) ►  February (9) ►  January (10) ►  2017 (82) ►  December (6) ►  November (6) ►  October (8) ►  September (6) ►  August (7) ►  July (5) ►  June (7) ►  May (6) ►  April (7) ►  March (11) ►  February (5) ►  January (8) ►  2016 (89) ►  December (4) ►  November (8) ►  October (10) ►  September (8) ►  August (8) ►  July (7) ►  June (8) ►  May (7) ►  April (5) ►  March (10) ►  February (7) ►  January (7) ►  2015 (75) ►  December (7) ►  November (5) ►  October (11) ►  September (5) ►  August (3) ►  July (3) ►  June (8) ►  May (10) ►  April (6) ►  March (6) ►  February (7) ►  January (4) ►  2014 (68) ►  December (7) ►  November (8) ►  October (6) ►  September (8) ►  August (7) ►  July (3) ►  June (5) ►  May (6) ►  April (5) ►  March (6) ►  February (2) ►  January (5) ►  2013 (67) ►  December (3) ►  November (6) ►  October (7) ►  September (6) ►  August (3) ►  July (5) ►  June (6) ►  May (5) ►  April (9) ►  March (5) ►  February (5) ►  January (7) ►  2012 (43) ►  December (4) ►  November (4) ►  October (6) ►  September (6) ►  August (2) ►  July (5) ►  June (2) ►  May (5) ►  March (1) ►  February (5) ►  January (3) ►  2011 (40) ►  December (2) ►  November (1) ►  October (7) ►  September (3) ►  August (5) ►  July (2) ►  June (2) ►  May (2) ►  April (4) ►  March (4) ►  February (4) ►  January (4) ►  2010 (17) ►  December (5) ►  November (3) ►  October (4) ►  September (2) ►  July (1) ►  June (1) ►  February (1) ►  2009 (8) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (2) ►  January (2) ►  2008 (8) ►  December (2) ►  March (1) ►  January (5) ►  2007 (14) ►  December (1) ►  October (3) ►  September (1) ►  August (1) ►  July (2) ►  June (3) ►  May (1) ►  April (2) LOCKSS system has permission to collect, preserve, and serve this Archival Unit. Simple theme. Powered by Blogger. 
blog-dshr-org-9954	----	DSHR's Blog: What Is The Point? DSHR's Blog I'm David Rosenthal, and this is a place to discuss the work I'm doing in Digital Preservation. Thursday, April 22, 2021 What Is The Point? During a discussion of NFTs, Larry Masinter pointed me to his 2012 proposal The 'tdb' and 'duri' URI schemes, based on dated URIs. The proposal's abstract reads: This document defines two URI schemes. The first, 'duri' (standing for "dated URI"), identifies a resource as of a particular time. This allows explicit reference to the "time of retrieval", similar to the way in which bibliographic references containing URIs are often written. The second scheme, 'tdb' ( standing for "Thing Described By"), provides a way of minting URIs for anything that can be described, by the means of identifying a description as of a particular time. These schemes were posited as "thought experiments", and therefore this document is designated as Experimental. As far as I can tell, this proposal went nowhere, but it raises a question that is also raised by NFTs. What is the point of a link that is unlikely to continue to resolve to the expected content? Below the fold I explore this question. I think there are two main reasons why duri: went nowhere: The duri: concept implies that Web content in general is not static, but it is actually much more dynamic than that. Even the duri: specification admits this: There are many URIs which are, unfortunately, not particularly "uniform", in the sense that two clients can observe completely different content for the same resource, at exactly the same time. Personalization, advertisements, geolocation, watermarks, all make it very unlikely that either several clients accessing the same URI at the same time, or a single client accessing the same URI at different times, would see the same content. When this proposal was put forward in 2012, it was competing with a less elegant but much more useful competitor that had been in use for 16 years. The duri: specificartion admits that: There are no direct resolution servers or processes for 'duri' or 'tdb' URIs. However, a 'duri' URI might be "resolvable" in the sense that a resource that was accessed at a point in time might have the result of that access cached or archived in an Internet archive service. See, for example, the "Internet Archive" project But the duri: URI doesn't provide the information needed to resolve to the "cached or archived" content. The Internet Archive's Wayback Machine uses URIs which, instead of the prefix duri:[datetime]: have the prefix https://web.archive.org/web/[datetime]/. This is more useful, both because browsers will actually resolve these URIs, and because they resolve to a service devoted to delivering the content of the URI at the specified time. The competition for duri: was not merely long established, but also actually did what users presumably wanted, which was to resolve to the content of the specified URL at the specified time. It is true that a user creating a Wayback Machine URL, perhaps using the "Save Page Now" button, would preserve the content accessed by the Wayback Machine's crawler. which might be different from that accessed by the user themselves. But the user could compare the two versions at the time of creation, and avoid using the created Wayback Machine URL if the differences were significant. Publishing a Wayback Machine URL carries an implicit warranty that the creator regarded any differences as insignificant. The history of duri: suggests that there isn't a lot of point in "durable" URIs lacking an expectation that they will continue to resolve to the original content. NFTs have the expectation, but lack the mechanism necessary to satisfy the expectation. Posted by David. at 8:00 AM Labels: personal digital preservation, web archiving No comments: Post a Comment Newer Post Older Post Home Subscribe to: Post Comments (Atom) Blog Rules Posts and comments are copyright of their respective authors who, by posting or commenting, license their work under a Creative Commons Attribution-Share Alike 3.0 United States License. Off-topic or unsuitable comments will be deleted. DSHR DSHR in ANWR Recent Comments Full comments Blog Archive ▼  2021 (18) ▼  April (5) Dogecoin Disrupts Bitcoin! What Is The Point? NFTs and Web Archiving Cryptocurrency's Carbon Footprint Elon Musk: Threat or Menace? ►  March (3) ►  February (5) ►  January (5) ►  2020 (55) ►  December (4) ►  November (4) ►  October (3) ►  September (6) ►  August (5) ►  July (3) ►  June (6) ►  May (3) ►  April (5) ►  March (6) ►  February (5) ►  January (5) ►  2019 (66) ►  December (2) ►  November (4) ►  October (8) ►  September (5) ►  August (5) ►  July (7) ►  June (6) ►  May (7) ►  April (6) ►  March (7) ►  February (4) ►  January (5) ►  2018 (96) ►  December (7) ►  November (8) ►  October (10) ►  September (5) ►  August (8) ►  July (5) ►  June (7) ►  May (10) ►  April (8) ►  March (9) ►  February (9) ►  January (10) ►  2017 (82) ►  December (6) ►  November (6) ►  October (8) ►  September (6) ►  August (7) ►  July (5) ►  June (7) ►  May (6) ►  April (7) ►  March (11) ►  February (5) ►  January (8) ►  2016 (89) ►  December (4) ►  November (8) ►  October (10) ►  September (8) ►  August (8) ►  July (7) ►  June (8) ►  May (7) ►  April (5) ►  March (10) ►  February (7) ►  January (7) ►  2015 (75) ►  December (7) ►  November (5) ►  October (11) ►  September (5) ►  August (3) ►  July (3) ►  June (8) ►  May (10) ►  April (6) ►  March (6) ►  February (7) ►  January (4) ►  2014 (68) ►  December (7) ►  November (8) ►  October (6) ►  September (8) ►  August (7) ►  July (3) ►  June (5) ►  May (6) ►  April (5) ►  March (6) ►  February (2) ►  January (5) ►  2013 (67) ►  December (3) ►  November (6) ►  October (7) ►  September (6) ►  August (3) ►  July (5) ►  June (6) ►  May (5) ►  April (9) ►  March (5) ►  February (5) ►  January (7) ►  2012 (43) ►  December (4) ►  November (4) ►  October (6) ►  September (6) ►  August (2) ►  July (5) ►  June (2) ►  May (5) ►  March (1) ►  February (5) ►  January (3) ►  2011 (40) ►  December (2) ►  November (1) ►  October (7) ►  September (3) ►  August (5) ►  July (2) ►  June (2) ►  May (2) ►  April (4) ►  March (4) ►  February (4) ►  January (4) ►  2010 (17) ►  December (5) ►  November (3) ►  October (4) ►  September (2) ►  July (1) ►  June (1) ►  February (1) ►  2009 (8) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (2) ►  January (2) ►  2008 (8) ►  December (2) ►  March (1) ►  January (5) ►  2007 (14) ►  December (1) ►  October (3) ►  September (1) ►  August (1) ►  July (2) ►  June (3) ►  May (1) ►  April (2) LOCKSS system has permission to collect, preserve, and serve this Archival Unit. Simple theme. Powered by Blogger. 
blog-esilibrary-com-6095	----	Equinox Open Library Initiative Skip to content Facebook-f Twitter Linkedin-in Vimeo About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Fulfillment CORAL Services Consulting Migration Development Hosting & Support Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Collaborate Communities Partnerships Grants We Provide Connect Sales Support Donate Social Media × About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Fulfillment CORAL Services Consulting Migration Development Hosting & Support Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Collaborate Communities Partnerships Grants We Provide Connect Sales Support Donate Social Media About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Koha On Demand Koha Dedicated Hosting Fulfillment CORAL SubjectsPlus Services Consulting Workflow and Advanced ILS Consultation Data Services Web Design IT Consultation Migration Development Hosting & Support Sequoia Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Resource Library Collaborate Communities Evergreen Koha CORAL Equinox Grants Connect Sales Support Donate Contact Us × About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Koha On Demand Koha Dedicated Hosting Fulfillment CORAL SubjectsPlus Services Consulting Workflow and Advanced ILS Consultation Data Services Web Design IT Consultation Migration Development Hosting & Support Sequoia Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Resource Library Collaborate Communities Evergreen Koha CORAL Equinox Grants Connect Sales Support Donate Contact Us About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Fulfillment CORAL Services Consulting Migration Development Hosting & Support Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Collaborate Communities Partnerships Grants We Provide Connect Sales Support Donate Social Media × About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Fulfillment CORAL Services Consulting Migration Development Hosting & Support Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Collaborate Communities Partnerships Grants We Provide Connect Sales Support Donate Social Media Equinox provides innovative open source software for libraries of all types. Extraordinary service. Exceptional value. As a 501(c)(3) nonprofit corporation, Equinox supports library automation by investing in open source software and providing technology services for libraries. Products Services Ask Us How » About Equinox » News & Events Press Release Equinox Open Library Initiative Awards Center for Khmer Studies the Equinox Open Source Grant Learn More » Press Release Equinox Open Library Initiative Awards Vermont Jazz Center the Equinox Open Source Grant Learn More » Press Release Equinox Launches New Website Featuring Open Source Library Products, Services, and Education Learn More » Products & Services Koha is the first free and open source library automation package. Equinox’s team includes some of Koha’s core developers. Learn More Evergreen is a unique and powerful open source ILS designed to support large, dispersed, and multi-tiered library networks. Learn More Equinox provides ongoing educational opportunities through equinoxEDU, including live webinars, workshops, and online resources. Learn More Fulfillment is an open source interlibrary loan management system. Fulfillment can be used alongside or in connection with any integrated library system. Learn More CORAL is an open source electronic resources management system. Its interoperable modules allow libraries to streamline their management of electronic resources. Learn More Customized For Your Library Consulting Migration Development Hosting & Support Training & Education Why Choose Equinox? Equinox is different from most ILS providers. As a non-profit organization, our guiding principle is to provide a transparent, open software development process, and we release all code developed to publicly available repositories. Equinox is experienced with serving libraries of all types in the United States and internationally. We’ve supported and migrated libraries of all sizes, from single library sites to full statewide implementations. Equinox is technically proficient, with skilled project managers, software developers, and data services staff ready to assist you. We’ve helped libraries automating for the first time and those migrating from legacy ILS systems. Equinox knows libraries. More than fifty percent of our team are professional librarians with direct experience working in academic, government, public and special libraries. We understand the context and ecosystem of library software. Sign up today for news & updates! Please enable JavaScript in your browser to complete this form. Email * Name *First Last I'd like to hear more about: Koha Evergreen equinoxEDU Other Please describe: Submit Working with Equinox has been like night and day. It's amazing to have a system so accessible to our patrons and easy to use. It has super-charged our library lending power! Brooke MatsonExecutive Director, Spark Central Equinox Open Library Initiative hosts Evergreen for the SCLENDS library consortium. Their technical support has been both prompt, responsive, and professional in reacting to our support requests during COVID-19. They have been a valuable consortium partner in meeting the needs of the member libraries and their patrons. Chris YatesSouth Carolina State Library Working with Equinox was great! They were able to migrate our entire consortium with no down time during working hours. The Equinox team went the extra mile in helping Missouri Evergreen. Colleen KnightMissouri Evergreen Previous Next Twitter Equinox OLIFollow Equinox OLI@EquinoxOLI·13h Our latest: "Equinox Open Library Initiative Awards Vermont Jazz Center the Equinox Open Source Grant." Read more: https://bit.ly/3xjCLFT #equinoxgrant #grant #opensource #kohails #oss @vtjazz Reply on Twitter 1386891644117659648Retweet on Twitter 13868916441176596481Like on Twitter 13868916441176596481Twitter 1386891644117659648 Retweet on TwitterEquinox OLI Retweeted Newswire@inewswire·20 Apr Equinox Open Library Initiative celebrates 15 years as a small business delivering 'Extraordinary Service. Exceptional Value' to libraries worldwide: https://www.newswire.com/news/equinox-launches-new-website-featuring-open-source-library-products-21368731 Reply on Twitter 1384523516700200960Retweet on Twitter 13845235167002009602Like on Twitter 13845235167002009602Twitter 1384523516700200960 Equinox OLI@EquinoxOLI·20 Apr equinoxEDU : Spotlight on the @EvergreenILS Booking Module registration is open! Save your spot for April 28, 1-2pm EDT. Free & open: https://bit.ly/3ahwbGl Reply on Twitter 1384354942253768707Retweet on Twitter 1384354942253768707Like on Twitter 1384354942253768707Twitter 1384354942253768707 Equinox OLI@EquinoxOLI·17 Apr #ICYMI - @EvergreenILS 3.7.0 is here! Congrats to the community! #evgils #oss #opensource Evergreen ILS@EvergreenILSEvergreen 3.7.0 released! New features include SAML, hold groups, boostrap OPAC, "did you mean?" and much, much more, read about it here: https://evergreen-ils.org/evergreen-3-7-0-released/ #evgils Reply on Twitter 1383220421139386370Retweet on Twitter 13832204211393863701Like on Twitter 13832204211393863702Twitter 1383220421139386370 Equinox OLI@EquinoxOLI·16 Apr Our latest: "Equinox Launches New Website Featuring Open Source Library Products, Services, and Education." Read more: https://www.equinoxoli.org/equinox-launches-new-website-featuring-open-source-library-products-services-and-education/ #libraries #services #consulting #interlibraryloan #opensource #training #tech #evgils #kohails #oss Reply on Twitter 1383089316151365636Retweet on Twitter 13830893161513656362Like on Twitter 1383089316151365636Twitter 1383089316151365636 Events Open Source Twitter Chat with Guest Moderator Becky Yoose #ChatOpenS Event Join us on Twitter with the hashtag #ChatOpenS as we discuss cybersecurity with Becky Yoose of LDH Consulting Services. Read More Open Source Twitter Chat with Rogan Hamby #ChatOpenS Event 03/17/2021 Join us on Twitter @EquinoxOLI and the #ChatOpenS hashtag from 12-1pm EDT as we discuss all things #opensource & libraries. Moderated by Rogan Hamby, Data and Project Analyst for Read More EquinoxEDU: Spotlight on Evergreen 3.6 Event 03/05/2021 Join us for an EquinoxEDU: Spotlight session on new features in the Evergreen ILS! In this live webinar we will highlight some of the newest features in version 3.6 Read More Equinox Open Library Initiative Equinox Open Library Initiative Inc. is a 501(c)3 corporation devoted to the support of open source software for public libraries, academic libraries, school libraries, and special libraries. As the successor to Equinox Software, Inc., Equinox provides exceptional service and technical expertise delivered by experienced librarians and technical staff. Equinox offers affordable, customized consulting services, software development, hosting, training, and technology support for libraries of all sizes and types. Connect Please enable JavaScript in your browser to complete this form. Email Submit Facebook-f Twitter Linkedin-in Vimeo Contact Us info@equinoxoli.org 877.OPEN.ILS (877.673.6457) +1.770.709.5555 PO Box 69 Norcross, GA 30091 Copyright © 2007 – 2021 Equinox Open Library Initiative. All rights reserved. Privacy Policy  |   Terms of Use  |   Equinox Library Services Canada  |   Site Map Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset 
blog-esilibrary-com-7328	----	Equinox Open Library Initiative Skip to content Facebook-f Twitter Linkedin-in Vimeo About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Fulfillment CORAL Services Consulting Migration Development Hosting & Support Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Collaborate Communities Partnerships Grants We Provide Connect Sales Support Donate Social Media × About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Fulfillment CORAL Services Consulting Migration Development Hosting & Support Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Collaborate Communities Partnerships Grants We Provide Connect Sales Support Donate Social Media About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Koha On Demand Koha Dedicated Hosting Fulfillment CORAL SubjectsPlus Services Consulting Workflow and Advanced ILS Consultation Data Services Web Design IT Consultation Migration Development Hosting & Support Sequoia Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Resource Library Collaborate Communities Evergreen Koha CORAL Equinox Grants Connect Sales Support Donate Contact Us × About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Koha On Demand Koha Dedicated Hosting Fulfillment CORAL SubjectsPlus Services Consulting Workflow and Advanced ILS Consultation Data Services Web Design IT Consultation Migration Development Hosting & Support Sequoia Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Resource Library Collaborate Communities Evergreen Koha CORAL Equinox Grants Connect Sales Support Donate Contact Us About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Fulfillment CORAL Services Consulting Migration Development Hosting & Support Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Collaborate Communities Partnerships Grants We Provide Connect Sales Support Donate Social Media × About Our Team Newsroom Events History Ethics Disclosures Products Evergreen Koha Fulfillment CORAL Services Consulting Migration Development Hosting & Support Training & Education Learn equinoxEDU Tips & Tricks Conference Presentations Collaborate Communities Partnerships Grants We Provide Connect Sales Support Donate Social Media Equinox provides innovative open source software for libraries of all types. Extraordinary service. Exceptional value. As a 501(c)(3) nonprofit corporation, Equinox supports library automation by investing in open source software and providing technology services for libraries. Products Services Ask Us How » About Equinox » News & Events Press Release Equinox Open Library Initiative Awards Center for Khmer Studies the Equinox Open Source Grant Learn More » Press Release Equinox Open Library Initiative Awards Vermont Jazz Center the Equinox Open Source Grant Learn More » Press Release Equinox Launches New Website Featuring Open Source Library Products, Services, and Education Learn More » Products & Services Koha is the first free and open source library automation package. Equinox’s team includes some of Koha’s core developers. Learn More Evergreen is a unique and powerful open source ILS designed to support large, dispersed, and multi-tiered library networks. Learn More Equinox provides ongoing educational opportunities through equinoxEDU, including live webinars, workshops, and online resources. Learn More Fulfillment is an open source interlibrary loan management system. Fulfillment can be used alongside or in connection with any integrated library system. Learn More CORAL is an open source electronic resources management system. Its interoperable modules allow libraries to streamline their management of electronic resources. Learn More Customized For Your Library Consulting Migration Development Hosting & Support Training & Education Why Choose Equinox? Equinox is different from most ILS providers. As a non-profit organization, our guiding principle is to provide a transparent, open software development process, and we release all code developed to publicly available repositories. Equinox is experienced with serving libraries of all types in the United States and internationally. We’ve supported and migrated libraries of all sizes, from single library sites to full statewide implementations. Equinox is technically proficient, with skilled project managers, software developers, and data services staff ready to assist you. We’ve helped libraries automating for the first time and those migrating from legacy ILS systems. Equinox knows libraries. More than fifty percent of our team are professional librarians with direct experience working in academic, government, public and special libraries. We understand the context and ecosystem of library software. Sign up today for news & updates! Please enable JavaScript in your browser to complete this form. Email * Name *First Last I'd like to hear more about: Koha Evergreen equinoxEDU Other Please describe: Submit Working with Equinox has been like night and day. It's amazing to have a system so accessible to our patrons and easy to use. It has super-charged our library lending power! Brooke MatsonExecutive Director, Spark Central Equinox Open Library Initiative hosts Evergreen for the SCLENDS library consortium. Their technical support has been both prompt, responsive, and professional in reacting to our support requests during COVID-19. They have been a valuable consortium partner in meeting the needs of the member libraries and their patrons. Chris YatesSouth Carolina State Library Working with Equinox was great! They were able to migrate our entire consortium with no down time during working hours. The Equinox team went the extra mile in helping Missouri Evergreen. Colleen KnightMissouri Evergreen Previous Next Twitter Equinox OLIFollow Equinox OLI@EquinoxOLI·13h Our latest: "Equinox Open Library Initiative Awards Vermont Jazz Center the Equinox Open Source Grant." Read more: https://bit.ly/3xjCLFT #equinoxgrant #grant #opensource #kohails #oss @vtjazz Reply on Twitter 1386891644117659648Retweet on Twitter 13868916441176596481Like on Twitter 13868916441176596481Twitter 1386891644117659648 Retweet on TwitterEquinox OLI Retweeted Newswire@inewswire·20 Apr Equinox Open Library Initiative celebrates 15 years as a small business delivering 'Extraordinary Service. Exceptional Value' to libraries worldwide: https://www.newswire.com/news/equinox-launches-new-website-featuring-open-source-library-products-21368731 Reply on Twitter 1384523516700200960Retweet on Twitter 13845235167002009602Like on Twitter 13845235167002009602Twitter 1384523516700200960 Equinox OLI@EquinoxOLI·20 Apr equinoxEDU : Spotlight on the @EvergreenILS Booking Module registration is open! Save your spot for April 28, 1-2pm EDT. Free & open: https://bit.ly/3ahwbGl Reply on Twitter 1384354942253768707Retweet on Twitter 1384354942253768707Like on Twitter 1384354942253768707Twitter 1384354942253768707 Equinox OLI@EquinoxOLI·17 Apr #ICYMI - @EvergreenILS 3.7.0 is here! Congrats to the community! #evgils #oss #opensource Evergreen ILS@EvergreenILSEvergreen 3.7.0 released! New features include SAML, hold groups, boostrap OPAC, "did you mean?" and much, much more, read about it here: https://evergreen-ils.org/evergreen-3-7-0-released/ #evgils Reply on Twitter 1383220421139386370Retweet on Twitter 13832204211393863701Like on Twitter 13832204211393863702Twitter 1383220421139386370 Equinox OLI@EquinoxOLI·16 Apr Our latest: "Equinox Launches New Website Featuring Open Source Library Products, Services, and Education." Read more: https://www.equinoxoli.org/equinox-launches-new-website-featuring-open-source-library-products-services-and-education/ #libraries #services #consulting #interlibraryloan #opensource #training #tech #evgils #kohails #oss Reply on Twitter 1383089316151365636Retweet on Twitter 13830893161513656362Like on Twitter 1383089316151365636Twitter 1383089316151365636 Events Open Source Twitter Chat with Guest Moderator Becky Yoose #ChatOpenS Event Join us on Twitter with the hashtag #ChatOpenS as we discuss cybersecurity with Becky Yoose of LDH Consulting Services. Read More Open Source Twitter Chat with Rogan Hamby #ChatOpenS Event 03/17/2021 Join us on Twitter @EquinoxOLI and the #ChatOpenS hashtag from 12-1pm EDT as we discuss all things #opensource & libraries. Moderated by Rogan Hamby, Data and Project Analyst for Read More EquinoxEDU: Spotlight on Evergreen 3.6 Event 03/05/2021 Join us for an EquinoxEDU: Spotlight session on new features in the Evergreen ILS! In this live webinar we will highlight some of the newest features in version 3.6 Read More Equinox Open Library Initiative Equinox Open Library Initiative Inc. is a 501(c)3 corporation devoted to the support of open source software for public libraries, academic libraries, school libraries, and special libraries. As the successor to Equinox Software, Inc., Equinox provides exceptional service and technical expertise delivered by experienced librarians and technical staff. Equinox offers affordable, customized consulting services, software development, hosting, training, and technology support for libraries of all sizes and types. Connect Please enable JavaScript in your browser to complete this form. Email Submit Facebook-f Twitter Linkedin-in Vimeo Contact Us info@equinoxoli.org 877.OPEN.ILS (877.673.6457) +1.770.709.5555 PO Box 69 Norcross, GA 30091 Copyright © 2007 – 2021 Equinox Open Library Initiative. All rights reserved. Privacy Policy  |   Terms of Use  |   Equinox Library Services Canada  |   Site Map Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset 
blog-iandavis-com-2237	----	Internet Alchemy, the blog of Ian Davis Internet Alchemy est. 1999 2017 · 2011 · 2006 · 2001 2016 · 2010 · 2005 · 2000 2015 · 2009 · 2004 · 1999 2014 · 2008 · 2003 2012 · 2007 · 2002                      Mon, Oct 23, 2017 Serverless: why microfunctions > microservices This post follows on from a post I wrote a couple of years back called Why Service Architectures Should Focus on Workflows. In that post I attempted to describe the fragility of microservice systems that were simply translating object-oriented patterns to the new paradigm. These systems were migrating domain models and their interactions from in-memory objects to separate networked processes. They were replacing in-process function calls with cross-network rpc calls, adding latency and infrastructure complexity. The goal was scalability and flexibility but, I argued, the entity modelling approach introduced new failure modes. I suggested a solution: Instead of carving up the domain by entity, focus on the workflows. If I was writing that post today I would say “focus on the functions” because the future is serverless functions, not microservices. Or, more brashly: microfunctions > microservices The industry has moved apace in the last 3 years with a focus on solving the infrastructure challenges caused by running hundreds of intercommunicating microservices. Containers have matured and become the de-facto standard for the unit of microservice deployment with management platforms such as Kubernetes to orchestrate them and frameworks like GRPC for robust interservice communication. The focus still tends to be on interacting entities though: when placing an order the “order service” talks to the “customer service” which reserves items by talking to the “stock service” and the “payment service” which talks to the “payment gateway” after first checking with the “fraud service”. When the order needs to be shipped the “shipping service” asks the “order service” for orders that need to be fulfilled and tells the “stock service” to remove the reservation, then to the “customer service” to locate the customer etc. All of these services are likely to be persisting state in various backend databases. Microservices are organized as vertical slices through the domain: The same problems still exist: if the customer service is overwhelmed by the shipping service then the order service can’t take new orders. The container manager will, of course, scale up the number of customer service instances and register them with the appropriate load balancers, discovery servers, monitoring and logging. However, it cannot easily cope with a critical failure in this service, perhaps caused by a repeated bad request that panics the service and prevents multiple dependent services from operating properly. Failures and slowdowns in response times are handled within client services through backoff strategies, circuit breakers and retries. The system as a whole increases in complexity but remains fragile. By contrast, in a serverless architecture, the emphasis is on the functions of the system. For this reason serverless is sometimes called FaaS – Functions as a Service. Systems are decomposed into functions that encapsulate a single task in a single process. Instead of each request involving the orchestration of multiple services the request uses an instance of the appropriate function. Rather than the domain model being exploded into separate networked processes its entities are provided in code libraries compiled into the function at build time. Calls to entity methods are in-process so don’t pay the network latency or reliability taxes. In this paradigm the “place order” function simply calls methods on customer, stock and payment objects, which may then interact with the various backend databases directly. Instead of a dozen networked RPC calls, the function relies on 2-3 database calls. Additionally, if a function is particularly hot it can be scaled directly without affecting the operation of other functions and, crucially, it can fail completely without taking down other functions. (Modulo the reliability of databases which affect both styles of architecture identically.) Microfunctions are horizontal slices through the domain: The advantages I wrote last time still hold up when translated to serverless terminology: Deploying or retiring a function becomes as simple as switching it on or off which leads to greater freedom to experiment. Scaling a function is limited to scaling a single type of process horizontally and the costs of doing this can be cleanly evaluated. The system as a whole becomes much more robust. When a function encounters problems it is limited to a single workflow such as issuing invoices. Other functions can continue to operate independently. Latency, bandwidth use and reliability are all improved because there are fewer network calls. The function still relies on the database and other support systems such as lock servers, but most of the data flow is controlled in-process. The unit of testing and deployment is a single function which reduces the complexity and cost of maintenance. One major advantage that I missed is the potential for extreme cost savings through scale, particularly the scale attainable by running on public shared infrastructure. Since all the variability of microservice deployment configurations is abstracted away into a simple request/response interface the microfunctions can be run as isolated shared-nothing processes, billed only for the resources they use in their short lifetime. Anyone who has costed for redundant microservices simply for basic resilience will appreciate the potential here. Although there are number of cloud providers in this space (AWS Lambda, Google Cloud Functions, Azure Functions) serverless is still an emerging paradigm with the problems that come with immaturity. Adrian Coyler recently summarized an excellent paper and presentation dealing with the challenges of building serverless systems which highlights many of these, including the lack of service level agreements and loose performance guarantees. It seems almost certain though that these will improve as the space matures and overtakes the microservice paradigm. Other posts tagged as architecture, distributed-systems, technology, serverless, faas Earlier Posts Gorecipes: Fin Wed, Mar 30 2016 Another Blog Refresh Sun, Feb 22 2015 Why Service Architectures Should Focus on Workflows Mon, Mar 31 2014 Help me crowdfund my game Amberfell Mon, Nov 12 2012 
blog-librarything-com-3311	----	The Thingology Blog The Thingology Blog New Syndetics Unbound Feature: Mark and Boost Electronic Resources ProQuest and LibraryThing have just introduced a major new feature to our catalog-enrichment suite, Syndetics Unbound, to meet the needs of libraries during the COVID-19 crisis. Our friends at ProQuest blogged about it briefly on the ProQuest blog. This blog post goes into greater detail about what we did, how we did it, and what [&#8230;] Introducing Syndetics Unbound Short Version Today we&#8217;re going public with a new product for libraries, jointly developed by LibraryThing and ProQuest. It&#8217;s called Syndetics Unbound, and it makes library catalogs better, with catalog enrichments that provide information about each item, and jumping-off points for exploring the catalog. To see it in action, check out the Hartford Public Library [&#8230;] ALAMW 2016 in Boston (and Free Passes)! Abby and KJ will be at ALA Midwinter in Boston this weekend, showing off LibraryThing for Libraries. Since the conference is so close to LibraryThing headquarters, chances are good that a few other LT staff members may appear, as well! Visit Us. Stop by booth #1717 to meet Abby &#38; KJ (and potential mystery guests!), [&#8230;] For ALA 2015: Three Free OPAC Enhancements For a limited time, LibraryThing for Libraries (LTFL) is offering three of its signature enhancements for free! There are no strings attached. We want people to see how LibraryThing for Libraries can improve your catalog. Check Library. The Check Library button is a &#8220;bookmarklet&#8221; that allows patrons to check if your library has a book [&#8230;] ALA 2015 in San Francisco (Free Passes) Our booth. But this is Kate, not Tim or Abby. She had the baby. Tim and I are headed to San Francisco this weekend for the ALA Annual Conference. Visit Us. Stop by booth #3634 to talk to us, get a demo, and learn about all the new and fun things we&#8217;re up to with [&#8230;] New &#8220;More Like This&#8221; for LibraryThing for Libraries We&#8217;ve just released &#8220;More Like This,&#8221; a major upgrade to LibraryThing for Libraries’ &#8220;Similar items&#8221; recommendations. The upgrade is free and automatic for all current subscribers to LibraryThing for Libraries Catalog Enhancement Package. It adds several new categories of recommendations, as well as new features. We&#8217;ve got text about it below, but here&#8217;s a short [&#8230;] Subjects and the Ship of Theseus I thought I might take a break to post an amusing photo of something I wrote out today: The photo is a first draft of a database schema for a revamp of how LibraryThing will do library subjects. All told, it has 26 tables. Gulp. About eight of the tables do what a good cataloging [&#8230;] LibraryThing Recommends in BiblioCommons Does your library use BiblioCommons as its catalog? LibraryThing and BiblioCommons now work together to give you high-quality reading recommendations in your BiblioCommons catalog. You can see some examples here. Look for &#8220;LibraryThing Recommends&#8221; on the right side. Not That Kind of Girl (Daniel Boone Regional Library) Carthage Must Be Destroyed (Ottowa Public Library) The [&#8230;] NEW: Annotations for Book Display Widgets Our Book Display Widgets is getting adopted by more and more libraries, and we&#8217;re busy making it better and better. Last week we introduced Easy Share. This week we&#8217;re rolling out another improvement—Annotations! Book Display Widgets is the ultimate tool for libraries to create automatic or hand-picked virtual book displays for their home page, blog, [&#8230;] Send us a programmer, win $1,000 in books. We just posted a new job post Job: Library Developer at LibraryThing (Telecommute). To sweeten the deal, we are offering $1,000 worth of books to the person who finds them. That&#8217;s a lot of books. Rules! You get a $1,000 gift certificate to the local, chain or online bookseller of your choice. To qualify, you [&#8230;] 
blog-librarything-com-8946	----	The Thingology Blog The LibraryThing Blog Thingology Monday, April 20th, 2020 New Syndetics Unbound Feature: Mark and Boost Electronic Resources ProQuest and LibraryThing have just introduced a major new feature to our catalog-enrichment suite, Syndetics Unbound, to meet the needs of libraries during the COVID-19 crisis. Our friends at ProQuest blogged about it briefly on the ProQuest blog. This blog post goes into greater detail about what we did, how we did it, and what efforts like this may mean for library catalogs in the future. What it Does The feature, “Mark and Boost Electronic Resources,” turns Syndetics Unbound from a general catalog enrichment tool to one focused on your library’s electronic resources—the resources patrons can access during a library shutdown. We hope it encourages libraries to continue to promote their catalog, the library’s own and most complete collection repository, instead of sending patrons to a host of partial, third-party eresource platforms. The new feature marks the library’s electronic resources and “boosts,” or promotes, them in Syndetics Unbound’s discovery enhancements, such as “You May Also Like,” “Other Editions,” “Tags” and “Reading Levels.” Here’s a screenshot showing the feature in action. How it Works The feature is composed of three settings. By default, they all turn on together, but they can be independently turned off and on. Boost electronic resources chooses to show electronic editions of an item where they exist, and boosts such items within discovery elements. Mark electronic resources with an “e” icon marks all electronic resources—ebooks, eaudio, and streaming video. Add electronic resources message at top of page adds a customizable message to the top of the Syndetics Unbound area. “Mark and Boost Electronic Holdings” works across all enrichments. It is particularly important for “Also Available As” which lists all the other formats for a given title. Enabling this feature sorts electronic resources to the front of the list. We also suggest that, for now, libraries may want to put “Also Available As” at the top of their enrichment order. Why We Did It Your catalog is only as good as your holdings. Faced with a world in which physical holdings are off-limits and electronic resources essential, many libraries have discouraged use of the catalog, which is dominated by non-digital resources, in favor of linking directly to Overdrive, Hoopla, Freegal and so forth. Unfortunately, these services are silos, containing only what you bought from that particular vendor. “Mark and Boost Electronic Resources” turns your catalog toward digital resources, while preserving what makes a catalog important—a single point of access to ALL library resources, not a vendor silo. Maximizing Your Electronic Holdings To make the best use of “Mark and Boost Electronic Resources,” we need to know about all your electronic resources. Unfortunately, some systems separate MARC holdings and electronic holdings; all resources appear in the catalog, but only some are available for export to Syndetics Unbound. Other libraries send us holding files with everything, but they are unable to send us updates every time new electronic resources are added. To address this issue, we have therefore advanced a new feature—”Auto-discover electronic holdings.” Turn this on and we build up an accurate representation of your library’s electronic resource holdings, without requiring any effort on your part. Adapting to Change “Mark and Boost Electronic Resources” is our first feature change to address the current crisis. But we are eager to do others, and to adapt the feature over time, as the situation develops. We are eager to get feedback from librarians and patrons! — The ProQuest and LibraryThing teams Labels: new features, new product, Syndetics Unbound posted by Tim @3:12 pm 0 Comments » Share Thursday, October 27th, 2016 Introducing Syndetics Unbound Short Version Today we’re going public with a new product for libraries, jointly developed by LibraryThing and ProQuest. It’s called Syndetics Unbound, and it makes library catalogs better, with catalog enrichments that provide information about each item, and jumping-off points for exploring the catalog. To see it in action, check out the Hartford Public Library in Hartford, CT. Here are some sample links: The Raven Boys by Maggie Stiefvater Alexander Hamilton by Ron Chernow Faithful Place by Tana French We’ve also got a press release and a nifty marketing site. UPDATE: Webinars Every Week! We’re now having weekly webinars, in which you can learn all about Syndetics Unbound, and ask us questions. Visit ProQuest’s WebEx portal to see the schedule and sign up! Long Version The Basic Idea Syndetics Unbound aims to make patrons happier and increase circulation. It works by enhancing discovery within your OPAC, giving patrons useful information about books, movies, music, and video games, and helping them find other things they like. This means adding elements like cover images, summaries, recommendations, series, tags, and both professional and user reviews. In one sense, Syndetics Unbound combines products—the ProQuest product Syndetics Plus and the LibraryThing products LibraryThing for Libraries and Book Display Widgets. In a more important sense, however, it leaps forward from these products to something new, simple, and powerful. New elements were invented. Static elements have become newly dynamic. Buttons provide deep-dives into your library’s collection. And—we think—everything looks better than anything Syndetics or LibraryThing have done before! (That’s one of only two exclamation points in this blog post, so we mean it.) Simplicity Syndetics Unbound is a complete and unified solution, not a menu of options spread across one or even multiple vendors. This simplicity starts with the design, which is made to look good out of the box, already configured for your OPAC and look. The installation requirements for Syndetics Unbound are minimal. If you already have Syndetics Plus or LibraryThing for Libraries, you’re all set. If you’ve never been a customer, you only need to add a line of HTML to your OPAC, and to upload your holdings. Although it’s simple, we didn’t neglect options. Libraries can reorder elements, or drop them entirely. We expect libraries will pick and choose, and evaluate elements according to patron needs, or feedback from our detailed usage stats. Libraries can also tweak the look and feel with custom CSS stylesheets. And simplicity is cheap. To assemble a not-quite-equivalent bundle from ProQuest’s and LibraryThing’s separate offerings would cost far more. We want everyone who has Syndetics Unbound to have it in its full glory. Comprehensiveness and Enrichments Syndetics Unbound enriches your catalog with some sixteen enrichments, but the number is less important than the options they encompass. These include both professional and user-generated content, information about the item you’re looking at, and jumping-off points to explore similar items. Quick descriptions of the enrichments: Boilterplate covers for items without covers. Premium Cover Service. Syndetics offers the most comprehensive cover database in existence for libraries—over 25 million full-color cover images for books, videos, DVDs, and CDs, with thousands of new covers added every week. For Syndetics Unbound, we added boilerplate covers for items that don’t have a cover, which include the title, author, and media type. Summaries. Over 18 million essential summaries and annotations, so patrons know what the book’s about. About the Author. This section includes the author biography and a small shelf of other items by the author. The section is also adorned by a small author photo—a first in the catalog, although familiar elsewhere on the web. Look Inside. Includes three previous Syndetics enrichments—first chapters or excerpts, table of contents and large-size covers—newly presented as a “peek inside the book” feature. Series. Shows a book’s series, including reading order. If the library is missing part of the series, those covers are shown but grayed out. You May Also Like. Provides sharp, on-the-spot readers advisory in your catalog, with the option to browse a larger world of suggestions, drawn from LibraryThing members and big-data algorithms. In this and other enrichments, Syndetics Unbound only recommends items that your library owns. The Syndetics Unbound recommendations cover far more of your collection than any similar service. For example, statistics from the Hartford Public Library show this feature on 88% of items viewed. Professional Reviews includes more than 5.4 million reviews from Library Journal, School Library Journal, New York Times, The Guardian, The Horn Book, BookList, BookSeller + Publisher Magazine, Choice, Publisher’s Weekly, and Kirkus. A la carte review sources include Voice of Youth Advocates: VOYA, Doody’s Medical Reviews and Quill and Quire. Reader Reviews includes more than 1.5 million vetted, reader reviews from LibraryThing members. It also allows patrons and librarians to add their own ratings and reviews, right in your catalog, and then showcase them on a library’s home page and social media. Also Available As helps patrons find other available formats and versions of a title in your collection, including paper, audio, ebook, and translations. Exploring the tag system Tags rethinks LibraryThing’s celebrated tag clouds—redesigning them toward simplicity and consistency, and away from the “ransom note” look of most clouds. As data, tags are based on over 131 million tags created by LibraryThing members, and hand-vetted by our staff librarians for quality. A new exploration interface allows patrons to explore what LibraryThing calls “tag mashes”—finding books by combinations of tags—in a simple faceted way. I’m going to be blogging about the redesign of tag clouds in the near future. Considering dozens of designs, we decided on a clean break with the past. (I expect it will get some reactions.) Book Profile is a newly dynamic version of what Bowker has done for years—analyzing thousands of new works of fiction, short-story collections, biographies, autobiographies, and memoirs annually. Now every term is clickable, and patrons can search and browse over one million profiles. Explore Reading Levels Reading Level is a newly dynamic way to see and explore other books in the same age and grade range. Reading Level also includes Metametrics Lexile® Framework for Reading. Click the “more” button to get a new, super-powered reading-level explorer. This is one my favorite features! (Second and last exclamation point.) Awards highlights the awards a title has won, and helps patrons find highly-awarded books in your collection. Includes biggies like the National Book Award and the Booker Prize, but also smaller awards like the Bram Stoker Award and Oklahoma’s Sequoyah Book Award. Browse Shelf gives your patrons the context and serendipity of browsing a physical shelf, using your call numbers. Includes a mini shelf-browser that sits on your detail pages, and a full-screen version, launched from the detail page. Video and Music adds summaries and other information for more than four million video and music titles including annotations, performers, track listings, release dates, genres, keywords, and themes. Video Games provides game descriptions, ESRB ratings, star ratings, system requirements, and even screenshots. Book Display Widgets. Finally, Syndetics Unbound isn’t limited to the catalog, but includes the LibraryThing product Book Display Widgets—virtual book displays that go on your library’s homepage, blog, LibGuides, Facebook, Twitter, Pinterest, or even in email newsletters. Display Widgets can be filled with preset content, such as popular titles, new titles, DVDs, journals, series, awards, tags, and more. Or you point them at a web page, RSS feed, or list of ISBNs, UPCs, or ISSNs. If your data is dynamic, the widget updates automatically. Here’s a page of Book Display Widget examples. Find out More Made it this far? You really need to see Syndetics Unbound in action. Check it Out. Again, here are some sample links of Syndetics Unbound at Hartford Public Library in Hartford, CT: The Raven Boys by Maggie Stiefvater, Alexander Hamilton by Ron Chernow, Faithful Place by Tana French. Webinars. We hold webinars every Tuesday and walk you through the different elements and answer questions. To sign up for a webinar, visit this Webex page and search for “Syndetics Unbound.” Interested in Syndetics Unbound at your library? Go here to contact a representative at ProQuest. Or read more about at the Syndetics Unbound website. Or email us at ltflsupport@librarything.com and we’ll help you find the right person or resource. Labels: librarything for libraries, new feature, new features, new product posted by Tim @10:45 am 4 Comments » Share Thursday, January 7th, 2016 ALAMW 2016 in Boston (and Free Passes)! Abby and KJ will be at ALA Midwinter in Boston this weekend, showing off LibraryThing for Libraries. Since the conference is so close to LibraryThing headquarters, chances are good that a few other LT staff members may appear, as well! Visit Us. Stop by booth #1717 to meet Abby & KJ (and potential mystery guests!), get a demo, and learn about all the new and fun things we’re up to with LibraryThing for Libraries, TinyCat, and LibraryThing. Get in Free. Are you in the Boston area and want to go to ALAMW? We have free exhibit only passes. Click here to sign up and get one! Note: It will get you just into the exhibit hall, not the conference sessions themselves. Labels: Uncategorized posted by Kate @4:05 pm 0 Comments » Share Thursday, June 25th, 2015 For ALA 2015: Three Free OPAC Enhancements For a limited time, LibraryThing for Libraries (LTFL) is offering three of its signature enhancements for free! There are no strings attached. We want people to see how LibraryThing for Libraries can improve your catalog. Check Library. The Check Library button is a “bookmarklet” that allows patrons to check if your library has a book while on Amazon and most other book websites. Unlike other options, LibraryThing knows all of the editions out there, so it finds the edition your library has. Learn more about Check Library Other Editions Let your users know everything you have. Don’t let users leave empty-handed when the record that came up is checked out. Other editions links all your holdings together in a FRBR model—paper, audiobook, ebook, even translations. Lexile Measures Put MetaMetrics’ The Lexile Framework® for Reading in your catalog, to help librarians and patrons find material based on reading level. In addition to showing the Lexile numbers, we also include an interactive browser. Easy to Add LTFL Enhancements are easy to install and can be added to every major ILS/OPAC system and most of the minor ones. Enrichments can be customized and styled to fit your catalog, and detailed usage reporting lets you know how they’re doing. See us at ALA. Stop by booth 3634 at ALA Annual this weekend in San Francisco to talk to Tim and Abby and see how these enhancements work. If you need a free pass to the exhibit hall, details are in this blog post. Sign up We’re offering these three enhancements free, for at least two years. We’ll probably send you links showing you how awesome other enhancements would look in your catalog, but that’s it. Find out more http://www.librarything.com/forlibraries or email Abby Blachly at abby@librarything.com. Labels: alaac15, Lexile measures, librarything for libraries, ltfl posted by Abby @1:31 pm 0 Comments » Share Tuesday, June 23rd, 2015 ALA 2015 in San Francisco (Free Passes) Our booth. But this is Kate, not Tim or Abby. She had the baby. Tim and I are headed to San Francisco this weekend for the ALA Annual Conference. Visit Us. Stop by booth #3634 to talk to us, get a demo, and learn about all the new and fun things we’re up to with LibraryThing for Libraries! Stay tuned this week for more announcements of what we’ll be showing off. No, really. It’s going to be awesome. Get in Free. In the SF area and want to go to ALA? We have free exhibit only passes. Click here to sign up and get one. It will get you just into the exhibit hall, not the conference sessions themselves. Labels: ala, alaac15 posted by Abby @2:17 pm 4 Comments » Share Monday, February 9th, 2015 New “More Like This” for LibraryThing for Libraries We’ve just released “More Like This,” a major upgrade to LibraryThing for Libraries’ “Similar items” recommendations. The upgrade is free and automatic for all current subscribers to LibraryThing for Libraries Catalog Enhancement Package. It adds several new categories of recommendations, as well as new features. We’ve got text about it below, but here’s a short (1:28) video: What’s New Similar items now has a See more link, which opens More Like This. Browse through different types of recommendations, including: Similar items More by author Similar authors By readers Same series By tags By genre You can also choose to show one or several of the new categories directly on the catalog page. Click a book in the lightbox to learn more about it—a summary when available, and a link to go directly to that item in the catalog. Rate the usefulness of each recommended item right in your catalog—hovering over a cover gives you buttons that let you mark whether it’s a good or bad recommendation. Try it Out! Click “See more” to open the More Like This browser in one of these libraries: Spokane County Library District Arapahoe Public Library Waukegan Public Library Cape May Public Library SAILS Library Network Find out more Find more details for current customers on what’s changing and what customizations are available on our help pages. For more information on LibraryThing for Libraries or if you’re interested in a free trial, email abby@librarything.com, visit http://www.librarything.com/forlibraries, or register for a webinar. Labels: librarything for libraries, ltfl, recommendations, similar books posted by Abby @2:02 pm 2 Comments » Share Thursday, February 5th, 2015 Subjects and the Ship of Theseus I thought I might take a break to post an amusing photo of something I wrote out today: The photo is a first draft of a database schema for a revamp of how LibraryThing will do library subjects. All told, it has 26 tables. Gulp. About eight of the tables do what a good cataloging system would do: Distinguishes the various subject systems (LCSH, Medical Subjects, etc.) Preserves the semantic richness of subject cataloging, including the stuff that never makes it into library systems. Breaks subjects into their facets (e.g., “Man-woman relationships — Fiction”) has two subject facets Most of the tables, however, satisfy LibraryThing’s unusual core commitments: to let users do their own thing, like their own little library, but also to let them benefit from and participate in the data and contributions of others.(1) So it: Links to subjects from various “levels,” including book-level, edition-level, ISBN-level and work-level. Allows members to use their own data, or “inherit” subjects from other levels. Allows for members to “play librarian,” improving good data and suppressing bad data.(2) Allows for real-time, fully reversible aliasing of subjects and subject facets. The last is perhaps the hardest. Nine years ago (!) I compared LibraryThing to the “Ship of Theseus,” a ship which is “preserved” although its components are continually changed. The same goes for much of its data, although “shifting sands” might be a better analogy. Accounting for this makes for some interesting database structures, and interesting programming. Not every system at LibraryThing does this perfectly. But I hope this structure will help us do that better for subjects.(3) Weird as all this is, I think it’s the way things are going. At present most libraries maintain their own data, which, while generally copied from another library, is fundamentally siloed. Like an evolving species, library records descend from each other; they aren’t dynamically linked. The data inside the records are siloed as well, trapped in a non-relational model. The profession that invented metadata, and indeed invented sharing metadata, is, at least as far as its catalogs go, far behind. Eventually that will end. It may end in a “Library Goodreads,” every library sharing the same data, with global changes possible, but reserved for special catalogers. But my bet is on a more LibraryThing-like future, where library systems will both respect local cataloging choices and, if they like, benefit instantly from improvements made elsewhere in the system. When that future arrives, we got the schema! 1. I’m betting another ten tables are added before the system is complete. 2. The system doesn’t presume whether changes will be made unilaterally, or voted on. Voting, like much else, existings in a separate system, even if it ends up looking like part of the subject system. 3. This is a long-term project. Our first steps are much more modest–the tables have an order-of-use, not shown. First off we’re going to duplicate the current system, but with appropriate character sets and segmentation by thesaurus and language. Labels: cataloging, subjects posted by Tim @7:44 pm 3 Comments » Share Tuesday, January 20th, 2015 LibraryThing Recommends in BiblioCommons Does your library use BiblioCommons as its catalog? LibraryThing and BiblioCommons now work together to give you high-quality reading recommendations in your BiblioCommons catalog. You can see some examples here. Look for “LibraryThing Recommends” on the right side. Not That Kind of Girl (Daniel Boone Regional Library) Carthage Must Be Destroyed (Ottowa Public Library) The Martian (Edmonton Public Library) Little Bear (West Vancouver Memorial Library) Station Eleven (Chapel Hill Public Library) The Brothers Karamazov (Calgary Public Library) Quick facts: As with all LibraryThing for Libraries products, LibraryThing Recommends only recommends other books within a library’s catalog. LibraryThing Recommends stretches across media, providing recommendations not just for print titles, but also for ebooks, audiobooks, and other media. LibraryThing Recommends shows up to two titles up front, with up to three displayed under “Show more.” Recommendations come from LibraryThing’s recommendations system, which draws on hundreds of millions of data points in readership patterns, tags, series, popularity, and other data. Not using BiblioCommons? Well, you can get LibraryThing recommendations—and much more—integrated in almost every catalog (OPAC and ILS) on earth, with all the same basic functionality, like recommending only books in your catalog, as well as other LibraryThing for Libraries feaures, like reviews, series and tags. Check out some examples on different systems here. SirsiDynix Enterprise (Saint Louis Public Library) SirsiDynix Horizon Information Portal (Hume Libraries) SirsiDynix eLibrary (Spokane County Public Library) III Encore (Arapahoe Public Library) III WebPac Pro (Waukegan Public Library) Polaris (Cape May County Library) Ex Libris Voyager (University of Wisconsin-Eau Claire) Interested? BiblioCommons: email info@bibliocommons.com or visit http://www.bibliocommons.com/AugmentedContent. See the full specifics here. Other Systems: email abby@librarything.com or visit http://www.librarything.com/forlibraries. Labels: Uncategorized posted by Tim @12:43 pm 0 Comments » Share Thursday, October 16th, 2014 NEW: Annotations for Book Display Widgets Our Book Display Widgets is getting adopted by more and more libraries, and we’re busy making it better and better. Last week we introduced Easy Share. This week we’re rolling out another improvement—Annotations! Book Display Widgets is the ultimate tool for libraries to create automatic or hand-picked virtual book displays for their home page, blog, Facebook or elsewhere. Annotations allows libraries to add explanations for their picks. Some Ways to Use Annotations 1. Explain Staff Picks right on your homepage. 2. Let students know if a book is reserved for a particular class. 3. Add context for special collections displays. How it Works Check out the LibraryThing for Libraries Wiki for instructions on how to add Annotations to your Book Display Widgets. It’s pretty easy. Interested? Watch a quick screencast explaining Book Display Widgets and how you can use them. Find out more about LibraryThing for Libraries and Book Display Widgets. And sign up for a free trial of either by contacting ltflsupport@librarything.com. Labels: Book Display Widgets, librarything for libraries, new feature, new features, widgets posted by KJ @10:21 am 0 Comments » Share Tuesday, October 14th, 2014 Send us a programmer, win $1,000 in books. We just posted a new job post Job: Library Developer at LibraryThing (Telecommute). To sweeten the deal, we are offering $1,000 worth of books to the person who finds them. That’s a lot of books. Rules! You get a $1,000 gift certificate to the local, chain or online bookseller of your choice. To qualify, you need to connect us to someone. Either you introduce them to us—and they follow up by applying themselves—or they mention your name in their email (“So-and-so told me about this”). You can recommend yourself, but if you found out about it from someone else, we hope you’ll do the right thing and make them the beneficiary. Small print: Our decision is final, incontestable, irreversible and completely dictatorial. It only applies when an employee is hired full-time, not part-time, contract or for a trial period. If we don’t hire someone for the job, we don’t pay. The contact must happen in the next month. If we’ve already been in touch with the candidate, it doesn’t count. Void where prohibited. You pay taxes, and the insidious hidden tax of shelving. Employees and their families are eligible to win, provided they aren’t work contacts. Tim is not. » Job: Library Developer at LibraryThing (Telecommute) Labels: jobs posted by Tim @10:04 am 1 Comment » Share Page 1 of 4512345...102030...»Last » Thingology is LibraryThing's ideas blog, on the philosophy and methods of tags, libraries and suchnot. The LibraryThing Blog RSS Feed Combined Feed Search for: Recent Posts New Syndetics Unbound Feature: Mark and Boost Electronic Resources Introducing Syndetics Unbound ALAMW 2016 in Boston (and Free Passes)! For ALA 2015: Three Free OPAC Enhancements ALA 2015 in San Francisco (Free Passes) Recent Comments máy phun phân bón on The LibraryThing programming quiz! Janis Jones on Book Display Widgets from LibraryThing for Libraries Marie Seltenrych on Book Display Widgets from LibraryThing for Libraries Tye Bishop on Introducing thingISBN Walter Clark on New group: “Books in 2025—The Future of the Book World” Archives April 2020 October 2016 January 2016 June 2015 February 2015 January 2015 October 2014 June 2014 May 2014 April 2014 March 2014 December 2013 October 2013 September 2013 June 2013 April 2013 March 2013 February 2013 January 2013 November 2012 October 2012 August 2012 June 2012 April 2012 March 2012 February 2012 January 2012 November 2011 October 2011 September 2011 August 2011 July 2011 June 2011 May 2011 April 2011 March 2011 February 2011 January 2011 December 2010 November 2010 October 2010 August 2010 July 2010 June 2010 May 2010 April 2010 March 2010 February 2010 January 2010 December 2009 November 2009 October 2009 September 2009 August 2009 July 2009 June 2009 May 2009 April 2009 March 2009 February 2009 January 2009 December 2008 November 2008 October 2008 September 2008 August 2008 July 2008 June 2008 May 2008 April 2008 March 2008 February 2008 January 2008 December 2007 November 2007 October 2007 September 2007 August 2007 July 2007 June 2007 May 2007 April 2007 March 2007 February 2007 January 2007 December 2006 November 2006 October 2006 September 2006 August 2006 July 2006 June 2006 May 2006 Categories 37Signals aaron swartz academics ads ahml ala ala 2008 ala anaheim ALA midwinter ala2007 ala2008 ALA2010 ala2014 alaac15 ALAMW11 ALAMW13 alamw2009 ALAmw2010 Aleph Alexandria Egypt Amazon amusement android apis app arl arlington heights armenian astroturfing ato attention australia australian australian tax office authenticity awards barcode scanning bea ben franklin berkman center bhutan biblios BIGWIG blogging book blogs book covers Book Display Widgets book reviews BookPsychic books bookstores booksurge Boston bowdoin Bowker branded apps brigadoon library britney spears business c.s. lewis canton cataloging categories censorship Charleston chick lit chris catalfo christmas CIG CIL CIL2008 CIL2009 CIL2010 CIL2012 city planning claremont colleges clay shirky cluetrain code codi cognitive cost collection development commemorations common knowledge communiation Computers in Libraries conference ConferenceThing contests copyright covers coverthing crime csuci curiosities cutter DanMARC david weinberger DDC dead or alive department of commerce department of defense department of labor dewey decimal Dewey Decimal Classification discovery layer django doc searls dr. horrible drm Durham Early Reviewers east brunswick ebooks ebpl EBSCOhost economics elton john email employment enhancement Enterprise ereaders erotica event Evergreen everything is miscellaneous ExLibris facebook fake steve federal libraries feedback flash-mob cataloging folksonomy frbr freedom fun future of cataloging future of the book gbs gene smith getting real giraffe gmilcs google google book search groups guardian harry potter harvard coop hidden images hiring homophily houghton mifflin humor iBistro iii il2008 indexing indiebound inspiration instruction international internet archive internet librarians internships interviews iphone app isbns it conversations itt tallaght jacob nielsen jason griffey javascript jeff atwood jobs JSON kelly vista kils kindle kingston koha languages lccn lccns LCSH legacies legacy libraries legacy mob Lexile measures lianza lianza09 lib2.0 liblime librarians libraries libraries of the dead library 2.0 gang library anywhere library blogging library journal library of congress library of congress report library of the futurue library science library technology librarycampnyc2007 librarything librarything for libraries LibraryThing for Publishers librarything local linden labs LIS los gatos LTER ltfl LTFL categories ltfl libraries LTFL Reviews maine marc marcthing marié digby mashups masonic control masons meet-up metadata metasexdactyly michael gorman michael porter microsoft microsoft songsmith mike wesch milestone mobile mobile catalog mobile web monopoly moose movers and shakers nc NCSU neil gaiman NELA2013 new feature new features new product newspapers nipply nook North Carolina oclc oclc numbers oh opacs open data open library Open Shelves Classification open source openness OSC OverDrive paid memberships palinet pay what you want physical world PLA PLA12 PLA2008 podcasts policy politics polls portland Portland Public Library power laws print culture profile pictures QR code ra radiohead randolph county public library rcpl readers advisory reading recommendations reloadevery remixability reviews rhinos richland county rights riverine metaphors roy tennant rusa mars safe for work if you're a cataloger San Francisco State University santathing scanning schaufferwaffer screencasts Seattle Public Library second life secret santas serendipity series sfsu shelf browse shelfari shirky similar books sincerity sirsidynix slco small libraries Social Cataloging social media social networking songsmith sony reader SOPAC spam stack map stats steve lawson strangeness subjects Syndetics Unbound syria tag mirror tagging tagmash tags talis talks tax exemption the thingisbn Tim tipping points tools translation twitter uclassify ugc Uncategorized usability user generated content users utnapishtim VC vertical social networks very short list visualizations Voyager VuFind web web 2.0 webinars weddings weinberger West Virginia westlaw widgets Wikimania 2008 Wikimania2008 wirral wirral libraries work disambiguation Working Group on the Future of Bibliographic Control works worldcat worldcat local xisbn youtube zombies zoomii Meta Register Log in Entries RSS Comments RSS WordPress.org Help/FAQs | About | Privacy/Terms | Blog | Contact | APIs | WikiThing | Common Knowledge | Legacy Libraries | Early Reviewers | Zeitgeist Copyright LibraryThing and/or members of LibraryThing, authors, publishers, libraries, cover designers, Amazon, Bol, Bruna, etc. 
blog-library-villanova-edu-2758	----	Falvey Memorial Library Blog Falvey Memorial Library Blog The collection of blogs published by Falvey Memorial Library, Villanova University 
blog-library-villanova-edu-6016	----	Falvey Memorial Library :: The collection of blogs published by Falvey Memorial Library, Villanova University Skip Navigation Falvey Memorial Library VISIT / APPLY / GIVE My Library Account Collections Research Services Using the Library About Falvey Memorial Library Search Everything Books & Media Title Journal Title Author Subject Call Number ISBN/ISSN Tag Articles & more Article Title Article Author Other Libraries (ILL) ILL Title ILL Author ILL Subject ILL Call Number ILL ISBN/ISSN Library Website Guides Digital Library Search for books, articles, library site, almost anything Advanced You are exploring: Home > Blogs Falvey Memorial Library Blog Falvey Library Blogs Dig Deeper: Award-Winning Children’s Author Beverly Cleary April 27, 2021Library NewsBeverly Cleary, Dig Deeper, Library Resources Disappointed with the children’s books she read growing up, Beverly Cleary was determined to tell stories kids could relate to. ”I wanted to read funny stories about the sort of children I knew,” she wrote, ”and I decided ... Read More In Praise of Scrapple April 27, 2021Blue Electrode: Sparking between Silicon and Paperdigital library, Distinctive Collections, national poetry month, poems In honor of National Poetry Month, I thought I would share this poem by Philadelphia poet and Villanova alumnus Thomas Augustine Daly (1871-1948). The poem appears in McAroni Ballads and Other Verses (1919), newly digitized in our Digital Library ... Read More From The Archives: Owl Hop April 26, 2021Blue Electrode: Sparking between Silicon and PaperDistinctive Collections, University Archives When you step onto campus, you’ll discover Villanova’s many unique traditions. Some you may find are as old as the University itself and others are much more recent—but they all play an important role in the life of Villanova students. ... Read More New Resource: Eighteenth Century Collections Online April 26, 2021Library NewsEighteenth Century Collections Online, Gale Primary Sources, Library Resources Eighteenth Century Collections Online is broken into two parts and offers full text access to nearly every English-language and foreign-language title printed in the United Kingdom, alongside thousands of works published in the Americas, between ... Read More An Evening with Sr. Thea Bowman (1937-1990): Songs, Service, Struggle on April 27 April 26, 2021Library NewsAfrican American Spirituality, African American Studies, Biblical interpretation, black history, Campus Ministry, Sr. Thea Bowman The Villanova campus community is invited to join Campus Ministry for an evening of prayer and reflection, April 27, 7-8:15 p.m., with the song and spirit of Sr. Thea Bowman, FSPA. Presenters Rev. Naomi Washington-Leapheart and Michelle Sherman ... Read More Happy World Book Day and Shakespeare Day April 23, 2021Library News"William Shakespeare", book, falvey memorial library, Photo Friday, World Book Day Happy World Book Day and Shakespeare Day! To celebrate the Bard’s many contributions to culture and language, we wanted to share this striking edition that is contained in our physical collection. While the collection indeed contains several ... Read More Content Roundup – Third Week – 2021 April 23, 2021Blue Electrode: Sparking between Silicon and PaperContent Roundup This week sees the addition of materials digitized recently, including more Dime Novels and Story Papers and newly acquired letters written from William T. Sherman to Mrs. Mary C. Audenried, widow of Sherman’s longtime aide-de-camp. Dime Novel ... Read More Villanova Open Educational Resource (OER) Adoption Grant April 22, 2021Library News The Affordable Materials Project (AMP)  is offering 5 grants in the amount of $500 to tenure track or continuing faculty to encourage the adoption of an open educational resource (OER) as the primary course material for a class offered in the 2021 ... Read More TBT: 2019 Climate Strikes April 22, 2021Library Newsclimate change, Earth Day, Earth Week, Earth Week 2021, TBT, Throwback, throwback Thursday Here comes a BONUS TBT in honor of Earth Day! The photos featured here come from the March 15, 2019 Climate Strike at Villanova. This was just one of many climate strikes taking place on college campuses across the country. These strikes were ... Read More Search Falvey Library Blogs Categories Blue Electrode: Sparking between Silicon and Paper Library News Resources Technology Developments Feeds Content Comments Archives Compass (2005 - 2008) Meta Log in   Last Modified: December 22, 2015 800 Lancaster Ave., Villanova, PA 19085 610.519.4500 Contact Directions Privacy & Security Diversity Higher Education Act MY NOVA Villanova A-Z Directory Work at Villanova Accessibility Ask Us: Live Chat 
blog-libux-co-7064	----	Library User Experience Community - Medium Library User Experience Community - Medium A blog and slack community organized around design and the user experience in libraries, non-profits, and the higher-ed web. - Medium A Library System for the Future This is a what-if story.Continue reading on Library User Experience Community » Alexa, get me the articles (voice interfaces in academia) Thinking about interfaces has led me down a path of all sorts of exciting/mildly terrifying ways of interacting with our devices &#x2014; from&#x2026;Continue reading on Library User Experience Community » Accessibility Information on Library Websites Is autocomplete on your library home page? Writing for the User Experience with Rebecca Blakiston First look at Primo’s new user interface What users expect Write for LibUX On the User Experience of Ebooks Unambitious and incapable men in librarianship 
blog-libux-co-8185	----	Library User Experience Community Homepage Open in app Sign inGet started Practical Design Thinking for Libraries Library User Experience Community Guest Write ( - we pay!) Our Slack Community FollowFollowing A Library System for the Future A Library System for the Future This is a what-if story. Kelly DaganFeb 25, 2018 Latest Alexa, get me the articles (voice interfaces in academia) Alexa, get me the articles (voice interfaces in academia) Thinking about interfaces has led me down a path of all sorts of exciting/mildly terrifying ways of interacting with our devices — from… Kelly DaganFeb 11, 2018 Accessibility Information on Library Websites Accessibility Information on Library Websites An important part of making your library accessible is advertising that your library’s spaces and services are accessible and inclusive. Carli SpinaNov 17, 2017 Is autocomplete on your library home page? Is autocomplete on your library home page? Literature and some testing I’ve done this semester convinces me that autocomplete fundamentally improves the user experience Jaci Paige WilkinsonAug 20, 2017 Writing for the User Experience with Rebecca Blakiston Writing for the User Experience with Rebecca Blakiston 53:25 | Rebecca Blakiston — author of books on usability testing and writing with clarity; Library Journal mover and shaker — talks shop in… Michael SchofieldAug 1, 2017 Write for LibUX Write for LibUX We should aspire to push the #libweb forward by creating content that sets the bar for the conversation way up there, and I would love your… Michael SchofieldApr 28, 2017 First look at Primo’s new user interface First look at Primo’s new user interface Impressions of some key innovations of Primo’s new UI as well as challenges involved making customizations. Ron GilmourFeb 27, 2017 Today, I learned about the Accessibility Tree Today, I learned about the Accessibility Tree If you didn’t think your grip on web accessibility could get any looser. Michael SchofieldFeb 18, 2017 What users expect What users expect We thought it would be fun to emulate some of our favorite sites in a lightweight concept discovery layer we call Libre. Trey GordnerJan 29, 2017 Critical Librarianship in the Design of Libraries Critical Librarianship in the Design of Libraries Design decisions position libraries to more deliberately influence the user experience toward advocacy — such as communicating moral or… Michael SchofieldJan 10, 2017 The Non-Reader Persona The Non-Reader Persona Michael SchofieldDec 1, 2016 IU Libraries’ Redesign and the descending hero search IU Libraries’ Redesign and the descending hero search Michael SchofieldAug 8, 2016 Accessible, sort of — #a11eh Michael SchofieldJul 21, 2016 Create Once, Publish Everywhere Create Once, Publish Everywhere Michael SchofieldJul 17, 2016 Web education must go further than a conference budget Michael SchofieldMay 8, 2016 Blur the Line Between the Website and the Building Michael SchofieldNov 2, 2015 Say “Ok Library” Say “Ok Library” Michael SchofieldOct 28, 2015 Unambitious and incapable men in librarianship Unambitious and incapable men in librarianship Michael SchofieldOct 25, 2015 On the User Experience of Ebooks On the User Experience of Ebooks So, when it comes to ebooks I am in the minority: I prefer them to the real thing. The aesthetic or whats-it about the musty trappings of… Michael SchofieldOct 5, 2015 About Library User Experience CommunityLatest StoriesArchiveAbout MediumTermsPrivacy 
blog-openlibrary-org-1866	----	The Open Library Blog The Open Library Blog A web page for every book Introducing the Open Library Explorer Try it here! If you like it, share it. Bringing 100 Years of Librarian-Knowledge to Life By Nick Norman with Drini Cami &#38; Mek At the Library Leaders Forum 2020 (demo), Open Library unveiled the beta for what it&#8217;s calling the Library Explorer: an immersive interface which powerfully recreates and enhances the experience of navigating [&#8230;] Importing your Goodreads & Accessing them with Open Library’s APIs by Mek Today Joe Alcorn, founder of readng, published an article (https://joealcorn.co.uk/blog/2020/goodreads-retiring-API) sharing news with readers that Amazon&#8217;s Goodreads service is in the process of retiring their developer APIs, with an effective start date of last Tuesday, December 8th, 2020. The topic stirred discussion among developers and book lovers alike, making the front-page of the [&#8230;] On Bookstores, Libraries & Archives in the Digital Age The following was a guest post by Brewster Kahle on Against The Grain (ATG) &#8211; Linking Publishers, Vendors, &#38; Librarians By:&#160;Brewster Kahle, Founder &#38; Digital Librarian, Internet Archive​​​​​​​ ​​​Back in 2006,&#160;I was honored to give a keynote at the meeting of the&#160;Society of American Archivists, when the president of the Society presented me with a [&#8230;] Amplifying the voices behind books Exploring how Open Library uses author data to help readers move from imagination to impact By Nick Norman, Edited by Mek &#38; Drini According to René Descartes, a creative mathematician, “The reading of all good books is like a conversation with the finest [people] of past centuries.” If that’s true, then who are some of [&#8230;] Giacomo Cignoni: My Internship at the Internet Archive This summer, Open Library and the Internet Archive took part in Google Summer of Code (GSoC), a Google initiative to help students gain coding experience by contributing to open source projects. I was lucky enough to mentor Giacomo while he worked on improving our BookReader experience and infrastructure. We have invited Giacomo to write a [&#8230;] Google Summer of Code 2020: Adoption by Book Lovers by Tabish Shaikh &#38; Mek OpenLibrary.org,the world’s best-kept library secret: Let’s make it easier for book lovers to discover and get started with Open Library. Hi, my name is Tabish Shaikh and this summer I participated in the Google Summer of Code program with Open Library to develop improvements which will help book lovers discover [&#8230;] Open Library for Language Learners By Guyrandy Jean-Gilles 2020-07-21 A quick browse through the App Store and aspiring language learners will find themselves swimming in useful programs. But for experienced linguaphiles, the never-ending challenge is finding enough raw content and media to consume in their adopted tongue. Open Library can help. Earlier this year, Open Library added reading levels to [&#8230;] Meet the Librarians of Open Library By Lisa Seaberg Are you a book lover looking to contribute to a warm, inclusive library community? We’d love to work with you: Learn more about Volunteering @ Open Library Behind the scenes of Open Library is a whole team of developers, data scientists, outreach experts, and librarians working together to make Open Library better [&#8230;] Re-thinking Open Library’s Book Pages by Mek Karpeles, Tabish Shaikh We&#8217;ve redesigned our Book Pages: Before →After. Please share your feedback with us. A web page for every book&#8230; This is the mission of Open Library: a free, inclusive, online digital library catalog which helps readers find information about any book ever published. Millions of books in Open Library&#8217;s catalog [&#8230;] Reading Logs: Going Public & Helping Book Lovers Share Hi book lovers, Starting 2020-05-26, Reading Logs for new Open Library accounts will be public by default. Readers may go here to view or manage their Reading Log privacy preferences. This will not affect the privacy of your reading history &#8212; only books which you explicitly mark as Want to Read, Currently Reading, or Already [&#8230;] 
blog-openlibrary-org-2691	----	The Open Library Blog | A web page for every book The Open Library Blog A web page for every book Skip to content About « Older posts Introducing the Open Library Explorer By mek | Published: December 16, 2020 Try it here! If you like it, share it. Bringing 100 Years of Librarian-Knowledge to Life By Nick Norman with Drini Cami & Mek At the Library Leaders Forum 2020 (demo), Open Library unveiled the beta for what it’s calling the Library Explorer: an immersive interface which powerfully recreates and enhances the experience of navigating a physical library. If the tagline doesn’t grab your attention, wait until you see it in action: Drini showcasing Library Explorer at the Library Leaders Forum Get Ready to Explore In this article, we’ll give you a tour of the Open Library Explorer and teach you how one may take full advantage of its features. You’ll also get a crash course on the 100+ years of library history which led to its innovation and an opportunity to test-drive it for yourself. So let’s get started!   What better way to set the stage than by taking a trip down memory lane to the last time you were able to visit your local public library. As you pass the front desk, a friendly librarian scribbles some numbers on a piece of paper which they hand to you and points you towards a relevant section. With the list of library call numbers in your hand as your compass, you eagerly make your way through waves of towering bookshelves. Suddenly, you depart from reality and find yourself navigating through a sea of books, discovering treasures you didn’t even know existed. Library photo courtesy of pixy.org/5775865/ Before you know it, one book gets stuffed under one arm, two more books go under your other arm, and a few more books get positioned securely between your knees. You’re doing the math to see how close you are to your check-out limit. Remember those days? What if you could replicate that same library experience and access it every single day, from the convenience of your web browser? Well, thanks to the new Open Library Explorer, you can experience the joys of a physical library right in your web browser, as well as leverage superpowers which enable you to explore in ways which may have previously been impossible. Before we dive into the bells-and-whistles of the Library Explorer, it’s worth learning how and why such innovations came to be. Who needs Library Explorer? This year we’ve seen systems stressed to their max due to the COVID-19 pandemic. With libraries and schools closing their doors globally and stay-at-home orders hampering our access, there has been a paradigm shift in the needs of researchers, educators, students, and families to access fundamental resources online. Getting this information online is a challenge in and of itself. Making it easy to discover and use materials online is another entirely. How does one faithfully compress the entire experience of a reliable, unbiased, expansive public library and its helpful, friendly staff into a 14” computer screen? Some sites, like Netflix or YouTube, solve this problem with recommendation engines that populate information based on what people have previously seen or searched. Consequently, readers may unknowingly find themselves caught in a sort of “algorithmic bubble.” An algorithmic bubble (or “filter bubble”) is a state of intellectual or informational isolation that’s perpetuated by personalized content. Algorithmic bubbles can make it difficult for users to access information beyond their own opinions—effectively isolating them in their own cultural or ideological silos.  Drini Cami, the creator of Library Explorer, says that users’ caught inside these algorithmic bubbles “won’t be exposed to information that is completely foreign to [them]. There is no way to systematically and feasibly explore.” Hence the reasoning behind the Library Explorer’s intelligence comes out of a need to discover information without the constraints of algorithmic bubbles. As readers are exposed to more information, the question becomes, how can readers fully explore swaths of new information and still enjoy the experience? Let’s take a look at how the Library Explorer tackles that half of the problem. Humanity’s Knowledge Brought to Life Earlier this year, Open Library added the ability to search materials by both Dewey Decimal Classification and Library of Congress Classification. These systems contain embedded within them over 100 years of librarian experience, and provide a systematized approach to sort through the entirety of humanity’s knowledge embedded in books.  It is important to note, the systematization of knowledge alone does not necessarily make it easily discoverable. This is what makes the Library Explorer so special. Its digital interface opens the door for readers to seamlessly navigate centuries of books anywhere online. Thanks to innovations such as the Library Explorer, readers can explore more books and access more knowledge with a better experience. A tour of Library Explorer’s features If you’re pulling up a chair for the first time, the Library Explorer presents you with tall, clickable bookshelves situated across your screen. Each shelf has its own identity that can morph into new classes of books and subject categories with a single click. And that’s only the beginning of what it offers. In addition to those smart filters, the Library Explorer wants you to steer the ship… not the other way around. In other words, you can personalize single rows of books, expand entire shelves, or construct an entire library-experience that evolves around your exact interests. You can custom tailor your own personal library from the comfort of your device, wherever you may be. Quick question: as a kid, did you ever layout your newly checked-out library books on your bed to admire them? Well, the creators behind the Library Explorer found a way to mimic that same experience. If you so choose, you can zoom out of the Library Explorer interface to get a complete view of the library you’ve constructed. Let’s explore one more set of cool features the Library Explorer offers by clicking on the “Filter” icon at the bottom of the page. By selecting “Juvenile,” you can instantly transform your entire library into a children’s library, but keep all the useful organization and structure provided by the bookshelves. It’s as if your own personal librarian ran in at lightning speed and removed every book from each shelf that didn’t meet your criteria. Or you may type in “subject:biography” and suddenly your entire library shows you a tailored collection of just biographies on every subject. The sky is your limit. If you click on the Settings tab, you’re given several options to customize the look and feel of your personal Library Explorer. You can switch between using Library of Congress or Dewey Decimal classification to organize your shelves. You can also choose from a variety of delightful options to see your books in 3D. Each book has the correct thickness determined by its actual number of pages. To see your favorite book in 3D, click the settings icon at the bottom of the screen and then press the 3D button. Library Explorer’s 3D view Maybe you’ve experienced a time where you had limited space in your book bag. Perhaps because of that, you chose to wait on checking out heavier books. Or, maybe you judged a book’s strength of knowledge based on its thickness. If that’s you, guess what? The Open Library Explorer lets you do that.  It gets personal… The primary goal of the Library Explorer was to create an experimental interface that ‘opens the door’ for readers to locate new books and engage with their favorite books. The Library Explorer is one of many steps that both the Internet Archive and the Open Library have made towards making knowledge easy to discover. As you know, such innovation couldn’t be possible without people who believe in the necessity of reading. Here is a list of the names of those who contributed to the creation of the Library Explorer: Drini Cami, Open Library Developer and Library Explorer Creator Mek Karpeles, Open Library Program Lead Jim Shelton, UX Designer, Internet Archive Ziyad Basheer, Product Designer Tinnei Pang, Illustrator and Product Designer James Hill-Khurana, Product Designer Nick Norman, Open Library Storyteller & Volunteer Communications Lead  Well, this is the moment you’ve been waiting for. Go here and give the Library Explorer a beta test-run. Also, follow @OpenLibrary on Twitter to learn about other features as soon as they’re released. But before you go… in the comments below, tell us your favorite library experience. We’d love to hear! Posted in Uncategorized | Comments closed Importing your Goodreads & Accessing them with Open Library’s APIs By mek | Published: December 13, 2020 by Mek Today Joe Alcorn, founder of readng, published an article (https://joealcorn.co.uk/blog/2020/goodreads-retiring-API) sharing news with readers that Amazon’s Goodreads service is in the process of retiring their developer APIs, with an effective start date of last Tuesday, December 8th, 2020. A screenshot taken from Joe Alcorn’s post The topic stirred discussion among developers and book lovers alike, making the front-page of the popular Hacker News website. Hacker News at 2020-12-13 1:30pm Pacific. The Importance of APIs For those who are new to the term, an API is a method of accessing data in a way which is designed for computers to consume rather than people. APIs often allow computers to subscribe to (i.e. listen for) events and then take actions. For example, let’s say you wanted to tweet every time your favorite author published a new book. One could sit on Goodreads and refresh the website every fifteen minutes. Or, one might write a twitter bot which automatically connects to Goodreads and checks real-time data using its API. In fact, the reason why Twitter bots work, is that they use Twitter’s API, a mechanism which lets specially designed computer programs submit tweets to the platform. As one of the more popular book services online today, tens of thousands of readers and organizations rely on Amazon’s Goodreads APIs to lookup information about books and to power their book-related applications across the web. Some authors rely on the data to showcase their works on their personal homepages, online book stores to promote their inventory, innovative new services like thestorygraph are using this data to help readers discover new insights, and even librarians and scholastic websites rely on book data APIs to make sure their catalog information is as up to date and accurate as possible for their patrons. For years, the Open Library team has been enthusiastic to share the book space with friends like Goodreads who have historically shown great commitment by enabling patrons to control (download and export) their own data and enabling developers to create flourishing ecosystems which promote books and readership through their APIs. When it comes to serving an audience of book lovers, there is no “one size fits all” and we’re glad so many different platforms and APIs exist to provide experiences which meet the needs of different communities. And we’d like to do our part to keep the landscape flourishing. “The sad thing is it [retiring their APIs] really only hurts the hobbyist projects and Goodreads users themselves.” — Joe Alcorn Picture of Aaron Swartz by Noah Berger/Landov from thedailybeast At Open Library, our top priority is pursuing Aaron Swartz‘s original mission: to serve as an open book catalog for the public (one page for every book ever published) and ensure our community always has free, open data to unlock a world of possibilities. A world which believes in the power of reading to preserve our cultural heritage and empower education and understanding. We sincerely hope that Amazon will decide it’s in Goodreads’ best interests to re-instate their APIs. But either way, Open Library is committed to helping readers, developers, and all book lovers have autonomy over their data and direct access to the data they rely on. One reason patrons appreciate Open Library is that it aligns with their values Imports & Exports In August 2020, one of our Google Summer of Code contributors Tabish Shaikh helped us implement an export option for Open Library Reading Logs to help everyone retain full control of their book data. We also created a Goodreads import feature to help patrons who may want an easy way to check which Goodreads titles may be available to borrow from the Internet Archive’s Controlled Digital Lending program via openlibrary.org and to help patrons organize all their books in one place. We didn’t make a fuss about this feature at the time, because we knew patrons have a lot of options. But things can change quickly and we want patrons to be able to make that decision for themselves. For those who may not have known, Amazon’s Goodreads website provides an option for downloading/exporting a list of books from one’s bookshelves. You may find instructions on this Goodreads export process here. Open Library’s Goodreads importer enables patrons to take this exported dump of their Goodreads bookshelves and automatically add matching titles to their Open Library Reading Logs. The Goodreads import feature from https://openlibrary.org/account/import Known issues. Currently, Open Library’s Goodreads Importer only works for (a) titles that are in the Open Library catalog and (b) which are new enough to have ISBNs. Our staff and community are committed to continuing to improve our catalog to include more titles (we added more than 1M titles this year) and we plan to improve our importer to support other ID types like OCLC and LOC. APIs & Data Developers and book overs who have been relying on Amazon’s Goodreads APIs are not out of luck. There are several wonderful services, many of them open-source, including Open Library, which offer free APIs: Wikidata.org (by the same group who brought us Wikipedia) is a treasure trove of metadata on Authors and Books. Open Library gratefully leverages this powerful resource to enrich our pages. Inventaire.io is a wonderful service which uses Wikidata and Openlibrary data (API: api.inventaire.io) Bookbrainz.org (by the group who runs Musicbrainz) is a up-and-coming catalog of books WorldCat by OCLC offers various metadata APIs Did we miss any? Please let us know! We’d love to work together, build stronger integrations with, and support other book-loving services. Open Library’s APIs. And of course, Open Library has a free, open, Book API which spans nearly 30 million books. Bulk Data. If you need access to all our data, Open Library releases a free monthly bulk data dump of Authors, Books, and more. Spoiler: Everything on Open Library is an API! One of my favorite parts of Open Library is that practically every page is an API. All that is required is adding “.json” to the end. Here are some examples: Search https://openlibrary.org/search?q=lord+of+the+rings is our search page for humans… https://openlibrary.org/search.json?q=lord+of+the+rings is our Search API! Books https://openlibrary.org/books/OL25929351M/Harry_Potter_and_the_Methods_of_Rationality is the human page for Harry Potter and the Methods of Rationality… https://openlibrary.org/books/OL25929351M.json is its API! Authors https://openlibrary.org/authors/OL2965893A/Rik_Roots is a human readable author page… https://openlibrary.org/authors/OL2965893A.json and here is the API! Did We Mention: Full-text Search over 4M Books? Major hat tip to the Internet Archive’s Giovanni Damiola for this one: Folks may also appreciate the ability to full-text search across 4M of the Internet Archive’s books (https://blog.openlibrary.org/2018/07/14/search-full-text-within-4m-books) on Open Library: You can try it directly here: http://openlibrary.org/search/inside?q=thanks%20for%20all%20the%20fish As per usual, nearly all Open Library urls are themselves APIs, e.g.: http://openlibrary.org/search/inside.json?q=thanks%20for%20all%20the%20fish Get Involved Questions? Open Library is an free, open-source, nonprofit project run by the Internet Archive. We do our development transparently in public (here’s our code) and our community spanning more than 40 volunteers meets every week, Tuesday @ 11:30am Pacific. Please contact us to join our call and participate in the process. Bugs? If something isn’t working as expected, please let us know by opening an issue or joining our weekly community calls. Want to share thanks? Please follow up on twitter: https://twitter.com/openlibrary and let us know how you’re using our APIs! Thank you A special thank you to our lead developers Drini Cami, Chris Clauss, and one of our lead volunteer engineers, Aaron, for spending their weekend helping fix a Python 3 bug which was temporarily preventing Goodreads imports from succeeding. A Decentralized Future The Internet Archive has a history cultivating and supporting the decentralized web. We operate a decentralized version of archive.org and host regular meetups and summits to galvanize the distributed web community. In the future, we can imagine a world where no single website controls all of your data, but rather patrons can participate in a decentralized, distributed network. You may be interested to try Bookwyrm, an open-source decentralized project by Mouse, former engineer on the Internet Archive’s Archive-It team. Posted in Uncategorized | Comments closed On Bookstores, Libraries & Archives in the Digital Age By Brewster Kahle | Published: October 7, 2020 The following was a guest post by Brewster Kahle on Against The Grain (ATG) – Linking Publishers, Vendors, & Librarians On Bookstores, Libraries & Archives in the Digital Age-An ATG Guest Post See the original article here on ATG’s website By: Brewster Kahle, Founder & Digital Librarian, Internet Archive​​​​​​​ ​​​Back in 2006, I was honored to give a keynote at the meeting of the Society of American Archivists, when the president of the Society presented me with a framed blown-up letter “S.”  This was an inside joke about the Internet Archive being named in the singular, Archive, rather than the plural Archives. Of course, he was right, as I should have known all along. The Internet Archive had long since grown out of being an “archive of the Internet”—a singular collection, say of web pages—to being “archives on the Internet,” plural.  My evolving understanding of these different names might help focus a discussion that has become blurry in our digital times: the difference between the roles of publishers, bookstores, libraries, archives, and museums. These organizations and institutions have evolved with different success criteria, not just because of the shifting physical manifestation of knowledge over time, but because of the different roles each group plays in a functioning society. For the moment, let’s take the concepts of Library and Archive. The traditional definition of a library is that it is made up of published materials, while an archive is made up of unpublished materials. Archives play an important function that must be maintained—we give frightfully little attention to collections of unpublished works in the digital age. Think of all the drafts of books that have disappeared once we started to write with word processors and kept the files on fragile computer floppies and disks. Think of all the videotapes of lectures that are thrown out or were never recorded in the first place.  Bookstores: The Thrill of the Hunt Let’s try another approach to understanding distinctions between bookstores, libraries and archives. When I was in my 20’s living in Boston—before Amazon.com and before the World Wide Web (but during the early Internet)—new and used bookstores were everywhere. I thought of them as catering to the specialized interests of their customers: small, selective, and only offering books that might sell and be taken away, with enough profit margin to keep the store in business. I loved them. I especially liked the used bookstore owners—they could peer into my soul (and into my wallet!) to find the right book for me. The most enjoyable aspect of the bookstore was the hunt—I arrived with a tiny sheet of paper in my wallet with a list of the books I wanted, would bring it out and ask the used bookstore owners if I might go home with a bargain. I rarely had the money to buy new books for myself, but I would give new books as gifts. While I knew it was okay to stay for awhile in the bookstore just reading, I always knew the game. Libraries: Offering Conversations not Answers The libraries that I used in Boston—MIT Libraries, Harvard Libraries, the Boston Public Library—were very different. I knew of the private Boston Athenæum but I was not a member, so I could not enter. Libraries for me seemed infinite, but still tailored to individual interests. They had what was needed for you to explore and if they did not have it, the reference librarian would proudly proclaim: “We can get it for you!” I loved interlibrary loans—not so much in practice, because it was slow, but because they gave you a glimpse of a network of institutions sharing what they treasured with anyone curious enough to want to know more. It was a dream straight out of Borges’ imagination (if you have not read Borges’ short stories, they are not to be missed, and they are short. I recommend you write them on the little slip of paper you keep in your wallet.) I couldn’t afford to own many of the books I wanted, so it turned off that acquisitive impulse in me. But the libraries allowed me to read anything, old and new. I found I consumed library books very differently. I rarely even brought a book from the shelf to a table; I would stand, browse, read, learn and search in the aisles. Dipping in here and there. The card catalog got me to the right section and from there I learned as I explored.  Libraries were there to spark my own ideas. The library did not set out to tell a story as a museum would. It was for me to find stories, to create connections, have my own ideas by putting things together. I would come to the library with a question and end up with ideas.  Rarely were these facts or statistics—but rather new points of view. Old books, historical newspapers, even the collection of reference books all illustrated points of view that were important to the times and subject matter. I was able to learn from others who may have been far away or long deceased. Libraries presented me with a conversation, not an answer. Good libraries cause conversations in your head with many writers. These writers, those librarians, challenged me to be different, to be better.  Staying for hours in a library was not an annoyance for the librarians—it was the point. Yes, you could check books out of the library, and I would, but mostly I did my work in the library—a few pages here, a few pages there—a stack of books in a carrel with index cards tucked into them and with lots of handwritten notes (uh, no laptops yet). But libraries were still specialized. To learn about draft resisters during the Vietnam War, I needed access to a law library. MIT did not have a law collection and this was before Lexis/Nexis and Westlaw. I needed to get to the volumes of case law of the United States.  Harvard, up the road, had one of the great law libraries, but as an MIT student, I could not get in. My MIT professor lent me his ID that fortunately did not include a photo, so I could sneak in with that. I spent hours in the basement of Harvard’s Law Library reading about the cases of conscientious objectors and others.  But why was this library of law books not available to everyone? It stung me. It did not seem right.  A few years later I would apply to library school at Simmons College to figure out how to build a digital library system that would be closer to the carved words over the Boston Public Library’s door in Copley Square:  “Free to All.”   Archives: A Wonderful Place for Singular Obsessions When I quizzed the archivist at MIT, she explained what she did and how the MIT Archives worked. I loved the idea, but did not spend any time there—it was not organized for the busy undergraduate. The MIT Library was organized for easy access; the MIT Archives included complete collections of papers, notes, ephemera from others, often professors. It struck me that the archives were collections of collections. Each collection faithfully preserved and annotated.  I think of them as having advertisements on them, beckoning the researcher who wants to dive into the materials in the archive and the mindset of the collector. So in this formulation, an archive is a collection, archives are collections of collections.  Archivists are presented with collections, usually donations, but sometimes there is some money involved to preserve and catalog another’s life work. Personally, I appreciate almost any evidence of obsession—it can drive toward singular accomplishments. Archives often reveal such singular obsessions. But not all collections are archived, as it is an expensive process. The cost of archiving collections is changing, especially with digital materials, as is cataloging and searching those collections. But it is still expensive. When the Internet Archive takes on a physical collection, say of records, or old repair manuals, or materials from an art group, we have to weigh the costs and the potential benefits to researchers in the future.  Archives take the long view. One hundred years from now is not an endpoint, it may be the first time a collection really comes back to light. Digital Libraries: A Memex Dream, a Global Brain So when I helped start the Internet Archive, we wanted to build a digital library—a “complete enough” collection, and “organized enough” that everything would be there and findable. A Universal Library. A Library of Alexandria for the digital age. Fulfilling the memex dream of Vanevar Bush (do read “As We May Think“), of Ted Nelson‘s Xanadu, of Tim Berners-Lee‘s World Wide Web, of Danny Hillis‘ Thinking Machine, Raj Reddy’s Universal Access to All Knowledge, and Peter Russell’s Global Brain. Could we be smarter by having people, the library, networks, and computers all work together?  That is the dream I signed on to.  I dreamed of starting with a collection—an Archive, an Internet Archive. This grew to be  a collection of collections: Archives. Then a critical mass of knowledge complete enough to inform citizens worldwide: a Digital Library. A library accessible by anyone connected to the Internet, “Free to All.” About the Author: Brewster Kahle, Founder & Digital Librarian, Internet Archive Brewster Kahle A passionate advocate for public Internet access and a successful entrepreneur, Brewster Kahle has spent his career intent on a singular focus: providing Universal Access to All Knowledge. He is the founder and Digital Librarian of the Internet Archive, one of the largest digital libraries in the world, which serves more than a million patrons each day. Creator of the Wayback Machine and lending millions of digitized books, the Internet Archive works with more than 800 library and university partners to create a free digital library, accessible to all. Soon after graduating from the Massachusetts Institute of Technology where he studied artificial intelligence, Kahle helped found the company Thinking Machines, a parallel supercomputer maker. He is an Internet pioneer, creating the Internet’s first publishing system called Wide Area Information Server (WAIS). In 1996, Kahle co-founded Alexa Internet, with technology that helps catalog the Web, selling it to Amazon.com in 1999.  Elected to the Internet Hall of Fame, Kahle is also a Fellow of the American Academy of Arts and Sciences, a member of the National Academy of Engineering, and holds honorary library doctorates from Simmons College and University of Alberta. Posted in Discussion, Librarianship, Uncategorized | Comments closed Amplifying the voices behind books By mek | Published: September 2, 2020 Exploring how Open Library uses author data to help readers move from imagination to impact By Nick Norman, Edited by Mek & Drini Image Source: Pexels / Pixabay from popsugar According to René Descartes, a creative mathematician, “The reading of all good books is like a conversation with the finest [people] of past centuries.” If that’s true, then who are some of the people you’re talking to? If you’re not sure how to answer that question, you’ll definitely appreciate the ‘Author Stats’ feature developed by Open Library. A deep dive into author stats Author stats give readers clear insights about their favorite authors that go much deeper than the front cover: such as birthplace, gender, works by time, ethnicity, and country of citizenship. These bits and pieces of knowledge about authors can empower readers in some dynamic ways. But how exactly? To answer that question, consider a reader who’s passionate about the topic of cultural diversity. However, after the reader examines their personalized author stats, they realize that their reading history lacks diversity. This doesn’t mean the reader isn’t passionate about cultural diversity; rather, author stats empowers the reader to pinpoint specific stats that can be diversified. Take a moment … or a day, and think about all the books you’ve read — just in the last year or as far back as you can. What if you could align the pages of each of those books with something meaningful … something that matters? What if each time you cracked open a book, the voices inside could point you to places filled with hope and opportunity? According to Drini Cami — Open Library’s lead developer behind Author Stats , “These stats let readers determine where the voices they read are coming from.” Drini continues saying, “A book can be both like a conversation as well as a journey.” He also says, “Statistics related to the authors might help provide readers with feedback as to where the voices they are listening to are coming from, and hopefully encourage the reading of books from a wider variety of perspectives.” Take a moment to let that sink in. Data with the power to change While Open Library’s author stats can show author-related demographics, those same stats can do a lot more than that. Drini Cami went on to say that, “Author stats can help readers intelligently alter their  behavior (if they wish to).” A profound statement that Mark Twain — one of the best writers in American history — might even shout from the rooftop. Broad, wholesome, charitable views of [people] … cannot be acquired by vegetating in one little corner of the earth all one’s lifetime. — Mark Twain In the eyes of Drini Cami and Mark Twain, books are like miniature time machines that have the power to launch readers into new spaces while changing their behaviors at the same time. For it is only when a reader steps out of their corner of the earth that they can step forward towards becoming a better person — for the entire world. Connecting two worlds of data Open Library has gone far beyond the extra mile to provide data about author demographics that some readers may not realize. It started with Open Library’s commitment to providing its readers with what Drini Cami describes as “clean, organized, structured, queryable data.” Simply put, readers can trust that Open Library’s data can be used to provide its audiences with maximum value. Which begs the question, where is all that ‘value’ coming from? Drini Cami calls it “linked data”. In not so complex terms, you may think of linked data as being two or more storage sheds packed with data. When these storage sheds are connected, well… that’s when the magic happens. For Open Library, that magic starts at the link between Wikidata and Open Library knowledge bases. Wikidata, a non-profit community-powered project run by Wikimedia, the same team which brought us Wikipedia, is a “free and open knowledge base that can be read and edited by both humans and machines”. It’s like Wikipedia except for storing bite-sized encyclopedic data and facts instead of articles. If you look closely, you may even find some of Wikidata’s data being leveraged within Wikipedia articles. Wikipedia’s Summary Info Box Source data in Wikidata Wikidata is where Open Library gets its author demographic data from. This is possible because the entries on Wikidata often include links to source material such as books, authors, learning materials, e-journals, and even to other knowledge bases like Open Library’s. Because of these links, Open Library is able to share its data with Wikidata and often times get back detailed information and structured data in return. Such as author demographics. Wrangling in the Data Linking-up services like Wikidata and Open Library doesn’t happen automatically. It requires the hard work of “Metadata Wranglers”. That’s where Charles Horn comes in, the lead Data Engineer at Open Library — without his work, author stats would not be possible. Charles Horn works closely with Drini Cami and also the team at Wikidata to connect book and author resources on Open Library with the data kept inside Wikidata. By writing clever bots and scripts, Charles and Drini are able to make tens of thousands of connections at scale. To put it simply, as both Open Library and Wikidata grow, their resources and data will become better connected and more accurate.  Thanks to the help of “Metadata Wranglers”, Open Library users will always have the smartest results — right at their fingertips.  It’s in a book … Once Upon a Time, ten-time Grammy Award Winner Chaka Kahn greeted television viewers with her bright voice on the once-popular book reading program, Reading Rainbow. In her words, she sang … “Friends to know, and ways to grow, a Reading Rainbow. I can be anything. Take a look, it’s in a book …” Thanks to Open Library’s author stats, not only do readers have the power to “take a look” into books, they can see further, and truly change what they see. Try browsing your author stats and consider following Open Library on twitter. The “My Reading Stats” option may be found under the “My Books” drop down menu within the main site’s top navigation. What did you learn about your favorite authors? Please share in the comments below. Posted in Community, Cultural Resources, Data | Comments closed Giacomo Cignoni: My Internship at the Internet Archive By Drini Cami | Published: August 29, 2020 This summer, Open Library and the Internet Archive took part in Google Summer of Code (GSoC), a Google initiative to help students gain coding experience by contributing to open source projects. I was lucky enough to mentor Giacomo while he worked on improving our BookReader experience and infrastructure. We have invited Giacomo to write a blog post to share some of the wonderful work he has done and his learnings. It was a pleasure working with you Giacomo, and we all wish you the best of luck with the rest of your studies! – Drini Hi, I am Giacomo Cignoni, a 2nd year computer science student from Italy. I submitted my 2020 Google Summer of Code (GSoC) project to work with the Internet Archive and I was selected for it. In this blogpost, I want to tell you about my experience and my accomplishments working this summer on BookReader, Internet Archive’s open source book reading web application. The BookReader features I enjoyed the most working on are page filters (which includes “dark mode”) and the text selection layer for certain public domain books. They were both challenging, but mostly had a great impact on the user experience of Bookreader. The first allows text to be selected and copied directly from the page images (currently in internal testing), and the second permits turning white-background black-text pages into black-background-white-text ones. Short summary of implemented features: End-to-end testing (search, autoplay, right-to-left books) Generic book from Internet Archive demo Mobile BookReader table of contents Checkbox for filters on book pages (including dark mode) Text selection layer plugin for public domain books Bug fixes for page flipping Using high resolution book images bug fix First approach to GSoC experience Once I received the news that I had been selected for GSoC with Internet Archive for my BookReader project, I was really excited, as it was the beginning of a new experience for me. For the same reason, I will not hide that I was a little bit nervous because it was my first internship-like experience. Fortunately, even from the start, my mentor Drini and also Mek were supportive and also ready to offer help. Moreover, the fact that I was already familiar with BookReader was helpful, as I had already used it (and even modified it a little bit) for a personal project. For most of the month of May, since the 6th, the day of the GSoC selection, I mainly focused on getting to know the other members of the UX team at Internet Archive, whom I would be working with for the rest of the summer, and also define a more precise roadmap of my future work with my mentor, as my proposed project was open to any improvements for BookReader. End to end testing The first tasks I worked on, as stated in the project, were about end-to-end testing for BookReader. I learned about the Testcafe tool that was to be used, and my first real task was to remove and explore some old QUnit tests (#308). Then I started to make end-to-end tests for the search feature in BookReader, both for desktop (#314) and mobile (#322). Lastly, I fixed the existent autoplay end-to-end test (#344) that was causing problems and I also had prepared end-to-end tests for right-to-left books (#350), but it wasn’t merged immediately because it needed a feature that I would have implemented later; a system to choose different books from the IA servers to be displayed specifying the book id in the URL. This work on testing (which lasted until the ~20th of June) was really helpful at the beginning as it allowed me to gain more confidence with the codebase without trying immediately harder tasks and also to gain more confidence with JavaScript ES6. The frequent meetings with my mentor and other members of the team made me really feel part of the workplace. Working on the source code The table of contents panel in BookReader mobile My first experience working on core BookReader source code was during the Internet Archive hackathon on May the 30th when, with the help of my mentor, I created the first draft for the table of content panel for mobile BookReader. I would then resume to work on this feature in July, refining it until it was released (#351). I then worked on a checkbox to apply different filters to the book page images, still on mobile BookReader (#342), which includes a sort of “dark mode”. This feature was probably the one I enjoyed the most working on, as it was challenging but not too difficult, it included some planning and was not purely technical and received great appreciation from users. Page filters for BookReader mobile let you read in a “dark mode” https://twitter.com/openlibrary/status/1280184861957828608 Then I worked on the generic demo feature; a particular demo for BookReader which allows you to choose a book  from the Internet Archive servers to be displayed, by simply adding the book id in the URL as a parameter (#356). This allowed the right to left e2e test to be merged and proved to be useful for manually testing the text selection plugin. In this period I also fixed two page flipping issues: one more critical (when flipping pages in quick succession the pages started turning back and forth randomly) (#386), and the other one less urgent, but it was an issue a user specifically pointed out (in an old BookReader demo it was impossible to turn pages at all) (#383). Another issue I solved was BookReader not correctly displaying high resolution images on high resolution displays (#378). Open source project experience One aspect I really enjoyed of my GSoC is the all-around experience of working on an open source project. This includes leaving more approachable tasks for the occasional member of the community to take on and helping them out. Also, I found it interesting working with other members of the team aside from my mentor, both for more technical reasons and for help in UI designing and feedback about the user experience: I always liked having more points of view about my work. Moreover, direct user feedback from the users, which showed appreciation for the new implemented features (such as BookReader “dark mode”), was very motivating and pushed me to do better in the following tasks. Text selection layer The normally invisible text layer shown red here for debugging The biggest feature of my GSoC was implementing the ability to select text directly on the page image from BookReader for public domain books, in order to copy and paste it elsewhere (#367). This was made possible because Internet Archive books have information about each word and its placement in the page, which is collected by doing OCR. To implement this feature we decided to use an invisible text layer placed on top of the page image, with words being correctly positioned and scaled. This made it possible to use the browser’s text selection system instead of creating a new one. The text layer on top of the page was implemented using an SVG element, with subelements for each paragraph and word in the page. The use of the SVG instead of normal html text elements made it a lot easier to overcome most of the problems we expected to find regarding the correct placement and scaling of words in the layer. I started working sporadically on this feature since the start of July and this led to having a workable demo by the first day of August. The rest of the month of August was spent refining this feature to make it production-ready. This included refining word placement in the layer, adding unit tests, adding support for more browsers, refactoring some functions, making the experience more fluid, making the selected text to be accurate for newlines and spaces on copy. The most challenging part was probably to integrate well the text selection actions in the two page view of BookReader, without disrupting the click-to-flip-page and other functionalities related to mouse-click events. This feature is currently in internal testing, and scheduled for release in the next few weeks. The text selection experience Conclusions Overall, I was extremely satisfied with my GSoC at the Internet Archive. It was a great opportunity to learn new things for me. I got much more fluent in JavaScript and CSS, thanks to both my mentor and using these languages in practice while coding. I learnt a lot about working on an open source project, but a part that I probably found really interesting was attending and participating in the decision making processes, even about projects I was not involved in. It was also interesting for me to apply concepts I had studied on a more theoretical level at university in a real workplace environment. To sum things up, the ability to work on something I liked that had an impact on users and the ability to learn useful things for my personal development really made this experience worthwhile for me. I would 100% recommend doing a GSoC at the Internet Archive! Posted in BookReader, Community, Google Summer of Code (GSoC), Open Source | Comments closed Open Library is an initiative of the Internet Archive, a 501(c)(3) non-profit, building a digital library of Internet sites and other cultural artifacts in digital form. Other projects include the Wayback Machine, archive.org and archive-it.org. Your use of the Open Library is subject to the Internet Archive's Terms of Use. « Older posts Search Recent Posts Introducing the Open Library Explorer Importing your Goodreads & Accessing them with Open Library’s APIs On Bookstores, Libraries & Archives in the Digital Age Amplifying the voices behind books Giacomo Cignoni: My Internship at the Internet Archive Archives Archives Select Month December 2020 October 2020 September 2020 August 2020 July 2020 May 2020 November 2019 October 2019 January 2019 October 2018 August 2018 July 2018 June 2018 May 2018 March 2018 December 2017 October 2016 June 2016 May 2016 February 2016 January 2016 November 2015 February 2015 January 2015 December 2014 November 2014 October 2014 August 2014 July 2014 June 2014 May 2014 April 2014 March 2014 April 2013 January 2013 August 2012 December 2011 November 2011 October 2011 July 2011 June 2011 May 2011 April 2011 March 2011 February 2011 January 2011 December 2010 November 2010 October 2010 September 2010 August 2010 July 2010 June 2010 May 2010 April 2010 March 2010 February 2010 January 2010 December 2009 November 2009 October 2009 September 2009 August 2009 July 2009 June 2009 May 2009 April 2009 March 2009 February 2009 January 2009 December 2008 November 2008 Theme customized from Thematic Theme Framework. 
blog-reeset-net-1240	----	MarcEdit 7.5 Update – Terry's Worklog Skip to content Terry's Worklog On my work (programming, digital libraries, cataloging) and other stuff that perks my interest (family, cycling, etc) Menu Close Home About Me MarcEdit Homepage GitHub Page Privacy Policy MarcEdit 7.5 Update ChangeLog: https://marcedit.reeset.net/software/update75.txt Highlights Preview Changes One of the most requested features over the years has been the ability to preview changes prior to running them.  As of 7.5.8 – a new preview option has been added to many of the global editing tools in the MarcEditor.  Currently, you will find the preview option attached to the following functions: Replace All Add New Field Delete Field Edit Subfield Edit Field Edit Indicator Copy Field Swap Field Functions that include a preview option will be denoted with the following button: When this button is pressed, the following option is made available When Preview Results is selected, the program will execute the defined action, and display the potential results in a display screen.  For example: To protect performance, only 500 results at a time will be loaded into the preview grid, though users can keep adding results to the grid and continue to review items.  Additionally, users have the ability to search for items within the grid as well as jump to a specific record number (not row number).  These new options will show up first in the windows version of MarcEdit, but will be added to the MarcEdit Mac 3.5.x branch in the coming weeks.  New JSON => XML Translation To better support the translation of data from JSON to MARC, I’ve included a JSON => MARC algorithm in the MARCEngine.  This will allow JSON data to serialized into XML.  The benefit of including this option, is that I’ve been able to update the XML Functions options to allow JSON to be a starting format.  This will specifically useful for users that want to make use of linked data vocabularies to generate MARC Authority records.  Users can direct MarcEdit to facilitate the translation from JSON to XML, and then create XSLT translations that can then be used to complete the process to MARCXML and MARC.  I’ve demonstrated how this process works using a vocabulary of interest to the #critcat community, the Homosaurus vocabulary (How do I generate MARC authority records from the Homosaurus vocabulary? – Terry’s Worklog (reeset.net)). OCLC API Interactions Working with the OCLC API is sometimes tricky.   MarcEdit utilizes a specific authentication process that requires OCLC keys be setup and configured to work a certain way.  When issues come up, it is sometimes very difficult to debug them.  I’ve updated the process and error handling to surface more information – so when problems occur and XML debugging information isn’t available, the actual exception and inner exception data will be surfaced instead.  This often can provide information to help understand why the process isn’t able to complete. Wrap up As noted, there have been a number of updates.  While many fall under the category of house-keeping (updating icons, UX improvements, actions, default values, etc.) – this update does include a number of often asked for, significant updates, that I hope will improve user workflows. –tr Published April 3, 2021By reeset Categorized as MarcEdit Leave a comment Cancel reply Your email address will not be published. Required fields are marked * Comment Name * Email * Website Notify me of follow-up comments by email. Notify me of new posts by email. Post navigation Previous post How do I generate MARC authority records from the Homosaurus vocabulary? Next post Thoughts on NACOs proposed process on updating CJK records Search… Terry's Worklog Proudly powered by WordPress. Dark Mode: 
blog-reeset-net-1241	----	None 
blog-reeset-net-1510	----	Terry's Worklog – On my work (programming, digital libraries, cataloging) and other stuff that perks my interest (family, cycling, etc) Skip to content Terry's Worklog On my work (programming, digital libraries, cataloging) and other stuff that perks my interest (family, cycling, etc) Menu Close Home About Me MarcEdit Homepage GitHub Page Privacy Policy Thoughts on NACOs proposed process on updating CJK records I would like to take a few minutes and share my thoughts about an updated best practice recently posted by the PCC and NACO related to an update on CJK records. The update is found here: https://www.loc.gov/aba/pcc/naco/CJK/CJK-Best-Practice-NCR.docx. I’m not certain if this is active or a simply a proposal, but I’ve been having a number… Continue reading Thoughts on NACOs proposed process on updating CJK records Published April 20, 2021Categorized as Cataloging, MarcEdit MarcEdit 7.5 Update ChangeLog: https://marcedit.reeset.net/software/update75.txt Highlights Preview Changes One of the most requested features over the years has been the ability to preview changes prior to running them.  As of 7.5.8 – a new preview option has been added to many of the global editing tools in the MarcEditor.  Currently, you will find the preview option attached to… Continue reading MarcEdit 7.5 Update Published April 3, 2021Categorized as MarcEdit How do I generate MARC authority records from the Homosaurus vocabulary? Step by step instructions here: https://youtu.be/FJsdQI3pZPQ Ok, so last week, I got an interesting question on the listserv where a user asked specifically about generating MARC records for use in one’s ILS system from a JSONLD vocabulary.  In this case, the vocabulary in question as Homosaurus (Homosaurus Vocabulary Site) – and the questioner was specifically… Continue reading How do I generate MARC authority records from the Homosaurus vocabulary? Published April 3, 2021Categorized as MarcEdit MarcEdit: State of the Community *2020-2021 * Sigh – original title said 2019-2020.  Obviously, this is for this past year (Jan. 2020-Dec. 31, 2020).   Per usual, I wanted to take a couple minutes and look at the state of the MarcEdit project. This is something that I try to do once a year to gauge the current health of the community,… Continue reading MarcEdit: State of the Community *2020-2021 Published March 24, 2021Categorized as MarcEdit, Uncategorized MarcEdit 7.3.x/7.5.x (beta) Updates Versions are available at: https://marcedit.reeset.net/downloads Information about the changes: 7.3.10 Change Log: https://marcedit.reeset.net/software/update7.txt 7.5.0 Change Log: https://marcedit.reeset.net/software/update75.txt If you are using 7.x – this will prompt as normal for update. 7.5.x is the beta build, please be aware I expect to be releasing updates to this build weekly and also expect to find some issues.… Continue reading MarcEdit 7.3.x/7.5.x (beta) Updates Published February 2, 2021Categorized as MarcEdit MarcEdit 7.5.x/MacOS 3.5.x Timelines I sent this to the MarcEdit Listserv to provide info about my thoughts around timelines related to the beta and release.  Here’s the info. Dear All, As we are getting close to Feb. 1 (when I’ll make the 7.5 beta build available for testing) – I wanted to provide information about the update process going… Continue reading MarcEdit 7.5.x/MacOS 3.5.x Timelines Published January 26, 2021Categorized as MarcEdit MarcEdit 7.5 Change/Bug Fix list * Updated; 1/20 Change: Allow OS to manage supported supported Security Protocol types. Change: Remove com.sun dependency related to dns and httpserver Change: Changed AppData Path Change: First install automatically imports settings from MarcEdit 7.0-2.x Change: Field Count – simplify UI (consolidate elements) Change: 008 Windows — update help urls to oclc Change: Generate FAST… Continue reading MarcEdit 7.5 Change/Bug Fix list Published January 20, 2021Categorized as MarcEdit MarcEdit 7.5 Updates Current list of MarcEdit 7.5 general updates.  I’ll be walking through many of these changes in a webinar 1/15. Significant Changes: Targeted Framework: .NET 5.0 (What’s new in .NET 5 | Microsoft Docs) XML Wizard Changes Support for Attribute-based mapping (extends previous entity based mapping) Linked Data Components updated SPARQL Components Updated Linked Data Rules… Continue reading MarcEdit 7.5 Updates Published January 12, 2021Categorized as MarcEdit MarcEdit 7.5 Update Status I’m planning to start making testing versions of the new MarcEdit instance available around the first of the year broadly, to a handful of testers in mid-Dec.  The translation from .NET 4.7.2 to .NET 5 was more significant than I would have thought – and includes a number of swapped default values – so hunting… Continue reading MarcEdit 7.5 Update Status Published November 30, 2020Categorized as Uncategorized Changes to System.Diagnostics.Process in .NET Core In .NET Core, one of the changes that caught me by surprise is the change related to starting processes.  In the .NET framework – you can open a web site, file, etc. just by using the following:\ System.Diagnostics.Process.Start(path); However, in .NET Core – this won’t work.  When trying to open a file, the process will… Continue reading Changes to System.Diagnostics.Process in .NET Core Published November 19, 2020Categorized as Uncategorized Posts navigation Page 1 … Page 94 Older posts Search… Terry's Worklog Proudly powered by WordPress. Dark Mode: 
blog-reeset-net-2049	----	None 
blog-reeset-net-2210	----	None 
blog-reeset-net-2983	----	Thoughts on NACOs proposed process on updating CJK records – Terry's Worklog Skip to content Terry's Worklog On my work (programming, digital libraries, cataloging) and other stuff that perks my interest (family, cycling, etc) Menu Close Home About Me MarcEdit Homepage GitHub Page Privacy Policy Thoughts on NACOs proposed process on updating CJK records I would like to take a few minutes and share my thoughts about an updated best practice recently posted by the PCC and NACO related to an update on CJK records. The update is found here: https://www.loc.gov/aba/pcc/naco/CJK/CJK-Best-Practice-NCR.docx. I’m not certain if this is active or a simply a proposal, but I’ve been having a number of private discussions with members at the Library of Congress and the PCC as I’ve been trying to understand the genesis for this policy change. I personally believe that formally adopting a policy like this would be exceptionally problematic, and I wanted to flesh out my thoughts on why and some potential better options that could fix the issue that this problem is attempting to solve. But first, I owe some folks an apology. In chatting with some folks at LC (because, let’s be clear, this proposal was created specifically because there are local, limiting practices at LC that artificially are complicating this work) – it came to my attention that the individuals that spent a good deal of time considering and creating this proposal have received some unfair criticism – and I think I bare a lot of responsibility for that. I have done work creating best practices and standards and its thankless, difficult work. Because of that, in cases where I disagree with a particular best practice, my preference has been to address those privately and attempt to understand and share my issues with a set of practices. This is what I have been doing related to this work. However, on the MarcEdit list (a private list), when a request was made related to a feature request in MarcEdit to support this work – I was less thoughtful in my response as the proposed change could fundamentally undo almost a decade of work as I have dealt with thousands of libraries stymied by these kinds of best practices that have significant unintended consequences. My regret is that I’ve been told that my thoughts shared on the MarcEdit list, have been used by others in more public spaces to take this committee’s work to task. This is unfortunate and disappointing, and something I should have been more thoughtful of in my responses on the MarcEdit list. Especially, given that every member of that committee is doing this work as a service to the community. I know I forget that sometimes. So, to the folks that did this work – I’ve not followed (or seen) any feedback you may have received, but in as much that I’m sure I played a part in any push back you may have received, I’m sorry. What does this problem seek to solve? If you look at the proposal, I think that the writers do a good job identifying the issue. Essentially, this issue is unique to authority records. At present, NACO still requires that records created within the program only utilize UTF8 characters that fall within the MARC-8 repertoire. OCLC, the pipeline for creating these records, enforces this rule by invalidating records with UTF8 characters outside the MARC8 range. The proposal seeks to address this by encouraging the use of NRC (Numeric Character Reference) data in UTF8 records, to work around these normalization issues. So, in a nutshell, that is the problem, and that is the proposed solution. But before we move on, let’s talk a little bit about how we got here. This problem currently exists because of, what I believe to be, an extremely narrow and unproductive read of what MARC8 repertoire actually means. For those not in Libraries, MARC8 is essentially a made-up character encoding, used only in libraries, that has so outlived its usefulness. Modern systems have largely stopped supporting it outside of legacy ingest workflows. The issue is that for every academic library or national library that has transitioned to UTF8, hundreds of small libraries or organizations around the world have not. MARC8 continues to exist because the infrastructure that supports these smaller libraries is built around it. But again, I think it is worth thinking about today, what actually is the MARC8 repertoire. Previously, this had been a hard set of defined values. But really, that changed in 2004ish when LC updated guidance and introduced the concept of NRCs to preserve lossless data transfer between systems that were fully UTF8 compliant and older MARC8 systems. NRCs in MARC8 were workable, because it left local systems the ability to handle (or not handle) the data as it seen fit and finally provided an avenue for the Library community as a whole to move on from the limitations MARC8 was imposing on systems. It allowed for the facilitation of data into non-MARC formats that were UTF8 compliant and provided a pathway to allow data from other metadata formats, the ability to reuse that data in MARC records. I would argue that today, the MARC8 repertoire includes NRC notation – and to assume or pretend otherwise, is shortsighted and revisionist. But why is all of this important. Well, it is at the heart of the problem that we find ourselves in. For authority data, the Library of Congress appears to have adopted this very narrow view of what MARC8 means (against their own stated recommendations) and as a result, NACO and OCLC place artificial limits on the pipeline. There are lots of reasons why LC does this, I recognize they are moving slowly because any changes that they make are often met with some level of resistance from members of our community – but in this case, this paralysis is causing more harm to the community than good. Why this proposal is problematic? So, this is the environment that we are working in and the issue this proposal sought to solve. The issue, however, is that the proposal attempts to solve this problem by adopting a MARC8 solution and applying it within UTF8 data – essentially making the case that NRC values can be embedded in UTF8 records to ensure lossless data entry. And while I can see why someone might think that – that assumption is fundamentally incorrect. When LC developed its guidance on NRC notation, this was guidance that was specifically directed in the lossless translation of data to MARC8. UTF8 data has no need for NRC notation. This does not mean that it does not sometimes show up – and as a practical purpose, I’ve spent thousands of hours working with Libraries dealing with the issues this creates in local systems. Aside from the issues this creates in MARC systems around indexing and discovery, it makes data almost impossible to be used outside of that system and in times of migration. In thinking about the implications of this change in the context of MarcEdit, I had the following, specific concerns: NRC data in UTF8 records would break existing workflows for users with current generation systems that would have no reason to expect this data as being present in UTF8 MARC records It would make normalization functionally virtually impossible and potentially re-introduce a problem I spent months solving for organizations related to how UTF8 data is normalized and introduced into local systems. It would break many of the transformation options.  MarcEdit allows for the flow of data to many different metadata formats – all are built on the concept that the first thing MarcEdit does is clean up character encodings to ensure the output data is in UTF8. MarcEdit is used by ~20k active users and ~60k annual users.  Over 1/3 of those users do not use MARC21 and do not use MARC-8.  Allowing the mixing of NRCs and UTF8 data potentially breaks functionality for broad groups of international users. While I very much appreciate the issue that this is attempting to solve, I’ve spent years working with libraries where this kind of practice would introduce a long-term data issue that is very difficult to identify and fix and often shows up unexpectedly when it comes time to migration or share this information with other services, communities, or organizations. So what is the solution?   I think that we can address this issue on two fronts. First, I would advise NACO and OCLC to essentially stop limiting data entry to this very limited notion of MARC8 repertoire. In all other contexts, OCLC provides the ability to enter any valid UTF8 data. This current limit within the authority process is artificial and unnecessary. OCLC could easily remove it, and NACO could amend their process to allow record entry to utilize any valid UTF8 character. This would address the problem that this group was attempting to solve for catalogers creating these records. The second step could take two forms. If LC continues to ignore their own guidance and cleave to an outdated concept of the MARC8 repertoire – OCLC could provide to LC via their pipeline a version of the records where data includes NRC notation for use in LCs own systems. It would mean that I would not recommend using LC as a trusted system for downloading authorities if this was the practice unless I had an internal local process to remove any NRC data found in valid UTF8 records. Essentially, we essentially treat LC’s requirements as a disease and quarantine them and their influence in this process. Of course, what would be more ideal, is LC making the decision to accept UTF8 data without restrictions and rely on applicable guidance and MARC21 best practice by supporting UTF8 data fully, and for those still needing MARC8 data – providing that data using the lossless process of NRCs (per their own recommendations). Conclusion Ultimately, this proposal is a recognition that the current NACO rules and process is broken and broken in a way that it is actively undermining other work in the PCC around linked data development. And while I very much appreciate the thoughtful work that went into the consideration of a different approach, I think the unintended side affects would cause more long-term damage that any short-term gains. Ultimately, what we need is for the principles to rethink why these limitations are in place, and, honestly, really consider ways that we start to deemphasize the role LC plays as a standard holder if in that role, LC’s presence continues to be an impediment for moving libraries forward. Published April 20, 2021By reeset Categorized as Cataloging, MarcEdit Leave a comment Cancel reply Your email address will not be published. Required fields are marked * Comment Name * Email * Website Notify me of follow-up comments by email. Notify me of new posts by email. Post navigation Previous post MarcEdit 7.5 Update Search… Terry's Worklog Proudly powered by WordPress. Dark Mode: 
blog-reeset-net-3028	----	None 
blog-reeset-net-3412	----	Terry's Worklog Terry's Worklog On my work (programming, digital libraries, cataloging) and other stuff that perks my interest (family, cycling, etc) Thoughts on NACOs proposed process on updating CJK records I would like to take a few minutes and share my thoughts about an updated best practice recently posted by the PCC and NACO related to an update on CJK records. The update is found here: https://www.loc.gov/aba/pcc/naco/CJK/CJK-Best-Practice-NCR.docx. I’m not certain if this is active or a simply a proposal, but I’ve been having a number&#8230; Continue reading Thoughts on NACOs proposed process on updating CJK records MarcEdit 7.5 Update ChangeLog: https://marcedit.reeset.net/software/update75.txt Highlights Preview Changes One of the most requested features over the years has been the ability to preview changes prior to running them.&#160; As of 7.5.8 – a new preview option has been added to many of the global editing tools in the MarcEditor.&#160; Currently, you will find the preview option attached to&#8230; Continue reading MarcEdit 7.5 Update How do I generate MARC authority records from the Homosaurus vocabulary? Step by step instructions here: https://youtu.be/FJsdQI3pZPQ Ok, so last week, I got an interesting question on the listserv where a user asked specifically about generating MARC records for use in one’s ILS system from a JSONLD vocabulary.&#160; In this case, the vocabulary in question as Homosaurus (Homosaurus Vocabulary Site) – and the questioner was specifically&#8230; Continue reading How do I generate MARC authority records from the Homosaurus vocabulary? MarcEdit: State of the Community *2020-2021 * Sigh &#8211; original title said 2019-2020.  Obviously, this is for this past year (Jan. 2020-Dec. 31, 2020).   Per usual, I wanted to take a couple minutes and look at the state of the MarcEdit project. This is something that I try to do once a year to gauge the current health of the community,&#8230; Continue reading MarcEdit: State of the Community *2020-2021 MarcEdit 7.3.x/7.5.x (beta) Updates Versions are available at: https://marcedit.reeset.net/downloads Information about the changes: 7.3.10 Change Log: https://marcedit.reeset.net/software/update7.txt 7.5.0 Change Log: https://marcedit.reeset.net/software/update75.txt If you are using 7.x – this will prompt as normal for update. 7.5.x is the beta build, please be aware I expect to be releasing updates to this build weekly and also expect to find some issues.&#8230; Continue reading MarcEdit 7.3.x/7.5.x (beta) Updates MarcEdit 7.5.x/MacOS 3.5.x Timelines I sent this to the MarcEdit Listserv to provide info about my thoughts around timelines related to the beta and release.&#160; Here’s the info. Dear All, As we are getting close to Feb. 1 (when I’ll make the 7.5 beta build available for testing) – I wanted to provide information about the update process going&#8230; Continue reading MarcEdit 7.5.x/MacOS 3.5.x Timelines MarcEdit 7.5 Change/Bug Fix list * Updated; 1/20 Change: Allow OS to manage supported supported Security Protocol types. Change: Remove com.sun dependency related to dns and httpserver Change: Changed AppData Path Change: First install automatically imports settings from MarcEdit 7.0-2.x Change: Field Count &#8211; simplify UI (consolidate elements) Change: 008 Windows &#8212; update help urls to oclc Change: Generate FAST&#8230; Continue reading MarcEdit 7.5 Change/Bug Fix list MarcEdit 7.5 Updates Current list of MarcEdit 7.5 general updates.&#160; I’ll be walking through many of these changes in a webinar 1/15. Significant Changes: Targeted Framework: .NET 5.0 (What&#8217;s new in .NET 5 &#124; Microsoft Docs) XML Wizard Changes Support for Attribute-based mapping (extends previous entity based mapping) Linked Data Components updated SPARQL Components Updated Linked Data Rules&#8230; Continue reading MarcEdit 7.5 Updates MarcEdit 7.5 Update Status I’m planning to start making testing versions of the new MarcEdit instance available around the first of the year broadly, to a handful of testers in mid-Dec.&#160; The translation from .NET 4.7.2 to .NET 5 was more significant than I would have thought – and includes a number of swapped default values – so hunting&#8230; Continue reading MarcEdit 7.5 Update Status Changes to System.Diagnostics.Process in .NET Core In .NET Core, one of the changes that caught me by surprise is the change related to starting processes.&#160; In the .NET framework – you can open a web site, file, etc. just by using the following:\ System.Diagnostics.Process.Start(path); However, in .NET Core – this won’t work.&#160; When trying to open a file, the process will&#8230; Continue reading Changes to System.Diagnostics.Process in .NET Core 
blog-reeset-net-6539	----	None 
blog-reeset-net-7876	----	None 
blog-reeset-net-794	----	How do I generate MARC authority records from the Homosaurus vocabulary? – Terry's Worklog Skip to content Terry's Worklog On my work (programming, digital libraries, cataloging) and other stuff that perks my interest (family, cycling, etc) Menu Close Home About Me MarcEdit Homepage GitHub Page Privacy Policy How do I generate MARC authority records from the Homosaurus vocabulary? Step by step instructions here: https://youtu.be/FJsdQI3pZPQ Ok, so last week, I got an interesting question on the listserv where a user asked specifically about generating MARC records for use in one’s ILS system from a JSONLD vocabulary.  In this case, the vocabulary in question as Homosaurus (Homosaurus Vocabulary Site) – and the questioner was specifically looking for a way to pull individual terms for generation into MARC Authority records to add to one’s ILS to improve search and discovery. When the question was first asked, my immediate thought was that this could likely be accommodated using the XML/JSON profiling wizard in MarcEdit.  This tool can review a sample XML or JSON file and allow a user to create a portable processing file based on the content in the file.  However, there were two issues with this approach: The profile wizard assumes that data format is static – i.e., the sample file is representative of other files.  Unfortunately, for this vocabulary, that isn’t the case.  The profile wizard was designed to work with JSON – JSON LD is actually a different animal due to the inclusion of the @ symbol.  While I updated the Profiler to recognize and work better with JSON-LD – the first challenge is one that doesn’t make this a good fit to create a generic process.  So, I looked at how this could be built into the normal processing options. To do this, I added a new default serialization, JSON=>XML == which MarcEdit now supports.  This allows the tool to take a JSON file, and deserialize the data so that is output reliably as XML.  So, for example, here is a sample JSON-LD file (homosaurus.org/v2/adoptiveParents.jsonld): { "@context": { "dc": "http://purl.org/dc/terms/", "skos": "http://www.w3.org/2004/02/skos/core#", "xsd": "http://www.w3.org/2001/XMLSchema#" }, "@id": "http://homosaurus.org/v2/adoptiveParents", "@type": "skos:Concept", "dc:identifier": "adoptiveParents", "dc:issued": { "@value": "2019-05-14", "@type": "xsd:date" }, "dc:modified": { "@value": "2019-05-14", "@type": "xsd:date" }, "skos:broader": { "@id": "http://homosaurus.org/v2/parentsLGBTQ" }, "skos:hasTopConcept": [ { "@id": "http://homosaurus.org/v2/familyMembers" }, { "@id": "http://homosaurus.org/v2/familiesLGBTQ" } ], "skos:inScheme": { "@id": "http://homosaurus.org/terms" }, "skos:prefLabel": "Adoptive parents", "skos:related": [ { "@id": "http://homosaurus.org/v2/socialParenthood" }, { "@id": "http://homosaurus.org/v2/LGBTQAdoption" }, { "@id": "http://homosaurus.org/v2/LGBTQAdoptiveParents" }, { "@id": "http://homosaurus.org/v2/birthParents" } ] } In MarcEdit, the new JSON=>XML process can take this file and output it in XML like this: <?xml version="1.0"?> <records> <record> <context> <dc>http://purl.org/dc/terms/</dc> <skos>http://www.w3.org/2004/02/skos/core#</skos> <xsd>http://www.w3.org/2001/XMLSchema#</xsd> </context> <id>http://homosaurus.org/v2/adoptiveParents</id> <type>skos:Concept</type> <identifier>adoptiveParents</identifier> <issued> <value>2019-05-14</value> <type>xsd:date</type> </issued> <modified> <value>2019-05-14</value> <type>xsd:date</type> </modified> <broader> <id>http://homosaurus.org/v2/parentsLGBTQ</id> </broader> <hasTopConcept> <id>http://homosaurus.org/v2/familyMembers</id> </hasTopConcept> <hasTopConcept> <id>http://homosaurus.org/v2/familiesLGBTQ</id> </hasTopConcept> <inScheme> <id>http://homosaurus.org/terms</id> </inScheme> <prefLabel>Adoptive parents</prefLabel> <related> <id>http://homosaurus.org/v2/socialParenthood</id> </related> <related> <id>http://homosaurus.org/v2/LGBTQAdoption</id> </related> <related> <id>http://homosaurus.org/v2/LGBTQAdoptiveParents</id> </related> <related> <id>http://homosaurus.org/v2/birthParents</id> </related> </record> </records> The ability to reliably convert JSON/JSONLD to XML means that I can now allow users to utilize the same XSLT/XQUERY process MarcEdit utilizes for other library metadata format transformation.  All that was left to make this happen was to add a new origin data format to the XML Function template – and we are off and running. The end result is users could utilize this process with any JSON-LD vocabulary (assuming they created the XSLT) to facilitate the automation of MARC Authority data.  In this case of this vocabulary, I’ve created an XSLT and added it to my github space: https://github.com/reeset/marcedit_xslt_files/blob/master/homosaurus_xml.xsl but have included the XSLT in the MarcEdit XSLT directory in current downloads. In order to use this XSLT and allow your version of MarcEdit to generate MARC Authority records from this vocabulary – you would use the following steps: Be using MarcEdit 7.5.8+ or MarcEdit Mac 3.5.8+ (Mac version will be available around 4/8).  I have not decided if I will backport to 7.3- Open the XML Functions Editor in MarcEdit Add a new Transformation – using JSON as the original format, and MARC as the final.  Make sure the XSLT path is pointed to the location where you saved the downloaded XSLT file. Save That should be pretty much it.  I’ve recorded the steps and placed them here: https://youtu.be/FJsdQI3pZPQ, including some information on values you may wish to edit should you want to localize the XSLT.  Published April 3, 2021By reeset Categorized as MarcEdit 1 comment Pingback: MarcEdit 7.5 Update – Terry's Worklog Leave a comment Cancel reply Your email address will not be published. Required fields are marked * Comment Name * Email * Website Notify me of follow-up comments by email. Notify me of new posts by email. Post navigation Previous post MarcEdit: State of the Community *2020-2021 Next post MarcEdit 7.5 Update Search… Terry's Worklog Proudly powered by WordPress. Dark Mode: 
blog-twitter-com-3439	----	Enabling the future of academic research with the Twitter API Developer Blog Back Developer Blog Tips Community Tools Spotlight Sign Up ‎English (US)‎ ‎日本語‎ ‎English (US)‎ ‎日本語‎ Sign Up Tools Enabling the future of academic research with the Twitter API By Adam Tornes and Leanne Trujillo Tuesday, 26 January 2021 Link copied successfully When we introduced the next generation of the Twitter API in July 2020, we also shared our plans to invest in the success of the academic research community with tailored solutions that better serve their goals. Today, we’re excited to launch the Academic Research product track on the new Twitter API.  Why we’re launching this & how we got here Since the Twitter API was first introduced in 2006, academic researchers have used data from the public conversation to study topics as diverse as the conversation on Twitter itself - from state-backed efforts to disrupt the public conversation to floods and climate change, from attitudes and perceptions about COVID-19 to efforts to promote healthy conversation online. Today, academic researchers are one of the largest groups of people using the Twitter API.  Our developer platform hasn’t always made it easy for researchers to access the data they need, and many have had to rely on their own resourcefulness to find the right information. Despite this, for over a decade, academic researchers have used Twitter data for discoveries and innovations that help make the world a better place. Over the past couple of years, we’ve taken iterative steps to improve the experience for researchers, like when we launched a webpage dedicated to Academic Research, and updated our Twitter Developer Policy to make it easier to validate or reproduce others’ research using Twitter data. We’ve also made improvements to help academic researchers use Twitter data to advance their disciplines, answer urgent questions during crises, and even help us improve Twitter. For example, in April 2020, we released the COVID-19 stream endpoint - the first free, topic-based stream built solely for researchers to use data from the global conversation for the public good. Researchers from around the world continue to use this endpoint for a number of projects. Over two years ago, we started our own extensive research to better understand the needs, constraints and challenges that researchers have when studying the public conversation. In October 2020, we tested this product track in a private beta program where we gathered additional feedback. This gave us a glimpse into some of the important work that the free Academic Research product track we’re launching today can now enable. “The Academic Research product track gives researchers a window into understanding the use of Twitter and social media at large, and is an important step by Twitter to support the scientific community.” - Dr. Sarah Shugars, Assistant Professor at New York University “Twitter's enhancements for academic research have the potential to eliminate many of the bottlenecks that scholars confront in working with Twitter's API, and allow us to better evaluate the impact and origin of trends we discover.” - Dr. David Lazer, Professor at Northeastern University What’s launching today With the new Academic Research product track, qualified researchers will have access to all v2 endpoints released to date, as well as: Free access to the full history of public conversation via the full-archive search endpoint, which was previously limited to paid premium or enterprise customers Higher levels of access to the Twitter developer platform for free, including a significantly higher monthly Tweet volume cap of 10 million (20x higher than what’s available on the Standard product track today) More precise filtering capabilities across all v2 endpoints to limit data collection to what is relevant for your study and minimize data cleaning requirements New technical and methodological guides to maximize the success of your studies The release of the Academic Research product track is just a starting point. This initial solution is intended to address the most requested, biggest challenges faced when conducting research on the platform. We are excited to enable even more research that can create a positive impact on the world, and on Twitter, in the future.    For more in-depth details about what’s available, see our post on the Twitter community forum. Where do I start? To use this track, new and existing Twitter developers will need to apply for access with the Academic Research application. This Tweet is unavailable This Tweet is unavailable. An improved developer portal experience guides you to the product track that best fits your needs. We require this additional application step to help protect the security and privacy of people who use Twitter and our developer platform. Each application will go through a manual review process to determine whether the described use cases for accessing our Academic Research product track adhere to our Developer Policy, and that applicants meet these three requirements: You are either a master’s student, doctoral candidate, post-doc, faculty, or research-focused employee at an academic institution or university. You have a clearly defined research objective, and you have specific plans for how you intend to use, analyze, and share Twitter data from your research. Learn more about the application. You will use this product track for non-commercial purposes. Learn about non-commercial use. We understand that these requirements are not representative of everyone doing academic research with Twitter data (for example, if you are an undergraduate, independent researcher, or a non-profit). Our future goal is to serve the complete range of research use cases for public Twitter data. In the meantime, anyone can apply to start with our v2 endpoints on the Standard product track. This Tweet is unavailable This Tweet is unavailable. The new application for the Academic Research track asks specific questions related to your academic profile and research project details. Learn more about the application here. What’s next for the Twitter API v2? Today’s launch marks the beginning of how we plan to support this community with unprecedented access to data that can advance research objectives for nearly any discipline. While we recognize what we’re launching today may not address all needs of the community, this is a starting point and we are committed to continued support for academic researchers in the future. We’ll continue to listen and learn from you all, and welcome your feedback on how we can continue to improve and best serve your needs. As we’ve seen over the last 15 years, the research topics that can be studied with Twitter data are vast, and the future possibilities are endless. We hope you are as excited as we are about the possibilities this new product track creates for your research. In coming months, we will introduce a specialized Business product track, as well as additional levels of access within our Academic Research, Standard, and Business product tracks. We are also exploring more flexible access terms, support for additional Projects with unique use cases within your product track, and other improvements intended to help researchers and developers to get started, grow, and scale their projects all within the same API. To follow our planned releases, check out the product roadmap. Eventually, the new Twitter API will fully replace the v1.1 standard, premium, and enterprise APIs. Though before that can happen, we have a lot more to build, which is why we are referring to today’s launch as Early Access. Early access gives you a chance to get started and get ahead on using our new, v2 endpoints. Learn more about how we plan to roll out the new Twitter API here. Have questions or want to connect with other researchers using the Twitter API? Check out our academic research community forum. Have ideas about how we can improve the new Twitter API? Upvote ideas or add your own in the v2 API feedback channel. This Tweet is unavailable This Tweet is unavailable. Adam Tornes ‎@atornes‎ Staff Product Manager, Developer & Enterprise Solutions Leanne Trujillo ‎@leanne_tru‎ Sr. Program Manager, Developer & Enterprise Solutions Only on Twitter #TwitterAPI #academicresearch Tweet Twitter logo icon Tags: API academicresearch Link copied successfully More from Tools Prototyping in production for rapid user feedback By Daniele Bernardi on Thursday, 3 December 2020 Introducing a new and improved Twitter API By Ian Cairns and Priyanka Shetty on Thursday, 16 July 2020 Previewing changes to the User and Mentions Timeline API endpoints By ‎@robjohnson ‎ and ‎@yoyoel ‎ on Tuesday, 19 March 2019 Designing the new Twitter developer experience By Alyssa Reese on Wednesday, 12 August 2020 See what's happening ‎@Twitter‎ Twitter platform Twitter.com Status Card validator Privacy Center Transparency Center Twitter, Inc. About the company Twitter for Good Company news Brand toolkit Jobs and internships Investors Help Help Center Using Twitter Twitter Media Ads Help Center Managing your account Safety and security Rules and policies Contact us Developer resources Developer home Documentation Forums Communities Developer blog Engineering blog Developer terms Business resources Advertise Twitter for business Resources and guides Twitter for marketers Marketing insights Brand inspiration Twitter Data Twitter Flight School ‎© 2021 Twitter, Inc.‎ Cookies Privacy Terms and conditions By using Twitter’s services you agree to our Cookies Use. We use cookies for purposes including analytics, personalisation, and ads. OK 
blog-twitter-com-4317	----	Introducing a new and improved Twitter API Developer Blog Back Developer Blog Tips Community Tools Spotlight Sign Up ‎English (US)‎ ‎日本語‎ ‎English (US)‎ ‎日本語‎ Sign Up Introducing a new and improved Twitter API By Ian Cairns and Priyanka Shetty Thursday, 16 July 2020 Link copied successfully We planned to launch the new Twitter API on July 16, 2020. But given the security incident we discovered on July 15, 2020, the timing of our launch no longer made sense or felt right.  We updated this post on August 12, 2020 to include additional details below to support the official launch of the new Twitter API. This Tweet is unavailable This Tweet is unavailable. -------------------------------------------- This Tweet is unavailable This Tweet is unavailable. Your browser does not support the <code>video</code> element. Play Play Pause Seek 0% buffered 00:00 Current time 00:00 Duration 00:00 Toggle Mute Volume Today, we’re introducing the new Twitter API. Rebuilt from the ground up to deliver new features faster, today’s release includes the first set of new endpoints and features we’re launching so developers can help the world connect to the public conversation happening on Twitter.  If you can’t wait to check it out, visit the new developer portal. If you can, then read on for more about what we’re building, what’s new about the Twitter API v2, what’s launching first, and what’s coming next.  This Tweet is unavailable This Tweet is unavailable. Building in the open and what we've learned This Tweet is unavailable This Tweet is unavailable. Your feedback has been essential in helping us define our vision and roadmap for the new Twitter API. From Tweets to focus groups, you have shared a ton of feedback with us over the past few years about what you need out of the Twitter API and what we can do better. We also learned a lot through Twitter Developer Labs where you’ve been sharing real-time feedback on the new API features we’ve tested in the open.  We’ve always known that our developer ecosystem is diverse, but our API has long taken a one-size-fits-all approach. Your feedback helped us see the importance of making the new Twitter API more flexible and scalable to fit your needs. With the new API, we are building new elevated access options and new product tracks, so more developers can find options to meet their needs. More on that below.  We also know it’s important to be able to plan ahead, and we want to do a better job of sharing our plans with you in advance. Going forward, we’ll share more of what’s coming next on our public roadmap (updates coming soon). We're also sharing a Guide to the future of the Twitter API for more about what to expect as we roll out the new API. We have a lot planned, and it will evolve and improve as we continue to hear from you.  This Tweet is unavailable This Tweet is unavailable. Twitter API v2: What’s New? This Tweet is unavailable This Tweet is unavailable. A new foundation- The new API is built on a completely new foundation — rebuilt for the first time since 2012 — and includes new features so you can get more out of the public conversation. That new foundation allows us to add new functionality faster and better than we’ve done in the past, so expect more new features from Twitter to show up in the API. With this new foundation, developers can expect to see: A cleaner API that's easier to use, with new developer features like the ability to specify which fields get returned, or retrieve more Tweets from a conversation within the same response  Some of the most requested features that were missing from the API, including conversation threading, poll results in Tweets, pinned Tweets on profiles, spam filtering, and a more powerful stream filtering and search query language  New access levels- With the new Twitter API, we’re building multiple access levels to make it easier for developers to get started and to grow what they build. In the past, the Twitter API was separated into three different platforms and experiences: standard (free), premium (self-serve paid), and enterprise (custom paid). As a developer's needs expanded, it required tedious migration to each API. In the future, all developers — from academic researchers to makers to businesses — will have options to get elevated access and grow on the same API.  This Tweet is unavailable This Tweet is unavailable. New product tracks- We love the incredible diversity of developers who use our API. Our plan is to introduce new, distinct product tracks to better serve different groups of developers and provide them with the right experience and support for their needs, along with a range of relevant access levels, and appropriate pricing (where applicable). To start, these product tracks will include: Standard: Available first, this will be the default product track for most developers, including those just getting started, building something for fun, for a good cause, and to learn or teach. We plan to add Elevated access to this track in the future. Academic Research: Academic researchers use the Twitter API to understand what’s happening in the public conversation. In the future, qualified academic researchers will have a way to get Elevated or Custom access to relevant endpoints. We’re also providing tools and guides to make it easier to conduct academic research with the Twitter API. Business: Developers build businesses on the Twitter API, including our Twitter Official Partners and enterprise data customers. We love that their products help other people and businesses better understand and engage with the conversation on Twitter. In the future, this track will include Elevated or Custom access to relevant endpoints. A new developer portal- To help you get the most out of the new API, we’ve also designed and built a new developer portal. This is where you can get started with our new onboarding wizard, manage Apps, understand your API usage and limits, access our new support center, find documentation, and more to come in the future.  With the new Twitter API, we hope to enable more:  Academic research that helps the world better understand our shared perspectives on important topics such as: people’s attitudes about COVID-19, the social impact of floods and climate change, or the prevalence of hateful speech and how to address it.  Tools that help make Twitter better for the people who use it, like: BlockParty, TweetDelete, and Tokimeki Unfollow. Bots that share information and make conversations more fun like the: HAM: Drawings bot, House of Lords Hansard bot, and Emoji Mashup bot.  Businesses like Black Swan, Spiketrap, and Social Market Analytics who serve innovative use cases such as social prediction of future product trends, AI-powered consumer insights and FinTech market intelligence.  Twitter Official Partners such as Brandwatch, Sprinklr and Sprout Social who help brands better understand and engage with their industry and customers. And much more, including new things we haven't thought of yet, but that we know you will... This Tweet is unavailable This Tweet is unavailable. So, what’s launching first? This Tweet is unavailable This Tweet is unavailable. One of the most common reasons developers use the Twitter API is to listen to and analyze the conversation happening on Twitter. So, soon we’ll release Early Access to an initial set of new endpoints for developers to:  Stream Tweets in real-time or analyze past conversations to help the world understand the public conversations happening on Twitter, or help businesses discover customer insights from the conversation Measure Tweet performance to help people and businesses get better at using Twitter Listen for important events to help people learn about new things that matter to them on Twitter And a whole lot more, with new options to explore Tweets from any account All API features we’re releasing first will be available in our new – always free – Basic access level. For most developers, Basic access will provide everything you need to get started and build something awesome.  Eventually, the new API will fully replace the v1.1 standard, premium, and enterprise APIs. Before that can happen though, we have more to build, which is why we are referring to this phase as Early Access. It's a chance to get started now and get ahead. Unlike Twitter Developer Labs which hosts our experiments, everything in the first release will be fully supported and ready for you to use in production. To see the full list of API functionality and endpoints that are included in today’s release, check out our developer forum post. You can get started on the new API by creating a new Project and App today in the new developer portal. You can also connect your new Project to existing Apps, if you would like. To get started with Early Access to the new Twitter API, visit the new developer portal. If you don’t yet have a developer account, apply to get started. This Tweet is unavailable This Tweet is unavailable. What's Next? This Tweet is unavailable This Tweet is unavailable. This is just the beginning. We’re sharing our public roadmap to keep you updated on our vision for the API, along with options to share feedback so that we can continue to learn from you along the way and so you can plan for what’s to come. On deck: full support to hide (and unhide) replies, and free Elevated access for academic researchers.   Developers like you push us and inspire us every day. Your creativity and work with our API make Twitter better for people & businesses, and make the world a better place. Thanks for your partnership on the journey ahead.   This Tweet is unavailable This Tweet is unavailable. Ian Cairns ‎@cairns‎ Head of Product, Twitter Developer Platform Priyanka Shetty ‎@_priyankashetty‎ Product Manager, Twitter Developer Platform Only on Twitter #TwitterAPI @TwitterDev @TwitterAPI Tweet Twitter logo icon Tags: API data customer success Link copied successfully More from Tools Prototyping in production for rapid user feedback By Daniele Bernardi on Thursday, 3 December 2020 A year with Twitter Developer Labs: What we've learned and changed By Kyle Weiss on Thursday, 2 July 2020 Enabling study of the public conversation in a time of crisis By Adam Tornes on Wednesday, 29 April 2020 Announcing more functionality to improve customer engagements on Twitter By Jon Cipriano on Tuesday, 19 December 2017 See what's happening ‎@Twitter‎ Twitter platform Twitter.com Status Card validator Privacy Center Transparency Center Twitter, Inc. About the company Twitter for Good Company news Brand toolkit Jobs and internships Investors Help Help Center Using Twitter Twitter Media Ads Help Center Managing your account Safety and security Rules and policies Contact us Developer resources Developer home Documentation Forums Communities Developer blog Engineering blog Developer terms Business resources Advertise Twitter for business Resources and guides Twitter for marketers Marketing insights Brand inspiration Twitter Data Twitter Flight School ‎© 2021 Twitter, Inc.‎ Cookies Privacy Terms and conditions By using Twitter’s services you agree to our Cookies Use. We use cookies for purposes including analytics, personalisation, and ads. OK 
blog-twitter-com-6356	----	Enabling the future of academic research with the Twitter API Developer Blog Back Developer Blog Tips Community Tools Spotlight Sign Up ‎English (US)‎ ‎日本語‎ ‎English (US)‎ ‎日本語‎ Sign Up Tools Enabling the future of academic research with the Twitter API By Adam Tornes and Leanne Trujillo Tuesday, 26 January 2021 Link copied successfully When we introduced the next generation of the Twitter API in July 2020, we also shared our plans to invest in the success of the academic research community with tailored solutions that better serve their goals. Today, we’re excited to launch the Academic Research product track on the new Twitter API.  Why we’re launching this & how we got here Since the Twitter API was first introduced in 2006, academic researchers have used data from the public conversation to study topics as diverse as the conversation on Twitter itself - from state-backed efforts to disrupt the public conversation to floods and climate change, from attitudes and perceptions about COVID-19 to efforts to promote healthy conversation online. Today, academic researchers are one of the largest groups of people using the Twitter API.  Our developer platform hasn’t always made it easy for researchers to access the data they need, and many have had to rely on their own resourcefulness to find the right information. Despite this, for over a decade, academic researchers have used Twitter data for discoveries and innovations that help make the world a better place. Over the past couple of years, we’ve taken iterative steps to improve the experience for researchers, like when we launched a webpage dedicated to Academic Research, and updated our Twitter Developer Policy to make it easier to validate or reproduce others’ research using Twitter data. We’ve also made improvements to help academic researchers use Twitter data to advance their disciplines, answer urgent questions during crises, and even help us improve Twitter. For example, in April 2020, we released the COVID-19 stream endpoint - the first free, topic-based stream built solely for researchers to use data from the global conversation for the public good. Researchers from around the world continue to use this endpoint for a number of projects. Over two years ago, we started our own extensive research to better understand the needs, constraints and challenges that researchers have when studying the public conversation. In October 2020, we tested this product track in a private beta program where we gathered additional feedback. This gave us a glimpse into some of the important work that the free Academic Research product track we’re launching today can now enable. “The Academic Research product track gives researchers a window into understanding the use of Twitter and social media at large, and is an important step by Twitter to support the scientific community.” - Dr. Sarah Shugars, Assistant Professor at New York University “Twitter's enhancements for academic research have the potential to eliminate many of the bottlenecks that scholars confront in working with Twitter's API, and allow us to better evaluate the impact and origin of trends we discover.” - Dr. David Lazer, Professor at Northeastern University What’s launching today With the new Academic Research product track, qualified researchers will have access to all v2 endpoints released to date, as well as: Free access to the full history of public conversation via the full-archive search endpoint, which was previously limited to paid premium or enterprise customers Higher levels of access to the Twitter developer platform for free, including a significantly higher monthly Tweet volume cap of 10 million (20x higher than what’s available on the Standard product track today) More precise filtering capabilities across all v2 endpoints to limit data collection to what is relevant for your study and minimize data cleaning requirements New technical and methodological guides to maximize the success of your studies The release of the Academic Research product track is just a starting point. This initial solution is intended to address the most requested, biggest challenges faced when conducting research on the platform. We are excited to enable even more research that can create a positive impact on the world, and on Twitter, in the future.    For more in-depth details about what’s available, see our post on the Twitter community forum. Where do I start? To use this track, new and existing Twitter developers will need to apply for access with the Academic Research application. This Tweet is unavailable This Tweet is unavailable. An improved developer portal experience guides you to the product track that best fits your needs. We require this additional application step to help protect the security and privacy of people who use Twitter and our developer platform. Each application will go through a manual review process to determine whether the described use cases for accessing our Academic Research product track adhere to our Developer Policy, and that applicants meet these three requirements: You are either a master’s student, doctoral candidate, post-doc, faculty, or research-focused employee at an academic institution or university. You have a clearly defined research objective, and you have specific plans for how you intend to use, analyze, and share Twitter data from your research. Learn more about the application. You will use this product track for non-commercial purposes. Learn about non-commercial use. We understand that these requirements are not representative of everyone doing academic research with Twitter data (for example, if you are an undergraduate, independent researcher, or a non-profit). Our future goal is to serve the complete range of research use cases for public Twitter data. In the meantime, anyone can apply to start with our v2 endpoints on the Standard product track. This Tweet is unavailable This Tweet is unavailable. The new application for the Academic Research track asks specific questions related to your academic profile and research project details. Learn more about the application here. What’s next for the Twitter API v2? Today’s launch marks the beginning of how we plan to support this community with unprecedented access to data that can advance research objectives for nearly any discipline. While we recognize what we’re launching today may not address all needs of the community, this is a starting point and we are committed to continued support for academic researchers in the future. We’ll continue to listen and learn from you all, and welcome your feedback on how we can continue to improve and best serve your needs. As we’ve seen over the last 15 years, the research topics that can be studied with Twitter data are vast, and the future possibilities are endless. We hope you are as excited as we are about the possibilities this new product track creates for your research. In coming months, we will introduce a specialized Business product track, as well as additional levels of access within our Academic Research, Standard, and Business product tracks. We are also exploring more flexible access terms, support for additional Projects with unique use cases within your product track, and other improvements intended to help researchers and developers to get started, grow, and scale their projects all within the same API. To follow our planned releases, check out the product roadmap. Eventually, the new Twitter API will fully replace the v1.1 standard, premium, and enterprise APIs. Though before that can happen, we have a lot more to build, which is why we are referring to today’s launch as Early Access. Early access gives you a chance to get started and get ahead on using our new, v2 endpoints. Learn more about how we plan to roll out the new Twitter API here. Have questions or want to connect with other researchers using the Twitter API? Check out our academic research community forum. Have ideas about how we can improve the new Twitter API? Upvote ideas or add your own in the v2 API feedback channel. This Tweet is unavailable This Tweet is unavailable. Adam Tornes ‎@atornes‎ Staff Product Manager, Developer & Enterprise Solutions Leanne Trujillo ‎@leanne_tru‎ Sr. Program Manager, Developer & Enterprise Solutions Only on Twitter #TwitterAPI #academicresearch Tweet Twitter logo icon Tags: API academicresearch Link copied successfully More from Tools Prototyping in production for rapid user feedback By Daniele Bernardi on Thursday, 3 December 2020 Introducing a new and improved Twitter API By Ian Cairns and Priyanka Shetty on Thursday, 16 July 2020 A year with Twitter Developer Labs: What we've learned and changed By Kyle Weiss on Thursday, 2 July 2020 Enabling study of the public conversation in a time of crisis By Adam Tornes on Wednesday, 29 April 2020 See what's happening ‎@Twitter‎ Twitter platform Twitter.com Status Card validator Privacy Center Transparency Center Twitter, Inc. About the company Twitter for Good Company news Brand toolkit Jobs and internships Investors Help Help Center Using Twitter Twitter Media Ads Help Center Managing your account Safety and security Rules and policies Contact us Developer resources Developer home Documentation Forums Communities Developer blog Engineering blog Developer terms Business resources Advertise Twitter for business Resources and guides Twitter for marketers Marketing insights Brand inspiration Twitter Data Twitter Flight School ‎© 2021 Twitter, Inc.‎ Cookies Privacy Terms and conditions By using Twitter’s services you agree to our Cookies Use. We use cookies for purposes including analytics, personalisation, and ads. OK 
blog-twitter-com-7472	----	Rebuilding Twitter’s public API Engineering Back Engineering Insights Infrastructure Open source Sign Up Sign Up Infrastructure Rebuilding Twitter’s public API By Jenny Qiu Hylbert and Steve Cosenza Wednesday, 12 August 2020 Link copied successfully Today we launched the new Twitter API v2. Our first launch of a public API was in 2006 and shortly after, we began building API access to new features with the intention of opening our platform and inviting developers to build the future with us. Six years after the first launch, in 2012, we released the v1.1 API that introduced new requirements and stricter policies needed to curb abuse and protect the Twitter platform.  Today’s launch marks the most significant rebuild of our public API since 2012. It’s built to deliver new features, faster, and to better serve the diverse set of developers who build on Twitter. It’s also built to incorporate many of our experiences and lessons learned over the past fourteen years of operating public APIs. We’d like to show you how we thought about designing and building this from the ground up. This Tweet is unavailable This Tweet is unavailable.  Establishing goals This Tweet is unavailable This Tweet is unavailable. The public Twitter API v1.1 endpoints are currently implemented by a large set of HTTP microservices, a decision we made as part of our re-architecture from a Ruby monolith. While the microservices approach enabled increased development speeds at first, it also resulted in a scattered and disjointed Twitter API as independent teams designed and built endpoints for their specific use cases with little coordination. For the new Twitter API v2, we knew we needed a new architecture that could more easily scale with the large number of API endpoints to serve our planned and new functionality going forward. As part of this design process, we drafted the following goals: Abstraction: Enable Twitter engineers building the Twitter API to focus on querying, mutating, or subscribing to only the data they care about, without needing to worry about the infrastructure and operations of running a production HTTP service. Ownership: Contain core and common API logic in a single place, owned by a single team. Consistency: Provide a consistent experience for external developers by relying on our API design principles to reinforce uniformity. With the above goals in mind, we’ve built a common platform to host all of our new Twitter API endpoints. To operate this multi-tenant platform at scale, we had to minimize any endpoint specific business logic, otherwise the system would quickly become unmaintainable. A powerful data access layer that emphasized declarative queries over imperative code was crucial to this strategy.  This Tweet is unavailable This Tweet is unavailable. Unified data access layer This Tweet is unavailable This Tweet is unavailable. Around this same time, representatives from teams building Twitter for web, iOS, and Android began migrating from individual internal REST endpoints to a unified GraphQL service. Our team followed suit as we realized that the data querying needs of the public Twitter API are similar to the needs of our Twitter mobile and desktop clients. Put another way, Twitter clients query for data and render UIs, while the public Twitter APIs query for data and render JSON responses.  This Tweet is unavailable This Tweet is unavailable. A bonus from consolidating our data querying through a single interface is that the Twitter API can now easily deliver new Twitter features by querying for GraphQL data already being directly used by our consumer apps. When considering exposing GraphQL directly to external developers, we opted for a design most familiar to a broad set of developers in the form of a REST API. This model also makes it easier to protect against unexpected query complexity so we can ensure a reliable service for all developers. This Tweet is unavailable This Tweet is unavailable. Componentizing the API platform This Tweet is unavailable This Tweet is unavailable. With the platform approach decided, we needed a way for different teams to build and contribute to the overall API. To facilitate this, we designed the following three components: Routes to represent the external HTTP endpoints e.g. /2/tweets Selections to represent the ways to find resources e.g. "Tweet lookup by id". To implement a selection, create a GraphQL query which returns one or more resources Resources to represent the core resources in our system e.g. Tweets and users. To implement a resource, create a directory for every resource field which contains a GraphQL query to fetch the data for that specific field e.g. Tweet/text Using these three components to construct a directory structure, teams can independently own and contribute different parts of the overall Twitter API while still returning uniform representations in responses. For example, here's a subset of our selections and resources directories: This Tweet is unavailable This Tweet is unavailable. ├── selections │ └── tweet │ ├── id │ │ ├── Selection.scala │ │ ├── selection.graphql │ ├── multi_ids │ │ ├── Selection.scala │ │ ├── selection.graphql │ ├── search │ │ ├── Selection.scala │ │ ├── selection.graphql ├── resources │ ├── tweet │ │ ├── id │ │ │ ├── Field.scala │ │ │ └── fragment.graphql │ │ ├── author_id │ │ │ ├── Field.scala │ │ │ └── fragment.graphql │ │ ├── text │ │ │ ├── Field.scala │ │ │ └── fragment.graphql GraphQL plays a key role in this architecture. We can utilize GraphQL fragments as the unit of our rendering reuse (in a similar way to React Relay). For example, the GraphQL queries below all use a "platform_tweet" fragment which is a fragment created by combining all the customer requested fields in the /resources/tweet directory: This Tweet is unavailable This Tweet is unavailable. https://api.twitter.com/2/tweets/20 Selection: /selections/tweet/id/selection.graphql This Tweet is unavailable This Tweet is unavailable. query TweetById($id: String!) { tweet_by_rest_id(rest_id: $id) { ...platform_tweet } } https://api.twitter.com/2/tweets?ids=20,21 Selection: /selections/tweet/multi_ids/selection.graphql This Tweet is unavailable This Tweet is unavailable. query TweetsByIds($ids: [String!]!) { tweets_by_rest_ids(rest_ids: $ids) { ...platform_tweet } } https://api.twitter.com/2/tweets/search/recent?query=%23DogsofTwitter Selection: /selections/tweet/search/selection.graphql This Tweet is unavailable This Tweet is unavailable. query TweetsBySearch($query: String!, $start_time: String, $end_time: String, ...) { search_query(query: $query) { matched_tweets(from_date: $start_time, to_date: $end_time, ...) { tweets { ...platform_tweet } next_token } } } Putting it all together This Tweet is unavailable This Tweet is unavailable. At this point in the story, you may be curious where endpoint-specific business logic actually lives. We offer two options: When an endpoint’s business logic can be represented in StratoQL (the language used by Twitter’s data catalog system known as Strato which powers the GraphQL schema), then we only need to write a function in StratoQL without requiring a separate service.  Otherwise, the business logic is contained in a Finatra Thrift microservice written in Scala, exposed by a Thrift Strato Column. With the platform providing the common needs for all HTTP endpoints, new routes and resources can be released without spinning up any new HTTP services. We can ensure uniformity through the platform by standardizing how a Tweet is rendered or how a set of Tweets are paginated regardless of the actual endpoint used for retrieval.  Additionally, if an endpoint can be constructed from queries for already existing data in the GraphQL schema, or if they're able to implement their logic in StratoQL, then we can not only bypass almost all "service owning" responsibilities but also deliver faster access to new Twitter features! One aspect of the platform that has been top of mind since the beginning is the importance of serving the health of the public conversation and protecting the personal data of people using Twitter. The new platform takes a strong stance on where related business logic should live by pushing all security and privacy related logic to backend services. The result is that the API layer is agnostic to this logic and privacy decisions are applied uniformly across all of the Twitter clients and the API. By isolating where these decisions are made, we can limit inconsistent data exposure so that what you see in the iOS app will be the same as what you get from programmatic querying through the API. This is the start of our journey and our work is far from done. We have many more existing v1.1 endpoints to migrate and improve, and entirely new public endpoints to build. We know developers want the ability to interact with all of the different features in the Twitter app and we’re excited for you to see how we’ve leveraged this platform approach to do just that. We can’t wait to bring more features to the new Twitter API! To see more about our plans, check out our Guide to the future of the new API.    </fin>   This Tweet is unavailable This Tweet is unavailable. Jenny Qiu Hylbert ‎@jqiu‎ Senior Engineering Manager Steve Cosenza ‎@scosenza‎ Senior Staff Engineer Only on Twitter @TwitterEng @TwitterDev #TwitterAPI Tweet Twitter logo icon Tags: API microservices infrastructure Link copied successfully More from Infrastructure Kafka as a storage system By Babatunde Fashola on Wednesday, 16 December 2020 How we fortified Twitter's real time ad spend architecture By Revenue Platform on Monday, 2 November 2020 The infrastructure behind Twitter: efficiency and optimization By Mazdak Hashemi on Tuesday, 23 August 2016 Building DistributedLog: High-performance replicated log service By Leigh Stewart on Wednesday, 16 September 2015 See what's happening Twitter ‎@Twitter‎ Follow Tweets Following Followers Twitter platform Twitter.com Status Card validator Privacy Center Transparency Center Twitter, Inc. About the company Twitter for Good Company news Brand toolkit Jobs and internships Investors Help Help Center Using Twitter Twitter Media Ads Help Center Managing your account Safety and security Rules and policies Contact us Developer resources Developer home Documentation Forums Communities Developer blog Engineering blog Developer terms Business resources Advertise Twitter for business Resources and guides Twitter for marketers Marketing insights Brand inspiration Twitter Data Twitter Flight School ‎© 2021 Twitter, Inc.‎ Cookies Privacy Terms and conditions By using Twitter’s services you agree to our Cookies Use. We use cookies for purposes including analytics, personalisation, and ads. OK 
blog-vlib-mpg-de-7011	----	Max Planck vLib News Max Planck vLib News   MPG/SFX server maintenance, Tuesday 01 December, 5-6 pm The database of the MPG/SFX server will undergo scheduled maintenance. The downtime will start at 5 pm. Services are expected to be back after 30 minutes. We apologize for any inconvenience. How to get Elsevier articles after December 31, 2018 The Max Planck Digital Library has been mandated to discontinue their Elsevier subscription when the current agreement expires on December 31, 2018. Read more about the background in the full press release. Nevertheless, most journal articles published until that date will remain available, due to the rights stipulated in the MPG contracts to date. To &#8230; Continue reading How to get Elsevier articles after December 31, 2018 &#8594; Aleph Multipool-Recherche: Parallele Suche in MPG-Bibliothekskatalogen Update, 07.12.2018: Die Multipool-Suche gibt es jetzt auch als Webinterface. Der Multipool-Expertenmodus im Aleph Katalogisierungs-Client dient der schnellen Recherche in mehreren Datenbanken gleichzeitig. Dabei können die Datenbanken entweder direkt auf dem Aleph-Server liegen oder als externe Ressourcen über das z39.50-Protokoll angebunden sein. Zus&#228;tzlich zu den lokalen Bibliotheken ist der MPI Bibliothekskatalog im GBV auf dem &#8230; Continue reading Aleph Multipool-Recherche: Parallele Suche in MPG-Bibliothekskatalogen &#8594; Goodbye vLib! Shutdown after October 31, 2018 In 2002 the Max Planck virtual Library (vLib) was launched, with the idea of making all information resources relevant for Max Planck users simultaneously searchable under a common user interface. Since then, the vLib project partners from the Max Planck libraries, information retrieval services groups, the GWDG and the MPDL invested much time and effort &#8230; Continue reading Goodbye vLib! Shutdown after October 31, 2018 &#8594; HTTPS only for MPG/SFX and MPG.eBooks As of next week, all http requests to the MPG/SFX link resolver will be redirected to a corresponding https request. The Max Planck Society electronic Book Index is scheduled to be switched to https only access the week after, starting on November 27, 2017. Regular web browser use of the above services should not be &#8230; Continue reading HTTPS only for MPG/SFX and MPG.eBooks &#8594; HTTPS enabled for MPG/SFX The MPG/SFX link resolver is now alternatively accessible via the https protocol. The secure base URL of the productive MPG/SFX instance is: https://sfx.mpg.de/sfx_local. HTTPS support enables secure third-party sites to load or to embed content from MPG/SFX without causing mixed content errors. Please feel free to update your applications or your links to the MPG/SFX &#8230; Continue reading HTTPS enabled for MPG/SFX &#8594; Citation Trails in Primo Central Index (PCI) The May 2016 release brought an interesting functionality to the MPG/SFX server maintenance, Wednesday 20 April, 8-9 am The MPG/SFX server updates to a new database (MariaDB) on Wednesday morning. The downtime will begin at 8 am and is scheduled to last until 9 am. We apologize for any inconvenience. ProQuest Illustrata databases discontinued Last year, the information provider ProQuest decided to discontinue its &#34;Illustrata Technology&#34; and &#34;Illustrata Natural Science&#34; databases. Unfortunately, this represents a preliminary end to ProQuest&#8217;s long-year investment into deep indexing content. In a corresponding support article ProQuest states that there &#34;[&#8230;] will be no loss of full text and full text + graphics images because &#8230; Continue reading ProQuest Illustrata databases discontinued &#8594; MPG.ReNa via https only The MPG Resource Navigator MPG.ReNa is now accessible via https only. If in doubt, please double-check any routines and applications loading or embedding content via MPG.ReNa APIs. Please note that you may need to re-subscribe to resource feeds, or update URLs of RSS widgets in your Content Management System, etc. We apologize for any inconvenience. 
blog-vlib-mpg-de-8647	----	Max Planck vLib News |   Max Planck vLib News Search Primary Menu Skip to content Home About Contact Disclaimer Privacy Policy Search for: sfx link resolver MPG/SFX server maintenance, Tuesday 01 December, 5-6 pm 30. November 2020 eia The database of the MPG/SFX server will undergo scheduled maintenance. The downtime will start at 5 pm. Services are expected to be back after 30 minutes. We apologize for any inconvenience. outage resources, sfx link resolver How to get Elsevier articles after December 31, 2018 20. December 2018 inga The Max Planck Digital Library has been mandated to discontinue their Elsevier subscription when the current agreement expires on December 31, 2018. Read more about the background in the full press release. Nevertheless, most journal articles published until that date will remain available, due to the rights stipulated in the MPG contracts to date. To fulfill the content needs of Max Planck researchers when Elsevier shuts off access to recent content at the beginning of January, the Max Planck libraries and MPDL have coordinated the setup of a common document order service. This will be integrated into the MPG/SFX interface and can be addressed as follows: Step 1: Search in ScienceDirect, start in any other database or enter the article details into the MPG/SFX citation linker. Step 2: Click the MPG/SFX button. Note: In ScienceDirect, it appears in the “Get Access” section at the top of those article pages for which the full text is no longer available: Step 3: Check the options in the service menu presented to you, e.g. free available full text versions (if available). Step 4: To order the article via your local library or the MPDL, select the corresponding link, e.g. "Request document via your local library". Please note that the wording might differ slightly according to your location. Step 5: Add your personal details to the order form in the next screen and submit your document request. The team in your local library or at the MPDL will get back to you as soon as possible. Please feel free to contact us if you face any problem or want to raise a question. Update, 06.06.2019: Check out our new flyer "How to deal with no subscription DEAL" prepared in cooperation with Max Planck’s PhDnet. elsevier document-delivery resources Aleph Multipool-Recherche: Parallele Suche in MPG-Bibliothekskatalogen 2. November 2018 inga Update, 07.12.2018: Die Multipool-Suche gibt es jetzt auch als Webinterface. Der Multipool-Expertenmodus im Aleph Katalogisierungs-Client dient der schnellen Recherche in mehreren Datenbanken gleichzeitig. Dabei können die Datenbanken entweder direkt auf dem Aleph-Server liegen oder als externe Ressourcen über das z39.50-Protokoll angebunden sein. Zusätzlich zu den lokalen Bibliotheken ist der MPI Bibliothekskatalog im GBV auf dem Aleph-Sever bereits vorkonfiguriert. Die Multipool-Funktion ist im Aleph Katalogisierungs-Client im Recherche-Bereich zu finden (2. Tab): Unterhalb des Bereichs zur Auswahl der relevanten Datenbanken kann man die Suchanfrage eintragen. Hinweise zur verwendeten Kommandosprache finden sich in der Aleph-Hilfe. Nach dem Absenden der Suchanfrage wird die Ergebnisliste mit den Datenbanken und der jeweiligen Treffermenge im unteren Rahmen angezeigt: Zum Öffnen eines einzelnen Sets genügt ein Doppelklick: Bei gemeinsamen Katalogen – wie z.B. dem MPI Bibliothekskatalog im GBV – findet sich der Hinweis auf die bestandshaltende Bibliothek in der Datensatz-Vollanzeige: Zur Einrichtung der Multipool-Suche müssen die vom lokalen Aleph-Client genutzten Konfigurationsdateien (library.ini und searbase.dat) erweitert werden. Bei Bedarf stellen wir die von uns genutzten Dateien gerne zur Verfügung. Weiterführende Informationen finden sich auch im Aleph Wiki: Download und Installation des Aleph Clients Einrichtung weiterer Z39.50-Zugänge Aleph vLib portal Goodbye vLib! Shutdown after October 31, 2018 24. October 2018 inga In 2002 the Max Planck virtual Library (vLib) was launched, with the idea of making all information resources relevant for Max Planck users simultaneously searchable under a common user interface. Since then, the vLib project partners from the Max Planck libraries, information retrieval services groups, the GWDG and the MPDL invested much time and effort to integrate various library catalogs, reference databases, full-text collections and other information resources into MetaLib, a federated search system developed by Ex Libris. With the rise of large search engines and discovery tools in recent years, usage slowly shifted away and the metasearch technology applied was no longer fulfilling user’s expection. Therefore, the termination of most vLib services was announced two years ago and now we are approaching the final shutdown: The vLib portal will cease to operate after the 31th of October 2018. As you know, there are many alternatives to the former vLib services: MPG.ReNa will remain available for browsing and discovering electronic resources available to Max Planck users. In addition, we’ll post some information on how to cross search Max Planck library catalogs soon. Let us take the opportunity to send a big "Thank you!" to all vLib users and collaborators within and outside the Max Planck Society. It always was and will continue to be a pleasure to work with and for you. Goodbye!… and please feel free to contact us in case of any further question. MPG.eBooks, sfx link resolver HTTPS only for MPG/SFX and MPG.eBooks 17. November 2017 eia As of next week, all http requests to the MPG/SFX link resolver will be redirected to a corresponding https request. The Max Planck Society electronic Book Index is scheduled to be switched to https only access the week after, starting on November 27, 2017. Regular web browser use of the above services should not be affected. Please thoroughly test any solutions that integrate these services via their web APIs. Please consider re-subscribing to MPG.eBooks RSS feeds. ebookshttpsrss sfx link resolver HTTPS enabled for MPG/SFX 27. June 2016 inga The MPG/SFX link resolver is now alternatively accessible via the https protocol. The secure base URL of the productive MPG/SFX instance is: https://sfx.mpg.de/sfx_local. HTTPS support enables secure third-party sites to load or to embed content from MPG/SFX without causing mixed content errors. Please feel free to update your applications or your links to the MPG/SFX server. https resources Citation Trails in Primo Central Index (PCI) 2. June 2016 inga The May 2016 release brought an interesting functionality to the Primo Central Index (PCI): The new "Citation Trail" capability enables PCI users to discover relevant materials by providing cited and citing publications for selected article records. At this time the only data source for the citation trail feature is CrossRef, thus the number of citing articles will be below the "Cited by" counts in other sources like Scopus and Web of Science. Further information: Short video demonstrating the citation trail feature (by Ex Libris). Detailed feature description (by Ex Libris) pciprimo-central-indexscopusweb-of-science sfx link resolver MPG/SFX server maintenance, Wednesday 20 April, 8-9 am 20. April 2016 inga The MPG/SFX server updates to a new database (MariaDB) on Wednesday morning. The downtime will begin at 8 am and is scheduled to last until 9 am. We apologize for any inconvenience. outage resources ProQuest Illustrata databases discontinued 15. April 2016 inga Last year, the information provider ProQuest decided to discontinue its "Illustrata Technology" and "Illustrata Natural Science" databases. Unfortunately, this represents a preliminary end to ProQuest’s long-year investment into deep indexing content. In a corresponding support article ProQuest states that there "[…] will be no loss of full text and full text + graphics images because of the removal of Deep Indexed content". In addition, they announce to "[…] develop an even better way for researchers to discover images, figures, tables, and other relevant visual materials related to their research tasks". The MPG.ReNa records for ProQuest Illustrata: Technology and ProQuest Illustrata: Natural Science have been marked as "terminating" and will be deactivated soon. proquest MPG.ReNa MPG.ReNa via https only 30. March 2016 eia The MPG Resource Navigator MPG.ReNa is now accessible via https only. If in doubt, please double-check any routines and applications loading or embedding content via MPG.ReNa APIs. Please note that you may need to re-subscribe to resource feeds, or update URLs of RSS widgets in your Content Management System, etc. We apologize for any inconvenience. https Posts navigation 1 2 … 10 Next → In short In this blog you'll find updates on information resources, vendor platform and access systems provided by the Max Planck Digital Library. Use MPG.ReNa to search and browse through the journal collections, eBook collections and databases available to MPG researchers. New Resources in MPG.ReNa Australian Education Index (ProQuest) 25. April 2021 RiffReporter 25. March 2021 Journal on Excellence in College Teaching 2. March 2021 Persian E-Bookds Miras Maktoob (Brill) 16. February 2021 Translated CIA Documents with Global Perspectives (NewsBank) 14. February 2021 MPDL News   News Categories COinS (4) exLibris (2) localization (6) materials (7) MPG.eBooks (1) MPG.ReNa (3) question and answer (6) resources (21) sfx link resolver (44) tools (10) vLib portal (38) Related Blogs FHI library MPIs Stuttgart Library PubMan blog Proudly powered by WordPress 
carpentries-org-9661	----	The Carpentries The Carpentries Nav Donate Search Contact Home About About Us Our Values Code of Conduct Governance Supporters Testimonials Annual Reports Equity, Inclusion, and Accessibility Teach What is a Workshop? Data Carpentry Lessons Software Carpentry Lessons Library Carpentry Lessons Community Lessons Become an Instructor For Instructors Online Workshop Recommendations Learn Our Workshops Our Curricula Upcoming Workshops Past Workshops Our Impact Join Us Get Involved Help Develop Lessons Become a Member Organisation Job Vacancies Our community Our Team Core Team Projects Community Overview Our Instructors Our Maintainers Our Mentors Our Regional Coordinators Our Trainers Committees and Task Forces Current Member Organisations Connect Blog Community Calendar Community Discussions Community Handbook Newsletter Carpentries Podcast Twitter We teach foundational coding and data science skills to researchers worldwide. What we do The Carpentries teaches foundational coding, and data science skills to researchers worldwide. Software Carpentry, Data Carpentry, and Library Carpentry workshops are based on our lessons. Workshop hosts, Instructors, and learners must be prepared to follow our Code of Conduct. More › Who we are Our diverse, global community includes Instructors, helpers, Trainers, Maintainers, Mentors, community champions, member organisations, supporters, workshop organisers, staff and a whole lot more. More › Get involved See all the ways you can engage with The Carpentries. Get information about upcoming events such as workshops, meetups, and discussions from our community calendar, or from our twice-monthly newsletter, Carpentry Clippings. Follow us on Twitter, Facebook, and Slack. More › Subscribe to our Newsletter "Carpentry Clippings" Events, Community Updates, Teaching Tips, in your inbox, twice a month New Blog Posts Core Team 'Acc-athon' to Add Alt Text Across Carpentries Curricula Carpentries Core Team gets a start on alt text updates for Carpentries lessons Read More › More Posts Incubator Lesson Spotlight: Python for Business The Carpentries Strategic Plan: One Year Update Foundations of Astronomical Data Science - Call for Beta Pilot Applications More › Resources for Online Workshops Official Carpentries' Recommendations This page holds an official set of recommendations by The Carpentries to help you organise and run Online Carpentries workshops. The page is updated periodically as we continue to receive input and feedback from our community. Go to Page. Community-Created Resources This resource is a section in our Handbook containing an evolving list of all community-created resources and conversations around teaching Carpentries workshops online. The section is updated periodically to include newer resources and emerging conversations on the subject. Go to Page. Upcoming Carpentries Workshops Click on an individual event to learn more about that event, including contact information and registration instructions. Brac University (online) ** Instructors: Annajiat Alim Rasel, Benson Muite Feb 14 - May 21, 2021 Brac University (online) ** Instructors: Annajiat Alim Rasel Feb 20 - May 22, 2021 University of Edinburgh ** Instructors: Fran Baseby, Chris Wood, Charlotte Desvages, Alex Casper Cline Helpers: Jen Harris, Marco Crotti, Robert Smith, Graham Blyth, Matthew Fellion, Francine Millard Mar 4 - May 13, 2021 ENES unidad León, Licenciatura en Ciencias Agrogenómicas ** Instructors: Tania Vanessa Arellano Fernández, J Abraham Avelar-Rivas Helpers: Maria Cambero, Nelly Sélem, Aarón Jaime Mar 20 - May 8, 2021 King's College London Instructors: Rohit Goswami, Sanjay Fuloria, Annajiat Alim Rasel Helpers: Fursham Hamid, James Cain, Kai Lim Apr 14 - Apr 28, 2021 UCLA (online) Instructors: Jamie Jamison, Scott Gruber, Kristian Allen, Elizabeth McAulay Helpers: Tim Dennis, Geno Sanchez, Leigh Phan, Zhiyuan Yao, Dave George Apr 16 - May 7, 2021 Institute for Modeling Collaboration and Innovation @ The University of Idaho (online) Instructors: Erich Seamon Helpers: Travis Seaborn Apr 20 - Apr 29, 2021 Swansea University (online) Instructors: Ed Bennett, Vladimir Khodygo, Ben Thorpe, Michele Mesiti Helpers: Tom Pritchard Apr 26 - Apr 30, 2021 United States Department of Agriculture (USDA) Instructors: Meghan Sposato, Aditya Bandla, Adrienne Traxler, Kristina Riemer Apr 27 - May 4, 2021 Workshop on Programming using Python ** Instructors: Bezaye Tesfaye Belayneh, Christian Meeßen, Hannes Fuchs, Maximilian Dolling Helpers: Stefan Lüdtke Apr 27 - Apr 28, 2021 United States Geological Survey Instructors: Anthony Valente, Ainhoa Oliden Sanchez, Ian Carroll, Melissa Fabros Apr 27 - Apr 28, 2021 Emerging Public Leaders (online) Instructors: Benson Muite Helpers: Selorm Tamakloe May 1 - May 29, 2021 UW-Madison (online) ** Instructors: Trisha Adamus, Clare Michaud, Tobin Magle, Sailendharan Sudakaran Helpers: Karl Broman, Erin Jonaitis, Casey Schacher, Heather Shimon, Sarah Stevens May 3 - May 10, 2021 Queensland Cyber Infrastructure Foundation ** Instructors: David Green, Paula Andrea Martinez Helpers: Marlies Hankel, Betsy Alpert May 5 - May 5, 2021 University of California, Santa Barbara (online) Instructors: Torin White, Camila Vargas, Greg Janée Helpers: Kristi Liu, Renata Curty May 6 - May 7, 2021 National Oceanic and Atmospheric Administration Instructors: Callum Rollo, D. Sarah Stamps, Jonathan Guyer, Annajiat Alim Rasel May 10 - May 13, 2021 King's College London (online) ** Instructors: Stefania Marcotti, Flavia Flaviani, Alessia Visconti Helpers: Fursham Hamid, Alejandro Santana-Bonilla May 12 - May 26, 2021 UW-Madison (online) Instructors: Trisha Adamus, Clare Michaud, Sarah Stevens, Erwin Lares Helpers: Karl Broman, Casey Schacher, Sarah Stevens, Heather Shimon May 12 - May 19, 2021 Netherlands eScience Center (online) Instructors: Pablo Rodríguez-Sánchez, Alessio Sclocco Helpers: Barbara Vreede, Lieke de Boer May 17 - May 20, 2021 Openscapes Instructors: Jake Szamosi, Bia Villas Boas, Makhan Virdi, Negin Valizadegan May 17 - May 19, 2021 NHS Library and Health Services Instructors: Jez Cope, Fran Baseby, Annajiat Alim Rasel May 18 - May 19, 2021 Queensland Cyber Infrastructure Foundation ** Instructors: Jason Bell, Dag Evensberget, Kasia Koziara Helpers: David Green, Marlies Hankel, Betsy Alpert, Stéphane Guillou, Shern Tee May 19 - May 20, 2021 AUC Data Science Initiative Instructors: Monah Abou Alezz, Muhammad Zohaib Anwar, Yaqing Xu, Jason Williams May 20 - May 25, 2021 Joint Genome Institute/UC Merced ** Instructors: Rhondene Wint Jun 1 - Jun 4, 2021 King's College London (online) ** Instructors: Alessia Visconti, Stefania Marcotti, Flavia Flaviani Helpers: Fursham Hamid, Alejandro Santana-Bonilla Jun 9 - Jun 16, 2021 NWU, South Africa Instructors: Sebastian Mosidi, Martin Dreyer Aug 10 - Aug 13, 2021 ** Workshops marked with asterisks are based on curriculum from The Carpentries lesson programs but may not follow our standard workshop format. Workshops with a globe icon are being held online. The corresponding flag notes the country where the host organization is based. Click here to see our past workshops.  About The Carpentries The Carpentries is a fiscally sponsored project of Community Initiatives, a registered 501(c)3 non-profit organisation based in California, USA. We are a global community teaching foundational computational and data science skills to researchers in academia, industry and government. More › Services Contact RSS Atom sitemap.xml Links Our Code of Conduct Our Community Handbook Our Privacy Policy Our Annual Reports Software Carpentry website Data Carpentry website Library Carpentry website 
casrai-org-1267	----	CRediT - Contributor Roles Taxonomy Skip to content CASRAI Less burden. More research. Menu About Blog Resources Supporters More CRediT – Contributor Roles Taxonomy CRediT (Contributor Roles Taxonomy) is high-level taxonomy, including 14 roles, that can be used to represent the roles typically played by contributors to scientific scholarly output. The roles describe each contributor’s specific contribution to the scholarly output. 14 Contributor Roles Conceptualization Data curation Formal Analysis Funding acquisition Investigation Methodology Project administration Resources Software Supervision Validation Visualization Writing – original draft Writing – review & editing Contributor Roles Defined Conceptualization – Ideas; formulation or evolution of overarching research goals and aims. Data curation – Management activities to annotate (produce metadata), scrub data and maintain research data (including software code, where it is necessary for interpreting the data itself) for initial use and later re-use. Formal analysis – Application of statistical, mathematical, computational, or other formal techniques to analyze or synthesize study data. Funding acquisition ​- Acquisition of the financial support for the project leading to this publication. Investigation – ​Conducting a research and investigation process, specifically performing the experiments, or data/evidence collection. Methodology – Development or design of methodology; creation of models. Project administration – Management and coordination responsibility for the research activity planning and execution. Resources – Provision of study materials, reagents, materials, patients, laboratory samples, animals, instrumentation, computing resources, or other analysis tools. Software – Programming, software development; designing computer programs; implementation of the computer code and supporting algorithms; testing of existing code components. Supervision – Oversight and leadership responsibility for the research activity planning and execution, including mentorship external to the core team. Validation – Verification, whether as a part of the activity or separate, of the overall replication/reproducibility of results/experiments and other research outputs. Visualization – Preparation, creation and/or presentation of the published work, specifically visualization/data presentation. Writing – original draft – ​Preparation, creation and/or presentation of the published work, specifically writing the initial draft (including substantive translation). Writing – review & editing – Preparation, creation and/or presentation of the published work by those from the original research group, specifically critical review, commentary or revision – including pre- or post-publication stages. Background CRediT grew from a practical realization that bibliographic conventions for describing and listing authors on scholarly outputs are increasingly outdated and fail to represent the range of contributions that researchers make to published output. Furthermore, there is growing interest among researchers, funding agencies, academic institutions, editors, and publishers in increasing both the transparency and accessibility of research contributions. Most publishers require author and contribution disclosure statements upon article submission – some in structured form, some in free-text form – at the same time that funders are developing more scientifically rigorous ways to track the outputs and impact of their research investments. In mid-2012 the Wellcome Trust and Harvard University co-hosted a workshop to bring together members of the academic, publishing, and funder communities interested in exploring alternative contributorship and attribution models. Following the workshop (see workshop report), and working initially with a group of mainly biomedical journal editors (and members of the ICMJE a pilot project was established to develop a controlled vocabulary of contributor roles (taxonomy) that could be used to describe the typical range of ‘contributions’ to scholarly published output for biomedical and science more broadly. The aim was to develop a taxonomy that was both practical and easy to understand while minimizing the potential for misuse. A draft taxonomy was tested with a sample of recent corresponding authors publishing across science and was relatively well received. The outcomes of the pilot test are described in Nature commentary (April 2014). Benefits Since 2014, the contributor taxonomy – otherwise known as CRediT (Contributor Roles Taxonomy) has been widely adopted across a range of publishers to improve accessibility and visibility of the range of contribution to published research outputs, bringing a number of important and practical benefits to the research ecosystem more broadly, including: Helping to reduce the potential for author disputes. Supporting adherence to authorship/contributorship processes and policies. Enabling visibility and recognition of the different contributions of researchers, particularly in multi-authored works – across all aspects of the research being reported (including data curation, statistical analysis, etc.) Support identification of peer reviewers and specific expertise. ​Support grant making by enabling funders to more easily identify those responsible for specific research products, developments or breakthroughs. Improving the ability to track the outputs and contributions of individual research specialists and grant recipients. Easy identification of potential collaborators and opportunities for research networking. Further developments in data management and nano-publication. ​Inform ‘science of science’ (‘meta-research) to help enhance scientific efficacy and effectiveness. ​Enable new indicators of research value, use and re-use, credit and attribution. Adopters This list is constantly evolving and will be frequently updated. To share information about a CRediT adoption, please email: [credit] at [casrai] dot [org] ​Publishers American Association of Petroleum Geologists BMJ British Psychological Society Cell Press “CPC” Business Perspectives Dartmouth Journal Services De Gruyter Open Duke University Press eLife Elsevier Evidence Based Communications F1000 Research Geological Society of London Health & Medical Publishing Group International Centre of Insect Physiology and Ecology The Journal of Bone & Joint Surgery KAMJE Press Lippincott Williams & Wilkins MA Healthcare MDPI MIT Press Oman Medical Specialty Board Oxford University Press Public Library of Science (Plos) SAE International SAGE Publishing ScholarOne SLACK Incorporated Springer Springer Publishing Company Virtus Interpress Wiley VCH Wolters Kluwer Institutions University of Glasgow Integrators Allen Press/ Peer Track Aries Systems/ Editorial Manager Clarivate Analytics/ ScholarOne Coko Foundation/ PubSweet OpenConf River Valley/ ReView eJournalPress Rescognito ​Worktribe Publishing Outlets Gates Open Research HRB Open Research Wellcome Open Research How to Implement CRediT For academics Just begin allocating the terms appropriately to your contributors within research outputs. Advocate that your institution and any publications you’re submitting to acknowledge and adopt the taxonomy. For Publishers CRediT adoption can be achieved via a manual workflow outside of Submission and Peer Review systems, or through using a system with an existing CRediT integration. The roles given in the above taxonomy include, but are not limited to, traditional authorship roles. The roles are not intended to define what constitutes authorship, but instead to capture all the work that allows scholarly publications to be produced. Recommendations for applying the CRediT taxonomy are: List all Contributions – All contributions should be listed, whether from those listed as authors or individuals named in acknowledgements; Multiple Roles Possible – Individual contributors can be assigned multiple roles, and a given role can be assigned to multiple contributors; Degree of Contribution Optional – Where multiple individuals serve in the same role, the degree of contribution can optionally be specified as ‘lead’, ‘equal’, or ‘supporting’; Shared Responsibility – Corresponding authors should assume responsibility for role assignment, and all contributors should be given the opportunity to review and confirm assigned roles; Make CRediT Machine Readable – CRediT tagged contributions should be coded in JATS xml v1.2 ​The taxonomy has been refined by Consortia Advancing Standards in Research Administration (CASRAI) and National Information Standards Organization (NISO). It is in adoption by Cell Press, PLOS and many other publishers, and has been integrated into some submission and peer review systems including Aries’ Editorial Manager, and River Valley’s ReView. It will be integrated into Coko Foundation’s xPub. For publishers to make CRediT machine readable and with full meta-data available, CRediT should be  coded in JATS xml v1.2, described via this link: https://jats4r.org/credit-taxonomy Links of Interest Resources PLOS & CRediT Cell Press Adoption Interview, Council for Science Editors’ Science Editor Aries Systems CRediT Integration FAQ Aries Systems CRediT Integration video​ Aries Systems/ JBJS CRediT Integration Case Study Articles & Publications How can we ensure visibility and diversity in research contributions? How the Contributor Role Taxonomy (CRediT) is helping the shift from authorship to contributorship Making research contributions more transparent: report of a FORCE workshop Farewell authors, hello contributors Contributorship, Not Authorship: Use CRediT to Indicate Who Did What Now is the time for a team-based approach to team science CRediT where credit is due Now is the time for a team-based approach to team science Increase transparency by adding CRediT to workflow with PubSweet Credit data generators for data reuse Report on the International Workshop on Contributorship and Scholarly Attribution (2012) Guglielmi, Giorgia, Who gets credit? Survey digs into the thorny question of authorship. Nature News. doi: 10.1038/d41586-018-05280-0 Brand, A.; Allen, L.; Altman, M.; Hlava, M.; Scott, J., Beyond Authorship: attribution, contribution, collaboration, and credit. Learned Publishing 2015, 28 (2), 151-155. Allen, L.; Brand, A.; Scott, J.; Altman, M.; Hlava, M., Credit where credit is due. Nature 2014, 508 (7496), 312-313. “Academic Recognition of Team Science: How to Optimize the Canadian Academic System,” (Canadian Academy of Health Sciences, Ottawa (ON), 2017). “Improving recognition of team science contributions in biomedical research careers,” (Academy of Medical Sciences 2016). V. Ilik, M. Conlon, G. Triggs, M. Haendel, K. L. Holmes, OpenVIVO: Transparency in Scholarship. Frontiers in Research Metrics and Analytics preprint (2018). Interview with @DKingsley, Cambridge University Meet the Chairs Liz Allen, Director of Strategic Initiatives, F1000 Research Alison McGonagle-O’Connell, Founder, O’Connell Consulting. Get Involved We have been overwhelmed by the interest in CRediT to date and are working to support adoption and encourage practical usage. We are also working to ensure that CRediT is tied to ORCID and included in the Crossref metadata capture. CRediT is currently managed as an informal standard at CASRAI and we are working towards formal standardisation of the taxonomy at NISO. But please do get involved by joining the community CRediT Interest Group, spreading the word, and providing feedback! Proudly powered by WordPress | Theme: Business 
catalog-docnow-io-6355	----	Home | DocNow Tweet Catalog The DocNow Catalog is a collectively curated listing of Twitter datasets. Public datasets are shared as Tweet IDs, which can be hydrated back into full datasets using our Hydrator desktop application. 0 Records comprising 0 tweets Add Record SubjectsAll Tweets Start Tweets End Search Note: all metadata is shared under a CC0 license. Please read our Code of Conduct for more information about contributing datasets. ADDED DATE RANGE TITLE TWEET COUNT CREATORS SUBJECTS REPOSITORY 
cbeer-info-3687	----	blog.cbeer.info Chris Beer chris@cbeer.info cbeer _cb_ May 25, 2016 Autoscaling AWS Elastic Beanstalk worker tier based on SQS queue length We are deploying a Rails application (for the Hydra-in-a-Box project) to AWS Elastic Beanstalk. Elastic Beanstalk offers us easy deployment, monitoring, and simple auto-scaling with a built-in dashboard and management interface. Our application uses several potentially long-running background jobs to characterize, checksum, and create derivates for uploaded content. Since we’re deploying this application within AWS, we’re also taking advantage of the Simple Queue Service (SQS), using the active-elastic-job gem to queue and run ActiveJob tasks. Elastic Beanstalk provides settings for “Web server” and “Worker” tiers. Web servers are provisioned behind a load balancer and handle end-user requests, while Workers automatically handle background tasks (via SQS + active-elastic-job). Elastic Beanstalk provides basic autoscaling based on a variety of metrics collected from the underlying instances (CPU, Network, I/O, etc), although, while sufficient for our “Web server” tier, we’d like to scale our “Worker” tier based on the number of tasks waiting to be run. Currently, though, the ability to auto-scale the worker tier based on the underlying queue depth isn’t enable through the Elastic Beanstak interface. However, as Beanstalk merely manages and aggregates other AWS resources, we have access to the underlying resources, including the autoscaling group for our environment. We should be able to attach a custom auto-scaling policy to that auto scaling group to scale based on additional alarms. For example, let’s we want to add additional worker nodes if there are more than 10 tasks for more than 5 minutes (and, to save money and resources, also remove worker nodes when there are no tasks available). To create the new policy, we’ll need to: find the appropriate auto-scaling group by finding the Auto-scaling group with the elasticbeanstalk:environment-id that matches the worker tier environment id; find the appropriate SQS queue for the worker tier; add auto-scaling policies that add (and remove) instances to the autoscaling group; create a new CloudWatch alarm that measures the SQS queue exceeds our configured depth (5) that triggers the auto-scaling policy to add additional worker instances whenever the alarm is triggered; and, conversely, create a new CloudWatch alarm that measures the SQS queue hits 09 that trigger the auto-scaling action to removes worker instances whenever the alarm is triggered. and, similarly for scaling back down. Even though there are several manual steps, they aren’t too difficult (other than discovering the various resources we’re trying to orchestrate), and using Elastic Beanstalk is still valuable for the rest of its functionality. But, we’re in the cloud, and really want to automate everything. With a little CloudFormation trickery, we can even automate creating the worker tier with the appropriate autoscaling policies. First, knowing that the CloudFormation API allows us to pass in an existing SQS queue for the worker tier, let’s create an explicit SQS queue resource for the workers: "DefaultQueue" : { "Type" : "AWS::SQS::Queue", } And wire it up to the Beanstalk application by setting the aws:elasticbeanstalk:sqsd:WorkerQueueURL (not shown: sending the worker queue to the web server tier): "WorkersConfigurationTemplate" : { "Type" : "AWS::ElasticBeanstalk::ConfigurationTemplate", "Properties" : { "ApplicationName" : { "Ref" : "AWS::StackName" }, "OptionSettings" : [ ..., { "Namespace": "aws:elasticbeanstalk:sqsd", "OptionName": "WorkerQueueURL", "Value": { "Ref" : "DefaultQueue"} } } } }, "WorkerEnvironment": { "Type": "AWS::ElasticBeanstalk::Environment", "Properties": { "ApplicationName": { "Ref" : "AWS::StackName" }, "Description": "Worker Environment", "EnvironmentName": { "Fn::Join": ["-", [{ "Ref" : "AWS::StackName"}, "workers"]] }, "TemplateName": { "Ref": "WorkersConfigurationTemplate" }, "Tier": { "Name": "Worker", "Type": "SQS/HTTP" }, "SolutionStackName" : "64bit Amazon Linux 2016.03 v2.1.2 running Ruby 2.3 (Puma)" ... } } Using our queue we can describe one of the CloudWatch::Alarm resources and start describing a scaling policy: "ScaleOutAlarm" : { "Type": "AWS::CloudWatch::Alarm", "Properties": { "MetricName": "ApproximateNumberOfMessagesVisible", "Namespace": "AWS/SQS", "Statistic": "Average", "Period": "60", "Threshold": "10", "ComparisonOperator": "GreaterThanOrEqualToThreshold", "Dimensions": [ { "Name": "QueueName", "Value": { "Fn::GetAtt" : ["DefaultQueue", "QueueName"] } } ], "EvaluationPeriods": "5", "AlarmActions": [{ "Ref" : "ScaleOutPolicy" }] } }, "ScaleOutPolicy" : { "Type": "AWS::AutoScaling::ScalingPolicy", "Properties": { "AdjustmentType": "ChangeInCapacity", "AutoScalingGroupName": ????, "ScalingAdjustment": "1", "Cooldown": "60" } }, However, to connect the policy to the auto-scaling group, we need to know the name for the autoscaling group. Unfortunately, the autoscaling group is abstracted behind the Beanstalk environment. To gain access to it, we’ll need to create a custom resource backed by a Lambda function to extract the information from the AWS APIs: "BeanstalkStack": { "Type": "Custom::BeanstalkStack", "Properties": { "ServiceToken": { "Fn::GetAtt" : ["BeanstalkStackOutputs", "Arn"] }, "EnvironmentName": { "Ref": "WorkerEnvironment" } } }, "BeanstalkStackOutputs": { "Type": "AWS::Lambda::Function", "Properties": { "Code": { "ZipFile": { "Fn::Join": ["\n", [ "var response = require('cfn-response');", "exports.handler = function(event, context) {", " console.log('REQUEST RECEIVED:\\n', JSON.stringify(event));", " if (event.RequestType == 'Delete') {", " response.send(event, context, response.SUCCESS);", " return;", " }", " var environmentName = event.ResourceProperties.EnvironmentName;", " var responseData = {};", " if (environmentName) {", " var aws = require('aws-sdk');", " var eb = new aws.ElasticBeanstalk();", " eb.describeEnvironmentResources({EnvironmentName: environmentName}, function(err, data) {", " if (err) {", " responseData = { Error: 'describeEnvironmentResources call failed' };", " console.log(responseData.Error + ':\\n', err);", " response.send(event, context, resource.FAILED, responseData);", " } else {", " responseData = { AutoScalingGroupName: data.EnvironmentResources.AutoScalingGroups[0].Name };", " response.send(event, context, response.SUCCESS, responseData);", " }", " });", " } else {", " responseData = {Error: 'Environment name not specified'};", " console.log(responseData.Error);", " response.send(event, context, response.FAILED, responseData);", " }", "};" ]]} }, "Handler": "index.handler", "Runtime": "nodejs", "Timeout": "10", "Role": { "Fn::GetAtt" : ["LambdaExecutionRole", "Arn"] } } } With the custom resource, we can finally get access the autoscaling group name and complete the scaling policy: "ScaleOutPolicy" : { "Type": "AWS::AutoScaling::ScalingPolicy", "Properties": { "AdjustmentType": "ChangeInCapacity", "AutoScalingGroupName": { "Fn::GetAtt": [ "BeanstalkStack", "AutoScalingGroupName" ] }, "ScalingAdjustment": "1", "Cooldown": "60" } }, The complete worker tier is part of our CloudFormation stack: https://github.com/hybox/aws/blob/master/templates/worker.json Mar 8, 2015 LDPath in 3 examples At Code4Lib 2015, I gave a quick lightning talk on LDPath, a declarative domain-specific language for flatting linked data resources to a hash (e.g. for indexing to Solr). LDPath can traverse the Linked Data Cloud as easily as working with local resources and can cache remote resources for future access. The LDPath language is also (generally) implementation independent (java, ruby) and relatively easy to implement. The language also lends itself to integration within development environments (e.g. ldpath-angular-demo-app, with context-aware autocompletion and real-time responses). For me, working with the LDPath language and implementation was the first time that linked data moved from being a good idea to being a practical solution to some problems. Here is a selection from the VIAF record [1]: <> void:inDataset <../data> ; a genont:InformationResource, foaf:Document ; foaf:primaryTopic <../65687612> . <../65687612> schema:alternateName "Bittman, Mark" ; schema:birthDate "1950-02-17" ; schema:familyName "Bittman" ; schema:givenName "Mark" ; schema:name "Bittman, Mark" ; schema:sameAs <http://d-nb.info/gnd/1058912836>, <http://dbpedia.org/resource/Mark_Bittman> ; a schema:Person ; rdfs:seeAlso <../182434519>, <../310263569>, <../314261350>, <../314497377>, <../314513297>, <../314718264> ; foaf:isPrimaryTopicOf <http://en.wikipedia.org/wiki/Mark_Bittman> . We can use LDPath to extract the person’s name: So far, this is not so different from traditional approaches. But, if we look deeper in the response, we can see other resources, including books by the author. <../310263569> schema:creator <../65687612> ; schema:name "How to Cook Everything : Simple Recipes for Great Food" ; a schema:CreativeWork . We can traverse the links to include the titles in our record: LDPath also gives us the ability to write this query using a reverse property selector, e.g: books = foaf:primaryTopic / ^schema:creator[rdf:type is schema:CreativeWork] / schema:name :: xsd:string ; The resource links out to some external resources, including a link to dbpedia. Here is a selection from record in dbpedia: <http://dbpedia.org/resource/Mark_Bittman> dbpedia-owl:abstract "Mark Bittman (born c. 1950) is an American food journalist, author, and columnist for The New York Times."@en, "Mark Bittman est un auteur et chroniqueur culinaire américain. Il a tenu une chronique hebdomadaire pour le The New York Times, appelée The Minimalist (« le minimaliste »), parue entre le 17 septembre 1997 et le 26 janvier 2011. Bittman continue d'écrire pour le New York Times Magazine, et participe à la section Opinion du journal. Il tient également un blog."@fr ; dbpedia-owl:birthDate "1950+02:00"^^<http://www.w3.org/2001/XMLSchema#gYear> ; dbpprop:name "Bittman, Mark"@en ; dbpprop:shortDescription "American journalist, food writer"@en ; dc:description "American journalist, food writer", "American journalist, food writer"@en ; dcterms:subject <http://dbpedia.org/resource/Category:1950s_births>, <http://dbpedia.org/resource/Category:American_food_writers>, <http://dbpedia.org/resource/Category:American_journalists>, <http://dbpedia.org/resource/Category:American_television_chefs>, <http://dbpedia.org/resource/Category:Clark_University_alumni>, <http://dbpedia.org/resource/Category:Living_people>, <http://dbpedia.org/resource/Category:The_New_York_Times_writers> ; LDPath allows us to transparently traverse that link, allowing us to extract the subjects for VIAF record: [1] If you’re playing along at home, note that, as of this writing, VIAF.org fails to correctly implement content negotiation and returns HTML if it appears anywhere in the Accept header, e.g.: curl -H "Accept: application/rdf+xml, text/html; q=0.1" -v http://viaf.org/viaf/152427175/ will return a text/html response. This may cause trouble for your linked data clients. Mar 13, 2013 Building a Pivotal Tracker IRC bot with Sinatra and Cinch We're using Pivotal Tracker on the Fedora Futures project. We also have an IRC channel where the tech team hangs out most of the day, and let each other know what we're working on, which tickets we're taking, and give each other feedback on those tickets. In order to document this, we try to put most of our the discussion in the tickets for future reference (although we are logging the IRC channel, it's not nearly as easy to look up decisions there). Because we're (lazy) developers, we wanted updates in Pivotal to get surfaced in the IRC channel. There was a (neglected) IRC bot, Pivotal-Tracker-IRC-bot, but it was designed to push and pull data from Pivotal based on commands in IRC (and, seems fairly abandoned). So, naturally, we built our own integration: Pivotal-IRC. This was my first time using Cinch to build a bot, and it was a surprisingly pleasant and straightforward experience: bot = Cinch::Bot.new do configure do |c| c.nick = $nick c.server = $irc_server c.channels = [$channel] end end # launch the bot in a separate thread, because we're using this one for the webapp. Thread.new { bot.start } And we have a really tiny Sinatra app that can parse the Pivotal Webhooks payload and funnel it into the channel: post '/' do message = Pivotal::WebhookMessage.new request.body.read bot.channel_list.first.msg("#{message.description} #{message.story_url}") end It turns out we also send links to Pivotal tickets not infrequently, and building two-way communication (using the Pivotal REST API, and the handy pivotal-tracker gem) was also easy. Cinch exposes a handy DSL that parses messages using regular expressions and capturing groups: bot.on :message, /story\/show\/([0-9]+)/ do |m, ticket_id| story = project.stories.find(ticket_id) m.reply "#{story.story_type}: #{story.name} (#{story.current_state}) / owner: #{story.owned_by}" end Mar 9, 2013 Real-time statistics with Graphite, Statsd, and GDash We have a Graphite-based stack of real-time visualization tools, including the data aggregator Statsd. These tools let us easily record real-time data from arbitrary services with mimimal fuss. We present some curated graphs through GDash, a simple Sinatra front-end. For example, we record the time it takes for Solr to respond to queries from our SearchWorks catalog, using this simple bash script: tail -f /var/log/tomcat6/catalina.out | ruby solr_stats.rb (We rotate these logs through truncation; you can also use `tail -f --retry` for logs that are moved away when rotated) And the ruby script that does the actual parsing: require 'statsd.rb' STATSD = Statsd.new(...,8125) # Listen to stdin while str = gets if str =~ /QTime=([^ ]+)/ # extract the QTime ms = $1.to_i # record it, based on our hostname STATSD.timing("#{ENV['HOSTNAME'].gsub('.', '-')}.solr.qtime", ms) end end From this data, we can start asking qustions like: Is our load-balancer configured optimally? (hint: not quite; for a variety of reasons, we've sacrificed some marginal performance benefit for this non-invasive, simpler load-blaance configuration. Why are our the 90th-percentile query times creeping up? (time in ms) (Answers to these questions and more in a future post, I'm sure.) We also use this setup to monitor other services, e.g.: What's happening in our Fedora instance (and, which services are using the repository)? Note the red line ("warn_0") in the top graph. It marks the point where our (asynchronous) indexing system is unable to keep up with demand, and updates may appear at a delay. Given time (and sufficient data, of course), this also gives us the ability to forecast and plan for issues: Is our Solr query time getting worse? (Ganglia can perform some basic manipulation, including taking integrals and derivatives) What is the rate of growth of our indexing backlog, and, can we process it in a reasonable timeframe, or should we scale the indexer service? Given our rate of disk usage, are we on track to run out of disk space this month? this week? If we build graphs to monitor those conditions, we can add Nagios alerts to trigger service alerts. GDash helpfully exposes a REST endpoint that lets us know if a service has those WARN or CRITICAL thresholds. We currently have a home-grown system monitoring system that we're tempted to fold into here as well. I've been evaluating Diamond, which seems to do a pretty good job of collecting granular system statistics (CPU, RAM, IO, Disk space, etc). Mar 8, 2013 Icemelt: A stand-in for integration tests against AWS Glacier One of the threads we've been pursuing as part of the Fedora Futures project is integration with asynchronous and/or very slow storage. We've taken on AWS Glacier as a prime, generally accessable example. Uploading content is slow, but can be done synchronously in one API request: POST /:account_id/vaults/:vault_id/archives x-amz-archive-description: Description ...Request body (aka your content)... Where things get radically different is when requesting content back. First, you let Glacier know you'd like to retrieve your content: POST /:account_id/vaults/:vault_id/jobs HTTP/1.1 { "Type": "archive-retrieval", "ArchiveId": String, [...] } Then, you wait. and wait. and wait some more; from the documentation: Most Amazon Glacier jobs take about four hours to complete. You must wait until the job output is ready for you to download. If you have either set a notification configuration on the vault identifying an Amazon Simple Notification Service (Amazon SNS) topic or specified an Amazon SNS topic when you initiated a job, Amazon Glacier sends a message to that topic after it completes the job. [emphasis added] Icemelt If you're iterating on some code, waiting hours to get your content back isn't realistic. So, we wrote a quick Sinatra app called Icemelt in order to mock the Glacier REST API (and, perhaps taking less time to code than retrieving content from Glacier ). We've tested it using the Ruby Fog client, as well as the official AWS Java SDK, and it actually works! Your content gets stored locally, and the delay for retrieving content is configurable (default: 5 seconds). Configuring the official SDK looks something like this: PropertiesCredentials credentials = new PropertiesCredentials( TestIcemeltGlacierMock.class .getResourceAsStream("AwsCredentials.properties")); AmazonGlacierClient client = new AmazonGlacierClient(credentials); client.setEndpoint("http://localhost:3000/"); And for Fog, something like: Fog::AWS::Glacier.new :aws_access_key_id => '', :aws_secret_access_key => '', :scheme => 'http', :host => 'localhost', :port => '3000' Right now, Icemelt skips a lot of unnecessary work (e.g. checking HMAC digests for authentication, validating hashes, etc), but, as always, patches are very welcome. Next » 
cedat-mak-ac-ug-494	----	iLabs Project | The College of Engineering, Design, Art and Technology Skip to content The College of Engineering, Design, Art and Technology Makerere University Menu Home About Us Message from the Principal Historical Background University Vision and Mission Academic Staff Administrative Staff News Events Facts and Figures National Collaborations International Collaborations Academics Schools Margaret Trowell School of Industrial and Fine Arts The Makerere Art School Through The Ages Department of Fine Art Department of Industrial Art and Applied Design Department of Visual Communication, Design and Multimedia School of Built Environment Department of Architecture and Physical Planning Department of Construction Economics and Management Department of Geomatics and Land Management School of Engineering Department of Electrical and Computer Engineering Department of Civil and Environmental Engineering Department of Mechanical Engineering Student Information Undergraduate Programs Graduate Programs Short Courses Public Private Partnerships Research and Publications Centre for Research in Energy and Energy Conservation MAKA Pads Project Industrial Parks Project Low-cost Irrigation Project netLabs!UG Publications CEDAT Newsletters Presidential Initiative Project ARMS Project iLabs Project Center for Research in Transportation Technologies The Kiira EV Center for Technology Design and Development People Makerere University Fine Arts Students Association Makerere Engineering Society Makerere Architecture Students Association Makerere Association of Construction Management Students Alumini Gallery Contact Us iLabs Project iLabs@MAK iLabs@MAK is a CEDAT based research project that develops remote Laboratories on the iLabs Platform to supplement the conventional Laboratories under the Electrical Engineering Department. An iLab system links three computer stations: a Lab Server which runs the laboratory hardware, a Client – the graphical user interface customized to remotely access the laboratory hardware and a Service Broker which manages access to the laboratory and mediates information flow between the Lab Server and the Client. iLabs@MAK is carried out in collaboration with the Massachusetts Institute of Technology (MIT), Obafemi Awolowo University (OAU) and the University of Dar-es-salaam. The fundamental hardware and software used is provided mainly by National Instruments (NI). Mission Advancing knowledge and skills beneficial to Uganda in particular and the world at large through collaboration with pre-eminent Research institutions spearheaded by MIT. Vision To facilitate the improvement of the student learning experience by contributing meaningfully to the movement within higher education leading to global sharing of lab experiments over the Internet. Research iLabs@MAK comprises student developers (both Graduate and Undergraduate) and members of staff from the Faculty of Technology. The student developers carry out research in development of new labs to support curricula of the Bsc. Electrical, Telecommunications and Computer Engineering Programmes. To date, laboratories have been developed supporting experimentation in the fields of Digital circuit analysis, Amplitude/Frequency Modulation, Pulse Code Modulation and Digital Data Transmission. These experiments are used in the courses of Introduction to Digital Electronics (1st year), Applied Digital Electronics, Basic Telephony and Communication Theory I (3rd year). The ongoing research seeks to develop laboratories supporting the fields of Digital Signal Processing, Embedded Systems, Fiber Optic Systems, and Control Systems Engineering. For more information about the iLabs project, please visit our website: cedat.mak.ac.ug/ilabs iLabs Events 5th Annual iLabs-National Instruments Conference iLabs@MAK Project Wins the Best Exhibitor Award iLabs Science and Technology Innovations Challenge 2013 iLabs successfully holds Central Robotics challenge iLabs@Mak Project concludes search for best innovators iLabs Robotics Final 2013   One thought on “iLabs Project” Pingback:Meet the organisations receiving Open Data Day 2021 mini-grants – Open Knowledge Foundation blog Comments are closed. Quick Access Message from the Principal Academic Staff Administrative Staff Undergraduate Programs Graduate Programs Ceremonies and Events Partnerships and Collaborations Publications CEDAT Events << Apr 2021 >> M T W T F S S 29 30 31 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 1 2 Twitter Feed Two Mechanical Engineering @MakCEDAT @MakerereU students are part of a team that is now a FINALIST for @WegePrize 2021—a global design competition. They belong to a team Musana, which created a stove using solar power and water to fuel cooking, eliminating the need for wood fuel pic.twitter.com/gBZSz1fsC3 About 3 weeks ago from Makerere University CEDAT's Twitter via Twitter Web App 4. Bsc. Construction Management will be still offered at Graduate level as MSc. Construction Management. This will allow for specialization at this level from a wide base of undergraduate programmes and also offer an opportunity for advanced research in Construction Management pic.twitter.com/QwnoW9k4HK Last month from Makerere University CEDAT's Twitter via Twitter Web App 3. BSc. Telecommunications Engineering with BSc. Computer Engineering are being merged to form BSc. Computer and Communications Engineering which will provide the students with a wider scope of employment opportunities and more options for specialization pic.twitter.com/Z8LQC5Flya Last month from Makerere University CEDAT's Twitter via Twitter Web App 2. Those already enrolled on these programmes will not be affected in anyway. Students on the programmes will continue to have normal classes until graduation. pic.twitter.com/RcgsNoE91C Last month from Makerere University CEDAT's Twitter via Twitter Web App Communication about phased out programmes at CEDAT... 1. The phased out programmes are BSc. Telecommunications Engineering, BSc. Computer Engineering and BSc. Construction Management... cedat.mak.ac.ug/news/communic… pic.twitter.com/LxGYOb32yC Last month from Makerere University CEDAT's Twitter via Twitter Web App 50 years of Technology https://www.youtube.com/watch?v=-qtuxc7oHLw Important Links MAK-Home | Webmail Policies | Intranet Presidential Initiative International Collaborations Tag Cloud analysis. architecture CEDAT Charles Niwagaba Civil Engineering communication computer engineering concepts Conferences Construction course Department of Electrical and Computer Engineering Department of Geomatics and Land Management design Development drawing engineering Engineering Mathematics Entrepreneurship Environment Exhibition fundamentals Independent Study Innovation Innovations Makerere Art Gallery Makerere University management materials media methods MTSIFA planning principles process project projects Rationale Research skills student students techniques technology Uganda Connect With Us Contact Us THE COLLEGE OF ENGINEERING, DESIGN, ART AND TECHNOLOGY Makerere University P.O.Box 7062, Kampala-Uganda Email: pr@cedat.mak.ac.ug Web: www.cedat.mak.ac.ug Copyright © 2021 The College of Engineering, Design, Art and Technology. 
climadeeleicao-com-br-3043	----	Análise de planos – Clima de Eleição Home Sobre Nosso Time Lideranças pelo clima Análise de planos Publicações Contato Brasil +55 41 998997470 climadeeleicao@gmail.com joaohencer@gmail.com Entre em contato Nosso e-mail → climadeeleicao@gmail.com Entrar em contato Projeto Clima de Eleição capacita centenas de candidaturas sobre a crise climática durante as eleições municipais de 2020. Links úteis Sobre nós Candidatos Nosso time Contato ©2020, Clima de Eleição. Todos direitos reservados. 
clir-informz-net-471	----	Template Name: Characters remaining: Template Description: Characters remaining: Name: Filename: Select the destination folder: Select a folder Report Edit Test Activate Deactivate Save As Template Copy Delete Undelete Archive Set as In Progress Form Content Only Unsubscribe Form Custom Some non-required fields on this form are blank. Do you wish to save with blank values? Loading... Join the DLF Forum Newsletter mailing list! Email: * First Name * Last Name * Institution Title Opt-in to join the list DLF Forum News Please verify that you are not a robot. Sign Up DLF never shares your information. But we do like to share information with you! 
code4lib-org-861	----	Code4Lib | We are developers and technologists for libraries, museums, and archives who are dedicated to being a diverse and inclusive community, seeking to share ideas and build collaboration. About Chat Conference Jobs Journal Local Mailing List Planet Wiki Code4Lib.org was migrated from Drupal to Jekyll in June 2018. Some links may still be broken. To report issues or help fix see: https://github.com/code4lib/code4lib.github.io Posts Nov 25, 2020 Code4Lib 2021 Sep 5, 2019 Code4Lib 2020 Aug 27, 2018 Code4Lib 2019 Apr 17, 2018 Code4Lib Journal Issue 41 Call for Papers Oct 18, 2017 Issue 38 of the Code4Lib Journal Aug 8, 2017 Code4Lib 2018 Jul 18, 2017 Issue 37 of the Code4Lib Journal Jun 12, 2017 Code4Lib Journal Issue 38 Call for Papers Oct 28, 2016 Code4Lib Journal #34 Oct 14, 2016 C4L17: Call for Presentation/Panel proposals Oct 13, 2016 Code4Lib 2017 Jul 19, 2016 Code4Lib Journal #33 Apr 26, 2016 Code4Lib Journal #32 Sep 17, 2015 jobs.code4lib.org studied Aug 10, 2015 Code4Lib 2016 Jul 27, 2015 Code4Lib Northern California: Stanford, CA Jul 15, 2015 Code4Lib Journal #29 Apr 15, 2015 Code4Lib Journal #28: Special Issue on Diversity in Library Technology Mar 7, 2015 Code4Lib 2016 will be in Philadelphia Mar 7, 2015 Code4Lib 2016 Conference Proposals Feb 21, 2015 Code4Lib North 2015: St. Catharines, ON Feb 21, 2015 Code4Lib 2015 videos Jan 31, 2015 2015 Code of Conduct Dec 12, 2014 Code4Lib 2015 Diversity Scholarships Dec 5, 2014 Your code does not exist in a vacuum Dec 5, 2014 Your Chocolate is in My Peanut Butter! Mixing up Content and Presentation Layers to Build Smarter Books in Browsers with RDFa, Schema.org, and Linked Data Topics Dec 5, 2014 You Gotta Keep 'em Separated: The Case for "Bento Box" Discovery Interfaces Dec 5, 2014 Refinery — An open source locally deployable web platform for the analysis of large document collections Dec 5, 2014 Programmers are not projects: lessons learned from managing humans Dec 5, 2014 Our $50,000 Problem: Why Library School? Dec 5, 2014 Making your digital objects embeddable around the web Dec 5, 2014 Leveling Up Your Git Workflow Dec 5, 2014 Level Up Your Coding with Code Club (yes, you can talk about it) Dec 5, 2014 How to Hack it as a Working Parent: or, Should Your Face be Bathed in the Blue Glow of a Phone at 2 AM? Dec 5, 2014 Helping Google (and scholars, researchers, educators, & the public) find archival audio Dec 5, 2014 Heiðrún: DPLA's Metadata Harvesting, Mapping and Enhancement System Dec 5, 2014 Got Git? Getting More Out of Your GitHub Repositories Dec 5, 2014 Feminist Human Computer Interaction (HCI) in Library Software Dec 5, 2014 Dynamic Indexing: a Tragic Solr Story Dec 5, 2014 Docker? VMs? EC2? Yes! With Packer.io Dec 5, 2014 Digital Content Integrated with ILS Data for User Discovery: Lessons Learned Dec 5, 2014 Designing and Leading a Kick A** Tech Team Dec 5, 2014 Consuming Big Linked Open Data in Practice: Authority Shifts and Identifier Drift Dec 5, 2014 BYOB: Build Your Own Bootstrap Dec 5, 2014 Book Reader Bingo: Which Page-Turner Should I Use? Dec 5, 2014 Beyond Open Source Dec 5, 2014 Awesome Pi, LOL! Dec 5, 2014 Annotations as Linked Data with Fedora4 and Triannon (a Real Use Case for RDF!) Dec 5, 2014 American (Archives) Horror Story: LTO Failure and Data Loss Dec 5, 2014 A Semantic Makeover for CMS Data Dec 4, 2014 Code4lib 2007 Lighting Talks Nov 16, 2014 Store Nov 11, 2014 Voting for Code4Lib 2015 Prepared Talks is now open. Nov 10, 2014 Keynote voting for the 2015 conference is now open! Sep 23, 2014 Code4Lib 2015: Call for Proposals Sep 21, 2014 Code4Lib North (Ottawa): Tuesday October 7th, 2014 Sep 10, 2014 code4libBC: November 27 and 28, 2014 Sep 6, 2014 2015 Conference Schedule Jul 22, 2014 Code4Lib Journal issue 25 Jul 15, 2014 Code4Lib NorCal 28 July in San Mateo Jul 2, 2014 Code4Lib 2015 Apr 18, 2014 Code4Lib 2014 Trip Report - Zahra Ashktorab Apr 18, 2014 Code4Lib 2014 Trip Report- Nabil Kashyap Apr 18, 2014 Code4Lib 2014 Trip Report - Junior Tidal Apr 18, 2014 Code4Lib 2014 Trip Report - Jennifer Maiko Kishi Apr 18, 2014 Code4Lib 2014 Trip Report - J. (Jenny) Gubernick Apr 18, 2014 Code4Lib 2014 Trip Report - Emily Reynolds Apr 18, 2014 Code4Lib 2014 Trip Report - Coral Sheldon Hess Apr 18, 2014 Code4Lib 2014 Trip Report - Christina Harlow Apr 18, 2014 CODE4LIB 2014 Trip Report - Arie Nugraha Mar 10, 2014 Call for proposals: Code4Lib Journal, issue 25 Feb 3, 2014 2014 Code of Conduct Jan 30, 2014 Code4Lib 2015 Call for Host Proposals Jan 24, 2014 Code4Lib 2014 Sponsors Jan 21, 2014 WebSockets for Real-Time and Interactive Interfaces Jan 21, 2014 We Are All Disabled! Universal Web Design Making Web Services Accessible for Everyone Jan 21, 2014 Visualizing Solr Search Results with D3.js for User-Friendly Navigation of Large Results Sets Jan 21, 2014 Visualizing Library Resources as Networks Jan 21, 2014 Under the Hood of Hadoop Processing at OCLC Research Jan 21, 2014 Towards Pasta Code Nirvana: Using JavaScript MVC to Fill Your Programming Ravioli Jan 21, 2014 Sustaining your Open Source project through training Jan 21, 2014 Structured data NOW: seeding schema.org in library systems Jan 21, 2014 Quick and Easy Data Visualization with Google Visualization API and Google Chart Libraries Jan 21, 2014 Queue Programming -- how using job queues can make the Library coding world a better place Jan 21, 2014 PhantomJS+Selenium: Easy Automated Testing of AJAX-y UIs Jan 21, 2014 Personalize your Google Analytics Data with Custom Events and Variables Jan 21, 2014 Organic Free-Range API Development - Making Web Services That You Will Actually Want to Consume Jan 21, 2014 Next Generation Catalogue - RDF as a Basis for New Services Jan 21, 2014 More Like This: Approaches to Recommending Related Items using Subject Headings Jan 21, 2014 Lucene's Latest (for Libraries) Jan 21, 2014 Discovering your Discovery System in Real Time Jan 21, 2014 Dead-simple Video Content Management: Let Your Filesystem Do The Work Jan 21, 2014 Building for others (and ourselves): the Avalon Media System Jan 21, 2014 Behold Fedora 4: The Incredible Shrinking Repository! Jan 21, 2014 All Tiled Up Jan 21, 2014 A reusable application to enable self deposit of complex objects into a digital preservation environment Jan 21, 2014 A Book, a Web Browser and a Tablet: How Bibliotheca Alexandrina's Book Viewer Framework Makes It Possible Jan 21, 2014 2014 Conference Schedule Jan 17, 2014 Code4Lib 2014 Conference Diversity Scholarship Recipients Nov 19, 2013 Code4lib 2014 Diversity Scholarships (Application Deadline: Dec. 13, 2013, 5pm EST) Nov 12, 2013 Code4Lib 2014 Keynote Speakers Sep 30, 2013 Code4Lib 2014 Jun 10, 2013 Code4Lib 2014 Conference Prospectus for Sponsors Mar 28, 2013 Code4Lib 2014 Conference Proposals Jan 31, 2013 Ask Anything! Dec 5, 2012 Code4Lib 2014 Call for Host Proposals Dec 4, 2012 The Care and Feeding of a Crowd Dec 4, 2012 The Avalon Media System: A Next Generation Hydra Head For Audio and Video Delivery Dec 4, 2012 Solr Update Dec 4, 2012 REST IS Your Mobile Strategy Dec 4, 2012 Practical Relevance Ranking for 10 million books. Dec 4, 2012 Pitfall! Working with Legacy Born Digital Materials in Special Collections Dec 4, 2012 n Characters in Search of an Author Dec 4, 2012 Linked Open Communism: Better discovery through data dis- and re- aggregation Dec 4, 2012 Hybrid Archival Collections Using Blacklight and Hydra Dec 4, 2012 HTML5 Video Now! Dec 4, 2012 Hands off! Best Practices and Top Ten Lists for Code Handoffs Dec 4, 2012 Hacking the DPLA Dec 4, 2012 Google Analytics, Event Tracking and Discovery Tools Dec 4, 2012 Evolving Towards a Consortium MARCR Redis Datastore Dec 4, 2012 EAD without XSLT: A Practical New Approach to Web-Based Finding Aids Dec 4, 2012 De-sucking the Library User Experience Dec 4, 2012 Data-Driven Documents: Visualizing library data with D3.js Dec 4, 2012 Creating a Commons Dec 4, 2012 Citation search in SOLR and second-order operators Dec 4, 2012 Browser/Javascript Integration Testing with Ruby Dec 4, 2012 ARCHITECTING ScholarSphere: How We Built a Repository App That Doesn't Feel Like Yet Another Janky Old Repository App Dec 4, 2012 All Teh Metadatas Re-Revisited Dec 4, 2012 Actions speak louder than words: Analyzing large-scale query logs to improve the research experience Nov 30, 2012 Code4Lib 2013 Scholarship (deadline: December 14, 2012) Nov 2, 2012 Code4Lib 2013 Nov 2, 2012 Code4Lib 2013 Schedule Oct 2, 2012 Code4Lib Conference 2013 Call for Propoosals Sep 5, 2012 Keynote voting for the 2013 conference is now open! Jul 11, 2012 Dates Set for Code4Lib 2013 in Chicago May 29, 2012 Code4Lib Journal - Call for Proposals May 7, 2012 ruby-marc 0.5.0 released Apr 10, 2012 Code4Lib Journal: Editors Wanted Feb 3, 2012 Code4Lib Journal Issue 16 is published! Feb 3, 2012 Ask Anything! – Facilitated by Carmen Mitchell- Code4Lib 2012 Jan 26, 2012 Relevance Ranking in the Scholarly Domain - Tamar Sadeh, PhD Jan 26, 2012 Kill the search button II - the handheld devices are coming - Jørn Thøgersen, Michael Poltorak Nielsen Jan 25, 2012 Stack View: A Library Browsing Tool - Annie Cain Jan 25, 2012 Search Engine Relevancy Tuning - A Static Rank Framework for Solr/Lucene - Mike Schultz Jan 25, 2012 Practical Agile: What's Working for Stanford, Blacklight, and Hydra - Naomi Dushay Jan 25, 2012 NoSQL Bibliographic Records: Implementing a Native FRBR Datastore with Redis - Jeremy Nelson Jan 25, 2012 Lies, Damned Lies, and Lines of Code Per Day - James Stuart Jan 25, 2012 Indexing big data with Tika, Solr & map-reduce - Scott Fisher, Erik Hetzner Jan 25, 2012 In-browser data storage and me - Jason Casden Jan 25, 2012 How people search the library from a single search box - Cory Lown Jan 25, 2012 Discovering Digital Library User Behavior with Google Analytics - Kirk Hess Jan 25, 2012 Building research applications with Mendeley - William Gunn Jan 23, 2012 Your UI can make or break the application (to the user, anyway) - Robin Schaaf Jan 23, 2012 Your Catalog in Linked Data - Tom Johnson Jan 23, 2012 The Golden Road (To Unlimited Devotion): Building a Socially Constructed Archive of Grateful Dead Artifacts - Robin Chandler Jan 23, 2012 Quick and <del>Dirty</del> Clean Usability: Rapid Prototyping with Bootstrap - Shaun Ellis Jan 23, 2012 “Linked-Data-Ready” Software for Libraries - Jennifer Bowen Jan 23, 2012 HTML5 Microdata and Schema.org - Jason Ronallo Jan 23, 2012 HathiTrust Large Scale Search: Scalability meets Usability - Tom Burton-West Jan 23, 2012 Design for Developers - Lisa Kurt Jan 23, 2012 Beyond code: Versioning data with Git and Mercurial - Charlie Collett, Martin Haye Jan 23, 2012 ALL TEH METADATAS! or How we use RDF to keep all of the digital object metadata formats thrown at us - Declan Fleming Dec 29, 2011 Discussion for Elsevier App Challenge during Code4Lib 2012 Dec 14, 2011 So you want to start a Kindle lending program Dec 1, 2011 Code4Lib 2013 Call for Host Proposals Nov 29, 2011 Code4Lib 2012 Scholarship (deadline: December 9, 2011) Oct 21, 2011 code4lib 2012 Sponsor Listing Oct 19, 2011 Code4Lib 2012 Schedule Jul 28, 2011 Code4Lib 2012 Feb 11, 2011 Code4Lib 2012 Sponsorship Jan 26, 2011 VuFind Beyond MARC: Discovering Everything Else - Demian Katz Jan 26, 2011 One Week | One Tool: Ultra-Rapid Open Source Development Among Strangers - Scott Hanrath Jan 26, 2011 Letting In the Light: Using Solr as an External Search Component - Jay Luker and Benoit Thiell Jan 26, 2011 Kuali OLE: Architecture for Diverse and Linked Data - Tim McGeary and Brad Skiles Jan 26, 2011 Keynote Address - Diane Hillmann Jan 26, 2011 Hey, Dilbert. Where's My Data?! - Thomas Barker Jan 26, 2011 Enhancing the Mobile Experience: Mobile Library Services at Illinois - Josh Bishoff - Josh Bishoff Jan 26, 2011 Drupal 7 as Rapid Application Development Tool - Cary Gordon Jan 26, 2011 Code4Lib 2012 in Seattle Jan 26, 2011 2011 Lightning Talks Jan 26, 2011 2011 Breakout Sessions Jan 25, 2011 (Yet Another) Home-Grown Digital Library System, Built Upon Open Source XML Technologies and Metadata Standards - David Lacy Jan 25, 2011 Why (Code4) Libraries Exist - Eric Hellman Jan 25, 2011 Visualizing Library Data - Karen Coombs Jan 25, 2011 Sharing Between Data Repositories - Kevin S. Clarke Jan 25, 2011 Practical Relevancy Testing - Naomi Dushay Jan 25, 2011 Opinionated Metadata (OM): Bringing a Bit of Sanity to the World of XML Metadata - Matt Zumwalt Jan 25, 2011 Mendeley's API and University Libraries: Three Examples to Create Value - Ian Mulvany Jan 25, 2011 Let's Get Small: A Microservices Approach to Library Websites - Sean Hannan Jan 25, 2011 GIS on the Cheap - Mike Graves Jan 25, 2011 fiwalk With Me: Building Emergent Pre-Ingest Workflows for Digital Archival Records using Open Source Forensic Software - Mark M Jan 25, 2011 Enhancing the Performance and Extensibility of the XC’s MetadataServicesToolkit - Ben Anderson Jan 25, 2011 Chicago Underground Library’s Community-Based Cataloging System - Margaret Heller and Nell Taylor Jan 25, 2011 Building an Open Source Staff-Facing Tablet App for Library Assessment - Jason Casden and Joyce Chapman Jan 25, 2011 Beyond Sacrilege: A CouchApp Catalog - Gabriel Farrell Jan 25, 2011 Ask Anything! – Facilitated by Dan Chudnov Jan 25, 2011 A Community-Based Approach to Developing a Digital Exhibit at Notre Dame Using the Hydra Framework - Rick Johnson and Dan Brubak Dec 12, 2010 Code4Lib 2011 schedule Dec 10, 2010 Code4Lib 2012 Call for Host Proposals Nov 17, 2010 Scholarships to Attend the 2011 Code4Lib Conference (Deadline Dec. 6, 2010) Sep 23, 2010 Code4Lib 2011 Sponsorship Jun 28, 2010 Issue 10 of the Code4Lib Journal Mar 23, 2010 Location of code4lib 2011 Mar 23, 2010 Code4Lib 2011: Get Ready for the Best Code4lib Conference Yet! Mar 22, 2010 Issue 9 of the Code4Lib Journal Mar 12, 2010 Vote on Code4Lib 2011 hosting proposals Feb 24, 2010 You Either Surf or You Fight: Integrating Library Services With Google Wave - Sean Hannan - Code4Lib 2010 Feb 24, 2010 Vampires vs. Werewolves: Ending the War Between Developers and Sysadmins with Puppet - Bess Sadler - Code4Lib 2010 Feb 24, 2010 The Linked Library Data Cloud: Stop talking and start doing - Ross Singer - Code4Lib 2010 Feb 24, 2010 Taking Control of Library Metadata and Websites Using the eXtensible Catalog - Jennifer Bowen - Code4Lib 2010 Feb 24, 2010 Public Datasets in the Cloud - Rosalyn Metz and Michael B. Klein - Code4Lib 2010 Feb 24, 2010 Mobile Web App Design: Getting Started - Michael Doran - Code4Lib 2010 Feb 24, 2010 Metadata Editing – A Truly Extensible Solution - David Kennedy and David Chandek-Stark - Code4Lib 2010 Feb 24, 2010 Media, Blacklight, and Viewers Like You (pdf, 2.61MB) - Chris Beer - Code4Lib 2010 Feb 24, 2010 Matching Dirty Data – Yet Another Wheel - Anjanette Young and Jeff Sherwood - Code4Lib 2010 Feb 24, 2010 library/mobile: Developing a Mobile Catalog - Kim Griggs - Code4Lib 2010 Feb 24, 2010 Keynote #2: catfish, cthulhu, code, clouds and Levenshtein distance - Paul Jones - Code4Lib 2010 Feb 24, 2010 Keynote #1: Cathy Marshall - Code4Lib 2010 Feb 24, 2010 Iterative Development Done Simply - Emily Lynema - Code4Lib 2010 Feb 24, 2010 I Am Not Your Mother: Write Your Test Code - Naomi Dushay, Willy Mene, and Jessie Keck - Code4Lib 2010 Feb 24, 2010 How to Implement A Virtual Bookshelf With Solr - Naomi Dushay and Jessie Keck - Code4Lib 2010 Feb 24, 2010 HIVE: A New Tool for Working With Vocabularies - Ryan Scherle and Jose Aguera - Code4Lib 2010 Feb 24, 2010 Enhancing Discoverability With Virtual Shelf Browse - Andreas Orphanides, Cory Lown, and Emily Lynema - Code4Lib 2010 Feb 24, 2010 Drupal 7: A more powerful platform for building library applications - Cary Gordon - Code4Lib 2010 Feb 24, 2010 Do It Yourself Cloud Computing with Apache and R - Harrison Dekker - Code4Lib 2010 Feb 24, 2010 Cloud4Lib - Jeremy Frumkin and Terry Reese - Code4Lib 2010 Feb 24, 2010 Becoming Truly Innovative: Migrating from Millennium to Koha - Ian Walls - Code4Lib 2010 Feb 24, 2010 Ask Anything! – Facilitated by Dan Chudnov - Code4Lib 2010 Feb 24, 2010 A Better Advanced Search - Naomi Dushay and Jessie Keck - Code4Lib 2010 Feb 24, 2010 7 Ways to Enhance Library Interfaces with OCLC Web Services - Karen Coombs - Code4Lib 2010 Feb 22, 2010 Code4Lib 2010 Lightning Talks Feb 22, 2010 Code4Lib 2010 Breakout Sessions Feb 21, 2010 Code4Lib 2010 Participant Release Form Feb 5, 2010 Code4Lib 2011 Hosting Proposals Solicited Jan 16, 2010 2010 Code4lib Scholarship Recipients Jan 12, 2010 Code4Lib North Dec 21, 2009 Scholarships to Attend the 2010 Code4Lib Conference Dec 16, 2009 Code4Lib 2010 Registration Dec 14, 2009 2010 Conference info Dec 10, 2009 Code4Lib 2010 Schedule Dec 4, 2009 Code4Lib 2010 Sponsorship Nov 16, 2009 2010 Code4Lib Conference Prepared Talks Voting Now Open! Oct 30, 2009 Code4Lib 2010 Call for Prepared Talk Proposals Sep 21, 2009 Vote for code4lib 2010 keynotes! Jul 10, 2009 Code4Lib 2010 Jun 26, 2009 Code4Lib Journal: new issue 7 now available May 15, 2009 Visualizing Media Archives: A Case Study May 15, 2009 The Open Platform Strategy: what it means for library developers May 15, 2009 If You Love Something...Set it Free May 14, 2009 What We Talk About When We Talk About FRBR May 14, 2009 The Rising Sun: Making the most of Solr power May 14, 2009 Great facets, like your relevance, but can I have links to Amazon and Google Book Search? May 14, 2009 FreeCite - An Open Source Free-Text Citation Parser May 14, 2009 Freebasing for Fun and Enhancement May 14, 2009 Extending biblios, the open source web based metadata editor May 14, 2009 Complete faceting May 14, 2009 A New Platform for Open Data - Introducing ‡biblios.net Web Services May 13, 2009 Sebastian Hammer, Keynote Address May 13, 2009 Blacklight as a unified discovery platform May 13, 2009 A new frontier - the Open Library Environment (OLE) May 8, 2009 The Dashboard Initiative May 8, 2009 RESTafarian-ism at the NLA May 8, 2009 Open Up Your Repository With a SWORD! May 8, 2009 LuSql: (Quickly and easily) Getting your data from your DBMS into Lucene May 8, 2009 Like a can opener for your data silo: simple access through AtomPub and Jangle May 8, 2009 LibX 2.0 May 8, 2009 How I Failed To Present on Using DVCS for Managing Archival Metadata May 8, 2009 djatoka for djummies May 8, 2009 A Bookless Future for Libraries: A Comedy in 3 Acts May 1, 2009 Why libraries should embrace Linked Data Mar 31, 2009 Code4Lib Journal: new issue 6 now available Feb 28, 2009 See you next year in Asheville Feb 20, 2009 Code4Lib 2009 Lightning Talks Feb 19, 2009 code4lib2010 venue voting Feb 17, 2009 OCLC Grid Services Boot Camp (2009 Preconference) Feb 16, 2009 Code4Lib 2010 Hosting Proposals Jan 29, 2009 Code4Lib Logo Jan 29, 2009 Code4Lib Logo Debuts Jan 28, 2009 Code4Lib 2009 Breakout Sessions Jan 16, 2009 Call for Code4Lib 2010 Hosting Proposals Jan 11, 2009 2009 Code4lib Scholarship Recipients Jan 5, 2009 Code4lib 2009 T-shirt Design Contest Dec 17, 2008 code4lib2009 registration open! Dec 15, 2008 Code4Lib Journal Issue 5 Published Dec 5, 2008 Code4lib 2009 Gender Diversity and Minority Scholarships Dec 5, 2008 Calling all Code4Libers Attending Midwinter Dec 3, 2008 Logo Design Process Launched Dec 3, 2008 Code4Lib 2009 Schedule Dec 2, 2008 2009 Pre-Conferences Nov 25, 2008 Voting On Presentations for code4lib 2009 Open until December 3 Nov 18, 2008 drupal4lib unconference (02/27/2009 Darien, CT) Oct 24, 2008 Call for Proposals, Code4Lib 2009 Conference Oct 10, 2008 ne.code4lib.org Sep 30, 2008 code4lib2009 keynote voting Sep 23, 2008 Logo? You Decide Sep 17, 2008 solrpy google code project Sep 3, 2008 Code4Lib 2009 Sep 3, 2008 Code4Lib 2009 Sponsorship Aug 27, 2008 Code4LibNYC Aug 22, 2008 Update from LinkedIn Jul 15, 2008 LinkedIn Group Growing Fast Jul 3, 2008 code4lib group on LInkedIn Apr 17, 2008 ELPUB 2008 Open Scholarship: Authority, Community and Sustainability in the Age of Web 2.0 Mar 4, 2008 Code4libcon 2008 Lightning Talks Mar 3, 2008 Brown University to Host Code4Lib 2009 Feb 26, 2008 Desktop Presenter software Feb 25, 2008 Presentations from LibraryFind pre-conference Feb 21, 2008 Vote for Code4Lib 2009 Host! Feb 19, 2008 Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data? Feb 6, 2008 Code4libcon 2008 Breakout Sessions Feb 1, 2008 Call for Code4Lib 2009 Hosting Proposals Jan 30, 2008 Code4lib 2008 Conference T-Shirt Design Jan 7, 2008 Code4lib 2008 Registration now open! Dec 27, 2007 Zotero and You, or Bibliography on the Semantic Web Dec 27, 2007 XForms for Metadata creation Dec 27, 2007 Working with the WorldCat API Dec 27, 2007 Using a CSS Framework Dec 27, 2007 The Wayback Machine Dec 27, 2007 The Making of The Code4Lib Journal Dec 27, 2007 The Code4Lib Future Dec 27, 2007 Show Your Stuff, using Omeka Dec 27, 2007 Second Life Web Interoperability - Moodle and Merlot.org Dec 27, 2007 RDF and RDA: declaring and modeling library metadata Dec 27, 2007 ÖpënÜRL Dec 27, 2007 OSS Web-based cataloging tool Dec 27, 2007 MARCThing Dec 27, 2007 Losing sleep over REST? Dec 27, 2007 From Idea to Open Source Dec 27, 2007 Finding Relationships in MARC Data Dec 27, 2007 DLF ILS Discovery Interface Task Force API recommendation Dec 27, 2007 Delivering Library Services in the Web 2.0 environment: OSU Libraries Publishing System for and by Librarians Dec 27, 2007 CouchDB is sacrilege... mmm, delicious sacrilege Dec 27, 2007 Building the Open Library Dec 27, 2007 Building Mountains Out of Molehills Dec 27, 2007 A Metadata Registry Dec 17, 2007 Code4lib 2008 Gender Diversity and Minority Scholarships Dec 12, 2007 Conference Schedule Nov 20, 2007 Code4lib 2008 Keynote Survey Oct 31, 2007 Code4lib 2008 Call for Proposals Oct 16, 2007 Code4Lib 2008 Schedule Jul 18, 2007 code4lib 2008 conference Jul 6, 2007 Random #code4lib Quotes Jun 13, 2007 Request for Proposals: Innovative Uses of CrossRef Metadata May 16, 2007 Library Camp NYC, August 14, 2007 Apr 3, 2007 Code4Lib 2007 - Video, Audio and Podcast Available Mar 14, 2007 Code4Lib 2007 - Day 1 Video Available Mar 13, 2007 Erik Hatcher Keynote Mar 12, 2007 My Adventures in Getting Data into the ArchivistsToolkit Mar 9, 2007 Karen Schneider Keynote "Hurry up please it's time" Mar 9, 2007 Code4Lib Conference Feedback Available Mar 9, 2007 Code4Lib 2007 Video Trickling In Mar 1, 2007 Code4Lib.org Restored Feb 24, 2007 Code4Lib 2008 will be in Portland, OR Feb 13, 2007 Code4Lib Blog Anthology Feb 9, 2007 The Intellectual Property Disclosure Process: Releasing Open Source Software in Academia Feb 6, 2007 Polling for interest in a European code4lib Feb 5, 2007 Call for Proposals to Host Code4Lib 2008 Feb 5, 2007 2007 Code4lib Scholarship Recipients Feb 3, 2007 Delicious! Flare + SIMILE Exhibit Jan 30, 2007 Open Access Self-Archiving Mandate Jan 17, 2007 Evergreen Keynote Jan 17, 2007 Code4Lib 2007 T-Shirt Contest Jan 16, 2007 Stone Soup Jan 10, 2007 #code4lib logging Jan 2, 2007 Two scholarships to attend the 2007 code4lib conference Dec 20, 2006 2007 Conference Schedule Now Available Dec 19, 2006 code4lib 2007 pre-conference workshop: Lucene, Solr, and your data Dec 18, 2006 Traversing the Last Mile Dec 18, 2006 The XQuery Exposé: Practical Experiences from a Digital Library Dec 18, 2006 The BibApp Dec 18, 2006 Smart Subjects - Application Independent Subject Recommendations Dec 18, 2006 Open-Source Endeca in 250 Lines or Less Dec 18, 2006 On the Herding of Cats Dec 18, 2006 Obstacles to Agility Dec 18, 2006 MyResearch Portal: An XML based Catalog-Independent OPAC Dec 18, 2006 LibraryFind Dec 18, 2006 Library-in-a-Box Dec 18, 2006 Library Data APIs Abound! Dec 18, 2006 Get Groovy at Your Public Library Dec 18, 2006 Fun with ZeroConfMetaOpenSearch Dec 18, 2006 Free the Data: Creating a Web Services Interface to the Online Catalog Dec 18, 2006 Forget the Lipstick. This Pig Just Needs Social Skills. Dec 18, 2006 Atom Publishing Protocol Primer Nov 27, 2006 barton data Nov 21, 2006 MIT Catalog Data Oct 29, 2006 Code4Lib Downtime Oct 16, 2006 Call for Proposals Aug 24, 2006 Code4Lib2006 Audio Aug 15, 2006 book club Jul 4, 2006 Code4LibCon Site Proposals Jul 1, 2006 Improving Code4LibCon 200* Jun 28, 2006 Code4Lib Conference Hosting Jun 22, 2006 Learning to Scratch Our Own Itches Jun 15, 2006 2007 Code4Lib Conference Jun 15, 2006 2007 Code4Lib Conference Schedule Jun 15, 2006 2007 Code4Lib Conference Lightning Talks Jun 15, 2006 2007 Code4Lib Conference Breakouts Mar 31, 2006 Results of the journal name vote Mar 22, 2006 #dspace Mar 20, 2006 #code4lib logging Mar 14, 2006 regulars on the #code4lib irc channel Mar 14, 2006 Code4lib Journal Name Vote Mar 14, 2006 code4lib journal: mission, format, guidelines Mar 14, 2006 #code4lib irc channel faq Feb 27, 2006 CUFTS2 AIM/AOL/ICQ bot Feb 24, 2006 code4lib journal: draft purpose, format, and guidelines Feb 21, 2006 2006 code4lib Breakout Sessions Feb 17, 2006 unapi revision 1 Feb 15, 2006 code4lib 2006 presentations will be available Feb 14, 2006 planet update Feb 13, 2006 Weather in Corvallis for Code4lib Feb 13, 2006 Holiday Inn Express Feb 9, 2006 conference wiki Jan 31, 2006 Portland Hostel Jan 27, 2006 Lightning Talks Jan 23, 2006 Code4lib 2006 T-Shirt design vote! Jan 19, 2006 Portland Jazz Festival Jan 13, 2006 unAPI version 0 Jan 13, 2006 conference schedule in hCalendar Jan 12, 2006 code4lib 2006 T-shirt design contest Jan 11, 2006 Conference Schedule Set Jan 11, 2006 code4lib 2006 registration count pool Jan 10, 2006 WikiD Jan 10, 2006 The Case for Code4Lib 501c(3) Jan 10, 2006 Teaching the Library and Information Community How to Remix Information Jan 10, 2006 Practical Aspects of Implementing Open Source in Armenia Jan 10, 2006 Lipstick on a Pig: 7 Ways to Improve the Sex Life of Your OPAC Jan 10, 2006 Generating Recommendations in OPACS: Initial Results and Open Areas for Exploration Jan 10, 2006 ERP Options in an OSS World Jan 10, 2006 AHAH: When Good is Better than Best Jan 10, 2006 1,000 Lines of Code, and other topics from OCLC Research Jan 9, 2006 What Blog Applications Can Teach Us About Library Software Architecture Jan 9, 2006 Standards, Reusability, and the Mating Habits of Learning Content Jan 9, 2006 Quality Metrics Jan 9, 2006 Library Text Mining Jan 9, 2006 Connecting Everything with unAPI and OPA Jan 9, 2006 Chasing Babel Jan 9, 2006 Anatomy of aDORe Jan 6, 2006 Voting on Code4Lib 2006 Presentation Proposals Jan 3, 2006 one more week for proposals Dec 19, 2005 code4lib card Dec 15, 2005 planet facelift Dec 6, 2005 Registration is Open Dec 3, 2005 planet code4lib & blogs Dec 1, 2005 Code4lib 2006 Call For Proposals Nov 29, 2005 code4lib Conference 2006: Schedule Nov 21, 2005 panizzi Nov 21, 2005 drupal installed Nov 21, 2005 code4lib 2006 subscribe via RSS Code4Lib Code4Lib code4lib code4lib.social code4lib code4lib We are developers and technologists for libraries, museums, and archives who are dedicated to being a diverse and inclusive community, seeking to share ideas and build collaboration. 
codeforpakistan-org-1574	----	Code for Pakistan - Civic Innovation in Pakistan Login | Register Username or Email Address Password Lost your password? Home About Us Programs Civic Innovation Labs Civic Hackathons Civic Hackathon 2020 SDG Hackathon 2019 Pakistan @100 Innovation Hackathon Previous Hackathons Fellowship Batch 1: KP Fellowship Batch 2: KP Fellowship Batch 3: KP Fellowship Batch 4: KP Fellowship Batch 5: KP Fellowship Batch 6: KP Fellowship Women And Tech Events Civic Apps Annual Reports Impact Report 2019 Blog Contact Careers Code for Pakistan Our goal is to bring together civic-minded software developers to use technology to innovate in public services, by creating open source solutions to address the needs of citizens. This is an opportunity for citizens and the private sector to give back to Pakistan by engendering civic innovation. Read More CfP Founder Sheba Najmi at #021Disrupt19 Play Video Sheba Najmi shared her thoughts on how companies design products and user experiences today, and why it is essential for us to develop a more human approach when it comes to both. Civic Innovation Labs CfP runs Civic Innovation Labs in major cities. Learn more about joining or starting a Lab. Civic Hackathons Civic Hackathons are events that spark civic engagement by bringing designers, developers, and community organizers together to prototype solutions to civic problems. Upcoming Events There is always something interesting going on at CfP. Join our events. CfP is part of a global movement. Watch this video message from Code for America. What We've Done So Far 0 Github Repositories 0 Civic Hackathons 0 Civic Innovation Labs 0 Fellows Graduated Why It Matters Civic innovation starts to reframe the relationship between local government and citizens, which is essential if the two are to live together smartly. Toward a progressive Pakistan! Collaborative Model Through the creation of open source technology to address civic needs, we aim to transform civic life by increasing civic engagement, encouraging the opening of government data, and supporting innovation in the public domain. Our Labs meet regularly to collaborate with local stakeholders (including Government, partner Non-profit Organizations, and Media Organizations) on projects that focus on how to use 21st century web and data tools to improve civic interfaces. Learn more about our Programs Latest From Our Blog April 9, 2021 Applications Open: KP Government Innovation Fellowship Program 2021 (7th Cycle) Read more January 12, 2021 Job Opening: Country Director Read more January 11, 2021 Civic Hackathon 2020 Concludes Read more If you are interested in learning more about Code for Pakistan, Contact Us! We would love to hear from you. Join our Discord community . License CC BY-SA 4.0 
coding-confessions-github-io-9008	----	Coding Confessions | Normalising failure in research software. CodingConfessions About Read Confessions MAKE A CONFESSION Normalising failure. Normalising failure in research software creates an inclusive space for sharing experiences, and generates opportunity to learn. What is Coding Confessions? Simply put: "Where somebody admits to mistakes or bad practice in code they've developed." What's the problem? Everybody who develops software has at some point written some software badly, quickly, cut corners or simply made a mistake that made it function incorrectly. Due to imposter syndrome many people feel like this makes them less worthy developers. Often the root cause is time pressure to make something that "just works" (or at least appeared to). These little short cuts often end up becoming core pieces of software upon which research conclusions and publications are based. People don't like to admit to making mistakes, cutting corners or not following best practice, sometimes hiding these problems away. Why do this? We want to: Change the culture of research so that mistakes can be disclosed without fear. Document mistakes and allow the entire community to benefit from the lessons learned. These will be published on our blog. How to submit a confession Please only submit a confession about something you did yourself, don't submit confessions about the work of others. Send us one paragraph about each of the following: The background to the problem, what were you trying to do? The mistake you made. What steps can be taken to avoid this mistake in the future. You can do this publicly (with atribution) or privately (anonymously). We will then publish them on our blog. See this example blog post. See the submit a confession page for more information. Submit a confession to us How to run a Confessions Workshop at your own event Read confessions in our blog. Confessions Below are the latest confessions from our blog. Confession 1 Dave 1 April 2021 The typo that nearly broke my first paper Eirini 9 February 2021 Confession 2 Dave 9 February 2021 Coding Confessions. Normalising failure in research software. Software Sustainability Institute This project and website was created as part of the Hack Day in the Collaborations Workshop 2021. The Software Sustainability Institute cultivates better, more sustainable, research software to enable world-class research. They help people build better software, and we work with researchers, developers, funders and infrastructure providers to identify key issues and best practice in scientific software. Privacy Thanks Github Pages. Menu 
commonplace-net-5233	----	commonplace.net – Data. The final frontier. Skip to content commonplace.net Data. The final frontier. Publications A Common Place All Posts About Contact Infrastructure for heritage institutions – ARK PID’s November 3, 2020November 11, 2020 Lukas KosterData, Infrastructure, Library In the Digital Infrastructure program at the Library of the University of Amsterdam we have reached a first milestone. In my previous post in the Infrastructure for heritage institutions series, “Change of course“, I mentioned the coming implementation of ARK persistent identifiers for our collection objects. Since November 3, 2020, ARK PID’s are available for our university library Alma catalogue through the Primo user interface. Implementation of ARK PID’s for the other collection description systems […] Read more Infrastructure for heritage institutions – change of course June 23, 2020 Lukas KosterData, Infrastructure, Library In July 2019 I published the first post about our planning to realise a “coherent and future proof digital infrastructure” for the Library of the University of Amsterdam. In February I reported on the first results. As frequently happens, since then the conditions have changed, and naturally we had to adapt the direction we are following to achieve our goals. In other words: a change of course, of course.  Projects  I will leave aside the […] Read more Infrastructure for heritage institutions – first results February 24, 2020February 25, 2020 Lukas KosterData, Infrastructure, Library In July 2019 I published the post Infrastructure for heritage institutions in which I described our planning to realise a “coherent and future proof digital infrastructure” for the Library of the University of Amsterdam. Time to look back: how far have we come? And time to look forward: what’s in store for the near future? Ongoing activities I mentioned three “currently ongoing activities”:  Monitoring and advising on infrastructural aspects of new projects Maintaining a structured dynamic overview […] Read more Infrastructure for heritage institutions July 11, 2019January 11, 2020 Lukas KosterData, Infrastructure, Library During my vacation I saw this tweet by LIBER about topics to address, as suggested by the participants of the LIBER 2019 conference in Dublin: It shows a word cloud (yes, a word cloud) containing a large number of terms. I list the ones I can read without zooming in (so the most suggested ones, I guess), more or less grouped thematically: Open scienceOpen dataOpen accessLicensingCopyrightsLinked open dataOpen educationCitizen science Scholarly communicationDigital humanities/DHDigital scholarshipResearch assessmentResearch […] Read more Ten years linked open data June 4, 2016February 13, 2020 Lukas KosterData, Library This post is the English translation of my original article in Dutch, published in META (2016-3), the Flemish journal for information professionals. Ten years after the term “linked data” was introduced by Tim Berners-Lee it appears to be time to take stock of the impact of linked data for libraries and other heritage institutions in the past and in the future. I will do this from a personal historical perspective, as a library technology professional, […] Read more Maps, dictionaries and guidebooks August 3, 2015February 3, 2020 Lukas KosterData Interoperability in heterogeneous library data landscapes Libraries have to deal with a highly opaque landscape of heterogeneous data sources, data types, data formats, data flows, data transformations and data redundancies, which I have earlier characterized as a “data maze”. The level and magnitude of this opacity and heterogeneity varies with the amount of content types and the number of services that the library is responsible for. Academic and national libraries are possibly dealing with more […] Read more Standard deviations in data modeling, mapping and manipulation June 16, 2015February 3, 2020 Lukas KosterData Or: Anything goes. What are we thinking? An impression of ELAG 2015 This year’s ELAG conference in Stockholm was one of many questions. Not only the usual questions following each presentation (always elicited in the form of yet another question: “Any questions?”). But also philosophical ones (Why? What?). And practical ones (What time? Where? How? How much?). And there were some answers too, fortunately. This is my rather personal impression of the event. For a […] Read more Analysing library data flows for efficient innovation November 27, 2014February 14, 2020 Lukas KosterLibrary In my work at the Library of the University of Amsterdam I am currently taking a step forward by actually taking a step back from a number of forefront activities in discovery, linked open data and integrated research information towards a more hidden, but also more fundamental enterprise in the area of data infrastructure and information architecture. All for a good cause, for in the end a good data infrastructure is essential for delivering high […] Read more Looking for data tricks in Libraryland September 5, 2014January 12, 2020 Lukas KosterLibrary IFLA 2014 Annual World Library and Information Congress Lyon – Libraries, Citizens, Societies: Confluence for Knowledge After attending the IFLA 2014 Library Linked Data Satellite Meeting in Paris I travelled to Lyon for the first three days (August 17-19) of the IFLA 2014 Annual World Library and Information Congress. This year’s theme “Libraries, Citizens, Societies: Confluence for Knowledge” was named after the confluence or convergence of the rivers Rhône and Saône where the city of […] Read more Library Linked Data Happening August 26, 2014January 12, 2020 Lukas KosterLibrary On August 14 the IFLA 2014 Satellite Meeting ‘Linked Data in Libraries: Let’s make it happen!’ took place at the National Library of France in Paris. Rurik Greenall (who also wrote a very readable conference report) and I had the opportunity to present our paper ‘An unbroken chain: approaches to implementing Linked Open Data in libraries; comparing local, open-source, collaborative and commercial systems’. In this paper we do not go into reasons for libraries to […] Read more Posts navigation Older posts Profiles and social @lukask on Twitter @lukask on Mastodon My ORCID My Impactstory My Zotero My UvA profile Recent Posts Infrastructure for heritage institutions – ARK PID’s Infrastructure for heritage institutions – change of course Infrastructure for heritage institutions – first results Infrastructure for heritage institutions Ten years linked open data Maps, dictionaries and guidebooks Most Popular Posts Is an e-book a book? (8,462 views) Who needs MARC? (5,839 views) Linked Data for Libraries (5,058 views) Mobile app or mobile web? (4,561 views) User experience in public and academic libraries (4,279 views) Mainframe to mobile (3,484 views) (Discover AND deliver) OR else (3,260 views) Recent Comments Maarten Brinkerink on Infrastructure for heritage institutions Gittaca on Infrastructure for heritage institutions Libraries & the Future of Scholarly Communication at #BTPDF2 – UC3 Portal on Beyond The Library Tatiana Bryant (@BibliotecariaT) on Analysing library data flows for efficient innovation @BibliotecariaT on Analysing library data flows for efficient innovation @LizWoolcott on Analysing library data flows for efficient innovation Tags apps authority files catalog collection conferences cultural heritage data data management developer platforms discovery tools elag exlibris foaf frbr hardware identifiers igelu infrastructure innovation integration interoperability libraries library Library2.0 library systems linked data linked open data marc meetings metadata mobile next generation open data open source open stack open systems people persistent identifiers rda rdf semantic web social networking technology uri web2.0 This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. System Log in Entries feed Comments feed WordPress.org Top Posts & Pages Explicit and implicit metadata Analysing library data flows for efficient innovation Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use. To find out more, including how to control cookies, see here: Cookie Policy commonplace.net A Common Place About All Posts Contact Publications Powered by WordPress | Theme: Astrid by aThemes. 
community-esri-com-4132	----	Participatory Mapping with Google Forms, Google Sh... - Esri Community Community All Communities Products ArcGIS Survey123 ArcGIS Pro ArcGIS Online ArcGIS Enterprise Data Management Geoprocessing ArcGIS Web AppBuilder ArcGIS Collector ArcGIS Spatial Analyst ArcGIS CityEngine Imagery and Remote Sensing ArcGIS Dashboards All Products Communities Industries Education Water Resources Gas and Pipeline State & Local Government Transportation Water Utilities Telecommunications Roads and Highways Natural Resources Electric Science Commercial All Industries Communities Developers Python ArcGIS API for JavaScript ArcGIS Runtime SDKs ArcObjects SDK ArcGIS API for Python ArcGIS Pro SDK Developers - General ArcGIS API for Silverlight (Retired) ArcGIS API for Flex (Retired) ArcGIS REST API ArcGIS for Windows Mobile (Retired) File Geodatabase API All Developers Communities Worldwide Comunidad Esri Colombia - Ecuador - Panamá ArcGIS 開発者コミュニティ ArcNesia Esri India GeoDev Germany Czech GIS ArcGIS Content - Esri Nederland Esri Italia Community Swiss Geo Community GeoDev Switzerland Comunidad GEOTEC Esri Ireland All Worldwide Communities All Communities Products Developers User Groups Industries Services Worldwide Community Basics Events ArcGIS Topics Learning Networks View All Communities ArcGIS Ideas Community Basics Community Help Documents Community Blog Community Feedback Member Introductions Sign In cancel Turn on suggestions Auto-suggest helps you quickly narrow down your search results by suggesting possible matches as you type. Showing results for  Show  only  | Search instead for  Did you mean:  Cancel Home : All Communities : Industries : Education : Education Blog : Participatory Mapping with Google Forms, Google Sh... Participatory Mapping with Google Forms, Google Sheets, and ArcGIS Online Subscribe 7585 4 07-21-2017 06:03 AM Labels (2) Labels Higher Education Schools (K - 12) by JosephKerski Esri Frequent Contributor 3 4 7,585 Subscribe to RSS Feed Mark as New Mark as Read Bookmark Subscribe Email to a Friend Printer Friendly Page Report Inappropriate Content ‎07-21-2017 06:03 AM I have been receiving questions from schools that have become "Google Schools" as well as universities and individual researchers who want to use Google Sheets in ArcGIS Online.  What are the advantages of using Google Sheets (spreadsheets, really, is what they are) over using an Excel spreadsheet on your own computer? Google Sheets live in the cloud, just like ArcGIS Online, so they can be edited from any device, anywhere, and the author of the Sheet can invite others to add data to it, so they can accept input from multiple collaborators, students, and faculty. Some educators want to map data that they have input into Google Sheets.  Others want to go to the next level, where multiple students or researchers edit Google Sheets in a participatory mapping or citizen science environment, and the resulting data is mapped and automatically refreshes as the data continues to be added. Both of these scenarios are possible with ArcGIS Online.  To illustrate, I created a form where students are asked, "What country have you visited?", shown below. After students fill out the form, I go to the "responses" zone in Google Forms, and access the spreadsheet that is created from the data.  Now that my data is in my Google Sheet, I access > File > Publish to the Web > and change "Web Page" to "Comma Separated Values (.csv)" file > Publish.   Then, I copy the resulting URL: Then, I access my ArcGIS Online account, open a new or existing map > Add > Add Layer from Web - CSV file > paste your URL for my Google Sheet here.   Next, I > Add Layer > I indicate which fields contain my location information (address, latitude-longitude, city/state/country combination).   That's really all there is to it!  My results are in this map linked here, and shown below: Note that I used one of the fun new basemaps in ArcGIS Online that I wrote about here. In another example, this time using cities instead of countries, see this map of the 10 most polluted and 10 least polluted large cities of the world.  Students examine spatial patterns and reasons for the pollution (or lack of it) in each city using the map and the metadata here.  I created this map by populating this Google Sheet, below.  My students could add 10 or 20 more to this sheet and their changes would be reflected in my ArcGIS Online map. Here is the map from the data, below.  For those explanatory labels, I used this custom label expression:   $feature.City + " is the #" + " " + $feature.Rank + " " + $feature.Variable and set the text color to match the point symbol color for clarity.  For more about expressions, see my blog post here. In another example, my colleague created this google sheet of some schools in India by latitude-longitude. Then she added the published content from Google to her map.  Let's explore a bit deeper.  Let's say that I wanted to visualize the most commonly visited countries among my students.  I can certainly examine the statistics from my Google form, as seen below: However, my goal is really to see this data on a map.  With the analysis tools in ArcGIS Online, this too is quickly done. The Aggregate Points tool will summarize points in polygons.  For my polygons, I added a generalized world countries map layer, and then used Aggregate Points to summarize my point data within those countries.  The result is shown below and is visible as a layer in the map I referenced above.  Another point worth noting is that you can adjust the settings of how your map interacts with your Google Sheet.  Go to the layer's metadata page, and under “Published content & settings”, select "Automatically republish when changes are made." You can set the refresh interval to, for example, 1 minute, but the actual refresh on your map may take somewhat longer because Google’s “Auto re-publish” isn’t quite "real-time".  Then do the following for the layer: Note that if you are geocoding by address (such as city/country, as I did above, or street address), the automatic refresh option is not available: To get around this challenge, I manually added the latitude-longitude values to my cities spreadsheet.  Thanks to the Measure tool in ArcGIS Online, this took less than 1 minute per city.  I simply typed in the city name in ArcGIS Online, and used the Location button under the Measure tools, clicked on the map where the city was located, and entered the resulting coordinates into my spreadsheet. For more information, see this blog essay.   Labels Higher Education Schools (K - 12) Tags (5) Tags: citizen science crowdsourcing google forms google sheets participatory mapping 3 Kudos Share 4 Comments by JosephKerski Esri Frequent Contributor ‎07-26-2017 05:52 PM Mark as Read Mark as New Bookmark Permalink Print Email to a Friend Report Inappropriate Content Important update!  Because of my experience with not being able to flip the ramp in the top 10 polluted cities map, our awesome development team added the Invert button in smart mapping. Now you don’t need to write an equation and have a legend from 0 to 1. See below.  Very useful indeed! --Joseph Kerski  1 Kudo by HaleyNelson New Contributor II ‎06-12-2018 01:48 PM Mark as Read Mark as New Bookmark Permalink Print Email to a Friend Report Inappropriate Content This is great! Will this process work in reverse? For example, will (or can) the google sheets be automatically updated if new points are added to the map, or attributes are updated in the web map? Is this a possible workflow? For example, can I connect a feature layer to a google sheet, collect data on that feature layer in Survey123, and have this data populate in a connected Google Sheet based on the web map refresh interval? 1 Kudo by deleted-user-0eS87ljx3Rcy New Contributor II ‎01-30-2019 09:49 AM Mark as Read Mark as New Bookmark Permalink Print Email to a Friend Report Inappropriate Content Anybody knows how to secure the published google sheets data? We want to bring the google sheet data to AGOL but google clearly states that data is not secured.  0 Kudos by FlorentBigirimana New Contributor III ‎05-27-2020 07:48 AM Mark as Read Mark as New Bookmark Permalink Print Email to a Friend Report Inappropriate Content I have created a google sheet with some records and managed to have the data from it on my web map as a web layer. One thing I was expecting is when values are updated form the google sheet, automatically the value is also updated on my layer in web map. However this is not happening. What am I missing ? 0 Kudos You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in. Comment About the Author I believe that spatial thinking can transform education and society through the application of Geographic Information Systems for instruction, research, administration, and policy. I hold 3 degrees in Geography, have served at NOAA, the US Census Bureau, and USGS as a cartographer and geographer, and teach a variety of F2F (Face to Face) (including T3G) and online courses. I have authored a variety of books and textbooks about the environment, STEM, GIS, and education. These include "Interpreting Our World", "Essentials of the Environment", "Tribal GIS", "The GIS Guide to Public Domain Data", "International Perspectives on Teaching and Learning with GIS In Secondary Education", "Spatial Mathematics" and others. I write for 2 blogs, 2 monthly podcasts, and a variety of journals, and have created over 5,000 videos on the Our Earth YouTube channel. Yet, as time passes, the more I realize my own limitations and that this is a lifelong learning endeavor and thus I actively seek mentors and collaborators. Labels Curriculum-Learning Resources 4 Education Facilities 28 GeoInquiries 1 Higher Education 324 Informal Education 191 Licensing Best Practices 1 Pedagogy and Education Theory 107 Schools (K - 12) 294 Schools (K-12) 4 STEM 1 Students - Higher Education 148 Students - K-12 Schools 1 Success Stories 1 TeacherDesk 1 Tech Tips 3 Terms of Use Community Guidelines Community Basics Privacy Trust Center Legal Contact Esri 
costhonduras-hn-6426	----	Inicio - CostHonduras Menú Empleo y Consultorías Sitios de Interés Preguntas Frecuentes Mapa del Sitio Close top bar 112 Your Adress 23 Washington DC 1234 Call us anytime 415 555 1234 Send us a mail mail@domain.com Inicio Acerca de CoST La Iniciativa Historia Estatutos Financiamiento Plan Estratégico Grupo Multisectorial CoST International Historias de Éxito Procesos CoST Divulgación SISOCS Monitoreo de Proyectos Divulgados Aseguramiento Aseguramientos Realizados Auditoría Social EASI Diplomado para Periodistas Diplomado Virtual Inscripciones Recursos Sala de Prensa Noticias Boletines Contáctenos Search Toggle navigation Infraestructura mejor valorada NOTICIAS RECIENTES Proyectos de infraestructura divulgan información bajo nuevo estándar de datos 25 enero, 2021 Las instituciones y entidades ejecutoras que publican información en los portales Sisocs.org y Siscos APP, ahora deberán divulgar información de sus proyectos bajo los lineamientos de las Contrataciones Abiertas para el Estándar de Datos sobre Infraestructura (OC4IDS). La Iniciativa de… Presentan Guía de Respuesta Rápida para No miembros de CoST, para fomentar transparencia en infraestructura en tiempo de crisis 11 enero, 2021 La Iniciativa de Transparencia en Infraestructura (CoST Internacional) presentó, en los primeros días de 2021, una Guía de Respuesta Rápida para impulsar la transparencia y la rendición de cuentas en los proyectos de infraestructira pública ejecutados en tiempos de crisis,… En el Día Internacional Contra la Corrupción, CoST presenta Manual del Índice de Transparencia en Infraestructura 16 diciembre, 2020 Honduras fue, en 2017, el país piloto donde se implementó esta herramienta El ITI no sólo considera el acceso a la información, también la calidad de la misma Para 2030 se estima se podrían ahorrar cerca de 6,000 millones de… Ver más noticias 1237 Proyectos divulgados red vial y APP 500 Personas capacitadas 5 Porcentaje divulgación información proyectos 1 Proyectos en Estudios de Aseguramiento NUESTRAS REDES SOCIALES Tweets by CostHonduras GRUPO MULTISECTORIAL Procesos CoST Divulgación Aseguramiento Auditoría Social CosT Honduras © 2019 | Todos los Derechos Reservados 
cyber-fsi-stanford-edu-1164	----	FSI | Cyber | Internet Observatory - David Thiel Skip to: Skip to content Skip to navigation A program of the Cyber Policy Center, part of the Freeman Spogli Institute for International Studies. Search form Home Opportunities Projects End-to-End Encryption Takedowns Trust and Safety Virality Project attribution.news (external link) Election Integrity Project (external link) About Search form Home Opportunities Projects End-to-End Encryption Takedowns Trust and Safety Virality Project attribution.news (external link) Election Integrity Project (external link) About David Thiel David Thiel Chief Technical Officer, Stanford Internet Observatory Big Data Architect Download image Bio David is the Big Data Architect and Chief Technology Officer of the Stanford Internet Observatory. Prior to Stanford, David worked at Facebook, primarily focusing on security and safety for Facebook Connectivity, a collection of projects aimed at providing faster and less expensive internet connectivity to unconnected or underconnected communities. Projects included the Terragraph mesh networking system, the Magma open source mobile network platform, Express Wi-Fi and Facebook Lite. Before Facebook, David was a VP at iSEC Partners and later NCC Group, managing the North American security consulting and research team, as well as producing original security research, coordinating vulnerability disclosure and performing security assessments and penetration testing for companies across a wide range of business sectors. David has spoken at various industry conferences, including Black Hat, DEFCON, PacSec and SOURCE Boston. He is also the author of iOS Application Security (No Starch Press) and coauthor of Mobile Application Security (McGraw-Hill). Publications Combine fields filter All White Paper Contours and Controversies of Parler Stoking Conflicts by Keystroke: An Operation Run by IRA-Linked Individuals Targeting Libya, Sudan, and Syria #ZakzakyLifeMatters: An Investigation into a Facebook Operation Linked to the Islamic Movement in Nigeria An Investigation into a Female-Focused Online Campaign in Iran and Afghanistan targeting Afghans More Publications Topics Security Our Address Encina Hall 616 Jane Stanford Way Stanford University Stanford, CA 94305-6055 Navigate Research Education People Centers News Events About Follow Us General inquiries 650-723-4581 Mail Twitter Facebook Youtube Instagram   Support Us Learn more about how your support makes a difference or make a gift now Make a gift   Top Stanford Home Maps & Directions Search Stanford Emergency Info Terms of Use Privacy Copyright Trademarks Non-Discrimination Accessibility © Stanford University, Stanford, California 94305. Copyright Complaints 
cynthiang-ca-6115	----	Learning (Lib)Tech – Stories from my Life as a Technologist Skip to content Learning (Lib)Tech Stories from my Life as a Technologist Menu About Me About this Blog Contact Me Twitter GitHub LinkedIn Flickr RSS UBC iSchool Career Talk Series: Journey from LibTech to Tech The UBC iSchool reached out to me recently asking me to talk about my path from getting my library degree to ending up working in a tech company. Below is the script for my portion of the talk, along with a transcription of the questions I answered. Continue reading “UBC iSchool Career Talk Series: Journey from LibTech to Tech” Author CynthiaPosted on March 5, 2021March 5, 2021Categories Events, LibrarianshipTags career growth, reflectionLeave a comment on UBC iSchool Career Talk Series: Journey from LibTech to Tech Choosing not to go into management (again) Often, to move up and get a higher pay, you have to become a manager, but not everyone is suited to become a manager, and sometimes given the preference, it’s not what someone wants to do. Thankfully at GitLab, in every engineering team including Support, we have two tracks: technical (individual contributor), and management. Continue reading “Choosing not to go into management (again)” Author CynthiaPosted on February 2, 2021March 5, 2021Categories Work cultureTags career growth, management, reflectionLeave a comment on Choosing not to go into management (again) Prioritization in Support: Tickets, Slack, issues, and more I mentioned in my GitLab reflection that prioritization has been quite different working in Support compared to other previous work I’ve done. In most of my previous work, I’ve had to take “desk shifts” but those are discreet where you’re focused on providing customer service during that period of time and you can focus on other things the rest of the time. In Support, we have to constantly balance all the different work that we have, especially in helping to ensure that tickets are responded to within the Service Level Agreement (SLA). It doesn’t always happen, but I ultimately try to reach inbox 0 (with read-only items possibly left), and GitLab to-do 0 by the end of the every week. People often ask me how I manage to do that, so hopefully this provides a bit of insight. Continue reading “Prioritization in Support: Tickets, Slack, issues, and more” Author CynthiaPosted on December 11, 2020December 24, 2020Categories MethodologyTags productivityLeave a comment on Prioritization in Support: Tickets, Slack, issues, and more Reflection Part 2: My second year at GitLab and on becoming Senior again This reflection is a direct continuation of part 1 of my time at GitLab so far. If you haven’t, please read the first part before beginning this one. Continue reading “Reflection Part 2: My second year at GitLab and on becoming Senior again” Author CynthiaPosted on June 17, 2020January 31, 2021Categories Update, Work cultureTags GitLab, organizational culture, reflectionLeave a comment on Reflection Part 2: My second year at GitLab and on becoming Senior again Reflection Part 1: My first year at GitLab and becoming Senior About a year ago, I wrote a reflection on Summit and Contribute, our all staff events, and later that year, wrote a series of posts on the GitLab values and culture from my own perspective. There is a lot that I mention in the blog post series and I’ll try not to repeat myself (too much), but I realize I never wrote a general reflection at year 1, so I’ve decided to write about both years now but split into 2 parts. Continue reading “Reflection Part 1: My first year at GitLab and becoming Senior” Author CynthiaPosted on June 16, 2020January 31, 2021Categories Update, Work cultureTags GitLab, organizational culture, reflectionLeave a comment on Reflection Part 1: My first year at GitLab and becoming Senior Is blog reading dead? There was a bit more context to the question, but a friend recently asked me: What you do think? Is Blogging dead? Continue reading “Is blog reading dead?” Author CynthiaPosted on May 8, 2020May 7, 2020Categories UpdateTags reflectionLeave a comment on Is blog reading dead? Working remotely at home as a remote worker during a pandemic I’m glad that I still have a job, that my life isn’t wholly impacted by the pandemic we’re in, but to say that nothing is different just because I was already a remote worker would be wrong. The effect the pandemic is having on everyone around you has affects your life. It seems obvious to me, but apparently that fact is lost on a lot of people. I’d expect that’s not the case for those who read my blog, but I thought it’d be worth reflecting on anyway. Continue reading “Working remotely at home as a remote worker during a pandemic” Author CynthiaPosted on May 4, 2020May 2, 2020Categories Work cultureTags remoteLeave a comment on Working remotely at home as a remote worker during a pandemic Code4libBC Lightning Talk Notes: Day 2 Code4libBC Day 2 lightning talk notes! Continue reading “Code4libBC Lightning Talk Notes: Day 2” Author CynthiaPosted on November 29, 2019Categories EventsTags authentication, big data, c4lbc, code, code4lib, digital collections, privacy, reference, teachingLeave a comment on Code4libBC Lightning Talk Notes: Day 2 Code4libBC Lightning Talk Notes: Day 1 Code4libBC Day 1 lightning talk notes! Continue reading “Code4libBC Lightning Talk Notes: Day 1” Author CynthiaPosted on November 28, 2019Categories EventsTags c4lbc, digital collections, intranet, MARC, metadata, teachingLeave a comment on Code4libBC Lightning Talk Notes: Day 1 Presentation: Implementing Values in Practical Ways This was presented at Code4libBC 2019. Continue reading “Presentation: Implementing Values in Practical Ways” Author CynthiaPosted on November 28, 2019November 28, 2019Categories Events, Work cultureTags c4lbc, organizational culture, presentation, valuesLeave a comment on Presentation: Implementing Values in Practical Ways Posts navigation Page 1 Page 2 … Page 47 Next page Cynthia Technologist, Librarian, Metadata and Technical Services expert, Educator, Mentor, Web Developer, UXer, Accessibility Advocate, Documentarian View Full Profile → Follow Us Twitter LinkedIn GitHub Telegram Search for: Search Categories Events Librarianship Library Academic Public Special Tours Methodology Project work Technology Tools Update Web design Work culture Follow via Email Enter your email address to receive notifications of new posts by email. Email Address: Follow About Me About this Blog Contact Me Twitter GitHub LinkedIn Flickr RSS Learning (Lib)Tech You must be logged in to post a comment. Loading Comments... Comment × 
cynthiang-ca-7810	----	Learning (Lib)Tech Learning (Lib)Tech Stories from my Life as a Technologist UBC iSchool Career Talk Series: Journey from LibTech to Tech The UBC iSchool reached out to me recently asking me to talk about my path from getting my library degree to ending up working in a tech company. Below is the script for my portion of the talk, along with a transcription of the questions I answered. Context To provide a bit of context (and &#8230; Continue reading "UBC iSchool Career Talk Series: Journey from LibTech to&#160;Tech" Choosing not to go into management (again) Often, to move up and get a higher pay, you have to become a manager, but not everyone is suited to become a manager, and sometimes given the preference, it&#8217;s not what someone wants to do. Thankfully at GitLab, in every engineering team including Support, we have two tracks: technical (individual contributor), and management. Progression &#8230; Continue reading "Choosing not to go into management&#160;(again)" Prioritization in Support: Tickets, Slack, issues, and more I mentioned in my GitLab reflection that prioritization has been quite different working in Support compared to other previous work I&#8217;ve done. In most of my previous work, I&#8217;ve had to take &#8220;desk shifts&#8221; but those are discreet where you&#8217;re focused on providing customer service during that period of time and you can focus on &#8230; Continue reading "Prioritization in Support: Tickets, Slack, issues, and&#160;more" Reflection Part 2: My second year at GitLab and on becoming Senior again This reflection is a direct continuation of part 1 of my time at GitLab so far. If you haven&#8217;t, please read the first part before beginning this one. Becoming an Engineer (18 months) The more time I spent working in Support, the more I realized that the job was much more technical than I originally &#8230; Continue reading "Reflection Part 2: My second year at GitLab and on becoming Senior&#160;again" Reflection Part 1: My first year at GitLab and becoming Senior About a year ago, I wrote a reflection on Summit and Contribute, our all staff events, and later that year, wrote a series of posts on the GitLab values and culture from my own perspective. There is a lot that I mention in the blog post series and I&#8217;ll try not to repeat myself (too &#8230; Continue reading "Reflection Part 1: My first year at GitLab and becoming&#160;Senior" Is blog reading dead? There was a bit more context to the question, but a friend recently asked me: What you do think? Is Blogging dead? I think blogging the way it used to work is (mostly) dead. Back in the day, we had a bunch of blogs and people who subscribe to them via email and RSS feeds. &#8230; Continue reading "Is blog reading&#160;dead?" Working remotely at home as a remote worker during a pandemic I&#8217;m glad that I still have a job, that my life isn&#8217;t wholly impacted by the pandemic we&#8217;re in, but to say that nothing is different just because I was already a remote worker would be wrong. The effect the pandemic is having on everyone around you has affects your life. It seems obvious to &#8230; Continue reading "Working remotely at home as a remote worker during a&#160;pandemic" Code4libBC Lightning Talk Notes: Day 2 Code4libBC Day 2 lightning talk notes! Code club for adults/seniors &#8211; Dethe Elza Richmond Public Library, Digital Services Technician started code clubs, about 2 years ago used to call code and coffee, chain event, got little attendance had code codes for kids, teens, so started one for adults and seniors for people who have done &#8230; Continue reading "Code4libBC Lightning Talk Notes: Day&#160;2" Code4libBC Lightning Talk Notes: Day 1 Code4libBC Day 1 lightning talk notes! Scraping index pages and VuFind implementation &#8211; Louise Brittain Boisvert Systems Librarian at Legislative collection development policy: support legislators and staff, receive or collect publications, many of them digital but also some digitized (mostly PDF, but others) accessible via link in MARC record previously, would create an index page &#8230; Continue reading "Code4libBC Lightning Talk Notes: Day&#160;1" Presentation: Implementing Values in Practical Ways This was presented at Code4libBC 2019. Slides Slides on GitHub Hi everyone, hope you’re enjoying Code4libBC so far. While I’m up here, I just want to take a quick moment to thank the organizers past and present. We’re on our 7th one and still going strong. I hope to continue attending and see this event &#8230; Continue reading "Presentation: Implementing Values in Practical&#160;Ways" 
dancohen-org-6268	----	Dan Cohen – Vice Provost, Dean, and Professor at Northeastern University Skip to the content Search Dan Cohen Vice Provost, Dean, and Professor at Northeastern University Menu About Blog Newsletter Podcast Publications Social Media CV RSS Search Search for: Close search Close Menu About Blog Newsletter Podcast Publications Social Media CV RSS What’s New Podcast Humane Ingenuity Newsletter Blog Publications © 2021 Dan Cohen Powered by WordPress To the top ↑ Up ↑ 
datafest-ge-59	----	DataFest 2020 Speakers Agenda Partners About Passes Past Editions X Speakers Agenda Partners About Passes Past Editions 15-17 December Watch the recordings! Follow us: #DataFestTbilisi Online celebration for data lovers Online celebration for data lovers DataFest Tbilisi 2020 is the 4th edition of an annual international data conference happening in the vibrant capital of Georgia. This time, it will take place online and, traditionally, will bring together hundreds of data professionals from all around the world, to inspire and encourage, and to create meaningful connections.    Journalism Human Rights & Democracy Design Analytics Technology Business Speakers All Speakers Gert Franke Co-founder / Managing Director @ CLEVER°FRANKE | The Netherlands Nasser Oudjidane Co-Founder & CEO @ Intrro | UK Devendra Vyavahare Senior data engineer @ Delivery Hero | Germany Rocío Joo Statistician, Researcher, Data scientist @ University of Florida | USA Gev Sogomonian Co-founder @ AimHub | Armenia Tetyana Bohdanova Fellow @ Prague Civil Society Centre | Ukraine Anahit Karapetyan Compliance Investigator / AML Trainer @ Revolut | Poland Wael Eskandar Analyst @ Tactical Tech | Germany Erekle Magradze Director of Engineering @ MaxinAI, Associate Professor @ Ilia State University | Georgia Lasha Pertakhia Machine Learning Engineer @ MaxinAI | Georgia Dr. Divya Seernani Psychologist, Researcher, Co-organizer @ R-Ladies Freiburg | Germany Luca Borella Business Development @ TESOBE | Germany Yulia Kim Business Intelligence Manager @ GoCardless | UK Stefanie Posavec Designer, Artist, Author | UK Henrietta Ross Course leader on MA Data Visualisation @ London College of Communication | UK Varlam Ebanoidze Co-founder @ RiskTech 4 FinTech | UK Gianluigi Davassi CEO @ faire.ai | Germany Miriam Quick Data Journalist, Researcher, Author | UK Rodrigo Menegat Data Journalist | Brazil Mara Pometti Data Strategist @ IBM | Italy / UK Charles Frye Deep Learning Educator @ Weights & Biases | USA Viktor Nestulia Senior Manager @ Open Contracting Partnership | Ukraine Ana Brandusescu McConnell Foundation Professor of Practice | Canada All Speakers Duncan Geere Generative Artist & Information Designer | Sweden Lauren Klein Associate Professor @ Emory University | USA Denise Ajiri Adjunct Assistant Professor @ Columbia University | USA Pedro Ecija Serrano Head of Actuarial and Analytics @ Grant Thornton Ireland | Ireland Uli Köppen Head of AI + Automation Lab @ German Public Broadcaster | Germany Irakli Gogatishvili Head of Data Research Lab @ Bank of Georgia | Georgia Natalia Voutova Head @ Council of Europe Office in Georgia | Georgia Omar Ferwati Researcher @ Forensic Architecture | Canada Caroline Lair Founder @ The Good AI | France Shabnam Mojtahedi Sr. Program Manager @ Benetech | USA Bilal Mateen Clinical Technology Lead @ Wellcome Trust | UK David Mark Human Rights Adviser @ ODIHR | Poland Michela Graziani Co-founder & Product designer @ Symbolikon | Italy Sandra Rendgen Author, Visualization Strategist | Germany Adina Renner Visual Data Journalist @ Neue Zürcher Zeitung | Switzerland Evelina Judeikyte Data Analyst @ iziwork | France Evelyn Münster Data Visualization Designer | Germany Barnaby Skinner Head of Visuals @ Neue Zürcher Zeitung | Switzerland Kathy Rowell Co-Founder & Principal @ HealthDataViz | USA Jane Zhang Data Visualization Designer | Canada Frederico Pires Senior Customer Growth Manager @ UTRUST | Portugal Carlotta Dotto Senior Data Journalist @ First Draft | UK Ashish Singh Co-founder @ ScatterPie Analytics | India Don't miss the event Get your pass Organizers & Partners Questions? Contact us: hello@datafest.ge Follow us: #DataFestTbilisi Subscribe here for news: Send Made by ForSet, Wandio with 
datamish-com-5819	----	Bitcoin shorts vs Longs - Click for BTC margin charts - Datamish About Bitcoin Bitcoin is the revolutionary P2P digital cash envisioned by Satoshi Nakamoto. Many attempts have been made to dethrone Bitcoin, but real connoisseurs accept no imitations. Bitcoin is referred to as digital gold with good reason. It is borderless, decentralized, censorship resistant, and open source. Trade Bitcoin The Bitcoin market is not the most volatile crypto market, but is by far the most liquid and most traded. Futures: Spot: Bitcoin development The network has been running for 10 years, but development is in no way stagnant. Bitcoin developers are some of the best in the space, and they are constantly looking for safe ways to improve and upgrade the system. Recent highlights: Segwit Lightning Network Smart contracts (via RSK) Read the Bitcoin white paper or see this page if you want to learn more about Bitcoin. 360D 180D 90D 30D 14D 7D 2D 24H 12H 6H 4H 2H 1H Dashboards Bitcoin BTC Litecoin LTC Ethereum ETH Cardano ADA Monero XMR Zcash ZEC IOTA IOT EOS Ripple XRP Help & Contact Help page Contact Share this page: Bitcoin margin data - BTC 24H Bitcoin BTCUSD 24 hour timeframe ADAGE - A low fee Cardano stake pool Price {{ price }} {{ daily_change_pct }}% From ATH {{ ath_change_pct }}% Year to date {{ ytd_change_pct }}% Longs {{ total_longs }} {{ symbol_longs_amount_pct_change }}% USD lending rate {{ usd_rate }}% USD available {{ usd_funding_available }} Shorts {{ total_shorts }} {{ symbol_shorts_amount_pct_change }}% BTC lending rate {{ symbol_rate }}% BTC available {{ symbol_funding_available }} 00:00 London 00:00 Berlin 00:00 Athens 00:00 Moscow 00:00 Dubai 00:00 Hong Kong 00:00 Beijing 00:00 Seoul 00:00 Tokyo 00:00 Melbourne 00:00 Los Angeles 00:00 Mexico City 00:00 New York Bitcoin price & total long and short interest Left Y: BTC total longs & shorts Right Y: Price USD {{ price }} Mana vs. Pain 24h health score Longs Shorts Pain {{ long_pain_24h }} {{ short_pain_24h }} Mana {{ long_mana_24h }} {{ short_mana_24h }} 7d health score Longs Shorts Pain {{ long_pain_7d }} {{ short_pain_7d }} Mana {{ long_mana_7d }} {{ short_mana_7d }} 14d health score Longs Shorts Pain {{ long_pain_14d }} {{ short_pain_14d }} Mana {{ long_mana_14d }} {{ short_mana_14d }} Longs & USD interest rate Left Y: USD daily interest rate Right Y: BTC longs Shorts & BTC interest rate Left Y: BTC daily interest rate Right Y: BTC shorts Percent longs and shorts Left Y: Percent short vs. long Hedged and unhedged shorts Left Y: BTC shorts Today's sentiment changes (past 24h) Left Y: BTC sentiment change Bitfinex long & short liquidations past 14d Left Y: BTC volume liquidated Bitfinex total long & short liquidations (timeframe) Left Y: BTC volume liquidated Bitmex long & short liquidations past 14d Left Y: USD/contract volume liquidated Bitmex total long & short liquidations (timeframe) Left Y: USD/contract volume liquidated Never leave without a song. Close New version! × Welcome to the new version of datamish.com! A lot of hours has gone into making the new version more user friendly and informative. Hope you like it :-) The old version will be made available on https://old.datamish.com within a few days. Close Longs vs. shorts × Live data BFX On the Bitcoin price chart you can see: Bitcoin price in USD (white line) Total Bitcoin longs (green line) Total Bitcoin short (red line) Both longs and shorts are measured in BTC. On shorter timeframes (say below one week) longs and shorts are typically almost straight lines because they don't fluctuate much and because of Y-axis scaling. The two charts below the price chart show the same values for total longs and shorts, but capture the short term flucturations much better. Esc to close Pain & Mana score × Live data Pain and Mana is a health-score that is calculated by Datamish. Both longs and shorts have a pain and a mana score. Pain is bad and mana is good. Pain score increases when traders are adding to Bitcoin positions while the market is moving against them. So they could be in for a squeeze. Increased mana score happens when traders are closing their Bitcoin positions while price is moving with them. They are regaining energy. A positive mana score can sometimes happen after the other side has been squeezed successfully. Pain and mana score is dependent on timeframe, so Datamish calculates the scores for three different timeframes: 24h, 7d, and 14d. Pain and mana score does not tell you anything you couldn’t figure out for yourself by looking at the price chart, the longs chart, and the shorts chart. If you want to learn more the about how pain and mana score works then go to one of the time three frames and consider how price, shorts, and longs have developed within in that timeframe. Esc to close Longs and USD interest rate × Live data BFX On the chart you can see: Left-Y: Daily interest rate for USD (grey line) Right-Y: Total longs measured in Bitcoin (green line) Changes in long positions are important to consider. Increasing longs express a bullish sentiment, and decreasing longs express a bearish sentiment. If USD interest rate is high, traders are less likely to borrow USD to go long Bitcoin. Interest rate can be pushed up if there is little funding available, so it is a good idea to keep an eye on both interest rates and available funding. At the top of the page there is a section where you can see how much USD funding is available. The risk of liquidation means that margin traders are "weak hands" that can easily be shaken out of their positions. If there are too many longs this can result in a long squeeze. Esc to close Shorts and Bitcoin interest rate × Live data BFX On the chart you can see: Left-Y: Daily interest rate for Bitcoin (grey line) Right-Y: Total shorts measured in BTC (red line) If Bitcoin interest rate is high, traders are less likely to borrow Bitcoin to go short. Interest rate can be pushed up if there is little Bitcoin funding available, so that is worth considering. At the top of the page there is a section where you can see how much Bitcoin funding is available. Changes in short positions are important to consider. If shorts increase then sentiment is bearish, and if shorts decrease then bearish sentiment is decreasing. The risk of liquidation means that margin traders are "weak hands" that can easily be shaken out of their positions. If there are too many shorters then that can lead to a short squeeze. Esc to close Percent longs and shorts × Live data BFX This chart shows the distribution of longs and shorts as a percentage of the total margin interest, and tracks how this distribution has changed over time. Esc to close Hedged and unhedged shorts × Live data BFX On this chart you can see: Yellow line: The amount of BTC shorts that are known to be hedged. Red line: The amount of BTC shorts that are unhedged (or rather not known to be hedged). Adding hedged and unhedged shorts gives you the total amount of shorts. Sometimes you will see a sudden and substantial drop in the total amount of shorts that has no effect on price. This can seem surprising because you usually complete a trade when you close a short position. The explanation is that the closed short position was hedged. In other words the trader that closed his position did not need to go into the market to buy cover when the position was closed. Esc to close Todays sentiment changes × Live data BFX On this chart you can see sentiment changes for the past 24 hours (the timeframe is fixed). Essentially the chart is reflecting how much BTC has been added or removed on the short side (red line) and how much BTC that has been added or removed on the long side (green line). If the green line is above the red line then sentiment can be said to be more bullish than bearish. Likewise sentiment can be said to be more bearish than bullish if the red line is above the green line. Esc to close Bitcoin liquidations on Bitfinex past 14d × Live data BFX This chart shows the volume liquidated each day for the past two weeks (timeframe is fixed). Short liquidations are green, and long liquidations are red. Bitcoin Liquidations on Bitfinex are measured in BTC. For Bitcoin and Ethereum the charts include liquidation data from both spot AND futures exchanges. Esc to close Total long & short Bitcoin liquidations on Bitfinex (timeframe) × Live data BFX This chart shows the total BTC volume liquidated for the selected timeframe. Short liquidations are green, and long liquidations are red. Liquidations on Bitfinex are measured in BTC so this is what we have on the Y-Axis. Above each bar you can see how many positions has been liquidated in total. For Bitcoin and Ethereum the charts include liquidation data from both spot AND futures exchanges. Esc to close Bitcoin liquidations on Bitmex for the past 14d × Live data BITMEX This chart shows the volume liquidated for the Bitcoin-USD trading pair each day for the past two weeks (timeframe is fixed). Short liquidations are green, and long liquidations are red. Liquidations on Bitmex are measured in contracts. Each contract is 1 USD. Esc to close Total Bitcoin liquidations on Bitmex (timeframe) × Live data BITMEX This chart shows the total volume liquidated for the selected timeframe. Short liquidations are green, and long liquidations are red. Bitcoin liquidations on Bitmex are measured in contracts (1 USD), so this is what we have on the Y-Axis. Above each bar you can see how many positions have been liquidated in total. Esc to close 
datosabiertospj-eastus-cloudapp-azure-com-7111	----	Estándar de Datos de Contrataciones Abiertas (OCDS) - Conjuntos de datos - Datos Abiertos del Poder Judicial de Costa Rica Ir al contenido Iniciar Sesión Registro Conjuntos de datos Organizaciones Grupos Acerca de Buscar conjuntos de datos Inicio Organizaciones Poder Judicial de Costa Rica Estándar de Datos de ... Estándar de Datos de Contrataciones Abiertas (OCDS) Seguidores 0 Organización Poder Judicial de Costa Rica Poder Judicial de Costa Rica leer más Social Twitter Facebook Licencia Creative Commons Attribution Conjunto de datos Grupos Flujo de Actividad Estándar de Datos de Contrataciones Abiertas (OCDS) Estándar de Datos de Contrataciones Abiertas (OCDS) Datos y Recursos Estándar de Datos de Contrataciones Abiertas ...ZIP Estándar de Datos de Contrataciones Abiertas (OCDS) - Masivo Explorar Más información Ir al recurso Estándar de Datos de Contrataciones Abiertas ...JSON Estándar de Datos de Contrataciones Abiertas (OCDS) - 2018 Explorar Más información Ir al recurso Estándar de Datos de Contrataciones Abiertas ...JSON Estándar de Datos de Contrataciones Abiertas (OCDS) - 2019 Explorar Más información Ir al recurso Estándar de Datos de Contrataciones Abiertas ...JSON Estándar de Datos de Contrataciones Abiertas (OCDS) - 2020 Explorar Más información Ir al recurso Estándar de Datos de Contrataciones Abiertas ...JSON Estándar de Datos de Contrataciones Abiertas (OCDS) - 2021 Explorar Más información Ir al recurso Información Adicional Campo Valor Autor Poder Judicial Mantenedor Poder Judicial Versión 1.0 Última actualización 28 Enero, 2021, 23:03 (UTC) Creado 3 Agosto, 2020, 23:21 (UTC) Lineamiento de publicación de datos abiertos del Poder Judicial de Costa RicasegúnOpen Contracting Data Standard (OCDS) https://proveeduria.poder-judicial.go.cr/images/Documentos/Lineamientos_Open_Contracting_PJCRC_version_final_REV_JU_Innovaapv1.pdf Acerca de Datos Abiertos del Poder Judicial de Costa Rica API CKAN CKAN Association Gestionado con CKAN Idioma español English português (Brasil) 日本語 italiano čeština (Česká republika) català français Ελληνικά svenska српски norsk bokmål (Norge) slovenčina suomi русский Deutsch polski Nederlands български 한국어 (대한민국) magyar slovenščina latviešu Tiếng Việt srpski (latinica) 中文 (简体, 中国) فارسی (ایران) ខ្មែរ English (Australia) українська (Україна) नेपाली galego shqip עברית македонски ไทย українська Indonesia Türkçe español (Argentina) hrvatski íslenska dansk (Danmark) монгол (Монгол) العربية română português (Portugal) lietuvių 中文 (繁體, 台灣) Filipino (Pilipinas) Ir 
davidgerard-co-uk-1719	----	News: vanishing NFTs, Free Keene not so free, Coinbase wash-trading Litecoin – Attack of the 50 Foot Blockchain Skip to content Attack of the 50 Foot Blockchain Blockchain and cryptocurrency news and analysis by David Gerard About the author Attack of the 50 Foot Blockchain: The Book Book extras Business bafflegab, but on the Blockchain Buterin’s quantum quest Dogecoin Ethereum smart contracts in practice ICOs: magic beans and bubble machines Imogen Heap: “Tiny Human”. Total sales: $133.20 Index Libra Shrugged: How Facebook Tried to Take Over the Money My cryptocurrency and blockchain commentary and writing for others Press coverage: Attack of the 50 Foot Blockchain Press coverage: Libra Shrugged Table of Contents The conspiracy theory economics of Bitcoin The DAO: the steadfast iron will of unstoppable code Search for: Main Menu News: vanishing NFTs, Free Keene not so free, Coinbase wash-trading Litecoin 29th March 202111th April 2021 - by David Gerard - Leave a Comment I have printed copies of Libra Shrugged and Attack of the 50 Foot Blockchain here — if you’d like to get yourself copies of the books signed by the author, go to this post and see how much to PayPal me. You can support my work by signing up for the Patreon — a few dollars every month ensures the continuing flow of delights. It really does help. [Patreon] I added a $100/month Corporate tier to the Patreon — you get early access to stories I’m working on, and the opportunity to ask your blockchain questions and have me answer! You get that on the other tiers too — but the number is bigger on this tier, and will look more impressive on your analyst newsletter expense account. [Patreon] And tell your friends and colleagues to sign up for this newsletter by email! [scroll down, or click here] Prole art threat I’m an “is it art?” maximalist. NFTs can be used for creative artistic value — just as anything can. The creator and the buyers are playing a game together; there can be genuine appreciation and participation there. I’m not gonna tell ’em they’re wrong. Of course, it may be art, but it can also be a reprehensible scam. The serious problems with the wider NFT market remain. And when the KLF burnt a million quid, they only set it on fire once. If you think of the most absolutely inept and trash-tier way of performing any real-world function, then crypto will reliably not meet even that bar. The pictures for NFTs are often stored on the Interplanetary File System, or IPFS. Blockchain promoters talk like IPFS is some sort of bulletproof cloud storage that works by magic and unicorns. But functionally, IPFS works the same way as BitTorrent with magnet links — if nobody bothers seeding your file, there’s no file there. Nifty Gateway turn out not to bother to seed literally the files they sold, a few weeks later. [Twitter; Twitter] How does the OpenSea NFT platform deal with copyright violations? They keep the unfortunate buyer’s money — and tell them they should have done their own research. (Story by Ben Munster.) [Vice] Beeple has made the wisest play in the NFT game — he got the $60 million in ether for his JPEG, and sold it for dollars immediately. [New Yorker]   woopsie pic.twitter.com/sAQBee0YJ5 — Kim Parker (@thatkimparker) March 28, 2021   This is Radio Freedom Activists from the Free Keene movement, who seek to turn Keene, New Hampshire into a Libertarian paradise, are being ground under the statist jackboot — just for using sound money on a website! Well, running a money transmission business — specifically, exchanging cryptocurrency for actual money — that wasn’t “licensed” by the bureaucratic oppressors who hate freedom. And something about opening bank accounts in the names of churches — “The Shire Free Church”, “The Crypto Church of NH”, “The Church of the Invisible Hand”, and “The Reformed Satanic Church” — and pretending that the money coming in was tax-deductible religious donations. The usual governmental overreach. [Justice Department; Patch; indictment, PDF; case docket] Ian Freeman (On The Land?) had $1.6 million worth of bitcoins, and $178,000 in a safe, when the FBI raided the house where the arrestees lived — which was owned by “Shire Free Church Monadnock.” The same house was raided by the FBI in 2016 — on an investigation into child pornography. Must be one of those coincidences. [Keene Sentinel, archive; Union Leader, archive] For those whose day isn’t complete without some cheering Cantwell News — you know who you are — this particular bunch are all ex-friends of Chris “The Crying Nazi” Cantwell, who moved to Keene specifically to join Free Keene. The recently-arrested activists now claim Cantwell was never part of Free Keene, but that’s completely false — they showed up as moral support to Cantwell’s recent trial on threats of rape, only to throw him under the bus when he was convicted. [Manchester Ink Link] In my 2019 Foreign Policy piece on the ways neo-Nazis used Bitcoin, this bit at the end was about Cantwell: [Foreign Policy, 2019] One neo-Nazi podcaster found a credit card processor that was fine with the content of his show but said he was untouchable for another reason: He was considered a money laundering risk because he dealt in cryptocurrency. One story that didn’t get into that piece is how Cantwell got out of jail after the Unite The Right neo-Nazi rally in Charlottesville, North Carolina in 2017 and bought up big into Bitcoin! … right at the December peak of the 2017 bubble. He lost so much money on Bitcoin that he had to sell his guns to pay his lawyer.   every crypto vision of the future is trying to take a technology developed for hyperadversarial contexts and being like Let's build a society on this. like saying all transit should take place in armored tanks, or all interpersonal disputes should go through full legal discovery — stephanie (@isosteph) March 10, 2021   Lie dream of a casino soul Coinbase has had to pay a $6.5 million fine to the CFTC for allowing an unnamed employee to wash-trade Litecoin on the platform. On some days, the employee’s wash-trading was 99% of the Litecoin/Bitcoin trading pair’s volume. Coinbase also operated two trading bots, “Hedger and Replicator,” which often matched each others’ orders, and reported these matches to the market. [press release; order, PDF] CFTC commissioner Dawn Stump issued an opinion that concurred with the stated facts, but disputes that the issue was within CFTC’s jurisdiction, and says that the reporting didn’t affect the market. This appears not to be the case — it did affect the markets that depended on Coinbase’s numbers. [CFTC; New Money Review] Coinbase’s direct listing public offering has been pushed back at least to April — no reason given, but doubtless coincidental with Coinbase getting caught letting an employee run wild wash-trading on the exchange. [Bloomberg Quint] If Coinbase — one of the more regulated exchanges — did this, just think what the unregulated exchanges get up to. Bloomberg reports a CFTC probe into Binance, and whether the non-US exchange had US customers — attributed to unnamed “people familiar with the matter.” There doesn’t seem to be further news on this one as yet. [Bloomberg] Ben Delo and Arthur Hayes from BitMEX will be surrendering to US authorities to face the Department of Justice charges against them. [Bloomberg; Twitter] Bennett Tomlin summarises what Bitfinex/Tether executives did before Bitfinex or Tether. [blog post]   Said differently – unfortunately Coinbase requires its customers to retain counsel to get customer service… — David Silver (SILVER MILLER) (@dcsilver) March 29, 2021   Baby’s on fire Alex de Vries (Digiconomist) has a study published in Joule on what the rising Bitcoin price means for the Bitcoin network’s energy consumption. He thinks the Bitcoin network could already ues as much energy as every other data centre in the world — with a carbon footprint the size of London. [Joule] “Coin miners have basically added a province’s worth of electricity consumption without adding a province’s worth of economic output, so Bitcoin mining is actually a net drag on the economy as a whole,” Tim Swanson told Al Jazeera. [Al Jazeera] In late 2017, Benjamin Reynolds of Control-Finance Ltd ran a Bitcoin investment scam in the UK. The CFTC, in association with the FCA, now have a $571 million default judgement against him. The hard part: finding him. [press release] New Bitcoin use case found! Selling fake insider trading tips on the dark web. [SEC; complaint, PDF]   An entire generation (or maybe just a cargo cult on twitter/reddit) read the inflation chapter of an econ textbook, panicked & stopped before they read the rest. Maybe the fed should do some PSAs or something. Pay @cullenroche or @TheStalwart to do a youtube series. — Adam Singer (@AdamSinger) March 29, 2021   Be less Brenda The Advertising Standards Authority (UK) has finally acted against an ad for Bitcoin — in this case, a Coinfloor ad running in local papers, featuring a woman buying bitcoins with a third of her pension. The complainant said the ad was: misleading, because it failed to make clear the risks associated with Bitcoin investments, including loss of capital, and that neither Coinfloor Ltd nor the general Bitcoin market were regulated in the UK; and socially irresponsible, because it suggested that purchasing Bitcoin was a good or secure way to invest one’s savings or pension. The ASA upheld both objections. [ASA]   In 1955, a McDonald's hamburger cost $0.15. Today, they're worth $2.50 each. If you had bought 400,000 of them for just $60,000 and never sold, those burgers would be worth $1,000,000 today.#investing #CFA #compounders — abstractify 📚 (@abstractify) March 20, 2021   Carpe Diem Facebook’s Diem applied for a money transmitter licence to FINMA, the Swiss regulator, in April 2020 — back when it was still called Libra. The application is still pending, nearly a year later. FINMA apparently has internal disagreements on whether to let Diem go forward — and they know they absolutely need this to be okay with regulators in the US and EU before they proceed. [SRF, audio in German; Twitter] Kevin Weil, one of Libra/Diem’s four founders, and co-author of the Libra white paper, has quit Facebook Finance. He’s moving to satellite surveillance startup Planet.com. “I’m beyond excited to be working on a non-zombie project,” Weil didn’t quite say. [Twitter; Planet] I’m wondering how long before David Marcus gets bored running WhatsApp Pay and wanders off too. There’s still active contributions to the Diem GitHub repo, if only from Facebook staff. [GitHub] The East Caribbean Central Bank is launching its DCash CBDC pilot on 31 March. [ECCB, archive] The European Central bank has blogged on their plans for a digital euro! That is: no specific plans whatsoever, and repeated reassurances that they’re not about to replace cash, impose negative interest rates, or push out the commercial banks. And they don’t have a consumer use case as yet. [ECB]   Facebook's strategy for protecting their crypto projects from regulators is to rename the project and cycle out all the executives every 6 months so that no regulator can possibly remember if "that Libra or Diem thing" is still around — Kyle S. Gibson (@KyleSGibson) March 18, 2021   ICO, ICO Telegram’s ICO failed so hard that founder Pavel Durov ended up owing $500 million to investors — specifically, the sort of investors who have robust ideas on how to deal with perceived shenanigans. “Pavel’s got a smart team, I’m sure they’ll come up with something,” said one creditor. Durov announced in December that Telegram would start running advertising in public channels. [Telegram] Now Durov has announced a $1 billion bond issue. [Telegram] He is delighted to share that he can finally pay back the guys who put money into the ICO, and that he will continue to enjoy the use of his limbs. SEC’s action against Ripple Labs, claiming XRP is a security, continues — and so far, they’re still sniping over what the case will cover: The SEC asks to strike Ripple’s Fourth Affirmative Defense, “Lack of Due Process and Fair Notice”; Ripple complains that the SEC won’t submit documents in discovery on what it thinks of Bitcoin and Ethereum; Ripple executives Brad Garlinghouse and Christian Larsen ask to quash the SEC subpoenas to look into their personal bank accounts; and John Deaton, representing a group calling itself the XRP Holders, wishes to join the case on the grounds that the SEC has damaged the value of their XRP. Much of this will be dealt with in pleadings to be filed over April, May and June. [Case docket, with linked PDFs] Trailofbits has been fuzz-testing the compiler for Solidity — the language most blockchain smart contracts are written in — for bugs and vulnerabilities. [Trailofbits]   We do have proof that the FTC did, in fact, say “Buttcoin”https://t.co/5eywXuXsO2 https://t.co/QaxYL9OfYg — Buttcoin (@ButtCoin) March 24, 2021   Things happen The crypto ban in India looks set to go ahead, penalising miners and traders — “Officials are confident of getting the bill enacted into law as Prime Minister Narendra Modi’s government holds a comfortable majority in parliament.” You’ll have six months to liquidate your holding. [Reuters] In the meantime, Indian companies will have to disclose their crypto holdings in their profit-and-loss and balance sheets. [Ministry of Corporate Affairs, PDF; Finance Magnates] How’s Reddit’s subforum crypto token experiment going? Well, /r/cryptocurrency is now pay-to-post — 1000 MOON tokens a month, or $5. You can imagine my surprise at seeing the scheme end up being run as a scam to enrich local forum moderators. [Reddit] Visa moves to allow payment settlements using dollar-substitute stablecoin USDC, in a pilot programme with Anchorage and Crypto.com: “Visa has launched a pilot that allows Crypto.com to send USDC to Visa to settle a portion of its obligations for the Crypto.com Visa card program.” The size of the “portion” is not specified. Visa also tweeted some non-detail details. [press release; Reuters; Twitter] Former SEC chair Jay Clayton has his first post-SEC crypto consulting gig — as an advisor to One River Digital Asset Management. [press release]   I have digitized your plums and sold them although, strictly speaking, I sold a hash of a URL to a JSON file describing your plums in perpetuity or, for as long as https://t.co/3jkcTOHQBo stays in business the plums themselves? i burned them forgive me they made a lot of smoke — Ian Holmes (@ianholmes) March 17, 2021   Living on video I did a ton of media on NFTs in the past month, including the BBC’s explainer: What are NFTs and why are some worth millions? “The same guys who’ve always been at it, trying to come up with a new form of worthless magic bean that they can sell for money.” [BBC] Business Insider writes on NFTs, quoting me — and the Independent quotes Business Insider quoting me. [Business Insider, Independent] I went on the Coingeek Conversations podcast again, to talk about NFTs with Josh Petty, a.k.a. Elon Moist of Twetch. We ended up agreeing on most stuff — that you can definitely do good and fun things with NFTs, but the present mainstream market is awful. [Coingeek] I don’t yet know of anyone busted for money-laundering through NFTs, but it’s the obvious use case for objects of purely subjective value being traded in an art market at the speed of crypto. Crypto News has an article, with quotes from me. [Crypto News] I was interviewed on NTD about NFTs: Expert Warns About NFT Digital Crypto Art. [NTD] Kenny Schachter from Artnet writes about NFTs. He’s an art professor, and very much into the potential of NFTs, but he was great to talk to about this stuff. [Artnet] I can’t name it until it airs — they’re worried about their competition sniping them — but I recorded a segment this evening on NFTs for a TV show with quite a large and important audience. Should be out tomorrow, or maybe the day after. Someone sold a house for $3.3 million in bitcoins. I went on a TV segment about it, to explain what the heck a bitcoin is. [video; transcript] Sky News Arabia has a 28-minute bitcoin documentary, with me in — my bits are 15:19–15:34, 16:42–17:13 (holding up one book backwards) and 17:42–18:12. It’s all in Arabic, so I have no idea of its quality, but they’re part of the sane Sky News (UK), not the crazy one (Australia). I’m told the voiceover translations of my bits are accurate. [YouTube] I talked about celebrity crypto scams on NTD — the Elon Musk scams on Twitter, and the Instagram influencer who conned his followers out of bitcoins. Had to use the laptop camera, but ehh, it gave usable results. My segment starts 17:34. [YouTube] Not cryptocurrency related — that’s coming later, when we do the “Bitcoin Nazis” episode — but I’m on the podcast I Don’t Speak German, talking to a couple of antifa commies about Scott Alexander, author of the Intellectual Dark Web rationalist blog Slate Star Codex. I Don’t Speak German is mostly about neo-Nazis and white nationalists, and Slate Star Codex isn’t really that — but Scott Alexander is a massive and explicit fan of eugenics, “human biodiversity” (scientific racism), sterilising those he sees as unfit, and the neoreactionary movement, so that was close enough for our purposes. (For cites on all those claims, listen to the podcast.) It was a fun episode. Also appearing is Elizabeth Sandifer, author of Neoreaction a Basilisk (UK, US), and the person responsible for me starting Attack of the 50 Foot Blockchain. [I Don’t Speak German] Hint for crypto video media: when sending a query, say who the hosts and all the guests are, and what the format is. The media arm of one crypto news site that’s definitely large enough to know better nearly (inadvertently) sprang an ambush live debate on me, until I questioned more closely. Don’t be the outlet that your prospective subjects warn each other about.   Rare that a single tweet so perfectly encapsulates everything that makes my skin crawl about SF Bay Area's moneyed, whitebread techie monoculture. Truly the most cursed thing I have seen in recent memory. https://t.co/8eCWPGlu1O — KC 🏴 (@KdotCdot) March 18, 2021   Check out my technical analysis on the stuck boat, big breakout incoming. Should be unstuck any time now. Very bullish pic.twitter.com/u2BknxUyqT — G. Kennedy Fuld Jr., CFA, MBA, ChEA, FRM (@MemberSee) March 25, 2021   Your subscriptions keep this site going. Sign up today! Share this: Click to share on Twitter (Opens in new window) Click to share on Facebook (Opens in new window) Click to share on LinkedIn (Opens in new window) Click to share on Reddit (Opens in new window) Click to share on Telegram (Opens in new window) Click to share on Hacker News (Opens in new window) Click to email this to a friend (Opens in new window) Taggedanchoragearthur hayesasabeepleben delobenjamin reynoldsbinancebitfinexbitmexcftcchristopher cantwellcoinbasecoinfloorcontrol-financecrypto.comdawn stumpdcashdiemdigiconomistecbeccbfacebook financefinmafree keeneian freemanicoindiaipfsjay claytonkevin weillinkslitecoinnftnifty gatewayone riveropenseapavel durovredditripplesecsolidityswitzerlandtelegramtethertim swansontrailofbitsusdcvisaxrp Post navigation Previous Article Quadriga documentary ‘Dead Man’s Switch’ — the trailer is out Next Article Tether produces a new attestation — it says nothing useful Leave a Reply Cancel reply Your email address will not be published. Required fields are marked * Comment Name * Email * Website Notify me of follow-up comments by email. Notify me of new posts by email. This site uses Akismet to reduce spam. Learn how your comment data is processed. Search for: Click here to get signed copies of the books!   Get blog posts by email! Email Address Subscribe Support this site on Patreon! Hack through the blockchain bafflegab: $5/month for early access to works in progress! $20/month for early access and even greater support! $100/month corporate rate, for your analyst newsletter budget! Buy the books! Libra Shrugged US Paperback UK/Europe Paperback ISBN-13: 9798693053977 Kindle: UK, US, Australia, Canada (and all other Kindle stores) — no DRM Google Play Books (PDF) Apple Books Kobo Smashwords Other e-book stores Attack of the 50 Foot Blockchain US Paperback UK/Europe Paperback ISBN-13: 9781974000067 Kindle: UK, US, Australia, Canada (and all other Kindle stores) — no DRM Google Play Books (PDF) Apple Books Kobo Smashwords Other e-book stores Available worldwide  RSS - Posts  RSS - Comments Recent blog posts News: Coinbase goes public, Bitcoin hashrate goes down, NFTs go down, proof-of-space trashes hard disk market Stilgherrian: The 9pm Dumb Anarcho-Capitalist Blockchain Scams with David Gerard Podcast: I Don’t Speak German #85: Crypto Fascists, with David Gerard Desperate investors, neoliberalism and Keynes: how to increase returns New York’s Excelsior Pass for COVID-19, on IBM Blockchain: doing the wrong thing, badly Excerpts from the book Table of Contents The conspiracy theory economics of Bitcoin Dogecoin Buterin’s quantum quest ICOs: magic beans and bubble machines Ethereum smart contracts in practice The DAO: the steadfast iron will of unstoppable code Business bafflegab, but on the Blockchain Imogen Heap: “Tiny Human”. Total sales: $133.20 Index About Press coverage for Attack of the 50 Foot Blockchain Press coverage for Libra Shrugged My cryptocurrency and blockchain press commentary and writing Facebook author page About the author Contact The content of this site is journalism and personal opinion. Nothing contained on this site is, or should be construed as providing or offering, investment, legal, accounting, tax or other advice. Do not act on any opinion expressed here without consulting a qualified professional. I do not hold a position in any crypto asset or cryptocurrency or blockchain company. Amazon product links on this site are affiliate links — as an Amazon Associate I earn from qualifying purchases. (This doesn’t cost you any extra.) Copyright © 2016–2021 David Gerard Powered by WordPress and HitMag. Send to Email Address Your Name Your Email Address Cancel Post was not sent - check your email addresses! Email check failed, please try again Sorry, your blog cannot share posts by email. 
davidgerard-co-uk-2274	----	NFTs: crypto grifters try to scam artists, again – Attack of the 50 Foot Blockchain Skip to content Attack of the 50 Foot Blockchain Blockchain and cryptocurrency news and analysis by David Gerard About the author Attack of the 50 Foot Blockchain: The Book Book extras Business bafflegab, but on the Blockchain Buterin’s quantum quest Dogecoin Ethereum smart contracts in practice ICOs: magic beans and bubble machines Imogen Heap: “Tiny Human”. Total sales: $133.20 Index Libra Shrugged: How Facebook Tried to Take Over the Money My cryptocurrency and blockchain commentary and writing for others Press coverage: Attack of the 50 Foot Blockchain Press coverage: Libra Shrugged Table of Contents The conspiracy theory economics of Bitcoin The DAO: the steadfast iron will of unstoppable code Search for: Main Menu NFTs: crypto grifters try to scam artists, again 11th March 202111th March 2021 - by David Gerard - 15 Comments. Non-fungible tokens, or NFTs, are the crypto hype for 2021 — since DeFi ran out of steam in 2020, and Bitcoin’s pumped bubble seems to be deflating. The scam is to sell NFTs to artists as a get-rich-quick scheme, to make life-changing money. There’s a gusher of money out there! You just create a token! And any number of crypto grifters would be delighted to assist you. For a small consideration. It’s con men with a new variety of magic beans to feed the bubble machine — and artists are their excuse this time. The NFT grift works like this: Tell artists there’s a gusher of free money! They need to buy into crypto to get the gusher of free money. They become crypto advocates, and make excuses for proof-of-work and so on. A few artists really are making life-changing money from this! You probably won’t be one of them. In a nicer, happier world, NFTs would be fun little things you could make and collect and trade, and it’d be great. It’s a pity this is crypto.     What is an NFT? An NFT is a crypto-token on a blockchain. The token is virtual — the thing you own is a cryptographic key to a particular address on the blockchain — but legally, it’s property that you can buy, own or sell like any other property. Most crypto-tokens, such as bitcoins, are “fungible” — e.g., you mostly don’t care which particular bitcoins you have, only how much Bitcoin you have. Non-fungible tokens are a bit different. Each one is unique — and can be used as an identifier for an individual object. The NFT can contain a web address, or maybe just a number, that points somewhere else. An NFT is just a pointer. If the place the NFT points to is a site that claims to sell NFTs that represent artworks — then you have what’s being called crypto-art! Note that it’s only the token that’s non-fungible — the art it points to is on a website, under centralised control, and easily changeable. When I buy an NFT, what do I get? The art itself is not in the blockchain — the NFT is just a pointer to a piece of art on a website. You’re buying the key to a crypto-token. You’re not buying anything else. An NFT doesn’t convey copyright, usage rights, moral rights, or any other rights, unless there’s an explicit licence saying so. It’s like a “Certificate of Authenticity” that’s in Comic Sans, and misspelt. At absolute best, you’re buying a piece of official merchandise — one that’s just a number pointing to a website. Why is an NFT? NFTs exist so that the crypto grifters can have a new kind of magic bean to sell for actual money, and pretend they’re not selling magic beans. The purpose of NFTs is to get you to give your money to crypto grifters. When the grifter has your money, the NFT has done its job, and none of the fabulous claims about NFTs need to work or be true past that point. NFTs are entirely for the benefit of the crypto grifters. The only purpose the artists serve is as aspiring suckers to pump the concept of crypto — and, of course, to buy cryptocurrency to pay for “minting” NFTs. Sometimes the artist gets some crumbs to keep them pumping the concept of crypto. CryptoKitties, in late 2017, was the first popular NFT. CryptoKitties was largely fueled by bored holders of ether — the cryptocurrency for Ethereum — spending their ether, that they had too much of to cash out easily, on some silly toys that they traded amongst themselves. Since then, various marketers have tried to push the idea along. People pay real money for hats in video games, don’t they? Then surely they’ll buy crypto tokens that allegedly represent their favourite commercial IP! These mostly haven’t taken off. The first real success is NBA Top Shots, where you buy an official NBA-marketed token that gives you a website trading card of a video snippet. This has taken off hugely. NBA Top Shots has its own issues, which I’ll probably deal with in a later post. DeFi pumpers tried pushing NFTs in October last year, but they couldn’t get the idea to stick. The recent Bitcoin bubble feels like it’s running out of steam — so they’re pushing the NFT idea again, and pumping it hard. With NBA Top Shots and some heavily promoted big-money alleged sales, crypto art NFTs are hitting the headlines. How do I make an NFT? If you aren’t a technically-minded blockchain enthusiast, there are websites where you can “mint” an NFT. First, you need to buy some ether. This covers the transaction fee to make your NFT. You’ll need Ethereum wallet software, probably Metamask, which is a browser extension. How much do you need? Well, guess and hope you’re lucky. Ethereum transaction fees peaked at $40 per transaction in February. Lots of poor artists have tried making NFTs and lost over $100 they really couldn’t spare — so guess high! You might notice that this looks a lot like a vanity gallery scam, or pay-to-play. You’d be correct — the purpose is to suck your precious actual-money into the crypto economy. Connect your Ethereum wallet to one of the NFT marketplaces. Upload your file and its description. You have created a token! Now you need to hope a bored crypto holder will buy it. What is “digital ownership”? Without a specific contract saying otherwise, an NFT does not grant ownership of the artwork it points to in any meaningful sense. All implications otherwise are lies to get your money. This is the “registration scam” — like selling your name on a star, or a square foot of land on the moon. Musicians will know the “band name registry” scam, where the scammer sells something that they imply will work like a trademark on your name — but, of course, it doesn’t. (There have been multiple “register your band name on a blockchain” scams.) Crypto grifters will talk about “digital ownership.” This is meaningless. The more detail you ask for what actual usable rights this “ownership” conveys, the vaguer the claims will get. The whole idea of Bitcoin was property unconfiscatable by the government, that they could use as money. Instead of a framework of laws and rights, they’d use … a blockchain! This notion is incoherent and stupid on multiple levels — money is a construct agreed upon in a society, property rights are a construct of law and social expectations — but it’s also what the bitcoiners believe and what they wanted. NFTs try to justify themselves with variations on this claim as the marketing pitch. Christie’s auction of an NFT is a fabulous worked example. There’s a 33-page terms and conditions document, and if you wade through the circuitous verbiage, it finally admits that … you’re just buying the crypto-token itself: [Christie’s, PDF, archive] You acknowledge that ownership of an NFT carries no rights, express or implied, other than property rights for the lot (specifically, digital artwork tokenized by the NFT). … You acknowledge and represent that there is substantial uncertainty as to the characterization of NFTs and other digital assets under applicable law. The magic bean in question is bidding at $13 million as I write this, which means Christie’s stands to make about $2 million commission. Pretty good payday for a cryptographic hash. [Christie’s] I don’t understand any of this. Please explain it like I’m five. “Would you like to watch your favourite CBeebies show — or would you like me to write on a piece of paper that you own the show? All you get is the piece of paper.” The trouble with explaining NFTs to a five-year-old is that you’ll have a hard time convincing a five-year-old that this nonsense isn’t the nonsense it obviously is. It sounds unfathomably stupid because it’s unfathomably stupid. The K Foundation Burn A Million NFTs: Crypto art’s ghastly CO2 production Proof-of-work is the reprehensible, planet-destroying mechanism that the Ethereum and Bitcoin blockchains use to decide who gets fresh ether or bitcoins. Proof-of-work is inexcusable nonsense, and every single person making money in anything linked to Ethereum or Bitcoin should feel personal shame. (Crypto grifters don’t possess a shame organ.) Like Bitcoin, Ethereum uses an whole country’s worth of electricity just to keep running — and generates a country’s worth of CO2. The Ethereum developers claim they’re totally moving off proof-of-work any day now — but they’ve been saying that since 2014. Crypto grifters making bad excuses for proof-of-work will often object to calculating their favourite magic bean’s per-transaction energy use, at all. The excuse is that adding more transactions doesn’t directly increase Bitcoin or Ethereum’s energy consumption. The actual reason is that the numbers for Bitcoin and Ethereum are bloody awful. [Digiconomist; Digiconomist] The grifters will routinely pretend it’s somehow impossible to do arithmetic, and divide the energy use by the work achieved with it — in the precise same manner we do for literally every other enterprise or industry that uses energy. But if you’re calculating energy efficiency — of Bitcoin, Ethereum, Visa, Twitter or banks — then taking the total energy used and dividing it by the total work done is the standard way to work that out. Sites have sprung up to calculate the share of energy that crypto art spends. The site cryptoart.wtf picks a random piece of crypto art and calculates that transaction’s energy use. “These figures do not include the production or storage of the works, or even web hosting, but is simply for the act of using the PoW Ethereum blockchain to keep track of sales and activity.” The creator also has a blog post to explain the site, and address common bad excuses for proof-of-work. [cryptoart.wtf; Medium] You may tell yourself “but my personal marginal effect is minimal” — but in that case, don’t pretend you’re not just another aspiring crypto grifter. There are other blockchains that don’t use proof-of-work. Hardly anybody does NFTs on these chains — almost nobody uses them, and the local cryptocurrency for your fees is a lot more work to get hold of. And even if you did use one of these other blockchains, all the other ways that NFTs are a scam would still hold. But what about artists? They need money too Artist pay is terrible. Even quite successful artists whose names you know wonder if they could tap into the rich people status-and-vanity art market, and get life-changing money. (I’ve already seen one artist bedazzled by the prospect of NFT money say that anyone who objects to crypto art must be a shill for Big Tech.) Artists don’t know technology any more than anyone else does, so a lot of artists who tentatively essayed an NFT were completely unaware of the ghastly CO2 production involved in anything that touches cryptocurrency. Several were shocked at the backlash over an issue they’d had no idea existed. Famous artists are getting into NFTs. Grimes did an NFT, and it’d be fair to say that Elon Musk’s partner isn’t going to be doing an NFT for the money. Even if it’s a bit at odds with her album about ecological collapse. But famous musicians have long had a habit of adopting some awful headline-friendly technology that’s utterly unready for prime time consumer use, in order to show that they are hep and up to speed with the astounding future. Then they never speak of it again. Remember Björk’s cryptocurrency album in 2017? Kings of Leon are doing an NFT of their new album — sort of. Their page on NFT site Opensea suggests that you buy a digital download (not an NFT), limited edition vinyl (not an NFT), or a collectible artwork (a wallpaper). So what you’re actually buying is a vinyl record with a download, and in return, you not only give the band money, but hasten ecological collapse. Some small artists have done very well indeed from NFTs — and that’s excellent news! If you’ve made life-changing money from an NFT, then that’s good for the world as well as for you — ‘cos now the money’s out of the hands of the crypto grifters. (For goodness’ sake, cash out now.) An important rule of crypto is: every number that can be faked is faked. NFTs are the sort of con where a shill appears to make a ton of money, so you’ll think you can too. Put a large price tag on your NFT by buying it from yourself — then write a press release talking about your $100,000 sale, and you’re only out the transaction fee. Journalists who can’t be bothered checking things will write this up without verifying that the buyer is a separate person who exists. Just like the high-end art world! Another thing that the high-end art world shares with crypto is money laundering. Press coverage tends to focus on cultural value, and assume this stuff must be of artistic weight because someone spent a fortune on it. The part that functions as a money-laundering scam is only starting to get comment recently. [National Law Review, 2019; Art & Object, 2020] NFTs will almost certainly be used for money laundering as well, because crypto has always been a favourite for that use case. Banksying the unbanksied: fraudulent NFTs There is no mechanism to ensure that an NFT for an artwork is created by the artist. A lot of NFTs are just straight-up fraud. If NFTs weren’t a scam, there would be legal and technical safeguards to help ensure the NFT was being created by someone who owned the work in question, to fend off scammers. But there aren’t any — the sites all work on the basis “we’ll clean it up later, maybe.” This is because NFTs only exist to further the crypto grift. There are multiple NFT sites — you could create an unlimited number of NFTs that all claimed to be of a single particular work. There are a number of Twitter bots that will make an NFT of any tweet you point them at. The point is for the bot owner to make a commission from the sale of the NFTs, before the suckers catch on. Don’t expect Twitter to do anything about these people — Twitter CEO Jack Dorsey has a $2.5 million offer for an NFT of his first tweet. The offer is from Dorsey’s fellow crypto grifter Justin Sun. Now, you might think these two massive crypto holders were just trying to get headlines for the NFT market. [Rolling Stone] Someone NFTed all of dinosaur artist Corbin Rainbolt’s tweeted illustrations — and he took down the lot and put up watermarked versions. “I am not pleased that I have to take this sort of scorched earth policy with my artwork, frankly I am livid.” [Twitter] You could go through and block and report all the Twitter bots, though more will just spring up. [Twitter] But think of all the good things you could do with NFTs, you luddite When you point out that cryptocurrencies are terrible and NFTs are a scam, crypto grifters will start talking about all the things that you could potentially do if NFTs worked like they claim they do. This is a standard crypto grifter move — any clear miserable failure in the present will be answered with talking about the fabulous future! e.g., claiming Bitcoin or blockchain promises will surely come true, because it’s just like the early Internet. Which, of course, it isn’t. What can artists and buyers do about fraudulent NFTs? If the NFT site has a copy of your artwork up, you can send a DMCA notice to them, and to their upstream network provider. If the NFT site is just claiming or implying that you created this NFT when you did not, this is clearly fraudulent (misrepresentation, passing off) — but may be harder to get immediate action on. If you bought an NFT thinking it was put up by the artist, and it wasn’t, then you’ve been defrauded, and should ask for a refund. If the NFT site won’t refund you, then bring to bear absolutely everything you can on them. If the site is unresponsive to notices of fraud — which is quite common, because crypto grifters think “digital ownership” is a thing, and don’t care that other rights might exist in law or society — it is absolutely in order to shout from the rooftops that they are frauds, and blacken their name as best you can. Contact their financial backers too. Then talk about that as well. Ask around to see if you have a lawyer friend, or a friend of a friend, who might be in a position to assist pro bono just because these grifters are that terrible. The most important thing for artists to do about NFT fraud is to work to make NFTs widely considered to be worthless, fraudulent magic beans, with massive CO2 generation per transaction. This shouldn’t be terribly difficult, given that NFTs are in fact worthless, fraudulent magic beans, with massive CO2 generation per transaction. But is it art? You can tell that crypto art is definitely art, because so many proponents of it are insufferable manifesto bros. Just the manifestos could cause runaway global warming from sheer volume of hot air. (“Banksying the unbanksied” courtesy Etienne Beureux.)   Pleased to offer a NFT version of Neoreaction a Basilisk, which you can obtain at https://t.co/SPZnjZIgOI — El Sandifer, Rationality Expert to the Stars (@ElSandifer) March 1, 2021   you claim to place such moral stock in "artists getting paid" yet do not subscribe to my patreon, curious — Dr Samantha Keeper MD (@SamFateKeeper) March 5, 2021   We have a unique opportunity to help the planet and make culture better for future generations, and everyone can contribute simply by not giving a toss about NFTs. — Dan Davies (@dsquareddigest) March 7, 2021   Your subscriptions keep this site going. Sign up today! Share this: Click to share on Twitter (Opens in new window) Click to share on Facebook (Opens in new window) Click to share on LinkedIn (Opens in new window) Click to share on Reddit (Opens in new window) Click to share on Telegram (Opens in new window) Click to share on Hacker News (Opens in new window) Click to email this to a friend (Opens in new window) Taggedchristie'scorbin rainboltcryptokittiesethereumgrimesjack dorseyjustin sunkings of leonnba top shotsnftopenseaproof of work Post navigation Previous Article News: India crypto ban, North Korea, BitMEX execs to appear, IBM Blockchain dead, more McAfee charges Next Article Foreign Policy: It’s a $69 Million JPEG, but Is It Art? 15 Comments on “NFTs: crypto grifters try to scam artists, again” Adam Achen says: 12th March 2021 at 12:24 am Wait, so, NFT don’t even typically include a license for use of the underlying?! Reply David Gerard says: 12th March 2021 at 10:24 am nope! Note how even the Christie’s contract basically says “we dunno wtf this thing is, have fun” Reply K. Paul says: 12th March 2021 at 4:40 am Isn’t it curious that on the same day that 1 billion Tethers get minted on the TRON blockchain, Beeple’s NFT gets sold for USD 69 million worth of ETH? Apparently Justin Sun (founder of the TRON blockchain) was the leading bidder until losing it to another crypto bro at the last bid. Super curious, no? Money laundering? Reply David Gerard says: 12th March 2021 at 10:25 am I’m sure it’s just coincidence, and that Sun definitely didn’t snipe his own bid under another name for press release purposes. Reply K. Paul says: 13th March 2021 at 10:16 am LOL Reply David Gerard says: 16th March 2021 at 11:50 pm It turns out it was bought by … a guy Beeple was already in the crypto business with! https://amycastor.com/2021/03/14/metakovan-the-mystery-beeple-art-buyer-and-his-nft-defi-scheme/ so the $9m (in ETH) to Christie’s is correctly viewed as a marketing expense Reply K. Paul says: 17th March 2021 at 2:40 am I think Sun, Beeple, Christie’s, Vignesh, Musk, etc. are all working together to push NFTs. It all just seems so… planned and organized in advance. Look at what Musk is doing now. Meanwhile, Tether printer goes BRRRRRRRRR!!! WK says: 13th March 2021 at 2:19 pm Thanks; this was an interesting read. I’ve been reading about “crypto” on and off for a while now, trying to understand what it’s all about because it seems like nonsense. My initial skepticism has so far been reinforced and I completely fail to see how Bitcoin or any other digital currency is independent of actual existing hard currencies. This NFT business ($2.5m for a Tweet?) is headscratchingly ridiculous. Reply John S says: 21st March 2021 at 9:01 pm Crypto is a perfect way to take money from “midwits.” Lower IQ people instinctively know it’s dumb and the barrier of entry keeps them out. Genuinely smart people (I’m not a genius but I would place myself in this category) read all the claims and conclude that there is no intrinsic value, regardless of limits on supply etc. People in the middle read the claims and convince themselves they understand this stuff and the marketing (better than FIAT, banks, libertarian utopia etc) are true and get burned. The people who make money in crypto are either insiders or they know it’s crap and sell it during bubble periods instead of holding with the expectation that the value will perpetually increase due to magical properties. Reply Ingvar says: 16th March 2021 at 1:49 pm JWZ on NFTs. Worth a read, including the comments (which, frankly, is not something I am used to saying). Reply Blaise says: 17th March 2021 at 8:26 pm Great work: I now have a much better understanding of NFT and your “contrarian” view makes perfect sense. Reply Adam Burns says: 20th March 2021 at 4:17 pm > It’s [NFTs are] like a “Certificate of Authenticity” that’s in Comic Sans, and misspelt. written in crypto crayons. for the love of humanity! oh … but wait. check out these ‘humanitarians’ https://www.proofofhumanity.id/ Reply JetBlack says: 23rd March 2021 at 12:18 am Just more proof of the unmitigated stupidity of the world we live in. Wow. And Grimes just sold a bunch of NTFs for a tidy sum. Hmm… I wonder who bought those? Is she connected to anyone with a lot of disposable income with an vested interest in Bitcoin and crypto? Reply Alex says: 24th March 2021 at 12:15 am Hello! I have a question, help me out. Let’s say an artist put up his/her work in a conditional NFT market, an auction started and he/she successfully sold it. After this event – what rights does the artist have towards the auctioned work? Or the work is still the intellectual property of the artist? Reply David Gerard says: 24th March 2021 at 12:51 am All the rights, unless explicitly stated otherwise in the sale of the NFT. The purchaser might try to claim implied rights – e.g. a limited right to reproduce the work for the purpose of saying “this is what I bought an NFT of” – but not major rights like copyright or reproduction without an explicit license. Though I am not your lawyer, so ask one if it’s important. Reply Leave a Reply Cancel reply Your email address will not be published. Required fields are marked * Comment Name * Email * Website Notify me of follow-up comments by email. Notify me of new posts by email. This site uses Akismet to reduce spam. Learn how your comment data is processed. Search for: Click here to get signed copies of the books!   Get blog posts by email! Email Address Subscribe Support this site on Patreon! Hack through the blockchain bafflegab: $5/month for early access to works in progress! $20/month for early access and even greater support! $100/month corporate rate, for your analyst newsletter budget! Buy the books! Libra Shrugged US Paperback UK/Europe Paperback ISBN-13: 9798693053977 Kindle: UK, US, Australia, Canada (and all other Kindle stores) — no DRM Google Play Books (PDF) Apple Books Kobo Smashwords Other e-book stores Attack of the 50 Foot Blockchain US Paperback UK/Europe Paperback ISBN-13: 9781974000067 Kindle: UK, US, Australia, Canada (and all other Kindle stores) — no DRM Google Play Books (PDF) Apple Books Kobo Smashwords Other e-book stores Available worldwide  RSS - Posts  RSS - Comments Recent blog posts News: Coinbase goes public, Bitcoin hashrate goes down, NFTs go down, proof-of-space trashes hard disk market Stilgherrian: The 9pm Dumb Anarcho-Capitalist Blockchain Scams with David Gerard Podcast: I Don’t Speak German #85: Crypto Fascists, with David Gerard Desperate investors, neoliberalism and Keynes: how to increase returns New York’s Excelsior Pass for COVID-19, on IBM Blockchain: doing the wrong thing, badly Excerpts from the book Table of Contents The conspiracy theory economics of Bitcoin Dogecoin Buterin’s quantum quest ICOs: magic beans and bubble machines Ethereum smart contracts in practice The DAO: the steadfast iron will of unstoppable code Business bafflegab, but on the Blockchain Imogen Heap: “Tiny Human”. Total sales: $133.20 Index About Press coverage for Attack of the 50 Foot Blockchain Press coverage for Libra Shrugged My cryptocurrency and blockchain press commentary and writing Facebook author page About the author Contact The content of this site is journalism and personal opinion. Nothing contained on this site is, or should be construed as providing or offering, investment, legal, accounting, tax or other advice. Do not act on any opinion expressed here without consulting a qualified professional. I do not hold a position in any crypto asset or cryptocurrency or blockchain company. Amazon product links on this site are affiliate links — as an Amazon Associate I earn from qualifying purchases. (This doesn’t cost you any extra.) Copyright © 2016–2021 David Gerard Powered by WordPress and HitMag. Send to Email Address Your Name Your Email Address Cancel Post was not sent - check your email addresses! Email check failed, please try again Sorry, your blog cannot share posts by email. 
davidgerard-co-uk-4192	----	News: Coinbase goes public, Bitcoin hashrate goes down, NFTs go down, proof-of-space trashes hard disk market – Attack of the 50 Foot Blockchain Skip to content Attack of the 50 Foot Blockchain Blockchain and cryptocurrency news and analysis by David Gerard About the author Attack of the 50 Foot Blockchain: The Book Book extras Business bafflegab, but on the Blockchain Buterin’s quantum quest Dogecoin Ethereum smart contracts in practice ICOs: magic beans and bubble machines Imogen Heap: “Tiny Human”. Total sales: $133.20 Index Libra Shrugged: How Facebook Tried to Take Over the Money My cryptocurrency and blockchain commentary and writing for others Press coverage: Attack of the 50 Foot Blockchain Press coverage: Libra Shrugged Table of Contents The conspiracy theory economics of Bitcoin The DAO: the steadfast iron will of unstoppable code Search for: Main Menu News: Coinbase goes public, Bitcoin hashrate goes down, NFTs go down, proof-of-space trashes hard disk market 20th April 2021 - by David Gerard - 1 Comment If you’d like to get yourself copies of the books signed by the author, go to this post and see how much to PayPal me! You can support my work by signing up for the Patreon — $5 or $20 a month is like a few drinks down the pub while we rant about cryptos once a month. It really does help. [Patreon] The Patreon also has a $100/month Corporate tier — the number is bigger on this tier, and will look more impressive on your analyst newsletter expense account. [Patreon] And tell your friends and colleagues to sign up for this newsletter by email! [scroll down, or click here] The Bernard L. Madoff Memorial Coinbase Listing On 14 April 2021, Coinbase listed on NASDAQ as a public company! On the same day the Bitcoin price peaked, and Bernie Madoff — the patron saint of Bitcoin — died. Being a public company brings much closer attention to just what’s going on here, without the sort of dumb excuses that crypto bros will accept. Coinbase’s stock price is unsustainable — the starting price was at 79 times revenue, let alone earnings. For comparison, Palantir’s direct listing was at 19 times revenue. [FT Alphaville, free with registration] The stock price has behaved accordingly — and went from $409.62 on launch day, to $319.00 as I write this. [MarketWatch, archive] The other nice thing about a public listing is that the Coinbase stock price is a proxy for the price of Bitcoin — and you can’t short Bitcoin reliably, but you can certainly short stocks reliably. The Coinbase listing was thoroughly in the spirit of crypto offerings — insiders dumped a pile of their shares immediately, including the chief financial officer selling 100% of hers. Apologists swooped in to say that they only sold vested shares — which means, the shares they actually had, and not the shares they didn’t have. [Twitter; OpenInsider, archive] A lawyer — specifically, a professor of contracts — looks at the Coinbase terms of service, and specifically the requirement to take disputes to arbitration. She’s unconvinced the terms are even enforceable. [ContractsProf Blog] Martin Walker and Winnie Mosioma: “Many cryptocurrency exchanges are now making proud claims about their regulated status, but does ‘regulated’ really mean what investors think?” A review of sixteen crypto exchanges.  [LSE Business Review] Not so much a revolving door as a recirculating sewer — Brian Brooks, formerly of Coinbase, and then of the Office of the Comptroller of the Currency, becomes the CEO of Binance US. [CoinDesk]   Bitcoin is just Avon for men in their late 20s don’t at me — CryptoCharles (@CryptoCharles__) April 12, 2021   Hashrate go down, number follows Bitcoin is so robustly decentralised that a power outage in a single area — or, by some reports, in a single data centre — in Xinjiang took half of Bitcoin’s hashpower offline, across multiple “independent” mining pools. Decentralised! [NASDAQ News] An accident in a coal mine on 10 April didn’t directly stop the flow of electricity — but it did lead to widespread safety inspections in various industries. This included Bitcoin mining data centres being shut down. [Crypto Briefing] The Bitcoin hash rate dropped from 220 exahashes per second to 165 EH/s. The rate of new blocks slowed. The Bitcoin mempool — the backlog of transactions waiting to be processed — has filled. Transaction fees peaked at just over $50 average on 18 April. [Johoe’s Bitcoin Mempool Statistics, archive of 20 April 2021; Ycharts, archive of 20 April 2021] This turned a slight dip in the BTC price over the weekend into a crash — from $64,000 down to $51,000. It’s hard to pump the market if you can’t move your coins. Though that hasn’t stopped Tether doing two-billion-USDT pumps. I’m sure this is all 100% backed with something that won’t crash if you look at it funny. Binance finds itself suddenly unable to fulfil withdrawals of crypto — direct from them to you on the blockchain, without even being able to blame the legacy financial system. Affected tokens: BNB (BEP2 and BEP20), USDT (TRC20 and BEP20) BTC, XRP, DOGE, BUSD (BEP20). But I’m sure it’ll all be fine, and Binance definitely have all the cryptos they claimed to. [Twitter, Twitter] You can cash out any time you like! As long as nobody else is trying to.   who decided to call them NFTs instead of GIF Certificates??? — adam j. sontag (@ajpiano) April 18, 2021   Q. What do you call unsmokeable mushrooms? A. Non-Tokeable Fungi NFTs have a problem: number go … not up. It turns out there isn’t a secondary market for NFTs — nobody buys them after the pumpers have had their turn. [Bloomberg] “It’s not meaningful to characterize a concept as a financial bubble,” said Chris Wilmer, a University of Pittsburgh academic who co-edits a blockchain research journal, and thinks playing with words obscures that NFTs were a month-long bubble. Some news stories called NFTs a “stimulus-led fad”. Now, you might think that was a remarkable euphemism for a blatant pump by crypto bros to fake the appearance of a market. Popular NFT marketplace Rarible has been targeted by … scammers and malware! Unheard of in crypto. [Bleeping Computer] Brian Livingston’s newsletter Muscular Portfolios traces a bit more of the follow-the-money on Metakovan’s purchase of a $69 million NFT. [Muscular Portfolios] Kim Parker: Most artists are not making money off NFTs — and here are some graphs to prove it. [Medium]   Minty Bingo for when NFTs die and everyone comes back crying https://t.co/1aCPplzdui pic.twitter.com/j725JsoIzq — 🕯️synthwave void gremlin 🕯️ (@Lokinne) April 6, 2021   He is genius in allocation of space Proof-of-space crypto may do to hard disks and SSDs what proof-of-work altcoins did to video cards. Bram Cohen’s Chia network seems to already be leading to local shortages of large hard drives — prices in Hong Kong for the 4TB and above range are up to triple the usual price.[HKEPC, in Chinese; WCCFTech] How wonderfully energy-efficient is proof-of-space? Not so great — Shokunin tried out the client: “I tested this Chia thing overnight. Gave it 200GB plot and two CPU threads. After 10 hours it consumed 400GB temp space, didn’t sync yet, CPU usage is always 80%+. Estimated reward time is 5 months. This isn’t green, already being centralised on large waste producing servers.” [Twitter] David S. H. Rosenthal noted precisely this in 2018: “One aspect of the talk that concerned me was that Cohen didn’t seem well-informed about the landscape of storage … If the cloud companies chose to burn-in their new drives by using them for Proof of Space they would easily dominate the network at almost zero cost.” [blog post, 2018] Baby’s on fire CoinHive used to host crypto-miners on web pages — scraps of JavaScript that would use your electricity to mine for Monero. The service was also popular with web malware vendors. CoinHive shut down in 2019. The coinhive.com domain name is now owned by security expert Troy Hunt — if you go to a page that’s still trying to load the CoinHive script, you get a page that warns you about cryptos, web-based malware and cross-site scripting.  [Troy Hunt] There’s enough Bitcoin mining in China that the Bitcoin mining alone is a serious problem for the country to meet its CO2 targets. [Nature; The Economist] David S. H. Rosenthal on how Bitcoin mining can never be green — because the carbon footprint is the point. [blog post] Gothamist: Andrew Yang Wants To Turn NYC Into A Bitcoin Megahub. That Would Be Terrible For Climate Change. “Bitcoin advocates never talk about displacement because it makes the numbers sound bad,” I was quoted as saying. [Gothamist] The Times: The idea of bitcoin going green is laughable — hey Bitcoin, this is what attention from the mainstream looks like. [Times, paywalled, archive]   while y'all are over here getting excited over NFTs I'm making the original NFT pic.twitter.com/Jcf01LB0BZ — live tucker reaction (@vogon) April 5, 2021   ICO, ICO The SEC has sued LBRY over their 2016 ICO — and their still-ongoing offerings of tokens in a manner that, on the face of it, appears to be a ridiculously obvious unregistered offering of securities. The SEC investigation has been going on three years. LBRY decided to market more tokens last year, which may have been the last straw for the SEC. [SEC press release; complaint, PDF] LBRY has struck back! With a site called HELP LBRY SAVE CRYPTO. The FAQ on the site makes a string of assertions which are best answered “read the complaint”. [HELP LBRY SAVE CRYPTO] Paragon was an ICO for “blockchain technology in the cannabis industry”. It was, as usual, an illegal offering of unregistered securities. Paragon settled with the SEC in 2018 — they had to return everyone’s money, and pay a $250,000 fine. Shockingly, the pot coin guys turned out to be flakes — Paragon defaulted on its settlement. [WSJ, 2019, paywalled] Paragon’s founders have disappeared. Aggrieved investors tried to mount a class action last year. [CoinDesk, 2020] Only $175,000 of the SEC penalty was paid, and this will be distributed to Paragon’s investors. [Order, PDF] In SEC v. Ripple, the SEC has been denied access to eight years of personal financial information of Ripple executives Brad Garlinghouse and Christian Larsen. [Order, PDF] And Ripple has gained partial access to SEC discussions on whether XRP was a security, as compared to BTC or ETH. [CoinTelegraph] The independent Telegram messaging service, beloved of crypto pumpers, will be a thing of the past — Pavel Durov was so screwed by paying back the investors in Telegram’s disastrous ICO that he’s now planning to take the company public. According to a claimed leak from the investment bankers preparing the offering, Telegram plans to sell 10% to 25% of the company in a direct US listing, in the hope of $30 to 50 billion, likely in 2023. [CoinDesk; Vedomosti, in Russian] The SEC has published a “Framework for ‘Investment Contract’ Analysis of Digital Assets.” None of this should be news to anyone here, though that won’t stop the crypto bros yelling like stuck pigs. [SEC]   Economists may sometimes say that the sky is green. The average crypto person will fight you on a 67 tweet thread arguing the colour of the sky is wet and in any case inflation is making the Nash equilibrium Llama. — 𝖤𝖽𝗆𝗎𝗇𝖽 𝖲𝖼𝗁𝗎𝗌𝗍𝖾𝗋 (@Edmund_Schuster) March 9, 2021   My beautiful launderette The Bank for International Settlements has a new report: “Supervising cryptoassets for anti-money laundering.” BIS concludes: “the first priority should be implementing the FATF standards wherever that has not taken place yet. This is the absolute minimum needed to mitigate the risks posed by cryptoassets at a global level.” This isn’t saying anything controversial, or advocating anything that isn’t happening — but crypto bros wishfully thinking the FATF ratchet will stop tightening on crypto are incorrect. [BIS, PDF] More on Signal and MobileCoin — Dan Davies (author of Lying for Money, a book that everyone reading this blog should read — UK, US) points out that the FCA already considers doing financial business over WhatsApp, Telegram or Signal “self-evidently suspicious.” In real finance, the traders’ chat channels are logged for compliance — because, without that, traders reliably dive headlong into illegal market shenanigans. And often, even with compliance logging. [Financial News, paywalled; Twitter] Dan correctly describes the innovation of MobileCoin: “pass on illegal inside information, receive payment and launder the proceeds, all in the same app!” [Twitter] The IRS wants information on Kraken crypto exchange customers, and on Circle customers — the latter may include when they owned Poloniex. [Forbes; Justice Department] Turkey gives cryptocurrencies official legal recognition as a payments mechanism, regulating their use either directly or indirectly! All use of cryptos in payments is banned. [Reuters; Resmi Gazete, in Turkish]   Welcome to finance Twitter. Please select your Guy: -Programmer trading in IRA -Leftist sympathizer, detests coworkers -Mysterious furry rumored to hav $500M AUM, 40% returns every year somehow -PhD high energy theory retired at 34 -Guy with tinder name John-MBA,CFA like LinkedIn — diet divorced guy (@neoliberal_dad) November 7, 2019   Central banking, not on the blockchain The Bank of England and the UK Treasury are forming a task force on central bank digital currencies (CBDCs). One of the task force’s vague and ill-specified jobs will be to look into whether they can find a use case for this in the UK — where most cash-like spending is actually a card anyway. [Bank of England] The Bank has been terribly excited about the fabulous possibilities of blockchain since they first noticed Bitcoin in 2013 — they’ve put out a pile of speculative papers, but none with an actual use case. That’s fine — speculating on weird possibilities is one of the things a central bank research unit does. (See Libra Shrugged, chapter 15.) But starting at an idea without a use case is the problem with blockchains in general. The Wall Street Journal has a pretty generic article on China’s DC/EP, but it includes the detail that the latest trial includes e-CNY that expires — “Beijing has tested expiration dates to encourage users to spend it quickly, for times when the economy needs a jump-start.” So even if DC/EP turns into Alipay-but-it’s-PBOC, being run by the PBOC means they can do interesting things with it if they need to. [WSJ, paywalled] The New Republic: Cryptocurrencies Are the Next Frontier for the Surveillance State — on the surveillance potential of CBDCs. With quotes and ideas from Libra Shrugged. [The New Republic]   So far in 2021 #Bitcoin has lost 97% of its value verses #Dogecoin. The market has spoken. Dogecoin is eating Bitcoin. All the Bitcoin pumpers who claim Bitcoin is better than gold because its price has risen more than gold's must now concede that Dogecoin is better than Bitcoin. — Peter Schiff (@PeterSchiff) April 16, 2021   Things happen Dogecoin is having another price pump, firmly establishing DOGE as the true crypto store of value and BTC as a deprecated altcoin. The big pump coincided with 400 million Tethers being deployed. Everything I said in February in my Foreign Policy piece on Dogecoin applies twice as hard. [Reddit] Australian plans to put disability payments on a … blockchain! It’ll work great! Right? With a quote from me. This particular bad idea somewhat resembles the plan to put welfare spending onto a blockchain that the UK government put into its 2016 paper “Distributed Ledger Technology: Beyond Blockchain” [gov.uk, 2016], which I wrote up in chapter 11 of Attack of the 50 Foot Blockchain. [ZDNet] The Marvelous Money Machine! A children’s book for grown-ups. This is great. Pay what you want for the PDF. [Gumroad] Facebook’s WhatsApp Pay Brazil has still not been allowed to go live, in the version where it hooks into the national PIX retail real-time settlement system. [Reuters] Der Spiegel: the German COVID vaccine tracker was going to use five blockchains! It will now use none. Nice try, IBM. [Der Spiegel, archive] Crypto guy loses a bet, and tries to pay the bet using the Lightning Network. Hilarity ensues. [Twitter thread, archive] PayPal lets you make payments with crypto! If it’s crypto you already had in your PayPal crypto holdings — which you can’t top up by depositing crypto from outside, only by buying crypto on PayPal with money. [Reuters] Why do this? The CEO of PayPal is a massive coiner, but he also has to worry about things like “the law.” So this gets crypto into news headlines on the company dime. Living on video Here’s the third pocast I did last week: Dunc Tank with Duncan Gammie! Talking about Attack of the 50 Foot Blockchain and the crypto skeptic view. [Podbean] I went on NTD again to talk about crypto “market cap” and how it’s a meaningless number, starting 11:35. [YouTube] And to talk about the Coinbase listing, starts 13:43. [YouTube] My laptop webcam is still mediocre, but it was better than the other Zoom experts’ webcams. The Naked Scientists podcast has done an episode on “Bitcoin Decrypted: Cash, Code, Crime & Power”. This is going out through BBC Radio 5 Live in the UK, and Radio National in Australia. [my segment; whole podcast] Byline Times: “So who is behind the onward march of the crypto, nearly 13 years on from the credit crunch and the arrival of Bitcoin and the thousands of digital currencies in its slipstream? The short answer is: idealists, ideologues and opportunists.” With a quote from me. [Byline Times] Sydney Morning Herald: ‘Financial weapon’: Bitcoin becomes another factor in China-US contest — with quotes from me. [SMH] I spoke to CNet about altcoins. [CNet] Investor’s Business Daily: Bitcoin Hits Tipping Point After Skyrocketing On Investment Mania — with quotes from me. [Investor’s Business Daily]   learning how to regurgitate on demand like a frightened vulture for the next time a man tries to explain cryptocurrencies to me — Kat Maddox (@ctrlshifti) April 8, 2021   Your subscriptions keep this site going. Sign up today! Share this: Click to share on Twitter (Opens in new window) Click to share on Facebook (Opens in new window) Click to share on LinkedIn (Opens in new window) Click to share on Reddit (Opens in new window) Click to share on Telegram (Opens in new window) Click to share on Hacker News (Opens in new window) Click to email this to a friend (Opens in new window) Taggedaustraliabank of englandbernie madoffbinancebisbitcoinblockchainbrad garlinghousebrazilbrian brooksbrian livingstoncbdcchiachinachristian larsencirclecoinbasecoinhivedcepdogecoindunc tankibmicoirskim parkerkrakenlbrylightning networklinksmarvelous money machineminingmobilecoinnftparagonpaypalpixpodcastpoloniexproof of spaceraribleripplesecsignaltelegramtethertroy huntturkeyunited kingdomwhatsapp payxinjiang Post navigation Previous Article Stilgherrian: The 9pm Dumb Anarcho-Capitalist Blockchain Scams with David Gerard One Comment on “News: Coinbase goes public, Bitcoin hashrate goes down, NFTs go down, proof-of-space trashes hard disk market” D says: 21st April 2021 at 2:21 am Fred Flintstone And The Marvelous Money Machine https://www.amazon.com/dp/B002UZQ0ZC Reply Leave a Reply Cancel reply Your email address will not be published. Required fields are marked * Comment Name * Email * Website Notify me of follow-up comments by email. Notify me of new posts by email. This site uses Akismet to reduce spam. Learn how your comment data is processed. Search for: Click here to get signed copies of the books!   Get blog posts by email! Email Address Subscribe Support this site on Patreon! Hack through the blockchain bafflegab: $5/month for early access to works in progress! $20/month for early access and even greater support! $100/month corporate rate, for your analyst newsletter budget! Buy the books! Libra Shrugged US Paperback UK/Europe Paperback ISBN-13: 9798693053977 Kindle: UK, US, Australia, Canada (and all other Kindle stores) — no DRM Google Play Books (PDF) Apple Books Kobo Smashwords Other e-book stores Attack of the 50 Foot Blockchain US Paperback UK/Europe Paperback ISBN-13: 9781974000067 Kindle: UK, US, Australia, Canada (and all other Kindle stores) — no DRM Google Play Books (PDF) Apple Books Kobo Smashwords Other e-book stores Available worldwide  RSS - Posts  RSS - Comments Recent blog posts News: Coinbase goes public, Bitcoin hashrate goes down, NFTs go down, proof-of-space trashes hard disk market Stilgherrian: The 9pm Dumb Anarcho-Capitalist Blockchain Scams with David Gerard Podcast: I Don’t Speak German #85: Crypto Fascists, with David Gerard Desperate investors, neoliberalism and Keynes: how to increase returns New York’s Excelsior Pass for COVID-19, on IBM Blockchain: doing the wrong thing, badly Excerpts from the book Table of Contents The conspiracy theory economics of Bitcoin Dogecoin Buterin’s quantum quest ICOs: magic beans and bubble machines Ethereum smart contracts in practice The DAO: the steadfast iron will of unstoppable code Business bafflegab, but on the Blockchain Imogen Heap: “Tiny Human”. Total sales: $133.20 Index About Press coverage for Attack of the 50 Foot Blockchain Press coverage for Libra Shrugged My cryptocurrency and blockchain press commentary and writing Facebook author page About the author Contact The content of this site is journalism and personal opinion. Nothing contained on this site is, or should be construed as providing or offering, investment, legal, accounting, tax or other advice. Do not act on any opinion expressed here without consulting a qualified professional. I do not hold a position in any crypto asset or cryptocurrency or blockchain company. Amazon product links on this site are affiliate links — as an Amazon Associate I earn from qualifying purchases. (This doesn’t cost you any extra.) Copyright © 2016–2021 David Gerard Powered by WordPress and HitMag. Send to Email Address Your Name Your Email Address Cancel Post was not sent - check your email addresses! Email check failed, please try again Sorry, your blog cannot share posts by email. 
developer-twitter-com-2745	----	Expansions | Twitter Developer Expansions Overview With expansions, developers can expand objects referenced in the payload. Objects available for expansion are referenced by ID. For example, the referenced_tweets.id and author_id fields returned in the Tweets lookup payload can be expanded into complete objects. If you would like to request fields related to the user that posted that Tweet, or the media, poll, or place that was included in that Tweet, you will need to pass the related expansion query parameter in your request to receive that data in your response. When including an expansion in your request, we will include that expanded object’s default fields within the same response. It helps return additional data in the same response without the need for separate requests. If you would like to request additional fields related to the expanded object, you can include the field parameter associated with that expanded object, along with a comma-separated list of fields that you would like to receive in your response. Please note fields are not always returned in the same order they were requested in the query. { "data": { "attachments": { "media_keys": [ "16_1211797899316740096" ] }, "author_id": "2244994945", "id": "1212092628029698048", "referenced_tweets": [ { "type": "replied_to", "id": "1212092627178287104" } ], "text": "We believe the best future version of our API will come from building it with YOU. Here’s to another great year with everyone who builds on the Twitter platform. We can’t wait to continue working with you in the new year. https://t.co/yvxdK6aOo2" } } The Tweet payload above contains some reference IDs for complementary objects we can expand on. We can expand on attachments.media_keys to view the media object, author_id to view the user object, and referenced_tweets.id to view the Tweet object the originally requested Tweet was referencing. Expanded objects will be nested in the "includes" object, as can be seen in the sample response below.   Available expansions in a Tweet payload Expansion Description author_id Returns a user object representing the Tweet’s author referenced_tweets.id Returns a Tweet object that this Tweet is referencing (either as a Retweet, Quoted Tweet, or reply) in_reply_to_user_id Returns a user object representing the Tweet author this requested Tweet is a reply of attachments.media_keys Returns a media object representing the images, videos, GIFs included in the Tweet attachments.poll_ids Returns a poll object containing metadata for the poll included in the Tweet geo.place_id Returns a place object containing metadata for the location tagged in the Tweet entities.mentions.username Returns a user object for the user mentioned in the Tweet referenced_tweets.id.author_id Returns a user object for the author of the referenced Tweet   Available expansion in a user payload Expansion Description pinned_tweet_id Returns a Tweet object representing the Tweet pinned to the top of the user’s profile   Expanding the media, Tweet, and user objects In the following request, we are requesting the following expansions to include alongside the default Tweet fields.  Be sure to replace $BEARER_TOKEN with your own generated bearer token. attachments.media_keys referenced_tweets.id author_id   Sample Request   curl 'https://api.twitter.com/2/tweets/1212092628029698048?expansions=attachments.media_keys,referenced_tweets.id,author_id' --header 'Authorization: Bearer $BEARER_TOKEN' Code copied to clipboard   Sample Response { "data": { "attachments": { "media_keys": [ "16_1211797899316740096" ] }, "author_id": "2244994945", "id": "1212092628029698048", "referenced_tweets": [ { "type": "replied_to", "id": "1212092627178287104" } ], "text": "We believe the best future version of our API will come from building it with YOU. Here’s to another great year with everyone who builds on the Twitter platform. We can’t wait to continue working with you in the new year. https://t.co/yvxdK6aOo2" }, "includes": { "media": [ { "media_key": "16_1211797899316740096", "type": "animated_gif" } ], "users": [ { "id": "2244994945", "name": "Twitter Dev", "username": "TwitterDev" } ], "tweets": [ { "author_id": "2244994945", "id": "1212092627178287104", "referenced_tweets": [ { "type": "replied_to", "id": "1212092626247110657" } ], "text": "These launches would not be possible without the feedback you provided along the way, so THANK YOU to everyone who has contributed your time and ideas. Have more feedback? Let us know ⬇️ https://t.co/Vxp4UKnuJ9" } ] } } Expanding the poll object In the following request, we are requesting the following expansions to include alongside the default Tweet fields: attachments.poll_ids   Sample Request curl 'https://api.twitter.com/2/tweets/1199786642791452673?expansions=attachments.poll_ids' --header 'Authorization: Bearer $BEARER_TOKEN' Code copied to clipboard Sample Response { "data": { "attachments": { "poll_ids": [ "1199786642468413448" ] }, "id": "1199786642791452673", "text": "C#" }, "includes": { "polls": [ { "id": "1199786642468413448", "options": [ { "position": 1, "label": "“C Sharp”", "votes": 795 }, { "position": 2, "label": "“C Hashtag”", "votes": 156 } ] } ] } } Expanding the place object In the following request, we are requesting the following expansions to include alongside the default Tweet fields: geo.place_id   Sample Request curl 'https://api.twitter.com/2/tweets/:ID?expansions=geo.place_id’ --header 'Authorization: Bearer $BEARER_TOKEN' Code copied to clipboard Sample Response { "data": { "geo": { "place_id": "01a9a39529b27f36" }, "id": "ID", "text": "Test" }, "includes": { "places": [ { "full_name": "Manhattan, NY", "id": "01a9a39529b27f36" } ] } } Next step Learn how to use Fields with Expansions Review the different data objects available with Twitter API v2 Was this document helpful? Thank you for the feedback. We’re really glad we could help! Thank you for the feedback. How could we improve this document? This page is missing information. The information was hard to follow or confusing. There is inaccurate information. There is a broken link or typo. Specific Feedback Submit feedback Skip Thank you for the feedback. Your comments will help us improve our documents in the future. Developer agreement, policy & terms Follow @twitterdev Subscribe to developer news Twitter platform Twitter.com Status Card validator Privacy Center Transparency Center Twitter, Inc. About the company Twitter for Good Company news Brand toolkit Jobs and internships Investors Help Help Center Using Twitter Twitter Media Ads Help Center Managing your account Safety and security Rules and policies Contact us Developer resources Developer home Documentation Forums Communities Developer blog Engineering blog Developer terms Business resources Advertise Twitter for business Resources and guides Twitter for marketers Marketing insights Brand inspiration Twitter Data Twitter Flight School © 2021 Twitter, Inc. Cookies Privacy Terms and conditions Language Developer By using Twitter’s services you agree to our Cookies Use. We use cookies for purposes including analytics, personalisation, and ads. OK This page and certain other Twitter sites place and read third party cookies on your browser that are used for non-essential purposes including targeting of ads. Through these cookies, Google, LinkedIn and Demandbase collect personal data about you for their own purposes. Learn more. Accept Decline 
developer-twitter-com-2901	----	Fields | Twitter Developer Fields Introduction The Twitter API v2 endpoints are equipped with a new set of parameters called fields, which allows you to select just the data that you want from each of our objects in your endpoint response. For example, if you only need to retrieve a Tweet’s created date, or a user’s bio description, you can specifically request that data to return with a set of other default fields without the full set of fields that associate with that data object. This provides a higher degree of customization by enabling you to only request the fields you require depending on your use case. Default fields will always be returned in the response. With the fields query parameters, you can request additional fields of the object to include in the response. This is done by specifying one of the below parameters, including a comma-separated list of fields that you would like to return. Each object has its own parameter which is used to specifically request the fields that are associated with that object. Here are the different fields parameters that are currently available: Tweet → tweet.fields User → user.fields Media → media.fields Poll → poll.fields Place → place.fields When using an endpoint that primarily returns a particular object, simply use the matching field parameter and specify the field(s) desired in a comma-separated list as the value to that parameter to retrieve those fields in the response.   For example, if you are using the GET /tweets/search/recent endpoint, you will primarily receive Tweet objects in that response. Without specifying any fields parameters, you will just receive the default values, id and text. If you are interested in receiving the public metrics of the Tweets that are returned in the response, you will want to include the tweet.fields parameter in your request, with public_metrics set as the value.  This request would look like the following. If you would like to use this request, make sure to replace $BEARER_TOKEN with your Bearer Token and send it using your command line tool. curl --request GET \ --url 'https://api.twitter.com/2/tweets/search/recent?query=from%3Atwitterdev&tweet.fields=public_metrics' \ --header 'Authorization: Bearer $BEARER_TOKEN' Code copied to clipboard If you send this request in your terminal, then each of the Tweets that return will include the following fields: { "data": { "id": "1263150595717730305", "public_metrics": { "retweet_count": 12, "reply_count": 14, "like_count": 49, "quote_count": 7 }, "text": "Do you 👀our new Tweet settings?\n\nWe want to know how and why you’d use a feature like this in the API. Get the details and let us know what you think👇\nhttps://t.co/RtMhhfAcIB https://t.co/8wxeZ9fJER" } } If you would like to retrieve a set of fields from a secondary object that is associated with the primary object returned by an endpoint, you will need to include an additional expansions parameter.  For example, if you were using the same GET search/tweets/recent endpoint as earlier, and you wanted to retrieve the author's profile description, you will have to pass the expansions=author_id and user.fields=description with your request. Here is an example of what this might look like. If you would like to try this request, make sure to replace the $BEARER_TOKEN with your Bearer Token before pasting it into your command line tool. curl --request GET \ --url 'https://api.twitter.com/2/tweets/search/recent?query=from%3Atwitterdev&tweet.fields=public_metrics&expansions=author_id&user.fields=description' \ --header 'Authorization: Bearer $BEARER_TOKEN' Code copied to clipboard If you specify this in the request, then each of the Tweets that deliver will have the following fields, and the related user object's default and specified fields will return within includes. The user object can be mapped back to the corresponding Tweet(s) by matching the tweet.author_id and users.id fields.   { "data": [ { "id": "1263150595717730305", "author_id": "2244994945", "text": "Do you 👀our new Tweet settings?\n\nWe want to know how and why you’d use a feature like this in the API. Get the details and let us know what you think👇\nhttps://t.co/RtMhhfAcIB https://t.co/8wxeZ9fJER", "public_metrics": { "retweet_count": 12, "reply_count": 13, "like_count": 51, "quote_count": 7 } } ], "includes": { "users": [ { "id": "2244994945", "username": "TwitterDev", "description": "The voice of the #TwitterDev team and your official source for updates, news, and events, related to the #TwitterAPI.", "name": "Twitter Dev" } ] } } Bear in mind that you cannot request specific subfields (for example, public_metrics.retweet_count). All subfields will be returned when the top-level field (public_metrics) is specified. We have listed all possible fields that you can request in each endpoints' API reference page's parameters table.  A full list of fields are listed in the object model. To expand and request fields on an object that is not that endpoint’s primary resource, use the expansions parameter with fields. Next step Learn how to use Fields with Expansions Review the different data objects available with Twitter API v2 Make your first request with Fields and Expansions Was this document helpful? Thank you for the feedback. We’re really glad we could help! Thank you for the feedback. How could we improve this document? This page is missing information. The information was hard to follow or confusing. There is inaccurate information. There is a broken link or typo. Specific Feedback Submit feedback Skip Thank you for the feedback. Your comments will help us improve our documents in the future. Developer agreement, policy & terms Follow @twitterdev Subscribe to developer news Twitter platform Twitter.com Status Card validator Privacy Center Transparency Center Twitter, Inc. About the company Twitter for Good Company news Brand toolkit Jobs and internships Investors Help Help Center Using Twitter Twitter Media Ads Help Center Managing your account Safety and security Rules and policies Contact us Developer resources Developer home Documentation Forums Communities Developer blog Engineering blog Developer terms Business resources Advertise Twitter for business Resources and guides Twitter for marketers Marketing insights Brand inspiration Twitter Data Twitter Flight School © 2021 Twitter, Inc. Cookies Privacy Terms and conditions Language Developer By using Twitter’s services you agree to our Cookies Use. We use cookies for purposes including analytics, personalisation, and ads. OK This page and certain other Twitter sites place and read third party cookies on your browser that are used for non-essential purposes including targeting of ads. Through these cookies, Google, LinkedIn and Demandbase collect personal data about you for their own purposes. Learn more. Accept Decline 
developer-twitter-com-5357	----	Guide to the future of the Twitter API | Twitter Developer Early Access Guide to the future of the Twitter API Overview Support for diverse use cases New access options to get started and grow Bringing it all together Evolution of the developer portal Support for OAuth 2 and new permissions Versioning Rolling out the new Twitter API At Twitter, our purpose is to serve the public conversation and we believe developers play a critical role in achieving this. Twitter wouldn’t be what it is today if it weren’t for you. Your creativity and work with our API make Twitter, and the world, a better place.  We’re building the next generation of the Twitter API to better serve our diverse community of developers. Our API gives you the ability to learn from and engage with the conversation on Twitter and we want to give you tools to further uncover, build on, and share the value of this conversation with the world.  To serve our diverse ecosystem, we plan to introduce a few new concepts and components to the platform. Consider this page your guide to the future of the Twitter API. As we build, we’ll update our product roadmap and will be keeping you updated here about the details of our plans. And with our new API foundation, we'll be better able to incorporate your feedback and make improvements along the way.  If you missed our recent announcements, make sure to read our blog posts introducing the new and improved Twitter API, and the Academic Research product track. If you have questions, feedback, or suggestions about any of the following, let us know. Share your feedback Last updated: January 26, 2021   Support for diverse use cases Understanding the global conversation Engaging with people on Twitter Improving Twitter   Understanding the global conversation Engaging with people on Twitter Improving Twitter We’ve always known that our developer ecosystem is diverse, but our API has long taken a one-size-fits-all approach. You helped us to understand the use cases you have when you work with the Twitter API and we're building the new API to help support these use cases—releasing new functionality in phases, each supporting the core use cases that we’ve heard from you.   Understanding the global conversation Our first few releases will be focused on making it easier to understand the public conversation. One of the most common reasons developers use the Twitter API is to listen to and analyze the conversation happening on Twitter. We’re not done yet. In the coming months, we will continue to release additional endpoints to help you understand the conversation, to discover insights, or to make informed decisions.  Explore the Listen & Analyze use case   Engaging with people on Twitter People come to Twitter to connect and interact with each other, with their favorite teams, celebrities, or musicians, with world leaders, with their communities, with brands, with fun bots, and more. Developers play a critical role in creating content and engaging in various ways on the platform. In the coming months, we’ll release a number of endpoints (including new versions of endpoints for creating Tweets) to Early Access to support these use cases.   Improving Twitter Developers have played a key part in making Twitter healthier and more engaging since the beginning. Your love for Twitter shows through your work and we want to make it easy for you to channel your passion for Twitter into actively making it better. We want to empower you to give people more control over their experience on Twitter. The new Academic Research product track represents a crucial beginning to this process, as their research and discoveries can help make the world a better place, even help improve experiences on Twitter. Although we started with academic researchers, keep an eye open for new endpoints, guidance, and tools to fuel this kind of work across our Standard, Academic Research, and Business product tracks.   New access options to get started and grow Access levels Product tracks New license terms Supporting the health of the public conversation   Access levels Product tracks New license terms Supporting the health of the public conversation Your feedback helped us see the importance of making the new Twitter API more flexible and scalable to fit your needs. With the new API, we are building new access options and product tracks so more developers can find options to support their use cases.   Access levels Within the new Twitter API, we intend to introduce three core access levels which make it easy to grow and scale. The three access levels include: Basic access:  Free, default access to endpoints for developers with an approved developer account. Based on research over the past few years, we expect that the large majority of developers (>80%) will find the access they need within this tier to get started and build something awesome. Elevated access: Increased access to collections of relevant endpoints that include access to more Tweets, increased rate limits, and more advanced reliability features. Custom access: While the majority of developers’ goals will be met by Basic and Elevated access, for those who need more, we can help get you what you need.   Product tracks We love the incredible diversity of developers who use our API. And we want to provide a platform that: serves many types of developers with access and tools that fit their use cases continues to offer free and open access for developers, and provides a dedicated and supported path for both commercial and non-commercial services built with the API To accomplish these, we're introducing new, distinct product tracks to better serve different groups of developers and provide them with a tailored experience and support, a range of relevant access levels, and appropriate pricing (where applicable).  Developers who already have a developer account will start in the Standard product track and will be able to apply for others. New developers will be able to apply for the tracks that are relevant to you.  Standard: The default product track for most developers, including those building something for fun, for a good cause, to learn or teach. Academic Research: Academic researchers are one of the largest groups looking to understand what’s happening in the public conversation. Within this track, qualified academic researchers will get increased levels of access to a relevant collection of endpoints, including a new full-archive search endpoint. We’re also providing resources for researchers to make it easier to conduct academic research with the Twitter API. Business: Developers build businesses on the Twitter API. And we love that their products help other people and businesses better understand and engage with the conversation on Twitter. This track will include the option for Elevated access to relevant collections of endpoints, or Custom access. A key part of our strategy is our commitment to working with a diverse set of developers to enable their success. Some developers, including those building client-like applications, deserve more clarity in how to operate with the new Twitter API.  Though too early to share any specifics, reaching this clarity may require a fresh look at policy and product access details that affect them. We’re looking ahead and seek to determine how best to work with these groups to serve the public conversation together.   New license terms We’re designing these tracks with products, pricing, and access level options to better serve the unique needs of different types of developers. To support this, certain product tracks are reserved for non-commercial purposes only. We’ve therefore introduced new commercial use terms to the Developer Agreement that govern how the API can be used in product tracks designated as non-commercial. The Academic Research product track is the first product track we’ve released that is reserved for non-commercial purposes. As we continue introducing additional product tracks, namely the Business product track, we will provide more information about serving commercial use of the Twitter API. For now, commercial use cases are supported through the Standard Basic access or the v1.1 Twitter API.  Know that if you are using the API for commercial purposes, this does not necessarily mean that you are required to pay for access (for example, Basic access on any of our product tracks will be available for free).  We want to continue learning more from you about this approach. If you’re interested, let us know your thoughts on these plans with this short survey.     Supporting the health of the public conversation As with the introduction of our developer application a few years ago, we are committed to a developer platform that works in service of the overall health of conversation on Twitter. Simultaneously, we are committed to a developer platform that is open and serves diverse needs. The introduction of these new access levels and product tracks allows us to offer more options and access with increased trust, as well as more controls to help address platform abuse and to keep the Twitter service safe and secure for everyone. Our hope is that you find that these paths provide even more clarity about how to adhere to our Developer Terms and make it easier to scale your use of the Twitter API for years to come.   Bringing it all together With work underway and several new access levels and product tracks planned, we want to share an illustration of how they may all come together. This is an evolving vision and it will take some time before all of these access options are available. We hope this will be helpful to understand which path may eventually make sense for you. Overview Standard Academic Research Business Overview Standard Academic Research Business We want to continue learning more from you to be sure our approach is right. If you’re interested, please share your feedback with us about these plans.   Evolution of the developer portal Evolution of the developer portal   Evolution of the developer portal   Over the last few months, all developers saw a new interface when they logged in to their developer accounts. This new developer portal is the home base for managing your use of the new Twitter API, with continual improvements and new features to help you build. We’re planning to create new ways to manage access for multiple development environments, to help you rethink how you manage a team of collaborators, track and understand your API usage, move up and down between access levels, and find resources to help you be successful. If you have other ideas you’d like to see, let us know and share your feedback! We’ve also introduced "Projects" within the developer portal as a way to organize your work and manage your access to the Twitter API for each use case you’re building with it. We’re starting with just one Project per developer account for the first Early Access release, so you can begin using Basic access to the new Twitter API. With the recent release of the Academic Research product track, eligible researchers can now add a Project in the Academic Research product track. They may also create or maintain another Project in the Standard product track for a distinct, and separate use case. As we roll out further access levels and product tracks, you’ll be able to create multiple Projects for different use cases. We plan to support separate production, staging and development Apps within a Project as distinct environments to help you better manage your integration, and make it easier for a team to manage a Project and its Apps. For now, you can still use your existing, standalone Apps and create new ones if you need to; eventually, all API access will be through Projects.   Support for OAuth 2 and new permissions Support for OAuth 2 and new permissions   Support for OAuth 2 and new permissions   We are working to add support for OAuth 2. In doing so, we intend to improve the developer experience with more granular permissions to give you more control and to serve the expectations of people authorizing your application. It will be some time before we make this available, however, this is a path we are actively pursuing. We'll share more in the future about how to test this. Share your feedback and suggestions as we build.   Versioning Versioning   Versioning   We expect to launch new major API versions more often than we have in the past (8 years ago!), but we'll still make it a goal to avoid doing so unless there's a compelling reason. We don't expect to make major version updates more often than once every 12 months, and when we do, it will be our goal to support the previous version for at least 1 year until retirement. Between major version changes, you’ll continue to see us add non-breaking improvements as they’re ready. Our goal is that you will only need to update your integration if you’d like to take advantage of new functionality.   Rolling out the new Twitter API Early access Deprecations and migrations Expected sequence of events   Early access Deprecations and migrations Expected sequence of events   Early Access In August 2020, we released Early Access to the new Twitter API v2. Eventually, the new API will fully replace the v1.1 standard, premium, and enterprise APIs. Before that can happen, we have more to build. Since our initial release, we’ve added a handful of new features including the new hide replies endpoint, the user Tweet timeline and user mention timeline endpoints, and the new follows lookup endpoints.  Additionally, we launched the Academic Research product track on the new Twitter API. This specialized track for researchers offers higher levels of access, free access to full-archive search, and other v2 endpoints for approved developers, as well as enhanced features and functionality to get more precise and complete data for analyzing the public conversation. Please note that this product track does have increased eligibility requirements. Academics with a specific research use case for using Twitter data can now apply for the Academic Research product track. For all other developers, we continue to encourage usage of Early Access. Everything we’ve released and will continue releasing into Early Access is fully supported and ready for you to build in production. Once we've completed releasing new versions of core functionality, we’ll move the new API version (v2) into the General Availability (GA) phase and make it the new default version of the Twitter API. To learn more, visit the Early Access overview. For a preview of what’s to come, and what we have planned, check out our expected sequence of events, below! Get started with Early Access If you don't yet have a developer account, apply to get started.   Deprecations and migrations We know migrations can be challenging and we’re committed to doing our part to make migrating to our new API as easy as we can. Whether you use the current standard v1.1, premium, or enterprise endpoints — or a combination — you likely won’t need to migrate for some time. Our intent is to provide plenty of migration time (along with resources to help) when we deprecate existing endpoints. Our goal is to wait until we have completed releasing new versions of core functionality, but there may be exceptions where we need to turn off some legacy services sooner, including: Standard v1.1 statuses/sample and statuses/filter endpoints. Later this year we plan to announce a shorter deprecation window for these two endpoints. The replacements for these endpoints are available in Early Access: the filtered stream and sampled stream endpoints. We're giving you this heads up so you can begin exploring these replacements now. For specific requests or to provide your thoughts on this update, please share your feedback. For those that want to get ahead and migrate early, check out our migration resources for the Twitter API v2.   Expected sequence of updates The effort to replace the v1.1, premium, and enterprise APIs will take some time. To help you plan, we want to share a rough outline of the order in which we hope to roll out changes. Should our plans evolve, we will do our best to keep it updated here. To receive notification about the progress of specific items, sign up to "watch" any cards within our product roadmap.   Timeline Endpoints Product tracks Deprecation Timeline Endpoints Product tracks Deprecation Stay tuned! We will continue to evolve and improve our plans as we learn. Have specific thoughts you'd like to share? We're always listening, so please share your feedback. We'd love to hear from you! Developer agreement, policy & terms Follow @twitterdev Subscribe to developer news Twitter platform Twitter.com Status Card validator Privacy Center Transparency Center Twitter, Inc. About the company Twitter for Good Company news Brand toolkit Jobs and internships Investors Help Help Center Using Twitter Twitter Media Ads Help Center Managing your account Safety and security Rules and policies Contact us Developer resources Developer home Documentation Forums Communities Developer blog Engineering blog Developer terms Business resources Advertise Twitter for business Resources and guides Twitter for marketers Marketing insights Brand inspiration Twitter Data Twitter Flight School © 2021 Twitter, Inc. Cookies Privacy Terms and conditions Language Developer By using Twitter’s services you agree to our Cookies Use. We use cookies for purposes including analytics, personalisation, and ads. OK This page and certain other Twitter sites place and read third party cookies on your browser that are used for non-essential purposes including targeting of ads. Through these cookies, Google, LinkedIn and Demandbase collect personal data about you for their own purposes. Learn more. Accept Decline 
digitallibrarian-org-4082	----	The Digital Librarian – Information. Organization. Access. ↓ Skip to Main Content The Digital Librarian Information. Organization. Access. Main Navigation Menu Home About Libraries and the state of the Internet By jaf Posted on June 27, 2016 Posted in digital libraries No Comments Mary Meeker presented her 2016 Internet Trends report earlier this month. If you want a better understanding of how tech and the tech industry is evolving, you should watch her talk and read her slides. This year’s talk was fairly … Libraries and the state of the Internet Read more » Meaningful Web Metrics By jaf Posted on January 3, 2016 Posted in Web metrics No Comments This article from Wired magazine is a must-read if you are interested in more impactful metrics for your library’s web site. At MPOE, we are scaling up our need for in-house web product expertise, but regardless of how much we … Meaningful Web Metrics Read more » Site migrated By jaf Posted on October 1, 2012 Posted in blog No Comments Just a quick note – digitallibrarian.org has been migrated to a new server. You may see a few quirks here and there, but things should be mostly in good shape. If you notice anything major, send me a Challah. Really. … Site migrated Read more » The new iPad By jaf Posted on March 18, 2012 Posted in Apple, Hardware, iPad 1 Comment I decided that it was time to upgrade my original iPad, so I pre-ordered a new iPad, which arrived this past Friday. After a few days, here are my initial thoughts / observations: Compared to the original iPad, the new … The new iPad Read more » 3rd SITS Meeting – Geneva By jaf Posted on August 3, 2011 Posted in Conferences, digital libraries, Uncategorized, workshops No Comments Back in June I attend the 3rd SITS (Scholarly Infrastructure Technical Summit) meeting, held in conjunction with the OAI7 workshop and sponsored by JISC and the Digital Library Federation. This meeting, held in lovely Geneva, Switzerland, brought together library technologists … 3rd SITS Meeting – Geneva Read more » Tagged with: digital libraries, DLF, SITS David Lewis’ presentation on Collections Futures By jaf Posted on March 2, 2011 Posted in eBooks, Librarianship 1 Comment Peter Murray (aka the Disruptive Library Technology Jester) has provided an audio-overlay of David Lewis’ slideshare of his plenary at the last June’s RLG Annual Partners meeting. If you are at all interested in understanding the future of academic libraries, … David Lewis’ presentation on Collections Futures Read more » Tagged with: collections, future, provisioning Librarians are *the* search experts… By jaf Posted on August 19, 2010 Posted in Librarianship No Comments …so I wonder how many librarians know all of the tips and tricks for using Google that are mentioned here? What do we want from Discovery? Maybe it’s to save the time of the user…. By jaf Posted on August 18, 2010 Posted in Uncategorized 1 Comment Just a quick thought on discovery tools – the major newish discovery services being vended to libraries (WorldCat local, Summon, Ebsco Discovery Service, etc.) all have their strengths, their complexity, their middle-of-the-road politician trying to be everything to everybody features. … What do we want from Discovery? Maybe it’s to save the time of the user…. Read more » Putting a library in Starbucks By jaf Posted on August 12, 2010 Posted in digital libraries, Librarianship No Comments It is not uncommon to find a coffee shop in a library these days. Turn that concept around, though – would you expect a library inside a Starbucks? Or maybe that’s the wrong question – how would you react to … Putting a library in Starbucks Read more » Tagged with: coffee, digital library, library, monopsony, starbucks, upsell 1 week of iPad By jaf Posted on April 14, 2010 Posted in Apple, eBooks, Hardware, iPad 1 Comment It has been a little over a week since My iPad was delivered, and in that time I have had the opportunity to try it out at home, at work, and on the road. In fact, I’m currently typing this … 1 week of iPad Read more » Tagged with: Apple, digital lifestyle, iPad, mobile, tablet Posts navigation 1 2 3 Next © 2021 | Powered by Responsive Theme 
digitallibrarian-org-6938	----	<?xml version="1.0" encoding="UTF-8"?><rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" > <channel> <title>The Digital Librarian</title> <atom:link href="http://digitallibrarian.org/?feed=rss2" rel="self" type="application/rss+xml" /> <link>http://digitallibrarian.org</link> <description>Information. Organization. Access.</description> <lastBuildDate>Mon, 27 Jun 2016 19:04:01 +0000</lastBuildDate> <language>en-US</language> <sy:updatePeriod> hourly </sy:updatePeriod> <sy:updateFrequency> 1 </sy:updateFrequency> <generator>https://wordpress.org/?v=5.6.3</generator> <item> <title>Libraries and the state of the Internet</title> <link>http://digitallibrarian.org/?p=229</link> <comments>http://digitallibrarian.org/?p=229#respond</comments> <dc:creator><![CDATA[jaf]]></dc:creator> <pubDate>Mon, 27 Jun 2016 12:04:01 +0000</pubDate> <category><![CDATA[digital libraries]]></category> <guid isPermaLink="false">http://digitallibrarian.org/?p=229</guid> <description><![CDATA[Mary Meeker presented her 2016 Internet Trends report earlier this month. If you want a better understanding of how tech and the tech industry is evolving, you should watch her talk and read her slides. This year&#8217;s talk was fairly &#8230;<p class="read-more"> <a class="more-link" href="http://digitallibrarian.org/?p=229"> <span class="screen-reader-text">Libraries and the state of the Internet</span> Read More &#187;</a></p>]]></description> <content:encoded><![CDATA[<p><a href="https://en.wikipedia.org/wiki/Mary_Meeker" target="_blank">Mary Meeker</a> presented her 2016 Internet Trends report earlier this month. If you want a better understanding of how tech and the tech industry is evolving, you should <a href="https://youtu.be/334Gfug5OL0" target="_blank">watch her talk</a> and <a href="http://www.kpcb.com/internet-trends" target="_blank">read her slides</a>.</p> <p>This year&#8217;s talk was fairly time constrained, and she did not go into as much detail as she has in years past. That being said, there is still an enormous amount of value in the data she presents and the trends she identifies via that data.</p> <p>Some interesting takeaways:</p> <ul> <li>The growth in total number of internet users worldwide is slowing (the year-to-year growth rate is flat; overall growth is around 7% new years per year)</li> <li>However, growth in India is still accelerating, and India is now the #2 global user market (behind China; USA is 3rd)</li> <li>Similarly, there is a slowdown in the growth of the number of smartphone users and number of smartphones being shipped worldwide (still growing, but at a slower rate)</li> <li>Android continues to demonstrate growth in marketshare; Android devices are continuing to be less costly by a significant margin than Apple devices.</li> <li>Overall, there are opportunities for businesses that innovate / increase efficiency / lower prices / create jobs</li> <li>Advertising continues to demonstrateÂ strong growth; advertising efficacy still has a ways to go (internet advertising is effective and can be even more so)</li> <li>Internet as distribution channel continues to grow in use and importance</li> <li>Â Brand recognition is increasingly important</li> <li>Visual communication channel usage is increasing &#8211; Generation Z relies more on communicating with images than with text</li> <li>Messaging is becoming a core communication channel for business interactions in addition to social interactions</li> <li>Voice on mobile rapidly rising as important user interface &#8211; lots of activity around this</li> <li>Data as platform &#8211; important!</li> </ul> <p>So, what kind of take-aways might be most useful to consider in the library context? Some top-of-head thoughts:</p> <ul> <li>In the larger context of the Internet, Libraries need to be more aggressive in marketing their brand and brand value. We are, by nature, fairly passive, especially compared to our commercial competition, and a failure to better leverage the opportunity for brand exposure leaves the door open to commercial competitors.</li> <li>Integration of library services and content through messaging channels will become more important, especially with younger users. (Integration may actually be too weak a term; understanding how to use messaging inherently within the digital lifestyles of our users is critical)</li> <li>Voice &#8211; are any libraries doing anything with voice? Integration with Amazon&#8217;s Alexa voice search? How do we fit into the voice as platform paradigm?</li> </ul> <p>One parting thought, that I&#8217;ll try to tease out in a follow-up post: Libraries need to look very seriously at the importance of personalized, customized curation of collections for users, something that might actually be antithetical to the way we currently approach collection development. Think Apple Music, but for books, articles, and other content provided by libraries. It feels like we are doing this in slices and pieces, but that we have not yet established a unifying platform that integrates with the larger Internet ecosystem.</p> ]]></content:encoded> <wfw:commentRss>http://digitallibrarian.org/?feed=rss2&#038;p=229</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item> <title>Meaningful Web Metrics</title> <link>http://digitallibrarian.org/?p=207</link> <comments>http://digitallibrarian.org/?p=207#respond</comments> <dc:creator><![CDATA[jaf]]></dc:creator> <pubDate>Sun, 03 Jan 2016 20:10:52 +0000</pubDate> <category><![CDATA[Web metrics]]></category> <guid isPermaLink="false">http://digitallibrarian.org/?p=207</guid> <description><![CDATA[This article from Wired magazine is a must-read if you are interested in more impactful metrics for your library&#8217;s web site. At MPOE, we are scaling up our need for in-house web product expertise, but regardless of how much we &#8230;<p class="read-more"> <a class="more-link" href="http://digitallibrarian.org/?p=207"> <span class="screen-reader-text">Meaningful Web Metrics</span> Read More &#187;</a></p>]]></description> <content:encoded><![CDATA[<p><a href="http://www.wired.com/2015/12/everyone-knows-page-views-dont-matter-but-they-just-wont-die/" target="_blank">This article</a> from <a href="http://wired.com" target="_blank"><em>Wired</em></a> magazine is a must-read if you are interested in more impactful metrics for your library&#8217;s web site. At MPOE, we are scaling up our need for in-house web product expertise, but regardless of how much we invest in terms of staffing, it is likely that the amount of requested web support will always exceed the amount of resourcing we have for that support. Leveraging meaningful impact metrics can help us understand the value we get from the investment we make in our web presence, and more importantly help us define <strong>what</strong> types of impact we want to achieve through that investment. This is no easy feat, but it is good to see that others in the information ecosystem are looking at the same challenges.</p> ]]></content:encoded> <wfw:commentRss>http://digitallibrarian.org/?feed=rss2&#038;p=207</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item> <title>Site migrated</title> <link>http://digitallibrarian.org/?p=154</link> <comments>http://digitallibrarian.org/?p=154#respond</comments> <dc:creator><![CDATA[jaf]]></dc:creator> <pubDate>Mon, 01 Oct 2012 20:25:53 +0000</pubDate> <category><![CDATA[blog]]></category> <guid isPermaLink="false">http://digitallibrarian.org/?p=154</guid> <description><![CDATA[Just a quick note &#8211; digitallibrarian.org has been migrated to a new server. You may see a few quirks here and there, but things should be mostly in good shape. If you notice anything major, send me a Challah. Really. &#8230;<p class="read-more"> <a class="more-link" href="http://digitallibrarian.org/?p=154"> <span class="screen-reader-text">Site migrated</span> Read More &#187;</a></p>]]></description> <content:encoded><![CDATA[<p>Just a quick note &#8211; digitallibrarian.org has been migrated to a new server. You may see a few quirks here and there, but things should be mostly in good shape. If you notice anything major, send me a Challah. Really. A nice bread. Or just an email. Your choice. <img src="https://s.w.org/images/core/emoji/13.0.1/72x72/1f642.png" alt="ðŸ™‚" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p> ]]></content:encoded> <wfw:commentRss>http://digitallibrarian.org/?feed=rss2&#038;p=154</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item> <title>The new iPad</title> <link>http://digitallibrarian.org/?p=141</link> <comments>http://digitallibrarian.org/?p=141#comments</comments> <dc:creator><![CDATA[jaf]]></dc:creator> <pubDate>Sun, 18 Mar 2012 16:20:55 +0000</pubDate> <category><![CDATA[Apple]]></category> <category><![CDATA[Hardware]]></category> <category><![CDATA[iPad]]></category> <guid isPermaLink="false">http://digitallibrarian.org/?p=141</guid> <description><![CDATA[I decided that it was time to upgrade my original iPad, so I pre-ordered a new iPad, which arrived this past Friday. After a few days, here are my initial thoughts / observations: Compared to the original iPad, the new &#8230;<p class="read-more"> <a class="more-link" href="http://digitallibrarian.org/?p=141"> <span class="screen-reader-text">The new iPad</span> Read More &#187;</a></p>]]></description> <content:encoded><![CDATA[<p>I decided that it was time to upgrade my original iPad, so I pre-ordered a new iPad, which arrived this past Friday. After a few days, here are my initial thoughts / observations:</p> <li>Compared to the original iPad, the new iPad is a huge improvement. Much zipper, feels lighter (compared to the original), and of course the display is fantastic.</li> <li>I&#8217;ve just briefly tried the dictation feature, and though I haven&#8217;t used it extensively yet, the accuracy seems pretty darned good. I wonder if a future update will support Siri?</li> <li>The beauty of the display cannot be understated &#8211; crisp, clear (especially for someone with aging eyes)</li> <li>I purchased a 32-Gb model with LTE, but I have not tried the cell network yet. I did see 4G show up, so I&#8217;m hoping that Tucson indeed has the newer network.</li> <li>Not really new, but going from the original iPad to the new iPad, I really like the smart cover approach. Ditto with the form factor.</li> <li>Again, not specific to the new model, the ability to access my music, videos, and apps via iCloud means that I can utilize the storage on the iPad more effectively.</li> <p>All-in-all, I can see myself using the new iPad consistently for a variety of tasks, not just for consuming information. Point-in-fact, this post was written with the new iPad.</p> ]]></content:encoded> <wfw:commentRss>http://digitallibrarian.org/?feed=rss2&#038;p=141</wfw:commentRss> <slash:comments>1</slash:comments> </item> <item> <title>3rd SITS Meeting &#8211; Geneva</title> <link>http://digitallibrarian.org/?p=130</link> <comments>http://digitallibrarian.org/?p=130#respond</comments> <dc:creator><![CDATA[jaf]]></dc:creator> <pubDate>Wed, 03 Aug 2011 09:38:19 +0000</pubDate> <category><![CDATA[Conferences]]></category> <category><![CDATA[digital libraries]]></category> <category><![CDATA[Uncategorized]]></category> <category><![CDATA[workshops]]></category> <category><![CDATA[DLF]]></category> <category><![CDATA[SITS]]></category> <guid isPermaLink="false">http://digitallibrarian.org/?p=130</guid> <description><![CDATA[Back in June I attend the 3rd SITS (Scholarly Infrastructure Technical Summit) meeting, held in conjunction with the OAI7 workshop and sponsored by JISC and the Digital Library Federation. This meeting, held in lovely Geneva, Switzerland, brought together library technologists &#8230;<p class="read-more"> <a class="more-link" href="http://digitallibrarian.org/?p=130"> <span class="screen-reader-text">3rd SITS Meeting &#8211; Geneva</span> Read More &#187;</a></p>]]></description> <content:encoded><![CDATA[<p>Back in June I attend the <a href="http://sits-oai7.eventbrite.com/">3rd SITS (Scholarly Infrastructure Technical Summit) meeting</a>, held in conjunction with the OAI7 workshop and sponsored by JISC and the Digital Library Federation. This meeting, held in lovely Geneva, Switzerland, brought together library technologists and technology leaders from North America, Europe, Australia, and Asia for the purpose of exploring common technology and technology-related issues that crossed our geographic boundaries.</p> <p>This is the first SITS meeting that I attended &#8211; prior to this meeting, there were two other SITS meetings (one in London and one in California). As this SITS meeting was attached to the OAI7 conference, it brought together a group of stakeholders who&#8217;s roles in their organizations spanned from technology implementors to technology strategists and decision makers. From having chatted with some of the folks who had attended previous SITS meetings, the attendees at those meetings tended to weigh heavily on the technology implementer / developer side, while this particular instance of SITS had a broader range of discussion that, while centered on technology, also incorporated much of the context to which technology was being applied. For me, that actually made this a more intriguing and productive discussion, as I think that while there are certainly a great variety of strictly technical issues with which we grapple, what often gets lost when talking semantic web, linked data, digital preservation, etc. is the context and focus of the purpose of deploying said technology. So, with that particular piece of context, I&#8217;ll describe some of the conversation that occurred at this particular SITS event.</p> <p>Due to the schedule of OAI7, this SITS meeting was held in two parts &#8211; the afternoon of 24 June, and the morning of 25 June. For the first session, the group met in one of the lecture rooms at the conference venue, and this worked out quite nicely. SITS uses an open agenda / open meeting format, which allows the attendees to basically nominate and elect the topics of discussion for the meeting. After initial introductions, we began proposing topics. I tried to capture as best I could all of the topics that were proposed, though I might have missed one or two:</p> <p>* stable links for linked data vs. stable bitstreams for preservation<br /> * authority hubs / clustered IDs / researcher IDs / ORCID in DSpace<br /> * effective synchronization of digital resources<br /> * consistency and usage of usage data<br /> * digital preservation architecture &#8211; integration of tape-based storage and other storage anvironments (external to the library)<br /> * integration between repositories and media delivery (i.e. streaming) &#8211; particularly to access control enforcement<br /> * nano publications and object granularity<br /> * pairing storage with different types of applications<br /> * linking research data to scholarly publications to faculty assessment<br /> * well-behaved document<br /> * research impacts and outputs<br /> * linked open data: from vision to deployment<br /> * Relationship between open linked data and open research data<br /> * Name disambiguation</p> <p>Following process, we took the above brainstormed list and proceeded to vote on which topic to begin discussion. The first topic chosen was researcher identities, which began with discussion around ORCID, a project that currently has reasonable mindshare behind it. While there are a lot of backers of ORCID, it is not clear whether the approach of a singular researcher ID is a feasible approach, though I believe we&#8217;ll discover the answer based on the success (or not) of the project. In general, I think that most of the attendees will be paying attention to ORCID, but that also a wait and see approach is likely as there are many, many issues around researcher IDs that still need to be worked through.</p> <p>The next topic was the assessment of research impacts and outputs. This particular topic was not particularly technically focused, but did bring about some interesting discussion about the impact of assessment activities, both positive and negative. </p> <p>The next topic, linking research data to scholarly publications to faculty assessment, was a natural progression from the previous topic, and much of the discussion revolved around how to support such relationships. I must admit that while I think this topic is important, I didn&#8217;t feel that the discussion really resolved any of the potential issues with supporting researchers in linking data to publications (and then capturing this data for assessment purposes). What is clear is that the concept of publishing data, especially open data, is one that is not necessarily as straight-forward as one would hope when you get into the details, such as where to publish data, how to credit such publication, how is the data maintained, etc. There is a lot of work to be done here.</p> <p>Next to be discussed was the preservation of data and software. It was brought up that the sustainability and preservation of data, especially open data, was somewhat analogous to the sustainability and preservation of software, in that both required a certain number of active tasks in order to ensure that both data and software were continually usable. It is also clear that much data requires the proper software in order to be usable, and therefore the issues of software and data sustainability and preservation are in my senses interwoven.</p> <p>The group then moved to a brief discussion of the harvesting and use of usage data. Efforts such as COUNTER and popirus2 were mentioned. The ability to track data in a way that balances anonymity and privacy vs. added value back to the user was discussed &#8211; the fact that usage data can be leveraged to provide better services back to users was a key consideration.</p> <p>The next discussion topic was influenced by the OAI7 workshop. The issue of the synchronisation of resources was discussed, and during OAI7, there was a breakout session that looked at the future of OAI-PMH, both in terms of 1.x sustainability as well as work that might end up with the result of OAI-PMH 2.0. Interestingly, there was some discussion of even the need for data synchronization with the advent of linked data; I can see why this would come up, but I personally believe that linked data isn&#8217;t at the point where other methods for ensuring synchronized data aren&#8217;t necessary (nor may it ever be).</p> <p>Speaking of linked data, the concept arose in many of the SITS discussions, though the group did not officially address it until late in the agenda. I must admit that I&#8217;ve yet to drink the linked data lemonade, in the sense that I really don&#8217;t see it being the silver bullet that many of its proponents make it out to be, but I do see it as one approach for enabling extended use of data and resources. In the discussion, one of the challenges of the linked data approach that was discussed was the need to map between ontologies.</p> <p>At this point, it was getting a bit late into the meeting, but we did talk about two more topics: One was very pragmatic, while the other was a bit more future-thinking (though there might be some disagreement on that). The first was a discussion about how organizationally digital preservation architectures were being supported &#8211; were they being supported by central IT, by the Library IT, or otherwise? It seemed that (not surprisingly) a lot depended upon the specific organization, and that perhaps more coordination could be undertaken through efforts such as PASIG. The second discussion was on the topic of &#8220;nano-publications&#8221;, which the group defined as &#8220;things that simply tell you what is being asserted (e.g. Europe is a continent)&#8221;. I must admit I got a bit lost about the importance and purpose of nano-publications, but again, it was close to the end of the meeting. </p> <p>BTW, as I&#8217;m finishing this an email just came through with the official notes from the SITS meeting, which can be accessed at <a href="http://eprints.ecs.soton.ac.uk/22546/">http://eprints.ecs.soton.ac.uk/22546/</a></p> ]]></content:encoded> <wfw:commentRss>http://digitallibrarian.org/?feed=rss2&#038;p=130</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item> <title>David Lewis&#8217; presentation on Collections Futures</title> <link>http://digitallibrarian.org/?p=126</link> <comments>http://digitallibrarian.org/?p=126#comments</comments> <dc:creator><![CDATA[jaf]]></dc:creator> <pubDate>Wed, 02 Mar 2011 21:05:12 +0000</pubDate> <category><![CDATA[eBooks]]></category> <category><![CDATA[Librarianship]]></category> <category><![CDATA[collections]]></category> <category><![CDATA[future]]></category> <category><![CDATA[provisioning]]></category> <guid isPermaLink="false">http://digitallibrarian.org/?p=126</guid> <description><![CDATA[Peter Murray (aka the Disruptive Library Technology Jester) has provided an audio-overlay of David Lewis&#8217; slideshare of his plenary at the last June&#8217;s RLG Annual Partners meeting. If you are at all interested in understanding the future of academic libraries, &#8230;<p class="read-more"> <a class="more-link" href="http://digitallibrarian.org/?p=126"> <span class="screen-reader-text">David Lewis&#8217; presentation on Collections Futures</span> Read More &#187;</a></p>]]></description> <content:encoded><![CDATA[<p>Peter Murray (aka the <a href="http://dltj.org/">Disruptive Library Technology Jester</a>) has provided an audio-overlay of <a href="http://dltj.org/article/collections-futures/">David Lewis&#8217; slideshare of his plenary</a> at the last June&#8217;s <a href="http://www.oclc.org/research/events/2010-06-09a.htm">RLG Annual Partners meeting</a>. If you are at all interested in understanding the future of academic libraries, you should take an hour of your time and listen to this presentation. Of particular note, because David says it almost in passing, is that academic libraries are moving away from being collectors of information to being provisioners of information &#8211; the difference being that instead of purchasing everything that might be used, academic libraries instead are moving to ensuring that there is a path for provisioning access to materials that actually requested for use by their users. Again, well worth an hour of your time.</p> ]]></content:encoded> <wfw:commentRss>http://digitallibrarian.org/?feed=rss2&#038;p=126</wfw:commentRss> <slash:comments>1</slash:comments> </item> <item> <title>Librarians are *the* search experts&#8230;</title> <link>http://digitallibrarian.org/?p=121</link> <comments>http://digitallibrarian.org/?p=121#respond</comments> <dc:creator><![CDATA[jaf]]></dc:creator> <pubDate>Thu, 19 Aug 2010 14:22:46 +0000</pubDate> <category><![CDATA[Librarianship]]></category> <guid isPermaLink="false">http://digitallibrarian.org/?p=121</guid> <description><![CDATA[&#8230;so I wonder how many librarians know all of the tips and tricks for using Google that are mentioned here?]]></description> <content:encoded><![CDATA[<p>&#8230;so I wonder how many librarians know all of the tips and tricks for using Google that are mentioned <a href="http://webapps.stackexchange.com/questions/753/hidden-features-of-google">here</a>?</p> ]]></content:encoded> <wfw:commentRss>http://digitallibrarian.org/?feed=rss2&#038;p=121</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item> <title>What do we want from Discovery? Maybe it&#8217;s to save the time of the user&#8230;.</title> <link>http://digitallibrarian.org/?p=119</link> <comments>http://digitallibrarian.org/?p=119#comments</comments> <dc:creator><![CDATA[jaf]]></dc:creator> <pubDate>Wed, 18 Aug 2010 13:14:04 +0000</pubDate> <category><![CDATA[Uncategorized]]></category> <guid isPermaLink="false">http://digitallibrarian.org/?p=119</guid> <description><![CDATA[Just a quick thought on discovery tools &#8211; the major newish discovery services being vended to libraries (WorldCat local, Summon, Ebsco Discovery Service, etc.) all have their strengths, their complexity, their middle-of-the-road politician trying to be everything to everybody features. &#8230;<p class="read-more"> <a class="more-link" href="http://digitallibrarian.org/?p=119"> <span class="screen-reader-text">What do we want from Discovery? Maybe it&#8217;s to save the time of the user&#8230;.</span> Read More &#187;</a></p>]]></description> <content:encoded><![CDATA[<p>Just a quick thought on discovery tools &#8211; the major newish discovery services being vended to libraries (WorldCat local, Summon, Ebsco Discovery Service, etc.) all have their strengths, their complexity, their middle-of-the-road politician trying to be everything to everybody features. One question I have asked and not yet had a good answer to is &#8220;How does your tool save the time of the user?&#8221;. For me, that&#8217;s the most important feature of any discovery tool.</p> <p>Show me data or study results that prove your tool saves the time of the user as compared to other vended tools (and Google and Google Scholar), and you have a clear advantage, at least in what I am considering when choosing to implement a discovery tool.</p> ]]></content:encoded> <wfw:commentRss>http://digitallibrarian.org/?feed=rss2&#038;p=119</wfw:commentRss> <slash:comments>1</slash:comments> </item> <item> <title>Putting a library in Starbucks</title> <link>http://digitallibrarian.org/?p=114</link> <comments>http://digitallibrarian.org/?p=114#respond</comments> <dc:creator><![CDATA[jaf]]></dc:creator> <pubDate>Thu, 12 Aug 2010 09:40:58 +0000</pubDate> <category><![CDATA[digital libraries]]></category> <category><![CDATA[Librarianship]]></category> <category><![CDATA[coffee]]></category> <category><![CDATA[digital library]]></category> <category><![CDATA[library]]></category> <category><![CDATA[monopsony]]></category> <category><![CDATA[starbucks]]></category> <category><![CDATA[upsell]]></category> <guid isPermaLink="false">http://digitallibrarian.org/?p=114</guid> <description><![CDATA[It is not uncommon to find a coffee shop in a library these days. Turn that concept around, though &#8211; would you expect a library inside a Starbucks? Or maybe that&#8217;s the wrong question &#8211; how would you react to &#8230;<p class="read-more"> <a class="more-link" href="http://digitallibrarian.org/?p=114"> <span class="screen-reader-text">Putting a library in Starbucks</span> Read More &#187;</a></p>]]></description> <content:encoded><![CDATA[<p><img loading="lazy" alt="" src="http://t0.gstatic.com/images?q=tbn:ANd9GcTusDCALtcYwVOj91I-bdH7kGxqvrnZ2WsreWTz0CXtSLe-zRM&#038;t=1&#038;usg=__2sYt8Ohe2YMsjbxuMc2icPKfaxY=" title="Coffee" class="alignleft" width="240" height="179" />It is not uncommon to find a coffee shop in a library these days. Turn that concept around, though &#8211; would you expect a library inside a Starbucks? Or maybe that&#8217;s the wrong question &#8211; how would you react to having a library inside a Starbucks? Well, that concept shuffling its way towards reality, as Starbucks is now experimenting with offering premium (i.e. non-free) content to users while they are on the free wireless that Starbucks provides. In fact, Starbucks actually has a collection development policy for their content &#8211; they are providing content in the following areas, which they call channels: News, Entertainment, Wellness, Business &#038; Careers and My Neighborhood. They even call their offerings &#8220;curated content&#8221;.</p> <p>Obviously, this isn&#8217;t the equivalent of putting the full contents of a library into a coffee shop, but it is worth our time to pay attention to how this new service approach from Starbucks evolves. Starbucks isn&#8217;t giving away content for free just to get customers in the door; they are looking at how they might monetize this service through upsell techniques. The business models and agreements are going to have impact on how libraries do business, and we need to pay attention to how Starbucks brokers agreements with content providers. <a href="http://go-to-hellman.blogspot.com/">Eric Hellman&#8217;s</a> current favorite term, <a href="http://en.wikipedia.org/wiki/Monopsony">monopsony</a>, comes to mind here &#8211; though in reality Starbucks isn&#8217;t buying anything, as no money is actually changing hands, at least to start. Content providers are happy to allow Starbucks to provide limited access (i.e. limited by geographic location / network access) to content for free in order to promote their content and provide a discovery to delivery path that will allow users to extend their use of the content for a price. </p> <p>This begs the question &#8211; should libraries look at upsell opportunities, especially if it means we can reduce our licensing costs? At the very least, the idea is worth exploring.</p> <p>Source: <a href="http://news.yahoo.com/s/mashable/20100812/tc_mashable/how_starbucks_plans_to_capitalize_on_free_wifi">Yahoo News</a></p> ]]></content:encoded> <wfw:commentRss>http://digitallibrarian.org/?feed=rss2&#038;p=114</wfw:commentRss> <slash:comments>0</slash:comments> </item> <item> <title>1 week of iPad</title> <link>http://digitallibrarian.org/?p=101</link> <comments>http://digitallibrarian.org/?p=101#comments</comments> <dc:creator><![CDATA[jaf]]></dc:creator> <pubDate>Wed, 14 Apr 2010 11:10:36 +0000</pubDate> <category><![CDATA[Apple]]></category> <category><![CDATA[eBooks]]></category> <category><![CDATA[Hardware]]></category> <category><![CDATA[iPad]]></category> <category><![CDATA[digital lifestyle]]></category> <category><![CDATA[mobile]]></category> <category><![CDATA[tablet]]></category> <guid isPermaLink="false">http://digitallibrarian.org/?p=101</guid> <description><![CDATA[It has been a little over a week since My iPad was delivered, and in that time I have had the opportunity to try it out at home, at work, and on the road. In fact, I&#8217;m currently typing this &#8230;<p class="read-more"> <a class="more-link" href="http://digitallibrarian.org/?p=101"> <span class="screen-reader-text">1 week of iPad</span> Read More &#187;</a></p>]]></description> <content:encoded><![CDATA[<p>It has been a little over a week since My iPad was delivered, and in that time I have had the opportunity to try it out at home, at work, and on the road. In fact, I&#8217;m currently typing this entry on it from the hotel restaurant at the CNI Spring task force meeting. I feel that I have used it enough now to provide some of my insights and thoughts about the iPad, how I am using it, and what I think of it.</p> <p>So, how best to describe the iPad? Fun. Convenient. Fun again. The iPad is more than the sum of its parts; much like the iPhone, it provides an overall experience, one that is enjoyable and yes, efficient. Browsing is great fun; I have only run into one site where because of the lack of flash support was completely inaccessible (a local restaurant site). A number of sites that I regularly peruse have some flash aspect that is not available via the iPad, but typically this isn&#8217;t a big loss. For example, if there is an engadget article that contains video, I won&#8217;t get the video. However, the NY Times, ESPN, and other major sites are already supporting HTML 5 embedded video, and I expect to see a strong push towards HTML 5 and away from flash. In the grand scheme of things, most of the sites I browse are text and image based, and have no issues.</p> <p>Likewise for email and calendaring &#8211; both work like a charm. Email on the iPad is easy, fun, and much better than on the iPhone. The keyboard, when in landscape mode, is actually much better than I expected, and very suitable for email replies (not to mention blog posts). I&#8217;d go as far to say that the usability of the onscreen keyboard (when the iPad is in landscape mode) is as good or better than a typical net book keyboard. Also, an unintended bonus is that typing on the keyboard is pretty much silent; this is somewhat noticeable during conference sessions where a dozen or so attendees are typing their notes and the clack of their keyboards starts to add up.</p> <p>So, how am I using my iPad? Well, on this trip, I have used it to read (one novel and a bunch of work-related articles), do email, listen to music, watch videos, stream some netflix, browse the web, draft a policy document for my place of employment, diagram a repository architecture, and take notes during conference sessions. Could I do all of this on a laptop? Sure. Could I do all of this on a laptop without plugging in at any point in the day? Possibly, with the right laptop or net book. But here&#8217;s the thing &#8211; at the conference, instead of lugging my laptop bag around with me, my iPad replaced the laptop, my notepad, and everything else I would have dragged around in my bag. I literally only took my iPad, which is actually smaller than a standard paper notebook, and honestly I didn&#8217;t miss a beat. Quickly jot down a note? Easy. Sketch out an idea? Ditto. It&#8217;s all just right there, all the functionality, in a so-much-more convenient form factor.</p> <p>Is the iPad perfect? By no means &#8211; the desktop interface is optimized for the iPhone / iTouch, and feels a bit inefficient for the larger iPad. Because of the current lack of multitasking (something that Apple has already announced will be available in the next version of the OS), I can&#8217;t keep an IM client running in the background. There is no inherent folder system, so saving files outside of applications is more complex then it should be. Fingerprints show up much more than I expected, though they wipe away fairly easily with a cloth. The weight (1.5 lbs) is just enough to make you need to shift how you hold the iPad after a period of time.</p> <p>Again, here&#8217;s the thing: the iPad doesn&#8217;t need to be perfect, it needs to be niche. Is it niche? Ask my laptop bag.</p> ]]></content:encoded> <wfw:commentRss>http://digitallibrarian.org/?feed=rss2&#038;p=101</wfw:commentRss> <slash:comments>1</slash:comments> </item> </channel> </rss> 
dihslovenia-si-2585	----	Digitalno inovacijsko stičišče Slovenije - Digitalno inovacijsko stičišče Slovenije O nas Kontakt Slovenščina | English Vstop Iskanje Katalog strokovnjakov Brskajte po katalogu Vpis v katalog Vavčerji Aktualno Novice Dogodki Baza znanja Katalog dobrih praks Strokovna gradiva Video vsebine Razpisi Sodelujte Ob predsedovanju Slovenije Svetu EU se predstavite na digitalnem razstavišču Tehnologija za ljudi Poziv podjetjem k sodelovanju v pozivu - Spletne tržnice SPS z vavčerji znova podpira digitalizacijo Naložbo sofinancirata Republika Slovenija in Evropska unija iz Evropskega sklada za regionalni razvoj. Brskajte po katalogu strokovnjakov Pridobite vavčer za sofinanciranje Vpišite se v katalog strokovnjakov Novice 22. apr. 2021 Priložnosti za digitalizacijo slovenskega gospodarstva v okviru nove finančne perspektive 2021-2027 01. apr. 2021 Oblikovanje predlogov vsebin za študijske programe 23. mar. 2021 Z novo pobudo lažje do digitalnih znanj za delovna mesta prihodnosti Vse novice Dosezite podobne rezultate tudi vi. Sodelujte z nami! Odkrijte prednosti povezovanja partnerjev v DIH Slovenije. Sodelujte z nami Dogodki Udeležite se srečanj za digitalno transformacijo. Vavčerji Do 60% sofinanciranja na področju digitalizacije. Omogočamo digitalno transformacijo. Gradimo med-sektorska in multidisciplinarna partnerstva: univerze, raziskovalne in poslovne ustanove, podjetja, ponudniki IKT in podporne organizacije za podjetja, ki predstavljajo ekosistem za trajnostno kratkoročno in dolgoročno podporo tej viziji. Povezovanje DIH Slovenije zagotavlja povezave z vlagatelji, olajša dostop do financiranja digitalne transformacije, poveže uporabnike in ponudnike digitalnih inovacij ter omogoča sinergije med digitalnimi in drugimi ključnimi tehnologijami. Kompetence Razvoj digitalnih kompetenc in kadrov prihodnosti. Podpora digitalni transformaciji Skupni razvoj storitev za podporo upravljanju digitalne preobrazbe v podjetjih. Inovacije in prototipi Spodbujanje odprtega inoviranja, oblikovanje novih poslovnih modelov, eksperimentalnih in pilotnih okolij. Internacionalizacija Prenos dobrih praks in sodelovanje z drugimi Digitalnimi inovacijskimi stičišči v EU. Več o DIH Slovenija Strateški partnerji, ki nam pomagajo graditi digitalno prihodnost Slovenije Sodelujte z nami Tudi vi? Ostanite na tekočem. Prijavite se na eNovice! Prijavite se na eNovice Katalog strokovnjakov Vavčerji Aktualno Brskajte po katalogu strokovnjakov Pridobite vavčer za sofinanciranje Vpišite se v katalog strokovnjakov Sodelujte z nami Odkrijte prednosti povezovanja partnerjev v DIH Slovenije. Sodelujte z nami Dimičeva 13, 1503 Ljubljana, Slovenija 040 606 710 pon.–pet., 10:00 - 13:00 info@dihslovenia.si Podpora Naložbo sofinancirata Republika Slovenija in Evropska unija iz Evropskega sklada za regionalni razvoj. Članstvo 2020 © Digital Innovation Hub Slovenia. Vse pravice pridržane. Pravna obvestila Politika zasebnosti Piškotki Prosimo, potrdite piškotke. Na spletni strani dihslovenia.si uporabljamo piškotke z namenom zagotavljanja spletne storitve in funkcionalnosti, ki jih brez njih ne bi mogli nuditi. Prosimo vas, da s klikom na spodnji gumb potrdite uporabo piškotkov na naši spletni strani. Strinjam se Več informacij 
dlfteach-pubpub-org-6389	----	None 
dlfteach-pubpub-org-6525	----	None 
dlfteach-pubpub-org-9995	----	None 
dltj-org-1250	----	Publishers going-it-alone (for now?) with GetFTR | Disruptive Library Technology Jester Skip links Skip to primary navigation Skip to content Skip to footer Disruptive Library Technology Jester About Resume Toggle search Toggle menu Peter Murray Library technologist, open source advocate, striving to think globally while acting locally Follow Columbus, Ohio Email Twitter Keybase GitHub LinkedIn StackOverflow ORCID Email Publishers going-it-alone (for now?) with GetFTR Posted on December 03, 2019 and updated on April 03, 2021     5 minute read In early December 2019, a group of publishers announced Get-Full-Text-Research, or GetFTR for short. I read about this first in Roger Schonfeld’s “Publishers Announce a Major New Service to Plug Leakage” piece in The Scholarly Kitchen via Jeff Pooley’s Twitter thread and blog post. Details about how this works are thin, so I’m leaning heavily on Roger’s description. I’m not as negative about this as Jeff, and I’m probably a little more opinionated than Roger. This is an interesting move by publishers, and—as the title of this post suggests—I am critical of the publisher’s “go-it-alone” approach. First, some disclosure might be in order. My background has me thinking of this in the context of how it impacts libraries and library consortia. For the past four years, I’ve been co-chair of the NISO Information Discovery and Interchange topic committee (and its predecessor, the “Discovery to Delivery” topic committee), so this is squarely in what I’ve been thinking about in the broader library-publisher professional space. I also traced the early development of RA21 and more recently am volunteering on the SeamlessAccess Entity Category and Attribute Bundles Working Group; that’ll become more important a little further down this post. I was nodding along with Roger’s narrative until I stopped short here: The five major publishing houses that are the driving forces behind GetFTR are not pursuing this initiative through one of the major industry collaborative bodies. All five are leading members of the STM Association, NISO, ORCID, Crossref, and CHORUS, to name several major industry groups. But rather than working through one of these existing groups, the houses plan instead to launch a new legal entity.  While [Vice President of Product Strategy & Partnerships for Wiley Todd] Toler and [Senior Director, Technology Strategy & Partnerships for the American Chemical Society Ralph] Youngen were too politic to go deeply into the details of why this might be, it is clear that the leadership of the large houses have felt a major sense of mismatch between their business priorities on the one hand and the capabilities of these existing industry bodies. At recent industry events, publishing house CEOs have voiced extensive concerns about the lack of cooperation-driven innovation in the sector. For example, Judy Verses from Wiley spoke to this issue in spring 2018, and several executives did so at Frankfurt this fall. In both cases, long standing members of the scholarly publishing sector questioned if these executives perhaps did not realize the extensive collaborations driven through Crossref and ORCID, among others. It is now clear to me that the issue is not a lack of knowledge but rather a concern at the executive level about the perceived inability of existing collaborative vehicles to enable the new strategic directions that publishers feel they must pursue.  This is the publishers going-it-alone. To see Roger describe it, they are going to create this web service that allows publishers to determine the appropriate copy for a patron and do it without input from the libraries. Librarians will just be expected to put this web service widget into their discovery services to get “colored buttons indicating that the link will take [patrons] to the version of record, an alternative pathway, or (presumably in rare cases) no access at all.” (Let’s set aside for the moment the privacy implications of having a fourth-party web service recording all of the individual articles that come up in a patron’s search results.) Librarians will not get to decide the “alternative pathway” that is appropriate for the patron: “Some publishers might choose to provide access to a preprint or a read-only version, perhaps in some cases on some kind of metered basis.” (Roger goes on to say that he “expect[s] publishers will typically enable some alternative version for their content, in which case the vast majority of scholarly content will be freely available through publishers even if it is not open access in terms of licensing.” I’m not so confident.) No, thank you. If publishers want to engage in technical work to enable libraries and others to build web services that determine the direct link to an article based on a DOI, then great. Libraries can build a tool that consumes that information as well as takes into account information about preprint services, open access versions, interlibrary loan and other methods of access. But to ask libraries to accept this publisher-controlled access button in their discovery layers, their learning management systems, their scholarly profile services, and their other tools? That sounds destined for disappointment. I am only somewhat encouraged by the fact that RA21 started out as a small, isolated collaboration of publishers before they brought in NISO and invited libraries to join the discussion. Did it mean that it slowed down deployment of RA21? Undoubtedly yes. Did persnickety librarians demand transparent discussions and decisions about privacy-related concerns like what attributes the publisher would get about the patron in the Shibboleth-powered backchannel? Yes, but because the patrons weren’t there to advocate for themselves. Will it likely mean wider adoption? I’d like to think so. Have publishers learned that forcing these kinds of technologies onto users without consultation is a bad idea? At the moment it would appear not. Some of what publishers are seeking with GetFTR can be implemented with straight-up OpenURL or—at the very least—limited-scope additions to OpenURL (the Z39.88 open standard!). So that they didn’t start with OpenURL, a robust existing standard, is both concerning and annoying. I’ll be watching and listening for points of engagement, so I remain hopeful. A few words about Jeff Pooley’s five-step “laughably creaky and friction-filled effort” that is SeamlessAccess. Many of the steps Jeff describes are invisible and well-established technical protocols. What Jeff fails to take into account is the very visible and friction-filled effect of patrons accessing content beyond the boundaries of campus-recognized internet network addresses. Those patrons get stopped at step two with a “pay $35 please” message. I’m all for removing that barrier entirely by making all published content “open access”. It is folly to think, though, that researchers and readers can enforce an open access business model on all publishers, so solutions like SeamlessAccess will have a place. (Which is to say nothing of the benefit of inter-institutional resource collaboration opened up by a more widely deployed Shibboleth infrastructure powered by SeamlessAccess.) Tags: discovery, GetFTR, niso, openurl, ra21, SeamlessAccess Categories: Linking Technologies Twitter Facebook LinkedIn Previous Next You May Also Enjoy More Thoughts on Pre-recording Conference Talks 8 minute read Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion abo... Should All Conference Talks be Pre-recorded? 6 minute read The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and con... User Behavior Access Controls at a Library Proxy Server are Okay 9 minute read Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. ... As a Cog in the Election System: Reflections on My Role as a Precinct Election Official 10 minute read I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democ... Enter your search term... Twitter GitHub Feed © 2021 Peter Murray. Powered by Jekyll & Minimal Mistakes. 
dltj-org-165	----	Managing Remote Conference Presenters with Zoom | Disruptive Library Technology Jester Skip links Skip to primary navigation Skip to content Skip to footer Disruptive Library Technology Jester About Resume Toggle search Toggle menu Peter Murray Library technologist, open source advocate, striving to think globally while acting locally Follow Columbus, Ohio Email Twitter Keybase GitHub LinkedIn StackOverflow ORCID Email Managing Remote Conference Presenters with Zoom Posted on March 14, 2020 and updated on April 03, 2021     8 minute read Bringing remote presenters into a face-to-face conference is challenging and fraught with peril. In this post, I describe a scheme using Zoom that had in-person attendees forgetting that the presenter was remote! The Code4Lib conference was this week, and with the COVID-19 pandemic breaking through many individuals and institutions made decisions to not travel to Pittsburgh for the meeting. We had an unprecedented nine presentations that were brought into the conference via Zoom. I was chairing the livestream committee for the conference (as I have done for several years—skipping last year), so it made the most sense for me to arrange a scheme for remote presenters. With the help of the on-site A/V contractor, we were able to pull this off with minimal requirements for the remote presenter. List of Requirements 2 Zoom Pro accounts 1 PC/Mac with video output, as if you were connecting an external monitor (the “Receiving Zoom” computer) 1 PC/Mac (the “Coordinator Zoom” computer) 1 USB audio interface Hardwired network connection for the Receiving Zoom computer (recommended) The Pro-level Zoom accounts were required because we needed to run a group call for longer than 40 minutes (to include setup time). And two were needed: one for the Coordinator Zoom machine and one for the dedicated Receiving Zoom machine. It would have been possible to consolidate the two Zoom Pro accounts and the two PC/Mac machines into one, but we had back-to-back presenters at Code4Lib, and I wanted to be able to help one remote presenter get ready while another was presenting. In addition to this equipment, the A/V contractor was indispensable in making the connection work. We fed the remote presenter’s video and audio from the Receiving Zoom computer to the contractor’s A/V switch through HDMI, and the contractor put the video on the ballroom projectors and audio through the ballroom speakers. The contractor gave us a selective audio feed of the program audio minus the remote presenter’s audio (so they wouldn’t hear themselves come back through the Zoom meeting). This becomes a little clearer in the diagram below. Physical Connections and Setup This diagram shows the physical connections between machines. The Audio Mixer and Video Switch were provided and run by the A/V contractor. The Receiving Zoom machine was the one that is connected to the A/V contractor’s Video Switch via an HDMI cable coming off the computer’s external monitor connection. In the Receiving Zoom computer’s control panel, we set the external monitor to mirror what was on the main monitor. The audio and video from the computer (i.e., the Zoom call) went out the HDMI cable to the A/V contractor’s Video Switch. The A/V contractor took the audio from the Receiving Zoom computer through the Video Switch and added it to the Audio Mixer as an input channel. From there, the audio was sent out to the ballroom speakers the same way audio from the podium microphone was amplified to the audience. We asked the A/V contractor to create an audio mix that includes all of the audio sources except the Receiving Zoom computer (e.g., in-room microphones) and plugged that into the USB Audio interface. That way, the remote presenter could hear the sounds from the ballroom—ambient laughter, questions from the audience, etc.—in their Zoom call. (Note that it was important to remove the remote presenter’s own speaking voice from this audio mix; there was a significant, distracting delay between the time the presenter spoke and the audio was returned to them through the Zoom call.) We used a hardwired network connection to the internet, and I would recommend that—particularly with tech-heavy conferences that might overflow the venue wi-fi. (You don’t want your remote presenter’s Zoom to have to compete with what attendees are doing.) Be aware that the hardwired network connection will cost more from the venue, and may take some time to get functioning since this doesn’t seem to be something that hotels often do. In the Zoom meeting, we unmuted the microphone and selected the USB Audio interface as the microphone input. As the Zoom meeting was connected, we made the meeting window full-screen so the remote presenter’s face and/or presentation were at the maximum size on the ballroom projectors. Setting Up the Zoom Meetings The two Zoom accounts came from the Open Library Foundation. (Thank you!) As mentioned in the requirements section above, these were Pro-level accounts. The two accounts were olf_host2@openlibraryfoundation.org and olf_host3@openlibraryfoundation.org. The olf_host2 account was used for the Receiving Zoom computer, and the olf_host3 account was used for the Coordinator Zoom computer. The Zoom meeting edit page looked like this: This is for the “Code4Lib 2020 Remote Presenter A” meeting with the primary host as olf_host2@openlibraryfoundation.org. Note these settings: A recurring meeting that ran from 8:00am to 6:00pm each day of the conference. Enable join before host is checked in case the remote presenter got on the meeting before I did. Record the meeting automatically in the cloud to use as a backup in case something goes wrong. Alternative Hosts is olf_host3@openlibraryfoundation.org The “Code4Lib 2020 Remote Presenter B” meeting was exactly the same except the primary host was olf_host3, and olf_host2 was added as an alternative host. The meetings were set up with each other as the alternative host so that the Coordinator Zoom computer could start the meeting, seamlessly hand it off to the Receiving Zoom computer, then disconnect. Preparing the Remote Presenter Remote presenters were given this information: Code4Lib will be using Zoom for remote presenters. In addition to the software, having the proper audio setup is vital for a successful presentation. Microphone: The best option is a headset or earbuds so a microphone is close to your mouth. Built-in laptop microphones are okay, but using them will make it harder for the audience to hear you. Speaker: A headset or earbuds are required. Do not use your computer’s built-in speakers. The echo cancellation software is designed for small rooms and cannot handle the delay caused by large ballrooms. You can test your setup with a test Zoom call. Be sure your microphone and speakers are set correctly in Zoom. Also, try sharing your screen on the test call so you understand how to start and stop screen sharing. The audience will see everything on your screen, so quit/disable/turn-off notifications that come from chat programs, email clients, and similar tools. Plan to connect to the Zoom meeting 30 minutes before your talk to work out any connection or setup issues. At the 30-minute mark before the remote presentation, I went to the ballroom lobby and connected to the designated Zoom meeting for the remote presenter using the Coordinator Zoom computer. I used this checklist with each presenter: Check presenter’s microphone level and sound quality (make sure headset/earbud microphone is being used!) Check presenter’s speakers and ensure there is no echo Test screen-sharing (start and stop) with presenter Remind presenter to turn off notifications from chat programs, email clients, etc. Remind the presenter that they need to keep track of their own time; there is no way for us to give them cues about timing other than interrupting them when their time is up The critical item was making sure the audio worked (that their computer was set to use the headset/earbud microphone and audio output). The result was excellent sound quality for the audience. When the remote presenter was set on the Zoom meeting, I returned to the A/V table and asked a livestream helper to connect the Receiving Zoom to the remote presenter’s Zoom meeting. At this point, the remote presenter can hear the audio in the ballroom of the speaker before them coming through the Receiving Zoom computer. Now I would lock the Zoom meeting to prevent others from joining and interrupting the presenter (from the Zoom Participants panel, select More then Lock Meeting). I hung out on the remote presenter’s meeting on the Coordinator Zoom computer in case they had any last-minute questions. As the speaker in the ballroom was finishing up, I wished the remote presenter well and disconnected the Coordinator Zoom computer from the meeting. (I always selected Leave Meeting rather than End Meeting for All so that the Zoom meeting continued with the remote presenter and the Receiving Zoom computer.) As the remote presenter was being introduced—and the speaker would know because they could hear it in their Zoom meeting—the A/V contractor switched the video source for the ballroom projectors to the Receiving Zoom computer and unmuted the Receiving Zoom computer’s channel on the Audio Mixer. At this point, the remote speaker is off-and-running! Last Thoughts This worked really well. Surprisingly well. So well that I had a few people comment that they were taken aback when they realized that there was no one standing at the podium during the presentation. I’m glad I had set up the two Zoom meetings. We had two cases where remote presenters were back-to-back. I was able to get the first remote presenter set up and ready on one Zoom meeting while preparing the second remote presenter on the other Zoom meeting. The most stressful part was at the point when we disconnected the first presenter’s Zoom meeting and quickly connected to the second presenter’s Zoom meeting. This was slightly awkward for the second remote presenter because they didn’t hear their full introduction as it happened and had to jump right into their presentation. This could be solved by setting up a second Receiving Zoom computer, but this added complexity seemed to be too much for the benefit gained. I would definitely recommend making this setup a part of the typical A/V preparations for future Code4Lib conferences. We don’t know when an individual’s circumstances (much less a worldwide pandemic) might cause a last-minute request for a remote presentation capability, and the overhead of the setup is pretty minimal. Tags: code4lib, howto, zoom Categories: Raw Technology Twitter Facebook LinkedIn Previous Next You May Also Enjoy More Thoughts on Pre-recording Conference Talks 8 minute read Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion abo... Should All Conference Talks be Pre-recorded? 6 minute read The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and con... User Behavior Access Controls at a Library Proxy Server are Okay 9 minute read Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. ... As a Cog in the Election System: Reflections on My Role as a Precinct Election Official 10 minute read I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democ... Enter your search term... Twitter GitHub Feed © 2021 Peter Murray. Powered by Jekyll & Minimal Mistakes. 
dltj-org-2962	----	More Thoughts on Pre-recording Conference Talks | Disruptive Library Technology Jester Skip links Skip to primary navigation Skip to content Skip to footer Disruptive Library Technology Jester About Resume Toggle search Toggle menu Peter Murray Library technologist, open source advocate, striving to think globally while acting locally Follow Columbus, Ohio Email Twitter Keybase GitHub LinkedIn StackOverflow ORCID Email More Thoughts on Pre-recording Conference Talks Posted on April 08, 2021     7 minute read Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion about recording talks to fill in gaps—positive and negative—about the concept, and I was not disappointed. I’m particularly thankful to Lisa Janicke Hinchliffe and Andromeda Yelton along with Jason Griffey, Junior Tidal, and Edward Lim Junhao for generously sharing their thoughts. Daniel S and Kate Deibel also commented on the Code4Lib Slack team. I added to the previous article’s bullet points and am expanding on some of the issues here. I’m inviting everyone mentioned to let me know if I’m mischaracterizing their thoughts, and I will correct this post if I hear from them. (I haven’t found a good comments system to hook into this static site blog.) Pre-recorded Talks Limit Presentation Format Lisa Janicke Hinchliffe made this point early in the feedback: @DataG For me downside is it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? I was required to turn workshops into talks this year. Even tho tech can do more. Not at all best pedagogy for learning — Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 Jason described the “flipped classroom” model that he had in mind as the NISOplus2021 program was being developed. The flipped classroom model is one where students do the work of reading material and watching lectures, then come to the interactive time with the instructors ready with questions and comments about the material. Rather than the instructor lecturing during class time, the class time becomes a discussion about the material. For NISOplus, “the recording is the material the speaker and attendees are discussing” during the live Zoom meetings. In the previous post, I described how the speaker could respond in text chat while the recording replay is beneficial. Lisa went on to say: @DataG Q+A is useful but isn't an interactive session. To me, interactive = participants are co-creating the session, not watching then commenting on it. — Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 She described an example: the SSP preconference she ran at CHS. I’m paraphrasing her tweets in this paragraph. The preconference had a short keynote and an “Oprah-style” panel discussion (not pre-prepared talks). This was done live; nothing was recorded. After the panel, people worked in small groups using Zoom and a set of Google Slides to guide the group work. The small groups reported their discussions back to all participants. Andromeda points out (paraphrasing twitter-speak): “Presenters will need much more— and more specialized—skills to pull it off, and it takes a lot more work.” And Lisa adds: “Just so there is no confusion … I don’t think being online makes it harder to do interactive. It’s the pre-recording. Interactive means participants co-create the session. A pause to chat isn’t going to shape what comes next on the recording.” Increased Technical Burden on Speakers and Organizers @ThatAndromeda @DataG Totally agree on this. I had to pre-record a conference presentation recently and it was a terrible experience, logistically. I feel like it forces presenters to become video/sound editors, which is obviously another thing to worry about on top of content and accessibility. — Junior Tidal (@JuniorTidal) April 5, 2021 Andromeda also agreed with this: “I will say one of the things I appreciated about NISO is that @griffey did ALL the video editing, so I was not forced to learn how that works.” She continued, “everyone has different requirements for prerecording, and in [Code4Lib’s] case they were extensive and kept changing.” And later added: “Part of the challenge is that every conference has its own tech stack/requirements. If as a presenter I have to learn that for every conference, it’s not reducing my workload.” It is hard not to agree with this; a high-quality (stylistically and technically) recording is not easy to do with today’s tools. This is also a technical burden for meeting organizers. The presenters will put a lot of work into talks—including making sure the recordings look good; whatever playback mechanism is used has to honor the fidelity of that recording. For instance, presenters who have gone through the effort to ensure the accessibility of the presentation color scheme want the conference platform to display the talk “as I created it.” The previous post noted that recorded talks also allow for the creation of better, non-real-time transcriptions. Lisa points out that presenters will want to review that transcription for accuracy, which Jason noted adds to the length of time needed before the start of a conference to complete the preparations. Increased Logistical Burden on Presenters @ThatAndromeda @DataG @griffey Even if prep is no more than the time it would take to deliver live (which has yet to be case for me and I'm good at this stuff), it is still double the time if you are expected to also show up live to watch along with everyone else. — Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 This is a consideration I hadn’t thought through—that presenters have to devote more clock time to the presentation because first they have to record it and then they have to watch it. (Or, as Andromeda added, “significantly more than twice the time for some people, if they are recording a bunch in order to get it right and/or doing editing.”) No. Audience. Reaction. @DataG @griffey 3) No. Audience. Reaction. I give a joke and no one laughs. Was it funny? Was it not funny? Talks are a *performance* and a *relationship*; I'm getting energy off the audience, I'm switching stuff on the fly to meet their vibe. Prerecorded/webinar is dead. Feels like I'm bombing. — Andromeda Yelton (@ThatAndromeda) April 5, 2021 Wow, yes. I imagine it would take a bit of imagination to get in the right mindset to give a talk to a small camera instead of an audience. I wonder how stand-up comedians are dealing with this as they try to put on virtual shows. Andromeda summed this up: @DataG @griffey oh and I mean 5) I don't get tenure or anything for speaking at conferences and goodness knows I don't get paid. So the ENTIRE benefit to me is that I enjoy doing the talk and connect to people around it. prerecorded talk + f2f conf removes one of these; online removes both. — Andromeda Yelton (@ThatAndromeda) April 5, 2021 Also in this heading could be “No Speaker Reaction”—or the inability for subsequent speakers at a conference to build on something that someone said earlier. In the Code4Lib Slack team, Daniel S noted: “One thing comes to mind on the pre-recording [is] the issue that prerecorded talks lose the ‘conversation’ aspect where some later talks at a conference will address or comment on earlier talks.” Kate Deibel added: “Exactly. Talks don’t get to spontaneously build off of each other or from other conversations that happen at the conference.” Currency of information Lisa points out that pre-recording talks before en event means there is a delay between the recording and the playback. In the example she pointed out, there was a talk at RLUK that pre-recorded would have been about the University of California working on an Open Access deal with Elsevier; live, it was able to be “the deal we announced earlier this week”. Conclusions? Near the end of the discussion, Lisa added: @DataG @griffey @ThatAndromeda I also recommend going forward that the details re what is required of presenters be in the CfP. It was one thing for conferences that pivoted (huge effort!) but if you write the CfP since the pivot it should say if pre-record, platform used, etc. — Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 …and Andromeda added: “Strong agree here. I understand that this year everyone was making it up as they went along, but going forward it’d be great to know that in advance.” That means conferences will need to take these needs into account well before the Call for Proposals (CfP) is published. A conference that is thinking now about pre-recording their talks must work through these issues and set expectations with presenters early. As I hoped, the Twiter replies tempered my eagerness for the all-recorded style with some real-world experience. There could be possibilities here, but adapting face-to-face meetings to a world with less travel won’t be simple and will take significant thought beyond the issues of technology platforms. Edward Lim Junhao summarized this nicely: “I favor unpacking what makes up our prof conferences. I’m interested in recreating that shared experience, the networking, & the serendipity of learning sth you didn’t know. I feel in-person conferences now have to offer more in order to justify people traveling to attend them.” Related, Andromeda said: “Also, for a conf that ultimately puts its talks online, it’s critical that it have SOMEthing beyond content delivery during the actual conference to make it worth registering rather than just waiting for youtube. realtime interaction with the speaker is a pretty solid option.” If you have something to add, reach out to me on Twitter. Given enough responses, I’ll create another summary. Let’s keep talking about what that looks like and sharing discoveries with each other. The Tree of Tweets It was a great discussion, and I think I pulled in the major ideas in the summary above. With some guidance from Ed Summers, I’m going to embed the Twitter threads below using Treeverse by Paul Butler. We might be stretching the boundaries of what is possible, so no guarantees that this will be viewable for the long term. Tags: code4lib, covid19, meeting planning, NISOplus Categories: L/IS Profession Twitter Facebook LinkedIn Previous Next You May Also Enjoy Should All Conference Talks be Pre-recorded? 6 minute read The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and con... User Behavior Access Controls at a Library Proxy Server are Okay 9 minute read Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. ... As a Cog in the Election System: Reflections on My Role as a Precinct Election Official 9 minute read I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democ... Running an All-Online Conference with Zoom [post removed] less than 1 minute read This is an article draft that was accidentally published. I hope to work on a final version soon. If you really want to see it, I saved a copy on the Interne... Enter your search term... Twitter GitHub Feed © 2021 Peter Murray. Powered by Jekyll & Minimal Mistakes. 
dltj-org-3521	----	Should All Conference Talks be Pre-recorded? | Disruptive Library Technology Jester Skip links Skip to primary navigation Skip to content Skip to footer Disruptive Library Technology Jester About Resume Toggle search Toggle menu Peter Murray Library technologist, open source advocate, striving to think globally while acting locally Follow Columbus, Ohio Email Twitter Keybase GitHub LinkedIn StackOverflow ORCID Email Should All Conference Talks be Pre-recorded? Posted on April 03, 2021 and updated on April 08, 2021     6 minute read The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and conference organizers. Should all talks be pre-recorded, even when we are back face-to-face? Note! After I posted a link to this article on Twitter, there was a great response of thoughtful comments. I've included new bullet points below and summarized the responses in another blog post. As an entirely virtual conference, I think we can call Code4Lib 2021 a success. Success ≠ Perfect, of course, and last week the conference coordinating team got together on a Zoom call for a debriefing session. We had a lengthy discussion about what we learned and what we wanted to take forward to the 2022 conference, which we’re anticipating will be something with a face-to-face component. That last sentence was tough to compose: “…will be face-to-face”? “…will be both face-to-face and virtual”? (Or another fully virtual event?) Truth be told, I don’t think we know yet. I think we know with some certainty that the COVID pandemic will become much more manageable by this time next year—at least in North America and Europe. (Code4Lib draws from primarily North American library technologists with a few guests from other parts of the world.) I’m hearing from higher education institutions, though, that travel is going to be severely curtailed…if not for health risk reasons, then because budgets have been slashed. So one has to wonder what a conference will look like next year. I’ve been to two online conferences this year: NISOplus21 and Code4Lib. Both meetings recorded talks in advance and started playback of the recordings at a fixed point in time. This was beneficial for a couple of reasons. For organizers and presenters, pre-recording allowed technical glitches to be worked through without the pressure of a live event happening. Technology is not nearly perfect enough or ubiquitously spread to count on it working in real-time. 1 NISOplus21 also used the recordings to get transcribed text for the videos. (Code4Lib used live transcriptions on the synchronous playback.) Attendees and presenters benefited from pre-recording because the presenters could be in the text chat channel to answer questions and provide insights. Having the presenter free during the playback offers new possibilities for making talks more engaging: responding in real-time to polls, getting forehand knowledge of topics for subsequent real-time question/answer sessions, and so forth. The synchronous playback time meant that there was a point when (almost) everyone was together watching the same talk—just as in face-to-face sessions. During the Code4Lib conference coordinating debrief call, I asked the question: “If we saw so many benefits to pre-recording talks, do we want to pre-record them all next year?” In addition to the reasons above, pre-recorded talks benefit those who are not comfortable speaking English or are first-time presenters. (They have a chance to re-do their talk as many times as they need in a much less stressful environment.) “Live” demos are much smoother because a recording can be restarted if something goes wrong. Each year, at least one presenter needs to use their own machine (custom software, local development environment, etc.), and swapping out presenter computers in real-time is risky. And it is undoubtedly easier to impose time requirements with recorded sessions. So why not pre-record all of the talks? I get it—it would be different to sit in a ballroom watching a recording play on big screens at the front of the room while the podium is empty. But is it so different as to dramatically change the experience of watching a speaker at a podium? In many respects, we had a dry-run of this during Code4Lib 2020. It was at the early stages of the coming lockdowns when institutions started barring employee travel, and we had to bring in many presenters remotely. I wrote a blog post describing the setup we used for remote presenters, and at the end, I said: I had a few people comment that they were taken aback when they realized that there was no one standing at the podium during the presentation. Some attendees, at least, quickly adjusted to this format. For those with the means and privilege of traveling, there can still be face-to-face discussions in the hall, over meals, and social activities. For those that can’t travel (due to risks of traveling, family/personal responsibilities, or budget cuts), the attendee experience is a little more level—everyone is watching the same playback and in the same text backchannels during the talk. I can imagine a conference tool capable of segmenting chat sessions during the talk playback to “tables” where you and close colleagues can exchange ideas and then promote the best ones to a conference-wide chat room. Something like that would be beneficial as attendance grows for events with an online component, and it would be a new form of engagement that isn’t practical now. There are undoubtedly reasons not to pre-record all session talks (beyond the feels-weird-to-stare-at-an-unoccupied-ballroom-podium reasons). During the debriefing session, one person brought up that having all pre-recorded talks erodes the justification for in-person attendance. I can see a manager saying, “All of the talks are online…just watch it from your desk. Even your own presentation is pre-recorded, so there is no need for you to fly to the meeting.” That’s legitimate. So if you like bullet points, here’s how it lays out. Pre-recording all talks is better for: Accessibility: better transcriptions for recorded audio versus real-time transcription (and probably at a lower cost, too) Engagement: the speaker can be in the text chat during playback, and there could be new options for backchannel discussions Better quality: speakers can re-record their talk as many times as needed Closer equality: in-person attendees are having much the same experience during the talk as remote attendees Downsides for pre-recording all talks: Feels weird: yeah, it would be different Erodes justification: indeed a problem, especially for those for whom giving a speech is the only path to getting the networking benefits of face-to-face interaction Limits presentation format: it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? (Lisa Janicke Hinchliffe) Increased Technical Burden on Speaker and Organizers: conference organizers asking presenters to do their own pre-recording is a barrier (Junior Tidal), and organizers have added new requirements for themselves No Audience Feedback: pre-recording forces the presenter into an unnatural state relative to the audience (Andromeda Yelton) Currency of information: pre-recording talks before en event naturally introduces a delay between the recording and the playback. (Lisa Janicke Hinchliffe) I’m curious to hear of other reasons, for and against. Reach out to me on Twitter if you have some. The COVID-19 pandemic has changed our society and will undoubtedly transform it in ways that we can’t even anticipate. Is the way that we hold professional conferences one of them? Can we just pause for a moment and consider the decades of work and layers of technology that make a modern teleconference call happen? For you younger folks, there was a time when one couldn’t assume the network to be there. As in: the operating system on your computer couldn’t be counted on to have a network stack built into it. In the earliest years of my career, we were tickled pink to have Macintoshes at the forefront of connectivity through GatorBoxes. Go read the first paragraph of that Wikipedia article on GatorBoxes…TCP/IP was tunneled through LocalTalk running over PhoneNet on unshielded twisted pairs no faster than about 200 kbit/second. (And we loved it!) Now the network is expected; needing to know about TCP/IP is pushed so far down the stack as to be forgotten…assumed. Sure, the software on top now is buggy and bloated—is my Zoom client working? has Zoom’s service gone down?—but the network…we take that for granted. ↩ Tags: code4lib, covid19, meeting planning, NISOplus Categories: L/IS Profession Twitter Facebook LinkedIn Previous Next You May Also Enjoy More Thoughts on Pre-recording Conference Talks 8 minute read Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion abo... User Behavior Access Controls at a Library Proxy Server are Okay 9 minute read Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. ... As a Cog in the Election System: Reflections on My Role as a Precinct Election Official 9 minute read I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democ... Running an All-Online Conference with Zoom [post removed] less than 1 minute read This is an article draft that was accidentally published. I hope to work on a final version soon. If you really want to see it, I saved a copy on the Interne... Enter your search term... Twitter GitHub Feed © 2021 Peter Murray. Powered by Jekyll & Minimal Mistakes. 
dltj-org-4212	----	Should All Conference Talks be Pre-recorded? | Disruptive Library Technology Jester Skip links Skip to primary navigation Skip to content Skip to footer Disruptive Library Technology Jester About Resume Toggle search Toggle menu Peter Murray Library technologist, open source advocate, striving to think globally while acting locally Follow Columbus, Ohio Email Twitter Keybase GitHub LinkedIn StackOverflow ORCID Email Should All Conference Talks be Pre-recorded? Posted on April 03, 2021 and updated on April 08, 2021     6 minute read The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and conference organizers. Should all talks be pre-recorded, even when we are back face-to-face? Note! After I posted a link to this article on Twitter, there was a great response of thoughtful comments. I've included new bullet points below and summarized the responses in another blog post. As an entirely virtual conference, I think we can call Code4Lib 2021 a success. Success ≠ Perfect, of course, and last week the conference coordinating team got together on a Zoom call for a debriefing session. We had a lengthy discussion about what we learned and what we wanted to take forward to the 2022 conference, which we’re anticipating will be something with a face-to-face component. That last sentence was tough to compose: “…will be face-to-face”? “…will be both face-to-face and virtual”? (Or another fully virtual event?) Truth be told, I don’t think we know yet. I think we know with some certainty that the COVID pandemic will become much more manageable by this time next year—at least in North America and Europe. (Code4Lib draws from primarily North American library technologists with a few guests from other parts of the world.) I’m hearing from higher education institutions, though, that travel is going to be severely curtailed…if not for health risk reasons, then because budgets have been slashed. So one has to wonder what a conference will look like next year. I’ve been to two online conferences this year: NISOplus21 and Code4Lib. Both meetings recorded talks in advance and started playback of the recordings at a fixed point in time. This was beneficial for a couple of reasons. For organizers and presenters, pre-recording allowed technical glitches to be worked through without the pressure of a live event happening. Technology is not nearly perfect enough or ubiquitously spread to count on it working in real-time. 1 NISOplus21 also used the recordings to get transcribed text for the videos. (Code4Lib used live transcriptions on the synchronous playback.) Attendees and presenters benefited from pre-recording because the presenters could be in the text chat channel to answer questions and provide insights. Having the presenter free during the playback offers new possibilities for making talks more engaging: responding in real-time to polls, getting forehand knowledge of topics for subsequent real-time question/answer sessions, and so forth. The synchronous playback time meant that there was a point when (almost) everyone was together watching the same talk—just as in face-to-face sessions. During the Code4Lib conference coordinating debrief call, I asked the question: “If we saw so many benefits to pre-recording talks, do we want to pre-record them all next year?” In addition to the reasons above, pre-recorded talks benefit those who are not comfortable speaking English or are first-time presenters. (They have a chance to re-do their talk as many times as they need in a much less stressful environment.) “Live” demos are much smoother because a recording can be restarted if something goes wrong. Each year, at least one presenter needs to use their own machine (custom software, local development environment, etc.), and swapping out presenter computers in real-time is risky. And it is undoubtedly easier to impose time requirements with recorded sessions. So why not pre-record all of the talks? I get it—it would be different to sit in a ballroom watching a recording play on big screens at the front of the room while the podium is empty. But is it so different as to dramatically change the experience of watching a speaker at a podium? In many respects, we had a dry-run of this during Code4Lib 2020. It was at the early stages of the coming lockdowns when institutions started barring employee travel, and we had to bring in many presenters remotely. I wrote a blog post describing the setup we used for remote presenters, and at the end, I said: I had a few people comment that they were taken aback when they realized that there was no one standing at the podium during the presentation. Some attendees, at least, quickly adjusted to this format. For those with the means and privilege of traveling, there can still be face-to-face discussions in the hall, over meals, and social activities. For those that can’t travel (due to risks of traveling, family/personal responsibilities, or budget cuts), the attendee experience is a little more level—everyone is watching the same playback and in the same text backchannels during the talk. I can imagine a conference tool capable of segmenting chat sessions during the talk playback to “tables” where you and close colleagues can exchange ideas and then promote the best ones to a conference-wide chat room. Something like that would be beneficial as attendance grows for events with an online component, and it would be a new form of engagement that isn’t practical now. There are undoubtedly reasons not to pre-record all session talks (beyond the feels-weird-to-stare-at-an-unoccupied-ballroom-podium reasons). During the debriefing session, one person brought up that having all pre-recorded talks erodes the justification for in-person attendance. I can see a manager saying, “All of the talks are online…just watch it from your desk. Even your own presentation is pre-recorded, so there is no need for you to fly to the meeting.” That’s legitimate. So if you like bullet points, here’s how it lays out. Pre-recording all talks is better for: Accessibility: better transcriptions for recorded audio versus real-time transcription (and probably at a lower cost, too) Engagement: the speaker can be in the text chat during playback, and there could be new options for backchannel discussions Better quality: speakers can re-record their talk as many times as needed Closer equality: in-person attendees are having much the same experience during the talk as remote attendees Downsides for pre-recording all talks: Feels weird: yeah, it would be different Erodes justification: indeed a problem, especially for those for whom giving a speech is the only path to getting the networking benefits of face-to-face interaction Limits presentation format: it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? (Lisa Janicke Hinchliffe) Increased Technical Burden on Speaker and Organizers: conference organizers asking presenters to do their own pre-recording is a barrier (Junior Tidal), and organizers have added new requirements for themselves No Audience Feedback: pre-recording forces the presenter into an unnatural state relative to the audience (Andromeda Yelton) Currency of information: pre-recording talks before en event naturally introduces a delay between the recording and the playback. (Lisa Janicke Hinchliffe) I’m curious to hear of other reasons, for and against. Reach out to me on Twitter if you have some. The COVID-19 pandemic has changed our society and will undoubtedly transform it in ways that we can’t even anticipate. Is the way that we hold professional conferences one of them? Can we just pause for a moment and consider the decades of work and layers of technology that make a modern teleconference call happen? For you younger folks, there was a time when one couldn’t assume the network to be there. As in: the operating system on your computer couldn’t be counted on to have a network stack built into it. In the earliest years of my career, we were tickled pink to have Macintoshes at the forefront of connectivity through GatorBoxes. Go read the first paragraph of that Wikipedia article on GatorBoxes…TCP/IP was tunneled through LocalTalk running over PhoneNet on unshielded twisted pairs no faster than about 200 kbit/second. (And we loved it!) Now the network is expected; needing to know about TCP/IP is pushed so far down the stack as to be forgotten…assumed. Sure, the software on top now is buggy and bloated—is my Zoom client working? has Zoom’s service gone down?—but the network…we take that for granted. ↩ Tags: code4lib, covid19, meeting planning, NISOplus Categories: L/IS Profession Twitter Facebook LinkedIn Previous Next You May Also Enjoy More Thoughts on Pre-recording Conference Talks 8 minute read Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion abo... User Behavior Access Controls at a Library Proxy Server are Okay 9 minute read Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. ... As a Cog in the Election System: Reflections on My Role as a Precinct Election Official 9 minute read I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democ... Running an All-Online Conference with Zoom [post removed] less than 1 minute read This is an article draft that was accidentally published. I hope to work on a final version soon. If you really want to see it, I saved a copy on the Interne... Enter your search term... Twitter GitHub Feed © 2021 Peter Murray. Powered by Jekyll & Minimal Mistakes. 
dltj-org-6401	----	Publishers going-it-alone (for now?) with GetFTR | Disruptive Library Technology Jester Skip links Skip to primary navigation Skip to content Skip to footer Disruptive Library Technology Jester About Resume Toggle search Toggle menu Peter Murray Library technologist, open source advocate, striving to think globally while acting locally Follow Columbus, Ohio Email Twitter Keybase GitHub LinkedIn StackOverflow ORCID Email Publishers going-it-alone (for now?) with GetFTR Posted on December 03, 2019 and updated on April 03, 2021     5 minute read In early December 2019, a group of publishers announced Get-Full-Text-Research, or GetFTR for short. I read about this first in Roger Schonfeld’s “Publishers Announce a Major New Service to Plug Leakage” piece in The Scholarly Kitchen via Jeff Pooley’s Twitter thread and blog post. Details about how this works are thin, so I’m leaning heavily on Roger’s description. I’m not as negative about this as Jeff, and I’m probably a little more opinionated than Roger. This is an interesting move by publishers, and—as the title of this post suggests—I am critical of the publisher’s “go-it-alone” approach. First, some disclosure might be in order. My background has me thinking of this in the context of how it impacts libraries and library consortia. For the past four years, I’ve been co-chair of the NISO Information Discovery and Interchange topic committee (and its predecessor, the “Discovery to Delivery” topic committee), so this is squarely in what I’ve been thinking about in the broader library-publisher professional space. I also traced the early development of RA21 and more recently am volunteering on the SeamlessAccess Entity Category and Attribute Bundles Working Group; that’ll become more important a little further down this post. I was nodding along with Roger’s narrative until I stopped short here: The five major publishing houses that are the driving forces behind GetFTR are not pursuing this initiative through one of the major industry collaborative bodies. All five are leading members of the STM Association, NISO, ORCID, Crossref, and CHORUS, to name several major industry groups. But rather than working through one of these existing groups, the houses plan instead to launch a new legal entity.  While [Vice President of Product Strategy & Partnerships for Wiley Todd] Toler and [Senior Director, Technology Strategy & Partnerships for the American Chemical Society Ralph] Youngen were too politic to go deeply into the details of why this might be, it is clear that the leadership of the large houses have felt a major sense of mismatch between their business priorities on the one hand and the capabilities of these existing industry bodies. At recent industry events, publishing house CEOs have voiced extensive concerns about the lack of cooperation-driven innovation in the sector. For example, Judy Verses from Wiley spoke to this issue in spring 2018, and several executives did so at Frankfurt this fall. In both cases, long standing members of the scholarly publishing sector questioned if these executives perhaps did not realize the extensive collaborations driven through Crossref and ORCID, among others. It is now clear to me that the issue is not a lack of knowledge but rather a concern at the executive level about the perceived inability of existing collaborative vehicles to enable the new strategic directions that publishers feel they must pursue.  This is the publishers going-it-alone. To see Roger describe it, they are going to create this web service that allows publishers to determine the appropriate copy for a patron and do it without input from the libraries. Librarians will just be expected to put this web service widget into their discovery services to get “colored buttons indicating that the link will take [patrons] to the version of record, an alternative pathway, or (presumably in rare cases) no access at all.” (Let’s set aside for the moment the privacy implications of having a fourth-party web service recording all of the individual articles that come up in a patron’s search results.) Librarians will not get to decide the “alternative pathway” that is appropriate for the patron: “Some publishers might choose to provide access to a preprint or a read-only version, perhaps in some cases on some kind of metered basis.” (Roger goes on to say that he “expect[s] publishers will typically enable some alternative version for their content, in which case the vast majority of scholarly content will be freely available through publishers even if it is not open access in terms of licensing.” I’m not so confident.) No, thank you. If publishers want to engage in technical work to enable libraries and others to build web services that determine the direct link to an article based on a DOI, then great. Libraries can build a tool that consumes that information as well as takes into account information about preprint services, open access versions, interlibrary loan and other methods of access. But to ask libraries to accept this publisher-controlled access button in their discovery layers, their learning management systems, their scholarly profile services, and their other tools? That sounds destined for disappointment. I am only somewhat encouraged by the fact that RA21 started out as a small, isolated collaboration of publishers before they brought in NISO and invited libraries to join the discussion. Did it mean that it slowed down deployment of RA21? Undoubtedly yes. Did persnickety librarians demand transparent discussions and decisions about privacy-related concerns like what attributes the publisher would get about the patron in the Shibboleth-powered backchannel? Yes, but because the patrons weren’t there to advocate for themselves. Will it likely mean wider adoption? I’d like to think so. Have publishers learned that forcing these kinds of technologies onto users without consultation is a bad idea? At the moment it would appear not. Some of what publishers are seeking with GetFTR can be implemented with straight-up OpenURL or—at the very least—limited-scope additions to OpenURL (the Z39.88 open standard!). So that they didn’t start with OpenURL, a robust existing standard, is both concerning and annoying. I’ll be watching and listening for points of engagement, so I remain hopeful. A few words about Jeff Pooley’s five-step “laughably creaky and friction-filled effort” that is SeamlessAccess. Many of the steps Jeff describes are invisible and well-established technical protocols. What Jeff fails to take into account is the very visible and friction-filled effect of patrons accessing content beyond the boundaries of campus-recognized internet network addresses. Those patrons get stopped at step two with a “pay $35 please” message. I’m all for removing that barrier entirely by making all published content “open access”. It is folly to think, though, that researchers and readers can enforce an open access business model on all publishers, so solutions like SeamlessAccess will have a place. (Which is to say nothing of the benefit of inter-institutional resource collaboration opened up by a more widely deployed Shibboleth infrastructure powered by SeamlessAccess.) Tags: discovery, GetFTR, niso, openurl, ra21, SeamlessAccess Categories: Linking Technologies Twitter Facebook LinkedIn Previous Next You May Also Enjoy More Thoughts on Pre-recording Conference Talks 8 minute read Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion abo... Should All Conference Talks be Pre-recorded? 6 minute read The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and con... User Behavior Access Controls at a Library Proxy Server are Okay 9 minute read Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. ... As a Cog in the Election System: Reflections on My Role as a Precinct Election Official 10 minute read I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democ... Enter your search term... Twitter GitHub Feed © 2021 Peter Murray. Powered by Jekyll & Minimal Mistakes. 
dltj-org-7097	----	Should All Conference Talks be Pre-recorded? | Disruptive Library Technology Jester Skip links Skip to primary navigation Skip to content Skip to footer Disruptive Library Technology Jester About Resume Toggle search Toggle menu Peter Murray Library technologist, open source advocate, striving to think globally while acting locally Follow Columbus, Ohio Email Twitter Keybase GitHub LinkedIn StackOverflow ORCID Email Should All Conference Talks be Pre-recorded? Posted on April 03, 2021 and updated on April 08, 2021     6 minute read The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and conference organizers. Should all talks be pre-recorded, even when we are back face-to-face? Note! After I posted a link to this article on Twitter, there was a great response of thoughtful comments. I've included new bullet points below and summarized the responses in another blog post. As an entirely virtual conference, I think we can call Code4Lib 2021 a success. Success ≠ Perfect, of course, and last week the conference coordinating team got together on a Zoom call for a debriefing session. We had a lengthy discussion about what we learned and what we wanted to take forward to the 2022 conference, which we’re anticipating will be something with a face-to-face component. That last sentence was tough to compose: “…will be face-to-face”? “…will be both face-to-face and virtual”? (Or another fully virtual event?) Truth be told, I don’t think we know yet. I think we know with some certainty that the COVID pandemic will become much more manageable by this time next year—at least in North America and Europe. (Code4Lib draws from primarily North American library technologists with a few guests from other parts of the world.) I’m hearing from higher education institutions, though, that travel is going to be severely curtailed…if not for health risk reasons, then because budgets have been slashed. So one has to wonder what a conference will look like next year. I’ve been to two online conferences this year: NISOplus21 and Code4Lib. Both meetings recorded talks in advance and started playback of the recordings at a fixed point in time. This was beneficial for a couple of reasons. For organizers and presenters, pre-recording allowed technical glitches to be worked through without the pressure of a live event happening. Technology is not nearly perfect enough or ubiquitously spread to count on it working in real-time. 1 NISOplus21 also used the recordings to get transcribed text for the videos. (Code4Lib used live transcriptions on the synchronous playback.) Attendees and presenters benefited from pre-recording because the presenters could be in the text chat channel to answer questions and provide insights. Having the presenter free during the playback offers new possibilities for making talks more engaging: responding in real-time to polls, getting forehand knowledge of topics for subsequent real-time question/answer sessions, and so forth. The synchronous playback time meant that there was a point when (almost) everyone was together watching the same talk—just as in face-to-face sessions. During the Code4Lib conference coordinating debrief call, I asked the question: “If we saw so many benefits to pre-recording talks, do we want to pre-record them all next year?” In addition to the reasons above, pre-recorded talks benefit those who are not comfortable speaking English or are first-time presenters. (They have a chance to re-do their talk as many times as they need in a much less stressful environment.) “Live” demos are much smoother because a recording can be restarted if something goes wrong. Each year, at least one presenter needs to use their own machine (custom software, local development environment, etc.), and swapping out presenter computers in real-time is risky. And it is undoubtedly easier to impose time requirements with recorded sessions. So why not pre-record all of the talks? I get it—it would be different to sit in a ballroom watching a recording play on big screens at the front of the room while the podium is empty. But is it so different as to dramatically change the experience of watching a speaker at a podium? In many respects, we had a dry-run of this during Code4Lib 2020. It was at the early stages of the coming lockdowns when institutions started barring employee travel, and we had to bring in many presenters remotely. I wrote a blog post describing the setup we used for remote presenters, and at the end, I said: I had a few people comment that they were taken aback when they realized that there was no one standing at the podium during the presentation. Some attendees, at least, quickly adjusted to this format. For those with the means and privilege of traveling, there can still be face-to-face discussions in the hall, over meals, and social activities. For those that can’t travel (due to risks of traveling, family/personal responsibilities, or budget cuts), the attendee experience is a little more level—everyone is watching the same playback and in the same text backchannels during the talk. I can imagine a conference tool capable of segmenting chat sessions during the talk playback to “tables” where you and close colleagues can exchange ideas and then promote the best ones to a conference-wide chat room. Something like that would be beneficial as attendance grows for events with an online component, and it would be a new form of engagement that isn’t practical now. There are undoubtedly reasons not to pre-record all session talks (beyond the feels-weird-to-stare-at-an-unoccupied-ballroom-podium reasons). During the debriefing session, one person brought up that having all pre-recorded talks erodes the justification for in-person attendance. I can see a manager saying, “All of the talks are online…just watch it from your desk. Even your own presentation is pre-recorded, so there is no need for you to fly to the meeting.” That’s legitimate. So if you like bullet points, here’s how it lays out. Pre-recording all talks is better for: Accessibility: better transcriptions for recorded audio versus real-time transcription (and probably at a lower cost, too) Engagement: the speaker can be in the text chat during playback, and there could be new options for backchannel discussions Better quality: speakers can re-record their talk as many times as needed Closer equality: in-person attendees are having much the same experience during the talk as remote attendees Downsides for pre-recording all talks: Feels weird: yeah, it would be different Erodes justification: indeed a problem, especially for those for whom giving a speech is the only path to getting the networking benefits of face-to-face interaction Limits presentation format: it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? (Lisa Janicke Hinchliffe) Increased Technical Burden on Speaker and Organizers: conference organizers asking presenters to do their own pre-recording is a barrier (Junior Tidal), and organizers have added new requirements for themselves No Audience Feedback: pre-recording forces the presenter into an unnatural state relative to the audience (Andromeda Yelton) Currency of information: pre-recording talks before en event naturally introduces a delay between the recording and the playback. (Lisa Janicke Hinchliffe) I’m curious to hear of other reasons, for and against. Reach out to me on Twitter if you have some. The COVID-19 pandemic has changed our society and will undoubtedly transform it in ways that we can’t even anticipate. Is the way that we hold professional conferences one of them? Can we just pause for a moment and consider the decades of work and layers of technology that make a modern teleconference call happen? For you younger folks, there was a time when one couldn’t assume the network to be there. As in: the operating system on your computer couldn’t be counted on to have a network stack built into it. In the earliest years of my career, we were tickled pink to have Macintoshes at the forefront of connectivity through GatorBoxes. Go read the first paragraph of that Wikipedia article on GatorBoxes…TCP/IP was tunneled through LocalTalk running over PhoneNet on unshielded twisted pairs no faster than about 200 kbit/second. (And we loved it!) Now the network is expected; needing to know about TCP/IP is pushed so far down the stack as to be forgotten…assumed. Sure, the software on top now is buggy and bloated—is my Zoom client working? has Zoom’s service gone down?—but the network…we take that for granted. ↩ Tags: code4lib, covid19, meeting planning, NISOplus Categories: L/IS Profession Twitter Facebook LinkedIn Previous Next You May Also Enjoy More Thoughts on Pre-recording Conference Talks 8 minute read Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion abo... User Behavior Access Controls at a Library Proxy Server are Okay 9 minute read Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. ... As a Cog in the Election System: Reflections on My Role as a Precinct Election Official 9 minute read I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democ... Running an All-Online Conference with Zoom [post removed] less than 1 minute read This is an article draft that was accidentally published. I hope to work on a final version soon. If you really want to see it, I saved a copy on the Interne... Enter your search term... Twitter GitHub Feed © 2021 Peter Murray. Powered by Jekyll & Minimal Mistakes. 
dltj-org-7931	----	Disruptive Library Technology Jester Skip links Skip to primary navigation Skip to content Skip to footer Disruptive Library Technology Jester About Resume Toggle search Toggle menu Peter Murray Library technologist, open source advocate, striving to think globally while acting locally Follow Columbus, Ohio Email Twitter Keybase GitHub LinkedIn StackOverflow ORCID Email Recent Posts More Thoughts on Pre-recording Conference Talks 7 minute read Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion abo... Should All Conference Talks be Pre-recorded? 6 minute read The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and con... User Behavior Access Controls at a Library Proxy Server are Okay 9 minute read Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. ... As a Cog in the Election System: Reflections on My Role as a Precinct Election Official 9 minute read I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democ... Running an All-Online Conference with Zoom [post removed] less than 1 minute read This is an article draft that was accidentally published. I hope to work on a final version soon. If you really want to see it, I saved a copy on the Interne... Previous 1 2 3 … 130 Next Enter your search term... Twitter GitHub Feed © 2021 Peter Murray. Powered by Jekyll & Minimal Mistakes. 
dltj-org-8735	----	Should All Conference Talks be Pre-recorded? | Disruptive Library Technology Jester Skip links Skip to primary navigation Skip to content Skip to footer Disruptive Library Technology Jester About Resume Toggle search Toggle menu Peter Murray Library technologist, open source advocate, striving to think globally while acting locally Follow Columbus, Ohio Email Twitter Keybase GitHub LinkedIn StackOverflow ORCID Email Should All Conference Talks be Pre-recorded? Posted on April 03, 2021 and updated on April 08, 2021     6 minute read The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and conference organizers. Should all talks be pre-recorded, even when we are back face-to-face? Note! After I posted a link to this article on Twitter, there was a great response of thoughtful comments. I've included new bullet points below and summarized the responses in another blog post. As an entirely virtual conference, I think we can call Code4Lib 2021 a success. Success ≠ Perfect, of course, and last week the conference coordinating team got together on a Zoom call for a debriefing session. We had a lengthy discussion about what we learned and what we wanted to take forward to the 2022 conference, which we’re anticipating will be something with a face-to-face component. That last sentence was tough to compose: “…will be face-to-face”? “…will be both face-to-face and virtual”? (Or another fully virtual event?) Truth be told, I don’t think we know yet. I think we know with some certainty that the COVID pandemic will become much more manageable by this time next year—at least in North America and Europe. (Code4Lib draws from primarily North American library technologists with a few guests from other parts of the world.) I’m hearing from higher education institutions, though, that travel is going to be severely curtailed…if not for health risk reasons, then because budgets have been slashed. So one has to wonder what a conference will look like next year. I’ve been to two online conferences this year: NISOplus21 and Code4Lib. Both meetings recorded talks in advance and started playback of the recordings at a fixed point in time. This was beneficial for a couple of reasons. For organizers and presenters, pre-recording allowed technical glitches to be worked through without the pressure of a live event happening. Technology is not nearly perfect enough or ubiquitously spread to count on it working in real-time. 1 NISOplus21 also used the recordings to get transcribed text for the videos. (Code4Lib used live transcriptions on the synchronous playback.) Attendees and presenters benefited from pre-recording because the presenters could be in the text chat channel to answer questions and provide insights. Having the presenter free during the playback offers new possibilities for making talks more engaging: responding in real-time to polls, getting forehand knowledge of topics for subsequent real-time question/answer sessions, and so forth. The synchronous playback time meant that there was a point when (almost) everyone was together watching the same talk—just as in face-to-face sessions. During the Code4Lib conference coordinating debrief call, I asked the question: “If we saw so many benefits to pre-recording talks, do we want to pre-record them all next year?” In addition to the reasons above, pre-recorded talks benefit those who are not comfortable speaking English or are first-time presenters. (They have a chance to re-do their talk as many times as they need in a much less stressful environment.) “Live” demos are much smoother because a recording can be restarted if something goes wrong. Each year, at least one presenter needs to use their own machine (custom software, local development environment, etc.), and swapping out presenter computers in real-time is risky. And it is undoubtedly easier to impose time requirements with recorded sessions. So why not pre-record all of the talks? I get it—it would be different to sit in a ballroom watching a recording play on big screens at the front of the room while the podium is empty. But is it so different as to dramatically change the experience of watching a speaker at a podium? In many respects, we had a dry-run of this during Code4Lib 2020. It was at the early stages of the coming lockdowns when institutions started barring employee travel, and we had to bring in many presenters remotely. I wrote a blog post describing the setup we used for remote presenters, and at the end, I said: I had a few people comment that they were taken aback when they realized that there was no one standing at the podium during the presentation. Some attendees, at least, quickly adjusted to this format. For those with the means and privilege of traveling, there can still be face-to-face discussions in the hall, over meals, and social activities. For those that can’t travel (due to risks of traveling, family/personal responsibilities, or budget cuts), the attendee experience is a little more level—everyone is watching the same playback and in the same text backchannels during the talk. I can imagine a conference tool capable of segmenting chat sessions during the talk playback to “tables” where you and close colleagues can exchange ideas and then promote the best ones to a conference-wide chat room. Something like that would be beneficial as attendance grows for events with an online component, and it would be a new form of engagement that isn’t practical now. There are undoubtedly reasons not to pre-record all session talks (beyond the feels-weird-to-stare-at-an-unoccupied-ballroom-podium reasons). During the debriefing session, one person brought up that having all pre-recorded talks erodes the justification for in-person attendance. I can see a manager saying, “All of the talks are online…just watch it from your desk. Even your own presentation is pre-recorded, so there is no need for you to fly to the meeting.” That’s legitimate. So if you like bullet points, here’s how it lays out. Pre-recording all talks is better for: Accessibility: better transcriptions for recorded audio versus real-time transcription (and probably at a lower cost, too) Engagement: the speaker can be in the text chat during playback, and there could be new options for backchannel discussions Better quality: speakers can re-record their talk as many times as needed Closer equality: in-person attendees are having much the same experience during the talk as remote attendees Downsides for pre-recording all talks: Feels weird: yeah, it would be different Erodes justification: indeed a problem, especially for those for whom giving a speech is the only path to getting the networking benefits of face-to-face interaction Limits presentation format: it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? (Lisa Janicke Hinchliffe) Increased Technical Burden on Speaker and Organizers: conference organizers asking presenters to do their own pre-recording is a barrier (Junior Tidal), and organizers have added new requirements for themselves No Audience Feedback: pre-recording forces the presenter into an unnatural state relative to the audience (Andromeda Yelton) Currency of information: pre-recording talks before en event naturally introduces a delay between the recording and the playback. (Lisa Janicke Hinchliffe) I’m curious to hear of other reasons, for and against. Reach out to me on Twitter if you have some. The COVID-19 pandemic has changed our society and will undoubtedly transform it in ways that we can’t even anticipate. Is the way that we hold professional conferences one of them? Can we just pause for a moment and consider the decades of work and layers of technology that make a modern teleconference call happen? For you younger folks, there was a time when one couldn’t assume the network to be there. As in: the operating system on your computer couldn’t be counted on to have a network stack built into it. In the earliest years of my career, we were tickled pink to have Macintoshes at the forefront of connectivity through GatorBoxes. Go read the first paragraph of that Wikipedia article on GatorBoxes…TCP/IP was tunneled through LocalTalk running over PhoneNet on unshielded twisted pairs no faster than about 200 kbit/second. (And we loved it!) Now the network is expected; needing to know about TCP/IP is pushed so far down the stack as to be forgotten…assumed. Sure, the software on top now is buggy and bloated—is my Zoom client working? has Zoom’s service gone down?—but the network…we take that for granted. ↩ Tags: code4lib, covid19, meeting planning, NISOplus Categories: L/IS Profession Twitter Facebook LinkedIn Previous Next You May Also Enjoy More Thoughts on Pre-recording Conference Talks 8 minute read Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion abo... User Behavior Access Controls at a Library Proxy Server are Okay 9 minute read Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. ... As a Cog in the Election System: Reflections on My Role as a Precinct Election Official 9 minute read I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democ... Running an All-Online Conference with Zoom [post removed] less than 1 minute read This is an article draft that was accidentally published. I hope to work on a final version soon. If you really want to see it, I saved a copy on the Interne... Enter your search term... Twitter GitHub Feed © 2021 Peter Murray. Powered by Jekyll & Minimal Mistakes. 
dltj-org-9010	----	What is known about GetFTR at the end of 2019 | Disruptive Library Technology Jester Skip links Skip to primary navigation Skip to content Skip to footer Disruptive Library Technology Jester About Resume Toggle search Toggle menu Peter Murray Library technologist, open source advocate, striving to think globally while acting locally Follow Columbus, Ohio Email Twitter Keybase GitHub LinkedIn StackOverflow ORCID Email What is known about GetFTR at the end of 2019 Posted on December 28, 2019 and updated on April 03, 2021     14 minute read In early December 2019, a group of publishers announced Get-Full-Text-Research, or GetFTR for short. There was a heck of a response on social media, and the response was—on the whole—not positive from my librarian-dominated corner of Twitter. For my early take on GetFTR, see my December 3rd blog post “Publishers going-it-alone (for now?) with GetFTR.” As that post title suggests, I took the five founding GetFTR publishers to task on their take-it-or-leave-it approach. I think that is still a problem. To get you caught up, here is a list of other commentary. Roger Schonfeld’s December 3rd “Publishers Announce a Major New Service to Plug Leakage” piece in The Scholarly Kitchen Tweet from Herbert Van de Sompel, the lead author of the OpenURL spec, on solving the appropriate copy problem December 5th post “Get To Fulltext Ourselves, Not GetFTR.” on the Open Access Button blog Twitter thread on December 7th between @cshillum and @lisalibrarian on the positioning of GetFTR in relation to link resolvers and an unanswered question about how GetFTR aligns with library interests Twitter thread started by @TAC_NISO on December 9th looking for more information with a link to an STM Association presentation added by @aarontay A tree of tweets starting from @mrgunn’s [I don’t trust publishers to decide] is the crux of the whole thing. In particular, threads of that tweet that include Jason Griffey of NISO saying he knew nothing about GetFTR and Bernhard Mittermaier’s point about hidden motivations behind GetFTR Twitter thread started by @aarontay on December 7th saying “GetFTR is bad for researchers/readers and librarians. It only benefits publishers, change my mind.” Lisa Janicke Hinchliffe’s December 10th “Why are Librarians Concerned about GetFTR?” in The Scholarly Kitchen and take note of the follow-up discussion in the comments Twitter thread between @alison_mudditt and @lisalibrarian clarifying PLOS is not on the Advisory Board with some @TAC_NISO as well. Ian Mulvany’s December 11th “thoughts on GetFTR” on ScholCommsProd GetFTR’s December 11th “Updating the community” post on their website The Spanish Federation of Associations of Archivists, Librarians, Archaeologists, Museologists and Documentalists (ANABAD)’s December 12th “GetFTR: new publishers service to speed up access to research articles” (original in Spanish, Google Translate to English) December 20th news entry from eContent Pro with the title “What GetFTR Means for Journal Article Access” which I’ll only quarrel with this sentence: “Thus, GetFTR is a service where Academic articles are found and provided to you at absolutely no cost.” No—if you are in academia the cost is born by your library even if you don’t see it. But this seems like a third party service that isn’t directly related to publishers or libraries, so perhaps they can be forgiven for not getting that nuance. Wiley’s Chemistry Views news post on December 26th titled simply “Get Full Text Research (GetFTR)” is perhaps only notable for the sentence “Growing leakage has steadily eroded the ability of the publishers to monetize the value they create.” If you are looking for a short list of what to look at, I recommend these posts. GetFTR’s Community Update On December 11—after the two posts I list below—an “Updating the Community” web page was posted to the GetFTR website. From a public relations perspective, it was…interesting. We are committed to being open and transparent This section goes on to say, “If the community feels we need to add librarians to our advisory group we will certainly do so and we will explore ways to ensure we engage with as many of our librarian stakeholders as possible.” If the GetFTR leadership didn’t get the indication between December 3 and December 12 that librarians feel strongly about being at the table, then I don’t know what will. And it isn’t about being on the advisory group; it is about being seen and appreciated as important stakeholders in the research discovery process. I’m not sure who the “community” is in this section, but it is clear that librarians are—at best—an afterthought. That is not the kind of “open and transparent” that is welcoming. Later on in the Questions about library link resolvers section is this sentence: We have, or are planning to, consult with existing library advisory boards that participating publishers have, as this enables us to gather views from a significant number of librarians from all over the globe, at a range of different institutions. As I said in my previous post, I don’t know why GetFTR is not engaging in existing cross-community (publisher/technology-supplier/library) organizations to have this discussion. It feels intentional, which colors the perception of what the publishers are trying to accomplish. To be honest, I don’t think the publishers are using GetFTR to drive a wedge between library technology service providers (who are needed to make GetFTR a reality for libraries) and libraries themselves. But I can see how that interpretation could be made. Understandably, we have been asked about privacy. I punted on privacy in my previous post, so let’s talk about it here. It remains to be seen what is included in the GetFTR API request between the browser and the publisher site. Sure, it needs to include the DOI and a token that identifies the patron’s institution. We can inspect that API request to ensure nothing else is included. But the fact that the design of GetFTR has the browser making the call to the publisher site means that the publisher site knows the IP address of the patron’s browser, and the IP address can be considered personally identifiable information. This issue could be fixed by having the link resolver or the discovery layer software make the API request, and according to the Questions about library link resolvers section of the community update, this may be under consideration. So, yes, an auditable privacy policy and implementation is key for for GetFTR. GetFTR is fully committed to supporting third-party aggregators This is good to hear. I would love to see more information published about this, including how discipline-specific repositories and institutional repositories can have their holdings represented in GetFTR responses. My Take-a-ways In the second to last paragraph: “Researchers should have easy, seamless pathways to research, on whatever platform they are using, wherever they are.” That is a statement that I think every library could sign onto. This Updating the Community is a good start, but the project has dug a deep hole of trust and it hasn’t reached level ground yet. Lisa Janicke Hinchliffe’s “Why are Librarians Concerned about GetFTR?” Posted on December 10th in The Scholarly Kitchen, Lisa outlines a series of concerns from a librarian perspective. I agree with some of these; others are not an issue in my opinion. Librarian Concern: The Connection to Seamless Access Many librarians have expressed a concern about how patron information can leak to the publisher through ill-considered settings at an institution’s identity provider. Seamless Access can ease access control because it leverages a campus’ single sign-on solution—something that a library patron is likely to be familiar with. If the institution’s identity provider is overly permissive in the attributes about a patron that get transmitted to the publisher, then there is a serious risk of tying a user’s research activity to their identity and the bad things that come from that (patrons self-censoring their research paths, commoditization of patron activity, etc.). I’m serving on a Seamless Access task force that is addressing this issue, and I think there are technical, policy, and education solutions to this concern. In particular, I think some sort of intermediate display of the attributes being transmitted to the publisher is most appropriate. Librarian Concern: The Limited User Base Enabled As Lisa points out, the population of institutions that can take advantage of Seamless Access, a prerequisite for GetFTR, is very small and weighted heavily towards well-resourced institutions. To the extent that projects like Seamless Access (spurred on by a desire to have GetFTR-like functionality) helps with the adoption of SAML-based infrastructure like Shibboleth, then the whole academic community benefits from a shared authentication/identity layer that can be assumed to exist. Librarian Concern: The Insertion of New Stumbling Blocks Of the issues Lisa mentioned here, I’m not concerned about users being redirected to their campus single sign-on system in multiple browsers on multiple machines. This is something we should be training users about—there is a single website to put your username/password into for whatever you are accessing at the institution. That a user might already be logged into the institution single sign-on system in the course of doing other school work and never see a logon screen is an attractive benefit to this system. That said, it would be useful for an API call from a library’s discovery layer to a publisher’s GetFTR endpoint to be able to say, “This is my user. Trust me when I say that they are from this institution.” If that were possible, then the Seamless Access Where-Are-You-From service could be bypassed for the GetFTR purpose of determining whether a user’s institution has access to an article on the publisher’s site. It would sure be nice if librarians were involved in the specification of the underlying protocols early on so these use cases could be offered. Update Lisa reached out on Twitter to say (in part): “Issue is GetFTR doesn’t redirect and SA doesnt when you are IPauthenticated. Hence user ends up w mishmash of experience.” I went back to read her Scholarly Kitchen post and realized I did not fully understand her point. If GetFTR is relying on a Seamless Access token to know which institution a user is coming from, then that token must get into the user’s browser. The details we have seen about GetFTR don’t address how that Seamless Access institution token is put in the user’s browser if the user has not been to the Seamless Access select-your-institution portal. One such case is when the user is coming from an IP-address-authenticated computer on a campus network. Do the GetFTR indicators appear even when the Seamless Access institution token is not stored in the browser? If at the publisher site the GetFTR response also uses the institution IP address table to determine entitlements, what does a user see when they have neither the Seamless Access institution token nor the institution IP address? And, to Lisa’s point, how does one explain this disparity to users? Is the situation better if the GetFTR determination is made in the link resolver rather than in the user browser? Librarian Concern: Exclusion from Advisory Committee See previous paragraph. That librarians are not at the table offering use cases and technical advice means that the developers are likely closing off options that meet library needs. Addressing those needs would ease the acceptance of the GetFTR project as mutually beneficial. So an emphatic “AGREE!” with Lisa on her points in this section. Publishers—what were you thinking? Librarian Concern: GetFTR Replacing the Library Link Resolver Libraries and library technology companies are making significant investments in tools that ease the path from discovery to delivery. Would the library’s link resolver benefit from a real-time API call to a publisher’s service that determines the direct URL to a specific DOI? Oh, yes—that would be mighty beneficial. The library could put that link right at the top of a series of options that include a link to a version of the article in a Green Open Access repository, redirection to a content aggregator, one-click access to an interlibrary-loan form, or even an option where the library purchases a copy of the article on behalf of the patron. (More likely, the link resolver would take the patron right to the article URL supplied by GetFTR, but the library link resolver needs to be in the loop to be able to offer the other options.) My Take-a-ways The patron is affiliated with the institution, and the institution (through the library) is subscribing to services from the publisher. The institution’s library knows best what options are available to the patron (see above section). Want to know why librarians are concerned? Because they are inserting themselves as the arbiter of access to content, whether it is in the patron’s best interest or not. It is also useful to reinforce Lisa’s closing paragraph: Whether GetFTR will act to remediate these concerns remains to be seen. In some cases, I would expect that they will. In others, they may not. Publishers’ interests are not always aligned with library interests and they may accept a fraying relationship with the library community as the price to pay to pursue their strategic goals. Ian Mulvany’s “thoughts on GetFTR” Ian’s entire post from December 11th in ScholCommsProd is worth reading. I think it is an insightful look at the technology and its implications. Here are some specific comments: Clarifying the relation between SeamlessAccess and GetFTR There are a couple of things that I disagree with: OK, so what is the difference, for the user, between seamlessaccess and GetFTR? I think that the difference is the following - with seamless access you the user have to log in to the publisher site. With GetFTR if you are providing pages that contain DOIs (like on a discovery service) to your researchers, you can give them links they can click on that have been setup to get those users direct access to the content. That means as a researcher, so long as the discovery service has you as an authenticated user, you don’t need to even think about logins, or publisher access credentials. To the best of my understanding, this is incorrect. With SeamlessAccess, the user is not “logging into the publisher site.” If the publisher site doesn’t know who a user is, the user is bounced back to their institution’s single sign-on service to authenticate. If the publisher site doesn’t know where a user is from, it invokes the SeamlessAccess Where-Are-You-From service to learn which institution’s single sign-on service is appropriate for the user. If a user follows a GetFTR-supplied link to a publisher site but the user doesn’t have the necessary authentication token from the institution’s single sign-on service, then they will be bounced back for the username/password and redirected to the publisher’s site. GetFTR signaling that an institution is entitled to view an article does not mean the user can get it without proving that they are a member of the institution. What does this mean for Green Open Access A key point that Ian raises is this: One example of how this could suck, lets imagine that there is a very usable green OA version of an article, but the publisher wants to push me to using some “e-reader limited functionality version” that requires an account registration, or god forbid a browser exertion, or desktop app. If the publisher shows only this limited utility version, and not the green version, well that sucks. Oh, yeah…that does suck, and it is because the library—not the publisher of record—is better positioned to know what is best for a particular user. Will GetFTR be adopted? Ian asks, “Will google scholar implement this, will other discovery services do so?” I do wonder if GetFTR is big enough to attract the attention of Google Scholar and Microsoft Research. My gut tells me “no”: I don’t think Google and Microsoft are going to add GetFTR buttons to their search results screens unless they are paid a lot. As for Google Scholar, it is more likely that Google would build something like GetFTR to get the analytics rather than rely on a publisher’s version. I’m even more doubtful that the companies pushing GetFTR can convince discovery layers makers to embed GetFTR into their software. Since the two widely adopted discovery layers (in North America, at least) are also aggregators of journal content, I don’t see the discovery-layer/aggregator companies devaluing their product by actively pushing users off their site. My Take-a-ways It is also useful to reinforce Ian’s closing paragraph: I have two other recommendations for the GetFTR team. Both relate to building trust. First up, don’t list orgs as being on an advisory board, when they are not. Secondly it would be great to learn about the team behind the creation of the Service. At the moment its all very anonymous. Where Do We Stand? Wow, I didn’t set out to write 2,500 words on this topic. At the start I was just taking some time to review everything that happened since this was announced at the start of December and see what sense I could make of it. It turned into a literature review of sort. While GetFTR has some powerful backers, it also has some pretty big blockers: Can GetFTR help spur adoption of Seamless Access enough to convince big and small institutions to invest in identity provider infrastructure and single sign-on systems? Will GetFTR grab the interest of Google, Google Scholar, and Microsoft Research (where admittedly a lot of article discovery is already happening)? Will developers of discovery layers and link resolvers prioritize GetFTR implementation in their services? Will libraries find enough value in GetFTR to enable it in their discovery layers and link resolvers? Would libraries argue against GetFTR in learning management systems, faculty profile systems, and other campus systems if its own services cannot be included in GetFTR displays? I don’t know, but I think it is up to the principles behind GetFTR to make more inclusive decisions. The next steps is theirs. Tags: discovery, GetFTR, niso, openurl, ra21, SeamlessAccess Categories: Linking Technologies Twitter Facebook LinkedIn Previous Next You May Also Enjoy More Thoughts on Pre-recording Conference Talks 8 minute read Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion abo... Should All Conference Talks be Pre-recorded? 6 minute read The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and con... User Behavior Access Controls at a Library Proxy Server are Okay 9 minute read Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. ... As a Cog in the Election System: Reflections on My Role as a Precinct Election Official 10 minute read I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democ... Enter your search term... Twitter GitHub Feed © 2021 Peter Murray. Powered by Jekyll & Minimal Mistakes. 
dltj-org-9420	----	More Thoughts on Pre-recording Conference Talks | Disruptive Library Technology Jester Skip links Skip to primary navigation Skip to content Skip to footer Disruptive Library Technology Jester About Resume Toggle search Toggle menu Peter Murray Library technologist, open source advocate, striving to think globally while acting locally Follow Columbus, Ohio Email Twitter Keybase GitHub LinkedIn StackOverflow ORCID Email More Thoughts on Pre-recording Conference Talks Posted on April 08, 2021     7 minute read Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion about recording talks to fill in gaps—positive and negative—about the concept, and I was not disappointed. I’m particularly thankful to Lisa Janicke Hinchliffe and Andromeda Yelton along with Jason Griffey, Junior Tidal, and Edward Lim Junhao for generously sharing their thoughts. Daniel S and Kate Deibel also commented on the Code4Lib Slack team. I added to the previous article’s bullet points and am expanding on some of the issues here. I’m inviting everyone mentioned to let me know if I’m mischaracterizing their thoughts, and I will correct this post if I hear from them. (I haven’t found a good comments system to hook into this static site blog.) Pre-recorded Talks Limit Presentation Format Lisa Janicke Hinchliffe made this point early in the feedback: @DataG For me downside is it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? I was required to turn workshops into talks this year. Even tho tech can do more. Not at all best pedagogy for learning — Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 Jason described the “flipped classroom” model that he had in mind as the NISOplus2021 program was being developed. The flipped classroom model is one where students do the work of reading material and watching lectures, then come to the interactive time with the instructors ready with questions and comments about the material. Rather than the instructor lecturing during class time, the class time becomes a discussion about the material. For NISOplus, “the recording is the material the speaker and attendees are discussing” during the live Zoom meetings. In the previous post, I described how the speaker could respond in text chat while the recording replay is beneficial. Lisa went on to say: @DataG Q+A is useful but isn't an interactive session. To me, interactive = participants are co-creating the session, not watching then commenting on it. — Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 She described an example: the SSP preconference she ran at CHS. I’m paraphrasing her tweets in this paragraph. The preconference had a short keynote and an “Oprah-style” panel discussion (not pre-prepared talks). This was done live; nothing was recorded. After the panel, people worked in small groups using Zoom and a set of Google Slides to guide the group work. The small groups reported their discussions back to all participants. Andromeda points out (paraphrasing twitter-speak): “Presenters will need much more— and more specialized—skills to pull it off, and it takes a lot more work.” And Lisa adds: “Just so there is no confusion … I don’t think being online makes it harder to do interactive. It’s the pre-recording. Interactive means participants co-create the session. A pause to chat isn’t going to shape what comes next on the recording.” Increased Technical Burden on Speakers and Organizers @ThatAndromeda @DataG Totally agree on this. I had to pre-record a conference presentation recently and it was a terrible experience, logistically. I feel like it forces presenters to become video/sound editors, which is obviously another thing to worry about on top of content and accessibility. — Junior Tidal (@JuniorTidal) April 5, 2021 Andromeda also agreed with this: “I will say one of the things I appreciated about NISO is that @griffey did ALL the video editing, so I was not forced to learn how that works.” She continued, “everyone has different requirements for prerecording, and in [Code4Lib’s] case they were extensive and kept changing.” And later added: “Part of the challenge is that every conference has its own tech stack/requirements. If as a presenter I have to learn that for every conference, it’s not reducing my workload.” It is hard not to agree with this; a high-quality (stylistically and technically) recording is not easy to do with today’s tools. This is also a technical burden for meeting organizers. The presenters will put a lot of work into talks—including making sure the recordings look good; whatever playback mechanism is used has to honor the fidelity of that recording. For instance, presenters who have gone through the effort to ensure the accessibility of the presentation color scheme want the conference platform to display the talk “as I created it.” The previous post noted that recorded talks also allow for the creation of better, non-real-time transcriptions. Lisa points out that presenters will want to review that transcription for accuracy, which Jason noted adds to the length of time needed before the start of a conference to complete the preparations. Increased Logistical Burden on Presenters @ThatAndromeda @DataG @griffey Even if prep is no more than the time it would take to deliver live (which has yet to be case for me and I'm good at this stuff), it is still double the time if you are expected to also show up live to watch along with everyone else. — Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 This is a consideration I hadn’t thought through—that presenters have to devote more clock time to the presentation because first they have to record it and then they have to watch it. (Or, as Andromeda added, “significantly more than twice the time for some people, if they are recording a bunch in order to get it right and/or doing editing.”) No. Audience. Reaction. @DataG @griffey 3) No. Audience. Reaction. I give a joke and no one laughs. Was it funny? Was it not funny? Talks are a *performance* and a *relationship*; I'm getting energy off the audience, I'm switching stuff on the fly to meet their vibe. Prerecorded/webinar is dead. Feels like I'm bombing. — Andromeda Yelton (@ThatAndromeda) April 5, 2021 Wow, yes. I imagine it would take a bit of imagination to get in the right mindset to give a talk to a small camera instead of an audience. I wonder how stand-up comedians are dealing with this as they try to put on virtual shows. Andromeda summed this up: @DataG @griffey oh and I mean 5) I don't get tenure or anything for speaking at conferences and goodness knows I don't get paid. So the ENTIRE benefit to me is that I enjoy doing the talk and connect to people around it. prerecorded talk + f2f conf removes one of these; online removes both. — Andromeda Yelton (@ThatAndromeda) April 5, 2021 Also in this heading could be “No Speaker Reaction”—or the inability for subsequent speakers at a conference to build on something that someone said earlier. In the Code4Lib Slack team, Daniel S noted: “One thing comes to mind on the pre-recording [is] the issue that prerecorded talks lose the ‘conversation’ aspect where some later talks at a conference will address or comment on earlier talks.” Kate Deibel added: “Exactly. Talks don’t get to spontaneously build off of each other or from other conversations that happen at the conference.” Currency of information Lisa points out that pre-recording talks before en event means there is a delay between the recording and the playback. In the example she pointed out, there was a talk at RLUK that pre-recorded would have been about the University of California working on an Open Access deal with Elsevier; live, it was able to be “the deal we announced earlier this week”. Conclusions? Near the end of the discussion, Lisa added: @DataG @griffey @ThatAndromeda I also recommend going forward that the details re what is required of presenters be in the CfP. It was one thing for conferences that pivoted (huge effort!) but if you write the CfP since the pivot it should say if pre-record, platform used, etc. — Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 …and Andromeda added: “Strong agree here. I understand that this year everyone was making it up as they went along, but going forward it’d be great to know that in advance.” That means conferences will need to take these needs into account well before the Call for Proposals (CfP) is published. A conference that is thinking now about pre-recording their talks must work through these issues and set expectations with presenters early. As I hoped, the Twiter replies tempered my eagerness for the all-recorded style with some real-world experience. There could be possibilities here, but adapting face-to-face meetings to a world with less travel won’t be simple and will take significant thought beyond the issues of technology platforms. Edward Lim Junhao summarized this nicely: “I favor unpacking what makes up our prof conferences. I’m interested in recreating that shared experience, the networking, & the serendipity of learning sth you didn’t know. I feel in-person conferences now have to offer more in order to justify people traveling to attend them.” Related, Andromeda said: “Also, for a conf that ultimately puts its talks online, it’s critical that it have SOMEthing beyond content delivery during the actual conference to make it worth registering rather than just waiting for youtube. realtime interaction with the speaker is a pretty solid option.” If you have something to add, reach out to me on Twitter. Given enough responses, I’ll create another summary. Let’s keep talking about what that looks like and sharing discoveries with each other. The Tree of Tweets It was a great discussion, and I think I pulled in the major ideas in the summary above. With some guidance from Ed Summers, I’m going to embed the Twitter threads below using Treeverse by Paul Butler. We might be stretching the boundaries of what is possible, so no guarantees that this will be viewable for the long term. Tags: code4lib, covid19, meeting planning, NISOplus Categories: L/IS Profession Twitter Facebook LinkedIn Previous Next You May Also Enjoy Should All Conference Talks be Pre-recorded? 6 minute read The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and con... User Behavior Access Controls at a Library Proxy Server are Okay 9 minute read Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. ... As a Cog in the Election System: Reflections on My Role as a Precinct Election Official 9 minute read I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democ... Running an All-Online Conference with Zoom [post removed] less than 1 minute read This is an article draft that was accidentally published. I hope to work on a final version soon. If you really want to see it, I saved a copy on the Interne... Enter your search term... Twitter GitHub Feed © 2021 Peter Murray. Powered by Jekyll & Minimal Mistakes. 
docs-google-com-400	----	Documentation Sprint 1.1.0 - Google Sheets JavaScript isn't enabled in your browser, so this file can't be opened. Enable and reload. Documentation Sprint 1.1.0          Share Sign in The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss File Edit View Insert Format Data Tools Form Add-ons Help Accessibility Unsaved changes to Drive See new changes                 $ %     123                             Conditional formatting     Conditional formatting                                                                                                   SUM AVERAGE COUNT MAX MIN Learn more                     Accessibility               A B C D E F G H I J K L M N O P Q R S T U V W X Y Z AA AB AC AD 1 Review Complete Auditor Reviewer Link Status Type Audience Goal PROBLEMS NOTES 2 About https://islandora.github.io/documentation/ Good For Now Conceptual Stranger Explains at a high level what islandora does 3 Concepts Structuring menu item - not a page 4 Reviewed DIG MA â”œâ”€â”€ Collections https://islandora.github.io/documentation/concepts/collection/ Needs Work Conceptual Newcomer Explain the concept of Collections in Islandora, with reference to bulk management and the interaction of Islandora Defaults. Points to page that does not exist yet (Bulk Editing). Assumes some Basic Drupal knowledge and knowledge of Islandora Defaults, too early (because this is one of the first pages in the documentation). Collections should probably not be the first page in the documentation tree. 'Content Types' should be in the Glossary. Add more links. 5 Audited MH â”œâ”€â”€ Access Control https://islandora.github.io/documentation/concepts/access-control/ Needs Work Conceptual DevOps, repository manager Explain what mechanism(s) for access control are available and how restrictions affect Islandora repo content Mixes documentation type and audiences; make this conceptual documentation for repository managers that explains which levels of restriction can be configured, how inheritance works (it doesn't), separate out sysadmin/devops documentation about preventing access to other components of the stack, consider moving overview over contrib modules not part of Islandora core/default to a "solution gallery" or cookbook section with recommendations; fix link to documentation page on manging user accounts 6 KC â”œâ”€â”€ Accessibility https://islandora.github.io/documentation/concepts/accessibility/ Conceptual 7 â”œâ”€â”€ Component Overview https://islandora.github.io/documentation/installation/component_overview/ Conceptual Stranger Give an understanding of what components Islandora include and how they work together. This should have a link to the architecture diagram: https://islandora.github.io/documentation/technical-documentation/diagram/ (MA) 8 AB â”œâ”€â”€ Modelling content in Islandora 8 vs. 7 https://islandora.github.io/documentation/user-documentation/objects_to_resource_nodes/ Conceptual Islandora 7 user Translate between the "object" and "datastreams" model and the "nodes" and "media" model 9 â””â”€â”€ Islandora Defaults https://islandora.github.io/documentation/reference/islandora_defaults_reference/ Conceptual Create sensible expectations around configurability and ongoing support 10 Installation Structuring menu item - not a page Proposed page under this menu item: Installation overview, describing why we have so many installation methods 11 â”œâ”€â”€ Docker Compose (ISLE-DC) https://islandora.github.io/documentation/installation/docker-compose/ Conceptual Reference page: what is ISLE. Explain "best practices" like Remov tutorial Proposed sub-page: Tutorial Create a Dev-Environment; procedural; geared towards 'baby devs'; Hand-hold walkthrough of creating a local sandbox 12 â”œâ”€â”€ Ansible Playbook https://islandora.github.io/documentation/installation/playbook/ Needs Work Procedural 13 â”œâ”€â”€ Manual Installation Structuring menu item - not a page Procedural 14 â”‚ â”œâ”€â”€ Introduction https://islandora.github.io/documentation/installation/manual/introduction/ Procedural Site Builder Assumes, but does not specify Ubuntu (or similar) operating system 15 CG â”‚ â”œâ”€â”€ Preparing a LAPP Webserver https://islandora.github.io/documentation/installation/manual/preparing_a_webserver/ Needs Work Procedural Site Builder Remove jargon, check specifications. Is this locked to PHP 7.2? To PostgreSQL? LAPP? Linux Apache PostgreSQL & PHP? 16 â”‚ â”œâ”€â”€ Installing Composer, Drush, and Drupal https://islandora.github.io/documentation/installation/manual/installing_composer_drush_and_drupal/ Procedural Site Builder 17 â”‚ â”œâ”€â”€ Installing Tomcat and Cantaloupe https://islandora.github.io/documentation/installation/manual/installing_tomcat_and_cantaloupe/ Procedural Site Builder 18 â”‚ â”œâ”€â”€ Installing Fedora, Syn, and Blazegraph https://islandora.github.io/documentation/installation/manual/installing_fedora_syn_and_blazegraph/ Procedural Site Builder 19 â”‚ â”œâ”€â”€ Installing Solr https://islandora.github.io/documentation/installation/manual/installing_solr/ Procedural Site Builder 20 â”‚ â”œâ”€â”€ Installing Crayfish https://islandora.github.io/documentation/installation/manual/installing_crayfish/ Procedural Site Builder 21 â”‚ â”œâ”€â”€ Installing Karaf and Alpaca https://islandora.github.io/documentation/installation/manual/installing_karaf_and_alpaca/ Procedural Site Builder 22 â”‚ â””â”€â”€ Configuring Drupal https://islandora.github.io/documentation/installation/manual/configuring_drupal/ Procedural Site Builder 23 â””â”€â”€ Installing Modules https://islandora.github.io/documentation/technical-documentation/install-enable-drupal-modules/ Procedural Site Builder 24 Tutorials Structuring menu item - not a page 25 Reviewed MC MAC â”œâ”€â”€ Create a Resource Node https://islandora.github.io/documentation/tutorials/create-a-resource-node/ Good For Now Procedural Islandora/Drupal Novice, Content/Collection Manager Hand holdy walkthrough of creating a resource node with a media file. Note in tutorial to Keep it simple and avoid fields with the autocomplete symbol could stand an explanation for avoiding, or a link to more information elsewhere. 26 Audited MC KC â”œâ”€â”€ Create a Collection https://islandora.github.io/documentation/tutorials/how-to-create-collection/ Good For Now Procedural Islandora/Drupal Novice, Content/Collection Manager Walkthrough of creating and populating a Collection in UI Minor accuracy issue: References to "Collection Members" tab should be changed to "Children tab" as shown in screenshots. This tutorial has "Introduction" section, while previous tutorial has opening "Overview" section 27 Audited MC â”œâ”€â”€ Configure Blocks https://islandora.github.io/documentation/tutorials/blocks/ Needs Work Procedural Islandora/Drupal Novice, Site Builder Walkthrough of general Block layout and Context configurations Lack of labeled "Overview" or "Introduction" section. Screenshots and steps in the Using Context section need to be updated to match current release (as seen on public sandbox). For example, Context list page on sandbox shows more context groupings than screenshot; text for "Click 'Configure' button" step should read "click 'Edit' option" I found myself wondering if there are Islandora-specific blocks of interest, or if the majority of Islandora-centric configurations are in the Context options (which seems to be the case). 28 Reviewed MC MAC â”œâ”€â”€ Create or Update a View https://islandora.github.io/documentation/tutorials/create_update_views/ Needs Work Procedural Islandora/Drupal Novice, Site Builder Walkthrough of how to modify existing and create new views Screenshot for step 4.a doesn't match sandbox (different button name). In Create new view section, instructions include selecting "Create a block." Some explanation of relationship with blocks as they are explained in separate page would be helpful. 29 Audited MC â””â”€â”€ Video Documentation https://islandora.github.io/documentation/user-documentation/video-docs/ Needs Work Reference Islandora/Drupal Novice, consumers of documentation in video format Provide browsable list of video tutorials available, organized by broad categories Lacks Intro/Overview section in TOC, even though there is intro text. Link to "the playlist" is a link to this page (self-referencing, instead of linking out to YouTube playlist). Text for "Regenerating a Derivative" video link has a typo. The intro text mentions that new videos are added to the playlist (and updated here on this page?) regularly, so it would be nice to place the page's last update info at the top rather than in the footer as it is currently. 30 Documentation Structuring menu item - not a page 31 â”œâ”€â”€ Introduction https://islandora.github.io/documentation/user-documentation/user-intro/ Conceptual 32 AB KC â”œâ”€â”€ Intro to Linked Data https://islandora.github.io/documentation/user-documentation/intro-to-ld-for-islandora-8/ Conceptual 33 Audited MA â”œâ”€â”€ Versioning https://islandora.github.io/documentation/user-documentation/versioning/ Needs Work Conceptual Islandora/Drupal Novice, Site Builder Describes how versioning works in Islandora and Fedora+Islandora, including the workflow Specifically references Islandora 8.x-1.1. This should be updated or made evergreen. This page could also be a good place to intoduce/explain semantic versioning? 34 â”œâ”€â”€ Content in Islandora 8 Structuring menu item - not a page Conceptual 35 Reviewed MC MA â”‚ â”œâ”€â”€ Resource Nodes https://islandora.github.io/documentation/user-documentation/resource-nodes/ Conceptual Islandora/Drupal Novice, Repository admins Provide detailed explanation of the components and configuration options for resource nodes. Lacks Intro/Overview section in TOC, even though there is intro text. Last update date at top of page doesn't match last update date in footer. Islandora 8 Property/Value table is missing a row for uid. Field section could use expansion covering how to view/manage/configure fields, to be more consistent with other sections on page. Display modes section needs more clarity in last paragraph about order and overrides. Adding links between this page and the Create a Resource page at https://islandora.github.io/documentation/tutorials/create-a-resource-node/ would be helpful. 36 MC â”‚ â”œâ”€â”€ Media https://islandora.github.io/documentation/user-documentation/media/ Conceptual 37 MC â”‚ â”œâ”€â”€ Paged Content https://islandora.github.io/documentation/user-documentation/paged-content/ Conceptual 38 MR â”‚ â””â”€â”€ Metadata https://islandora.github.io/documentation/user-documentation/metadata/ Good For Now Conceptual Systems Admin, Users, Novice To describe the basic metadata configuration, how it's stored, and ways it can be configured One minor note is that I was a bit confused by the paragraph that began with "Not all content types in your Drupal site need be Islandora "resource nodes"." It took me two reads to grasp what they were talking about. 39 â”œâ”€â”€ Configuring Islandora Structuring menu item - not a page Procedural 40 AB â”‚ â”œâ”€â”€ Modify or Create a Content Type https://islandora.github.io/documentation/user-documentation/content_types/ Procedural 41 â”‚ â”œâ”€â”€ Configure Search https://islandora.github.io/documentation/user-documentation/searching/ Procedural 42 RL â”‚ â”œâ”€â”€ Configure Context https://islandora.github.io/documentation/user-documentation/context/ Procedural 43 MR MC â”‚ â”œâ”€â”€ Multilingual https://islandora.github.io/documentation/user-documentation/multilingual/ Procedural 44 Audited MA â”‚ â”œâ”€â”€ Extending Islandora https://islandora.github.io/documentation/user-documentation/extending/ Good For Now Reference Site builders To describe an dlink to additional resources for adding non-Islandora Drupal modules. Mostly pointing to the Cookbook. Very brief, just pointing out. Could be imporved by adding https://www.drupal.org/project/project_theme as a link when mentioning themes. 45 Audited MA â”‚ â”œâ”€â”€ Viewers https://islandora.github.io/documentation/user-documentation/file_viewers/ Needs Work Conceptual Site builders Explains how viewers work, including a configuration example Attempts to be procedural, but the example is not quite written step-by-step enough to follow along and accomplish a goal. Audience seems to be Site builders, especially based on context of the other pages in this section, but it's written a little technical. 46 MA â”‚ â”œâ”€â”€ IIIF https://islandora.github.io/documentation/user-documentation/iiif/ Reference Site builders Explains what IIIF is and how it works in tghe Islandora context. Crosses the line between procedural and reference, since it both explains, and has some steps for making changes 47 MR â”‚ â”œâ”€â”€ OAI-PMH https://islandora.github.io/documentation/user-documentation/oai/ Procedural 48 MR â”‚ â”œâ”€â”€ RDF Generation https://islandora.github.io/documentation/islandora/rdf-mapping/ Procedural 49 MR â”‚ â”œâ”€â”€ Drupal Bundle Configurations https://islandora.github.io/documentation/islandora/drupal-bundle-configurations/ Procedural 50 â”‚ â””â”€â”€ Flysystem https://islandora.github.io/documentation/technical-documentation/flysystem/ Procedural 51 â””â”€â”€ Operating an Islandora Repository Structuring menu item - not a page Procedural 52 MC . â”œâ”€â”€ Create and Manage User Accounts https://islandora.github.io/documentation/user-documentation/users/ Procedural 53 . â””â”€â”€ Usage Stats https://islandora.github.io/documentation/user-documentation/usage-stats/ Procedural 54 System Administrator Documentation Structuring menu item - not a page 55 Reviewed MH MA â”œâ”€â”€ Updating Drupal https://islandora.github.io/documentation/technical-documentation/updating_drupal/ Needs Work Procedural system administrator explain steps needed to update the Drupal component of the Islandora stack check if described process reflects the approach necessary for ISLE; page says it's missing description on updating Islandora features; 'make backup' admonition should be step in the process; 'alternate syntax needed' admonition should be step in the process; highlight more explicitly if Islandora pins versions of Drupal components or modules Missing pages: Describe how to update any other component of the stack that requires special instructions 56 Audited MH RL â”œâ”€â”€ Uploading large files https://islandora.github.io/documentation/technical-documentation/uploading-large-files/ Good For Now Reference system administrator explain configuration options for use case "I want Islandora users to be able to upload large files" Consider moving to a new "solution gallery" section, or a new "configuration options" page under the Sys Admin documentation 57 Audited MH RL â””â”€â”€ JWT Authentication https://islandora.github.io/documentation/technical-documentation/jwt/ Good For Now Reference developer and/or systems administrator lists key storage locations and explains configuration of JWT authentication for secure communication between components Consider moving to installation instructions 58 Documentation for Developers Structuring menu item - not a page 59 Reviewed MH MA â”œâ”€â”€ Architecture Diagram https://islandora.github.io/documentation/technical-documentation/diagram/ Needs Work Reference developer and system administrator overview over Islandora stack components and their interaction Is "Syn" something that neede to feature in the diagram and list of components? check to make sure the diagram and list of components is up to date 60 â”œâ”€â”€ REST Documentation Structuring menu item - not a page 61 Audited MH â”‚ â”œâ”€â”€ Introduction https://islandora.github.io/documentation/technical-documentation/using-rest-endpoints/ Needs Work Reference developer overview over the RESTful API, which allows for programmatic interaction with Islandora content link to Drupal documentation about RESTful API, if it exists; documentation about Authentication should have a separate page 62 Audited MH â”‚ â”œâ”€â”€ GET https://islandora.github.io/documentation/technical-documentation/rest-get/ Good For Now Reference developer describe how to retrieve metadata for nodes, media and file entities, as well as binary file URLs 63 Audited MH â”‚ â”œâ”€â”€ POST/PUT https://islandora.github.io/documentation/technical-documentation/rest-create/ Needs Work Reference developer describe how to create a node, media/file entities through the REST API unclear if JSON data in request can contain more than just the required fields (I suppose it can, add an example?); consider creating separate pages for POST and PUT, since the verbs are used for different things (creating node vs. creating file) and are used at slightly different endpoints (Drupal vs. Islandora); check and document if there are for instance file size limitations for using PUT requests (link to https://islandora.github.io/documentation/technical-documentation/uploading-large-files/) 64 Audited MH â”‚ â”œâ”€â”€ PATCH https://islandora.github.io/documentation/technical-documentation/rest-patch/ Good For Now Reference developer describe how to update values on fields of nodes or media using the REST API 65 Audited MH â”‚ â”œâ”€â”€ DELETE https://islandora.github.io/documentation/technical-documentation/rest-delete/ Needs Work Reference developer describe how to delete nodes, media or files using the REST API verify and document if deleting nodes/media through REST API can leave media/files orphaned, and how to mitigate that 66 Audited MH â”‚ â””â”€â”€ Signposting https://islandora.github.io/documentation/technical-documentation/rest-signposting/ Good For Now Reference developer, system admin describe which HTTP Link Headers Islandora returns in the response to a GET request perhaps link to https://signposting.org/ for rationale and sample use cases? If the Link Headers provided by either Drupal or Islandora are configurable, document that 67 â”œâ”€â”€ Tests Structuring menu item - not a page Procedural 68 â”‚ â”œâ”€â”€ Running Tests https://islandora.github.io/documentation/technical-documentation/running-automated-tests/ Procedural 69 â”‚ â””â”€â”€ Testing Notes https://islandora.github.io/documentation/technical-documentation/testing-notes/ Procedural 70 â”œâ”€â”€ Updating drupal-project https://islandora.github.io/documentation/technical-documentation/drupal-project/ Procedural 71 Audited RL â”œâ”€â”€ Versioning Policy https://islandora.github.io/documentation/technical-documentation/versioning/ Needs Work Reference developer describe how we version the various components of Islandora? Be the "Versioning policy" that seems necessary. Page could be more explicit about how we release major/minor versions, incorporating more of the semver explanations, such as this page: https://docs.launchdarkly.com/sdk/concepts/versioning Actually, I have questions about whether the Drupal 8/9 modules are still using "core compatibility" as the first number, since Drupal 9 is HERE (the page says no) 72 Audited RL â”œâ”€â”€ Adding back ?_format=jsonld https://islandora.github.io/documentation/technical-documentation/adding_format_jsonld/ Needs Work Procedural developer Document that we changed behaviour around the 1.0 release so that devs can revert if desired This page doesn't make sense as a standalone page. It is random and bizarre. It should be part of the discussion of what Milliner is, and maybe what a URI is in the context of Islandora and Fedora. I don't think we've had this discussion. 73 â”œâ”€â”€ Updating a `deb` and adding it to Lyrasis PPA https://islandora.github.io/documentation/technical-documentation/ppa-documentation/ Procedural 74 â””â”€â”€ Alpaca Structuring menu item - not a page Procedural 75 . â”œâ”€â”€ Alpaca Technical Stack https://islandora.github.io/documentation/alpaca/alpaca-technical-stack/ Procedural 76 . â””â”€â”€ Alpaca Tips https://islandora.github.io/documentation/technical-documentation/alpaca_tips/ Procedural 77 Migration Structuring menu item - not a page 78 â”œâ”€â”€ Migration Overview https://islandora.github.io/documentation/technical-documentation/migration-overview/ Procedural 79 RL â”œâ”€â”€ CSV https://islandora.github.io/documentation/technical-documentation/migrate-csv/ Procedural 80 â””â”€â”€ Islandora 7 https://islandora.github.io/documentation/technical-documentation/migrate-7x/ Procedural 81 Contributing Structuring menu item - not a page 82 Audited MA â”œâ”€â”€ How to contribute https://islandora.github.io/documentation/contributing/CONTRIBUTING/ Needs Work Procedural New contributors Explains the avenues and procedures for making contributions to the Islandora codebase and documentation This is based on the CONTRIBUTING.md file that is standard in every Islandora github repo. Because those have to stand alone, it doesn't really read well as part of the larger documentation set, and it could be more verbose in this context, expecially in terms of how ot contribute to documentation. Example of another CONTRIBUTING.md: https://github.com/Islandora/islandora/blob/7.x/CONTRIBUTING.md 83 Audited MA |â”€â”€ Resizing a VM https://islandora.github.io/documentation/technical-documentation/resizing_vm/ Needs Work Procedural Testers Instructions for adjusting the size allocated to a Virtual Machine so that larger files can be adjusted. These instructions are great, but it's wierd that this is a page all on its own. It should be a section or note in a page about using an Islandora VM 84 Audited MA â”œâ”€â”€ Checking Coding Standards https://islandora.github.io/documentation/technical-documentation/checking-coding-standards/ Needs Work Procedural Developers Describes the commands to run to check coding standards before making a contribution. This should be verified by some one with a dev background to make sure it's all still relevant. and it probably does not need to be its own page. it could be rolled into the description of how to do a pull request that is included in the "How to contribute" page in this same section. 85 â”œâ”€â”€ Contributing Workflow https://islandora.github.io/documentation/contributing/contributing-workflow/ Procedural 86 YS â”œâ”€â”€ Creating GitHub Issues https://islandora.github.io/documentation/contributing/create_issues/ Procedural 87 Audited YS â”œâ”€â”€ Editing Documentation https://islandora.github.io/documentation/contributing/editing-docs/ Needs Work Procedural documentation contributors, developers, committers Instuctions for editing the documentation using the online Github code editor and by creatign a pull request online. A) explain how markdown is a formatting language and that mkdocs uses it B) Refer to "THIS PROJECTS Documentation Style Guide" to exaplain the provenance of the style guide D) mention that you can request a Contributor License Agreement if you don't have one. E) explain that "Starting from the page you want to edit" refers to any of the github.io versions of this content F) mention that there is a way o contribute docs with Issues as mentioned here, by creatign an issue ...https://github.com/Islandora/documentation/blob/24155c50257de067d02aa4e6e48a381ace273d94/CONTRIBUTING.md G) specifically mention that docuemtnation can be built by forking then clonng a local copy of the repo and then one can follow a typical PR process 88 Audited YS â”œâ”€â”€ How to Build Documentation https://islandora.github.io/documentation/technical-documentation/docs-build/ Needs Work Procedural documentation contributors, developers, committers Instructions on how to build the documentation from the docuemntation repo using. Including how to install the mkdocs Python based software needed to build the docs. A) Provide macos install syntax reffering to "pip3 --user" B) Veriffy if we need to run git submodule update --init --recursive to build docs. C) Consider spelling out the steps from linked traiing video on how to test a doc pull request. (download a zip version of PR branch/commit, mkdocs --clean mkdocs, mkdocs server) D) mention that you can use ctrl-c to quit our of mkdocs on the terminal. 89 Audited YS â”œâ”€â”€ Documentation Style Guide https://islandora.github.io/documentation/contributing/docs_style_guide/ Good For Now Reference documentation contributors, developers, committers List of suggestions for how to create well formatted and well style documentation. In the bullet that mentions that doc submissiosn shoudl use Github PRs we coudl link to the "Editing Documentation" page that explains the basics of PRs. This page could cover cross page linking syntax for this project. 90 Audited MA â””â”€â”€ Committers https://islandora.github.io/documentation/contributing/committers/ Needs Work Reference Everyone? Describes the rights and responsibilities of Islandora committers, and how new committers are nominated and approved. Also lists current and Emeritus committers. Alan Stanley is listed as working for Prince Edward Islandora [sic]. 91 Glossary https://islandora.github.io/documentation/user-documentation/glossary/ Reference 92 93 94 95 96 97 98 99 100 Quotes are not sourced from all markets and may be delayed up to 20 minutes. Information is provided 'as is' and solely for informational purposes, not for trading purposes or advice.Disclaimer       DIG Sprint April 2021 Page Suggestions Instructions Sign-up Pages (pre Nov 2020) Pages (OLD) Pages OLD     A browser error has occurred. Please press Ctrl-F5 to refresh the page and try again. A browser error has occurred. Please hold the Shift key and click the Refresh button to try again. 
docs-google-com-7032	----	Islandora Open Meeting: April 27, 2021 - Google Docs JavaScript isn't enabled in your browser, so this file can't be opened. Enable and reload. Islandora Open Meeting: April 27, 2021        Share Sign in The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss File Edit View Tools Help Accessibility Debug See new changes 
docs-google-com-7108	----	Documentation Sprint 1.1.0 - Google Sheets JavaScript isn't enabled in your browser, so this file can't be opened. Enable and reload. Documentation Sprint 1.1.0          Share Sign in The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss File Edit View Insert Format Data Tools Form Add-ons Help Accessibility Unsaved changes to Drive See new changes                 $ %     123                             Conditional formatting     Conditional formatting                                                                                                   SUM AVERAGE COUNT MAX MIN Learn more                     Accessibility               A B C D E F G H I J K L M N O P Q R S T U V W X Y Z AA AB AC AD 1 Review Complete Auditor Reviewer Link Status Type Audience Goal PROBLEMS NOTES 2 About https://islandora.github.io/documentation/ Good For Now Conceptual Stranger Explains at a high level what islandora does 3 Concepts Structuring menu item - not a page 4 Reviewed DIG MA â”œâ”€â”€ Collections https://islandora.github.io/documentation/concepts/collection/ Needs Work Conceptual Newcomer Explain the concept of Collections in Islandora, with reference to bulk management and the interaction of Islandora Defaults. Points to page that does not exist yet (Bulk Editing). Assumes some Basic Drupal knowledge and knowledge of Islandora Defaults, too early (because this is one of the first pages in the documentation). Collections should probably not be the first page in the documentation tree. 'Content Types' should be in the Glossary. Add more links. 5 Audited MH â”œâ”€â”€ Access Control https://islandora.github.io/documentation/concepts/access-control/ Needs Work Conceptual DevOps, repository manager Explain what mechanism(s) for access control are available and how restrictions affect Islandora repo content Mixes documentation type and audiences; make this conceptual documentation for repository managers that explains which levels of restriction can be configured, how inheritance works (it doesn't), separate out sysadmin/devops documentation about preventing access to other components of the stack, consider moving overview over contrib modules not part of Islandora core/default to a "solution gallery" or cookbook section with recommendations; fix link to documentation page on manging user accounts 6 KC â”œâ”€â”€ Accessibility https://islandora.github.io/documentation/concepts/accessibility/ Conceptual 7 â”œâ”€â”€ Component Overview https://islandora.github.io/documentation/installation/component_overview/ Conceptual Stranger Give an understanding of what components Islandora include and how they work together. This should have a link to the architecture diagram: https://islandora.github.io/documentation/technical-documentation/diagram/ (MA) 8 AB â”œâ”€â”€ Modelling content in Islandora 8 vs. 7 https://islandora.github.io/documentation/user-documentation/objects_to_resource_nodes/ Conceptual Islandora 7 user Translate between the "object" and "datastreams" model and the "nodes" and "media" model 9 â””â”€â”€ Islandora Defaults https://islandora.github.io/documentation/reference/islandora_defaults_reference/ Conceptual Create sensible expectations around configurability and ongoing support 10 Installation Structuring menu item - not a page Proposed page under this menu item: Installation overview, describing why we have so many installation methods 11 â”œâ”€â”€ Docker Compose (ISLE-DC) https://islandora.github.io/documentation/installation/docker-compose/ Conceptual Reference page: what is ISLE. Explain "best practices" like Remov tutorial Proposed sub-page: Tutorial Create a Dev-Environment; procedural; geared towards 'baby devs'; Hand-hold walkthrough of creating a local sandbox 12 â”œâ”€â”€ Ansible Playbook https://islandora.github.io/documentation/installation/playbook/ Needs Work Procedural 13 â”œâ”€â”€ Manual Installation Structuring menu item - not a page Procedural 14 â”‚ â”œâ”€â”€ Introduction https://islandora.github.io/documentation/installation/manual/introduction/ Procedural Site Builder Assumes, but does not specify Ubuntu (or similar) operating system 15 CG â”‚ â”œâ”€â”€ Preparing a LAPP Webserver https://islandora.github.io/documentation/installation/manual/preparing_a_webserver/ Needs Work Procedural Site Builder Remove jargon, check specifications. Is this locked to PHP 7.2? To PostgreSQL? LAPP? Linux Apache PostgreSQL & PHP? 16 â”‚ â”œâ”€â”€ Installing Composer, Drush, and Drupal https://islandora.github.io/documentation/installation/manual/installing_composer_drush_and_drupal/ Procedural Site Builder 17 â”‚ â”œâ”€â”€ Installing Tomcat and Cantaloupe https://islandora.github.io/documentation/installation/manual/installing_tomcat_and_cantaloupe/ Procedural Site Builder 18 â”‚ â”œâ”€â”€ Installing Fedora, Syn, and Blazegraph https://islandora.github.io/documentation/installation/manual/installing_fedora_syn_and_blazegraph/ Procedural Site Builder 19 â”‚ â”œâ”€â”€ Installing Solr https://islandora.github.io/documentation/installation/manual/installing_solr/ Procedural Site Builder 20 â”‚ â”œâ”€â”€ Installing Crayfish https://islandora.github.io/documentation/installation/manual/installing_crayfish/ Procedural Site Builder 21 â”‚ â”œâ”€â”€ Installing Karaf and Alpaca https://islandora.github.io/documentation/installation/manual/installing_karaf_and_alpaca/ Procedural Site Builder 22 â”‚ â””â”€â”€ Configuring Drupal https://islandora.github.io/documentation/installation/manual/configuring_drupal/ Procedural Site Builder 23 â””â”€â”€ Installing Modules https://islandora.github.io/documentation/technical-documentation/install-enable-drupal-modules/ Procedural Site Builder 24 Tutorials Structuring menu item - not a page 25 Reviewed MC MAC â”œâ”€â”€ Create a Resource Node https://islandora.github.io/documentation/tutorials/create-a-resource-node/ Good For Now Procedural Islandora/Drupal Novice, Content/Collection Manager Hand holdy walkthrough of creating a resource node with a media file. Note in tutorial to Keep it simple and avoid fields with the autocomplete symbol could stand an explanation for avoiding, or a link to more information elsewhere. 26 Audited MC KC â”œâ”€â”€ Create a Collection https://islandora.github.io/documentation/tutorials/how-to-create-collection/ Good For Now Procedural Islandora/Drupal Novice, Content/Collection Manager Walkthrough of creating and populating a Collection in UI Minor accuracy issue: References to "Collection Members" tab should be changed to "Children tab" as shown in screenshots. This tutorial has "Introduction" section, while previous tutorial has opening "Overview" section 27 Audited MC â”œâ”€â”€ Configure Blocks https://islandora.github.io/documentation/tutorials/blocks/ Needs Work Procedural Islandora/Drupal Novice, Site Builder Walkthrough of general Block layout and Context configurations Lack of labeled "Overview" or "Introduction" section. Screenshots and steps in the Using Context section need to be updated to match current release (as seen on public sandbox). For example, Context list page on sandbox shows more context groupings than screenshot; text for "Click 'Configure' button" step should read "click 'Edit' option" I found myself wondering if there are Islandora-specific blocks of interest, or if the majority of Islandora-centric configurations are in the Context options (which seems to be the case). 28 Reviewed MC MAC â”œâ”€â”€ Create or Update a View https://islandora.github.io/documentation/tutorials/create_update_views/ Needs Work Procedural Islandora/Drupal Novice, Site Builder Walkthrough of how to modify existing and create new views Screenshot for step 4.a doesn't match sandbox (different button name). In Create new view section, instructions include selecting "Create a block." Some explanation of relationship with blocks as they are explained in separate page would be helpful. 29 Audited MC â””â”€â”€ Video Documentation https://islandora.github.io/documentation/user-documentation/video-docs/ Needs Work Reference Islandora/Drupal Novice, consumers of documentation in video format Provide browsable list of video tutorials available, organized by broad categories Lacks Intro/Overview section in TOC, even though there is intro text. Link to "the playlist" is a link to this page (self-referencing, instead of linking out to YouTube playlist). Text for "Regenerating a Derivative" video link has a typo. The intro text mentions that new videos are added to the playlist (and updated here on this page?) regularly, so it would be nice to place the page's last update info at the top rather than in the footer as it is currently. 30 Documentation Structuring menu item - not a page 31 â”œâ”€â”€ Introduction https://islandora.github.io/documentation/user-documentation/user-intro/ Conceptual 32 AB KC â”œâ”€â”€ Intro to Linked Data https://islandora.github.io/documentation/user-documentation/intro-to-ld-for-islandora-8/ Conceptual 33 Audited MA â”œâ”€â”€ Versioning https://islandora.github.io/documentation/user-documentation/versioning/ Needs Work Conceptual Islandora/Drupal Novice, Site Builder Describes how versioning works in Islandora and Fedora+Islandora, including the workflow Specifically references Islandora 8.x-1.1. This should be updated or made evergreen. This page could also be a good place to intoduce/explain semantic versioning? 34 â”œâ”€â”€ Content in Islandora 8 Structuring menu item - not a page Conceptual 35 Reviewed MC MA â”‚ â”œâ”€â”€ Resource Nodes https://islandora.github.io/documentation/user-documentation/resource-nodes/ Conceptual Islandora/Drupal Novice, Repository admins Provide detailed explanation of the components and configuration options for resource nodes. Lacks Intro/Overview section in TOC, even though there is intro text. Last update date at top of page doesn't match last update date in footer. Islandora 8 Property/Value table is missing a row for uid. Field section could use expansion covering how to view/manage/configure fields, to be more consistent with other sections on page. Display modes section needs more clarity in last paragraph about order and overrides. Adding links between this page and the Create a Resource page at https://islandora.github.io/documentation/tutorials/create-a-resource-node/ would be helpful. 36 MC â”‚ â”œâ”€â”€ Media https://islandora.github.io/documentation/user-documentation/media/ Conceptual 37 MC â”‚ â”œâ”€â”€ Paged Content https://islandora.github.io/documentation/user-documentation/paged-content/ Conceptual 38 MR â”‚ â””â”€â”€ Metadata https://islandora.github.io/documentation/user-documentation/metadata/ Good For Now Conceptual Systems Admin, Users, Novice To describe the basic metadata configuration, how it's stored, and ways it can be configured One minor note is that I was a bit confused by the paragraph that began with "Not all content types in your Drupal site need be Islandora "resource nodes"." It took me two reads to grasp what they were talking about. 39 â”œâ”€â”€ Configuring Islandora Structuring menu item - not a page Procedural 40 AB â”‚ â”œâ”€â”€ Modify or Create a Content Type https://islandora.github.io/documentation/user-documentation/content_types/ Procedural 41 â”‚ â”œâ”€â”€ Configure Search https://islandora.github.io/documentation/user-documentation/searching/ Procedural 42 RL â”‚ â”œâ”€â”€ Configure Context https://islandora.github.io/documentation/user-documentation/context/ Procedural 43 MR MC â”‚ â”œâ”€â”€ Multilingual https://islandora.github.io/documentation/user-documentation/multilingual/ Procedural 44 Audited MA â”‚ â”œâ”€â”€ Extending Islandora https://islandora.github.io/documentation/user-documentation/extending/ Good For Now Reference Site builders To describe an dlink to additional resources for adding non-Islandora Drupal modules. Mostly pointing to the Cookbook. Very brief, just pointing out. Could be imporved by adding https://www.drupal.org/project/project_theme as a link when mentioning themes. 45 Audited MA â”‚ â”œâ”€â”€ Viewers https://islandora.github.io/documentation/user-documentation/file_viewers/ Needs Work Conceptual Site builders Explains how viewers work, including a configuration example Attempts to be procedural, but the example is not quite written step-by-step enough to follow along and accomplish a goal. Audience seems to be Site builders, especially based on context of the other pages in this section, but it's written a little technical. 46 MA â”‚ â”œâ”€â”€ IIIF https://islandora.github.io/documentation/user-documentation/iiif/ Reference Site builders Explains what IIIF is and how it works in tghe Islandora context. Crosses the line between procedural and reference, since it both explains, and has some steps for making changes 47 MR â”‚ â”œâ”€â”€ OAI-PMH https://islandora.github.io/documentation/user-documentation/oai/ Procedural 48 MR â”‚ â”œâ”€â”€ RDF Generation https://islandora.github.io/documentation/islandora/rdf-mapping/ Procedural 49 MR â”‚ â”œâ”€â”€ Drupal Bundle Configurations https://islandora.github.io/documentation/islandora/drupal-bundle-configurations/ Procedural 50 â”‚ â””â”€â”€ Flysystem https://islandora.github.io/documentation/technical-documentation/flysystem/ Procedural 51 â””â”€â”€ Operating an Islandora Repository Structuring menu item - not a page Procedural 52 MC . â”œâ”€â”€ Create and Manage User Accounts https://islandora.github.io/documentation/user-documentation/users/ Procedural 53 . â””â”€â”€ Usage Stats https://islandora.github.io/documentation/user-documentation/usage-stats/ Procedural 54 System Administrator Documentation Structuring menu item - not a page 55 Reviewed MH MA â”œâ”€â”€ Updating Drupal https://islandora.github.io/documentation/technical-documentation/updating_drupal/ Needs Work Procedural system administrator explain steps needed to update the Drupal component of the Islandora stack check if described process reflects the approach necessary for ISLE; page says it's missing description on updating Islandora features; 'make backup' admonition should be step in the process; 'alternate syntax needed' admonition should be step in the process; highlight more explicitly if Islandora pins versions of Drupal components or modules Missing pages: Describe how to update any other component of the stack that requires special instructions 56 Audited MH RL â”œâ”€â”€ Uploading large files https://islandora.github.io/documentation/technical-documentation/uploading-large-files/ Good For Now Reference system administrator explain configuration options for use case "I want Islandora users to be able to upload large files" Consider moving to a new "solution gallery" section, or a new "configuration options" page under the Sys Admin documentation 57 Audited MH RL â””â”€â”€ JWT Authentication https://islandora.github.io/documentation/technical-documentation/jwt/ Good For Now Reference developer and/or systems administrator lists key storage locations and explains configuration of JWT authentication for secure communication between components Consider moving to installation instructions 58 Documentation for Developers Structuring menu item - not a page 59 Reviewed MH MA â”œâ”€â”€ Architecture Diagram https://islandora.github.io/documentation/technical-documentation/diagram/ Needs Work Reference developer and system administrator overview over Islandora stack components and their interaction Is "Syn" something that neede to feature in the diagram and list of components? check to make sure the diagram and list of components is up to date 60 â”œâ”€â”€ REST Documentation Structuring menu item - not a page 61 Audited MH â”‚ â”œâ”€â”€ Introduction https://islandora.github.io/documentation/technical-documentation/using-rest-endpoints/ Needs Work Reference developer overview over the RESTful API, which allows for programmatic interaction with Islandora content link to Drupal documentation about RESTful API, if it exists; documentation about Authentication should have a separate page 62 Audited MH â”‚ â”œâ”€â”€ GET https://islandora.github.io/documentation/technical-documentation/rest-get/ Good For Now Reference developer describe how to retrieve metadata for nodes, media and file entities, as well as binary file URLs 63 Audited MH â”‚ â”œâ”€â”€ POST/PUT https://islandora.github.io/documentation/technical-documentation/rest-create/ Needs Work Reference developer describe how to create a node, media/file entities through the REST API unclear if JSON data in request can contain more than just the required fields (I suppose it can, add an example?); consider creating separate pages for POST and PUT, since the verbs are used for different things (creating node vs. creating file) and are used at slightly different endpoints (Drupal vs. Islandora); check and document if there are for instance file size limitations for using PUT requests (link to https://islandora.github.io/documentation/technical-documentation/uploading-large-files/) 64 Audited MH â”‚ â”œâ”€â”€ PATCH https://islandora.github.io/documentation/technical-documentation/rest-patch/ Good For Now Reference developer describe how to update values on fields of nodes or media using the REST API 65 Audited MH â”‚ â”œâ”€â”€ DELETE https://islandora.github.io/documentation/technical-documentation/rest-delete/ Needs Work Reference developer describe how to delete nodes, media or files using the REST API verify and document if deleting nodes/media through REST API can leave media/files orphaned, and how to mitigate that 66 Audited MH â”‚ â””â”€â”€ Signposting https://islandora.github.io/documentation/technical-documentation/rest-signposting/ Good For Now Reference developer, system admin describe which HTTP Link Headers Islandora returns in the response to a GET request perhaps link to https://signposting.org/ for rationale and sample use cases? If the Link Headers provided by either Drupal or Islandora are configurable, document that 67 â”œâ”€â”€ Tests Structuring menu item - not a page Procedural 68 â”‚ â”œâ”€â”€ Running Tests https://islandora.github.io/documentation/technical-documentation/running-automated-tests/ Procedural 69 â”‚ â””â”€â”€ Testing Notes https://islandora.github.io/documentation/technical-documentation/testing-notes/ Procedural 70 â”œâ”€â”€ Updating drupal-project https://islandora.github.io/documentation/technical-documentation/drupal-project/ Procedural 71 Audited RL â”œâ”€â”€ Versioning Policy https://islandora.github.io/documentation/technical-documentation/versioning/ Needs Work Reference developer describe how we version the various components of Islandora? Be the "Versioning policy" that seems necessary. Page could be more explicit about how we release major/minor versions, incorporating more of the semver explanations, such as this page: https://docs.launchdarkly.com/sdk/concepts/versioning Actually, I have questions about whether the Drupal 8/9 modules are still using "core compatibility" as the first number, since Drupal 9 is HERE (the page says no) 72 Audited RL â”œâ”€â”€ Adding back ?_format=jsonld https://islandora.github.io/documentation/technical-documentation/adding_format_jsonld/ Needs Work Procedural developer Document that we changed behaviour around the 1.0 release so that devs can revert if desired This page doesn't make sense as a standalone page. It is random and bizarre. It should be part of the discussion of what Milliner is, and maybe what a URI is in the context of Islandora and Fedora. I don't think we've had this discussion. 73 â”œâ”€â”€ Updating a `deb` and adding it to Lyrasis PPA https://islandora.github.io/documentation/technical-documentation/ppa-documentation/ Procedural 74 â””â”€â”€ Alpaca Structuring menu item - not a page Procedural 75 . â”œâ”€â”€ Alpaca Technical Stack https://islandora.github.io/documentation/alpaca/alpaca-technical-stack/ Procedural 76 . â””â”€â”€ Alpaca Tips https://islandora.github.io/documentation/technical-documentation/alpaca_tips/ Procedural 77 Migration Structuring menu item - not a page 78 â”œâ”€â”€ Migration Overview https://islandora.github.io/documentation/technical-documentation/migration-overview/ Procedural 79 RL â”œâ”€â”€ CSV https://islandora.github.io/documentation/technical-documentation/migrate-csv/ Procedural 80 â””â”€â”€ Islandora 7 https://islandora.github.io/documentation/technical-documentation/migrate-7x/ Procedural 81 Contributing Structuring menu item - not a page 82 Audited MA â”œâ”€â”€ How to contribute https://islandora.github.io/documentation/contributing/CONTRIBUTING/ Needs Work Procedural New contributors Explains the avenues and procedures for making contributions to the Islandora codebase and documentation This is based on the CONTRIBUTING.md file that is standard in every Islandora github repo. Because those have to stand alone, it doesn't really read well as part of the larger documentation set, and it could be more verbose in this context, expecially in terms of how ot contribute to documentation. Example of another CONTRIBUTING.md: https://github.com/Islandora/islandora/blob/7.x/CONTRIBUTING.md 83 Audited MA |â”€â”€ Resizing a VM https://islandora.github.io/documentation/technical-documentation/resizing_vm/ Needs Work Procedural Testers Instructions for adjusting the size allocated to a Virtual Machine so that larger files can be adjusted. These instructions are great, but it's wierd that this is a page all on its own. It should be a section or note in a page about using an Islandora VM 84 Audited MA â”œâ”€â”€ Checking Coding Standards https://islandora.github.io/documentation/technical-documentation/checking-coding-standards/ Needs Work Procedural Developers Describes the commands to run to check coding standards before making a contribution. This should be verified by some one with a dev background to make sure it's all still relevant. and it probably does not need to be its own page. it could be rolled into the description of how to do a pull request that is included in the "How to contribute" page in this same section. 85 â”œâ”€â”€ Contributing Workflow https://islandora.github.io/documentation/contributing/contributing-workflow/ Procedural 86 YS â”œâ”€â”€ Creating GitHub Issues https://islandora.github.io/documentation/contributing/create_issues/ Procedural 87 Audited YS â”œâ”€â”€ Editing Documentation https://islandora.github.io/documentation/contributing/editing-docs/ Needs Work Procedural documentation contributors, developers, committers Instuctions for editing the documentation using the online Github code editor and by creatign a pull request online. A) explain how markdown is a formatting language and that mkdocs uses it B) Refer to "THIS PROJECTS Documentation Style Guide" to exaplain the provenance of the style guide D) mention that you can request a Contributor License Agreement if you don't have one. E) explain that "Starting from the page you want to edit" refers to any of the github.io versions of this content F) mention that there is a way o contribute docs with Issues as mentioned here, by creatign an issue ...https://github.com/Islandora/documentation/blob/24155c50257de067d02aa4e6e48a381ace273d94/CONTRIBUTING.md G) specifically mention that docuemtnation can be built by forking then clonng a local copy of the repo and then one can follow a typical PR process 88 Audited YS â”œâ”€â”€ How to Build Documentation https://islandora.github.io/documentation/technical-documentation/docs-build/ Needs Work Procedural documentation contributors, developers, committers Instructions on how to build the documentation from the docuemntation repo using. Including how to install the mkdocs Python based software needed to build the docs. A) Provide macos install syntax reffering to "pip3 --user" B) Veriffy if we need to run git submodule update --init --recursive to build docs. C) Consider spelling out the steps from linked traiing video on how to test a doc pull request. (download a zip version of PR branch/commit, mkdocs --clean mkdocs, mkdocs server) D) mention that you can use ctrl-c to quit our of mkdocs on the terminal. 89 Audited YS â”œâ”€â”€ Documentation Style Guide https://islandora.github.io/documentation/contributing/docs_style_guide/ Good For Now Reference documentation contributors, developers, committers List of suggestions for how to create well formatted and well style documentation. In the bullet that mentions that doc submissiosn shoudl use Github PRs we coudl link to the "Editing Documentation" page that explains the basics of PRs. This page could cover cross page linking syntax for this project. 90 Audited MA â””â”€â”€ Committers https://islandora.github.io/documentation/contributing/committers/ Needs Work Reference Everyone? Describes the rights and responsibilities of Islandora committers, and how new committers are nominated and approved. Also lists current and Emeritus committers. Alan Stanley is listed as working for Prince Edward Islandora [sic]. 91 Glossary https://islandora.github.io/documentation/user-documentation/glossary/ Reference 92 93 94 95 96 97 98 99 100 Quotes are not sourced from all markets and may be delayed up to 20 minutes. Information is provided 'as is' and solely for informational purposes, not for trading purposes or advice.Disclaimer       DIG Sprint April 2021 Page Suggestions Instructions Sign-up Pages (pre Nov 2020) Pages (OLD) Pages OLD     A browser error has occurred. Please press Ctrl-F5 to refresh the page and try again. A browser error has occurred. Please hold the Shift key and click the Refresh button to try again. 
doi-org-3035	----	A cross disciplinary study of link decay and the effectiveness of mitigation techniques | BMC Bioinformatics | Full Text Skip to main content Advertisement Search Explore journals Get published About BMC My account Search all BMC articles Search BMC Bioinformatics Home About Articles In Review Submission Guidelines Download PDF Volume 14 Supplement 14 Proceedings of the Tenth Annual MCBIOS Conference Proceedings Open Access Published: 09 October 2013 A cross disciplinary study of link decay and the effectiveness of mitigation techniques Jason Hennessey1 & Steven Xijin Ge1  BMC Bioinformatics volume 14, Article number: S5 (2013) Cite this article 6526 Accesses 15 Citations 50 Altmetric Metrics details Abstract Background The dynamic, decentralized world-wide-web has become an essential part of scientific research and communication. Researchers create thousands of web sites every year to share software, data and services. These valuable resources tend to disappear over time. The problem has been documented in many subject areas. Our goal is to conduct a cross-disciplinary investigation of the problem and test the effectiveness of existing remedies. Results We accessed 14,489 unique web pages found in the abstracts within Thomson Reuters' Web of Science citation index that were published between 1996 and 2010 and found that the median lifespan of these web pages was 9.3 years with 62% of them being archived. Survival analysis and logistic regression were used to find significant predictors of URL lifespan. The availability of a web page is most dependent on the time it is published and the top-level domain names. Similar statistical analysis revealed biases in current solutions: the Internet Archive favors web pages with fewer layers in the Universal Resource Locator (URL) while WebCite is significantly influenced by the source of publication. We also created a prototype for a process to submit web pages to the archives and increased coverage of our list of scientific webpages in the Internet Archive and WebCite by 22% and 255%, respectively. Conclusion Our results show that link decay continues to be a problem across different disciplines and that current solutions for static web pages are helping and can be improved. Background Scholarly Internet resources play an increasingly important role in modern research. We can see this by the increasing number of URLs published in a paper's title or abstract [1](also see Figure 1). Until now, maintaining the availability of scientific contributions has been decentralized, mature and effective, utilizing methods developed over centuries to archive the books and journals in which they were communicated. As the Internet is still a relatively new medium for communicating scientific thought, the community is still figuring out how best to use it in a way that preserves contributions for years to come. One problem is that continued availability of these online resources is at the mercy of the organizations or individuals that host them. Many disappear after publication (and some even disappear before[2]), leading to a well-documented phenomenon referred to as link rot or link decay. Figure 1 Growth of scholarly online resources. Not only are the number of URL-containing articles (those with "http" in the title or abstract) published per year increasing (dotted line), but also the percentage of published items containing URLs (solid line). The annual increase in articles according to a linear fit was 174 with R2 0.97. The linear trend for the percentage was an increase of 0.010% per year with R2 0.98. Source: Thomas Reuter's Web of Science Full size image The problem has been documented in several subject areas, with Table 1 containing a large list of these subject-specific studies. In terms of wide, cross-disciplinary analyses, the closest thus far are those of the biological and medical MEDLINE and PubMed databases by Ducut [1] and Wren [3, 4], in addition to Yang's study of the Social Sciences within the Chinese Social Sciences Citation Index (CSSCI) [5]. Table 1 Link decay has been studied for several years in specific subject areas.Full size table Some solutions have been proposed which attack the problem from different angles. The Internet Archive (IA) [6] and WebCite (WC) [7] address the issue by archiving web pages, though their mechanisms for acquiring those pages differ. The IA, beginning from a partnership with the Alexa search engine, employs an algorithm that crawls the Internet at large, storing snapshots of pages it encounters along the way. In contrast, WebCite archives only those pages which are submitted to it, and it is geared toward the scientific community. These two methods, however, can only capture information that is visible from the client. Logic and data housed on the server are not frequently available. Other tools, like the Digital Object Identifier (DOI) System [8] and Persistent Uniform Resource Locator (PURL) [9], provide solutions for when a web resource is moved to a different URL but is still available. The DOI System was created by an international consortium of organizations wishing to assign unique identifiers to items such as movies, television shows, books, journal articles, web sites and data sets. It encompasses several thousand "Naming Authorities" organized under a few "Registration Agencies" that have a lot of flexibility in their business models[10]. Perhaps 30-60% of link rot could be solved using DOIs and PURLs[11, 12]. However they are not without pitfalls. One is that a researcher or company could stop caring about a particular tool for various reasons and thus not be interested in updating its permanent identifier. Another is that the one wanting the permanent URL (the publishing author) is frequently not the same as the person administering the site itself over the long term, thus we have an imbalance of desire vs. responsibilities between the two parties. A third in the case of the DOI System is that there may be a cost in terms of money and time associated with registering their organization that could be prohibitive to authors that don't already have access to a Naming Authority[1]. One example of a DOI System business model would be that of the California Digital Library's EZID service, which charges a flat rate (currently $2,500 for a research institution) for up to 1 million DOIs per year[13]. In this study, we ask two questions: what are the problem's characteristics in scientific literature as a whole and how is it being addressed? To assess progress in combating the problem, we evaluate the effectiveness of the two most prevalent preservation engines: and examine the effectiveness of one prototyped solution. If a URL is published in the abstract, it is assumed that the URL plays a prominent role within that paper, similar to the rationale proposed by Wren [4]. Results Our goals are to provide some metrics that are useful in understanding the problem of link decay in a cross-disciplinary fashion and to examine the effectiveness of the existing archival methods while proposing some incremental improvements. To accomplish these tasks, we downloaded 18,231 Web of Science (WOS) abstracts containing "http" in the title or abstract from the years under study (1996-2010), out of which 17,110 URLs (14,489 unique) were extracted and used. We developed Python scripts to access these URLs over a 30-day period. For the period studied, 69% of the published URLs (67% of the unique) were available on the live Internet, the Internet Archive's Wayback Machine had archived 62% (59% unique) of the total and WebCite had 21% (16% unique). Overall, 65% of all URLs (62% unique) were available from one of the two surveyed archival engines. Figure 2 contains a breakdown by year for availability on the live web as well as through the combined archives, and Figure 3 illustrates each archival engine's coverage. The median lifetime for published URLs was found to be 9.3 years (95% CI [9.3,10.0]), with the median lifetime amongst unique URLs also being 9.3 years (95% CI [9.3,9.3]). Subject-specific lifetimes may be found in Table 2. Using a simple linear model, the chances that a URL published in a particular year is still available goes down by 3.7% for each year added to its age with an R2 of 0.96. Its chances of being archived go up after an initial period of flux (see Figure 2). Submitting our list of unarchived but living URLs to the archival engines showed dramatic promise, increasing the Internet Archive's coverage of the dataset by 2080 URLs, an increase of 22%, and WebCite's by 6348, an increase of 255%. Figure 2 The accessibility of URLs from a particular year is closely correlated with age. The probability of being available (solid line) declines by 3.7% every year based on a linear model with R2 0.96. The surveyed archival engines have about a 70-80% archival rate (dotted line) following an initial ramp time. Full size image Figure 3 URL presence in the archives. Percentage of URLs found in the archives of the Internet Archive (dashed line), WebCite (dotted line) or in any group (solid line). IA is older, and thus accounts for the lion's share of earlier published URLs, though as time goes on WebCite is offering more and more. Full size image Table 2 Comparison of certain statistics based on the subject of a given URL.Full size table How common are published, scholarly online resources? For WOS, both the percentage of published items which contained a URL as well as their absolute number increased steadily since 1996 as seen in Figure 1. Simple linear fits showed the former's annual increase at a conservative 0.010 % per year with an R2 of 0.98 while the latter's increase was 174 papers with an R2 of 0.97. A total of 189 (167 unique) DOI URLs were identified, consisting of 1% of the total, while 9 PURLs (8 unique) were identified. Due to cost[14], it is likely that DOIs will remain useful for tracking commercially published content though not the scholarly online items independent of those publishers. URL survival In order to shed some light on the underlying phenomena of link rot, a survival regression model was fitted with data from the unique URLs. This model, shown in Table 3, identified 17 top-level domains, the number of times a URL has been published, a URL's directory structure depth (hereafter referred to as "depth", using the same definition as [15]), the number of times the publishing article(s) has been cited, whether articles contain funding text as well as 4 journals as having a significant impact on a URL's lifetime at the P< 0.001 level. This survival regression used the logistic distribution and is interpreted similarly to logistic models. To determine the predicted outcome for a particular URL, one takes the intercept (5.2) and adds to it the coefficients for the individual predictors if those predictors are different from the base level; coefficients here are given in years. If numeric, one first multiplies before adding. The result is then interpreted as the location of the peak of a bell curve for the expected lifetime, instead of a log odds ratio as a regular logistic model would give. Among the two categorical predictors (domains and journals having more than 100 samples), the three having the largest positive impact on lifetimes were the journal Zoological Studies (+16) and the top-level domains org and dk (+8 for both). Though smaller in magnitude than the positive ones, the 3 categorical predictors having the largest negative impact were the journals Computer Physics Communications (-4) and Bioinformatics (-2) as well as the domain kr (-3), though the P values associated with the latter two are more marginal than some of the others (.006 and .02 respectively). Table 3 Results of fitting a parametric survival regression using the logistic distribution to the unique URLs.Full size table Predictors of availability While examining URL survival and archival, it is not only interesting to ask which factors significantly correlate with a URL lasting but also which account for most of the differences. To that end, we fit logistic models for each of the measured outcomes (live web, Internet Archive and Web Citation availabilities) to help tease out that information. To enhance comparability, a similar list of predictors (differing only in whether the first or last year a URL was published was used) without interaction terms was employed for all 3 methods and unique deviance calculated by dropping each term from the model and measuring the change in residual deviance. Results were then expressed as a percentage of the total uniquely explained deviance and are graphically shown in Figure 4. Figure 4 How important is each predictor in predicting whether a URL is available? This graph compares what portion of the overall deviance is explained uniquely by each predictor for each of the measured outcomes. A similar list of predictors (differing only in whether the first or last year a URL was published) without interaction terms was employed to construct 3 logistic regression models. The dependent variable for each of the outcomes under study (Live Web, Internet Archive and WebCite) was availability at the time of measurement. Unique deviance was calculated by dropping each term and measuring the change in explained deviance in the logistic model. Results were then expressed as a percentage of the total uniquely explained deviance for each of the 3 methods. Full size image For live web availability, the most deviance was explained by the last year a URL was published (42%) followed by the domain (26%). That these two predictors are very important agrees with much of the published literature thus far. For the Internet Archive, by far the most important predictor was the URL depth at 45%. Based on this, it stands to reason that the Internet Archive either prefers more popular URLs which happen to be at lower depths or employs an algorithm that prioritizes breadth over depth. Similar to the IA, WC had a single predictor that accounted for much of the explained deviance, with the publishing journal representing 49% of the explained deviance. This may reflect WC's efforts to work with publishers as the model shows one of the announced early adopters, BioMed Central [7], as having the two measured journals (BMC Bioinformatics and BMC Genomics) with the highest retention rates. Therefore, WC is biased towards a publication's source (journals). Archive site performance Another way to measure the effectiveness of the current solutions to link decay is to look at the number of "saved" URLs, or those missing ones that are available through archival engines. Out of the 31% of URLs (33% of the unique) which were not accessible on the live web, 49% of them (47% of the unique) were available in one of the two engines, with IA having 47% (46% unique) and WC having 7% (6% unique). WC's comparatively lower performance can likely be attributed to a combination of its requirement for human interaction and its still-growing adoption. In order to address the discrepancy, all sites that were still active but not archived were submitted to the engine(s) from which they were missing. Using the information gleaned from probing the sites as well as the archives, URLs missing from one or both of the archives, yet still alive, were submitted programmatically. This included submitting 2,662 to the Wayback Machine as well as 7,477 to WebCite, of which 2,080 and 6,348 were successful, respectively. Discussion Submission of missing URLs to archives Archiving missing URLs in each of the archival engines had their own special nuances. For the Internet Archive, the lack of a practical documented way of submitting URLs (see http://faq.web.archive.org/my-sites-not-archived-how-can-i-add-it/) necessitated trusting a message shown by the Wayback Machine when one finds a URL that isn't archived and clicks the "Latest" button. In this instance, the user is sent to the URL "http://liveweb.archive.org/<url>" which has a banner proclaiming that the page "will become part of the permanent archive in the next few months". Interestingly, as witnessed by requests for a web page hosted on a server for which the authors could monitor the logs, only those items requested by the client were downloaded. This meant that if only a page's text were fetched, supporting items such as images and CSS files would not be archived. To archive the supporting items and avoid duplicating work, wget's "--page-requisites" option was used instead of a custom parser. WebCite has an easy-to-use API for submitting URLs, though limitations during the submission of our dataset presented some issues. The biggest issue was WebCite's abuse detection process, which would flag the robot after it had made a certain number of requests. To account for this and be generally nice users, we added logic to ensure a minimum delay between archival requests submitted to both the IA and WC. Exponential delay logic was implemented for WC when encountering general timeouts, other failures (like mysql error messages) or the abuse logic. Eventually, we learned that certain URLs would cause WC's crawler to timeout indefinitely, requiring the implementation of a maximum retry count (and a failure status) if the error wasn't caused by the abuse logic. To estimate what impact we had on the archives' coverage of the study URLs, we compared a URL survey done directly prior to our submission process to one done afterwards; a period of about 3.5 months. It was assumed that the contribution due to unrelated processes would not be very large given that there was only a modest increase in coverage, 5% for IA and 1% for WC, over the previous period of just under a year and a half. Each of the two archival engines had interesting behaviors which required gauging successful submission of a URL by whether it was archived as of a subsequent survey rather than using the statuses returned by the engines. For the Internet Archive, it was discovered that an error didn't always indicate failure, as there were 872 URLs for which wget returned an error but which were successfully archived. Conversely, WebCite returned an asynchronous status, such that even in the case of a successful return the URL might fail archival; the case in 955 out of a total of 7,285. Submitting the 2662 URLs to IA took a little less than a day, whereas submitting 7285 to WC took over 2 months. This likely reflects IA's large server capacity, funding and platform maturity due to its age. Generating the list of unique URLs Converting some of the potential predictors from the list of published URLs to the list of unique URLs presented some unique issues. In particular, while converting those based on the URL itself (domain, depth, whether alive or in an archive) were straightforward, those which depended upon a publishing article (number of times URL was published, the number of times an article was cited, publishing journal, whether there was funding text) were estimated by collating the data from each publishing. Only a small amount, 8%, of the unique URLs, appeared more than once, and among the measured variables that pertained to the publishing there was not a large amount of variety. Amongst repeatedly-published URLs, 43% appeared in only one journal and the presence of funding text was the same 76% of the time. For calculating the number of times a paper was published, multiple appearances of a URL within a given title/abstract were counted as one. Thus, while efforts were made to provide a representative collated value where appropriate, it's expected that different methods would not have produced significantly different results. Additional sources of error Even though WOS's index appears to have better quality Optical Character Recognition (OCR) than PubMed, it still has OCR artifacts. To compensate for this, the URL extraction script tried to use some heuristics to detect the most common sources of error and correct them. Some of the biggest sources of error were: randomly inserted spaces in URLs, "similar to" being substituted for the tilde character, periods being replaced with commas and extra punctuation being appended to the URL (sometimes due to the logic added to address the first issue). Likely the largest contributors to false negatives are errors in OCR and the attempts to compensate for them. In assessing the effectiveness of our submissions to IA, it is possible that the estimate could be understated due to URLs that had been submitted but not yet made available within the Wayback Machine. Dynamic websites with interactive content, if only present via an archiving engine, would be a source of false positives, as the person accessing the resource would presumably want to use it as opposed to viewing the design work of its landing page. If a published web site goes away and another installed in its place (especially true if a .com or .net domain is allowed to expire), then the program will not be able to tell the difference since it will see a valid (though impertinent) web site. In addition, though page contents can change and lose relevance from their original use[16], dates of archival were not compared to the publication date. Another source of false positive error would be uncaught OCR artifacts that insert spaces within URLs if it truncated the path but left the correct host intact. The result would be a higher probability that the URL would appear as a higher level index page, which are generally more likely to function than pages at lower levels [11, 12]. Bibliographic database Web of Science was chosen because, compared to PubMed, it was more cross-sectional and had better OCR quality based on a small sampling. Many of the other evaluation criteria were similar between PubMed and WOS, as both contain scholarly work and have an interface to download bibliographic data. Interestingly, due to the continued presence of OCR issues in newer articles, it appears that bibliographic information for some journals is not yet passed electronically. Conclusions Based on the data gathered in this and other studies, it is apparent that there is still a problem with irretrievable scholarly research on the Internet. We found that roughly 50% of URLs published 11 years prior to the survey (in 2000) are still left standing. Interesting is that the rate of decay for late-published URLs (within the past 11 years) appears to be higher than that for the older ones, lending credence to what Koehler suggested about eventual decay rate stabilization[17]. Survival rates for living URLs published between 1996 and 1999, inclusive, only vary by 2.4% (1.5% for unique) and have poor linear fits (R2 of .51 and .18 for unique), whereas years [2000, 2010] have linear slope 0.031 and R2 .90 (.036 and R2 .95 for unique URLs using the first published year) indicating that the availability between years for older URLs is much more stable whereas the availability for more recent online resources follow a linear trend with a predictable loss rate. Overall, 84% of URLs (82% of the unique) were available in some manner: either via the web, IA or WC. Several remedies are available to address different aspects of the link decay problem. For data-based sites that can be archived properly with an engine such as the Internet Archive or WebCite, one remedy is to submit the missing sites which are still alive to the archiving engines. Based on the results of our prototype (illustrated in Figure 5), this method was wildly successful, increasing IA's coverage of the study's URLs by 22% and WebCite's by 255%. Journals could require authors to submit URLs to both the Internet Archive and WebCite, or alternatively programs similar to those employed in this study could be used to do it automatically. Another way to increase archival would be for the owners of published sites to ease restrictions for archiving engines since 507 (352 unique) of the published URLs had archiving disabled via robots.txt according to the Internet Archive. Amongst these, 16% (22% of the unique) have already ceased being valid. While some sites may have good reason for blocking automated archivers (such as dynamic content or licensing issues), there may be others that could remove their restrictions entirely or provide an exception for preservation engines. Figure 5 Coverage of the scholarly URL list for each archival engine at different times. All URLs marked as alive in 2011 but missing from an archive were submitted between the 2012 and 2013 surveys. The effect of submitting the URLs is most evident in the WebCite case though the Internet Archive also showed substantial improvement. Implementing an automated process to do this could vastly improve the retention of scholarly static web pages. Full size image To address the control issue for redirection solutions (DOI, PURL) mentioned in the introduction, those who administer cited tools could begin to maintain and publish a permanent URL on the web site itself. Perhaps an even more radical step would be for either these existing tools or some new tool to take a Wikipedia approach and allow end-users to update and search a database of permanent URLs. Considering the studies that have shown around at least 30% of dead URLs to be locatable using web search engines [3, 18], such a peer-maintained system could be effective and efficient, though spam could be an issue if not properly addressed. For dynamic websites, the current solutions are more technically involved, potentially expensive and less feasible. These include mirroring (hosting a website on another server, possibly at another institution) and providing access to the source code, both of which require time and effort. Once the source is acquired, it can sometimes take considerable expertise to make use of it as there may be complex libraries or framework configuration, local assumptions hard-coded into the software or it could be written for a different platform (GPU, Unix, Windows, etc.). The efforts to have reproducible research, where the underlying logic and data behind the results of a publication are made available to the greater community, have stated many of the same requirements as preserving dynamic websites [19, 20]. Innovation in this area could thus have multiple benefits beyond just the archival. Methods Data preparation and analysis The then-current year (2011) was excluded to eliminate bias from certain journals being indexed sooner than others. For analysis and statistical modeling, the R program [21] and its "survival" library [22] were used (scripts included in Additional file 1). Wherever possible, statistics are presented in 2 forms: one representing the raw list of URLs extracted from abstracts and the other representing a deduplicated set of those URLs. The former is most appropriate when thinking about what a researcher would encounter when trying to use a published URL in an article of interest and also serves as a way to give weight to multiply-published URLs. The latter is more appropriate when contemplating scholarly URLs as a whole or when using statistical models that assume independence between samples. URLs not the goal of this study such as journal promotions and invalid URLs were excluded using computational methods as much as possible in order to minimize subjective bias. The first method, removing 943 (26 unique), looked for identical URLs which comprised a large percentage of a journal's published collection within a given year. Upon manual examination, a decision was then made whether to eliminate them. The second method, which identified 18 invalid URLs (all unique), consisted of checking for WebCitation's "UnexpectedXML" error. These URLs were corrupted to the point that they interfered with XML interpretation of the request due either to an error in our parsing or the OCR. DOI sites were identified by virtue of containing "http://dx.doi.org". PURL sites were identified by virtue of containing "http://purl." in the URL. Interestingly, 3 PURL servers were identified through this mechanism: http://purl.oclc.org, http://purl.org and http://purl.access.gpo.gov. To make for results more comparable to prior work as well as easier to interpret analysis, a URL was considered available if it successfully responded to at least 90% of the requests and unavailable if less than that. This method is similar to the method used by Wren[4], and differs from Ducut's[1] by not using a "variable availability" category defined as being available > 0% and < 90% of the time. Our results show that 466 unique URLs (3.2%) would have been in this middle category, a number quite similar to what Wren's and Ducut's would have been (3.4% and 3.2%, respectively). Being such a small percentage of the total, their treatment is not likely to affect analysis much regardless of how they are interpreted. Having binary data also eases interpretation of the statistical models. In addition, due to the low URL counts for 1994 (3) and 1995 (22), these years were excluded from analysis. Survival model Survival analysis was chosen to analyze living URLs due to its natural fit; like people, URLs have lifetimes and we are interested in discussing them, what causes them to be longer or shorter and by how much. Lifetimes were calculated by assuming URLs were alive each time they were published, which is a potential source of error [2]. Data was coded as either right or left-censored; right-censored since living URLs presumably would die at an unknown time in the future and left-censored because it was unknown when a non-responding URL had died. Ages were coded in months rather than years in order to increase accuracy and precision. Parametric survival regression models were constructed using R's survreg(). In selecting the distribution to use, all of those available were tried, with the logistical showing the best overall fit based on Akaike Information Criterion (AIC) score. Better fits for two of the numeric predictors (number of citations to a publishing paper and number of times a URL was published) were obtained by taking the base 2 logarithm. Collinearity was checked by calculating the variance inflation factor against a logistic regression fit to the web outcome variable. Overall lifetime estimates were made using the survfit() function from R's survival library. Extracting and testing URLs To prepare a list of URLs (and their associated data), a collection of bibliographic data was compiled by searching WOS for "http" in the title or abstract, downloading the results (500 at a time), then finally collating them into a single file. A custom program (extract_urls.py in Additional file 1) was then used to extract the URLs and associated metadata from these, after which 5 positive and 2 negative controls were added. A particular URL was only included once per paper. With the extracted URLs in hand, another custom program (check_urls_web.py in Additional file 1) was used to test the availability of the URLs 3 times a day over the course of 30 days, starting April 16, 2011. These times were generated randomly by scheduler.py (included in Additional file 1), the algorithm guaranteeing that no consecutive runs were closer than 2 hours. A given URL was only visited once per run even if it was published multiple times, saving load on the server and speeding up the total runtime (which averaged about 25 minutes due to use of parallelism). Failure was viewed as anything that caused an exception in python's "urllib2" package (which includes error statuses, like 404), with the exception reason being recorded for later analysis. While investigating some of the failed fetches, a curious thing was noted: there were URLs that would consistently work with a web browser but not with the Python program or other command line downloaders like wget. After some investigation, it was realized that the web server was denying access to unrecognized User Agent strings. In response, the Python program adopted the User Agent of a regular browser and subsequently reduced the number of failed URLs. At the end of the live web testing period, a custom program (check_urls_archived.py in Additional file 1) was used to programmatically query the archive engines on May 23, 2011. For the Internet Archive's Wayback Machine, this was done using an HTTP HEAD request (which saves resources vs. GET) on the URL formed by "http://web.archive.org/web/*/" + <the url>. Status was judged by the resulting HTTP status code with 200 meaning success, 404 meaning not archived, 403 signifying a page blocked due to robots.txt and 503 meaning that the server was too busy. Because there were a number of these 503 codes, the script would make up to 4 attempts to access the URL, with increasing back off delays to keep from overloading IA's servers. The end result still contained 18, which were counted as not archived for analysis. For WebCite, the documented API was used. This supports returning XML, a format very suitable to automated parsing [23]. For sites containing multiple statuses, any successful archiving was taken as a success. References 1.Ducut E, Liu F, Fontelo P: An update on Uniform Resource Locator (URL) decay in MEDLINE abstracts and measures for its mitigation. BMC Med Inform Decis Mak. 2008, 8:- PubMed Central  Article  PubMed  Google Scholar  2.Aronsky D, Madani S, Carnevale RJ, Duda S, Feyder MT: The prevalence and inaccessibility of Internet references in the biomedical literature at the time of publication. J Am Med Inform Assn. 2007, 14: 232-234. 10.1197/jamia.M2243. Article  Google Scholar  3.Wren JD: URL decay in MEDLINE - a 4-year follow-up study. Bioinformatics. 2008, 24: 1381-1385. 10.1093/bioinformatics/btn127. CAS  Article  PubMed  Google Scholar  4.Wren JD: 404 not found: the stability and persistence of URLs published in MEDLINE. Bioinformatics. 2004, 20: 668-U208. 10.1093/bioinformatics/btg465. CAS  Article  PubMed  Google Scholar  5.Yang SL, Qiu JP, Xiong ZY: An empirical study on the utilization of web academic resources in humanities and social sciences based on web citations. Scientometrics. 2010, 84: 1-19. 10.1007/s11192-009-0142-7. Article  Google Scholar  6.The Internet Archive. [http://www.archive.org/web/web.php] 7.Eysenbach G, Trudell M: Going, going, still there: Using the WebCite service to permanently archive cited web pages. Journal of Medical Internet Research. 2005, 7: 2-6. 10.2196/jmir.7.1.e2. Article  Google Scholar  8.The DOI System. [http://www.doi.org/] 9.PURL Home Page. [http://purl.org] 10.Key Facts on Digital Object identifier System. [http://www.doi.org/factsheets/DOIKeyFacts.html] 11.Wren JD, Johnson KR, Crockett DM, Heilig LF, Schilling LM, Dellavalle RP: Uniform resource locator decay in dermatology journals - Author attitudes and preservation practices. Arch Dermatol. 2006, 142: 1147-1152. 10.1001/archderm.142.9.1147. Article  PubMed  Google Scholar  12.Casserly MF, Bird JE: Web citation availability: Analysis and implications for scholarship. College & Research Libraries. 2003, 64: 300-317. 10.5860/crl.64.4.300. Article  Google Scholar  13.EZID: Pricing. [http://n2t.net/ezid/home/pricing] 14.Wagner C, Gebremichael MD, Taylor MK, Soltys MJ: Disappearing act: decay of uniform resource locators in health care management journals. J Med Libr Assoc. 2009, 97: 122-130. 10.3163/1536-5050.97.2.009. PubMed Central  Article  PubMed  Google Scholar  15.Koehler W: An analysis of Web page and Web site constancy and permanence. J Am Soc Inf Sci. 1999, 50: 162-180. 10.1002/(SICI)1097-4571(1999)50:2<162::AID-ASI7>3.0.CO;2-B. Article  Google Scholar  16.Bar-Ilan J, Peritz BC: Evolution, continuity, and disappearance of documents on a specific topic on the web: A longitudinal study of "informetrics". Journal of the American Society for Information Science and Technology. 2004, 55: 980-990. 10.1002/asi.20049. Article  Google Scholar  17.Koehler W: A longitudinal study of Web pages continued: a consideration of document persistence. Information Research-an International Electronic Journal. 2004, 9: -- Google Scholar  18.Casserly MF, Bird JE: Web citation availability - A follow-up study. Libr Resour Tech Ser. 2008, 52: 42-53. 10.5860/lrts.52n1.42. Article  Google Scholar  19.Peng RD: Reproducible research and Biostatistics. Biostatistics. 2009, 10: 405-408. 10.1093/biostatistics/kxp014. Article  PubMed  Google Scholar  20.Ince DC, Hatton L, Graham-Cumming J: The case for open computer programs. Nature. 2012, 482: 485-488. 10.1038/nature10836. CAS  Article  PubMed  Google Scholar  21.R Development Core Team: R: A Language and Environment for Statistical Computing. Book R: A Language and Environment for Statistical Computing. 2011, City: R Foundation for Statistical Computing Google Scholar  22.Therneau T: A Package for Survival Analysis in S. Book A Package for Survival Analysis in S. 2012, City, 2.36-12 Google Scholar  23.WebCite Technical Background and Best Practices Guide. [http://www.webcitation.org/doc/WebCiteBestPracticesGuide.pdf] 24.Markwell J, Brooks DW: "Link rot" limits the usefulness of web-based educational materials in biochemistry and molecular biology. Biochemistry and Molecular Biology Education. 2003, 31: 69-72. 10.1002/bmb.2003.494031010165. Article  Google Scholar  25.Thorp AW, Brown L: Accessibility of internet references in Annals of Emergency Medicine: Is it time to require archiving?. Ann Emerg Med. 2007, 50: 188-192. 10.1016/j.annemergmed.2006.11.019. Article  PubMed  Google Scholar  26.Carnevale RJ, Aronsky D: The life and death of URLs in five biomedical informatics journals. International Journal of Medical Informatics. 2007, 76: 269-273. 10.1016/j.ijmedinf.2005.12.001. Article  PubMed  Google Scholar  27.Dimitrova DV, Bugeja M: Consider the source: Predictors of online citation permanence in communication journals. Portal-Libraries and the Academy. 2006, 6: 269-283. 10.1353/pla.2006.0032. Article  Google Scholar  28.Duda JJ, Camp RJ: Ecology in the information age: patterns of use and attrition rates of internet-based citations in ESA journals, 1997-2005. Frontiers in Ecology and the Environment. 2008, 6: 145-151. 10.1890/070022. Article  Google Scholar  29.Rhodes S: Breaking Down Link Rot: The Chesapeake Project Legal Information Archive's Examination of URL Stability. Law Library Journal. 2010, 102: 581-597. Google Scholar  30.Goh DHL, Ng PK: Link decay in leading information science journals. Journal of the American Society for Information Science and Technology. 2007, 58: 15-24. 10.1002/asi.20513. Article  Google Scholar  31.Russell E, Kane J: The missing link - Assessing the reliability of Internet citations in history journals. Technology and Culture. 2008, 49: 420-429. 10.1353/tech.0.0028. Article  Google Scholar  32.Dellavalle RP, Hester EJ, Heilig LF, Drake AL, Kuntzman JW, Graber M, Schilling LM: Information science - Going, going, gone: Lost Internet references. Science. 2003, 302: 787-788. 10.1126/science.1088234. CAS  Article  PubMed  Google Scholar  33.Evangelou E, Trikalinos TA, Ioannidis JPA: Unavailability of online supplementary scientific information from articles published in major journals. Faseb Journal. 2005, 19: 1943-1944. 10.1096/fj.05-4784lsf. CAS  Article  PubMed  Google Scholar  34.Sellitto C: The impact of impermanent web-located citations: A study of 123 scholarly conference publications. Journal of the American Society for Information Science and Technology. 2005, 56: 695-703. 10.1002/asi.20159. Article  Google Scholar  35.Bar-Ilan J, Peritz B: The lifespan of "informetrics" on the Web: An eight year study (1998-2006). Scientometrics. 2009, 79: 7-25. 10.1007/s11192-009-0401-7. Article  Google Scholar  36.Gomes D, Silva MJ: Modelling Information Persistence on the Web. Book Modelling Information Persistence on the Web. 2006, City Google Scholar  37.Markwell J, Brooks DW: Evaluating web-based information: Access and accuracy. Journal of Chemical Education. 2008, 85: 458-459. 10.1021/ed085p458. CAS  Article  Google Scholar  38.Wu ZQ: An empirical study of the accessibility of web references in two Chinese academic journals. Scientometrics. 2009, 78: 481-503. 10.1007/s11192-007-1951-1. Article  Google Scholar  Download references Acknowledgements The authors would like to thank the South Dakota State University departments of Mathematics & Statistics and Biology & Microbiology for their valuable feedback. Declarations Publication of this article was funded by the National Institutes of Health [GM083226 to SXG]. This article has been published as part of BMC Bioinformatics Volume 14 Supplement 14, 2013: Proceedings of the Tenth Annual MCBIOS Conference. Discovery in a sea of data. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcbioinformatics/supplements/14/S14. Author information Affiliations Department of Mathematics and Statistics, South Dakota State University, Box 2220, Brookings, SD, 57007, USA Jason Hennessey & Steven Xijin Ge Authors Jason HennesseyView author publications You can also search for this author in PubMed Google Scholar Steven Xijin GeView author publications You can also search for this author in PubMed Google Scholar Corresponding author Correspondence to Steven Xijin Ge. Additional information Competing interests The authors declare that they have no competing interests. Authors' contributions JH implemented the tools for data acquisition and statistical analysis as well as performed a literature review and drafting of the paper. SXG implemented an initial prototype and provided valuable feedback at every step of the process, including critical revision of this manuscript. Electronic supplementary material Additional file 1: supplement.zip. Contains source code used to perform the study, written in python and R. README.txt contains descriptions for each file. (ZIP 40 KB) Rights and permissions This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Reprints and Permissions About this article Cite this article Hennessey, J., Ge, S.X. A cross disciplinary study of link decay and the effectiveness of mitigation techniques. BMC Bioinformatics 14, S5 (2013). https://doi.org/10.1186/1471-2105-14-S14-S5 Download citation Published: 09 October 2013 DOI: https://doi.org/10.1186/1471-2105-14-S14-S5 Keywords Optical Character Recognition Universal Resource Locator Internet Archive Naming Authority Survival Regression Model Download PDF Advertisement BMC Bioinformatics ISSN: 1471-2105 Contact us Submission enquiries: Access here and click Contact Us General enquiries: ORSupport@springernature.com Read more on our blogs Receive BMC newsletters Manage article alerts Language editing for authors Scientific editing for authors Policies Accessibility Press center Support and Contact Leave feedback Careers Follow BMC BMC Twitter page BMC Facebook page BMC Weibo page By using this website, you agree to our Terms and Conditions, California Privacy Statement, Privacy statement and Cookies policy. Manage cookies/Do not sell my data we use in the preference centre. © 2021 BioMed Central Ltd unless otherwise stated. Part of Springer Nature. \ 
doi-org-9525	----	"Blockchain Empowers Social Resistance and Terrorism Through Decentrali" by Armin Krishnan Home Search Browse Collections My Account About Digital Commons Network™ Skip to main content My Account FAQ About This IR Scholar Commons     Home > USF Libraries > Open Access Journals > JSS > Vol. 13 > No. 1 (2020)   Article Title Blockchain Empowers Social Resistance and Terrorism Through Decentralized Autonomous Organizations Authors Armin Krishnan, East Carolina UniversityFollow Author Biography Armin Krishnan, PhD is an Associate Professor and Director of Security Studies at East Carolina University. He is the author of five books on new developments in warfare and conflict, including Killer Robots: The Legality and Ethicality of Autonomous Weapons published by Ashgate and Military Neuroscience and the Coming Age of Neurowarfare published by Routledge. His most recent book is Why Paramilitary Operations Fail published by Palgrave Macmillan. Dr. Krishnan has earned his doctorate from University of Salford, UK and he holds other graduate degrees in Political Science and International Relations from the University of Munich and the University of Salford. He has previously taught Intelligence Studies as a Visiting Assistant Professor at the University of Texas at El Paso. DOI https://doi.org/10.5038/1944-0472.13.1.1743 Subject Area Keywords Cybersecurity, Nonstate actors, Security studies, Social media Abstract The invention of the Internet has changed the way social resistance, revolutionary movements and terror groups are organized with new features such as loose network organization, netwars, social media campaigns, and lone wolf attacks. This article argues that blockchain technology will lead to more far-reaching changes in the organization of resistance to authority. Blockchain is a distributed ledger that records transactions using a consensus protocol, and when it meets objective conditions, it also enables smart contracts that execute transactions. Blockchain technology is not only a system for transferring value, but also it is a trustless system in which strangers can cooperate without the need for having to trust each other, as computer code governs their interactions. Blockchain will not only allow resistance/ terror organizations to easily receive donations globally, to have assets that a government can easily confiscate, and to disseminate censorship-resistant propaganda, but more importantly, to operate and cooperate across the world in a truly leaderless, coordinated, and highly decentralized fashion. Governments will need to be more proactive in the area of blockchain technology to mitigate some of the dangers to political stability that may emerge from it. Acknowledgements I want to thank the anonymous reviewers of the article for their encouragement, insights, and constructive criticism that has helped to improve the quality of the article. Recommended Citation Krishnan, Armin. "Blockchain Empowers Social Resistance and Terrorism Through Decentralized Autonomous Organizations." Journal of Strategic Security 13, no. 1 (2020) : 41-58. DOI: https://doi.org/10.5038/1944-0472.13.1.1743 Available at: https://scholarcommons.usf.edu/jss/vol13/iss1/3 View as PDF DOWNLOADS Since March 12, 2020 Share COinS       Journal Home Abstracting & Indexing Aims & Scope Associates Call For Papers Editorial Board Instructions for Reviewers Policies for the Journal of Strategic Security Publication Ethics Submit Manuscript Submission Requirements Most Popular Papers Receive Email Notices or RSS Special Issues: Vol. 13, No. 4 Climate Change and Global Security Volume 9, No. 4, Special Issue Winter 2016: Understanding and Resolving Complex Strategic Security Issues Volume 9, No. 1, Special Issue Spring 2016: Designing Danger: Complex Engineering by Violent Non-State Actors Select an issue: All Issues Vol. 14, No. 1 Vol. 13, No. 4 Climate Change and Global Security Vol. 13, No. 3 Vol. 13, No. 2 Vol. 13, No. 1 Vol. 12, No. 4 Vol. 12, No. 3 Vol. 12, No. 2 Vol. 12, No. 1 Vol. 11, No. 4 Vol. 11, No. 3 Vol. 11, No. 2 Vol. 11, No. 1 Vol. 10, No. 4 Vol. 10, No. 3 Vol. 10, No. 2 Vol. 10, No. 1 Volume 9, No. 4, Special Issue Winter 2016: Understanding and Resolving Complex Strategic Security Issues Vol. 9, No. 3 Vol. 9, No. 2 Volume 9, No. 1, Special Issue Spring 2016: Designing Danger: Complex Engineering by Violent Non-State Actors Volume 8, No. 3, Fall 2015 Supplement: Eleventh Annual IAFIE Conference Vol. 8, No. 4 Volume 8, No. 3, Special Issue Fall 2015: Intelligence: Analysis, Tradecraft, Training, Education, and Practical Application Vol. 8, No. 1 Volume 7, No. 4, Special Issue Winter 2014: Future Challenges in Drone Geopolitics Vol. 7, No. 3 Volume 7, No. 2, Special Issue Summer 2014: The Global SOF Network Vol. 7, No. 1 Volume 6, No. 3, Fall 2013 Supplement: Ninth Annual IAFIE Conference: Expanding the Frontiers of Intelligence Education Vol. 6, No. 4 Vol. 6, No. 3 Vol. 6, No. 2 Vol. 6, No. 1 Vol. 5, No. 4 Volume 5, No. 3, Fall 2012: Energy Security Vol. 5, No. 2 Vol. 5, No. 1 Volume 4, No. 4, Winter 2011: Perspectives on Radicalization and Involvement in Terrorism Vol. 4, No. 3 Volume 4, No. 2, Summer 2011: Strategic Security in the Cyber Age Vol. 4, No. 1 Vol. 3, No. 4 Vol. 3, No. 3 Vol. 3, No. 2 Vol. 3, No. 1 Vol. 2, No. 4 Vol. 2, No. 3 Vol. 2, No. 2 Vol. 2, No. 1 Vol. 1, No. 1   Search Enter search terms: Select context to search: in this journal in this repository across all repositories Advanced Search ISSN: 1944-0464 (Print)ISSN: 1944-0472 (Online)   Hosted By:   Digital Commons Scholar Commons | About This IR | FAQ | My Account | Accessibility Statement Privacy Copyright 
dp-la-4798	----	Digital Public Library of America Skip to Main Content Digital Public Library of AmericaShow Menu Browse by Topic Browse by Partner Exhibitions Primary Source Sets My Lists About DPLA News DPLA Pro Browse by Topic Browse by Partner Exhibitions Primary Source Sets My Lists About DPLA News DPLA Pro Digital Public Library of America Donate Discover 43,203,601 images, texts, videos, and sounds from across the United States Search Browse by TopicNew? Start Here DPLA Ebooks Ebook services are core to our commitment to a library-led digital future. We've redesigned our DPLA Ebooks site to showcase how we are helping libraries take control of acquisition and delivery and make more diverse materials easily available while advocating for the needs of libraries in the marketplace. Explore now Online Exhibitions Browse all Exhibitions Recreational Tourism in the Mountain West The Show Must Go On! American Theater in the Great Depression Race to the Moon A History of US Public Libraries In Focus: The Evolution of the Personal Camera Activism in the US Primary Source Sets Browse all Sets Voting Rights Act of 1965 Ida B. Wells and Anti-Lynching Activism The Poetry of Maya Angelou The New Woman Lyndon Johnson's Great Society The Fifteenth Amendment Immigration and Americanization, 1880 - 1930 Space Race How can I use DPLA? Education Educators and students explore our Primary Source Sets to discover history and culture through primary sources and ideas for classroom use. Family Research Genealogists use our search tools to find free materials for their family history research projects. Lifelong Learning Lifelong learners enjoy browsing by topic and viewing Online Exhibitions to learn more about their interests. Scholarly Research Scholarly researchers use DPLA to find open access sources from archives across the country through a single portal. If you’re new to DPLA, these research guides will give you a head start using our site. The guides reflect a few key activities that attract visitors to DPLA, but you can explore many other interests here too. View all user guides DPLA News Browse the archives Flexible licensing models from the DPLA Exchange April 13, 2021 DPLA’s ebooks program serves our mission of maximizing access to digital content by giving libraries across the country greater control over their acquisition and delivery of ebooks and audiobooks DPLA to host Book Talk on Mistrust, by Ethan Zuckerman, on April 22nd at 1 pm ET April 2, 2021 We are pleased to invite you to join us at the inaugural DPLA Book Talk, which will feature a conversation between Mistrust author Ethan Zuckerman and Wikimedia Foundation CEO and… Join the DPLA Community + Open Board Meeting on April 9th March 25, 2021 With expanded vaccine access, many of us have begun to conceive of what our post-Covid worlds might look like. These visions are necessarily colored by all that we have learned… Stay informed Get the latest DPLA news in your inbox General News Ebooks Education Genealogy DPLA Frequently Asked Questions How Can I Use DPLA? Terms & Conditions Harmful Content About DPLA Contact Us Feedback News Tools Primary Source Sets Browse by Partner Browse by Topic Exhibitions My Lists Search DPLA Pro DPLA Pro Home Prospective Hubs Community Reps Hub Network Developers Education Projects Ebooks Events Donate 
duraspace-org-3775	----	Home - Duraspace.org Projects DSpace Fedora VIVO Who’s Using Services ArchivesDirect DSpaceDirect DuraCloud Community Our Users Community Programs Service Providers Strategic Partners Membership Values and Benefits Current Members Financial Contributors Become a Member Support Choosing a Project Choosing a Service Technical Specifications Wiki Contact Us News & Events Latest News Event Calendar Webinars Monthly Newsletter About DuraSpace Projects Services Community Membership Support News & Events Projects DSpace Fedora VIVO Who’s Using Services ArchivesDirect DSpaceDirect DuraCloud Community Our Users Community Programs Service Providers Strategic Partners Membership Values and Benefits Current Members Financial Contributors Become a Member Support Choosing a Project Choosing a Service Technical Specifications Wiki Contact Us News & Events Latest News Event Calendar Webinars Monthly Newsletter Help us preserve and provide access to the world's intellectual, cultural and scientific heritage Join Us Learn More Latest News 4.22.21 Fedora 6.0: A Migration Story – The Berlin State Library 4.21.21 DSPACE 7.0 Beta 5 Now Available 4.16.21 DSpace 7.0 Testathon: How You Can Help Us Build a Better DSpace Through Testing & Reporting Our Global Community The community DuraSpace serves is alive with ideas and innovation aimed at collaboratively meeting the needs of the scholarly ecosystem that connects us all. Our global community contributes to the advancement of DSpace, Fedora and VIVO. At the same time subscribers to DuraSpace Services are helping to build best practices for delivery of high quality customer service. We are grateful for our community’s continued support and engagement in the enterprise we share as we work together to provide enduring access to the world’s digital heritage. Join us   Open Source Projects The Fedora, DSpace and VIVO community-supported projects are proud to provide more than 2500 users worldwide from more than 120 countries with freely-available open source software. Fedora is a flexible repository platform with native linked data capabilities. DSpace is a turnkey institutional repository application. VIVO creates an integrated record of the scholarly work of your organization.   Our Services ArchivesDirect, DSpaceDirect, and DuraCloud services from DuraSpace provide access to institutional resources, preservation of treasured collections, and simplified data management tools. Our services are built on solid open source software platforms, can be set up quickly, and are competitively priced. Staff experts work directly with customers to provide personalized on-boarding and superb customer support. DuraCloud is a hosted service that lets you control where and how your content is preserved in the cloud. DSpaceDirect is a hosted turnkey repository solution. ArchivesDirect is a complete, hosted archiving solution.   About About DuraSpace History What We Do Board of Directors Meet the Team Policies Reports Community Our Users Community Programs Service Providers Strategic Partners Membership Values & Benefits Current Members Financial Contributors Become a Member Support Choosing a Project Choosing a Service Technical Specifications Wiki Contact Us News & Events Latest News Event Calendar Webinars Monthly Newsletter This work is licensed under a Creative Commons Attribution 4.0 International License 
duraspace-org-9771	----	News – Duraspace.org News – Duraspace.org Meet the Members Welcome to the first in a series of blog posts aimed at introducing you to some of the movers and shakers who work tirelessly to advocate, educate and promote Fedora and other community-supported programs like ours. At Fedora, we are strong because of our people and without individuals like this advocating for continued development we... Read more &#187; The post Meet the Members appeared first on Duraspace.org. Fedora Migration Paths and Tools Project Update: January 2021 This is the fourth in a series of monthly updates on the Fedora Migration Paths and Tools project – please see last month’s post for a summary of the work completed up to that point. This project has been generously funded by the IMLS. The grant team has been focused on completing an initial build... Read more &#187; The post Fedora Migration Paths and Tools Project Update: January 2021 appeared first on Duraspace.org. Fedora Migration Paths and Tools Project Update: December 2020 This is the third in a series of monthly updates on the Fedora Migration Paths and Tools project – please see last month’s post for a summary of the work completed up to that point. This project has been generously funded by the IMLS. The Principal Investigator, David Wilcox, participated in a presentation for CNI... Read more &#187; The post Fedora Migration Paths and Tools Project Update: December 2020 appeared first on Duraspace.org. Fedora 6 Alpha Release is Here Today marks a milestone in our progress toward Fedora 6 &#8211; the Alpha Release is now available for download and testing! Over the past year, our dedicated Fedora team, along with an extensive list of active community members and committers, have been working hard to deliver this exciting release to all of our users. So... Read more &#187; The post Fedora 6 Alpha Release is Here appeared first on Duraspace.org. Fedora Migration Paths and Tools Project Update: October 2020 This is the first in a series of monthly blog posts that will provide updates on the IMLS-funded Fedora Migration Paths and Tools: a Pilot Project. The first phase of the project began in September with kick-off meetings for each pilot partner: the University of Virginia and Whitman College. These meetings established roles and responsibilities... Read more &#187; The post Fedora Migration Paths and Tools Project Update: October 2020 appeared first on Duraspace.org. Fedora in the time of COVID-19 The impacts of coronavirus disease 2019 are being felt around the world, and access to digital materials is essential in this time of remote work and study. The Fedora community has been reflecting on the value of our collective digital repositories in helping our institutions and researchers navigate this unprecedented time.  Many member institutions have... Read more &#187; The post Fedora in the time of COVID-19 appeared first on Duraspace.org. NOW AVAILABLE: DSpace 7.0 Beta 2 The DSpace Leadership Group, the DSpace Committers and LYRASIS are proud to announce that DSpace 7.0 Beta 2 is now available for download and testing. Beta 2 is the second scheduled Beta release provided for community feedback and to introduce the new features of the 7.0 platform. As a Beta release, we highly advise against... Read more &#187; The post NOW AVAILABLE: DSpace 7.0 Beta 2 appeared first on Duraspace.org. NOW AVAILABLE: VIVO 1.11.1 VIVO 1.11.1 is now available! VIVO 1.11.1 is a point release containing two patches to the previous 1.11.0 release: &#8211; Security patch that now prevents users with self-edit privileges from editing other user profiles [1] &#8211; Minor security patch to underlying puppycrawl dependency (CVE-2019-9658) [2] Upgrading from 1.11.0 to 1.11.1 should be a trivial drop-in... Read more &#187; The post NOW AVAILABLE: VIVO 1.11.1 appeared first on Duraspace.org. NOW AVAILABLE: DSpace 7.0 Beta 1 The DSpace Leadership Group, the DSpace Committers and LYRASIS are proud to announce that DSpace 7.0 Beta 1 is now available for download and testing.  Beta1 is the first of several scheduled Beta releases provided for community feedback and to introduce the new features of the 7.0 platform. As a Beta release, we do not... Read more &#187; The post NOW AVAILABLE: DSpace 7.0 Beta 1 appeared first on Duraspace.org. Curriculum Available: Islandora and Fedora Camp in Arizona The curriculum for the upcoming Islandora and Fedora Camp at Arizona State University, February 24-26, 2020 is now available here. Islandora and Fedora Camp, hosted by Arizona State University Libraries, offers everyone a chance to dive in and learn all about the latest versions of Islandora and Fedora. Training will begin with the basics and build... Read more &#187; The post Curriculum Available: Islandora and Fedora Camp in Arizona appeared first on Duraspace.org. 
ejournals-bc-edu-5722	----	Information Technology and Libraries Skip to main content Skip to main navigation menu Skip to site footer Current Archives Announcements About About the Journal Editorial Team Submissions Contact Privacy Statement Search Search Register Login Current Issue Vol 40 No 1 (2021) Published: 2021-03-15 Editorials Reviewers Wanted Letter from the Editor Kenneth J. Varnum PDF The Fourth Industrial Revolution Does It Pose an Existential Threat to Libraries? Brady Lund PDF We Can Do It for Free! Using Freeware for Online Patron Engagement Karin Suni, Christopher A. Brown PDF Utilizing Technology to Support and Extend Access to Students and Job Seekers during the Pandemic Daniel Berra PDF Articles User Experience Testing in the Open Textbook Adaptation Workflow A Case Study Camille Thomas, Kimberly Vardeman, Jingjing Wu PDF Peer Reading Promotion in University Libraries Based on a Simulation Study about Readers' Opinion Seeking in Social Network Yiping Jiang, Xiaobo Chi, Yan Lou, Lihua Zuo, Yeqi Chu, Qingyi Zhuge PDF Web Content Strategy in Practice within Academic Libraries Courtney McDonald, Heidi Burkhardt PDF Solving SEO Issues in DSpace-based Digital Repositories A Case Study and Assessment of Worldwide Repositories Matus Formanek PDF Development of a Gold-standard Pashto Dataset and a Segmentation App Yan Han, Marek Rychlik PDF Personalization of Search Results Representation of a Digital Library Ljubomir Paskali, Lidija Ivanovic, Georgia Kapitsaki, Dragan Ivanovic, Bojana Dimic Surla, Dusan Surla PDF Communications User Testing with Microinteractions Enhancing a Next-Generation Repository Sara Gonzales, Matthew B. Carson, Guillaume Viger, Lisa O'Keefe, Norrina B. Allen, Joseph P. Ferrie, Kristi Holmes PDF View All Issues Open Journal Systems Information For Readers For Authors For Librarians Current Issue 
ejournals-bc-edu-7307	----	None 
elibtronic-ca-1923	----	None 
elibtronic-ca-3082	----	None 
emory-zoom-us-7617	----	Webinar Registration - Zoom Skip to main content 1.888.799.9666 JOIN A MEETING HOST A MEETING With Video Off With Video On HOST A MEETING WITH VIDEO EMORY IT SERVICE DESK       SIGN IN SIGN UP, IT'S FREE SIGN UP, IT'S FREE webinar register page Topic Samvera Virtual Connect 2021 Description Samvera Virtual Connect (SVC), is an opportunity for Samvera Community participants to gather online to learn about initiatives taking place across interest groups, working groups, local and collaborative development projects, and other efforts. SVC will give the Samvera community a chance to come together to catch up on developments, make new connections, and learn more about the Community. Webinar is over, you cannot register now. If you have any questions, please contact Webinar host: Heather Greer Klein (she/her/hers). × Share via Email All fields are required Your Information Send to Message preview Hi, You are invited to a Zoom webinar. Topic: Samvera Virtual Connect 2021 Register in advance for this webinar: https://emory.zoom.us/webinar/register/WN_sfR-WxKyTl2klDjmVWAPWw After registering, you will receive a confirmation email containing information about joining the webinar.   Send Cancel × Switch Time Zone Time Zone:   Please Select Your Time Zone... (GMT-11:00) Midway Island, Samoa (GMT-11:00) Pago Pago (GMT-10:00) Hawaii (GMT-8:00) Alaska (GMT-8:00) Juneau (GMT-7:00) Vancouver (GMT-7:00) Pacific Time (US and Canada) (GMT-7:00) Tijuana (GMT-7:00) Arizona (GMT-6:00) Edmonton (GMT-6:00) Mountain Time (US and Canada) (GMT-6:00) Mazatlan (GMT-6:00) Saskatchewan (GMT-6:00) Guatemala (GMT-6:00) El Salvador (GMT-6:00) Managua (GMT-6:00) Costa Rica (GMT-6:00) Tegucigalpa (GMT-6:00) Chihuahua (GMT-5:00) Winnipeg (GMT-5:00) Central Time (US and Canada) (GMT-5:00) Mexico City (GMT-5:00) Panama (GMT-5:00) Bogota (GMT-5:00) Lima (GMT-5:00) Monterrey (GMT-4:00) Montreal (GMT-4:00) Eastern Time (US and Canada) (GMT-4:00) Indiana (East) (GMT-4:00) Puerto Rico (GMT-4:00) Caracas (GMT-4:00) Santiago (GMT-4:00) La Paz (GMT-4:00) Guyana (GMT-3:00) Halifax (GMT-3:00) Montevideo (GMT-3:00) Recife (GMT-3:00) Buenos Aires, Georgetown (GMT-3:00) Sao Paulo (GMT-3:00) Atlantic Time (Canada) (GMT-2:30) Newfoundland and Labrador (GMT-2:00) Greenland (GMT-1:00) Cape Verde Islands (GMT+0:00) Azores (GMT+0:00) Universal Time UTC (GMT+0:00) Greenwich Mean Time (GMT+0:00) Reykjavik (GMT+0:00) Casablanca (GMT+0:00) Nouakchott (GMT+1:00) Dublin (GMT+1:00) London (GMT+1:00) Lisbon (GMT+1:00) West Central Africa (GMT+1:00) Algiers (GMT+1:00) Tunis (GMT+2:00) Belgrade, Bratislava, Ljubljana (GMT+2:00) Sarajevo, Skopje, Zagreb (GMT+2:00) Oslo (GMT+2:00) Copenhagen (GMT+2:00) Brussels (GMT+2:00) Amsterdam, Berlin, Rome, Stockholm, Vienna (GMT+2:00) Amsterdam (GMT+2:00) Rome (GMT+2:00) Stockholm (GMT+2:00) Vienna (GMT+2:00) Luxembourg (GMT+2:00) Paris (GMT+2:00) Zurich (GMT+2:00) Madrid (GMT+2:00) Harare, Pretoria (GMT+2:00) Warsaw (GMT+2:00) Prague Bratislava (GMT+2:00) Budapest (GMT+2:00) Tripoli (GMT+2:00) Cairo (GMT+2:00) Johannesburg (GMT+2:00) Khartoum (GMT+3:00) Helsinki (GMT+3:00) Nairobi (GMT+3:00) Sofia (GMT+3:00) Istanbul (GMT+3:00) Athens (GMT+3:00) Bucharest (GMT+3:00) Nicosia (GMT+3:00) Beirut (GMT+3:00) Damascus (GMT+3:00) Jerusalem (GMT+3:00) Amman (GMT+3:00) Moscow (GMT+3:00) Baghdad (GMT+3:00) Kuwait (GMT+3:00) Riyadh (GMT+3:00) Bahrain (GMT+3:00) Qatar (GMT+3:00) Aden (GMT+3:00) Djibouti (GMT+3:00) Mogadishu (GMT+3:00) Kiev (GMT+3:00) Minsk (GMT+4:00) Dubai (GMT+4:00) Muscat (GMT+4:00) Baku, Tbilisi, Yerevan (GMT+4:30) Tehran (GMT+4:30) Kabul (GMT+5:00) Yekaterinburg (GMT+5:00) Islamabad, Karachi, Tashkent (GMT+5:30) India (GMT+5:30) Mumbai, Kolkata, New Delhi (GMT+5:30) Asia/Colombo (GMT+5:45) Kathmandu (GMT+6:00) Almaty (GMT+6:00) Dacca (GMT+6:00) Astana, Dhaka (GMT+6:30) Rangoon (GMT+7:00) Novosibirsk (GMT+7:00) Krasnoyarsk (GMT+7:00) Bangkok (GMT+7:00) Vietnam (GMT+7:00) Jakarta (GMT+8:00) Irkutsk, Ulaanbaatar (GMT+8:00) Beijing, Shanghai (GMT+8:00) Hong Kong SAR (GMT+8:00) Taipei (GMT+8:00) Kuala Lumpur (GMT+8:00) Singapore (GMT+8:00) Perth (GMT+9:00) Yakutsk (GMT+9:00) Seoul (GMT+9:00) Osaka, Sapporo, Tokyo (GMT+9:30) Darwin (GMT+9:30) Adelaide (GMT+10:00) Vladivostok (GMT+10:00) Guam, Port Moresby (GMT+10:00) Brisbane (GMT+10:00) Canberra, Melbourne, Sydney (GMT+10:00) Hobart (GMT+11:00) Magadan (GMT+11:00) Solomon Islands (GMT+11:00) New Caledonia (GMT+12:00) Kamchatka (GMT+12:00) Fiji Islands, Marshall Islands (GMT+12:00) Auckland, Wellington (GMT+13:00) Independent State of Samoa OK Cancel × Continue to PayPal Click to Continue × × Upcoming Meetings Would you like to start this meeting? Would you like to start one of these meetings? View more... Start a New Meeting 
en-wikipedia-org-1224	----	The Age of Surveillance Capitalism - Wikipedia The Age of Surveillance Capitalism From Wikipedia, the free encyclopedia Jump to navigation Jump to search Book published in 2019 The Age of Surveillance Capitalism Front cover Author Shoshana Zuboff Subject Politics, cybersecurity Publisher Profile Books Publication date January 15, 2019 ISBN 9781781256855 The Age of Surveillance Capitalism is a 2019 non-fiction book by Professor Shoshana Zuboff which looks at the development of digital companies like Google and Amazon, and suggests that their business models represent a new form of capitalist accumulation that she calls "surveillance capitalism".[1][2] While industrial capitalism exploited and controlled nature with devastating consequences, surveillance capitalism exploits and controls human nature with a totalitarian order as the endpoint of the development.[3] Premise[edit] Zuboff states that Surveillance Capitalism "unilaterally claims human experience as free raw material for translation into behavioural data [which] are declared as a proprietary behavioural surplus, fed into advanced manufacturing processes known as ‘machine intelligence’, and fabricated into prediction products that anticipate what you will do now, soon, and later." She states that these new capitalist products "are traded in a new kind of marketplace that I call behavioural futures markets."[4] In a capitalist society, information, such as a users likes and dislikes, observed from accessing a platform like Facebook is information that can be freely used by that platform to better the experience of a user by feeding them information that data obtained from their previous activity would have shown them to be interested in. This in many ways can be done through the use of an algorithm that automatically filters out information. The danger of surveillance capitalism is that platforms and tech companies are entitled to this information because it is free for them to access. There is very little supervision by governments and users themselves. Because of this, there has been backlash on how these companies have used the information gathered. For example, Google, which is said to be “the pioneer of surveillance capitalism”, Zuboff (2019)[5] introduced a feature that used “commercial models…discovered by people in a time and place”, Zuboff (2019).[5] This means that not only are commercials being specifically targeted to you through your phone, but now work hand in hand with your environment and habits such as being shown an advertisement of a local bar when walking around downtown in the evening. Advertising attempts this technical and specific can easily have an impact on the one's decision-making process in the activities they choose and in political decisions. Thus the idea that these companies seemingly go unchecked whilst having the power to observe and control thinking is one of the many reasons tech companies such as Google themselves are under so much scrutiny. Furthermore, the freedom allotted to tech companies comes from the idea that “surveillance capitalism does not abandon established capitalist ‘laws’ such as competitive production, profit maximization, productivity and growth”, Zuboff (2019),[5] as they are principles any business in a capitalistic society should aim to excel in, in order to be competitive. Zuboff (2019)[5] claims in an article that “new logic accumulation…introduces its own laws of motion”. In other words, this is a new phenomenon in capitalistic operations that should be treated as such and be instilled with its own specific restrictions and limitations. Lastly, as invasive as platforms have been in terms of accumulating information, they have also led to what is now called a “sharing economy”, Van Dijck (2018)[6] in which digital information can be obtained by individuals carrying out their own surveillance capitalism through the aid of platforms themselves. Thus “individuals can greatly benefit from this transformation because it empowers them to set up business”, Van Dijck (2018).[6] Small businesses will also benefit in potentially growing faster than they would have without knowing consumer demands and wants. This leaves surveillance capitalism as an exceptionally useful tool for businesses, but also an invasion of privacy to users. Reception[edit] The New Yorker listed The Age of Surveillance Capitalism as one of its top non-fiction books of 2019.[7] Former President of the United States Barack Obama also listed it as one of his favourite books of 2019, which journalism researcher Avi Asher-Schapiro noted as an interesting choice, given that the book heavily criticises the "revolving door of personnel who migrated between Google & the Obama admin”.[8] Sam DiBella, writing for the LSE Blog, criticised the book's approach which could "inspire paralysis rather than praxis when it comes to forging collective action to counter systematic corporate surveillance."[9] The Financial Times called the book a "masterwork of original thinking and research".[10] References[edit] ^ Bridle, James (2 February 2019). "The Age of Surveillance Capitalism by Shoshana Zuboff review – we are the pawns". The Guardian. ISSN 0261-3077. Retrieved 2020-01-17 – via www.theguardian.com. CS1 maint: discouraged parameter (link) ^ Naughton, John (20 January 2019). "'The goal is to automate us': welcome to the age of surveillance capitalism". The Observer. ISSN 0029-7712. Retrieved 2020-01-17 – via www.theguardian.com. CS1 maint: discouraged parameter (link) ^ "The new tech totalitarianism". www.newstatesman.com. Retrieved 2021-02-21. ^ Naughton, John (2019-01-20). "'The goal is to automate us': welcome to the age of surveillance capitalism". The Observer. ISSN 0029-7712. Retrieved 2020-01-16. ^ a b c d Zuboff, Shoshana; Möllers, Norma; Murakami Wood, David; Lyon, David (2019-03-31). "Surveillance Capitalism: An Interview with Shoshana Zuboff". Surveillance & Society. 17 (1/2): 257–266. doi:10.24908/ss.v17i1/2.13238. ISSN 1477-7487. ^ a b van Dijck, José; Poell, Thomas; de Waal, Martijn (2018-10-18). "The Platform Society". Oxford Scholarship Online. doi:10.1093/oso/9780190889760.001.0001. ISBN 9780190889760. ^ Yorker, The New (2019-12-18). "Our Favorite Nonfiction Books of 2019". The New Yorker (Serial). ISSN 0028-792X. Retrieved 2020-01-16. ^ Binder, Matt. "Obama praises book that slams his White House for its Google relationship". Mashable. Retrieved 2020-01-16. ^ November 10th; Reviews, 2019|Book; Democracy; Comments, culture|0 (2019-11-10). "Book Review: The Age of Surveillance Capitalism: The Fight for a Human Future at the New Frontier of Power by Shoshana Zuboff". USAPP. Retrieved 2020-01-16. ^ Graphics, FT Interactive. "The Age of Surveillance Capitalism by Shoshana Zuboff". FT Business book of the year award. Retrieved 2020-01-16. Retrieved from "https://en.wikipedia.org/w/index.php?title=The_Age_of_Surveillance_Capitalism&oldid=1019967094" Categories: American non-fiction books 2019 non-fiction books Books critical of capitalism Hidden categories: CS1 maint: discouraged parameter Articles with short description Short description matches Wikidata Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages Italiano Edit links This page was last edited on 26 April 2021, at 12:27 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-1629	----	Double-spending - Wikipedia Double-spending From Wikipedia, the free encyclopedia Jump to navigation Jump to search Failure mode of digital cash schemes Double-spending is a potential flaw in a digital cash scheme in which the same single digital token can be spent more than once. Unlike physical cash, a digital token consists of a digital file that can be duplicated or falsified.[1][2] As with counterfeit money, such double-spending leads to inflation by creating a new amount of copied currency that did not previously exist. This devalues the currency relative to other monetary units or goods and diminishes user trust as well as the circulation and retention of the currency. Fundamental cryptographic techniques to prevent double-spending, while preserving anonymity in a transaction, are blind signatures and, particularly in offline systems, secret splitting.[2] Contents 1 Centralized currencies 2 Decentralized currencies 3 51% attack 4 References Centralized currencies[edit] Prevention of double-spending is usually implemented using an online central trusted third party that can verify whether a token has been spent.[2] This normally represents a single point of failure from both availability and trust viewpoints. Decentralized currencies[edit] In a decentralized system, the double-spending problem is significantly harder to solve. To avoid the need for a trusted third party, many servers must store identical up-to-date copies of a public transaction ledger, but as transactions (requests to spend money) are broadcast, they will arrive at each server at slightly different times. If two transactions attempt to spend the same token, each server will consider the first transaction it sees to be valid, and the other invalid. Once the servers disagree, there is no way to determine true balances, as each server's observations are considered equally valid. Most decentralized systems solve this with a consensus algorithm, a way to bring the servers back in sync. Two notable types of consensus mechanisms are proof-of-work and proof-of-stake. By 2007, a number of distributed systems for the prevention of double-spending had been proposed.[3][4] The cryptocurrency Bitcoin implemented a solution in early 2009. Its cryptographic protocol used a proof-of-work consensus mechanism where transactions are batched into blocks and chained together using a linked list of hash pointers (blockchain). Any server can produce a block by solving a computationally difficult puzzle (specifically finding a partial hash collision) called mining. The block commits to the entire history of bitcoin transactions as well as the new set of incoming transactions. The miner is rewarded some bitcoins for solving it. The double-spending problem persists, however, if two blocks (with conflicting transactions) are mined at the same approximate time. When servers inevitably disagree on the order of the two blocks, they each keep both blocks temporarily. As new blocks arrive, they must commit to one history or the other, and eventually a single chain will continue on, while the other(s) will not. Since the longest (more technically "heaviest") chain is considered to be the valid data set, miners are incentivized to only build blocks on the longest chain they know about in order for it to become part of that dataset (and for their reward to be valid). Transactions in this system are therefore never technically "final" as a conflicting chain of blocks can always outgrow the current canonical chain. However, as blocks are built on top of a transaction, it becomes increasingly unlikely/costly for another chain to overtake it. 51% attack[edit] The total computational power of a decentralized proof-of-work system is the sum of the computational power of the nodes, which can differ significantly due to the hardware used. Larger computational power increases the chance to win the mining reward for each new block mined, which creates an incentive to accumulate clusters of mining nodes, or mining pools. Any pool that achieves 51% hashing power can effectively overturn network transactions, resulting in double spending. One of the Bitcoin forks, Bitcoin Gold, was hit by such an attack in 2018 and then again in 2020.[5] A given cryptocurrency's susceptibility to attack depends on the existing hashing power of the network since the attacker needs to overcome it. For the attack to be economically viable, the market cap of the currency must be sufficiently large to justify the cost to rent hashing power.[6][7] In 2014, mining pool Ghash.io obtained 51% hashing power in Bitcoin which raised significant controversies about the safety of the network. The pool has voluntarily capped their hashing power at 39.99% and requested other pools to follow in order to restore trust in the network.[8] References[edit] ^ The Double Spending Problem and Cryptocurrencies. Banking & Insurance Journal. Social Science Research Network (SSRN). Accessed 24 December 2017. ^ a b c Mark Ryan. "Digital Cash". School of Computer Science, University of Birmingham. Retrieved 2017-05-27. ^ Jaap-Henk Hoepman (2008). "Distributed Double Spending Prevention". arXiv:0802.0832v1 [cs.CR]. ^ Osipkov, I.; Vasserman, E. Y.; Hopper, N.; Kim, Y. (2007). "Combating Double-Spending Using Cooperative P2P Systems". 27th International Conference on Distributed Computing Systems (ICDCS '07). p. 41. CiteSeerX 10.1.1.120.52. doi:10.1109/ICDCS.2007.91. ^ Canellis, David (2020-01-27). "Bitcoin Gold hit by 51% attacks, $72K in cryptocurrency double-spent". Hard Fork | The Next Web. Retrieved 2020-02-29. ^ "Cost of a 51% Attack for Different Cryptocurrencies | Crypto51". www.crypto51.app. Retrieved 2020-02-29. ^ Varshney, Neer (2018-05-24). "Why Proof-of-work isn't suitable for small cryptocurrencies". Hard Fork | The Next Web. Retrieved 2018-05-25. ^ "Popular Bitcoin Mining Pool Promises To Restrict Its Compute Power To Prevent Feared '51%' Fiasco". TechCrunch. Retrieved 2020-02-29. v t e Cryptocurrencies Technology Blockchain Cryptocurrency tumbler Cryptocurrency exchange Cryptocurrency wallet Cryptographic hash function Distributed ledger Fork Lightning Network MetaMask Smart contract Consensus mechanisms Proof of authority Proof of personhood Proof of space Proof of stake Proof of work Proof of work currencies SHA-256-based Bitcoin Bitcoin Cash Counterparty LBRY MazaCoin Namecoin Peercoin Titcoin Ethash-based Ethereum Ethereum Classic Scrypt-based Auroracoin Bitconnect Coinye Dogecoin Litecoin Equihash-based Bitcoin Gold Zcash RandomX-based Monero X11-based Dash Petro Other AmbaCoin Firo IOTA Primecoin Verge Vertcoin Proof of stake currencies Cardano EOS.IO Gridcoin Nxt Peercoin Polkadot Steem Tezos TRON ERC-20 tokens Augur Aventus Bancor Basic Attention Token Chainlink Kin KodakCoin Minds The DAO Uniswap Stablecoins Dai Diem Tether USD Coin Other currencies Filecoin GNU Taler Hashgraph Nano NEO Ripple Stellar WhopperCoin Related topics Airdrop BitLicense Blockchain game Complementary currency Crypto-anarchism Cryptocurrency bubble Decentralized Finance Digital currency Double-spending Hyperledger Initial coin offering Initial exchange offering Initiative Q List of cryptocurrencies Non-fungible token Token money Virtual currency Category Commons List Retrieved from "https://en.wikipedia.org/w/index.php?title=Double-spending&oldid=1018658906" Categories: Digital currencies Financial cryptography Payment systems Internet fraud Distributed computing Cryptocurrencies Hidden categories: Articles with short description Short description matches Wikidata Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages العربية Español فارسی Français Italiano Português Русский 中文 Edit links This page was last edited on 19 April 2021, at 06:08 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-1453	----	BitTorrent - Wikipedia BitTorrent From Wikipedia, the free encyclopedia Jump to navigation Jump to search Peer-to-peer file sharing protocol This article is about the file sharing protocol. For other uses, see BitTorrent (disambiguation). BitTorrent Original author(s) Bram Cohen Developer(s) BitTorrent, Inc Initial release 2001; 20 years ago (2001) Repository github.com/bittorrent/bittorrent.org Operating system Android iOS Linux macOS Windows Other Standard(s) The BitTorrent Protocol Specification[1] Type peer-to-peer file sharing License Unknown Website www.bittorrent.org Part of a series on File sharing Technologies File hosting services Online video platform Peer to peer Usenet Web hosting WebRTC XDCC Video sharing sites 123Movies Dailymotion PeerTube Putlocker YouTube BitTorrent sites 1337x Demonoid ExtraTorrent EZTV isoHunt KickassTorrents Nyaa Torrents RARBG Tamil Rockers The Pirate Bay YIFY Academic #ICanHazPDF Internet Archive Library Genesis Sci-Hub Academic Torrents Z-Library File sharing networks BitTorrent Direct Connect eDonkey Freenet Gnutella Gnutella2 IPFS LBRY Ares Galaxy List of P2P protocols OpenNap WebTorrent P2P clients BitComet DC++ Deluge eMule μTorrent qBittorrent Shareaza Soulseek Transmission Tribler Vuze WinMX Napster Streaming programs Butter Project Popcorn Time Torrents-Time Anonymous file sharing Anonymous P2P Darknet Freenet Friend-to-friend I2P Private P2P Proxy server Seedbox Tor VPN Development and societal aspects Timeline Legality BitTorrent issues By country or region Canada Japan Singapore UK US Comparisons Comparison of BitTorrent clients Comparison of BitTorrent sites Comparison of eDonkey software Comparison of Internet Relay Chat clients Comparison of Usenet newsreaders v t e BitTorrent (abbreviated to BT) is a communication protocol for peer-to-peer file sharing (P2P), which enables users to distribute data and electronic files over the Internet in a decentralized manner. BitTorrent is one of the most common protocols for transferring large files; such as, digital video files containing TV shows and video clips, or digital audio files containing songs. P2P networks have been estimated to, collectively, account for approximately 43% to 70% of Internet traffic depending on location, as of February 2009[update].[2] In February 2013, BitTorrent was responsible for 3.35% of all worldwide bandwidth—more than half of the 6% of total bandwidth dedicated to file sharing.[3] In 2019, BitTorrent was a dominant file sharing protocol and generated a substantial amount of Internet traffic, with 2.46% of downstream, and 27.58% of upstream traffic.[4] To send or receive files, a person uses a BitTorrent client, on their Internet-connected computer. A BitTorrent client is a computer program that implements the BitTorrent protocol. Popular clients include μTorrent, Xunlei Thunder,[5][6] Transmission, qBittorrent, Vuze, Deluge, BitComet and Tixati. BitTorrent trackers provide a list of files available for transfer and allow the client to find peer users, known as "seeds", who may transfer the files. Programmer Bram Cohen, a University at Buffalo alumnus,[7] designed the protocol in April 2001, and released the first available version on 2 July 2001.[8] As of June 2020[update], the most recent version was implemented in 2017.[1] BitTorrent clients are available for a variety of computing platforms and operating systems, including an official client released by BitTorrent, Inc. As of 2013[update], BitTorrent has 15–27 million concurrent users at any time.[9] As of January 2012[update], BitTorrent is utilized by 150 million active users. Based on this figure, the total number of monthly users may be estimated to more than a quarter of a billion (≈ 250 million).[10] Torrenting may sometimes be limited by Internet Service Providers (ISPs), on legal or copyright grounds. In turn, users may choose to run seedboxes or Virtual Private Networks (VPNs) as an alternative. On May 15, 2017, an update to the protocol specification was released by BitTorrent, called BitTorrent v2.[11][12] libtorrent was updated to support the new version on September 6, 2020.[13] Animation of protocol use: The colored dots beneath each computer in the animation represent different parts of the file being shared. By the time a copy to a destination computer of each of those parts completes, a copy to another destination computer of that part (or other parts) is already taking place between users. Contents 1 Description 2 Operation 2.1 Search queries 2.2 Downloading torrents and sharing files 2.3 Creating and publishing torrents 2.4 Anonymity 2.5 BitTorrent v2 3 Adoption 3.1 Film, video, and music 3.2 Broadcasters 3.3 Personal works 3.4 Software 3.5 Government 3.6 Education 3.7 Others 4 Technologies built on BitTorrent 4.1 Distributed trackers 4.2 Web seeding 4.2.1 Hash web seeding 4.2.2 HTTP web seeding 4.2.3 Other 4.3 RSS feeds 4.4 Throttling and encryption 4.5 Multitracker 4.6 Peer selection 5 Implementations 6 Legal issues 7 Security problems 7.1 Malware 8 See also 9 References 10 Further reading 11 External links Description[edit] The middle computer is acting as a "seed" to provide a file to the other computers which act as peers. The BitTorrent protocol can be used to reduce the server and network impact of distributing large files. Rather than downloading a file from a single source server, the BitTorrent protocol allows users to join a "swarm" of hosts to upload to/download from each other simultaneously. The protocol is an alternative to the older single source, multiple mirror sources technique for distributing data, and can work effectively over networks with lower bandwidth. Using the BitTorrent protocol, several basic computers, such as home computers, can replace large servers while efficiently distributing files to many recipients. This lower bandwidth usage also helps prevent large spikes in internet traffic in a given area, keeping internet speeds higher for all users in general, regardless of whether or not they use the BitTorrent protocol. The first release of the BitTorrent client had no search engine and no peer exchange, so users who wanted to upload a file had to create a small torrent descriptor file that they would upload to a torrent index site. The first uploader acted as a seed, and downloaders would initially connect as peers (see diagram on the right). Those who wish to download the file would download the torrent, which their client would use to connect to a tracker which had a list of the IP addresses of other seeds and peers in the swarm. Once a peer completed a download of the complete file, it could in turn function as a seed. The file being distributed is divided into segments called pieces. As each peer receives a new piece of the file, it becomes a source (of that piece) for other peers, relieving the original seed from having to send that piece to every computer or user wishing a copy. With BitTorrent, the task of distributing the file is shared by those who want it; it is entirely possible for the seed to send only a single copy of the file itself and eventually distribute to an unlimited number of peers. Each piece is protected by a cryptographic hash contained in the torrent descriptor.[1] This ensures that any modification of the piece can be reliably detected, and thus prevents both accidental and malicious modifications of any of the pieces received at other nodes. If a node starts with an authentic copy of the torrent descriptor, it can verify the authenticity of the entire file it receives. Pieces are typically downloaded non-sequentially, and are rearranged into the correct order by the BitTorrent client, which monitors which pieces it needs, and which pieces it has and can upload to other peers. Pieces are of the same size throughout a single download (for example, a 10 MB file may be transmitted as ten 1 MB pieces or as forty 256 KB pieces). Due to the nature of this approach, the download of any file can be halted at any time and be resumed at a later date, without the loss of previously downloaded information, which in turn makes BitTorrent particularly useful in the transfer of larger files. This also enables the client to seek out readily available pieces and download them immediately, rather than halting the download and waiting for the next (and possibly unavailable) piece in line, which typically reduces the overall time of the download. This eventual transition from peers to seeders determines the overall "health" of the file (as determined by the number of times a file is available in its complete form). The distributed nature of BitTorrent can lead to a flood-like spreading of a file throughout many peer computer nodes. As more peers join the swarm, the likelihood of a successful download by any particular node increases. Relative to traditional Internet distribution schemes, this permits a significant reduction in the original distributor's hardware and bandwidth resource costs. Distributed downloading protocols in general provide redundancy against system problems, reduce dependence on the original distributor,[14] and provide sources for the file which are generally transient and therefore there is no single point of failure as in one way server-client transfers. Operation[edit] A BitTorrent client is capable of preparing, requesting, and transmitting any type of computer file over a network, using the protocol. Up until 2005, the only way to share files was by creating a small text file called a "torrent". These files contain metadata about the files to be shared and the trackers which keep track of the other seeds and peers. Users that want to download the file first obtain a torrent file for it, and connect to the tracker or seeds. In 2005, first Vuze and then the BitTorrent client introduced distributed tracking using distributed hash tables which allowed clients to exchange data on swarms directly without the need for a torrent file. In 2006, peer exchange functionality was added allowing clients to add peers based on the data found on connected nodes. Though both ultimately transfer files over a network, a BitTorrent download differs from a one way server-client download (as is typical with an HTTP or FTP request, for example) in several fundamental ways: BitTorrent makes many small data requests over different IP connections to different machines, while server-client downloading is typically made via a single TCP connection to a single machine. BitTorrent downloads in a random or in a "rarest-first"[15] approach that ensures high availability, while classic downloads are sequential. Taken together, these differences allow BitTorrent to achieve much lower cost to the content provider, much higher redundancy, and much greater resistance to abuse or to "flash crowds" than regular server software. However, this protection, theoretically, comes at a cost: downloads can take time to rise to full speed because it may take time for enough peer connections to be established, and it may take time for a node to receive sufficient data to become an effective uploader. This contrasts with regular downloads (such as from an HTTP server, for example) that, while more vulnerable to overload and abuse, rise to full speed very quickly, and maintain this speed throughout. In the beginning, BitTorrent's non-contiguous download methods made it harder to support "streaming playback". In 2014, the client Popcorn Time allowed for streaming of BitTorrent video files. Since then, more and more clients are offering streaming options. Search queries[edit] The BitTorrent protocol provides no way to index torrent files. As a result, a comparatively small number of websites have hosted a large majority of torrents, many linking to copyrighted works without the authorization of copyright holders, rendering those sites especially vulnerable to lawsuits.[16] A BitTorrent index is a "list of .torrent files, which typically includes descriptions" and information about the torrent's content.[17] Several types of websites support the discovery and distribution of data on the BitTorrent network. Public torrent-hosting sites such as The Pirate Bay allow users to search and download from their collection of torrent files. Users can typically also upload torrent files for content they wish to distribute. Often, these sites also run BitTorrent trackers for their hosted torrent files, but these two functions are not mutually dependent: a torrent file could be hosted on one site and tracked by another unrelated site. Private host/tracker sites operate like public ones except that they may restrict access to registered users and may also keep track of the amount of data each user uploads and downloads, in an attempt to reduce "leeching". Web search engines allow the discovery of torrent files that are hosted and tracked on other sites; examples include The Pirate Bay, Torrentz, isoHunt and BTDigg. These sites allow the user to ask for content meeting specific criteria (such as containing a given word or phrase) and retrieve a list of links to torrent files matching those criteria. This list can often be sorted with respect to several criteria, relevance (seeders-leechers ratio) being one of the most popular and useful (due to the way the protocol behaves, the download bandwidth achievable is very sensitive to this value). Metasearch engines allow one to search several BitTorrent indices and search engines at once. The Tribler BitTorrent client was among the first to incorporate built-in search capabilities. With Tribler, users can find .torrent files held by random peers and taste buddies.[18] It adds such an ability to the BitTorrent protocol using a gossip protocol, somewhat similar to the eXeem network which was shut down in 2005. The software includes the ability to recommend content as well. After a dozen downloads, the Tribler software can roughly estimate the download taste of the user, and recommend additional content.[19] In May 2007, researchers at Cornell University published a paper proposing a new approach to searching a peer-to-peer network for inexact strings,[20] which could replace the functionality of a central indexing site. A year later, the same team implemented the system as a plugin for Vuze called Cubit[21] and published a follow-up paper reporting its success.[22] A somewhat similar facility but with a slightly different approach is provided by the BitComet client through its "Torrent Exchange"[23] feature. Whenever two peers using BitComet (with Torrent Exchange enabled) connect to each other they exchange lists of all the torrents (name and info-hash) they have in the Torrent Share storage (torrent files which were previously downloaded and for which the user chose to enable sharing by Torrent Exchange). Thus each client builds up a list of all the torrents shared by the peers it connected to in the current session (or it can even maintain the list between sessions if instructed). At any time the user can search into that Torrent Collection list for a certain torrent and sort the list by categories. When the user chooses to download a torrent from that list, the .torrent file is automatically searched for (by info-hash value) in the DHT Network and when found it is downloaded by the querying client which can after that create and initiate a downloading task. Downloading torrents and sharing files[edit] Users find a torrent of interest on a torrent index site or by using a search engine built into the client, download it, and open it with a BitTorrent client. The client connects to the tracker(s) or seeds specified in the torrent file, from which it receives a list of seeds and peers currently transferring pieces of the file(s). The client connects to those peers to obtain the various pieces. If the swarm contains only the initial seeder, the client connects directly to it, and begins to request pieces. Clients incorporate mechanisms to optimize their download and upload rates. The effectiveness of this data exchange depends largely on the policies that clients use to determine to whom to send data. Clients may prefer to send data to peers that send data back to them (a "tit for tat" exchange scheme), which encourages fair trading. But strict policies often result in suboptimal situations, such as when newly joined peers are unable to receive any data because they don't have any pieces yet to trade themselves or when two peers with a good connection between them do not exchange data simply because neither of them takes the initiative. To counter these effects, the official BitTorrent client program uses a mechanism called "optimistic unchoking", whereby the client reserves a portion of its available bandwidth for sending pieces to random peers (not necessarily known good partners, so called preferred peers) in hopes of discovering even better partners and to ensure that newcomers get a chance to join the swarm.[24] Although "swarming" scales well to tolerate "flash crowds" for popular content, it is less useful for unpopular or niche market content. Peers arriving after the initial rush might find the content unavailable and need to wait for the arrival of a "seed" in order to complete their downloads. The seed arrival, in turn, may take long to happen (this is termed the "seeder promotion problem"). Since maintaining seeds for unpopular content entails high bandwidth and administrative costs, this runs counter to the goals of publishers that value BitTorrent as a cheap alternative to a client-server approach. This occurs on a huge scale; measurements have shown that 38% of all new torrents become unavailable within the first month.[25] A strategy adopted by many publishers which significantly increases availability of unpopular content consists of bundling multiple files in a single swarm.[26] More sophisticated solutions have also been proposed; generally, these use cross-torrent mechanisms through which multiple torrents can cooperate to better share content.[27] Creating and publishing torrents[edit] The peer distributing a data file treats the file as a number of identically sized pieces, usually with byte sizes of a power of 2, and typically between 32 kB and 16 MB each. The peer creates a hash for each piece, using the SHA-1 hash function, and records it in the torrent file. Pieces with sizes greater than 512 kB will reduce the size of a torrent file for a very large payload, but is claimed to reduce the efficiency of the protocol.[28] When another peer later receives a particular piece, the hash of the piece is compared to the recorded hash to test that the piece is error-free.[1] Peers that provide a complete file are called seeders, and the peer providing the initial copy is called the initial seeder. The exact information contained in the torrent file depends on the version of the BitTorrent protocol. By convention, the name of a torrent file has the suffix .torrent. Torrent files have an "announce" section, which specifies the URL of the tracker, and an "info" section, containing (suggested) names for the files, their lengths, the piece length used, and a SHA-1 hash code for each piece, all of which are used by clients to verify the integrity of the data they receive. Though SHA-1 has shown signs of cryptographic weakness, Bram Cohen did not initially consider the risk big enough for a backward incompatible change to, for example, SHA-3. As of BitTorrent v2 the hash function has been updated to SHA-256.[29] In the early days, torrent files were typically published to torrent index websites, and registered with at least one tracker. The tracker maintained lists of the clients currently connected to the swarm.[1] Alternatively, in a trackerless system (decentralized tracking) every peer acts as a tracker. Azureus was the first[30] BitTorrent client to implement such a system through the distributed hash table (DHT) method. An alternative and incompatible DHT system, known as Mainline DHT, was released in the Mainline BitTorrent client three weeks later (though it had been in development since 2002)[30] and subsequently adopted by the μTorrent, Transmission, rTorrent, KTorrent, BitComet, and Deluge clients. After the DHT was adopted, a "private" flag – analogous to the broadcast flag – was unofficially introduced, telling clients to restrict the use of decentralized tracking regardless of the user's desires.[31] The flag is intentionally placed in the info section of the torrent so that it cannot be disabled or removed without changing the identity of the torrent. The purpose of the flag is to prevent torrents from being shared with clients that do not have access to the tracker. The flag was requested for inclusion in the official specification in August 2008, but has not been accepted yet.[32] Clients that have ignored the private flag were banned by many trackers, discouraging the practice.[33] Anonymity[edit] BitTorrent does not, on its own, offer its users anonymity. One can usually see the IP addresses of all peers in a swarm in one's own client or firewall program. This may expose users with insecure systems to attacks.[24] In some countries, copyright organizations scrape lists of peers, and send takedown notices to the internet service provider of users participating in the swarms of files that are under copyright. In some jurisdictions, copyright holders may launch lawsuits against uploaders or downloaders for infringement, and police may arrest suspects in such cases. Various means have been used to promote anonymity. For example, the BitTorrent client Tribler makes available a Tor-like onion network, optionally routing transfers through other peers to obscure which client has requested the data. The exit node would be visible to peers in a swarm, but the Tribler organization provides exit nodes. One advantage of Tribler is that clearnet torrents can be downloaded with only a small decrease in download speed from one "hop" of routing. i2p provides a similar anonymity layer although in that case, one can only download torrents that have been uploaded to the i2p network.[34] The bittorrent client Vuze allows users who are not concerned about anonymity to take clearnet torrents, and make them available on the i2p network.[35] Most BitTorrent clients are not designed to provide anonymity when used over Tor,[36] and there is some debate as to whether torrenting over Tor acts as a drag on the network.[37] Private torrent trackers are usually invitation only, and require members to participate in uploading, but have the downside of a single centralized point of failure. Oink's Pink Palace and What.cd are examples of private trackers which have been shut down. Seedbox services download the torrent files first to the company's servers, allowing the user to direct download the file from there.[38][39] One's IP address would be visible to the Seedbox provider, but not to third parties. Virtual private networks encrypt transfers, and substitute a different IP address for the user's, so that anyone monitoring a torrent swarm will only see that address. BitTorrent v2[edit] BitTorrent v2 is intended to work seamlessly with previous versions of the BitTorrent protocol. The main reason for the update was that the old cryptographic hash function, SHA-1 is no longer considered safe from malicious attacks by the developers, and as such, v2 uses SHA-256. To ensure backwards compatibility, the v2 .torrent file format supports a hybrid mode where the torrents are hashed through both the new method and the old method, with the intent that the files will be shared with peers on both v1 and v2 swarms. Another update to the specification is adding a hash tree to speed up time from adding a torrent to downloading files, and to allow more granular checks for file corruption. In addition, each file is now hashed individually, enabling files in the swarm to be deduplicated, so that if multiple torrents include the same files, but seeders are only seeding the file from some, downloaders of the other torrents can still download the file. Magnet links for v2 also support a hybrid mode to ensure support for legacy clients.[40] Adoption[edit] A growing number of individuals and organizations are using BitTorrent to distribute their own or licensed works (e.g. indie bands distributing digital files of their new songs). Independent adopters report that without using BitTorrent technology, and its dramatically reduced demands on their private networking hardware and bandwidth, they could not afford to distribute their files.[41] Some uses of BitTorrent for file sharing may violate laws in some jurisdictions (see legal issues section). Film, video, and music[edit] BitTorrent Inc. has obtained a number of licenses from Hollywood studios for distributing popular content from their websites.[citation needed] Sub Pop Records releases tracks and videos via BitTorrent Inc.[42] to distribute its 1000+ albums. Babyshambles and The Libertines (both bands associated with Pete Doherty) have extensively used torrents to distribute hundreds of demos and live videos. US industrial rock band Nine Inch Nails frequently distributes albums via BitTorrent. Podcasting software is starting to integrate BitTorrent to help podcasters deal with the download demands of their MP3 "radio" programs. Specifically, Juice and Miro (formerly known as Democracy Player) support automatic processing of .torrent files from RSS feeds. Similarly, some BitTorrent clients, such as μTorrent, are able to process web feeds and automatically download content found within them. DGM Live purchases are provided via BitTorrent.[43] VODO, a service which distributes "free-to-share" movies and TV shows via BitTorrent.[44][45][46] Broadcasters[edit] In 2008, the CBC became the first public broadcaster in North America to make a full show (Canada's Next Great Prime Minister) available for download using BitTorrent.[47] The Norwegian Broadcasting Corporation (NRK) has since March 2008 experimented with bittorrent distribution, available online.[48] Only selected works in which NRK owns all royalties are published. Responses have been very positive, and NRK is planning to offer more content. The Dutch VPRO broadcasting organization released four documentaries in 2009 and 2010 under a Creative Commons license using the content distribution feature of the Mininova tracker.[49][50][51] Personal works[edit] The Amazon S3 "Simple Storage Service" is a scalable Internet-based storage service with a simple web service interface, equipped with built-in BitTorrent support.[52] Software[edit] Blizzard Entertainment uses BitTorrent (via a proprietary client called the "Blizzard Downloader", associated with the Blizzard "BattleNet" network) to distribute content and patches for Diablo III, StarCraft II and World of Warcraft, including the games themselves.[53] Wargaming uses BitTorrent in their popular titles World of Tanks, World of Warships and World of Warplanes to distribute game updates.[54] CCP Games, maker of the space Simulation MMORPG Eve Online, has announced that a new launcher will be released that is based on BitTorrent.[55][56] Many software games, especially those whose large size makes them difficult to host due to bandwidth limits, extremely frequent downloads, and unpredictable changes in network traffic, will distribute instead a specialized, stripped down bittorrent client with enough functionality to download the game from the other running clients and the primary server (which is maintained in case not enough peers are available). Many major open source and free software projects encourage BitTorrent as well as conventional downloads of their products (via HTTP, FTP etc.) to increase availability and to reduce load on their own servers, especially when dealing with larger files.[57] Government[edit] The British government used BitTorrent to distribute details about how the tax money of British citizens was spent.[58][59] Education[edit] Florida State University uses BitTorrent to distribute large scientific data sets to its researchers.[60] Many universities that have BOINC distributed computing projects have used the BitTorrent functionality of the client-server system to reduce the bandwidth costs of distributing the client-side applications used to process the scientific data. If a BOINC distributed computing application needs to be updated (or merely sent to a user), it can do so with little impact on the BOINC server.[61] The developing Human Connectome Project uses BitTorrent to share their open dataset.[62] Academic Torrents is a BitTorrent tracker for use by researchers in fields that need to share large datasets[63][64] Others[edit] Facebook uses BitTorrent to distribute updates to Facebook servers.[65] Twitter uses BitTorrent to distribute updates to Twitter servers.[66][67] The Internet Archive added BitTorrent to its file download options for over 1.3 million existing files, and all newly uploaded files, in August 2012.[68][69] This method is the fastest means of downloading media from the Archive.[68][70] As of 2011[update], BitTorrent had 100 million users and a greater share of network bandwidth than Netflix and Hulu combined.[71][72] In early 2015, AT&T estimates that BitTorrent represents 20% of all broadband traffic.[73] Routers that use network address translation (NAT) must maintain tables of source and destination IP addresses and ports. Typical home routers are limited to about 2000 table entries[citation needed] while some more expensive routers have larger table capacities. BitTorrent frequently contacts 20–30 servers per second, rapidly filling the NAT tables. This is a known cause of some home routers ceasing to work correctly.[74][75] Technologies built on BitTorrent[edit] Distributed trackers[edit] On 2 May 2005, Azureus 2.3.0.0 (now known as Vuze) was released,[76] introducing support for "trackerless" torrents through a system called the "distributed database." This system is a Distributed hash table implementation which allows the client to use torrents that do not have a working BitTorrent tracker. Instead just bootstrapping server is used (router.bittorrent.com, dht.transmissionbt.com or router.utorrent.com[77][78]). The following month, BitTorrent, Inc. released version 4.2.0 of the Mainline BitTorrent client, which supported an alternative DHT implementation (popularly known as "Mainline DHT", outlined in a draft on their website) that is incompatible with that of Azureus. In 2014, measurement showed concurrent users of Mainline DHT to be from 10 million to 25 million, with a daily churn of at least 10 million.[79] Current versions of the official BitTorrent client, μTorrent, BitComet, Transmission and BitSpirit all share compatibility with Mainline DHT. Both DHT implementations are based on Kademlia.[80] As of version 3.0.5.0, Azureus also supports Mainline DHT in addition to its own distributed database through use of an optional application plugin.[81] This potentially allows the Azureus/Vuze client to reach a bigger swarm. Another idea that has surfaced in Vuze is that of virtual torrents. This idea is based on the distributed tracker approach and is used to describe some web resource. Currently, it is used for instant messaging. It is implemented using a special messaging protocol and requires an appropriate plugin. Anatomic P2P is another approach, which uses a decentralized network of nodes that route traffic to dynamic trackers. Most BitTorrent clients also use Peer exchange (PEX) to gather peers in addition to trackers and DHT. Peer exchange checks with known peers to see if they know of any other peers. With the 3.0.5.0 release of Vuze, all major BitTorrent clients now have compatible peer exchange. Web seeding[edit] Web "seeding" was implemented in 2006 as the ability of BitTorrent clients to download torrent pieces from an HTTP source in addition to the "swarm". The advantage of this feature is that a website may distribute a torrent for a particular file or batch of files and make those files available for download from that same web server; this can simplify long-term seeding and load balancing through the use of existing, cheap, web hosting setups. In theory, this would make using BitTorrent almost as easy for a web publisher as creating a direct HTTP download. In addition, it would allow the "web seed" to be disabled if the swarm becomes too popular while still allowing the file to be readily available. This feature has two distinct specifications, both of which are supported by Libtorrent and the 26+ clients that use it. Hash web seeding[edit] The first was created by John "TheSHAD0W" Hoffman, who created BitTornado.[82][83] This first specification requires running a web service that serves content by info-hash and piece number, rather than filename. HTTP web seeding[edit] The other specification is created by GetRight authors and can rely on a basic HTTP download space (using byte serving).[84][85] Other[edit] In September 2010, a new service named Burnbit was launched which generates a torrent from any URL using webseeding.[86] There are server-side solutions that provide initial seeding of the file from the web server via standard BitTorrent protocol and when the number of external seeders reach a limit, they stop serving the file from the original source.[87] RSS feeds[edit] Main article: Broadcatching A technique called broadcatching combines RSS feeds with the BitTorrent protocol to create a content delivery system, further simplifying and automating content distribution. Steve Gillmor explained the concept in a column for Ziff-Davis in December 2003.[88] The discussion spread quickly among bloggers (Ernest Miller,[89] Chris Pirillo, etc.). In an article entitled Broadcatching with BitTorrent, Scott Raymond explained: I want RSS feeds of BitTorrent files. A script would periodically check the feed for new items, and use them to start the download. Then, I could find a trusted publisher of an Alias RSS feed, and "subscribe" to all new episodes of the show, which would then start downloading automatically – like the "season pass" feature of the TiVo. — Scott Raymond, scottraymond.net[90] The RSS feed will track the content, while BitTorrent ensures content integrity with cryptographic hashing of all data, so feed subscribers will receive uncorrupted content. One of the first and popular software clients (free and open source) for broadcatching is Miro. Other free software clients such as PenguinTV and KatchTV are also now supporting broadcatching. The BitTorrent web-service MoveDigital added the ability to make torrents available to any web application capable of parsing XML through its standard REST-based interface in 2006,[91] though this has since been discontinued. Additionally, Torrenthut is developing a similar torrent API that will provide the same features, and help bring the torrent community to Web 2.0 standards. Alongside this release is a first PHP application built using the API called PEP, which will parse any Really Simple Syndication (RSS 2.0) feed and automatically create and seed a torrent for each enclosure found in that feed.[92] Throttling and encryption[edit] Main article: BitTorrent protocol encryption Since BitTorrent makes up a large proportion of total traffic, some ISPs have chosen to "throttle" (slow down) BitTorrent transfers. For this reason, methods have been developed to disguise BitTorrent traffic in an attempt to thwart these efforts.[93] Protocol header encrypt (PHE) and Message stream encryption/Protocol encryption (MSE/PE) are features of some BitTorrent clients that attempt to make BitTorrent hard to detect and throttle. As of November 2015, Vuze, Bitcomet, KTorrent, Transmission, Deluge, μTorrent, MooPolice, Halite, qBittorrent, rTorrent, and the latest official BitTorrent client (v6) support MSE/PE encryption. In August 2007, Comcast was preventing BitTorrent seeding by monitoring and interfering with the communication between peers. Protection against these efforts is provided by proxying the client-tracker traffic via an encrypted tunnel to a point outside of the Comcast network.[94] In 2008, Comcast called a "truce" with BitTorrent, Inc. with the intention of shaping traffic in a protocol-agnostic manner.[95] Questions about the ethics and legality of Comcast's behavior have led to renewed debate about net neutrality in the United States.[96] In general, although encryption can make it difficult to determine what is being shared, BitTorrent is vulnerable to traffic analysis. Thus, even with MSE/PE, it may be possible for an ISP to recognize BitTorrent and also to determine that a system is no longer downloading but only uploading data, and terminate its connection by injecting TCP RST (reset flag) packets. Multitracker[edit] Another unofficial feature is an extension to the BitTorrent metadata format proposed by John Hoffman[97] and implemented by several indexing websites. It allows the use of multiple trackers per file, so if one tracker fails, others can continue to support file transfer. It is implemented in several clients, such as BitComet, BitTornado, BitTorrent, KTorrent, Transmission, Deluge, μTorrent, rtorrent, Vuze, and Frostwire. Trackers are placed in groups, or tiers, with a tracker randomly chosen from the top tier and tried, moving to the next tier if all the trackers in the top tier fail. Torrents with multiple trackers can decrease the time it takes to download a file, but also have a few consequences: Poorly implemented[98] clients may contact multiple trackers, leading to more overhead-traffic. Torrents from closed trackers suddenly become downloadable by non-members, as they can connect to a seed via an open tracker. Peer selection[edit] As of December 2008[update], BitTorrent, Inc. is working with Oversi on new Policy Discover Protocols that query the ISP for capabilities and network architecture information. Oversi's ISP hosted NetEnhancer box is designed to "improve peer selection" by helping peers find local nodes, improving download speeds while reducing the loads into and out of the ISP's network.[99] Implementations[edit] Main article: Comparison of BitTorrent clients The BitTorrent specification is free to use and many clients are open source, so BitTorrent clients have been created for all common operating systems using a variety of programming languages. The official BitTorrent client, μTorrent, qBittorrent, Transmission, Vuze, and BitComet are some of the most popular clients.[100][101][102][103] Some BitTorrent implementations such as MLDonkey and Torrentflux are designed to run as servers. For example, this can be used to centralize file sharing on a single dedicated server which users share access to on the network.[104] Server-oriented BitTorrent implementations can also be hosted by hosting providers at co-located facilities with high bandwidth Internet connectivity (e.g., a datacenter) which can provide dramatic speed benefits over using BitTorrent from a regular home broadband connection. Services such as ImageShack can download files on BitTorrent for the user, allowing them to download the entire file by HTTP once it is finished. The Opera web browser supports BitTorrent,[105] as does Wyzo and Brave.[106] BitLet allows users to download Torrents directly from their browser using a Java applet. An increasing number of hardware devices are being made to support BitTorrent. These include routers and NAS devices containing BitTorrent-capable firmware like OpenWrt. Proprietary versions of the protocol which implement DRM, encryption, and authentication are found within managed clients such as Pando. Legal issues[edit] Main article: Legal issues with BitTorrent Although the protocol itself is legal,[107] problems stem from using the protocol to traffic copyright infringing works, since BitTorrent is often used to download otherwise paid content, such as movies and video games. There has been much controversy over the use of BitTorrent trackers. BitTorrent metafiles themselves do not store file contents. Whether the publishers of BitTorrent metafiles violate copyrights by linking to copyrighted works without the authorization of copyright holders is controversial. Various jurisdictions have pursued legal action against websites that host BitTorrent trackers. High-profile examples include the closing of Suprnova.org, TorrentSpy, LokiTorrent, BTJunkie, Mininova, Oink's Pink Palace and What.cd. The Pirate Bay torrent website, formed by a Swedish group, is noted for the "legal" section of its website in which letters and replies on the subject of alleged copyright infringements are publicly displayed. On 31 May 2006, The Pirate Bay's servers in Sweden were raided by Swedish police on allegations by the MPAA of copyright infringement;[108] however, the tracker was up and running again three days later. In the study used to value NBC Universal in its merger with Comcast, Envisional examined the 10,000 torrent swarms managed by PublicBT which had the most active downloaders. After excluding pornographic and unidentifiable content, it was found that only one swarm offered legitimate content.[109] In the United States, more than 200,000 lawsuits have been filed for copyright infringement on BitTorrent since 2010.[110] On 30 April 2012, the High Court of Justice ordered five ISPs to block BitTorrent search engine The Pirate Bay.[111] (see List of websites blocked in the United Kingdom) Security problems[edit] One concern is the UDP flood attack. BitTorrent implementations often use μTP for their communication. To achieve high bandwidths, the underlying protocol used is UDP, which allows spoofing of source addresses of internet traffic. It has been possible to carry out Denial-of-service attacks in a P2P lab environment, where users running BitTorrent clients act as amplifiers for an attack at another service.[112] However this is not always an effective attack because ISPs can check if the source address is correct. Malware[edit] Several studies on BitTorrent found files containing malware, available for download. In particular, one small sample[113] indicated that 18% of all executable programs available for download contained malware. Another study[114] claims that as much as 14.5% of BitTorrent downloads contain zero-day malware, and that BitTorrent was used as the distribution mechanism for 47% of all zero-day malware they have found. See also[edit] Anonymous P2P Napster Gnutella Anti-Counterfeiting Trade Agreement Bencode Cache Discovery Protocol Comparison of BitTorrent clients Comparison of BitTorrent sites Comparison of BitTorrent tracker software FastTrack Glossary of BitTorrent terms Magnet URI scheme μTP (Micro Transport Protocol) Peer-to-peer file sharing Segmented file transfer Simple file verification Super-seeding Torrent file Torrent poisoning VPN References[edit] ^ a b c d e Cohen, Bram (October 2002). "BitTorrent Protocol 1.0". BitTorrent.org. Archived from the original on 8 February 2014. Retrieved 1 June 2020. ^ Schulze, Hendrik; Klaus Mochalski (2009). "Internet Study 2008/2009" (PDF). Leipzig, Germany: ipoque. Archived from the original (PDF) on 26 June 2011. Retrieved 3 October 2011. Peer-to-peer file sharing (P2P) still generates by far the most traffic in all monitored regions – ranging from 43% in Northern Africa to 70% Eastern Europe. ^ "Application Usage & Threat Report". Palo Alto Networks. 2013. Archived from the original on 31 October 2013. Retrieved 7 April 2013. ^ Marozzo, Fabrizio; Talia, Domenico; Trunfio, Paolo (2020). "A Sleep-and-Wake technique for reducing energy consumption in BitTorrent networks". Concurrency and Computation: Practice and Experience. 32 (14). doi:10.1002/cpe.5723. ISSN 1532-0634. S2CID 215841734. ^ Van der Sar, Ernesto (4 December 2009). "Thunder Blasts uTorrent's Market Share Away - TorrentFreak". TorrentFreak. Archived from the original on 20 February 2016. Retrieved 18 June 2018. ^ "迅雷-全球共享计算与区块链创领者". www.xunlei.com. Retrieved 21 November 2019. ^ "UB Engineering Tweeter". University at Buffalo's School of Engineering and Applied Sciences. Archived from the original on 11 November 2013. ^ Cohen, Bram (2 July 2001). "BitTorrent – a new P2P app". Yahoo eGroups. Archived from the original on 29 January 2008. Retrieved 15 April 2007. ^ Wang, Liang; Kangasharju, J. (1 September 2013). "Measuring large-scale distributed systems: Case of Bit Torrent Mainline DHT". IEEE P2P 2013 Proceedings. pp. 1–10. doi:10.1109/P2P.2013.6688697. ISBN 978-1-4799-0515-7. S2CID 5659252. Archived from the original on 18 November 2015. Retrieved 7 January 2016. ^ "BitTorrent and μTorrent Software Surpass 150 Million User Milestone". Bittorrent.com. 9 January 2012. Archived from the original on 26 March 2014. Retrieved 9 July 2012. ^ https://github.com/bittorrent/bittorrent.org/commit/51fe877e6ed6f20fb7eea67fe234e7b266aaed84 ^ Cohen, Bram. "The BitTorrent Protocol Specification v2". BitTorrent.org. BitTorrent. Retrieved 28 October 2020. ^ "Bittorrent-v2". libbittorrent.org. libbittorrent. Retrieved 28 October 2020. ^ Menasche, Daniel S.; Rocha, Antonio A. A.; de Souza e Silva, Edmundo A.; Leao, Rosa M.; Towsley, Don; Venkataramani, Arun (2010). "Estimating Self-Sustainability in Peer-to-Peer Swarming Systems". Performance Evaluation. 67 (11): 1243–1258. arXiv:1004.0395. doi:10.1016/j.peva.2010.08.013. S2CID 9361889. by D. Menasche, A. Rocha, E. de Souza e Silva, R. M. Leao, D. Towsley, A. Venkataramani. ^ Urvoy-Keller (December 2006). "Rarest First and Choke Algorithms Are Enough" (PDF). SIGCOMM. Archived (PDF) from the original on 23 May 2012. Retrieved 9 March 2012. ^ Ernesto (12 July 2009). "PublicBT Tracker Set To Patch BitTorrent' Achilles' Heel". Torrentfreak. Archived from the original on 26 March 2014. Retrieved 14 July 2009. ^ Chwan-Hwa (John) Wu, J. David Irwin. Introduction to Computer Networks and Cybersecurity. Chapter 5.4.: Partially Centralized Architectures. CRC Press. February 4, 2013. ISBN 9781466572133 ^ Zeilemaker, N., Capotă, M., Bakker, A., & Pouwelse, J. (2011). "Tribler P2P Media Search and Sharing." Proceedings of the 19th ACM International Conference on Multimedia - MM ’11. ^ "DecentralizedRecommendation –". Tribler.org. Archived from the original on 2 December 2008. Retrieved 9 July 2012. ^ Wong, Bernard; Vigfusson, Ymir; Gun Sirer, Emin (2 May 2007). "Hyperspaces for Object Clustering and Approximate Matching in Peer-to-Peer Overlays" (PDF). Cornell University. Archived (PDF) from the original on 17 June 2012. Retrieved 7 April 2013. ^ Wong, Bernard (2008). "Cubit: Approximate Matching for Peer-to-Peer Overlays". Cornell University. Archived from the original on 31 December 2012. Retrieved 26 May 2008. ^ Wong, Bernard. "Approximate Matching for Peer-to-Peer Overlays with Cubit" (PDF). Cornell University. Archived (PDF) from the original on 29 October 2008. Retrieved 26 May 2008. ^ "Torrent Exchange". Archived from the original on 5 October 2013. Retrieved 31 January 2010. The torrent sharing feature of BitComet. Bitcomet.com. ^ a b Tamilmani, Karthik (25 October 2003). "Studying and enhancing the BitTorrent protocol". Stony Brook University. Archived from the original (DOC) on 19 November 2004. Retrieved 6 May 2006. ^ Kaune, Sebastian; et al. (2009). "Unraveling BitTorrent's File Unavailability: Measurements and Analysis". arXiv:0912.0625 [cs.NI]. ^ D. Menasche; et al. (1–4 December 2009). Content Availability and Bundling in Swarming Systems (PDF). CoNEXT'09. Rome, Italy: ACM via sigcomm.org. ISBN 978-1-60558-636-6. Archived (PDF) from the original on 1 May 2011. Retrieved 18 December 2009. ^ Kaune, Sebastian; et al. "The Seeder Promotion Problem: Measurements, Analysis and Solution Space" (PDF). Queen Mary's University London. Archived (PDF) from the original on 9 August 2014. Retrieved 20 July 2017. ^ "BitTorrent Specification". Wiki.theory.org. Archived from the original on 26 June 2013. Retrieved 9 July 2012.[dubious – discuss] ^ "» BitTorrent v2". Retrieved 27 September 2020. ^ a b Jones, Ben (7 June 2015). "BitTorrent's DHT Turns 10 Years Old". TorrentFreak. Archived from the original on 11 June 2015. Retrieved 5 July 2015. ^ "Unofficial BitTorrent Protocol Specification v1.0". Archived from the original on 14 December 2006. Retrieved 4 October 2009.[dubious – discuss] ^ Harrison, David (3 August 2008). "Private Torrents". Bittorrent.org. Archived from the original on 24 March 2013. Retrieved 4 October 2009. ^ "BitComet Banned From Growing Number of Private Trackers". Archived from the original on 26 March 2014. Retrieved 4 October 2009. ^ "I2P Compared to Tor - I2P". Archived from the original on 22 December 2015. Retrieved 16 December 2015. ^ "I2PHelper HowTo - VuzeWiki". Archived from the original on 20 October 2017. Retrieved 16 December 2015. ^ "Bittorrent over Tor isn't a good idea - The Tor Blog". Archived from the original on 13 October 2016. Retrieved 2 October 2016. ^ Inc., The Tor Project. "Tor Project: FAQ". Archived from the original on 22 October 2016. Retrieved 2 October 2016. ^ "This Website Could Be The Ultimate All-In-One Torrent Machine". 8 April 2016. Archived from the original on 8 April 2016. ^ "Torrent From the Cloud With Seedr - TorrentFreak". 17 January 2016. Archived from the original on 19 April 2016. Retrieved 8 April 2016. ^ "Bittorrent-v2". libbittorrent.org. libbittorrent. Retrieved 28 October 2020. ^ See, for example, "Why Bit Torrent". Archived from the original on 28 January 2013.. tasvideos.org. ^ "Sub Pop page on BitTorrent.com". Archived from the original on 14 January 2007. Retrieved 13 December 2006. ^ "DGMlive.com". DGMlive.com. Archived from the original on 11 November 2013. Retrieved 9 July 2012. ^ "VODO – About...". Retrieved 15 April 2012. (WebCite). ^ Cory Doctorow (15 October 2009). "Vodo: a filesharing service for film-makers". Boing Boing. Happy Mutants LLC. Retrieved 15 April 2012. (WebCite) ^ Ernesto. "Pioneer One, The BitTorrent Exclusive TV-Series Continues". TorrentFreak. Retrieved 15 April 2012. (WebCite) ^ "CBC to BitTorrent Canada's Next Great Prime Minister". CBC News. 19 March 2008. Archived from the original on 14 June 2010. Retrieved 19 March 2008. ^ "Bittorrent" (in Norwegian). Nrkbeta.no. 2008. Archived from the original on 24 October 2013. Retrieved 7 April 2013. ^ "Torrents uploaded by EeuwvandeStad". MiniNova. 2009. Archived from the original on 4 November 2013. Retrieved 7 April 2013. ^ Denters, M. (11 August 2010). "Tegenlicht – Download California Dreaming". VPRO.nl. Archived from the original on 26 March 2014. Retrieved 7 April 2013. ^ Bol, M. (1 October 2009). "Tegenlicht – VPRO gemeengoed" (in Dutch). VPRO.nl. Archived from the original on 26 March 2014. Retrieved 7 April 2013. ^ "Using BitTorrent with Amazon S3". Archived from the original on 26 March 2014. ^ "Blizzard Downloader". Curse Inc. 4 November 2010. Archived from the original on 26 March 2014. Retrieved 4 November 2010. ^ "World of Tanks FAQ". Wargaming. 15 December 2014. Archived from the original on 18 December 2014. Retrieved 15 December 2014. ^ MJ Guthrie (11 March 2013). "EVE Online reconfiguring launcher to use BitTorrent". Massively.joystiq.com. Archived from the original on 13 February 2014. Retrieved 7 April 2013. ^ CCP Games (20 July 2010). "All quiet on the EVE Launcher front? – EVE Community". Community.eveonline.com. Archived from the original on 13 March 2013. Retrieved 7 April 2013. ^ "Complete Download Options List – BitTorrent". Ubuntu.com. Archived from the original on 24 April 2010. Retrieved 7 May 2009. ^ HM Government (4 September 2012). "Combined Online Information System". Data.Gov.Uk Beta. Controller of Her Majesty's Stationery Office. Archived from the original on 26 March 2014. Retrieved 7 September 2012. ^ Ernesto (4 June 2010). "UK Government Uses BitTorrent to Share Public Spending Data". TorrentFreak. Archived from the original on 27 October 2013. Retrieved 7 September 2012. ^ "HPC Data Repository". Florida State University. Archived from the original on 2 April 2013. Retrieved 7 April 2013. ^ Costa, Fernando; Silva, Luis; Fedak, Gilles; Kelley, Ian (2008). "Optimizing the data distribution layer of BOINC with Bit Torrent". 2008 IEEE International Symposium on Parallel and Distributed Processing. IEEE International Symposium on Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE. p. 1. doi:10.1109/IPDPS.2008.4536446. ISBN 978-1-4244-1693-6. S2CID 13265537.(registration required) ^ "Torrents Help Researchers Worldwide to Study Babies' Brains". Torrent Freak. 3 June 2017. Archived from the original on 5 January 2018. Retrieved 4 January 2018. ^ "Academic Torrents Website". Retrieved 4 May 2020. ^ Miccoli, Fräntz (2014). "Academic Torrents: Bringing P2P Technology to the Academic World". MyScienceWork. Retrieved 6 May 2020. ^ Ernesto (25 June 2010). "Facebook Uses BitTorrent, and They Love It". Torrent Freak. Torrent Freak. Archived from the original on 19 April 2014. Retrieved 7 September 2012. ^ Ernesto (10 February 2010). "Twitter Uses BitTorrent For Server Deployment". Torrent Freak. Torrent Freak. Archived from the original on 26 March 2014. Retrieved 7 September 2012. ^ Ernesto (16 July 2010). "BitTorrent Makes Twitter's Server Deployment 75x Faster". Torrent Freak. Torrent Freak. Archived from the original on 26 March 2014. Retrieved 7 September 2012. ^ a b Ernesto (7 August 2012). "Internet Archive Starts Seeding 1,398,875 Torrents". TorrentFreak. Archived from the original on 8 August 2012. Retrieved 7 August 2012. ^ "Hot List for bt1.us.archive.org (Updated August 7, 2012, 7:31 pm PDT)". Archived from the original on 3 August 2012. Retrieved 8 August 2012.. Archive.org. ^ "Welcome to Archive torrents". Archived from the original on 19 January 2016. Retrieved 22 December 2015.. Archive.org. 2012. ^ Carr, Austin (4 January 2011). "BitTorrent Has More Users Than Netflix and Hulu Combined—and Doubled". fastcompany.com. Archived from the original on 10 January 2011. Retrieved 9 July 2012. ^ Hartley, Matt (1 July 2011). "BitTorrent turns ten". Financialpost.com. Archived from the original on 4 November 2013. Retrieved 9 July 2012. ^ "AT&T patents system to 'fast-lane' BitTorrent traffic". Thestack.com. 8 May 2006. Archived from the original on 23 February 2015. Retrieved 5 March 2015. ^ "FAQ:Modems/routers that are known to have problems with P2P apps". uTorrent.com. Archived from the original on 13 September 2008. Retrieved 7 April 2013. ^ Halkes, Gertjan; Pouwelse, Johan (2011). Jordi Domingo-Pascual; et al. (eds.). UDP NAT and Firewall Puncturing in the Wild. NETWORKING 2011:10th International IFIP TC 6 Networking Conference, Valencia, Spain, May 9–13, 2011, Proceedings. Springer. p. 7. ISBN 9783642207976. Archived from the original on 9 May 2013. Retrieved 7 April 2013. ^ "Vuze Changelog". Azureus.sourceforge.net. Archived from the original on 1 December 2006. ^ "DHT Bootstrap Update | The BitTorrent Engineering Blog". engineering.bittorrent.com. Retrieved 21 November 2019. ^ GitHub - bittorrent/bootstrap-dht: DHT bootstrap server, BitTorrent Inc., 11 November 2019, retrieved 21 November 2019 ^ Wang, Liang; Kangasharju, Jussi. (2013). "Measuring Large-Scale Distributed Systems: Case of BitTorrent Mainline DHT" (PDF). IEEE Peer-to-Peer. Archived (PDF) from the original on 12 May 2014. Retrieved 15 May 2014. ^ "Khashmir.Sourceforge.net". Khashmir.Sourceforge.net. Archived from the original on 2 July 2012. Retrieved 9 July 2012. ^ "plugins.vuze.com". plugins.vuze.com. Archived from the original on 1 August 2012. Retrieved 9 July 2012. ^ "HTTP-Based Seeding Specification". BitTornado.com. Archived from the original (TXT) on 20 March 2004. Retrieved 9 May 2006. ^ John Hoffman, DeHackEd (25 February 2008). "HTTP Seeding – BitTorrent Enhancement Proposal № 17". Archived from the original on 13 December 2013. Retrieved 17 February 2012. ^ "HTTP/FTP Seeding for BitTorrent". GetRight.com. Archived from the original on 28 December 2009. Retrieved 18 March 2010. ^ Michael Burford (25 February 2008). "WebSeed – HTTP/FTP Seeding (GetRight style) – BitTorrent Enhancement Proposal № 19". Bittorrent.org. Archived from the original on 13 December 2013. Retrieved 17 February 2012. ^ "Burn Any Web-Hosted File into a Torrent With Burnbit". TorrentFreak. 13 September 2010. Archived from the original on 9 August 2011. Retrieved 9 July 2012. ^ "PHP based torrent file creator, tracker and seed server". PHPTracker. Archived from the original on 19 December 2013. Retrieved 9 July 2012. ^ Gillmor, Steve (13 December 2003). "BitTorrent and RSS Create Disruptive Revolution". EWeek.com. Retrieved 22 April 2007. ^ Miller, Ernest (2 March 2004). "BitTorrent + RSS = The New Broadcast". Archived from the original on 23 October 2013.. The Importance of... Corante.com. ^ Raymond, Scott (16 December 2003). "Broadcatching with BitTorrent". scottraymond.net. Archived from the original on 13 February 2004. ^ "MoveDigital API REST functions". Move Digital. 2006. Archived from the original on 11 August 2006. Retrieved 9 May 2006. Documentation. ^ "Prodigem Enclosure Puller(pep.txt)". Prodigem.com. Archived from the original (TXT) on 26 May 2006. Retrieved 9 May 2006. via Internet Wayback Machine. ^ "Encrypting Bittorrent to take out traffic shapers". Torrentfreak.com. 5 February 2006. Archived from the original on 26 March 2014. Retrieved 9 May 2006. ^ "Comcast Throttles BitTorrent Traffic, Seeding Impossible". Archived from the original on 11 October 2013., TorrentFreak, 17 August 2007. ^ Broache, Anne (27 March 2008). "Comcast and BitTorrent Agree to Collaborate". News.com. Archived from the original on 9 May 2008. Retrieved 9 July 2012. ^ Soghoian, Chris (4 September 2007). "Is Comcast's BitTorrent filtering violating the law?". Cnet.com. Archived from the original on 15 July 2010. Retrieved 9 July 2012. ^ "BEP12: Multitracker Metadata Extension". BitTorrent Inc. Archived from the original on 27 December 2012. Retrieved 28 March 2013. ^ "P2P:Protocol:Specifications:Multitracker". wiki.depthstrike.com. Archived from the original on 26 March 2014. Retrieved 13 November 2009.[dubious – discuss] ^ Johnston, Casey (9 December 2008). "Arstechnica.com". Arstechnica.com. Archived from the original on 12 December 2008. Retrieved 9 July 2012. ^ Van Der Sar, Ernesto (4 December 2009). "Thunder Blasts uTorrent's Market Share Away". TorrentFreak. Archived from the original on 7 December 2009. Retrieved 15 September 2011. ^ "uTorrent Dominates BitTorrent Client Market Share". TorrentFreak. 24 June 2009. Archived from the original on 3 April 2014. Retrieved 25 June 2013. ^ "Windows Public File Sharing Market Share 2015". opswat. Archived from the original on 14 April 2016. Retrieved 1 April 2016. ^ Henry, Alan. "Most Popular BitTorrent Client 2015". lifehacker. Archived from the original on 9 April 2016. Retrieved 1 April 2016. ^ "Torrent Server combines a file server with P2P file sharing". Turnkeylinux.org. Archived from the original on 7 July 2012. Retrieved 9 July 2012. ^ Anderson, Nate (1 February 2007). "Does network neutrality mean an end to BitTorrent throttling?". Ars Technica, LLC. Archived from the original on 16 December 2008. Retrieved 9 February 2007. ^ Mark. "How to Stream Movies and Download Torrent Files in Brave Browser". Browser Pulse. Retrieved 6 October 2020. ^ "Is torrenting safe? Is it illegal? Are you likely to be caught?". 29 November 2018. Archived from the original on 6 October 2018. Retrieved 5 October 2018. ^ "The Piratebay is Down: Raided by the Swedish Police". TorrentFreak. 31 May 2006. Archived from the original on 16 April 2014. Retrieved 20 May 2007. ^ "Technical report: An Estimate of Infringing Use of the Internet" (PDF). Envisional. 1 January 2011. Archived (PDF) from the original on 25 April 2012. Retrieved 6 May 2012. ^ "BitTorrent: Copyright lawyers' favourite target reaches 200,000 lawsuits". The Guardian. 9 August 2011. Archived from the original on 4 December 2013. Retrieved 10 January 2014. ^ Albanesius, Chloe (30 April 2012). "U.K. High Court Orders ISPs to Block The Pirate Bay". PC Magazine. Archived from the original on 25 May 2013. Retrieved 6 May 2012. ^ Adamsky, Florian (2015). "P2P File-Sharing in Hell: Exploiting BitTorrent Vulnerabilities to Launch Distributed Reflective DoS Attacks". Archived from the original on 1 October 2015. Retrieved 21 August 2015. ^ Berns, Andrew D.; Jung, Eunjin (EJ) (24 April 2008). "Searching for Malware in Bit Torrent". University of Iowa, via TechRepublic. Archived from the original on 1 May 2013. Retrieved 7 April 2013.(registration required) ^ Vegge, Håvard; Halvorsen, Finn Michael; Nergård, Rune Walsø (2009), "Where Only Fools Dare to Tread: An Empirical Study on the Prevalence of Zero-Day Malware" (PDF), 2009 Fourth International Conference on Internet Monitoring and Protection, IEEE Computer Society, p. 66, doi:10.1109/ICIMP.2009.19, ISBN 978-1-4244-3839-6, S2CID 15567480, archived from the original (PDF (orig. work + pub. paper)) on 17 June 2013 Further reading[edit] Pouwelse, Johan; et al. (2005). "The Bittorrent P2P File-Sharing System: Measurements and Analysis". Peer-to-Peer Systems IV. Lecture Notes in Computer Science. 3640. Berlin: Springer. pp. 205–216. doi:10.1007/11558989_19. ISBN 978-3-540-29068-1. Retrieved 4 September 2011. External links[edit] Wikimedia Commons has media related to BitTorrent. Official website Specification BitTorrent at Curlie Interview with chief executive Ashwin Navin Unofficial BitTorrent Protocol Specification v1.0 at wiki.theory.org Unofficial BitTorrent Location-aware Protocol 1.0 Specification at wiki.theory.org Czerniawski, Michal (20 December 2009). "Responsibility of Bittorrent Search Engines for Copyright Infringements". SSRN. doi:10.2139/ssrn.1540913. SSRN 1540913. Cite journal requires |journal= (help) Cohen, Bram (16 February 2005). "Under the hood of BitTorrent". Computer Systems Colloquium (EE380). Stanford University. v t e Cloud computing As a service Content as a service Data as a service Desktop as a service Function as a service Infrastructure as a service Integration platform as a service Mobile backend as a service Network as a service Platform as a service Security as a service Software as a service Technologies Cloud database Cloud storage Data centers Distributed file system for cloud Hardware virtualization Internet Native cloud application Networking Security Structured storage Virtual appliance Web APIs Virtual private cloud Applications Box Dropbox Google Workspace Drive HP Cloud (closed) IBM Cloud Microsoft Office 365 OneDrive Oracle Cloud Rackspace Salesforce Workday Zoho Platforms Alibaba Cloud Amazon Web Services AppScale Box Bluemix CloudBolt Cloud Foundry Cocaine (PaaS) Creatio Engine Yard Helion GE Predix Google App Engine GreenQloud Heroku IBM Cloud Inktank Jelastic Mendix Microsoft Azure MindSphere Netlify Oracle Cloud OutSystems openQRM OpenShift PythonAnywhere RightScale Scalr Force.com SAP Cloud Platform Splunk VMware vCloud Air WaveMaker Infrastructure Alibaba Cloud Amazon Web Services Abiquo Enterprise Edition CloudStack Citrix Cloud CtrlS DigitalOcean EMC Atmos Eucalyptus Fujitsu GoGrid Google Cloud Platform GreenButton GreenQloud IBM Cloud iland Joyent Linode Lunacloud Microsoft Azure Mirantis Netlify Nimbula Nimbus OpenIO OpenNebula OpenStack Oracle Cloud OrionVM Rackspace Cloud Safe Swiss Cloud SoftLayer Zadara Storage libvirt libguestfs OVirt Virtual Machine Manager Wakame-vdc Virtual Private Cloud OnDemand Category Commons v t e BitTorrent Companies BitTorrent, Inc. Vuze, Inc. People Bram Cohen Ross Cohen Eric Klinker Ashwin Navin Justin Sun Technology Glossary Broadcatching Distributed hash tables DNA I2P index Local Peer Discovery Peer exchange Protocol encryption Super-seeding Tracker Torrent file TCP UDP µTP WebRTC WebTorrent Clients (comparison, usage share) Ares Galaxy BitTorrent (original client) BitComet BitLord Deluge Free Download Manager Flashget FrostWire Getright Go!Zilla KTorrent libtorrent (library) LimeWire µTorrent Miro MLDonkey qBittorrent rTorrent Shareaza Tixati Transmission Tribler Vuze (formerly Azureus) WebTorrent Desktop Xunlei Tracker software (comparison) opentracker PeerTracker TorrentPier XBT Tracker Search engines (comparison) 1337x BTDigg Demonoid etree ExtraTorrent EZTV isoHunt Karagarga KickassTorrents Nyaa Torrents The Pirate Bay RARBG Tamil Rockers Torrentz YIFY yourBittorrent Defunct websites BTJunkie Burnbit LokiTorrent Mininova Oink's Pink Palace OpenBitTorrent Suprnova.org t411 Torrent Project TorrentSpy What.CD YouTorrent Related topics aXXo BitTorrent Open Source License Glossary of BitTorrent terms Popcorn Time Slyck.com TorrentFreak Category Commons v t e Peer-to-peer file sharing Networks, protocols Centralized Direct Connect OpenNap Soribada Soulseek Decentralized Ares BitTorrent DAT eDonkey FastTrack Freenet GNUnet Gnutella Gnutella2 I2P IPFS Kad LBRY OpenFT Perfect Dark Retroshare Share Tribler WebTorrent WinMX Winny ZeroNet Historic Audiogalaxy CuteMX Entropy Kazaa LimeWire Morpheus Overnet Napster Scour WASTE Comparisons of clients Advanced Direct Connect BitTorrent Direct Connect eDonkey Gnutella Gnutella2 WebTorrent Hyperlinks eD2k Magnet Metalink Uses Backup Broadcatching Segmented file transfer Disk sharing game & video sharing Image sharing Music sharing Peercasting Sharing software Web hosting (Freesite, IPFS, ZeroNet) Legal aspects Concepts Privacy Anonymous P2P Darknet Darkweb Friend-to-friend Open Music Model Private P2P Tor Internal technologies DHT Merkle tree NAT traversal PEX Protocol Encryption SHA-1 Super-seeding Tracker UDP hole punching µTP Retrieved from "https://en.wikipedia.org/w/index.php?title=BitTorrent&oldid=1018756587" Categories: BitTorrent Computer-related introductions in 2001 Application layer protocols Web 2.0 File sharing Hidden categories: All accuracy disputes Articles with disputed statements from April 2013 CS1 Norwegian-language sources (no) CS1 Dutch-language sources (nl) Pages with login required references or sources Articles with short description Short description matches Wikidata Use dmy dates from July 2016 Articles containing potentially dated statements from February 2009 All articles containing potentially dated statements Articles containing potentially dated statements from June 2020 Articles containing potentially dated statements from 2013 Articles containing potentially dated statements from January 2012 All articles with unsourced statements Articles with unsourced statements from January 2020 Articles containing potentially dated statements from 2011 Articles with unsourced statements from November 2011 Articles containing potentially dated statements from December 2008 Commons category link is on Wikidata Official website different in Wikidata and Wikipedia Articles with Curlie links CS1 errors: missing periodical Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version In other projects Wikimedia Commons Languages Afrikaans العربية Asturianu Azərbaycanca Беларуская Български Bosanski Català Čeština Cymraeg Dansk Deutsch Eesti Ελληνικά Español Esperanto Euskara فارسی Français Galego 한국어 हिन्दी Hrvatski Ilokano Bahasa Indonesia Italiano עברית ქართული Kurdî Latviešu Lietuvių Magyar മലയാളം मराठी Bahasa Melayu Nederlands 日本語 Norsk bokmål Norsk nynorsk Polski Português Română Русский Shqip සිංහල Simple English Slovenčina Slovenščina Српски / srpski Srpskohrvatski / српскохрватски Suomi Svenska தமிழ் Татарча/tatarça ไทย Türkçe Українська اردو Tiếng Việt 吴语 粵語 中文 Edit links This page was last edited on 19 April 2021, at 18:10 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-2210	----	Chilling effect - Wikipedia Chilling effect From Wikipedia, the free encyclopedia Jump to navigation Jump to search For other uses, see Chilling effect (disambiguation). Not to be confused with Chilling Effects. Part of a series on Censorship by country Countries Algeria Armenia Australia Azerbaijan Bangladesh Belarus Bhutan Bolivia Brazil Canada China (Hong Kong/overseas) Cuba Czech Republic Denmark Ecuador Eritrea Finland France Germany (Nazi / Democratic Republic / Federal Republic) Honduras India Iran Iraq Ireland Israel Italy Jamaica Japan Malaysia Maldives Mexico Myanmar New Zealand Nigeria North Korea Pakistan Paraguay Philippines Poland Portugal Romania Russia (Soviet Union / Russian Empire) Samoa Saudi Arabia Serbia Singapore Somalia South Korea Spain Sri Lanka Sweden Taiwan Thailand Tunisia Turkey Ukraine United Kingdom United States Venezuela Vietnam See also Freedom of speech by country Internet censorship and surveillance by country v t e In a legal context, a chilling effect is the inhibition or discouragement of the legitimate exercise of natural and legal rights by the threat of legal sanction.[1] The right that is most often described as being suppressed by a chilling effect is the US constitutional right to free speech. A chilling effect may be caused by legal actions such as the passing of a law, the decision of a court, or the threat of a lawsuit; any legal action that would cause people to hesitate to exercise a legitimate right (freedom of speech or otherwise) for fear of legal repercussions. When that fear is brought about by the threat of a libel lawsuit, it is called libel chill.[2] A lawsuit initiated specifically for the purpose of creating a chilling effect may be called a Strategic Lawsuit Against Public Participation ("SLAPP"). "Chilling" in this context normally implies an undesirable slowing. Outside the legal context in common usage; any coercion or threat of coercion (or other unpleasantries) can have a chilling effect on a group of people regarding a specific behavior, and often can be statistically measured or be plainly observed. For example, the news headline "Flood insurance [price] spikes have chilling effect on some home sales,"[3] and the abstract title of a two‐part survey of 160 college students involved in dating relationships: "The chilling effect of aggressive potential on the expression of complaints in intimate relationships."[4] Contents 1 Usage 2 History 2.1 Chilling effects on Wikipedia users 3 See also 4 References 5 External links Usage[edit] In United States and Canadian law, the term chilling effects refers to the stifling effect that vague or excessively broad laws may have on legitimate speech activity.[5] However, the term is also now commonly used outside American legal jargon, such as the chilling effects of high prices[3] or of corrupt police, or of "anticipated aggressive repercussions" (in say, personal relationships[4]). A chilling effect is an effect that reduces, suppresses, discourages, delays, or otherwise retards reporting concerns of any kind. An example of the "chilling effect" in Canadian case law can be found in Iorfida v. MacIntyre where the constitutionality of a criminal law prohibiting the publication of literature depicting illicit drug use was challenged. The court found that the law had a "chilling effect" on legitimate forms of expression and could stifle political debate on issues such as the legalization of marijuana.[6] The court noted that it did not adopt the same "chilling effect" analysis used in American law but considered the chilling effect of the law as a part of its own analysis.[7] Regarding Ömer Faruk Gergerlioğlu case in Turkey, press release of the Office of the United Nations High Commissioner for Human Rights (OHCHR) defined, Turkey's mis-using of counter-terrorism measures can have a chilling effect on the enjoyment of fundamental freedoms and human rights.[8] History[edit] In 1644 John Milton expressed the chilling effect of censorship in Areopagitica: For to distrust the judgement and the honesty of one who hath but a common repute in learning and never yet offended, as not to count him fit to print his mind without a tutor or examiner, lest he should drop a schism or something of corruption, is the greatest displeasure and indignity to a free and knowing spirit that can be put upon him.[9] The term chilling effect has been in use in the United States since as early as 1950.[10] The United States Supreme Court first refers to the "chilling effect" in the context of the United States Constitution in Wieman v. Updegraff in 1952.[11] It, however, became further used as a legal term when William J. Brennan, a justice of the United States Supreme Court, used it in a judicial decision (Lamont v. Postmaster General) which overturned a law requiring a postal patron receiving "communist political propaganda"[12] to specifically authorize the delivery.[13] The Lamont case, however, did not center around a law that explicitly stifles free speech. The "chilling effect" referred to at the time was a "deterrent effect" on freedom of expression—even when there is no law explicitly prohibiting it. However, in general, "chilling effect" is now often used in reference to laws or actions that do not explicitly prohibit legitimate speech, but that impose undue burdens.[13][failed verification] Chilling effects on Wikipedia users[edit] Edward Snowden disclosed in 2013 that the US government's Upstream program was collecting data on people reading Wikipedia articles. This revelation had significant impact on the self-censorship of the readers, as shown by the fact that there were substantially fewer views for articles related to terrorism and security.[14] The court case Wikimedia Foundation v. NSA has since followed. See also[edit] Censorship Culture of fear Opinion corridor Fear mongering Media transparency Prior restraint Self-censorship Strategic lawsuit against public participation References[edit] ^ chilling effect. (n.d.). Retrieved October 19, 2011, from http://law.yourdictionary.com/chilling-effect ^ Green, Allen (October 15, 2009). "Banish the libel chill". The Guardian. ^ a b "Flood insurance spikes have chilling effect on some home sales". WWL‑TV Eyewitness News. October 15, 2013. Archived from the original on November 19, 2013. Realtors say [price spikes are] already causing home sales to fall through when buyers realize they can't afford the flood insurance. ^ a b Cloven, Denise H.; Roloff, Michael E. (1993). "The Chilling Effect of Aggressive Potential on The Expression of Complaints in Intimate Relationships". Communication Monographs. 60 (3): 199–219. doi:10.1080/03637759309376309. A two‐part survey of 160 college students involved in dating relationships.... This chilling effect was greater when individuals who generally feared conflict anticipated aggressive repercussions (p < .001), and when people anticipated symbolic aggression from relationally independent partners (p < .05). ^ "censorship-reports-striking-a-balance-hate-speech-freedom-of-expression-and-nondiscrimination-1992-431-pp". doi:10.1163/2210-7975_hrd-2210-0079. Cite journal requires |journal= (help) ^ Iorfida v. MacIntyre, 1994 CanLII 7341 (ON SC)at para. 20, < "Archived copy". Archived from the original on July 13, 2012. Retrieved October 25, 2011.CS1 maint: archived copy as title (link)> retrieved on 2011-10-25 ^ Iorfida v. MacIntyre, 1994 CanLII 7341 (ON SC) at para. 37, < "Archived copy". Archived from the original on July 13, 2012. Retrieved October 25, 2011.CS1 maint: archived copy as title (link)> retrieved on 2011-10-25 ^ https://www.ohchr.org/EN/NewsEvents/Pages/DisplayNews.aspx?NewsID=26934&LangID=E&s=09 ^ John Milton (1644) Areopagitica, edited by George H. Sabine (1951), page 29, Appleton-Century-Crofts ^ Freund, Paul A. "4 Vanderbilt Law Review 533, at 539 (1950–1951): The Supreme Court and Civil Liberties". ^ "The Chilling Effect in Constitutional Law". Columbia Law Review. 69 (5): 808–842. May 1969. doi:10.2307/1121147. JSTOR 1121147. ^ Safire, William (July 20, 2005). "Safire Urges Federal Journalist Shield Law". Center For Individual Freedom. Retrieved June 18, 2008. Justice Brennan reported having written a 1965 decision striking down a state's intrusion on civil liberty because of its "chilling effect upon the exercise of First Amendment rights...” ^ a b "LAMONT V. POSTMASTER GENERAL, 381 U. S. 301 (1965)". Justia. Retrieved June 18, 2008. ^ Penney, Jonathon W. (2016). "Chilling Effects: Online Surveillance and Wikipedia Use". Berkeley Technology Law Journal. doi:10.15779/z38ss13. Retrieved August 20, 2019. External links[edit] Lumen, containing many current examples of alleged chilling effects Terms associated with libel cases Cato Policy Analysis No. 270 Chilling The Internet? Lessons from FCC Regulation of Radio Broadcasting Libel Reform Campaign The Chilling Effect of English libel law v t e Censorship Media regulation Books books banned Films banned films Internet circumvention Music Postal Press Radio Speech and expression Thought Video games banned video games Methods Bleeping Book burning Broadcast delay Burying of scholars Censor bars Chilling effect Concision Conspiracy of silence Content-control software Damnatio memoriae Euphemism Minced oath Expurgation Fogging Gag order Heckling Heckler's veto Internet police Memory hole National intranet Newspaper theft Pixelization Prior restraint Propaganda Purge Revisionism Sanitization Self-censorship Speech code Strategic lawsuit Surveillance computer and network mass Whitewashing Word filtering Contexts Criminal Corporate Hate speech Online Ideological LGBT issues Media bias Moralistic fallacy Naturalistic fallacy Politics Propaganda model Religious Suppression of dissent Systemic bias By country Censorship Chinese issues overseas Freedom of speech Internet censorship v t e Law Core subjects Administrative law Civil law Constitutional law Contract Criminal law Deed Equity Evidence International law Law of obligations Private law Procedure Civil Criminal Property law Public law Restitution Statutory law Tort Other subjects Agricultural law Aviation law Amnesty law Banking law Bankruptcy Commercial law Competition law Conflict of laws Construction law Consumer protection Corporate law Cyberlaw Election law Energy law Entertainment law Environmental law Family law Financial law Financial regulation Health law History of the legal profession History of the American legal profession Immigration law Intellectual property International criminal law International human rights International slavery laws Jurimetrics Labour Law of war Legal archaeology Legal fiction Maritime law Media law Military law Probate Estate Will and testament Product liability Public international law Space law Sports law Tax law Transport law Trust law Unenforced law Women in law Sources of law Charter Code Constitution Custom Divine right Divine law Human rights Natural law Natural and legal rights Case law Precedent Law making Ballot measure Codification Decree Edict Executive order Proclamation Legislation Delegated legislation Regulation Rulemaking Promulgation Repeal Treaty Concordat Statutory law Statute Act of Parliament Act of Congress (US) Legal systems Civil law Common law Chinese law Legal pluralism Religious law Canon law Catholic canon law Hindu law Jain law Jewish law Sharia Roman law Socialist law Statutory law Xeer Yassa Legal theory Anarchist Contract theory Critical legal studies Comparative law Feminist Fundamental theory of Catholic canon law Law and economics Legal formalism History Libertarian International legal theory Principle of legality Principle of typicality Rule of law Sociology Jurisprudence Adjudication Administration of justice Criminal justice Court-martial Dispute resolution Fiqh Lawsuit/Litigation Legal opinion Legal remedy Judge Justice of the peace Magistrate Judgment Judicial review Jurisdiction Jury Justice Practice of law Attorney Barrister Counsel Lawyer Legal representation Prosecutor Solicitor Question of fact Question of law Trial Trial advocacy Trier of fact Verdict Legal institutions Bureaucracy The bar The bench Civil society Court Court of equity Election commission Executive Judiciary Law enforcement Legal education Law school Legislature Military Police Political party Tribunal Category Index Outline Portal Authority control MA: 2778658105 Retrieved from "https://en.wikipedia.org/w/index.php?title=Chilling_effect&oldid=1013826707" Categories: Censorship Freedom of expression American legal terminology Hidden categories: CS1 errors: missing periodical CS1 maint: archived copy as title Use mdy dates from August 2017 All articles with failed verification Articles with failed verification from January 2009 Wikipedia articles with MA identifiers Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages العربية Deutsch Italiano Nederlands 日本語 Polski Português Suomi Türkçe 中文 Edit links This page was last edited on 23 March 2021, at 17:28 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-2238	----	Simple Mail Transfer Protocol - Wikipedia Simple Mail Transfer Protocol From Wikipedia, the free encyclopedia Jump to navigation Jump to search Internet protocol used for relaying e-mails "SMTP" redirects here. For the email delivery company, see SMTP (company). For Short Message Transfer Protocol, see GSM 03.40. Internet protocol suite Application layer BGP DHCP DNS FTP HTTP HTTPS IMAP LDAP MGCP MQTT NNTP NTP POP PTP ONC/RPC RTP RTSP RIP SIP SMTP SNMP SSH Telnet TLS/SSL XMPP more... Transport layer TCP UDP DCCP SCTP RSVP more... Internet layer IP IPv4 IPv6 ICMP ICMPv6 ECN IGMP IPsec more... Link layer ARP NDP OSPF Tunnels L2TP PPP MAC Ethernet Wi-Fi DSL ISDN FDDI more... v t e The Simple Mail Transfer Protocol (SMTP) is an internet standard communication protocol for electronic mail transmission. Mail servers and other message transfer agents use SMTP to send and receive mail messages. User-level email clients typically use SMTP only for sending messages to a mail server for relaying, and typically submit outgoing email to the mail server on port 587 or 465 per RFC 8314. For retrieving messages, IMAP and POP3 are standard, but proprietary servers also often implement proprietary protocols, e.g., Exchange ActiveSync. Since SMTP's introduction in 1981, it was updated, modified and extended multiple times. The protocol version in common use today has extensible structure with various extensions for authentication, encryption, binary data transfer, internationalized email addresses. SMTP servers commonly use the Transmission Control Protocol on port number 25 (for plaintext) and 587 (for encrypted communications). Contents 1 History 1.1 Predecessors to SMTP 1.2 Original SMTP 1.3 Modern SMTP 2 Mail processing model 3 Protocol overview 3.1 SMTP vs mail retrieval 3.2 Remote Message Queue Starting 4 Outgoing mail SMTP server 4.1 Outgoing mail server access restrictions 4.1.1 Restricting access by location 4.1.2 Client authentication 4.2 Ports 5 SMTP transport example 6 SMTP Extensions 6.1 Extension discovery mechanism 6.2 Binary data transfer 6.3 Mail delivery mechanism extensions 6.4 On-Demand Mail Relay 6.5 Internationalization extension 6.6 Extensions 6.6.1 8BITMIME 6.6.2 SMTP-AUTH 6.6.3 SMTPUTF8 6.7 Security extensions 6.7.1 STARTTLS or "Opportunistic TLS" 6.7.2 SMTP MTA Strict Transport Security 6.7.3 SMTP TLS Reporting 7 Spoofing and spamming 8 Implementations 9 Related requests for comments 10 See also 11 Notes 12 References 13 External links History[edit] Predecessors to SMTP[edit] Various forms of one-to-one electronic messaging were used in the 1960s. Users communicated using systems developed for specific mainframe computers. As more computers were interconnected, especially in the U.S. Government's ARPANET, standards were developed to permit exchange of messages between different operating systems. SMTP grew out of these standards developed during the 1970s. SMTP traces its roots to two implementations described in 1971: the Mail Box Protocol, whose implementation has been disputed,[1] but is discussed in RFC 196 and other RFCs, and the SNDMSG program, which, according to RFC 2235, Ray Tomlinson of BBN invented for TENEX computers to send mail messages across the ARPANET.[2][3][4] Fewer than 50 hosts were connected to the ARPANET at this time.[5] Further implementations include FTP Mail[6] and Mail Protocol, both from 1973.[7] Development work continued throughout the 1970s, until the ARPANET transitioned into the modern Internet around 1980. Original SMTP[edit] In 1980, Jon Postel published RFC 772 which proposed the Mail Transfer Protocol as a replacement of the use of the File Transfer Protocol (FTP) for mail. RFC 780 of May 1981 removed all references to FTP and allocated port 57 for TCP and UDP.[citation needed], an allocation that has since been removed by IANA). In November 1981, Postel published RFC 788 "Simple Mail Transfer Protocol". The SMTP standard was developed around the same time as Usenet, a one-to-many communication network with some similarities.[citation needed] SMTP became widely used in the early 1980s. At the time, it was a complement to the Unix to Unix Copy Program (UUCP), which was better suited for handling email transfers between machines that were intermittently connected. SMTP, on the other hand, works best when both the sending and receiving machines are connected to the network all the time. Both used a store and forward mechanism and are examples of push technology. Though Usenet's newsgroups were still propagated with UUCP between servers,[8] UUCP as a mail transport has virtually disappeared[9] along with the "bang paths" it used as message routing headers.[10] Sendmail, released with 4.1cBSD in 1982, soon after RFC 788 was published in November 1981, was one of the first mail transfer agents to implement SMTP.[11] Over time, as BSD Unix became the most popular operating system on the Internet, Sendmail became the most common MTA (mail transfer agent).[12] The original SMTP protocol supported only unauthenticated unencrypted 7-bit ASCII text communications, susceptible to trivial man-in-the-middle attack, spoofing, and spamming, and requiring any binary data to be encoded to readable text before transmission. Due to absence of proper authentication mechanism, by design every SMTP server was an open mail relay. The Internet Mail Consortium (IMC) reported that 55% of mail servers were open relays in 1998,[13] but less than 1% in 2002.[14] Because of spam concerns most email providers blocklist open relays,[15] making original SMTP essentially impractical for general use on the Internet. Modern SMTP[edit] In November 1995, RFC 1869 defined Extended Simple Mail Transfer Protocol (ESMTP), which established a general structure for all existing and future extensions which aimed to add-in the features missing from the original SMTP. ESMTP defines consistent and manageable means by which ESMTP clients and servers can be identified and servers can indicate supported extensions. Message submission ( RFC 2476) and SMTP-AUTH ( RFC 2554) were introduced in 1998 and 1999, both describing new trends in email delivery. Originally, SMTP servers were typically internal to an organization, receiving mail for the organization from the outside, and relaying messages from the organization to the outside. But as time went on, SMTP servers (mail transfer agents), in practice, were expanding their roles to become message submission agents for Mail user agents, some of which were now relaying mail from the outside of an organization. (e.g. a company executive wishes to send email while on a trip using the corporate SMTP server.) This issue, a consequence of the rapid expansion and popularity of the World Wide Web, meant that SMTP had to include specific rules and methods for relaying mail and authenticating users to prevent abuses such as relaying of unsolicited email (spam). Work on message submission ( RFC 2476) was originally started because popular mail servers would often rewrite mail in an attempt to fix problems in it, for example, adding a domain name to an unqualified address. This behavior is helpful when the message being fixed is an initial submission, but dangerous and harmful when the message originated elsewhere and is being relayed. Cleanly separating mail into submission and relay was seen as a way to permit and encourage rewriting submissions while prohibiting rewriting relay. As spam became more prevalent, it was also seen as a way to provide authorization for mail being sent out from an organization, as well as traceability. This separation of relay and submission quickly became a foundation for modern email security practices. As this protocol started out purely ASCII text-based, it did not deal well with binary files, or characters in many non-English languages. Standards such as Multipurpose Internet Mail Extensions (MIME) were developed to encode binary files for transfer through SMTP. Mail transfer agents (MTAs) developed after Sendmail also tended to be implemented 8-bit-clean, so that the alternate "just send eight" strategy could be used to transmit arbitrary text data (in any 8-bit ASCII-like character encoding) via SMTP. Mojibake was still a problem due to differing character set mappings between vendors, although the email addresses themselves still allowed only ASCII. 8-bit-clean MTAs today tend to support the 8BITMIME extension, permitting some binary files to be transmitted almost as easily as plain text (limits on line length and permitted octet values still apply, so that MIME encoding is needed for most non-text data and some text formats). In 2012, the SMTPUTF8 extension was created to support UTF-8 text, allowing international content and addresses in non-Latin scripts like Cyrillic or Chinese. Many people contributed to the core SMTP specifications, among them Jon Postel, Eric Allman, Dave Crocker, Ned Freed, Randall Gellens, John Klensin, and Keith Moore. Mail processing model[edit] Blue arrows depict implementation of SMTP variations Email is submitted by a mail client (mail user agent, MUA) to a mail server (mail submission agent, MSA) using SMTP on TCP port 587. Most mailbox providers still allow submission on traditional port 25. The MSA delivers the mail to its mail transfer agent (mail transfer agent, MTA). Often, these two agents are instances of the same software launched with different options on the same machine. Local processing can be done either on a single machine, or split among multiple machines; mail agent processes on one machine can share files, but if processing is on multiple machines, they transfer messages between each other using SMTP, where each machine is configured to use the next machine as a smart host. Each process is an MTA (an SMTP server) in its own right. The boundary MTA uses DNS to look up the MX (mail exchanger) record for the recipient's domain (the part of the email address on the right of @). The MX record contains the name of the target MTA. Based on the target host and other factors, the sending MTA selects a recipient server and connects to it to complete the mail exchange. Message transfer can occur in a single connection between two MTAs, or in a series of hops through intermediary systems. A receiving SMTP server may be the ultimate destination, an intermediate "relay" (that is, it stores and forwards the message) or a "gateway" (that is, it may forward the message using some protocol other than SMTP). Per RFC 5321 section 2.1, each hop is a formal handoff of responsibility for the message, whereby the receiving server must either deliver the message or properly report the failure to do so. Once the final hop accepts the incoming message, it hands it to a mail delivery agent (MDA) for local delivery. An MDA saves messages in the relevant mailbox format. As with sending, this reception can be done using one or multiple computers, but in the diagram above the MDA is depicted as one box near the mail exchanger box. An MDA may deliver messages directly to storage, or forward them over a network using SMTP or other protocol such as Local Mail Transfer Protocol (LMTP), a derivative of SMTP designed for this purpose. Once delivered to the local mail server, the mail is stored for batch retrieval by authenticated mail clients (MUAs). Mail is retrieved by end-user applications, called email clients, using Internet Message Access Protocol (IMAP), a protocol that both facilitates access to mail and manages stored mail, or the Post Office Protocol (POP) which typically uses the traditional mbox mail file format or a proprietary system such as Microsoft Exchange/Outlook or Lotus Notes/Domino. Webmail clients may use either method, but the retrieval protocol is often not a formal standard. SMTP defines message transport, not the message content. Thus, it defines the mail envelope and its parameters, such as the envelope sender, but not the header (except trace information) nor the body of the message itself. STD 10 and RFC 5321 define SMTP (the envelope), while STD 11 and RFC 5322 define the message (header and body), formally referred to as the Internet Message Format. Protocol overview[edit] SMTP is a connection-oriented, text-based protocol in which a mail sender communicates with a mail receiver by issuing command strings and supplying necessary data over a reliable ordered data stream channel, typically a Transmission Control Protocol (TCP) connection. An SMTP session consists of commands originated by an SMTP client (the initiating agent, sender, or transmitter) and corresponding responses from the SMTP server (the listening agent, or receiver) so that the session is opened, and session parameters are exchanged. A session may include zero or more SMTP transactions. An SMTP transaction consists of three command/reply sequences: MAIL command, to establish the return address, also called return-path,[16] reverse-path,[17] bounce address, mfrom, or envelope sender. RCPT command, to establish a recipient of the message. This command can be issued multiple times, one for each recipient. These addresses are also part of the envelope. DATA to signal the beginning of the message text; the content of the message, as opposed to its envelope. It consists of a message header and a message body separated by an empty line. DATA is actually a group of commands, and the server replies twice: once to the DATA command itself, to acknowledge that it is ready to receive the text, and the second time after the end-of-data sequence, to either accept or reject the entire message. Besides the intermediate reply for DATA, each server's reply can be either positive (2xx reply codes) or negative. Negative replies can be permanent (5xx codes) or transient (4xx codes). A reject is a permanent failure and the client should send a bounce message to the server it received it from. A drop is a positive response followed by message discard rather than delivery. The initiating host, the SMTP client, can be either an end-user's email client, functionally identified as a mail user agent (MUA), or a relay server's mail transfer agent (MTA), that is an SMTP server acting as an SMTP client, in the relevant session, in order to relay mail. Fully capable SMTP servers maintain queues of messages for retrying message transmissions that resulted in transient failures. A MUA knows the outgoing mail SMTP server from its configuration. A relay server typically determines which server to connect to by looking up the MX (Mail eXchange) DNS resource record for each recipient's domain name. If no MX record is found, a conformant relaying server (not all are) instead looks up the A record. Relay servers can also be configured to use a smart host. A relay server initiates a TCP connection to the server on the "well-known port" for SMTP: port 25, or for connecting to an MSA, port 587. The main difference between an MTA and an MSA is that connecting to an MSA requires SMTP Authentication. SMTP vs mail retrieval[edit] SMTP is a delivery protocol only. In normal use, mail is "pushed" to a destination mail server (or next-hop mail server) as it arrives. Mail is routed based on the destination server, not the individual user(s) to which it is addressed. Other protocols, such as the Post Office Protocol (POP) and the Internet Message Access Protocol (IMAP) are specifically designed for use by individual users retrieving messages and managing mail boxes. To permit an intermittently-connected mail server to pull messages from a remote server on demand, SMTP has a feature to initiate mail queue processing on a remote server (see Remote Message Queue Starting below). POP and IMAP are unsuitable protocols for relaying mail by intermittently-connected machines; they are designed to operate after final delivery, when information critical to the correct operation of mail relay (the "mail envelope") has been removed. Remote Message Queue Starting[edit] Remote Message Queue Starting enables a remote host to start processing of the mail queue on a server so it may receive messages destined to it by sending a corresponding command. The original TURN command was deemed insecure and was extended in RFC 1985 with the ETRN command which operates more securely using an authentication method based on Domain Name System information.[18] Outgoing mail SMTP server[edit] An email client needs to know the IP address of its initial SMTP server and this has to be given as part of its configuration (usually given as a DNS name). This server will deliver outgoing messages on behalf of the user. Outgoing mail server access restrictions[edit] Server administrators need to impose some control on which clients can use the server. This enables them to deal with abuse, for example spam. Two solutions have been in common use: In the past, many systems imposed usage restrictions by the location of the client, only permitting usage by clients whose IP address is one that the server administrators control. Usage from any other client IP address is disallowed. Modern SMTP servers typically offer an alternative system that requires authentication of clients by credentials before allowing access. Restricting access by location[edit] Under this system, an ISP's SMTP server will not allow access by users who are outside the ISP's network. More precisely, the server may only allow access to users with an IP address provided by the ISP, which is equivalent to requiring that they are connected to the Internet using that same ISP. A mobile user may often be on a network other than that of their normal ISP, and will then find that sending email fails because the configured SMTP server choice is no longer accessible. This system has several variations. For example, an organisation's SMTP server may only provide service to users on the same network, enforcing this by firewalling to block access by users on the wider Internet. Or the server may perform range checks on the client's IP address. These methods were typically used by corporations and institutions such as universities which provided an SMTP server for outbound mail only for use internally within the organisation. However, most of these bodies now use client authentication methods, as described below. Where a user is mobile, and may use different ISPs to connect to the internet, this kind of usage restriction is onerous, and altering the configured outbound email SMTP server address is impractical. It is highly desirable to be able to use email client configuration information that does not need to change. Client authentication[edit] Modern SMTP servers typically require authentication of clients by credentials before allowing access, rather than restricting access by location as described earlier. This more flexible system is friendly to mobile users and allows them to have a fixed choice of configured outbound SMTP server. SMTP Authentication, often abbreviated SMTP AUTH, is an extension of the SMTP in order to log in using an authentication mechanism. Ports[edit] Communication between mail servers generally uses the standard TCP port 25 designated for SMTP. Mail clients however generally don't use this, instead using specific "submission" ports. Mail services generally accept email submission from clients on one of: 587 (Submission), as formalized in RFC 6409 (previously RFC 2476) 465 This port was deprecated after RFC 2487, until the issue of RFC 8314. Port 2525 and others may be used by some individual providers, but have never been officially supported. Many Internet service providers now block all outgoing port 25 traffic from their customers. Mainly as an anti-spam measure,[19] but also to cure for the higher cost they have when leaving it open, perhaps by charging more from the few customers that requires it open. SMTP transport example[edit] A typical example of sending a message via SMTP to two mailboxes (alice and theboss) located in the same mail domain (example.com or localhost.com) is reproduced in the following session exchange. (In this example, the conversation parts are prefixed with S: and C:, for server and client, respectively; these labels are not part of the exchange.) After the message sender (SMTP client) establishes a reliable communications channel to the message receiver (SMTP server), the session is opened with a greeting by the server, usually containing its fully qualified domain name (FQDN), in this case smtp.example.com. The client initiates its dialog by responding with a HELO command identifying itself in the command's parameter with its FQDN (or an address literal if none is available).[20] S: 220 smtp.example.com ESMTP Postfix C: HELO relay.example.com S: 250 smtp.example.com, I am glad to meet you C: MAIL FROM:<bob@example.com> S: 250 Ok C: RCPT TO:<alice@example.com> S: 250 Ok C: RCPT TO:<theboss@example.com> S: 250 Ok C: DATA S: 354 End data with <CR><LF>.<CR><LF> C: From: "Bob Example" <bob@example.com> C: To: Alice Example <alice@example.com> C: Cc: theboss@example.com C: Date: Tue, 15 Jan 2008 16:02:43 -0500 C: Subject: Test message C: C: Hello Alice. C: This is a test message with 5 header fields and 4 lines in the message body. C: Your friend, C: Bob C: . S: 250 Ok: queued as 12345 C: QUIT S: 221 Bye {The server closes the connection} The client notifies the receiver of the originating email address of the message in a MAIL FROM command. This is also the return or bounce address in case the message cannot be delivered. In this example the email message is sent to two mailboxes on the same SMTP server: one for each recipient listed in the To and Cc header fields. The corresponding SMTP command is RCPT TO. Each successful reception and execution of a command is acknowledged by the server with a result code and response message (e.g., 250 Ok). The transmission of the body of the mail message is initiated with a DATA command after which it is transmitted verbatim line by line and is terminated with an end-of-data sequence. This sequence consists of a new-line (<CR><LF>), a single full stop (period), followed by another new-line. Since a message body can contain a line with just a period as part of the text, the client sends two periods every time a line starts with a period; correspondingly, the server replaces every sequence of two periods at the beginning of a line with a single one. Such escaping method is called dot-stuffing. The server's positive reply to the end-of-data, as exemplified, implies that the server has taken the responsibility of delivering the message. A message can be doubled if there is a communication failure at this time, e.g. due to a power shortage: Until the sender has received that 250 reply, it must assume the message was not delivered. On the other hand, after the receiver has decided to accept the message, it must assume the message has been delivered to it. Thus, during this time span, both agents have active copies of the message that they will try to deliver.[21] The probability that a communication failure occurs exactly at this step is directly proportional to the amount of filtering that the server performs on the message body, most often for anti-spam purposes. The limiting timeout is specified to be 10 minutes.[22] The QUIT command ends the session. If the email has other recipients located elsewhere, the client would QUIT and connect to an appropriate SMTP server for subsequent recipients after the current destination(s) had been queued. The information that the client sends in the HELO and MAIL FROM commands are added (not seen in example code) as additional header fields to the message by the receiving server. It adds a Received and Return-Path header field, respectively. Some clients are implemented to close the connection after the message is accepted (250 Ok: queued as 12345), so the last two lines may actually be omitted. This causes an error on the server when trying to send the 221 reply. SMTP Extensions[edit] Extension discovery mechanism[edit] Clients learn a server's supported options by using the EHLO greeting, as exemplified below, instead of the original HELO. Clients fall back to HELO only if the server does not support EHLO greeting.[23] Modern clients may use the ESMTP extension keyword SIZE to query the server for the maximum message size that will be accepted. Older clients and servers may try to transfer excessively sized messages that will be rejected after consuming network resources, including connect time to network links that is paid by the minute.[24] Users can manually determine in advance the maximum size accepted by ESMTP servers. The client replaces the HELO command with the EHLO command. S: 220 smtp2.example.com ESMTP Postfix C: EHLO bob.example.com S: 250-smtp2.example.com Hello bob.example.org [192.0.2.201] S: 250-SIZE 14680064 S: 250-PIPELINING S: 250 HELP Thus smtp2.example.com declares that it can accept a fixed maximum message size no larger than 14,680,064 octets (8-bit bytes). In the simplest case, an ESMTP server declares a maximum SIZE immediately after receiving an EHLO. According to RFC 1870, however, the numeric parameter to the SIZE extension in the EHLO response is optional. Clients may instead, when issuing a MAIL FROM command, include a numeric estimate of the size of the message they are transferring, so that the server can refuse receipt of overly-large messages. Binary data transfer[edit] Original SMTP supports only a single body of ASCII text, therefore any binary data needs to be encoded as text into that body of the message before transfer, and then decoded by the recipient. Binary-to-text encodings, such as uuencode and BinHex were typically used. The 8BITMIME command was developed to address this. It was standardized in 1994 as RFC 1652[25] It facilitates the transparent exchange of e-mail messages containing octets outside the seven-bit ASCII character set by encoding them as MIME content parts, typically encoded with Base64. Mail delivery mechanism extensions[edit] On-Demand Mail Relay[edit] Main article: On-Demand Mail Relay On-Demand Mail Relay (ODMR) is an SMTP extension standardized in RFC 2645 that allows an intermittently-connected SMTP server to receive email queued for it when it is connected. Internationalization extension[edit] Main article: International email Original SMTP supports email addresses composed of ASCII characters only, which is inconvenient for users whose native script is not Latin based, or who use diacritic not in the ASCII character set. This limitation was alleviated via extensions enabling UTF-8 in address names. RFC 5336 introduced experimental[26] UTF8SMTP command and later was superseded by RFC 6531 that introduced SMTPUTF8 command. These extensions provide support for multi-byte and non-ASCII characters in email addresses, such as those with diacritics and other language characters such as Greek and Chinese.[27] Current support is limited, but there is strong interest in broad adoption of RFC 6531 and the related RFCs in countries like China that have a large user base where Latin (ASCII) is a foreign script. Extensions[edit] Like SMTP, ESMTP is a protocol used to transport Internet mail. It is used as both an inter-server transport protocol and (with restricted behavior enforced) a mail submission protocol. The main identification feature for ESMTP clients is to open a transmission with the command EHLO (Extended HELLO), rather than HELO (Hello, the original RFC 821 standard). A server will respond with success (code 250), failure (code 550) or error (code 500, 501, 502, 504, or 421), depending on its configuration. An ESMTP server returns the code 250 OK in a multi-line reply with its domain and a list of keywords to indicate supported extensions. A RFC 821 compliant server returns error code 500, allowing ESMTP clients to try either HELO or QUIT. Each service extension is defined in an approved format in subsequent RFCs and registered with the Internet Assigned Numbers Authority (IANA). The first definitions were the RFC 821 optional services: SEND, SOML (Send or Mail), SAML (Send and Mail), EXPN, HELP, and TURN. The format of additional SMTP verbs was set and for new parameters in MAIL and RCPT. Some relatively common keywords (not all of them corresponding to commands) used today are: 8BITMIME – 8 bit data transmission, RFC 6152 ATRN – Authenticated TURN for On-Demand Mail Relay, RFC 2645 AUTH – Authenticated SMTP, RFC 4954 CHUNKING – Chunking, RFC 3030 DSN – Delivery status notification, RFC 3461 (See Variable envelope return path) ETRN – Extended version of remote message queue starting command TURN, RFC 1985 HELP – Supply helpful information, RFC 821 PIPELINING – Command pipelining, RFC 2920 SIZE – Message size declaration, RFC 1870 STARTTLS – Transport Layer Security, RFC 3207 (2002) SMTPUTF8 – Allow UTF-8 encoding in mailbox names and header fields, RFC 6531 UTF8SMTP – Allow UTF-8 encoding in mailbox names and header fields, RFC 5336 (deprecated[28]) The ESMTP format was restated in RFC 2821 (superseding RFC 821) and updated to the latest definition in RFC 5321 in 2008. Support for the EHLO command in servers became mandatory, and HELO designated a required fallback. Non-standard, unregistered, service extensions can be used by bilateral agreement, these services are indicated by an EHLO message keyword starting with "X", and with any additional parameters or verbs similarly marked. SMTP commands are case-insensitive. They are presented here in capitalized form for emphasis only. An SMTP server that requires a specific capitalization method is a violation of the standard.[citation needed] 8BITMIME[edit] At least the following servers advertise the 8BITMIME extension: Apache James (since 2.3.0a1)[29] Citadel (since 7.30) Courier Mail Server Gmail[30] IceWarp IIS SMTP Service Kerio Connect Lotus Domino Microsoft Exchange Server (as of Exchange Server 2000) Novell GroupWise OpenSMTPD Oracle Communications Messaging Server Postfix Sendmail (since 6.57) The following servers can be configured to advertise 8BITMIME, but do not perform conversion of 8-bit data to 7-bit when connecting to non-8BITMIME relays: Exim and qmail do not translate eight-bit messages to seven-bit when making an attempt to relay 8-bit data to non-8BITMIME peers, as is required by the RFC.[31] This does not cause problems in practice, since virtually all modern mail relays are 8-bit clean.[32] Microsoft Exchange Server 2003 advertises 8BITMIME by default, but relaying to a non-8BITMIME peer results in a bounce. This is allowed by RFC 6152 section 3. SMTP-AUTH[edit] Main article: SMTP Authentication The SMTP-AUTH extension provides an access control mechanism. It consists of an authentication step through which the client effectively logs into the mail server during the process of sending mail. Servers that support SMTP-AUTH can usually be configured to require clients to use this extension, ensuring the true identity of the sender is known. The SMTP-AUTH extension is defined in RFC 4954. SMTP-AUTH can be used to allow legitimate users to relay mail while denying relay service to unauthorized users, such as spammers. It does not necessarily guarantee the authenticity of either the SMTP envelope sender or the RFC 2822 "From:" header. For example, spoofing, in which one sender masquerades as someone else, is still possible with SMTP-AUTH unless the server is configured to limit message from-addresses to addresses this AUTHed user is authorized for. The SMTP-AUTH extension also allows one mail server to indicate to another that the sender has been authenticated when relaying mail. In general this requires the recipient server to trust the sending server, meaning that this aspect of SMTP-AUTH is rarely used on the Internet.[citation needed] SMTPUTF8[edit] Supporting servers include: Postfix (version 3.0 and later)[33] Momentum (versions 4.1[34] and 3.6.5, and later) Sendmail (under development) Exim (experimental as of the 4.86 release) CommuniGate Pro as of version 6.2.2[35] Courier-MTA as of version 1.0[36] Halon as of version 4.0[37] Microsoft Exchange Server as of protocol revision 14.0[38] Haraka and other servers.[39] Oracle Communications Messaging Server as of release 8.0.2.[40] Security extensions[edit] Mail delivery can occur both over plain text and encrypted connections, however the communicating parties might not know in advance of other party's ability to use secure channel. STARTTLS or "Opportunistic TLS"[edit] Main articles: Opportunistic TLS and Email encryption The STARTTLS extensions enables supporting SMTP servers to notify connecting clients that it supports TLS encrypted communication and offers the opportunity for clients to upgrade their connection by sending the STARTTLS command. Servers supporting the extension do not inherently gain any security benefits from its implementation on its own, as upgrading to a TLS encrypted session is dependent on the connecting client deciding to exercise this option, hence the term opportunistic TLS. STARTTLS is effective only against passive observation attacks, since the STARTTLS negotiation happens in plain text and an active attacker can trivially remove STARTTLS commands. This type of man-in-the-middle attack is sometimes referred to as STRIPTLS, where the encryption negotiation information sent from one end never reaches the other. In this scenario both parties take the invalid or unexpected responses as indication that the other does not properly support STARTTLS, defaulting to traditional plain-text mail transfer.[41] Note that STARTTLS is also defined for IMAP and POP3 in other RFCs, but these protocols serve different purposes: SMTP is used for communication between message transfer agents, while IMAP and POP3 are for end clients and message transfer agents. Electronic Frontier Foundation maintains a "STARTTLS Everywhere" list that similarly to "HTTPS Everywhere" list allows relying parties to discover others supporting secure communication without prior communication.[42] RFC 8314 officially declared plain text obsolete and recommend always using TLS, adding ports with implicit TLS. SMTP MTA Strict Transport Security[edit] A newer 2018 RFC 8461called "SMTP MTA Strict Transport Security (MTA-STS)" aims to address the problem of active adversary by defining a protocol for mail servers to declare their ability to use secure channels in specific files on the server and specific DNS TXT records. The relying party would regularly check existence of such record, and cache it for the amount of time specified in the record and never communicate over insecure channels until record expires.[41] Note that MTA-STS records apply only to SMTP traffic between mail servers while communications between end client and the mail server are protected by HTTPS, HTTP Strict Transport Security. In April 2019 Google Mail announced support for MTA-STS.[43] SMTP TLS Reporting[edit] A number of protocols allows secure delivery of messages, but they can fail due to misconfigurations or deliberate active interference, leading to undelivered messages or delivery over unencrypted or unauthenticated channels. RFC 8460 "SMTP TLS Reporting" describes a reporting mechanism and format for sharing statistics and specific information about potential failures with recipient domains. Recipient domains can then use this information to both detect potential attacks and diagnose unintentional misconfigurations. In April 2019 Google Mail announced support for SMTP TLS Reporting.[43] Spoofing and spamming[edit] Main articles: Anti-spam techniques and Email authentication The original design of SMTP had no facility to authenticate senders, or check that servers were authorized to send on their behalf, with the result that email spoofing is possible, and commonly used in email spam and phishing. Occasional proposals are made to modify SMTP extensively or replace it completely. One example of this is Internet Mail 2000, but neither it, nor any other has made much headway in the face of the network effect of the huge installed base of classic SMTP. Instead, mail servers now use a range of techniques, such as stricter enforcement of standards such as RFC 5322,[44][45] DomainKeys Identified Mail, Sender Policy Framework and DMARC, DNSBLs and greylisting to reject or quarantine suspicious emails.[46] Implementations[edit] There is also SMTP proxy implementation as for example nginx.[47] Main articles: List of mail server software and Comparison of mail servers Related requests for comments[edit] RFC 1123 – Requirements for Internet Hosts—Application and Support (STD 3) RFC 1870 – SMTP Service Extension for Message Size Declaration (оbsoletes: RFC 1653) RFC 2505 – Anti-Spam Recommendations for SMTP MTAs (BCP 30) RFC 2821 – Simple Mail Transfer Protocol RFC 2920 – SMTP Service Extension for Command Pipelining (STD 60) RFC 3030 – SMTP Service Extensions for Transmission of Large and Binary MIME Messages RFC 3207 – SMTP Service Extension for Secure SMTP over Transport Layer Security (obsoletes RFC 2487) RFC 3461 – SMTP Service Extension for Delivery Status Notifications (obsoletes RFC 1891) RFC 3463 – Enhanced Status Codes for SMTP (obsoletes RFC 1893, updated by RFC 5248) RFC 3464 – An Extensible Message Format for Delivery Status Notifications (obsoletes RFC 1894) RFC 3798 – Message Disposition Notification (updates RFC 3461) RFC 3834 – Recommendations for Automatic Responses to Electronic Mail RFC 3974 – SMTP Operational Experience in Mixed IPv4/v6 Environments RFC 4952 – Overview and Framework for Internationalized Email (updated by RFC 5336) RFC 4954 – SMTP Service Extension for Authentication (obsoletes RFC 2554, updates RFC 3463, updated by RFC 5248) RFC 5068 – Email Submission Operations: Access and Accountability Requirements (BCP 134) RFC 5248 – A Registry for SMTP Enhanced Mail System Status Codes (BCP 138) (updates RFC 3463) RFC 5321 – The Simple Mail Transfer Protocol (obsoletes RFC 821 aka STD 10, RFC 974, RFC 1869, RFC 2821, updates RFC 1123) RFC 5322 – Internet Message Format (obsoletes RFC 822 aka STD 11, and RFC 2822) RFC 5504 – Downgrading Mechanism for Email Address Internationalization RFC 6409 – Message Submission for Mail (STD 72) (obsoletes RFC 4409, RFC 2476) RFC 6522 – The Multipart/Report Content Type for the Reporting of Mail System Administrative Messages (obsoletes RFC 3462, and in turn RFC 1892) RFC 6531 – SMTP Extension for Internationalized Email Addresses (updates RFC 2821, RFC 2822, RFC 4952, and RFC 5336) RFC 8314 – Cleartext Considered Obsolete: Use of Transport Layer Security (TLS) for Email Submission and Access See also[edit] Bounce address CRAM-MD5 (a SASL mechanism for ESMTPA) RFC 2195 Email Email encryption DKIM Ident List of mail server software List of SMTP server return codes POP before SMTP / SMTP after POP Internet Message Access Protocol Binary Content Extension RFC 3516 Sender Policy Framework (SPF) Simple Authentication and Security Layer (SASL) RFC 4422 SMTP Authentication Variable envelope return path Comparison of email clients for information about SMTP support Notes[edit] ^ The History of Electronic Mail, Tom Van Vleck: "It is not clear this protocol was ever implemented" ^ The First Network Email, Ray Tomlinson, BBN ^ Picture of "The First Email Computer" by Dan Murphy, a PDP-10 ^ Dan Murphy's TENEX and TOPS-20 Papers Archived November 18, 2007, at the Wayback Machine ^ RFC 2235 ^ RFC 469 – Network Mail Meeting Summary ^ RFC 524 – A Proposed Mail Protocol ^ Tldp.org ^ draft-barber-uucp-project-conclusion-05 – The Conclusion of the UUCP Mapping Project ^ The article about sender rewriting contains technical background info about the early SMTP history and source routing before RFC 1123. ^ Eric Allman (1983), Sendmail – An Internetwork Mail Router (PDF), BSD UNIX documentation set, Berkeley: University of California, retrieved June 29, 2012 ^ Craig Partridge (2008), The Technical Development of Internet Email (PDF), IEEE Annals of the History of Computing, 30, IEEE Computer Society, pp. 3–29, doi:10.1109/MAHC.2008.32, S2CID 206442868, archived from the original (PDF) on May 12, 2011 ^ Paul Hoffman (February 1, 1998). "Allowing Relaying in SMTP: A Survey". Internet Mail Consortium. Retrieved May 30, 2010. CS1 maint: discouraged parameter (link) ^ Paul Hoffman (August 2002). "Allowing Relaying in SMTP: A Series of Surveys". Internet Mail Consortium. Archived from the original on January 18, 2007. Retrieved May 30, 2010. CS1 maint: discouraged parameter (link) ^ "In Unix, what is an open mail relay? - Knowledge Base". web.archive.org. June 17, 2007. Retrieved March 15, 2021. ^ "The MAIL, RCPT, and DATA verbs", [D. J. Bernstein] ^ RFC 5321 Section-7.2 ^ Systems, Message. "Message Systems Introduces Latest Version Of Momentum With New API-Driven Capabilities". www.prnewswire.com. Retrieved July 19, 2020. ^ Cara Garretson (2005). "ISPs Pitch In to Stop Spam". PC World. Retrieved January 18, 2016. Last month, the Anti-Spam Technical Alliance, formed last year by Yahoo, America Online, EarthLink, and Microsoft, issued a list of antispam recommendations that includes filtering Port 25. ^ RFC 5321, Simple Mail Transfer Protocol, J. Klensin, The Internet Society (October 2008) ^ RFC 1047 ^ rfc5321#section-4.5.3.2.6 ^ John Klensin; Ned Freed; Marshall T. Rose; Einar A. Stefferud; Dave Crocker (November 1995). SMTP Service Extensions. IETF. doi:10.17487/RFC1869. RFC 1869. ^ "MAIL Parameters". IANA. Retrieved April 3, 2016. ^ Which was obsoleted in 2011 by RFC 6152 corresponding to the then new STD 71 ^ "MAIL Parameters". November 15, 2018. ^ Jiankang Yao (December 19, 2014). "Chinese email address". EAI (Mailing list). IETF. Retrieved May 24, 2016. ^ "SMTP Service Extension Parameters". IANA. Retrieved November 5, 2013. ^ James Server - ChangeLog. James.apache.org. Retrieved on 2013-07-17. ^ 8BITMIME service advertised in response to EHLO on gmail-smtp-in.l.google.com port 25, checked 23 November 2011 ^ Qmail bugs and wishlist. Home.pages.de. Retrieved on 2013-07-17. ^ The 8BITMIME extension. Cr.yp.to. Retrieved on 2013-07-17. ^ "Postfix SMTPUTF8 support is enabled by default", February 8, 2015, postfix.org ^ "Message Systems Introduces Latest Version Of Momentum With New API-Driven Capabilities" (Press release). ^ "Version 6.2 Revision History". CommuniGate.com. ^ Sam Varshavchik (September 18, 2018). "New releases of Courier packages". courier-announce (Mailing list). ^ changelog ^ "MS-OXSMTP: Simple Mail Transfer Protocol (SMTP) Extensions". July 24, 2018. ^ "EAI Readiness in TLDs" (PDF). February 12, 2019. ^ "Communications Messaging Server Release Notes". oracle.com. October 2017. ^ a b "Introducing MTA Strict Transport Security (MTA-STS) | Hardenize Blog". www.hardenize.com. Retrieved April 25, 2019. ^ "STARTTLS Everywhere". EFF. Retrieved August 15, 2019. ^ a b Cimpanu, Catalin. "Gmail becomes first major email provider to support MTA-STS and TLS Reporting". ZDNet. Retrieved April 25, 2019. ^ Message Non Compliant with RFC 5322 ^ Message could not be delivered. Please ensure the message is RFC 5322 compliant. ^ Why are the emails sent to Microsoft Account rejected for policy reasons? ^ "NGINX Docs | Configuring NGINX as a Mail Proxy Server". References[edit] Hughes, L (1998). Internet E-mail: Protocols, Standards and Implementation. Artech House Publishers. ISBN 978-0-89006-939-4. Hunt, C (2003). sendmail Cookbook. O'Reilly Media. ISBN 978-0-596-00471-2. Johnson, K (2000). Internet Email Protocols: A Developer's Guide. Addison-Wesley Professional. ISBN 978-0-201-43288-6. Loshin, P (1999). Essential Email Standards: RFCs and Protocols Made Practical. John Wiley & Sons. ISBN 978-0-471-34597-8. Rhoton, J (1999). Programmer's Guide to Internet Mail: SMTP, POP, IMAP, and LDAP. Elsevier. ISBN 978-1-55558-212-8. Wood, D (1999). Programming Internet Mail. O'Reilly. ISBN 978-1-56592-479-6. External links[edit] IANA registry of mail parameters includes service extension keywords RFC 1869 SMTP Service Extensions RFC 5321 Simple Mail Transfer Protocol RFC 4954 SMTP Service Extension for Authentication (obsoletes RFC 2554) RFC 3848 SMTP and LMTP Transmission Types Registration (with ESMTPA) RFC 6409 Message Submission for Mail (obsoletes RFC 4409, which obsoletes RFC 2476) v t e Email clients Free software Current Alpine Balsa Citadel/UX Claws Mail Cleancode eMail Cone Evolution fetchmail fdm Geary getmail GNUMail Gnus Gnuzilla IMP KMail Mahogany Mailpile Mailx Mailx (Heirloom Project) Modest Mozilla Thunderbird Mulberry Mutt nmh / MH OfflineIMAP Roundcube SeaMonkey SquirrelMail Sylpheed Trojitá YAM Zimbra Discontinued Arachne Beonex Communicator BlitzMail Classilla Columbia MM Elm FossaMail Hula Mailody Mozilla Mail & Newsgroups Nylas N1 Spicebird Proprietary Freeware eM Client EmailTray Foxmail i.Scribe Mailbird Opera Mail Spark Spike TouchMail Retail Hiri Bloomba/WordPerfect Mail Newton IBM Notes InScribe Apple Mail Mail (Windows) Microsoft Outlook Novell GroupWise Airmail Postbox Shareware Becky! Forté Agent GyazMail The Bat! Donationware Pegasus Mail Discontinued cc:Mail Claris Emailer Courier Cyberdog Cyberjack Embrowser Eudora (discontinued in 2010, moved to open-source in 2018) Mailbox Microsoft Entourage Microsoft Internet Mail and News Microsoft Mail MINUET Netscape Mail Netscape Messenger 9 NeXTMail Outlook Express Pine Pocomail POPmail Sparrow Turnpike WebSpyder Windows Live Mail Windows Mail Windows Messaging Related technologies SMTP IMAP JMAP LMTP POP Push-IMAP SMAP SMTP UUCP Related topics Email Unicode and email Category Comparison Retrieved from "https://en.wikipedia.org/w/index.php?title=Simple_Mail_Transfer_Protocol&oldid=1017975199" Categories: Internet mail protocols Hidden categories: Webarchive template wayback links CS1 maint: discouraged parameter Articles with short description Short description matches Wikidata Use mdy dates from October 2013 All articles with unsourced statements Articles with unsourced statements from March 2021 Articles with unsourced statements from April 2021 Articles with unsourced statements from October 2019 Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version In other projects Wikimedia Commons Languages العربية Azərbaycanca भोजपुरी Български Bosanski Català Čeština Dansk Deutsch Eesti Ελληνικά Español Esperanto Euskara فارسی Français Galego 한국어 Հայերեն हिन्दी Hrvatski Bahasa Indonesia Íslenska Italiano עברית Kurdî Latviešu Lëtzebuergesch Lietuvių Magyar Македонски മലയാളം Bahasa Melayu Nederlands 日本語 Norsk bokmål Norsk nynorsk Олык марий Polski Português Română Русский Shqip Simple English Slovenčina Slovenščina Српски / srpski Srpskohrvatski / српскохрватски Suomi Svenska ไทย Türkçe Українська Tiếng Việt 吴语 Yorùbá 中文 Edit links This page was last edited on 15 April 2021, at 16:46 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-319	----	InterPlanetary File System - Wikipedia InterPlanetary File System From Wikipedia, the free encyclopedia Jump to navigation Jump to search Content-addressable, peer-to-peer hypermedia distribution protocol InterPlanetary File System Original author(s) Juan Benet and Protocol Labs[1] Developer(s) Protocol Labs Initial release February 2015; 6 years ago (2015-02)[1] Stable release 0.8.0 / 18 February 2021; 2 months ago (2021-02-18)[2] Repository github.com/ipfs/ipfs Written in Protocol implementations: Go (reference implementation), JavaScript, C,[3] Python Client libraries: Go, Java, JavaScript, Python, Scala, Haskell, Swift, Common Lisp, Rust, Ruby, PHP, C#, Erlang Operating system Linux, FreeBSD, OpenBSD, macOS, Windows Available in Go, JavaScript, Python Type Protocol, distributed file system, content delivery network License MIT license, Apache license 2.0 Website ipfs.io Part of a series on File sharing Technologies File hosting services Online video platform Peer to peer Usenet Web hosting WebRTC XDCC Video sharing sites 123Movies Dailymotion PeerTube Putlocker YouTube BitTorrent sites 1337x Demonoid ExtraTorrent EZTV isoHunt KickassTorrents Nyaa Torrents RARBG Tamil Rockers The Pirate Bay YIFY Academic #ICanHazPDF Internet Archive Library Genesis Sci-Hub Academic Torrents Z-Library File sharing networks BitTorrent Direct Connect eDonkey Freenet Gnutella Gnutella2 IPFS LBRY Ares Galaxy List of P2P protocols OpenNap WebTorrent P2P clients BitComet DC++ Deluge eMule μTorrent qBittorrent Shareaza Soulseek Transmission Tribler Vuze WinMX Napster Streaming programs Butter Project Popcorn Time Torrents-Time Anonymous file sharing Anonymous P2P Darknet Freenet Friend-to-friend I2P Private P2P Proxy server Seedbox Tor VPN Development and societal aspects Timeline Legality BitTorrent issues By country or region Canada Japan Singapore UK US Comparisons Comparison of BitTorrent clients Comparison of BitTorrent sites Comparison of eDonkey software Comparison of Internet Relay Chat clients Comparison of Usenet newsreaders v t e The InterPlanetary File System (IPFS) is a protocol and peer-to-peer network for storing and sharing data in a distributed file system. IPFS uses content-addressing to uniquely identify each file in a global namespace connecting all computing devices.[4] Contents 1 Design 2 History 3 Other notable uses 4 See also 5 References 6 External links Design[edit] This section needs expansion. You can help by adding to it. (June 2020) IPFS allows users to host and receive content in a manner similar to BitTorrent. As opposed to a centrally located server, IPFS is built around a decentralized system[5] of user-operators who hold a portion of the overall data, creating a resilient system of file storage and sharing. Any user in the network can serve a file by its content address, and other peers in the network can find and request that content from any node who has it using a distributed hash table (DHT). In contrast to BitTorrent, IPFS aims to create a single global network. This means that if Alice and Bob publish a block of data with the same hash, the peers downloading the content from Alice will exchange data with the ones downloading it from Bob.[6] IPFS aims to replace protocols used for static webpage delivery by using gateways which are accessible with HTTP.[7] Users may choose not to install an IPFS client on their device and instead use a public gateway. A list of these gateways is maintained on the IPFS GitHub page.[8] History[edit] This section needs expansion. You can help by adding to it. (June 2020) IPFS was launched in an alpha version in February 2015, and by October of the same year was described by TechCrunch as "quickly spreading by word of mouth."[1] The Catalan independence referendum, taking place in September–October 2017, was deemed illegal by the Constitutional Court of Spain and many related websites were blocked. Subsequently, the Catalan Pirate Party mirrored the website on IPFS to bypass the High Court of Justice of Catalonia order of blocking.[9][10] Phishing attacks have also been distributed through Cloudflare's IPFS gateway since July 2018. The phishing scam HTML is stored on IPFS, and displayed via Cloudflare's gateway. The connection shows as secure via a Cloudflare SSL certificate.[11] The IPStorm botnet, first detected in June 2019, uses IPFS, so it can hide its command-and-control amongst the flow of legitimate data on the IPFS network.[12] Security researchers had worked out previously the theoretical possibility of using IPFS as a botnet command-and-control system.[13][14] Other notable uses[edit] During the block of Wikipedia in Turkey, IPFS was used to create a mirror of Wikipedia, which allows access to the content of Wikipedia despite the ban.[15] That archived version of Wikipedia is a limited immutable copy that cannot be updated. Filecoin, also inter-related to IPFS and developed by Juan Benet and Protocol Labs, is an IPFS-based cooperative storage cloud.[16] Cloudflare runs a distributed web gateway to simplify, speed up, and secure access to IPFS without needing a local node.[17] Microsoft's self-sovereign identity system, Microsoft ION, builds on the Bitcoin blockchain and IPFS through a Sidetree-based DID network.[18] Brave uses Origin Protocol and IPFS to host its decentralized merchandise store[19] and in 2021 added support into their browser.[20] Opera for Android has default support for IPFS, allowing mobile users to browse ipfs:// links to access data on the IPFS network.[21] See also[edit] Content addressable storage Dat (software) Distributed file system Freenet GNUnet ZeroNet References[edit] ^ a b c Case, Amber (4 October 2015). "Why The Internet Needs IPFS Before It's Too Late". TechCrunch. Retrieved 16 July 2019. ^ https://github.com/ipfs/go-ipfs/releases ^ Agorise (23 October 2017). "c-ipfs: IPFS implementation in C. Why C? Think Bitshares' Stealth backups, OpenWrt routers (decentralize the internet/meshnet!), Android TV, decentralized Media, decentralized websites, decent." Github.com. Retrieved 25 October 2017. ^ Finley, Klint (20 June 2016). "The Inventors of the Internet Are Trying to Build a Truly Permanent Web". Wired. ^ Krishnan, Armin (2020). "Blockchain Empowers Social Resistance and Terrorism Through Decentralized Autonomous Organizations". Journal of Strategic Security. 13 (1): 41–58. doi:10.5038/1944-0472.13.1.1743. ISSN 1944-0464. JSTOR 26907412. ^ "Content addressing". docs.ipfs.io. Retrieved 29 August 2020. ^ "IPFS Gateway". docs.ipfs.io. Retrieved 29 August 2020. ^ "Public Gateway Checker | IPFS". ipfs.github.io. Retrieved 29 August 2020. ^ Balcell, Marta Poblet (5 October 2017). "Inside Catalonia's cypherpunk referendum". Eureka Street. ^ Hill, Paul (30 September 2017). "Catalan referendum app removed from Google Play Store". Neowin. Retrieved 6 October 2017. ^ Abrams, Lawrence (4 October 2018). "Phishing Attacks Distributed Through Cloudflare's IPFS Gateway". Bleeping Computer. Retrieved 31 August 2019. ^ Palmer, Danny (11 June 2019). "This unusual Windows malware is controlled via a P2P network". ZDNet. Retrieved 31 August 2019. ^ Patsakis, Constantinos; Casino, Fran (4 June 2019). "Hydras and IPFS: a decentralised playground for malware". International Journal of Information Security. 18 (6): 787–799. arXiv:1905.11880. doi:10.1007/s10207-019-00443-0. S2CID 167217444. ^ Bruno Macabeus; Marcus Vinicius; Jo ̃ao Paolo Cavalcante; Cidcley Teixeira de Souza (6 May 2018). "Protocolos IPFS e IPNS como meio para o controle de botnet: prova de conceito" (PDF). WSCDC - SBRC 2018 (in Portuguese). Retrieved 27 April 2021. ^ Dale, Brady (10 May 2017). "Turkey Can't Block This Copy of Wikipedia". Observer Media. Archived from the original on 18 October 2017. Retrieved 20 December 2017. ^ Johnson, Steven (16 January 2018). "Beyond the Bitcoin Bubble". The New York Times. Retrieved 26 September 2018. ^ Orcutt, Mike (5 October 2018). "A big tech company is working to free the internet from big tech companies". MIT Technology Review. Retrieved 21 April 2020. ^ Simons, Alex (13 May 2019). "Toward scalable decentralized identifier systems". Azure Active Directory Identity Blog. Retrieved 27 April 2021. ^ "Brave Launches New Swag Store Powered by Origin". Brave.com (Press release). 24 March 2020. Retrieved 21 April 2020. ^ Porter, Jon (19 January 2021). "Brave browser takes step toward enabling a decentralized web". The Verge. Retrieved 29 January 2021. ^ "Opera introduces major updates to its blockchain-browser on Android". Opera Blog (Press release). 3 March 2020. Retrieved 21 April 2020. External links[edit] Official website v t e File systems Comparison of file systems distributed Unix filesystem Disk ADFS AdvFS Amiga FFS Amiga OFS APFS AthFS bcachefs BeeGFS BFS Be File System Boot File System Btrfs CVFS CXFS DFS EFS Encrypting File System Extent File System Episode ext ext2 ext3 ext3cow ext4 FFS/FFS2 FAT exFAT Files-11 Fossil GPFS HAMMER HAMMER2 HFS HFS+ HPFS HTFS JFS LFS MFS Macintosh File System TiVo Media File System MINIX NetWare File System Next3 NILFS NILFS2 NSS NTFS OneFS PFS QFS QNX4FS ReFS ReiserFS Reiser4 Reliance Reliance Nitro RFS SFS SNFS Soup (Apple) Tux3 UBIFS UFS soft updates WAPBL VxFS WAFL Xiafs XFS Xsan zFS ZFS Optical disc HSF ISO 9660 ISO 13490 UDF Flash memory and SSD APFS FAT exFAT CHFS TFAT EROFS FFS2 F2FS HPFS JFFS JFFS2 JFS LogFS NILFS NILFS2 NVFS YAFFS UBIFS Distributed CXFS GFS2 Google File System OCFS2 OrangeFS PVFS QFS Xsan more... NAS 9P AFS (OpenAFS) AFP Coda DFS Google File System GPFS Lustre NCP NFS POHMELFS Hadoop SMB (CIFS) SSHFS more... Specialized Aufs AXFS Boot File System CDfs Compact Disc File System cramfs Davfs2 EROFS FTPFS FUSE Lnfs LTFS NOVA MVFS SquashFS UMSDOS OverlayFS UnionFS WBFS Pseudo and virtual configfs devfs debugfs kernfs procfs specfs sysfs tmpfs WinFS Encrypted eCryptfs EncFS EFS Rubberhose SSHFS ZFS Types Clustered Global Grid Self-certifying Flash Journaling Log-structured Object Record-oriented Semantic Steganographic Synthetic Versioning Features Case preservation Copy-on-write Data deduplication Data scrubbing Execute in place Extent File attribute Extended file attributes File change log Fork Links Hard Symbolic Access control Access-control list Filesystem-level encryption Permissions Modes Sticky bit Interfaces File manager File system API Installable File System Virtual file system Lists Cryptographic Default Log-structured v t e Peer-to-peer file sharing Networks, protocols Centralized Direct Connect OpenNap Soribada Soulseek Decentralized Ares BitTorrent DAT eDonkey FastTrack Freenet GNUnet Gnutella Gnutella2 I2P IPFS Kad LBRY OpenFT Perfect Dark Retroshare Share Tribler WebTorrent WinMX Winny ZeroNet Historic Audiogalaxy CuteMX Entropy Kazaa LimeWire Morpheus Overnet Napster Scour WASTE Comparisons of clients Advanced Direct Connect BitTorrent Direct Connect eDonkey Gnutella Gnutella2 WebTorrent Hyperlinks eD2k Magnet Metalink Uses Backup Broadcatching Segmented file transfer Disk sharing game & video sharing Image sharing Music sharing Peercasting Sharing software Web hosting (Freesite, IPFS, ZeroNet) Legal aspects Concepts Privacy Anonymous P2P Darknet Darkweb Friend-to-friend Open Music Model Private P2P Tor Internal technologies DHT Merkle tree NAT traversal PEX Protocol Encryption SHA-1 Super-seeding Tracker UDP hole punching µTP v t e Internet censorship circumvention technologies Background Internet censorship Internet censorship in China National intranet Censorship and blocking technologies IP address blocking DNS cache poisoning Wordfilter Great Firewall of China Blocks on specific websites Facebook Twitter Wikipedia Principles With a proxy server P2P Web proxies SSH VPN PAC Without a proxy server HTTPS IPv6 transition mechanism hosts DNSCrypt Domain fronting Refraction networking Anti-censorship software Free software Lantern Psiphon Shadowsocks Outline VPN GoAgent PirateBox Proprietary software Freegate Ultrasurf Hotspot Shield Garden Networks Telex CGIProxy Proxify Browser extensions uProxy Anonymity Anonymous software Tor JAP (JonDonym) Flash proxy Mixmaster Anonymous P2P network I2P ZeroNet Freenet StealthNet Physical circumvention methods Sneakernet USB dead drop Relevant organizations GreatFire FreeWeibo Turkey Blocks Reference Great Cannon Italics indicates that maintenance of the tool has been discontinued. Category Commons Retrieved from "https://en.wikipedia.org/w/index.php?title=InterPlanetary_File_System&oldid=1020151275" Categories: Application layer protocols Computer-related introductions in 2015 Distributed data storage Distributed file systems File transfer protocols Free network-related software Free software programmed in Python Internet privacy software Internet protocols Network protocols Peer-to-peer computing World Wide Web Hidden categories: CS1 Portuguese-language sources (pt) Articles with short description Short description matches Wikidata Use dmy dates from January 2019 Articles to be expanded from June 2020 All articles to be expanded Articles using small message boxes Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages العربية Català Čeština Deutsch Ελληνικά Español Esperanto فارسی Français Italiano 日本語 Polski Português Русский Українська 吴语 中文 Edit links This page was last edited on 27 April 2021, at 13:33 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-3301	----	Link rot - Wikipedia Link rot From Wikipedia, the free encyclopedia Jump to navigation Jump to search For link rot in Wikipedia, see Wikipedia:Link rot. Phenomenon of URLs tending to cease functioning Link rot (also called link death, link breaking, or reference rot) is the phenomenon of hyperlinks tending over time to cease to point to their originally targeted file, web page, or server due to that resource being relocated to a new address or becoming permanently unavailable. A link that no longer points to its target, often called a broken or dead link, is a specific form of dangling pointer. The rate of link rot is a subject of study and research due to its significance to the internet's ability to preserve information. Estimates of that rate vary dramatically between studies. Contents 1 Prevalence 2 Causes 3 Prevention and detection 4 See also 5 Further reading 6 Notes & references 7 External links Prevalence[edit] A number of studies have examined the prevalence of link rot within the World Wide Web, in academic literature that uses URLs to cite web content, and within digital libraries. A 2003 study found that on the Web, about one link out of every 200 broke each week,[1] suggesting a half-life of 138 weeks. This rate was largely confirmed by a 2016–2017 study of links in Yahoo! Directory (which had stopped updating in 2014 after 21 years of development) that found the half-life of the directory's links to be two years.[2] A 2004 study showed that subsets of Web links (such as those targeting specific file types or those hosted by academic institution) could have dramatically different half-lives.[3] The URLs selected for publication appear to have greater longevity than the average URL. A 2015 study by Weblock analyzed more than 180,000 links from references in the full-text corpora of three major open access publishers and found a half-life of about 14 years,[4] generally confirming a 2005 study that found that half of the URLs cited in D-Lib Magazine articles were active 10 years after publication.[5] Other studies have found higher rates of link rot in academic literature but typically suggest a half-life of four years or greater.[6][7] A 2013 study in BMC Bioinformatics analyzed nearly 15,000 links in abstracts from Thomson Reuters's Web of Science citation index and found that the median lifespan of web pages was 9.3 years, and just 62% were archived.[8] A 2002 study suggested that link rot within digital libraries is considerably slower than on the web, finding that about 3% of the objects were no longer accessible after one year[9] (equating to a half-life of nearly 23 years). Causes[edit] Link rot can result from several occurrences. A target web page may be removed. The server that hosts the target page could fail, be removed from service, or relocate to a new domain name. A domain name's registration may lapse or be transferred to another party. Some causes will result in the link failing to find any target and returning an error such as HTTP 404. Other causes will cause a link to target content other than what was intended by the link's author. Other reasons for broken links include: the restructuring of websites that causes changes in URLs (e.g. domain.net/pine_tree might be moved to domain.net/tree/pine) relocation of formerly free content to behind a paywall a change in server architecture that results in code such as PHP functioning differently dynamic page content such as search results that changes by design the presence of user-specific information (such as a login name) within the link deliberate blocking by content filters or firewalls the removal of gTLDs[10] Prevention and detection[edit] Strategies for preventing link rot can focus on placing content where its likelihood of persisting is higher, authoring links that are less likely to be broken, taking steps to preserve existing links, or repairing links whose targets have been relocated or removed. The creation of URLs that will not change with time is the fundamental method of preventing link rot. Preventive planning has been championed by Tim Berners-Lee and other web pioneers.[11] Strategies pertaining to the authorship of links include: linking to primary rather than secondary sources and prioritizing stable sites[citation needed] avoiding links that point to resources on researchers' personal pages[5] using clean URLs[12] or otherwise employing URL normalization or URL canonicalization using permalinks and persistent identifiers such as ARKs, DOIs, Handle System references, and PURLs avoiding linking to documents other than web pages[12] avoiding deep linking linking to web archives such as the Internet Archive,[13] WebCite,[14] Archive.is, Perma.cc,[15] or Amber[16] Strategies pertaining to the protection of existing links include: using redirection mechanisms such as HTTP 301 to automatically refer browsers and crawlers to relocated content using content management systems which can automatically update links when content within the same site is relocated or automatically replace links with canonical URLs[17] integrating search resources into HTTP 404 pages[18] The detection of broken links may be done manually or automatically. Automated methods include plug-ins for content management systems as well as standalone broken-link checkers such as like Xenu's Link Sleuth. Automatic checking may not detect links that return a soft 404 or links that return a 200 OK response but point to content that has changed.[19] See also[edit] Software rot Digital preservation Deletionism and inclusionism in Wikipedia Further reading[edit] Markwell, John; Brooks, David W. (2002). "Broken Links: The Ephemeral Nature of Educational WWW Hyperlinks". Journal of Science Education and Technology. 11 (2): 105–108. doi:10.1023/A:1014627511641. Gomes, Daniel; Silva, Mário J. (2006). "Modelling Information Persistence on the Web" (PDF). Proceedings of the 6th International Conference on Web Engineering. ICWE'06. Archived from the original (PDF) on 2011-07-16. Retrieved 14 September 2010. Dellavalle, Robert P.; Hester, Eric J.; Heilig, Lauren F.; Drake, Amanda L.; Kuntzman, Jeff W.; Graber, Marla; Schilling, Lisa M. (2003). "Going, Going, Gone: Lost Internet References". Science. 302 (5646): 787–788. doi:10.1126/science.1088234. PMID 14593153. Koehler, Wallace (1999). "An Analysis of Web Page and Web Site Constancy and Permanence". Journal of the American Society for Information Science. 50 (2): 162–180. doi:10.1002/(SICI)1097-4571(1999)50:2<162::AID-ASI7>3.0.CO;2-B. Sellitto, Carmine (2005). "The impact of impermanent Web-located citations: A study of 123 scholarly conference publications" (PDF). Journal of the American Society for Information Science and Technology. 56 (7): 695–703. CiteSeerX 10.1.1.473.2732. doi:10.1002/asi.20159. Notes & references[edit] Notes References ^ Fetterly, Dennis; Manasse, Mark; Najork, Marc; Wiener, Janet (2003). "A large-scale study of the evolution of web pages". Proceedings of the 12th international conference on World Wide Web. Archived from the original on 9 July 2011. Retrieved 14 September 2010. ^ van der Graaf, Hans. "The half-life of a link is two year". ZOMDir's blog. Archived from the original on 2017-10-17. Retrieved 2019-01-31. ^ Koehler, Wallace (2004). "A longitudinal study of web pages continued: a consideration of document persistence". Information Research. 9 (2). Archived from the original on 2017-09-11. Retrieved 2019-01-31. ^ "All-Time Weblock Report". August 2015. Archived from the original on 4 March 2016. Retrieved 12 January 2016. ^ a b McCown, Frank; Chan, Sheffan; Nelson, Michael L.; Bollen, Johan (2005). "The Availability and Persistence of Web References in D-Lib Magazine" (PDF). Proceedings of the 5th International Web Archiving Workshop and Digital Preservation (IWAW'05). Archived from the original (PDF) on 2012-07-17. Retrieved 2005-10-12. ^ Spinellis, Diomidis (2003). "The Decay and Failures of Web References". Communications of the ACM. 46 (1): 71–77. CiteSeerX 10.1.1.12.9599. doi:10.1145/602421.602422. Archived from the original on 2020-07-23. Retrieved 2007-09-29. ^ Steve Lawrence; David M. Pennock; Gary William Flake; et al. (March 2001). "Persistence of Web References in Scientific Research". Computer. 34 (3): 26–31. CiteSeerX 10.1.1.97.9695. doi:10.1109/2.901164. ISSN 0018-9162. Wikidata Q21012586. ^ Hennessey, Jason; Xijin Ge, Steven (2013). "A Cross Disciplinary Study of Link Decay and the Effectiveness of Mitigation Techniques". BMC Bioinformatics. 14: S5. doi:10.1186/1471-2105-14-S14-S5. PMC 3851533. PMID 24266891. ^ Nelson, Michael L.; Allen, B. Danette (2002). "Object Persistence and Availability in Digital Libraries". D-Lib Magazine. 8 (1). doi:10.1045/january2002-nelson. Archived from the original on 2020-07-19. Retrieved 2019-09-24. ^ "The death of a TLD". blog.benjojo.co.uk. Archived from the original on 2018-07-26. Retrieved 2018-07-27. ^ Berners-Lee, Tim (1998). "Cool URIs Don't Change". Archived from the original on 2000-03-02. Retrieved 2019-01-31. ^ a b Kille, Leighton Walter (8 November 2014). "The Growing Problem of Internet "Link Rot" and Best Practices for Media and Online Publishers". Journalist's Resource, Harvard Kennedy School. Archived from the original on 12 January 2015. Retrieved 16 January 2015. ^ "Internet Archive: Digital Library of Free Books, Movies, Music & Wayback Machine". 2001-03-10. Archived from the original on 26 January 1997. Retrieved 7 October 2013. ^ Eysenbach, Gunther; Trudel, Mathieu (2005). "Going, going, still there: Using the WebCite service to permanently archive cited web pages". Journal of Medical Internet Research. 7 (5): e60. doi:10.2196/jmir.7.5.e60. PMC 1550686. PMID 16403724. ^ Zittrain, Jonathan; Albert, Kendra; Lessig, Lawrence (12 June 2014). "Perma: Scoping and Addressing the Problem of Link and Reference Rot in Legal Citations" (PDF). Legal Information Management. 14 (2): 88–99. doi:10.1017/S1472669614000255. Archived (PDF) from the original on 1 November 2020. Retrieved 10 June 2020. ^ "Harvard University's Berkman Center Releases Amber, a "Mutual Aid" Tool for Bloggers & Website Owners to Help Keep the Web Available | Berkman Center". cyber.law.harvard.edu. Archived from the original on 2016-02-02. Retrieved 2016-01-28. ^ Rønn-Jensen, Jesper (2007-10-05). "Software Eliminates User Errors And Linkrot". Justaddwater.dk. Archived from the original on 11 October 2007. Retrieved 5 October 2007. ^ Mueller, John (2007-12-14). "FYI on Google Toolbar's Latest Features". Google Webmaster Central Blog. Archived from the original on 13 September 2008. Retrieved 9 July 2008. ^ Bar-Yossef, Ziv; Broder, Andrei Z.; Kumar, Ravi; Tomkins, Andrew (2004). "Sic transit gloria telae: towards an understanding of the Web's decay". Proceedings of the 13th international conference on World Wide Web – WWW '04. pp. 328–337. CiteSeerX 10.1.1.1.9406. doi:10.1145/988672.988716. ISBN 978-1581138443. External links[edit] The Wikibook Authoring Webpages has a page on the topic of: Preventing link rot Future-Proofing Your URIs Jakob Nielsen, "Fighting Linkrot", Jakob Nielsen's Alertbox, June 14, 1998. Retrieved from "https://en.wikipedia.org/w/index.php?title=Link_rot&oldid=1016420788" Categories: URL Data quality Product expiration Hidden categories: Articles with short description Short description is different from Wikidata All articles with unsourced statements Articles with unsourced statements from January 2019 Articles prone to spam from November 2015 Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version In other projects Wikimedia Commons Languages Bosanski Dansk Deutsch Eesti Español فارسی Français 한국어 Bahasa Indonesia Nederlands 日本語 Norsk bokmål Polski Português Русский Suomi Svenska ไทย Türkçe Edit links This page was last edited on 7 April 2021, at 02:19 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-3361	----	Unix philosophy - Wikipedia Unix philosophy From Wikipedia, the free encyclopedia Jump to navigation Jump to search Philosophy on developing software Ken Thompson and Dennis Ritchie, key proponents of the Unix philosophy The Unix philosophy, originated by Ken Thompson, is a set of cultural norms and philosophical approaches to minimalist, modular software development. It is based on the experience of leading developers of the Unix operating system. Early Unix developers were important in bringing the concepts of modularity and reusability into software engineering practice, spawning a "software tools" movement. Over time, the leading developers of Unix (and programs that ran on it) established a set of cultural norms for developing software; these norms became as important and influential as the technology of Unix itself; this has been termed the "Unix philosophy." The Unix philosophy emphasizes building simple, short, clear, modular, and extensible code that can be easily maintained and repurposed by developers other than its creators. The Unix philosophy favors composability as opposed to monolithic design. Contents 1 Origin 2 The UNIX Programming Environment 3 Program Design in the UNIX Environment 4 Doug McIlroy on Unix programming 5 Do One Thing and Do It Well 6 Eric Raymond's 17 Unix Rules 7 Mike Gancarz: The UNIX Philosophy 8 "Worse is better" 9 Criticism 10 See also 11 Notes 12 References 13 External links Origin[edit] The Unix philosophy is documented by Doug McIlroy[1] in the Bell System Technical Journal from 1978:[2] Make each program do one thing well. To do a new job, build afresh rather than complicate old programs by adding new "features". Expect the output of every program to become the input to another, as yet unknown, program. Don't clutter output with extraneous information. Avoid stringently columnar or binary input formats. Don't insist on interactive input. Design and build software, even operating systems, to be tried early, ideally within weeks. Don't hesitate to throw away the clumsy parts and rebuild them. Use tools in preference to unskilled help to lighten a programming task, even if you have to detour to build the tools and expect to throw some of them out after you've finished using them. It was later summarized by Peter H. Salus in A Quarter-Century of Unix (1994):[1] Write programs that do one thing and do it well. Write programs to work together. Write programs to handle text streams, because that is a universal interface. In their award-winning Unix paper of 1974[citation needed], Ritchie and Thompson quote the following design considerations:[3] Make it easy to write, test, and run programs. Interactive use instead of batch processing. Economy and elegance of design due to size constraints ("salvation through suffering"). Self-supporting system: all Unix software is maintained under Unix. The whole philosophy of UNIX seems to stay out of assembler. — Michael Sean Mahoney[4] The UNIX Programming Environment[edit] In their preface to the 1984 book, The UNIX Programming Environment, Brian Kernighan and Rob Pike, both from Bell Labs, give a brief description of the Unix design and the Unix philosophy:[5] Rob Pike, co-author of The UNIX Programming Environment Even though the UNIX system introduces a number of innovative programs and techniques, no single program or idea makes it work well. Instead, what makes it effective is the approach to programming, a philosophy of using the computer. Although that philosophy can't be written down in a single sentence, at its heart is the idea that the power of a system comes more from the relationships among programs than from the programs themselves. Many UNIX programs do quite trivial things in isolation, but, combined with other programs, become general and useful tools. The authors further write that their goal for this book is "to communicate the UNIX programming philosophy."[5] Program Design in the UNIX Environment[edit] Brian Kernighan has written at length about the Unix philosophy In October 1984, Brian Kernighan and Rob Pike published a paper called Program Design in the UNIX Environment. In this paper, they criticize the accretion of program options and features found in some newer Unix systems such as 4.2BSD and System V, and explain the Unix philosophy of software tools, each performing one general function:[6] Much of the power of the UNIX operating system comes from a style of program design that makes programs easy to use and, more important, easy to combine with other programs. This style has been called the use of software tools, and depends more on how the programs fit into the programming environment and how they can be used with other programs than on how they are designed internally. [...] This style was based on the use of tools: using programs separately or in combination to get a job done, rather than doing it by hand, by monolithic self-sufficient subsystems, or by special-purpose, one-time programs. The authors contrast Unix tools such as cat, with larger program suites used by other systems.[6] The design of cat is typical of most UNIX programs: it implements one simple but general function that can be used in many different applications (including many not envisioned by the original author). Other commands are used for other functions. For example, there are separate commands for file system tasks like renaming files, deleting them, or telling how big they are. Other systems instead lump these into a single "file system" command with an internal structure and command language of its own. (The PIP file copy program found on operating systems like CP/M or RSX-11 is an example.) That approach is not necessarily worse or better, but it is certainly against the UNIX philosophy. Doug McIlroy on Unix programming[edit] Doug McIlroy (left) with Dennis Ritchie McIlroy, then head of the Bell Labs Computing Sciences Research Center, and inventor of the Unix pipe,[7] summarized the Unix philosophy as follows:[1] This is the Unix philosophy: Write programs that do one thing and do it well. Write programs to work together. Write programs to handle text streams, because that is a universal interface. Beyond these statements, he has also emphasized simplicity and minimalism in Unix programming:[1] The notion of "intricate and beautiful complexities" is almost an oxymoron. Unix programmers vie with each other for "simple and beautiful" honors — a point that's implicit in these rules, but is well worth making overt. Conversely, McIlroy has criticized modern Linux as having software bloat, remarking that, "adoring admirers have fed Linux goodies to a disheartening state of obesity."[8] He contrasts this with the earlier approach taken at Bell Labs when developing and revising Research Unix:[9] Everything was small... and my heart sinks for Linux when I see the size of it. [...] The manual page, which really used to be a manual page, is now a small volume, with a thousand options... We used to sit around in the Unix Room saying, 'What can we throw out? Why is there this option?' It's often because there is some deficiency in the basic design — you didn't really hit the right design point. Instead of adding an option, think about what was forcing you to add that option. Do One Thing and Do It Well[edit] As stated by McIlroy, and generally accepted throughout the Unix community, Unix programs have always been expected to follow the concept of DOTADIW, or "Do One Thing And Do It Well." There are limited sources for the acronym DOTADIW on the Internet, but it is discussed at length during the development and packaging of new operating systems, especially in the Linux community. Patrick Volkerding, the project lead of Slackware Linux, invoked this design principle in a criticism of the systemd architecture, stating that, "attempting to control services, sockets, devices, mounts, etc., all within one daemon flies in the face of the Unix concept of doing one thing and doing it well."[10] Eric Raymond's 17 Unix Rules[edit] In his book The Art of Unix Programming that was first published in 2003,[11] Eric S. Raymond, an American programmer and open source advocate, summarizes the Unix philosophy as KISS Principle of "Keep it Simple, Stupid."[12] He provides a series of design rules:[1] Build modular programs Write readable programs Use composition Separate mechanisms from policy Write simple programs Write small programs Write transparent programs Write robust programs Make data complicated when required, not the program Build on potential users' expected knowledge Avoid unnecessary output Write programs which fail in a way that is easy to diagnose Value developer time over machine time Write abstract programs that generate code instead of writing code by hand Prototype software before polishing it Write flexible and open programs Make the program and protocols extensible. Mike Gancarz: The UNIX Philosophy[edit] In 1994, Mike Gancarz (a member of the team that designed the X Window System), drew on his own experience with Unix, as well as discussions with fellow programmers and people in other fields who depended on Unix, to produce The UNIX Philosophy which sums it up in nine paramount precepts: Small is beautiful. Make each program do one thing well. Build a prototype as soon as possible. Choose portability over efficiency. Store data in flat text files. Use software leverage to your advantage. Use shell scripts to increase leverage and portability. Avoid captive user interfaces. Make every program a filter. "Worse is better"[edit] Main article: Worse is better Richard P. Gabriel suggests that a key advantage of Unix was that it embodied a design philosophy he termed "worse is better", in which simplicity of both the interface and the implementation are more important than any other attributes of the system—including correctness, consistency, and completeness. Gabriel argues that this design style has key evolutionary advantages, though he questions the quality of some results. For example, in the early days Unix used a monolithic kernel (which means that user processes carried out kernel system calls all on the user stack). If a signal was delivered to a process while it was blocked on a long-term I/O in the kernel, then what should be done? Should the signal be delayed, possibly for a long time (maybe indefinitely) while the I/O completed? The signal handler could not be executed when the process was in kernel mode, with sensitive kernel data on the stack. Should the kernel back-out the system call, and store it, for replay and restart later, assuming that the signal handler completes successfully? In these cases Ken Thompson and Dennis Ritchie favored simplicity over perfection. The Unix system would occasionally return early from a system call with an error stating that it had done nothing—the "Interrupted System Call", or an error number 4 (EINTR) in today's systems. Of course the call had been aborted in order to call the signal handler. This could only happen for a handful of long-running system calls such as read(), write(), open(), and select(). On the plus side, this made the I/O system many times simpler to design and understand. The vast majority of user programs were never affected because they did not handle or experience signals other than SIGINT and would die right away if one was raised. For the few other programs—things like shells or text editors that respond to job control key presses—small wrappers could be added to system calls so as to retry the call right away if this EINTR error was raised. Thus, the problem was solved in a simple manner. Criticism[edit] In a 1981 article entitled "The truth about Unix: The user interface is horrid"[13] published in Datamation, Don Norman criticized the design philosophy of Unix for its lack of concern for the user interface. Writing from his background in cognitive science and from the perspective of the then-current philosophy of cognitive engineering,[4] he focused on how end-users comprehend and form a personal cognitive model of systems—or, in the case of Unix, fail to understand, with the result that disastrous mistakes (such as losing an hour's worth of work) are all too easy. See also[edit] Cognitive engineering Unix architecture Minimalism (computing) Software engineering KISS principle Hacker ethic List of software development philosophies Everything is a file Worse is better Notes[edit] ^ a b c d e Raymond, Eric S. (2004). "Basics of the Unix Philosophy". The Art of Unix Programming. Addison-Wesley Professional (published 2003-09-23). ISBN 0-13-142901-9. Retrieved 2016-11-01. ^ Doug McIlroy, E. N. Pinson, B. A. Tague (8 July 1978). "Unix Time-Sharing System: Foreword". The Bell System Technical Journal. Bell Laboratories: 1902–1903.CS1 maint: multiple names: authors list (link) ^ Dennis Ritchie; Ken Thompson (1974), "The UNIX time-sharing system" (PDF), Communications of the ACM, 17 (7): 365–375, doi:10.1145/361011.361061, S2CID 53235982 ^ a b "An Oral History of Unix". Princeton University History of Science. ^ a b Kernighan, Brian W. Pike, Rob. The UNIX Programming Environment. 1984. viii ^ a b Rob Pike; Brian W. Kernighan (October 1984). "Program Design in the UNIX Environment" (PDF). ^ Dennis Ritchie (1984), "The Evolution of the UNIX Time-Sharing System" (PDF), AT&T Bell Laboratories Technical Journal, 63 (8): 1577–1593, doi:10.1002/j.1538-7305.1984.tb00054.x ^ Douglas McIlroy. "Remarks for Japan Prize award ceremony for Dennis Ritchie, May 19, 2011, Murray Hill, NJ" (PDF). Retrieved 2014-06-19. ^ Bill McGonigle. "Ancestry of Linux — How the Fun Began (2005)". Retrieved 2014-06-19. ^ "Interview with Patrick Volkerding of Slackware". linuxquestions.org. 2012-06-07. Retrieved 2015-10-24. ^ Raymond, Eric (2003-09-19). The Art of Unix Programming. Addison-Wesley. ISBN 0-13-142901-9. Retrieved 2009-02-09. ^ Raymond, Eric (2003-09-19). "The Unix Philosophy in One Lesson". The Art of Unix Programming. Addison-Wesley. ISBN 0-13-142901-9. Retrieved 2009-02-09. ^ Norman, Don (1981). "The truth about Unix: The user interface is horrid" (PDF). Datamation. 27 (12). References[edit] The Unix Programming Environment by Brian Kernighan and Rob Pike, 1984 Program Design in the UNIX Environment – The paper by Pike and Kernighan that preceded the book. Notes on Programming in C, Rob Pike, September 21, 1989 A Quarter Century of Unix, Peter H. Salus, Addison-Wesley, May 31, 1994 ( ISBN 0-201-54777-5) Philosophy — from The Art of Unix Programming, Eric S. Raymond, Addison-Wesley, September 17, 2003 ( ISBN 0-13-142901-9) Final Report of the Multics Kernel Design Project by M. D. Schroeder, D. D. Clark, J. H. Saltzer, and D. H. Wells, 1977. The UNIX Philosophy, Mike Gancarz, ISBN 1-55558-123-4 External links[edit] Basics of the Unix Philosophy – by Catb.org The Unix Philosophy: A Brief Introduction – by The Linux Information Project (LINFO) Why the Unix Philosophy still matters Retrieved from "https://en.wikipedia.org/w/index.php?title=Unix_philosophy&oldid=1015300304" Categories: Software development philosophies Unix Hidden categories: CS1 maint: multiple names: authors list Articles with short description Short description matches Wikidata All articles with unsourced statements Articles with unsourced statements from March 2021 Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages العربية Čeština Deutsch Español فارسی Français 한국어 Italiano 日本語 Norsk bokmål Português Русский 中文 Edit links This page was last edited on 31 March 2021, at 18:08 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-338	----	Evaluation strategy - Wikipedia Evaluation strategy From Wikipedia, the free encyclopedia Jump to navigation Jump to search Evaluation strategies Eager evaluation Lazy evaluation Partial evaluation Remote evaluation Short-circuit evaluation v t e Evaluation strategies are used by programming languages to determine two things—when to evaluate the arguments of a function call and what kind of value to pass to the function. To illustrate, a function application may evaluate the argument before evaluating the function's body and pass the ability to look up the argument's current value and modify it via assignment.[1] The notion of reduction strategy in lambda calculus is similar but distinct. In practical terms, many modern programming languages like C# and Java have converged on a call-by-value/call-by-reference evaluation strategy for function calls.[clarification needed] Some languages, especially lower-level languages such as C++, combine several notions of parameter passing. Historically, call by value and call by name date back to ALGOL 60, which was designed in the late 1950s. Call by reference is used by PL/I and some Fortran systems.[2] Purely functional languages like Haskell, as well as non-purely functional languages like R, use call by need. Evaluation strategy is specified by the programming language definition, and is not a function of any specific implementation. Contents 1 Strict evaluation 1.1 Applicative order 1.2 Call by value 1.2.1 Implicit limitations 1.3 Call by reference 1.4 Call by sharing 1.5 Call by copy-restore 1.6 Partial evaluation 2 Non-strict evaluation 2.1 Normal order 2.2 Call by name 2.3 Call by need 2.4 Call by macro expansion 3 Nondeterministic strategies 3.1 Full β-reduction 3.2 Call by future 3.3 Optimistic evaluation 4 See also 5 References 6 Further reading Strict evaluation[edit] Main article: Eager evaluation In strict evaluation, the arguments to a function are always evaluated completely before the function is applied. Under Church encoding, eager evaluation of operators maps to strict evaluation of functions; for this reason, strict evaluation is sometimes called "eager". Most existing programming languages use strict evaluation for functions. Applicative order[edit] Applicative order evaluation is an evaluation strategy in which an expression is evaluated by repeatedly evaluating its leftmost innermost reducible expression. This means that a function's arguments are evaluated before the function is applied.[3] Call by value[edit] Call by value (also known as pass by value) is the most common evaluation strategy, used in languages as different as C and Scheme. In call by value, the argument expression is evaluated, and the resulting value is bound to the corresponding variable in the function (frequently by copying the value into a new memory region). If the function or procedure is able to assign values to its parameters, only its local variable is assigned—that is, anything passed into a function call is unchanged in the caller's scope when the function returns. Call by value is not a single evaluation strategy, but rather the family of evaluation strategies in which a function's argument is evaluated before being passed to the function. While many programming languages (such as Common Lisp, Eiffel and Java) that use call by value evaluate function arguments left-to-right, some evaluate functions and their arguments right-to-left, and others (such as Scheme, OCaml and C) do not specify order. Implicit limitations[edit] In some cases, the term "call by value" is problematic, as the value which is passed is not the value of the variable as understood by the ordinary meaning of value, but an implementation-specific reference to the value. The effect is that what syntactically looks like call by value may end up rather behaving like call by reference or call by sharing, often depending on very subtle aspects of the language semantics. The reason for passing a reference is often that the language technically does not provide a value representation of complicated data, but instead represents them as a data structure while preserving some semblance of value appearance in the source code. Exactly where the boundary is drawn between proper values and data structures masquerading as such is often hard to predict. In C, an array (of which strings are special cases) is a data structure but the name of an array is treated as (has as value) the reference to the first element of the array, while a struct variable's name refers to a value even if it has fields that are vectors. In Maple, a vector is a special case of a table and therefore a data structure, but a list (which gets rendered and can be indexed in exactly the same way) is a value. In Tcl, values are "dual-ported" such that the value representation is used at the script level, and the language itself manages the corresponding data structure, if one is required. Modifications made via the data structure are reflected back to the value representation and vice versa. The description "call by value where the value is a reference" is common (but should not be understood as being call by reference); another term is call by sharing. Thus the behaviour of call by value Java or Visual Basic and call by value C or Pascal are significantly different: in C or Pascal, calling a function with a large structure as an argument will cause the entire structure to be copied (except if it's actually a reference to a structure), potentially causing serious performance degradation, and mutations to the structure are invisible to the caller. However, in Java or Visual Basic only the reference to the structure is copied, which is fast, and mutations to the structure are visible to the caller. Call by reference[edit] Call by reference (or pass by reference) is an evaluation strategy where a function receives an implicit reference to a variable used as argument, rather than a copy of its value. This typically means that the function can modify (i.e., assign to) the variable used as argument—something that will be seen by its caller. Call by reference can therefore be used to provide an additional channel of communication between the called function and the calling function. A call-by-reference language makes it more difficult for a programmer to track the effects of a function call, and may introduce subtle bugs. A simple litmus test for whether a language supports call-by-reference semantics is if it's possible to write a traditional swap(a, b) function in the language.[4] Many languages support call by reference in some form, but few use it by default. FORTRAN II is an early example of a call-by-reference language. A few languages, such as C++, PHP, Visual Basic .NET, C# and REALbasic, default to call by value, but offer a special syntax for call-by-reference parameters. C++ additionally offers call by reference to const. Call by reference can be simulated in languages that use call by value and don't exactly support call by reference, by making use of references (objects that refer to other objects), such as pointers (objects representing the memory addresses of other objects). Languages such as C, ML and Rust use this technique. It is not a separate evaluation strategy—the language calls by value—but sometimes it is referred to as "call by address" or "pass by address". In ML, references are type- and memory-safe, similar to Rust. A similar effect is achieved by call by sharing (passing an object, which can then be mutated), used in languages like Java, Python, and Ruby. In purely functional languages there is typically no semantic difference between the two strategies (since their data structures are immutable, so there is no possibility for a function to modify any of its arguments), so they are typically described as call by value even though implementations frequently use call by reference internally for the efficiency benefits. Following is an example that demonstrates call by reference in the E programming language: def modify(var p, &q) { p := 27 # passed by value: only the local parameter is modified q := 27 # passed by reference: variable used in call is modified } ? var a := 1 # value: 1 ? var b := 2 # value: 2 ? modify(a, &b) ? a # value: 1 ? b # value: 27 Following is an example of call by address that simulates call by reference in C: void modify(int p, int* q, int* r) { p = 27; // passed by value: only the local parameter is modified *q = 27; // passed by value or reference, check call site to determine which *r = 27; // passed by value or reference, check call site to determine which } int main() { int a = 1; int b = 1; int x = 1; int* c = &x; modify(a, &b, c); // a is passed by value, b is passed by reference by creating a pointer (call by value), // c is a pointer passed by value // b and x are changed return 0; } Call by sharing[edit] Call by sharing (also known as "call by object" or "call by object-sharing") is an evaluation strategy first noted by Barbara Liskov in 1974 for the CLU language.[5] It is used by languages such as Python,[6] Java (for object references), Ruby, JavaScript, Scheme, OCaml, AppleScript, and many others. However, the term "call by sharing" is not in common use; the terminology is inconsistent across different sources. For example, in the Java community, they say that Java is call by value.[7] Call by sharing implies that values in the language are based on objects rather than primitive types, i.e., that all values are "boxed". Because they are boxed they can be said to pass by copy of reference (where primitives are boxed before passing and unboxed at called function). The semantics of call by sharing differ from call by reference: "In particular it is not call by value because mutations of arguments performed by the called routine will be visible to the caller. And it is not call by reference because access is not given to the variables of the caller, but merely to certain objects".[8] So, for example, if a variable was passed, it is not possible to simulate an assignment on that variable in the callee's scope.[9] However, since the function has access to the same object as the caller (no copy is made), mutations to those objects, if the objects are mutable, within the function are visible to the caller, which may appear to differ from call by value semantics. Mutations of a mutable object within the function are visible to the caller because the object is not copied or cloned—it is shared. For example, in Python, lists are mutable, so: def f(a_list): a_list.append(1) m = [] f(m) print(m) outputs [1] because the append method modifies the object on which it is called. Assignments within a function are not noticeable to the caller, because, in these languages, passing the variable only means passing (access to) the actual object referred to by the variable, not access to the original (caller's) variable. Since the rebound variable only exists within the scope of the function, the counterpart in the caller retains its original binding. Compare the Python mutation above with the code below, which binds the formal argument to a new object: def f(a_list): a_list = [1] m = [] f(m) print(m) outputs [], because the statement a_list = [1] reassigns a new list to the variable rather than to the location it references. For immutable objects, there is no real difference between call by sharing and call by value, except if object identity is visible in the language. The use of call by sharing with mutable objects is an alternative to input/output parameters: the parameter is not assigned to (the argument is not overwritten and object identity is not changed), but the object (argument) is mutated.[10] Although this term has widespread usage in the Python community, identical semantics in other languages such as Java and Visual Basic are often described as call by value, where the value is implied to be a reference to the object.[citation needed] Call by copy-restore[edit] Call by copy-restore—also known as "copy-in copy-out", "call by value result", "call by value return" (as termed in the Fortran community)—is a special case of call by reference where the provided reference is unique to the caller. This variant has gained attention in multiprocessing contexts and Remote procedure call:[11] if a parameter to a function call is a reference that might be accessible by another thread of execution, its contents may be copied to a new reference that is not; when the function call returns, the updated contents of this new reference are copied back to the original reference ("restored"). The semantics of call by copy-restore also differ from those of call by reference, where two or more function arguments alias one another (i.e., point to the same variable in the caller's environment). Under call by reference, writing to one will affect the other; call by copy-restore avoids this by giving the function distinct copies, but leaves the result in the caller's environment undefined depending on which of the aliased arguments is copied back first—will the copies be made in left-to-right order both on entry and on return? When the reference is passed to the callee uninitialized, this evaluation strategy may be called "call by result". Partial evaluation[edit] Main article: Partial evaluation In partial evaluation, evaluation may continue into the body of a function that has not been applied. Any sub-expressions that do not contain unbound variables are evaluated, and function applications whose argument values are known may be reduced. If there are side effects, complete partial evaluation may produce unintended results, which is why systems that support partial evaluation tend to do so only for "pure" expressions (i.e., those without side effects) within functions. Non-strict evaluation[edit] This section does not cite any sources. Please help improve this section by adding citations to reliable sources. Unsourced material may be challenged and removed. (June 2013) (Learn how and when to remove this template message) In non-strict evaluation, arguments to a function are not evaluated unless they are actually used in the evaluation of the function body. Under Church encoding, lazy evaluation of operators maps to non-strict evaluation of functions; for this reason, non-strict evaluation is often referred to as "lazy". Boolean expressions in many languages use a form of non-strict evaluation called short-circuit evaluation, where evaluation returns as soon as it can be determined that an unambiguous Boolean will result—for example, in a disjunctive expression (OR) where true is encountered, or in a conjunctive expression (AND) where false is encountered, and so forth. Conditional expressions also usually use lazy evaluation, where evaluation returns as soon as an unambiguous branch will result. Normal order[edit] Normal order evaluation is an evaluation strategy in which an expression is evaluated by repeatedly evaluating its leftmost outermost reducible expression. This means that a function's arguments are not evaluated before the function is applied.[12] Call by name[edit] Call by name is an evaluation strategy where the arguments to a function are not evaluated before the function is called—rather, they are substituted directly into the function body (using capture-avoiding substitution) and then left to be evaluated whenever they appear in the function. If an argument is not used in the function body, the argument is never evaluated; if it is used several times, it is re-evaluated each time it appears. (See Jensen's Device.) Call-by-name evaluation is occasionally preferable to call-by-value evaluation. If a function's argument is not used in the function, call by name will save time by not evaluating the argument, whereas call by value will evaluate it regardless. If the argument is a non-terminating computation, the advantage is enormous. However, when the function argument is used, call by name is often slower, requiring a mechanism such as a thunk. An early use was ALGOL 60. Today's .NET languages can simulate call by name using delegates or Expression<T> parameters. The latter results in an abstract syntax tree being given to the function. Eiffel provides agents, which represent an operation to be evaluated when needed. Seed7 provides call by name with function parameters. Java programs can accomplish similar lazy evaluation using lambda expressions and the java.util.function.Supplier<T> interface. Call by need[edit] Main article: Lazy evaluation Call by need is a memoized variant of call by name, where, if the function argument is evaluated, that value is stored for subsequent use. If the argument is pure (i.e., free of side effects), this produces the same results as call by name, saving the cost of recomputing the argument. Haskell is a well-known language that uses call-by-need evaluation. Because evaluation of expressions may happen arbitrarily far into a computation, Haskell only supports side effects (such as mutation) via the use of monads. This eliminates any unexpected behavior from variables whose values change prior to their delayed evaluation. In R's implementation of call by need, all arguments are passed, meaning that R allows arbitrary side effects. Lazy evaluation is the most common implementation of call-by-need semantics, but variations like optimistic evaluation exist. .NET languages implement call by need using the type Lazy<T>. Call by macro expansion[edit] Call by macro expansion is similar to call by name, but uses textual substitution rather than capture, thereby avoiding substitution. But macro substitution may cause mistakes, resulting in variable capture, leading to undesired behavior. Hygienic macros avoid this problem by checking for and replacing shadowed variables that are not parameters. Nondeterministic strategies[edit] Full β-reduction[edit] Under "full β-reduction", any function application may be reduced (substituting the function's argument into the function using capture-avoiding substitution) at any time. This may be done even within the body of an unapplied function. Call by future[edit] See also: Futures and promises "Call by future", also known as "parallel call by name", is a concurrent evaluation strategy in which the value of a future expression is computed concurrently with the flow of the rest of the program with promises, also known as futures. When the promise's value is needed, the main program blocks until the promise has a value (the promise or one of the promises finishes computing, if it has not already completed by then). This strategy is non-deterministic, as the evaluation can occur at any time between creation of the future (i.e., when the expression is given) and use of the future's value. It is similar to call by need in that the value is only computed once, and computation may be deferred until the value is needed, but it may be started before. Further, if the value of a future is not needed, such as if it is a local variable in a function that returns, the computation may be terminated partway through. If implemented with processes or threads, creating a future will spawn one or more new processes or threads (for the promises), accessing the value will synchronize these with the main thread, and terminating the computation of the future corresponds to killing the promises computing its value. If implemented with a coroutine, as in .NET async/await, creating a future calls a coroutine (an async function), which may yield to the caller, and in turn be yielded back to when the value is used, cooperatively multitasking. Optimistic evaluation[edit] Optimistic evaluation is another call-by-need variant where the function's argument is partially evaluated for some amount of time (which may be adjusted at runtime). After that time has passed, evaluation is aborted and the function is applied using call by need.[13] This approach avoids some the call-by-need strategy's runtime expenses while retaining desired termination characteristics. See also[edit] Beta normal form Comparison of programming languages eval Lambda calculus Call-by-push-value Parameter (computer science) References[edit] This article includes a list of general references, but it remains largely unverified because it lacks sufficient corresponding inline citations. Please help to improve this article by introducing more precise citations. (April 2012) (Learn how and when to remove this template message) ^ Daniel P. Friedman; Mitchell Wand (2008). Essentials of Programming Languages (third ed.). Cambridge, MA: The MIT Press. ISBN 978-0262062794. ^ Some Fortran systems use call by copy-restore. ^ "Applicative order reduction". Encyclopedia2.thefreedictionary.com. Retrieved 2019-11-19. ^ "Java is Pass-by-Value, Dammit!". Retrieved 2016-12-24. ^ Liskov, Barbara; Atkinson, Russ; Bloom, Toby; Moss, Eliot; Schaffert, Craig; Scheifler, Craig; Snyder, Alan (October 1979). "CLU Reference Manual" (PDF). Laboratory for Computer Science. Massachusetts Institute of Technology. Archived from the original (PDF) on 2006-09-22. Retrieved 2011-05-19. ^ Lundh, Fredrik. "Call By Object". effbot.org. Retrieved 2011-05-19. ^ "Java is Pass-by-Value, Dammit!". Retrieved 2016-12-24. ^ CLU Reference Manual (1974), p. 14-15. sfnp error: no target: CITEREFCLU_Reference_Manual1974 (help) ^ Note: in CLU language, "variable" corresponds to "identifier" and "pointer" in modern standard usage, not to the general/usual meaning of variable. ^ "CA1021: Avoid out parameters". Microsoft. ^ "RPC: Remote Procedure Call Protocol Specification Version 2". tools.ietf.org. IETF. Retrieved 7 April 2018. ^ "Normal order reduction". Encyclopedia2.thefreedictionary.com. Retrieved 2019-11-19. ^ Ennals, Robert; Jones, Simon Peyton (August 2003). "Optimistic Evaluation: a fast evaluation strategy for non-strict programs". Further reading[edit] Abelson, Harold; Sussman, Gerald Jay (1996). Structure and Interpretation of Computer Programs (Second ed.). Cambridge, Massachusetts: The MIT Press. ISBN 978-0-262-01153-2. Baker-Finch, Clem; King, David; Hall, Jon; Trinder, Phil (1999-03-10). "An Operational Semantics for Parallel Call-by-Need" (ps). Research report. Faculty of Mathematics & Computing, The Open University. 99 (1). Ennals, Robert; Peyton Jones, Simon (2003). Optimistic Evaluation: A Fast Evaluation Strategy for Non-Strict Programs (PDF). International Conference on Functional Programming. ACM Press. Ludäscher, Bertram (2001-01-24). "CSE 130 lecture notes". CSE 130: Programming Languages: Principles & Paradigms. Pierce, Benjamin C. (2002). Types and Programming Languages. MIT Press. ISBN 0-262-16209-1. Sestoft, Peter (2002). Mogensen, T; Schmidt, D; Sudborough, I. H. (eds.). Demonstrating Lambda Calculus Reduction (PDF). The Essence of Computation: Complexity, Analysis, Transformation. Essays Dedicated to Neil D. Jones. Lecture Notes in Computer Science. 2566. Springer-Verlag. pp. 420–435. ISBN 3-540-00326-6. "Call by Value and Call by Reference in C Programming". Call by Value and Call by Reference in C Programming explained. Archived from the original on 2013-01-21. Retrieved from "https://en.wikipedia.org/w/index.php?title=Evaluation_strategy&oldid=1011286493" Categories: Evaluation strategy Hidden categories: Harv and Sfn no-target errors Wikipedia articles needing clarification from January 2017 All articles with unsourced statements Articles with unsourced statements from June 2014 Articles needing additional references from June 2013 All articles needing additional references Articles lacking in-text citations from April 2012 All articles lacking in-text citations Articles with example Python (programming language) code Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages Čeština Deutsch Français 한국어 日本語 Português Русский Slovenčina Українська 中文 Edit links This page was last edited on 10 March 2021, at 01:52 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-3552	----	API - Wikipedia API From Wikipedia, the free encyclopedia Jump to navigation Jump to search Set of subroutine definitions, protocols, and tools for building software and applications For other uses, see API (disambiguation). "Api.php" redirects here. For the Wikipedia API, see Special:ApiHelp. In computing, an application programming interface (API) is an interface that defines interactions between multiple software applications or mixed hardware-software intermediaries.[1] It defines the kinds of calls or requests that can be made, how to make them, the data formats that should be used, the conventions to follow, etc. It can also provide extension mechanisms so that users can extend existing functionality in various ways and to varying degrees.[2] An API can be entirely custom, specific to a component, or designed based on an industry-standard to ensure interoperability. Through information hiding, APIs enable modular programming, allowing users to use the interface independently of the implementation. Reference to Web APIs is currently the most common use of the term.[3] There are also APIs for programming languages, software libraries, computer operating systems, and computer hardware. APIs originated in the 1940s, though the term API did not emerge until the 1960s and 70s. Contents 1 Purpose 2 History of the term 3 Usage 3.1 Libraries and frameworks 3.2 Operating systems 3.3 Remote APIs 3.4 Web APIs 4 Design 5 Release policies 5.1 Public API implications 6 Documentation 7 Dispute over copyright protection for APIs 8 Examples 9 See also 10 References 11 Further reading Purpose[edit] In building applications, an API (application programming interface) simplifies programming by abstracting the underlying implementation and only exposing objects or actions the developer needs. While a graphical interface for an email client might provide a user with a button that performs all the steps for fetching and highlighting new emails, an API for file input/output might give the developer a function that copies a file from one location to another without requiring that the developer understand the file system operations occurring behind the scenes.[4] History of the term[edit] A diagram from 1978 proposing the expansion of the idea of the API to become a general programming interface, beyond application programs alone.[5] The meaning of the term API has expanded over its history. It first described an interface only for end-user-facing programs, known as application programs. This origin is still reflected in the name "application programming interface." Today, the term API is broader, including also utility software and even hardware interfaces.[6] The idea of the API is much older than the term. British computer scientists Wilkes and Wheeler worked on modular software libraries in the 1940s for the EDSAC computer. Their book The Preparation of Programs for an Electronic Digital Computer contains the first published API specification. Joshua Bloch claims that Wilkes and Wheeler "latently invented" the API, because it is more of a concept that is discovered than invented.[6] Although the people who coined the term API were implementing software on a Univac 1108, the goal of their API was to make hardware independent programs possible.[7] The term "application program interface" (without an -ing suffix) is first recorded in a paper called Data structures and techniques for remote computer graphics presented at an AFIPS conference in 1968.[8][6] The authors of this paper use the term to describe the interaction of an application — a graphics program in this case — with the rest of the computer system. A consistent application interface (consisting of Fortran subroutine calls) was intended to free the programmer from dealing with idiosyncrasies of the graphics display device, and to provide hardware independence if the computer or the display were replaced.[7] The term was introduced to the field of databases by C. J. Date[9] in a 1974 paper called The Relational and Network Approaches: Comparison of the Application Programming Interface.[10] An API became a part of ANSI/SPARC framework for database management systems. This framework treated the application programming interface separately from other interfaces, such as the query interface. Database professionals in the 1970s observed these different interfaces could be combined; a sufficiently rich application interface could support the other interfaces as well.[5] This observation led to APIs that supported all types of programming, not just application programming. By 1990, the API was defined simply as "a set of services available to a programmer for performing certain tasks" by technologist Carl Malamud.[11] The conception of the API was expanded again with the dawn of web APIs. Roy Fielding's dissertation Architectural Styles and the Design of Network-based Software Architectures at UC Irvine in 2000 outlined Representational state transfer (REST) and described the idea of a "network-based Application Programming Interface" that Fielding contrasted with traditional "library-based" APIs.[12] XML and JSON web APIs saw widespread commercial adoption beginning in 2000 and continuing as of 2021. The web API is now the most common meaning of the term API.[3] When used in this way, the term API has some overlap in meaning with the terms communication protocol and remote procedure call. The Semantic Web proposed by Tim Berners-Lee in 2001 included "semantic APIs" that recast the API as an open, distributed data interface rather than a software behavior interface.[13] Instead, proprietary interfaces and agents became more widespread. Usage[edit] Libraries and frameworks[edit] The interface to a software library is one type of API. The API describes and prescribes the "expected behavior" (a specification) while the library is an "actual implementation" of this set of rules. A single API can have multiple implementations (or none, being abstract) in the form of different libraries that share the same programming interface. The separation of the API from its implementation can allow programs written in one language to use a library written in another. For example, because Scala and Java compile to compatible bytecode, Scala developers can take advantage of any Java API.[14] API use can vary depending on the type of programming language involved. An API for a procedural language such as Lua could consist primarily of basic routines to execute code, manipulate data or handle errors while an API for an object-oriented language, such as Java, would provide a specification of classes and its class methods.[15][16] Language bindings are also APIs. By mapping the features and capabilities of one language to an interface implemented in another language, a language binding allows a library or service written in one language to be used when developing in another language.[17] Tools such as SWIG and F2PY, a Fortran-to-Python interface generator, facilitate the creation of such interfaces.[18] An API can also be related to a software framework: a framework can be based on several libraries implementing several APIs, but unlike the normal use of an API, the access to the behavior built into the framework is mediated by extending its content with new classes plugged into the framework itself. Moreover, the overall program flow of control can be out of the control of the caller and in the framework's hands by inversion of control or a similar mechanism.[19][20] Operating systems[edit] An API can specify the interface between an application and the operating system.[21] POSIX, for example, specifies a set of common APIs that aim to enable an application written for a POSIX conformant operating system to be compiled for another POSIX conformant operating system. Linux and Berkeley Software Distribution are examples of operating systems that implement the POSIX APIs.[22] Microsoft has shown a strong commitment to a backward-compatible API, particularly within its Windows API (Win32) library, so older applications may run on newer versions of Windows using an executable-specific setting called "Compatibility Mode".[23] An API differs from an application binary interface (ABI) in that an API is source code based while an ABI is binary based. For instance, POSIX provides APIs while the Linux Standard Base provides an ABI.[24][25] Remote APIs[edit] Remote APIs allow developers to manipulate remote resources through protocols, specific standards for communication that allow different technologies to work together, regardless of language or platform. For example, the Java Database Connectivity API allows developers to query many different types of databases with the same set of functions, while the Java remote method invocation API uses the Java Remote Method Protocol to allow invocation of functions that operate remotely, but appear local to the developer.[26][27] Therefore, remote APIs are useful in maintaining the object abstraction in object-oriented programming; a method call, executed locally on a proxy object, invokes the corresponding method on the remote object, using the remoting protocol, and acquires the result to be used locally as a return value. A modification of the proxy object will also result in a corresponding modification of the remote object.[28] Web APIs[edit] Main article: Web API Web APIs are the defined interfaces through which interactions happen between an enterprise and applications that use its assets, which also is a Service Level Agreement (SLA) to specify the functional provider and expose the service path or URL for its API users. An API approach is an architectural approach that revolves around providing a program interface to a set of services to different applications serving different types of consumers.[29] When used in the context of web development, an API is typically defined as a set of specifications, such as Hypertext Transfer Protocol (HTTP) request messages, along with a definition of the structure of response messages, usually in an Extensible Markup Language (XML) or JavaScript Object Notation (JSON) format. An example might be a shipping company API that can be added to an eCommerce-focused website to facilitate ordering shipping services and automatically include current shipping rates, without the site developer having to enter the shipper's rate table into a web database. While "web API" historically has been virtually synonymous with web service, the recent trend (so-called Web 2.0) has been moving away from Simple Object Access Protocol (SOAP) based web services and service-oriented architecture (SOA) towards more direct representational state transfer (REST) style web resources and resource-oriented architecture (ROA).[30] Part of this trend is related to the Semantic Web movement toward Resource Description Framework (RDF), a concept to promote web-based ontology engineering technologies. Web APIs allow the combination of multiple APIs into new applications known as mashups.[31] In the social media space, web APIs have allowed web communities to facilitate sharing content and data between communities and applications. In this way, content that is created in one place dynamically can be posted and updated to multiple locations on the web.[32] For example, Twitter's REST API allows developers to access core Twitter data and the Search API provides methods for developers to interact with Twitter Search and trends data.[33] Design[edit] The design of an API has significant impact on its usage.[4] The principle of information hiding describes the role of programming interfaces as enabling modular programming by hiding the implementation details of the modules so that users of modules need not understand the complexities inside the modules.[34] Thus, the design of an API attempts to provide only the tools a user would expect.[4] The design of programming interfaces represents an important part of software architecture, the organization of a complex piece of software.[35] Release policies[edit] APIs are one of the more common ways technology companies integrate. Those that provide and use APIs are considered as being members of a business ecosystem.[36] The main policies for releasing an API are:[37] Private: The API is for internal company use only. Partner: Only specific business partners can use the API. For example, vehicle for hire companies such as Uber and Lyft allow approved third-party developers to directly order rides from within their apps. This allows the companies to exercise quality control by curating which apps have access to the API, and provides them with an additional revenue stream.[38] Public: The API is available for use by the public. For example, Microsoft makes the Windows API public, and Apple releases its API Cocoa, so that software can be written for their platforms. Not all public APIs are generally accessible by everybody. For example, Internet service providers like Cloudflare or Voxility, use RESTful APIs to allow customers and resellers access to their infrastructure information, DDoS stats, network performance or dashboard controls.[39] Access to such APIs is granted either by “API tokens”, or customer status validations.[40] Public API implications[edit] An important factor when an API becomes public is its "interface stability". Changes to the API—for example adding new parameters to a function call—could break compatibility with the clients that depend on that API.[41] When parts of a publicly presented API are subject to change and thus not stable, such parts of a particular API should be documented explicitly as "unstable". For example, in the Google Guava library, the parts that are considered unstable, and that might change soon, are marked with the Java annotation @Beta.[42] A public API can sometimes declare parts of itself as deprecated or rescinded. This usually means that part of the API should be considered a candidate for being removed, or modified in a backward incompatible way. Therefore, these changes allow developers to transition away from parts of the API that will be removed or not supported in the future.[43] Client code may contain innovative or opportunistic usages that were not intended by the API designers. In other words, for a library with a significant user base, when an element becomes part of the public API, it may be used in diverse ways.[44] On February 19, 2020, Akamai published their annual “State of the Internet” report, showcasing the growing trend of cybercriminals targeting public API platforms at financial services worldwide. From December 2017 through November 2019, Akamai witnessed 85.42 billion credential violation attacks. About 20%, or 16.55 billion, were against hostnames defined as API endpoints. Of these, 473.5 million have targeted financial services sector organizations.[45] Documentation[edit] API documentation describes what services an API offers and how to use those services, aiming to cover everything a client would need to know for practical purposes. Documentation is crucial for the development and maintenance of applications using the API.[46] API documentation is traditionally found in documentation files but can also be found in social media such as blogs, forums, and Q&A websites.[47] Traditional documentation files are often presented via a documentation system, such as Javadoc or Pydoc, that has a consistent appearance and structure. However, the types of content included in the documentation differs from API to API.[48] In the interest of clarity, API documentation may include a description of classes and methods in the API as well as "typical usage scenarios, code snippets, design rationales, performance discussions, and contracts", but implementation details of the API services themselves are usually omitted. Restrictions and limitations on how the API can be used are also covered by the documentation. For instance, documentation for an API function could note that its parameters cannot be null, that the function itself is not thread safe,[49] Because API documentation tends to be comprehensive, it is a challenge for writers to keep the documentation updated and for users to read it carefully, potentially yielding bugs.[41] API documentation can be enriched with metadata information like Java annotations. This metadata can be used by the compiler, tools, and by the run-time environment to implement custom behaviors or custom handling.[50] It is possible to generate API documentation in a data-driven manner. By observing many programs that use a given API, it is possible to infer the typical usages, as well the required contracts and directives.[51] Then, templates can be used to generate natural language from the mined data. Dispute over copyright protection for APIs[edit] Main article: Oracle America, Inc. v. Google, Inc. In 2010, Oracle Corporation sued Google for having distributed a new implementation of Java embedded in the Android operating system.[52] Google had not acquired any permission to reproduce the Java API, although permission had been given to the similar OpenJDK project. Judge William Alsup ruled in the Oracle v. Google case that APIs cannot be copyrighted in the U.S and that a victory for Oracle would have widely expanded copyright protection to a "functional set of symbols" and allowed the copyrighting of simple software commands: To accept Oracle's claim would be to allow anyone to copyright one version of code to carry out a system of commands and thereby bar all others from writing its different versions to carry out all or part of the same commands.[53][54] In 2014, however, Alsup's ruling was overturned on appeal to the Court of Appeals for the Federal Circuit, though the question of whether such use of APIs constitutes fair use was left unresolved. [55][56] In 2016, following a two-week trial, a jury determined that Google's reimplementation of the Java API constituted fair use, but Oracle vowed to appeal the decision.[57] Oracle won on its appeal, with the Court of Appeals for the Federal Circuit ruling that Google's use of the APIs did not qualify for fair use.[58] In 2019, Google appealed to the Supreme Court of the United States over both the copyrightability and fair use rulings, and the Supreme Court granted review.[59] Due to the COVID-19 pandemic, the oral hearings in the case were delayed until October 2020.[60] Examples[edit] Main category: Application programming interfaces ASPI for SCSI device interfacing Cocoa and Carbon for the Macintosh DirectX for Microsoft Windows EHLLAPI Java APIs ODBC for Microsoft Windows OpenAL cross-platform sound API OpenCL cross-platform API for general-purpose computing for CPUs & GPUs OpenGL cross-platform graphics API OpenMP API that supports multi-platform shared memory multiprocessing programming in C, C++, and Fortran on many architectures, including Unix and Microsoft Windows platforms. Server Application Programming Interface (SAPI) Simple DirectMedia Layer (SDL) See also[edit] API testing API writer Augmented web Calling convention Common Object Request Broker Architecture (CORBA) Comparison of application virtual machines Document Object Model (DOM) Double-chance function Foreign function interface Front and back ends Interface (computing) Interface control document List of 3D graphics APIs Microservices Name mangling Open API Open Service Interface Definitions Parsing Plugin RAML (software) Software development kit (SDK) Web API Web content vendor XPCOM References[edit] ^ "What is an API". Hubspire. ^ Fisher, Sharon (1989). "OS/2 EE to Get 3270 Interface Early". Google Books. ^ a b Lane, Kin (October 10, 2019). "Intro to APIs: History of APIs". Postman. Retrieved September 18, 2020. When you hear the acronym “API” or its expanded version “Application Programming Interface,” it is almost always in reference to our modern approach, in that we use HTTP to provide access to machine readable data in a JSON or XML format, often simply referred to as “web APIs.” APIs have been around almost as long as computing, but modern web APIs began taking shape in the early 2000s. ^ a b c 3333Clarke, Steven (2004). "Measuring API Usability". Dr. Dobb's. Retrieved 29 July 2016. ^ a b Database architectures—a feasibility workshop (Report). Washington D.C.: U.S. Department of Commerce, National Bureau of Standards. April 1981. pp. 45–47. hdl:2027/mdp.39015077587742. LCCN 81600004. NBS special publication 500-76. Retrieved September 18, 2020. ^ a b c Bloch, Joshua (August 8, 2018). A Brief, Opinionated History of the API (Speech). QCon. San Francisco: InfoQ. Retrieved September 18, 2020. ^ a b Cotton, Ira W.; Greatorex, Frank S. (December 1968). "Data structures and techniques for remote computer graphics". AFIPS '68: Proceedings of the December 9-11, 1968, Fall Joint Computer Conference. AFIPS 1968 Fall Joint Computer Conference. I. San Francisco, California: Association for Computing Machinery. pp. 533–544. doi:10.1145/1476589.1476661. ISBN 978-1450378994. OCLC 1175621908. ^ "application program interface". Oxford English Dictionary (Online ed.). Oxford University Press. (Subscription or participating institution membership required.) ^ Date, C. J. (July 18, 2019). E. F. Codd and Relational Theory: A Detailed Review and Analysis of Codd's Major Database Writings. p. 135. ISBN 978-1684705276. ^ Date, C. J.; Codd, E. F. (January 1975). "The relational and network approaches: Comparison of the application programming interfaces". In Randall Rustin (ed.). Proceedings of 1974 ACM-SIGMOD Workshop on Data Description, Access and Control. SIGMOD Workshop 1974. 2. Ann Arbor, Michigan: Association for Computing Machinery. pp. 83–113. doi:10.1145/800297.811532. ISBN 978-1450374187. OCLC 1175623233. ^ Carl, Malamud (1990). Analyzing Novell Networks. Van Nostrand Reinhold. p. 294. ISBN 978-0442003647. ^ Fielding, Roy (2000). Architectural Styles and the Design of Network-based Software Architectures (PhD). Retrieved September 18, 2020. ^ Dotsika, Fefie (August 2010). "Semantic APIs: Scaling up towards the Semantic Web". International Journal of Information Management. 30 (4): 335–342. doi:10.1016/j.ijinfomgt.2009.12.003. ^ Odersky, Martin; Spoon, Lex; Venners, Bill (10 December 2008). "Combining Scala and Java". www.artima.com. Retrieved 29 July 2016. ^ de Figueiredo, Luiz Henrique; Ierusalimschy, Roberto; Filho, Waldemar Celes. "The design and implementation of a language for extending applications". TeCGraf Grupo de Tecnologia Em Computacao Grafica. CiteSeerX 10.1.1.47.5194. S2CID 59833827. Retrieved 29 July 2016. ^ Sintes, Tony (13 July 2001). "Just what is the Java API anyway?". JavaWorld. Retrieved 2020-07-18. ^ Emery, David. "Standards, APIs, Interfaces and Bindings". Acm.org. Archived from the original on 2015-01-16. Retrieved 2016-08-08. ^ "F2PY.org". F2PY.org. Retrieved 2011-12-18. ^ Fowler, Martin. "Inversion Of Control". ^ Fayad, Mohamed. "Object-Oriented Application Frameworks". ^ Lewine, Donald A. (1991). POSIX Programmer's Guide. O'Reilly & Associates, Inc. p. 1. ISBN 9780937175736. Retrieved 2 August 2016. ^ West, Joel; Dedrick, Jason (2001). "Open source standardization: the rise of Linux in the network era" (PDF). Knowledge, Technology & Policy. 14 (2): 88–112. Retrieved 2 August 2016. ^ Microsoft (October 2001). "Support for Windows XP". Microsoft. p. 4. Archived from the original on 2009-09-26. ^ "LSB Introduction". Linux Foundation. 21 June 2012. Retrieved 2015-03-27. ^ Stoughton, Nick (April 2005). "Update on Standards" (PDF). USENIX. Retrieved 2009-06-04. ^ Bierhoff, Kevin (23 April 2009). "API Protocol Compliance in Object-Oriented Software" (PDF). CMU Institute for Software Research. Retrieved 29 July 2016. ^ Wilson, M. Jeff (10 November 2000). "Get smart with proxies and RMI". JavaWorld. Retrieved 2020-07-18. ^ Henning, Michi; Vinoski, Steve (1999). Advanced CORBA Programming with C++. Addison-Wesley. ISBN 978-0201379273. Retrieved 16 June 2015. ^ "API-fication" (PDF download). www.hcltech.com. August 2014. ^ Benslimane, Djamal; Schahram Dustdar; Amit Sheth (2008). "Services Mashups: The New Generation of Web Applications". IEEE Internet Computing, vol. 12, no. 5. Institute of Electrical and Electronics Engineers. pp. 13–15. Archived from the original on 2011-09-28. Retrieved 2019-10-01. ^ Niccolai, James (2008-04-23), "So What Is an Enterprise Mashup, Anyway?", PC World ^ Parr, Ben. "The Evolution of the Social Media API". Mashable. Retrieved 26 July 2016. ^ "GET trends/place". developer.twitter.com. Retrieved 2020-04-30. ^ Parnas, D.L. (1972). "On the Criteria To Be Used in Decomposing Systems into Modules" (PDF). Communications of the ACM. 15 (12): 1053–1058. doi:10.1145/361598.361623. S2CID 53856438. ^ Garlan, David; Shaw, Mary (January 1994). "An Introduction to Software Architecture" (PDF). Advances in Software Engineering and Knowledge Engineering. 1. Retrieved 8 August 2016. ^ de Ternay, Guerric (Oct 10, 2015). "Business Ecosystem: Creating an Economic Moat". BoostCompanies. Retrieved 2016-02-01. ^ Boyd, Mark (2014-02-21). "Private, Partner or Public: Which API Strategy Is Best for Business?". ProgrammableWeb. Retrieved 2 August 2016. ^ Weissbrot, Alison (7 July 2016). "Car Service APIs Are Everywhere, But What's In It For Partner Apps?". AdExchanger. ^ "Cloudflare API v4 Documentation". cloudflare. 25 February 2020. Retrieved 27 February 2020. ^ Liew, Zell (17 January 2018). "Car Service APIs Are Everywhere, But What's In It For Partner Apps". Smashing Magazine. Retrieved 27 February 2020. ^ a b Shi, Lin; Zhong, Hao; Xie, Tao; Li, Mingshu (2011). An Empirical Study on Evolution of API Documentation. International Conference on Fundamental Approaches to Software Engineering. Lecture Notes in Computer Science. 6603. pp. 416–431. doi:10.1007/978-3-642-19811-3_29. ISBN 978-3-642-19810-6. Retrieved 22 July 2016. ^ "guava-libraries - Guava: Google Core Libraries for Java 1.6+ - Google Project Hosting". 2014-02-04. Retrieved 2014-02-11. ^ Oracle. "How and When to Deprecate APIs". Java SE Documentation. Retrieved 2 August 2016. ^ Mendez, Diego; Baudry, Benoit; Monperrus, Martin (2013). "Empirical evidence of large-scale diversity in API usage of object-oriented software". 2013 IEEE 13th International Working Conference on Source Code Analysis and Manipulation (SCAM). pp. 43–52. arXiv:1307.4062. doi:10.1109/SCAM.2013.6648183. ISBN 978-1-4673-5739-5. S2CID 6890739. ^ Takanashi, Dean (19 February 2020). "Akamai: Cybercriminals are attacking APIs at financial services firms". Venture Beat. Retrieved 27 February 2020. ^ Dekel, Uri; Herbsleb, James D. (May 2009). "Improving API Documentation Usability with Knowledge Pushing". Institute for Software Research, School of Computer Science. CiteSeerX 10.1.1.446.4214. ^ Parnin, Chris; Treude, Cristoph (May 2011). "Measuring API Documentation on the Web". Web2SE: 25–30. doi:10.1145/1984701.1984706. ISBN 9781450305952. S2CID 17751901. Retrieved 22 July 2016. ^ Maalej, Waleed; Robillard, Martin P. (April 2012). "Patterns of Knowledge in API Reference Documentation" (PDF). IEEE Transactions on Software Engineering. Retrieved 22 July 2016. ^ Monperrus, Martin; Eichberg, Michael; Tekes, Elif; Mezini, Mira (3 December 2011). "What should developers be aware of? An empirical study on the directives of API documentation". Empirical Software Engineering. 17 (6): 703–737. arXiv:1205.6363. doi:10.1007/s10664-011-9186-4. S2CID 8174618. ^ "Annotations". Sun Microsystems. Archived from the original on 2011-09-25. Retrieved 2011-09-30.. ^ Bruch, Marcel; Mezini, Mira; Monperrus, Martin (2010). "Mining subclassing directives to improve framework reuse". 2010 7th IEEE Working Conference on Mining Software Repositories (MSR 2010). pp. 141–150. CiteSeerX 10.1.1.434.15. doi:10.1109/msr.2010.5463347. ISBN 978-1-4244-6802-7. S2CID 1026918. ^ "Oracle and the End of Programming As We Know It". DrDobbs. 2012-05-01. Retrieved 2012-05-09. ^ "APIs Can't be Copyrighted Says Judge in Oracle Case". TGDaily. 2012-06-01. Retrieved 2012-12-06. ^ "Oracle America, Inc. vs. Google Inc" (PDF). Wired. 2012-05-31. Retrieved 2013-09-22. ^ "Oracle Am., Inc. v. Google Inc., No. 13-1021, Fed. Cir. 2014". ^ Rosenblatt, Seth (May 9, 2014). "Court sides with Oracle over Android in Java patent appeal". CNET. Retrieved 2014-05-10. ^ "Google beats Oracle—Android makes "fair use" of Java APIs". Ars Technica. 2016-05-26. Retrieved 2016-07-28. ^ Decker, Susan (March 27, 2018). "Oracle Wins Revival of Billion-Dollar Case Against Google". Bloomberg Businessweek. Retrieved March 27, 2018. ^ Lee, Timothy (January 25, 2019). "Google asks Supreme Court to overrule disastrous ruling on API copyrights". Ars Technica. Retrieved February 8, 2019. ^ vkimber (2020-09-28). "Google LLC v. Oracle America, Inc". LII / Legal Information Institute. Retrieved 2021-03-06. Further reading[edit] Taina Bucher (16 November 2013). "Objects of Intense Feeling: The Case of the Twitter API". Computational Culture (3). ISSN 2047-2390. Argues that "APIs are far from neutral tools" and form a key part of contemporary programming, understood as a fundamental part of culture. What is an API? - in the U.S. supreme court opinion, Google v. Oracle 2021, pp.3-7 - "For each task, there is computer code; API (also known as Application Program Interface) is the method for calling that 'computer code' (instruction - like a recipe - rather than cooking instruction, this is machine instruction) to be carry out" v t e Operating systems General Advocacy Comparison Forensic engineering History Hobbyist development List Timeline Usage share User features comparison Variants Disk operating system Distributed operating system Embedded operating system Mobile operating system Network operating system Object-oriented operating system Real-time operating system Supercomputer operating system Kernel Architectures Exokernel Hybrid Microkernel Monolithic vkernel Rump kernel Unikernel Components Device driver Loadable kernel module User space Process management Concepts Computer multitasking (Cooperative, Preemptive) Context switch Interrupt IPC Process Process control block Real-time Thread Time-sharing Scheduling algorithms Fixed-priority preemptive Multilevel feedback queue Round-robin Shortest job next Memory management, resource protection Bus error General protection fault Memory protection Paging Protection ring Segmentation fault Virtual memory Storage access, file systems Boot loader Defragmentation Device file File attribute Inode Journal Partition Virtual file system Virtual tape library Supporting concepts API Computer network HAL Live CD Live USB OS shell CLI GUI 3D GUI NUI TUI VUI ZUI PXE Authority control BNF: cb13337425v (data) GND: 4430243-5 LCCN: sh98004527 MA: 99613125 Retrieved from "https://en.wikipedia.org/w/index.php?title=API&oldid=1019932945" Categories: Application programming interfaces Technical communication Hidden categories: Articles with short description Short description matches Wikidata Wikipedia articles with BNF identifiers Wikipedia articles with GND identifiers Wikipedia articles with LCCN identifiers Wikipedia articles with MA identifiers Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version In other projects Wikimedia Commons Languages العربية Asturianu Azərbaycanca বাংলা Български Boarisch Bosanski Català Čeština Dansk Deutsch Eesti Ελληνικά Español Esperanto Euskara فارسی Français Gaeilge Galego 한국어 हिन्दी Hrvatski Bahasa Indonesia Italiano עברית ქართული Latviešu Lietuvių Magyar മലയാളം Bahasa Melayu Монгол Nederlands 日本語 Nordfriisk Norsk bokmål Norsk nynorsk Piemontèis Polski Português Română Русский Shqip Simple English Slovenčina Slovenščina کوردی Српски / srpski Srpskohrvatski / српскохрватски Suomi Svenska Tagalog தமிழ் ไทย Türkçe Українська Tiếng Việt 吴语 中文 Edit links This page was last edited on 26 April 2021, at 06:47 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-359	----	Half-life - Wikipedia Half-life From Wikipedia, the free encyclopedia Jump to navigation Jump to search Scientific and mathematical term This article is about the scientific and mathematical concept. For the video game, see Half-Life (video game). For other uses, see Half-Life (disambiguation). This article is missing information about the history of the term half-life. Please expand the article to include this information. Further details may exist on the talk page. (July 2019) Number of half-lives elapsed Fraction remaining Percentage remaining 0 1⁄1 100 1 1⁄2 50 2 1⁄4 25 3 1⁄8 12 .5 4 1⁄16 6 .25 5 1⁄32 3 .125 6 1⁄64 1 .5625 7 1⁄128 0 .78125 ... ... ... n 1/2n 100/2n Half-life (symbol t1⁄2) is the time required for a quantity to reduce to half of its initial value. The term is commonly used in nuclear physics to describe how quickly unstable atoms undergo radioactive decay or how long stable atoms survive. The term is also used more generally to characterize any type of exponential or non-exponential decay. For example, the medical sciences refer to the biological half-life of drugs and other chemicals in the human body. The converse of half-life is doubling time. The original term, half-life period, dating to Ernest Rutherford's discovery of the principle in 1907, was shortened to half-life in the early 1950s.[1] Rutherford applied the principle of a radioactive element's half-life to studies of age determination of rocks by measuring the decay period of radium to lead-206. Half-life is constant over the lifetime of an exponentially decaying quantity, and it is a characteristic unit for the exponential decay equation. The accompanying table shows the reduction of a quantity as a function of the number of half-lives elapsed. Contents 1 Probabilistic nature 2 Formulas for half-life in exponential decay 2.1 Half-life and reaction orders 2.2 Decay by two or more processes 2.3 Examples 3 In non-exponential decay 4 In biology and pharmacology 5 See also 6 References 7 External links Probabilistic nature[edit] Simulation of many identical atoms undergoing radioactive decay, starting with either 4 atoms per box (left) or 400 (right). The number at the top is how many half-lives have elapsed. Note the consequence of the law of large numbers: with more atoms, the overall decay is more regular and more predictable. A half-life usually describes the decay of discrete entities, such as radioactive atoms. In that case, it does not work to use the definition that states "half-life is the time required for exactly half of the entities to decay". For example, if there is just one radioactive atom, and its half-life is one second, there will not be "half of an atom" left after one second. Instead, the half-life is defined in terms of probability: "Half-life is the time required for exactly half of the entities to decay on average". In other words, the probability of a radioactive atom decaying within its half-life is 50%.[2] For example, the image on the right is a simulation of many identical atoms undergoing radioactive decay. Note that after one half-life there are not exactly one-half of the atoms remaining, only approximately, because of the random variation in the process. Nevertheless, when there are many identical atoms decaying (right boxes), the law of large numbers suggests that it is a very good approximation to say that half of the atoms remain after one half-life. Various simple exercises can demonstrate probabilistic decay, for example involving flipping coins or running a statistical computer program.[3][4][5] Formulas for half-life in exponential decay[edit] Main article: Exponential decay An exponential decay can be described by any of the following three equivalent formulas:[6]:109–112 N ( t ) = N 0 ( 1 2 ) t t 1 / 2 N ( t ) = N 0 e − t τ N ( t ) = N 0 e − λ t {\displaystyle {\begin{aligned}N(t)&=N_{0}\left({\frac {1}{2}}\right)^{\frac {t}{t_{1/2}}}\\N(t)&=N_{0}e^{-{\frac {t}{\tau }}}\\N(t)&=N_{0}e^{-\lambda t}\end{aligned}}} where N0 is the initial quantity of the substance that will decay (this quantity may be measured in grams, moles, number of atoms, etc.), N(t) is the quantity that still remains and has not yet decayed after a time t, t1⁄2 is the half-life of the decaying quantity, τ is a positive number called the mean lifetime of the decaying quantity, λ is a positive number called the decay constant of the decaying quantity. The three parameters t1⁄2, τ, and λ are all directly related in the following way: t 1 / 2 = ln ⁡ ( 2 ) λ = τ ln ⁡ ( 2 ) {\displaystyle t_{1/2}={\frac {\ln(2)}{\lambda }}=\tau \ln(2)} where ln(2) is the natural logarithm of 2 (approximately 0.693).[6]:112 Half-life and reaction orders[edit] The value of the half-life depends on the reaction order: Zero order kinetics: The rate of this kind of reaction does not depend on the substrate concentration. The rate law of zero order kinetics is as follows: [ A ] = [ A ] 0 − k t {\displaystyle [A]=[A]_{0}-kt} In order to find the half life we have to replace the concentration value for the initial concentration divided by 2 and isolate the time. If we do it, we find the equation of the half life of the zero order reaction: t 1 / 2 = [ A ] 0 k 2 {\displaystyle t_{1/2}={\frac {[A]_{0}}{k2}}} The t1/2 formula for a zero order reaction suggests the half-life depends on the amount of initial concentration and rate constant. First order kinetics: In first order reactions, the concentration of the reaction will continue to decrease as time progresses until it reaches zero, and the length of half-life will be constant, independent of concentration. The time for [A] to decrease from [A]0 to ½ [A]0 in a first-order reaction is given by the following equation: k t 1 / 2 = − ln ⁡ ( 1 / 2 [ A ] 0 [ A ] 0 ) = − ln ⁡ 1 2 = ln ⁡ 2 {\displaystyle kt_{1/2}=-\ln {\biggl (}{\frac {1/2[A]_{0}}{[A]_{0}}}{\biggr )}=-\ln {\frac {1}{2}}=\ln 2} For a first-order reaction, the half-life of a reactant is independent of its initial concentration. Therefore, if the concentration of A at some arbitrary stage of the reaction is [A], then it will have fallen to ½ [A] after a further interval of (\ln 2)/k. Hence, the half-life of a first order reaction is given as the following: t 1 / 2 = ln ⁡ 2 k {\displaystyle t_{1/2}={\frac {\ln 2}{k}}} The half-life of a first order reaction is independent of its initial concentration and depends solely on the reaction rate constant, k. Second order kinetics: In the second order reactions, the concentration of the reactant decrease following this formula: 1 [ A ] = k t + 1 [ A ] 0 {\displaystyle {\frac {1}{[A]}}=kt+{\frac {1}{[A]_{0}}}} Then, we replace [A] for [A]0 divided by 2 in order to calculate the half-life of the reactant A and isolate the time of the half-life (t1/2): t 1 / 2 = 1 [ A ] 0 k {\displaystyle t_{1/2}={\frac {1}{[A]_{0}k}}} As you can see, the half-life of the second order reactions depends on the initial concentration and rate constant. Decay by two or more processes[edit] Some quantities decay by two exponential-decay processes simultaneously. In this case, the actual half-life T1⁄2 can be related to the half-lives t1 and t2 that the quantity would have if each of the decay processes acted in isolation: 1 T 1 / 2 = 1 t 1 + 1 t 2 {\displaystyle {\frac {1}{T_{1/2}}}={\frac {1}{t_{1}}}+{\frac {1}{t_{2}}}} For three or more processes, the analogous formula is: 1 T 1 / 2 = 1 t 1 + 1 t 2 + 1 t 3 + ⋯ {\displaystyle {\frac {1}{T_{1/2}}}={\frac {1}{t_{1}}}+{\frac {1}{t_{2}}}+{\frac {1}{t_{3}}}+\cdots } For a proof of these formulas, see Exponential decay § Decay by two or more processes. Examples[edit] Half-life demonstrated using dice in a classroom experiment Further information: Exponential decay § Applications and examples There is a half-life describing any exponential-decay process. For example: As noted above, in radioactive decay the half-life is the length of time after which there is a 50% chance that an atom will have undergone nuclear decay. It varies depending on the atom type and isotope, and is usually determined experimentally. See List of nuclides. The current flowing through an RC circuit or RL circuit decays with a half-life of ln(2)RC or ln(2)L/R, respectively. For this example the term half time tends to be used, rather than "half-life", but they mean the same thing. In a chemical reaction, the half-life of a species is the time it takes for the concentration of that substance to fall to half of its initial value. In a first-order reaction the half-life of the reactant is ln(2)/λ, where λ is the reaction rate constant. In non-exponential decay[edit] The term "half-life" is almost exclusively used for decay processes that are exponential (such as radioactive decay or the other examples above), or approximately exponential (such as biological half-life discussed below). In a decay process that is not even close to exponential, the half-life will change dramatically while the decay is happening. In this situation it is generally uncommon to talk about half-life in the first place, but sometimes people will describe the decay in terms of its "first half-life", "second half-life", etc., where the first half-life is defined as the time required for decay from the initial value to 50%, the second half-life is from 50% to 25%, and so on.[7] In biology and pharmacology[edit] See also: Biological half-life A biological half-life or elimination half-life is the time it takes for a substance (drug, radioactive nuclide, or other) to lose one-half of its pharmacologic, physiologic, or radiological activity. In a medical context, the half-life may also describe the time that it takes for the concentration of a substance in blood plasma to reach one-half of its steady-state value (the "plasma half-life"). The relationship between the biological and plasma half-lives of a substance can be complex, due to factors including accumulation in tissues, active metabolites, and receptor interactions.[8] While a radioactive isotope decays almost perfectly according to so-called "first order kinetics" where the rate constant is a fixed number, the elimination of a substance from a living organism usually follows more complex chemical kinetics. For example, the biological half-life of water in a human being is about 9 to 10 days,[9] though this can be altered by behavior and other conditions. The biological half-life of caesium in human beings is between one and four months. The concept of a half-life has also been utilized for pesticides in plants,[10] and certain authors maintain that pesticide risk and impact assessment models rely on and are sensitive to information describing dissipation from plants.[11] In epidemiology, the concept of half-life can refer to the length of time for the number of incident cases in a disease outbreak to drop by half, particularly if the dynamics of the outbreak can be modeled exponentially.[12][13] See also[edit] Half time (physics) List of radioactive nuclides by half-life Mean lifetime Median lethal dose References[edit] ^ John Ayto, 20th Century Words (1989), Cambridge University Press. ^ Muller, Richard A. (April 12, 2010). Physics and Technology for Future Presidents. Princeton University Press. pp. 128–129. ISBN 9780691135045. ^ Chivers, Sidney (March 16, 2003). "Re: What happens during half-lifes [sic] when there is only one atom left?". MADSCI.org. ^ "Radioactive-Decay Model". Exploratorium.edu. Retrieved 2012-04-25. ^ Wallin, John (September 1996). "Assignment #2: Data, Simulations, and Analytic Science in Decay". Astro.GLU.edu. Archived from the original on 2011-09-29.CS1 maint: unfit URL (link) ^ a b Rösch, Frank (September 12, 2014). Nuclear- and Radiochemistry: Introduction. 1. Walter de Gruyter. ISBN 978-3-11-022191-6. ^ Jonathan Crowe; Tony Bradshaw (2014). Chemistry for the Biosciences: The Essential Concepts. p. 568. ISBN 9780199662883. ^ Lin VW; Cardenas DD (2003). Spinal cord medicine. Demos Medical Publishing, LLC. p. 251. ISBN 978-1-888799-61-3. ^ Pang, Xiao-Feng (2014). Water: Molecular Structure and Properties. New Jersey: World Scientific. p. 451. ISBN 9789814440424. ^ Australian Pesticides and Veterinary Medicines Authority (31 March 2015). "Tebufenozide in the product Mimic 700 WP Insecticide, Mimic 240 SC Insecticide". Australian Government. Retrieved 30 April 2018. ^ Fantke, Peter; Gillespie, Brenda W.; Juraske, Ronnie; Jolliet, Olivier (11 July 2014). "Estimating Half-Lives for Pesticide Dissipation from Plants". Environmental Science & Technology. 48 (15): 8588–8602. Bibcode:2014EnST...48.8588F. doi:10.1021/es500434p. PMID 24968074. ^ Balkew, Teshome Mogessie (December 2010). The SIR Model When S(t) is a Multi-Exponential Function (Thesis). East Tennessee State University. ^ Ireland, MW, ed. (1928). The Medical Department of the United States Army in the World War, vol. IX: Communicable and Other Diseases. Washington: U.S.: U.S. Government Printing Office. pp. 116–7. External links[edit] Look up half-life in Wiktionary, the free dictionary. Wikimedia Commons has media related to Half times. Welcome to Nucleonica, Nucleonica.net (archived 2017) wiki: Decay Engine, Nucleonica.net (archived 2016) System Dynamics – Time Constants, Bucknell.edu Researchers Nikhef and UvA measure slowest radioactive decay ever: Xe-124 with 18 billion trillion years v t e Radiation (physics and health) Main articles Non-ionizing radiation Acoustic radiation force Infrared Light Starlight Sunlight Microwave Radio waves Ultraviolet Ionizing radiation Radioactive decay Cluster decay Background radiation Alpha particle Beta particle Gamma ray Cosmic ray Neutron radiation Nuclear fission Nuclear fusion Nuclear reactors Nuclear weapons Particle accelerators Radioactive materials X-ray Earth's energy budget Electromagnetic radiation Synchrotron radiation Thermal radiation Black-body radiation Particle radiation Gravitational radiation Cosmic background radiation Cherenkov radiation Askaryan radiation Bremsstrahlung Unruh radiation Dark radiation Radiation and health Radiation syndrome acute chronic Health physics Dosimetry Electromagnetic radiation and health Laser safety Lasers and aviation safety Medical radiography Mobile phone radiation and health Radiation protection Radiation therapy Radioactivity in the life sciences Radioactive contamination Radiobiology Biological dose units and quantities Wireless electronic devices and health Radiation heat-transfer Related articles Half-life Nuclear physics Radioactive source Radiation hardening List of civilian radiation accidents 1996 Costa Rica accident 1987 Goiânia accident 1984 Moroccan accident 1990 Zaragoza accident See also: the categories Radiation effects, Radioactivity, Radiobiology, and Radiation protection Authority control GND: 4258821-2 MA: 3941253, 113514576 Retrieved from "https://en.wikipedia.org/w/index.php?title=Half-life&oldid=1014132428" Categories: Chemical kinetics Exponentials Radioactivity Hidden categories: CS1 maint: unfit URL Articles with short description Short description is different from Wikidata Articles to be expanded from July 2019 Commons category link is on Wikidata Wikipedia articles with GND identifiers Wikipedia articles with MA identifiers Wikipedia articles with multiple identifiers Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version In other projects Wikimedia Commons Languages Afrikaans العربية Aragonés Asturianu বাংলা Bân-lâm-gú Беларуская भोजपुरी Български Bosanski Català Чӑвашла Čeština Cymraeg Dansk الدارجة Deutsch Eesti Ελληνικά Español Esperanto Euskara فارسی Français Gaeilge Galego 贛語 한국어 हिन्दी Hrvatski Bahasa Indonesia Íslenska Italiano עברית ಕನ್ನಡ ქართული Қазақша Kiswahili Kreyòl ayisyen Latviešu Lietuvių Limburgs Magyar Македонски മലയാളം Bahasa Melayu Nederlands 日本語 Nordfriisk Norsk bokmål Norsk nynorsk Occitan Oʻzbekcha/ўзбекча پنجابی Plattdüütsch Polski Português Română Runa Simi Русский Simple English Slovenčina Slovenščina کوردی Српски / srpski Srpskohrvatski / српскохрватски Suomi Svenska தமிழ் Татарча/tatarça తెలుగు ไทย Türkçe Українська اردو Tiếng Việt 吴语 粵語 中文 Edit links This page was last edited on 25 March 2021, at 10:46 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-5784	----	Unix philosophy - Wikipedia Unix philosophy From Wikipedia, the free encyclopedia Jump to navigation Jump to search Philosophy on developing software Ken Thompson and Dennis Ritchie, key proponents of the Unix philosophy The Unix philosophy, originated by Ken Thompson, is a set of cultural norms and philosophical approaches to minimalist, modular software development. It is based on the experience of leading developers of the Unix operating system. Early Unix developers were important in bringing the concepts of modularity and reusability into software engineering practice, spawning a "software tools" movement. Over time, the leading developers of Unix (and programs that ran on it) established a set of cultural norms for developing software; these norms became as important and influential as the technology of Unix itself; this has been termed the "Unix philosophy." The Unix philosophy emphasizes building simple, short, clear, modular, and extensible code that can be easily maintained and repurposed by developers other than its creators. The Unix philosophy favors composability as opposed to monolithic design. Contents 1 Origin 2 The UNIX Programming Environment 3 Program Design in the UNIX Environment 4 Doug McIlroy on Unix programming 5 Do One Thing and Do It Well 6 Eric Raymond's 17 Unix Rules 7 Mike Gancarz: The UNIX Philosophy 8 "Worse is better" 9 Criticism 10 See also 11 Notes 12 References 13 External links Origin[edit] The Unix philosophy is documented by Doug McIlroy[1] in the Bell System Technical Journal from 1978:[2] Make each program do one thing well. To do a new job, build afresh rather than complicate old programs by adding new "features". Expect the output of every program to become the input to another, as yet unknown, program. Don't clutter output with extraneous information. Avoid stringently columnar or binary input formats. Don't insist on interactive input. Design and build software, even operating systems, to be tried early, ideally within weeks. Don't hesitate to throw away the clumsy parts and rebuild them. Use tools in preference to unskilled help to lighten a programming task, even if you have to detour to build the tools and expect to throw some of them out after you've finished using them. It was later summarized by Peter H. Salus in A Quarter-Century of Unix (1994):[1] Write programs that do one thing and do it well. Write programs to work together. Write programs to handle text streams, because that is a universal interface. In their award-winning Unix paper of 1974[citation needed], Ritchie and Thompson quote the following design considerations:[3] Make it easy to write, test, and run programs. Interactive use instead of batch processing. Economy and elegance of design due to size constraints ("salvation through suffering"). Self-supporting system: all Unix software is maintained under Unix. The whole philosophy of UNIX seems to stay out of assembler. — Michael Sean Mahoney[4] The UNIX Programming Environment[edit] In their preface to the 1984 book, The UNIX Programming Environment, Brian Kernighan and Rob Pike, both from Bell Labs, give a brief description of the Unix design and the Unix philosophy:[5] Rob Pike, co-author of The UNIX Programming Environment Even though the UNIX system introduces a number of innovative programs and techniques, no single program or idea makes it work well. Instead, what makes it effective is the approach to programming, a philosophy of using the computer. Although that philosophy can't be written down in a single sentence, at its heart is the idea that the power of a system comes more from the relationships among programs than from the programs themselves. Many UNIX programs do quite trivial things in isolation, but, combined with other programs, become general and useful tools. The authors further write that their goal for this book is "to communicate the UNIX programming philosophy."[5] Program Design in the UNIX Environment[edit] Brian Kernighan has written at length about the Unix philosophy In October 1984, Brian Kernighan and Rob Pike published a paper called Program Design in the UNIX Environment. In this paper, they criticize the accretion of program options and features found in some newer Unix systems such as 4.2BSD and System V, and explain the Unix philosophy of software tools, each performing one general function:[6] Much of the power of the UNIX operating system comes from a style of program design that makes programs easy to use and, more important, easy to combine with other programs. This style has been called the use of software tools, and depends more on how the programs fit into the programming environment and how they can be used with other programs than on how they are designed internally. [...] This style was based on the use of tools: using programs separately or in combination to get a job done, rather than doing it by hand, by monolithic self-sufficient subsystems, or by special-purpose, one-time programs. The authors contrast Unix tools such as cat, with larger program suites used by other systems.[6] The design of cat is typical of most UNIX programs: it implements one simple but general function that can be used in many different applications (including many not envisioned by the original author). Other commands are used for other functions. For example, there are separate commands for file system tasks like renaming files, deleting them, or telling how big they are. Other systems instead lump these into a single "file system" command with an internal structure and command language of its own. (The PIP file copy program found on operating systems like CP/M or RSX-11 is an example.) That approach is not necessarily worse or better, but it is certainly against the UNIX philosophy. Doug McIlroy on Unix programming[edit] Doug McIlroy (left) with Dennis Ritchie McIlroy, then head of the Bell Labs Computing Sciences Research Center, and inventor of the Unix pipe,[7] summarized the Unix philosophy as follows:[1] This is the Unix philosophy: Write programs that do one thing and do it well. Write programs to work together. Write programs to handle text streams, because that is a universal interface. Beyond these statements, he has also emphasized simplicity and minimalism in Unix programming:[1] The notion of "intricate and beautiful complexities" is almost an oxymoron. Unix programmers vie with each other for "simple and beautiful" honors — a point that's implicit in these rules, but is well worth making overt. Conversely, McIlroy has criticized modern Linux as having software bloat, remarking that, "adoring admirers have fed Linux goodies to a disheartening state of obesity."[8] He contrasts this with the earlier approach taken at Bell Labs when developing and revising Research Unix:[9] Everything was small... and my heart sinks for Linux when I see the size of it. [...] The manual page, which really used to be a manual page, is now a small volume, with a thousand options... We used to sit around in the Unix Room saying, 'What can we throw out? Why is there this option?' It's often because there is some deficiency in the basic design — you didn't really hit the right design point. Instead of adding an option, think about what was forcing you to add that option. Do One Thing and Do It Well[edit] As stated by McIlroy, and generally accepted throughout the Unix community, Unix programs have always been expected to follow the concept of DOTADIW, or "Do One Thing And Do It Well." There are limited sources for the acronym DOTADIW on the Internet, but it is discussed at length during the development and packaging of new operating systems, especially in the Linux community. Patrick Volkerding, the project lead of Slackware Linux, invoked this design principle in a criticism of the systemd architecture, stating that, "attempting to control services, sockets, devices, mounts, etc., all within one daemon flies in the face of the Unix concept of doing one thing and doing it well."[10] Eric Raymond's 17 Unix Rules[edit] In his book The Art of Unix Programming that was first published in 2003,[11] Eric S. Raymond, an American programmer and open source advocate, summarizes the Unix philosophy as KISS Principle of "Keep it Simple, Stupid."[12] He provides a series of design rules:[1] Build modular programs Write readable programs Use composition Separate mechanisms from policy Write simple programs Write small programs Write transparent programs Write robust programs Make data complicated when required, not the program Build on potential users' expected knowledge Avoid unnecessary output Write programs which fail in a way that is easy to diagnose Value developer time over machine time Write abstract programs that generate code instead of writing code by hand Prototype software before polishing it Write flexible and open programs Make the program and protocols extensible. Mike Gancarz: The UNIX Philosophy[edit] In 1994, Mike Gancarz (a member of the team that designed the X Window System), drew on his own experience with Unix, as well as discussions with fellow programmers and people in other fields who depended on Unix, to produce The UNIX Philosophy which sums it up in nine paramount precepts: Small is beautiful. Make each program do one thing well. Build a prototype as soon as possible. Choose portability over efficiency. Store data in flat text files. Use software leverage to your advantage. Use shell scripts to increase leverage and portability. Avoid captive user interfaces. Make every program a filter. "Worse is better"[edit] Main article: Worse is better Richard P. Gabriel suggests that a key advantage of Unix was that it embodied a design philosophy he termed "worse is better", in which simplicity of both the interface and the implementation are more important than any other attributes of the system—including correctness, consistency, and completeness. Gabriel argues that this design style has key evolutionary advantages, though he questions the quality of some results. For example, in the early days Unix used a monolithic kernel (which means that user processes carried out kernel system calls all on the user stack). If a signal was delivered to a process while it was blocked on a long-term I/O in the kernel, then what should be done? Should the signal be delayed, possibly for a long time (maybe indefinitely) while the I/O completed? The signal handler could not be executed when the process was in kernel mode, with sensitive kernel data on the stack. Should the kernel back-out the system call, and store it, for replay and restart later, assuming that the signal handler completes successfully? In these cases Ken Thompson and Dennis Ritchie favored simplicity over perfection. The Unix system would occasionally return early from a system call with an error stating that it had done nothing—the "Interrupted System Call", or an error number 4 (EINTR) in today's systems. Of course the call had been aborted in order to call the signal handler. This could only happen for a handful of long-running system calls such as read(), write(), open(), and select(). On the plus side, this made the I/O system many times simpler to design and understand. The vast majority of user programs were never affected because they did not handle or experience signals other than SIGINT and would die right away if one was raised. For the few other programs—things like shells or text editors that respond to job control key presses—small wrappers could be added to system calls so as to retry the call right away if this EINTR error was raised. Thus, the problem was solved in a simple manner. Criticism[edit] In a 1981 article entitled "The truth about Unix: The user interface is horrid"[13] published in Datamation, Don Norman criticized the design philosophy of Unix for its lack of concern for the user interface. Writing from his background in cognitive science and from the perspective of the then-current philosophy of cognitive engineering,[4] he focused on how end-users comprehend and form a personal cognitive model of systems—or, in the case of Unix, fail to understand, with the result that disastrous mistakes (such as losing an hour's worth of work) are all too easy. See also[edit] Cognitive engineering Unix architecture Minimalism (computing) Software engineering KISS principle Hacker ethic List of software development philosophies Everything is a file Worse is better Notes[edit] ^ a b c d e Raymond, Eric S. (2004). "Basics of the Unix Philosophy". The Art of Unix Programming. Addison-Wesley Professional (published 2003-09-23). ISBN 0-13-142901-9. Retrieved 2016-11-01. ^ Doug McIlroy, E. N. Pinson, B. A. Tague (8 July 1978). "Unix Time-Sharing System: Foreword". The Bell System Technical Journal. Bell Laboratories: 1902–1903.CS1 maint: multiple names: authors list (link) ^ Dennis Ritchie; Ken Thompson (1974), "The UNIX time-sharing system" (PDF), Communications of the ACM, 17 (7): 365–375, doi:10.1145/361011.361061, S2CID 53235982 ^ a b "An Oral History of Unix". Princeton University History of Science. ^ a b Kernighan, Brian W. Pike, Rob. The UNIX Programming Environment. 1984. viii ^ a b Rob Pike; Brian W. Kernighan (October 1984). "Program Design in the UNIX Environment" (PDF). ^ Dennis Ritchie (1984), "The Evolution of the UNIX Time-Sharing System" (PDF), AT&T Bell Laboratories Technical Journal, 63 (8): 1577–1593, doi:10.1002/j.1538-7305.1984.tb00054.x ^ Douglas McIlroy. "Remarks for Japan Prize award ceremony for Dennis Ritchie, May 19, 2011, Murray Hill, NJ" (PDF). Retrieved 2014-06-19. ^ Bill McGonigle. "Ancestry of Linux — How the Fun Began (2005)". Retrieved 2014-06-19. ^ "Interview with Patrick Volkerding of Slackware". linuxquestions.org. 2012-06-07. Retrieved 2015-10-24. ^ Raymond, Eric (2003-09-19). The Art of Unix Programming. Addison-Wesley. ISBN 0-13-142901-9. Retrieved 2009-02-09. ^ Raymond, Eric (2003-09-19). "The Unix Philosophy in One Lesson". The Art of Unix Programming. Addison-Wesley. ISBN 0-13-142901-9. Retrieved 2009-02-09. ^ Norman, Don (1981). "The truth about Unix: The user interface is horrid" (PDF). Datamation. 27 (12). References[edit] The Unix Programming Environment by Brian Kernighan and Rob Pike, 1984 Program Design in the UNIX Environment – The paper by Pike and Kernighan that preceded the book. Notes on Programming in C, Rob Pike, September 21, 1989 A Quarter Century of Unix, Peter H. Salus, Addison-Wesley, May 31, 1994 ( ISBN 0-201-54777-5) Philosophy — from The Art of Unix Programming, Eric S. Raymond, Addison-Wesley, September 17, 2003 ( ISBN 0-13-142901-9) Final Report of the Multics Kernel Design Project by M. D. Schroeder, D. D. Clark, J. H. Saltzer, and D. H. Wells, 1977. The UNIX Philosophy, Mike Gancarz, ISBN 1-55558-123-4 External links[edit] Basics of the Unix Philosophy – by Catb.org The Unix Philosophy: A Brief Introduction – by The Linux Information Project (LINFO) Why the Unix Philosophy still matters Retrieved from "https://en.wikipedia.org/w/index.php?title=Unix_philosophy&oldid=1015300304" Categories: Software development philosophies Unix Hidden categories: CS1 maint: multiple names: authors list Articles with short description Short description matches Wikidata All articles with unsourced statements Articles with unsourced statements from March 2021 Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages العربية Čeština Deutsch Español فارسی Français 한국어 Italiano 日本語 Norsk bokmål Português Русский 中文 Edit links This page was last edited on 31 March 2021, at 18:08 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-5954	----	Communal work - Wikipedia Communal work From Wikipedia, the free encyclopedia Jump to navigation Jump to search See also: Mutual aid (organization theory) A quilting bee is a form of communal work. Communal work is a gathering for mutually accomplishing a task or for communal fundraising. Communal work provided manual labour to others, especially for major projects such as barn raising, bees of various kinds, log rolling, and subbotniks. Different words have been used to describe such gatherings. They are less common in today's more individualistic cultures, where there is less reliance on others than in preindustrial agricultural and hunter-gatherer societies. Major jobs such as clearing a field of timber or raising a barn needed many workers. It was often both a social and utilitarian event. Jobs like corn husking or sewing could be done as a group to allow socializing during an otherwise tedious chore. Such gatherings often included refreshments and entertainment. In more modern societies, the word "bee" has also been used for some time already for other social gatherings without communal work, for example for competitions such as a spelling bee. Contents 1 In specific cultures 1.1 Africa 1.1.1 East Africa 1.1.2 Rwanda 1.1.3 Ethiopia 1.1.4 Sudan 1.2 Asia 1.2.1 Indonesia 1.2.1.1 Background 1.2.1.2 Political appropriation 1.2.2 Philippines 1.2.2.1 Etymology 1.2.2.2 Usage 1.2.3 Turkey 1.3 Europe 1.3.1 Finland & the Baltics 1.3.2 Russia, Ukraine, Belarus, Poland 1.3.3 Hungary 1.3.4 Ireland 1.3.5 Asturias 1.3.6 Norway 1.3.7 Serbia 1.4 North America 1.4.1 Cherokee 1.5 Latin America 1.5.1 Mexico 1.5.2 Quechua 1.5.3 Chile 2 Bee 2.1 History 2.2 In literature 2.3 Etymology 3 See also 4 References In specific cultures[edit] Africa[edit] East Africa[edit] Harambee (Swahili pronunciation: [haramˈbeː]) is an East African (Kenyan, Tanzanian and Ugandan) tradition of community self-help events, e.g. fundraising or development activities. Harambee literally means "all pull together" in Swahili, and is also the official motto of Kenya and appears on its coat of arms. Rwanda[edit] Umuganda is a national day of community service held on the last Saturday of each month in Rwanda. In 2009, umuganda was institutionalized in the country. It is translated as "coming together in common purpose to achieve an outcome."[1] Ethiopia[edit] A social event is held to build a house or a farm. Especially for elderly and widows who do not have the physical strength to do it on their own. Sudan[edit] Naffīr (نفير) is an Arabic word used in parts of Sudan (including Kordofan, Darfur, parts of the Nuba mountains and Kassala) to describe particular types of communal work undertakings. Naffīr has been described as including a group recruited through family networks, in-laws and village neighbors for some particular purpose, which then disbands when that purpose is fulfilled.[2] An alternative, more recent, definition describes naffīr as "to bring someone together from the neighborhood or community to carry out a certain project, such as building a house or providing help during the harvest season."[3] The word may be related to the standard Arabic word nafr (نفر) which describes a band, party, group or troop, typically mobilized for war. In standard Arabic, a naffīr āmm (نفير عام) refers to a general call to arms.[4] Naffīr has also been used in a military context in Sudan. For example, the term was used to refer to the an-Naffīr ash-Sha'abī or "People's Militias" that operated in the central Nuba Mountains region in the early 1990s.[5] Asia[edit] Indonesia[edit] The traditional communal slametan unggahan ceremony of Bonokeling village, Banyumas, Central Java, in which the participants literally perform the notion of gotong royong (carrying together). Gotong-royong is a conception of sociality ethos familiar to Indonesia — and in wider extent might also include Malaysia, Brunei and Singapore. In Indonesian languages especially Javanese, gotong means "carrying a burden using one's shoulder", while royong means "together" or "communally", thus the combined phrase gotong royong can be translated literally as "joint bearing of burdens". It translate to working together, helping each other or mutual assistance.[6] Village's public facilities, such as irrigations, streets, and house of worship (village's mosque, church or pura) are usually constructed in gotong royong way, where the funds and materials are collected mutually. The traditional communal events, such as slametan ceremony are also usually held in goyong royong ethos of communal work spirit, which each members of society are expected to contribute and participate in the endeavour harmoniously. The phrase has been translated into English in many ways, most of which hearken to the conception of reciprocity or mutual aid. For M. Nasroen, gotong royong forms one of the core tenets of Indonesian philosophy. Paul Michael Taylor and Lorraine V. Aragon state that "gotong royong [is] cooperation among many people to attain a shared goal."[7] Background[edit] In a 1983 essay Clifford Geertz points to the importance of gotong royong in Indonesian life: An enormous inventory of highly specific and often quite intricate institutions for effecting the cooperation in work, politics, and personal relations alike, vaguely gathered under culturally charged and fairly well indefinable value-images--rukun ("mutual adjustment"), gotong royong ("joint bearing of burdens"), tolong-menolong ("reciprocal assistance")--governs social interaction with a force as sovereign as it is subdued.[8] Anthropologist Robert A. Hahn writes: Javanese culture is stratified by social class and by level of adherence to Islam. ...Traditional Javanese culture does not emphasize material wealth. ...There is respect for those who contribute to the general village welfare over personal gain. And the spirit of gotong royong, or volunteerism, is promoted as a cultural value.[9] Gotong royong has long functioned as the scale of the village, as a moral conception of the political economy. Pottier records the impact of the Green Revolution in Java: "Before the GR, 'Java' had relatively 'open' markets, in which many local people were rewarded in kind. With the GR, rural labour markets began to foster 'exclusionary practices'... This resulted in a general loss of rights, especially secure harvesting rights within a context of mutual cooperation, known as gotong royong." Citing Ann Laura Stoler's ethnography from the 1970s, Pottier writes that cash was replacing exchange, that old patron-client ties were breaking, and that social relations were becoming characterized more by employer-employee qualities.[10] Political appropriation[edit] For Prime Minister Muhammad Natsir, gotong royong was an ethical principle of sociality, in marked contrast to both the "unchecked" feudalism of the West, and the social anomie of capitalism.[11] Ideas of reciprocity, ancient and deeply enmeshed aspects of kampung morality, were seized upon by postcolonial politicians. John Sidel writes: "Ironically, national-level politicians drew on " village conceptions of adat and gotong royong. They drew on notions "of traditional community to justify new forms of authoritarian rule."[12] During the presidency of Sukarno, the idea of gotong royong was officially elevated to a central tenet of Indonesian life. For Sukarno, the new nation was to be synonymous with gotong royong. He said that the Pancasila could be reduced to the idea of gotong royong. On June 1, 1945, Sukarno said of the Pancasila: The first two principles, nationalism and internationalism, can be pressed to one, which I used to call 'socionationalism.' Similarly with democracy 'which is not the democracy of the West' together with social justice for all can be pressed down to one, and called socio democracy. Finally – belief in God. 'And so what originally was five has become three: socio nationalism, socio democracy, and belief in God.' 'If I press down five to get three, and three to get one, then I have a genuine Indonesian term – GOTONG ROYONG [mutual co-operation]. The state of Indonesia which we are to establish should be a state of mutual co-operation. How fine that is ! A Gotong Royong state![13] In 1960, Sukarno dissolved the elected parliament and implemented the Gotong Royong Parliament. Governor of Jakarta, Ali Sadikin, spoke of a desire to reinvigorate urban areas with village sociality, with gotong royong.[14] Suharto's New Order was characterized by much discourse about tradition. During the New Order, Siskamling harnessed the idea of gotong royong. By the 1990s, if not sooner, gotong royong had been "fossilized" by New Order sloganeering.[15] During the presidency of Megawati, the Gotong Royong Cabinet was implemented. It lasted from 2001 to 2004. Philippines[edit] Members of the community volunteering to move a house to new location. Though no longer commonplace, this method of moving houses has become a traditional symbol for the concept of bayanihan. Bayanihan (pronounced [ˌbajɐˈniːhan]) is a Filipino term taken from the word bayan, referring to a nation, country,[16] town or community. The whole term bayanihan refers to a spirit of communal unity or effort to achieve a particular objective. It is focused on doing things as a group as it relates to one's community.[17] Etymology[edit] The origin of the term bayanihan can be traced from a common tradition in Philippine towns where community members volunteer to help a family move to a new place by volunteering to transport the house to a specific location. The process, which is the classic illustration of the term,[18] involves literally carrying the house to its new location. This is done by putting bamboo poles forming a strong frame to lift the stilts from the ground and carrying the whole house with the men positioned at the ends of each pole. The tradition also features a small fiesta hosted by the family to express gratitude to the volunteers. Usage[edit] In society, bayanihan has been adopted as a term to refer to a local civil effort to resolve national issues. One of the first groups to use the term is the Bayanihan Philippine National Folk Dance Company which travels to countries to perform traditional folk dances of the country with the objective of promoting Philippine culture. The concept is related to damayán ("to help one another"). In computing, the term bayanihan has evolved into many meanings and incorporated as codenames to projects that depict the spirit of cooperative effort involving a community of members. An example of these projects is the Bayanihan Linux project which is a Philippines-based desktop-focused Linux distribution. In ethnic newspapers, Bayanihan News is the name of community newspaper for the Philippine community in Australia. It is in English and in Filipino with regular news and articles on Philippine current events and history. It was established in October 1998 in Sydney, Australia. Turkey[edit] Imece is a name given for a traditional Turkish village-scale collaboration. For example, if a couple is getting married, villagers participate in the overall organization of the ceremony including but not limited to preparation of the celebration venue, food, building and settlement of the new house for the newly weds. Tasks are often distributed according to expertise and has no central authority to govern activities. Europe[edit] Finland & the Baltics[edit] A tent is being raised in a talkoot for midsummer in Ylimuonio in 2005. Talkoot (from Finnish: talkoo, almost always used in plural, talkoot) is a Finnish expression for a gathering of friends and neighbors organized to accomplish a task. The word is borrowed into Finland Swedish as talko[19] but is unknown to most Swedes. However, cognate terms and in approximately the same context are used in Estonia (talgu(d)),[20] Latvia (noun talka, verb talkot), and Lithuania (noun talka, verb talkauti). It is the cultural equivalent of communal work in a village community, although adapted to the conditions of Finland, where most families traditionally lived in isolated farms often miles away from the nearest village. A talkoot is by definition voluntary, and the work is unpaid. The voluntary nature might be imaginary due to social pressure, especially in small communities, and one's honour and reputation may be severely damaged by non-attendance or laziness. The task of the talkoot may be something that is a common concern for the good of the group, or it may be to help someone with a task that exceeds his or her own capacity. For instance, elderly neighbours or relatives can need help if their house or garden is damaged by a storm, or siblings can agree to arrange a party for a parent's special birthday as a talkoot. Typically, club houses, landings, churches, and parish halls can be repaired through a talkoot, or environmental tasks for the neighborhood are undertaken. The parents of pre-school children may gather to improve the playground, or the tenants of a tenement house may arrange a talkoot to put their garden in order for the summer or winter. A person unable to contribute with actual work may contribute food for the talkoot party, or act as a baby-sitter. When a talkoot is for the benefit of an individual, he or she is the host of the talkoot party and is obliged to offer food and drink. Russia, Ukraine, Belarus, Poland[edit] Toloka[21] or Taloka (also pomoch) in Russian (Toloka in Ukrainian and Talaka in Belarusian, Tłoka in Polish) is the form of communal voluntary work. Neighbours gathered together to build something or to harvest crops. Hungary[edit] Kaláka (ˈkɒlaːkɒ) is the Hungarian word for working together for a common goal. This can be building a house or doing agricultural activities together, or any other communal work on a volunteer basis. Ireland[edit] Meitheal (Irish pronunciation: [ˈmɛhəl]) is the Irish word for a work team, gang, or party and denotes the co-operative labour system in Ireland where groups of neighbours help each other in turn with farming work such as harvesting crops.[22] The term is used in various writings of Irish language authors. It can convey the idea of community spirit in which neighbours respond to each other's needs. In modern use for example, a meitheal could be a party of neighbours and friends invited to help decorate a house in exchange for food and drink, or in scouting, where volunteer campsite wardens maintain campsites around Ireland. Asturias[edit] Andecha (from Latin indictia 'announcement) it is a voluntary, unpaid and punctual aid to help a neighbor carry out agricultural tasks (cutting hay, taking out potatoes, building a barn, picking up the apple to make cider, etc.). The work is rewarded with a snack or a small party and the tacit commitment that the person assisted will come with his family to the call of another andecha when another neighbor requests it.[23] Very similar to Irish Meitheal. It should not be confused with another Asturian collective work institution, the Sestaferia. In this, the provision of the service is mandatory (under penalty of fine) and is not called a to help of an individual but the provision of common services (repair of bridges, cleaning of roads, etc.) Norway[edit] Dugnad is a Norwegian term for voluntary work done together with other people.[24] It's a very core phenomenon for Norwegians, and the word was voted as the Norwegian word of the year 2004 in the TV programme «Typisk norsk» ("Typically Norwegian"). Participation in a dugnad is often followed by a common meal, served by the host, or consisting of various dishes brought by the participants, thus the meal is also a dugnad. In urban areas, the dugnad is most commonly identified with outdoor spring cleaning and gardening in housing co-operatives. Dugnader (dugnads) are also a phenomenon in kindergartens and elementary schools to make the area nice, clean and safe and to do decorating etc. such as painting and other types of maintenance. Dugnader occur more widely in remote and rural areas. Neighbours sometimes participate during house or garage building, and organizations (such as kindergartens or non-profit organisations) may arrange annual dugnader. The Norwegian word "dugnadsånd" is translatable to the spirit of will to work together for a better community. Many Norwegians will describe this as a typical Norwegian thing to have. The word dugnad was used to unite the people of Norway to cooperate and shut down public activities to fight the pandemic of 2020.[25] Serbia[edit] Moba (Serbian: моба) is an old Serbian tradition of communal self-help in villages. It was a request for help in labor-intensive activities, like harvesting wheat, building a church or repairing village roads. Work was entirely voluntary and no compensation, except possibly meals for workers, was expected. North America[edit] Cherokee[edit] Gadugi (Cherokee:ᎦᏚᎩ) is a term used in the Cherokee language which means "working together"[26] or "cooperative labor" within a community.[27] Historically, the word referred to a labor gang of men and/or women working together for projects such as harvesting crops or tending to gardens of elderly or infirm tribal members.[28] The word Gadugi was derived from the Cherokee word for "bread", which is Gadu. In recent years the Cherokee Nation tribal government has promoted the concept of Gadugi. The GaDuGi Health Center is a tribally run clinic in Tahlequah, Oklahoma, the capital of the Cherokee Nation. The concept is becoming more widely known. In Lawrence, Kansas, in 2004 the rape crisis center affiliated with the University of Kansas, adopted the name, the Gadugi Safe Center, for its programs to aid all people affected by sexual violence.[26] Gadugi is the name of a font included with Microsoft Windows 8 that includes support for the Cherokee language along with other languages of the Americas such as Inuktitut. Latin America[edit] Mexico[edit] Tequio [es]. Zapoteca Quechua[edit] Main article: Minka (communal work) Mink'a or minka (Quechua[29][30] or Kichwa,[31] Hispanicized minca, minga) is a type of traditional communal work in the Andes in favor of the whole community (ayllu). Participants are traditionally paid in kind. Mink'a is still practiced in indigenous communities in Peru, Ecuador, Bolivia, and Chile, especially among the Quechua and the Aymara. Chile[edit] In rural southern Chile, labor reciprocity and communal work remained common through the twentieth century and into the twenty-first, particularly in rural communities on the Archipelago of Chiloé.[32] Referred to as "mingas," the practice can be traced to pre-contact Mapuche and Huilliche traditions of communal labor.[33] In Chiloé, mingas took the form either of días cambiados (tit for tat exchanges of labor between neighbors) or large-scale work parties hosted by a particular family, accompanied by food and drink, and often lasting several days.[34] Most agricultural work and community construction projects were done by way of mingas. The tiradura de casa ("house pull") involved moving a house from one location to another. Panama In rural Panama, especially in the Azuero peninsula region and its diaspora, it is common to hold a 'junta' party[35] as a communal labor event. Most commonly these events are used to harvest rice, clear brush with machetes, or to build houses. Workers generally work without compensation but are provided with meals and often alcoholic beverages such as fermented chicha fuerte and seco. Bee[edit] History[edit] This use of the word bee is common in literature describing colonial North America. One of the earliest documented occurrences is found in the Boston Gazette for October 16, 1769, where it is reported that "Last Thursday about twenty young Ladies met at the house of Mr. L. on purpose for a Spinning Match; (or what is called in the Country a Bee)."[36] It was, and continues to be, commonly used in Australia also, most often as "working bee".[37][38] In literature[edit] Uses in literature include: "There was a bee to-day for making a road up to the church." – Anne Langton "The cellar … was dug by a bee in a single day." – S. G. Goodrich "I made a bee; that is, I collected as many of the most expert and able-bodied of the settlers to assist at the raising." – John Galt, Lawrie Todd (1830) "When one of the pioneers had chopped down timber and got it in shape, he would make a logging bee, get two or three gallons of New England Rum, and the next day the logs were in great heaps. ... after a while there was a carding and jutting mill started where people got their wool made into rolls, when the women spun and wove it. Sometimes the women would have spinning bees. They would put rolls among their neighbors and on a certain day they would all bring in their yarn and at night the boys would come with their fiddles for a dance. ... He never took a salary, had a farm of 80 acres [324,000 m2] and the church helped him get his wood (cut and drawn by a bee), and also his hay." – James Slocum "'I am in a regular quandary', said the mistress of the house, when the meal was about half over. Mr. Van Brunt looked up for an instant, and asked, 'What about?' 'Why, how I am ever going to do to get those apples and sausage-meat done. If I go to doing 'em myself I shall about get through by spring.' 'Why don't you make a bee?' said Mr. Van Brunt." – Susan Warner, The Wide, Wide World (1850)[39] "She is gone out with Cousin Deborah to an apple bee." – Charlotte Mary Yonge, The Trial; or More Links of the Daisy Chain (1864) Etymology[edit] The origin of the word "bee" in this sense is debated. Because it describes people working together in a social group, a common belief is that it derives from the insect of the same name and similar social behavior. This derivation appears in, for example, the Oxford English Dictionary.[40] Other dictionaries, however, regard this as a false etymology, and suggest that the word comes from dialectal been or bean (meaning "help given by neighbors"), derived in turn from Middle English bene (meaning "prayer", "boon" and "extra service by a tenant to his lord").[41][42] See also[edit] Sharing References[edit] ^ "Umuganda". Rwanda Governance Board. Retrieved 2019-12-03. ^ Manger, Leif O. (1987). "Communal Labour in the Sudan". University of Bergen: 7. Cite journal requires |journal= (help) ^ 'Conceptual analysis of volunteer', 2004 ^ Wehr, Hans. A Dictionary of Modern Written Arabic, Arabic - English. Beirut: Librarie Du Liban. ^ Kevlihan, Rob (2005). "Developing Connectors in Humanitarian Emergencies: Is it possible in Sudan?" (PDF). Humanitarian Exchange. 30. ^ "Gotong Royong - KBBI Daring". kbbi.kemdikbud.go.id. Retrieved 2020-05-23. ^ Taylor, Paul Michael; Aragon, Lorraine V (1991). Beyond the Java Sea: Art of Indonesia's Outer Islands. Abrams. p. 10. ISBN 0-8109-3112-5. ^ Geertz, Clifford. "Local Knowledge: Fact and Law in Comparative Perspective," pp. 167–234 in Geertz Local Knowledge: Further Essays in Interpretive Anthropology, NY: Basic Books. 1983. ^ Hahn, Robert A. (1999). Anthropology in Public Health: Bridging Differences in Culture and Society. Oxford, UK: Oxford University Press. ^ Pottier, Johan (1999). Anthropology of Food: The Social Dynamics of Food Security. Oxford, UK: Blackwell. p. 84. ^ Natsir, Muhammad. "The Indonesian Revolution." In Kurzman, Charles Liberal Islam: A Sourcebook, p. 62. Oxford, UK: Oxford University Press. 1998. ^ Sidel, John Thayer (2006). Riots, Pogroms, Jihad: Religious Violence in Indonesia. Ithaca, NY: Cornell University Press. p. 32. ^ "BUNG KARNO: 6 JUNE - 21 JUNE". Antenna. Retrieved 25 March 2013. ^ Kusno, Abidin (2003). Behind the Postcolonial: Architecture, Urban Space and Political Cultures. NY: Routledge. p. 152. ^ Anderson, Benedict (1990). Language and Power: Exploring Political Cultures in Indonesia. Ithaca, NY: Cornell UP. p. 148. ^ Visser, Wayne; Tolhurst, Nick (2017). The World Guide to CSR: A Country-by-Country Analysis of Corporate Sustainability and Responsibility. Routledge. ISBN 978-1-351-27890-4. Retrieved 9 April 2020. ^ Gripaldo, Rolando M. (2005). Filipino Cultural Traits: Claro R. Ceniza Lectures. CRVP. p. 173. ISBN 978-1-56518-225-7. Retrieved 9 April 2020. ^ Smith, Bradford; Shue, Sylvia; Villarreal, Joseph (1992). Asian and Hispanic philanthropy: sharing and giving money, goods, and services in the Chinese, Japanese, Filipino, Mexican, and Guatemalan communities in the San Francisco Bay Area. University of San Francisco, Institute for Nonprofit Organization Management, College of Professional Studies. p. 113. Retrieved 9 April 2020. ^ Mikael Reuter: En/ett iögonfallande talko? (in Swedish). Retrieved: 2010-10-04. ^ "[EKSS] "Eesti keele seletav sõnaraamat"". eki.ee. ^ "Vasmer's Etymological Dictionary". dic.academic.ru. ^ "Meitheal". Irish Dictionary Online. englishirishdictionary.com. Archived from the original on 10 July 2011. Retrieved 28 March 2013. ^ https://dej.rae.es/lema/andecha ^ Ottar Brox; John M. Bryden; Robert Storey (2006). The political economy of rural development: modernisation without centralisation?. Eburon Uitgeverij B.V. p. 79. ISBN 90-5972-086-5. ^ One Word Spared Norway From COVID-19 Disaster Kelsey L.O. July 20, 2020 ^ a b "GaDuGi SafeCenter's Mission Statement and Vision Statement". GaDuGi SafeCenter. Retrieved 25 March 2013. ^ Feeling, Durbin (1975). Cherokee-English Dictionary. Cherokee Nation of Oklahoma. p. 73. ^ Dunaway, Wilma. "The Origin of Gadugi". Cherokee Nation. Retrieved 28 March 2013. ^ Teofilo Laime Ajacopa, Diccionario Bilingüe Iskay simipi yuyayk'ancha, La Paz, 2007 (Quechua-Spanish dictionary) ^ Diccionario Quechua - Español - Quechua, Academía Mayor de la Lengua Quechua, Gobierno Regional Cusco, Cusco 2005 (Quechua-Spanish dictionary) ^ Fabián Potosí C. et al., Ministerio de Educación del Ecuador: Kichwa Yachakukkunapa Shimiyuk Kamu, Runa Shimi - Mishu Shimi, Mishu Shimi - Runa Shimi. Quito (DINEIB, Ecuador) 2009. (Kichwa-Spanish dictionary) ^ Daughters, Anton. "Solidarity and Resistance on the Island of Llingua." Anthropology Now 7:1 pp.1-11 (April 2015) ^ Cárdenas Álvarez, Renato, Daniel Montiel Vera, and Catherine Grace Hall. Los Chonos y los Veliche de Chiloé (Santiago, Chile: Ediciones Olimpho) 1991 ^ Daughters, Anton. "Southern Chile's Archipelago of Chiloé: Shifting Identities in a New Economy." Journal of Latin American and Caribbean Anthropology 21:2 pp.317.335 (July 2016) ^ "Folklore.PanamaTipico.com (English)". folklore.panamatipico.com. Retrieved 2018-10-30. ^ Boston Gazette, October 16, 1769. ^ The Australian Bosses roll up for Tony Abbott's working bee August 11, 2012 Retrieved 3 March 2015. ^ "Brisbane working bee hits streets". abc.net.au. January 15, 2011. Retrieved March 3, 2015. ^ Warner, Susan (1851). The Wide, Wide World. 1. New York: Putnam. p. 277. ^ "bee, n.". Oxford English Dictionary (Online ed.). Oxford University Press. (Subscription or participating institution membership required.) ^ "Bee". Dictionary.com. Retrieved March 3, 2015. ^ "Bee". Merriam-Webster. Retrieved 27 December 2020. Wikimedia Commons has media related to Communal work. v t e Cherokee Tribes Cherokee Nation Eastern Band United Keetoowah Band Culture Society National holiday Calendar Clans Gadugi Green Corn Ceremony Language history syllabary Cherokee (Unicode block) Cherokee Supplement (Unicode block) Cherokee Immersion School New Kituwah Academy Marbles Spiritual beliefs Moon-eyed people Ethnobotany Black drink Stomp dance Booger dance Flag of the Cherokee Nation Legends Ani Hyuntikwalaski Deer Woman Horned Serpent Moon-eyed people Nun'Yunu'Wi Nûñnë'hï Kâ'lanû Ahkyeli'skï U'tlun'ta Tsul 'Kalu History History timeline military Treaties Kituwa Ani-kutani skiagusta (rank) outacite (rank) Raven of Chota Wars Tribal Wars Battle of Taliwa Anglo-Cherokee War Siege of Fort Loudoun Battle of Echoee Cherokee War of 1776 Cherokee–American wars Battle of Hightower Battle of Lindley's Fort Nickajack Expedition American Civil War 1st Cherokee Mounted Rifles Cherokee treaties Treaty of New Echota Treaty of Tellico Treaty of Turkeytown Treaty of Dewitt's Corner Treaty of Hard Labour Treaty of Lochaber Treaty of Hopewell Treaty of Holston Jackson and McMinn Treaty Transylvania Purchase Chickamauga Cherokee Overhill Cherokee Cherokee Phoenix Cherokee Nation (1794–1907) Removal Trail of Tears Indian Removal Act Cultural citizenship Cherokee descent Jacob Brown Grant Deeds Texas Cherokees Organizations Heritage Center Cherokee Preservation Foundation Warriors Society Original Keetoowah Society Keetoowah Nighthawk Society Youth Choir Heritage groups Cherokee Southwest Township Oconaluftee Indian Village Unto These Hills Education Female Seminary Male Seminary Cherokee Central Schools Cherokee High School Sequoyah Schools Sequoyah High School Politics and law Principal Chiefs Blood Law Slavery 1842 revolt freedmen controversy Cherokee Nation v. Georgia (1831) Worcester v. Georgia (1832) The Cherokee Tobacco case (1871) Cherokee Nation v. Leavitt (2005) Cherokee Commission Cherokee Strip in Kansas Sequoyah Constitutional Convention Towns and Villages Cherokee Towns (pre-Removal) Nikwasi Nununyi Cowee Nacoochee NewEchota Chota Kituwa Red Clay Turkeytown Keowee Isunigu Talulah Toxoway Kulsetsiyi Tugaloo Tuskegee Joara Tellico Chatuga Hiwassee Amoyeligunahita Tanasi Conasauga Etowah Brasstown Nantahala Turtletown Ducktown Spike Bucktown Ocoee Tuckasegee Running Water Titsohili Crowtown Nickajack Coyotee Mialoquo Tomotley Toqua Tomassee Oconee Settico Chilhowee Talisi Frogtown Long Swamp Oostanaula Tsatanugi Dirt town Island town Cherokee Nations Western Cherokee Nation Tahlonteeskee Cherokee Nation Tahlequah Eastern Band Qualla Boundary Cherokee Landmarks and Memorial Sites Judaculla Rock Long Island John Ross House Chieftains Museum Ross's Landing Tellico Blockhouse Sequoyah's Cabin Trail of Tears State Park Cherokee Removal Memorial Park Cherokee National Capitol First Cherokee Female Seminary Site Brainerd Mission Rattlesnake Springs Fort Cass Red Clay State Park Hair Conrad Cabin Nancy Ward Tomb Blythe Ferry Bussell Island Chief Vann House Historic Site Mantle Rock Untokiasdiyi Standing Stone Stick Ball Grounds Cullasaja River Tuckasegee River Oconaluftee valley Oconaluftee River Abrams Creek Sycamore Shoals The Great Trading Path The Great War Path Hiwassee River Heritage Center Chatata Tuckaleechee Fort Smith Historic Site Port Royal State Park Five Civilized Tribes Museum Tlanusiyi Cherokee Path People Early leaders Moytoy of Tellico Attakullakulla Amouskositte Old Hop Moytoy of Citico Standing Turkey Outacite of Keowee Oconostota Savanukah Old Tassel Little Turkey Dragging Canoe Kunokeski Incalatanga Tagwadihi Cherokee Nation East (1794-1839) Enola Pathkiller Big Tiger Charles R. Hicks William Hicks John Ross Cherokee Nation West (1810-1839) The Bowl Degadoga Tahlonteeskee John Jolly John Looney John Rogers Eastern Band of Cherokee Indians (1824-present) Yonaguska William Holland Thomas Tsaladihi Gerard Parker Joyce Dugan Patrick Lambert Richard Sneed Cherokee Nation in Indian Territory (1839–1907) Lewis Downing Degataga William P. Ross Utselata Dennis Bushyhead Joel B. Mayes Johnson Harris Samuel Houston Mayes Thomas Buffington William Charles Rogers Cherokee Nation (1975–present) J. B. Milam W. W. Keeler Ross Swimmer Wilma Mankiller Joe Byrd Chadwick "Corntassel" Smith Bill John Baker Chuck Hoskin, Jr. United Keetoowah Band of Cherokee Indians (1939–present) James L. Gordon John W. Hair Other notable Cherokee Nancy Ward Tsali Incalatanga Tahlonteeskee (warrior) Turtle-at-Home Junaluska Goingsnake Elias Boudinot Wauhatchie James Vann David Vann Joseph Vann Bob Benge Nunnahitsunega Ned Christie John Martin Markwayne Mullin Yvette Herrell Sequoya Major Ridge Jenny McIntosh Sam Sixkiller Clement V. Rogers Redbird Smith Durbin Feeling Hastings Shade Kimberly Teehee See also: Cherokee-language Wikipedia Authority control GND: 4170995-0 Retrieved from "https://en.wikipedia.org/w/index.php?title=Communal_work&oldid=1009868254" Categories: Competitions Mutualism (movement) Social groups Hidden categories: CS1 errors: missing periodical Articles with Swedish-language sources (sv) Articles containing Arabic-language text Commons category link from Wikidata Wikipedia articles with GND identifiers Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version In other projects Wikimedia Commons Languages Asturianu Башҡортса Беларуская Беларуская (тарашкевіца)‎ Dansk Deutsch Eesti Español Euskara فارسی Lietuvių Norsk bokmål Norsk nynorsk Polski Português Русский Српски / srpski Suomi Svenska Татарча/tatarça Türkçe Українська Winaray Edit links This page was last edited on 2 March 2021, at 18:09 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-699	----	GatorBox - Wikipedia GatorBox From Wikipedia, the free encyclopedia Jump to navigation Jump to search GatorBox CS The GatorBox is a LocalTalk-to-Ethernet bridge, a router used on Macintosh-based networks to allow AppleTalk communications between clients on LocalTalk and Ethernet physical networks. The GatorSystem software also allowed TCP/IP and DECnet protocols to be carried to LocalTalk-equipped clients via tunneling, providing them with access to these normally Ethernet-only systems. When the GatorBox is running GatorPrint software, computers on the Ethernet network can send print jobs to printers on the LocalTalk network using the 'lpr' print spool command. When the GatorBox is running GatorShare software, computers on the LocalTalk network can access Network File System (NFS) hosts on Ethernet. Contents 1 Specifications 2 Software 3 Software Requirements 4 See also 5 References 6 External links Specifications[edit] The original GatorBox (model: 10100) is a desktop model that has a 10 MHz Motorola 68000 Cpu, 1MB RAM, 128k EPROM for boot program storage, 2 kB NVRAM for configuration storage, LocalTalk Mini-DIN-8 connector, Serial port Mini-DIN-8 connector, BNC connector, AUI connector, and is powered by an external power supply (16VAC 1 A transformer that is connected by a 2.5 mm plug). This model requires a software download when it is powered on to be able to operate. The GatorBox CS (model: 10101) is a desktop model that uses an internal power supply (120/240 V, 1.0 A, 50–60 Hz). The GatorMIM CS is a media interface module that fits in a Cabletron Multi-Media Access Center (MMAC). The GatorBox CS/Rack (model: 10104) is a rack-mountable version of the GatorBox CS that uses an internal power supply (120/240 V, 1.0 A, 50–60 Hz). The GatorStar GXM integrates the GatorMIM CS with a 24 port LocalTalk repeater.[1] The GatorStar GXR integrates the GatorBox CS/Rack with a 24 port LocalTalk repeater.[2] This model does not have a BNC connector and the serial port is a female DE-9 connector. All "CS" models have 2MB of memory and can boot from images of the software that have been downloaded into the EPROM using the GatorInstaller application. Software[edit] There are three disks in the GatorBox software package. Note that the content of the disks for an original GatorBox is different from that of the GatorBox CS models. Configuration - contains GatorKeeper, MacTCP folder and either GatorInstaller (for CS models) or GatorBox TFTP and GatorBox UDP-TFTP (for original GatorBox model) Application - contains GatorSystem, GatorPrint or GatorShare, which is the software that runs in the GatorBox. The application software for the GatorBox CS product family has a "CS" at the end of the filename. GatorPrint includes GatorSystem functionality. GatorShare includes GatorSystem and GatorPrint functionality. Network Applications - NCSA Telnet, UnStuffit Software Requirements[edit] The GatorKeeper 2.0 application requires Macintosh System version 6.0.2 up to 7.5.1 and Finder version 6.1 (or later) MacTCP (not Open Transport)[3] See also[edit] Kinetics FastPath Line Printer Daemon protocol – Print Spooling LocalTalk-to-Ethernet bridge – Other LocalTalk-to-Ethernet bridges/routers MacIP – TCP/IP Gateway References[edit] McCoy, Michael (August 1991). Setting Up Your GatorBox - Hardware Installation Guide. Cayman Systems. pp. 1–1, A-1–2. ^ Data Communication Network at the ASRM Facility - See 3.1.9 ^ "Glossary of Macintosh Networking terms - See GatorStar". Archived from the original on 2006-10-03. Retrieved 2007-01-25. ^ Christopher, Mason. "GatorBox Software". External links[edit] GatorBox CS configuration information Internet Archive copy of a configuration guide produced by the University of Illinois Juiced.GS magazine Volume 10, Issue 4 (Dec 2005) contains an article on how to set up a GatorBox for use with an Apple IIgs Software and scanned manuals for the GatorBox and GatorBox CS Retrieved from "https://en.wikipedia.org/w/index.php?title=GatorBox&oldid=932309684" Categories: Networking hardware Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages Add links This page was last edited on 24 December 2019, at 23:01 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-7147	----	Hackathon - Wikipedia Hackathon From Wikipedia, the free encyclopedia Jump to navigation Jump to search Event in which groups of software developers work at an accelerated pace A hackathon (also known as a hack day, hackfest, datathon or codefest; a portmanteau of hacking marathon) is a design sprint-like event; often, in which computer programmers and others involved in software development, including graphic designers, interface designers, project managers, domain experts, and others collaborate intensively on software projects. The goal of a hackathon is to create functioning software or hardware by the end of the event.[1] Hackathons tend to have a specific focus, which can include the programming language used, the operating system, an application, an API, or the subject and the demographic group of the programmers. In other cases, there is no restriction on the type of software being created. Contents 1 Etymology 2 Structure 3 Types of hackathons 3.1 For an application type 3.2 Using a specific programming language, API, or framework 3.3 For a cause or purpose 3.4 As a tribute or a memorial 3.5 For a demographic group 3.6 For internal innovation and motivation 3.7 To connect local tech communities 3.8 Code sprints 3.9 Edit-a-thon 4 Controversies 5 Notable Events 6 See also 7 References 8 External links Etymology[edit] The word "hackathon" is a portmanteau of the words "hack" and "marathon", where "hack" is used in the sense of exploratory programming, not its alternate meaning as a reference to breaching computer security. OpenBSD's apparent first use of the term referred to a cryptographic development event held in Calgary on June 4, 1999,[2] where ten developers came together to avoid legal problems caused due to export regulations of cryptographic software from the United States. Since then, a further three-to-five events per year have occurred around the world to advance development, generally on university campuses. For Sun Microsystems, the usage referred to an event at the JavaOne conference from June 15 to June 19, 1999; there John Gage challenged attendees to write a program in Java for the new Palm V using the infrared port to communicate with other Palm users and register it on the Internet. Starting in the mid to late 2000s, hackathons became significantly more widespread and began to be increasingly viewed by companies and venture capitalists as a way to quickly develop new software technologies, and to locate new areas for innovation and funding. Some major companies were born from these hackathons, such as GroupMe, which began as a project at a hackathon at the TechCrunch Disrupt 2010 conference; in 2011 it was acquired by Skype for $85 million. The software PhoneGap began as a project at the iPhoneDevCamp (later renamed iOSDevCamp) in 2008;[3] the company whose engineers developed PhoneGap, Nitobi, refocused itself around PhoneGap, and Nitobi was bought by Adobe in 2011 for an undisclosed amount.[4] Structure[edit] Hackathons typically start with communication via a presentation or a web page from the hosting organization that mentions the objectives, terms, and details of the hackathon. Developers register to participate in the hackathon and are qualified after the organization screens their background and skills. When the hackathon event begins, the participating individuals or teams start their programming work. The administrator of the hackathon is typically able to answer questions and offer help when there issues come up in the event. Hackathons can last several hours to several days. For hackathons that last 24 hours or longer, especially competitive ones, eating is often informal, with participants often subsisting on food like pizza and energy drinks. Sometimes sleeping is informal as well, with participants sleeping on-site with sleeping bags. At the end of hackathons, there are usually a series of demonstrations in which each group presents their results. To capture the great ideas and work-in-progress often people post a video of the demonstrations, blog about results with screenshots and details, share links and progress on social media, suggest a place for open source code and generally make it possible for people to share, learn from and possibly build from the ideas generated and initial work completed. There is sometimes a contest element as well, in which a panel of judges select the winning teams, and prizes are given. At many hackathons, the judges are made up of organisers and sponsors. At BarCamp-style hackathons, that are organised by the development community, such as iOSDevCamp, the judges are usually made up of peers and colleagues in the field. Such prizes are sometimes a substantial amount of money: a social gaming hackathon at the TechCrunch Disrupt conference offered $250,000 in funding to the winners, while a controversial[5] 2013 hackathon run by Salesforce.com had a payout of $1 million to the winners, billed as the largest-ever prize.[6] Types of hackathons[edit] For an application type[edit] Some hackathons focus on a particular platform such as mobile apps, a desktop operating system, web development or video game development. Mobile app hackathons like Over the Air, held at Phoenix Park, Ireland, can see a large amount of corporate sponsorship and interest.[7][8] Music Hack Day, a hackathon for music-related software and hardware applications, is a popular event, having been held over 30 times around the world since 2009.[9] Also Music Tech Fest, a three-day interdisciplinary festival for music ideas bringing together musicians with hackers, researchers and industry, features a hackathon.[10] Similarly, Science Hack Day, a hackathon for making things with science, has been held over 45 times in over 15 countries around the world since 2010.[11] Hackathons have been held to develop applications that run on various mobile device operating systems, such as Android,[12] iOS[13] and MeeGo.[14] Hackathons have also been held to develop video-based applications and computer games.[15] Hackathons where video games are developed are sometimes called game jams. "TV Hackfest" events have been held in both London[16] and San Francisco,[17] focusing mainly on social television and second screen technologies. In TV Hackfests, challenge briefs are typically submitted by content producers and brands, in the form of broadcast industry metadata or video content, while sponsors supply APIs, SDKs and pre-existing open source software code.[18] Hackathons have also been used in the life sciences to advance the informatics infrastructure that supports research. The Open Bioinformatics Foundation ran two hackathons for its member projects in 2002 and 2003, and since 2010 has held 2-day "codefests" preceding its annual conference.[19] The National Evolutionary Synthesis Center has co-organized and sponsored hackathons for evolutionary bioinformatics since 2006.[20][21] BioHackathon[22] is an annual event that started in 2008 targeted at advancing standards to enable interoperable bioinformatics tools and Web services. Neuroscientists have also used hackathons to bring developers and scientists together to address issues that range from focusing on a specific information system (e.g., Neurosynth Hackathon[23] and the Allen Brain Atlas Hackathon[24]) and providing reserved time for broad scientific inquiry (e.g., Brainhack),[25] to using specific challenges that focus hacking activity (e.g., HBM Hackathon).[26] There has been an emergence of 'datathons' or data-focused hackathons in recent years.[27][28][29] These events challenge data scientists and others to use creativity and data analysis skills and platforms to build, test and explore solutions and dashboards which analyse huge datasets in a limited amount of time. These are increasingly being used to deliver insights in big public and private datasets in various disciplines including business,[30] health care[31][32] news media[33] and for social causes.[34] Using a specific programming language, API, or framework[edit] There have been hackathons devoted to creating applications that use a specific language or framework, like JavaScript,[35] Node.js,[36] HTML5[37] and Ruby on Rails.[38] Some hackathons focus on applications that make use of the application programming interface, or API, from a single company or data source. Open Hack, an event run publicly by Yahoo! since 2006 (originally known as "Hack Day", then "Open Hack Day"), has focused on usage of the Yahoo! API, in addition to APIs of websites owned by Yahoo!, like Flickr.[39] The company's Open Hack India event in 2012 had over 700 attendees.[40] Google has run similar events for their APIs,[41] as has the travel guide company Lonely Planet.[42] The website Foursquare notably held a large, global hackathon in 2011, in which over 500 developers at over 30 sites around the world competed to create applications using the Foursquare API.[43] A second Foursquare hackathon, in 2013, had around 200 developers.[44] The IETF organizes Hackathons for each IETF meetings which are focused on IETF Internet Draft and IETF RFC implementation for better inter-operability and improved Internet Standards.[45] For a cause or purpose[edit] There have been a number of hackathons devoted to improving government, and specifically to the cause of open government.[46] One such event, in 2011, was hosted by the United States Congress.[47] Starting in 2012, NASA has been annually hosting the International Space Apps Challenge. In 2014, the British government and HackerNest ran DementiaHack,[48] the world's first hackathon dedicated to improving the lives of people living with dementia and their caregivers.[49][50] The series continues in 2015, adding the Canadian government and Facebook as major sponsors.[51] The Global Game Jam, the largest video game development hackathon,[52] often includes optional requirements called 'diversifiers'[53] that aim to promote game accessibility and other causes. Various hackathons have been held to improve city transit systems.[54] Hackathons aimed at improvements to city local services are increasing, with one of the London Councils (Hackney) creating a number of successful local solutions with a two-day Hackney-thon.[55] There have also been a number of hackathons devoted to improving education, including Education Hack Day[56] and on a smaller scale, looking specifically at the challenges of field work based geography education, the Field Studies Council[57] hosted FSCHackday.[58] Random Hacks of Kindness is another popular hackathon, devoted to disaster management and crisis response.[59] ThePort[60] instead is a hackathon devoted to solving humanitarian, social and public interest challenges. It's hosted by CERN with partners from other non-governmental organizations such as ICRC and UNDP. In March 2020, numerous world-wide initiatives led by entrepreneurs and governmental representatives from European countries resulted in a series of anti-crisis hackathons Hack the Crisis, with first to happen in Estonia,[61] followed up by Poland,[62] Latvia, and Ukraine. As a tribute or a memorial[edit] A number of hackathons around the world have been planned in memory of computer programmer and internet activist Aaron Swartz, who died in 2013.[63][64][65][66] For a demographic group[edit] Some hackathons are intended only for programmers within a certain demographic group, like teenagers, college students, or women.[67] Hackathons at colleges have become increasingly popular, in the United States and elsewhere. These are usually annual or semiannual events that are open to college students at all universities. They are often competitive, with awards provided by the University or programming-related sponsors. Many of them are supported by the organization Major League Hacking, which was founded in 2013 to assist with the running of collegiate hackathons. PennApps at the University of Pennsylvania was the first student-run college hackathon; in 2015 it became the largest college hackathon with its 12th iteration hosting over 2000 people and offering over $60k in prizes.[68][69] The University of Mauritius Computer Club and Cyberstorm.mu organized a Hackathon dubbed "Code Wars" focused on implementing an IETF RFC in Lynx in 2017.[70][71] ShamHacks at Missouri University of Science and Technology is held annually as an outreach activity of the campus's Curtis Laws Wilson Library. ShamHacks 2018[72] focused on problem statements to better quality of life factors for US veterans, by pairing with veteran-owned company sponsors.[73] For internal innovation and motivation[edit] Some companies hold internal hackathons to promote new product innovation by the engineering staff. For example, Facebook's Like button was conceived as part of a hackathon.[74] To connect local tech communities[edit] Some hackathons (such as StartupBus, founded in 2010 in Australia) combine the competitive element with a road trip, to connect local tech communities in multiple cities along the bus routes. This is now taking place across North America, Europe, Africa and Australasia.[75] Code sprints[edit] Not to be confused with Scrum (software development) § Sprint. In some hackathons, all work is on a single application, such as an operating system, programming language, or content management system. Such events are often known as "code sprints", and are especially popular for open source software projects, where such events are sometimes the only opportunity for developers to meet face-to-face.[76] Code sprints typically last from one week to three weeks and often take place near conferences at which most of the team attend. Unlike other hackathons, these events rarely include a competitive element. The annual hackathon to work on the operating system OpenBSD, held since 1999, is one such event; it may have originated the word "hackathon".[citation needed] Edit-a-thon[edit] An edit-a-thon (a portmanteau of editing marathon) is an event where editors of online communities such as Wikipedia, OpenStreetMap (also as a "mapathon"), and LocalWiki edit and improve a specific topic or type of content. The events typically including basic editing training for new editors. Controversies[edit] This section is in list format, but may read better as prose. You can help by converting this section, if appropriate. Editing help is available. (November 2020) A team at the September 2013 TechCrunch Disrupt Hackathon presented the TitStare app, which allowed users to post and view pictures of men staring at women's cleavage.[77] TechCrunch issued an apology later that day.[78] A November 2013 hackathon run by Salesforce.com, billed as having the largest-ever grand prize at $1 million, was accused of impropriety after it emerged that the winning entrants, a two-person startup called Upshot, had been developing the technology that they demoed for over a year and that one of the two was a former Salesforce employee.[5] Major League Hacking expelled a pair of hackers from the September 2015 hackathon Hack the North at the University of Waterloo for making jokes that were interpreted as bomb threats, leading many hackers to criticize the organization.[79] As a result of the controversy, Victor Vucicevich resigned from the Hack the North organizing team.[80] Use of hackathon participants as de facto unpaid laborers by some commercial ventures has been criticized as exploitative.[81][82]:193–194 Notable Events[edit] MHacks HackMIT Junction (hackathon) See also[edit] MediaWiki has documentation related to: Hackathons Game Jam Installfest Editathon Charrette Startup Weekend Campus Party References[edit] ^ "Hackathon definition". dictionary.com. ^ "OpenBSD Hackathons". OpenBSD. Retrieved 2015-04-10. ^ PhoneGap: It’s Like AIR for the IPhone Archived 2013-03-10 at the Wayback Machine, Dave Johnson, PhoneGap Blog, 18 September 2008 ^ Adobe Acquires Developer Of HTML5 Mobile App Framework PhoneGap Nitobi, Leena Rao, TechCrunch, October 3, 2011 ^ a b Biddle, Sam (November 22, 2013). "The "Biggest Hackathon Prize In History" Was Won By Cheaters". Valleywag. ^ Williams, Alex (November 21, 2013). "Two Harvard University Alum Win Disputed Salesforce $1M Hackathon Prize At Dreamforce [Updated]". TechCrunch. ^ Hackers Get Hired At Bletchley Park Archived 2011-09-26 at the Wayback Machine, HuffPost Tech UK, September 19, 2011 ^ "Mobile App Hackathon - TechVenture 2011". 21 December 2011. Archived from the original on 21 December 2011. Retrieved 16 March 2018. ^ "Music Hack Day homepage". Musichackday.org. Retrieved 2013-10-09. ^ Rich, L. J. (2014-04-20). "Music Hackathon at Music Tech Fest in Boston". BBC News. BBC.com. Retrieved 2015-03-05. ^ "Science Hack Day homepage". Sciencehackday.org. Retrieved 2014-12-09. ^ "Android Hackathon". Android Hackathon. 2010-03-13. Retrieved 2013-10-09. ^ "iOSDevCamp 2011 Hackathon". Iosdevcamp.org. Retrieved 2013-10-09. ^ "N9 Hackathon" (in German). Metalab.at. Retrieved 2013-10-09. ^ "Nordeus 2011 Game Development Hackathon". Seehub.me. Archived from the original on 2013-10-29. Retrieved 2013-10-09. ^ "TV Hackfest homepage". Hackfest.tv. Retrieved 2013-10-09. ^ "Article on TV Hackfest San Francisco". Techzone360.com. 2012-12-19. Retrieved 2013-10-09. ^ "PDF of Feature article on TV Hackfest in AIB The Channel" (PDF). Archived from the original (PDF) on 2014-02-26. Retrieved 2013-10-09. ^ "OBF Hackathons". Open-bio.org. 2013-03-12. Retrieved 2013-10-09. ^ "NESCent-sponsored Hackathons". Informatics.nescent.org. Retrieved 2013-10-09. ^ T Hill (2007-12-14). "Hilmar Lapp, Sendu Bala, James P. Balhoff, Amy Bouck, Naohisa Goto, Mark Holder, Richard Holland, et al. 2007. "The 2006 NESCent Phyloinformatics Hackathon: A Field Report." Evolutionary Bioinformatics Online 3: 287–296". La-press.com. Retrieved 2013-10-09. ^ "biohackathon.org". biohackathon.org. Retrieved 2013-10-09. ^ "hackathon.neurosynth.org". hackathon.neurosynth.org. Archived from the original on 2013-12-02. Retrieved 2013-10-09. ^ "2012 Allen Brain Atlas Hackathon - Hackathon - Allen Brain Atlas User Community". Community.brain-map.org. 2012-09-04. Archived from the original on 2013-12-02. Retrieved 2013-10-09. ^ "Brainhack.org". Brainhack.org. Retrieved 2013-10-09. ^ "HBM Hackathon - Organization for Human Brain Mapping". Humanbrainmapping.org. Retrieved 2013-10-09. ^ "Datathon 2020 the International Sata Science Hackathon". Data Science Society. Retrieved 16 December 2020. ^ "Datathon 2020". Data Republic. Retrieved 16 December 2020. ^ "WiDS Datathon 2021". Women in Data Science. Retrieved 16 December 2020. ^ "KPMG Datathon Challenge". KPMG Malaysia. ^ PubMed: US National Library of Medicine https://www.ncbi.nlm.nih.gov/pmc/?term=datathon. Retrieved 16 December 2020. Missing or empty |title= (help) ^ Aboab, Jerome; Celi, Leo; Charlton, Peter; Feng, Mengling (6 April 2016). "A "datathon" model to support cross-disciplinary collaboration". Science Translational Medicine. 8 (333): 8. doi:10.1126/scitranslmed.aad9072. PMC 5679209. PMID 27053770. ^ "Hack the News Datathon". Data Science Society. ^ "Datathon for Social Good". Our Community. Retrieved 16 December 2020. ^ DownCityJS, the Providence JavaScript Hackathon Archived 2014-03-25 at the Wayback Machine ^ Knockout, Node. "Node Knockout". www.nodeknockout.com. Retrieved 16 March 2018. ^ HTML5 App Hackathon Archived 2014-03-25 at the Wayback Machine, May 5–6, 2012, Berlin, Germany ^ "Pune Rails Hackathon: July 29-30, 2006". Punehackathon.pbworks.com. Retrieved 2013-10-09. ^ Open! Hack! Day!, Flickr blog, September 3, 2008 ^ Purple in Bangalore – Inside Yahoo! Open Hack India 2012 Archived 2013-10-21 at the Wayback Machine, Pushpalee Johnson, August 11, 2012, YDN Blog ^ "Google Hackathon • Vivacity 2015". Vivacity. 2014-12-25. Archived from the original on 2015-01-26. Retrieved 2015-01-10. ^ "Melbourne Hack Day: List Of Presentations And Winners". Archived from the original on 2011-04-22. ^ The hackathon heard round the world! Archived 2012-03-01 at the Wayback Machine, Foursquare blog, September 20, 2011 ^ If you build it, they will come. Check out all the cool new things you can do with Foursquare! #hackathon Archived 2013-04-29 at the Wayback Machine, Foursquare blog, January 8, 2013 ^ "IETF Hackathon". www.ietf.org. Retrieved 2017-12-18. ^ Open government hackathons matter, Mark Headd, govfresh, August 24, 2011 ^ In #HackWeTrust - The House of Representatives Opens Its Doors to Transparency Through Technology, Daniel Schuman, Sunlight Foundation blog, December 8, 2011 ^ Toronto dementia hackathon 12-14 September, Dr. John Preece, British Foreign & Commonwealth Office Blogs, August 8, 2014 ^ Toronto hackathon to target dementia challenges with innovative ideas, British High Commission Ottawa, GOV.UK, July 25, 2014 ^ HackerNest hooks up with British Consulate-General Toronto for new DementiaHack, Joseph Czikk, Betakit, August 12, 2014 ^ "DementiaHack - HackerNest". Archived from the original on 2014-12-16. Retrieved 2015-09-03. ^ "About the Global Game Jam". GlobalGameJam. 2013-09-13. Retrieved 19 April 2016. ^ "Global Game Jam Diversifiers". GlobalGameJam. 2014-01-21. Retrieved 19 April 2016. ^ All aboard the transit hackathon express Archived 2012-01-08 at the Wayback Machine, Roberto Rocha, The Gazette, December 16, 2011 ^ "Hackney Hackathon succeeds in new services". 2014-11-20. Retrieved 17 July 2015. ^ "Education Hack Day". Education Hack Day. Retrieved 2013-10-09. ^ Council, Field Studies. "Page Not Found - FSC". www.field-studies-council.org. Retrieved 16 March 2018. Cite uses generic title (help) ^ "fschackday.org". fschackday.org. Retrieved 2013-10-09. ^ NASA, Microsoft, Google Hosting Hackathon, Elizabeth Montalbano, InformationWeek, June 7, 2010 ^ "THE Port". theport.ch. Retrieved 2017-12-13. ^ "Estonia organized a public-private e-hackatlon to hack the crisis". Retrieved 16 December 2020. ^ "Anti-crisis hackers join forces to find COVID-19 solutions". Retrieved 16 December 2020. ^ Rocheleau, Matt. "In Aaron Swartz' memory, hackathons to be held across globe, including at MIT, next month". Boston Globe. Retrieved 17 October 2013. ^ Doctorow, Cory. "Aaron Swartz hackathon". Boing Boing. Retrieved 17 October 2013. ^ Sifry, Micah L. "techPresident". Personal Democracy Media. Retrieved 11 October 2013. ^ "Aaron Swartz Hackathon". Archived from the original on 29 March 2014. Retrieved 30 October 2013. ^ Female Geeks Flex Their Skills At Ladies-Only Hackathon, Jed Lipinski, Fast Company, September 14, 2011 ^ World's largest student hackathon descends on Wells Fargo Center, Philadelphia Business Journal ^ Student computer whizzes compete at PennApps Hackathon, Philly.com ^ "Code Wars". University Of Mauritius Computer Club. 2017-09-13. Retrieved 2017-10-20. ^ "UoM CodeWars 2017 - Real life code implementations ! - Codarren". Codarren. 2017-09-26. Retrieved 2017-10-20. ^ Goetz, Nicole (1 September 2017). "ShamHacks: Missouri S&T hackathon". ShamHacks. Retrieved 4 April 2018. ^ Sheeley, Andrew (15 February 2018). "ShamHacks' first hackathon benefits veterans and students". Phelps County Focus. Retrieved 5 April 2018. ^ "Stay focused and keep hacking". www.facebook.com. Retrieved 16 March 2018. ^ "Local Talent Drives Startup Culture In Tampa Bay". 83Degrees. Retrieved 2017-08-15. ^ A.Sigfridsson, G. Avram, A. Sheehan and D. K. Sullivan "Sprint-driven development: working, learning and the process of enculturation in the PyPy community" in the Proceedings of the Third International Conference on Open Source Systems, Limerick, Ireland, June 11–13, 2007, Springer, pp. 133-146 ^ "Meet 'Titstare,' the Tech World's Latest 'Joke' from the Minds of Brogrammers". The Wire. 2013-09-09. Retrieved 2015-11-09. ^ "An Apology From". TechCrunch. Retrieved 2015-11-09. ^ Mike Swift (2015-09-19). "When Jokes go too Far". Major League Hacking. Retrieved 2016-06-06. ^ Victor Vucicevich (2015-09-23). "Leaving Hack the North". Medium. Retrieved 2016-06-06. ^ "Sociologists Examine Hackathons and See Exploitation". Wired. ISSN 1059-1028. Retrieved 2020-11-26. ^ Dariusz Jemielniak; Aleksandra Przegalinska (18 February 2020). Collaborative Society. MIT Press. ISBN 978-0-262-35645-9. External links[edit] Wikimedia Commons has media related to Hackathon. "Media-Making Strategies to Support Community and Learning at Hackathons". MIT Center for Civic Media. June 30, 2014. "Demystifying the hackathon". Article from Mckinsey, October, 2015 Retrieved from "https://en.wikipedia.org/w/index.php?title=Hackathon&oldid=1019645966" Categories: Hacker culture Internet slang OpenBSD Software developer communities Software development events Hackathons Hidden categories: Webarchive template wayback links CS1 German-language sources (de) CS1 errors: missing title CS1 errors: bare URL CS1 errors: generic title Articles with short description Short description matches Wikidata All articles with unsourced statements Articles with unsourced statements from March 2016 Articles needing cleanup from November 2020 All pages needing cleanup Articles with sections that need to be turned into prose from November 2020 Commons category link is on Wikidata Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version In other projects Wikimedia Commons MediaWiki Languages العربية Azərbaycanca Български Bosanski Català Čeština Dansk Deutsch Español Esperanto Euskara فارسی Français 한국어 Հայերեն Bahasa Indonesia Italiano עברית Қазақша മലയാളം Bahasa Melayu Nederlands 日本語 Oʻzbekcha/ўзбекча Polski Português Română Русский Српски / srpski Svenska ไทย Türkçe Українська Tiếng Việt 中文 Edit links This page was last edited on 24 April 2021, at 15:43 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-7772	----	Distributed hash table - Wikipedia Distributed hash table From Wikipedia, the free encyclopedia Jump to navigation Jump to search Decentralized distributed system with lookup service This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Distributed hash table" – news · newspapers · books · scholar · JSTOR (September 2020) (Learn how and when to remove this template message) A distributed hash table (DHT) is a distributed system that provides a lookup service similar to a hash table: key-value pairs are stored in a DHT, and any participating node can efficiently retrieve the value associated with a given key. The main advantage of a DHT is that nodes can be added or removed with minimum work around re-distributing keys. Keys are unique identifiers which map to particular values, which in turn can be anything from addresses, to documents, to arbitrary data.[1] Responsibility for maintaining the mapping from keys to values is distributed among the nodes, in such a way that a change in the set of participants causes a minimal amount of disruption. This allows a DHT to scale to extremely large numbers of nodes and to handle continual node arrivals, departures, and failures. DHTs form an infrastructure that can be used to build more complex services, such as anycast, cooperative web caching, distributed file systems, domain name services, instant messaging, multicast, and also peer-to-peer file sharing and content distribution systems. Notable distributed networks that use DHTs include BitTorrent's distributed tracker, the Coral Content Distribution Network, the Kad network, the Storm botnet, the Tox instant messenger, Freenet, the YaCy search engine, and the InterPlanetary File System. Distributed hash tables Contents 1 History 2 Properties 3 Structure 3.1 Keyspace partitioning 3.1.1 Consistent hashing 3.1.2 Rendezvous hashing 3.1.3 Locality-preserving hashing 3.2 Overlay network 3.3 Algorithms for overlay networks 4 Security 5 Implementations 6 Examples 6.1 DHT protocols and implementations 6.2 Applications using DHTs 7 See also 8 References 9 External links History[edit] DHT research was originally motivated, in part, by peer-to-peer (P2P) systems such as Freenet, Gnutella, BitTorrent and Napster, which took advantage of resources distributed across the Internet to provide a single useful application. In particular, they took advantage of increased bandwidth and hard disk capacity to provide a file-sharing service.[2] These systems differed in how they located the data offered by their peers. Napster, the first large-scale P2P content delivery system, required a central index server: each node, upon joining, would send a list of locally held files to the server, which would perform searches and refer the queries to the nodes that held the results. This central component left the system vulnerable to attacks and lawsuits. Gnutella and similar networks moved to a query flooding model – in essence, each search would result in a message being broadcast to every other machine in the network. While avoiding a single point of failure, this method was significantly less efficient than Napster. Later versions of Gnutella clients moved to a dynamic querying model which vastly improved efficiency.[3] Freenet is fully distributed, but employs a heuristic key-based routing in which each file is associated with a key, and files with similar keys tend to cluster on a similar set of nodes. Queries are likely to be routed through the network to such a cluster without needing to visit many peers.[4] However, Freenet does not guarantee that data will be found. Distributed hash tables use a more structured key-based routing in order to attain both the decentralization of Freenet and Gnutella, and the efficiency and guaranteed results of Napster. One drawback is that, like Freenet, DHTs only directly support exact-match search, rather than keyword search, although Freenet's routing algorithm can be generalized to any key type where a closeness operation can be defined.[5] In 2001, four systems—CAN,[6] Chord,[7] Pastry, and Tapestry—ignited DHTs as a popular research topic. A project called the Infrastructure for Resilient Internet Systems (Iris) was funded by a $12 million grant from the United States National Science Foundation in 2002.[8] Researchers included Sylvia Ratnasamy, Ion Stoica, Hari Balakrishnan and Scott Shenker.[9] Outside academia, DHT technology has been adopted as a component of BitTorrent and in the Coral Content Distribution Network. Properties[edit] DHTs characteristically emphasize the following properties: Autonomy and decentralization: the nodes collectively form the system without any central coordination. Fault tolerance: the system should be reliable (in some sense) even with nodes continuously joining, leaving, and failing .[10] Scalability: the system should function efficiently even with thousands or millions of nodes. A key technique used to achieve these goals is that any one node needs to coordinate with only a few other nodes in the system – most commonly, O(log n) of the n participants (see below) – so that only a limited amount of work needs to be done for each change in membership. Some DHT designs seek to be secure against malicious participants[11] and to allow participants to remain anonymous, though this is less common than in many other peer-to-peer (especially file sharing) systems; see anonymous P2P. Finally, DHTs must deal with more traditional distributed systems issues such as load balancing, data integrity, and performance (in particular, ensuring that operations such as routing and data storage or retrieval complete quickly). Structure[edit] The structure of a DHT can be decomposed into several main components.[12][13] The foundation is an abstract keyspace, such as the set of 160-bit strings. A keyspace partitioning scheme splits ownership of this keyspace among the participating nodes. An overlay network then connects the nodes, allowing them to find the owner of any given key in the keyspace. Once these components are in place, a typical use of the DHT for storage and retrieval might proceed as follows. Suppose the keyspace is the set of 160-bit strings. To index a file with given filename and data in the DHT, the SHA-1 hash of filename is generated, producing a 160-bit key k, and a message put(k, data) is sent to any node participating in the DHT. The message is forwarded from node to node through the overlay network until it reaches the single node responsible for key k as specified by the keyspace partitioning. That node then stores the key and the data. Any other client can then retrieve the contents of the file by again hashing filename to produce k and asking any DHT node to find the data associated with k with a message get(k). The message will again be routed through the overlay to the node responsible for k, which will reply with the stored data. The keyspace partitioning and overlay network components are described below with the goal of capturing the principal ideas common to most DHTs; many designs differ in the details. Keyspace partitioning[edit] Most DHTs use some variant of consistent hashing or rendezvous hashing to map keys to nodes. The two algorithms appear to have been devised independently and simultaneously to solve the distributed hash table problem. Both consistent hashing and rendezvous hashing have the essential property that removal or addition of one node changes only the set of keys owned by the nodes with adjacent IDs, and leaves all other nodes unaffected. Contrast this with a traditional hash table in which addition or removal of one bucket causes nearly the entire keyspace to be remapped. Since any change in ownership typically corresponds to bandwidth-intensive movement of objects stored in the DHT from one node to another, minimizing such reorganization is required to efficiently support high rates of churn (node arrival and failure). Consistent hashing[edit] Further information: Consistent hashing Consistent hashing employs a function δ ( k 1 , k 2 ) {\displaystyle \delta (k_{1},k_{2})} that defines an abstract notion of the distance between the keys k 1 {\displaystyle k_{1}} and k 2 {\displaystyle k_{2}} , which is unrelated to geographical distance or network latency. Each node is assigned a single key called its identifier (ID). A node with ID i x {\displaystyle i_{x}} owns all the keys k m {\displaystyle k_{m}} for which i x {\displaystyle i_{x}} is the closest ID, measured according to δ ( k m , i x ) {\displaystyle \delta (k_{m},i_{x})} . For example, the Chord DHT uses consistent hashing, which treats nodes as points on a circle, and δ ( k 1 , k 2 ) {\displaystyle \delta (k_{1},k_{2})} is the distance traveling clockwise around the circle from k 1 {\displaystyle k_{1}} to k 2 {\displaystyle k_{2}} . Thus, the circular keyspace is split into contiguous segments whose endpoints are the node identifiers. If i 1 {\displaystyle i_{1}} and i 2 {\displaystyle i_{2}} are two adjacent IDs, with a shorter clockwise distance from i 1 {\displaystyle i_{1}} to i 2 {\displaystyle i_{2}} , then the node with ID i 2 {\displaystyle i_{2}} owns all the keys that fall between i 1 {\displaystyle i_{1}} and i 2 {\displaystyle i_{2}} . Rendezvous hashing[edit] Further information: Rendezvous hashing In rendezvous hashing, also called highest random weight (HRW) hashing, all clients use the same hash function h ( ) {\displaystyle h()} (chosen ahead of time) to associate a key to one of the n available servers. Each client has the same list of identifiers {S1, S2, ..., Sn }, one for each server. Given some key k, a client computes n hash weights w1 = h(S1, k), w2 = h(S2, k), ..., wn = h(Sn, k). The client associates that key with the server corresponding to the highest hash weight for that key. A server with ID S x {\displaystyle S_{x}} owns all the keys k m {\displaystyle k_{m}} for which the hash weight h ( S x , k m ) {\displaystyle h(S_{x},k_{m})} is higher than the hash weight of any other node for that key. Locality-preserving hashing[edit] Further information: Locality-preserving hashing Locality-preserving hashing ensures that similar keys are assigned to similar objects. This can enable a more efficient execution of range queries, however, in contrast to using consistent hashing, there is no more assurance that the keys (and thus the load) is uniformly randomly distributed over the key space and the participating peers. DHT protocols such as Self-Chord and Oscar[14] address such issues. Self-Chord decouples object keys from peer IDs and sorts keys along the ring with a statistical approach based on the swarm intelligence paradigm.[15] Sorting ensures that similar keys are stored by neighbour nodes and that discovery procedures, including range queries, can be performed in logarithmic time. Oscar constructs a navigable small-world network based on random walk sampling also assuring logarithmic search time. Overlay network[edit] Each node maintains a set of links to other nodes (its neighbors or routing table). Together, these links form the overlay network.[16] A node picks its neighbors according to a certain structure, called the network's topology. All DHT topologies share some variant of the most essential property: for any key k, each node either has a node ID that owns k or has a link to a node whose node ID is closer to k, in terms of the keyspace distance defined above. It is then easy to route a message to the owner of any key k using the following greedy algorithm (that is not necessarily globally optimal): at each step, forward the message to the neighbor whose ID is closest to k. When there is no such neighbor, then we must have arrived at the closest node, which is the owner of k as defined above. This style of routing is sometimes called key-based routing. Beyond basic routing correctness, two important constraints on the topology are to guarantee that the maximum number of hops in any route (route length) is low, so that requests complete quickly; and that the maximum number of neighbors of any node (maximum node degree) is low, so that maintenance overhead is not excessive. Of course, having shorter routes requires higher maximum degree. Some common choices for maximum degree and route length are as follows, where n is the number of nodes in the DHT, using Big O notation: Max. degree Max route length Used in Note O ( 1 ) {\displaystyle O(1)} O ( n ) {\displaystyle O(n)} Worst lookup lengths, with likely much slower lookups times O ( 1 ) {\displaystyle O(1)} O ( log ⁡ n ) {\displaystyle O(\log n)} Koorde (with constant degree) More complex to implement, but acceptable lookup time can be found with a fixed number of connections O ( log ⁡ n ) {\displaystyle O(\log n)} O ( log ⁡ n ) {\displaystyle O(\log n)} Chord Kademlia Pastry Tapestry Most common, but not optimal (degree/route length). Chord is the most basic version, with Kademlia seeming the most popular optimized variant (should have improved average lookup) O ( log ⁡ n ) {\displaystyle O(\log n)} O ( log ⁡ n / log ⁡ ( log ⁡ n ) ) {\displaystyle O(\log n/\log(\log n))} Koorde (with optimal lookup) More complex to implement, but lookups might be faster (have a lower worst case bound) O ( n ) {\displaystyle O({\sqrt {n}})} O ( 1 ) {\displaystyle O(1)} Worst local storage needs, with much communication after any node connects or disconnects The most common choice, O ( log ⁡ n ) {\displaystyle O(\log n)} degree/route length, is not optimal in terms of degree/route length tradeoff, but such topologies typically allow more flexibility in choice of neighbors. Many DHTs use that flexibility to pick neighbors that are close in terms of latency in the physical underlying network. In general, all DHTs construct navigable small-world network topologies, which trade-off route length vs. network degree.[17] Maximum route length is closely related to diameter: the maximum number of hops in any shortest path between nodes. Clearly, the network's worst case route length is at least as large as its diameter, so DHTs are limited by the degree/diameter tradeoff[18] that is fundamental in graph theory. Route length can be greater than diameter, since the greedy routing algorithm may not find shortest paths.[19] Algorithms for overlay networks[edit] Aside from routing, there exist many algorithms that exploit the structure of the overlay network for sending a message to all nodes, or a subset of nodes, in a DHT.[20] These algorithms are used by applications to do overlay multicast, range queries, or to collect statistics. Two systems that are based on this approach are Structella,[21] which implements flooding and random walks on a Pastry overlay, and DQ-DHT, which implements a dynamic querying search algorithm over a Chord network.[22] Security[edit] Because of the decentralization, fault tolerance, and scalability of DHTs, they are inherently more resilient against a hostile attacker than a centralized system.[vague] Open systems for distributed data storage that are robust against massive hostile attackers are feasible.[23] A DHT system that is carefully designed to have Byzantine fault tolerance can defend against a security weakness, known as the Sybil attack, which affects all current DHT designs.[24][25] Petar Maymounkov, one of the original authors of Kademlia, has proposed a way to circumvent the weakness to the Sybil attack by incorporating social trust relationships into the system design.[26] The new system, codenamed Tonika or also known by its domain name as 5ttt, is based on an algorithm design known as "electric routing" and co-authored with the mathematician Jonathan Kelner.[27] Maymounkov has now undertaken a comprehensive implementation effort of this new system. However, research into effective defences against Sybil attacks is generally considered an open question, and wide variety of potential defences are proposed every year in top security research conferences.[citation needed] Implementations[edit] Most notable differences encountered in practical instances of DHT implementations include at least the following: The address space is a parameter of DHT. Several real-world DHTs use 128-bit or 160-bit key space. Some real-world DHTs use hash functions other than SHA-1. In the real world the key k could be a hash of a file's content rather than a hash of a file's name to provide content-addressable storage, so that renaming of the file does not prevent users from finding it. Some DHTs may also publish objects of different types. For example, key k could be the node ID and associated data could describe how to contact this node. This allows publication-of-presence information and often used in IM applications, etc. In the simplest case, ID is just a random number that is directly used as key k (so in a 160-bit DHT ID will be a 160-bit number, usually randomly chosen). In some DHTs, publishing of nodes' IDs is also used to optimize DHT operations. Redundancy can be added to improve reliability. The (k, data) key pair can be stored in more than one node corresponding to the key. Usually, rather than selecting just one node, real world DHT algorithms select i suitable nodes, with i being an implementation-specific parameter of the DHT. In some DHT designs, nodes agree to handle a certain keyspace range, the size of which may be chosen dynamically, rather than hard-coded. Some advanced DHTs like Kademlia perform iterative lookups through the DHT first in order to select a set of suitable nodes and send put(k, data) messages only to those nodes, thus drastically reducing useless traffic, since published messages are only sent to nodes that seem suitable for storing the key k; and iterative lookups cover just a small set of nodes rather than the entire DHT, reducing useless forwarding. In such DHTs, forwarding of put(k, data) messages may only occur as part of a self-healing algorithm: if a target node receives a put(k, data) message, but believes that k is out of its handled range and a closer node (in terms of DHT keyspace) is known, the message is forwarded to that node. Otherwise, data are indexed locally. This leads to a somewhat self-balancing DHT behavior. Of course, such an algorithm requires nodes to publish their presence data in the DHT so the iterative lookups can be performed. Since on most machines sending messages is much more expensive than local hash table accesses, it makes sense to bundle many messages concerning a particular node into a single batch. Assuming each node has a local batch consisting of at most b operations, the bundling procedure is as follows. Each node first sorts its local batch by the identifier of the node responsible for the operation. Using bucket sort, this can be done in O(b + n), where n is the number of nodes in the DHT. When there are multiple operations addressing the same key within one batch, the batch is condensed before being sent out. For example, multiple lookups of the same key can be reduced to one or multiple increments can be reduced to a single add operation. This reduction can be implemented with the help of a temporary local hash table. Finally, the operations are sent to the respective nodes.[28] Examples[edit] DHT protocols and implementations[edit] Apache Cassandra BATON Overlay Mainline DHT – standard DHT used by BitTorrent (based on Kademlia as provided by Khashmir)[29] Content addressable network (CAN) Chord Koorde Kademlia Pastry P-Grid Riak Tapestry TomP2P Voldemort Applications using DHTs[edit] BTDigg: BitTorrent DHT search engine Codeen: web caching Coral Content Distribution Network Freenet: a censorship-resistant anonymous network GlusterFS: a distributed file system used for storage virtualization GNUnet: Freenet-like distribution network including a DHT implementation I2P: An open-source anonymous peer-to-peer network I2P-Bote: serverless secure anonymous email IPFS: A content-addressable, peer-to-peer hypermedia distribution protocol JXTA: open-source P2P platform Oracle Coherence: an in-memory data grid built on top of a Java DHT implementation Perfect Dark: a peer-to-peer file-sharing application from Japan Retroshare: a Friend-to-friend network[30] Jami: a privacy-preserving voice, video and chat communication platform, based on a Kademlia-like DHT Tox: an instant messaging system intended to function as a Skype replacement Twister: a microblogging peer-to-peer platform YaCy: a distributed search engine See also[edit] Couchbase Server: a persistent, replicated, clustered distributed object storage system compatible with memcached protocol. Memcached: a high-performance, distributed memory object caching system. Prefix hash tree: sophisticated querying over DHTs. Merkle tree: tree having every non-leaf node labelled with the hash of the labels of its children nodes. Most distributed data stores employ some form of DHT for lookup. Skip graphs are an efficient data structure for implementing DHTs. References[edit] ^ Stoica, I.; Morris, R.; Karger, D.; Kaashoek, M. F.; Balakrishnan, H. (2001). "Chord: A scalable peer-to-peer lookup service for internet applications" (PDF). ACM SIGCOMM Computer Communication Review. 31 (4): 149. doi:10.1145/964723.383071. A value can be an address, a document, or an arbitrary data item. ^ Liz, Crowcroft; et al. (2005). "A survey and comparison of peer-to-peer overlay network schemes" (PDF). IEEE Communications Surveys & Tutorials. 7 (2): 72–93. CiteSeerX 10.1.1.109.6124. doi:10.1109/COMST.2005.1610546. ^ Richter, Stevenson; et al. (2009). "Analysis of the impact of dynamic querying models on client-server relationships". Trends in Modern Computing: 682–701. ^ Searching in a Small World Chapters 1 & 2 (PDF), retrieved 2012-01-10 ^ "Section 5.2.2" (PDF), A Distributed Decentralized Information Storage and Retrieval System, retrieved 2012-01-10 ^ Ratnasamy; et al. (2001). "A Scalable Content-Addressable Network" (PDF). In Proceedings of ACM SIGCOMM 2001. Retrieved 2013-05-20. Cite journal requires |journal= (help) ^ Hari Balakrishnan, M. Frans Kaashoek, David Karger, Robert Morris, and Ion Stoica. Looking up data in P2P systems. In Communications of the ACM, February 2003. ^ David Cohen (October 1, 2002). "New P2P network funded by US government". New Scientist. Retrieved November 10, 2013. ^ "MIT, Berkeley, ICSI, NYU, and Rice Launch the IRIS Project". Press release. MIT. September 25, 2002. Archived from the original on September 26, 2015. Retrieved November 10, 2013. ^ R Mokadem, A Hameurlain and AM Tjoa. Resource discovery service while minimizing maintenance overhead in hierarchical DHT systems. Proc. iiWas, 2010 ^ Guido Urdaneta, Guillaume Pierre and Maarten van Steen. A Survey of DHT Security Techniques. ACM Computing Surveys 43(2), January 2011. ^ Moni Naor and Udi Wieder. Novel Architectures for P2P Applications: the Continuous-Discrete Approach. Proc. SPAA, 2003. ^ Gurmeet Singh Manku. Dipsea: A Modular Distributed Hash Table Archived 2004-09-10 at the Wayback Machine. Ph. D. Thesis (Stanford University), August 2004. ^ Girdzijauskas, Šarūnas; Datta, Anwitaman; Aberer, Karl (2010-02-01). "Structured overlay for heterogeneous environments". ACM Transactions on Autonomous and Adaptive Systems. 5 (1): 1–25. doi:10.1145/1671948.1671950. ISSN 1556-4665. ^ Forestiero, Agostino; Leonardi, Emilio; Mastroianni, Carlo; Meo, Michela (October 2010). "Self-Chord: A Bio-Inspired P2P Framework for Self-Organizing Distributed Systems". IEEE/ACM Transactions on Networking. 18 (5): 1651–1664. doi:10.1109/TNET.2010.2046745. ^ Galuba, Wojciech; Girdzijauskas, Sarunas (2009), "Peer to Peer Overlay Networks: Structure, Routing and Maintenance", in LIU, LING; ÖZSU, M. TAMER (eds.), Encyclopedia of Database Systems, Springer US, pp. 2056–2061, doi:10.1007/978-0-387-39940-9_1215, ISBN 9780387399409 ^ Girdzijauskas, Sarunas (2009). Designing peer-to-peer overlays a small-world perspective. epfl.ch. EPFL. ^ The (Degree,Diameter) Problem for Graphs, Maite71.upc.es, archived from the original on 2012-02-17, retrieved 2012-01-10 ^ Gurmeet Singh Manku, Moni Naor, and Udi Wieder. "Know thy Neighbor's Neighbor: the Power of Lookahead in Randomized P2P Networks". Proc. STOC, 2004. ^ Ali Ghodsi. "Distributed k-ary System: Algorithms for Distributed Hash Tables", Archived 22 May 2007 at the Wayback Machine. KTH-Royal Institute of Technology, 2006. ^ Castro, Miguel; Costa, Manuel; Rowstron, Antony (1 January 2004). "Should we build Gnutella on a structured overlay?" (PDF). ACM SIGCOMM Computer Communication Review. 34 (1): 131. CiteSeerX 10.1.1.221.7892. doi:10.1145/972374.972397. ^ Talia, Domenico; Trunfio, Paolo (December 2010). "Enabling Dynamic Querying over Distributed Hash Tables". Journal of Parallel and Distributed Computing. 70 (12): 1254–1265. doi:10.1016/j.jpdc.2010.08.012. ^ Baruch Awerbuch, Christian Scheideler. "Towards a scalable and robust DHT". 2006. doi:10.1145/1148109.1148163 ^ Maxwell Young; Aniket Kate; Ian Goldberg; Martin Karsten. "Practical Robust Communication in DHTs Tolerating a Byzantine Adversary". ^ Natalya Fedotova; Giordano Orzetti; Luca Veltri; Alessandro Zaccagnini. "Byzantine agreement for reputation management in DHT-based peer-to-peer networks". doi:10.1109/ICTEL.2008.4652638 ^ Chris Lesniewski-Laas. "A Sybil-proof one-hop DHT" (PDF): 20. Cite journal requires |journal= (help) ^ Jonathan Kelner, Petar Maymounkov (2009). "Electric routing and concurrent flow cutting". arXiv:0909.2859. Bibcode:2009arXiv0909.2859K. Cite journal requires |journal= (help) ^ Sanders, Peter; Mehlhorn, Kurt; Dietzfelbinger, Martin; Dementiev, Roman (2019). Sequential and Parallel Algorithms and Data Structures: The Basic Toolbox. Springer International Publishing. ISBN 978-3-030-25208-3. ^ Tribler wiki Archived December 4, 2010, at the Wayback Machine retrieved January 2010. ^ Retroshare FAQ retrieved December 2011 External links[edit] Distributed Hash Tables, Part 1 by Brandon Wiley. Distributed Hash Tables links Carles Pairot's Page on DHT and P2P research kademlia.scs.cs.nyu.edu Archive.org snapshots of kademlia.scs.cs.nyu.edu Eng-Keong Lua; Crowcroft, Jon; Pias, Marcelo; Sharma, Ravi; Lim, Steve (2005). "IEEE Survey on overlay network schemes". CiteSeerX 10.1.1.111.4197: Cite journal requires |journal= (help) covering unstructured and structured decentralized overlay networks including DHTs (Chord, Pastry, Tapestry and others). Mainline DHT Measurement at Department of Computer Science, University of Helsinki, Finland. v t e BitTorrent Companies BitTorrent, Inc. Vuze, Inc. People Bram Cohen Ross Cohen Eric Klinker Ashwin Navin Justin Sun Technology Glossary Broadcatching Distributed hash tables DNA I2P index Local Peer Discovery Peer exchange Protocol encryption Super-seeding Tracker Torrent file TCP UDP µTP WebRTC WebTorrent Clients (comparison, usage share) Ares Galaxy BitTorrent (original client) BitComet BitLord Deluge Free Download Manager Flashget FrostWire Getright Go!Zilla KTorrent libtorrent (library) LimeWire µTorrent Miro MLDonkey qBittorrent rTorrent Shareaza Tixati Transmission Tribler Vuze (formerly Azureus) WebTorrent Desktop Xunlei Tracker software (comparison) opentracker PeerTracker TorrentPier XBT Tracker Search engines (comparison) 1337x BTDigg Demonoid etree ExtraTorrent EZTV isoHunt Karagarga KickassTorrents Nyaa Torrents The Pirate Bay RARBG Tamil Rockers Torrentz YIFY yourBittorrent Defunct websites BTJunkie Burnbit LokiTorrent Mininova Oink's Pink Palace OpenBitTorrent Suprnova.org t411 Torrent Project TorrentSpy What.CD YouTorrent Related topics aXXo BitTorrent Open Source License Glossary of BitTorrent terms Popcorn Time Slyck.com TorrentFreak Category Commons Retrieved from "https://en.wikipedia.org/w/index.php?title=Distributed_hash_table&oldid=1013584054" Categories: Distributed data storage File sharing Distributed data structures Hash based data structures Network architecture Hashing Hidden categories: CS1 errors: missing periodical Webarchive template wayback links Articles with short description Short description is different from Wikidata Articles needing additional references from September 2020 All articles needing additional references All Wikipedia articles needing clarification Wikipedia articles needing clarification from June 2016 All articles with unsourced statements Articles with unsourced statements from May 2020 Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages Български Català Deutsch Español فارسی Français 한국어 Italiano Magyar Nederlands 日本語 Norsk bokmål Polski Português Русский Српски / srpski Suomi Svenska Türkçe Українська Tiếng Việt 中文 Edit links This page was last edited on 22 March 2021, at 12:23 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-7867	----	Shure SM57 - Wikipedia Shure SM57 From Wikipedia, the free encyclopedia Jump to navigation Jump to search The Shure SM57 microphone The Shure SM57 is a low-impedance cardioid dynamic microphone made by Shure Incorporated and commonly used in live sound reinforcement and studio recording. It is one of the best-selling microphones in the world. It is used extensively in amplified music and has been used for speeches by every U.S. president since its introduction in 1965.[1] In 2004, honoring its four decades of "solid, dependable performance", it was inducted into the first-ever TEC Awards TECnology Hall of Fame.[1] Contents 1 Background 2 Characteristics 3 Use 4 Specifications 5 See also 6 References 7 External links Background[edit] The origin of SM57 may be traced to 1937, when Shure engineer Benjamin Bauer developed the first single-element directional microphone, the Unidyne, which had a cardioid pickup pattern.[1] In 1959, another Shure engineer, Ernie Seeler, advanced the art of microphone design significantly with the Unidyne III.[1] Seeler torture-tested the Unidyne III during three years of research and development and thereby, produced the SM series of rugged and reliable Shure microphone capsules.[1] The "SM" stands for Studio Microphone;[2] Seeler was an aficionado of classical music and expected the SM57 to be used for orchestras. Because he "despised" rock music, the TEC Foundation said that it was ironic that the microphone has become "a mainstay of rock music."[1] Characteristics[edit] The SM57 uses the same capsule as the popular SM58. Like the SM58, the SM57 is fitted with an XLR connector and features a balanced output, which helps to minimize electrical hum and noise pickup. According to Shure, the frequency response extends from 40 Hertz (Hz) to 15 kHz. The SM57 is manufactured in the United States, Mexico, and China. The Shure A2WS is an accessory windscreen for the SM57 that attenuates wind noise and plosives ("pop" sounds), and protects the microphone capsule. Use[edit] Shure SM57 microphones with A2WS windscreens installed on the lectern of former United States President Barack Obama. The microphone kit (two SM57 microphones, windscreens, microphone stands, and black right-angle XLR cables) is referred to as the VIP/high-profile microphone kit. The SM57 is a popular choice of musicians due to its sturdy construction and ability to work well with instruments that produce high sound pressure levels, such as percussion instruments and electric guitars. The School of Audio Engineering (SAE) recommends the SM57 (along with other makes and models) for four roles in a drum kit: kick drum, snare drum, rack toms, and floor tom.[3] The cardioid pickup pattern of the microphone reduces the pickup of unwanted background sound and the generation of acoustic feedback. SM57s have also been a staple when reinforcing the sound from guitar amplifiers. In a more unconventional fashion, the SM57 has been favoured by some as a vocal mic, both live and in the studio. Notable singers known to have recorded vocals with an SM57 include Anthony Kiedis, Brandon Flowers,[4] Madonna,[5] David Bowie,[6] John Lennon,[7] Jack White,[8] Bjork,[9] Peter Gabriel,[10] Paul Rodgers,[11] Tom Waits,[12] Wayne Coyne,[13] Tom Petty [14]Alice Cooper, Erykah Badu,[15] Caleb Followill[16] and Raphael Saadiq.[17] An early model of the mic, the Unidyne 545 was used on Pet Sounds for Brian Wilson's vocal tracks. Every U.S. president since Lyndon B. Johnson has delivered speeches through an SM57.[1] It became the lectern microphone of the White House Communications Agency in 1965, the year of its introduction, and remains so.[18] Due to its popularity, the SM57 has been counterfeited frequently by manufacturers in China and Thailand.[19] Shure Distribution UK reports that the SM57, SM58, Beta 57A, and Beta 58A are their microphones that are most commonly counterfeited.[20] In 2006, Shure mounted a campaign against the trading of counterfeit microphones.[21] Specifications[edit] SM57 Unidyne III, ca. 1984 Type Dynamic Frequency response 40 to 15,000 Hz Polar pattern Cardioid Sensitivity (at 1,000 Hz open circuit voltage) −56.0 dBV/Pa (at 1,000 Hz) Impedance Rated impedance is 150 ohms (300 ohms actual) for connection to microphone inputs rated low impedance Connector Three-pin professional audio connector (male XLR type) Produced 1965–present See also[edit] Shure SM58 References[edit] ^ a b c d e f g TECnology Hall of Fame: 2004 Archived 2013-12-13 at the Wayback Machine ^ History of Shure Incorporated Archived 2008-04-28 at the Wayback Machine ^ "Microphone Placement: Let's take a look at a standard drum kit". SAE. Retrieved April 6, 2011. CS1 maint: discouraged parameter (link) ^ https://reverb.com/news/gear-tribute-the-shure-sm57-from-rumours-to-the-white-house ^ https://web.archive.org/web/20110830193214/http://www.sheppettibone.com/sp_erotica_diaries.htm ^ https://timpalmer.com/wp-content/themes/timpalmer/pdfs/Melody_maker_1989.pdf ^ https://www.soundonsound.com/people/john-lennon-whatever-gets-you-thru-night ^ https://www.soundonsound.com/techniques/inside-track-jack-white ^ http://www.moredarkthanshark.org/eno_int_audproint-oct08.html ^ https://www.youtube.com/watch?v=scmYG1Pv1_Q&feature=youtu.be&t=35m45s ^ https://www.analogplanet.com/content/royal-sessions-finds-paul-rodgers-fine-voice ^ https://www.soundonsound.com/people/bones-howe-tom-waits ^ https://www.youtube.com/watch?v=zzk4AkZw9vc&feature=youtu.be&t=256 ^ https://www.soundonsound.com/techniques/inside-track-tom-pettys-hypnotic-eye ^ https://web.archive.org/web/20171122013017/http://www.emusician.com/gear/1332/earth-sun-moon/39259 ^ https://www.mixonline.com/recording/kings-leon-365832 ^ Farinella, David John (January 1, 2009). "Music: Raphael Saadiq". Mix. Archived from the original on September 21, 2012. Retrieved April 21, 2012. CS1 maint: discouraged parameter (link) ^ Charles J. Kouri; Rose L. Shure; Hayward Blake; John Lee (2001). Shure: sound people, products, and values. 1. Shure Inc. p. xiii. ISBN 0-9710738-0-5. ^ Home Recording. Joe Shambro, Spotting a Fake Shure Microphone: How to tell if your mic is genuine—or not ^ Shure Distribution UK. What is a counterfeit? Archived 2009-03-03 at the Wayback Machine ^ Shure Distribution UK. Shure Distribution UK Clamp Down on Counterfeiters Archived 2009-04-25 at the Wayback Machine External links[edit] SM57 official page Sound&Recording - 50 Years of Shure SM57 Retrieved from "https://en.wikipedia.org/w/index.php?title=Shure_SM57&oldid=1000474223" Categories: Microphones Hidden categories: Webarchive template wayback links CS1 maint: discouraged parameter Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version In other projects Wikimedia Commons Languages Català Deutsch Español Français Italiano Nederlands 日本語 Suomi Edit links This page was last edited on 15 January 2021, at 07:40 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-8598	----	Shure SM58 - Wikipedia Shure SM58 From Wikipedia, the free encyclopedia Jump to navigation Jump to search The Shure SM58 microphone The Shure SM58 is a professional cardioid dynamic microphone, commonly used in live vocal applications. Produced since 1966 by Shure Incorporated, it has built a strong reputation among musicians for its durability and sound, and half a century later it is still considered the industry standard for live vocal performance microphones.[1][2][3] The SM58 and its sibling, the SM57, are the best-selling microphones in the world.[4] The SM stands for Studio Microphone.[5] Like all directional microphones, the SM58 is subject to proximity effect, a low frequency boost when used close to the source. The cardioid response reduces pickup from the side and rear, helping to avoid feedback onstage. There are wired (with and without on/off switch) and wireless versions. The wired version provides balanced audio through a male XLR connector. The SM58 uses an internal shock mount to reduce handling noise. A distinctive feature of the SM58 is its pneumatic suspension system for the microphone capsule.[6] The capsule, a readily replaceable component, is surrounded by a soft rubber balloon, rather than springs or solid rubber. This gives notably good isolation from handling noise; one reason for its being a popular microphone for stage vocalists. Microphones with this feature are intended primarily for hand-held use, rather than on a stand or for instrument miking. The SM58 is unswitched, while the otherwise identical SM58S has a sliding on-off switch on the body. Other suffixes refer to any accessories supplied with the microphone: when a cable is provided, the model is actually SM58-CN, while the SM58-LC has no provided cable; the SM58-X2u kit consists of the SM58-LC and an inline X2u XLR-to-USB signal adaptor (capable of providing phantom power for condenser microphones, and offering an in-built headphone jack for monitoring).[7] Contents 1 Specifications 2 Awards 3 Counterfeiting 4 See also 5 References 6 External links Specifications[edit] Robert Lockwood, Jr using an SM58 Patti Smith performing with an SM58 in Finland Randall Bramblett with an SM58 Lower-cost 588SD, circa 1970[8] Type: Dynamic[9] (moving coil) Frequency Response 50 to 15,000 Hz[9] Polar Pattern Cardioid,[9] rotationally symmetrical about microphone axis, uniform with frequency Sensitivity (at 1,000 Hz Open Circuit Voltage) −54.5 dBV/Pa (1.85 mV); 1 Pa = 94 dB SPL[9] Impedance Rated impedance is 150 ohms (300 ohms actual) for connection to microphone inputs rated low impedance[9] Polarity Positive pressure on diaphragm produces positive voltage on pin 2 with respect to pin 3[9] Connector Three-pin male XLR[9] Net Weight 298 grams (10.5 oz)[9] Awards[edit] In 2008, for the second year running, the SM58 microphone won the MI Pro Retail Survey "Best Live Microphone" award.[10] In 2011, Acoustic Guitar magazine honored the SM58 with a Gold Medal in the Player's Choice Awards.[11] Counterfeiting[edit] The SM58 and SM57 have been extensively counterfeited.[12][13][14][15][16][17] Most of these counterfeit microphones are at least functional, but have poorer performance and do not have the pneumatic suspension. There are many other subtle details which can reveal most of these fakes.[18][19] See also[edit] Shure SM57 Shure Beta 58A References[edit] ^ Live Sound International, September/October 2002. Real World: Wired Vocal Microphones Archived 2009-01-07 at the Wayback Machine ^ Miller, Peter L. (2001). Speaking Skills for Every Occasion. Blake's Guides. Pascal Press. p. 30. ISBN 1741250463. ^ Morris, Tee; Tomasi, Chuck; Terra, Evo (2008). Podcasting For Dummies (2 ed.). John Wiley & Sons. p. 36. ISBN 047027557X. ^ Paul Stamler, Shure SM57 Impedance Modification, Recording Magazine, archived from the original on 2014-04-21, retrieved 2014-04-20 CS1 maint: discouraged parameter (link) ^ History of Shure Incorporated ^ Goodwyn, Peterson. "Shure's Secret, Invisible Shockmount". Recording Hacks. Retrieved 1 November 2013. CS1 maint: discouraged parameter (link) ^ "SM58+X2u USB Digital Bundle". Shure Europe. ^ Shure webpage ^ a b c d e f g h Product Specifications (PDF), Shure, retrieved 2012-10-06 CS1 maint: discouraged parameter (link) ^ http://www.shure.com/americas/about-shure/history/index.htm ^ Gerken, Teja. "Acoustic Guitar Player's Choice Awards 2011 - Shure SM58". Acoustic Guitar. String Letter Publishing. Retrieved August 19, 2012. CS1 maint: discouraged parameter (link) ^ "Sennheiser, Shure Team Up For Counterfeit Raid", December 21, 2001, MIX ^ "Counterfeit Shure Microphones Destroyed", October 9, 2002, MIX ^ "Thai-based counterfeit ring smashed", February 1, 2006, Music Trades. "Among the products in this shipment was a large quantity of counterfeit SM58 microphones destined for retail outlets around Thailand." ^ "Auction websites' threat to legitimate brands", January 1, 2007, Pro Sound News Europe. "The SM57, SM58, Beta 57 and Beta 58 are among the fixities proving most attractive to counterfeiters." ^ ""Shure Seizes Counterfeit Microphones in China", November 14, 2007, MIX ^ "Counterfeit Shure Gear Seized: Thousands of counterfeit microphones were recently confiscated in Peru and Paraguay by customs officials", February 2, 2012, Broadcasting & Cable ^ "Spotting a Fake Shure Microphone: How to tell if your mic is genuine -- or not". About.com Home Recording ^ [1]"5 Tips on Spotting a Fake Shure SM58" External links[edit] SM58 official page Shure Asia SM58 official page Shure SM58 history page Retrieved from "https://en.wikipedia.org/w/index.php?title=Shure_SM58&oldid=975527289" Categories: Microphones Hidden categories: Webarchive template wayback links CS1 maint: discouraged parameter Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version In other projects Wikimedia Commons Languages Català Deutsch Español Français Italiano Nederlands 日本語 Русский Suomi Edit links This page was last edited on 29 August 2020, at 01:20 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-8991	----	Randori - Wikipedia Randori From Wikipedia, the free encyclopedia Jump to navigation Jump to search Free-style practice in Japanese martial arts This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Randori" – news · newspapers · books · scholar · JSTOR (December 2009) (Learn how and when to remove this template message) Randori Japanese name Kanji 乱取り Hiragana らんどり Transcriptions Revised Hepburn randori Randori (乱取り) is a term used in Japanese martial arts to describe free-style practice (sparring). The term denotes an exercise in 取り tori, applying technique to a random ( 乱 ran) succession of uke attacks. The actual connotation of randori depends on the martial art it is used in. In judo, jujutsu, and Shodokan aikido, among others, it most often refers to one-on-one sparring where partners attempt to resist and counter each other's techniques. In other styles of aikido, in particular Aikikai, it refers to a form of practice in which a designated aikidoka defends against multiple attackers in quick succession without knowing how they will attack or in what order. Contents 1 In Japan 2 In Judo 3 In Tenshin Aikido 4 In Kendo 5 In Karate 6 In ninjutsu 7 See also 8 References 9 External links In Japan[edit] The term is used in aikido, judo, and Brazilian jiu-jitsu dojos outside Japan. In Japan, this form of practice is called taninzu-gake (多人数掛け), which literally means multiple attackers. In Judo[edit] The term was described by Jigoro Kano, the founder of Judo, in a speech at the 1932 Los Angeles Olympic Games: "Randori, meaning "free exercise", is practiced under conditions of actual contest. It includes throwing, choking, holding the opponent down, and bending or twisting of the arms. The two combatants may use whatever methods they like provided they do not hurt each other and obey the rules of Judo concerning etiquette, which are essential to its proper working." [1] There are 2 types of Randori.[2] [3] In Tenshin Aikido[edit] In Steven Seagal's Tenshin Aikido Federation (affiliated with the Aikikai), randori is different from that of Aikikai, in that the attackers can do anything to the defender (e.g. punch, grab, kick, etc.), and the randori continues on the ground until a pin. In Kendo[edit] In kendo, jigeiko means "friendly" free combat, as in competition, but without counting points. In Karate[edit] Although in karate the word kumite is usually reserved for sparring, some schools also employ the term randori with regard to "mock-combat" in which both karateka move with speed, parrying and attacking with all four limbs (including knees, elbows, etc.). In these schools, the distinction between randori and kumite is that in randori, the action is uninterrupted when a successful technique is applied. (Also known as ju kumite or soft sparring.) In ninjutsu[edit] Randori is also practiced in Bujinkan ninjutsu and usually represented to the practitioner when he reaches the "Shodan" level. In ninjutsu, randori puts the practitioner in a position where he is armed or unarmed and is attacked by multiple attackers. See also[edit] Kata Sparring Randori-no-kata References[edit] ^ Original text of this speech available at The Judo Information Site at http://judoinfo.com/kano1.htm ^ Ohlenkamp, Neil (16 May 2018). Black Belt Judo. New Holland. ISBN 9781845371098 – via Google Books. ^ Tello, Rodolfo (1 August 2016). Judo: Seven Steps to Black Belt (An Introductory Guide for Beginners). Amakella Publishing. ISBN 9781633870086 – via Google Books. External links[edit] Judo Information Site YouTube: Randori In Tenshin Aikido v t e Japanese martial arts Lists List of Japanese martial arts List of koryū schools of martial arts Ko-budō Battōjutsu Bōjutsu Hojōjutsu Iaijutsu Jōjutsu Jujutsu Jittejutsu Kenjutsu Kyūjutsu Naginatajutsu Ninjutsu Shurikenjutsu Sōjutsu Gendai budō Aikido Daitō-ryū Aiki-jūjutsu Iaido Judo Karate Kendo Kyūdō Nippon Kempo Shorinji Kempo Sumo Terms Aiki Budō Dōjō Kuzushi Maai Mushin Randori Uchi-deshi Zanshin Japanese martial arts  • Japan Martial arts v t e Martial arts List of styles History Timeline Hard and soft Regional origin China Europe India Indonesia Japan Korea Philippines Unarmed techniques Chokehold Clinch Footwork Elbow strike Headbutt Hold Kick Knee strike Joint lock Punch Sweep Takedown Throw Weapons Duel Melee weapons Knife fighting Stick-fighting Swordsmanship Ranged weapons Archery Shooting Training Kata Boxing gloves Practice weapon Punching bag Pushing hands Randori Sparring Grappling Brazilian jiu-jitsu Judo Jujutsu Sambo Shuai Jiao Sumo Wrestling Striking Bando Boxing Capoeira Karate Kickboxing Lethwei Muay Thai Pradal serey Sanshou Savate Taekwondo Vovinam Internal Aikido Aikijutsu Baguazhang Tai chi Xing Yi Quan Full contact / combat sports Professional boxing Professional kickboxing Knockdown karate Mixed martial arts Pankration Submission wrestling Vale tudo Self-defense / combatives Arnis Bartitsu Hapkido Kajukenbo Jieitaikakutōjutsu Krav Maga MCMAP Pencak Silat Systema Wing Chun Legal aspects Silat Melayu Eclectic / hybrids American Kenpo Chun Kuk Do Jeet Kune Do Shooto Shorinji Kempo Unifight Entertainment Beat 'em up Fighting game Martial arts film (Chanbara) Professional wrestling Stage combat Wuxia Portal Outline Retrieved from "https://en.wikipedia.org/w/index.php?title=Randori&oldid=1008934709" Categories: Aikido Japanese martial arts Japanese martial arts terminology Judo Mock combat Training Hidden categories: Articles with short description Short description is different from Wikidata Articles needing additional references from December 2009 All articles needing additional references Articles containing Japanese-language text Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages বাংলা Deutsch Español Français Italiano עברית Nederlands 日本語 Polski Português Русский Српски / srpski Suomi Svenska Українська Edit links This page was last edited on 25 February 2021, at 20:55 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-9073	----	Sybil attack - Wikipedia Sybil attack From Wikipedia, the free encyclopedia Jump to navigation Jump to search Attack done by multiple fake identities In a Sybil attack, the attacker subverts the reputation system of a network service by creating a large number of pseudonymous identities and uses them to gain a disproportionately large influence. It is named after the subject of the book Sybil, a case study of a woman diagnosed with dissociative identity disorder.[1] The name was suggested in or before 2002 by Brian Zill at Microsoft Research.[2] The term pseudospoofing had previously been coined by L. Detweiler on the Cypherpunks mailing list and used in the literature on peer-to-peer systems for the same class of attacks prior to 2002, but this term did not gain as much influence as "Sybil attack".[3] Sybil attacks are also called sock puppetry. Contents 1 Description 2 Example 3 Prevention 3.1 Identity validation 3.2 Social trust graphs 3.3 Economic costs 3.4 Personhood validation 3.5 Application-specific defenses 4 See also 5 References 6 External links Description[edit] The Sybil attack in computer security is an attack wherein a reputation system is subverted by creating multiple identities.[4] A reputation system's vulnerability to a Sybil attack depends on how cheaply identities can be generated, the degree to which the reputation system accepts inputs from entities that do not have a chain of trust linking them to a trusted entity, and whether the reputation system treats all entities identically. As of 2012[update], evidence showed that large-scale Sybil attacks could be carried out in a very cheap and efficient way in extant realistic systems such as BitTorrent Mainline DHT.[5][6] An entity on a peer-to-peer network is a piece of software which has access to local resources. An entity advertises itself on the peer-to-peer network by presenting an identity. More than one identity can correspond to a single entity. In other words, the mapping of identities to entities is many to one. Entities in peer-to-peer networks use multiple identities for purposes of redundancy, resource sharing, reliability and integrity. In peer-to-peer networks, the identity is used as an abstraction so that a remote entity can be aware of identities without necessarily knowing the correspondence of identities to local entities. By default, each distinct identity is usually assumed to correspond to a distinct local entity. In reality, many identities may correspond to the same local entity. An adversary may present multiple identities to a peer-to-peer network in order to appear and function as multiple distinct nodes. The adversary may thus be able to acquire a disproportionate level of control over the network, such as by affecting voting outcomes. In the context of (human) online communities, such multiple identities are sometimes known as sockpuppets. Example[edit] A notable Sybil attack (in conjunction with a traffic confirmation attack) was launched against the Tor anonymity network for several months in 2014 by unknown perpetrators.[7][8] Prevention[edit] Known approaches to Sybil attack prevention include identity validation, social trust graph algorithms, or economic costs, personhood validation, and application-specific defenses. Identity validation[edit] Validation techniques can be used to prevent Sybil attacks and dismiss masquerading hostile entities. A local entity may accept a remote identity based on a central authority which ensures a one-to-one correspondence between an identity and an entity and may even provide a reverse lookup. An identity may be validated either directly or indirectly. In direct validation the local entity queries the central authority to validate the remote identities. In indirect validation the local entity relies on already-accepted identities which in turn vouch for the validity of the remote identity in question. Practical network applications and services often use a variety of identity proxies to achieve limited Sybil attack resistance, such as telephone number verification, credit card verification, or even based on the IP address of a client. These methods have the limitations that it is usually possible to obtain multiple such identity proxies at some cost—or even to obtain many at low cost through techniques such as SMS spoofing or IP address spoofing. Use of such identity proxies can also exclude those without ready access to the required identity proxy: e.g., those without their own mobile phone or credit card, or users located behind carrier-grade network address translation who share their IP addresses with many others. Identity-based validation techniques generally provide accountability at the expense of anonymity, which can be an undesirable tradeoff especially in online forums that wish to permit censorship-free information exchange and open discussion of sensitive topics. A validation authority can attempt to preserve users' anonymity by refusing to perform reverse lookups, but this approach makes the validation authority a prime target for attack. Protocols using threshold cryptography can potentially distribute the role of such a validation authority among multiple servers, protecting users' anonymity even if one or a limited number of validation servers is compromised.[9] Social trust graphs[edit] Sybil prevention techniques based on the connectivity characteristics of social graphs can also limit the extent of damage that can be caused by a given Sybil attacker while preserving anonymity. Examples of such prevention techniques include SybilGuard,[10] SybilLimit,[11] the Advogato Trust Metric,[12] and the sparsity based metric to identify Sybil clusters in a distributed P2P based reputation system.[13] These techniques cannot prevent Sybil attacks entirely, and may be vulnerable to widespread small-scale Sybil attacks. In addition, it is not clear whether real-world online social networks will satisfy the trust or connectivity assumptions that these algorithms assume.[14] Economic costs[edit] Alternatively, imposing economic costs as artificial barriers to entry may be used to make Sybil attacks more expensive. Proof of work, for example, requires a user to prove that they expended a certain amount of computational effort to solve a cryptographic puzzle. In Bitcoin and related permissionless cryptocurrencies, miners compete to append blocks to a blockchain and earn rewards roughly in proportion to the amount of computational effort they invest in a given time period. Investments in other resources such as storage or stake in existing cryptocurrency may similarly be used to impose economic costs. Personhood validation[edit] As an alternative to identity verification that attempts to maintain a strict "one-per-person" allocation rule, a validation authority can use some mechanism other than knowledge of a user's real identity - such as verification of an unidentified person's physical presence at a particular place and time as in a pseudonym party[15] - to enforce a one-to-one correspondence between online identities and real-world users. Such proof of personhood approaches have been proposed as a basis for permissionless blockchains and cryptocurrencies in which each human participant would wield exactly one vote in consensus.[16][17] A variety of approaches to proof of personhood have been proposed, some with deployed implementations, although many usability and security issues remain.[18] Application-specific defenses[edit] A number of distributed protocols have been designed with Sybil attack protection in mind. SumUp[19] and DSybil[20] are Sybil-resistant algorithms for online content recommendation and voting. Whānau is a Sybil-resistant distributed hash table algorithm.[21] I2P's implementation of Kademlia also has provisions to mitigate Sybil attacks.[22] See also[edit] Astroturfing Ballot stuffing Social bot Sockpuppetry References[edit] ^ Lynn Neary (20 October 2011). Real 'Sybil' Admits Multiple Personalities Were Fake. NPR. Retrieved 8 February 2017. ^ Douceur, John R (2002). "The Sybil Attack". Peer-to-Peer Systems. Lecture Notes in Computer Science. 2429. pp. 251–60. doi:10.1007/3-540-45748-8_24. ISBN 978-3-540-44179-3. ^ Oram, Andrew. Peer-to-peer: harnessing the benefits of a disruptive technology. ^ Trifa, Zied; Khemakhem, Maher (2014). "Sybil Nodes as a Mitigation Strategy Against Sybil Attack". Procedia Computer Science. 32: 1135–40. doi:10.1016/j.procs.2014.05.544. ^ Wang, Liang; Kangasharju, Jussi (2012). "Real-world sybil attacks in BitTorrent mainline DHT". 2012 IEEE Global Communications Conference (GLOBECOM). pp. 826–32. doi:10.1109/GLOCOM.2012.6503215. ISBN 978-1-4673-0921-9. ^ Wang, Liang; Kangasharju, Jussi (2013). "Measuring large-scale distributed systems: case of BitTorrent Mainline DHT". IEEE P2P 2013 Proceedings. pp. 1–10. doi:10.1109/P2P.2013.6688697. ISBN 978-1-4799-0515-7. ^ (30 July 2014). Tor security advisory: "relay early" traffic confirmation attack. ^ Dan Goodin (31 July 2014). Active attack on Tor network tried to decloak users for five months. ^ John Maheswaran, Daniel Jackowitz, Ennan Zhai, David Isaac Wolinsky, and Bryan Ford (9 March 2016). Building Privacy-Preserving Cryptographic Credentials from Federated Online Identities (PDF). 6th ACM Conference on Data and Application Security and Privacy (CODASPY).CS1 maint: uses authors parameter (link) ^ Yu, Haifeng; Kaminsky, Michael; Gibbons, Phillip B; Flaxman, Abraham (2006). SybilGuard: defending against sybil attacks via social networks. 2006 conference on Applications, technologies, architectures, and protocols for computer communications - SIGCOMM '06. pp. 267–78. doi:10.1145/1159913.1159945. ISBN 978-1-59593-308-9. ^ SybilLimit: A Near-Optimal Social Network Defense against Sybil Attacks. IEEE Symposium on Security and Privacy. 19 May 2008. ^ O'Whielacronx, Zooko. "Levien's attack-resistant trust metric". <p2p-hackers at lists.zooko.com>. gmane.org. Retrieved 10 February 2012. CS1 maint: discouraged parameter (link) ^ Kurve, Aditya; Kesidis, George (2011). "Sybil Detection via Distributed Sparse Cut Monitoring". 2011 IEEE International Conference on Communications (ICC). pp. 1–6. doi:10.1109/icc.2011.5963402. ISBN 978-1-61284-232-5. ^ Bimal Viswanath, Ansley Post, Krishna Phani Gummadi, and Alan E Mislove (August 2010). "An analysis of social network-based Sybil defenses". ACM SIGCOMM Computer Communication Review. doi:10.1145/1851275.1851226.CS1 maint: uses authors parameter (link) ^ Ford, Bryan; Strauss, Jacob (1 April 2008). An Offline Foundation for Online Accountable Pseudonyms. 1st Workshop on Social Network Systems - SocialNets '08. pp. 31–6. doi:10.1145/1435497.1435503. ISBN 978-1-60558-124-8. ^ Maria Borge, Eleftherios Kokoris-Kogias, Philipp Jovanovic, Linus Gasser, Nicolas Gailly, Bryan Ford (29 April 2017). Proof-of-Personhood: Redemocratizing Permissionless Cryptocurrencies. IEEE Security & Privacy on the Blockchain (IEEE S&B).CS1 maint: uses authors parameter (link) ^ Ford, Bryan (December 2020). "Technologizing Democracy or Democratizing Technology? A Layered-Architecture Perspective on Potentials and Challenges". In Lucy Bernholz; Hélène Landemore; Rob Reich (eds.). Digital Technology and Democratic Theory. University of Chicago Press. ISBN 9780226748573. ^ Divya Siddarth, Sergey Ivliev, Santiago Siri, Paula Berman (13 October 2020). "Who Watches the Watchmen? A Review of Subjective Approaches for Sybil-resistance in Proof of Personhood Protocols". arXiv:2008.05300.CS1 maint: uses authors parameter (link) ^ Nguyen Tran, Bonan Min, Jinyang Li, and Lakshminarayanan Subramanian (22 April 2009). Sybil-Resilient Online Content Voting (PDF). NSDI ’09: 6th USENIX Symposium on Networked Systems Design and Implementation.CS1 maint: uses authors parameter (link) ^ Haifeng Yu, Chenwei Shi, Michael Kaminsky, Phillip B. Gibbons, and Feng Xiao (19 May 2009). DSybil: Optimal Sybil-Resistance for Recommendation Systems. 30th IEEE Symposium on Security and Privacy.CS1 maint: uses authors parameter (link) ^ Chris Lesniewski-Laas and M. Frans Kaashoek (28 April 2010). Whānau: A Sybil-proof Distributed Hash Table (PDF). 7th USENIX Symposium on Network Systems Design and Implementation (NSDI).CS1 maint: uses authors parameter (link) ^ "The Network Database - I2P". External links[edit] Querci, Daniele; Hailes, Stephen (2010). "Sybil Attacks Against Mobile Users: Friends and Foes to the Rescue". 2010 Proceedings IEEE INFOCOM. pp. 1–5. CiteSeerX 10.1.1.360.8730. doi:10.1109/INFCOM.2010.5462218. ISBN 978-1-4244-5836-3. Bazzi, Rida A; Konjevod, Goran (2006). "On the establishment of distinct identities in overlay networks". Distributed Computing. 19 (4): 267–87. doi:10.1007/s00446-006-0012-y. Lesniewski-Laas, Chris (2008). "A Sybil-proof one-hop DHT". Proceedings of the 1st workshop on Social network systems - SocialNets '08. pp. 19–24. doi:10.1145/1435497.1435501. ISBN 978-1-60558-124-8. Newsome, James; Shi, Elaine; Song, Dawn; Perrig, Adrian (2004). "The sybil attack in sensor networks". Proceedings of the third international symposium on Information processing in sensor networks - IPSN'04. pp. 259–68. doi:10.1145/984622.984660. ISBN 978-1581138467. A Survey of Solutions to the Sybil Attack On Network formation: Sybil attacks and Reputation systems Seigneur, Jean-Marc; Gray, Alan; Jensen, Christian Damsgaard (2005). "Trust Transfer: Encouraging Self-recommendations Without Sybil Attack". Trust Management. Lecture Notes in Computer Science. 3477. pp. 321–37. CiteSeerX 10.1.1.391.5003. doi:10.1007/11429760_22. ISBN 978-3-540-26042-4. A Survey of DHT Security Techniques by Guido Urdaneta, Guillaume Pierre and Maarten van Steen. ACM Computing surveys, 2009. An experiment on the weakness of reputation algorithms used in professional social networks: the case of Naymz by Marco Lazzari. Proceedings of the IADIS International Conference e-Society 2010. Retrieved from "https://en.wikipedia.org/w/index.php?title=Sybil_attack&oldid=1000481849" Categories: Computer network security Reputation management Hidden categories: CS1 maint: uses authors parameter CS1 maint: discouraged parameter Articles with short description Short description matches Wikidata Articles containing potentially dated statements from 2012 All articles containing potentially dated statements Use dmy dates from April 2011 Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages Deutsch Español فارسی Français Italiano Português Русский Українська Edit links This page was last edited on 15 January 2021, at 08:15 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-9317	----	Fear of missing out - Wikipedia Fear of missing out From Wikipedia, the free encyclopedia Jump to navigation Jump to search "FOMO" redirects here. For the album by Liam Finn, see FOMO (album). type of social anxiety Smartphones enable people to remain in contact with their social and professional network continuously. This may result in compulsive checking for status updates and messages, for fear of missing an opportunity.[1] Fear of missing out (FOMO) is a social anxiety[2] stemming from the belief that others might be having fun while the person experiencing the anxiety is not present. It is characterized by a desire to stay continually connected with what others are doing.[3] FOMO is also defined as a fear of regret,[4] which may lead to concerns that one might miss an opportunity for social interaction, a novel experience or a profitable investment.[5] It is the fear that deciding not to participate is the wrong choice.[4][6] Social networking creates many opportunities for FOMO. While it provides opportunities for social engagement,[3] it offers an endless stream of activities in which any given person is not involved. Psychological dependence on social networks can result in anxiety and can lead to FOMO[7] or even pathological internet use.[8] FOMO is claimed to negatively influence psychological health and well-being.[4] Contents 1 History 2 Definition 3 Effects 4 Causes 5 Marketing technique 6 See also 7 References History[edit] The phenomenon was first identified in 1996 by marketing strategist Dr. Dan Herman, who conducted research for Adam Bellouch and published the first academic paper on the topic in 2000 in The Journal of Brand Management.[9] Author Patrick J. McGinnis coined the term FOMO[10] and popularized it in a 2004 op-ed in The Harbus, the magazine of Harvard Business School. The article was titled McGinnis' Two FOs: Social Theory at HBS, and also referred to another related condition, Fear of a Better Option (FoBO), and their role in the school's social life.[11][12][13] The origin of FOMO has also been traced to the 2004 Harbus article by academic Joseph Reagle.[14] Definition[edit] FOMO refers to the apprehension that one is either not in the know or missing out on information, events, experiences, or decisions that could make one's life better.[3] Those affected by it may not know exactly what they are missing but may still worry that others are having a much better time or doing something better than they are, without them.[2] FOMO could result from not knowing about a conversation,[15] missing a T.V. show, not attending a wedding or party,[16] or hearing that others have discovered a new restaurant.[17] Within video games, FOMO is also used to describe the similar anxiety around missing the ability to obtain in-game items or complete activities that are only available for a limited time.[18] Effects[edit] A study by JWTIntelligence suggests that FOMO can influence the formation of long-term goals and self-perceptions.[2] In this study, around half of the respondents stated that they are overwhelmed by the amount of information needed to stay up-to-date, and that it is impossible to not miss out on something. The process of relative deprivation creates FOMO and dissatisfaction. It reduces psychological well-being.[3][4][19] FOMO led to negative social and emotional experiences, such as boredom and loneliness.[20] A 2013 study found that it negatively impacts mood and life satisfaction,[3] reduces self-esteem, and affects mindfulness.[21] According to John M. Grohol, founder and Editor-in-Chief of Psych Central, FOMO may lead to a constant search for new connections with others, abandoning current connections to do so. Moreover, the desire to stay in touch may endanger personal safety, e.g., while driving.[22] A 2019 University of Glasgow study surveyed 467 adolescents, and found that the respondents felt societal pressure to always be available.[23] FOMO-sufferers may increasingly seek access to others' social lives, and consume an escalating amount of real-time information.[24] Causes[edit] FOMO arises from situational or long-term deficits in psychological needs satisfaction, which are not a new phenomenon.[3] Before the Internet, a related phenomenon, "keeping up with the Jones'", was widely experienced. FOMO generalized and intensified this experience because so much more of people's lives became publicly documented and easily accessed. Further, a common tendency is to post about positive experiences (that great restaurant) rather than negative ones (bad first date). Self-determination theory contends that an individual's psychological satisfaction in their competence, autonomy, and relatedness consists of three basic psychological needs for human beings.[25] Test subjects with lower levels of basic psychological satisfaction reported a higher level of FOMO. Basic psychological satisfaction and FOMO were positively correlated.[3] Four in ten young people reported FOMO sometimes or often.[2] FOMO was found to be negatively correlated with age, and men were more likely than women to report It.[3] Social media platforms that are associated with FOMO include Snapchat,[26] Facebook,[27] Instagram,[28] and Twitter. Marketing technique[edit] Advertising and marketing campaigns may seek to intensify FOMO within a marketing strategy. Examples include AT&T's "Don't be left behind" campaign, Duracell's Powermat "Stay in charge" campaign and Heineken's "Sunrise" campaign.[2] The "Sunrise" campaign, in particular, aimed to encourage responsible drinking by portraying excessive drinking as a way to miss the best parts of a party, rather than claiming that excessive drinking is a risk to personal health. Other brands attempt counter FOMO, such as Nescafé's "Wake up to life" campaign.[2] Harnessing TV viewers' FOMO is also perceived to foster higher broadcast ratings. Real-time updates about status and major social events allow for a more engaging media consumption experience and faster dissemination of information.[2] Real-time tweets about the Super Bowl are considered to be correlated with higher TV ratings due to their appeal to FOMO and the prevalence of social media usage.[2] See also[edit] Hyperbolic discounting Kiasu Loss aversion Irrational exuberance Missed connections Murray's system of needs Opportunity cost Relative deprivation Self-determination theory Social media Status anxiety Social proof References[edit] ^ Anderson, Hephzibah (16 April 2011). "Never heard of Fomo? You're so missing out". The Guardian. Retrieved 6 June 2017. ^ a b c d e f g h "Fear of Missing Out (FOMO)" (PDF). J. Walter Thompson. March 2012. Archived from the original (PDF) on June 26, 2015. ^ a b c d e f g h Przybylski, Andrew K.; Murayama, Kou; DeHaan, Cody R.; Gladwell, Valerie (July 2013). "Motivational, emotional, and behavioral correlates of fear of missing out". Computers in Human Behavior. 29 (4): 1841–1848. doi:10.1016/j.chb.2013.02.014. ^ a b c d Wortham, J. (April 10, 2011). "Feel like a wall flower? Maybe it's your Facebook wall". The New York Times. ^ Shea, Michael (27 July 2015). "Living with FOMO". The Skinny. Retrieved 9 January 2016. ^ Alt, Dorit; Boniel-Nissim, Meyran (2018-06-20). "Parent–Adolescent Communication and Problematic Internet Use: The Mediating Role of Fear of Missing Out (FoMO)". Journal of Family Issues. 39 (13): 3391–3409. doi:10.1177/0192513x18783493. ISSN 0192-513X. S2CID 149746950. ^ Jonathan K. J. (1998). "Internet Addiction on Campus: The Vulnerability of College Students". CyberPsychology & Behavior. 1 (1): 11–17. doi:10.1089/cpb.1998.1.11. Archived from the original on 2014-05-13. ^ Song, Indeok; Larose, Robert; Eastin, Matthew S.; Lin, Carolyn A. (September 2004). "Internet Gratifications and Internet Addiction: On the Uses and Abuses of New Media". CyberPsychology & Behavior. 7 (4): 384–394. doi:10.1089/cpb.2004.7.384. PMID 15331025. ^ Herman, Dan (2000-05-01). "Introducing short-term brands: A new branding tool for a new consumer reality". Journal of Brand Management. 7 (5): 330–340. doi:10.1057/bm.2000.23. ISSN 1350-231X. S2CID 167311741. ^ Kozodoy, Peter (2017-10-09). "The Inventor of FOMO is Warning Leaders About a New, More Dangerous Threat". Inc.com. Retrieved 2017-10-10. ^ "Social Theory at HBS: McGinnis' Two FOs". The Harbus. 10 May 2004. Retrieved 30 March 2017. ^ Schreckinger, Ben (29 July 2014). "The Home of FOMO". Boston. Retrieved 30 March 2017. ^ Blair, Linda (6 October 2017). "How to beat 'fear of missing out' as the growth of social media sites feeds the trend - Independent.ie". Independent.ie. Retrieved 2017-10-10. ^ "FOMO's etymology". reagle.org. Retrieved 2017-10-10. ^ Tait, Amelia (2018-10-11). "Why do we experience the curse of conversation envy?". Metro. Retrieved 2020-05-31. ^ "Why FOMO at uni is totally OK to feel". Debut. 2016-10-11. Retrieved 2020-05-31. ^ Delmar, Niamh. "FOMO: Are you afraid of missing out?". The Irish Times. Retrieved 2020-05-31. ^ Close, James; Lloyd, Joanne (2021). Lifting the Lid on Loot-Boxes (PDF) (Report). GambleAware. Retrieved 2 April 2021. CS1 maint: discouraged parameter (link) ^ Morford, M. (August 4, 2010). "Oh my god you are so missing out". San Francisco Chronicle. ^ Burke, M.; Marlow, C. & Lento, T. (2010). Social network activity and social well-being. Postgraduate Medical Journal. 85. pp. 455–459. CiteSeerX 10.1.1.184.2702. doi:10.1145/1753326.1753613. ISBN 9781605589299. S2CID 207178564. ^ "The FoMo Health Factor". Psychology Today. Retrieved 2020-04-09. ^ Grohol, J. (February 28, 2015). "FOMO Addiction: The Fear of Missing Out". World of Psychology. Psych Central. ^ "Woods, H. C. and Scott, H. (2016) #Sleepyteens: social media use in adolescence is associated with poor sleep quality, anxiety, depression and low self-esteem. Journal of Adolescence, 51, pp. 41-49" (PDF). University of Glasgow. Retrieved 28 May 2020. ^ Amichai-Hamburger, Y. & Ben-Artzi, E. (2003), "Loneliness and internet use", Computers in Human Behavior, 19 (1): 71–80, doi:10.1016/S0747-5632(02)00014-6 ^ Deci, E.L. & Ryan, R.M. (1985). Intrinsic motivation and self-determination in human behavior. Plenum Press. ISBN 9780306420221. ^ "Why Snapchat Is The Leading Cause Of FOMO". The Odyssey Online. 2016-03-21. Retrieved 2017-12-06. ^ Krasnova, Hanna; Widjaja, Thomas; Wenninger, Helena; Buxmann, Peter (2013). "Envy on Facebook: A Hidden Threat to Users' Life Satisfaction? - Semantic Scholar". doi:10.7892/boris.47080. S2CID 15408147. Cite journal requires |journal= (help) ^ Djisseglo, Ayoko (2019-05-05). "FOMO: An Instagram Anxiety". Medium. Retrieved 2020-05-31. v t e Conformity Enforcement Proscription Enemy of the people Enemy of the state Ostracism Blacklisting Cancel culture Censorship Outlaw Civil death Vogelfrei Public enemy Group pressure Bandwagon effect Brainwashing Collectivism Consensus reality Deplatforming Dogma Emotional contagion Behavioral Crime Hysterical Suicide Fear of missing out Groupthink Hazing Herd mentality Indoctrination Invented tradition Memory conformity Milieu control Mobbing Nationalism Normalization Normative social influence Patriotism Peer pressure Pluralistic ignorance Propaganda Rally 'round the flag effect Scapegoating Shunning Social influence Socialization Spiral of silence Teasing Tyranny of the majority Untouchability Xeer Individual pressure Authoritarianism Personality Control freak Obsessive–compulsive personality disorder Conformity Compliance Communal reinforcement Countersignaling Herd behavior Internalization Obedience Social proof Experiments Asch conformity experiments Breaching experiment Milgram experiment Stanford prison experiment Anticonformity Alternative media Anti-authoritarianism Anti-social behaviour Auto-segregation Civil disobedience Cosmopolitanism Counterculture Culture jamming Deviance Devil's advocate Dissent Eccentricity Eclecticism Hermit Idiosyncrasy Individualism Pueblo clown Rebellion Red team Satire Shock value Counterconformists Cagot Damnatio memoriae Dissident Exile Homo sacer Nonperson Outcast Persona non grata Retrieved from "https://en.wikipedia.org/w/index.php?title=Fear_of_missing_out&oldid=1019659121" Categories: Advertising Anxiety Internet culture Social media Hidden categories: CS1 maint: discouraged parameter CS1 errors: missing periodical Articles with short description Short description matches Wikidata Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages العربية Català Deutsch Ελληνικά Español فارسی Français 한국어 Հայերեն Italiano עברית 日本語 Norsk bokmål Pälzisch Português Русский ไทย Українська 中文 Edit links This page was last edited on 24 April 2021, at 17:18 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
en-wikipedia-org-9575	----	Filter (software) - Wikipedia Filter (software) From Wikipedia, the free encyclopedia Jump to navigation Jump to search For Internet filtering software, see Content-control software. For video filtering software, see Filter (video). For other uses, see Email filtering. A filter is a computer program or subroutine to process a stream, producing another stream. While a single filter can be used individually, they are frequently strung together to form a pipeline. Some operating systems such as Unix are rich with filter programs. Windows 7 and later are also rich with filters, as they include Windows PowerShell. In comparison, however, few filters are built into cmd.exe (the original command-line interface of Windows), most of which have significant enhancements relative to the similar filter commands that were available in MS-DOS. OS X includes filters from its underlying Unix base but also has Automator, which allows filters (known as "Actions") to be strung together to form a pipeline. Contents 1 Unix 1.1 List of Unix filter programs 2 DOS 3 Windows 4 References 5 External links Unix[edit] In Unix and Unix-like operating systems, a filter is a program that gets most of its data from its standard input (the main input stream) and writes its main results to its standard output (the main output stream). Auxiliary input may come from command line flags or configuration files, while auxiliary output may go to standard error. The command syntax for getting data from a device or file other than standard input is the input operator (<). Similarly, to send data to a device or file other than standard output is the output operator (>). To append data lines to an existing output file, one can use the append operator (>>). Filters may be strung together into a pipeline with the pipe operator ("|"). This operator signifies that the main output of the command to the left is passed as main input to the command on the right. The Unix philosophy encourages combining small, discrete tools to accomplish larger tasks. The classic filter in Unix is Ken Thompson's grep, which Doug McIlroy cites as what "ingrained the tools outlook irrevocably" in the operating system, with later tools imitating it.[1] grep at its simplest prints any lines containing a character string to its output. The following is an example: cut -d : -f 1 /etc/passwd | grep foo This finds all registered users that have "foo" as part of their username by using the cut command to take the first field (username) of each line of the Unix system password file and passing them all as input to grep, which searches its input for lines containing the character string "foo" and prints them on its output. Common Unix filter programs are: cat, cut, grep, head, sort, uniq, and tail. Programs like awk and sed can be used to build quite complex filters because they are fully programmable. Unix filters can also be used by Data scientists to get a quick overview about a file based dataset.[2] List of Unix filter programs[edit] awk cat comm cut expand compress fold grep head less more nl perl paste pr sed sh sort split strings tail tac tee tr uniq wc zcat DOS[edit] Two standard filters from the early days of DOS-based computers are find and sort. Examples: find "keyword" < inputfilename > outputfilename sort "keyword" < inputfilename > outputfilename find /v "keyword" < inputfilename | sort > outputfilename Such filters may be used in batch files (*.bat, *.cmd etc.). For use in the same command shell environment, there are many more filters available than those built into Windows. Some of these are freeware, some shareware and some are commercial programs. A number of these mimic the function and features of the filters in Unix. Some filtering programs have a graphical user interface (GUI) to enable users to design a customized filter to suit their special data processing and/or data mining requirements. Windows[edit] Windows Command Prompt inherited MS-DOS commands, improved some and added a few. For example, Windows Server 2003 features six command-line filters for modifying Active Directory that can be chained by piping: DSAdd, DSGet, DSMod, DSMove, DSRm and DSQuery.[3] Windows PowerShell adds an entire host of filters known as "cmdlets" which can be chained together with a pipe, except a few simple ones, e.g. Clear-Screen. The following example gets a list of files in the C:\Windows folder, gets the size of each and sorts the size in ascending order. It shows how three filters (Get-ChildItem, ForEach-Object and Sort-Object) are chained with pipes. Get-ChildItem C:\Windows | ForEach-Object { $_.length } | Sort-Object -Ascending References[edit] ^ McIlroy, M. D. (1987). A Research Unix reader: annotated excerpts from the Programmer's Manual, 1971–1986 (PDF) (Technical report). CSTR. Bell Labs. 139. CS1 maint: discouraged parameter (link) ^ Data Analysis with the Unix Shell Archived 2016-01-22 at the Wayback Machine - Bernd Zuther, comSysto GmbH, 2013 ^ Holme, Dan; Thomas, Orin (2004). Managing and maintaining a Microsoft Windows Server 2003 environment : exam 70-290. Redmond, WA: Microsoft Press. pp. 3|17—3|26. ISBN 9780735614376. External links[edit] http://www.webopedia.com/TERM/f/filter.html Retrieved from "https://en.wikipedia.org/w/index.php?title=Filter_(software)&oldid=1003727691" Categories: Software design patterns Programming paradigms Operating system technology Hidden categories: CS1 maint: discouraged parameter Webarchive template wayback links Navigation menu Personal tools Not logged in Talk Contributions Create account Log in Namespaces Article Talk Variants Views Read Edit View history More Search Navigation Main page Contents Current events Random article About Wikipedia Contact us Donate Contribute Help Learn to edit Community portal Recent changes Upload file Tools What links here Related changes Upload file Special pages Permanent link Page information Cite this page Wikidata item Print/export Download as PDF Printable version Languages Čeština Dansk Español Français Italiano Magyar 日本語 Українська Walon Edit links This page was last edited on 30 January 2021, at 11:12 (UTC). Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization. Privacy policy About Wikipedia Disclaimers Contact Wikipedia Mobile view Developers Statistics Cookie statement 
erambler-co-uk-3824	----	Collaborations Workshop 2021: talks & panel session eRambler home about series tags talks rdm resources Collaborations Workshop 2021: talks & panel session Date: 2021-04-05 Series: Collaborations Workshop 2021 Tags: [Technology] [Conference] [SSI] [Research] [Disability] [Equality, diversity & inclusion] Series This post is part of a series on the SSI Collaborations Workshop in 2021. Collaborations Workshop 2021: collaborative ideas & hackday > Collaborations Workshop 2021: talks & panel session < Contents Provocations FAIR Research Software Equality, Diversity & Inclusion: how to go about it Equality, Diversity & Inclusion: disability issues Lightning talks Data & metadata Learning & teaching/community Wrapping up I’ve just finished attending (online) the three days of this year’s SSI Collaborations Workshop (CW for short), and once again it’s been a brilliant experience, as well as mentally exhausting, so I thought I’d better get a summary down while it’s still fresh it my mind. Collaborations Workshop is, as the name suggests, much more focused on facilitating collaborations than a typical conference, and has settled into a structure that starts off with with longer keynotes and lectures, and progressively gets more interactive culminating with a hack day on the third day. That’s a lot to write about, so for this post I’ll focus on the talks and panel session, and follow up with another post about the collaborative bits. I’ll also probably need to come back and add in more links to bits and pieces once slides and the “official” summary of the event become available. Updates 2021-04-07 Added links to recordings of keynotes and panel sessions Provocations The first day began with two keynotes on this year’s main themes: FAIR Research Software and Diversity & Inclusion, and day 2 had a great panel session focused on disability. All three were streamed live and the recordings remain available on Youtube: View the keynotes recording; Google-free alternative link View the panel session recording; Google-free alternative link FAIR Research Software Dr Michelle Barker, Director of the Research Software Alliance, spoke on the challenges to recognition of software as part of the scholarly record: software is not often cited. The FAIR4RS working group has been set up to investigate and create guidance on how the FAIR Principles for data can be adapted to research software as well; as they stand, the Principles are not ideally suited to software. This work will only be the beginning though, as we will also need metrics, training, career paths and much more. ReSA itself has 3 focus areas: people, policy and infrastructure. If you’re interested in getting more involved in this, you can join the ReSA email list. Equality, Diversity & Inclusion: how to go about it Dr Chonnettia Jones, Vice President of Research, Michael Smith Foundation for Health Research spoke extensively and persuasively on the need for Equality, Diversity & Inclusion (EDI) initiatives within research, as there is abundant robust evidence that all research outcomes are improved. She highlighted the difficulties current approaches to EDI have effecting structural change, and changing not just individual behaviours but the cultures & practices that perpetuate iniquity. What initiatives are often constructed around making up for individual deficits, a bitter framing is to start from an understanding of individuals having equal stature but having different tired experiences. Commenting on the current focus on “research excellent” she pointed out that the hyper-competition this promotes is deeply unhealthy. suggesting instead that true excellence requires diversity, and we should focus on an inclusive excellence driven by inclusive leadership. Equality, Diversity & Inclusion: disability issues Day 2’s EDI panel session brought together five disabled academics to discuss the problems of disability in research. Dr Becca Wilson, UKRI Innovation Fellow, Institute of Population Health Science, University of Liverpool (Chair) Phoenix C S Andrews (PhD Student, Information Studies, University of Sheffield and Freelance Writer) Dr Ella Gale (Research Associate and Machine Learning Subject Specialist, School of Chemistry, University of Bristol) Prof Robert Stevens (Professor and Head of Department of Computer Science, University of Manchester) Dr Robin Wilson (Freelance Data Scientist and SSI Fellow) NB. The discussion flowed quite freely so the following summary, so the following summary mixes up input from all the panel members. Researchers are often assumed to be single-minded in following their research calling, and aptness for jobs is often partly judged on “time send”, which disadvantages any disabled person who has been forced to take a career break. On top of this disabled people are often time-poor because of the extra time needed to manage their condition, leaving them with less “output” to show for their time served on many common metrics. This can partially affect early-career researchers, since resources for these are often restricted on a “years-since-PhD” criterion. Time poverty also makes funding with short deadlines that much harder to apply for. Employers add more demands right from the start: new starters are typically expected to complete a health and safety form, generally a brief affair that will suddenly become an 80-page bureaucratic nightmare if you tick the box declaring a disability. Many employers claim to be inclusive yet utterly fail to understand the needs of their disabled staff. Wheelchairs are liberating for those who use them (despite the awful but common phrase “wheelchair-bound”) and yet employers will refuse to insure a wheelchair while travelling for work, classifying it as a “high value personal item” that the owner would take the same responsibility for as an expensive camera. Computers open up the world for blind people in a way that was never possible without them, but it’s not unusual for mandatory training to be inaccessible to screen readers. Some of these barriers can be overcome, but doing so takes yet more time that could and should be spent on more important work. What can we do about it? Academia works on patronage whether we like it or not, so be the person who supports people who are different to you rather than focusing on the one you “recognise yourself in” to mentor. As a manager, it’s important to ask each individual what they need and believe them: they are the expert in their own condition and their lived experience of it. Don’t assume that because someone else in your organisation with the same disability needs one set of accommodations, it’s invalid for your staff member to require something totally different. And remember: disability is unusual as a protected characteristic in that anyone can acquire it at any time without warning! Lightning talks Lightning talk sessions are always tricky to summarise, and while this doesn’t do them justice, here are a few highlights from my notes. Data & metadata Malin Sandstrom talked about a much-needed refinement of contributor role taxonomies for scientific computing Stephan Druskat showcased a project to crowdsource a corpus of research software for further analysis Learning & teaching/community Matthew Bluteau introduced the concept of the “coding dojo” as a way to enhance community of practice. A group of coders got together to practice & learn by working together to solve a problem and explaining their work as they go He described 2 models: a code jam, where people work in small groups, and the Randori method, where 2 people do pair programming while the rest observe. I’m excited to try this out! Steve Crouch talked about intermediate skills and helping people take the next step, which I’m also very interested in with the GLAM Data Science network Esther Plomp recounted experience of running multiple Carpentry workshops online, while Diego Alonso Alvarez discussed planned workshops on making research software more usable with GUIs Shoaib Sufi showcased the SSI’s new event organising guide Caroline Jay reported on a diary study into autonomy & agency in RSE during COVID Lopez, T., Jay, C., Wermelinger, M., & Sharp, H. (2021). How has the covid-19 pandemic affected working conditions for research software engineers? Unpublished manuscript. Wrapping up That’s not everything! But this post is getting pretty long so I’ll wrap up for now. I’ll try to follow up soon with a summary of the “collaborative” part of Collaborations Workshop: the idea-generating sessions and hackday! Comments You can comment on this post, "Collaborations Workshop 2021: talks & panel session", by: Replying to its tweet on Twitter or its toot on Mastodon Sending a Webmention from your own site to https://erambler.co.uk/blog/collabw21-part-1/ Using this button: Comments & reactions haven't loaded yet. You might have JavaScript disabled but that's cool 😎. me elsewhere :: keyoxide | keybase | mastodon | matrix | twitter | github | gitlab | orcid | pypi | linkedin © 2021 Jez Cope | Built by: Hugo | Theme: Mnemosyne Build status: Except where noted, this work is licensed under a Creative Commons Attribution 4.0 International License. 
erambler-co-uk-4392	----	Beginner's guide to Twitter Part I: messages, followers and searching | eRambler eRambler Jez Cope's blog on becoming a research technologist Home About Blogroll Please note: this older content has been archived and is no longer fully linked into the site. Please go to the current home page for up-to-date content. Beginner's guide to Twitter Part I: messages, followers and searching Sunday 15 March 2009 Tagged with Howto Message Social media Social networking Tutorial Tweet Twitter Web 2.0 Twitter home page I’ve recently signed up to Twitter. It’s not a new thing; it’s been around for a few years and it’s probably safe to say that I’m way behind the curve on this one. For those who haven’t come across it yet, it’s a very, very simple social networking site which allows you to broadcast 140-character messages. However, in spite of this simplicity, it’s a very powerful tool, and can be quite off-putting for new users. Since I’m a bit techie and tend to pick these things up quite quickly, a few friends have suggested that I lay down some words on how to get to grips with Twitter. I’ve ended up breaking it into three to make it a bit more digestible: Twitter basics: messages, followers and searching; Confusing conventions: @s, #s and RTs; Useful tools to make your Twittering life easier. I’ll spread them out by publishing them over a period of three days. So, without further ado, here’s the first part of my guide to making this very cool tool work for you. How does it work? When I said it was simple, I wasn’t kidding. Once you’ve signed up on the Twitter website, you do one of three things: send and receive messages, follow people (more on what this means in a bit), or search through the archive of old messages. That’s it. Let’s have a look at those components in more detail. Messages The core of Twitter is the status update or tweet; that’s a brief message, broadcast to every other user, taking up no more than 140 characters (letters, digits, punctuation, spaces). By and large, this will be some form of answer to the question “What are you doing?” You can send as many of these as you like, whenever you like. You can even split a longer message across several tweets (manually), but if you need to do this, you might want to question whether another medium might be more appropriate. You can also send direct messages to specific users: these are completely private one-to-one communications. If you’re having a conversation publicly with another user and it’s starting to ramble on, think about switching to direct messages to avoid subjecting everyone else to a conversation that doesn’t concern them. You can only send direct messages to users who are following you: more on what this means next. Followers Wading through the tweets of every other twitterer on the planet is going to take some time. The answer to this problem is ‘following’. You’ll notice that, to begin with, your home page shows only your own tweets. No, Twitter isn’t broken: this page will only show the tweets of people you’re following. This hands control over what you read back to you: you don’t have to follow anyone you don’t want to. I can’t emphasise enough how important this is: don’t follow anyone whose tweets aren’t worth reading. By all means follow someone for a while before you make this decision, and change your mind all you want. Just remember that if you’re not interested in updates on userxyz’s cat at 90-second intervals, no-one says you have to follow them. Follow button You can follow someone by visiting their profile page, which will have the form “http://twitter.com/username”. This page lists their most recent tweets, newest first. Right at the top, underneath their picture, there’s a button marked “Follow”: click this and it’ll change to a message telling you that you’re now following them. To stop following someone, click this message and it’ll reveal a “Remove” button for you to press. Twitter will send them an email when you start following them, but not when you stop. Following info On the left of your home page, there are links entitled “Following” and “Followers” which take you to a list of people you follow and people who follow you, respectively. On your followers list, you’ll see a tick next to anyone you’re also following, and a follow button next to anyone you’re not. Following people who follow you is good for at least three reasons: It allows you to hold a conversation, and to receive direct messages from them; It's a great way to build your network; It's considered polite. That said, my previous advice still stands: you don’t have to follow anyone you don’t want to. So how do you find people to follow? You’ve got a few options here. The best way to get started is to follow people you know in real life: try searching for them. As I’ve already mentioned you can follow people who follow you. You can wade through the global list of tweets and follow people with similar interests (searching will help here: see the next section). You could have a look at the we follow directory to find people. Finally, you can explore your network by looking at your followers’ followers and so on. It’s worth reiterating at this point that all your tweets are visible, ultimately, to anyone on the network. If you’re not happy with this, you can restrict access, which means that only your followers can read your tweets. It’ll also mean that you have to give your approval before someone can follow you. This might work for you, but openness has it’s benefits: you’ll find it a lot more people will follow you if you keep your account open. You’ll get a lot more out of Twitter if you stay open and simply avoid saying anything that you don’t want the whole world to know. Search So, you’ve got to grips with sending and reading tweets, you’ve chosen a few people to follow and started to join in the global conversation that is Twitter. You’re already getting a lot out of this great tool. But what about all the tweets you’re missing? Perhaps you represent a company and want to know who’s talking about your brand. Maybe you’re going to attend a conference and want to connect with other delegates. Maybe you just want the answer to a question and want to see if someone’s already mentioned it. For these, and many more, problems, Twitter search is the answer. Try searching for a brand, a conference or anything else you’re interested in, and you’ll quickly and easily discover what twitterers the world over are saying about it. You might even want to follow some of them. Well, that’s it for today. Tomorrow I’ll be looking at some of the initially confusing but massively useful conventions that have grown up within Twitter: @replies, #hashtags and retweeting. Did you find this post useful? Is there something I’ve totally missed that you think should really be in there? Perhaps you just think I’m great (well, it might happen). I want to bring you really high quality stuff, and the only way I do that is if you (yes, you with the web browser) tell me how I’m doing. Please leave a comment below or link to me from your own blog (that’ll appear here as a comment too, with a link back to you: free publicity!). I’ll do my best to respond to feedback, correct inaccuracies in the text and write more about things that interest both me and you. Finally, if you find this post useful please tell your friends and colleagues. Thanks for stopping by! Hi, I’m Jez Cope and this is my blog, where I talk about technology in research and higher education, including: Research data management; e-Research; Learning; Teaching; Educational technology. Me elsewhere Twitter github LinkedIn Diigo Zotero Google+ eRambler by Jez Cope is licensed under a Creative Commons Attribution-ShareAlike 4.0 International license 
erambler-co-uk-5000	----	eRambler eRambler home about series tags talks rdm resources a blog about research communication & higher education & open culture & technology & making & librarianship & stuff Intro to the fediverse Date: 2021-04-11 Tags: [Fediverse] [Social media] [Twitter] Wow, it turns out to be 10 years since I wrote this beginners guide to Twitter. Things have moved on a loooooong way since then. Far from being the interesting, disruptive technology it was back then, Twitter has become part of the mainstream, the establishment. Almost everyone and everything is on Twitter now, which has both pros and cons. So what’s the problem? It’s now possible to follow all sorts of useful information feeds, from live updates on transport delays to your favourite sports team’s play-by-play performance to an almost infinite number of cat pictures. Read more... Collaborations Workshop 2021: collaborative ideas & hackday Date: 2021-04-07 Series: Collaborations Workshop 2021 Tags: [Technology] [Conference] [SSI] [Research] [Disability] [Equality, diversity & inclusion] My last post covered the more “traditional” lectures-and-panel-sessions approach of the first half of the SSI Collaborations Workshop. The rest of the workshop was much more interactive, consisting of a discussion session, a Collaborative Ideas session, and a whole-day hackathon! The discussion session on day one had us choose a topic (from a list of topics proposed leading up to the workshop) and join a breakout room for that topic with the aim of producing a “speed blog” by then end of 90 minutes. Read more... Collaborations Workshop 2021: talks & panel session Date: 2021-04-05 Series: Collaborations Workshop 2021 Tags: [Technology] [Conference] [SSI] [Research] [Disability] [Equality, diversity & inclusion] I’ve just finished attending (online) the three days of this year’s SSI Collaborations Workshop (CW for short), and once again it’s been a brilliant experience, as well as mentally exhausting, so I thought I’d better get a summary down while it’s still fresh it my mind. Collaborations Workshop is, as the name suggests, much more focused on facilitating collaborations than a typical conference, and has settled into a structure that starts off with with longer keynotes and lectures, and progressively gets more interactive culminating with a hack day on the third day. Read more... Date: 2021-04-03 Tags: [Meta] [Design] I’ve decided to try switching this website back to using Hugo to manage the content and generate the static HTML pages. I’ve been on the Python-based Nikola for a few years now, but recently I’ve been finding it quite slow, and very confusing to understand how to do certain things. I used Hugo recently for the GLAM Data Science Network website and found it had come on a lot since the last time I was using it, so I thought I’d give it another go, and redesign this site to be a bit more minimal at the same time. The theme is still a work in progress so it’ll probably look a bit rough around the edges for a while, but I think I’m happy enough to publish it now. When I get round to it I might publish some more detailed thoughts on the design. Ideas for Accessible Communications Date: 2021-03-20 Tags: [Stuff] [Accessibility] [Ablism] The Disability Support Network at work recently ran a survey on “accessible communications”, to develop guidance on how to make communications (especially internal staff comms) more accessible to everyone. I grabbed a copy of my submission because I thought it would be useful to share more widely, so here it is. Please note that these are based on my own experiences only. I am in no way suggesting that these are the only things you would need to do to ensure your communications are fully accessible. Read more... Matrix self-hosting Date: 2021-03-12 Tags: [Technology] [Matrix] [Communication] [Self-hosting] [DWeb] I started running my own Matrix server a little while ago. Matrix is something rather cool, a chat system similar to IRC or Slack, but open and federated. Open in that the standard is available for anyone to view, but also the reference implementations of server and client are open source, along with many other clients and a couple of nascent alternative servers. Federated in that, like email, it doesn’t matter what server you sign up with, you can talk to users on your own or any other server. Read more... What do you miss least about pre-lockdown life? Date: 2021-02-26 Tags: [Stuff] [Reflection] [Pandemic] @JanetHughes on Twitter: What do you miss the least from pre-lockdown life? I absolutely do not miss wandering around the office looking for a meeting room for a confidential call or if I hadn’t managed to book a room in advance. Let’s never return to that joyless frustration, hey? 10:27 AM · Feb 3, 2021 After seeing Terence Eden taking Janet Hughes' tweet from earlier this month as a writing prompt, I thought I might do the same. Read more... Remarkable blogging Date: 2021-02-06 Tags: [Technology] [Writing] [Gadgets] And the handwritten blog saga continues, as I’ve just received my new reMarkable 2 tablet, which is designed for reading, writing and nothing else. It uses a super-responsive e-ink display and writing on it with a stylus is a dream. It has a slightly rough texture with just a bit of friction that makes my writing come out a lot more legibly than on a slippery glass touchscreen. If that was all there was to it, I might not have wasted my money, but it turns out that it runs on Linux and the makers have wisely decided not to lock it down but to give you full root mess. Read more... GLAM Data Science Network fellow travellers Date: 2021-02-03 Series: GLAM Data Science Network Tags: [Data science] [GLAM] [Librarianship] [Humanities] [Cultural heritage] Updates 2021-02-04 Thanks to Gene @dzshuniper@ausglam.space for suggesting ADHO and a better attribution for the opening quote (see comments below for details) See comments & webmentions for details. “If you want to go fast, go alone. If you want to go far, go together.” — African proverb, probably popularised in English by Kenyan church leader Rev. Samuel Kobia (original) This quote is a popular one in the Carpentries community, and I interpret it in this context to mean that a group of people working together is more sustainable than individuals pursuing the same goal independently. Read more... Date: 2021-01-26 Tags: [Font] [Writing] [Stuff] I’ve updated my blog theme to use the quasi-proportional fonts Iosevka Aile and Iosevka Etoile. I really like the aesthetic, as they look like fixed-width console fonts (I use the true fixed-width version of Iosevka in my terminal and text editor) but they’re actually proportional which makes them easier to read. https://typeof.net/Iosevka/ 1 of 9 Next Page me elsewhere :: keyoxide | keybase | mastodon | matrix | twitter | github | gitlab | orcid | pypi | linkedin © 2021 Jez Cope | Built by: Hugo | Theme: Mnemosyne Build status: Except where noted, this work is licensed under a Creative Commons Attribution 4.0 International License. 
erambler-co-uk-6760	----	Collaborations Workshop 2021: collaborative ideas & hackday eRambler home about series tags talks rdm resources Collaborations Workshop 2021: collaborative ideas & hackday Date: 2021-04-07 Series: Collaborations Workshop 2021 Tags: [Technology] [Conference] [SSI] [Research] [Disability] [Equality, diversity & inclusion] Series This post is part of a series on the SSI Collaborations Workshop in 2021. > Collaborations Workshop 2021: collaborative ideas & hackday < Collaborations Workshop 2021: talks & panel session My last post covered the more “traditional” lectures-and-panel-sessions approach of the first half of the SSI Collaborations Workshop. The rest of the workshop was much more interactive, consisting of a discussion session, a Collaborative Ideas session, and a whole-day hackathon! The discussion session on day one had us choose a topic (from a list of topics proposed leading up to the workshop) and join a breakout room for that topic with the aim of producing a “speed blog” by then end of 90 minutes. Those speed blogs will be published on the SSI blog over the coming weeks, so I won’t go into that in more detail. The Collaborative Ideas session is a way of generating hackday ideas, by putting people together at random into small groups to each raise a topic of interest to them before discussing and coming up with a combined idea for a hackday project. Because of the serendipitous nature of the groupings, it’s a really good way of generating new ideas from unexpected combinations of individual interests. After that, all the ideas from the session, along with a few others proposed by various participants, were pitched as ideas for the hackday and people started to form teams. Not every idea pitched gets worked on during the hackday, but in the end 9 teams of roughly equal size formed to spend the third day working together. My team’s project: “AHA! An Arts & Humanities Adventure” There’s a lot of FOMO around choosing which team to join for an event like this: there were so many good ideas and I wanted to work on several of them! In the end I settled on a team developing an escape room concept to help Arts & Humanities scholars understand the benefits of working with research software engineers for their research. Five of us rapidly mapped out an example storyline for an escape room, got a website set up with GitHub and populated it with the first few stages of the game. We decided to focus on a story that would help the reader get to grips with what an API is and I’m amazed how much we managed to get done in less than a day’s work! You can try playing through the escape room (so far) yourself on the web, or take a look at the GitHub repository, which contains the source of the website along with a list of outstanding tasks to work on if you’re interested in contributing. I’m not sure yet whether this project has enough momentum to keep going, but it was a really valuable way both of getting to know and building trust with some new people and demonstrating the concept is worth more work. Other projects Here’s a brief rundown of the other projects worked on by teams on the day. Coding Confessions Everyone starts somewhere and everyone cuts corners from time to time. Real developers copy and paste! Fight imposter syndrome by looking through some of these confessions or contributing your own. https://coding-confessions.github.io/ CarpenPI A template to set up a Raspberry Pi with everything you need to run a Carpentries (https://carpentries.org/) data science/software engineering workshop in a remote location without internet access. https://github.com/CarpenPi/docs/wiki Research Dugnads A guide to running an event that is a coming together of a research group or team to share knowledge, pass on skills, tidy and review code, among other software and working best practices (based on the Norwegian concept of a dugnad, a form of “voluntary work done together with other people”) https://research-dugnads.github.io/dugnads-hq/ Collaborations Workshop ideas A meta-project to collect together pitches and ideas from previous Collaborations Workshop conferences and hackdays, to analyse patterns and revisit ideas whose time might now have come. https://github.com/robintw/CW-ideas howDescribedIs Integrate existing tools to improve the machine-readable metadata attached to open research projects by integrating projects like SOMEF, codemeta.json and HowFAIRIs (https://howfairis.readthedocs.io/en/latest/index.html). Complete with CI and badges! https://github.com/KnowledgeCaptureAndDiscovery/somef-github-action Software end-of-project plans Develop a template to plan and communicate what will happen when the fixed-term project funding for your research software ends. Will maintenance continue? When will the project sunset? Who owns the IP? https://github.com/elichad/software-twilight Habeas Corpus A corpus of machine readable data about software used in COVID-19 related research, based on the CORD19 dataset. https://github.com/softwaresaved/habeas-corpus Credit-all Extend the all-contributors GitHub bot (https://allcontributors.org/) to include rich information about research project contributions such as the CASRAI Contributor Roles Taxonomy (https://casrai.org/credit/) https://github.com/dokempf/credit-all I’m excited to see so many metadata-related projects! I plan to take a closer look at what the Habeas Corpus, Credit-all and howDescribedIs teams did when I get time. I also really want to try running a dugnad with my team or for the GLAM Data Science network. Comments You can comment on this post, "Collaborations Workshop 2021: collaborative ideas & hackday", by: Replying to its tweet on Twitter or its toot on Mastodon Sending a Webmention from your own site to https://erambler.co.uk/blog/collabw21-part-2/ Using this button: Comments & reactions haven't loaded yet. You might have JavaScript disabled but that's cool 😎. me elsewhere :: keyoxide | keybase | mastodon | matrix | twitter | github | gitlab | orcid | pypi | linkedin © 2021 Jez Cope | Built by: Hugo | Theme: Mnemosyne Build status: Except where noted, this work is licensed under a Creative Commons Attribution 4.0 International License. 
erambler-co-uk-7259	----	Intro to the fediverse eRambler home about series tags talks rdm resources Intro to the fediverse Date: 2021-04-11 Tags: [Fediverse] [Social media] [Twitter] Wow, it turns out to be 10 years since I wrote this beginners guide to Twitter. Things have moved on a loooooong way since then. Far from being the interesting, disruptive technology it was back then, Twitter has become part of the mainstream, the establishment. Almost everyone and everything is on Twitter now, which has both pros and cons. So what’s the problem? It’s now possible to follow all sorts of useful information feeds, from live updates on transport delays to your favourite sports team’s play-by-play performance to an almost infinite number of cat pictures. In my professional life it’s almost guaranteed that anyone I meet will be on Twitter, meaning that I can contact them to follow up at a later date without having to exchange contact details (and they have options to block me if they don’t like that). On the other hand, a medium where everyone’s opinion is equally valid regardless of knowledge or life experience has turned some parts of the internet into a toxic swamp of hatred and vitriol. It’s easier than ever to forget that we have more common ground with any random stranger than we have similarities, and that’s led to some truly awful acts and a poisonous political arena. Part of the problem here is that each of the social media platforms is controlled by a single entity with almost no accountability to anyone other than shareholders. Technological change has been so rapid that the regulatory regime has no idea how to handle them, leaving them largely free to operate how they want. This has led to a whole heap of nasty consequences that many other people have done a much better job of documenting than I could (Shoshana Zuboff’s book The Age of Surveillance Capitalism is a good example). What I’m going to focus on instead are some possible alternatives. If you accept the above argument, one obvious solution is to break up the effective monopoly enjoyed by Facebook, Twitter et al. We need to be able to retain the wonderful affordances of social media but democratise control of it, so that it can never be dominated by a small number of overly powerful players. What’s the solution? There’s actually a thing that already exists, that almost everyone is familiar with and that already works like this. It’s email. There are a hundred thousand email servers, but my email can always find your inbox if I know your address because that address identifies both you and the email service you use, and they communicate using the same protocol, Simple Mail Transfer Protocol (SMTP)1. I can’t send a message to your Twitter from my Facebook though, because they’re completely incompatible, like oil and water. Facebook has no idea how to talk to Twitter and vice versa (and the companies that control them have zero interest in such interoperability anyway). Just like email, a federated social media service like Mastodon allows you to use any compatible server, or even run your own, and follow accounts on your home server or anywhere else, even servers running different software as long as they use the same ActivityPub protocol. There’s no lock-in because you can move to another server any time you like, and interact with all the same people from your new home, just like changing your email address. Smaller servers mean that no one server ends up with enough power to take over and control everything, as the social media giants do with their own platforms. But at the same time, a small server with a small moderator team can enforce local policy much more easily and block accounts or whole servers that host trolls, nazis or other poisonous people. How do I try it? I have no problem with anyone for choosing to continue to use what we’re already calling “traditional” social media; frankly, Facebook and Twitter are still useful for me to keep in touch with a lot of my friends. However, I do think it’s useful to know some of the alternatives if only to make a more informed decision to stick with your current choices. Most of these services only ask for an email address when you sign up and use of your real name vs a pseudonym is entirely optional so there’s not really any risk in signing up and giving one a try. That said, make sure you take sensible precautions like not reusing a password from another account. Instead of… Try… Twitter, Facebook Mastodon, Pleroma, Misskey Slack, Discord, IRC Matrix WhatsApp, FB Messenger, Telegram Also Matrix Instagram, Flickr PixelFed YouTube PeerTube The web Interplanetary File System (IPFS) Which, if you can believe it, was formalised nearly 40 years ago in 1982 and has only had fairly minor changes since then! ↩︎ Comments You can comment on this post, "Intro to the fediverse", by: Replying to its tweet on Twitter or its toot on Mastodon Sending a Webmention from your own site to https://erambler.co.uk/blog/intro-to-the-fediverse/ Using this button: Comments & reactions haven't loaded yet. You might have JavaScript disabled but that's cool 😎. me elsewhere :: keyoxide | keybase | mastodon | matrix | twitter | github | gitlab | orcid | pypi | linkedin © 2021 Jez Cope | Built by: Hugo | Theme: Mnemosyne Build status: Except where noted, this work is licensed under a Creative Commons Attribution 4.0 International License. 
erinrwhite-com-2076	----	Libraries – erin white Libraries – erin white library technology, UX, the web, bikes, #RVA Talk: Using light from the dumpster fire to illuminate a more just digital world This February I gave a lightning talk for the Richmond Design Group. My question: what if we use the light from the dumpster fire of 2020 to see an equitable, just digital world? How can we change our thinking to build the future web we need? Presentation is embedded here; text of talk is below. [&#8230;] Podcast interview: Names, binaries and trans-affirming systems on Legacy Code Rocks! In February I was honored to be invited to join Scott Ford on his podcast Legacy Code Rocks!. I&#8217;m embedding the audio below. View the full episode transcript — thanks to trans-owned Deep South Transcription Services! I&#8217;ve pulled out some of the topics we discussed and heavily edited/rearranged them for clarity. Names in systems Legal [&#8230;] Trans-inclusive design at A List Apart I am thrilled and terrified to say that I have an article on Trans-inclusive design out on A List Apart today. I have read A List Apart for years and have always seen it as The Site for folks who make websites, so it is an honor to be published there. Coming out as nonbinary at work This week, after 10 years of working at VCU Libraries, I have been letting my colleagues know that I&#8217;m nonbinary. Response from my boss, my team, and my colleagues has been so positive, and has made this process so incredibly easy. I didn&#8217;t really have a template for a coming-out message, so ended up writing [&#8230;] What it means to stay Seven years ago last month I interviewed for my job at VCU. I started work a few months later, assuming I&#8217;d stick around for a couple of years then move on to my Next Academic Library Job. Instead I found myself signing closing papers on a house on my sixth work anniversary, having decided to [&#8230;] Back-to-school mobile snapshot This week I took a look at mobile phone usage on the VCU Libraries website for the first couple weeks of class and compared that to similar time periods from the past couple years. 2015 Here&#8217;s some data from the first week of class through today. Note that mobile is 9.2% of web traffic. To round [&#8230;] Recruiting web workers for your library In the past few years I&#8217;ve created a couple of part-time, then full-time, staff positions on the web team at VCU Libraries. We now have a web designer and a web developer who&#8217;ve both been with us for a while, but for a few years it was a revolving door of hires. So let&#8217;s just say I&#8217;ve hired lots [&#8230;] Easier access for databases and research guides at VCU Libraries Today VCU Libraries launched a couple of new web tools that should make it easier for people to find or discover our library&#8217;s databases and research guides. This project&#8217;s goal was to help connect “hunters” to known databases and help “gatherers” explore new topic areas in databases and research guides1. Our web redesign task force [&#8230;] Why this librarian supports the Ada Initiative This week the Ada Initiative is announcing a fundraising drive just for the library community. I&#8217;m pitching in, and I hope you will, too. The Ada Initiative&#8217;s mission is to increase the status and participation of women in open technology and culture. The organization holds AdaCamps, ally workshops for men, and impostor syndrome trainings; and [&#8230;] A new look for search at VCU Libraries This week we launched a new design for VCU Libraries Search (our instance of Ex Libris&#8217; Primo discovery system). The guiding design principles behind this project: Mental models: Bring elements of the search interface in line with other modern, non-library search systems that our users are used to. In our case, we looked to e-commerce websites [&#8230;] 
erinrwhite-com-4563	----	Talk: Using light from the dumpster fire to illuminate a more just digital world – erin white erinrwhite Published April 16, 2021 Skip to content erinrwhite in Libraries, Richmond | April 16, 2021 Talk: Using light from the dumpster fire to illuminate a more just digital world This February I gave a lightning talk for the Richmond Design Group. My question: what if we use the light from the dumpster fire of 2020 to see an equitable, just digital world? How can we change our thinking to build the future web we need? Presentation is embedded here; text of talk is below. Hi everybody, I’m Erin. Before I get started I want to say thank you to the RVA Design Group organizers. This is hard work and some folks have been doing it for YEARS. Thank you to the organizers of this group for doing this work and for inviting me to speak. This talk isn’t about 2020. This talk is about the future. But to understand the future, we gotta look back. The web in 1996 Travel with me to 1996. Twenty-five years ago! I want to transport us back to the mindset of the early web. The fundamental idea of hyperlinks, which we now take for granted, really twisted everyone’s noodles. So much of the promise of the early web was that with broad access to publish in hypertext, the opportunities were limitless. Technologists saw the web as an equalizing space where systems of oppression that exist in the real world wouldn’t matter, and that we’d all be equal and free from prejudice. Nice idea, right? You don’t need to’ve been around since 1996 to know that’s just not the way things have gone down. Pictured before you are some of the early web pioneers. Notice a pattern here? These early visions of the web, including Barlow’s declaration of independence of cyberspace, while inspiring and exciting, were crafted by the same types of folks who wrote the actual declaration of independence: the landed gentry, white men with privilege. Their vision for the web echoed the declaration of independence’s authors’ attempts to describe the world they envisioned. And what followed was the inevitable conflict with reality. We all now hold these truths to be self-evident: The systems humans build reflect humans’ biases and prejudices. We continue to struggle to diversify the technology industry. Knowledge is interest-driven. Inequality exists, online and off. Celebrating, rather than diminishing, folks’ intersecting identities is vital to human flourishing. The web we have known Profit first: monetization, ads, the funnel, dark patterns Can we?: Innovation for innovation’s sake Solutionism: code will save us Visual design: aesthetics over usability Lone genius: “hard” skills and rock star coders Short term thinking: move fast, break stuff Shipping: new features, forsaking infrastructure Let’s move forward quickly through the past 25 years or so of the web, of digital design. All of the web we know today has been shaped in some way by intersecting matrices of domination: colonialism, capitalism, white supremacy, patriarchy. (Thank you, bell hooks.) The digital worlds where we spend our time – and that we build!! – exist in this way. This is not an indictment of anyone’s individual work, so please don’t take it personally. What I’m talking about here is the digital milieu where we live our lives. The funnel drives everything. Folks who work in nonprofits and public entities often tie ourselves in knots to retrofit our use cases in order to use common web tools (google analytics, anyone?) In chasing innovation™ we often overlook important infrastructure work, and devalue work — like web accessibility, truly user-centered design, care work, documentation, customer support and even care for ourselves and our teams — that doesn’t drive the bottom line. We frequently write checks for our future selves to cash, knowing damn well that we’ll keep burying ourselves in technical debt. That’s some tough stuff for us to carry with us every day. The “move fast” mentality has resulted in explosive growth, but at what cost? And in creating urgency where it doesn’t need to exist, focusing on new things rather than repair, the end result is that we’re building a house of cards. And we’re exhausted. To zoom way out, this is another manifestation of late capitalism. Emphasis on LATE. Because…2020 happened. What 2020 taught us Hard times amplify existing inequalities Cutting corners mortgages our future Infrastructure is essential “Colorblind”/color-evasive policy doesn’t cut it Inclusive design is vital We have a duty to each other Technology is only one piece Together, we rise The past year has been awful for pretty much everybody. But what the light from this dumpster fire has illuminated is that things have actually been awful for a lot of people, for a long time. This year has shown us how perilous it is to avoid important infrastructure work and to pursue innovation over access. It’s also shown us that what is sometimes referred to as colorblindness — I use the term color-evasiveness because it is not ableist and it is more accurate — a color-evasive approach that assumes everyone’s needs are the same in fact leaves people out, especially folks who need the most support. We’ve learned that technology is a crucial tool and that it’s just one thing that keeps us connected to each other as humans. Finally, we’ve learned that if we work together we can actually make shit happen, despite a world that tells us individual action is meaningless. Like biscuits in a pan, when we connect, we rise together. Marginalized folks have been saying this shit for years. More of us than ever see these things now. And now we can’t, and shouldn’t, unsee it. The web we can build together Current state: – Profit first – Can we? – Solutionism – Aesthetics – “Hard” skills – Rockstar coders – Short term thinking – Shipping Future state: – People first: security, privacy, inclusion – Should we? – Holistic design – Accessibility – Soft skills – Teams – Long term thinking – Sustaining So let’s talk about the future. I told you this would be a talk about the future. Like many of y’all I have had a very hard time this year thinking about the future at all. It’s hard to make plans. It’s hard to know what the next few weeks, months, years will look like. And who will be there to see it with us. But sometimes, when I can think clearly about something besides just making it through every day, I wonder. What does a people-first digital world look like? Who’s been missing this whole time? Just because we can do something, does it mean we should? Will technology actually solve this problem? Are we even defining the problem correctly? What does it mean to design knowing that even “able-bodied” folks are only temporarily so? And that our products need to be used, by humans, in various contexts and emotional states? (There are also false binaries here: aesthetics vs. accessibility; abled and disabled; binaries are dangerous!) How can we nourish our collaborations with each other, with our teams, with our users? And focus on the wisdom of the folks in the room rather than assigning individuals as heroes? How can we build for maintenance and repair? How do we stop writing checks our future selves to cash – with interest? Some of this here, I am speaking of as a web user and a web creator. I’ve only ever worked in the public sector. When I talk with folks working in the private sector I always do some amount of translating. At the end of the day, we’re solving many of the same problems. But what can private-sector workers learn from folks who come from a public-sector organization? And, as we think about what we build online, how can we also apply that thinking to our real-life communities? What is our role in shaping the public conversation around the use of technologies? I offer a few ideas here, but don’t want them to limit your thinking. Consider the public sector Here’s a thread about public service. ⚖️🏛️ 💪🏼💻🇺🇸 — Dana Chisnell (she / her) (@danachis) February 5, 2021 I don’t have a ton of time left today. I wanted to talk about public service like the very excellent Dana Chisnell here. Like I said, I’ve worked in the public sector, in higher ed, for a long time. It’s my bread and butter. It’s weird, it’s hard, it’s great. There’s a lot of work to be done, and it ain’t happening at civic hackathons or from external contractors. The call needs to come from inside the house. Working in the public sector Government should be – inclusive of all people – responsive to needs of the people – effective in its duties & purpose — Dana Chisnell (she / her) (@danachis) February 5, 2021 I want you to consider for a minute how many folks are working in the public sector right now, and how technical expertise — especially in-house expertise — is something that is desperately needed. Pictured here are the old website and new website for the city of Richmond. I have a whole ‘nother talk about that new Richmond website. I FOIA’d the contracts for this website. There are 112 accessibility errors on the homepage alone. It’s been in development for 3 years and still isn’t in full production. Bottom line, good government work matters, and it’s hard to find. Important work is put out for the lowest bidder and often external agencies don’t get it right. What would it look like to have that expertise in-house? Influencing technology policy We also desperately need lawmakers and citizens who understand technology and ask important questions about ethics and human impact of systems decisions. Pictured here are some headlines as well as a contract from the City of Richmond. Y’all know we spent $1.5 million on a predictive policing system that will disproportionately harm citizens of color? And that earlier this month, City Council voted to allow Richmond and VCU PD’s to start sharing their data in that system? The surveillance state abides. Technology facilitates. I dare say these technologies are designed to bank on the fact that lawmakers don’t know what they’re looking at. My theory is, in addition to holding deep prejudices, lawmakers are also deeply baffled by technology. The hard questions aren’t being asked, or they’re coming too late, and they’re coming from citizens who have to put themselves in harm’s way to do so. Technophobia is another harmful element that’s emerged in the past decades. What would a world look like where technology is not a thing to shrug off as un-understandable, but is instead deftly co-designed to meet our needs, rather than licensed to our city for 1.5 million dollars? What if everyone knew that technology is not neutral? Closing This is some of the future I can see. I hope that it’s sparked new thoughts for you. Let’s envision a future together. What has the light illuminated for you? Thank you! erinrwhite Published April 16, 2021 Write a Comment Cancel Reply Write a Comment Comment Name Email Website Categories CategoriesSelect Category Bikes Conferences Libraries Life Projects Richmond Archives Archives Select Month April 2021 March 2021 May 2019 March 2019 March 2016 February 2016 September 2015 August 2015 January 2015 December 2014 September 2014 August 2014 May 2014 April 2014 March 2014 March 2013 February 2013 Contact E-mail me Follow @erinrwhite Independent Publisher empowered by WordPress 
erinrwhite-com-5053	----	erin white – library technology, UX, the web, bikes, #RVA erin white library technology, UX, the web, bikes, #RVA Skip to content erinrwhite in Libraries, Richmond | April 16, 2021 Talk: Using light from the dumpster fire to illuminate a more just digital world This February I gave a lightning talk for the Richmond Design Group. My question: what if we use the light from the dumpster fire of 2020 to see an equitable, just digital world? How can we change our thinking to build the future web we need? Presentation is embedded here; text of talk is below. Hi everybody, I’m Erin. Before I get started I want to say thank you to the RVA Design Group organizers. This is hard work and some folks have been doing it for YEARS. Thank you to the organizers of this group for doing this work and for inviting me to speak. This talk isn’t about 2020. This talk is about the future. But to understand the future, we gotta look back. The web in 1996 Travel with me to 1996. Twenty-five years ago! I want to transport us back to the mindset of the early web. The fundamental idea of hyperlinks, which we now take for granted, really twisted everyone’s noodles. So much of the promise of the early web was that with broad access to publish in hypertext, the opportunities were limitless. Technologists saw the web as an equalizing space where systems of oppression that exist in the real world wouldn’t matter, and that we’d all be equal and free from prejudice. Nice idea, right? You don’t need to’ve been around since 1996 to know that’s just not the way things have gone down. Pictured before you are some of the early web pioneers. Notice a pattern here? These early visions of the web, including Barlow’s declaration of independence of cyberspace, while inspiring and exciting, were crafted by the same types of folks who wrote the actual declaration of independence: the landed gentry, white men with privilege. Their vision for the web echoed the declaration of independence’s authors’ attempts to describe the world they envisioned. And what followed was the inevitable conflict with reality. We all now hold these truths to be self-evident: The systems humans build reflect humans’ biases and prejudices. We continue to struggle to diversify the technology industry. Knowledge is interest-driven. Inequality exists, online and off. Celebrating, rather than diminishing, folks’ intersecting identities is vital to human flourishing. The web we have known Profit first: monetization, ads, the funnel, dark patterns Can we?: Innovation for innovation’s sake Solutionism: code will save us Visual design: aesthetics over usability Lone genius: “hard” skills and rock star coders Short term thinking: move fast, break stuff Shipping: new features, forsaking infrastructure Let’s move forward quickly through the past 25 years or so of the web, of digital design. All of the web we know today has been shaped in some way by intersecting matrices of domination: colonialism, capitalism, white supremacy, patriarchy. (Thank you, bell hooks.) The digital worlds where we spend our time – and that we build!! – exist in this way. This is not an indictment of anyone’s individual work, so please don’t take it personally. What I’m talking about here is the digital milieu where we live our lives. The funnel drives everything. Folks who work in nonprofits and public entities often tie ourselves in knots to retrofit our use cases in order to use common web tools (google analytics, anyone?) In chasing innovation™ we often overlook important infrastructure work, and devalue work — like web accessibility, truly user-centered design, care work, documentation, customer support and even care for ourselves and our teams — that doesn’t drive the bottom line. We frequently write checks for our future selves to cash, knowing damn well that we’ll keep burying ourselves in technical debt. That’s some tough stuff for us to carry with us every day. The “move fast” mentality has resulted in explosive growth, but at what cost? And in creating urgency where it doesn’t need to exist, focusing on new things rather than repair, the end result is that we’re building a house of cards. And we’re exhausted. To zoom way out, this is another manifestation of late capitalism. Emphasis on LATE. Because…2020 happened. What 2020 taught us Hard times amplify existing inequalities Cutting corners mortgages our future Infrastructure is essential “Colorblind”/color-evasive policy doesn’t cut it Inclusive design is vital We have a duty to each other Technology is only one piece Together, we rise The past year has been awful for pretty much everybody. But what the light from this dumpster fire has illuminated is that things have actually been awful for a lot of people, for a long time. This year has shown us how perilous it is to avoid important infrastructure work and to pursue innovation over access. It’s also shown us that what is sometimes referred to as colorblindness — I use the term color-evasiveness because it is not ableist and it is more accurate — a color-evasive approach that assumes everyone’s needs are the same in fact leaves people out, especially folks who need the most support. We’ve learned that technology is a crucial tool and that it’s just one thing that keeps us connected to each other as humans. Finally, we’ve learned that if we work together we can actually make shit happen, despite a world that tells us individual action is meaningless. Like biscuits in a pan, when we connect, we rise together. Marginalized folks have been saying this shit for years. More of us than ever see these things now. And now we can’t, and shouldn’t, unsee it. The web we can build together Current state: – Profit first – Can we? – Solutionism – Aesthetics – “Hard” skills – Rockstar coders – Short term thinking – Shipping Future state: – People first: security, privacy, inclusion – Should we? – Holistic design – Accessibility – Soft skills – Teams – Long term thinking – Sustaining So let’s talk about the future. I told you this would be a talk about the future. Like many of y’all I have had a very hard time this year thinking about the future at all. It’s hard to make plans. It’s hard to know what the next few weeks, months, years will look like. And who will be there to see it with us. But sometimes, when I can think clearly about something besides just making it through every day, I wonder. What does a people-first digital world look like? Who’s been missing this whole time? Just because we can do something, does it mean we should? Will technology actually solve this problem? Are we even defining the problem correctly? What does it mean to design knowing that even “able-bodied” folks are only temporarily so? And that our products need to be used, by humans, in various contexts and emotional states? (There are also false binaries here: aesthetics vs. accessibility; abled and disabled; binaries are dangerous!) How can we nourish our collaborations with each other, with our teams, with our users? And focus on the wisdom of the folks in the room rather than assigning individuals as heroes? How can we build for maintenance and repair? How do we stop writing checks our future selves to cash – with interest? Some of this here, I am speaking of as a web user and a web creator. I’ve only ever worked in the public sector. When I talk with folks working in the private sector I always do some amount of translating. At the end of the day, we’re solving many of the same problems. But what can private-sector workers learn from folks who come from a public-sector organization? And, as we think about what we build online, how can we also apply that thinking to our real-life communities? What is our role in shaping the public conversation around the use of technologies? I offer a few ideas here, but don’t want them to limit your thinking. Consider the public sector Here’s a thread about public service. ⚖️🏛️ 💪🏼💻🇺🇸 — Dana Chisnell (she / her) (@danachis) February 5, 2021 I don’t have a ton of time left today. I wanted to talk about public service like the very excellent Dana Chisnell here. Like I said, I’ve worked in the public sector, in higher ed, for a long time. It’s my bread and butter. It’s weird, it’s hard, it’s great. There’s a lot of work to be done, and it ain’t happening at civic hackathons or from external contractors. The call needs to come from inside the house. Working in the public sector Government should be – inclusive of all people – responsive to needs of the people – effective in its duties & purpose — Dana Chisnell (she / her) (@danachis) February 5, 2021 I want you to consider for a minute how many folks are working in the public sector right now, and how technical expertise — especially in-house expertise — is something that is desperately needed. Pictured here are the old website and new website for the city of Richmond. I have a whole ‘nother talk about that new Richmond website. I FOIA’d the contracts for this website. There are 112 accessibility errors on the homepage alone. It’s been in development for 3 years and still isn’t in full production. Bottom line, good government work matters, and it’s hard to find. Important work is put out for the lowest bidder and often external agencies don’t get it right. What would it look like to have that expertise in-house? Influencing technology policy We also desperately need lawmakers and citizens who understand technology and ask important questions about ethics and human impact of systems decisions. Pictured here are some headlines as well as a contract from the City of Richmond. Y’all know we spent $1.5 million on a predictive policing system that will disproportionately harm citizens of color? And that earlier this month, City Council voted to allow Richmond and VCU PD’s to start sharing their data in that system? The surveillance state abides. Technology facilitates. I dare say these technologies are designed to bank on the fact that lawmakers don’t know what they’re looking at. My theory is, in addition to holding deep prejudices, lawmakers are also deeply baffled by technology. The hard questions aren’t being asked, or they’re coming too late, and they’re coming from citizens who have to put themselves in harm’s way to do so. Technophobia is another harmful element that’s emerged in the past decades. What would a world look like where technology is not a thing to shrug off as un-understandable, but is instead deftly co-designed to meet our needs, rather than licensed to our city for 1.5 million dollars? What if everyone knew that technology is not neutral? Closing This is some of the future I can see. I hope that it’s sparked new thoughts for you. Let’s envision a future together. What has the light illuminated for you? Thank you! April 16, 2021 | Comment This car runs: Love letter to a 1997 Honda Accord Three years ago I sold my 1997 Honda Accord DX. Here’s the Craigslist ad love letter I wrote to it. 1997 Honda Accord DX – 4dr, automatic – This car runs. – $500 (Richmond, VA) 1997 Honda Accord DX 4 door 4 cylinders 206,193 miles Color: “Eucalyptus green pearl” aka the color and year that […] in Life, Richmond | April 1, 2021 Podcast interview: Names, binaries and trans-affirming systems on Legacy Code Rocks! In February I was honored to be invited to join Scott Ford on his podcast Legacy Code Rocks!. I’m embedding the audio below. View the full episode transcript — thanks to trans-owned Deep South Transcription Services! I’ve pulled out some of the topics we discussed and heavily edited/rearranged them for clarity. Names in systems Legal […] in Libraries | March 31, 2021 Post navigation ← Older posts Categories CategoriesSelect Category Bikes Conferences Libraries Life Projects Richmond Archives Archives Select Month April 2021 March 2021 May 2019 March 2019 March 2016 February 2016 September 2015 August 2015 January 2015 December 2014 September 2014 August 2014 May 2014 April 2014 March 2014 March 2013 February 2013 Contact E-mail me Follow @erinrwhite Independent Publisher empowered by WordPress 
erambler-co-uk-3847	----	eRambler eRambler Recent content on eRambler Intro to the fediverse Wow, it turns out to be 10 years since I wrote this beginners guide to Twitter. Things have moved on a loooooong way since then. Far from being the interesting, disruptive technology it was back then, Twitter has become part of the mainstream, the establishment. Almost everyone and everything is on Twitter now, which has both pros and cons. So what&rsquo;s the problem? It&rsquo;s now possible to follow all sorts of useful information feeds, from live updates on transport delays to your favourite sports team&rsquo;s play-by-play performance to an almost infinite number of cat pictures. In my professional life it&rsquo;s almost guaranteed that anyone I meet will be on Twitter, meaning that I can contact them to follow up at a later date without having to exchange contact details (and they have options to block me if they don&rsquo;t like that). On the other hand, a medium where everyone&rsquo;s opinion is equally valid regardless of knowledge or life experience has turned some parts of the internet into a toxic swamp of hatred and vitriol. It&rsquo;s easier than ever to forget that we have more common ground with any random stranger than we have similarities, and that&rsquo;s led to some truly awful acts and a poisonous political arena. Part of the problem here is that each of the social media platforms is controlled by a single entity with almost no accountability to anyone other than shareholders. Technological change has been so rapid that the regulatory regime has no idea how to handle them, leaving them largely free to operate how they want. This has led to a whole heap of nasty consequences that many other people have done a much better job of documenting than I could (Shoshana Zuboff&rsquo;s book The Age of Surveillance Capitalism is a good example). What I&rsquo;m going to focus on instead are some possible alternatives. If you accept the above argument, one obvious solution is to break up the effective monopoly enjoyed by Facebook, Twitter et al. We need to be able to retain the wonderful affordances of social media but democratise control of it, so that it can never be dominated by a small number of overly powerful players. What&rsquo;s the solution? There&rsquo;s actually a thing that already exists, that almost everyone is familiar with and that already works like this. It&rsquo;s email. There are a hundred thousand email servers, but my email can always find your inbox if I know your address because that address identifies both you and the email service you use, and they communicate using the same protocol, Simple Mail Transfer Protocol (SMTP)1. I can&rsquo;t send a message to your Twitter from my Facebook though, because they&rsquo;re completely incompatible, like oil and water. Facebook has no idea how to talk to Twitter and vice versa (and the companies that control them have zero interest in such interoperability anyway). Just like email, a federated social media service like Mastodon allows you to use any compatible server, or even run your own, and follow accounts on your home server or anywhere else, even servers running different software as long as they use the same ActivityPub protocol. There&rsquo;s no lock-in because you can move to another server any time you like, and interact with all the same people from your new home, just like changing your email address. Smaller servers mean that no one server ends up with enough power to take over and control everything, as the social media giants do with their own platforms. But at the same time, a small server with a small moderator team can enforce local policy much more easily and block accounts or whole servers that host trolls, nazis or other poisonous people. How do I try it? I have no problem with anyone for choosing to continue to use what we&rsquo;re already calling &ldquo;traditional&rdquo; social media; frankly, Facebook and Twitter are still useful for me to keep in touch with a lot of my friends. However, I do think it&rsquo;s useful to know some of the alternatives if only to make a more informed decision to stick with your current choices. Most of these services only ask for an email address when you sign up and use of your real name vs a pseudonym is entirely optional so there&rsquo;s not really any risk in signing up and giving one a try. That said, make sure you take sensible precautions like not reusing a password from another account. Instead of… Try… Twitter, Facebook Mastodon, Pleroma, Misskey Slack, Discord, IRC Matrix WhatsApp, FB Messenger, Telegram Also Matrix Instagram, Flickr PixelFed YouTube PeerTube The web Interplanetary File System (IPFS) Which, if you can believe it, was formalised nearly 40 years ago in 1982 and has only had fairly minor changes since then! &#x21a9;&#xfe0e; Collaborations Workshop 2021: collaborative ideas & hackday My last post covered the more &ldquo;traditional&rdquo; lectures-and-panel-sessions approach of the first half of the SSI Collaborations Workshop. The rest of the workshop was much more interactive, consisting of a discussion session, a Collaborative Ideas session, and a whole-day hackathon! The discussion session on day one had us choose a topic (from a list of topics proposed leading up to the workshop) and join a breakout room for that topic with the aim of producing a &ldquo;speed blog&rdquo; by then end of 90 minutes. Those speed blogs will be published on the SSI blog over the coming weeks, so I won&rsquo;t go into that in more detail. The Collaborative Ideas session is a way of generating hackday ideas, by putting people together at random into small groups to each raise a topic of interest to them before discussing and coming up with a combined idea for a hackday project. Because of the serendipitous nature of the groupings, it&rsquo;s a really good way of generating new ideas from unexpected combinations of individual interests. After that, all the ideas from the session, along with a few others proposed by various participants, were pitched as ideas for the hackday and people started to form teams. Not every idea pitched gets worked on during the hackday, but in the end 9 teams of roughly equal size formed to spend the third day working together. My team&rsquo;s project: &ldquo;AHA! An Arts &amp; Humanities Adventure&rdquo; There&rsquo;s a lot of FOMO around choosing which team to join for an event like this: there were so many good ideas and I wanted to work on several of them! In the end I settled on a team developing an escape room concept to help Arts &amp; Humanities scholars understand the benefits of working with research software engineers for their research. Five of us rapidly mapped out an example storyline for an escape room, got a website set up with GitHub and populated it with the first few stages of the game. We decided to focus on a story that would help the reader get to grips with what an API is and I&rsquo;m amazed how much we managed to get done in less than a day&rsquo;s work! You can try playing through the escape room (so far) yourself on the web, or take a look at the GitHub repository, which contains the source of the website along with a list of outstanding tasks to work on if you&rsquo;re interested in contributing. I&rsquo;m not sure yet whether this project has enough momentum to keep going, but it was a really valuable way both of getting to know and building trust with some new people and demonstrating the concept is worth more work. Other projects Here&rsquo;s a brief rundown of the other projects worked on by teams on the day. Coding Confessions Everyone starts somewhere and everyone cuts corners from time to time. Real developers copy and paste! Fight imposter syndrome by looking through some of these confessions or contributing your own. https://coding-confessions.github.io/ CarpenPI A template to set up a Raspberry Pi with everything you need to run a Carpentries (https://carpentries.org/) data science/software engineering workshop in a remote location without internet access. https://github.com/CarpenPi/docs/wiki Research Dugnads A guide to running an event that is a coming together of a research group or team to share knowledge, pass on skills, tidy and review code, among other software and working best practices (based on the Norwegian concept of a dugnad, a form of &ldquo;voluntary work done together with other people&rdquo;) https://research-dugnads.github.io/dugnads-hq/ Collaborations Workshop ideas A meta-project to collect together pitches and ideas from previous Collaborations Workshop conferences and hackdays, to analyse patterns and revisit ideas whose time might now have come. https://github.com/robintw/CW-ideas howDescribedIs Integrate existing tools to improve the machine-readable metadata attached to open research projects by integrating projects like SOMEF, codemeta.json and HowFAIRIs (https://howfairis.readthedocs.io/en/latest/index.html). Complete with CI and badges! https://github.com/KnowledgeCaptureAndDiscovery/somef-github-action Software end-of-project plans Develop a template to plan and communicate what will happen when the fixed-term project funding for your research software ends. Will maintenance continue? When will the project sunset? Who owns the IP? https://github.com/elichad/software-twilight Habeas Corpus A corpus of machine readable data about software used in COVID-19 related research, based on the CORD19 dataset. https://github.com/softwaresaved/habeas-corpus Credit-all Extend the all-contributors GitHub bot (https://allcontributors.org/) to include rich information about research project contributions such as the CASRAI Contributor Roles Taxonomy (https://casrai.org/credit/) https://github.com/dokempf/credit-all I&rsquo;m excited to see so many metadata-related projects! I plan to take a closer look at what the Habeas Corpus, Credit-all and howDescribedIs teams did when I get time. I also really want to try running a dugnad with my team or for the GLAM Data Science network. Collaborations Workshop 2021: talks & panel session I&rsquo;ve just finished attending (online) the three days of this year&rsquo;s SSI Collaborations Workshop (CW for short), and once again it&rsquo;s been a brilliant experience, as well as mentally exhausting, so I thought I&rsquo;d better get a summary down while it&rsquo;s still fresh it my mind. Collaborations Workshop is, as the name suggests, much more focused on facilitating collaborations than a typical conference, and has settled into a structure that starts off with with longer keynotes and lectures, and progressively gets more interactive culminating with a hack day on the third day. That&rsquo;s a lot to write about, so for this post I&rsquo;ll focus on the talks and panel session, and follow up with another post about the collaborative bits. I&rsquo;ll also probably need to come back and add in more links to bits and pieces once slides and the &ldquo;official&rdquo; summary of the event become available. Updates 2021-04-07 Added links to recordings of keynotes and panel sessions Provocations The first day began with two keynotes on this year&rsquo;s main themes: FAIR Research Software and Diversity &amp; Inclusion, and day 2 had a great panel session focused on disability. All three were streamed live and the recordings remain available on Youtube: View the keynotes recording; Google-free alternative link View the panel session recording; Google-free alternative link FAIR Research Software Dr Michelle Barker, Director of the Research Software Alliance, spoke on the challenges to recognition of software as part of the scholarly record: software is not often cited. The FAIR4RS working group has been set up to investigate and create guidance on how the FAIR Principles for data can be adapted to research software as well; as they stand, the Principles are not ideally suited to software. This work will only be the beginning though, as we will also need metrics, training, career paths and much more. ReSA itself has 3 focus areas: people, policy and infrastructure. If you&rsquo;re interested in getting more involved in this, you can join the ReSA email list. Equality, Diversity &amp; Inclusion: how to go about it Dr Chonnettia Jones, Vice President of Research, Michael Smith Foundation for Health Research spoke extensively and persuasively on the need for Equality, Diversity &amp; Inclusion (EDI) initiatives within research, as there is abundant robust evidence that all research outcomes are improved. She highlighted the difficulties current approaches to EDI have effecting structural change, and changing not just individual behaviours but the cultures &amp; practices that perpetuate iniquity. What initiatives are often constructed around making up for individual deficits, a bitter framing is to start from an understanding of individuals having equal stature but having different tired experiences. Commenting on the current focus on &ldquo;research excellent&rdquo; she pointed out that the hyper-competition this promotes is deeply unhealthy. suggesting instead that true excellence requires diversity, and we should focus on an inclusive excellence driven by inclusive leadership. Equality, Diversity &amp; Inclusion: disability issues Day 2&rsquo;s EDI panel session brought together five disabled academics to discuss the problems of disability in research. Dr Becca Wilson, UKRI Innovation Fellow, Institute of Population Health Science, University of Liverpool (Chair) Phoenix C S Andrews (PhD Student, Information Studies, University of Sheffield and Freelance Writer) Dr Ella Gale (Research Associate and Machine Learning Subject Specialist, School of Chemistry, University of Bristol) Prof Robert Stevens (Professor and Head of Department of Computer Science, University of Manchester) Dr Robin Wilson (Freelance Data Scientist and SSI Fellow) NB. The discussion flowed quite freely so the following summary, so the following summary mixes up input from all the panel members. Researchers are often assumed to be single-minded in following their research calling, and aptness for jobs is often partly judged on &ldquo;time send&rdquo;, which disadvantages any disabled person who has been forced to take a career break. On top of this disabled people are often time-poor because of the extra time needed to manage their condition, leaving them with less &ldquo;output&rdquo; to show for their time served on many common metrics. This can partially affect early-career researchers, since resources for these are often restricted on a &ldquo;years-since-PhD&rdquo; criterion. Time poverty also makes funding with short deadlines that much harder to apply for. Employers add more demands right from the start: new starters are typically expected to complete a health and safety form, generally a brief affair that will suddenly become an 80-page bureaucratic nightmare if you tick the box declaring a disability. Many employers claim to be inclusive yet utterly fail to understand the needs of their disabled staff. Wheelchairs are liberating for those who use them (despite the awful but common phrase &ldquo;wheelchair-bound&rdquo;) and yet employers will refuse to insure a wheelchair while travelling for work, classifying it as a &ldquo;high value personal item&rdquo; that the owner would take the same responsibility for as an expensive camera. Computers open up the world for blind people in a way that was never possible without them, but it&rsquo;s not unusual for mandatory training to be inaccessible to screen readers. Some of these barriers can be overcome, but doing so takes yet more time that could and should be spent on more important work. What can we do about it? Academia works on patronage whether we like it or not, so be the person who supports people who are different to you rather than focusing on the one you &ldquo;recognise yourself in&rdquo; to mentor. As a manager, it&rsquo;s important to ask each individual what they need and believe them: they are the expert in their own condition and their lived experience of it. Don&rsquo;t assume that because someone else in your organisation with the same disability needs one set of accommodations, it&rsquo;s invalid for your staff member to require something totally different. And remember: disability is unusual as a protected characteristic in that anyone can acquire it at any time without warning! Lightning talks Lightning talk sessions are always tricky to summarise, and while this doesn&rsquo;t do them justice, here are a few highlights from my notes. Data &amp; metadata Malin Sandstrom talked about a much-needed refinement of contributor role taxonomies for scientific computing Stephan Druskat showcased a project to crowdsource a corpus of research software for further analysis Learning &amp; teaching/community Matthew Bluteau introduced the concept of the &ldquo;coding dojo&rdquo; as a way to enhance community of practice. A group of coders got together to practice &amp; learn by working together to solve a problem and explaining their work as they go He described 2 models: a code jam, where people work in small groups, and the Randori method, where 2 people do pair programming while the rest observe. I&rsquo;m excited to try this out! Steve Crouch talked about intermediate skills and helping people take the next step, which I&rsquo;m also very interested in with the GLAM Data Science network Esther Plomp recounted experience of running multiple Carpentry workshops online, while Diego Alonso Alvarez discussed planned workshops on making research software more usable with GUIs Shoaib Sufi showcased the SSI&rsquo;s new event organising guide Caroline Jay reported on a diary study into autonomy &amp; agency in RSE during COVID Lopez, T., Jay, C., Wermelinger, M., &amp; Sharp, H. (2021). How has the covid-19 pandemic affected working conditions for research software engineers? Unpublished manuscript. Wrapping up That&rsquo;s not everything! But this post is getting pretty long so I&rsquo;ll wrap up for now. I&rsquo;ll try to follow up soon with a summary of the &ldquo;collaborative&rdquo; part of Collaborations Workshop: the idea-generating sessions and hackday! Time for a new look... I&rsquo;ve decided to try switching this website back to using Hugo to manage the content and generate the static HTML pages. I&rsquo;ve been on the Python-based Nikola for a few years now, but recently I&rsquo;ve been finding it quite slow, and very confusing to understand how to do certain things. I used Hugo recently for the GLAM Data Science Network website and found it had come on a lot since the last time I was using it, so I thought I&rsquo;d give it another go, and redesign this site to be a bit more minimal at the same time. The theme is still a work in progress so it&rsquo;ll probably look a bit rough around the edges for a while, but I think I&rsquo;m happy enough to publish it now. When I get round to it I might publish some more detailed thoughts on the design. Ideas for Accessible Communications The Disability Support Network at work recently ran a survey on &ldquo;accessible communications&rdquo;, to develop guidance on how to make communications (especially internal staff comms) more accessible to everyone. I grabbed a copy of my submission because I thought it would be useful to share more widely, so here it is. Please note that these are based on my own experiences only. I am in no way suggesting that these are the only things you would need to do to ensure your communications are fully accessible. They&rsquo;re just some things to keep in mind. Policies/procedures/guidance can be stressful to use if anything is vague or inconsistent, or if it looks like there might be more information implied than is explicitly given (a common cause of this is use of jargon in e.g. HR policies). Emails relating to these policies have similar problems, made worse because they tend to be very brief. Online meetings can be very helpful, but can also be exhausting, especially if there are too many people, or not enough structure. Larger meetings &amp; webinars without agendas (or where the agenda is ignored, or timings are allowed to drift without acknowledgement) are very stressful, as are those where there is not enough structure to ensure fair opportunities to contribute. Written reference documents and communications should: Be carefully checked for consistency and clarity Have all all key points explicitly stated Explicitly acknowledge the need for flexibility where it is necessary, rather than implying or hinting at it Clearly define jargon &amp; acronyms where they are necessary to the point being made, and avoid them otherwise Include links to longer, more explicit versions where space is tight Provide clear bullet-point summaries with links to the details Online meetings should: Include sufficient break time (at least 10 minutes out of every hour) and not allow this to be compromised just because a speaker has misjudged the length of their talk Include initial &ldquo;settling-in&rdquo; time in agendas to avoid timing getting messed up from the start Ensure the agenda is stuck to, or that divergence from the agenda is acknowledged explicitly by the chair and updated timing briefly discussed to ensure everyone is clear Establish a norm for participation at the start of the meeting and stick to it e.g. ask people to raise hands when they have a point to make, or have specific time for round-robin contributions Ensure quiet/introverted people have space to contribute, but don&rsquo;t force them to do so if they have nothing to add at the time Offer a text-based alternative to contributing verbally If appropriate, at the start of the meeting assign specific roles of: Gatekeeper: ensures everyone has a chance to contribute Timekeeper: ensures meeting runs to time Scribe: ensures a consistent record of the meeting Be chaired by someone with the confidence to enforce the above: offer training to all staff on chairing meetings to ensure everyone has the skills to run a meeting effectively Matrix self-hosting I started running my own Matrix server a little while ago. Matrix is something rather cool, a chat system similar to IRC or Slack, but open and federated. Open in that the standard is available for anyone to view, but also the reference implementations of server and client are open source, along with many other clients and a couple of nascent alternative servers. Federated in that, like email, it doesn&rsquo;t matter what server you sign up with, you can talk to users on your own or any other server. I decided to host my own for three reasons. Firstly, to see if I could and to learn from it. Secondly, to try and rationalise the Cambrian explosion of Slack teams I was being added to in 2019. Thirdly, to take some control of the loss of access to historical messages in some communities that rely on Slack (especially the Carpentries and RSE communities). Since then, I&rsquo;ve also added a fourth goal: taking advantage of various bridges to bring other messaging network I use (such as Signal and Telegram) into a consistent UI. I&rsquo;ve also found that my use of Matrix-only rooms has grown as more individuals &amp; communities have adopted the platform. So, I really like Matrix and I use it daily. My problem now is whether to keep self-hosting. Synapse, the only full server implementation at the moment, is really heavy on memory, so I&rsquo;ve ended up running it on a much bigger server than I thought I&rsquo;d need, which seems overkill for a single-user instance. So now I have to make a decision about whether it&rsquo;s worth keeping going, or shutting it down and going back to matrix.org, or setting up on one of the other servers that have sprung up in the last couple of years. There are a couple of other considerations here. Firstly, Synapse resource usage is entirely down to the size of the rooms joined by users of the homeowner, not directly the number of users. So if users have mostly overlapping interests, and thus keep to the same rooms, you can support quite a large community without significant extra resource usage. Secondly, there are a couple of alternative server implementations in development specifically addressing this issue for small servers. Dendrite and Conduit. Neither are quite ready for what I want yet, but are getting close, and when ready that will allow running small homeservers with much more sensible resource usage. So I could start opening up for other users, and at least justify the size of the server that way. I wouldn&rsquo;t ever want to make it a paid-for service but perhaps people might be willing to make occasional donations towards running costs. That still leaves me with the question of whether I&rsquo;m comfortable running a service that others may come to rely on, or being responsible for the safety of their information. I could also hold out for Dendrite or Conduit to mature enough that I&rsquo;m ready to try them, which might not be more than a few months off. Hmm, seems like I&rsquo;ve convinced myself to stick with it for now, and we&rsquo;ll see how it goes. In the meantime, if you know me and you want to try it out let me know and I might risk setting you up with an account! What do you miss least about pre-lockdown life? @JanetHughes on Twitter: What do you miss the least from pre-lockdown life? I absolutely do not miss wandering around the office looking for a meeting room for a confidential call or if I hadn&rsquo;t managed to book a room in advance. Let&rsquo;s never return to that joyless frustration, hey? 10:27 AM · Feb 3, 2021 After seeing Terence Eden taking Janet Hughes' tweet from earlier this month as a writing prompt, I thought I might do the same. The first thing that leaps to my mind is commuting. At various points in my life I&rsquo;ve spent between one and three hours a day travelling to and from work and I&rsquo;ve never more than tolerated it at best. It steals time from your day, and societal norms dictate that it&rsquo;s your leisure &amp; self-care time that must be sacrificed. Longer commutes allow more time to get into a book or podcast, especially if not driving, but I&rsquo;d rather have that time at home rather than trying to be comfortable in a train seat designed for some mythical average man shaped nothing like me! The other thing I don&rsquo;t miss is the colds and flu! Before the pandemic, British culture encouraged working even when ill, which meant constantly coming into contact with people carrying low-grade viruses. I&rsquo;m not immunocompromised but some allergies and residue of being asthmatic as a child meant that I would get sick 2-3 times a year. A pleasant side-effect of the COVID precautions we&rsquo;re all taking is that I haven&rsquo;t been sick for over 12 months now, which is amazing! Finally, I don&rsquo;t miss having so little control over my environment. One of the things that working from home has made clear is that there are certain unavoidable aspects of working in my shared office that cause me sensory stress, and that are completely unrelated to my work. Working (or trying to work) next to a noisy automatic scanner; trying to find a light level that works for 6 different people doing different tasks; lacking somewhere quiet and still to eat lunch and recover from a morning of meetings or the constant vaguely-distracting bustle of a large shared office. It all takes energy. Although it&rsquo;s partly been replaced by the new stress of living through a global pandemic, that old stress was a constant drain on my productivity and mood that had been growing throughout my career as I moved (ironically, given the common assumption that seniority leads to more privacy) into larger and larger open plan offices. Remarkable blogging And the handwritten blog saga continues, as I&rsquo;ve just received my new reMarkable 2 tablet, which is designed for reading, writing and nothing else. It uses a super-responsive e-ink display and writing on it with a stylus is a dream. It has a slightly rough texture with just a bit of friction that makes my writing come out a lot more legibly than on a slippery glass touchscreen. If that was all there was to it, I might not have wasted my money, but it turns out that it runs on Linux and the makers have wisely decided not to lock it down but to give you full root mess. Yes, you read that right: root access. It presents as an ethernet device over USB, so you can SSH in with a password found in the settings and have full control over your own devices. What a novel concept. This fact alone has meant it&rsquo;s built a small yet devoted community of users who have come up with some clever ways of extending its functionality. In fact, many of these are listed on this GitHub repository. Finally, from what I&rsquo;ve seen so far, the handwriting recognition is impressive to say the least. This post was written on it and needed only a little editing. I think this is a device that will get a lot of use! GLAM Data Science Network fellow travellers Updates 2021-02-04 Thanks to Gene @dzshuniper@ausglam.space for suggesting ADHO and a better attribution for the opening quote (see comments below for details) See comments &amp; webmentions for details. “If you want to go fast, go alone. If you want to go far, go together.” — African proverb, probably popularised in English by Kenyan church leader Rev. Samuel Kobia (original) This quote is a popular one in the Carpentries community, and I interpret it in this context to mean that a group of people working together is more sustainable than individuals pursuing the same goal independently. That&rsquo;s something that speaks to me, and that I want to make sure is reflected in nurturing this new community for data science in galleries, archives, libraries &amp; museums (GLAM). To succeed, this work needs to be complementary and collaborative, rather than competitive, so I want to acknowledge a range of other networks &amp; organisations whose activities complement this. The rest of this article is an unavoidably incomplete list of other relevant organisations whose efforts should be acknowledged and potentially built on. And it should go without saying, but just in case: if the work I&rsquo;m planning fits right into an existing initiative, then I&rsquo;m happy to direct my resources there rather than duplicate effort. Inspirations &amp; collaborators Groups with similar goals or undertaking similar activities, but focused on a different sector, geographic area or topic. I think we should make as much use of and contribution to these existing communities as possible since there will be significant overlap. code4lib Probably the closest existing community to what I want to build, but primarily based in the US, so timezones (and physical distance for in-person events) make it difficult to participate fully. This is a well-established community though, with regular events including an annual conference so there&rsquo;s a lot to learn here. newCardigan Similar to code4lib but an Australian focus, so the timezone problem is even bigger! GLAM Labs Focused on supporting the people experimenting with and developing the infrastructure to enable scholars to access GLAM materials in new ways. In some ways, a GLAM data science network would be complementary to their work, by providing people not directly involved with building GLAM Labs with the skills to make best use of GLAM Labs infrastructure. UK Government data science community Another existing community with very similar intentions, but focused on UK Government sector. Clearly the British Library and a few national &amp; regional museums &amp; archives fall into this, but much of the rest of the GLAM sector does not. Artifical Intelligence for Libraries, Archives &amp; Museums (AI4LAM) A multinational collaboration between several large libraries, archives and museums with a specific focus on the Artificial Intelligence (AI) subset of data science UK Reproducibility Network A network of researchers, primarily in HEIs, with an interest in improving the transparency and reliability of academic research. Mostly science-focused but with some overlap of goals around ethical and robust use of data. Museums Computer Group I&rsquo;m less familiar with this than the others, but it seems to have a wider focus on technology generally, within the slightly narrower scope of museums specifically. Again, a lot of potential for collaboration. Training Several organisations and looser groups exist specifically to develop and deliver training that will be relevant to members of this network. The network also presents an opportunity for those who have done a workshop with one of these and want to know what the “next steps” are to continue their data science journey. The Carpentries, aka: Library Carpentry Data Carpentry Software Carpentry Data Science Training for Librarians (DST4L) The Programming Historian CDH Cultural Heritage Data School Supporters These misson-driven organisations have goals that align well with what I imagine for the GLAM DSN, but operate at a more strategic level. They work by providing expert guidance and policy advice, lobbying and supporting specific projects with funding and/or effort. In particular, the SSI runs a fellowship programme which is currently providing a small amount of funding to this project. Digital Preservation Coalition (DPC) Software Sustainability Institute (SSI) Research Data Alliance (RDA) Alliance of Digital Humanities Organizations (ADHO) &hellip; and its Libraries and Digital Humanities Special Interest Group (Lib&amp;DH SIG) Professional bodies These organisations exist to promote the interests of professionals in particular fields, including supporting professional development. I hope they will provide communication channels to their various members at the least, and may be interested in supporting more directly, depending on their mission and goals. Society of Research Software Engineering Chartered Institute of Library and Information Professionals Archives &amp; Records Association Museums Association Conclusion As I mentioned at the top of the page, this list cannot possibly be complete. This is a growing area and I&rsquo;m not the only or first person to have this idea. If you can think of anything glaring that I&rsquo;ve missed and you think should be on this list, leave a comment or tweet/toot at me! A new font for the blog I&rsquo;ve updated my blog theme to use the quasi-proportional fonts Iosevka Aile and Iosevka Etoile. I really like the aesthetic, as they look like fixed-width console fonts (I use the true fixed-width version of Iosevka in my terminal and text editor) but they&rsquo;re actually proportional which makes them easier to read. https://typeof.net/Iosevka/ Training a model to recognise my own handwriting If I&rsquo;m going to train an algorithm to read my weird &amp; awful writing, I&rsquo;m going to need a decent-sized training set to work with. And since one of the main things I want to do with it is to blog &ldquo;by hand&rdquo; it makes sense to focus on that type of material for training. In other words, I need to write out a bunch of blog posts on paper, scan them and transcribe them as ground truth. The added bonus of this plan is that after transcribing, I also end up with some digital text I can use as an actual post — multitasking! So, by the time you read this, I will have already run it through a manual transcription process using Transkribus to add it to my training set, and copy-pasted it into emacs for posting. This is a fun little project because it means I can: Write more by hand with one of my several nice fountain pens, which I enjoy Learn more about the operational process some of my colleagues go through when digitising manuscripts Learn more about the underlying technology &amp; maths, and how to tune the process Produce more lovely content! For you to read! Yay! Write in a way that forces me to put off editing until after a first draft is done and focus more on getting the whole of what I want to say down. That&rsquo;s it for now — I&rsquo;ll keep you posted as the project unfolds. Addendum Tee hee! I&rsquo;m actually just enjoying the process of writing stuff by hand in long-form prose. It&rsquo;ll be interesting to see how the accuracy turns out and if I need to be more careful about neatness. Will it be better or worse than the big but generic models used by Samsung Notes or OneNote. Maybe I should include some stylus-written text for comparison. Blogging by hand I wrote the following text on my tablet with a stylus, which was an interesting experience: So, thinking about ways to make writing fun again, what if I were to write some of them by hand? I mean I have a tablet with a pretty nice stylus, so maybe handwriting recognition could work. One major problem, of course, is that my handwriting is AWFUL! I guess I&rsquo;ll just have to see whether the OCR is good enough to cope… It&rsquo;s something I&rsquo;ve been thinking about recently anyway: I enjoy writing with a proper fountain pen, so is there a way that I can have a smooth workflow to digitise handwritten text without just typing it back in by hand? That would probably be preferable to this, which actually seems to work quite well but does lead to my hand tensing up to properly control the stylus on the almost-frictionless glass screen. I&rsquo;m surprised how well it worked! Here&rsquo;s a sample of the original text: And here&rsquo;s the result of converting that to text with the built-in handwriting recognition in Samsung Notes: Writing blog posts by hand So, thinking about ways to make writing fun again, what if I were to write some of chum by hand? I mean, I have a toldest winds a pretty nice stylus, so maybe handwriting recognition could work. One major problems, ofcourse, is that my , is AWFUL! Iguess I&rsquo;ll just have to see whattime the Ocu is good enough to cope&hellip; It&rsquo;s something I&rsquo;ve hun tthinking about recently anyway: I enjoy wilting with a proper fountain pion, soischeme a way that I can have a smooch workflow to digitise handwritten text without just typing it back in by hand? That wouldprobally be preferableto this, which actually scams to work quito wall but doers load to my hand tensing up to properly couldthe stylus once almost-frictionlessg lass scream. It&rsquo;s pretty good! It did require a fair bit of editing though, and I reckon we can do better with a model that&rsquo;s properly trained on a large enough sample of my own handwriting. What I want from a GLAM/Cultural Heritage Data Science Network Introduction As I mentioned last year, I was awarded a Software Sustainability Institute Fellowship to pursue the project of setting up a Cultural Heritage/GLAM data science network. Obviously, the global pandemic has forced a re-think of many plans and this is no exception, so I&rsquo;m coming back to reflect on it and make sure I&rsquo;m clear about the core goals so that everything else still moves in the right direction. One of the main reasons I have for setting up a GLAM data science network is because it&rsquo;s something I want. The advice to &ldquo;scratch your own itch&rdquo; is often given to people looking for an open project to start or contribute to, and the lack of a community of people with whom to learn &amp; share ideas and practice is something that itches for me very much. The &ldquo;motivation&rdquo; section in my original draft project brief for this work said: Cultural heritage work, like all knowledge work, is increasingly data-based, or at least gives opportunities to make use of data day-to-day. The proper skills to use this data enable more effective working. Knowledge and experience thus gained improves understanding of and empathy with users also using such skills. But of course, I have my own reasons for wanting to do this too. In particular, I want to: Advocate for the value of ethical, sustainable data science across a wide range of roles within the British Library and the wider sector Advance the sector to make the best use of data and digital sources in the most ethical and sustainable way possible Understand how and why people use data from the British Library, and plan/deliver better services to support that Keep up to date with relevant developments in data science Learn from others' skills and experiences, and share my own in turn Those initial goals imply some further supporting goals: Build up the confidence of colleagues who might benefit from data science skills but don&rsquo;t feel they are &ldquo;technical&rdquo; or &ldquo;computer literate&rdquo; enough Further to that, build up a base of colleagues with the confidence to share their skills &amp; knowledge with others, whether through teaching, giving talks, writing or other channels Identify common awareness gaps (skills/knowledge that people don&rsquo;t know they&rsquo;re missing) and address them Develop a communal space (primarily online) in which people feel safe to ask questions Develop a body of professional practice and help colleagues to learn and contribute to the evolution of this, including practices of data ethics, software engineering, statistics, high performance computing, … Break down language barriers between data scientists and others I&rsquo;ll expand on this separately as my planning develops, but here are a few specific activities that I&rsquo;d like to be able to do to support this: Organise less-formal learning and sharing events to complement the more formal training already available within organisations and the wider sector, including &ldquo;show and tell&rdquo; sessions, panel discussions, code cafés, masterclasses, guest speakers, reading/study groups, co-working sessions, … Organise training to cover intermediate skills and knowledge currently missing from the available options, including the awareness gaps and professional practice mentioned above Collect together links to other relevant resources to support self-led learning Decisions to be made There are all sorts of open questions in my head about this right now, but here are some of the key ones. Is it GLAM or Cultural Heritage? When I first started planning this whole thing, I went with &ldquo;Cultural Heritage&rdquo;, since I was pretty transparently targeting my own organisation. The British Library is fairly unequivocally a CH organisation. But as I&rsquo;ve gone along I&rsquo;ve found myself gravitating more towards the term &ldquo;GLAM&rdquo; (which stands for Galleries, Libraries, Archives, Museums) as it covers a similar range of work but is clearer (when you spell out the acronym) about what kinds of work are included. What skills are relevant? This turns out to be surprisingly important, at least in terms of how the community is described, as they define the boundaries of the community and can be the difference between someone feeling welcome or excluded. For example, I think that some introductory statistics training would be immensely valuable for anyone working with data to understand what options are open to them and what limitations those options have, but is the word &ldquo;statistics&rdquo; offputting per se to those who&rsquo;ve chosen a career in arts &amp; humanities? I don&rsquo;t know because I don&rsquo;t have that background and perspective. Keep it internal to the BL, or open up early on? I originally planned to focus primarily on my own organisation to start with, feeling that it would be easier to organise events and build a network within a single organisation. However, the pandemic has changed my thinking significantly. Firstly, it&rsquo;s now impossible to organise in-person events and that will continue for quite some time to come, so there is less need to focus on the logistics of getting people into the same room. Secondly, people within the sector are much more used to attending remote events, which can easily be opened up to multiple organisations in many countries, timezones allowing. It now makes more sense to focus primarily on online activities, which opens up the possibility of building a critical mass of active participants much more quickly by opening up to the wider sector. Conclusion This is the type of post that I could let run and run without ever actually publishing, but since it&rsquo;s something I need feedback and opinions on from other people, I&rsquo;d better ship it! I really want to know what you think about this, whether you feel it&rsquo;s relevant to you and what would make it useful. Comments are open below, or you can contact me via Mastodon or Twitter. Writing About Not Writing Under Construction Grunge Sign by Nicolas Raymond — CC BY 2.0 Every year, around this time of year, I start doing two things. First, I start thinking I could really start to understand monads and write more than toy programs in Haskell. This is unlikely to ever actually happen unless and until I get a day job where I can justify writing useful programs in Haskell, but Advent of Code always gets me thinking otherwise. Second, I start mentally writing this same post. You know, the one about how the blogger in question hasn&rsquo;t had much time to write but will be back soon? &ldquo;Sorry I haven&rsquo;t written much lately…&rdquo; It&rsquo;s about as cliché as a Geocities site with a permanent &ldquo;Under construction&rdquo; GIF. At some point, not long after the dawn of ~time~ the internet, most people realised that every website was permanently under construction and publishing something not ready to be published was just pointless. So I figured this year I&rsquo;d actually finish writing it and publish it. After all, what&rsquo;s the worst that could happen? If we&rsquo;re getting all reflective about this, I could probably suggest some reasons why I&rsquo;m not writing much: For a start, there&rsquo;s a lot going on in both my world and The World right now, which doesn&rsquo;t leave a lot of spare energy after getting up, eating, housework, working and a few other necessary activities. As a result, I&rsquo;m easily distracted and I tend to let myself get dragged off in other directions before I even get to writing much of anything. If I do manage to focus on this blog in general, I&rsquo;ll often end up working on some minor tweak to the theme or functionality. I mean, right now I&rsquo;m wondering if I can do something clever in my text-editor (Emacs, since you&rsquo;re asking) to streamline my writing &amp; editing process so it&rsquo;s more elegant, efficient, ergonomic and slightly closer to perfect in every way. It also makes me much more likely to self-censor, and to indulge my perfectionist tendencies to try and tweak the writing until it&rsquo;s absolutely perfect, which of course never happens. I&rsquo;ve got a whole heap of partly-written posts that are juuuust waiting for the right motivation for me to just finish them off. The only real solution is to accept that: I&rsquo;m not going to write much and that&rsquo;s probably OK What I do write won&rsquo;t always be the work of carefully-researched, finely crafted genius that I want it to be, and that&rsquo;s probably OK too Also to remember why I started writing and publishing stuff in the first place: to reflect and get my thoughts out onto a (virtual) page so that I can see them, figure out whether I agree with myself and learn; and to stimulate discussion and get other views on my (possibly uninformed, incorrect or half-formed) thoughts, also to learn. In other words, a thing I do for me. It&rsquo;s easy to forget that and worry too much about whether anyone else wants to read my s—t. Will you notice any changes? Maybe? Maybe not? Who knows. But it&rsquo;s a new year and that&rsquo;s as good a time for a change as any. When is a persistent identifier not persistent? Or an identifier? I wrote a post on the problems with ISBNs as persistent identifiers (PIDS) for work, so check it out if that sounds interesting. IDCC20 reflections I&rsquo;m just back from IDCC20, so here are a few reflections on this year&rsquo;s conference. You can find all the available slides and links to shared notes on the conference programme. There&rsquo;s also a list of all the posters and an overview of the Unconference Skills for curation of diverse datasets Here in the UK and elsewhere, you&rsquo;re unlikely to find many institutions claiming to apply a deep level of curation to every dataset/software package/etc deposited with them. There are so many different kinds of data and so few people in any one institution doing &ldquo;curation&rdquo; that it&rsquo;s impossible to do this for everything. Absent the knowledge and skills required to fully evaluate an object the best that can be done is usually to make a sense check on the metadata and flag up with the depositor potential for high-level issues such as accidental disclosure of sensitive personal information. The Data Curation Network in the United States is aiming to address this issue by pooling expertise across multiple organisations. The pilot has been highly successful and they&rsquo;re now looking to obtain funding to continue this work. The Swedish National Data Service is experimenting with a similar model, also with a lot of success. As well as sharing individual expertise, the DCN collaboration has also produced some excellent online quick-reference guides for curating common types of data. We had some further discussion as part of the Unconference on the final day about what it would look like to introduce this model in the UK. There was general agreement that this was a good idea and a way to make optimal use of sparse resources. There were also very valid concerns that it would be difficult in the current financial climate for anyone to justify doing work for another organisation, apparently for free. In my mind there are two ways around this, which are not mutually exclusive by any stretch of the imagination. First is to Just Do It: form an informal network of curators around something simple like a mailing list, and give it a try. Second is for one or more trusted organisations to provide some coordination and structure. There are several candidates for this including DCC, Jisc, DPC and the British Library; we all have complementary strengths in this area so it&rsquo;s my hope that we&rsquo;ll be able to collaborate around it. In the meantime, I hope the discussion continues. Artificial intelligence, machine learning et al As you might expect at any tech-oriented conference there was a strong theme of AI running through many presentations, starting from the very first keynote from Francine Berman. Her talk, The Internet of Things: Utopia or Dystopia? used self-driving cars as a case study to unpack some of the ethical and privacy implications of AI. For example, driverless cars can potentially increase efficiency, both through route-planning and driving technique, but also by allowing fewer vehicles to be shared by more people. However, a shared vehicle is not a private space in the way your own car is: anything you say or do while in that space is potentially open to surveillance. Aside from this, there are some interesting ideas being discussed, particularly around the possibility of using machine learning to automate increasingly complex actions and workflows such as data curation and metadata enhancement. I didn&rsquo;t get the impression anyone is doing this in the real world yet, but I&rsquo;ve previously seen theoretical concepts discussed at IDCC make it into practice so watch this space! Playing games! Training is always a major IDCC theme, and this year two of the most popular conference submissions described games used to help teach digital curation concepts and skills. Mary Donaldson and Matt Mahon of the University of Glasgow presented their use of Lego to teach the concept of sufficient metadata. Participants build simple models before documenting the process and breaking them down again. Then everyone had to use someone else&rsquo;s documentation to try and recreate the models, learning important lessons about assumptions and including sufficient detail. Kirsty Merrett and Zosia Beckles from the University of Bristol brought along their card game &ldquo;Researchers, Impact and Publications (RIP)&rdquo;, based on the popular &ldquo;Cards Against Humanity&rdquo;. RIP encourages players to examine some of the reasons for and against data sharing with plenty of humour thrown in. Both games were trialled by many of the attendees during Thursday&rsquo;s Unconference. Summary I realised in Dublin that it&rsquo;s 8 years since I attended my first IDCC, held at the University of Bristol in December 2011 while I was still working at the nearby University of Bath. While I haven&rsquo;t been every year, I&rsquo;ve been to every one held in Europe since then and it&rsquo;s interesting to see what has and hasn&rsquo;t changed. We&rsquo;re no longer discussing data management plans, data scientists or various other things as abstract concepts that we&rsquo;d like to encourage, but dealing with the real-world consequences of them. The conference has also grown over the years: this year was the biggest yet, boasting over 300 attendees. There has been especially big growth in attendees from North America, Australasia, Africa and the Middle East. That&rsquo;s great for the diversity of the conference as it brings in more voices and viewpoints than ever. With more people around to interact with I have to work harder to manage my energy levels but I think that&rsquo;s a small price to pay. Iosevka: a nice fixed-width-font Iosevka is a nice, slender monospace font with a lot of configurable variations. Check it out: https://typeof.net/Iosevka/ Replacing comments with webmentions Just a quickie to say that I&rsquo;ve replaced the comment section at the bottom of each post with webmentions, which allows you to comment by posting on your own site and linking here. It&rsquo;s a fundamental part of the IndieWeb, which I&rsquo;m slowly getting to grips with having been a halfway member of it for years by virtue of having my own site on my own domain. I&rsquo;d already got rid of Google Analytics to stop forcing that tracking on my visitors, I wanted to get rid of Disqus too because I&rsquo;m pretty sure the only way that is free for me is if they&rsquo;re selling my data and yours to third parties. Webmention is a nice alternative because it relies only on open standards, has no tracking and allows people to control their own comments. While I&rsquo;m currently using a third-party service to help, I can switch to self-hosted at any point in the future, completely transparently. Thanks to webmention.io, which handles incoming webmentions for me, and webmention.js, which displays them on the site, I can keep it all static and not have to implement any of this myself, which is nice. It&rsquo;s a bit harder to comment because you have to be able to host your own content somewhere, but then almost no-one ever commented anyway, so it&rsquo;s not like I&rsquo;ll lose anything! Plus, if I get Bridgy set up right, you should be able to comment just by replying on Mastodon, Twitter or a few other places. A spot of web searching shows that I&rsquo;m not the first to make the Disqus -&gt; webmentions switch (yes, I&rsquo;m putting these links in blatantly to test outgoing webmentions with Telegraph&hellip;): So long Disqus, hello webmention &mdash; Nicholas Hoizey Bye Disqus, hello Webmention! &mdash; Evert Pot Implementing Webmention on a static site &mdash; Deluvi Let&rsquo;s see how this goes! Bridging Carpentries Slack channels to Matrix It looks like I&rsquo;ve accidentally taken charge of bridging a bunch of The Carpentries Slack channels over to Matrix. Given this, it seems like a good idea to explain what that sentence means and reflect a little on my reasoning. I&rsquo;m more than happy to discuss the pros and cons of this approach If you just want to try chatting in Matrix, jump to the getting started section What are Slack and Matrix? Slack (see also on Wikipedia), for those not familiar with it, is an online text chat platform with the feel of IRC (Internet Relay Chat), a modern look and feel and both web and smartphone interfaces. By providing a free tier that meets many peoples' needs on its own Slack has become the communication platform of choice for thousands of online communities, private projects and more. One of the major disadvantages of using Slack&rsquo;s free tier, as many community organisations do, is that as an incentive to upgrade to a paid service your chat history is limited to the most recent 10,000 messages across all channels. For a busy community like The Carpentries, this means that messages older than about 6-7 weeks are already inaccessible, rendering some of the quieter channels apparently empty. As Slack is at pains to point out, that history isn&rsquo;t gone, just archived and hidden from view unless you pay the low, low price of $1/user/month. That doesn&rsquo;t seem too pricy, unless you&rsquo;re a non-profit organisation with a lot of projects you want to fund and an active membership of several hundred worldwide, at which point it soon adds up. Slack does offer to waive the cost for registered non-profit organisations, but only for one community. The Carpentries is not an independent organisation, but one fiscally sponsored by Community Initiatives, which has already used its free quota of one elsewhere rendering the Carpentries ineligible. Other umbrella organisations such as NumFocus (and, I expect, Mozilla) also run into this problem with Slack. So, we have a community which is slowly and inexorably losing its own history behind a paywall. For some people this is simply annoying, but from my perspective as a facilitator of the preservation of digital things the community is haemhorraging an important record of its early history. Enter Matrix. Matrix is a chat platform similar to IRC, Slack or Discord. It&rsquo;s divided into separate channels, and users can join one or more of these to take part in the conversation happening in those channels. What sets it apart from older technology like IRC and walled gardens like Slack &amp; Discord is that it&rsquo;s federated. Federation means simply that users on any server can communicate with users and channels on any other server. Usernames and channel addresses specify both the individual identifier and the server it calls home, just as your email address contains all the information needed for my email server to route messages to it. While users are currently tied to their home server, channels can be mirrored and synchronised across multiple servers making the overall system much more resilient. Can&rsquo;t connect to your favourite channel on server X? No problem: just connect via its alias on server Y and when X comes back online it will be resynchronised. The technology used is much more modern and secure than the aging IRC protocol, and there&rsquo;s no vender lock-in like there is with closed platforms like Slack and Discord. On top of that, Matrix channels can easily be &ldquo;bridged&rdquo; to channels/rooms on other platforms, including, yes, Slack, so that you can join on Matrix and transparently talk to people connected to the bridged room, or vice versa. So, to summarise: The current Carpentries Slack channels could be bridged to Matrix at no cost and with no disruption to existing users The history of those channels from that point on would be retained on matrix.org and accessible even when it&rsquo;s no longer available on Slack If at some point in the future The Carpentries chose to invest in its own Matrix server, it could adopt and become the main Matrix home of these channels without disruption to users of either Matrix or (if it&rsquo;s still in use at that point) Slack Matrix is an open protocol, with a reference server implementation and wide range of clients all available as free software, which aligns with the values of the Carpentries community On top of this: I&rsquo;m fed up of having so many different Slack teams to switch between to see the channels in all of them, and prefer having all the channels I regularly visit in a single unified interface; I wanted to see how easy this would be and whether others would also be interested. Given all this, I thought I&rsquo;d go ahead and give it a try to see if it made things more manageable for me and to see what the reaction would be from the community. How can I get started? !!! reminder Please remember that, like any other Carpentries space, the Code of Conduct applies in all of these channels. First, sign up for a Matrix account. The quickest way to do this is on the Matrix &ldquo;Try now&rdquo; page, which will take you to the Riot Web client which for many is synonymous with Matrix. Other clients are also available for the adventurous. Second, join one of the channels. The links below will take you to a page that will let you connect via your preferred client. You&rsquo;ll need to log in as they are set not to allow guest access, but, unlike Slack, you won&rsquo;t need an invitation to be able to join. #general &mdash; the main open channel to discuss all things Carpentries #random &mdash; anything that would be considered offtopic elsewhere #welcome &mdash; join in and introduce yourself! That&rsquo;s all there is to getting started with Matrix. To find all the bridged channels there&rsquo;s a Matrix &ldquo;community&rdquo; that I&rsquo;ve added them all to: Carpentries Matrix community. There&rsquo;s a lot more, including how to bridge your favourite channels from Slack to Matrix, but this is all I&rsquo;ve got time and space for here! If you want to know more, leave a comment below, or send me a message on Slack (jezcope) or maybe Matrix (@petrichor:matrix.org)! I&rsquo;ve also made a separate channel for Matrix-Slack discussions: #matrix on Slack and Carpentries Matrix Discussion on Matrix MozFest19 first reflections Discussions of neurodiversity at #mozfest Photo by Jennifer Riggins The other weekend I had my first experience of Mozilla Festival, aka #mozfest. It was pretty awesome. I met quite a few people in real life that I&rsquo;ve previously only known (/stalked) on Twitter, and caught up with others that I haven&rsquo;t seen for a while. I had the honour of co-facilitating a workshop session on imposter syndrome and how to deal with it with the wonderful Yo Yehudi and Emmy Tsang. We all learned a lot and hope our participants did too; we&rsquo;ll be putting together a summary blog post as soon as we can get our act together! I also attended a great session, led by Kiran Oliver (psst, they&rsquo;re looking for a new challenge), on how to encourage and support a neurodiverse workforce. I was only there for the one day, and I really wish that I&rsquo;d taken the plunge and committed to the whole weekend. There&rsquo;s always next year though! To be honest, I&rsquo;m just disappointed that I never had the courage to go sooner, Music for working Today1 the office conversation turned to blocking out background noise. (No, the irony is not lost on me.) Like many people I work in a large, open-plan office, and I&rsquo;m not alone amongst my colleagues in sometimes needing to find a way to boost concentration by blocking out distractions. Not everyone is like this, but I find music does the trick for me. I also find that different types of music are better for different types of work, and I use this to try and manage my energy better. There are more distractions than auditory noise, and at times I really struggle with visual noise. Rather than have this post turn into a rant about the evils of open-plan offices, I&rsquo;ll just mention that the scientific evidence doesn&rsquo;t paint them in a good light2, or at least suggests that the benefits are more limited in scope than is commonly thought3, and move on to what I actually wanted to share: good music for working to. There are a number of genres that I find useful for working. Generally, these have in common a consistent tempo, a lack of lyrics, and enough variation to prevent boredom without distracting. Familiarity helps my concentration too so I&rsquo;ll often listen to a restricted set of albums for a while, gradually moving on by dropping one out and bringing in another. In my case this includes: Traditional dance music, generally from northern and western European traditions for me. This music has to be rhythmically consistent to allow social dancing, and while the melodies are typically simple repeated phrases, skilled musicians improvise around that to make something beautiful. I tend to go through phases of listening to particular traditions; I&rsquo;m currently listening to a lot of French, Belgian and Scandinavian. Computer game soundtracks, which are specifically designed to enhance gameplay without distracting, making them perfect for other activities requiring a similar level of concentration. Chiptunes and other music incorporating it; partly overlapping with the previous category, chiptunes is music made by hacking the audio chips from (usually) old computers and games machines to become an instrument for new music. Because of the nature of the instrument, this will have millisecond-perfect rhythm and again makes for undistracting noise blocking with an extra helping of nostalgia! Purists would disagree with me, but I like artists that combine chiptunes with other instruments and effects to make something more complete-sounding. Retrowave/synthwave/outrun, synth-driven music that&rsquo;s instantly familiar as the soundtrack to many 90s sci-fi and thriller movies. Atmospheric, almost dreamy, but rhythmic with a driving beat, it&rsquo;s another genre that fits into the &ldquo;pleasing but not too surprising&rdquo; category for me. So where to find this stuff? One of the best resources I&rsquo;ve found is Music for Programming which provides carefully curated playlists of mostly electronic music designed to energise without distracting. They&rsquo;re so well done that the tracks move seamlessly, one to the next, without ever getting boring. Spotify is an obvious option, and I do use it quite a lot. However, I&rsquo;ve started trying to find ways to support artists more directly, and Bandcamp seems to be a good way of doing that. It&rsquo;s really easy to browse by genre, or discover artists similar to what you&rsquo;re currently hearing. You can listen for free as long as you don&rsquo;t mind occasional nags to buy the music you&rsquo;re hearing, but you can also buy tracks or albums. Music you&rsquo;ve paid for is downloadable in several open, DRM-free formats for you to keep, and you know that a decent chunk of that cash is going directly to that artist. I also love noise generators; not exactly music, but a variety of pleasant background noises, some of which nicely obscure typical office noise. I particularly like mynoise.net, which has a cornucopia of different natural and synthetic noises. Each generator comes with a range of sliders allowing you to tweak the composition and frequency range, and will even animate them randomly for you to create a gently shifting soundscape. A much simpler, but still great, option is Noisli with it&rsquo;s nice clean interface. Both offer apps for iOS and Android. For bonus points, you can always try combining one or more of the above. Adding in a noise generator allows me to listen to quieter music while still getting good environmental isolation when I need concentration. Another favourite combo is to open both the cafe and rainfall generators from myNoise, made easier by the ability to pop out a mini-player then open up a second generator. I must be missing stuff though. What other musical genres should I try? What background sounds are nice to work to? Well, you know. The other day. Whatever. &#x21a9;&#xfe0e; See e.g.: Lee, So Young, and Jay L. Brand. ‘Effects of Control over Office Workspace on Perceptions of the Work Environment and Work Outcomes’. Journal of Environmental Psychology 25, no. 3 (1 September 2005): 323–33. https://doi.org/10.1016/j.jenvp.2005.08.001. &#x21a9;&#xfe0e; Open plan offices can actually work under certain conditions, The Conversation &#x21a9;&#xfe0e; Working at the British Library: 6 months in It barely seems like it, but I&rsquo;ve been at the British Library now for nearly 6 months. It always takes a long time to adjust and from experience I know it&rsquo;ll be another year before I feel fully settled, but my team, department and other colleagues have really made me feel welcome and like I belong. One thing that hasn&rsquo;t got old yet is the occasional thrill of remembering that I work at my national library now. Every now and then I&rsquo;ll catch a glimpse of the collections at Boston Spa or step into one of the reading rooms and think &ldquo;wow, I actually work here!&rdquo; I also like having a national and international role to play, which means I get to travel a bit more than I used to. Budgets are still tight so there are limits, and I still prefer to be home more often than not, but there is more scope in this job than I&rsquo;ve had previously for travelling to conferences, giving talks that change the way people think, and learning in different contexts. I&rsquo;m learning a lot too, especially how to work with and manage people split across multiple sites, and the care and feeding of budgets. As well as missing mo old team at Sheffield, I do also miss some of the direct contact I had with researchers in HE. I especially miss the teaching work, but also the higher-level influencing of more senior academics to change practices on a wider scale. Still, I get to use those influencing skills in different ways now, and I&rsquo;m still involved with the Carpentries which should let me keep my hand in with teaching. I still deal with my general tendency to try and do All The Things, and as before I&rsquo;m slowly learning to recognise it, tame it and very occasionally turn it to my advantage. That also leads to feelings of imposterism that are only magnified by the knowledge that I now work at a national institution! It&rsquo;s a constant struggle some days to believe that I&rsquo;ve actually earned my place here through hard work, Even if I don&rsquo;t always feel that I have, my colleagues here certainly have, so I should have more faith in their opinion of me. Finally, I couldn&rsquo;t write this type of thing without mentioning the commute. I&rsquo;ve gone from 90 minutes each way on a good day (up to twice that if the trains were disrupted) to 35 minutes each way along fairly open roads. I have less time to read, but much more time at home. On top of that, the library has implemented flexitime across all pay grades, with even senior managers strongly encouraged to make full use. Not only is this an important enabler of equality across the organisation, it relieves for me personally the pressure to work over my contracted hours and the guilt I&rsquo;ve always felt at leaving work even 10 minutes early. If I work late, it&rsquo;s now a choice I&rsquo;m making based on business needs instead of guilt and in full knowledge that I&rsquo;ll get that time back later. So that&rsquo;s where I am right now. I&rsquo;m really enjoying the work and the culture, and I look forward to what the next 6 months will bring! RDA Plenary 13 reflection Photo by me I sit here writing this in the departure lounge at Philadelphia International Airport, waiting for my Aer Lingus flight back after a week at the 13th Research Data Alliance (RDA) Plenary (although I&rsquo;m actually publishing this a week or so later at home). I&rsquo;m pretty exhausted, partly because of the jet lag, and partly because it&rsquo;s been a very full week with so much to take in. It&rsquo;s my first time at an RDA Plenary, and it was quite a new experience for me! First off, it&rsquo;s my first time outside Europe, and thus my first time crossing quite so many timezones. I&rsquo;ve been waking at 5am and ready to drop by 8pm, but I&rsquo;ve struggled on through! Secondly, it&rsquo;s the biggest conference I&rsquo;ve been to for a long time, both in number of attendees and number of parallel sessions. There&rsquo;s been a lot of sustained input so I&rsquo;ve been very glad to have a room in the conference hotel and be able to escape for a few minutes when I needed to recharge. Thirdly, it&rsquo;s not really like any other conference I&rsquo;ve been to: rather than having large numbers of presentations submitted by attendees, each session comprises lots of parallel meetings of RDA interest groups and working groups. It&rsquo;s more community-oriented: an opportunity for groups to get together face to face and make plans or show off results. I found it pretty intense and struggled to take it all in, but incredibly valuable nonetheless. Lots of information to process (I took a lot of notes) and a few contacts to follow up on too, so overall I loved it! Using Pipfile in Binder Photo by Sear Greyson on Unsplash I recently attended a workshop, organised by the excellent team of the Turing Way project, on a tool called BinderHub. BinderHub, along with public hosting platform MyBinder, allows you to publish computational notebooks online as &ldquo;binders&rdquo; such that they&rsquo;re not static but fully interactive. It&rsquo;s able to do this by using a tool called repo2docker to capture the full computational environment and dependencies required to run the notebook. !!! aside &ldquo;What is the Turing Way?&rdquo; The Turing Way is, in its own words, &ldquo;a lightly opinionated guide to reproducible data science.&rdquo; The team is building an open textbook and running a number of workshops for scientists and research software engineers, and you should check out the project on Github. You could even contribute! The Binder process goes roughly like this: Do some work in a Jupyter Notebook or similar Put it into a public git repository Add some extra metadata describing the packages and versions your code relies on Go to mybinder.org and tell it where to find your repository Open the URL it generates for you Profit Other than step 5, which can take some time to build the binder, this is a remarkably quick process. It supports a number of different languages too, including built-in support for R, Python and Julia and the ability to configure pretty much any other language that will run on Linux. However, the Python support currently requires you to have either a requirements.txt or Conda-style environment.yml file to specify dependencies, and I commonly use a Pipfile for this instead. Pipfile allows you to specify a loose range of compatible versions for maximal convenience, but then locks in specific versions for maximal reproducibility. You can upgrade packages any time you want, but you&rsquo;re fully in control of when that happens, and the locked versions are checked into version control so that everyone working on a project gets consistency. Since Pipfile is emerging as something of a standard thought I&rsquo;d see if I could use that in a binder, and it turns out to be remarkably simple. The reference implementation of Pipfile is a tool called pipenv by the prolific Kenneth Reitz. All you need to use this in your binder is two files of one line each. requirements.txt tells repo2binder to build a Python-based binder, and contains a single line to install the pipenv package: pipenv Then postBuild is used by repo2binder to install all other dependencies using pipenv: pipenv install --system The --system flag tells pipenv to install packages globally (its default behaviour is to create a Python virtualenv). With these two files, the binder builds and runs as expected. You can see a complete example that I put together during the workshop here on Gitlab. What do you think I should write about? I&rsquo;ve found it increasingly difficult to make time to blog, and it&rsquo;s not so much not having the time — I&rsquo;m pretty privileged in that regard — but finding the motivation. Thinking about what used to motivate me, one of the big things was writing things that other people wanted to read. Rather than try to guess, I thought I&rsquo;d ask! Those who know what I&#39;m about, what would you read about, if it was written by me?I&#39;m trying to break through the blog-writers block and would love to know what other people would like to see my ill-considered opinions on.&mdash; Jez Cope (@jezcope) March 7, 2019 I&rsquo;m still looking for ideas, so please tweet me or leave me a comment below. Below are a few thoughts that I&rsquo;m planning to do something with. Something taking one of the more techy aspects of Open Research, breaking it down and explaining the benefits for non-techy folks?&mdash; Dr Beth 🏳️‍🌈 🐺 (@PhdGeek) March 7, 2019 Skills (both techy and non techy) that people need to most effectively support RDM&mdash; Kate O&#39;Neill (@KateFONeill) March 7, 2019 Sometimes I forget that my background makes me well-qualified to take some of these technical aspects of the job and break them down for different audiences. There might be a whole series in this&hellip; Carrying on our conversation last week I&#39;d love to hear more about how you&#39;ve found moving from an HE lib to a national library and how you see the BL&#39;s role in RDM. Appreciate this might be a bit niche/me looking for more interesting things to cite :)&mdash; Rosie Higman (@RosieHLib) March 7, 2019 This is interesting, and something I&rsquo;d like to reflect on; moving from one job to another always has lessons and it&rsquo;s easy to miss them if you&rsquo;re not paying attention. Another one for the pile. Life without admin rights to your computer&mdash; Mike Croucher (@walkingrandomly) March 7, 2019 This is so frustrating as an end user, but at the same time I get that endpoint security is difficult and there are massive risks associated with letting end users have admin rights. This is particularly important at the BL: as custodian&rsquo;s of a nation&rsquo;s cultural heritage, the risk for us is bigger than for many and for this reason we are now Cyber Essentials Plus certified. At some point I&rsquo;d like to do some research and have a conversation with someone who knows a lot more about InfoSec to work out what the proper approach to this, maybe involving VMs and a demilitarized zone on the network. I&rsquo;m always looking for more inspiration, so please leave a comment if you&rsquo;ve got anything you&rsquo;d like to read my thoughts on. If you&rsquo;re not familiar with my writing, please take a minute or two to explore the blog; the tags page is probably a good place to get an overview. Ultimate Hacking Keyboard: first thoughts Following on from the excitement of having built a functioning keyboard myself, I got a parcel on Monday. Inside was something that I&rsquo;ve been waiting for since September: an Ultimate Hacking Keyboard! Where the custom-built Laplace is small and quiet for travelling, the UHK is to be my main workhorse in the study at home. Here are my first impressions: Key switches I went with Kailh blue switches from the available options. In stark contrast to the quiet blacks on the Laplace, blues are NOISY! They have an extra piece of plastic inside the switch that causes an audible and tactile click when the switch activates. This makes them very satisfying to type on and should help as I train my fingers not to bottom out while typing, but does make them unsuitable for use in a shared office! Here are some animations showing how the main types of key switch vary. Layout This keyboard has what&rsquo;s known as a 60% layout: no number pad, arrows or function keys. As with the more spartan Laplace, these &ldquo;missing&rdquo; keys are made up for with programmable layers. For example, the arrow keys are on the Mod layer on the I/J/K/L keys, so I can access them without moving from the home row. I actually find this preferable to having to move my hand to the right to reach them, and I really never used the number pad in any case. Split This is a split keyboard, which means that the left and right halves can be separated to place the hands further apart which eases strain across the shoulders. The UHK has a neat coiled cable joining the two which doesn&rsquo;t get in the way. A cool design feature is that the two halves can be slotted back together and function perfectly well as a non-split keyboard too, held together by magnets. There are even electrical contacts so that when the two are joined you don&rsquo;t need the linking cable. Programming The board is fully programmable, and this is achieved via a custom (open source) GUI tool which talks to the (open source) firmware on the board. You can have multiple keymaps, each of which has a separate Base, Mod, Fn and Mouse layer, and there&rsquo;s an LED display that shows a short mnemonic for the currently active map. I already have a customised Dvorak layout for day-to-day use, plus a standard QWERTY for not-me to use and an alternative QWERTY which will be slowly tweaked for games that don&rsquo;t work well with Dvorak. Mouse keys One cool feature that the designers have included in the firmware is the ability to emulate a mouse. There&rsquo;s a separate layer that allows me to move the cursor, scroll and click without moving my hands from the keyboard. Palm rests Not much to say about the palm rests, other than they are solid wood, and chunky, and really add a little something. I have to say, I really like it so far! Overall it feels really well designed, with every little detail carefully thought out and excellent build quality and a really solid feeling. Custom-built keyboard I&rsquo;m typing this post on a keyboard I made myself, and I&rsquo;m rather excited about it! Why make my own keyboard? I wanted to learn a little bit about practical electronics, and I like to learn by doing I wanted to have the feeling of making something useful with my own hands I actually need a small, keyboard with good-quality switches now that I travel a fair bit for work and this lets me completely customise it to my needs Just because! While it is possible to make a keyboard completely from scratch, it makes much more sense to put together some premade parts. The parts you need are: PCB (printed circuit board): the backbone of the keyboard, to which all the other electrical components attach, this defines the possible physical locations for each key Switches: one for each key to complete a circuit whenever you press it Keycaps: switches are pretty ugly and pretty uncomfortable to press, so each one gets a cap; these are what you probably think of as the &ldquo;keys&rdquo; on your keyboard and come in almost limitless variety of designs (within the obvious size limitation) and are the easiest bit of personalisation Controller: the clever bit, which detects open and closed switches on the PCB and tells your computer what keys you pressed via a USB cable Firmware: the program that runs on the controller starts off as source code like any other program, and altering this can make the keyboard behave in loads of different ways, from different layouts to multiple layers accessed by holding a particular key, to macros and even emulating a mouse! In my case, I&rsquo;ve gone for the following: PCB Laplace from keeb.io, a very compact 47-key (&ldquo;40%&quot;) board, with no number pad, function keys or number row, but a lot of flexibility for key placement on the bottom row. One of my key design goals was small size so I can just pop it in my bag and have on my lap on the train. Controller Elite-C, designed specifically for keyboard builds to be physically compatible with the cheaper Pro Micro, with a more-robust USB port (the Pro Micro&rsquo;s has a tendency to snap off), and made easier to program with a built-in reset button and better bootloader. Switches Gateron Black: Gateron is one of a number of manufacturers of mechanical switches compatible with the popular Cherry range. The black switch is linear (no click or bump at the activation point) and slightly heavier sprung than the more common red. Cherry also make a black switch but the Gateron version is slightly lighter and having tested a few I found them smoother too. My key goal here was to reduce noise, as the stronger spring will help me type accurately without hitting the bottom of the keystroke with an audible sound. Keycaps Blank grey PBT in DSA profile: this keyboard layout has a lot of non-standard sized keys, so blank keycaps meant that I wouldn&rsquo;t be putting lots of keys out of their usual position; they&rsquo;re also relatively cheap, fairly classy IMHO and a good placeholder until I end up getting some really cool caps on a group buy or something; oh, and it minimises the chance of someone else trying the keyboard and getting freaked out by the layout&hellip; Firmware QMK (Quantum Mechanical Keyboard), with a work-in-progress layout, based on Dvorak. QMK has a lot of features and allows you to fully program each and every key, with multiple layers accessed through several different routes. Because there are so few keys on this board, I&rsquo;ll need to make good use of layers to make all the keys on a usual keyboard available. Dvorak Simplified Keyboard I&rsquo;m grateful to the folks of the Leeds Hack Space, especially Nav &amp; Mark who patiently coached me in various soldering techniques and good practice, but also everyone else who were so friendly and welcoming and interested in my project. I&rsquo;m really pleased with the result, which is small, light and fully customisable. Playing with QMK firmware features will keep me occupied for quite a while! This isn&rsquo;t the end though, as I&rsquo;ll need a case to keep the dust out. I&rsquo;m hoping to be able to 3D print this or mill it from wood with a CNC mill, for which I&rsquo;ll need to head back to the Hack Space! Less, but better &ldquo;Wenniger aber besser&rdquo; — Dieter Rams {:.big-quote} I can barely believe it&rsquo;s a full year since I published my intentions for 2018. A lot has happened since then. Principally: in November I started a new job as Data Services Lead at The British Library. One thing that hasn&rsquo;t changed is my tendency to try to do too much, so this year I&rsquo;m going to try and focus on a single intention, a translation of designer Dieter Rams' famous quote above: Less, but better. This chimes with a couple of other things I was toying with over the Christmas break, as they&rsquo;re essentially other ways of saying the same thing: Take it steady One thing at a time I&rsquo;m also going to keep in mind those touchstones from last year: What difference is this making? Am I looking after myself? Do I have evidence for this? I mainly forget to think about them, so I&rsquo;ll be sticking up post-its everywhere to help me remember! How to extend Python with Rust: part 1 Python is great, but I find it useful to have an alternative language under my belt for occasions when no amount of Pythonic cleverness will make some bit of code run fast enough. One of my main reasons for wanting to learn Rust was to have something better than C for that. Not only does Rust have all sorts of advantages that make it a good choice for code that needs to run fast and correctly, it&rsquo;s also got a couple of rather nice crates (libraries) that make interfacing with Python a lot nicer. Here&rsquo;s a little tutorial to show you how easy it is to call a simple Rust function from Python. If you want to try it yourself, you&rsquo;ll find the code on GitHub. !!! prerequisites I’m assuming for this tutorial that you’re already familiar with writing Python scripts and importing &amp; using packages, and that you’re comfortable using the command line. You’ll also need to have installed Rust. The Rust bit The quickest way to get compiled code into Python is to use the builtin ctypes package. This is Python&rsquo;s &ldquo;Foreign Function Interface&rdquo; or FFI: a means of calling functions outside the language you&rsquo;re using to make the call. ctypes allows us to call arbitrary functions in a shared library1, as long as those functions conform to certain standard C language calling conventions. Thankfully, Rust tries hard to make it easy for us to build such a shared library. The first thing to do is to create a new project with cargo, the Rust build tool: $ cargo new rustfrompy Created library `rustfrompy` project $ tree . ├── Cargo.toml └── src └── lib.rs 1 directory, 2 files !!! aside I use the fairly common convention that text set in fixed-width font is either example code or commands to type in. For the latter, a $ precedes the command that you type (omit the $), and lines that don&rsquo;t start with a $ are output from the previous command. I assume a basic familiarity with Unix-style command line, but I should probably put in some links to resources if you need to learn more! We need to edit the Cargo.toml file and add a [lib] section: [package] name = &#34;rustfrompy&#34; version = &#34;0.1.0&#34; authors = [&#34;Jez Cope &lt;j.cope@erambler.co.uk&gt;&#34;] [dependencies] [lib] name = &#34;rustfrompy&#34; crate-type = [&#34;cdylib&#34;] This tells cargo that we want to make a C-compatible dynamic library (crate-type = [&quot;cdylib&quot;]) and what to call it, plus some standard metadata. We can then put our code in src/lib.rs. We&rsquo;ll just use a simple toy function that adds two numbers together: #[no_mangle] pub fn add(a: i64, b: i64) -&gt; i64 { a + b } Notice the pub keyword, which instructs the compiler to make this function accessible to other modules, and the #[no_mangle] annotation, which tells it to use the standard C naming conventions for functions. If we don&rsquo;t do this, then Rust will generate a new name for the function for its own nefarious purposes, and as a side effect we won&rsquo;t know what to call it when we want to use it from Python. Being good developers, let&rsquo;s also add a test: #[cfg(test)] mod test { use ::*; #[test] fn test_add() { assert_eq!(4, add(2, 2)); } } We can now run cargo test which will compile that code and run the test: $ cargo test Compiling rustfrompy v0.1.0 (file:///home/jez/Personal/Projects/rustfrompy) Finished dev [unoptimized + debuginfo] target(s) in 1.2 secs Running target/debug/deps/rustfrompy-3033caaa9f5f17aa running 1 test test test::test_add ... ok test result: ok. 1 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out Everything worked! Now just to build that shared library and we can try calling it from Python: $ cargo build Compiling rustfrompy v0.1.0 (file:///home/jez/Personal/Projects/rustfrompy) Finished dev [unoptimized + debuginfo] target(s) in 0.30 secs Notice that the build is unoptimized and includes debugging information: this is useful in development, but once we&rsquo;re ready to use our code it will run much faster if we compile it with optimisations. Cargo makes this easy: $ cargo build --release Compiling rustfrompy v0.1.0 (file:///home/jez/Personal/Projects/rustfrompy) Finished release [optimized] target(s) in 0.30 secs The Python bit After all that, the Python bit is pretty short. First we import the ctypes package (which is included in all recent Python versions): from ctypes import cdll Cargo has tidied our shared library away into a folder, so we need to tell Python where to load it from. On Linux, it will be called lib&lt;something&gt;.so where the &ldquo;something&rdquo; is the crate name from Cargo.toml, &ldquo;rustfrompy&rdquo;: lib = cdll.LoadLibrary(&#39;target/release/librustfrompy.so&#39;) Finally we can call the function anywhere we want. Here it is in a pytest-style test: def test_rust_add(): assert lib.add(27, 15) == 42 If you have pytest installed (and you should!) you can run the whole test like this: $ pytest --verbose test.py ====================================== test session starts ====================================== platform linux -- Python 3.6.4, pytest-3.1.1, py-1.4.33, pluggy-0.4.0 -- /home/jez/.virtualenvs/datasci/bin/python cachedir: .cache rootdir: /home/jez/Personal/Projects/rustfrompy, inifile: collected 1 items test.py::test_rust_add PASSED It worked! I&rsquo;ve put both the Rust and Python code on github if you want to try it for yourself. Shortcomings Ok, so that was a pretty simple example, and I glossed over a lot of things. For example, what would happen if we did lib.add(2.0, 2)? This causes Python to throw an error because our Rust function only accepts integers (64-bit signed integers, i64, to be precise), and we gave it a floating point number. ctypes can’t guess what type(s) a given function will work with, but it can at least tell us when we get it wrong. To fix this properly, we need to do some extra work, telling the ctypes library what the argument and return types for each function are. For a more complex library, there will probably be more housekeeping to do, such as translating return codes from functions into more Pythonic-style errors. For a small example like this there isn’t much of a problem, but the bigger your compiled library the more extra boilerplate is required on the Python side just to use all the functions. When you’re working with an existing library you don’t have much choice about this, but if you’re building it from scratch specifically to interface with Python, there’s a better way using the Python C API. You can call this directly in Rust, but there are a couple of Rust crates that make life much easier, and I’ll be taking a look at those in a future blog post. .so on Linux, .dylib on Mac and .dll on Windows &#x21a9;&#xfe0e; New Years's irresolution Photo by Andrew Hughes on Unsplash I&rsquo;ve chosen not to make any specific resolutions this year; I’ve found that they just don’t work for me. Like many people, all I get is a sense of guilt when I inevitably fail to live up to the expectations I set myself at the start of the year. However, I have set a couple of what I’m referring to as “themes” for the year: touchstones that I’ll aim to refer to when setting priorities or just feeling a bit overwhelmed or lacking in direction. They are: Contribution Self-care Measurement I may do some blog posts expanding on these, but in the meantime, I&rsquo;ve put together a handful of questions to help me think about priorities and get perspective when I&rsquo;m doing (or avoiding doing) something. What difference is this making? I feel more motivated when I can figure out how I&rsquo;m contributing to something bigger than myself. In society? In my organisation? To my friends &amp; family? Am I looking after myself? I focus a lot on the expectations have (or at least that I think others have) of me, but I can&rsquo;t do anything well unless I&rsquo;m generally happy and healthy. Is this making me happier and healthier? Is this building my capacity to to look after myself, my family &amp; friends and do my job? Is this worth the amount of time and energy I&rsquo;m putting in? Do I have evidence for this? I don&rsquo;t have to base decisions purely on feelings/opinions: I have the skills to obtain, analyse and interpret data. Is this fact or opinion? What are the facts? Am I overthinking this? Can I put a confidence interval for this? Build documents from code and data with Saga !!! tldr &ldquo;TL;DR&rdquo; I&rsquo;ve made Saga, a thing for compiling documents by combining code and data with templates. What is it? Saga is a very simple command-line tool that reads in one or more data files, runs one or more scripts, then passes the results into a template to produce a final output document. It enables you to maintain a clean separation between data, logic and presentation and produce data-based documents that can easily be updated. That allows the flow of data through the document to be easily understood, a cornerstone of reproducible analysis. You run it like this: saga build -d data.yaml -d other_data.yaml \ -s analysis.py -t report.md.tmpl \ -O report.md Any scripts specified with -s will have access to the data in local variables, and any changes to local variables in a script will be retained when everything is passed to the template for rendering. For debugging, you can also do: saga dump -d data.yaml -d other_data.yaml -s analysis.py which will print out the full environment that would be passed to your template with saga build. Features Right now this is a really early version. It does the job but I have lots of ideas for features to add if I ever have time. At present it does the following: Reads data from one or more YAML files Transforms data with one or more Python scripts Renders a template in Mako format Works with any plain-text output format, including Markdown, LaTeX and HTML Use cases Write reproducible reports &amp; papers based on machine-readable data Separate presentation from content in any document, e.g. your CV (example coming soon) Yours here? Get it! I haven&rsquo;t released this on PyPI yet, but all the code is available on GitHub to try out. If you have pipenv installed (and if you use Python you should!), you can try it out in an isolated virtual environment by doing: git clone https://github.com/jezcope/sagadoc.git cd sagadoc pipenv install pipenv run saga or you can set up for development and run some tests: pipenv install --dev pipenv run pytest Why? Like a lot of people, I have to produce reports for work, often containing statistics computed from data. Although these generally aren&rsquo;t academic research papers, I see no reason not to aim for a similar level of reproducibility: after all, if I&rsquo;m telling other people to do it, I&rsquo;d better take my own advice! A couple of times now I&rsquo;ve done this by writing a template that holds the text of the report and placeholders for values, along with a Python script that reads in the data, calculates the statistics I want and completes the template. This is valuable for two main reasons: If anyone wants to know how I processed the data and calculated those statistics, it&rsquo;s all there: no need to try and remember and reproduce a series of button clicks in Excel; If the data or calculations change, I just need to update the relevant part and run it again, and all the relevant parts of the document will be updated. This is particularly important if changing a single data value requires recalculation of dozens of tables, charts, etc. It also gives me the potential to factor out and reuse bits of code in the future, add tests and version control everything. Now that I&rsquo;ve done this more than once (and it seems likely I&rsquo;ll do it again) it makes sense to package that script up in a more portable form so I don&rsquo;t have to write it over and over again (or, shock horror, copy &amp; paste it!). It saves time, and gives others the possibility to make use of it. Prior art I&rsquo;m not the first person to think of this, but I couldn&rsquo;t find anything that did exactly what I needed. Several tools will let you interweave code and prose, including the results of evaluating each code snippet in the document: chief among these are Jupyter and Rmarkdown. There are also tools that let you write code in the order that makes most sense to read and then rearrange it into the right order to execute, so-call literate programming. The original tool for this is the venerable noweb. Sadly there is very little that combine both of these and allow you to insert the results of various calculations at arbitrary points in a document, independent of the order of either presenting or executing the code. The only two that I&rsquo;m aware of are: Dexy and org-mode. Unfortunately, Dexy currently only works on Legacy Python (/Python 2) and org-mode requires emacs (which is fine but not exactly portable). Rmarkdown comes close and supports a range of languages but the full feature set is only available with R. Actually, my ideal solution is org-mode without the emacs dependency, because that&rsquo;s the most flexible solution; maybe one day I&rsquo;ll have both the time and skill to implement that. It&rsquo;s also possible I might be able to figure out Dexy&rsquo;s internals to add what I want to it, but until then Saga does the job! Future work There are lots of features that I&rsquo;d still like to add when I have time: Some actual documentation! And examples! More data formats (e.g. CSV, JSON, TOML) More languages (e.g. R, Julia) Fetching remote data over http Caching of intermediate results to speed up rebuilds For now, though, I&rsquo;d love for you to try it out and let me know what you think! As ever, comment here, tweet me or start an issue on GitHub. Why try Rust for scientific computing? When you&rsquo;re writing analysis code, Python (or R, or JavaScript, or &hellip;) is usually the right choice. These high-level languages are set up to make you as productive as possible, and common tasks like array manipulation have been well optimised. However, sometimes you just can&rsquo;t get enough speed and need to turn to a lower-level compiled language. Often that will be C, C++ or Fortran, but I thought I&rsquo;d do a short post on why I think you should consider Rust. One of my goals for 2017&rsquo;s Advent of Code was to learn a modern, memory-safe, statically-typed language. I now know that there are quite a lot of options in this space, but two seem to stand out: Go &amp; Rust. I gave both of them a try, and although I&rsquo;ll probably go back to give Go a more thorough test at some point I found I got quite hooked on Rust. Both languages, though young, are definitely production-ready. Servo, the core of the new Firefox browser, is entirely written in Rust. In fact, Mozilla have been trying to rewrite the rendering core in C for nearly a decade, and switching to Rust let them get it done in just a couple of years. !!! tldr &ldquo;TL;DR&rdquo; - It&rsquo;s fast: competitive with idiomatic C/C++, and no garbage-collection overhead - It&rsquo;s harder to write buggy code, and compiler errors are actually helpful - It&rsquo;s C-compatible: you can call into Rust code anywhere you&rsquo;d call into C, call C/C++ from Rust, and incrementally replace C/C++ code with Rust - It has sensible modern syntax that makes your code clearer and more concise - Support for scientific computing are getting better all the time (matrix algebra libraries, built-in SIMD, safe concurrency) - It has a really friendly and active community - It&rsquo;s production-ready: Servo, the new rendering core in Firefox, is built entirely in Rust Performance To start with, as a compiled language Rust executes much faster than a (pseudo-)interpreted language like Python or R; the price you pay for this is time spent compiling during development. However, having a compile step also allows the language to enforce certain guarantees, such as type-correctness and memory safety, which between them prevent whole classes of bugs from even being possible. Unlike Go (which, like many higher-level languages, uses a garbage collector), Rust handles memory safety at compile time through the concepts of ownership and borrowing. These can take some getting used to and were a big source of frustration when I was first figuring out the language, but ultimately contribute to Rust&rsquo;s reliably-fast performance. Performance can be unpredictable in a garbage-collected language because you can&rsquo;t be sure when the GC is going to run and you need to understand it really well to stand a chance of optimising it if becomes a problem. On the other hand, code that has the potential to be unsafe will result in compilation errors in Rust. There are a number of benchmarks (example) that show Rust&rsquo;s performance on a par with idiomatic C &amp; C++ code, something that very few languages can boast. Helpful error messages Because beginner Rust programmers often get compile errors, it&rsquo;s really important that those errors are easy to interpret and fix, and Rust is great at this. Not only does it tell you what went wrong, but wherever possible it prints out your code annotated with arrows to show exactly where the error is, and makes specific suggestions how to fix the error which usually turn out to be correct. It also has a nice suite of warnings (things that don&rsquo;t cause compilation to fail but may indicate bugs) that are just as informative, and this can be extended even further by using the clippy linting tool to further analyse your code. warning: unused variable: `y` --&gt; hello.rs:3:9 | 3 | let y = x; | ^ | = note: #[warn(unused_variables)] on by default = note: to avoid this warning, consider using `_y` instead Easy to integrate with other languages If you&rsquo;re like me, you&rsquo;ll probably only use a low-level language for performance-critical code that you can call from a high-level language, and this is an area where Rust shines. Most programmers will turn to C, C++ or Fortran for this because they have a well established ABI (Application Binary Interface) which can be understood by languages like Python and R1. In Rust, it&rsquo;s trivial to make a C-compatible shared library, and the standard library includes extra features for working with C types. That also means that existing C code can be incrementally ported to Rust: see remacs for an example. On top of this, there are projects like rust-cpython and PyO3 which provide macros and structures that wrap the Python C API to let you build Python modules in Rust with minimal glue code; rustr does a similar job for R. Nice language features Rust has some really nice features, which let you write efficient, concise and correct code. Several feel particularly comfortable as they remind me of similar things available in Haskell, including: Enums, a super-powered combination of C enums and unions (similar to Haskell&rsquo;s algebraic data types) that enable some really nice code with no runtime cost Generics and traits that let you get more done with less code Pattern matching, a kind of case statement that lets you extract parts of structs, tuples &amp; enums and do all sorts of other clever things Lazy computation based on an iterator pattern, for efficient processing of lists of things: you can do for item in list { ... } instead of the C-style use of an index2, or you can use higher-order functions like map and filter Functions/closures as first-class citizens Scientific computing Although it&rsquo;s a general-purpose language and not designed specifically for scientific computing, Rust&rsquo;s support is improving all the time. There are some interesting matrix algebra libraries available, and built-in SIMD is incoming. The memory safety features also work to ensure thread safety, so it&rsquo;s harder to write concurrency bugs. You should be able to use your favourite MPI implementation too, and there&rsquo;s at least one attempt to portably wrap MPI in a more Rust-like way. Active development and friendly community One of the things you notice straight away is how active and friendly the Rust community is. There are several IRC channels on irc.mozilla.org including #rust-beginners, which is a great place to get help. The compiler is under constant but carefully-managed development, so that new features are landing all the time but without breaking existing code. And the fabulous Cargo build tool and crates.io are enabling the rapid growth of a healthy ecosystem of open source libraries that you can use to write less code yourself. Summary So, next time you need a compiled language to speed up hotspots in your code, try Rust. I promise you won&rsquo;t regret it! Julia actually allows you to call C and Fortran functions as a first-class language feature &#x21a9;&#xfe0e; Actually, since C++11 there&rsquo;s for (auto item : list) { ... } but still&hellip; &#x21a9;&#xfe0e; Reflections on #aoc2017 Trees reflected in a lake Joshua Reddekopp on Unsplash It seems like ages ago, but way back in November I committed to completing Advent of Code. I managed it all, and it was fun! All of my code is available on GitHub if you’re interested in seeing what I did, and I managed to get out a blog post for every one with a bit more commentary, which you can see in the series list above. How did I approach it? I’ve not really done any serious programming challenges before. I don’t get to write a lot of code at the moment, so all I wanted from AoC was an excuse to do some proper problem-solving. I never really intended to take a polyglot approach, though I did think that I might use mainly Python with a bit of Haskell. In the end, though, I used: Python (×12); Haskell (×7); Rust (×4); Go; C++; Ruby; Julia; and Coconut. For the most part, my priorities were getting the right answer, followed by writing readable code. I didn’t specifically focus on performance but did try to avoid falling into traps that I knew about. What did I learn? I found Python the easiest to get on with: it’s the language I know best and although I can’t always remember exact method names and parameters I know what’s available and where to look to remind myself, as well as most of the common idioms and some performance traps to avoid. Python was therefore the language that let me focus most on solving the problem itself. C++ and Ruby were more challenging, and it was harder to write good idiomatic code but I can still remember quite a lot. Haskell I haven’t used since university, and just like back then I really enjoyed working out how to solve problems in a functional style while still being readable and efficient (not always something I achieved&hellip;). I learned a lot about core Haskell concepts like monads &amp; functors, and I’m really amazed by the way the Haskell community and ecosystem has grown up in the last decade. I also wanted to learn at least one modern, memory-safe compiled language, so I tried both Go and Rust. Both seem like useful languages, but Rust really intrigued me with its conceptual similarities to both Haskell and C++ and its promise of memory safety without a garbage collector. I struggled a lot initially with the “borrow checker” (the component that enforces memory safety at compile time) but eventually started thinking in terms of ownership and lifetimes after which things became easier. The Rust community seems really vibrant and friendly too. What next? I really want to keep this up, so I’m going to look out some more programming challenges (Project Euler looks interesting). It turns out there’s a regular Code Dojo meetup in Leeds, so hopefully I’ll try that out too. I’d like to do more realistic data-science stuff, so I’ll be taking a closer look at stuff like Kaggle too, and figuring out how to do a bit more analysis at work. I’m also feeling motivated to find an open source project to contribute to and/or release a project of my own, so we’ll see if that goes anywhere! I’ve always found the advice to “scratch your own itch” difficult to follow because everything I think of myself has already been done better. Most of the projects I use enough to want to contribute to tend to be pretty well developed with big communities and any bugs that might be accessible to me will be picked off and fixed before I have a chance to get started. Maybe it’s time to get over myself and just reimplement something that already exists, just for the fun of it! The Halting Problem — Python — #adventofcode Day 25 Today&rsquo;s challenge, takes us back to a bit of computing history: a good old-fashioned Turing Machine. → Full code on GitHub !!! commentary Today&rsquo;s challenge was a nice bit of nostalgia, taking me back to my university days learning about the theory of computing. Turing Machines are a classic bit of computing theory, and are provably able to compute any value that is possible to compute: a value is computable if and only if a Turing Machine can be written that computes it (though in practice anything non-trivial is mind-bendingly hard to write as a TM). A bit of a library-fest today, compared to other days! from collections import deque, namedtuple from collections.abc import Iterator from tqdm import tqdm import re import fileinput as fi These regular expressions are used to parse the input that defines the transition table for the machine. RE_ISTATE = re.compile(r&#39;Begin in state (?P&lt;state&gt;\w+)\.&#39;) RE_RUNTIME = re.compile( r&#39;Perform a diagnostic checksum after (?P&lt;steps&gt;\d+) steps.&#39;) RE_STATETRANS = re.compile( r&#34;In state (?P&lt;state&gt;\w+):\n&#34; r&#34; If the current value is (?P&lt;read0&gt;\d+):\n&#34; r&#34; - Write the value (?P&lt;write0&gt;\d+)\.\n&#34; r&#34; - Move one slot to the (?P&lt;move0&gt;left|right).\n&#34; r&#34; - Continue with state (?P&lt;next0&gt;\w+).\n&#34; r&#34; If the current value is (?P&lt;read1&gt;\d+):\n&#34; r&#34; - Write the value (?P&lt;write1&gt;\d+)\.\n&#34; r&#34; - Move one slot to the (?P&lt;move1&gt;left|right).\n&#34; r&#34; - Continue with state (?P&lt;next1&gt;\w+).&#34;) MOVE = {&#39;left&#39;: -1, &#39;right&#39;: 1} A namedtuple to provide some sugar when using a transition rule. Rule = namedtuple(&#39;Rule&#39;, &#39;write move next_state&#39;) The TuringMachine class does all the work. class TuringMachine: def __init__(self, program=None): self.tape = deque() self.transition_table = {} self.state = None self.runtime = 0 self.steps = 0 self.pos = 0 self.offset = 0 if program is not None: self.load(program) def __str__(self): return f&#34;Current: {self.state}; steps: {self.steps} of {self.runtime}&#34; Some jiggery-pokery to allow us to use self[pos] to reference an infinite tape. def __getitem__(self, i): i += self.offset if i &lt; 0 or i &gt;= len(self.tape): return 0 else: return self.tape[i] def __setitem__(self, i, x): i += self.offset if i &gt;= 0 and i &lt; len(self.tape): self.tape[i] = x elif i == -1: self.tape.appendleft(x) self.offset += 1 elif i == len(self.tape): self.tape.append(x) else: raise IndexError(&#39;Tried to set position off end of tape&#39;) Parse the program and set up the transtion table. def load(self, program): if isinstance(program, Iterator): program = &#39;&#39;.join(program) match = RE_ISTATE.search(program) self.state = match[&#39;state&#39;] match = RE_RUNTIME.search(program) self.runtime = int(match[&#39;steps&#39;]) for match in RE_STATETRANS.finditer(program): self.transition_table[match[&#39;state&#39;]] = { int(match[&#39;read0&#39;]): Rule(write=int(match[&#39;write0&#39;]), move=MOVE[match[&#39;move0&#39;]], next_state=match[&#39;next0&#39;]), int(match[&#39;read1&#39;]): Rule(write=int(match[&#39;write1&#39;]), move=MOVE[match[&#39;move1&#39;]], next_state=match[&#39;next1&#39;]), } Run the program for the required number of steps (given by self.runtime). tqdm isn&rsquo;t in the standard library but it should be: it shows a lovely text-mode progress bar as we go. def run(self): for _ in tqdm(range(self.runtime), desc=&#34;Running&#34;, unit=&#34;steps&#34;, unit_scale=True): read = self[self.pos] rule = self.transition_table[self.state][read] self[self.pos] = rule.write self.pos += rule.move self.state = rule.next_state Calculate the &ldquo;diagnostic checksum&rdquo; required for the answer. @property def checksum(self): return sum(self.tape) Aaand GO! machine = TuringMachine(fi.input()) machine.run() print(&#34;Checksum:&#34;, machine.checksum) Electromagnetic Moat — Rust — #adventofcode Day 24 Today&rsquo;s challenge, the penultimate, requires us to build a bridge capable of reaching across to the CPU, our final destination. → Full code on GitHub !!! commentary We have a finite number of components that fit together in a restricted way from which to build a bridge, and we have to work out both the strongest and the longest bridge we can build. The most obvious way to do this is to recursively build every possible bridge and select the best, but that&rsquo;s an O(n!) algorithm that could blow up quickly, so might as well go with a nice fast language! Might have to try this in Haskell too, because it&rsquo;s the type of algorithm that lends itself naturally to a pure functional approach. I feel like I've applied some of the things I've learned in previous challenges I used Rust for, and spent less time mucking about with ownership, and made better use of various language features, including structs and iterators. I'm rather pleased with how my learning of this language is progressing. I'm definitely overusing `Option.unwrap` at the moment though: this is a lazy way to deal with `Option` results and will panic if the result is not what's expected. I'm not sure whether I need to be cloning the components `Vector` either, or whether I could just be passing iterators around. First, we import some bits of standard library and define some data types. The BridgeResult struct lets us use the same algorithm for both parts of the challenge and simply change the value used to calculate the maximum. use std::io; use std::fmt; use std::io::BufRead; #[derive(Debug, Copy, Clone, PartialEq, Eq, Hash)] struct Component(u8, u8); #[derive(Debug, Copy, Clone, Default)] struct BridgeResult { strength: u16, length: u16, } impl Component { fn from_str(s: &amp;str) -&gt; Component { let parts: Vec&lt;&amp;str&gt; = s.split(&#39;/&#39;).collect(); assert!(parts.len() == 2); Component(parts[0].parse().unwrap(), parts[1].parse().unwrap()) } fn fits(self, port: u8) -&gt; bool { self.0 == port || self.1 == port } fn other_end(self, port: u8) -&gt; u8 { if self.0 == port { return self.1; } else if self.1 == port { return self.0; } else { panic!(&#34;{} doesn&#39;t fit port {}&#34;, self, port); } } fn strength(self) -&gt; u16 { self.0 as u16 + self.1 as u16 } } impl fmt::Display for BridgeResult { fn fmt(&amp;self, f: &amp;mut fmt::Formatter) -&gt; fmt::Result { write!(f, &#34;(S: {}, L: {})&#34;, self.strength, self.length) } } best_bridge calculates the length and strength of the &ldquo;best&rdquo; bridge that can be built from the remaining components and fits the required port. Whether this is based on strength or length is given by the key parameter, which is passed to Iter.max_by_key. fn best_bridge&lt;F&gt;(port: u8, key: &amp;F, components: &amp;Vec&lt;Component&gt;) -&gt; Option&lt;BridgeResult&gt; where F: Fn(&amp;BridgeResult) -&gt; u16 { if components.len() == 0 { return None; } components.iter() .filter(|c| c.fits(port)) .map(|c| { let b = best_bridge(c.other_end(port), key, &amp;components.clone().into_iter() .filter(|x| x != c).collect()) .unwrap_or_default(); BridgeResult{strength: c.strength() + b.strength, length: 1 + b.length} }) .max_by_key(key) } Now all that remains is to read the input and calculate the result. I was rather pleasantly surprised to find that in spite of my pessimistic predictions about efficiency, when compiled with optimisations turned on this terminates in less than 1s on my laptop. fn main() { let stdin = io::stdin(); let components: Vec&lt;_&gt; = stdin.lock() .lines() .map(|l| Component::from_str(&amp;l.unwrap())) .collect(); match best_bridge(0, &amp;|b: &amp;BridgeResult| b.strength, &amp;components) { Some(b) =&gt; println!(&#34;Strongest bridge is {}&#34;, b), None =&gt; println!(&#34;No strongest bridge found&#34;) }; match best_bridge(0, &amp;|b: &amp;BridgeResult| b.length, &amp;components) { Some(b) =&gt; println!(&#34;Longest bridge is {}&#34;, b), None =&gt; println!(&#34;No longest bridge found&#34;) }; } Coprocessor Conflagration — Haskell — #adventofcode Day 23 Today&rsquo;s challenge requires us to understand why a coprocessor is working so hard to perform an apparently simple calculation. → Full code on GitHub !!! commentary Today&rsquo;s problem is based on an assembly-like language very similar to day 18, so I went back and adapted my code from that, which works well for the first part. I&rsquo;ve also incorporated some advice from /r/haskell, and cleaned up all warnings shown by the -Wall compiler flag and the hlint tool. Part 2 requires the algorithm to run with much larger inputs, and since some analysis shows that it's an `O(n^3)` algorithm it gets intractible pretty fast. There are several approaches to this. First up, if you have a fast enough processor and an efficient enough implementation I suspect that the simulation would probably terminate eventually, but that would likely still take hours: not good enough. I also thought about doing some peephole optimisations on the instructions, but the last time I did compiler optimisation was my degree so I wasn't really sure where to start. What I ended up doing was actually analysing the input code by hand to figure out what it was doing, and then just doing that calculation in a sensible way. I'd like to say I managed this on my own (and I ike to think I would have) but I did get some tips on [/r/adventofcode](https://reddit.com/r/adventofcode). The majority of this code is simply a cleaned-up version of day 18, with some tweaks to accommodate the different instruction set: module Main where import qualified Data.Vector as V import qualified Data.Map.Strict as M import Control.Monad.State.Strict import Text.ParserCombinators.Parsec hiding (State) type Register = Char type Value = Int type Argument = Either Value Register data Instruction = Set Register Argument | Sub Register Argument | Mul Register Argument | Jnz Argument Argument deriving Show type Program = V.Vector Instruction data Result = Cont | Halt deriving (Eq, Show) type Registers = M.Map Char Int data Machine = Machine { dRegisters :: Registers , dPtr :: !Int , dMulCount :: !Int , dProgram :: Program } instance Show Machine where show d = show (dRegisters d) ++ &#34; @&#34; ++ show (dPtr d) ++ &#34; ×&#34; ++ show (dMulCount d) defaultMachine :: Machine defaultMachine = Machine M.empty 0 0 V.empty type MachineState = State Machine program :: GenParser Char st Program program = do instructions &lt;- endBy instruction eol return $ V.fromList instructions where instruction = try (regOp &#34;set&#34; Set) &lt;|&gt; regOp &#34;sub&#34; Sub &lt;|&gt; regOp &#34;mul&#34; Mul &lt;|&gt; jump &#34;jnz&#34; Jnz regOp n c = do string n &gt;&gt; spaces val1 &lt;- oneOf &#34;abcdefgh&#34; secondArg c val1 jump n c = do string n &gt;&gt; spaces val1 &lt;- regOrVal secondArg c val1 secondArg c val1 = do spaces val2 &lt;- regOrVal return $ c val1 val2 regOrVal = register &lt;|&gt; value register = do name &lt;- lower return $ Right name value = do val &lt;- many $ oneOf &#34;-0123456789&#34; return $ Left $ read val eol = char &#39;\n&#39; parseProgram :: String -&gt; Either ParseError Program parseProgram = parse program &#34;&#34; getReg :: Char -&gt; MachineState Int getReg r = do st &lt;- get return $ M.findWithDefault 0 r (dRegisters st) putReg :: Char -&gt; Int -&gt; MachineState () putReg r v = do st &lt;- get let current = dRegisters st new = M.insert r v current put $ st { dRegisters = new } modReg :: (Int -&gt; Int -&gt; Int) -&gt; Char -&gt; Argument -&gt; MachineState () modReg op r v = do u &lt;- getReg r v&#39; &lt;- getRegOrVal v putReg r (u `op` v&#39;) incPtr getRegOrVal :: Argument -&gt; MachineState Int getRegOrVal = either return getReg addPtr :: Int -&gt; MachineState () addPtr n = do st &lt;- get put $ st { dPtr = n + dPtr st } incPtr :: MachineState () incPtr = addPtr 1 execInst :: Instruction -&gt; MachineState () execInst (Set reg val) = do newVal &lt;- getRegOrVal val putReg reg newVal incPtr execInst (Mul reg val) = do result &lt;- modReg (*) reg val st &lt;- get put $ st { dMulCount = 1 + dMulCount st } return result execInst (Sub reg val) = modReg (-) reg val execInst (Jnz val1 val2) = do test &lt;- getRegOrVal val1 jump &lt;- if test /= 0 then getRegOrVal val2 else return 1 addPtr jump execNext :: MachineState Result execNext = do st &lt;- get let prog = dProgram st p = dPtr st if p &gt;= length prog then return Halt else do execInst (prog V.! p) return Cont runUntilTerm :: MachineState () runUntilTerm = do result &lt;- execNext unless (result == Halt) runUntilTerm This implements the actual calculation: the number of non-primes between (for my input) 107900 and 124900: optimisedCalc :: Int -&gt; Int -&gt; Int -&gt; Int optimisedCalc a b k = sum $ map (const 1) $ filter notPrime [a,a+k..b] where notPrime n = elem 0 $ map (mod n) [2..(floor $ sqrt (fromIntegral n :: Double))] main :: IO () main = do input &lt;- getContents case parseProgram input of Right prog -&gt; do let c = defaultMachine { dProgram = prog } (_, c&#39;) = runState runUntilTerm c putStrLn $ show (dMulCount c&#39;) ++ &#34; multiplications made&#34; putStrLn $ &#34;Calculation result: &#34; ++ show (optimisedCalc 107900 124900 17) Left e -&gt; print e Sporifica Virus — Rust — #adventofcode Day 22 Today&rsquo;s challenge has us helping to clean up (or spread, I can&rsquo;t really tell) an infection of the &ldquo;sporifica&rdquo; virus. → Full code on GitHub !!! commentary I thought I&rsquo;d have another play with Rust, as its Haskell-like features resonate with me at the moment. I struggled quite a lot with the Rust concepts of ownership and borrowing, and this is a cleaned-up version of the code based on some good advice from the folks on /r/rust. use std::io; use std::env; use std::io::BufRead; use std::collections::HashMap; #[derive(PartialEq, Clone, Copy, Debug)] enum Direction {Up, Right, Down, Left} #[derive(PartialEq, Clone, Copy, Debug)] enum Infection {Clean, Weakened, Infected, Flagged} use self::Direction::*; use self::Infection::*; type Grid = HashMap&lt;(isize, isize), Infection&gt;; fn turn_left(d: Direction) -&gt; Direction { match d {Up =&gt; Left, Right =&gt; Up, Down =&gt; Right, Left =&gt; Down} } fn turn_right(d: Direction) -&gt; Direction { match d {Up =&gt; Right, Right =&gt; Down, Down =&gt; Left, Left =&gt; Up} } fn turn_around(d: Direction) -&gt; Direction { match d {Up =&gt; Down, Right =&gt; Left, Down =&gt; Up, Left =&gt; Right} } fn make_move(d: Direction, x: isize, y: isize) -&gt; (isize, isize) { match d { Up =&gt; (x-1, y), Right =&gt; (x, y+1), Down =&gt; (x+1, y), Left =&gt; (x, y-1), } } fn basic_step(grid: &amp;mut Grid, x: &amp;mut isize, y: &amp;mut isize, d: &amp;mut Direction) -&gt; usize { let mut infect = 0; let current = match grid.get(&amp;(*x, *y)) { Some(v) =&gt; *v, None =&gt; Clean, }; if current == Infected { *d = turn_right(*d); } else { *d = turn_left(*d); infect = 1; }; grid.insert((*x, *y), match current { Clean =&gt; Infected, Infected =&gt; Clean, x =&gt; panic!(&#34;Unexpected infection state {:?}&#34;, x), }); let new_pos = make_move(*d, *x, *y); *x = new_pos.0; *y = new_pos.1; infect } fn nasty_step(grid: &amp;mut Grid, x: &amp;mut isize, y: &amp;mut isize, d: &amp;mut Direction) -&gt; usize { let mut infect = 0; let new_state: Infection; let current = match grid.get(&amp;(*x, *y)) { Some(v) =&gt; *v, None =&gt; Infection::Clean, }; match current { Clean =&gt; { *d = turn_left(*d); new_state = Weakened; }, Weakened =&gt; { new_state = Infected; infect = 1; }, Infected =&gt; { *d = turn_right(*d); new_state = Flagged; }, Flagged =&gt; { *d = turn_around(*d); new_state = Clean; } }; grid.insert((*x, *y), new_state); let new_pos = make_move(*d, *x, *y); *x = new_pos.0; *y = new_pos.1; infect } fn virus_infect&lt;F&gt;(mut grid: Grid, mut step: F, mut x: isize, mut y: isize, mut d: Direction, n: usize) -&gt; usize where F: FnMut(&amp;mut Grid, &amp;mut isize, &amp;mut isize, &amp;mut Direction) -&gt; usize, { (0..n).map(|_| step(&amp;mut grid, &amp;mut x, &amp;mut y, &amp;mut d)) .sum() } fn main() { let args: Vec&lt;String&gt; = env::args().collect(); let n_basic: usize = args[1].parse().unwrap(); let n_nasty: usize = args[2].parse().unwrap(); let stdin = io::stdin(); let lines: Vec&lt;String&gt; = stdin.lock() .lines() .map(|x| x.unwrap()) .collect(); let mut grid: Grid = HashMap::new(); let x0 = (lines.len() / 2) as isize; let y0 = (lines[0].len() / 2) as isize; for (i, line) in lines.iter().enumerate() { for (j, c) in line.chars().enumerate() { grid.insert((i as isize, j as isize), match c {&#39;#&#39; =&gt; Infected, _ =&gt; Clean}); } } let basic_steps = virus_infect(grid.clone(), basic_step, x0, y0, Up, n_basic); println!(&#34;Basic: infected {} times&#34;, basic_steps); let nasty_steps = virus_infect(grid, nasty_step, x0, y0, Up, n_nasty); println!(&#34;Nasty: infected {} times&#34;, nasty_steps); } Fractal Art — Python — #adventofcode Day 21 Today&rsquo;s challenge asks us to assist an artist building fractal patterns from a rulebook. → Full code on GitHub !!! commentary Another fairly straightforward algorithm: the really tricky part was breaking the pattern up into chunks and rejoining it again. I could probably have done that more efficiently, and would have needed to if I had to go for a few more iterations and the grid grows with every iteration and gets big fast. Still behind on the blog posts… import fileinput as fi from math import sqrt from functools import reduce, partial import operator INITIAL_PATTERN = ((0, 1, 0), (0, 0, 1), (1, 1, 1)) DECODE = [&#39;.&#39;, &#39;#&#39;] ENCODE = {&#39;.&#39;: 0, &#39;#&#39;: 1} concat = partial(reduce, operator.concat) def rotate(p): size = len(p) return tuple(tuple(p[i][j] for i in range(size)) for j in range(size - 1, -1, -1)) def flip(p): return tuple(p[i] for i in range(len(p) - 1, -1, -1)) def permutations(p): yield p yield flip(p) for _ in range(3): p = rotate(p) yield p yield flip(p) def print_pattern(p): print(&#39;-&#39; * len(p)) for row in p: print(&#39; &#39;.join(DECODE[x] for x in row)) print(&#39;-&#39; * len(p)) def build_pattern(s): return tuple(tuple(ENCODE[c] for c in row) for row in s.split(&#39;/&#39;)) def build_pattern_book(lines): book = {} for line in lines: source, target = line.strip().split(&#39; =&gt; &#39;) for rotation in permutations(build_pattern(source)): book[rotation] = build_pattern(target) return book def subdivide(pattern): size = 2 if len(pattern) % 2 == 0 else 3 n = len(pattern) // size return (tuple(tuple(pattern[i][j] for j in range(y * size, (y + 1) * size)) for i in range(x * size, (x + 1) * size)) for x in range(n) for y in range(n)) def rejoin(parts): n = int(sqrt(len(parts))) size = len(parts[0]) return tuple(concat(parts[i + k][j] for i in range(n)) for k in range(0, len(parts), n) for j in range(size)) def enhance_once(p, book): return rejoin(tuple(book[part] for part in subdivide(p))) def enhance(p, book, n, progress=None): for _ in range(n): p = enhance_once(p, book) return p book = build_pattern_book(fi.input()) intermediate_pattern = enhance(INITIAL_PATTERN, book, 5) print(&#34;After 5 iterations:&#34;, sum(sum(row) for row in intermediate_pattern)) final_pattern = enhance(intermediate_pattern, book, 13) print(&#34;After 18 iterations:&#34;, sum(sum(row) for row in final_pattern)) Particle Swarm — Python — #adventofcode Day 20 Today&rsquo;s challenge finds us simulating the movements of particles in space. → Full code on GitHub !!! commentary Back to Python for this one, another relatively straightforward simulation, although it&rsquo;s easier to calculate the answer to part 1 than to simulate. import fileinput as fi import numpy as np import re First we parse the input into 3 2D arrays: using numpy enables us to do efficient arithmetic across the whole set of particles in one go. PARTICLE_RE = re.compile(r&#39;p=&lt;(-?\d+),(-?\d+),(-?\d+)&gt;, &#39; r&#39;v=&lt;(-?\d+),(-?\d+),(-?\d+)&gt;, &#39; r&#39;a=&lt;(-?\d+),(-?\d+),(-?\d+)&gt;&#39;) def parse_input(lines): x = [] v = [] a = [] for l in lines: m = PARTICLE_RE.match(l) x.append([int(x) for x in m.group(1, 2, 3)]) v.append([int(x) for x in m.group(4, 5, 6)]) a.append([int(x) for x in m.group(7, 8, 9)]) return (np.arange(len(x)), np.array(x), np.array(v), np.array(a)) i, x, v, a = parse_input(fi.input()) Now we can calculate which particle will be closest to the origin in the long-term: this is simply the particle with the smallest acceleration. It turns out that several have the same acceleration, so of these, the one we want is the one with the lowest starting velocity. This is only complicated slightly by the need to get the number of the particle rather than its other information, hence the need to use numpy.argmin. a_abs = np.sum(np.abs(a), axis=1) a_min = np.min(a_abs) a_i = np.squeeze(np.argwhere(a_abs == a_min)) closest = i[a_i[np.argmin(np.sum(np.abs(v[a_i]), axis=1))]] print(&#34;Closest: &#34;, closest) Now we define functions to simulate collisions between particles. We have to use the return_index and return_counts options to numpy.unique to be able to get rid of all the duplicate positions (the standard usage is to keep one of each duplicate). def resolve_collisions(x, v, a): (_, i, c) = np.unique(x, return_index=True, return_counts=True, axis=0) i = i[c == 1] return x[i], v[i], a[i] The termination criterion for this loop is an interesting aspect: the most robust to my mind seems to be that eventually the particles will end up sorted in order of their initial acceleration in terms of distance from the origin, so you could check for this but that&rsquo;s pretty computationally expensive. In the end, all that was needed was a bit of trial and error: terminating arbitrarily after 1,000 iterations seems to work! In fact, all the collisions are over after about 40 iterations for my input but there was always the possibility that two particles with very slightly different accelerations would eventually intersect much later. def simulate_collisions(x, v, a, iterations=1000): for _ in range(iterations): v += a x += v x, v, a = resolve_collisions(x, v, a) return len(x) print(&#34;Remaining particles: &#34;, simulate_collisions(x, v, a)) A Series of Tubes — Rust — #adventofcode Day 19 Today&rsquo;s challenge asks us to help a network packet find its way. → Full code on GitHub !!! commentary Today&rsquo;s challenge was fairly straightforward, following an ASCII art path, so I thought I&rsquo;d give Rust another try. I&rsquo;m a bit behind on the blog posts, so I&rsquo;m presenting the code below without any further commentary. I&rsquo;m not really convinced this is good idiomatic Rust, and it was interesting turning a set of strings into a 2D array of characters because there are both u8 (byte) and char types to deal with. use std::io; use std::io::BufRead; const ALPHA: &amp;&#39;static str = &#34;ABCDEFGHIJKLMNOPQRSTUVWXYZ&#34;; fn change_direction(dia: &amp;Vec&lt;Vec&lt;u8&gt;&gt;, x: usize, y: usize, dx: &amp;mut i32, dy: &amp;mut i32) { assert_eq!(dia[x][y], b&#39;+&#39;); if dx.abs() == 1 { *dx = 0; if y + 1 &lt; dia[x].len() &amp;&amp; (dia[x][y + 1] == b&#39;-&#39; || ALPHA.contains(dia[x][y + 1] as char)) { *dy = 1; } else if dia[x][y - 1] == b&#39;-&#39; || ALPHA.contains(dia[x][y - 1] as char) { *dy = -1; } else { panic!(&#34;Huh? {} {}&#34;, dia[x][y+1] as char, dia[x][y-1] as char); } } else { *dy = 0; if x + 1 &lt; dia.len() &amp;&amp; (dia[x + 1][y] == b&#39;|&#39; || ALPHA.contains(dia[x + 1][y] as char)) { *dx = 1; } else if dia[x - 1][y] == b&#39;|&#39; || ALPHA.contains(dia[x - 1][y] as char) { *dx = -1; } else { panic!(&#34;Huh?&#34;); } } } fn follow_route(dia: Vec&lt;Vec&lt;u8&gt;&gt;) -&gt; (String, i32) { let mut x: i32 = 0; let mut y: i32; let mut dx: i32 = 1; let mut dy: i32 = 0; let mut result = String::new(); let mut steps = 1; match dia[0].iter().position(|x| *x == b&#39;|&#39;) { Some(i) =&gt; y = i as i32, None =&gt; panic!(&#34;Could not find &#39;|&#39; in first row&#34;), } loop { x += dx; y += dy; match dia[x as usize][y as usize] { b&#39;A&#39;...b&#39;Z&#39; =&gt; result.push(dia[x as usize][y as usize] as char), b&#39;+&#39; =&gt; change_direction(&amp;dia, x as usize, y as usize, &amp;mut dx, &amp;mut dy), b&#39; &#39; =&gt; return (result, steps), _ =&gt; (), } steps += 1; } } fn main() { let stdin = io::stdin(); let lines: Vec&lt;Vec&lt;u8&gt;&gt; = stdin.lock().lines() .map(|l| l.unwrap().into_bytes()) .collect(); let result = follow_route(lines); println!(&#34;Route: {}&#34;, result.0); println!(&#34;Steps: {}&#34;, result.1); } Duet — Haskell — #adventofcode Day 18 Today&rsquo;s challenge introduces a type of simplified assembly language that includes instructions for message-passing. First we have to simulate a single program (after humorously misinterpreting the snd and rcv instructions as &ldquo;sound&rdquo; and &ldquo;recover&rdquo;), but then we have to simulate two concurrent processes and the message passing between them. → Full code on GitHub !!! commentary Well, I really learned a lot from this one! I wanted to get to grips with more complex stuff in Haskell and this challenge seemed like an excellent opportunity to figure out a) parsing with the parsec library and b) using the State monad to keep the state of the simulator. As it turned out, that wasn't all I'd learned: I also ran into an interesting situation whereby lazy evaluation was creating an infinite loop where there shouldn't be one, so I also had to learn how to selectively force strict evaluation of values. I'm pretty sure this isn't the best Haskell in the world, but I'm proud of it. First we have to import a bunch of stuff to use later, but also notice the pragma on the first line which instructs the compiler to enable the BangPatterns language extension, which will be important later. {-# LANGUAGE BangPatterns #-} module Main where import qualified Data.Vector as V import qualified Data.Map.Strict as M import Data.List import Data.Either import Data.Maybe import Control.Monad.State.Strict import Control.Monad.Loops import Text.ParserCombinators.Parsec hiding (State) First up we define the types that will represent the program code itself. data DuetVal = Reg Char | Val Int deriving Show type DuetQueue = [Int] data DuetInstruction = Snd DuetVal | Rcv DuetVal | Jgz DuetVal DuetVal | Set DuetVal DuetVal | Add DuetVal DuetVal | Mul DuetVal DuetVal | Mod DuetVal DuetVal deriving Show type DuetProgram = V.Vector DuetInstruction Next we define the types to hold the machine state, which includes: registers, instruction pointer, send &amp; receive buffers and the program code, plus a counter of the number of sends made (to provide the solution). type DuetRegisters = M.Map Char Int data Duet = Duet { dRegisters :: DuetRegisters , dPtr :: Int , dSendCount :: Int , dRcvBuf :: DuetQueue , dSndBuf :: DuetQueue , dProgram :: DuetProgram } instance Show Duet where show d = show (dRegisters d) ++ &#34; @&#34; ++ show (dPtr d) ++ &#34; S&#34; ++ show (dSndBuf d) ++ &#34; R&#34; ++ show (dRcvBuf d) defaultDuet = Duet M.empty 0 0 [] [] V.empty type DuetState = State Duet program is a parser built on the cool parsec library to turn the program text into a Haskell format that we can work with, a Vector of instructions. Yes, using a full-blown parser is overkill here (it would be much simpler just to split each line on whitespace, but I wanted to see how Parsec works. I&rsquo;m using Vector here because we need random access to the instruction list, which is much more efficient with Vector: O(1) compared with the O(n) of the built in Haskell list ([]) type. parseProgram applies the parser to a string and returns the result. program :: GenParser Char st DuetProgram program = do instructions &lt;- endBy instruction eol return $ V.fromList instructions where instruction = try (oneArg &#34;snd&#34; Snd) &lt;|&gt; oneArg &#34;rcv&#34; Rcv &lt;|&gt; twoArg &#34;set&#34; Set &lt;|&gt; twoArg &#34;add&#34; Add &lt;|&gt; try (twoArg &#34;mul&#34; Mul) &lt;|&gt; twoArg &#34;mod&#34; Mod &lt;|&gt; twoArg &#34;jgz&#34; Jgz oneArg n c = do string n &gt;&gt; spaces val &lt;- regOrVal return $ c val twoArg n c = do string n &gt;&gt; spaces val1 &lt;- regOrVal spaces val2 &lt;- regOrVal return $ c val1 val2 regOrVal = register &lt;|&gt; value register = do name &lt;- lower return $ Reg name value = do val &lt;- many $ oneOf &#34;-0123456789&#34; return $ Val $ read val eol = char &#39;\n&#39; parseProgram :: String -&gt; Either ParseError DuetProgram parseProgram = parse program &#34;&#34; Next up we have some utility functions that sit in the DuetState monad we defined above and perform common manipulations on the state: getting/setting/updating registers, updating the instruction pointer and sending/receiving messages via the relevant queues. getReg :: Char -&gt; DuetState Int getReg r = do st &lt;- get return $ M.findWithDefault 0 r (dRegisters st) putReg :: Char -&gt; Int -&gt; DuetState () putReg r v = do st &lt;- get let current = dRegisters st new = M.insert r v current put $ st { dRegisters = new } modReg :: (Int -&gt; Int -&gt; Int) -&gt; Char -&gt; DuetVal -&gt; DuetState Bool modReg op r v = do u &lt;- getReg r v&#39; &lt;- getRegOrVal v putReg r (u `op` v&#39;) incPtr return False getRegOrVal :: DuetVal -&gt; DuetState Int getRegOrVal (Reg r) = getReg r getRegOrVal (Val v) = return v addPtr :: Int -&gt; DuetState () addPtr n = do st &lt;- get put $ st { dPtr = n + dPtr st } incPtr = addPtr 1 send :: Int -&gt; DuetState () send v = do st &lt;- get put $ st { dSndBuf = (dSndBuf st ++ [v]), dSendCount = dSendCount st + 1 } recv :: DuetState (Maybe Int) recv = do st &lt;- get case dRcvBuf st of (x:xs) -&gt; do put $ st { dRcvBuf = xs } return $ Just x [] -&gt; return Nothing execInst implements the logic for each instruction. It returns False as long as the program can continue, but True if the program tries to receive from an empty buffer. execInst :: DuetInstruction -&gt; DuetState Bool execInst (Set (Reg reg) val) = do newVal &lt;- getRegOrVal val putReg reg newVal incPtr return False execInst (Mul (Reg reg) val) = modReg (*) reg val execInst (Add (Reg reg) val) = modReg (+) reg val execInst (Mod (Reg reg) val) = modReg mod reg val execInst (Jgz val1 val2) = do st &lt;- get test &lt;- getRegOrVal val1 jump &lt;- if test &gt; 0 then getRegOrVal val2 else return 1 addPtr jump return False execInst (Snd val) = do v &lt;- getRegOrVal val send v st &lt;- get incPtr return False execInst (Rcv (Reg r)) = do st &lt;- get v &lt;- recv handle v where handle :: Maybe Int -&gt; DuetState Bool handle (Just x) = putReg r x &gt;&gt; incPtr &gt;&gt; return False handle Nothing = return True execInst x = error $ &#34;execInst not implemented yet for &#34; ++ show x execNext looks up the next instruction and executes it. runUntilWait runs the program until execNext returns True to signal the wait state has been reached. execNext :: DuetState Bool execNext = do st &lt;- get let prog = dProgram st p = dPtr st if p &gt;= length prog then return True else execInst (prog V.! p) runUntilWait :: DuetState () runUntilWait = do waiting &lt;- execNext unless waiting runUntilWait runTwoPrograms handles the concurrent running of two programs, by running first one and then the other to a wait state, then swapping each program&rsquo;s send buffer to the other&rsquo;s receive buffer before repeating. If you look carefully, you&rsquo;ll see a &ldquo;bang&rdquo; (!) before the two arguments of the function: runTwoPrograms !d0 !d1. Haskell is a lazy language and usually doesn&rsquo;t evaluate a computation until you ask for a result, instead carrying around a &ldquo;thunk&rdquo; or plan for how to carry out the computation. Sometimes that can be a problem because the amount of memory your program is using can explode unnecessarily as a long computation turns into a large thunk which isn&rsquo;t evaluated until the very end. That&rsquo;s not the problem here though. What happens here without the bangs is another side-effect of laziness. The exit condition of this recursive function is that a deadlock has been reached: both programs are waiting to receive, but neither has sent anything, so neither can ever continue. The check for this is (null $ dSndBuf d0') &amp;&amp; (null $ dSndBuf d1'). As long as the first program has something in its send buffer, the test fails without ever evaluating the second part, which means the result d1' of running the second program is never needed. The function immediately goes to the recursive case and tries to continue the first program again, which immediately returns because it&rsquo;s still waiting to receive. The same thing happens again, and the result is that instead of running the second program to obtain something for the first to receive, we get into an infinite loop trying and failing to continue the first program. The bang forces both d0 and d1 to be evaluated at the point we recurse, which forces the rest of the computation: running the second program and swapping the send/receive buffers. With that, the evaluation proceeds correctly and we terminate with a result instead of getting into an infinite loop! runTwoPrograms :: Duet -&gt; Duet -&gt; (Int, Int) runTwoPrograms !d0 !d1 | (null $ dSndBuf d0&#39;) &amp;&amp; (null $ dSndBuf d1&#39;) = (dSendCount d0&#39;, dSendCount d1&#39;) | otherwise = runTwoPrograms d0&#39;&#39; d1&#39;&#39; where (_, d0&#39;) = runState runUntilWait d0 (_, d1&#39;) = runState runUntilWait d1 d0&#39;&#39; = d0&#39; { dSndBuf = [], dRcvBuf = dSndBuf d1&#39; } d1&#39;&#39; = d1&#39; { dSndBuf = [], dRcvBuf = dSndBuf d0&#39; } All that remains to be done now is to run the programs and see how many messages were sent before the deadlock. main = do prog &lt;- fmap (fromRight V.empty . parseProgram) getContents let d0 = defaultDuet { dProgram = prog, dRegisters = M.fromList [(&#39;p&#39;, 0)] } d1 = defaultDuet { dProgram = prog, dRegisters = M.fromList [(&#39;p&#39;, 1)] } (send0, send1) = runTwoPrograms d0 d1 putStrLn $ &#34;Program 0 sent &#34; ++ show send0 ++ &#34; messages&#34; putStrLn $ &#34;Program 1 sent &#34; ++ show send1 ++ &#34; messages&#34; Spinlock — Rust/Python — #adventofcode Day 17 In today&rsquo;s challenge we deal with a monstrous whirlwind of a program, eating up CPU and memory in equal measure. → Full code on GitHub (and Python driver script) !!! commentary One of the things I wanted from AoC was an opportunity to try out some popular languages that I don&rsquo;t currently know, including the memory-safe, strongly-typed compiled languages Go and Rust. Realistically though, I&rsquo;m likely to continue doing most of my programming in Python, and use one of these other languages when it has better tools or I need the extra speed. In which case, what I really want to know is how I can call functions written in Go or Rust from Python. I thought I'd try Rust first, as it seems to be designed to be C-compatible and that makes it easy to call from Python using [`ctypes`](https://docs.python.org/3.6/library/ctypes.html). Part 1 was another straightforward simulation: translate what the &quot;spinlock&quot; monster is doing into code and run it. It was pretty obvious from the story of this challenge and experience of the last few days that this was going to be another one where the simulation is too computationally expensive for part two, which turns out to be correct. So, first thing to do is to implement the meat of the solution in Rust. spinlock solves the first part of the problem by doing exactly what the monster does. Since we only have to go up to 2017 iterations, this is very tractable. The last number we insert is 2017, so we just return the number immediately after that. #[no_mangle] pub extern fn spinlock(n: usize, skip: usize) -&gt; i32 { let mut buffer: Vec&lt;i32&gt; = Vec::with_capacity(n+1); buffer.push(0); buffer.push(1); let mut pos = 1; for i in 2..n+1 { pos = (pos + skip + 1) % buffer.len(); buffer.insert(pos, i as i32); } pos = (pos + 1) % buffer.len(); return buffer[pos]; } For the second part, we have to do 50 million iterations instead, which is a lot. Given that every time you insert an item in the list it has to move up all the elements after that position, I&rsquo;m pretty sure the algorithm is O(n^2), so it&rsquo;s going to take a lot longer than 10,000ish times the first part. Thankfully, we don&rsquo;t need to build the whole list, just keep track of where 0 is and what number is immediately after it. There may be a closed-form solution to simply calculate the result, but I couldn&rsquo;t think of it and this is good enough. #[no_mangle] pub extern fn spinlock0(n: usize, skip: usize) -&gt; i32 { let mut pos = 1; let mut pos_0 = 0; let mut after_0 = 1; for i in 2..n+1 { pos = (pos + skip + 1) % i; if pos == pos_0 + 1 { after_0 = i; } if pos &lt;= pos_0 { pos_0 += 1; } } return after_0 as i31; } Now it&rsquo;s time to call this code from Python. Notice the #[no_mangle] pragmas and pub extern declarations for each function above, which are required to make sure the functions are exported in a C-compatible way. We can build this into a shared library like this: rustc --crate-type=cdylib -o spinlock.so 17-spinlock.rs The Python script is as simple as loading this library, reading the puzzle input from the command line and calling the functions. The ctypes module does a lot of magic so that we don&rsquo;t have to worry about converting from Python types to native types and back again. import ctypes import sys lib = ctypes.cdll.LoadLibrary(&#34;./spinlock.so&#34;) skip = int(sys.argv[1]) print(&#34;Part 1:&#34;, lib.spinlock(2017, skip)) print(&#34;Part 2:&#34;, lib.spinlock0(50_000_000, skip)) This is a toy example as far as calling Rust from Python is concerned, but it&rsquo;s worth noting that already we can play with the parameters to the two Rust functions without having to recompile. For more serious work, I&rsquo;d probably be looking at something like PyO3 to make a proper Python module. Looks like there&rsquo;s also a very early Rust numpy integration for integrating numerical stuff. You can also do the same thing from Julia, which has a ccall function built in: ccall((:spinlock, &#34;./spinlock.so&#34;), Int32, (UInt64, UInt64), 2017, 377) My next thing to try might be Haskell → Python though… Permutation Promenade — Julia — #adventofcode Day 16 Today&rsquo;s challenge rather appeals to me as a folk dancer, because it describes a set of instructions for a dance and asks us to work out the positions of the dancing programs after each run through the dance. → Full code on GitHub !!! commentary So, part 1 is pretty straight forward: parse the set of instructions, interpret them and keep track of the dancer positions as you go. One time through the dance. However, part 2 asks for the positions after 1 billion (yes, that&rsquo;s 1,000,000,000) times through the dance. In hindsight I should have immediately become suspicious, but I thought I&rsquo;d at least try the brute force approach first because it was simpler to code. So I give it a try, and after waiting for a while, having a cup of tea etc. it still hasn't terminated. I try reducing the number of iterations to 1,000. Now it terminates, but takes about 6 seconds. A spot of arithmetic suggests that running the full version will take a little over 190 years. There must be a better way than that! I'm a little embarassed that I didn't spot the solution immediately (blaming Julia) and tried again in Python to see if I could get it to terminate quicker. When that didn't work I had to think again. A little further investigation with a while loop shows that in fact the dance position repeats (in the case of my input) every 48 times. After that it becomes much quicker! Oh, and it was time for a new language, so I wasted some extra time working out the quirks of [Julia][]. First, a function to evaluate a single move — for neatness, this dispatches to a dedicated function depending on the type of move, although this isn&rsquo;t really necessary to solve the challenge. Ending a function name with a bang (!) is a Julia convention to indicate that it has side-effects. function eval_move!(move, dancers) move_type = move[1] params = move[2:end] if move_type == &#39;s&#39; # spin eval_spin!(params, dancers) elseif move_type == &#39;x&#39; # exchange eval_exchange!(params, dancers) elseif move_type == &#39;p&#39; # partner swap eval_partner!(params, dancers) end end These take care of the individual moves. Parsing the parameters from a string every single time probably isn&rsquo;t ideal, but as it turns out, that optimisation isn&rsquo;t really necessary. Note the + 1 in eval_exchange!, which is necessary because Julia is one of those crazy languages where indexes start from 1 instead of 0. These actions are pretty nice to implement, because Julia has circshift as a builtin to rotate a list, and allows you to assign to list slices and swap values in place with a single statement. function eval_spin!(params, dancers) shift = parse(Int, params) dancers[1:end] = circshift(dancers, shift) end function eval_exchange!(params, dancers) i, j = map(x -&gt; parse(Int, x) + 1, split(params, &#34;/&#34;)) dancers[i], dancers[j] = dancers[j], dancers[i] end function eval_partner!(params, dancers) a, b = split(params, &#34;/&#34;) ia = findfirst([x == a for x in dancers]) ib = findfirst([x == b for x in dancers]) dancers[ia], dancers[ib] = b, a end dance! takes a list of moves and takes the dances once through the dance. function dance!(moves, dancers) for m in moves eval_move!(m, dancers) end end To solve part 1, we simply need to read the moves in, set up the initial positions of the dances and run the dance through once. join is necessary to a) turn characters into length-1 strings, and b) convert the list of strings back into a single string to print out. moves = split(readchomp(STDIN), &#34;,&#34;) dancers = collect(join(c) for c in &#39;a&#39;:&#39;p&#39;) orig_dancers = copy(dancers) dance!(moves, dancers) println(join(dancers)) Part 2 requires a little more work. We run the dance through again and again until we get back to the initial position, saving the intermediate positions in a list. The list now contains every possible position available from that starting point, so we can find position 1 billion by taking 1,000,000,000 modulo the list length (plus 1 because 1-based indexing) and use that to index into the list to get the final position. dance_cycle = [orig_dancers] while dancers != orig_dancers push!(dance_cycle, copy(dancers)) dance!(moves, dancers) end println(join(dance_cycle[1_000_000_000 % length(dance_cycle) + 1])) This terminates on my laptop in about 1.6s: Brute force 0; Careful thought 1! Dueling Generators — Rust — #adventofcode Day 15 Today&rsquo;s challenge introduces two pseudo-random number generators which are trying to agree on a series of numbers. We play the part of the &ldquo;judge&rdquo;, counting the number of times their numbers agree in the lowest 16 bits. → Full code on GitHub Ever since I used Go to solve day 3, I&rsquo;ve had a hankering to try the other new kid on the memory-safe compiled language block, Rust. I found it a bit intimidating at first because the syntax wasn&rsquo;t as close to the C/C++ I&rsquo;m familiar with and there are quite a few concepts unique to Rust, like the use of traits. But I figured it out, so I can tick another language of my to-try list. I also implemented a version in Python for comparison: the Python version is more concise and easier to read but the Rust version runs about 10× faster. First we include the std::env &ldquo;crate&rdquo; which will let us get access to commandline arguments, and define some useful constants for later. use std::env; const M: i64 = 2147483647; const MASK: i64 = 0b1111111111111111; const FACTOR_A: i64 = 16807; const FACTOR_B: i64 = 48271; gen_next generates the next number for a given generator&rsquo;s sequence. gen_next_picky does the same, but for the &ldquo;picky&rdquo; generators, only returning values that meet their criteria. fn gen_next(factor: i64, current: i64) -&gt; i64 { return (current * factor) % M; } fn gen_next_picky(factor: i64, current: i64, mult: i64) -&gt; i64 { let mut next = gen_next(factor, current); while next % mult != 0 { next = gen_next(factor, next); } return next; } duel runs a single duel, and returns the number of times the generators agreed in the lowest 16 bits (found by doing a binary &amp; with the mask defined above). Rust allows functions to be passed as parameters, so we use this to be able to run both versions of the duel using only this one function. fn duel&lt;F, G&gt;(n: i64, next_a: F, mut value_a: i64, next_b: G, mut value_b: i64) -&gt; i64 where F: Fn(i64) -&gt; i64, G: Fn(i64) -&gt; i64, { let mut count = 0; for _ in 0..n { value_a = next_a(value_a); value_b = next_b(value_b); if (value_a &amp; MASK) == (value_b &amp; MASK) { count += 1; } } return count; } Finally, we read the start values from the command line and run the two duels. The expressions that begin |n| are closures (anonymous functions, often called lambdas in other languages) that we use to specify the generator functions for each duel. fn main() { let args: Vec&lt;String&gt; = env::args().collect(); let start_a: i64 = args[1].parse().unwrap(); let start_b: i64 = args[2].parse().unwrap(); println!( &#34;Duel 1: {}&#34;, duel( 40000000, |n| gen_next(FACTOR_A, n), start_a, |n| gen_next(FACTOR_B, n), start_b, ) ); println!( &#34;Duel 2: {}&#34;, duel( 5000000, |n| gen_next_picky(FACTOR_A, n, 4), start_a, |n| gen_next_picky(FACTOR_B, n, 8), start_b, ) ); } Disk Defragmentation — Haskell — #adventofcode Day 14 Today&rsquo;s challenge has us helping a disk defragmentation program by identifying contiguous regions of used sectors on a 2D disk. → Full code on GitHub !!! commentary Wow, today&rsquo;s challenge had a pretty steep learning curve. Day 14 was the first to directly reuse code from a previous day: the &ldquo;knot hash&rdquo; from day 10. I solved day 10 in Haskell, so I thought it would be easier to stick with Haskell for today as well. The first part was straightforward, but the second was pretty mind-bending in a pure functional language! I ended up solving it by implementing a [flood fill algorithm][flood]. It's recursive, which is right in Haskell's wheelhouse, but I ended up using `Data.Sequence` instead of the standard list type as its API for indexing is better. I haven't tried it, but I think it will also be a little faster than a naive list-based version. It took a looong time to figure everything out, but I had a day off work to be able to concentrate on it! A lot more imports for this solution, as we&rsquo;re exercising a lot more of the standard library. module Main where import Prelude hiding (length, filter, take) import Data.Char (ord) import Data.Sequence import Data.Foldable hiding (length) import Data.Ix (inRange) import Data.Function ((&amp;)) import Data.Maybe (fromJust, mapMaybe, isJust) import qualified Data.Set as Set import Text.Printf (printf) import System.Environment (getArgs) Also we&rsquo;ll extract the key bits from day 10 into a module and import that. import KnotHash Now we define a few data types to make the code a bit more readable. Sector represent the state of a particular disk sector, either free, used (but unmarked) or used and marked as belonging to a given integer-labelled group. Grid is a 2D matrix of Sector, as a sequence of sequences. data Sector = Free | Used | Mark Int deriving (Eq) instance Show Sector where show Free = &#34; .&#34; show Used = &#34; #&#34; show (Mark i) = printf &#34;%4d&#34; i type GridRow = Seq Sector type Grid = Seq (GridRow) Some utility functions to make it easier to view the grids (which can be quite large): used for debugging but not in the finished solution. subGrid :: Int -&gt; Grid -&gt; Grid subGrid n = fmap (take n) . take n printRow :: GridRow -&gt; IO () printRow row = do mapM_ (putStr . show) row putStr &#34;\n&#34; printGrid :: Grid -&gt; IO () printGrid = mapM_ printRow makeKey generates the hash key for a given row. makeKey :: String -&gt; Int -&gt; String makeKey input n = input ++ &#34;-&#34; ++ show n stringToGridRow converts a binary string of &lsquo;1&rsquo; and &lsquo;0&rsquo; characters to a sequence of Sector values. stringToGridRow :: String -&gt; GridRow stringToGridRow = fromList . map convert where convert x | x == &#39;1&#39; = Used | x == &#39;0&#39; = Free makeRow and makeGrid build up the grid to use based on the provided input string. makeRow :: String -&gt; Int -&gt; GridRow makeRow input n = stringToGridRow $ concatMap (printf &#34;%08b&#34;) $ dense $ fullKnotHash 256 $ map ord $ makeKey input n makeGrid :: String -&gt; Grid makeGrid input = fromList $ map (makeRow input) [0..127] Utility functions to count the number of used and free sectors, to give the solution to part 1. countEqual :: Sector -&gt; Grid -&gt; Int countEqual x = sum . fmap (length . filter (==x)) countUsed = countEqual Used countFree = countEqual Free Now the real meat begins! fundUnmarked finds the location of the next used sector that we haven&rsquo;t yet marked. It returns a Maybe value, which is Just (x, y) if there is still an unmarked block or Nothing if there&rsquo;s nothing left to mark. findUnmarked :: Grid -&gt; Maybe (Int, Int) findUnmarked g | y == Nothing = Nothing | otherwise = Just (fromJust x, fromJust y) where hasUnmarked row = isJust $ elemIndexL Used row x = findIndexL hasUnmarked g y = case x of Nothing -&gt; Nothing Just x&#39; -&gt; elemIndexL Used $ index g x&#39; floodFill implements a very simple recursive flood fill. It takes a target and replacement value and a starting location, and fills in the replacement value for every connected location that currently has the target value. We use it below to replace a connected used region with a marked region. floodFill :: Sector -&gt; Sector -&gt; (Int, Int) -&gt; Grid -&gt; Grid floodFill t r (x, y) g | inRange (0, length g - 1) x &amp;&amp; inRange (0, length g - 1) y &amp;&amp; elem == t = let newRow = update y r row newGrid = update x newRow g in newGrid &amp; floodFill t r (x+1, y) &amp; floodFill t r (x-1, y) &amp; floodFill t r (x, y+1) &amp; floodFill t r (x, y-1) | otherwise = g where row = g `index` x elem = row `index` y markNextGroup looks for an unmarked group and marks it if found. If no more groups are found it returns Nothing. markAllGroups then repeatedly applies markNextGroup until Nothing is returned. markNextGroup :: Int -&gt; Grid -&gt; Maybe Grid markNextGroup i g = case findUnmarked g of Nothing -&gt; Nothing Just loc -&gt; Just $ floodFill Used (Mark i) loc g markAllGroups :: Grid -&gt; Grid markAllGroups g = markAllGroups&#39; 1 g where markAllGroups&#39; i g = case markNextGroup i g of Nothing -&gt; g Just g&#39; -&gt; markAllGroups&#39; (i+1) g&#39; onlyMarks filters a grid row and returns a list of (possibly duplicated) group numbers in the row. onlyMarks :: GridRow -&gt; [Int] onlyMarks = mapMaybe getMark . toList where getMark Free = Nothing getMark Used = Nothing getMark (Mark i) = Just i Finally, countGroups puts all the group numbers into a set to get rid of duplicates and returns the size of the set, i.e. the total number of separate groups. countGroups :: Grid -&gt; Int countGroups g = Set.size groupSet where groupSet = foldl&#39; Set.union Set.empty $ fmap rowToSet g rowToSet = Set.fromList . toList . onlyMarks As always, every Haskell program needs a main function to drive the I/O and produce the actual result. main = do input &lt;- fmap head getArgs let grid = makeGrid input used = countUsed grid marked = countGroups $ markAllGroups grid putStrLn $ &#34;Used sectors: &#34; ++ show used putStrLn $ &#34;Groups: &#34; ++ show marked Packet Scanners — Haskell — #adventofcode Day 13 Today&rsquo;s challenge requires us to sneak past a firewall made up of a series of scanners. → Full code on GitHub !!! commentary I wasn&rsquo;t really thinking straight when I solved this challenge. I got a solution without too much trouble, but I ended up simulating the step-by-step movement of the scanners. I finally realised that I could calculate whether or not a given scanner was safe at a given time directly with modular arithmetic, and it bugged me so much that I reimplemented the solution. Both are given below, the faster one first. First we introduce some standard library stuff and define some useful utilities. module Main where import qualified Data.Text as T import Data.Maybe (mapMaybe) strip :: String -&gt; String strip = T.unpack . T.strip . T.pack splitOn :: String -&gt; String -&gt; [String] splitOn sep = map T.unpack . T.splitOn (T.pack sep) . T.pack parseScanner :: String -&gt; (Int, Int) parseScanner s = (d, r) where [d, r] = map read $ splitOn &#34;: &#34; s traverseFW does all the hard work: it checks for each scanner whether or not it&rsquo;s safe as we pass through, and returns a list of the severities of each time we&rsquo;re caught. mapMaybe is like the standard map in many languages, but operates on a list of Haskell Maybe values, like a combined map and filter. If the value is Just x, x gets included in the returned list; if the value is Nothing, then it gets thrown away. traverseFW :: Int -&gt; [(Int, Int)] -&gt; [Int] traverseFW delay = mapMaybe caught where caught (d, r) = if (d + delay) `mod` (2*(r-1)) == 0 then Just (d * r) else Nothing Then the total severity of our passage through the firewall is simply the sum of each individual severity. severity :: [(Int, Int)] -&gt; Int severity = sum . traverseFW 0 But we don&rsquo;t want to know how badly we got caught, we want to know how long to wait before setting off to get through safely. findDelay tries traversing the firewall with increasing delay, and returns the delay for the first pass where we predict not getting caught. findDelay :: [(Int, Int)] -&gt; Int findDelay scanners = head $ filter (null . flip traverseFW scanners) [0..] And finally, we put it all together and calculate and print the result. main = do scanners &lt;- fmap (map parseScanner . lines) getContents putStrLn $ &#34;Severity: &#34; ++ (show $ severity scanners) putStrLn $ &#34;Delay: &#34; ++ (show $ findDelay scanners) I&rsquo;m not generally bothered about performance for these challenges, but here I&rsquo;ll note that my second attempt runs in a little under 2 seconds on my laptop: $ time ./13-packet-scanners-redux &lt; 13-input.txt Severity: 1900 Delay: 3966414 ./13-packet-scanners-redux &lt; 13-input.txt 1.73s user 0.02s system 99% cpu 1.754 total Compare that with the first, simulation-based one, which takes nearly a full minute: $ time ./13-packet-scanners &lt; 13-input.txt Severity: 1900 Delay: 3966414 ./13-packet-scanners &lt; 13-input.txt 57.63s user 0.27s system 100% cpu 57.902 total And for good measure, here&rsquo;s the code. Notice the tick and tickOne functions, which together simulate moving all the scanners by one step; for this to work we have to track the full current state of each scanner, which is easier to read with a Haskell record-based custom data type. traverseFW is more complicated because it has to drive the simulation, but the rest of the code is mostly the same. module Main where import qualified Data.Text as T import Control.Monad (forM_) data Scanner = Scanner { depth :: Int , range :: Int , pos :: Int , dir :: Int } instance Show Scanner where show (Scanner d r p dir) = show d ++ &#34;/&#34; ++ show r ++ &#34;/&#34; ++ show p ++ &#34;/&#34; ++ show dir strip :: String -&gt; String strip = T.unpack . T.strip . T.pack splitOn :: String -&gt; String -&gt; [String] splitOn sep str = map T.unpack $ T.splitOn (T.pack sep) $ T.pack str parseScanner :: String -&gt; Scanner parseScanner s = Scanner d r 0 1 where [d, r] = map read $ splitOn &#34;: &#34; s tickOne :: Scanner -&gt; Scanner tickOne (Scanner depth range pos dir) | pos &lt;= 0 = Scanner depth range (pos+1) 1 | pos &gt;= range - 1 = Scanner depth range (pos-1) (-1) | otherwise = Scanner depth range (pos+dir) dir tick :: [Scanner] -&gt; [Scanner] tick = map tickOne traverseFW :: [Scanner] -&gt; [(Int, Int)] traverseFW = traverseFW&#39; 0 where traverseFW&#39; _ [] = [] traverseFW&#39; layer scanners@((Scanner depth range pos _):rest) -- | layer == depth &amp;&amp; pos == 0 = (depth*range) + (traverseFW&#39; (layer+1) $ tick rest) | layer == depth &amp;&amp; pos == 0 = (depth,range) : (traverseFW&#39; (layer+1) $ tick rest) | layer == depth &amp;&amp; pos /= 0 = traverseFW&#39; (layer+1) $ tick rest | otherwise = traverseFW&#39; (layer+1) $ tick scanners severity :: [Scanner] -&gt; Int severity = sum . map (uncurry (*)) . traverseFW empty :: [a] -&gt; Bool empty [] = True empty _ = False findDelay :: [Scanner] -&gt; Int findDelay scanners = delay where (delay, _) = head $ filter (empty . traverseFW . snd) $ zip [0..] $ iterate tick scanners main = do scanners &lt;- fmap (map parseScanner . lines) getContents putStrLn $ &#34;Severity: &#34; ++ (show $ severity scanners) putStrLn $ &#34;Delay: &#34; ++ (show $ findDelay scanners) Digital Plumber — Python — #adventofcode Day 12 Today&rsquo;s challenge has us helping a village of programs who are unable to communicate. We have a list of the the communication channels between their houses, and need to sort them out into groups such that we know that each program can communicate with others in its own group but not any others. Then we have to calculate the size of the group containing program 0 and the total number of groups. → Full code on GitHub !!! commentary This is one of those problems where I&rsquo;m pretty sure that my algorithm isn&rsquo;t close to being the most efficient, but it definitely works! For the sake of solving the challenge that&rsquo;s all that matters, but it still bugs me. By now I&rsquo;ve become used to using fileinput to transparently read data either from files given on the command-line or standard input if no arguments are given. import fileinput as fi First we make an initial pass through the input data, creating a group for each line representing the programs on that line (which can communicate with each other). We store this as a Python set. groups = [] for line in fi.input(): head, rest = line.split(&#39; &lt;-&gt; &#39;) group = set([int(head)]) group.update([int(x) for x in rest.split(&#39;, &#39;)]) groups.append(group) Now we iterate through the groups, starting with the first, and merging any we find that overlap with our current group. i = 0 while i &lt; len(groups): current = groups[i] Each pass through the groups brings more programs into the current group, so we have to go through and check their connections too. We make several merge passes, until we detect that no more merges took place. num_groups = len(groups) + 1 while num_groups &gt; len(groups): j = i+1 num_groups = len(groups) This inner loop does the actual merging, and deletes each group as it&rsquo;s merged in. while j &lt; len(groups): if len(current &amp; groups[j]) &gt; 0: current.update(groups[j]) del groups[j] else: j += 1 i += 1 All that&rsquo;s left to do now is to display the results. print(&#34;Number in group 0:&#34;, len([g for g in groups if 0 in g][0])) print(&#34;Number of groups:&#34;, len(groups)) Hex Ed — Python — #adventofcode Day 11 Today&rsquo;s challenge is to help a program find its child process, which has become lost on a hexagonal grid. We need to follow the path taken by the child (given as input) and calculate the distance it is from home along with the furthest distance it has been at any point along the path. → Full code on GitHub !!! commentary I found this one quite interesting in that it was very quick to solve. In fact, I got lucky and my first quick implementation (max(abs(l)) below) gave the correct answer in spite of missing an obvious not-so-edge case. Thinking about it, there&rsquo;s only a ⅓ chance that the first incorrect implementation would give the wrong answer! The code is shorter, so you get more words today. ☺ There are a number of different co-ordinate systems on a hexagonal grid (I discovered while reading up after solving it&hellip;). I intuitively went for the system known as &lsquo;axial&rsquo; coordinates, where you pick two directions aligned to the grid as your x and y axes: note that these won&rsquo;t be perpendicular. I chose ne/sw as the x axis and se/nw as y, but there are three other possible choices. That leads to the following definition for the directions, encoded as numpy arrays because that makes some of the code below neater. import numpy as np STEPS = {d: np.array(v) for d, v in [(&#39;ne&#39;, (1, 0)), (&#39;se&#39;, (0, -1)), (&#39;s&#39;, (-1, -1)), (&#39;sw&#39;, (-1, 0)), (&#39;nw&#39;, (0, 1)), (&#39;n&#39;, (1, 1))]} hex_grid_dist, given a location l calculates the number of steps needed to reach that location from the centre at (0, 0). Notice that we can&rsquo;t simply use the Manhattan distance here because, for example, one step north takes us to (1, 1), which would give a Manhattan distance of 2. Instead, we can see that moving in the n/s direction allows us to increment or decrement both coordinates at the same time: If the coordinates have the same sign: move n/s until one of them is zero, then move along the relevant ne or se axis back to the origin; in this case the number of steps is greatest of the absolute values of the two coordinates If the coordinates have opposite signs: move independently along the ne and se axes to reduce each to 0; this time the number of steps is the sum of the absolute values of the two coordinates def hex_grid_distance(l): if sum(np.sign(l)) == 0: # i.e. opposite signs return sum(abs(l)) else: return max(abs(l)) Now we can read in the path followed by the child and follow it ourselves, tracking the maximum distance from home along the way. path = input().strip().split(&#39;,&#39;) location = np.array((0, 0)) max_distance = 0 for step in map(STEPS.get, path): location += step max_distance = max(max_distance, hex_grid_distance(location)) distance = hex_grid_distance(location) print(&#34;Child process is at&#34;, location, &#34;which is&#34;, distance, &#34;steps away&#34;) print(&#34;Greatest distance was&#34;, max_distance) Knot Hash — Haskell — #adventofcode Day 10 Today&rsquo;s challenge asks us to help a group of programs implement a (highly questionable) hashing algorithm that involves repeatedly reversing parts of a list of numbers. → Full code on GitHub !!! commentary I went with Haskell again today, because it&rsquo;s the weekend so I have a bit more time, and I really enjoyed yesterday&rsquo;s Haskell implementation. Today gave me the opportunity to explore the standard library a bit more, as well as lending itself nicely to being decomposed into smaller parts to be combined using higher-order functions. You know the drill by know: import stuff we&rsquo;ll use later. module Main where import Data.Char (ord) import Data.Bits (xor) import Data.Function ((&amp;)) import Data.List (unfoldr) import Text.Printf (printf) import qualified Data.Text as T The worked example uses a concept of the &ldquo;current position&rdquo; as a pointer to a location in a static list. In Haskell it makes more sense to instead use the front of the list as the current position, and rotate the whole list as we progress to bring the right element to the front. rotate :: Int -&gt; [Int] -&gt; [Int] rotate 0 xs = xs rotate n xs = drop n&#39; xs ++ take n&#39; xs where n&#39; = n `mod` length xs The simple version of the hash requires working through the input list, modifying the working list as we go, and incrementing a &ldquo;skip&rdquo; counter with each step. Converting this to a functional style, we simply zip up the input with an infinite list [0, 1, 2, 3, ...] to give the counter values. Notice that we also have to calculate how far to rotate the working list to get back to its original position. foldl lets us specify a function that returns a modified version of the working list and feeds the input list in one at a time. simpleKnotHash :: Int -&gt; [Int] -&gt; [Int] simpleKnotHash size input = foldl step [0..size-1] input&#39; &amp; rotate (negate finalPos) where input&#39; = zip input [0..] finalPos = sum $ zipWith (+) input [0..] reversePart xs n = (reverse $ take n xs) ++ drop n xs step xs (n, skip) = reversePart xs n &amp; rotate (n+skip) The full version of the hash (part 2 of the challenge) starts the same way as the simple version, except making 64 passes instead of one: we can do this by using replicate to make a list of 64 copies, then collapse that into a single list with concat. fullKnotHash :: Int -&gt; [Int] -&gt; [Int] fullKnotHash size input = simpleKnotHash size input&#39; where input&#39; = concat $ replicate 64 input The next step in calculating the full hash collapses the full 256-element &ldquo;sparse&rdquo; hash down into 16 elements by XORing groups of 16 together. unfoldr is a nice efficient way of doing this. dense :: [Int] -&gt; [Int] dense = unfoldr dense&#39; where dense&#39; [] = Nothing dense&#39; xs = Just (foldl1 xor $ take 16 xs, drop 16 xs) The final hash step is to convert the list of integers into a hexadecimal string. hexify :: [Int] -&gt; String hexify = concatMap (printf &#34;%02x&#34;) These two utility functions put together building blocks from the Data.Text module to parse the input string. Note that no arguments are given: the functions are defined purely by composing other functions using the . operator. In Haskell this is referred to as &ldquo;point-free&rdquo; style. strip :: String -&gt; String strip = T.unpack . T.strip . T.pack parseInput :: String -&gt; [Int] parseInput = map (read . T.unpack) . T.splitOn (T.singleton &#39;,&#39;) . T.pack Now we can put it all together, including building the weird input for the &ldquo;full&rdquo; hash. main = do input &lt;- fmap strip getContents let simpleInput = parseInput input asciiInput = map ord input ++ [17, 31, 73, 47, 23] (a:b:_) = simpleKnotHash 256 simpleInput print $ (a*b) putStrLn $ fullKnotHash 256 asciiInput &amp; dense &amp; hexify Stream Processing — Haskell — #adventofcode Day 9 In today&rsquo;s challenge we come across a stream that we need to cross. But of course, because we&rsquo;re stuck inside a computer, it&rsquo;s not water but data flowing past. The stream is too dangerous to cross until we&rsquo;ve removed all the garbage, and to prove we can do that we have to calculate a score for the valid data &ldquo;groups&rdquo; and the number of garbage characters to remove. → Full code on GitHub !!! commentary One of my goals for this process was to knock the rust of my functional programming skills in Haskell, and I haven&rsquo;t done that for the whole of the first week. Processing strings character by character and acting according to which character shows up seems like a good choice for pattern-matching though, so here we go. I also wanted to take a bash at test-driven development in Haskell, so I also loaded up the Test.Hspec module to give it a try. I did find keeping track of all the state in arguments a bit mind boggling, and I think it could have been improved through use of a data type using record syntax and the `State` monad, so that's something to look at for a future challenge. First import the extra bits we&rsquo;ll need. module Main where import Test.Hspec import Data.Function ((&amp;)) countGroups solves the first part of the problem, counting up the &ldquo;score&rdquo; of the valid data in the stream. countGroups' is an auxiliary function that holds some state in its arguments. We use pattern matching for the base case: [] represents the empty list in Haskell, which indicates we&rsquo;ve finished the whole stream. Otherwise, we split the remaining stream into its first character and remainder, and use guards to decide how to interpret it. If skip is true, discard the character and carry on with skip set back to false. If we find a &ldquo;!&rdquo;, that tells us to skip the next. Other characters mark groups or sets of garbage: groups increase the score when they close and garbage is discarded. We continue to progress the list by recursing with the remainder of the stream and any updated state. countGroups :: String -&gt; Int countGroups = countGroups&#39; 0 0 False False where countGroups&#39; score _ _ _ [] = score countGroups&#39; score level garbage skip (c:rest) | skip = countGroups&#39; score level garbage False rest | c == &#39;!&#39; = countGroups&#39; score level garbage True rest | garbage = case c of &#39;&gt;&#39; -&gt; countGroups&#39; score level False False rest _ -&gt; countGroups&#39; score level True False rest | otherwise = case c of &#39;{&#39; -&gt; countGroups&#39; score (level+1) False False rest &#39;}&#39; -&gt; countGroups&#39; (score+level) (level-1) False False rest &#39;,&#39; -&gt; countGroups&#39; score level False False rest &#39;&lt;&#39; -&gt; countGroups&#39; score level True False rest c -&gt; error $ &#34;Garbage character found outside garbage: &#34; ++ show c countGarbage works almost identically to countGroups, except it ignores groups and counts garbage. They are structured so similarly that it would probably make more sense to combine them to a single function that returns both counts. countGarbage :: String -&gt; Int countGarbage = countGarbage&#39; 0 False False where countGarbage&#39; count _ _ [] = count countGarbage&#39; count garbage skip (c:rest) | skip = countGarbage&#39; count garbage False rest | c == &#39;!&#39; = countGarbage&#39; count garbage True rest | garbage = case c of &#39;&gt;&#39; -&gt; countGarbage&#39; count False False rest _ -&gt; countGarbage&#39; (count+1) True False rest | otherwise = case c of &#39;&lt;&#39; -&gt; countGarbage&#39; count True False rest _ -&gt; countGarbage&#39; count False False rest Hspec gives us a domain-specific language heavily inspired by the rspec library for Ruby: the tests read almost like natural language. I built up these tests one-by-one, gradually implementing the appropriate bits of the functions above, a process known as Test-driven development. runTests = hspec $ do describe &#34;countGroups&#34; $ do it &#34;counts valid groups&#34; $ do countGroups &#34;{}&#34; `shouldBe` 1 countGroups &#34;{{{}}}&#34; `shouldBe` 6 countGroups &#34;{{{},{},{{}}}}&#34; `shouldBe` 16 countGroups &#34;{{},{}}&#34; `shouldBe` 5 it &#34;ignores garbage&#34; $ do countGroups &#34;{&lt;a&gt;,&lt;a&gt;,&lt;a&gt;,&lt;a&gt;}&#34; `shouldBe` 1 countGroups &#34;{{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;}}&#34; `shouldBe` 9 it &#34;skips marked characters&#34; $ do countGroups &#34;{{&lt;!!&gt;},{&lt;!!&gt;},{&lt;!!&gt;},{&lt;!!&gt;}}&#34; `shouldBe` 9 countGroups &#34;{{&lt;a!&gt;},{&lt;a!&gt;},{&lt;a!&gt;},{&lt;ab&gt;}}&#34; `shouldBe` 3 describe &#34;countGarbage&#34; $ do it &#34;counts garbage characters&#34; $ do countGarbage &#34;&lt;&gt;&#34; `shouldBe` 0 countGarbage &#34;&lt;random characters&gt;&#34; `shouldBe` 17 countGarbage &#34;&lt;&lt;&lt;&lt;&gt;&#34; `shouldBe` 3 it &#34;ignores non-garbage&#34; $ do countGarbage &#34;{{},{}}&#34; `shouldBe` 0 countGarbage &#34;{{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;}}&#34; `shouldBe` 8 it &#34;skips marked characters&#34; $ do countGarbage &#34;&lt;{!&gt;}&gt;&#34; `shouldBe` 2 countGarbage &#34;&lt;!!&gt;&#34; `shouldBe` 0 countGarbage &#34;&lt;!!!&gt;&#34; `shouldBe` 0 countGarbage &#34;&lt;{o\&#34;i!a,&lt;{i&lt;a&gt;&#34; `shouldBe` 10 Finally, the main function reads in the challenge input and calculates the answers, printing them on standard output. main = do runTests repeat &#39;=&#39; &amp; take 78 &amp; putStrLn input &lt;- getContents &amp; fmap (filter (/=&#39;\n&#39;)) putStrLn $ &#34;Found &#34; ++ show (countGroups input) ++ &#34; groups&#34; putStrLn $ &#34;Found &#34; ++ show (countGarbage input) ++ &#34; characters garbage&#34; I Heard You Like Registers — Python — #adventofcode Day 8 Today&rsquo;s challenge describes a simple instruction set for a CPU, incrementing and decrementing values in registers according to simple conditions. We have to interpret a stream of these instructions, and to prove that we&rsquo;ve done so, give the highest value of any register, both at the end of the program and throughout the whole program. → Full code on GitHub !!! commentary This turned out to be a nice straightforward one to implement, as the instruction format was easily parsed by regular expression, and Python provides the eval function which made evaluating the conditions a doddle. Import various standard library bits that we&rsquo;ll use later. import re import fileinput as fi from math import inf from collections import defaultdict We could just parse the instructions by splitting the string, but using a regular expression is a little bit more robust because it won&rsquo;t match at all if given an invalid instruction. INSTRUCTION_RE = re.compile(r&#39;(\w+) (inc|dec) (-?\d+) if (.+)\s*&#39;) def parse_instruction(instruction): match = INSTRUCTION_RE.match(instruction) return match.group(1, 2, 3, 4) Executing an instruction simply checks the condition and if it evaluates to True updates the relevant register. def exec_instruction(registers, instruction): name, op, value, cond = instruction value = int(value) if op == &#39;dec&#39;: value = -value if eval(cond, globals(), registers): registers[name] += value highest_value returns the maximum value found in any register. def highest_value(registers): return sorted(registers.items(), key=lambda x: x[1], reverse=True)[0][1] Finally, loop through all the instructions and carry them out, updating global_max as we go. We need to be able to deal with registers that haven&rsquo;t been accessed before. Keeping the registers in a dictionary means that we can evaluate the conditions directly using eval above, passing it as the locals argument. The standard dict will raise an exception if we try to access a key that doesn&rsquo;t exist, so instead we use collections.defaultdict, which allows us to specify what the default value for a non-existent key will be. New registers start at 0, so we use a simple lambda to define a function that always returns 0. global_max = -inf registers = defaultdict(lambda: 0) for i in map(parse_instruction, fi.input()): exec_instruction(registers, i) global_max = max(global_max, highest_value(registers)) print(&#39;Max value:&#39;, highest_value(registers)) print(&#39;All-time max:&#39;, global_max) Recursive Circus — Ruby — #adventofcode Day 7 Today&rsquo;s challenge introduces a set of processes balancing precariously on top of each other. We find them stuck and unable to get down because one of the processes is the wrong size, unbalancing the whole circus. Our job is to figure out the root from the input and then find the correct weight for the single incorrect process. → Full code on GitHub !!! commentary So I didn&rsquo;t really intend to take a full polyglot approach to Advent of Code, but it turns out to have been quite fun, so I made a shortlist of languages to try. Building a tree is a classic application for object-orientation using a class to represent tree nodes, and I&rsquo;ve always liked the feel of Ruby&rsquo;s class syntax, so I gave it a go. First make sure we have access to Set, which we&rsquo;ll use later. require &#39;set&#39; Now to define the CircusNode class, which represents nodes in the tree. attr :s automatically creates a function s that returns the value of the instance attribute @s class CircusNode attr :name, :weight def initialize(name, weight, children=nil) @name = name @weight = weight @children = children || [] end Add a &lt;&lt; operator (the same syntax for adding items to a list) that adds a child to this node. def &lt;&lt;(c) @children &lt;&lt; c @total_weight = nil end total_weight recursively calculates the weight of this node and everything above it. The @total_weight ||= blah idiom caches the value so we only calculate it once. def total_weight @total_weight ||= @weight + @children.map {|c| c.total_weight}.sum end balance_weight does the hard work of figuring out the proper weight for the incorrect node by recursively searching through the tree. def balance_weight(target=nil) by_weight = Hash.new{|h, k| h[k] = []} @children.each{|c| by_weight[c.total_weight] &lt;&lt; c} if by_weight.size == 1 then if target return @weight - (total_weight - target) else raise ArgumentError, &#39;This tree seems balanced!&#39; end else odd_one_out = by_weight.select {|k, v| v.length == 1}.first[1][0] child_target = by_weight.select {|k, v| v.length &gt; 1}.first[0] return odd_one_out.balance_weight child_target end end A couple of utility functions for displaying trees finish off the class. def to_s &#34;#{@name} (#{@weight})&#34; end def print_tree(n=0) puts &#34;#{&#39; &#39;*n}#{self} -&gt; #{self.total_weight}&#34; @children.each do |child| child.print_tree n+1 end end end build_circus takes input as a list of lists [name, weight, children]. We make two passes over this list, first creating all the nodes, then building the tree by adding children to parents. def build_circus(data) all_nodes = {} all_children = Set.new data.each do |name, weight, children| all_nodes[name] = CircusNode.new name, weight end data.each do |name, weight, children| children.each {|child| all_nodes[name] &lt;&lt; all_nodes[child]} all_children.merge children end root_name = (all_nodes.keys.to_set - all_children).first return all_nodes[root_name] end Finally, build the tree and solve the problem! Note that we use String.to_sym to convert the node names to symbols (written in Ruby as :symbol), because they&rsquo;re faster to work with in Hashes and Sets as we do above. data = readlines.map do |line| match = /(?&lt;parent&gt;\w+) \((?&lt;weight&gt;\d+)\)(?: -&gt; (?&lt;children&gt;.*))?/.match line [match[&#39;parent&#39;].to_sym, match[&#39;weight&#39;].to_i, match[&#39;children&#39;] ? match[&#39;children&#39;].split(&#39;, &#39;).map {|x| x.to_sym} : []] end root = build_circus data puts &#34;Root node: #{root}&#34; puts root.balance_weight Memory Reallocation — Python — #adventofcode Day 6 Today&rsquo;s challenge asks us to follow a recipe for redistributing objects in memory that bears a striking resemblance to the rules of the African game Mancala. → Full code on GitHub !!! commentary When I was doing my MSci, one of our programming exercises was to write (in Haskell, IIRC) a program to play a Mancala variant called Oware, so this had a nice ring of nostalgia. Back to Python today: it's already become clear that it's by far my most fluent language, which makes sense as it's the only one I've used consistently since my schooldays. I'm a bit behind on the blog posts, so you get this one without any explanation, for now at least! import math def reallocate(mem): max_val = -math.inf size = len(mem) for i, x in enumerate(mem): if x &gt; max_val: max_val = x max_index = i i = max_index mem[i] = 0 remaining = max_val while remaining &gt; 0: i = (i + 1) % size mem[i] += 1 remaining -= 1 return mem def detect_cycle(mem): mem = list(mem) steps = 0 prev_states = {} while tuple(mem) not in prev_states: prev_states[tuple(mem)] = steps steps += 1 mem = reallocate(mem) return (steps, steps - prev_states[tuple(mem)]) initial_state = map(int, input().split()) print(&#34;Initial state is &#34;, initial_state) steps, cycle = detect_cycle(initial_state) print(&#34;Steps to cycle: &#34;, steps) print(&#34;Steps in cycle: &#34;, cycle) A Maze of Twisty Trampolines — C++ — #adventofcode Day 5 Today&rsquo;s challenge has us attempting to help the CPU escape from a maze of instructions. It&rsquo;s not quite a Turing Machine, but it has that feeling of moving a read/write head up and down a tape acting on and changing the data found there. → Full code on GitHub !!! commentary I haven&rsquo;t written anything in C++ for over a decade. It sounds like there have been lots of interesting developments in the language since then, with C++11, C++14 and the freshly finalised C++17 standards (built-in parallelism in the STL!). I won&rsquo;t use any of those, but I thought I&rsquo;d dust off my C++ and see what happened. Thankfully the Standard Template Library classes still did what I expected! As usual, we first include the parts of the standard library we&rsquo;re going to use: iostream for input &amp; output; vector for the container. We also declare that we&rsquo;re using the std namespace, so that we don&rsquo;t have to prepend vector and the other classes with std::. #include &lt;iostream&gt; #include &lt;vector&gt; using namespace std; steps_to_escape_part1 implements part 1 of the challenge: we read a location, move forward/backward by the number of steps given in that location, then add one to the location before repeating. The result is the number of steps we take before jumping outside the list. int steps_to_escape_part1(vector&lt;int&gt;&amp; instructions) { int pos = 0, iterations = 0, new_pos; while (pos &lt; instructions.size()) { new_pos = pos + instructions[pos]; instructions[pos]++; pos = new_pos; iterations++; } return iterations; } steps_to_escape_part2 solves part 2, which is very similar, except that an offset greater than 3 is decremented instead of incremented before moving on. int steps_to_escape_part2(vector&lt;int&gt;&amp; instructions) { int pos = 0, iterations = 0, new_pos, offset; while (pos &lt; instructions.size()) { offset = instructions[pos]; new_pos = pos + offset; instructions[pos] += offset &gt;=3 ? -1 : 1; pos = new_pos; iterations++; } return iterations; } Finally we pull it all together and link it up to the input. int main() { vector&lt;int&gt; instructions1, instructions2; int n; The cin class lets us read data from standard input, which we then add to a vector of ints to give our list of instructions. while (true) { cin &gt;&gt; n; if (cin.eof()) break; instructions1.push_back(n); } Solving the problem modifies the input, so we need to take a copy to solve part 2 as well. Thankfully the STL makes this easy with iterators. instructions2.insert(instructions2.begin(), instructions1.begin(), instructions1.end()); Finally, compute the result and print it on standard output. cout &lt;&lt; steps_to_escape_part1(instructions1) &lt;&lt; endl; cout &lt;&lt; steps_to_escape_part2(instructions2) &lt;&lt; endl; return 0; } High Entropy Passphrases — Python — #adventofcode Day 4 Today&rsquo;s challenge describes some simple rules supposedly intended to enforce the use of secure passwords. All we have to do is test a list of passphrase and identify which ones meet the rules. → Full code on GitHub !!! commentary Fearing that today might be as time-consuming as yesterday, I returned to Python and it&rsquo;s hugely powerful &ldquo;batteries-included&rdquo; standard library. Thankfully this challenge was more straightforward, and I actually finished this before finishing day 3. First, let&rsquo;s import two useful utilities. from fileinput import input from collections import Counter Part 1 requires simply that a passphrase contains no repeated words. No problem: we split the passphrase into words and count them, and check if any was present more than once. Counter is an amazingly useful class to have in a language&rsquo;s standard library. All it does is count things: you add objects to it, and then it will tell you how many of a given object you have. We&rsquo;re going to use it to count those potentially duplicated words. def is_valid(passphrase): counter = Counter(passphrase.split()) return counter.most_common(1)[0][1] == 1 Part 2 requires that no word in the passphrase be an anagram of any other word. Since we don&rsquo;t need to do anything else with the words afterwards, we can check for anagrams by sorting the letters in each word: &ldquo;leaf&rdquo; and &ldquo;flea&rdquo; both become &ldquo;aefl&rdquo; and can be compared directly. Then we count as before. def is_valid_ana(passphrase): counter = Counter(&#39;&#39;.join(sorted(word)) for word in passphrase.split()) return counter.most_common(1)[0][1] == 1 Finally we pull everything together. sum(map(boolean_func, list)) is a common idiom in Python for counting the number of times a condition (checked by boolean_func) is true. In Python, True and False can be treated as the numbers 1 and 0 respectively, so that summing a list of Boolean values gives you the number of True values in the list. lines = list(input()) print(sum(map(is_valid, lines))) print(sum(map(is_valid_ana, lines))) Spiral Memory — Go — #adventofcode Day 3 Today&rsquo;s challenge requires us to perform some calculations on an &ldquo;experimental memory layout&rdquo;, with cells moving outwards from the centre of a square spiral (squiral?). → Full code on GitHub !!! commentary I&rsquo;ve been wanting to try my hand at Go, the memory-safe, statically typed compiled language from Google for a while. Today&rsquo;s challenge seemed a bit more mathematical in nature, meaning that I wouldn&rsquo;t need too many advanced language features or knowledge of a standard library, so I thought I&rsquo;d give it a &ldquo;go&rdquo;. It might have been my imagination, but it was impressive how quickly the compiled program chomped through 60 different input values while I was debugging. I actually spent far too long on this problem because my brain led me down a blind alley trying to do the wrong calculation, but I got there in the end! The solution is a bit difficult to explain without diagrams, which I don't really have time to draw right now, but fear not because several other people have. First take a look at [the challenge itself which explains the spiral memory concept](http://adventofcode.com/2017/day/3). Then look at the [nice diagrams that Phil Tooley made with Python](http://acceleratedscience.co.uk/blog/adventofcode-day-3-spiral-memory/) and hopefully you'll be able to see what's going on! It's interesting to note that this challenge also admits of an algorithmic solution instead of the mathematical one: you can model the memory as an infinite grid using a suitable data structure and literally move around it in a spiral. In hindsight this is a much better way of solving the challenge quickly because it's easier and less error-prone to code. I'm quite pleased with my maths-ing though, and it's much quicker than the algorithmic version! First some Go boilerplate: we have to define the package we&rsquo;re in (main, because it&rsquo;s an executable we&rsquo;re producing) and import the libraries we&rsquo;ll use. package main import ( &#34;fmt&#34; &#34;math&#34; &#34;os&#34; ) Weirdly, Go doesn&rsquo;t seem to have these basic mathematics functions for integers in its standard library (please someone correct me if I&rsquo;m wrong!) so I&rsquo;ll define them instead of mucking about with data types. Go doesn&rsquo;t do any implicit type conversion, even between numeric types, and the math builtin package only operates on float64 values. func abs(n int) int { if n &lt; 0 { return -n } return n } func min(x, y int) int { if x &lt; y { return x } return y } func max(x, y int) int { if x &gt; y { return x } return y } This does the heavy lifting for part one: converting from a position on the spiral to a column and row in the grid. (0, 0) is the centre of the spiral. This actually does a bit more than is necessary to calculate the distance as required for part 1, but we&rsquo;ll use it again for part 2. func spiral_to_xy(n int) (int, int) { if n == 1 { return 0, 0 } r := int(math.Floor((math.Sqrt(float64(n-1)) + 1) / 2)) n_r := n - (2*r-1)*(2*r-1) o := ((n_r - 1) % (2 * r)) - r + 1 sector := (n_r - 1) / (2 * r) switch sector { case 0: return r, o case 1: return -o, r case 2: return -r, -o case 3: return o, -r } return 0, 0 } Now use spiral_to_xy to calculate the Manhattan distance that the value at location n in the spiral memory are carried to reach the &ldquo;access port&rdquo; at 0. func distance(n int) int { x, y := spiral_to_xy(n) return abs(x) + abs(y) } This function does the opposite of spiral_to_xy, translating a grid position back to its position on the spiral. This is the one that took me far too long to figure out because I had a brain bug and tried to calculate the value s (which sector or quarter of the spiral we&rsquo;re looking at) in a way that was never going to work! Fortunately I came to my senses. func xy_to_spiral(x, y int) int { if x == 0 &amp;&amp; y == 0 { return 1 } r := max(abs(x), abs(y)) var s, o, n int if x+y &gt; 0 &amp;&amp; x-y &gt;= 0 { s = 0 } else if x-y &lt; 0 &amp;&amp; x+y &gt;= 0 { s = 1 } else if x+y &lt; 0 &amp;&amp; x-y &lt;= 0 { s = 2 } else { s = 3 } switch s { case 0: o = y case 1: o = -x case 2: o = -y case 3: o = x } n = o + r*(2*s+1) + (2*r-1)*(2*r-1) return n } This is a utility function that uses xy_to_spiral to fetch the value at a given (x, y) location, and returns zero if we haven&rsquo;t filled that location yet. func get_spiral(mem []int, x, y int) int { n := xy_to_spiral(x, y) - 1 if n &lt; len(mem) { return mem[n] } return 0 } Finally we solve part 2 of the problem, which involves going round the spiral writing values into it that are the sum of some values already written. The result is the first of these sums that is greater than or equal to the given input value. func stress_test(input int) int { mem := make([]int, 1) n := 0 mem[0] = 1 for mem[n] &lt; input { n++ x, y := spiral_to_xy(n + 1) mem = append(mem, get_spiral(mem, x+1, y)+ get_spiral(mem, x+1, y+1)+ get_spiral(mem, x, y+1)+ get_spiral(mem, x-1, y+1)+ get_spiral(mem, x-1, y)+ get_spiral(mem, x-1, y-1)+ get_spiral(mem, x, y-1)+ get_spiral(mem, x+1, y-1)) } return mem[n] } Now the last part of the program puts it all together, reading the input value from a commandline argument and printing the results of the two parts of the challenge: func main() { var n int fmt.Sscanf(os.Args[1], &#34;%d&#34;, &amp;n) fmt.Printf(&#34;Input is %d\n&#34;, n) fmt.Printf(&#34;Distance is %d\n&#34;, distance(n)) fmt.Printf(&#34;Stress test result is %d\n&#34;, stress_test(n)) } Corruption Checksum — Python — #adventofcode Day 2 Today&rsquo;s challenge is to calculate a rather contrived &ldquo;checksum&rdquo; over a grid of numbers. → Full code on GitHub !!! commentary Today I went back to plain Python, and I didn&rsquo;t do formal tests because only one test case was given for each part of the problem. I just got stuck in. I did write part 2 out in as nested `for` loops as an intermediate step to working out the generator expression. I think that expanded version may have been more readable. Having got that far, I couldn't then work out how to finally eliminate the need for an auxiliary function entirely without either sorting the same elements multiple times or sorting each row as it's read. First we read in the input, split it and convert it to numbers. fileinput.input() returns an iterator over the lines in all the files passed as command-line arguments, or over standard input if no files are given. from fileinput import input sheet = [[int(x) for x in l.split()] for l in input()] Part 1 of the challenge calls for finding the difference between the largest and smallest number in each row, and then summing those differences: print(sum(max(x) - min(x) for x in sheet)) Part 2 is a bit more involved: for each row we have to find the unique pair of elements that divide into each other without remainder, then sum the result of those divisions. We can make it a little easier by sorting each row; then we can take each number in turn and compare it only with the numbers after it (which are guaranteed to be larger). Doing this ensures we only make each comparison once. def rowsum_div(row): row = sorted(row) return sum(y // x for i, x in enumerate(row) for y in row[i+1:] if y % x == 0) print(sum(map(rowsum_div, sheet))) We can make this code shorter (if not easier to read) by sorting each row as it&rsquo;s read: sheet = [sorted(int(x) for x in l.split()) for l in input()] Then we can just use the first and last elements in each row for part 1, as we know those are the smallest and largest respectively in the sorted row: print(sum(x[-1] - x[0] for x in sheet)) Part 2 then becomes a sum over a single generator expression: print(sum(y // x for row in sheet for i, x in enumerate(row) for y in row[i+1:] if y % x == 0)) Very satisfying! Inverse Captcha — Coconut — #adventofcode Day 1 Well, December&rsquo;s here at last, and with it Day 1 of Advent of Code. … It goes on to explain that you may only leave by solving a captcha to prove you&rsquo;re not a human. Apparently, you only get one millisecond to solve the captcha: too fast for a normal human, but it feels like hours to you. … As well as posting solutions here when I can, I&rsquo;ll be putting them all on https://github.com/jezcope/aoc2017 too. !!! commentary After doing some challenges from last year in Haskell for a warm up, I felt inspired to try out the functional-ish Python dialect, Coconut. Now that I&rsquo;ve done it, it feels a bit of an odd language, neither fish nor fowl. It&rsquo;ll look familiar to any Pythonista, but is loaded with features normally associated with functional languages, like pattern matching, destructuring assignment, partial application and function composition. That makes it quite fun to work with, as it works similarly to Haskell, but because it's restricted by the basic rules of Python syntax everything feels a bit more like hard work than it should. The accumulator approach feels clunky, but it's necessary to allow [tail call elimination](https://en.wikipedia.org/wiki/Tail_call), which Coconut will do and I wanted to see in action. Lo and behold, if you take a look at the [compiled Python version](https://github.com/jezcope/aoc2017/blob/86c8100824bda1b35e5db6e02d4b80890be7a022/01-inverse-captcha.py#L675) you'll see that my recursive implementation has been turned into a non-recursive `while` loop. Then again, maybe I'm just jealous of Phil Tooley's [one-liner solution in Python](https://github.com/ptooley/aocGolf/blob/1380d78194f1258748ccfc18880cfd575baf5d37/2017.py#L8). import sys def inverse_captcha_(s, acc=0): case reiterable(s): match (|d, d|) :: rest: return inverse_captcha_((|d|) :: rest, acc + int(d)) match (|d0, d1|) :: rest: return inverse_captcha_((|d1|) :: rest, acc) return acc def inverse_captcha(s) = inverse_captcha_(s :: s[0]) def inverse_captcha_1_(s0, s1, acc=0): case (reiterable(s0), reiterable(s1)): match ((|d0|) :: rest0, (|d0|) :: rest1): return inverse_captcha_1_(rest0, rest1, acc + int(d0)) match ((|d0|) :: rest0, (|d1|) :: rest1): return inverse_captcha_1_(rest0, rest1, acc) return acc def inverse_captcha_1(s) = inverse_captcha_1_(s, s$[len(s)//2:] :: s) def test_inverse_captcha(): assert &quot;1111&quot; |&gt; inverse_captcha == 4 assert &quot;1122&quot; |&gt; inverse_captcha == 3 assert &quot;1234&quot; |&gt; inverse_captcha == 0 assert &quot;91212129&quot; |&gt; inverse_captcha == 9 def test_inverse_captcha_1(): assert &quot;1212&quot; |&gt; inverse_captcha_1 == 6 assert &quot;1221&quot; |&gt; inverse_captcha_1 == 0 assert &quot;123425&quot; |&gt; inverse_captcha_1 == 4 assert &quot;123123&quot; |&gt; inverse_captcha_1 == 12 assert &quot;12131415&quot; |&gt; inverse_captcha_1 == 4 if __name__ == &quot;__main__&quot;: sys.argv[1] |&gt; inverse_captcha |&gt; print sys.argv[1] |&gt; inverse_captcha_1 |&gt; print Advent of Code 2017: introduction It&rsquo;s a common lament of mine that I don&rsquo;t get to write a lot of code in my day-to-day job. I like the feeling of making something from nothing, and I often look for excuses to write bits of code, both at work and outside it. Advent of Code is a daily series of programming challenges for the month of December, and is about to start its third annual incarnation. I discovered it too late to take part in any serious way last year, but I&rsquo;m going to give it a try this year. There are no restrictions on programming language (so of course some people delight in using esoteric languages like Brainf**k), but I think I&rsquo;ll probably stick with Python for the most part. That said, I miss my Haskell days and I&rsquo;m intrigued by new kids on the block Go and Rust, so I might end up throwing in a few of those on some of the simpler challenges. I&rsquo;d like to focus a bit more on how I solve the puzzles. They generally come in two parts, with the second part only being revealed after successful completion of the first part. With that in mind, test-driven development makes a lot of sense, because I can verify that I haven&rsquo;t broken the solution to the first part in modifying to solve the second. I may also take a literate programming approach with org-mode or Jupyter notebooks to document my solutions a bit more, and of course that will make it easier to publish solutions here so I&rsquo;ll do that as much as I can make time for. On that note, here are some solutions for 2016 that I&rsquo;ve done recently as a warmup. Day 1: Python Day 1 instructions import numpy as np import pytest as t import sys TURN = { &#39;L&#39;: np.array([[0, 1], [-1, 0]]), &#39;R&#39;: np.array([[0, -1], [1, 0]]) } ORIGIN = np.array([0, 0]) NORTH = np.array([0, 1]) class Santa: def __init__(self, location, heading): self.location = np.array(location) self.heading = np.array(heading) self.visited = [(0,0)] def execute_one(self, instruction): start_loc = self.location.copy() self.heading = self.heading @ TURN[instruction[0]] self.location += self.heading * int(instruction[1:]) self.mark(start_loc, self.location) def execute_many(self, instructions): for i in instructions.split(&#39;,&#39;): self.execute_one(i.strip()) def distance_from_start(self): return sum(abs(self.location)) def mark(self, start, end): for x in range(min(start[0], end[0]), max(start[0], end[0])+1): for y in range(min(start[1], end[1]), max(start[1], end[1])+1): if any((x, y) != start): self.visited.append((x, y)) def find_first_crossing(self): for i in range(1, len(self.visited)): for j in range(i): if self.visited[i] == self.visited[j]: return self.visited[i] def distance_to_first_crossing(self): crossing = self.find_first_crossing() if crossing is not None: return abs(crossing[0]) + abs(crossing[1]) def __str__(self): return f&#39;Santa @ {self.location}, heading {self.heading}&#39; def test_execute_one(): s = Santa(ORIGIN, NORTH) s.execute_one(&#39;L1&#39;) assert all(s.location == np.array([-1, 0])) assert all(s.heading == np.array([-1, 0])) s.execute_one(&#39;L3&#39;) assert all(s.location == np.array([-1, -3])) assert all(s.heading == np.array([0, -1])) s.execute_one(&#39;R3&#39;) assert all(s.location == np.array([-4, -3])) assert all(s.heading == np.array([-1, 0])) s.execute_one(&#39;R100&#39;) assert all(s.location == np.array([-4, 97])) assert all(s.heading == np.array([0, 1])) def test_execute_many(): s = Santa(ORIGIN, NORTH) s.execute_many(&#39;L1, L3, R3&#39;) assert all(s.location == np.array([-4, -3])) assert all(s.heading == np.array([-1, 0])) def test_distance(): assert Santa(ORIGIN, NORTH).distance_from_start() == 0 assert Santa((10, 10), NORTH).distance_from_start() == 20 assert Santa((-17, 10), NORTH).distance_from_start() == 27 def test_turn_left(): east = NORTH @ TURN[&#39;L&#39;] south = east @ TURN[&#39;L&#39;] west = south @ TURN[&#39;L&#39;] assert all(east == np.array([-1, 0])) assert all(south == np.array([0, -1])) assert all(west == np.array([1, 0])) def test_turn_right(): west = NORTH @ TURN[&#39;R&#39;] south = west @ TURN[&#39;R&#39;] east = south @ TURN[&#39;R&#39;] assert all(east == np.array([-1, 0])) assert all(south == np.array([0, -1])) assert all(west == np.array([1, 0])) if __name__ == &#39;__main__&#39;: instructions = sys.stdin.read() santa = Santa(ORIGIN, NORTH) santa.execute_many(instructions) print(santa) print(&#39;Distance from start:&#39;, santa.distance_from_start()) print(&#39;Distance to target: &#39;, santa.distance_to_first_crossing()) Day 2: Haskell Day 2 instructions module Main where data Pos = Pos Int Int deriving (Show) -- Magrittr-style pipe operator (|&gt;) :: a -&gt; (a -&gt; b) -&gt; b x |&gt; f = f x swapPos :: Pos -&gt; Pos swapPos (Pos x y) = Pos y x clamp :: Int -&gt; Int -&gt; Int -&gt; Int clamp lower upper x | x &lt; lower = lower | x &gt; upper = upper | otherwise = x clampH :: Pos -&gt; Pos clampH (Pos x y) = Pos x&#39; y&#39; where y&#39; = clamp 0 4 y r = abs (2 - y&#39;) x&#39; = clamp r (4-r) x clampV :: Pos -&gt; Pos clampV = swapPos . clampH . swapPos buttonForPos :: Pos -&gt; String buttonForPos (Pos x y) = [buttons !! y !! x] where buttons = [&#34; D &#34;, &#34; ABC &#34;, &#34;56789&#34;, &#34; 234 &#34;, &#34; 1 &#34;] decodeChar :: Pos -&gt; Char -&gt; Pos decodeChar (Pos x y) &#39;R&#39; = clampH $ Pos (x+1) y decodeChar (Pos x y) &#39;L&#39; = clampH $ Pos (x-1) y decodeChar (Pos x y) &#39;U&#39; = clampV $ Pos x (y+1) decodeChar (Pos x y) &#39;D&#39; = clampV $ Pos x (y-1) decodeLine :: Pos -&gt; String -&gt; Pos decodeLine p &#34;&#34; = p decodeLine p (c:cs) = decodeLine (decodeChar p c) cs makeCode :: String -&gt; String makeCode instructions = lines instructions -- split into lines |&gt; scanl decodeLine (Pos 1 1) -- decode to positions |&gt; tail -- drop start position |&gt; concatMap buttonForPos -- convert to buttons main = do input &lt;- getContents putStrLn $ makeCode input Research Data Management Forum 18, Manchester !!! intro &quot;&quot; Monday 20 and Tuesday 21 November 2017 I&rsquo;m at the Research Data Management Forum in Manchester. I thought I&rsquo;d use this as an opportunity to try liveblogging, so during the event some notes should appear in the box below (you may have to manually refresh your browser tab periodically to get the latest version). I've not done this before, so if the blog stops updating then it's probably because I've stopped updating it to focus on the conference instead! This was made possible using GitHub's cool [Gist](https://gist.github.com) tool. Draft content policy I thought it was about time I had some sort of content policy on here so this is a first draft. It will eventually wind up as a separate page. Feedback welcome! !!! aside &ldquo;Content policy&rdquo; This blog&rsquo;s primary purpose is as a reflective learning tool for my own development; my aim in writing any given post is mainly to expose and develop my own thinking on a topic. My reasons for making a public blog rather than a private journal are: 1. If I'm lucky, someone smarter than me will provide feedback that will help me and my readers to learn more 2. If I'm extra lucky, someone else might learn from the material as well Each post, therefore, represents the state of my thinking at the time I wrote it, or perhaps a deliberate provocation or exaggeration; either way, if you don't know me personally please don't judge me based entirely on my past words. This is a request though, not an attempt to excuse bad behaviour on my part. I accept full responsibility for any consequences of my words, whether intended or not. I will not remove comments or ban individuals for disagreeing with me, only for behaving offensively or disrespectfully. I will do my best to be fair and balanced and explain decisions that I take, but I reserve the right to take those decisions without making any explanation at all if it seems likely to further inflame a situation. If I end up responding to anything simply with a link to this policy, that's probably all the explanation you're going to get. It should go without saying, but the opinions presented in this blog are my own and not those of my employer or anyone else I might at times represent. Learning to live with anxiety !!! intro &quot;&quot; This is a post that I&rsquo;ve been writing for months, and writing in my head for years. For some it will explain aspects of my personality that you might have wondered about. For some it will just be another person banging on self-indulgently about so-called &ldquo;mental health issues&rdquo;. Hopefully, for some it will demystify some stuff and show that you&rsquo;re not alone and things do get better. For as long as I can remember I&rsquo;ve been a worrier. I&rsquo;ve also suffered from bouts of what I now recognise as depression, on and off since my school days. It&rsquo;s only relatively recently that I&rsquo;ve come to the realisation that these two might be connected and that my &lsquo;worrying&rsquo; might in fact be outside the normal range of healthy human behaviour and might more accurately be described as chronic anxiety. You probably won&rsquo;t have noticed it, but it&rsquo;s been there. More recently I&rsquo;ve begun feeling like I&rsquo;m getting on top of it and feeling &ldquo;normal&rdquo; for the first time in my life. Things I&rsquo;ve found that help include: getting out of the house more and socialising with friends; and getting a range of exercise, outdoors and away from the city (rock climbing is mentally and physically engaging and open water swimming is indescribably joyful). But mostly it&rsquo;s the cognitive behavioural therapy (CBT) and the antidepressants. Before I go any further, a word about drugs (&ldquo;don&rsquo;t do drugs, kids&rdquo;): I&rsquo;m on the lowest available dose of a common antidepressant. This isn&rsquo;t because it stops me being sad all the time (I&rsquo;m not) or because it makes all my problems go away (it really doesn&rsquo;t). It&rsquo;s because the scientific evidence points to a combination of CBT and antidepressants as being the single most effective treatment for generalised anxiety disorder. The reason for this is simple: CBT isn&rsquo;t easy, because it asks you to challenge habits and beliefs you&rsquo;ve held your whole life. In the short term there is going to be more anxiety and some antidepressants are also effective at blunting the effect of this additional anxiety. In short, CBT is what makes you better, and the drugs just make it a little bit more effective. A lot of people have misconceptions about what it means to be &lsquo;in therapy&rsquo;. I suspect a lot of these are derived from the psychoanalysis we often see portrayed in (primarily US) film and TV. The problem with that type of navel-gazing therapy is that you can spend years doing it, finally reach some sort of breakthrough insight, and still not have no idea what the supposed insight means for your actual life. CBT is different in that rather than addressing feelings directly it focuses on habits in your thoughts (cognitive) and actions (behavioural) with feeling better as an outcome (therapy). CBT and related forms of therapy now have decades of clinical evidence showing that they really work. It uses a wide range of techniques to identify, challenge and reduce various common unhelpful thoughts and behaviours. By choosing and practicing these, you can break bad mental habits that you&rsquo;ve been carrying around, often for decades. For me this means giving fair weight to my successes as well as my failings, allowing flexibility into the rigid rules that I have always, subconsciously, lived by, and being a bit kinder to myself when I make mistakes. It&rsquo;s not been easy and I have to remind myself to practice this every day, but it&rsquo;s really helped. !!! aside &ldquo;More info&rdquo; If you live in the UK, you might not be aware that you can get CBT and other psychological therapies on the NHS through a scheme called IAPT (improving access to psychological therapies). You can self-refer so you don&rsquo;t need to see a doctor first, but you might want to anyway if you think medication might help. They also have a progression of treatments, so you might be offered a course of &ldquo;guided self-help&rdquo; and then progressed to CBT or another talking therapy if need be. This is what happened to me, and it did help a bit but it was CBT that helped me the most. Becoming a librarian What is a librarian? Is it someone who has a masters degree in librarianship and information science? Is it someone who looks after information for other people? Is it simply someone who works in a library? I&rsquo;ve been grappling with this question a lot lately because I&rsquo;ve worked in academic libraries for about 3 years now and I never really thought that&rsquo;s something that might happen. People keep referring to me as &ldquo;a librarian&rdquo; but there&rsquo;s some imposter feelings here because all the librarians around me have much more experience, have skills in areas like cataloguing and collection management and, generally, have a librarian masters degree. So I&rsquo;ve been thinking about what it actually means to me to be a librarian or not. NB. some of these may be tongue-in-cheek Ways in which I am a librarian: I work in a library I help people to access and organise information I have a cat I like gin Ways in which I am not a librarian: I don&rsquo;t have a librarianship qualification I don&rsquo;t work with books 😉 I don&rsquo;t knit (though I can probably remember how if pressed) I don&rsquo;t shush people or wear my hair in a bun (I can confirm that this is also true of every librarian I know) Ways in which I am a shambrarian: I like beer I have more IT experience and qualification than librarianship At the end of the day, I still don&rsquo;t know how I feel about this or, for that matter, how important it is. I&rsquo;m probably going to accept whatever title people around me choose to bestow, though any label will chafe at times! Lean Libraries: applying agile practices to library services Kanban board Jeff Lasovski (via Wikimedia Commons) I&rsquo;ve been working with our IT services at work quite closely for the last year as product owner for our new research data portal, ORDA. That&rsquo;s been a fascinating process for me as I&rsquo;ve been able to see first-hand some of the agile techniques that I&rsquo;ve been reading about from time-to-time on the web over the last few years. They&rsquo;re in the process of adopting a specific set of practices going under the name &ldquo;Scrum&rdquo;, which is fun because it uses some novel terminology that sounds pretty weird to non-IT folks, like &ldquo;scrum master&rdquo;, &ldquo;sprint&rdquo; and &ldquo;product backlog&rdquo;. On my small project we&rsquo;ve had great success with the short cycle times and been able to build trust with our stakeholders by showing concrete progress on a regular basis. Modern librarianship is increasingly fluid, particularly in research services, and I think that to handle that fluidity it&rsquo;s absolutely vital that we are able to work in a more agile way. I&rsquo;m excited about the possibilities of some of these ideas. However, Scrum as implemented by our IT services doesn&rsquo;t seem something that transfers directly to the work that we do: it&rsquo;s too specialised for software development to adapt directly. What I intend to try is to steal some of the individual practices on an experimental basis and simply see what works and what doesn&rsquo;t. The Lean concepts currently popular in IT were originally developed in manufacturing: if they can be translated from the production of physical goods to IT, I don&rsquo;t see why we can&rsquo;t make the ostensibly smaller step of translating them to a different type of knowledge work. I&rsquo;ve therefore started reading around this subject to try and get as many ideas as possible. I&rsquo;m generally pretty rubbish at taking notes from books, so I&rsquo;m going to try and record and reflect on any insights I make on this blog. The framework for trying some of these out is clearly a Plan-Do-Check-Act continuous improvement cycle, so I&rsquo;ll aim to reflect on that process too. I&rsquo;m sure there will have been people implementing Lean in libraries already, so I&rsquo;m hoping to be able to discover and learn from them instead of starting froms scratch. Wish me luck! Mozilla Global Sprint 2017 Photo by Lena Bell on Unsplash Every year, the Mozilla Foundation runs a two-day Global Sprint, giving people around the world 50 hours to work on projects supporting and promoting open culture and tech. Though much of the work during the sprint is, of course, technical software development work, there are always tasks suited to a wide range of different skill sets and experience levels. The participants include writers, designers, teachers, information professionals and many others. This year, for the first time, the University of Sheffield hosted a site, providing a space for local researchers, developers and others to get out of their offices, work on #mozsprint and link up with others around the world. The Sheffield site was organised by the Research Software Engineering group in collaboration with the University Library. Our site was only small compared to others, but we still had people working on several different projects. My reason for taking part in the sprint was to contribute to the international effort on the Library Carpentry project. A team spread across four continents worked throughout the whole sprint to review and develop our lesson material. As there were no other Library Carpentry volunteers at the Sheffield site, I chose to work on some urgent work around improving the presentation of our workshops and lessons on the web and related workflows. It was a really nice subproject to work on, requiring not only cleaning up and normalising the metadata we hold on workshops and lessons, but also digesting and formalising our current ad hoc process of lesson development. The largest group were solar physicists from the School of Maths and Statistics, working on the SunPy project, an open source environment for solar data analysis. They pushed loads of bug fixes and documentation improvements, and also mentored a new contributor through their first additions to the project. Anna Krystalli from Research Software Engineering worked on the EchoBurst project, which is building a web browser extension to help people break out of their online echo chambers. It does this by using natural language processing techniques to highlight well-written, logically sound articles that disagree with the reader&rsquo;s stated views on particular topics of interest. Anna was part of an effort to begin extending this technology to online videos. We had a couple of individuals simply taking the opportunity to break out of their normal work environments to work or learn, including a couple of members of library staff show up for a couple of hours to learn how to use git on a new project! IDCC 2017 reflection For most of the last few years I&#39;ve been lucky enough to attend the International Digital Curation Conference (IDCC). One of the main audiences attending is people who, like me, work on research data management at universities around the world and it&#39;s begun to feel like a sort of &#34;home&#34; conference to me. This year, IDCC was held at the Royal College of Surgeons in the beautiful city of Edinburgh. For the last couple of years, my overall impression has been that, as a community, we&#39;re moving away from the &#34;first-order&#34; problem of trying to convince people (from PhD students to senior academics) to take RDM seriously and into a rich set of &#34;second-order&#34; problems around how to do things better and widen support to more people. This year has been no exception. Here are a few of my observations and takeaway points. Everyone has a repository now Only last year, the most common question you&#39;d get asked by strangers in the coffee break would be &#34;Do you have a data repository?&#34; Now the question is more likely to be &#34;What are you using for your data repository?&#34;, along with more subtle questions about specific components of systems and how they interact. Integrating active storage and archival systems Now that more institutions have data worth preserving, there is more interest in (and in many cases experience of) setting up more seamless integrations between active and archival storage. There are lessons here we can learn. Freezing in amber vs actively maintaining assets There seemed to be an interesting debate going on throughout the conference around the aim of preservation: should we be faithfully preserving the bits and bytes provided without trying to interpret them, or should we take a more active approach by, for example, migrating obsolete formats to newer alternatives. If the former, should we attempt to preserve the software required to access the data as well? If the latter, how much effort do we invest and how do we ensure nothing is lost or altered in the migration? Demonstrating Data Science instead of debating what it is The phrase &#34;Data Science&#34; was once again one of the most commonly uttered of the conference. However, there is now less abstract discussion about what, exactly, is meant by this &#34;data science&#34; thing; this has been replaced more by concrete demonstrations. This change was exemplified perfectly by the keynote by data scientist Alice Daish, who spent a riveting 40 minutes or so enthusing about all the cool stuff she does with data at the British Museum. Recognition of software as an issue Even as recently as last year, I&#39;ve struggled to drum up much interest in discussing software sustainability and preservation at events like this; the interest was there, but there were higher priorities. So I was completely taken by surprise when we ended up with 30+ people in the Software Preservation Birds of a Feather (BoF) session, and when very little input was needed from me as chair to keep a productive discussion going for a full 90 minutes. Unashamed promotion of openness As a community we seem to have nearly overthrown our collective embarrassment about the phrase &#34;open data&#34; (although maybe this is just me). We&#39;ve always known it was a good thing, but I know I&#39;ve been a bit of an apologist in the past, feeling that I had to &#34;soften the blow&#34; when asking researchers to be more open. Now I feel more confident in leading with the benefits of openness, and it felt like that&#39;s a change reflected in the community more widely. Becoming more involved in the conference This year, I took a decision to try and do more to contribute to the conference itself, and I felt like this was pretty successful both in making that contribution and building up my own profile a bit. I presented a paper on one of my current passions, Library Carpentry; it felt really good to be able to share my enthusiasm. I presented a poster on our work integrating our data repository and digital preservation platform; this gave me more of a structure for networking during breaks, as I was able to stand by the poster and start discussions with anyone who seemed interested. I chaired a parallel session; a first for me, and a different challenge from presenting or simply attending the talks. And finally, I proposed and chaired the Software Preservation BoF session (blog post forthcoming). Renewed excitement It&#39;s weird, and possibly all in my imagination, but there seemed to be more energy at this conference than at the previous couple I&#39;ve been to. More people seemed to be excited about the work we&#39;re all doing, recent achievements and the possibilities for the future. Introducing PyRefine: OpenRefine meets Python I&rsquo;m knocking the rust off my programming skills by attempting to write a pure-Python interpreter for OpenRefine &ldquo;scripts&rdquo;. OpenRefine is a great tool for exploring and cleaning datasets prior to analysing them. It also records an undo history of all actions that you can export as a sort of script in JSON format. One thing that bugs me though is that, having spent some time interactively cleaning up your dataset, you then need to fire up OpenRefine again and do some interactive mouse-clicky stuff to apply that cleaning routine to another dataset. You can at least re-import the JSON undo history to make that as quick as possible, but there&rsquo;s no getting around the fact that there&rsquo;s no quick way to do it from a cold start. There is a project, BatchRefine, that extends the OpenRefine server to accept batch requests over a HTTP API, but that isn&rsquo;t useful when you can&rsquo;t or don&rsquo;t want to keep a full Java stack running in the background the whole time. My concept is this: you use OR to explore the data interactively and design a cleaning process, but then export the process to JSON and integrate it into your analysis in Python. That way it can be repeated ad nauseam without having to fire up a full Java stack. I&rsquo;m taking some inspiration from the great talk &ldquo;So you want to be a wizard?&quot; by Julia Evans (@b0rk), who recommends trying experiments as a way to learn. She gives these Rules of Programming Experiments: &ldquo;it doesn&rsquo;t have to be good it doesn&rsquo;t have to work you have to learn something&rdquo; In that spirit, my main priorities are: to see if this can be done; to see how far I can get implementing it; and to learn something. If it also turns out to be a useful thing, well, that&rsquo;s a bonus. Some of the interesting possible challenges here: Implement all core operations; there are quite a lot of these, some of which will be fun (i.e. non-trivial) to implement Implement (a subset of?) GREL, the General Refine Expression Language; I guess my undergrad course on implementing parsers and compilers will come in handy after all! Generate clean, sane Python code from the JSON rather than merely executing it; more than anything, this would be a nice educational tool for users of OpenRefine who want to see how to do equivalent things in Python Selectively optimise key parts of the process; this will involve profiling the code to identify bottlenecks as well as tweaking the actual code to go faster Potentially handle contributions to the code from other people; I&rsquo;d be really happy if this happened but I&rsquo;m realistic&hellip; If you&rsquo;re interested, the project is called PyRefine and it&rsquo;s on github. Constructive criticism, issues &amp; pull requests all welcome! Implementing Yesterbox in emacs with mu4e I&rsquo;ve been meaning to give Yesterbox a try for a while. The general idea is that each day you only deal with email that arrived yesterday or earlier. This forms your inbox for the day, hence &ldquo;yesterbox&rdquo;. Once you&rsquo;ve emptied your yesterbox, or at least got through some minimum number (10 is recommended) then you can look at emails from today. Even then you only really want to be dealing with things that are absolutely urgent. Anything else can wait til tomorrow. The motivation for doing this is to get away from the feeling that we are King Canute, trying to hold back the tide. I find that when I&rsquo;m processing my inbox toward zero there&rsquo;s always a temptation to keep skipping to the new stuff that&rsquo;s just come in. Hiding away the new email until I&rsquo;ve dealt with the old is a very interesting idea. I use mu4e in emacs for reading my email, and handily the mu search syntax is very flexible so you&rsquo;d think it would be easy to create a yesterbox filter: maildir:&quot;/INBOX&quot; date:..1d Unfortunately, 1d is interpreted as &ldquo;24 hours ago from right now&rdquo; so this filter misses everything that was sent yesterday but less than 24 hours ago. There was a feature request raised on the mu github repository to implement an additional date filter syntax but it seems to have died a death for now. In the meantime, the answer to this is to remember that my workplace observes fairly standard office hours, so that anything sent more than 9 hours ago is unlikely to have been sent today. The following does the trick: maildir:&quot;/INBOX&quot; date:..9h In my mu4e bookmarks list, that looks like this: (setq mu4e-bookmarks &#39;((&#34;flag:unread AND NOT flag:trashed&#34; &#34;Unread messages&#34; ?u) (&#34;flag:flagged maildir:/archive&#34; &#34;Starred messages&#34; ?s) (&#34;date:today..now&#34; &#34;Today&#39;s messages&#34; ?t) (&#34;date:7d..now&#34; &#34;Last 7 days&#34; ?w) (&#34;maildir:\&#34;/Mailing lists.*\&#34; (flag:unread OR flag:flagged)&#34; &#34;Unread in mailing lists&#34; ?M) (&#34;maildir:\&#34;/INBOX\&#34; date:..1d&#34; &#34;Yesterbox&#34; ?y))) ;; &lt;- this is the new one Rewarding good practice in research From opensource.com on Flickr Whenever I&rsquo;m involved in a discussion about how to encourage researchers to adopt new practices, eventually someone will come out with some variant of the following phrase: &ldquo;That&rsquo;s all very well, but researchers will never do XYZ until it&rsquo;s made a criterion in hiring and promotion decisions.&rdquo; With all the discussion of carrots and sticks I can see where this attitude comes from, and strongly empathise with it, but it raises two main problems: It&rsquo;s unfair and more than a little insulting to anyone to be lumped into one homogeneous group; and Taking all the different possible XYZs into account, that&rsquo;s an awful lot of hoops to expect anyone to jump through. Firstly, &ldquo;researchers&rdquo; are as diverse as the rest of us in terms of what gets them out of bed in the morning. Some of us want prestige; some want to contribute to a greater good; some want to create new things; some just enjoy the work. One thing I&rsquo;d argue we all have in common is this: nothing is more offputting than feeling like you&rsquo;re being strongarmed into something you don&rsquo;t want to do. If we rely on simplistic metrics, people will focus on those and miss the point. At best people will disengage and at worst they will actively game the system. I&rsquo;ve got to do these ten things to get my next payrise, and still retain my sanity? Ok, what&rsquo;s the least I can get away with and still tick them off. You see it with students taking poorly-designed assessments and grown-ups are no difference. We do need to wield carrots as well as sticks, but the whole point is that these practices are beneficial in and of themselves. The carrots are already there if we articulate them properly and clear the roadblocks (don&rsquo;t you enjoy mixed metaphors?). Creating artificial benefits will just dilute the value of the real ones. Secondly, I&rsquo;ve heard a similar argument made for all of the following practices and more: Research data management Open Access publishing Public engagement New media (e.g. blogging) Software management and sharing Some researchers devote every waking hour to their work, whether it&rsquo;s in the lab, writing grant applications, attending conferences, authoring papers, teaching, and so on and so on. It&rsquo;s hard to see how someone with all this in their schedule can find time to exercise any of these new skills, let alone learn them in the first place. And what about the people who sensibly restrict the hours taken by work to spend more time doing things they enjoy? Yes, all of the above practices are valuable, both for the individual and the community, but they&rsquo;re all new (to most) and hence require more effort up front to learn. We have to accept that it&rsquo;s inevitably going to take time for all of them to become &ldquo;business as usual&rdquo;. I think if the hiring/promotion/tenure process has any role in this, it&rsquo;s in asking whether the researcher can build a coherent narrative as to why they&rsquo;ve chosen to focus their efforts in this area or that. You&rsquo;re not on Twitter but your data is being used by 200 research groups across the world? Great! You didn&rsquo;t have time to tidy up your source code for github but your work is directly impacting government policy? Brilliant! We still need convince more people to do more of these beneficial things, so how? Call me naïve, but maybe we should stick to making rational arguments, calming fears and providing low-risk opportunities to learn new skills. Acting (compassionately) like a stuck record can help. And maybe we&rsquo;ll need to scale back our expectations in other areas (journal impact factors, anyone?) to make space for the new stuff. Software Carpentry: SC Test; does your software do what you meant? &ldquo;The single most important rule of testing is to do it.&rdquo; &mdash; Brian Kernighan and Rob Pike, The Practice of Programming (quote taken from SC Test page One of the trickiest aspects of developing software is making sure that it actually does what it&rsquo;s supposed to. Sometimes failures are obvious: you get completely unreasonable output or even (shock!) a comprehensible error message. But failures are often more subtle. Would you notice if your result was out by a few percent, or consistently ignored the first row of your input data? The solution to this is testing: take some simple example input with a known output, run the code and compare the actual output with the expected one. Implement a new feature, test and repeat. Sounds easy, doesn&rsquo;t it? But then you implement a new bit of code. You test it and everything seems to work fine, except that your new feature required changes to existing code and those changes broke something else. So in fact you need to test everything, and do it every time you make a change. Further than that, you probably want to test that all your separate bits of code work together properly (integration testing) as well as testing the individual bits separately (unit testing). In fact, splitting your tests up like that is a good way of holding on to your sanity. This is actually a lot less scary than it sounds, because there are plenty of tools now to automate that testing: you just type a simple test command and everything is verified. There are even tools that enable you to have tests run automatically when you check the code into version control, and even automatically deploy code that passes the tests, a process known as continuous integration or CI. The big problems with testing are that it&rsquo;s tedious, your code seems to work without it and no-one tells you off for not doing it. At the time when the Software Carpentry competition was being run, the idea of testing wasn&rsquo;t new, but the tools to help were in their infancy. &ldquo;Existing tools are obscure, hard to use, expensive, don&rsquo;t actually provide much help, or all three.&rdquo; The SC Test category asked entrants &ldquo;to design a tool, or set of tools, which will help programmers construct and maintain black box and glass box tests of software components at all levels, including functions, modules, and classes, and whole programs.&rdquo; The SC Test category is interesting in that the competition administrators clearly found it difficult to specify what they wanted to see in an entry. In fact, the whole category was reopened with a refined set of rules and expectations. Ultimately, it&rsquo;s difficult to tell whether this category made a significant difference. Where the tools to write tests used to be very sparse and difficult to use they are now many and several options exist for most programming languages. With this proliferation, several tried-and-tested methodologies have emerged which are consistent across many different tools, so while things still aren&rsquo;t perfect they are much better. In recent years there has been a culture shift in the wider software development community towards both testing in general and test-first development, where the tests for a new feature are written first, and then the implementation is coded incrementally until all tests pass. The current challenge is to transfer this culture shift to the academic research community! Tools for collaborative markdown editing Photo by Alan Cleaver I really love Markdown1. I love its simplicity; its readability; its plain-text nature. I love that it can be written and read with nothing more complicated than a text-editor. I love how nicely it plays with version control systems. I love how easy it is to convert to different formats with Pandoc and how it&rsquo;s become effectively the native text format for a wide range of blogging platforms. One frustration I&rsquo;ve had recently, then, is that it&rsquo;s surprisingly difficult to collaborate on a Markdown document. There are various solutions that almost work but at best feel somehow inelegant, especially when compared with rock solid products like Google Docs. Finally, though, we&rsquo;re starting to see some real possibilities. Here are some of the things I&rsquo;ve tried, but I&rsquo;d be keen to hear about other options. 1. Just suck it up To be honest, Google Docs isn&rsquo;t that bad. In fact it works really well, and has almost no learning curve for anyone who&rsquo;s ever used Word (i.e. practically anyone who&rsquo;s used a computer since the 90s). When I&rsquo;m working with non-technical colleagues there&rsquo;s nothing I&rsquo;d rather use. It still feels a bit uncomfortable though, especially the vendor lock-in. You can export a Google Doc to Word, ODT or PDF, but you need to use Google Docs to do that. Plus as soon as I start working in a word processor I get tempted to muck around with formatting. 2. Git(hub) The obvious solution to most techies is to set up a GitHub repo, commit the document and go from there. This works very well for bigger documents written over a longer time, but seems a bit heavyweight for a simple one-page proposal, especially over short timescales. Who wants to muck around with pull requests and merging changes for a document that&rsquo;s going to take 2 days to write tops? This type of project doesn&rsquo;t need a bug tracker or a wiki or a public homepage anyway. Even without GitHub in the equation, using git for such a trivial use case seems clunky. 3. Markdown in Etherpad/Google Docs Etherpad is great tool for collaborative editing, but suffers from two key problems: no syntax highlighting or preview for markdown (it&rsquo;s just treated as simple text); and you need to find a server to host it or do it yourself. However, there&rsquo;s nothing to stop you editing markdown with it. You can do the same thing in Google Docs, in fact, and I have. Editing a fundamentally plain-text format in a word processor just feels weird though. 4. Overleaf/Authorea Overleaf and Authorea are two products developed to support academic editing. Authorea has built-in markdown support but lacks proper simultaneous editing. Overleaf has great simultaneous editing but only supports markdown by wrapping a bunch of LaTeX boilerplate around it. Both OK but unsatisfactory. 5. StackEdit Now we&rsquo;re starting to get somewhere. StackEdit has both Markdown syntax highlighting and near-realtime preview, as well as integrating with Google Drive and Dropbox for file synchronisation. 6. HackMD HackMD is one that I only came across recently, but it looks like it does exactly what I&rsquo;m after: a simple markdown-aware editor with live preview that also permits simultaneous editing. I&rsquo;m a little circumspect simply because I know simultaneous editing is difficult to get right, but it certainly shows promise. 7. Classeur I discovered Classeur literally today: it&rsquo;s developed by the same team as StackEdit (which is now apparently no longer in development), and is currently in beta, but it looks to offer two killer features: real-time collaboration, including commenting, and pandoc-powered export to loads of different formats. Anything else? Those are the options I&rsquo;ve come up with so far, but they can&rsquo;t be the only ones. Is there anything I&rsquo;ve missed? Other plain-text formats are available. I&rsquo;m also a big fan of org-mode. &#x21a9;&#xfe0e; Software Carpentry: SC Track; hunt those bugs! This competition will be an opportunity for the next wave of developers to show their skills to the world &mdash; and to companies like ours. &mdash; Dick Hardt, ActiveState (quote taken from SC Track page) All code contains bugs, and all projects have features that users would like but which aren&rsquo;t yet implemented. Open source projects tend to get more of these as their user communities grow and start requesting improvements to the product. As your open source project grows, it becomes harder and harder to keep track of and prioritise all of these potential chunks of work. What do you do? The answer, as ever, is to make a to-do list. Different projects have used different solutions, including mailing lists, forums and wikis, but fairly quickly a whole separate class of software evolved: the bug tracker, which includes such well-known examples as Bugzilla, Redmine and the mighty JIRA. Bug trackers are built entirely around such requests for improvement, and typically track them through workflow stages (planning, in progress, fixed, etc.) with scope for the community to discuss and add various bits of metadata. In this way, it becomes easier both to prioritise problems against each other and to use the hive mind to find solutions. Unfortunately most bug trackers are big, complicated beasts, more suited to large projects with dozens of developers and hundreds or thousands of users. Clearly a project of this size is more difficult to manage and requires a certain feature set, but the result is that the average bug tracker is non-trivial to set up for a small single-developer project. The SC Track category asked entrants to propose a better bug tracking system. In particular, the judges were looking for something easy to set up and configure without compromising on functionality. The winning entry was a bug-tracker called Roundup, proposed by Ka-Ping Yee. Here we have another tool which is still in active use and development today. Given that there is now a huge range of options available in this area, including the mighty github, this is no small achievement. These days, of course, github has become something of a de facto standard for open source project management. Although ostensibly a version control hosting platform, each github repository also comes with a built-in issue tracker, which is also well-integrated with the &ldquo;pull request&rdquo; workflow system that allows contributors to submit bug fixes and features themselves. Github&rsquo;s competitors, such as GitLab and Bitbucket, also include similar features. Not everyone wants to work in this way though, so it&rsquo;s good to see that there is still a healthy ecosystem of open source bug trackers, and that Software Carpentry is still having an impact. Software Carpentry: SC Config; write once, compile anywhere Nine years ago, when I first release Python to the world, I distributed it with a Makefile for BSD Unix. The most frequent questions and suggestions I received in response to these early distributions were about building it on different Unix platforms. Someone pointed me to autoconf, which allowed me to create a configure script that figured out platform idiosyncracies Unfortunately, autoconf is painful to use &ndash; its grouping, quoting and commenting conventions don&rsquo;t match those of the target language, which makes scripts hard to write and even harder to debug. I hope that this competition comes up with a better solution &mdash; it would make porting Python to new platforms a lot easier! &mdash; Guido van Rossum, Technical Director, Python Consortium (quote taken from SC Config page) On to the next Software Carpentry competition category, then. One of the challenges of writing open source software is that you have to make it run on a wide range of systems over which you have no control. You don&rsquo;t know what operating system any given user might be using or what libraries they have installed, or even what versions of those libraries. This means that whatever build system you use, you can&rsquo;t just send the Makefile (or whatever) to someone else and expect everything to go off without a hitch. For a very long time, it&rsquo;s been common practice for source packages to include a configure script that, when executed, runs a bunch of tests to see what it has to work with and sets up the Makefile accordingly. Writing these scripts by hand is a nightmare, so tools like autoconf and automake evolved to make things a little easier. They did, and if the tests you want to use are already implemented they work very well indeed. Unfortunately they&rsquo;re built on an unholy combination of shell scripting and the archaic Gnu M4 macro language. That means if you want to write new tests you need to understand both of these as well as the architecture of the tools themselves &mdash; not an easy task for the average self-taught research programmer. SC Conf, then, called for a re-engineering of the autoconf concept, to make it easier for researchers to make their code available in a portable, platform-independent format. The second round configuration tool winner was SapCat, &ldquo;a tool to help make software portable&rdquo;. Unfortunately, this one seems not to have gone anywhere, and I could only find the original proposal on the Internet Archive. There were a lot of good ideas in this category about making catalogues and databases of system quirks to avoid having to rerun the same expensive tests again the way a standard ./configure script does. I think one reason none of these ideas survived is that they were overly ambitions, imagining a grand architecture where their tool provide some overarching source of truth. This is in stark contrast to the way most Unix-like systems work, where each tool does one very specific job well and tools are easy to combine in various ways. In the end though, I think Moore&rsquo;s Law won out here, making it easier to do the brute-force checks each time than to try anything clever to save time &mdash; a good example of avoiding unnecessary optimisation. Add to that the evolution of the generic pkg-config tool from earlier package-specific tools like gtk-config, and it&rsquo;s now much easier to check for particular versions and features of common packages. On top of that, much of the day-to-day coding of a modern researcher happens in interpreted languages like Python and R, which give you a fully-functioning pre-configured environment with a lot less compiling to do. As a side note, Tom Tromey, another of the shortlisted entrants in this category, is still a major contributor to the open source world. He still seems to be involved in the automake project, contributes a lot of code to the emacs community too and blogs sporadically at The Cliffs of Inanity. Semantic linefeeds: one clause per line I&rsquo;ve started using &ldquo;semantic linefeeds&rdquo;, a concept I discovered on Brandon Rhodes' blog, when writing content, an idea described in that article far better than I could. I turns out this is a very old idea, promoted way back in the day by Brian W Kernighan, contributor to the original Unix system, co-creator of the AWK and AMPL programming languages and co-author of a lot of seminal programming textbooks including &ldquo;The C Programming Language&rdquo;. The basic idea is that you break lines at natural gaps between clauses and phrases, rather than simply after the last word before you hit 80 characters. Keeping line lengths strictly to 80 characters isn&rsquo;t really necessary in these days of wide aspect ratios for screens. Breaking lines at points that make semantic sense in the sentence is really helpful for editing, especially in the context of version control, because it isolates changes to the clause in which they occur rather than just the nearest 80-character block. I also like it because it makes my crappy prose feel just a little bit more like poetry. ☺ Software Carpentry: SC Build; or making a better make Software tools often grow incrementally from small beginnings into elaborate artefacts. Each increment makes sense, but the final edifice is a mess. make is an excellent example: a simple tool that has grown into a complex domain-specific programming language. I look forward to seeing the improvements we will get from designing the tool afresh, as a whole&hellip; &mdash; Simon Peyton-Jones, Microsoft Research (quote taken from SC Build page) Most people who have had to compile an existing software tool will have come across the venerable make tool (which usually these days means GNU Make). It allows the developer to write a declarative set of rules specifying how the final software should be built from its component parts, mostly source code, allowing the build itself to be carried out by simply typing make at the command line and hitting Enter. Given a set of rules, make will work out all the dependencies between components and ensure everything is built in the right order and nothing that is up-to-date is rebuilt. Great in principle but make is notoriously difficult for beginners to learn, as much of the logic for how builds are actually carried out is hidden beneath the surface. This also makes it difficult to debug problems when building large projects. For these reasons, the SC Build category called for a replacement build tool engineered from the ground up to solve these problems. The second round winner, ScCons, is a Python-based make-like build tool written by Steven Knight. While I could find no evidence of any of the other shortlisted entries, this project (now renamed SCons) continues in active use and development to this day. I actually use this one myself from time to time and to be honest I prefer it in many cases to trendy new tools like rake or grunt and the behemoth that is Apache Ant. Its Python-based SConstruct file syntax is remarkably intuitive and scales nicely from very simple builds up to big and complicated project, with good dependency tracking to avoid unnecessary recompiling. It has a lot of built-in rules for performing common build &amp; compile tasks, but it&rsquo;s trivial to add your own, either by combining existing building blocks or by writing a new builder with the full power of Python. A minimal SConstruct file looks like this: Program(&#39;hello.c&#39;) Couldn&rsquo;t be simpler! And you have the full power of Python syntax to keep your build file simple and readable. It&rsquo;s interesting that all the entries in this category apart from one chose to use a Python-derived syntax for describing build steps. Python was clearly already a language of choice for flexible multi-purpose computing. The exception is the entry that chose to use XML instead, which I think is a horrible idea (oh how I used to love XML!) but has been used to great effect in the Java world by tools like Ant and Maven. What happened to the original Software Carpentry? &ldquo;Software Carpentry was originally a competition to design new software tools, not a training course. The fact that you didn&rsquo;t know that tells you how well it worked.&rdquo; When I read this in a recent post on Greg Wilson&rsquo;s blog, I took it as a challenge. I actually do remember the competition, although looking at the dates it was long over by the time I found it. I believe it did have impact; in fact, I still occasionally use one of the tools it produced, so Greg&rsquo;s comment got me thinking: what happened to the other competition entries? Working out what happened will need a bit of digging, as most of the relevant information is now only available on the Internet Archive. It certainly seems that by November 2008 the domain name had been allowed to lapse and had been replaced with a holding page by the registrar. There were four categories in the competition, each representing a category of tool that the organisers thought could be improved: SC Build: a build tool to replace make SC Conf: a configuration management tool to replace autoconf and automake SC Track: a bug tracking tool SC Test: an easy to use testing framework I&rsquo;m hoping to be able to show that this work had a lot more impact than Greg is admitting here. I&rsquo;ll keep you posted on what I find! Changing static site generators: Nanoc → Hugo I&rsquo;ve decided to move the site over to a different static site generator, Hugo. I&rsquo;ve been using Nanoc for a long time and it&rsquo;s worked very well, but lately it&rsquo;s been taking longer and longer to compile the site and throwing weird errors that I can&rsquo;t get to the bottom of. At the time I started using Nanoc, static site generators were in their infancy. There weren&rsquo;t the huge number of feature-loaded options that there are now, so I chose one and I built a whole load of blogging-related functionality myself. I did it in ways that made sense at the time but no longer work well with Nanoc&rsquo;s latest versions. So it&rsquo;s time to move to something that has blogging baked-in from the beginning and I&rsquo;m taking the opportunity to overhaul the look and feel too. Again, when I started there weren&rsquo;t many pre-existing themes so I built the whole thing myself and though I&rsquo;m happy with the work I did on it it never quite felt polished enough. Now I&rsquo;ve got the opportunity to adapt one of the many well-designed themes already out there, so I&rsquo;ve taken one from the Hugo themes gallery and tweaked the colours to my satisfaction. Hugo also has various features that I&rsquo;ve wanted to implement in Nanoc but never quite got round to it. The nicest one is proper handling of draft posts and future dates, but I keep finding others. There&rsquo;s a lot of old content that isn&rsquo;t quite compatible with the way Hugo does things so I&rsquo;ve taken the old Nanoc-compiled content and frozen it to make sure that old links should still work. I could probably fiddle with it for years without doing much so it&rsquo;s probably time to go ahead and publish it. I&rsquo;m still not completely happy with my choice of theme but one of the joys of Hugo is that I can change that whenever I want. Let me know what you think! License Except where otherwise stated, all content on eRambler by Jez Cope is licensed under a Creative Commons Attribution-ShareAlike 4.0 International license. RDM Resources I occasionally get asked for resources to help someone learn more about research data management (RDM) as a discipline (i.e. for those providing RDM support rather than simply wanting to manage their own data). I&rsquo;ve therefore collected a few resources together on this page. If you&rsquo;re lucky I might even update it from time to time! First, a caveat: this is very focussed on UK Higher Education, though much of it will still be relevant for people outside that narrow demographic. My general recommendation would be to start with the Digital Curation Centre (DCC) website and follow links out from there. I also have a slowly growing list of RDM links on Diigo, and there&rsquo;s an RDM section in my list of blogs and feeds too. Mailing lists Jiscmail is a popular list server run for the benefit of further and higher education in the UK; the following lists are particularly relevant: RESEARCH-DATAMAN DATA-PUBLICATION DIGITAL-PRESERVATION LIS-RESEARCHSUPPORT The Research Data Alliance have a number of Interest Groups and Working Groups that discuss issues by email Events International Digital Curation Conference — major annual conference Research Data Management Forum — roughly every six months, places are limited! RDA Plenary — also every 6 months, but only about 1 in every 3 in Europe Books In no particular order: Martin, Victoria. Demystifying eResearch: A Primer for Librarians. Libraries Unlimited, 2014. Borgman, Christine L. Big Data, Little Data, No Data: Scholarship in the Networked World. Cambridge, Massachusetts: The MIT Press, 2015. Corti, Louise, Veerle Van den Eynden, and Libby Bishop. Managing and Sharing Research Data. Thousand Oaks, CA: SAGE Publications Ltd, 2014. Pryor, Graham, ed. Managing Research Data. Facet Publishing, 2012. Pryor, Graham, Sarah Jones, and Angus Whyte, eds. Delivering Research Data Management Services: Fundamentals of Good Practice. Facet Publishing, 2013. Ray, Joyce M., ed. Research Data Management: Practical Strategies for Information Professionals. West Lafayette, Indiana: Purdue University Press, 2014. Reports ‘Ten Recommendations for Libraries to Get Started with Research Data Management’. LIBER, 24 August 2012. http://libereurope.eu/news/ten-recommendations-for-libraries-to-get-started-with-research-data-management/. ‘Science as an Open Enterprise’. Royal Society, 2 June 2012. https://royalsociety.org/policy/projects/science-public-enterprise/Report/. Mary Auckland. ‘Re-Skilling for Research’. RLUK, January 2012. http://www.rluk.ac.uk/wp-content/uploads/2014/02/RLUK-Re-skilling.pdf. Journals International Journal of Digital Curation (IJDC) Journal of eScience Librarianship (JeSLib) Fairphone 2: initial thoughts on the original ethical smartphone I&rsquo;ve had my eye on the Fairphone 2 for a while now, and when my current phone, an aging Samsung Galaxy S4, started playing up I decided it was time to take the plunge. A few people have asked for my thoughts on the Fairphone so here are a few notes. Why I bought it The thing that sparked my interest, and the main reason for buying the phone really, was the ethical stance of the manufacturer. The small Swedish company have gone to great lengths to ensure that both labour and materials are sourced as responsibly as possible. They regularly inspect the factories where the parts are made and assembled to ensure fair treatment of the workers and they source all the raw materials carefully to minimise the environmental impact and the use of conflict minerals. Another side to this ethical stance is a focus on longevity of the phone itself. This is not a product with an intentionally limited lifespan. Instead, it&rsquo;s designed to be modular and as repairable as possible, by the owner themselves. Spares are available for all of the parts that commonly fail in phones (including screen and camera), and at the time of writing the Fairphone 2 is the only phone to receive 10/10 for reparability from iFixit. There are plans to allow hardware upgrades, including an expansion port on the back so that NFC or wireless charging could be added with a new case, for example. What I like So far, the killer feature for me is the dual SIM card slots. I have both a personal and a work phone, and the latter was always getting left at home or in the office or running out of charge. Now I have both SIMs in the one phone: I can recieve calls on either number, turn them on and off independently and choose which account to use when sending a text or making a call. The OS is very close to &ldquo;standard&rdquo; Android, which is nice, and I really don&rsquo;t miss all the extra bloatware that came with the Galaxy S4. It also has twice the storage of that phone, which is hardly unique but is still nice to have. Overall, it seems like a solid, reliable phone, though it&rsquo;s not going to outperform anything else at the same price point. It certainly feels nice and snappy for everything I want to use it for. I&rsquo;m no mobile gamer, but there is that distant promise of upgradability on the horizon if you are. What I don&rsquo;t like I only have two bugbears so far. Once or twice it&rsquo;s locked up and become unresponsive, requiring a &ldquo;manual reset&rdquo; (removing and replacing the battery) to get going again. It also lacks NFC, which isn&rsquo;t really a deal breaker, but I was just starting to make occasional use of it on the S4 (mostly experimenting with my Yubikey NEO) and it would have been nice to try out Android Pay when it finally arrives in the UK. Overall It&rsquo;s definitely a serious contender if you&rsquo;re looking for a new smartphone and aren&rsquo;t bothered about serious mobile gaming. You do pay a premium for the ethical sourcing and modularity, but I feel that&rsquo;s worth it for me. I&rsquo;m looking forward to seeing how it works out as a phone. Wiring my web I&rsquo;m a nut for automating repetitive tasks, so I was dead pleased a few years ago when I discovered that IFTTT let me plug different bits of the web together. I now use it for tasks such as: Syndicating blog posts to social media Creating scheduled/repeating todo items from a Google Calendar Making a note to revisit an article I&rsquo;ve starred in Feedly I&rsquo;d probably only be half-joking if I said that I spend more time automating things than I save not having to do said things manually. Thankfully it&rsquo;s also a great opportunity to learn, and recently I&rsquo;ve been thinking about reimplementing some of my IFTTT workflows myself to get to grips with how it all works. There are some interesting open source projects designed to offer a lot of this functionality, such as Huginn, but I decided to go for a simpler option for two reasons: I want to spend my time learning about the APIs of the services I use and how to wire them together, rather than learning how to use another big framework; and I only have a small Amazon EC2 server to pay with and a heavy Ruby on Rails app like Huginn (plus web server) needs more memory than I have. Instead I&rsquo;ve gone old-school with a little collection of individual scripts to do particular jobs. I&rsquo;m using the built-in scheduling functionality of systemd, which is already part of a modern Linux operating system, to get them to run periodically. It also means I can vary the language I use to write each one depending on the needs of the job at hand and what I want to learn/feel like at the time. Currently it&rsquo;s all done in Python, but I want to have a go at Lisp sometime, and there are some interesting new languages like Go and Julia that I&rsquo;d like to get my teeth into as well. You can see my code on github as it develops: https://github.com/jezcope/web-plumbing. Comments and contributions are welcome (if not expected) and let me know if you find any of the code useful. Image credit: xkcd #1319, Automation Data is like water, and language is like clothing I admit it: I&rsquo;m a grammar nerd. I know the difference between &lsquo;who&rsquo; and &lsquo;whom&rsquo;, and I&rsquo;m proud. I used to be pretty militant, but these days I&rsquo;m more relaxed. I still take joy in the mechanics of the language, but I also believe that English is defined by its usage, not by a set of arbitrary rules. I&rsquo;m just as happy to abuse it as to use it, although I still think it&rsquo;s important to know what rules you&rsquo;re breaking and why. My approach now boils down to this: language is like clothing. You (probably) wouldn&rsquo;t show up to a job interview in your pyjamas1, but neither are you going to wear a tuxedo or ballgown to the pub. Getting commas and semicolons in the right place is like getting your shirt buttons done up right. Getting it wrong doesn&rsquo;t mean you&rsquo;re an idiot. Everyone will know what you meant. It will affect how you&rsquo;re perceived, though, and that will affect how your message is perceived. And there are former rules2 that some still enforce that are nonetheless dropping out of regular usage. There was a time when everyone in an office job wore formal clothing. Then it became acceptable just to have a blouse, or a shirt and tie. Then the tie became optional and now there are many professions where perfectly well-respected and competent people are expected to show up wearing nothing smarter than jeans and a t-shirt. One such rule IMHO is that &lsquo;data&rsquo; is a plural and should take pronouns like &lsquo;they&rsquo; and &lsquo;these&rsquo;. The origin of the word &lsquo;data&rsquo; is in the Latin plural of &lsquo;datum&rsquo;, and that idea has clung on for a considerable period. But we don&rsquo;t speak Latin and the English language continues to evolve: &lsquo;agenda&rsquo; also began life as a Latin plural, but we don&rsquo;t use the word &lsquo;agendum&rsquo; any more. It&rsquo;s common everyday usage to refer to data with singular pronouns like &lsquo;it&rsquo; and &lsquo;this&rsquo;, and it&rsquo;s very rare to see someone referring to a single datum (as opposed to &lsquo;data point&rsquo; or something). If you want to get technical, I tend to think of data as a mass noun, like &lsquo;water&rsquo; or &lsquo;information&rsquo;. It&rsquo;s uncountable: talking about &lsquo;a water&rsquo; or &lsquo;an information&rsquo; doesn&rsquo;t make much sense, but it uses singular pronouns, as in &lsquo;this information&rsquo;. If you&rsquo;re interested, the Oxford English Dictionary also takes this position, while Chambers leaves the choice of singular or plural noun up to you. There is absolutely nothing wrong, in my book, with referring to data in the plural as many people still do. But it&rsquo;s no longer a rule and for me it&rsquo;s weakened further from guideline to preference. It&rsquo;s like wearing a bow-tie to work. There&rsquo;s nothing wrong with it and some people really make it work, but it&rsquo;s increasingly outdated and even a little eccentric. or maybe you&rsquo;d totally rock it. &#x21a9;&#xfe0e; Like not starting a sentence with a conjunction&hellip; &#x21a9;&#xfe0e; #IDCC16 day 2: new ideas Well, I did a great job of blogging the conference for a couple of days, but then I was hit by the bug that&rsquo;s been going round and didn&rsquo;t have a lot of energy for anything other than paying attention and making notes during the day! I&rsquo;ve now got round to reviewing my notes so here are a few reflections on day 2. Day 2 was the day of many parallel talks! So many great and inspiring ideas to take in! Here are a few of my take-home points. Big science and the long tail The first parallel session had examples of practical data management in the real world. Jian Qin &amp; Brian Dobreski (School of Information Studies, Syracuse University) worked on reproducibility with one of the research groups involved with the recent gravitational wave discovery. &ldquo;Reproducibility&rdquo; for this work (as with much of physics) mostly equates to computational reproducibility: tracking the provenance of the code and its input and output is key. They also found that in practice the scientists' focus was on making the big discovery, and ensuring reproducibility was seen as secondary. This goes some way to explaining why current workflows and tools don&rsquo;t really capture enough metadata. Milena Golshan &amp; Ashley Sands (Center for Knowledge Infrastructures, UCLA) investigated the use of Software-as-a-Service (SaaS, such as Google Drive, Dropbox or more specialised tools) as a way of meeting the needs of long-tail science research such as ocean science. This research is characterised by small teams, diverse data, dynamic local development of tools, local practices and difficulty disseminating data. This results in a need for researchers to be generalists, as opposed to &ldquo;big science&rdquo; research areas, where they can afford to specialise much more deeply. Such generalists tend to develop their own isolated workflows, which can differ greatly even within a single lab. Long-tail research also often struggles from a lack of dedicated IT support. They found that use of SaaS could help to meet these challenges, but with a high cost required to cover the needed guarantees of security and stability. Education &amp; training This session focussed on the professional development of library staff. Eleanor Mattern (University of Pittsburgh) described the immersive training introduced to improve librarians' understanding of the data needs of their subject areas in delivering their RDM service delivery model. The participants each conducted a &ldquo;disciplinary deep dive&rdquo;, shadowing researchers and then reporting back to the group on their discoveries with a presentation and discussion. Liz Lyon (also University of Pittsburgh, formerly UKOLN/DCC) gave a systematic breakdown of the skills, knowledge and experience required in different data-related roles, obtained from an analysis of job adverts. She identified distinct roles of data analyst, data engineer and data journalist, and as well as each role&rsquo;s distinctive skills, pinpointed common requirements of all three: Python, R, SQL and Excel. This work follows on from an earlier phase which identified an allied set of roles: data archivist, data librarian and data steward. Data sharing and reuse This session gave an overview of several specific workflow tools designed for researchers. Marisa Strong (University of California Curation Centre/California Digital Libraries) presented Dash, a highly modular tool for manual data curation and deposit by researchers. It&rsquo;s built on their flexible backend, Stash, and though it&rsquo;s currently optimised to deposit in their Merritt data repository it could easily be hooked up to other repositories. It captures DataCite metadata and a few other fields, and is integrated with ORCID to uniquely identify people. In a different vein, Eleni Castro (Institute for Quantitative Social Science, Harvard University) discussed some of the ways that Harvard&rsquo;s Dataverse repository is streamlining deposit by enabling automation. It provides a number of standardised endpoints such as OAI-PMH for metadata harvest and SWORD for deposit, as well as custom APIs for discovery and deposit. Interesting use cases include: An addon for the Open Science Framework to deposit in Dataverse via SWORD An R package to enable automatic deposit of simulation and analysis results Integration with publisher workflows Open Journal Systems A growing set of visualisations for deposited data In the future they&rsquo;re also looking to integrate with DMPtool to capture data management plans and with Archivematica for digital preservation. Andrew Treloar (Australian National Data Service) gave us some reflections on the ANDS &ldquo;applications programme&rdquo;, a series of 25 small funded projects intended to address the fourth of their strategic transformations, single use → reusable. He observed that essentially these projects worked because they were able to throw money at a problem until they found a solution: not very sustainable. Some of them stuck to a traditional &ldquo;waterfall&rdquo; approach to project management, resulting in &ldquo;the right solution 2 years late&rdquo;. Every researcher&rsquo;s needs are &ldquo;special&rdquo; and communities are still constrained by old ways of working. The conclusions from this programme were that: &ldquo;Good enough&rdquo; is fine most of the time Adopt/Adapt/Augment is better than Build Existing toolkits let you focus on the 10% functionality that&rsquo;s missing Succussful projects involved research champions who can: 1) articulate their community&rsquo;s requirements; and 2) promote project outcomes Summary All in all, it was a really exciting conference, and I&rsquo;ve come home with loads of new ideas and plans to develop our services at Sheffield. I noticed a continuation of some of the trends I spotted at last year&rsquo;s IDCC, especially an increasing focus on &ldquo;second-order&rdquo; problems: we&rsquo;re no longer spending most of our energy just convincing researchers to take data management seriously and are able to spend more time helping them to do it better and get value out of it. There&rsquo;s also a shift in emphasis (identified by closing speaker Cliff Lynch) from sharing to reuse, and making sure that data is not just available but valuable. #IDCC16 Day 1: Open Data The main conference opened today with an inspiring keynote by Barend Mons, Professor in Biosemantics, Leiden University Medical Center. The talk had plenty of great stuff, but two points stood out for me. First, Prof Mons described a newly discovered link between Huntingdon&rsquo;s Disease and a previously unconsidered gene. No-one had previously recognised this link, but on mining the literature, an indirect link was identified in more than 10% of the roughly 1 million scientific claims analysed. This is knowledge for which we already had more than enough evidence, but which could never have been discovered without such a wide-ranging computational study. Second, he described a number of behaviours which should be considered &ldquo;malpractice&rdquo; in science: Relying on supplementary data in articles for data sharing: the majority of this is trash (paywalled, embedded in bitmap images, missing) Using the Journal Impact Factor to evaluate science and ignoring altmetrics Not writing data stewardship plans for projects (he prefers this term to &ldquo;data management plan&rdquo;) Obstructing tenure for data experts by assuming that all highly-skilled scientists must have a long publication record A second plenary talk from Andrew Sallons of the Centre for Open Science introduced a number of interesting-looking bits and bobs, including the Transparency &amp; Openness Promotion (TOP) Guidelines which set out a pathway to help funders, publishers and institutions move towards more open science. The rest of the day was taken up with a panel on open data, a poster session, some demos and a birds-of-a-feather session on sharing sensitive/confidential data. There was a great range of posters, but a few that stood out to me were: Lessons learned about ISO 16363 (&ldquo;Audit and certification of trustworthy digital repositories&rdquo;) certification from the British Library Two separate posters (from the Universities of Toronto and Colorado) about disciplinary RDM information &amp; training for liaison librarians A template for sharing psychology data developed by a psychologist-turned-information researcher from Carnegie Mellon University More to follow, but for now it&rsquo;s time for the conference dinner! #IDCC16 Day 0: business models for research data management I&rsquo;m at the International Digital Curation Conference 2016 (#IDCC16) in Amsterdam this week. It&rsquo;s always a good opportunity to pick up some new ideas and catch up with colleagues from around the world, and I always come back full of new possibilities. I&rsquo;ll try and do some more reflective posts after the conference but I thought I&rsquo;d do some quick reactions while everything is still fresh. Monday and Thursday are pre- and post-conference workshop days, and today I attended Developing Research Data Management Services. Joy Davidson and Jonathan Rans from the Digital Curation Centre (DCC) introduced us to the Business Model Canvas, a template for designing a business model on a single sheet of paper. The model prompts you to think about all of the key facets of a sustainable, profitable business, and can easily be adapted to the task of building a service model within a larger institution. The DCC used it as part of the Collaboration to Clarify Curation Costs (4C) project, whose output the Curation Costs Exchange is also worth a look. It was a really useful exercise to be able to work through the whole process for an aspect of research data management (my table focused on training &amp; guidance provision), both because of the ideas that came up and also the experience of putting the framework into practice. It seems like a really valuable tool and I look forward to seeing how it might help us with our RDM service development. Tomorrow the conference proper begins, with a range of keynotes, panel sessions and birds-of-a-feather meetings so hopefully more then! About me I help people in Higher Education communicate and collaborate more effectively using technology. I currently work at the University of Sheffield focusing on research data management policy, practice, training and advocacy. In my free time, I like to: run; play the accordion; morris dance; climb; cook; read (fiction and non-fiction); write. Better Science Through Better Data #scidata17 Better Science through Better DoughnutsJez Cope Update: fixed the link to the slides so it works now! Last week I had the honour of giving my first ever keynote talk, at an event entitled Better Science Through Better Data hosted jointly by Springer Nature and the Wellcome Trust. It was nerve-wracking but exciting and seemed to go down fairly well. I even got accidentally awarded a PhD in the programme — if only it was that easy! The slides for the talk, &ldquo;Supporting Open Research: The role of an academic library&rdquo;, are available online (doi:10.15131/shef.data.5537269), and the whole event was video&rsquo;d for posterity and viewable online. I got some good questions too, mainly from the clever online question system. I didn&rsquo;t get to answer all of them, so I&rsquo;m thinking of doing a blog post or two to address a few more. There were loads of other great presentations as well, both keynotes and 7-minute lightning talks, so I&rsquo;d encourage you to take a look at at least some of it. I&rsquo;ll pick out a few of my highlights. Dr Aled Edwards (University of Toronto) There&rsquo;s a major problem with science funding that I hadn&rsquo;t really thought about before. The available funding pool for research is divided up into pots by country, and often by funding body within a country. Each of these pots have robust processes to award funding to the most important problems and most capable researchers. The problem comes because there is no coordination between these pots, so researchers all over the world end up getting funded to research the most popular problems leading to a lot of duplication of effort. Industry funding suffers from a similar problem, particularly the pharmaceutical industry. Because there is no sharing of data or negative results, multiple companies spend billions researching the same dead ends chasing after the same drugs. This is where the astronomical costs of drug development come from. Dr Edwards presented one alternative, modelled by a company called M4K Pharma. The idea is to use existing IP laws to try and give academic researchers a reasonable, morally-justifiable and sustainable profit on drugs they develop, in contrast to the current model where basic research is funded by governments while large corporations hoover up as much profit as they possibly can. This new model would develop drugs all the way to human trial within academia, then license the resulting drugs to companies to manufacture with a price cap to keep the medicines affordable to all who need them. Core to this effort is openness with data, materials and methodology, and Dr Edwards presented several examples of how this approach benefited academic researchers, industry and patients compared with a closed, competitive focus. Dr Kirstie Whitaker (Alan Turing Institute) This was a brilliant presentation, presenting a practical how-to guide to doing reproducible research, from one researcher to another. I suggest you take a look at her slides yourself: Showing your working: a how-to guide to reproducible research. Dr Whitaker briefly addressed a number of common barriers to reproducible research: Is not considered for promotion: so it should be! Held to higher standards than others: reviewers should be discouraged from nitpicking just because the data/code/whatever is available (true unbiased peer review of these would be great though) Publication bias towards novel findings: it is morally wrong to not publish reproductions, replications etc. so we need to address the common taboo on doing so Plead the 5th: if you share, people may find flaws, but if you don&rsquo;t they can&rsquo;t — if you&rsquo;re worried about this you should ask yourself why! Support additional users: some (much?) of the burden should reasonably on the reuser, not the sharer Takes time: this is only true if you hack it together after the fact; if you do it from the start, the whole process will be quicker! Requires additional skills: important to provide training, but also to judge PhD students on their ability to do this, not just on their thesis &amp; papers The rest of the presentation, the &ldquo;how-to&rdquo; guide of the title' was a well-chosen and passionately delivered set of recommendations, but the thing that really stuck out for me is how good Dr Whitaker is at making the point that you only have to do one of these things to improve the quality of your research. It&rsquo;s easy to get the impression at the moment that you have to be fully, perfectly open or not at all, but it&rsquo;s actually OK to get there one step at a time, or even not to go all the way at all! Anyway, I think this is a slide deck that speaks for itself, so I won&rsquo;t say any more! Lightning talk highlights There was plenty of good stuff in the lightning talks, which were constrained to 7 minutes each, but a few of the things that stood out for me were, in no particular order: Code Ocean — share and run code in the cloud dat project — peer to peer data syncronisation tool Can automate metadata creation, data syncing, versioning Set up a secure data sharing network that keeps the data in sync but off the cloud Berlin Institute of Health — open science course for students Pre-print paper Course materials InterMine — taking the pain out of data cleaning &amp; analysis Nix/NixOS as a component of a reproducible paper BoneJ (ImageJ plugin for bone analysis) — developed by a scientist, used a lot, now has a Wellcome-funded RSE to develop next version ESASky — amazing live, online archive of masses of astronomical data Coda I really enjoyed the event (and the food was excellent too). My thanks go out to: The programme committee for asking me to come and give my take — I hope I did it justice! The organising team who did a brilliant job of keeping everything running smoothly before and during the event The University of Sheffield for letting me get away with doing things like this! Blog platform switch I&rsquo;ve just switched my blog over to the Nikola static site generator. Hopefully you won&rsquo;t notice a thing, but there might be a few weird spectres around til I get all the kinks ironed out. I&rsquo;ve made the switch for a couple of main reasons: Nikola supports Jupyter notebooks as a source format for blog posts, which will be useful to include code snippets It&rsquo;s written in Python, a language which I actually know, so I&rsquo;m more likely to be able to fix things that break, customise it and potentially contribute to the open source project (by contrast, Hugo is written in Go, which I&rsquo;m not really familiar with) Chat rooms vs Twitter: how I communicate now CC0, Pixabay This time last year, Brad Colbow published a comic in his &ldquo;The Brads&rdquo; series entitled &ldquo;The long slow death of Twitter&rdquo;. It really encapsulates the way I&rsquo;ve been feeling about Twitter for a while now. Go ahead and take a look. I&rsquo;ll still be here when you come back. According to my Twitter profile, I joined in February 2009 as user #20,049,102. It was nearing its 3rd birthday and, though there were clearly a lot of people already signed up at that point, it was still relatively quiet, especially in the UK. I was a lonely PhD student just starting to get interested in educational technology, and one thing that Twitter had in great supply was (and still is) people pushing back the boundaries of what tech can do in different contexts. Somewhere along the way Twitter got really noisy, partly because more people (especially commercial companies) are using it more to talk about stuff that doesn&rsquo;t interest me, and partly because I now follow 1,200+ people and find I get several tweets a second at peak times, which no-one could be expected to handle. More recently I&rsquo;ve found my attention drawn to more focussed communities instead of that big old shouting match. I find I&rsquo;m much more comfortable discussing things and asking questions in small focussed communities because I know who might be interested in what. If I come across an article about a cool new Python library, I&rsquo;ll geek out about it with my research software engineer friends; if I want advice on an aspect of my emacs setup, I&rsquo;ll ask a bunch of emacs users. I feel like I&rsquo;m talking to people who want to hear what I&rsquo;m saying. Next to that experience, Twitter just feels like standing on a street corner shouting. IRC channels (mostly on Freenode), and similar things like Slack and gitter form the bulk of this for me, along with a growing number of WhatsApp group chats. Although online chat is theoretically a synchronous medium, I find that I can treat it more as &ldquo;semi-synchronous&rdquo;: I can have real-time conversations as they arise, but I can also close them and tune back in later to catch up if I want. Now I come to think about it, this is how I used to treat Twitter before the 1,200 follows happened. I also find I visit a handful of forums regularly, mostly of the Reddit link-sharing or StackExchange Q&amp;A type. /r/buildapc was invaluable when I was building my latest box, /r/EarthPorn (very much not NSFW) is just beautiful. I suppose the risk of all this is that I end up reinforcing my own echo chamber. I&rsquo;m not sure how to deal with that, but I certainly can&rsquo;t deal with it while also suffering from information overload. Not just certifiable… A couple of months ago, I went to Oxford for an intensive, 2-day course run by Software Carpentry and Data Carpentry for prospective new instructors. I&rsquo;ve now had confirmation that I&rsquo;ve completed the checkout procedure so it&rsquo;s official: I&rsquo;m now a certified Data Carpentry instructor! As far as I&rsquo;m aware, the certification process is now combined, so I&rsquo;m also approved to teach Software Carpentry material too. And of course there&rsquo;s Library Carpentry too&hellip; SSI Fellowship 2020 I&rsquo;m honoured and excited to be named one of this year&rsquo;s Software Sustainability Institute Fellows. There&rsquo;s not much to write about yet because it&rsquo;s only just started, but I&rsquo;m looking forward to sharing more with you. In the meantime, you can take a look at the 2020 fellowship announcement and get an idea of my plans from my application video: Talks Here is a selection of talks that I&rsquo;ve given. {{% template %}} &lt;%! import arrow %&gt; Date Title Location % for talk in post.data("talks"): % if 'date' in talk: ${date.format('ddd d MMM YYYY')} % endif % if 'url' in talk: % endif ${talk['title']} % if 'url' in talk: % endif ${talk.get('location', '')} % endfor {{% /template %}} 
erambler-co-uk-4105	----	eRambler eRambler Recent content on eRambler Intro to the fediverse Wow, it turns out to be 10 years since I wrote this beginners guide to Twitter. Things have moved on a loooooong way since then. Far from being the interesting, disruptive technology it was back then, Twitter has become part of the mainstream, the establishment. Almost everyone and everything is on Twitter now, which has both pros and cons. So what&rsquo;s the problem? It&rsquo;s now possible to follow all sorts of useful information feeds, from live updates on transport delays to your favourite sports team&rsquo;s play-by-play performance to an almost infinite number of cat pictures. In my professional life it&rsquo;s almost guaranteed that anyone I meet will be on Twitter, meaning that I can contact them to follow up at a later date without having to exchange contact details (and they have options to block me if they don&rsquo;t like that). On the other hand, a medium where everyone&rsquo;s opinion is equally valid regardless of knowledge or life experience has turned some parts of the internet into a toxic swamp of hatred and vitriol. It&rsquo;s easier than ever to forget that we have more common ground with any random stranger than we have similarities, and that&rsquo;s led to some truly awful acts and a poisonous political arena. Part of the problem here is that each of the social media platforms is controlled by a single entity with almost no accountability to anyone other than shareholders. Technological change has been so rapid that the regulatory regime has no idea how to handle them, leaving them largely free to operate how they want. This has led to a whole heap of nasty consequences that many other people have done a much better job of documenting than I could (Shoshana Zuboff&rsquo;s book The Age of Surveillance Capitalism is a good example). What I&rsquo;m going to focus on instead are some possible alternatives. If you accept the above argument, one obvious solution is to break up the effective monopoly enjoyed by Facebook, Twitter et al. We need to be able to retain the wonderful affordances of social media but democratise control of it, so that it can never be dominated by a small number of overly powerful players. What&rsquo;s the solution? There&rsquo;s actually a thing that already exists, that almost everyone is familiar with and that already works like this. It&rsquo;s email. There are a hundred thousand email servers, but my email can always find your inbox if I know your address because that address identifies both you and the email service you use, and they communicate using the same protocol, Simple Mail Transfer Protocol (SMTP)1. I can&rsquo;t send a message to your Twitter from my Facebook though, because they&rsquo;re completely incompatible, like oil and water. Facebook has no idea how to talk to Twitter and vice versa (and the companies that control them have zero interest in such interoperability anyway). Just like email, a federated social media service like Mastodon allows you to use any compatible server, or even run your own, and follow accounts on your home server or anywhere else, even servers running different software as long as they use the same ActivityPub protocol. There&rsquo;s no lock-in because you can move to another server any time you like, and interact with all the same people from your new home, just like changing your email address. Smaller servers mean that no one server ends up with enough power to take over and control everything, as the social media giants do with their own platforms. But at the same time, a small server with a small moderator team can enforce local policy much more easily and block accounts or whole servers that host trolls, nazis or other poisonous people. How do I try it? I have no problem with anyone for choosing to continue to use what we&rsquo;re already calling &ldquo;traditional&rdquo; social media; frankly, Facebook and Twitter are still useful for me to keep in touch with a lot of my friends. However, I do think it&rsquo;s useful to know some of the alternatives if only to make a more informed decision to stick with your current choices. Most of these services only ask for an email address when you sign up and use of your real name vs a pseudonym is entirely optional so there&rsquo;s not really any risk in signing up and giving one a try. That said, make sure you take sensible precautions like not reusing a password from another account. Instead of… Try… Twitter, Facebook Mastodon, Pleroma, Misskey Slack, Discord, IRC Matrix WhatsApp, FB Messenger, Telegram Also Matrix Instagram, Flickr PixelFed YouTube PeerTube The web Interplanetary File System (IPFS) Which, if you can believe it, was formalised nearly 40 years ago in 1982 and has only had fairly minor changes since then! &#x21a9;&#xfe0e; Collaborations Workshop 2021: collaborative ideas & hackday My last post covered the more &ldquo;traditional&rdquo; lectures-and-panel-sessions approach of the first half of the SSI Collaborations Workshop. The rest of the workshop was much more interactive, consisting of a discussion session, a Collaborative Ideas session, and a whole-day hackathon! The discussion session on day one had us choose a topic (from a list of topics proposed leading up to the workshop) and join a breakout room for that topic with the aim of producing a &ldquo;speed blog&rdquo; by then end of 90 minutes. Those speed blogs will be published on the SSI blog over the coming weeks, so I won&rsquo;t go into that in more detail. The Collaborative Ideas session is a way of generating hackday ideas, by putting people together at random into small groups to each raise a topic of interest to them before discussing and coming up with a combined idea for a hackday project. Because of the serendipitous nature of the groupings, it&rsquo;s a really good way of generating new ideas from unexpected combinations of individual interests. After that, all the ideas from the session, along with a few others proposed by various participants, were pitched as ideas for the hackday and people started to form teams. Not every idea pitched gets worked on during the hackday, but in the end 9 teams of roughly equal size formed to spend the third day working together. My team&rsquo;s project: &ldquo;AHA! An Arts &amp; Humanities Adventure&rdquo; There&rsquo;s a lot of FOMO around choosing which team to join for an event like this: there were so many good ideas and I wanted to work on several of them! In the end I settled on a team developing an escape room concept to help Arts &amp; Humanities scholars understand the benefits of working with research software engineers for their research. Five of us rapidly mapped out an example storyline for an escape room, got a website set up with GitHub and populated it with the first few stages of the game. We decided to focus on a story that would help the reader get to grips with what an API is and I&rsquo;m amazed how much we managed to get done in less than a day&rsquo;s work! You can try playing through the escape room (so far) yourself on the web, or take a look at the GitHub repository, which contains the source of the website along with a list of outstanding tasks to work on if you&rsquo;re interested in contributing. I&rsquo;m not sure yet whether this project has enough momentum to keep going, but it was a really valuable way both of getting to know and building trust with some new people and demonstrating the concept is worth more work. Other projects Here&rsquo;s a brief rundown of the other projects worked on by teams on the day. Coding Confessions Everyone starts somewhere and everyone cuts corners from time to time. Real developers copy and paste! Fight imposter syndrome by looking through some of these confessions or contributing your own. https://coding-confessions.github.io/ CarpenPI A template to set up a Raspberry Pi with everything you need to run a Carpentries (https://carpentries.org/) data science/software engineering workshop in a remote location without internet access. https://github.com/CarpenPi/docs/wiki Research Dugnads A guide to running an event that is a coming together of a research group or team to share knowledge, pass on skills, tidy and review code, among other software and working best practices (based on the Norwegian concept of a dugnad, a form of &ldquo;voluntary work done together with other people&rdquo;) https://research-dugnads.github.io/dugnads-hq/ Collaborations Workshop ideas A meta-project to collect together pitches and ideas from previous Collaborations Workshop conferences and hackdays, to analyse patterns and revisit ideas whose time might now have come. https://github.com/robintw/CW-ideas howDescribedIs Integrate existing tools to improve the machine-readable metadata attached to open research projects by integrating projects like SOMEF, codemeta.json and HowFAIRIs (https://howfairis.readthedocs.io/en/latest/index.html). Complete with CI and badges! https://github.com/KnowledgeCaptureAndDiscovery/somef-github-action Software end-of-project plans Develop a template to plan and communicate what will happen when the fixed-term project funding for your research software ends. Will maintenance continue? When will the project sunset? Who owns the IP? https://github.com/elichad/software-twilight Habeas Corpus A corpus of machine readable data about software used in COVID-19 related research, based on the CORD19 dataset. https://github.com/softwaresaved/habeas-corpus Credit-all Extend the all-contributors GitHub bot (https://allcontributors.org/) to include rich information about research project contributions such as the CASRAI Contributor Roles Taxonomy (https://casrai.org/credit/) https://github.com/dokempf/credit-all I&rsquo;m excited to see so many metadata-related projects! I plan to take a closer look at what the Habeas Corpus, Credit-all and howDescribedIs teams did when I get time. I also really want to try running a dugnad with my team or for the GLAM Data Science network. Collaborations Workshop 2021: talks & panel session I&rsquo;ve just finished attending (online) the three days of this year&rsquo;s SSI Collaborations Workshop (CW for short), and once again it&rsquo;s been a brilliant experience, as well as mentally exhausting, so I thought I&rsquo;d better get a summary down while it&rsquo;s still fresh it my mind. Collaborations Workshop is, as the name suggests, much more focused on facilitating collaborations than a typical conference, and has settled into a structure that starts off with with longer keynotes and lectures, and progressively gets more interactive culminating with a hack day on the third day. That&rsquo;s a lot to write about, so for this post I&rsquo;ll focus on the talks and panel session, and follow up with another post about the collaborative bits. I&rsquo;ll also probably need to come back and add in more links to bits and pieces once slides and the &ldquo;official&rdquo; summary of the event become available. Updates 2021-04-07 Added links to recordings of keynotes and panel sessions Provocations The first day began with two keynotes on this year&rsquo;s main themes: FAIR Research Software and Diversity &amp; Inclusion, and day 2 had a great panel session focused on disability. All three were streamed live and the recordings remain available on Youtube: View the keynotes recording; Google-free alternative link View the panel session recording; Google-free alternative link FAIR Research Software Dr Michelle Barker, Director of the Research Software Alliance, spoke on the challenges to recognition of software as part of the scholarly record: software is not often cited. The FAIR4RS working group has been set up to investigate and create guidance on how the FAIR Principles for data can be adapted to research software as well; as they stand, the Principles are not ideally suited to software. This work will only be the beginning though, as we will also need metrics, training, career paths and much more. ReSA itself has 3 focus areas: people, policy and infrastructure. If you&rsquo;re interested in getting more involved in this, you can join the ReSA email list. Equality, Diversity &amp; Inclusion: how to go about it Dr Chonnettia Jones, Vice President of Research, Michael Smith Foundation for Health Research spoke extensively and persuasively on the need for Equality, Diversity &amp; Inclusion (EDI) initiatives within research, as there is abundant robust evidence that all research outcomes are improved. She highlighted the difficulties current approaches to EDI have effecting structural change, and changing not just individual behaviours but the cultures &amp; practices that perpetuate iniquity. What initiatives are often constructed around making up for individual deficits, a bitter framing is to start from an understanding of individuals having equal stature but having different tired experiences. Commenting on the current focus on &ldquo;research excellent&rdquo; she pointed out that the hyper-competition this promotes is deeply unhealthy. suggesting instead that true excellence requires diversity, and we should focus on an inclusive excellence driven by inclusive leadership. Equality, Diversity &amp; Inclusion: disability issues Day 2&rsquo;s EDI panel session brought together five disabled academics to discuss the problems of disability in research. Dr Becca Wilson, UKRI Innovation Fellow, Institute of Population Health Science, University of Liverpool (Chair) Phoenix C S Andrews (PhD Student, Information Studies, University of Sheffield and Freelance Writer) Dr Ella Gale (Research Associate and Machine Learning Subject Specialist, School of Chemistry, University of Bristol) Prof Robert Stevens (Professor and Head of Department of Computer Science, University of Manchester) Dr Robin Wilson (Freelance Data Scientist and SSI Fellow) NB. The discussion flowed quite freely so the following summary, so the following summary mixes up input from all the panel members. Researchers are often assumed to be single-minded in following their research calling, and aptness for jobs is often partly judged on &ldquo;time send&rdquo;, which disadvantages any disabled person who has been forced to take a career break. On top of this disabled people are often time-poor because of the extra time needed to manage their condition, leaving them with less &ldquo;output&rdquo; to show for their time served on many common metrics. This can partially affect early-career researchers, since resources for these are often restricted on a &ldquo;years-since-PhD&rdquo; criterion. Time poverty also makes funding with short deadlines that much harder to apply for. Employers add more demands right from the start: new starters are typically expected to complete a health and safety form, generally a brief affair that will suddenly become an 80-page bureaucratic nightmare if you tick the box declaring a disability. Many employers claim to be inclusive yet utterly fail to understand the needs of their disabled staff. Wheelchairs are liberating for those who use them (despite the awful but common phrase &ldquo;wheelchair-bound&rdquo;) and yet employers will refuse to insure a wheelchair while travelling for work, classifying it as a &ldquo;high value personal item&rdquo; that the owner would take the same responsibility for as an expensive camera. Computers open up the world for blind people in a way that was never possible without them, but it&rsquo;s not unusual for mandatory training to be inaccessible to screen readers. Some of these barriers can be overcome, but doing so takes yet more time that could and should be spent on more important work. What can we do about it? Academia works on patronage whether we like it or not, so be the person who supports people who are different to you rather than focusing on the one you &ldquo;recognise yourself in&rdquo; to mentor. As a manager, it&rsquo;s important to ask each individual what they need and believe them: they are the expert in their own condition and their lived experience of it. Don&rsquo;t assume that because someone else in your organisation with the same disability needs one set of accommodations, it&rsquo;s invalid for your staff member to require something totally different. And remember: disability is unusual as a protected characteristic in that anyone can acquire it at any time without warning! Lightning talks Lightning talk sessions are always tricky to summarise, and while this doesn&rsquo;t do them justice, here are a few highlights from my notes. Data &amp; metadata Malin Sandstrom talked about a much-needed refinement of contributor role taxonomies for scientific computing Stephan Druskat showcased a project to crowdsource a corpus of research software for further analysis Learning &amp; teaching/community Matthew Bluteau introduced the concept of the &ldquo;coding dojo&rdquo; as a way to enhance community of practice. A group of coders got together to practice &amp; learn by working together to solve a problem and explaining their work as they go He described 2 models: a code jam, where people work in small groups, and the Randori method, where 2 people do pair programming while the rest observe. I&rsquo;m excited to try this out! Steve Crouch talked about intermediate skills and helping people take the next step, which I&rsquo;m also very interested in with the GLAM Data Science network Esther Plomp recounted experience of running multiple Carpentry workshops online, while Diego Alonso Alvarez discussed planned workshops on making research software more usable with GUIs Shoaib Sufi showcased the SSI&rsquo;s new event organising guide Caroline Jay reported on a diary study into autonomy &amp; agency in RSE during COVID Lopez, T., Jay, C., Wermelinger, M., &amp; Sharp, H. (2021). How has the covid-19 pandemic affected working conditions for research software engineers? Unpublished manuscript. Wrapping up That&rsquo;s not everything! But this post is getting pretty long so I&rsquo;ll wrap up for now. I&rsquo;ll try to follow up soon with a summary of the &ldquo;collaborative&rdquo; part of Collaborations Workshop: the idea-generating sessions and hackday! Time for a new look... I&rsquo;ve decided to try switching this website back to using Hugo to manage the content and generate the static HTML pages. I&rsquo;ve been on the Python-based Nikola for a few years now, but recently I&rsquo;ve been finding it quite slow, and very confusing to understand how to do certain things. I used Hugo recently for the GLAM Data Science Network website and found it had come on a lot since the last time I was using it, so I thought I&rsquo;d give it another go, and redesign this site to be a bit more minimal at the same time. The theme is still a work in progress so it&rsquo;ll probably look a bit rough around the edges for a while, but I think I&rsquo;m happy enough to publish it now. When I get round to it I might publish some more detailed thoughts on the design. Ideas for Accessible Communications The Disability Support Network at work recently ran a survey on &ldquo;accessible communications&rdquo;, to develop guidance on how to make communications (especially internal staff comms) more accessible to everyone. I grabbed a copy of my submission because I thought it would be useful to share more widely, so here it is. Please note that these are based on my own experiences only. I am in no way suggesting that these are the only things you would need to do to ensure your communications are fully accessible. They&rsquo;re just some things to keep in mind. Policies/procedures/guidance can be stressful to use if anything is vague or inconsistent, or if it looks like there might be more information implied than is explicitly given (a common cause of this is use of jargon in e.g. HR policies). Emails relating to these policies have similar problems, made worse because they tend to be very brief. Online meetings can be very helpful, but can also be exhausting, especially if there are too many people, or not enough structure. Larger meetings &amp; webinars without agendas (or where the agenda is ignored, or timings are allowed to drift without acknowledgement) are very stressful, as are those where there is not enough structure to ensure fair opportunities to contribute. Written reference documents and communications should: Be carefully checked for consistency and clarity Have all all key points explicitly stated Explicitly acknowledge the need for flexibility where it is necessary, rather than implying or hinting at it Clearly define jargon &amp; acronyms where they are necessary to the point being made, and avoid them otherwise Include links to longer, more explicit versions where space is tight Provide clear bullet-point summaries with links to the details Online meetings should: Include sufficient break time (at least 10 minutes out of every hour) and not allow this to be compromised just because a speaker has misjudged the length of their talk Include initial &ldquo;settling-in&rdquo; time in agendas to avoid timing getting messed up from the start Ensure the agenda is stuck to, or that divergence from the agenda is acknowledged explicitly by the chair and updated timing briefly discussed to ensure everyone is clear Establish a norm for participation at the start of the meeting and stick to it e.g. ask people to raise hands when they have a point to make, or have specific time for round-robin contributions Ensure quiet/introverted people have space to contribute, but don&rsquo;t force them to do so if they have nothing to add at the time Offer a text-based alternative to contributing verbally If appropriate, at the start of the meeting assign specific roles of: Gatekeeper: ensures everyone has a chance to contribute Timekeeper: ensures meeting runs to time Scribe: ensures a consistent record of the meeting Be chaired by someone with the confidence to enforce the above: offer training to all staff on chairing meetings to ensure everyone has the skills to run a meeting effectively Matrix self-hosting I started running my own Matrix server a little while ago. Matrix is something rather cool, a chat system similar to IRC or Slack, but open and federated. Open in that the standard is available for anyone to view, but also the reference implementations of server and client are open source, along with many other clients and a couple of nascent alternative servers. Federated in that, like email, it doesn&rsquo;t matter what server you sign up with, you can talk to users on your own or any other server. I decided to host my own for three reasons. Firstly, to see if I could and to learn from it. Secondly, to try and rationalise the Cambrian explosion of Slack teams I was being added to in 2019. Thirdly, to take some control of the loss of access to historical messages in some communities that rely on Slack (especially the Carpentries and RSE communities). Since then, I&rsquo;ve also added a fourth goal: taking advantage of various bridges to bring other messaging network I use (such as Signal and Telegram) into a consistent UI. I&rsquo;ve also found that my use of Matrix-only rooms has grown as more individuals &amp; communities have adopted the platform. So, I really like Matrix and I use it daily. My problem now is whether to keep self-hosting. Synapse, the only full server implementation at the moment, is really heavy on memory, so I&rsquo;ve ended up running it on a much bigger server than I thought I&rsquo;d need, which seems overkill for a single-user instance. So now I have to make a decision about whether it&rsquo;s worth keeping going, or shutting it down and going back to matrix.org, or setting up on one of the other servers that have sprung up in the last couple of years. There are a couple of other considerations here. Firstly, Synapse resource usage is entirely down to the size of the rooms joined by users of the homeowner, not directly the number of users. So if users have mostly overlapping interests, and thus keep to the same rooms, you can support quite a large community without significant extra resource usage. Secondly, there are a couple of alternative server implementations in development specifically addressing this issue for small servers. Dendrite and Conduit. Neither are quite ready for what I want yet, but are getting close, and when ready that will allow running small homeservers with much more sensible resource usage. So I could start opening up for other users, and at least justify the size of the server that way. I wouldn&rsquo;t ever want to make it a paid-for service but perhaps people might be willing to make occasional donations towards running costs. That still leaves me with the question of whether I&rsquo;m comfortable running a service that others may come to rely on, or being responsible for the safety of their information. I could also hold out for Dendrite or Conduit to mature enough that I&rsquo;m ready to try them, which might not be more than a few months off. Hmm, seems like I&rsquo;ve convinced myself to stick with it for now, and we&rsquo;ll see how it goes. In the meantime, if you know me and you want to try it out let me know and I might risk setting you up with an account! What do you miss least about pre-lockdown life? @JanetHughes on Twitter: What do you miss the least from pre-lockdown life? I absolutely do not miss wandering around the office looking for a meeting room for a confidential call or if I hadn&rsquo;t managed to book a room in advance. Let&rsquo;s never return to that joyless frustration, hey? 10:27 AM · Feb 3, 2021 After seeing Terence Eden taking Janet Hughes' tweet from earlier this month as a writing prompt, I thought I might do the same. The first thing that leaps to my mind is commuting. At various points in my life I&rsquo;ve spent between one and three hours a day travelling to and from work and I&rsquo;ve never more than tolerated it at best. It steals time from your day, and societal norms dictate that it&rsquo;s your leisure &amp; self-care time that must be sacrificed. Longer commutes allow more time to get into a book or podcast, especially if not driving, but I&rsquo;d rather have that time at home rather than trying to be comfortable in a train seat designed for some mythical average man shaped nothing like me! The other thing I don&rsquo;t miss is the colds and flu! Before the pandemic, British culture encouraged working even when ill, which meant constantly coming into contact with people carrying low-grade viruses. I&rsquo;m not immunocompromised but some allergies and residue of being asthmatic as a child meant that I would get sick 2-3 times a year. A pleasant side-effect of the COVID precautions we&rsquo;re all taking is that I haven&rsquo;t been sick for over 12 months now, which is amazing! Finally, I don&rsquo;t miss having so little control over my environment. One of the things that working from home has made clear is that there are certain unavoidable aspects of working in my shared office that cause me sensory stress, and that are completely unrelated to my work. Working (or trying to work) next to a noisy automatic scanner; trying to find a light level that works for 6 different people doing different tasks; lacking somewhere quiet and still to eat lunch and recover from a morning of meetings or the constant vaguely-distracting bustle of a large shared office. It all takes energy. Although it&rsquo;s partly been replaced by the new stress of living through a global pandemic, that old stress was a constant drain on my productivity and mood that had been growing throughout my career as I moved (ironically, given the common assumption that seniority leads to more privacy) into larger and larger open plan offices. Remarkable blogging And the handwritten blog saga continues, as I&rsquo;ve just received my new reMarkable 2 tablet, which is designed for reading, writing and nothing else. It uses a super-responsive e-ink display and writing on it with a stylus is a dream. It has a slightly rough texture with just a bit of friction that makes my writing come out a lot more legibly than on a slippery glass touchscreen. If that was all there was to it, I might not have wasted my money, but it turns out that it runs on Linux and the makers have wisely decided not to lock it down but to give you full root mess. Yes, you read that right: root access. It presents as an ethernet device over USB, so you can SSH in with a password found in the settings and have full control over your own devices. What a novel concept. This fact alone has meant it&rsquo;s built a small yet devoted community of users who have come up with some clever ways of extending its functionality. In fact, many of these are listed on this GitHub repository. Finally, from what I&rsquo;ve seen so far, the handwriting recognition is impressive to say the least. This post was written on it and needed only a little editing. I think this is a device that will get a lot of use! GLAM Data Science Network fellow travellers Updates 2021-02-04 Thanks to Gene @dzshuniper@ausglam.space for suggesting ADHO and a better attribution for the opening quote (see comments below for details) See comments &amp; webmentions for details. “If you want to go fast, go alone. If you want to go far, go together.” — African proverb, probably popularised in English by Kenyan church leader Rev. Samuel Kobia (original) This quote is a popular one in the Carpentries community, and I interpret it in this context to mean that a group of people working together is more sustainable than individuals pursuing the same goal independently. That&rsquo;s something that speaks to me, and that I want to make sure is reflected in nurturing this new community for data science in galleries, archives, libraries &amp; museums (GLAM). To succeed, this work needs to be complementary and collaborative, rather than competitive, so I want to acknowledge a range of other networks &amp; organisations whose activities complement this. The rest of this article is an unavoidably incomplete list of other relevant organisations whose efforts should be acknowledged and potentially built on. And it should go without saying, but just in case: if the work I&rsquo;m planning fits right into an existing initiative, then I&rsquo;m happy to direct my resources there rather than duplicate effort. Inspirations &amp; collaborators Groups with similar goals or undertaking similar activities, but focused on a different sector, geographic area or topic. I think we should make as much use of and contribution to these existing communities as possible since there will be significant overlap. code4lib Probably the closest existing community to what I want to build, but primarily based in the US, so timezones (and physical distance for in-person events) make it difficult to participate fully. This is a well-established community though, with regular events including an annual conference so there&rsquo;s a lot to learn here. newCardigan Similar to code4lib but an Australian focus, so the timezone problem is even bigger! GLAM Labs Focused on supporting the people experimenting with and developing the infrastructure to enable scholars to access GLAM materials in new ways. In some ways, a GLAM data science network would be complementary to their work, by providing people not directly involved with building GLAM Labs with the skills to make best use of GLAM Labs infrastructure. UK Government data science community Another existing community with very similar intentions, but focused on UK Government sector. Clearly the British Library and a few national &amp; regional museums &amp; archives fall into this, but much of the rest of the GLAM sector does not. Artifical Intelligence for Libraries, Archives &amp; Museums (AI4LAM) A multinational collaboration between several large libraries, archives and museums with a specific focus on the Artificial Intelligence (AI) subset of data science UK Reproducibility Network A network of researchers, primarily in HEIs, with an interest in improving the transparency and reliability of academic research. Mostly science-focused but with some overlap of goals around ethical and robust use of data. Museums Computer Group I&rsquo;m less familiar with this than the others, but it seems to have a wider focus on technology generally, within the slightly narrower scope of museums specifically. Again, a lot of potential for collaboration. Training Several organisations and looser groups exist specifically to develop and deliver training that will be relevant to members of this network. The network also presents an opportunity for those who have done a workshop with one of these and want to know what the “next steps” are to continue their data science journey. The Carpentries, aka: Library Carpentry Data Carpentry Software Carpentry Data Science Training for Librarians (DST4L) The Programming Historian CDH Cultural Heritage Data School Supporters These misson-driven organisations have goals that align well with what I imagine for the GLAM DSN, but operate at a more strategic level. They work by providing expert guidance and policy advice, lobbying and supporting specific projects with funding and/or effort. In particular, the SSI runs a fellowship programme which is currently providing a small amount of funding to this project. Digital Preservation Coalition (DPC) Software Sustainability Institute (SSI) Research Data Alliance (RDA) Alliance of Digital Humanities Organizations (ADHO) &hellip; and its Libraries and Digital Humanities Special Interest Group (Lib&amp;DH SIG) Professional bodies These organisations exist to promote the interests of professionals in particular fields, including supporting professional development. I hope they will provide communication channels to their various members at the least, and may be interested in supporting more directly, depending on their mission and goals. Society of Research Software Engineering Chartered Institute of Library and Information Professionals Archives &amp; Records Association Museums Association Conclusion As I mentioned at the top of the page, this list cannot possibly be complete. This is a growing area and I&rsquo;m not the only or first person to have this idea. If you can think of anything glaring that I&rsquo;ve missed and you think should be on this list, leave a comment or tweet/toot at me! A new font for the blog I&rsquo;ve updated my blog theme to use the quasi-proportional fonts Iosevka Aile and Iosevka Etoile. I really like the aesthetic, as they look like fixed-width console fonts (I use the true fixed-width version of Iosevka in my terminal and text editor) but they&rsquo;re actually proportional which makes them easier to read. https://typeof.net/Iosevka/ Training a model to recognise my own handwriting If I&rsquo;m going to train an algorithm to read my weird &amp; awful writing, I&rsquo;m going to need a decent-sized training set to work with. And since one of the main things I want to do with it is to blog &ldquo;by hand&rdquo; it makes sense to focus on that type of material for training. In other words, I need to write out a bunch of blog posts on paper, scan them and transcribe them as ground truth. The added bonus of this plan is that after transcribing, I also end up with some digital text I can use as an actual post — multitasking! So, by the time you read this, I will have already run it through a manual transcription process using Transkribus to add it to my training set, and copy-pasted it into emacs for posting. This is a fun little project because it means I can: Write more by hand with one of my several nice fountain pens, which I enjoy Learn more about the operational process some of my colleagues go through when digitising manuscripts Learn more about the underlying technology &amp; maths, and how to tune the process Produce more lovely content! For you to read! Yay! Write in a way that forces me to put off editing until after a first draft is done and focus more on getting the whole of what I want to say down. That&rsquo;s it for now — I&rsquo;ll keep you posted as the project unfolds. Addendum Tee hee! I&rsquo;m actually just enjoying the process of writing stuff by hand in long-form prose. It&rsquo;ll be interesting to see how the accuracy turns out and if I need to be more careful about neatness. Will it be better or worse than the big but generic models used by Samsung Notes or OneNote. Maybe I should include some stylus-written text for comparison. Blogging by hand I wrote the following text on my tablet with a stylus, which was an interesting experience: So, thinking about ways to make writing fun again, what if I were to write some of them by hand? I mean I have a tablet with a pretty nice stylus, so maybe handwriting recognition could work. One major problem, of course, is that my handwriting is AWFUL! I guess I&rsquo;ll just have to see whether the OCR is good enough to cope… It&rsquo;s something I&rsquo;ve been thinking about recently anyway: I enjoy writing with a proper fountain pen, so is there a way that I can have a smooth workflow to digitise handwritten text without just typing it back in by hand? That would probably be preferable to this, which actually seems to work quite well but does lead to my hand tensing up to properly control the stylus on the almost-frictionless glass screen. I&rsquo;m surprised how well it worked! Here&rsquo;s a sample of the original text: And here&rsquo;s the result of converting that to text with the built-in handwriting recognition in Samsung Notes: Writing blog posts by hand So, thinking about ways to make writing fun again, what if I were to write some of chum by hand? I mean, I have a toldest winds a pretty nice stylus, so maybe handwriting recognition could work. One major problems, ofcourse, is that my , is AWFUL! Iguess I&rsquo;ll just have to see whattime the Ocu is good enough to cope&hellip; It&rsquo;s something I&rsquo;ve hun tthinking about recently anyway: I enjoy wilting with a proper fountain pion, soischeme a way that I can have a smooch workflow to digitise handwritten text without just typing it back in by hand? That wouldprobally be preferableto this, which actually scams to work quito wall but doers load to my hand tensing up to properly couldthe stylus once almost-frictionlessg lass scream. It&rsquo;s pretty good! It did require a fair bit of editing though, and I reckon we can do better with a model that&rsquo;s properly trained on a large enough sample of my own handwriting. What I want from a GLAM/Cultural Heritage Data Science Network Introduction As I mentioned last year, I was awarded a Software Sustainability Institute Fellowship to pursue the project of setting up a Cultural Heritage/GLAM data science network. Obviously, the global pandemic has forced a re-think of many plans and this is no exception, so I&rsquo;m coming back to reflect on it and make sure I&rsquo;m clear about the core goals so that everything else still moves in the right direction. One of the main reasons I have for setting up a GLAM data science network is because it&rsquo;s something I want. The advice to &ldquo;scratch your own itch&rdquo; is often given to people looking for an open project to start or contribute to, and the lack of a community of people with whom to learn &amp; share ideas and practice is something that itches for me very much. The &ldquo;motivation&rdquo; section in my original draft project brief for this work said: Cultural heritage work, like all knowledge work, is increasingly data-based, or at least gives opportunities to make use of data day-to-day. The proper skills to use this data enable more effective working. Knowledge and experience thus gained improves understanding of and empathy with users also using such skills. But of course, I have my own reasons for wanting to do this too. In particular, I want to: Advocate for the value of ethical, sustainable data science across a wide range of roles within the British Library and the wider sector Advance the sector to make the best use of data and digital sources in the most ethical and sustainable way possible Understand how and why people use data from the British Library, and plan/deliver better services to support that Keep up to date with relevant developments in data science Learn from others' skills and experiences, and share my own in turn Those initial goals imply some further supporting goals: Build up the confidence of colleagues who might benefit from data science skills but don&rsquo;t feel they are &ldquo;technical&rdquo; or &ldquo;computer literate&rdquo; enough Further to that, build up a base of colleagues with the confidence to share their skills &amp; knowledge with others, whether through teaching, giving talks, writing or other channels Identify common awareness gaps (skills/knowledge that people don&rsquo;t know they&rsquo;re missing) and address them Develop a communal space (primarily online) in which people feel safe to ask questions Develop a body of professional practice and help colleagues to learn and contribute to the evolution of this, including practices of data ethics, software engineering, statistics, high performance computing, … Break down language barriers between data scientists and others I&rsquo;ll expand on this separately as my planning develops, but here are a few specific activities that I&rsquo;d like to be able to do to support this: Organise less-formal learning and sharing events to complement the more formal training already available within organisations and the wider sector, including &ldquo;show and tell&rdquo; sessions, panel discussions, code cafés, masterclasses, guest speakers, reading/study groups, co-working sessions, … Organise training to cover intermediate skills and knowledge currently missing from the available options, including the awareness gaps and professional practice mentioned above Collect together links to other relevant resources to support self-led learning Decisions to be made There are all sorts of open questions in my head about this right now, but here are some of the key ones. Is it GLAM or Cultural Heritage? When I first started planning this whole thing, I went with &ldquo;Cultural Heritage&rdquo;, since I was pretty transparently targeting my own organisation. The British Library is fairly unequivocally a CH organisation. But as I&rsquo;ve gone along I&rsquo;ve found myself gravitating more towards the term &ldquo;GLAM&rdquo; (which stands for Galleries, Libraries, Archives, Museums) as it covers a similar range of work but is clearer (when you spell out the acronym) about what kinds of work are included. What skills are relevant? This turns out to be surprisingly important, at least in terms of how the community is described, as they define the boundaries of the community and can be the difference between someone feeling welcome or excluded. For example, I think that some introductory statistics training would be immensely valuable for anyone working with data to understand what options are open to them and what limitations those options have, but is the word &ldquo;statistics&rdquo; offputting per se to those who&rsquo;ve chosen a career in arts &amp; humanities? I don&rsquo;t know because I don&rsquo;t have that background and perspective. Keep it internal to the BL, or open up early on? I originally planned to focus primarily on my own organisation to start with, feeling that it would be easier to organise events and build a network within a single organisation. However, the pandemic has changed my thinking significantly. Firstly, it&rsquo;s now impossible to organise in-person events and that will continue for quite some time to come, so there is less need to focus on the logistics of getting people into the same room. Secondly, people within the sector are much more used to attending remote events, which can easily be opened up to multiple organisations in many countries, timezones allowing. It now makes more sense to focus primarily on online activities, which opens up the possibility of building a critical mass of active participants much more quickly by opening up to the wider sector. Conclusion This is the type of post that I could let run and run without ever actually publishing, but since it&rsquo;s something I need feedback and opinions on from other people, I&rsquo;d better ship it! I really want to know what you think about this, whether you feel it&rsquo;s relevant to you and what would make it useful. Comments are open below, or you can contact me via Mastodon or Twitter. Writing About Not Writing Under Construction Grunge Sign by Nicolas Raymond — CC BY 2.0 Every year, around this time of year, I start doing two things. First, I start thinking I could really start to understand monads and write more than toy programs in Haskell. This is unlikely to ever actually happen unless and until I get a day job where I can justify writing useful programs in Haskell, but Advent of Code always gets me thinking otherwise. Second, I start mentally writing this same post. You know, the one about how the blogger in question hasn&rsquo;t had much time to write but will be back soon? &ldquo;Sorry I haven&rsquo;t written much lately…&rdquo; It&rsquo;s about as cliché as a Geocities site with a permanent &ldquo;Under construction&rdquo; GIF. At some point, not long after the dawn of ~time~ the internet, most people realised that every website was permanently under construction and publishing something not ready to be published was just pointless. So I figured this year I&rsquo;d actually finish writing it and publish it. After all, what&rsquo;s the worst that could happen? If we&rsquo;re getting all reflective about this, I could probably suggest some reasons why I&rsquo;m not writing much: For a start, there&rsquo;s a lot going on in both my world and The World right now, which doesn&rsquo;t leave a lot of spare energy after getting up, eating, housework, working and a few other necessary activities. As a result, I&rsquo;m easily distracted and I tend to let myself get dragged off in other directions before I even get to writing much of anything. If I do manage to focus on this blog in general, I&rsquo;ll often end up working on some minor tweak to the theme or functionality. I mean, right now I&rsquo;m wondering if I can do something clever in my text-editor (Emacs, since you&rsquo;re asking) to streamline my writing &amp; editing process so it&rsquo;s more elegant, efficient, ergonomic and slightly closer to perfect in every way. It also makes me much more likely to self-censor, and to indulge my perfectionist tendencies to try and tweak the writing until it&rsquo;s absolutely perfect, which of course never happens. I&rsquo;ve got a whole heap of partly-written posts that are juuuust waiting for the right motivation for me to just finish them off. The only real solution is to accept that: I&rsquo;m not going to write much and that&rsquo;s probably OK What I do write won&rsquo;t always be the work of carefully-researched, finely crafted genius that I want it to be, and that&rsquo;s probably OK too Also to remember why I started writing and publishing stuff in the first place: to reflect and get my thoughts out onto a (virtual) page so that I can see them, figure out whether I agree with myself and learn; and to stimulate discussion and get other views on my (possibly uninformed, incorrect or half-formed) thoughts, also to learn. In other words, a thing I do for me. It&rsquo;s easy to forget that and worry too much about whether anyone else wants to read my s—t. Will you notice any changes? Maybe? Maybe not? Who knows. But it&rsquo;s a new year and that&rsquo;s as good a time for a change as any. When is a persistent identifier not persistent? Or an identifier? I wrote a post on the problems with ISBNs as persistent identifiers (PIDS) for work, so check it out if that sounds interesting. IDCC20 reflections I&rsquo;m just back from IDCC20, so here are a few reflections on this year&rsquo;s conference. You can find all the available slides and links to shared notes on the conference programme. There&rsquo;s also a list of all the posters and an overview of the Unconference Skills for curation of diverse datasets Here in the UK and elsewhere, you&rsquo;re unlikely to find many institutions claiming to apply a deep level of curation to every dataset/software package/etc deposited with them. There are so many different kinds of data and so few people in any one institution doing &ldquo;curation&rdquo; that it&rsquo;s impossible to do this for everything. Absent the knowledge and skills required to fully evaluate an object the best that can be done is usually to make a sense check on the metadata and flag up with the depositor potential for high-level issues such as accidental disclosure of sensitive personal information. The Data Curation Network in the United States is aiming to address this issue by pooling expertise across multiple organisations. The pilot has been highly successful and they&rsquo;re now looking to obtain funding to continue this work. The Swedish National Data Service is experimenting with a similar model, also with a lot of success. As well as sharing individual expertise, the DCN collaboration has also produced some excellent online quick-reference guides for curating common types of data. We had some further discussion as part of the Unconference on the final day about what it would look like to introduce this model in the UK. There was general agreement that this was a good idea and a way to make optimal use of sparse resources. There were also very valid concerns that it would be difficult in the current financial climate for anyone to justify doing work for another organisation, apparently for free. In my mind there are two ways around this, which are not mutually exclusive by any stretch of the imagination. First is to Just Do It: form an informal network of curators around something simple like a mailing list, and give it a try. Second is for one or more trusted organisations to provide some coordination and structure. There are several candidates for this including DCC, Jisc, DPC and the British Library; we all have complementary strengths in this area so it&rsquo;s my hope that we&rsquo;ll be able to collaborate around it. In the meantime, I hope the discussion continues. Artificial intelligence, machine learning et al As you might expect at any tech-oriented conference there was a strong theme of AI running through many presentations, starting from the very first keynote from Francine Berman. Her talk, The Internet of Things: Utopia or Dystopia? used self-driving cars as a case study to unpack some of the ethical and privacy implications of AI. For example, driverless cars can potentially increase efficiency, both through route-planning and driving technique, but also by allowing fewer vehicles to be shared by more people. However, a shared vehicle is not a private space in the way your own car is: anything you say or do while in that space is potentially open to surveillance. Aside from this, there are some interesting ideas being discussed, particularly around the possibility of using machine learning to automate increasingly complex actions and workflows such as data curation and metadata enhancement. I didn&rsquo;t get the impression anyone is doing this in the real world yet, but I&rsquo;ve previously seen theoretical concepts discussed at IDCC make it into practice so watch this space! Playing games! Training is always a major IDCC theme, and this year two of the most popular conference submissions described games used to help teach digital curation concepts and skills. Mary Donaldson and Matt Mahon of the University of Glasgow presented their use of Lego to teach the concept of sufficient metadata. Participants build simple models before documenting the process and breaking them down again. Then everyone had to use someone else&rsquo;s documentation to try and recreate the models, learning important lessons about assumptions and including sufficient detail. Kirsty Merrett and Zosia Beckles from the University of Bristol brought along their card game &ldquo;Researchers, Impact and Publications (RIP)&rdquo;, based on the popular &ldquo;Cards Against Humanity&rdquo;. RIP encourages players to examine some of the reasons for and against data sharing with plenty of humour thrown in. Both games were trialled by many of the attendees during Thursday&rsquo;s Unconference. Summary I realised in Dublin that it&rsquo;s 8 years since I attended my first IDCC, held at the University of Bristol in December 2011 while I was still working at the nearby University of Bath. While I haven&rsquo;t been every year, I&rsquo;ve been to every one held in Europe since then and it&rsquo;s interesting to see what has and hasn&rsquo;t changed. We&rsquo;re no longer discussing data management plans, data scientists or various other things as abstract concepts that we&rsquo;d like to encourage, but dealing with the real-world consequences of them. The conference has also grown over the years: this year was the biggest yet, boasting over 300 attendees. There has been especially big growth in attendees from North America, Australasia, Africa and the Middle East. That&rsquo;s great for the diversity of the conference as it brings in more voices and viewpoints than ever. With more people around to interact with I have to work harder to manage my energy levels but I think that&rsquo;s a small price to pay. Iosevka: a nice fixed-width-font Iosevka is a nice, slender monospace font with a lot of configurable variations. Check it out: https://typeof.net/Iosevka/ Replacing comments with webmentions Just a quickie to say that I&rsquo;ve replaced the comment section at the bottom of each post with webmentions, which allows you to comment by posting on your own site and linking here. It&rsquo;s a fundamental part of the IndieWeb, which I&rsquo;m slowly getting to grips with having been a halfway member of it for years by virtue of having my own site on my own domain. I&rsquo;d already got rid of Google Analytics to stop forcing that tracking on my visitors, I wanted to get rid of Disqus too because I&rsquo;m pretty sure the only way that is free for me is if they&rsquo;re selling my data and yours to third parties. Webmention is a nice alternative because it relies only on open standards, has no tracking and allows people to control their own comments. While I&rsquo;m currently using a third-party service to help, I can switch to self-hosted at any point in the future, completely transparently. Thanks to webmention.io, which handles incoming webmentions for me, and webmention.js, which displays them on the site, I can keep it all static and not have to implement any of this myself, which is nice. It&rsquo;s a bit harder to comment because you have to be able to host your own content somewhere, but then almost no-one ever commented anyway, so it&rsquo;s not like I&rsquo;ll lose anything! Plus, if I get Bridgy set up right, you should be able to comment just by replying on Mastodon, Twitter or a few other places. A spot of web searching shows that I&rsquo;m not the first to make the Disqus -&gt; webmentions switch (yes, I&rsquo;m putting these links in blatantly to test outgoing webmentions with Telegraph&hellip;): So long Disqus, hello webmention &mdash; Nicholas Hoizey Bye Disqus, hello Webmention! &mdash; Evert Pot Implementing Webmention on a static site &mdash; Deluvi Let&rsquo;s see how this goes! Bridging Carpentries Slack channels to Matrix It looks like I&rsquo;ve accidentally taken charge of bridging a bunch of The Carpentries Slack channels over to Matrix. Given this, it seems like a good idea to explain what that sentence means and reflect a little on my reasoning. I&rsquo;m more than happy to discuss the pros and cons of this approach If you just want to try chatting in Matrix, jump to the getting started section What are Slack and Matrix? Slack (see also on Wikipedia), for those not familiar with it, is an online text chat platform with the feel of IRC (Internet Relay Chat), a modern look and feel and both web and smartphone interfaces. By providing a free tier that meets many peoples' needs on its own Slack has become the communication platform of choice for thousands of online communities, private projects and more. One of the major disadvantages of using Slack&rsquo;s free tier, as many community organisations do, is that as an incentive to upgrade to a paid service your chat history is limited to the most recent 10,000 messages across all channels. For a busy community like The Carpentries, this means that messages older than about 6-7 weeks are already inaccessible, rendering some of the quieter channels apparently empty. As Slack is at pains to point out, that history isn&rsquo;t gone, just archived and hidden from view unless you pay the low, low price of $1/user/month. That doesn&rsquo;t seem too pricy, unless you&rsquo;re a non-profit organisation with a lot of projects you want to fund and an active membership of several hundred worldwide, at which point it soon adds up. Slack does offer to waive the cost for registered non-profit organisations, but only for one community. The Carpentries is not an independent organisation, but one fiscally sponsored by Community Initiatives, which has already used its free quota of one elsewhere rendering the Carpentries ineligible. Other umbrella organisations such as NumFocus (and, I expect, Mozilla) also run into this problem with Slack. So, we have a community which is slowly and inexorably losing its own history behind a paywall. For some people this is simply annoying, but from my perspective as a facilitator of the preservation of digital things the community is haemhorraging an important record of its early history. Enter Matrix. Matrix is a chat platform similar to IRC, Slack or Discord. It&rsquo;s divided into separate channels, and users can join one or more of these to take part in the conversation happening in those channels. What sets it apart from older technology like IRC and walled gardens like Slack &amp; Discord is that it&rsquo;s federated. Federation means simply that users on any server can communicate with users and channels on any other server. Usernames and channel addresses specify both the individual identifier and the server it calls home, just as your email address contains all the information needed for my email server to route messages to it. While users are currently tied to their home server, channels can be mirrored and synchronised across multiple servers making the overall system much more resilient. Can&rsquo;t connect to your favourite channel on server X? No problem: just connect via its alias on server Y and when X comes back online it will be resynchronised. The technology used is much more modern and secure than the aging IRC protocol, and there&rsquo;s no vender lock-in like there is with closed platforms like Slack and Discord. On top of that, Matrix channels can easily be &ldquo;bridged&rdquo; to channels/rooms on other platforms, including, yes, Slack, so that you can join on Matrix and transparently talk to people connected to the bridged room, or vice versa. So, to summarise: The current Carpentries Slack channels could be bridged to Matrix at no cost and with no disruption to existing users The history of those channels from that point on would be retained on matrix.org and accessible even when it&rsquo;s no longer available on Slack If at some point in the future The Carpentries chose to invest in its own Matrix server, it could adopt and become the main Matrix home of these channels without disruption to users of either Matrix or (if it&rsquo;s still in use at that point) Slack Matrix is an open protocol, with a reference server implementation and wide range of clients all available as free software, which aligns with the values of the Carpentries community On top of this: I&rsquo;m fed up of having so many different Slack teams to switch between to see the channels in all of them, and prefer having all the channels I regularly visit in a single unified interface; I wanted to see how easy this would be and whether others would also be interested. Given all this, I thought I&rsquo;d go ahead and give it a try to see if it made things more manageable for me and to see what the reaction would be from the community. How can I get started? !!! reminder Please remember that, like any other Carpentries space, the Code of Conduct applies in all of these channels. First, sign up for a Matrix account. The quickest way to do this is on the Matrix &ldquo;Try now&rdquo; page, which will take you to the Riot Web client which for many is synonymous with Matrix. Other clients are also available for the adventurous. Second, join one of the channels. The links below will take you to a page that will let you connect via your preferred client. You&rsquo;ll need to log in as they are set not to allow guest access, but, unlike Slack, you won&rsquo;t need an invitation to be able to join. #general &mdash; the main open channel to discuss all things Carpentries #random &mdash; anything that would be considered offtopic elsewhere #welcome &mdash; join in and introduce yourself! That&rsquo;s all there is to getting started with Matrix. To find all the bridged channels there&rsquo;s a Matrix &ldquo;community&rdquo; that I&rsquo;ve added them all to: Carpentries Matrix community. There&rsquo;s a lot more, including how to bridge your favourite channels from Slack to Matrix, but this is all I&rsquo;ve got time and space for here! If you want to know more, leave a comment below, or send me a message on Slack (jezcope) or maybe Matrix (@petrichor:matrix.org)! I&rsquo;ve also made a separate channel for Matrix-Slack discussions: #matrix on Slack and Carpentries Matrix Discussion on Matrix MozFest19 first reflections Discussions of neurodiversity at #mozfest Photo by Jennifer Riggins The other weekend I had my first experience of Mozilla Festival, aka #mozfest. It was pretty awesome. I met quite a few people in real life that I&rsquo;ve previously only known (/stalked) on Twitter, and caught up with others that I haven&rsquo;t seen for a while. I had the honour of co-facilitating a workshop session on imposter syndrome and how to deal with it with the wonderful Yo Yehudi and Emmy Tsang. We all learned a lot and hope our participants did too; we&rsquo;ll be putting together a summary blog post as soon as we can get our act together! I also attended a great session, led by Kiran Oliver (psst, they&rsquo;re looking for a new challenge), on how to encourage and support a neurodiverse workforce. I was only there for the one day, and I really wish that I&rsquo;d taken the plunge and committed to the whole weekend. There&rsquo;s always next year though! To be honest, I&rsquo;m just disappointed that I never had the courage to go sooner, Music for working Today1 the office conversation turned to blocking out background noise. (No, the irony is not lost on me.) Like many people I work in a large, open-plan office, and I&rsquo;m not alone amongst my colleagues in sometimes needing to find a way to boost concentration by blocking out distractions. Not everyone is like this, but I find music does the trick for me. I also find that different types of music are better for different types of work, and I use this to try and manage my energy better. There are more distractions than auditory noise, and at times I really struggle with visual noise. Rather than have this post turn into a rant about the evils of open-plan offices, I&rsquo;ll just mention that the scientific evidence doesn&rsquo;t paint them in a good light2, or at least suggests that the benefits are more limited in scope than is commonly thought3, and move on to what I actually wanted to share: good music for working to. There are a number of genres that I find useful for working. Generally, these have in common a consistent tempo, a lack of lyrics, and enough variation to prevent boredom without distracting. Familiarity helps my concentration too so I&rsquo;ll often listen to a restricted set of albums for a while, gradually moving on by dropping one out and bringing in another. In my case this includes: Traditional dance music, generally from northern and western European traditions for me. This music has to be rhythmically consistent to allow social dancing, and while the melodies are typically simple repeated phrases, skilled musicians improvise around that to make something beautiful. I tend to go through phases of listening to particular traditions; I&rsquo;m currently listening to a lot of French, Belgian and Scandinavian. Computer game soundtracks, which are specifically designed to enhance gameplay without distracting, making them perfect for other activities requiring a similar level of concentration. Chiptunes and other music incorporating it; partly overlapping with the previous category, chiptunes is music made by hacking the audio chips from (usually) old computers and games machines to become an instrument for new music. Because of the nature of the instrument, this will have millisecond-perfect rhythm and again makes for undistracting noise blocking with an extra helping of nostalgia! Purists would disagree with me, but I like artists that combine chiptunes with other instruments and effects to make something more complete-sounding. Retrowave/synthwave/outrun, synth-driven music that&rsquo;s instantly familiar as the soundtrack to many 90s sci-fi and thriller movies. Atmospheric, almost dreamy, but rhythmic with a driving beat, it&rsquo;s another genre that fits into the &ldquo;pleasing but not too surprising&rdquo; category for me. So where to find this stuff? One of the best resources I&rsquo;ve found is Music for Programming which provides carefully curated playlists of mostly electronic music designed to energise without distracting. They&rsquo;re so well done that the tracks move seamlessly, one to the next, without ever getting boring. Spotify is an obvious option, and I do use it quite a lot. However, I&rsquo;ve started trying to find ways to support artists more directly, and Bandcamp seems to be a good way of doing that. It&rsquo;s really easy to browse by genre, or discover artists similar to what you&rsquo;re currently hearing. You can listen for free as long as you don&rsquo;t mind occasional nags to buy the music you&rsquo;re hearing, but you can also buy tracks or albums. Music you&rsquo;ve paid for is downloadable in several open, DRM-free formats for you to keep, and you know that a decent chunk of that cash is going directly to that artist. I also love noise generators; not exactly music, but a variety of pleasant background noises, some of which nicely obscure typical office noise. I particularly like mynoise.net, which has a cornucopia of different natural and synthetic noises. Each generator comes with a range of sliders allowing you to tweak the composition and frequency range, and will even animate them randomly for you to create a gently shifting soundscape. A much simpler, but still great, option is Noisli with it&rsquo;s nice clean interface. Both offer apps for iOS and Android. For bonus points, you can always try combining one or more of the above. Adding in a noise generator allows me to listen to quieter music while still getting good environmental isolation when I need concentration. Another favourite combo is to open both the cafe and rainfall generators from myNoise, made easier by the ability to pop out a mini-player then open up a second generator. I must be missing stuff though. What other musical genres should I try? What background sounds are nice to work to? Well, you know. The other day. Whatever. &#x21a9;&#xfe0e; See e.g.: Lee, So Young, and Jay L. Brand. ‘Effects of Control over Office Workspace on Perceptions of the Work Environment and Work Outcomes’. Journal of Environmental Psychology 25, no. 3 (1 September 2005): 323–33. https://doi.org/10.1016/j.jenvp.2005.08.001. &#x21a9;&#xfe0e; Open plan offices can actually work under certain conditions, The Conversation &#x21a9;&#xfe0e; Working at the British Library: 6 months in It barely seems like it, but I&rsquo;ve been at the British Library now for nearly 6 months. It always takes a long time to adjust and from experience I know it&rsquo;ll be another year before I feel fully settled, but my team, department and other colleagues have really made me feel welcome and like I belong. One thing that hasn&rsquo;t got old yet is the occasional thrill of remembering that I work at my national library now. Every now and then I&rsquo;ll catch a glimpse of the collections at Boston Spa or step into one of the reading rooms and think &ldquo;wow, I actually work here!&rdquo; I also like having a national and international role to play, which means I get to travel a bit more than I used to. Budgets are still tight so there are limits, and I still prefer to be home more often than not, but there is more scope in this job than I&rsquo;ve had previously for travelling to conferences, giving talks that change the way people think, and learning in different contexts. I&rsquo;m learning a lot too, especially how to work with and manage people split across multiple sites, and the care and feeding of budgets. As well as missing mo old team at Sheffield, I do also miss some of the direct contact I had with researchers in HE. I especially miss the teaching work, but also the higher-level influencing of more senior academics to change practices on a wider scale. Still, I get to use those influencing skills in different ways now, and I&rsquo;m still involved with the Carpentries which should let me keep my hand in with teaching. I still deal with my general tendency to try and do All The Things, and as before I&rsquo;m slowly learning to recognise it, tame it and very occasionally turn it to my advantage. That also leads to feelings of imposterism that are only magnified by the knowledge that I now work at a national institution! It&rsquo;s a constant struggle some days to believe that I&rsquo;ve actually earned my place here through hard work, Even if I don&rsquo;t always feel that I have, my colleagues here certainly have, so I should have more faith in their opinion of me. Finally, I couldn&rsquo;t write this type of thing without mentioning the commute. I&rsquo;ve gone from 90 minutes each way on a good day (up to twice that if the trains were disrupted) to 35 minutes each way along fairly open roads. I have less time to read, but much more time at home. On top of that, the library has implemented flexitime across all pay grades, with even senior managers strongly encouraged to make full use. Not only is this an important enabler of equality across the organisation, it relieves for me personally the pressure to work over my contracted hours and the guilt I&rsquo;ve always felt at leaving work even 10 minutes early. If I work late, it&rsquo;s now a choice I&rsquo;m making based on business needs instead of guilt and in full knowledge that I&rsquo;ll get that time back later. So that&rsquo;s where I am right now. I&rsquo;m really enjoying the work and the culture, and I look forward to what the next 6 months will bring! RDA Plenary 13 reflection Photo by me I sit here writing this in the departure lounge at Philadelphia International Airport, waiting for my Aer Lingus flight back after a week at the 13th Research Data Alliance (RDA) Plenary (although I&rsquo;m actually publishing this a week or so later at home). I&rsquo;m pretty exhausted, partly because of the jet lag, and partly because it&rsquo;s been a very full week with so much to take in. It&rsquo;s my first time at an RDA Plenary, and it was quite a new experience for me! First off, it&rsquo;s my first time outside Europe, and thus my first time crossing quite so many timezones. I&rsquo;ve been waking at 5am and ready to drop by 8pm, but I&rsquo;ve struggled on through! Secondly, it&rsquo;s the biggest conference I&rsquo;ve been to for a long time, both in number of attendees and number of parallel sessions. There&rsquo;s been a lot of sustained input so I&rsquo;ve been very glad to have a room in the conference hotel and be able to escape for a few minutes when I needed to recharge. Thirdly, it&rsquo;s not really like any other conference I&rsquo;ve been to: rather than having large numbers of presentations submitted by attendees, each session comprises lots of parallel meetings of RDA interest groups and working groups. It&rsquo;s more community-oriented: an opportunity for groups to get together face to face and make plans or show off results. I found it pretty intense and struggled to take it all in, but incredibly valuable nonetheless. Lots of information to process (I took a lot of notes) and a few contacts to follow up on too, so overall I loved it! Using Pipfile in Binder Photo by Sear Greyson on Unsplash I recently attended a workshop, organised by the excellent team of the Turing Way project, on a tool called BinderHub. BinderHub, along with public hosting platform MyBinder, allows you to publish computational notebooks online as &ldquo;binders&rdquo; such that they&rsquo;re not static but fully interactive. It&rsquo;s able to do this by using a tool called repo2docker to capture the full computational environment and dependencies required to run the notebook. !!! aside &ldquo;What is the Turing Way?&rdquo; The Turing Way is, in its own words, &ldquo;a lightly opinionated guide to reproducible data science.&rdquo; The team is building an open textbook and running a number of workshops for scientists and research software engineers, and you should check out the project on Github. You could even contribute! The Binder process goes roughly like this: Do some work in a Jupyter Notebook or similar Put it into a public git repository Add some extra metadata describing the packages and versions your code relies on Go to mybinder.org and tell it where to find your repository Open the URL it generates for you Profit Other than step 5, which can take some time to build the binder, this is a remarkably quick process. It supports a number of different languages too, including built-in support for R, Python and Julia and the ability to configure pretty much any other language that will run on Linux. However, the Python support currently requires you to have either a requirements.txt or Conda-style environment.yml file to specify dependencies, and I commonly use a Pipfile for this instead. Pipfile allows you to specify a loose range of compatible versions for maximal convenience, but then locks in specific versions for maximal reproducibility. You can upgrade packages any time you want, but you&rsquo;re fully in control of when that happens, and the locked versions are checked into version control so that everyone working on a project gets consistency. Since Pipfile is emerging as something of a standard thought I&rsquo;d see if I could use that in a binder, and it turns out to be remarkably simple. The reference implementation of Pipfile is a tool called pipenv by the prolific Kenneth Reitz. All you need to use this in your binder is two files of one line each. requirements.txt tells repo2binder to build a Python-based binder, and contains a single line to install the pipenv package: pipenv Then postBuild is used by repo2binder to install all other dependencies using pipenv: pipenv install --system The --system flag tells pipenv to install packages globally (its default behaviour is to create a Python virtualenv). With these two files, the binder builds and runs as expected. You can see a complete example that I put together during the workshop here on Gitlab. What do you think I should write about? I&rsquo;ve found it increasingly difficult to make time to blog, and it&rsquo;s not so much not having the time — I&rsquo;m pretty privileged in that regard — but finding the motivation. Thinking about what used to motivate me, one of the big things was writing things that other people wanted to read. Rather than try to guess, I thought I&rsquo;d ask! Those who know what I&#39;m about, what would you read about, if it was written by me?I&#39;m trying to break through the blog-writers block and would love to know what other people would like to see my ill-considered opinions on.&mdash; Jez Cope (@jezcope) March 7, 2019 I&rsquo;m still looking for ideas, so please tweet me or leave me a comment below. Below are a few thoughts that I&rsquo;m planning to do something with. Something taking one of the more techy aspects of Open Research, breaking it down and explaining the benefits for non-techy folks?&mdash; Dr Beth 🏳️‍🌈 🐺 (@PhdGeek) March 7, 2019 Skills (both techy and non techy) that people need to most effectively support RDM&mdash; Kate O&#39;Neill (@KateFONeill) March 7, 2019 Sometimes I forget that my background makes me well-qualified to take some of these technical aspects of the job and break them down for different audiences. There might be a whole series in this&hellip; Carrying on our conversation last week I&#39;d love to hear more about how you&#39;ve found moving from an HE lib to a national library and how you see the BL&#39;s role in RDM. Appreciate this might be a bit niche/me looking for more interesting things to cite :)&mdash; Rosie Higman (@RosieHLib) March 7, 2019 This is interesting, and something I&rsquo;d like to reflect on; moving from one job to another always has lessons and it&rsquo;s easy to miss them if you&rsquo;re not paying attention. Another one for the pile. Life without admin rights to your computer&mdash; Mike Croucher (@walkingrandomly) March 7, 2019 This is so frustrating as an end user, but at the same time I get that endpoint security is difficult and there are massive risks associated with letting end users have admin rights. This is particularly important at the BL: as custodian&rsquo;s of a nation&rsquo;s cultural heritage, the risk for us is bigger than for many and for this reason we are now Cyber Essentials Plus certified. At some point I&rsquo;d like to do some research and have a conversation with someone who knows a lot more about InfoSec to work out what the proper approach to this, maybe involving VMs and a demilitarized zone on the network. I&rsquo;m always looking for more inspiration, so please leave a comment if you&rsquo;ve got anything you&rsquo;d like to read my thoughts on. If you&rsquo;re not familiar with my writing, please take a minute or two to explore the blog; the tags page is probably a good place to get an overview. Ultimate Hacking Keyboard: first thoughts Following on from the excitement of having built a functioning keyboard myself, I got a parcel on Monday. Inside was something that I&rsquo;ve been waiting for since September: an Ultimate Hacking Keyboard! Where the custom-built Laplace is small and quiet for travelling, the UHK is to be my main workhorse in the study at home. Here are my first impressions: Key switches I went with Kailh blue switches from the available options. In stark contrast to the quiet blacks on the Laplace, blues are NOISY! They have an extra piece of plastic inside the switch that causes an audible and tactile click when the switch activates. This makes them very satisfying to type on and should help as I train my fingers not to bottom out while typing, but does make them unsuitable for use in a shared office! Here are some animations showing how the main types of key switch vary. Layout This keyboard has what&rsquo;s known as a 60% layout: no number pad, arrows or function keys. As with the more spartan Laplace, these &ldquo;missing&rdquo; keys are made up for with programmable layers. For example, the arrow keys are on the Mod layer on the I/J/K/L keys, so I can access them without moving from the home row. I actually find this preferable to having to move my hand to the right to reach them, and I really never used the number pad in any case. Split This is a split keyboard, which means that the left and right halves can be separated to place the hands further apart which eases strain across the shoulders. The UHK has a neat coiled cable joining the two which doesn&rsquo;t get in the way. A cool design feature is that the two halves can be slotted back together and function perfectly well as a non-split keyboard too, held together by magnets. There are even electrical contacts so that when the two are joined you don&rsquo;t need the linking cable. Programming The board is fully programmable, and this is achieved via a custom (open source) GUI tool which talks to the (open source) firmware on the board. You can have multiple keymaps, each of which has a separate Base, Mod, Fn and Mouse layer, and there&rsquo;s an LED display that shows a short mnemonic for the currently active map. I already have a customised Dvorak layout for day-to-day use, plus a standard QWERTY for not-me to use and an alternative QWERTY which will be slowly tweaked for games that don&rsquo;t work well with Dvorak. Mouse keys One cool feature that the designers have included in the firmware is the ability to emulate a mouse. There&rsquo;s a separate layer that allows me to move the cursor, scroll and click without moving my hands from the keyboard. Palm rests Not much to say about the palm rests, other than they are solid wood, and chunky, and really add a little something. I have to say, I really like it so far! Overall it feels really well designed, with every little detail carefully thought out and excellent build quality and a really solid feeling. Custom-built keyboard I&rsquo;m typing this post on a keyboard I made myself, and I&rsquo;m rather excited about it! Why make my own keyboard? I wanted to learn a little bit about practical electronics, and I like to learn by doing I wanted to have the feeling of making something useful with my own hands I actually need a small, keyboard with good-quality switches now that I travel a fair bit for work and this lets me completely customise it to my needs Just because! While it is possible to make a keyboard completely from scratch, it makes much more sense to put together some premade parts. The parts you need are: PCB (printed circuit board): the backbone of the keyboard, to which all the other electrical components attach, this defines the possible physical locations for each key Switches: one for each key to complete a circuit whenever you press it Keycaps: switches are pretty ugly and pretty uncomfortable to press, so each one gets a cap; these are what you probably think of as the &ldquo;keys&rdquo; on your keyboard and come in almost limitless variety of designs (within the obvious size limitation) and are the easiest bit of personalisation Controller: the clever bit, which detects open and closed switches on the PCB and tells your computer what keys you pressed via a USB cable Firmware: the program that runs on the controller starts off as source code like any other program, and altering this can make the keyboard behave in loads of different ways, from different layouts to multiple layers accessed by holding a particular key, to macros and even emulating a mouse! In my case, I&rsquo;ve gone for the following: PCB Laplace from keeb.io, a very compact 47-key (&ldquo;40%&quot;) board, with no number pad, function keys or number row, but a lot of flexibility for key placement on the bottom row. One of my key design goals was small size so I can just pop it in my bag and have on my lap on the train. Controller Elite-C, designed specifically for keyboard builds to be physically compatible with the cheaper Pro Micro, with a more-robust USB port (the Pro Micro&rsquo;s has a tendency to snap off), and made easier to program with a built-in reset button and better bootloader. Switches Gateron Black: Gateron is one of a number of manufacturers of mechanical switches compatible with the popular Cherry range. The black switch is linear (no click or bump at the activation point) and slightly heavier sprung than the more common red. Cherry also make a black switch but the Gateron version is slightly lighter and having tested a few I found them smoother too. My key goal here was to reduce noise, as the stronger spring will help me type accurately without hitting the bottom of the keystroke with an audible sound. Keycaps Blank grey PBT in DSA profile: this keyboard layout has a lot of non-standard sized keys, so blank keycaps meant that I wouldn&rsquo;t be putting lots of keys out of their usual position; they&rsquo;re also relatively cheap, fairly classy IMHO and a good placeholder until I end up getting some really cool caps on a group buy or something; oh, and it minimises the chance of someone else trying the keyboard and getting freaked out by the layout&hellip; Firmware QMK (Quantum Mechanical Keyboard), with a work-in-progress layout, based on Dvorak. QMK has a lot of features and allows you to fully program each and every key, with multiple layers accessed through several different routes. Because there are so few keys on this board, I&rsquo;ll need to make good use of layers to make all the keys on a usual keyboard available. Dvorak Simplified Keyboard I&rsquo;m grateful to the folks of the Leeds Hack Space, especially Nav &amp; Mark who patiently coached me in various soldering techniques and good practice, but also everyone else who were so friendly and welcoming and interested in my project. I&rsquo;m really pleased with the result, which is small, light and fully customisable. Playing with QMK firmware features will keep me occupied for quite a while! This isn&rsquo;t the end though, as I&rsquo;ll need a case to keep the dust out. I&rsquo;m hoping to be able to 3D print this or mill it from wood with a CNC mill, for which I&rsquo;ll need to head back to the Hack Space! Less, but better &ldquo;Wenniger aber besser&rdquo; — Dieter Rams {:.big-quote} I can barely believe it&rsquo;s a full year since I published my intentions for 2018. A lot has happened since then. Principally: in November I started a new job as Data Services Lead at The British Library. One thing that hasn&rsquo;t changed is my tendency to try to do too much, so this year I&rsquo;m going to try and focus on a single intention, a translation of designer Dieter Rams' famous quote above: Less, but better. This chimes with a couple of other things I was toying with over the Christmas break, as they&rsquo;re essentially other ways of saying the same thing: Take it steady One thing at a time I&rsquo;m also going to keep in mind those touchstones from last year: What difference is this making? Am I looking after myself? Do I have evidence for this? I mainly forget to think about them, so I&rsquo;ll be sticking up post-its everywhere to help me remember! How to extend Python with Rust: part 1 Python is great, but I find it useful to have an alternative language under my belt for occasions when no amount of Pythonic cleverness will make some bit of code run fast enough. One of my main reasons for wanting to learn Rust was to have something better than C for that. Not only does Rust have all sorts of advantages that make it a good choice for code that needs to run fast and correctly, it&rsquo;s also got a couple of rather nice crates (libraries) that make interfacing with Python a lot nicer. Here&rsquo;s a little tutorial to show you how easy it is to call a simple Rust function from Python. If you want to try it yourself, you&rsquo;ll find the code on GitHub. !!! prerequisites I’m assuming for this tutorial that you’re already familiar with writing Python scripts and importing &amp; using packages, and that you’re comfortable using the command line. You’ll also need to have installed Rust. The Rust bit The quickest way to get compiled code into Python is to use the builtin ctypes package. This is Python&rsquo;s &ldquo;Foreign Function Interface&rdquo; or FFI: a means of calling functions outside the language you&rsquo;re using to make the call. ctypes allows us to call arbitrary functions in a shared library1, as long as those functions conform to certain standard C language calling conventions. Thankfully, Rust tries hard to make it easy for us to build such a shared library. The first thing to do is to create a new project with cargo, the Rust build tool: $ cargo new rustfrompy Created library `rustfrompy` project $ tree . ├── Cargo.toml └── src └── lib.rs 1 directory, 2 files !!! aside I use the fairly common convention that text set in fixed-width font is either example code or commands to type in. For the latter, a $ precedes the command that you type (omit the $), and lines that don&rsquo;t start with a $ are output from the previous command. I assume a basic familiarity with Unix-style command line, but I should probably put in some links to resources if you need to learn more! We need to edit the Cargo.toml file and add a [lib] section: [package] name = &#34;rustfrompy&#34; version = &#34;0.1.0&#34; authors = [&#34;Jez Cope &lt;j.cope@erambler.co.uk&gt;&#34;] [dependencies] [lib] name = &#34;rustfrompy&#34; crate-type = [&#34;cdylib&#34;] This tells cargo that we want to make a C-compatible dynamic library (crate-type = [&quot;cdylib&quot;]) and what to call it, plus some standard metadata. We can then put our code in src/lib.rs. We&rsquo;ll just use a simple toy function that adds two numbers together: #[no_mangle] pub fn add(a: i64, b: i64) -&gt; i64 { a + b } Notice the pub keyword, which instructs the compiler to make this function accessible to other modules, and the #[no_mangle] annotation, which tells it to use the standard C naming conventions for functions. If we don&rsquo;t do this, then Rust will generate a new name for the function for its own nefarious purposes, and as a side effect we won&rsquo;t know what to call it when we want to use it from Python. Being good developers, let&rsquo;s also add a test: #[cfg(test)] mod test { use ::*; #[test] fn test_add() { assert_eq!(4, add(2, 2)); } } We can now run cargo test which will compile that code and run the test: $ cargo test Compiling rustfrompy v0.1.0 (file:///home/jez/Personal/Projects/rustfrompy) Finished dev [unoptimized + debuginfo] target(s) in 1.2 secs Running target/debug/deps/rustfrompy-3033caaa9f5f17aa running 1 test test test::test_add ... ok test result: ok. 1 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out Everything worked! Now just to build that shared library and we can try calling it from Python: $ cargo build Compiling rustfrompy v0.1.0 (file:///home/jez/Personal/Projects/rustfrompy) Finished dev [unoptimized + debuginfo] target(s) in 0.30 secs Notice that the build is unoptimized and includes debugging information: this is useful in development, but once we&rsquo;re ready to use our code it will run much faster if we compile it with optimisations. Cargo makes this easy: $ cargo build --release Compiling rustfrompy v0.1.0 (file:///home/jez/Personal/Projects/rustfrompy) Finished release [optimized] target(s) in 0.30 secs The Python bit After all that, the Python bit is pretty short. First we import the ctypes package (which is included in all recent Python versions): from ctypes import cdll Cargo has tidied our shared library away into a folder, so we need to tell Python where to load it from. On Linux, it will be called lib&lt;something&gt;.so where the &ldquo;something&rdquo; is the crate name from Cargo.toml, &ldquo;rustfrompy&rdquo;: lib = cdll.LoadLibrary(&#39;target/release/librustfrompy.so&#39;) Finally we can call the function anywhere we want. Here it is in a pytest-style test: def test_rust_add(): assert lib.add(27, 15) == 42 If you have pytest installed (and you should!) you can run the whole test like this: $ pytest --verbose test.py ====================================== test session starts ====================================== platform linux -- Python 3.6.4, pytest-3.1.1, py-1.4.33, pluggy-0.4.0 -- /home/jez/.virtualenvs/datasci/bin/python cachedir: .cache rootdir: /home/jez/Personal/Projects/rustfrompy, inifile: collected 1 items test.py::test_rust_add PASSED It worked! I&rsquo;ve put both the Rust and Python code on github if you want to try it for yourself. Shortcomings Ok, so that was a pretty simple example, and I glossed over a lot of things. For example, what would happen if we did lib.add(2.0, 2)? This causes Python to throw an error because our Rust function only accepts integers (64-bit signed integers, i64, to be precise), and we gave it a floating point number. ctypes can’t guess what type(s) a given function will work with, but it can at least tell us when we get it wrong. To fix this properly, we need to do some extra work, telling the ctypes library what the argument and return types for each function are. For a more complex library, there will probably be more housekeeping to do, such as translating return codes from functions into more Pythonic-style errors. For a small example like this there isn’t much of a problem, but the bigger your compiled library the more extra boilerplate is required on the Python side just to use all the functions. When you’re working with an existing library you don’t have much choice about this, but if you’re building it from scratch specifically to interface with Python, there’s a better way using the Python C API. You can call this directly in Rust, but there are a couple of Rust crates that make life much easier, and I’ll be taking a look at those in a future blog post. .so on Linux, .dylib on Mac and .dll on Windows &#x21a9;&#xfe0e; New Years's irresolution Photo by Andrew Hughes on Unsplash I&rsquo;ve chosen not to make any specific resolutions this year; I’ve found that they just don’t work for me. Like many people, all I get is a sense of guilt when I inevitably fail to live up to the expectations I set myself at the start of the year. However, I have set a couple of what I’m referring to as “themes” for the year: touchstones that I’ll aim to refer to when setting priorities or just feeling a bit overwhelmed or lacking in direction. They are: Contribution Self-care Measurement I may do some blog posts expanding on these, but in the meantime, I&rsquo;ve put together a handful of questions to help me think about priorities and get perspective when I&rsquo;m doing (or avoiding doing) something. What difference is this making? I feel more motivated when I can figure out how I&rsquo;m contributing to something bigger than myself. In society? In my organisation? To my friends &amp; family? Am I looking after myself? I focus a lot on the expectations have (or at least that I think others have) of me, but I can&rsquo;t do anything well unless I&rsquo;m generally happy and healthy. Is this making me happier and healthier? Is this building my capacity to to look after myself, my family &amp; friends and do my job? Is this worth the amount of time and energy I&rsquo;m putting in? Do I have evidence for this? I don&rsquo;t have to base decisions purely on feelings/opinions: I have the skills to obtain, analyse and interpret data. Is this fact or opinion? What are the facts? Am I overthinking this? Can I put a confidence interval for this? Build documents from code and data with Saga !!! tldr &ldquo;TL;DR&rdquo; I&rsquo;ve made Saga, a thing for compiling documents by combining code and data with templates. What is it? Saga is a very simple command-line tool that reads in one or more data files, runs one or more scripts, then passes the results into a template to produce a final output document. It enables you to maintain a clean separation between data, logic and presentation and produce data-based documents that can easily be updated. That allows the flow of data through the document to be easily understood, a cornerstone of reproducible analysis. You run it like this: saga build -d data.yaml -d other_data.yaml \ -s analysis.py -t report.md.tmpl \ -O report.md Any scripts specified with -s will have access to the data in local variables, and any changes to local variables in a script will be retained when everything is passed to the template for rendering. For debugging, you can also do: saga dump -d data.yaml -d other_data.yaml -s analysis.py which will print out the full environment that would be passed to your template with saga build. Features Right now this is a really early version. It does the job but I have lots of ideas for features to add if I ever have time. At present it does the following: Reads data from one or more YAML files Transforms data with one or more Python scripts Renders a template in Mako format Works with any plain-text output format, including Markdown, LaTeX and HTML Use cases Write reproducible reports &amp; papers based on machine-readable data Separate presentation from content in any document, e.g. your CV (example coming soon) Yours here? Get it! I haven&rsquo;t released this on PyPI yet, but all the code is available on GitHub to try out. If you have pipenv installed (and if you use Python you should!), you can try it out in an isolated virtual environment by doing: git clone https://github.com/jezcope/sagadoc.git cd sagadoc pipenv install pipenv run saga or you can set up for development and run some tests: pipenv install --dev pipenv run pytest Why? Like a lot of people, I have to produce reports for work, often containing statistics computed from data. Although these generally aren&rsquo;t academic research papers, I see no reason not to aim for a similar level of reproducibility: after all, if I&rsquo;m telling other people to do it, I&rsquo;d better take my own advice! A couple of times now I&rsquo;ve done this by writing a template that holds the text of the report and placeholders for values, along with a Python script that reads in the data, calculates the statistics I want and completes the template. This is valuable for two main reasons: If anyone wants to know how I processed the data and calculated those statistics, it&rsquo;s all there: no need to try and remember and reproduce a series of button clicks in Excel; If the data or calculations change, I just need to update the relevant part and run it again, and all the relevant parts of the document will be updated. This is particularly important if changing a single data value requires recalculation of dozens of tables, charts, etc. It also gives me the potential to factor out and reuse bits of code in the future, add tests and version control everything. Now that I&rsquo;ve done this more than once (and it seems likely I&rsquo;ll do it again) it makes sense to package that script up in a more portable form so I don&rsquo;t have to write it over and over again (or, shock horror, copy &amp; paste it!). It saves time, and gives others the possibility to make use of it. Prior art I&rsquo;m not the first person to think of this, but I couldn&rsquo;t find anything that did exactly what I needed. Several tools will let you interweave code and prose, including the results of evaluating each code snippet in the document: chief among these are Jupyter and Rmarkdown. There are also tools that let you write code in the order that makes most sense to read and then rearrange it into the right order to execute, so-call literate programming. The original tool for this is the venerable noweb. Sadly there is very little that combine both of these and allow you to insert the results of various calculations at arbitrary points in a document, independent of the order of either presenting or executing the code. The only two that I&rsquo;m aware of are: Dexy and org-mode. Unfortunately, Dexy currently only works on Legacy Python (/Python 2) and org-mode requires emacs (which is fine but not exactly portable). Rmarkdown comes close and supports a range of languages but the full feature set is only available with R. Actually, my ideal solution is org-mode without the emacs dependency, because that&rsquo;s the most flexible solution; maybe one day I&rsquo;ll have both the time and skill to implement that. It&rsquo;s also possible I might be able to figure out Dexy&rsquo;s internals to add what I want to it, but until then Saga does the job! Future work There are lots of features that I&rsquo;d still like to add when I have time: Some actual documentation! And examples! More data formats (e.g. CSV, JSON, TOML) More languages (e.g. R, Julia) Fetching remote data over http Caching of intermediate results to speed up rebuilds For now, though, I&rsquo;d love for you to try it out and let me know what you think! As ever, comment here, tweet me or start an issue on GitHub. Why try Rust for scientific computing? When you&rsquo;re writing analysis code, Python (or R, or JavaScript, or &hellip;) is usually the right choice. These high-level languages are set up to make you as productive as possible, and common tasks like array manipulation have been well optimised. However, sometimes you just can&rsquo;t get enough speed and need to turn to a lower-level compiled language. Often that will be C, C++ or Fortran, but I thought I&rsquo;d do a short post on why I think you should consider Rust. One of my goals for 2017&rsquo;s Advent of Code was to learn a modern, memory-safe, statically-typed language. I now know that there are quite a lot of options in this space, but two seem to stand out: Go &amp; Rust. I gave both of them a try, and although I&rsquo;ll probably go back to give Go a more thorough test at some point I found I got quite hooked on Rust. Both languages, though young, are definitely production-ready. Servo, the core of the new Firefox browser, is entirely written in Rust. In fact, Mozilla have been trying to rewrite the rendering core in C for nearly a decade, and switching to Rust let them get it done in just a couple of years. !!! tldr &ldquo;TL;DR&rdquo; - It&rsquo;s fast: competitive with idiomatic C/C++, and no garbage-collection overhead - It&rsquo;s harder to write buggy code, and compiler errors are actually helpful - It&rsquo;s C-compatible: you can call into Rust code anywhere you&rsquo;d call into C, call C/C++ from Rust, and incrementally replace C/C++ code with Rust - It has sensible modern syntax that makes your code clearer and more concise - Support for scientific computing are getting better all the time (matrix algebra libraries, built-in SIMD, safe concurrency) - It has a really friendly and active community - It&rsquo;s production-ready: Servo, the new rendering core in Firefox, is built entirely in Rust Performance To start with, as a compiled language Rust executes much faster than a (pseudo-)interpreted language like Python or R; the price you pay for this is time spent compiling during development. However, having a compile step also allows the language to enforce certain guarantees, such as type-correctness and memory safety, which between them prevent whole classes of bugs from even being possible. Unlike Go (which, like many higher-level languages, uses a garbage collector), Rust handles memory safety at compile time through the concepts of ownership and borrowing. These can take some getting used to and were a big source of frustration when I was first figuring out the language, but ultimately contribute to Rust&rsquo;s reliably-fast performance. Performance can be unpredictable in a garbage-collected language because you can&rsquo;t be sure when the GC is going to run and you need to understand it really well to stand a chance of optimising it if becomes a problem. On the other hand, code that has the potential to be unsafe will result in compilation errors in Rust. There are a number of benchmarks (example) that show Rust&rsquo;s performance on a par with idiomatic C &amp; C++ code, something that very few languages can boast. Helpful error messages Because beginner Rust programmers often get compile errors, it&rsquo;s really important that those errors are easy to interpret and fix, and Rust is great at this. Not only does it tell you what went wrong, but wherever possible it prints out your code annotated with arrows to show exactly where the error is, and makes specific suggestions how to fix the error which usually turn out to be correct. It also has a nice suite of warnings (things that don&rsquo;t cause compilation to fail but may indicate bugs) that are just as informative, and this can be extended even further by using the clippy linting tool to further analyse your code. warning: unused variable: `y` --&gt; hello.rs:3:9 | 3 | let y = x; | ^ | = note: #[warn(unused_variables)] on by default = note: to avoid this warning, consider using `_y` instead Easy to integrate with other languages If you&rsquo;re like me, you&rsquo;ll probably only use a low-level language for performance-critical code that you can call from a high-level language, and this is an area where Rust shines. Most programmers will turn to C, C++ or Fortran for this because they have a well established ABI (Application Binary Interface) which can be understood by languages like Python and R1. In Rust, it&rsquo;s trivial to make a C-compatible shared library, and the standard library includes extra features for working with C types. That also means that existing C code can be incrementally ported to Rust: see remacs for an example. On top of this, there are projects like rust-cpython and PyO3 which provide macros and structures that wrap the Python C API to let you build Python modules in Rust with minimal glue code; rustr does a similar job for R. Nice language features Rust has some really nice features, which let you write efficient, concise and correct code. Several feel particularly comfortable as they remind me of similar things available in Haskell, including: Enums, a super-powered combination of C enums and unions (similar to Haskell&rsquo;s algebraic data types) that enable some really nice code with no runtime cost Generics and traits that let you get more done with less code Pattern matching, a kind of case statement that lets you extract parts of structs, tuples &amp; enums and do all sorts of other clever things Lazy computation based on an iterator pattern, for efficient processing of lists of things: you can do for item in list { ... } instead of the C-style use of an index2, or you can use higher-order functions like map and filter Functions/closures as first-class citizens Scientific computing Although it&rsquo;s a general-purpose language and not designed specifically for scientific computing, Rust&rsquo;s support is improving all the time. There are some interesting matrix algebra libraries available, and built-in SIMD is incoming. The memory safety features also work to ensure thread safety, so it&rsquo;s harder to write concurrency bugs. You should be able to use your favourite MPI implementation too, and there&rsquo;s at least one attempt to portably wrap MPI in a more Rust-like way. Active development and friendly community One of the things you notice straight away is how active and friendly the Rust community is. There are several IRC channels on irc.mozilla.org including #rust-beginners, which is a great place to get help. The compiler is under constant but carefully-managed development, so that new features are landing all the time but without breaking existing code. And the fabulous Cargo build tool and crates.io are enabling the rapid growth of a healthy ecosystem of open source libraries that you can use to write less code yourself. Summary So, next time you need a compiled language to speed up hotspots in your code, try Rust. I promise you won&rsquo;t regret it! Julia actually allows you to call C and Fortran functions as a first-class language feature &#x21a9;&#xfe0e; Actually, since C++11 there&rsquo;s for (auto item : list) { ... } but still&hellip; &#x21a9;&#xfe0e; Reflections on #aoc2017 Trees reflected in a lake Joshua Reddekopp on Unsplash It seems like ages ago, but way back in November I committed to completing Advent of Code. I managed it all, and it was fun! All of my code is available on GitHub if you’re interested in seeing what I did, and I managed to get out a blog post for every one with a bit more commentary, which you can see in the series list above. How did I approach it? I’ve not really done any serious programming challenges before. I don’t get to write a lot of code at the moment, so all I wanted from AoC was an excuse to do some proper problem-solving. I never really intended to take a polyglot approach, though I did think that I might use mainly Python with a bit of Haskell. In the end, though, I used: Python (×12); Haskell (×7); Rust (×4); Go; C++; Ruby; Julia; and Coconut. For the most part, my priorities were getting the right answer, followed by writing readable code. I didn’t specifically focus on performance but did try to avoid falling into traps that I knew about. What did I learn? I found Python the easiest to get on with: it’s the language I know best and although I can’t always remember exact method names and parameters I know what’s available and where to look to remind myself, as well as most of the common idioms and some performance traps to avoid. Python was therefore the language that let me focus most on solving the problem itself. C++ and Ruby were more challenging, and it was harder to write good idiomatic code but I can still remember quite a lot. Haskell I haven’t used since university, and just like back then I really enjoyed working out how to solve problems in a functional style while still being readable and efficient (not always something I achieved&hellip;). I learned a lot about core Haskell concepts like monads &amp; functors, and I’m really amazed by the way the Haskell community and ecosystem has grown up in the last decade. I also wanted to learn at least one modern, memory-safe compiled language, so I tried both Go and Rust. Both seem like useful languages, but Rust really intrigued me with its conceptual similarities to both Haskell and C++ and its promise of memory safety without a garbage collector. I struggled a lot initially with the “borrow checker” (the component that enforces memory safety at compile time) but eventually started thinking in terms of ownership and lifetimes after which things became easier. The Rust community seems really vibrant and friendly too. What next? I really want to keep this up, so I’m going to look out some more programming challenges (Project Euler looks interesting). It turns out there’s a regular Code Dojo meetup in Leeds, so hopefully I’ll try that out too. I’d like to do more realistic data-science stuff, so I’ll be taking a closer look at stuff like Kaggle too, and figuring out how to do a bit more analysis at work. I’m also feeling motivated to find an open source project to contribute to and/or release a project of my own, so we’ll see if that goes anywhere! I’ve always found the advice to “scratch your own itch” difficult to follow because everything I think of myself has already been done better. Most of the projects I use enough to want to contribute to tend to be pretty well developed with big communities and any bugs that might be accessible to me will be picked off and fixed before I have a chance to get started. Maybe it’s time to get over myself and just reimplement something that already exists, just for the fun of it! The Halting Problem — Python — #adventofcode Day 25 Today&rsquo;s challenge, takes us back to a bit of computing history: a good old-fashioned Turing Machine. → Full code on GitHub !!! commentary Today&rsquo;s challenge was a nice bit of nostalgia, taking me back to my university days learning about the theory of computing. Turing Machines are a classic bit of computing theory, and are provably able to compute any value that is possible to compute: a value is computable if and only if a Turing Machine can be written that computes it (though in practice anything non-trivial is mind-bendingly hard to write as a TM). A bit of a library-fest today, compared to other days! from collections import deque, namedtuple from collections.abc import Iterator from tqdm import tqdm import re import fileinput as fi These regular expressions are used to parse the input that defines the transition table for the machine. RE_ISTATE = re.compile(r&#39;Begin in state (?P&lt;state&gt;\w+)\.&#39;) RE_RUNTIME = re.compile( r&#39;Perform a diagnostic checksum after (?P&lt;steps&gt;\d+) steps.&#39;) RE_STATETRANS = re.compile( r&#34;In state (?P&lt;state&gt;\w+):\n&#34; r&#34; If the current value is (?P&lt;read0&gt;\d+):\n&#34; r&#34; - Write the value (?P&lt;write0&gt;\d+)\.\n&#34; r&#34; - Move one slot to the (?P&lt;move0&gt;left|right).\n&#34; r&#34; - Continue with state (?P&lt;next0&gt;\w+).\n&#34; r&#34; If the current value is (?P&lt;read1&gt;\d+):\n&#34; r&#34; - Write the value (?P&lt;write1&gt;\d+)\.\n&#34; r&#34; - Move one slot to the (?P&lt;move1&gt;left|right).\n&#34; r&#34; - Continue with state (?P&lt;next1&gt;\w+).&#34;) MOVE = {&#39;left&#39;: -1, &#39;right&#39;: 1} A namedtuple to provide some sugar when using a transition rule. Rule = namedtuple(&#39;Rule&#39;, &#39;write move next_state&#39;) The TuringMachine class does all the work. class TuringMachine: def __init__(self, program=None): self.tape = deque() self.transition_table = {} self.state = None self.runtime = 0 self.steps = 0 self.pos = 0 self.offset = 0 if program is not None: self.load(program) def __str__(self): return f&#34;Current: {self.state}; steps: {self.steps} of {self.runtime}&#34; Some jiggery-pokery to allow us to use self[pos] to reference an infinite tape. def __getitem__(self, i): i += self.offset if i &lt; 0 or i &gt;= len(self.tape): return 0 else: return self.tape[i] def __setitem__(self, i, x): i += self.offset if i &gt;= 0 and i &lt; len(self.tape): self.tape[i] = x elif i == -1: self.tape.appendleft(x) self.offset += 1 elif i == len(self.tape): self.tape.append(x) else: raise IndexError(&#39;Tried to set position off end of tape&#39;) Parse the program and set up the transtion table. def load(self, program): if isinstance(program, Iterator): program = &#39;&#39;.join(program) match = RE_ISTATE.search(program) self.state = match[&#39;state&#39;] match = RE_RUNTIME.search(program) self.runtime = int(match[&#39;steps&#39;]) for match in RE_STATETRANS.finditer(program): self.transition_table[match[&#39;state&#39;]] = { int(match[&#39;read0&#39;]): Rule(write=int(match[&#39;write0&#39;]), move=MOVE[match[&#39;move0&#39;]], next_state=match[&#39;next0&#39;]), int(match[&#39;read1&#39;]): Rule(write=int(match[&#39;write1&#39;]), move=MOVE[match[&#39;move1&#39;]], next_state=match[&#39;next1&#39;]), } Run the program for the required number of steps (given by self.runtime). tqdm isn&rsquo;t in the standard library but it should be: it shows a lovely text-mode progress bar as we go. def run(self): for _ in tqdm(range(self.runtime), desc=&#34;Running&#34;, unit=&#34;steps&#34;, unit_scale=True): read = self[self.pos] rule = self.transition_table[self.state][read] self[self.pos] = rule.write self.pos += rule.move self.state = rule.next_state Calculate the &ldquo;diagnostic checksum&rdquo; required for the answer. @property def checksum(self): return sum(self.tape) Aaand GO! machine = TuringMachine(fi.input()) machine.run() print(&#34;Checksum:&#34;, machine.checksum) Electromagnetic Moat — Rust — #adventofcode Day 24 Today&rsquo;s challenge, the penultimate, requires us to build a bridge capable of reaching across to the CPU, our final destination. → Full code on GitHub !!! commentary We have a finite number of components that fit together in a restricted way from which to build a bridge, and we have to work out both the strongest and the longest bridge we can build. The most obvious way to do this is to recursively build every possible bridge and select the best, but that&rsquo;s an O(n!) algorithm that could blow up quickly, so might as well go with a nice fast language! Might have to try this in Haskell too, because it&rsquo;s the type of algorithm that lends itself naturally to a pure functional approach. I feel like I've applied some of the things I've learned in previous challenges I used Rust for, and spent less time mucking about with ownership, and made better use of various language features, including structs and iterators. I'm rather pleased with how my learning of this language is progressing. I'm definitely overusing `Option.unwrap` at the moment though: this is a lazy way to deal with `Option` results and will panic if the result is not what's expected. I'm not sure whether I need to be cloning the components `Vector` either, or whether I could just be passing iterators around. First, we import some bits of standard library and define some data types. The BridgeResult struct lets us use the same algorithm for both parts of the challenge and simply change the value used to calculate the maximum. use std::io; use std::fmt; use std::io::BufRead; #[derive(Debug, Copy, Clone, PartialEq, Eq, Hash)] struct Component(u8, u8); #[derive(Debug, Copy, Clone, Default)] struct BridgeResult { strength: u16, length: u16, } impl Component { fn from_str(s: &amp;str) -&gt; Component { let parts: Vec&lt;&amp;str&gt; = s.split(&#39;/&#39;).collect(); assert!(parts.len() == 2); Component(parts[0].parse().unwrap(), parts[1].parse().unwrap()) } fn fits(self, port: u8) -&gt; bool { self.0 == port || self.1 == port } fn other_end(self, port: u8) -&gt; u8 { if self.0 == port { return self.1; } else if self.1 == port { return self.0; } else { panic!(&#34;{} doesn&#39;t fit port {}&#34;, self, port); } } fn strength(self) -&gt; u16 { self.0 as u16 + self.1 as u16 } } impl fmt::Display for BridgeResult { fn fmt(&amp;self, f: &amp;mut fmt::Formatter) -&gt; fmt::Result { write!(f, &#34;(S: {}, L: {})&#34;, self.strength, self.length) } } best_bridge calculates the length and strength of the &ldquo;best&rdquo; bridge that can be built from the remaining components and fits the required port. Whether this is based on strength or length is given by the key parameter, which is passed to Iter.max_by_key. fn best_bridge&lt;F&gt;(port: u8, key: &amp;F, components: &amp;Vec&lt;Component&gt;) -&gt; Option&lt;BridgeResult&gt; where F: Fn(&amp;BridgeResult) -&gt; u16 { if components.len() == 0 { return None; } components.iter() .filter(|c| c.fits(port)) .map(|c| { let b = best_bridge(c.other_end(port), key, &amp;components.clone().into_iter() .filter(|x| x != c).collect()) .unwrap_or_default(); BridgeResult{strength: c.strength() + b.strength, length: 1 + b.length} }) .max_by_key(key) } Now all that remains is to read the input and calculate the result. I was rather pleasantly surprised to find that in spite of my pessimistic predictions about efficiency, when compiled with optimisations turned on this terminates in less than 1s on my laptop. fn main() { let stdin = io::stdin(); let components: Vec&lt;_&gt; = stdin.lock() .lines() .map(|l| Component::from_str(&amp;l.unwrap())) .collect(); match best_bridge(0, &amp;|b: &amp;BridgeResult| b.strength, &amp;components) { Some(b) =&gt; println!(&#34;Strongest bridge is {}&#34;, b), None =&gt; println!(&#34;No strongest bridge found&#34;) }; match best_bridge(0, &amp;|b: &amp;BridgeResult| b.length, &amp;components) { Some(b) =&gt; println!(&#34;Longest bridge is {}&#34;, b), None =&gt; println!(&#34;No longest bridge found&#34;) }; } Coprocessor Conflagration — Haskell — #adventofcode Day 23 Today&rsquo;s challenge requires us to understand why a coprocessor is working so hard to perform an apparently simple calculation. → Full code on GitHub !!! commentary Today&rsquo;s problem is based on an assembly-like language very similar to day 18, so I went back and adapted my code from that, which works well for the first part. I&rsquo;ve also incorporated some advice from /r/haskell, and cleaned up all warnings shown by the -Wall compiler flag and the hlint tool. Part 2 requires the algorithm to run with much larger inputs, and since some analysis shows that it's an `O(n^3)` algorithm it gets intractible pretty fast. There are several approaches to this. First up, if you have a fast enough processor and an efficient enough implementation I suspect that the simulation would probably terminate eventually, but that would likely still take hours: not good enough. I also thought about doing some peephole optimisations on the instructions, but the last time I did compiler optimisation was my degree so I wasn't really sure where to start. What I ended up doing was actually analysing the input code by hand to figure out what it was doing, and then just doing that calculation in a sensible way. I'd like to say I managed this on my own (and I ike to think I would have) but I did get some tips on [/r/adventofcode](https://reddit.com/r/adventofcode). The majority of this code is simply a cleaned-up version of day 18, with some tweaks to accommodate the different instruction set: module Main where import qualified Data.Vector as V import qualified Data.Map.Strict as M import Control.Monad.State.Strict import Text.ParserCombinators.Parsec hiding (State) type Register = Char type Value = Int type Argument = Either Value Register data Instruction = Set Register Argument | Sub Register Argument | Mul Register Argument | Jnz Argument Argument deriving Show type Program = V.Vector Instruction data Result = Cont | Halt deriving (Eq, Show) type Registers = M.Map Char Int data Machine = Machine { dRegisters :: Registers , dPtr :: !Int , dMulCount :: !Int , dProgram :: Program } instance Show Machine where show d = show (dRegisters d) ++ &#34; @&#34; ++ show (dPtr d) ++ &#34; ×&#34; ++ show (dMulCount d) defaultMachine :: Machine defaultMachine = Machine M.empty 0 0 V.empty type MachineState = State Machine program :: GenParser Char st Program program = do instructions &lt;- endBy instruction eol return $ V.fromList instructions where instruction = try (regOp &#34;set&#34; Set) &lt;|&gt; regOp &#34;sub&#34; Sub &lt;|&gt; regOp &#34;mul&#34; Mul &lt;|&gt; jump &#34;jnz&#34; Jnz regOp n c = do string n &gt;&gt; spaces val1 &lt;- oneOf &#34;abcdefgh&#34; secondArg c val1 jump n c = do string n &gt;&gt; spaces val1 &lt;- regOrVal secondArg c val1 secondArg c val1 = do spaces val2 &lt;- regOrVal return $ c val1 val2 regOrVal = register &lt;|&gt; value register = do name &lt;- lower return $ Right name value = do val &lt;- many $ oneOf &#34;-0123456789&#34; return $ Left $ read val eol = char &#39;\n&#39; parseProgram :: String -&gt; Either ParseError Program parseProgram = parse program &#34;&#34; getReg :: Char -&gt; MachineState Int getReg r = do st &lt;- get return $ M.findWithDefault 0 r (dRegisters st) putReg :: Char -&gt; Int -&gt; MachineState () putReg r v = do st &lt;- get let current = dRegisters st new = M.insert r v current put $ st { dRegisters = new } modReg :: (Int -&gt; Int -&gt; Int) -&gt; Char -&gt; Argument -&gt; MachineState () modReg op r v = do u &lt;- getReg r v&#39; &lt;- getRegOrVal v putReg r (u `op` v&#39;) incPtr getRegOrVal :: Argument -&gt; MachineState Int getRegOrVal = either return getReg addPtr :: Int -&gt; MachineState () addPtr n = do st &lt;- get put $ st { dPtr = n + dPtr st } incPtr :: MachineState () incPtr = addPtr 1 execInst :: Instruction -&gt; MachineState () execInst (Set reg val) = do newVal &lt;- getRegOrVal val putReg reg newVal incPtr execInst (Mul reg val) = do result &lt;- modReg (*) reg val st &lt;- get put $ st { dMulCount = 1 + dMulCount st } return result execInst (Sub reg val) = modReg (-) reg val execInst (Jnz val1 val2) = do test &lt;- getRegOrVal val1 jump &lt;- if test /= 0 then getRegOrVal val2 else return 1 addPtr jump execNext :: MachineState Result execNext = do st &lt;- get let prog = dProgram st p = dPtr st if p &gt;= length prog then return Halt else do execInst (prog V.! p) return Cont runUntilTerm :: MachineState () runUntilTerm = do result &lt;- execNext unless (result == Halt) runUntilTerm This implements the actual calculation: the number of non-primes between (for my input) 107900 and 124900: optimisedCalc :: Int -&gt; Int -&gt; Int -&gt; Int optimisedCalc a b k = sum $ map (const 1) $ filter notPrime [a,a+k..b] where notPrime n = elem 0 $ map (mod n) [2..(floor $ sqrt (fromIntegral n :: Double))] main :: IO () main = do input &lt;- getContents case parseProgram input of Right prog -&gt; do let c = defaultMachine { dProgram = prog } (_, c&#39;) = runState runUntilTerm c putStrLn $ show (dMulCount c&#39;) ++ &#34; multiplications made&#34; putStrLn $ &#34;Calculation result: &#34; ++ show (optimisedCalc 107900 124900 17) Left e -&gt; print e Sporifica Virus — Rust — #adventofcode Day 22 Today&rsquo;s challenge has us helping to clean up (or spread, I can&rsquo;t really tell) an infection of the &ldquo;sporifica&rdquo; virus. → Full code on GitHub !!! commentary I thought I&rsquo;d have another play with Rust, as its Haskell-like features resonate with me at the moment. I struggled quite a lot with the Rust concepts of ownership and borrowing, and this is a cleaned-up version of the code based on some good advice from the folks on /r/rust. use std::io; use std::env; use std::io::BufRead; use std::collections::HashMap; #[derive(PartialEq, Clone, Copy, Debug)] enum Direction {Up, Right, Down, Left} #[derive(PartialEq, Clone, Copy, Debug)] enum Infection {Clean, Weakened, Infected, Flagged} use self::Direction::*; use self::Infection::*; type Grid = HashMap&lt;(isize, isize), Infection&gt;; fn turn_left(d: Direction) -&gt; Direction { match d {Up =&gt; Left, Right =&gt; Up, Down =&gt; Right, Left =&gt; Down} } fn turn_right(d: Direction) -&gt; Direction { match d {Up =&gt; Right, Right =&gt; Down, Down =&gt; Left, Left =&gt; Up} } fn turn_around(d: Direction) -&gt; Direction { match d {Up =&gt; Down, Right =&gt; Left, Down =&gt; Up, Left =&gt; Right} } fn make_move(d: Direction, x: isize, y: isize) -&gt; (isize, isize) { match d { Up =&gt; (x-1, y), Right =&gt; (x, y+1), Down =&gt; (x+1, y), Left =&gt; (x, y-1), } } fn basic_step(grid: &amp;mut Grid, x: &amp;mut isize, y: &amp;mut isize, d: &amp;mut Direction) -&gt; usize { let mut infect = 0; let current = match grid.get(&amp;(*x, *y)) { Some(v) =&gt; *v, None =&gt; Clean, }; if current == Infected { *d = turn_right(*d); } else { *d = turn_left(*d); infect = 1; }; grid.insert((*x, *y), match current { Clean =&gt; Infected, Infected =&gt; Clean, x =&gt; panic!(&#34;Unexpected infection state {:?}&#34;, x), }); let new_pos = make_move(*d, *x, *y); *x = new_pos.0; *y = new_pos.1; infect } fn nasty_step(grid: &amp;mut Grid, x: &amp;mut isize, y: &amp;mut isize, d: &amp;mut Direction) -&gt; usize { let mut infect = 0; let new_state: Infection; let current = match grid.get(&amp;(*x, *y)) { Some(v) =&gt; *v, None =&gt; Infection::Clean, }; match current { Clean =&gt; { *d = turn_left(*d); new_state = Weakened; }, Weakened =&gt; { new_state = Infected; infect = 1; }, Infected =&gt; { *d = turn_right(*d); new_state = Flagged; }, Flagged =&gt; { *d = turn_around(*d); new_state = Clean; } }; grid.insert((*x, *y), new_state); let new_pos = make_move(*d, *x, *y); *x = new_pos.0; *y = new_pos.1; infect } fn virus_infect&lt;F&gt;(mut grid: Grid, mut step: F, mut x: isize, mut y: isize, mut d: Direction, n: usize) -&gt; usize where F: FnMut(&amp;mut Grid, &amp;mut isize, &amp;mut isize, &amp;mut Direction) -&gt; usize, { (0..n).map(|_| step(&amp;mut grid, &amp;mut x, &amp;mut y, &amp;mut d)) .sum() } fn main() { let args: Vec&lt;String&gt; = env::args().collect(); let n_basic: usize = args[1].parse().unwrap(); let n_nasty: usize = args[2].parse().unwrap(); let stdin = io::stdin(); let lines: Vec&lt;String&gt; = stdin.lock() .lines() .map(|x| x.unwrap()) .collect(); let mut grid: Grid = HashMap::new(); let x0 = (lines.len() / 2) as isize; let y0 = (lines[0].len() / 2) as isize; for (i, line) in lines.iter().enumerate() { for (j, c) in line.chars().enumerate() { grid.insert((i as isize, j as isize), match c {&#39;#&#39; =&gt; Infected, _ =&gt; Clean}); } } let basic_steps = virus_infect(grid.clone(), basic_step, x0, y0, Up, n_basic); println!(&#34;Basic: infected {} times&#34;, basic_steps); let nasty_steps = virus_infect(grid, nasty_step, x0, y0, Up, n_nasty); println!(&#34;Nasty: infected {} times&#34;, nasty_steps); } Fractal Art — Python — #adventofcode Day 21 Today&rsquo;s challenge asks us to assist an artist building fractal patterns from a rulebook. → Full code on GitHub !!! commentary Another fairly straightforward algorithm: the really tricky part was breaking the pattern up into chunks and rejoining it again. I could probably have done that more efficiently, and would have needed to if I had to go for a few more iterations and the grid grows with every iteration and gets big fast. Still behind on the blog posts… import fileinput as fi from math import sqrt from functools import reduce, partial import operator INITIAL_PATTERN = ((0, 1, 0), (0, 0, 1), (1, 1, 1)) DECODE = [&#39;.&#39;, &#39;#&#39;] ENCODE = {&#39;.&#39;: 0, &#39;#&#39;: 1} concat = partial(reduce, operator.concat) def rotate(p): size = len(p) return tuple(tuple(p[i][j] for i in range(size)) for j in range(size - 1, -1, -1)) def flip(p): return tuple(p[i] for i in range(len(p) - 1, -1, -1)) def permutations(p): yield p yield flip(p) for _ in range(3): p = rotate(p) yield p yield flip(p) def print_pattern(p): print(&#39;-&#39; * len(p)) for row in p: print(&#39; &#39;.join(DECODE[x] for x in row)) print(&#39;-&#39; * len(p)) def build_pattern(s): return tuple(tuple(ENCODE[c] for c in row) for row in s.split(&#39;/&#39;)) def build_pattern_book(lines): book = {} for line in lines: source, target = line.strip().split(&#39; =&gt; &#39;) for rotation in permutations(build_pattern(source)): book[rotation] = build_pattern(target) return book def subdivide(pattern): size = 2 if len(pattern) % 2 == 0 else 3 n = len(pattern) // size return (tuple(tuple(pattern[i][j] for j in range(y * size, (y + 1) * size)) for i in range(x * size, (x + 1) * size)) for x in range(n) for y in range(n)) def rejoin(parts): n = int(sqrt(len(parts))) size = len(parts[0]) return tuple(concat(parts[i + k][j] for i in range(n)) for k in range(0, len(parts), n) for j in range(size)) def enhance_once(p, book): return rejoin(tuple(book[part] for part in subdivide(p))) def enhance(p, book, n, progress=None): for _ in range(n): p = enhance_once(p, book) return p book = build_pattern_book(fi.input()) intermediate_pattern = enhance(INITIAL_PATTERN, book, 5) print(&#34;After 5 iterations:&#34;, sum(sum(row) for row in intermediate_pattern)) final_pattern = enhance(intermediate_pattern, book, 13) print(&#34;After 18 iterations:&#34;, sum(sum(row) for row in final_pattern)) Particle Swarm — Python — #adventofcode Day 20 Today&rsquo;s challenge finds us simulating the movements of particles in space. → Full code on GitHub !!! commentary Back to Python for this one, another relatively straightforward simulation, although it&rsquo;s easier to calculate the answer to part 1 than to simulate. import fileinput as fi import numpy as np import re First we parse the input into 3 2D arrays: using numpy enables us to do efficient arithmetic across the whole set of particles in one go. PARTICLE_RE = re.compile(r&#39;p=&lt;(-?\d+),(-?\d+),(-?\d+)&gt;, &#39; r&#39;v=&lt;(-?\d+),(-?\d+),(-?\d+)&gt;, &#39; r&#39;a=&lt;(-?\d+),(-?\d+),(-?\d+)&gt;&#39;) def parse_input(lines): x = [] v = [] a = [] for l in lines: m = PARTICLE_RE.match(l) x.append([int(x) for x in m.group(1, 2, 3)]) v.append([int(x) for x in m.group(4, 5, 6)]) a.append([int(x) for x in m.group(7, 8, 9)]) return (np.arange(len(x)), np.array(x), np.array(v), np.array(a)) i, x, v, a = parse_input(fi.input()) Now we can calculate which particle will be closest to the origin in the long-term: this is simply the particle with the smallest acceleration. It turns out that several have the same acceleration, so of these, the one we want is the one with the lowest starting velocity. This is only complicated slightly by the need to get the number of the particle rather than its other information, hence the need to use numpy.argmin. a_abs = np.sum(np.abs(a), axis=1) a_min = np.min(a_abs) a_i = np.squeeze(np.argwhere(a_abs == a_min)) closest = i[a_i[np.argmin(np.sum(np.abs(v[a_i]), axis=1))]] print(&#34;Closest: &#34;, closest) Now we define functions to simulate collisions between particles. We have to use the return_index and return_counts options to numpy.unique to be able to get rid of all the duplicate positions (the standard usage is to keep one of each duplicate). def resolve_collisions(x, v, a): (_, i, c) = np.unique(x, return_index=True, return_counts=True, axis=0) i = i[c == 1] return x[i], v[i], a[i] The termination criterion for this loop is an interesting aspect: the most robust to my mind seems to be that eventually the particles will end up sorted in order of their initial acceleration in terms of distance from the origin, so you could check for this but that&rsquo;s pretty computationally expensive. In the end, all that was needed was a bit of trial and error: terminating arbitrarily after 1,000 iterations seems to work! In fact, all the collisions are over after about 40 iterations for my input but there was always the possibility that two particles with very slightly different accelerations would eventually intersect much later. def simulate_collisions(x, v, a, iterations=1000): for _ in range(iterations): v += a x += v x, v, a = resolve_collisions(x, v, a) return len(x) print(&#34;Remaining particles: &#34;, simulate_collisions(x, v, a)) A Series of Tubes — Rust — #adventofcode Day 19 Today&rsquo;s challenge asks us to help a network packet find its way. → Full code on GitHub !!! commentary Today&rsquo;s challenge was fairly straightforward, following an ASCII art path, so I thought I&rsquo;d give Rust another try. I&rsquo;m a bit behind on the blog posts, so I&rsquo;m presenting the code below without any further commentary. I&rsquo;m not really convinced this is good idiomatic Rust, and it was interesting turning a set of strings into a 2D array of characters because there are both u8 (byte) and char types to deal with. use std::io; use std::io::BufRead; const ALPHA: &amp;&#39;static str = &#34;ABCDEFGHIJKLMNOPQRSTUVWXYZ&#34;; fn change_direction(dia: &amp;Vec&lt;Vec&lt;u8&gt;&gt;, x: usize, y: usize, dx: &amp;mut i32, dy: &amp;mut i32) { assert_eq!(dia[x][y], b&#39;+&#39;); if dx.abs() == 1 { *dx = 0; if y + 1 &lt; dia[x].len() &amp;&amp; (dia[x][y + 1] == b&#39;-&#39; || ALPHA.contains(dia[x][y + 1] as char)) { *dy = 1; } else if dia[x][y - 1] == b&#39;-&#39; || ALPHA.contains(dia[x][y - 1] as char) { *dy = -1; } else { panic!(&#34;Huh? {} {}&#34;, dia[x][y+1] as char, dia[x][y-1] as char); } } else { *dy = 0; if x + 1 &lt; dia.len() &amp;&amp; (dia[x + 1][y] == b&#39;|&#39; || ALPHA.contains(dia[x + 1][y] as char)) { *dx = 1; } else if dia[x - 1][y] == b&#39;|&#39; || ALPHA.contains(dia[x - 1][y] as char) { *dx = -1; } else { panic!(&#34;Huh?&#34;); } } } fn follow_route(dia: Vec&lt;Vec&lt;u8&gt;&gt;) -&gt; (String, i32) { let mut x: i32 = 0; let mut y: i32; let mut dx: i32 = 1; let mut dy: i32 = 0; let mut result = String::new(); let mut steps = 1; match dia[0].iter().position(|x| *x == b&#39;|&#39;) { Some(i) =&gt; y = i as i32, None =&gt; panic!(&#34;Could not find &#39;|&#39; in first row&#34;), } loop { x += dx; y += dy; match dia[x as usize][y as usize] { b&#39;A&#39;...b&#39;Z&#39; =&gt; result.push(dia[x as usize][y as usize] as char), b&#39;+&#39; =&gt; change_direction(&amp;dia, x as usize, y as usize, &amp;mut dx, &amp;mut dy), b&#39; &#39; =&gt; return (result, steps), _ =&gt; (), } steps += 1; } } fn main() { let stdin = io::stdin(); let lines: Vec&lt;Vec&lt;u8&gt;&gt; = stdin.lock().lines() .map(|l| l.unwrap().into_bytes()) .collect(); let result = follow_route(lines); println!(&#34;Route: {}&#34;, result.0); println!(&#34;Steps: {}&#34;, result.1); } Duet — Haskell — #adventofcode Day 18 Today&rsquo;s challenge introduces a type of simplified assembly language that includes instructions for message-passing. First we have to simulate a single program (after humorously misinterpreting the snd and rcv instructions as &ldquo;sound&rdquo; and &ldquo;recover&rdquo;), but then we have to simulate two concurrent processes and the message passing between them. → Full code on GitHub !!! commentary Well, I really learned a lot from this one! I wanted to get to grips with more complex stuff in Haskell and this challenge seemed like an excellent opportunity to figure out a) parsing with the parsec library and b) using the State monad to keep the state of the simulator. As it turned out, that wasn't all I'd learned: I also ran into an interesting situation whereby lazy evaluation was creating an infinite loop where there shouldn't be one, so I also had to learn how to selectively force strict evaluation of values. I'm pretty sure this isn't the best Haskell in the world, but I'm proud of it. First we have to import a bunch of stuff to use later, but also notice the pragma on the first line which instructs the compiler to enable the BangPatterns language extension, which will be important later. {-# LANGUAGE BangPatterns #-} module Main where import qualified Data.Vector as V import qualified Data.Map.Strict as M import Data.List import Data.Either import Data.Maybe import Control.Monad.State.Strict import Control.Monad.Loops import Text.ParserCombinators.Parsec hiding (State) First up we define the types that will represent the program code itself. data DuetVal = Reg Char | Val Int deriving Show type DuetQueue = [Int] data DuetInstruction = Snd DuetVal | Rcv DuetVal | Jgz DuetVal DuetVal | Set DuetVal DuetVal | Add DuetVal DuetVal | Mul DuetVal DuetVal | Mod DuetVal DuetVal deriving Show type DuetProgram = V.Vector DuetInstruction Next we define the types to hold the machine state, which includes: registers, instruction pointer, send &amp; receive buffers and the program code, plus a counter of the number of sends made (to provide the solution). type DuetRegisters = M.Map Char Int data Duet = Duet { dRegisters :: DuetRegisters , dPtr :: Int , dSendCount :: Int , dRcvBuf :: DuetQueue , dSndBuf :: DuetQueue , dProgram :: DuetProgram } instance Show Duet where show d = show (dRegisters d) ++ &#34; @&#34; ++ show (dPtr d) ++ &#34; S&#34; ++ show (dSndBuf d) ++ &#34; R&#34; ++ show (dRcvBuf d) defaultDuet = Duet M.empty 0 0 [] [] V.empty type DuetState = State Duet program is a parser built on the cool parsec library to turn the program text into a Haskell format that we can work with, a Vector of instructions. Yes, using a full-blown parser is overkill here (it would be much simpler just to split each line on whitespace, but I wanted to see how Parsec works. I&rsquo;m using Vector here because we need random access to the instruction list, which is much more efficient with Vector: O(1) compared with the O(n) of the built in Haskell list ([]) type. parseProgram applies the parser to a string and returns the result. program :: GenParser Char st DuetProgram program = do instructions &lt;- endBy instruction eol return $ V.fromList instructions where instruction = try (oneArg &#34;snd&#34; Snd) &lt;|&gt; oneArg &#34;rcv&#34; Rcv &lt;|&gt; twoArg &#34;set&#34; Set &lt;|&gt; twoArg &#34;add&#34; Add &lt;|&gt; try (twoArg &#34;mul&#34; Mul) &lt;|&gt; twoArg &#34;mod&#34; Mod &lt;|&gt; twoArg &#34;jgz&#34; Jgz oneArg n c = do string n &gt;&gt; spaces val &lt;- regOrVal return $ c val twoArg n c = do string n &gt;&gt; spaces val1 &lt;- regOrVal spaces val2 &lt;- regOrVal return $ c val1 val2 regOrVal = register &lt;|&gt; value register = do name &lt;- lower return $ Reg name value = do val &lt;- many $ oneOf &#34;-0123456789&#34; return $ Val $ read val eol = char &#39;\n&#39; parseProgram :: String -&gt; Either ParseError DuetProgram parseProgram = parse program &#34;&#34; Next up we have some utility functions that sit in the DuetState monad we defined above and perform common manipulations on the state: getting/setting/updating registers, updating the instruction pointer and sending/receiving messages via the relevant queues. getReg :: Char -&gt; DuetState Int getReg r = do st &lt;- get return $ M.findWithDefault 0 r (dRegisters st) putReg :: Char -&gt; Int -&gt; DuetState () putReg r v = do st &lt;- get let current = dRegisters st new = M.insert r v current put $ st { dRegisters = new } modReg :: (Int -&gt; Int -&gt; Int) -&gt; Char -&gt; DuetVal -&gt; DuetState Bool modReg op r v = do u &lt;- getReg r v&#39; &lt;- getRegOrVal v putReg r (u `op` v&#39;) incPtr return False getRegOrVal :: DuetVal -&gt; DuetState Int getRegOrVal (Reg r) = getReg r getRegOrVal (Val v) = return v addPtr :: Int -&gt; DuetState () addPtr n = do st &lt;- get put $ st { dPtr = n + dPtr st } incPtr = addPtr 1 send :: Int -&gt; DuetState () send v = do st &lt;- get put $ st { dSndBuf = (dSndBuf st ++ [v]), dSendCount = dSendCount st + 1 } recv :: DuetState (Maybe Int) recv = do st &lt;- get case dRcvBuf st of (x:xs) -&gt; do put $ st { dRcvBuf = xs } return $ Just x [] -&gt; return Nothing execInst implements the logic for each instruction. It returns False as long as the program can continue, but True if the program tries to receive from an empty buffer. execInst :: DuetInstruction -&gt; DuetState Bool execInst (Set (Reg reg) val) = do newVal &lt;- getRegOrVal val putReg reg newVal incPtr return False execInst (Mul (Reg reg) val) = modReg (*) reg val execInst (Add (Reg reg) val) = modReg (+) reg val execInst (Mod (Reg reg) val) = modReg mod reg val execInst (Jgz val1 val2) = do st &lt;- get test &lt;- getRegOrVal val1 jump &lt;- if test &gt; 0 then getRegOrVal val2 else return 1 addPtr jump return False execInst (Snd val) = do v &lt;- getRegOrVal val send v st &lt;- get incPtr return False execInst (Rcv (Reg r)) = do st &lt;- get v &lt;- recv handle v where handle :: Maybe Int -&gt; DuetState Bool handle (Just x) = putReg r x &gt;&gt; incPtr &gt;&gt; return False handle Nothing = return True execInst x = error $ &#34;execInst not implemented yet for &#34; ++ show x execNext looks up the next instruction and executes it. runUntilWait runs the program until execNext returns True to signal the wait state has been reached. execNext :: DuetState Bool execNext = do st &lt;- get let prog = dProgram st p = dPtr st if p &gt;= length prog then return True else execInst (prog V.! p) runUntilWait :: DuetState () runUntilWait = do waiting &lt;- execNext unless waiting runUntilWait runTwoPrograms handles the concurrent running of two programs, by running first one and then the other to a wait state, then swapping each program&rsquo;s send buffer to the other&rsquo;s receive buffer before repeating. If you look carefully, you&rsquo;ll see a &ldquo;bang&rdquo; (!) before the two arguments of the function: runTwoPrograms !d0 !d1. Haskell is a lazy language and usually doesn&rsquo;t evaluate a computation until you ask for a result, instead carrying around a &ldquo;thunk&rdquo; or plan for how to carry out the computation. Sometimes that can be a problem because the amount of memory your program is using can explode unnecessarily as a long computation turns into a large thunk which isn&rsquo;t evaluated until the very end. That&rsquo;s not the problem here though. What happens here without the bangs is another side-effect of laziness. The exit condition of this recursive function is that a deadlock has been reached: both programs are waiting to receive, but neither has sent anything, so neither can ever continue. The check for this is (null $ dSndBuf d0') &amp;&amp; (null $ dSndBuf d1'). As long as the first program has something in its send buffer, the test fails without ever evaluating the second part, which means the result d1' of running the second program is never needed. The function immediately goes to the recursive case and tries to continue the first program again, which immediately returns because it&rsquo;s still waiting to receive. The same thing happens again, and the result is that instead of running the second program to obtain something for the first to receive, we get into an infinite loop trying and failing to continue the first program. The bang forces both d0 and d1 to be evaluated at the point we recurse, which forces the rest of the computation: running the second program and swapping the send/receive buffers. With that, the evaluation proceeds correctly and we terminate with a result instead of getting into an infinite loop! runTwoPrograms :: Duet -&gt; Duet -&gt; (Int, Int) runTwoPrograms !d0 !d1 | (null $ dSndBuf d0&#39;) &amp;&amp; (null $ dSndBuf d1&#39;) = (dSendCount d0&#39;, dSendCount d1&#39;) | otherwise = runTwoPrograms d0&#39;&#39; d1&#39;&#39; where (_, d0&#39;) = runState runUntilWait d0 (_, d1&#39;) = runState runUntilWait d1 d0&#39;&#39; = d0&#39; { dSndBuf = [], dRcvBuf = dSndBuf d1&#39; } d1&#39;&#39; = d1&#39; { dSndBuf = [], dRcvBuf = dSndBuf d0&#39; } All that remains to be done now is to run the programs and see how many messages were sent before the deadlock. main = do prog &lt;- fmap (fromRight V.empty . parseProgram) getContents let d0 = defaultDuet { dProgram = prog, dRegisters = M.fromList [(&#39;p&#39;, 0)] } d1 = defaultDuet { dProgram = prog, dRegisters = M.fromList [(&#39;p&#39;, 1)] } (send0, send1) = runTwoPrograms d0 d1 putStrLn $ &#34;Program 0 sent &#34; ++ show send0 ++ &#34; messages&#34; putStrLn $ &#34;Program 1 sent &#34; ++ show send1 ++ &#34; messages&#34; Spinlock — Rust/Python — #adventofcode Day 17 In today&rsquo;s challenge we deal with a monstrous whirlwind of a program, eating up CPU and memory in equal measure. → Full code on GitHub (and Python driver script) !!! commentary One of the things I wanted from AoC was an opportunity to try out some popular languages that I don&rsquo;t currently know, including the memory-safe, strongly-typed compiled languages Go and Rust. Realistically though, I&rsquo;m likely to continue doing most of my programming in Python, and use one of these other languages when it has better tools or I need the extra speed. In which case, what I really want to know is how I can call functions written in Go or Rust from Python. I thought I'd try Rust first, as it seems to be designed to be C-compatible and that makes it easy to call from Python using [`ctypes`](https://docs.python.org/3.6/library/ctypes.html). Part 1 was another straightforward simulation: translate what the &quot;spinlock&quot; monster is doing into code and run it. It was pretty obvious from the story of this challenge and experience of the last few days that this was going to be another one where the simulation is too computationally expensive for part two, which turns out to be correct. So, first thing to do is to implement the meat of the solution in Rust. spinlock solves the first part of the problem by doing exactly what the monster does. Since we only have to go up to 2017 iterations, this is very tractable. The last number we insert is 2017, so we just return the number immediately after that. #[no_mangle] pub extern fn spinlock(n: usize, skip: usize) -&gt; i32 { let mut buffer: Vec&lt;i32&gt; = Vec::with_capacity(n+1); buffer.push(0); buffer.push(1); let mut pos = 1; for i in 2..n+1 { pos = (pos + skip + 1) % buffer.len(); buffer.insert(pos, i as i32); } pos = (pos + 1) % buffer.len(); return buffer[pos]; } For the second part, we have to do 50 million iterations instead, which is a lot. Given that every time you insert an item in the list it has to move up all the elements after that position, I&rsquo;m pretty sure the algorithm is O(n^2), so it&rsquo;s going to take a lot longer than 10,000ish times the first part. Thankfully, we don&rsquo;t need to build the whole list, just keep track of where 0 is and what number is immediately after it. There may be a closed-form solution to simply calculate the result, but I couldn&rsquo;t think of it and this is good enough. #[no_mangle] pub extern fn spinlock0(n: usize, skip: usize) -&gt; i32 { let mut pos = 1; let mut pos_0 = 0; let mut after_0 = 1; for i in 2..n+1 { pos = (pos + skip + 1) % i; if pos == pos_0 + 1 { after_0 = i; } if pos &lt;= pos_0 { pos_0 += 1; } } return after_0 as i31; } Now it&rsquo;s time to call this code from Python. Notice the #[no_mangle] pragmas and pub extern declarations for each function above, which are required to make sure the functions are exported in a C-compatible way. We can build this into a shared library like this: rustc --crate-type=cdylib -o spinlock.so 17-spinlock.rs The Python script is as simple as loading this library, reading the puzzle input from the command line and calling the functions. The ctypes module does a lot of magic so that we don&rsquo;t have to worry about converting from Python types to native types and back again. import ctypes import sys lib = ctypes.cdll.LoadLibrary(&#34;./spinlock.so&#34;) skip = int(sys.argv[1]) print(&#34;Part 1:&#34;, lib.spinlock(2017, skip)) print(&#34;Part 2:&#34;, lib.spinlock0(50_000_000, skip)) This is a toy example as far as calling Rust from Python is concerned, but it&rsquo;s worth noting that already we can play with the parameters to the two Rust functions without having to recompile. For more serious work, I&rsquo;d probably be looking at something like PyO3 to make a proper Python module. Looks like there&rsquo;s also a very early Rust numpy integration for integrating numerical stuff. You can also do the same thing from Julia, which has a ccall function built in: ccall((:spinlock, &#34;./spinlock.so&#34;), Int32, (UInt64, UInt64), 2017, 377) My next thing to try might be Haskell → Python though… Permutation Promenade — Julia — #adventofcode Day 16 Today&rsquo;s challenge rather appeals to me as a folk dancer, because it describes a set of instructions for a dance and asks us to work out the positions of the dancing programs after each run through the dance. → Full code on GitHub !!! commentary So, part 1 is pretty straight forward: parse the set of instructions, interpret them and keep track of the dancer positions as you go. One time through the dance. However, part 2 asks for the positions after 1 billion (yes, that&rsquo;s 1,000,000,000) times through the dance. In hindsight I should have immediately become suspicious, but I thought I&rsquo;d at least try the brute force approach first because it was simpler to code. So I give it a try, and after waiting for a while, having a cup of tea etc. it still hasn't terminated. I try reducing the number of iterations to 1,000. Now it terminates, but takes about 6 seconds. A spot of arithmetic suggests that running the full version will take a little over 190 years. There must be a better way than that! I'm a little embarassed that I didn't spot the solution immediately (blaming Julia) and tried again in Python to see if I could get it to terminate quicker. When that didn't work I had to think again. A little further investigation with a while loop shows that in fact the dance position repeats (in the case of my input) every 48 times. After that it becomes much quicker! Oh, and it was time for a new language, so I wasted some extra time working out the quirks of [Julia][]. First, a function to evaluate a single move — for neatness, this dispatches to a dedicated function depending on the type of move, although this isn&rsquo;t really necessary to solve the challenge. Ending a function name with a bang (!) is a Julia convention to indicate that it has side-effects. function eval_move!(move, dancers) move_type = move[1] params = move[2:end] if move_type == &#39;s&#39; # spin eval_spin!(params, dancers) elseif move_type == &#39;x&#39; # exchange eval_exchange!(params, dancers) elseif move_type == &#39;p&#39; # partner swap eval_partner!(params, dancers) end end These take care of the individual moves. Parsing the parameters from a string every single time probably isn&rsquo;t ideal, but as it turns out, that optimisation isn&rsquo;t really necessary. Note the + 1 in eval_exchange!, which is necessary because Julia is one of those crazy languages where indexes start from 1 instead of 0. These actions are pretty nice to implement, because Julia has circshift as a builtin to rotate a list, and allows you to assign to list slices and swap values in place with a single statement. function eval_spin!(params, dancers) shift = parse(Int, params) dancers[1:end] = circshift(dancers, shift) end function eval_exchange!(params, dancers) i, j = map(x -&gt; parse(Int, x) + 1, split(params, &#34;/&#34;)) dancers[i], dancers[j] = dancers[j], dancers[i] end function eval_partner!(params, dancers) a, b = split(params, &#34;/&#34;) ia = findfirst([x == a for x in dancers]) ib = findfirst([x == b for x in dancers]) dancers[ia], dancers[ib] = b, a end dance! takes a list of moves and takes the dances once through the dance. function dance!(moves, dancers) for m in moves eval_move!(m, dancers) end end To solve part 1, we simply need to read the moves in, set up the initial positions of the dances and run the dance through once. join is necessary to a) turn characters into length-1 strings, and b) convert the list of strings back into a single string to print out. moves = split(readchomp(STDIN), &#34;,&#34;) dancers = collect(join(c) for c in &#39;a&#39;:&#39;p&#39;) orig_dancers = copy(dancers) dance!(moves, dancers) println(join(dancers)) Part 2 requires a little more work. We run the dance through again and again until we get back to the initial position, saving the intermediate positions in a list. The list now contains every possible position available from that starting point, so we can find position 1 billion by taking 1,000,000,000 modulo the list length (plus 1 because 1-based indexing) and use that to index into the list to get the final position. dance_cycle = [orig_dancers] while dancers != orig_dancers push!(dance_cycle, copy(dancers)) dance!(moves, dancers) end println(join(dance_cycle[1_000_000_000 % length(dance_cycle) + 1])) This terminates on my laptop in about 1.6s: Brute force 0; Careful thought 1! Dueling Generators — Rust — #adventofcode Day 15 Today&rsquo;s challenge introduces two pseudo-random number generators which are trying to agree on a series of numbers. We play the part of the &ldquo;judge&rdquo;, counting the number of times their numbers agree in the lowest 16 bits. → Full code on GitHub Ever since I used Go to solve day 3, I&rsquo;ve had a hankering to try the other new kid on the memory-safe compiled language block, Rust. I found it a bit intimidating at first because the syntax wasn&rsquo;t as close to the C/C++ I&rsquo;m familiar with and there are quite a few concepts unique to Rust, like the use of traits. But I figured it out, so I can tick another language of my to-try list. I also implemented a version in Python for comparison: the Python version is more concise and easier to read but the Rust version runs about 10× faster. First we include the std::env &ldquo;crate&rdquo; which will let us get access to commandline arguments, and define some useful constants for later. use std::env; const M: i64 = 2147483647; const MASK: i64 = 0b1111111111111111; const FACTOR_A: i64 = 16807; const FACTOR_B: i64 = 48271; gen_next generates the next number for a given generator&rsquo;s sequence. gen_next_picky does the same, but for the &ldquo;picky&rdquo; generators, only returning values that meet their criteria. fn gen_next(factor: i64, current: i64) -&gt; i64 { return (current * factor) % M; } fn gen_next_picky(factor: i64, current: i64, mult: i64) -&gt; i64 { let mut next = gen_next(factor, current); while next % mult != 0 { next = gen_next(factor, next); } return next; } duel runs a single duel, and returns the number of times the generators agreed in the lowest 16 bits (found by doing a binary &amp; with the mask defined above). Rust allows functions to be passed as parameters, so we use this to be able to run both versions of the duel using only this one function. fn duel&lt;F, G&gt;(n: i64, next_a: F, mut value_a: i64, next_b: G, mut value_b: i64) -&gt; i64 where F: Fn(i64) -&gt; i64, G: Fn(i64) -&gt; i64, { let mut count = 0; for _ in 0..n { value_a = next_a(value_a); value_b = next_b(value_b); if (value_a &amp; MASK) == (value_b &amp; MASK) { count += 1; } } return count; } Finally, we read the start values from the command line and run the two duels. The expressions that begin |n| are closures (anonymous functions, often called lambdas in other languages) that we use to specify the generator functions for each duel. fn main() { let args: Vec&lt;String&gt; = env::args().collect(); let start_a: i64 = args[1].parse().unwrap(); let start_b: i64 = args[2].parse().unwrap(); println!( &#34;Duel 1: {}&#34;, duel( 40000000, |n| gen_next(FACTOR_A, n), start_a, |n| gen_next(FACTOR_B, n), start_b, ) ); println!( &#34;Duel 2: {}&#34;, duel( 5000000, |n| gen_next_picky(FACTOR_A, n, 4), start_a, |n| gen_next_picky(FACTOR_B, n, 8), start_b, ) ); } Disk Defragmentation — Haskell — #adventofcode Day 14 Today&rsquo;s challenge has us helping a disk defragmentation program by identifying contiguous regions of used sectors on a 2D disk. → Full code on GitHub !!! commentary Wow, today&rsquo;s challenge had a pretty steep learning curve. Day 14 was the first to directly reuse code from a previous day: the &ldquo;knot hash&rdquo; from day 10. I solved day 10 in Haskell, so I thought it would be easier to stick with Haskell for today as well. The first part was straightforward, but the second was pretty mind-bending in a pure functional language! I ended up solving it by implementing a [flood fill algorithm][flood]. It's recursive, which is right in Haskell's wheelhouse, but I ended up using `Data.Sequence` instead of the standard list type as its API for indexing is better. I haven't tried it, but I think it will also be a little faster than a naive list-based version. It took a looong time to figure everything out, but I had a day off work to be able to concentrate on it! A lot more imports for this solution, as we&rsquo;re exercising a lot more of the standard library. module Main where import Prelude hiding (length, filter, take) import Data.Char (ord) import Data.Sequence import Data.Foldable hiding (length) import Data.Ix (inRange) import Data.Function ((&amp;)) import Data.Maybe (fromJust, mapMaybe, isJust) import qualified Data.Set as Set import Text.Printf (printf) import System.Environment (getArgs) Also we&rsquo;ll extract the key bits from day 10 into a module and import that. import KnotHash Now we define a few data types to make the code a bit more readable. Sector represent the state of a particular disk sector, either free, used (but unmarked) or used and marked as belonging to a given integer-labelled group. Grid is a 2D matrix of Sector, as a sequence of sequences. data Sector = Free | Used | Mark Int deriving (Eq) instance Show Sector where show Free = &#34; .&#34; show Used = &#34; #&#34; show (Mark i) = printf &#34;%4d&#34; i type GridRow = Seq Sector type Grid = Seq (GridRow) Some utility functions to make it easier to view the grids (which can be quite large): used for debugging but not in the finished solution. subGrid :: Int -&gt; Grid -&gt; Grid subGrid n = fmap (take n) . take n printRow :: GridRow -&gt; IO () printRow row = do mapM_ (putStr . show) row putStr &#34;\n&#34; printGrid :: Grid -&gt; IO () printGrid = mapM_ printRow makeKey generates the hash key for a given row. makeKey :: String -&gt; Int -&gt; String makeKey input n = input ++ &#34;-&#34; ++ show n stringToGridRow converts a binary string of &lsquo;1&rsquo; and &lsquo;0&rsquo; characters to a sequence of Sector values. stringToGridRow :: String -&gt; GridRow stringToGridRow = fromList . map convert where convert x | x == &#39;1&#39; = Used | x == &#39;0&#39; = Free makeRow and makeGrid build up the grid to use based on the provided input string. makeRow :: String -&gt; Int -&gt; GridRow makeRow input n = stringToGridRow $ concatMap (printf &#34;%08b&#34;) $ dense $ fullKnotHash 256 $ map ord $ makeKey input n makeGrid :: String -&gt; Grid makeGrid input = fromList $ map (makeRow input) [0..127] Utility functions to count the number of used and free sectors, to give the solution to part 1. countEqual :: Sector -&gt; Grid -&gt; Int countEqual x = sum . fmap (length . filter (==x)) countUsed = countEqual Used countFree = countEqual Free Now the real meat begins! fundUnmarked finds the location of the next used sector that we haven&rsquo;t yet marked. It returns a Maybe value, which is Just (x, y) if there is still an unmarked block or Nothing if there&rsquo;s nothing left to mark. findUnmarked :: Grid -&gt; Maybe (Int, Int) findUnmarked g | y == Nothing = Nothing | otherwise = Just (fromJust x, fromJust y) where hasUnmarked row = isJust $ elemIndexL Used row x = findIndexL hasUnmarked g y = case x of Nothing -&gt; Nothing Just x&#39; -&gt; elemIndexL Used $ index g x&#39; floodFill implements a very simple recursive flood fill. It takes a target and replacement value and a starting location, and fills in the replacement value for every connected location that currently has the target value. We use it below to replace a connected used region with a marked region. floodFill :: Sector -&gt; Sector -&gt; (Int, Int) -&gt; Grid -&gt; Grid floodFill t r (x, y) g | inRange (0, length g - 1) x &amp;&amp; inRange (0, length g - 1) y &amp;&amp; elem == t = let newRow = update y r row newGrid = update x newRow g in newGrid &amp; floodFill t r (x+1, y) &amp; floodFill t r (x-1, y) &amp; floodFill t r (x, y+1) &amp; floodFill t r (x, y-1) | otherwise = g where row = g `index` x elem = row `index` y markNextGroup looks for an unmarked group and marks it if found. If no more groups are found it returns Nothing. markAllGroups then repeatedly applies markNextGroup until Nothing is returned. markNextGroup :: Int -&gt; Grid -&gt; Maybe Grid markNextGroup i g = case findUnmarked g of Nothing -&gt; Nothing Just loc -&gt; Just $ floodFill Used (Mark i) loc g markAllGroups :: Grid -&gt; Grid markAllGroups g = markAllGroups&#39; 1 g where markAllGroups&#39; i g = case markNextGroup i g of Nothing -&gt; g Just g&#39; -&gt; markAllGroups&#39; (i+1) g&#39; onlyMarks filters a grid row and returns a list of (possibly duplicated) group numbers in the row. onlyMarks :: GridRow -&gt; [Int] onlyMarks = mapMaybe getMark . toList where getMark Free = Nothing getMark Used = Nothing getMark (Mark i) = Just i Finally, countGroups puts all the group numbers into a set to get rid of duplicates and returns the size of the set, i.e. the total number of separate groups. countGroups :: Grid -&gt; Int countGroups g = Set.size groupSet where groupSet = foldl&#39; Set.union Set.empty $ fmap rowToSet g rowToSet = Set.fromList . toList . onlyMarks As always, every Haskell program needs a main function to drive the I/O and produce the actual result. main = do input &lt;- fmap head getArgs let grid = makeGrid input used = countUsed grid marked = countGroups $ markAllGroups grid putStrLn $ &#34;Used sectors: &#34; ++ show used putStrLn $ &#34;Groups: &#34; ++ show marked Packet Scanners — Haskell — #adventofcode Day 13 Today&rsquo;s challenge requires us to sneak past a firewall made up of a series of scanners. → Full code on GitHub !!! commentary I wasn&rsquo;t really thinking straight when I solved this challenge. I got a solution without too much trouble, but I ended up simulating the step-by-step movement of the scanners. I finally realised that I could calculate whether or not a given scanner was safe at a given time directly with modular arithmetic, and it bugged me so much that I reimplemented the solution. Both are given below, the faster one first. First we introduce some standard library stuff and define some useful utilities. module Main where import qualified Data.Text as T import Data.Maybe (mapMaybe) strip :: String -&gt; String strip = T.unpack . T.strip . T.pack splitOn :: String -&gt; String -&gt; [String] splitOn sep = map T.unpack . T.splitOn (T.pack sep) . T.pack parseScanner :: String -&gt; (Int, Int) parseScanner s = (d, r) where [d, r] = map read $ splitOn &#34;: &#34; s traverseFW does all the hard work: it checks for each scanner whether or not it&rsquo;s safe as we pass through, and returns a list of the severities of each time we&rsquo;re caught. mapMaybe is like the standard map in many languages, but operates on a list of Haskell Maybe values, like a combined map and filter. If the value is Just x, x gets included in the returned list; if the value is Nothing, then it gets thrown away. traverseFW :: Int -&gt; [(Int, Int)] -&gt; [Int] traverseFW delay = mapMaybe caught where caught (d, r) = if (d + delay) `mod` (2*(r-1)) == 0 then Just (d * r) else Nothing Then the total severity of our passage through the firewall is simply the sum of each individual severity. severity :: [(Int, Int)] -&gt; Int severity = sum . traverseFW 0 But we don&rsquo;t want to know how badly we got caught, we want to know how long to wait before setting off to get through safely. findDelay tries traversing the firewall with increasing delay, and returns the delay for the first pass where we predict not getting caught. findDelay :: [(Int, Int)] -&gt; Int findDelay scanners = head $ filter (null . flip traverseFW scanners) [0..] And finally, we put it all together and calculate and print the result. main = do scanners &lt;- fmap (map parseScanner . lines) getContents putStrLn $ &#34;Severity: &#34; ++ (show $ severity scanners) putStrLn $ &#34;Delay: &#34; ++ (show $ findDelay scanners) I&rsquo;m not generally bothered about performance for these challenges, but here I&rsquo;ll note that my second attempt runs in a little under 2 seconds on my laptop: $ time ./13-packet-scanners-redux &lt; 13-input.txt Severity: 1900 Delay: 3966414 ./13-packet-scanners-redux &lt; 13-input.txt 1.73s user 0.02s system 99% cpu 1.754 total Compare that with the first, simulation-based one, which takes nearly a full minute: $ time ./13-packet-scanners &lt; 13-input.txt Severity: 1900 Delay: 3966414 ./13-packet-scanners &lt; 13-input.txt 57.63s user 0.27s system 100% cpu 57.902 total And for good measure, here&rsquo;s the code. Notice the tick and tickOne functions, which together simulate moving all the scanners by one step; for this to work we have to track the full current state of each scanner, which is easier to read with a Haskell record-based custom data type. traverseFW is more complicated because it has to drive the simulation, but the rest of the code is mostly the same. module Main where import qualified Data.Text as T import Control.Monad (forM_) data Scanner = Scanner { depth :: Int , range :: Int , pos :: Int , dir :: Int } instance Show Scanner where show (Scanner d r p dir) = show d ++ &#34;/&#34; ++ show r ++ &#34;/&#34; ++ show p ++ &#34;/&#34; ++ show dir strip :: String -&gt; String strip = T.unpack . T.strip . T.pack splitOn :: String -&gt; String -&gt; [String] splitOn sep str = map T.unpack $ T.splitOn (T.pack sep) $ T.pack str parseScanner :: String -&gt; Scanner parseScanner s = Scanner d r 0 1 where [d, r] = map read $ splitOn &#34;: &#34; s tickOne :: Scanner -&gt; Scanner tickOne (Scanner depth range pos dir) | pos &lt;= 0 = Scanner depth range (pos+1) 1 | pos &gt;= range - 1 = Scanner depth range (pos-1) (-1) | otherwise = Scanner depth range (pos+dir) dir tick :: [Scanner] -&gt; [Scanner] tick = map tickOne traverseFW :: [Scanner] -&gt; [(Int, Int)] traverseFW = traverseFW&#39; 0 where traverseFW&#39; _ [] = [] traverseFW&#39; layer scanners@((Scanner depth range pos _):rest) -- | layer == depth &amp;&amp; pos == 0 = (depth*range) + (traverseFW&#39; (layer+1) $ tick rest) | layer == depth &amp;&amp; pos == 0 = (depth,range) : (traverseFW&#39; (layer+1) $ tick rest) | layer == depth &amp;&amp; pos /= 0 = traverseFW&#39; (layer+1) $ tick rest | otherwise = traverseFW&#39; (layer+1) $ tick scanners severity :: [Scanner] -&gt; Int severity = sum . map (uncurry (*)) . traverseFW empty :: [a] -&gt; Bool empty [] = True empty _ = False findDelay :: [Scanner] -&gt; Int findDelay scanners = delay where (delay, _) = head $ filter (empty . traverseFW . snd) $ zip [0..] $ iterate tick scanners main = do scanners &lt;- fmap (map parseScanner . lines) getContents putStrLn $ &#34;Severity: &#34; ++ (show $ severity scanners) putStrLn $ &#34;Delay: &#34; ++ (show $ findDelay scanners) Digital Plumber — Python — #adventofcode Day 12 Today&rsquo;s challenge has us helping a village of programs who are unable to communicate. We have a list of the the communication channels between their houses, and need to sort them out into groups such that we know that each program can communicate with others in its own group but not any others. Then we have to calculate the size of the group containing program 0 and the total number of groups. → Full code on GitHub !!! commentary This is one of those problems where I&rsquo;m pretty sure that my algorithm isn&rsquo;t close to being the most efficient, but it definitely works! For the sake of solving the challenge that&rsquo;s all that matters, but it still bugs me. By now I&rsquo;ve become used to using fileinput to transparently read data either from files given on the command-line or standard input if no arguments are given. import fileinput as fi First we make an initial pass through the input data, creating a group for each line representing the programs on that line (which can communicate with each other). We store this as a Python set. groups = [] for line in fi.input(): head, rest = line.split(&#39; &lt;-&gt; &#39;) group = set([int(head)]) group.update([int(x) for x in rest.split(&#39;, &#39;)]) groups.append(group) Now we iterate through the groups, starting with the first, and merging any we find that overlap with our current group. i = 0 while i &lt; len(groups): current = groups[i] Each pass through the groups brings more programs into the current group, so we have to go through and check their connections too. We make several merge passes, until we detect that no more merges took place. num_groups = len(groups) + 1 while num_groups &gt; len(groups): j = i+1 num_groups = len(groups) This inner loop does the actual merging, and deletes each group as it&rsquo;s merged in. while j &lt; len(groups): if len(current &amp; groups[j]) &gt; 0: current.update(groups[j]) del groups[j] else: j += 1 i += 1 All that&rsquo;s left to do now is to display the results. print(&#34;Number in group 0:&#34;, len([g for g in groups if 0 in g][0])) print(&#34;Number of groups:&#34;, len(groups)) Hex Ed — Python — #adventofcode Day 11 Today&rsquo;s challenge is to help a program find its child process, which has become lost on a hexagonal grid. We need to follow the path taken by the child (given as input) and calculate the distance it is from home along with the furthest distance it has been at any point along the path. → Full code on GitHub !!! commentary I found this one quite interesting in that it was very quick to solve. In fact, I got lucky and my first quick implementation (max(abs(l)) below) gave the correct answer in spite of missing an obvious not-so-edge case. Thinking about it, there&rsquo;s only a ⅓ chance that the first incorrect implementation would give the wrong answer! The code is shorter, so you get more words today. ☺ There are a number of different co-ordinate systems on a hexagonal grid (I discovered while reading up after solving it&hellip;). I intuitively went for the system known as &lsquo;axial&rsquo; coordinates, where you pick two directions aligned to the grid as your x and y axes: note that these won&rsquo;t be perpendicular. I chose ne/sw as the x axis and se/nw as y, but there are three other possible choices. That leads to the following definition for the directions, encoded as numpy arrays because that makes some of the code below neater. import numpy as np STEPS = {d: np.array(v) for d, v in [(&#39;ne&#39;, (1, 0)), (&#39;se&#39;, (0, -1)), (&#39;s&#39;, (-1, -1)), (&#39;sw&#39;, (-1, 0)), (&#39;nw&#39;, (0, 1)), (&#39;n&#39;, (1, 1))]} hex_grid_dist, given a location l calculates the number of steps needed to reach that location from the centre at (0, 0). Notice that we can&rsquo;t simply use the Manhattan distance here because, for example, one step north takes us to (1, 1), which would give a Manhattan distance of 2. Instead, we can see that moving in the n/s direction allows us to increment or decrement both coordinates at the same time: If the coordinates have the same sign: move n/s until one of them is zero, then move along the relevant ne or se axis back to the origin; in this case the number of steps is greatest of the absolute values of the two coordinates If the coordinates have opposite signs: move independently along the ne and se axes to reduce each to 0; this time the number of steps is the sum of the absolute values of the two coordinates def hex_grid_distance(l): if sum(np.sign(l)) == 0: # i.e. opposite signs return sum(abs(l)) else: return max(abs(l)) Now we can read in the path followed by the child and follow it ourselves, tracking the maximum distance from home along the way. path = input().strip().split(&#39;,&#39;) location = np.array((0, 0)) max_distance = 0 for step in map(STEPS.get, path): location += step max_distance = max(max_distance, hex_grid_distance(location)) distance = hex_grid_distance(location) print(&#34;Child process is at&#34;, location, &#34;which is&#34;, distance, &#34;steps away&#34;) print(&#34;Greatest distance was&#34;, max_distance) Knot Hash — Haskell — #adventofcode Day 10 Today&rsquo;s challenge asks us to help a group of programs implement a (highly questionable) hashing algorithm that involves repeatedly reversing parts of a list of numbers. → Full code on GitHub !!! commentary I went with Haskell again today, because it&rsquo;s the weekend so I have a bit more time, and I really enjoyed yesterday&rsquo;s Haskell implementation. Today gave me the opportunity to explore the standard library a bit more, as well as lending itself nicely to being decomposed into smaller parts to be combined using higher-order functions. You know the drill by know: import stuff we&rsquo;ll use later. module Main where import Data.Char (ord) import Data.Bits (xor) import Data.Function ((&amp;)) import Data.List (unfoldr) import Text.Printf (printf) import qualified Data.Text as T The worked example uses a concept of the &ldquo;current position&rdquo; as a pointer to a location in a static list. In Haskell it makes more sense to instead use the front of the list as the current position, and rotate the whole list as we progress to bring the right element to the front. rotate :: Int -&gt; [Int] -&gt; [Int] rotate 0 xs = xs rotate n xs = drop n&#39; xs ++ take n&#39; xs where n&#39; = n `mod` length xs The simple version of the hash requires working through the input list, modifying the working list as we go, and incrementing a &ldquo;skip&rdquo; counter with each step. Converting this to a functional style, we simply zip up the input with an infinite list [0, 1, 2, 3, ...] to give the counter values. Notice that we also have to calculate how far to rotate the working list to get back to its original position. foldl lets us specify a function that returns a modified version of the working list and feeds the input list in one at a time. simpleKnotHash :: Int -&gt; [Int] -&gt; [Int] simpleKnotHash size input = foldl step [0..size-1] input&#39; &amp; rotate (negate finalPos) where input&#39; = zip input [0..] finalPos = sum $ zipWith (+) input [0..] reversePart xs n = (reverse $ take n xs) ++ drop n xs step xs (n, skip) = reversePart xs n &amp; rotate (n+skip) The full version of the hash (part 2 of the challenge) starts the same way as the simple version, except making 64 passes instead of one: we can do this by using replicate to make a list of 64 copies, then collapse that into a single list with concat. fullKnotHash :: Int -&gt; [Int] -&gt; [Int] fullKnotHash size input = simpleKnotHash size input&#39; where input&#39; = concat $ replicate 64 input The next step in calculating the full hash collapses the full 256-element &ldquo;sparse&rdquo; hash down into 16 elements by XORing groups of 16 together. unfoldr is a nice efficient way of doing this. dense :: [Int] -&gt; [Int] dense = unfoldr dense&#39; where dense&#39; [] = Nothing dense&#39; xs = Just (foldl1 xor $ take 16 xs, drop 16 xs) The final hash step is to convert the list of integers into a hexadecimal string. hexify :: [Int] -&gt; String hexify = concatMap (printf &#34;%02x&#34;) These two utility functions put together building blocks from the Data.Text module to parse the input string. Note that no arguments are given: the functions are defined purely by composing other functions using the . operator. In Haskell this is referred to as &ldquo;point-free&rdquo; style. strip :: String -&gt; String strip = T.unpack . T.strip . T.pack parseInput :: String -&gt; [Int] parseInput = map (read . T.unpack) . T.splitOn (T.singleton &#39;,&#39;) . T.pack Now we can put it all together, including building the weird input for the &ldquo;full&rdquo; hash. main = do input &lt;- fmap strip getContents let simpleInput = parseInput input asciiInput = map ord input ++ [17, 31, 73, 47, 23] (a:b:_) = simpleKnotHash 256 simpleInput print $ (a*b) putStrLn $ fullKnotHash 256 asciiInput &amp; dense &amp; hexify Stream Processing — Haskell — #adventofcode Day 9 In today&rsquo;s challenge we come across a stream that we need to cross. But of course, because we&rsquo;re stuck inside a computer, it&rsquo;s not water but data flowing past. The stream is too dangerous to cross until we&rsquo;ve removed all the garbage, and to prove we can do that we have to calculate a score for the valid data &ldquo;groups&rdquo; and the number of garbage characters to remove. → Full code on GitHub !!! commentary One of my goals for this process was to knock the rust of my functional programming skills in Haskell, and I haven&rsquo;t done that for the whole of the first week. Processing strings character by character and acting according to which character shows up seems like a good choice for pattern-matching though, so here we go. I also wanted to take a bash at test-driven development in Haskell, so I also loaded up the Test.Hspec module to give it a try. I did find keeping track of all the state in arguments a bit mind boggling, and I think it could have been improved through use of a data type using record syntax and the `State` monad, so that's something to look at for a future challenge. First import the extra bits we&rsquo;ll need. module Main where import Test.Hspec import Data.Function ((&amp;)) countGroups solves the first part of the problem, counting up the &ldquo;score&rdquo; of the valid data in the stream. countGroups' is an auxiliary function that holds some state in its arguments. We use pattern matching for the base case: [] represents the empty list in Haskell, which indicates we&rsquo;ve finished the whole stream. Otherwise, we split the remaining stream into its first character and remainder, and use guards to decide how to interpret it. If skip is true, discard the character and carry on with skip set back to false. If we find a &ldquo;!&rdquo;, that tells us to skip the next. Other characters mark groups or sets of garbage: groups increase the score when they close and garbage is discarded. We continue to progress the list by recursing with the remainder of the stream and any updated state. countGroups :: String -&gt; Int countGroups = countGroups&#39; 0 0 False False where countGroups&#39; score _ _ _ [] = score countGroups&#39; score level garbage skip (c:rest) | skip = countGroups&#39; score level garbage False rest | c == &#39;!&#39; = countGroups&#39; score level garbage True rest | garbage = case c of &#39;&gt;&#39; -&gt; countGroups&#39; score level False False rest _ -&gt; countGroups&#39; score level True False rest | otherwise = case c of &#39;{&#39; -&gt; countGroups&#39; score (level+1) False False rest &#39;}&#39; -&gt; countGroups&#39; (score+level) (level-1) False False rest &#39;,&#39; -&gt; countGroups&#39; score level False False rest &#39;&lt;&#39; -&gt; countGroups&#39; score level True False rest c -&gt; error $ &#34;Garbage character found outside garbage: &#34; ++ show c countGarbage works almost identically to countGroups, except it ignores groups and counts garbage. They are structured so similarly that it would probably make more sense to combine them to a single function that returns both counts. countGarbage :: String -&gt; Int countGarbage = countGarbage&#39; 0 False False where countGarbage&#39; count _ _ [] = count countGarbage&#39; count garbage skip (c:rest) | skip = countGarbage&#39; count garbage False rest | c == &#39;!&#39; = countGarbage&#39; count garbage True rest | garbage = case c of &#39;&gt;&#39; -&gt; countGarbage&#39; count False False rest _ -&gt; countGarbage&#39; (count+1) True False rest | otherwise = case c of &#39;&lt;&#39; -&gt; countGarbage&#39; count True False rest _ -&gt; countGarbage&#39; count False False rest Hspec gives us a domain-specific language heavily inspired by the rspec library for Ruby: the tests read almost like natural language. I built up these tests one-by-one, gradually implementing the appropriate bits of the functions above, a process known as Test-driven development. runTests = hspec $ do describe &#34;countGroups&#34; $ do it &#34;counts valid groups&#34; $ do countGroups &#34;{}&#34; `shouldBe` 1 countGroups &#34;{{{}}}&#34; `shouldBe` 6 countGroups &#34;{{{},{},{{}}}}&#34; `shouldBe` 16 countGroups &#34;{{},{}}&#34; `shouldBe` 5 it &#34;ignores garbage&#34; $ do countGroups &#34;{&lt;a&gt;,&lt;a&gt;,&lt;a&gt;,&lt;a&gt;}&#34; `shouldBe` 1 countGroups &#34;{{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;}}&#34; `shouldBe` 9 it &#34;skips marked characters&#34; $ do countGroups &#34;{{&lt;!!&gt;},{&lt;!!&gt;},{&lt;!!&gt;},{&lt;!!&gt;}}&#34; `shouldBe` 9 countGroups &#34;{{&lt;a!&gt;},{&lt;a!&gt;},{&lt;a!&gt;},{&lt;ab&gt;}}&#34; `shouldBe` 3 describe &#34;countGarbage&#34; $ do it &#34;counts garbage characters&#34; $ do countGarbage &#34;&lt;&gt;&#34; `shouldBe` 0 countGarbage &#34;&lt;random characters&gt;&#34; `shouldBe` 17 countGarbage &#34;&lt;&lt;&lt;&lt;&gt;&#34; `shouldBe` 3 it &#34;ignores non-garbage&#34; $ do countGarbage &#34;{{},{}}&#34; `shouldBe` 0 countGarbage &#34;{{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;}}&#34; `shouldBe` 8 it &#34;skips marked characters&#34; $ do countGarbage &#34;&lt;{!&gt;}&gt;&#34; `shouldBe` 2 countGarbage &#34;&lt;!!&gt;&#34; `shouldBe` 0 countGarbage &#34;&lt;!!!&gt;&#34; `shouldBe` 0 countGarbage &#34;&lt;{o\&#34;i!a,&lt;{i&lt;a&gt;&#34; `shouldBe` 10 Finally, the main function reads in the challenge input and calculates the answers, printing them on standard output. main = do runTests repeat &#39;=&#39; &amp; take 78 &amp; putStrLn input &lt;- getContents &amp; fmap (filter (/=&#39;\n&#39;)) putStrLn $ &#34;Found &#34; ++ show (countGroups input) ++ &#34; groups&#34; putStrLn $ &#34;Found &#34; ++ show (countGarbage input) ++ &#34; characters garbage&#34; I Heard You Like Registers — Python — #adventofcode Day 8 Today&rsquo;s challenge describes a simple instruction set for a CPU, incrementing and decrementing values in registers according to simple conditions. We have to interpret a stream of these instructions, and to prove that we&rsquo;ve done so, give the highest value of any register, both at the end of the program and throughout the whole program. → Full code on GitHub !!! commentary This turned out to be a nice straightforward one to implement, as the instruction format was easily parsed by regular expression, and Python provides the eval function which made evaluating the conditions a doddle. Import various standard library bits that we&rsquo;ll use later. import re import fileinput as fi from math import inf from collections import defaultdict We could just parse the instructions by splitting the string, but using a regular expression is a little bit more robust because it won&rsquo;t match at all if given an invalid instruction. INSTRUCTION_RE = re.compile(r&#39;(\w+) (inc|dec) (-?\d+) if (.+)\s*&#39;) def parse_instruction(instruction): match = INSTRUCTION_RE.match(instruction) return match.group(1, 2, 3, 4) Executing an instruction simply checks the condition and if it evaluates to True updates the relevant register. def exec_instruction(registers, instruction): name, op, value, cond = instruction value = int(value) if op == &#39;dec&#39;: value = -value if eval(cond, globals(), registers): registers[name] += value highest_value returns the maximum value found in any register. def highest_value(registers): return sorted(registers.items(), key=lambda x: x[1], reverse=True)[0][1] Finally, loop through all the instructions and carry them out, updating global_max as we go. We need to be able to deal with registers that haven&rsquo;t been accessed before. Keeping the registers in a dictionary means that we can evaluate the conditions directly using eval above, passing it as the locals argument. The standard dict will raise an exception if we try to access a key that doesn&rsquo;t exist, so instead we use collections.defaultdict, which allows us to specify what the default value for a non-existent key will be. New registers start at 0, so we use a simple lambda to define a function that always returns 0. global_max = -inf registers = defaultdict(lambda: 0) for i in map(parse_instruction, fi.input()): exec_instruction(registers, i) global_max = max(global_max, highest_value(registers)) print(&#39;Max value:&#39;, highest_value(registers)) print(&#39;All-time max:&#39;, global_max) Recursive Circus — Ruby — #adventofcode Day 7 Today&rsquo;s challenge introduces a set of processes balancing precariously on top of each other. We find them stuck and unable to get down because one of the processes is the wrong size, unbalancing the whole circus. Our job is to figure out the root from the input and then find the correct weight for the single incorrect process. → Full code on GitHub !!! commentary So I didn&rsquo;t really intend to take a full polyglot approach to Advent of Code, but it turns out to have been quite fun, so I made a shortlist of languages to try. Building a tree is a classic application for object-orientation using a class to represent tree nodes, and I&rsquo;ve always liked the feel of Ruby&rsquo;s class syntax, so I gave it a go. First make sure we have access to Set, which we&rsquo;ll use later. require &#39;set&#39; Now to define the CircusNode class, which represents nodes in the tree. attr :s automatically creates a function s that returns the value of the instance attribute @s class CircusNode attr :name, :weight def initialize(name, weight, children=nil) @name = name @weight = weight @children = children || [] end Add a &lt;&lt; operator (the same syntax for adding items to a list) that adds a child to this node. def &lt;&lt;(c) @children &lt;&lt; c @total_weight = nil end total_weight recursively calculates the weight of this node and everything above it. The @total_weight ||= blah idiom caches the value so we only calculate it once. def total_weight @total_weight ||= @weight + @children.map {|c| c.total_weight}.sum end balance_weight does the hard work of figuring out the proper weight for the incorrect node by recursively searching through the tree. def balance_weight(target=nil) by_weight = Hash.new{|h, k| h[k] = []} @children.each{|c| by_weight[c.total_weight] &lt;&lt; c} if by_weight.size == 1 then if target return @weight - (total_weight - target) else raise ArgumentError, &#39;This tree seems balanced!&#39; end else odd_one_out = by_weight.select {|k, v| v.length == 1}.first[1][0] child_target = by_weight.select {|k, v| v.length &gt; 1}.first[0] return odd_one_out.balance_weight child_target end end A couple of utility functions for displaying trees finish off the class. def to_s &#34;#{@name} (#{@weight})&#34; end def print_tree(n=0) puts &#34;#{&#39; &#39;*n}#{self} -&gt; #{self.total_weight}&#34; @children.each do |child| child.print_tree n+1 end end end build_circus takes input as a list of lists [name, weight, children]. We make two passes over this list, first creating all the nodes, then building the tree by adding children to parents. def build_circus(data) all_nodes = {} all_children = Set.new data.each do |name, weight, children| all_nodes[name] = CircusNode.new name, weight end data.each do |name, weight, children| children.each {|child| all_nodes[name] &lt;&lt; all_nodes[child]} all_children.merge children end root_name = (all_nodes.keys.to_set - all_children).first return all_nodes[root_name] end Finally, build the tree and solve the problem! Note that we use String.to_sym to convert the node names to symbols (written in Ruby as :symbol), because they&rsquo;re faster to work with in Hashes and Sets as we do above. data = readlines.map do |line| match = /(?&lt;parent&gt;\w+) \((?&lt;weight&gt;\d+)\)(?: -&gt; (?&lt;children&gt;.*))?/.match line [match[&#39;parent&#39;].to_sym, match[&#39;weight&#39;].to_i, match[&#39;children&#39;] ? match[&#39;children&#39;].split(&#39;, &#39;).map {|x| x.to_sym} : []] end root = build_circus data puts &#34;Root node: #{root}&#34; puts root.balance_weight Memory Reallocation — Python — #adventofcode Day 6 Today&rsquo;s challenge asks us to follow a recipe for redistributing objects in memory that bears a striking resemblance to the rules of the African game Mancala. → Full code on GitHub !!! commentary When I was doing my MSci, one of our programming exercises was to write (in Haskell, IIRC) a program to play a Mancala variant called Oware, so this had a nice ring of nostalgia. Back to Python today: it's already become clear that it's by far my most fluent language, which makes sense as it's the only one I've used consistently since my schooldays. I'm a bit behind on the blog posts, so you get this one without any explanation, for now at least! import math def reallocate(mem): max_val = -math.inf size = len(mem) for i, x in enumerate(mem): if x &gt; max_val: max_val = x max_index = i i = max_index mem[i] = 0 remaining = max_val while remaining &gt; 0: i = (i + 1) % size mem[i] += 1 remaining -= 1 return mem def detect_cycle(mem): mem = list(mem) steps = 0 prev_states = {} while tuple(mem) not in prev_states: prev_states[tuple(mem)] = steps steps += 1 mem = reallocate(mem) return (steps, steps - prev_states[tuple(mem)]) initial_state = map(int, input().split()) print(&#34;Initial state is &#34;, initial_state) steps, cycle = detect_cycle(initial_state) print(&#34;Steps to cycle: &#34;, steps) print(&#34;Steps in cycle: &#34;, cycle) A Maze of Twisty Trampolines — C++ — #adventofcode Day 5 Today&rsquo;s challenge has us attempting to help the CPU escape from a maze of instructions. It&rsquo;s not quite a Turing Machine, but it has that feeling of moving a read/write head up and down a tape acting on and changing the data found there. → Full code on GitHub !!! commentary I haven&rsquo;t written anything in C++ for over a decade. It sounds like there have been lots of interesting developments in the language since then, with C++11, C++14 and the freshly finalised C++17 standards (built-in parallelism in the STL!). I won&rsquo;t use any of those, but I thought I&rsquo;d dust off my C++ and see what happened. Thankfully the Standard Template Library classes still did what I expected! As usual, we first include the parts of the standard library we&rsquo;re going to use: iostream for input &amp; output; vector for the container. We also declare that we&rsquo;re using the std namespace, so that we don&rsquo;t have to prepend vector and the other classes with std::. #include &lt;iostream&gt; #include &lt;vector&gt; using namespace std; steps_to_escape_part1 implements part 1 of the challenge: we read a location, move forward/backward by the number of steps given in that location, then add one to the location before repeating. The result is the number of steps we take before jumping outside the list. int steps_to_escape_part1(vector&lt;int&gt;&amp; instructions) { int pos = 0, iterations = 0, new_pos; while (pos &lt; instructions.size()) { new_pos = pos + instructions[pos]; instructions[pos]++; pos = new_pos; iterations++; } return iterations; } steps_to_escape_part2 solves part 2, which is very similar, except that an offset greater than 3 is decremented instead of incremented before moving on. int steps_to_escape_part2(vector&lt;int&gt;&amp; instructions) { int pos = 0, iterations = 0, new_pos, offset; while (pos &lt; instructions.size()) { offset = instructions[pos]; new_pos = pos + offset; instructions[pos] += offset &gt;=3 ? -1 : 1; pos = new_pos; iterations++; } return iterations; } Finally we pull it all together and link it up to the input. int main() { vector&lt;int&gt; instructions1, instructions2; int n; The cin class lets us read data from standard input, which we then add to a vector of ints to give our list of instructions. while (true) { cin &gt;&gt; n; if (cin.eof()) break; instructions1.push_back(n); } Solving the problem modifies the input, so we need to take a copy to solve part 2 as well. Thankfully the STL makes this easy with iterators. instructions2.insert(instructions2.begin(), instructions1.begin(), instructions1.end()); Finally, compute the result and print it on standard output. cout &lt;&lt; steps_to_escape_part1(instructions1) &lt;&lt; endl; cout &lt;&lt; steps_to_escape_part2(instructions2) &lt;&lt; endl; return 0; } High Entropy Passphrases — Python — #adventofcode Day 4 Today&rsquo;s challenge describes some simple rules supposedly intended to enforce the use of secure passwords. All we have to do is test a list of passphrase and identify which ones meet the rules. → Full code on GitHub !!! commentary Fearing that today might be as time-consuming as yesterday, I returned to Python and it&rsquo;s hugely powerful &ldquo;batteries-included&rdquo; standard library. Thankfully this challenge was more straightforward, and I actually finished this before finishing day 3. First, let&rsquo;s import two useful utilities. from fileinput import input from collections import Counter Part 1 requires simply that a passphrase contains no repeated words. No problem: we split the passphrase into words and count them, and check if any was present more than once. Counter is an amazingly useful class to have in a language&rsquo;s standard library. All it does is count things: you add objects to it, and then it will tell you how many of a given object you have. We&rsquo;re going to use it to count those potentially duplicated words. def is_valid(passphrase): counter = Counter(passphrase.split()) return counter.most_common(1)[0][1] == 1 Part 2 requires that no word in the passphrase be an anagram of any other word. Since we don&rsquo;t need to do anything else with the words afterwards, we can check for anagrams by sorting the letters in each word: &ldquo;leaf&rdquo; and &ldquo;flea&rdquo; both become &ldquo;aefl&rdquo; and can be compared directly. Then we count as before. def is_valid_ana(passphrase): counter = Counter(&#39;&#39;.join(sorted(word)) for word in passphrase.split()) return counter.most_common(1)[0][1] == 1 Finally we pull everything together. sum(map(boolean_func, list)) is a common idiom in Python for counting the number of times a condition (checked by boolean_func) is true. In Python, True and False can be treated as the numbers 1 and 0 respectively, so that summing a list of Boolean values gives you the number of True values in the list. lines = list(input()) print(sum(map(is_valid, lines))) print(sum(map(is_valid_ana, lines))) Spiral Memory — Go — #adventofcode Day 3 Today&rsquo;s challenge requires us to perform some calculations on an &ldquo;experimental memory layout&rdquo;, with cells moving outwards from the centre of a square spiral (squiral?). → Full code on GitHub !!! commentary I&rsquo;ve been wanting to try my hand at Go, the memory-safe, statically typed compiled language from Google for a while. Today&rsquo;s challenge seemed a bit more mathematical in nature, meaning that I wouldn&rsquo;t need too many advanced language features or knowledge of a standard library, so I thought I&rsquo;d give it a &ldquo;go&rdquo;. It might have been my imagination, but it was impressive how quickly the compiled program chomped through 60 different input values while I was debugging. I actually spent far too long on this problem because my brain led me down a blind alley trying to do the wrong calculation, but I got there in the end! The solution is a bit difficult to explain without diagrams, which I don't really have time to draw right now, but fear not because several other people have. First take a look at [the challenge itself which explains the spiral memory concept](http://adventofcode.com/2017/day/3). Then look at the [nice diagrams that Phil Tooley made with Python](http://acceleratedscience.co.uk/blog/adventofcode-day-3-spiral-memory/) and hopefully you'll be able to see what's going on! It's interesting to note that this challenge also admits of an algorithmic solution instead of the mathematical one: you can model the memory as an infinite grid using a suitable data structure and literally move around it in a spiral. In hindsight this is a much better way of solving the challenge quickly because it's easier and less error-prone to code. I'm quite pleased with my maths-ing though, and it's much quicker than the algorithmic version! First some Go boilerplate: we have to define the package we&rsquo;re in (main, because it&rsquo;s an executable we&rsquo;re producing) and import the libraries we&rsquo;ll use. package main import ( &#34;fmt&#34; &#34;math&#34; &#34;os&#34; ) Weirdly, Go doesn&rsquo;t seem to have these basic mathematics functions for integers in its standard library (please someone correct me if I&rsquo;m wrong!) so I&rsquo;ll define them instead of mucking about with data types. Go doesn&rsquo;t do any implicit type conversion, even between numeric types, and the math builtin package only operates on float64 values. func abs(n int) int { if n &lt; 0 { return -n } return n } func min(x, y int) int { if x &lt; y { return x } return y } func max(x, y int) int { if x &gt; y { return x } return y } This does the heavy lifting for part one: converting from a position on the spiral to a column and row in the grid. (0, 0) is the centre of the spiral. This actually does a bit more than is necessary to calculate the distance as required for part 1, but we&rsquo;ll use it again for part 2. func spiral_to_xy(n int) (int, int) { if n == 1 { return 0, 0 } r := int(math.Floor((math.Sqrt(float64(n-1)) + 1) / 2)) n_r := n - (2*r-1)*(2*r-1) o := ((n_r - 1) % (2 * r)) - r + 1 sector := (n_r - 1) / (2 * r) switch sector { case 0: return r, o case 1: return -o, r case 2: return -r, -o case 3: return o, -r } return 0, 0 } Now use spiral_to_xy to calculate the Manhattan distance that the value at location n in the spiral memory are carried to reach the &ldquo;access port&rdquo; at 0. func distance(n int) int { x, y := spiral_to_xy(n) return abs(x) + abs(y) } This function does the opposite of spiral_to_xy, translating a grid position back to its position on the spiral. This is the one that took me far too long to figure out because I had a brain bug and tried to calculate the value s (which sector or quarter of the spiral we&rsquo;re looking at) in a way that was never going to work! Fortunately I came to my senses. func xy_to_spiral(x, y int) int { if x == 0 &amp;&amp; y == 0 { return 1 } r := max(abs(x), abs(y)) var s, o, n int if x+y &gt; 0 &amp;&amp; x-y &gt;= 0 { s = 0 } else if x-y &lt; 0 &amp;&amp; x+y &gt;= 0 { s = 1 } else if x+y &lt; 0 &amp;&amp; x-y &lt;= 0 { s = 2 } else { s = 3 } switch s { case 0: o = y case 1: o = -x case 2: o = -y case 3: o = x } n = o + r*(2*s+1) + (2*r-1)*(2*r-1) return n } This is a utility function that uses xy_to_spiral to fetch the value at a given (x, y) location, and returns zero if we haven&rsquo;t filled that location yet. func get_spiral(mem []int, x, y int) int { n := xy_to_spiral(x, y) - 1 if n &lt; len(mem) { return mem[n] } return 0 } Finally we solve part 2 of the problem, which involves going round the spiral writing values into it that are the sum of some values already written. The result is the first of these sums that is greater than or equal to the given input value. func stress_test(input int) int { mem := make([]int, 1) n := 0 mem[0] = 1 for mem[n] &lt; input { n++ x, y := spiral_to_xy(n + 1) mem = append(mem, get_spiral(mem, x+1, y)+ get_spiral(mem, x+1, y+1)+ get_spiral(mem, x, y+1)+ get_spiral(mem, x-1, y+1)+ get_spiral(mem, x-1, y)+ get_spiral(mem, x-1, y-1)+ get_spiral(mem, x, y-1)+ get_spiral(mem, x+1, y-1)) } return mem[n] } Now the last part of the program puts it all together, reading the input value from a commandline argument and printing the results of the two parts of the challenge: func main() { var n int fmt.Sscanf(os.Args[1], &#34;%d&#34;, &amp;n) fmt.Printf(&#34;Input is %d\n&#34;, n) fmt.Printf(&#34;Distance is %d\n&#34;, distance(n)) fmt.Printf(&#34;Stress test result is %d\n&#34;, stress_test(n)) } Corruption Checksum — Python — #adventofcode Day 2 Today&rsquo;s challenge is to calculate a rather contrived &ldquo;checksum&rdquo; over a grid of numbers. → Full code on GitHub !!! commentary Today I went back to plain Python, and I didn&rsquo;t do formal tests because only one test case was given for each part of the problem. I just got stuck in. I did write part 2 out in as nested `for` loops as an intermediate step to working out the generator expression. I think that expanded version may have been more readable. Having got that far, I couldn't then work out how to finally eliminate the need for an auxiliary function entirely without either sorting the same elements multiple times or sorting each row as it's read. First we read in the input, split it and convert it to numbers. fileinput.input() returns an iterator over the lines in all the files passed as command-line arguments, or over standard input if no files are given. from fileinput import input sheet = [[int(x) for x in l.split()] for l in input()] Part 1 of the challenge calls for finding the difference between the largest and smallest number in each row, and then summing those differences: print(sum(max(x) - min(x) for x in sheet)) Part 2 is a bit more involved: for each row we have to find the unique pair of elements that divide into each other without remainder, then sum the result of those divisions. We can make it a little easier by sorting each row; then we can take each number in turn and compare it only with the numbers after it (which are guaranteed to be larger). Doing this ensures we only make each comparison once. def rowsum_div(row): row = sorted(row) return sum(y // x for i, x in enumerate(row) for y in row[i+1:] if y % x == 0) print(sum(map(rowsum_div, sheet))) We can make this code shorter (if not easier to read) by sorting each row as it&rsquo;s read: sheet = [sorted(int(x) for x in l.split()) for l in input()] Then we can just use the first and last elements in each row for part 1, as we know those are the smallest and largest respectively in the sorted row: print(sum(x[-1] - x[0] for x in sheet)) Part 2 then becomes a sum over a single generator expression: print(sum(y // x for row in sheet for i, x in enumerate(row) for y in row[i+1:] if y % x == 0)) Very satisfying! Inverse Captcha — Coconut — #adventofcode Day 1 Well, December&rsquo;s here at last, and with it Day 1 of Advent of Code. … It goes on to explain that you may only leave by solving a captcha to prove you&rsquo;re not a human. Apparently, you only get one millisecond to solve the captcha: too fast for a normal human, but it feels like hours to you. … As well as posting solutions here when I can, I&rsquo;ll be putting them all on https://github.com/jezcope/aoc2017 too. !!! commentary After doing some challenges from last year in Haskell for a warm up, I felt inspired to try out the functional-ish Python dialect, Coconut. Now that I&rsquo;ve done it, it feels a bit of an odd language, neither fish nor fowl. It&rsquo;ll look familiar to any Pythonista, but is loaded with features normally associated with functional languages, like pattern matching, destructuring assignment, partial application and function composition. That makes it quite fun to work with, as it works similarly to Haskell, but because it's restricted by the basic rules of Python syntax everything feels a bit more like hard work than it should. The accumulator approach feels clunky, but it's necessary to allow [tail call elimination](https://en.wikipedia.org/wiki/Tail_call), which Coconut will do and I wanted to see in action. Lo and behold, if you take a look at the [compiled Python version](https://github.com/jezcope/aoc2017/blob/86c8100824bda1b35e5db6e02d4b80890be7a022/01-inverse-captcha.py#L675) you'll see that my recursive implementation has been turned into a non-recursive `while` loop. Then again, maybe I'm just jealous of Phil Tooley's [one-liner solution in Python](https://github.com/ptooley/aocGolf/blob/1380d78194f1258748ccfc18880cfd575baf5d37/2017.py#L8). import sys def inverse_captcha_(s, acc=0): case reiterable(s): match (|d, d|) :: rest: return inverse_captcha_((|d|) :: rest, acc + int(d)) match (|d0, d1|) :: rest: return inverse_captcha_((|d1|) :: rest, acc) return acc def inverse_captcha(s) = inverse_captcha_(s :: s[0]) def inverse_captcha_1_(s0, s1, acc=0): case (reiterable(s0), reiterable(s1)): match ((|d0|) :: rest0, (|d0|) :: rest1): return inverse_captcha_1_(rest0, rest1, acc + int(d0)) match ((|d0|) :: rest0, (|d1|) :: rest1): return inverse_captcha_1_(rest0, rest1, acc) return acc def inverse_captcha_1(s) = inverse_captcha_1_(s, s$[len(s)//2:] :: s) def test_inverse_captcha(): assert &quot;1111&quot; |&gt; inverse_captcha == 4 assert &quot;1122&quot; |&gt; inverse_captcha == 3 assert &quot;1234&quot; |&gt; inverse_captcha == 0 assert &quot;91212129&quot; |&gt; inverse_captcha == 9 def test_inverse_captcha_1(): assert &quot;1212&quot; |&gt; inverse_captcha_1 == 6 assert &quot;1221&quot; |&gt; inverse_captcha_1 == 0 assert &quot;123425&quot; |&gt; inverse_captcha_1 == 4 assert &quot;123123&quot; |&gt; inverse_captcha_1 == 12 assert &quot;12131415&quot; |&gt; inverse_captcha_1 == 4 if __name__ == &quot;__main__&quot;: sys.argv[1] |&gt; inverse_captcha |&gt; print sys.argv[1] |&gt; inverse_captcha_1 |&gt; print Advent of Code 2017: introduction It&rsquo;s a common lament of mine that I don&rsquo;t get to write a lot of code in my day-to-day job. I like the feeling of making something from nothing, and I often look for excuses to write bits of code, both at work and outside it. Advent of Code is a daily series of programming challenges for the month of December, and is about to start its third annual incarnation. I discovered it too late to take part in any serious way last year, but I&rsquo;m going to give it a try this year. There are no restrictions on programming language (so of course some people delight in using esoteric languages like Brainf**k), but I think I&rsquo;ll probably stick with Python for the most part. That said, I miss my Haskell days and I&rsquo;m intrigued by new kids on the block Go and Rust, so I might end up throwing in a few of those on some of the simpler challenges. I&rsquo;d like to focus a bit more on how I solve the puzzles. They generally come in two parts, with the second part only being revealed after successful completion of the first part. With that in mind, test-driven development makes a lot of sense, because I can verify that I haven&rsquo;t broken the solution to the first part in modifying to solve the second. I may also take a literate programming approach with org-mode or Jupyter notebooks to document my solutions a bit more, and of course that will make it easier to publish solutions here so I&rsquo;ll do that as much as I can make time for. On that note, here are some solutions for 2016 that I&rsquo;ve done recently as a warmup. Day 1: Python Day 1 instructions import numpy as np import pytest as t import sys TURN = { &#39;L&#39;: np.array([[0, 1], [-1, 0]]), &#39;R&#39;: np.array([[0, -1], [1, 0]]) } ORIGIN = np.array([0, 0]) NORTH = np.array([0, 1]) class Santa: def __init__(self, location, heading): self.location = np.array(location) self.heading = np.array(heading) self.visited = [(0,0)] def execute_one(self, instruction): start_loc = self.location.copy() self.heading = self.heading @ TURN[instruction[0]] self.location += self.heading * int(instruction[1:]) self.mark(start_loc, self.location) def execute_many(self, instructions): for i in instructions.split(&#39;,&#39;): self.execute_one(i.strip()) def distance_from_start(self): return sum(abs(self.location)) def mark(self, start, end): for x in range(min(start[0], end[0]), max(start[0], end[0])+1): for y in range(min(start[1], end[1]), max(start[1], end[1])+1): if any((x, y) != start): self.visited.append((x, y)) def find_first_crossing(self): for i in range(1, len(self.visited)): for j in range(i): if self.visited[i] == self.visited[j]: return self.visited[i] def distance_to_first_crossing(self): crossing = self.find_first_crossing() if crossing is not None: return abs(crossing[0]) + abs(crossing[1]) def __str__(self): return f&#39;Santa @ {self.location}, heading {self.heading}&#39; def test_execute_one(): s = Santa(ORIGIN, NORTH) s.execute_one(&#39;L1&#39;) assert all(s.location == np.array([-1, 0])) assert all(s.heading == np.array([-1, 0])) s.execute_one(&#39;L3&#39;) assert all(s.location == np.array([-1, -3])) assert all(s.heading == np.array([0, -1])) s.execute_one(&#39;R3&#39;) assert all(s.location == np.array([-4, -3])) assert all(s.heading == np.array([-1, 0])) s.execute_one(&#39;R100&#39;) assert all(s.location == np.array([-4, 97])) assert all(s.heading == np.array([0, 1])) def test_execute_many(): s = Santa(ORIGIN, NORTH) s.execute_many(&#39;L1, L3, R3&#39;) assert all(s.location == np.array([-4, -3])) assert all(s.heading == np.array([-1, 0])) def test_distance(): assert Santa(ORIGIN, NORTH).distance_from_start() == 0 assert Santa((10, 10), NORTH).distance_from_start() == 20 assert Santa((-17, 10), NORTH).distance_from_start() == 27 def test_turn_left(): east = NORTH @ TURN[&#39;L&#39;] south = east @ TURN[&#39;L&#39;] west = south @ TURN[&#39;L&#39;] assert all(east == np.array([-1, 0])) assert all(south == np.array([0, -1])) assert all(west == np.array([1, 0])) def test_turn_right(): west = NORTH @ TURN[&#39;R&#39;] south = west @ TURN[&#39;R&#39;] east = south @ TURN[&#39;R&#39;] assert all(east == np.array([-1, 0])) assert all(south == np.array([0, -1])) assert all(west == np.array([1, 0])) if __name__ == &#39;__main__&#39;: instructions = sys.stdin.read() santa = Santa(ORIGIN, NORTH) santa.execute_many(instructions) print(santa) print(&#39;Distance from start:&#39;, santa.distance_from_start()) print(&#39;Distance to target: &#39;, santa.distance_to_first_crossing()) Day 2: Haskell Day 2 instructions module Main where data Pos = Pos Int Int deriving (Show) -- Magrittr-style pipe operator (|&gt;) :: a -&gt; (a -&gt; b) -&gt; b x |&gt; f = f x swapPos :: Pos -&gt; Pos swapPos (Pos x y) = Pos y x clamp :: Int -&gt; Int -&gt; Int -&gt; Int clamp lower upper x | x &lt; lower = lower | x &gt; upper = upper | otherwise = x clampH :: Pos -&gt; Pos clampH (Pos x y) = Pos x&#39; y&#39; where y&#39; = clamp 0 4 y r = abs (2 - y&#39;) x&#39; = clamp r (4-r) x clampV :: Pos -&gt; Pos clampV = swapPos . clampH . swapPos buttonForPos :: Pos -&gt; String buttonForPos (Pos x y) = [buttons !! y !! x] where buttons = [&#34; D &#34;, &#34; ABC &#34;, &#34;56789&#34;, &#34; 234 &#34;, &#34; 1 &#34;] decodeChar :: Pos -&gt; Char -&gt; Pos decodeChar (Pos x y) &#39;R&#39; = clampH $ Pos (x+1) y decodeChar (Pos x y) &#39;L&#39; = clampH $ Pos (x-1) y decodeChar (Pos x y) &#39;U&#39; = clampV $ Pos x (y+1) decodeChar (Pos x y) &#39;D&#39; = clampV $ Pos x (y-1) decodeLine :: Pos -&gt; String -&gt; Pos decodeLine p &#34;&#34; = p decodeLine p (c:cs) = decodeLine (decodeChar p c) cs makeCode :: String -&gt; String makeCode instructions = lines instructions -- split into lines |&gt; scanl decodeLine (Pos 1 1) -- decode to positions |&gt; tail -- drop start position |&gt; concatMap buttonForPos -- convert to buttons main = do input &lt;- getContents putStrLn $ makeCode input Research Data Management Forum 18, Manchester !!! intro &quot;&quot; Monday 20 and Tuesday 21 November 2017 I&rsquo;m at the Research Data Management Forum in Manchester. I thought I&rsquo;d use this as an opportunity to try liveblogging, so during the event some notes should appear in the box below (you may have to manually refresh your browser tab periodically to get the latest version). I've not done this before, so if the blog stops updating then it's probably because I've stopped updating it to focus on the conference instead! This was made possible using GitHub's cool [Gist](https://gist.github.com) tool. Draft content policy I thought it was about time I had some sort of content policy on here so this is a first draft. It will eventually wind up as a separate page. Feedback welcome! !!! aside &ldquo;Content policy&rdquo; This blog&rsquo;s primary purpose is as a reflective learning tool for my own development; my aim in writing any given post is mainly to expose and develop my own thinking on a topic. My reasons for making a public blog rather than a private journal are: 1. If I'm lucky, someone smarter than me will provide feedback that will help me and my readers to learn more 2. If I'm extra lucky, someone else might learn from the material as well Each post, therefore, represents the state of my thinking at the time I wrote it, or perhaps a deliberate provocation or exaggeration; either way, if you don't know me personally please don't judge me based entirely on my past words. This is a request though, not an attempt to excuse bad behaviour on my part. I accept full responsibility for any consequences of my words, whether intended or not. I will not remove comments or ban individuals for disagreeing with me, only for behaving offensively or disrespectfully. I will do my best to be fair and balanced and explain decisions that I take, but I reserve the right to take those decisions without making any explanation at all if it seems likely to further inflame a situation. If I end up responding to anything simply with a link to this policy, that's probably all the explanation you're going to get. It should go without saying, but the opinions presented in this blog are my own and not those of my employer or anyone else I might at times represent. Learning to live with anxiety !!! intro &quot;&quot; This is a post that I&rsquo;ve been writing for months, and writing in my head for years. For some it will explain aspects of my personality that you might have wondered about. For some it will just be another person banging on self-indulgently about so-called &ldquo;mental health issues&rdquo;. Hopefully, for some it will demystify some stuff and show that you&rsquo;re not alone and things do get better. For as long as I can remember I&rsquo;ve been a worrier. I&rsquo;ve also suffered from bouts of what I now recognise as depression, on and off since my school days. It&rsquo;s only relatively recently that I&rsquo;ve come to the realisation that these two might be connected and that my &lsquo;worrying&rsquo; might in fact be outside the normal range of healthy human behaviour and might more accurately be described as chronic anxiety. You probably won&rsquo;t have noticed it, but it&rsquo;s been there. More recently I&rsquo;ve begun feeling like I&rsquo;m getting on top of it and feeling &ldquo;normal&rdquo; for the first time in my life. Things I&rsquo;ve found that help include: getting out of the house more and socialising with friends; and getting a range of exercise, outdoors and away from the city (rock climbing is mentally and physically engaging and open water swimming is indescribably joyful). But mostly it&rsquo;s the cognitive behavioural therapy (CBT) and the antidepressants. Before I go any further, a word about drugs (&ldquo;don&rsquo;t do drugs, kids&rdquo;): I&rsquo;m on the lowest available dose of a common antidepressant. This isn&rsquo;t because it stops me being sad all the time (I&rsquo;m not) or because it makes all my problems go away (it really doesn&rsquo;t). It&rsquo;s because the scientific evidence points to a combination of CBT and antidepressants as being the single most effective treatment for generalised anxiety disorder. The reason for this is simple: CBT isn&rsquo;t easy, because it asks you to challenge habits and beliefs you&rsquo;ve held your whole life. In the short term there is going to be more anxiety and some antidepressants are also effective at blunting the effect of this additional anxiety. In short, CBT is what makes you better, and the drugs just make it a little bit more effective. A lot of people have misconceptions about what it means to be &lsquo;in therapy&rsquo;. I suspect a lot of these are derived from the psychoanalysis we often see portrayed in (primarily US) film and TV. The problem with that type of navel-gazing therapy is that you can spend years doing it, finally reach some sort of breakthrough insight, and still not have no idea what the supposed insight means for your actual life. CBT is different in that rather than addressing feelings directly it focuses on habits in your thoughts (cognitive) and actions (behavioural) with feeling better as an outcome (therapy). CBT and related forms of therapy now have decades of clinical evidence showing that they really work. It uses a wide range of techniques to identify, challenge and reduce various common unhelpful thoughts and behaviours. By choosing and practicing these, you can break bad mental habits that you&rsquo;ve been carrying around, often for decades. For me this means giving fair weight to my successes as well as my failings, allowing flexibility into the rigid rules that I have always, subconsciously, lived by, and being a bit kinder to myself when I make mistakes. It&rsquo;s not been easy and I have to remind myself to practice this every day, but it&rsquo;s really helped. !!! aside &ldquo;More info&rdquo; If you live in the UK, you might not be aware that you can get CBT and other psychological therapies on the NHS through a scheme called IAPT (improving access to psychological therapies). You can self-refer so you don&rsquo;t need to see a doctor first, but you might want to anyway if you think medication might help. They also have a progression of treatments, so you might be offered a course of &ldquo;guided self-help&rdquo; and then progressed to CBT or another talking therapy if need be. This is what happened to me, and it did help a bit but it was CBT that helped me the most. Becoming a librarian What is a librarian? Is it someone who has a masters degree in librarianship and information science? Is it someone who looks after information for other people? Is it simply someone who works in a library? I&rsquo;ve been grappling with this question a lot lately because I&rsquo;ve worked in academic libraries for about 3 years now and I never really thought that&rsquo;s something that might happen. People keep referring to me as &ldquo;a librarian&rdquo; but there&rsquo;s some imposter feelings here because all the librarians around me have much more experience, have skills in areas like cataloguing and collection management and, generally, have a librarian masters degree. So I&rsquo;ve been thinking about what it actually means to me to be a librarian or not. NB. some of these may be tongue-in-cheek Ways in which I am a librarian: I work in a library I help people to access and organise information I have a cat I like gin Ways in which I am not a librarian: I don&rsquo;t have a librarianship qualification I don&rsquo;t work with books 😉 I don&rsquo;t knit (though I can probably remember how if pressed) I don&rsquo;t shush people or wear my hair in a bun (I can confirm that this is also true of every librarian I know) Ways in which I am a shambrarian: I like beer I have more IT experience and qualification than librarianship At the end of the day, I still don&rsquo;t know how I feel about this or, for that matter, how important it is. I&rsquo;m probably going to accept whatever title people around me choose to bestow, though any label will chafe at times! Lean Libraries: applying agile practices to library services Kanban board Jeff Lasovski (via Wikimedia Commons) I&rsquo;ve been working with our IT services at work quite closely for the last year as product owner for our new research data portal, ORDA. That&rsquo;s been a fascinating process for me as I&rsquo;ve been able to see first-hand some of the agile techniques that I&rsquo;ve been reading about from time-to-time on the web over the last few years. They&rsquo;re in the process of adopting a specific set of practices going under the name &ldquo;Scrum&rdquo;, which is fun because it uses some novel terminology that sounds pretty weird to non-IT folks, like &ldquo;scrum master&rdquo;, &ldquo;sprint&rdquo; and &ldquo;product backlog&rdquo;. On my small project we&rsquo;ve had great success with the short cycle times and been able to build trust with our stakeholders by showing concrete progress on a regular basis. Modern librarianship is increasingly fluid, particularly in research services, and I think that to handle that fluidity it&rsquo;s absolutely vital that we are able to work in a more agile way. I&rsquo;m excited about the possibilities of some of these ideas. However, Scrum as implemented by our IT services doesn&rsquo;t seem something that transfers directly to the work that we do: it&rsquo;s too specialised for software development to adapt directly. What I intend to try is to steal some of the individual practices on an experimental basis and simply see what works and what doesn&rsquo;t. The Lean concepts currently popular in IT were originally developed in manufacturing: if they can be translated from the production of physical goods to IT, I don&rsquo;t see why we can&rsquo;t make the ostensibly smaller step of translating them to a different type of knowledge work. I&rsquo;ve therefore started reading around this subject to try and get as many ideas as possible. I&rsquo;m generally pretty rubbish at taking notes from books, so I&rsquo;m going to try and record and reflect on any insights I make on this blog. The framework for trying some of these out is clearly a Plan-Do-Check-Act continuous improvement cycle, so I&rsquo;ll aim to reflect on that process too. I&rsquo;m sure there will have been people implementing Lean in libraries already, so I&rsquo;m hoping to be able to discover and learn from them instead of starting froms scratch. Wish me luck! Mozilla Global Sprint 2017 Photo by Lena Bell on Unsplash Every year, the Mozilla Foundation runs a two-day Global Sprint, giving people around the world 50 hours to work on projects supporting and promoting open culture and tech. Though much of the work during the sprint is, of course, technical software development work, there are always tasks suited to a wide range of different skill sets and experience levels. The participants include writers, designers, teachers, information professionals and many others. This year, for the first time, the University of Sheffield hosted a site, providing a space for local researchers, developers and others to get out of their offices, work on #mozsprint and link up with others around the world. The Sheffield site was organised by the Research Software Engineering group in collaboration with the University Library. Our site was only small compared to others, but we still had people working on several different projects. My reason for taking part in the sprint was to contribute to the international effort on the Library Carpentry project. A team spread across four continents worked throughout the whole sprint to review and develop our lesson material. As there were no other Library Carpentry volunteers at the Sheffield site, I chose to work on some urgent work around improving the presentation of our workshops and lessons on the web and related workflows. It was a really nice subproject to work on, requiring not only cleaning up and normalising the metadata we hold on workshops and lessons, but also digesting and formalising our current ad hoc process of lesson development. The largest group were solar physicists from the School of Maths and Statistics, working on the SunPy project, an open source environment for solar data analysis. They pushed loads of bug fixes and documentation improvements, and also mentored a new contributor through their first additions to the project. Anna Krystalli from Research Software Engineering worked on the EchoBurst project, which is building a web browser extension to help people break out of their online echo chambers. It does this by using natural language processing techniques to highlight well-written, logically sound articles that disagree with the reader&rsquo;s stated views on particular topics of interest. Anna was part of an effort to begin extending this technology to online videos. We had a couple of individuals simply taking the opportunity to break out of their normal work environments to work or learn, including a couple of members of library staff show up for a couple of hours to learn how to use git on a new project! IDCC 2017 reflection For most of the last few years I&#39;ve been lucky enough to attend the International Digital Curation Conference (IDCC). One of the main audiences attending is people who, like me, work on research data management at universities around the world and it&#39;s begun to feel like a sort of &#34;home&#34; conference to me. This year, IDCC was held at the Royal College of Surgeons in the beautiful city of Edinburgh. For the last couple of years, my overall impression has been that, as a community, we&#39;re moving away from the &#34;first-order&#34; problem of trying to convince people (from PhD students to senior academics) to take RDM seriously and into a rich set of &#34;second-order&#34; problems around how to do things better and widen support to more people. This year has been no exception. Here are a few of my observations and takeaway points. Everyone has a repository now Only last year, the most common question you&#39;d get asked by strangers in the coffee break would be &#34;Do you have a data repository?&#34; Now the question is more likely to be &#34;What are you using for your data repository?&#34;, along with more subtle questions about specific components of systems and how they interact. Integrating active storage and archival systems Now that more institutions have data worth preserving, there is more interest in (and in many cases experience of) setting up more seamless integrations between active and archival storage. There are lessons here we can learn. Freezing in amber vs actively maintaining assets There seemed to be an interesting debate going on throughout the conference around the aim of preservation: should we be faithfully preserving the bits and bytes provided without trying to interpret them, or should we take a more active approach by, for example, migrating obsolete formats to newer alternatives. If the former, should we attempt to preserve the software required to access the data as well? If the latter, how much effort do we invest and how do we ensure nothing is lost or altered in the migration? Demonstrating Data Science instead of debating what it is The phrase &#34;Data Science&#34; was once again one of the most commonly uttered of the conference. However, there is now less abstract discussion about what, exactly, is meant by this &#34;data science&#34; thing; this has been replaced more by concrete demonstrations. This change was exemplified perfectly by the keynote by data scientist Alice Daish, who spent a riveting 40 minutes or so enthusing about all the cool stuff she does with data at the British Museum. Recognition of software as an issue Even as recently as last year, I&#39;ve struggled to drum up much interest in discussing software sustainability and preservation at events like this; the interest was there, but there were higher priorities. So I was completely taken by surprise when we ended up with 30+ people in the Software Preservation Birds of a Feather (BoF) session, and when very little input was needed from me as chair to keep a productive discussion going for a full 90 minutes. Unashamed promotion of openness As a community we seem to have nearly overthrown our collective embarrassment about the phrase &#34;open data&#34; (although maybe this is just me). We&#39;ve always known it was a good thing, but I know I&#39;ve been a bit of an apologist in the past, feeling that I had to &#34;soften the blow&#34; when asking researchers to be more open. Now I feel more confident in leading with the benefits of openness, and it felt like that&#39;s a change reflected in the community more widely. Becoming more involved in the conference This year, I took a decision to try and do more to contribute to the conference itself, and I felt like this was pretty successful both in making that contribution and building up my own profile a bit. I presented a paper on one of my current passions, Library Carpentry; it felt really good to be able to share my enthusiasm. I presented a poster on our work integrating our data repository and digital preservation platform; this gave me more of a structure for networking during breaks, as I was able to stand by the poster and start discussions with anyone who seemed interested. I chaired a parallel session; a first for me, and a different challenge from presenting or simply attending the talks. And finally, I proposed and chaired the Software Preservation BoF session (blog post forthcoming). Renewed excitement It&#39;s weird, and possibly all in my imagination, but there seemed to be more energy at this conference than at the previous couple I&#39;ve been to. More people seemed to be excited about the work we&#39;re all doing, recent achievements and the possibilities for the future. Introducing PyRefine: OpenRefine meets Python I&rsquo;m knocking the rust off my programming skills by attempting to write a pure-Python interpreter for OpenRefine &ldquo;scripts&rdquo;. OpenRefine is a great tool for exploring and cleaning datasets prior to analysing them. It also records an undo history of all actions that you can export as a sort of script in JSON format. One thing that bugs me though is that, having spent some time interactively cleaning up your dataset, you then need to fire up OpenRefine again and do some interactive mouse-clicky stuff to apply that cleaning routine to another dataset. You can at least re-import the JSON undo history to make that as quick as possible, but there&rsquo;s no getting around the fact that there&rsquo;s no quick way to do it from a cold start. There is a project, BatchRefine, that extends the OpenRefine server to accept batch requests over a HTTP API, but that isn&rsquo;t useful when you can&rsquo;t or don&rsquo;t want to keep a full Java stack running in the background the whole time. My concept is this: you use OR to explore the data interactively and design a cleaning process, but then export the process to JSON and integrate it into your analysis in Python. That way it can be repeated ad nauseam without having to fire up a full Java stack. I&rsquo;m taking some inspiration from the great talk &ldquo;So you want to be a wizard?&quot; by Julia Evans (@b0rk), who recommends trying experiments as a way to learn. She gives these Rules of Programming Experiments: &ldquo;it doesn&rsquo;t have to be good it doesn&rsquo;t have to work you have to learn something&rdquo; In that spirit, my main priorities are: to see if this can be done; to see how far I can get implementing it; and to learn something. If it also turns out to be a useful thing, well, that&rsquo;s a bonus. Some of the interesting possible challenges here: Implement all core operations; there are quite a lot of these, some of which will be fun (i.e. non-trivial) to implement Implement (a subset of?) GREL, the General Refine Expression Language; I guess my undergrad course on implementing parsers and compilers will come in handy after all! Generate clean, sane Python code from the JSON rather than merely executing it; more than anything, this would be a nice educational tool for users of OpenRefine who want to see how to do equivalent things in Python Selectively optimise key parts of the process; this will involve profiling the code to identify bottlenecks as well as tweaking the actual code to go faster Potentially handle contributions to the code from other people; I&rsquo;d be really happy if this happened but I&rsquo;m realistic&hellip; If you&rsquo;re interested, the project is called PyRefine and it&rsquo;s on github. Constructive criticism, issues &amp; pull requests all welcome! Implementing Yesterbox in emacs with mu4e I&rsquo;ve been meaning to give Yesterbox a try for a while. The general idea is that each day you only deal with email that arrived yesterday or earlier. This forms your inbox for the day, hence &ldquo;yesterbox&rdquo;. Once you&rsquo;ve emptied your yesterbox, or at least got through some minimum number (10 is recommended) then you can look at emails from today. Even then you only really want to be dealing with things that are absolutely urgent. Anything else can wait til tomorrow. The motivation for doing this is to get away from the feeling that we are King Canute, trying to hold back the tide. I find that when I&rsquo;m processing my inbox toward zero there&rsquo;s always a temptation to keep skipping to the new stuff that&rsquo;s just come in. Hiding away the new email until I&rsquo;ve dealt with the old is a very interesting idea. I use mu4e in emacs for reading my email, and handily the mu search syntax is very flexible so you&rsquo;d think it would be easy to create a yesterbox filter: maildir:&quot;/INBOX&quot; date:..1d Unfortunately, 1d is interpreted as &ldquo;24 hours ago from right now&rdquo; so this filter misses everything that was sent yesterday but less than 24 hours ago. There was a feature request raised on the mu github repository to implement an additional date filter syntax but it seems to have died a death for now. In the meantime, the answer to this is to remember that my workplace observes fairly standard office hours, so that anything sent more than 9 hours ago is unlikely to have been sent today. The following does the trick: maildir:&quot;/INBOX&quot; date:..9h In my mu4e bookmarks list, that looks like this: (setq mu4e-bookmarks &#39;((&#34;flag:unread AND NOT flag:trashed&#34; &#34;Unread messages&#34; ?u) (&#34;flag:flagged maildir:/archive&#34; &#34;Starred messages&#34; ?s) (&#34;date:today..now&#34; &#34;Today&#39;s messages&#34; ?t) (&#34;date:7d..now&#34; &#34;Last 7 days&#34; ?w) (&#34;maildir:\&#34;/Mailing lists.*\&#34; (flag:unread OR flag:flagged)&#34; &#34;Unread in mailing lists&#34; ?M) (&#34;maildir:\&#34;/INBOX\&#34; date:..1d&#34; &#34;Yesterbox&#34; ?y))) ;; &lt;- this is the new one Rewarding good practice in research From opensource.com on Flickr Whenever I&rsquo;m involved in a discussion about how to encourage researchers to adopt new practices, eventually someone will come out with some variant of the following phrase: &ldquo;That&rsquo;s all very well, but researchers will never do XYZ until it&rsquo;s made a criterion in hiring and promotion decisions.&rdquo; With all the discussion of carrots and sticks I can see where this attitude comes from, and strongly empathise with it, but it raises two main problems: It&rsquo;s unfair and more than a little insulting to anyone to be lumped into one homogeneous group; and Taking all the different possible XYZs into account, that&rsquo;s an awful lot of hoops to expect anyone to jump through. Firstly, &ldquo;researchers&rdquo; are as diverse as the rest of us in terms of what gets them out of bed in the morning. Some of us want prestige; some want to contribute to a greater good; some want to create new things; some just enjoy the work. One thing I&rsquo;d argue we all have in common is this: nothing is more offputting than feeling like you&rsquo;re being strongarmed into something you don&rsquo;t want to do. If we rely on simplistic metrics, people will focus on those and miss the point. At best people will disengage and at worst they will actively game the system. I&rsquo;ve got to do these ten things to get my next payrise, and still retain my sanity? Ok, what&rsquo;s the least I can get away with and still tick them off. You see it with students taking poorly-designed assessments and grown-ups are no difference. We do need to wield carrots as well as sticks, but the whole point is that these practices are beneficial in and of themselves. The carrots are already there if we articulate them properly and clear the roadblocks (don&rsquo;t you enjoy mixed metaphors?). Creating artificial benefits will just dilute the value of the real ones. Secondly, I&rsquo;ve heard a similar argument made for all of the following practices and more: Research data management Open Access publishing Public engagement New media (e.g. blogging) Software management and sharing Some researchers devote every waking hour to their work, whether it&rsquo;s in the lab, writing grant applications, attending conferences, authoring papers, teaching, and so on and so on. It&rsquo;s hard to see how someone with all this in their schedule can find time to exercise any of these new skills, let alone learn them in the first place. And what about the people who sensibly restrict the hours taken by work to spend more time doing things they enjoy? Yes, all of the above practices are valuable, both for the individual and the community, but they&rsquo;re all new (to most) and hence require more effort up front to learn. We have to accept that it&rsquo;s inevitably going to take time for all of them to become &ldquo;business as usual&rdquo;. I think if the hiring/promotion/tenure process has any role in this, it&rsquo;s in asking whether the researcher can build a coherent narrative as to why they&rsquo;ve chosen to focus their efforts in this area or that. You&rsquo;re not on Twitter but your data is being used by 200 research groups across the world? Great! You didn&rsquo;t have time to tidy up your source code for github but your work is directly impacting government policy? Brilliant! We still need convince more people to do more of these beneficial things, so how? Call me naïve, but maybe we should stick to making rational arguments, calming fears and providing low-risk opportunities to learn new skills. Acting (compassionately) like a stuck record can help. And maybe we&rsquo;ll need to scale back our expectations in other areas (journal impact factors, anyone?) to make space for the new stuff. Software Carpentry: SC Test; does your software do what you meant? &ldquo;The single most important rule of testing is to do it.&rdquo; &mdash; Brian Kernighan and Rob Pike, The Practice of Programming (quote taken from SC Test page One of the trickiest aspects of developing software is making sure that it actually does what it&rsquo;s supposed to. Sometimes failures are obvious: you get completely unreasonable output or even (shock!) a comprehensible error message. But failures are often more subtle. Would you notice if your result was out by a few percent, or consistently ignored the first row of your input data? The solution to this is testing: take some simple example input with a known output, run the code and compare the actual output with the expected one. Implement a new feature, test and repeat. Sounds easy, doesn&rsquo;t it? But then you implement a new bit of code. You test it and everything seems to work fine, except that your new feature required changes to existing code and those changes broke something else. So in fact you need to test everything, and do it every time you make a change. Further than that, you probably want to test that all your separate bits of code work together properly (integration testing) as well as testing the individual bits separately (unit testing). In fact, splitting your tests up like that is a good way of holding on to your sanity. This is actually a lot less scary than it sounds, because there are plenty of tools now to automate that testing: you just type a simple test command and everything is verified. There are even tools that enable you to have tests run automatically when you check the code into version control, and even automatically deploy code that passes the tests, a process known as continuous integration or CI. The big problems with testing are that it&rsquo;s tedious, your code seems to work without it and no-one tells you off for not doing it. At the time when the Software Carpentry competition was being run, the idea of testing wasn&rsquo;t new, but the tools to help were in their infancy. &ldquo;Existing tools are obscure, hard to use, expensive, don&rsquo;t actually provide much help, or all three.&rdquo; The SC Test category asked entrants &ldquo;to design a tool, or set of tools, which will help programmers construct and maintain black box and glass box tests of software components at all levels, including functions, modules, and classes, and whole programs.&rdquo; The SC Test category is interesting in that the competition administrators clearly found it difficult to specify what they wanted to see in an entry. In fact, the whole category was reopened with a refined set of rules and expectations. Ultimately, it&rsquo;s difficult to tell whether this category made a significant difference. Where the tools to write tests used to be very sparse and difficult to use they are now many and several options exist for most programming languages. With this proliferation, several tried-and-tested methodologies have emerged which are consistent across many different tools, so while things still aren&rsquo;t perfect they are much better. In recent years there has been a culture shift in the wider software development community towards both testing in general and test-first development, where the tests for a new feature are written first, and then the implementation is coded incrementally until all tests pass. The current challenge is to transfer this culture shift to the academic research community! Tools for collaborative markdown editing Photo by Alan Cleaver I really love Markdown1. I love its simplicity; its readability; its plain-text nature. I love that it can be written and read with nothing more complicated than a text-editor. I love how nicely it plays with version control systems. I love how easy it is to convert to different formats with Pandoc and how it&rsquo;s become effectively the native text format for a wide range of blogging platforms. One frustration I&rsquo;ve had recently, then, is that it&rsquo;s surprisingly difficult to collaborate on a Markdown document. There are various solutions that almost work but at best feel somehow inelegant, especially when compared with rock solid products like Google Docs. Finally, though, we&rsquo;re starting to see some real possibilities. Here are some of the things I&rsquo;ve tried, but I&rsquo;d be keen to hear about other options. 1. Just suck it up To be honest, Google Docs isn&rsquo;t that bad. In fact it works really well, and has almost no learning curve for anyone who&rsquo;s ever used Word (i.e. practically anyone who&rsquo;s used a computer since the 90s). When I&rsquo;m working with non-technical colleagues there&rsquo;s nothing I&rsquo;d rather use. It still feels a bit uncomfortable though, especially the vendor lock-in. You can export a Google Doc to Word, ODT or PDF, but you need to use Google Docs to do that. Plus as soon as I start working in a word processor I get tempted to muck around with formatting. 2. Git(hub) The obvious solution to most techies is to set up a GitHub repo, commit the document and go from there. This works very well for bigger documents written over a longer time, but seems a bit heavyweight for a simple one-page proposal, especially over short timescales. Who wants to muck around with pull requests and merging changes for a document that&rsquo;s going to take 2 days to write tops? This type of project doesn&rsquo;t need a bug tracker or a wiki or a public homepage anyway. Even without GitHub in the equation, using git for such a trivial use case seems clunky. 3. Markdown in Etherpad/Google Docs Etherpad is great tool for collaborative editing, but suffers from two key problems: no syntax highlighting or preview for markdown (it&rsquo;s just treated as simple text); and you need to find a server to host it or do it yourself. However, there&rsquo;s nothing to stop you editing markdown with it. You can do the same thing in Google Docs, in fact, and I have. Editing a fundamentally plain-text format in a word processor just feels weird though. 4. Overleaf/Authorea Overleaf and Authorea are two products developed to support academic editing. Authorea has built-in markdown support but lacks proper simultaneous editing. Overleaf has great simultaneous editing but only supports markdown by wrapping a bunch of LaTeX boilerplate around it. Both OK but unsatisfactory. 5. StackEdit Now we&rsquo;re starting to get somewhere. StackEdit has both Markdown syntax highlighting and near-realtime preview, as well as integrating with Google Drive and Dropbox for file synchronisation. 6. HackMD HackMD is one that I only came across recently, but it looks like it does exactly what I&rsquo;m after: a simple markdown-aware editor with live preview that also permits simultaneous editing. I&rsquo;m a little circumspect simply because I know simultaneous editing is difficult to get right, but it certainly shows promise. 7. Classeur I discovered Classeur literally today: it&rsquo;s developed by the same team as StackEdit (which is now apparently no longer in development), and is currently in beta, but it looks to offer two killer features: real-time collaboration, including commenting, and pandoc-powered export to loads of different formats. Anything else? Those are the options I&rsquo;ve come up with so far, but they can&rsquo;t be the only ones. Is there anything I&rsquo;ve missed? Other plain-text formats are available. I&rsquo;m also a big fan of org-mode. &#x21a9;&#xfe0e; Software Carpentry: SC Track; hunt those bugs! This competition will be an opportunity for the next wave of developers to show their skills to the world &mdash; and to companies like ours. &mdash; Dick Hardt, ActiveState (quote taken from SC Track page) All code contains bugs, and all projects have features that users would like but which aren&rsquo;t yet implemented. Open source projects tend to get more of these as their user communities grow and start requesting improvements to the product. As your open source project grows, it becomes harder and harder to keep track of and prioritise all of these potential chunks of work. What do you do? The answer, as ever, is to make a to-do list. Different projects have used different solutions, including mailing lists, forums and wikis, but fairly quickly a whole separate class of software evolved: the bug tracker, which includes such well-known examples as Bugzilla, Redmine and the mighty JIRA. Bug trackers are built entirely around such requests for improvement, and typically track them through workflow stages (planning, in progress, fixed, etc.) with scope for the community to discuss and add various bits of metadata. In this way, it becomes easier both to prioritise problems against each other and to use the hive mind to find solutions. Unfortunately most bug trackers are big, complicated beasts, more suited to large projects with dozens of developers and hundreds or thousands of users. Clearly a project of this size is more difficult to manage and requires a certain feature set, but the result is that the average bug tracker is non-trivial to set up for a small single-developer project. The SC Track category asked entrants to propose a better bug tracking system. In particular, the judges were looking for something easy to set up and configure without compromising on functionality. The winning entry was a bug-tracker called Roundup, proposed by Ka-Ping Yee. Here we have another tool which is still in active use and development today. Given that there is now a huge range of options available in this area, including the mighty github, this is no small achievement. These days, of course, github has become something of a de facto standard for open source project management. Although ostensibly a version control hosting platform, each github repository also comes with a built-in issue tracker, which is also well-integrated with the &ldquo;pull request&rdquo; workflow system that allows contributors to submit bug fixes and features themselves. Github&rsquo;s competitors, such as GitLab and Bitbucket, also include similar features. Not everyone wants to work in this way though, so it&rsquo;s good to see that there is still a healthy ecosystem of open source bug trackers, and that Software Carpentry is still having an impact. Software Carpentry: SC Config; write once, compile anywhere Nine years ago, when I first release Python to the world, I distributed it with a Makefile for BSD Unix. The most frequent questions and suggestions I received in response to these early distributions were about building it on different Unix platforms. Someone pointed me to autoconf, which allowed me to create a configure script that figured out platform idiosyncracies Unfortunately, autoconf is painful to use &ndash; its grouping, quoting and commenting conventions don&rsquo;t match those of the target language, which makes scripts hard to write and even harder to debug. I hope that this competition comes up with a better solution &mdash; it would make porting Python to new platforms a lot easier! &mdash; Guido van Rossum, Technical Director, Python Consortium (quote taken from SC Config page) On to the next Software Carpentry competition category, then. One of the challenges of writing open source software is that you have to make it run on a wide range of systems over which you have no control. You don&rsquo;t know what operating system any given user might be using or what libraries they have installed, or even what versions of those libraries. This means that whatever build system you use, you can&rsquo;t just send the Makefile (or whatever) to someone else and expect everything to go off without a hitch. For a very long time, it&rsquo;s been common practice for source packages to include a configure script that, when executed, runs a bunch of tests to see what it has to work with and sets up the Makefile accordingly. Writing these scripts by hand is a nightmare, so tools like autoconf and automake evolved to make things a little easier. They did, and if the tests you want to use are already implemented they work very well indeed. Unfortunately they&rsquo;re built on an unholy combination of shell scripting and the archaic Gnu M4 macro language. That means if you want to write new tests you need to understand both of these as well as the architecture of the tools themselves &mdash; not an easy task for the average self-taught research programmer. SC Conf, then, called for a re-engineering of the autoconf concept, to make it easier for researchers to make their code available in a portable, platform-independent format. The second round configuration tool winner was SapCat, &ldquo;a tool to help make software portable&rdquo;. Unfortunately, this one seems not to have gone anywhere, and I could only find the original proposal on the Internet Archive. There were a lot of good ideas in this category about making catalogues and databases of system quirks to avoid having to rerun the same expensive tests again the way a standard ./configure script does. I think one reason none of these ideas survived is that they were overly ambitions, imagining a grand architecture where their tool provide some overarching source of truth. This is in stark contrast to the way most Unix-like systems work, where each tool does one very specific job well and tools are easy to combine in various ways. In the end though, I think Moore&rsquo;s Law won out here, making it easier to do the brute-force checks each time than to try anything clever to save time &mdash; a good example of avoiding unnecessary optimisation. Add to that the evolution of the generic pkg-config tool from earlier package-specific tools like gtk-config, and it&rsquo;s now much easier to check for particular versions and features of common packages. On top of that, much of the day-to-day coding of a modern researcher happens in interpreted languages like Python and R, which give you a fully-functioning pre-configured environment with a lot less compiling to do. As a side note, Tom Tromey, another of the shortlisted entrants in this category, is still a major contributor to the open source world. He still seems to be involved in the automake project, contributes a lot of code to the emacs community too and blogs sporadically at The Cliffs of Inanity. Semantic linefeeds: one clause per line I&rsquo;ve started using &ldquo;semantic linefeeds&rdquo;, a concept I discovered on Brandon Rhodes' blog, when writing content, an idea described in that article far better than I could. I turns out this is a very old idea, promoted way back in the day by Brian W Kernighan, contributor to the original Unix system, co-creator of the AWK and AMPL programming languages and co-author of a lot of seminal programming textbooks including &ldquo;The C Programming Language&rdquo;. The basic idea is that you break lines at natural gaps between clauses and phrases, rather than simply after the last word before you hit 80 characters. Keeping line lengths strictly to 80 characters isn&rsquo;t really necessary in these days of wide aspect ratios for screens. Breaking lines at points that make semantic sense in the sentence is really helpful for editing, especially in the context of version control, because it isolates changes to the clause in which they occur rather than just the nearest 80-character block. I also like it because it makes my crappy prose feel just a little bit more like poetry. ☺ Software Carpentry: SC Build; or making a better make Software tools often grow incrementally from small beginnings into elaborate artefacts. Each increment makes sense, but the final edifice is a mess. make is an excellent example: a simple tool that has grown into a complex domain-specific programming language. I look forward to seeing the improvements we will get from designing the tool afresh, as a whole&hellip; &mdash; Simon Peyton-Jones, Microsoft Research (quote taken from SC Build page) Most people who have had to compile an existing software tool will have come across the venerable make tool (which usually these days means GNU Make). It allows the developer to write a declarative set of rules specifying how the final software should be built from its component parts, mostly source code, allowing the build itself to be carried out by simply typing make at the command line and hitting Enter. Given a set of rules, make will work out all the dependencies between components and ensure everything is built in the right order and nothing that is up-to-date is rebuilt. Great in principle but make is notoriously difficult for beginners to learn, as much of the logic for how builds are actually carried out is hidden beneath the surface. This also makes it difficult to debug problems when building large projects. For these reasons, the SC Build category called for a replacement build tool engineered from the ground up to solve these problems. The second round winner, ScCons, is a Python-based make-like build tool written by Steven Knight. While I could find no evidence of any of the other shortlisted entries, this project (now renamed SCons) continues in active use and development to this day. I actually use this one myself from time to time and to be honest I prefer it in many cases to trendy new tools like rake or grunt and the behemoth that is Apache Ant. Its Python-based SConstruct file syntax is remarkably intuitive and scales nicely from very simple builds up to big and complicated project, with good dependency tracking to avoid unnecessary recompiling. It has a lot of built-in rules for performing common build &amp; compile tasks, but it&rsquo;s trivial to add your own, either by combining existing building blocks or by writing a new builder with the full power of Python. A minimal SConstruct file looks like this: Program(&#39;hello.c&#39;) Couldn&rsquo;t be simpler! And you have the full power of Python syntax to keep your build file simple and readable. It&rsquo;s interesting that all the entries in this category apart from one chose to use a Python-derived syntax for describing build steps. Python was clearly already a language of choice for flexible multi-purpose computing. The exception is the entry that chose to use XML instead, which I think is a horrible idea (oh how I used to love XML!) but has been used to great effect in the Java world by tools like Ant and Maven. What happened to the original Software Carpentry? &ldquo;Software Carpentry was originally a competition to design new software tools, not a training course. The fact that you didn&rsquo;t know that tells you how well it worked.&rdquo; When I read this in a recent post on Greg Wilson&rsquo;s blog, I took it as a challenge. I actually do remember the competition, although looking at the dates it was long over by the time I found it. I believe it did have impact; in fact, I still occasionally use one of the tools it produced, so Greg&rsquo;s comment got me thinking: what happened to the other competition entries? Working out what happened will need a bit of digging, as most of the relevant information is now only available on the Internet Archive. It certainly seems that by November 2008 the domain name had been allowed to lapse and had been replaced with a holding page by the registrar. There were four categories in the competition, each representing a category of tool that the organisers thought could be improved: SC Build: a build tool to replace make SC Conf: a configuration management tool to replace autoconf and automake SC Track: a bug tracking tool SC Test: an easy to use testing framework I&rsquo;m hoping to be able to show that this work had a lot more impact than Greg is admitting here. I&rsquo;ll keep you posted on what I find! Changing static site generators: Nanoc → Hugo I&rsquo;ve decided to move the site over to a different static site generator, Hugo. I&rsquo;ve been using Nanoc for a long time and it&rsquo;s worked very well, but lately it&rsquo;s been taking longer and longer to compile the site and throwing weird errors that I can&rsquo;t get to the bottom of. At the time I started using Nanoc, static site generators were in their infancy. There weren&rsquo;t the huge number of feature-loaded options that there are now, so I chose one and I built a whole load of blogging-related functionality myself. I did it in ways that made sense at the time but no longer work well with Nanoc&rsquo;s latest versions. So it&rsquo;s time to move to something that has blogging baked-in from the beginning and I&rsquo;m taking the opportunity to overhaul the look and feel too. Again, when I started there weren&rsquo;t many pre-existing themes so I built the whole thing myself and though I&rsquo;m happy with the work I did on it it never quite felt polished enough. Now I&rsquo;ve got the opportunity to adapt one of the many well-designed themes already out there, so I&rsquo;ve taken one from the Hugo themes gallery and tweaked the colours to my satisfaction. Hugo also has various features that I&rsquo;ve wanted to implement in Nanoc but never quite got round to it. The nicest one is proper handling of draft posts and future dates, but I keep finding others. There&rsquo;s a lot of old content that isn&rsquo;t quite compatible with the way Hugo does things so I&rsquo;ve taken the old Nanoc-compiled content and frozen it to make sure that old links should still work. I could probably fiddle with it for years without doing much so it&rsquo;s probably time to go ahead and publish it. I&rsquo;m still not completely happy with my choice of theme but one of the joys of Hugo is that I can change that whenever I want. Let me know what you think! License Except where otherwise stated, all content on eRambler by Jez Cope is licensed under a Creative Commons Attribution-ShareAlike 4.0 International license. RDM Resources I occasionally get asked for resources to help someone learn more about research data management (RDM) as a discipline (i.e. for those providing RDM support rather than simply wanting to manage their own data). I&rsquo;ve therefore collected a few resources together on this page. If you&rsquo;re lucky I might even update it from time to time! First, a caveat: this is very focussed on UK Higher Education, though much of it will still be relevant for people outside that narrow demographic. My general recommendation would be to start with the Digital Curation Centre (DCC) website and follow links out from there. I also have a slowly growing list of RDM links on Diigo, and there&rsquo;s an RDM section in my list of blogs and feeds too. Mailing lists Jiscmail is a popular list server run for the benefit of further and higher education in the UK; the following lists are particularly relevant: RESEARCH-DATAMAN DATA-PUBLICATION DIGITAL-PRESERVATION LIS-RESEARCHSUPPORT The Research Data Alliance have a number of Interest Groups and Working Groups that discuss issues by email Events International Digital Curation Conference — major annual conference Research Data Management Forum — roughly every six months, places are limited! RDA Plenary — also every 6 months, but only about 1 in every 3 in Europe Books In no particular order: Martin, Victoria. Demystifying eResearch: A Primer for Librarians. Libraries Unlimited, 2014. Borgman, Christine L. Big Data, Little Data, No Data: Scholarship in the Networked World. Cambridge, Massachusetts: The MIT Press, 2015. Corti, Louise, Veerle Van den Eynden, and Libby Bishop. Managing and Sharing Research Data. Thousand Oaks, CA: SAGE Publications Ltd, 2014. Pryor, Graham, ed. Managing Research Data. Facet Publishing, 2012. Pryor, Graham, Sarah Jones, and Angus Whyte, eds. Delivering Research Data Management Services: Fundamentals of Good Practice. Facet Publishing, 2013. Ray, Joyce M., ed. Research Data Management: Practical Strategies for Information Professionals. West Lafayette, Indiana: Purdue University Press, 2014. Reports ‘Ten Recommendations for Libraries to Get Started with Research Data Management’. LIBER, 24 August 2012. http://libereurope.eu/news/ten-recommendations-for-libraries-to-get-started-with-research-data-management/. ‘Science as an Open Enterprise’. Royal Society, 2 June 2012. https://royalsociety.org/policy/projects/science-public-enterprise/Report/. Mary Auckland. ‘Re-Skilling for Research’. RLUK, January 2012. http://www.rluk.ac.uk/wp-content/uploads/2014/02/RLUK-Re-skilling.pdf. Journals International Journal of Digital Curation (IJDC) Journal of eScience Librarianship (JeSLib) Fairphone 2: initial thoughts on the original ethical smartphone I&rsquo;ve had my eye on the Fairphone 2 for a while now, and when my current phone, an aging Samsung Galaxy S4, started playing up I decided it was time to take the plunge. A few people have asked for my thoughts on the Fairphone so here are a few notes. Why I bought it The thing that sparked my interest, and the main reason for buying the phone really, was the ethical stance of the manufacturer. The small Swedish company have gone to great lengths to ensure that both labour and materials are sourced as responsibly as possible. They regularly inspect the factories where the parts are made and assembled to ensure fair treatment of the workers and they source all the raw materials carefully to minimise the environmental impact and the use of conflict minerals. Another side to this ethical stance is a focus on longevity of the phone itself. This is not a product with an intentionally limited lifespan. Instead, it&rsquo;s designed to be modular and as repairable as possible, by the owner themselves. Spares are available for all of the parts that commonly fail in phones (including screen and camera), and at the time of writing the Fairphone 2 is the only phone to receive 10/10 for reparability from iFixit. There are plans to allow hardware upgrades, including an expansion port on the back so that NFC or wireless charging could be added with a new case, for example. What I like So far, the killer feature for me is the dual SIM card slots. I have both a personal and a work phone, and the latter was always getting left at home or in the office or running out of charge. Now I have both SIMs in the one phone: I can recieve calls on either number, turn them on and off independently and choose which account to use when sending a text or making a call. The OS is very close to &ldquo;standard&rdquo; Android, which is nice, and I really don&rsquo;t miss all the extra bloatware that came with the Galaxy S4. It also has twice the storage of that phone, which is hardly unique but is still nice to have. Overall, it seems like a solid, reliable phone, though it&rsquo;s not going to outperform anything else at the same price point. It certainly feels nice and snappy for everything I want to use it for. I&rsquo;m no mobile gamer, but there is that distant promise of upgradability on the horizon if you are. What I don&rsquo;t like I only have two bugbears so far. Once or twice it&rsquo;s locked up and become unresponsive, requiring a &ldquo;manual reset&rdquo; (removing and replacing the battery) to get going again. It also lacks NFC, which isn&rsquo;t really a deal breaker, but I was just starting to make occasional use of it on the S4 (mostly experimenting with my Yubikey NEO) and it would have been nice to try out Android Pay when it finally arrives in the UK. Overall It&rsquo;s definitely a serious contender if you&rsquo;re looking for a new smartphone and aren&rsquo;t bothered about serious mobile gaming. You do pay a premium for the ethical sourcing and modularity, but I feel that&rsquo;s worth it for me. I&rsquo;m looking forward to seeing how it works out as a phone. Wiring my web I&rsquo;m a nut for automating repetitive tasks, so I was dead pleased a few years ago when I discovered that IFTTT let me plug different bits of the web together. I now use it for tasks such as: Syndicating blog posts to social media Creating scheduled/repeating todo items from a Google Calendar Making a note to revisit an article I&rsquo;ve starred in Feedly I&rsquo;d probably only be half-joking if I said that I spend more time automating things than I save not having to do said things manually. Thankfully it&rsquo;s also a great opportunity to learn, and recently I&rsquo;ve been thinking about reimplementing some of my IFTTT workflows myself to get to grips with how it all works. There are some interesting open source projects designed to offer a lot of this functionality, such as Huginn, but I decided to go for a simpler option for two reasons: I want to spend my time learning about the APIs of the services I use and how to wire them together, rather than learning how to use another big framework; and I only have a small Amazon EC2 server to pay with and a heavy Ruby on Rails app like Huginn (plus web server) needs more memory than I have. Instead I&rsquo;ve gone old-school with a little collection of individual scripts to do particular jobs. I&rsquo;m using the built-in scheduling functionality of systemd, which is already part of a modern Linux operating system, to get them to run periodically. It also means I can vary the language I use to write each one depending on the needs of the job at hand and what I want to learn/feel like at the time. Currently it&rsquo;s all done in Python, but I want to have a go at Lisp sometime, and there are some interesting new languages like Go and Julia that I&rsquo;d like to get my teeth into as well. You can see my code on github as it develops: https://github.com/jezcope/web-plumbing. Comments and contributions are welcome (if not expected) and let me know if you find any of the code useful. Image credit: xkcd #1319, Automation Data is like water, and language is like clothing I admit it: I&rsquo;m a grammar nerd. I know the difference between &lsquo;who&rsquo; and &lsquo;whom&rsquo;, and I&rsquo;m proud. I used to be pretty militant, but these days I&rsquo;m more relaxed. I still take joy in the mechanics of the language, but I also believe that English is defined by its usage, not by a set of arbitrary rules. I&rsquo;m just as happy to abuse it as to use it, although I still think it&rsquo;s important to know what rules you&rsquo;re breaking and why. My approach now boils down to this: language is like clothing. You (probably) wouldn&rsquo;t show up to a job interview in your pyjamas1, but neither are you going to wear a tuxedo or ballgown to the pub. Getting commas and semicolons in the right place is like getting your shirt buttons done up right. Getting it wrong doesn&rsquo;t mean you&rsquo;re an idiot. Everyone will know what you meant. It will affect how you&rsquo;re perceived, though, and that will affect how your message is perceived. And there are former rules2 that some still enforce that are nonetheless dropping out of regular usage. There was a time when everyone in an office job wore formal clothing. Then it became acceptable just to have a blouse, or a shirt and tie. Then the tie became optional and now there are many professions where perfectly well-respected and competent people are expected to show up wearing nothing smarter than jeans and a t-shirt. One such rule IMHO is that &lsquo;data&rsquo; is a plural and should take pronouns like &lsquo;they&rsquo; and &lsquo;these&rsquo;. The origin of the word &lsquo;data&rsquo; is in the Latin plural of &lsquo;datum&rsquo;, and that idea has clung on for a considerable period. But we don&rsquo;t speak Latin and the English language continues to evolve: &lsquo;agenda&rsquo; also began life as a Latin plural, but we don&rsquo;t use the word &lsquo;agendum&rsquo; any more. It&rsquo;s common everyday usage to refer to data with singular pronouns like &lsquo;it&rsquo; and &lsquo;this&rsquo;, and it&rsquo;s very rare to see someone referring to a single datum (as opposed to &lsquo;data point&rsquo; or something). If you want to get technical, I tend to think of data as a mass noun, like &lsquo;water&rsquo; or &lsquo;information&rsquo;. It&rsquo;s uncountable: talking about &lsquo;a water&rsquo; or &lsquo;an information&rsquo; doesn&rsquo;t make much sense, but it uses singular pronouns, as in &lsquo;this information&rsquo;. If you&rsquo;re interested, the Oxford English Dictionary also takes this position, while Chambers leaves the choice of singular or plural noun up to you. There is absolutely nothing wrong, in my book, with referring to data in the plural as many people still do. But it&rsquo;s no longer a rule and for me it&rsquo;s weakened further from guideline to preference. It&rsquo;s like wearing a bow-tie to work. There&rsquo;s nothing wrong with it and some people really make it work, but it&rsquo;s increasingly outdated and even a little eccentric. or maybe you&rsquo;d totally rock it. &#x21a9;&#xfe0e; Like not starting a sentence with a conjunction&hellip; &#x21a9;&#xfe0e; #IDCC16 day 2: new ideas Well, I did a great job of blogging the conference for a couple of days, but then I was hit by the bug that&rsquo;s been going round and didn&rsquo;t have a lot of energy for anything other than paying attention and making notes during the day! I&rsquo;ve now got round to reviewing my notes so here are a few reflections on day 2. Day 2 was the day of many parallel talks! So many great and inspiring ideas to take in! Here are a few of my take-home points. Big science and the long tail The first parallel session had examples of practical data management in the real world. Jian Qin &amp; Brian Dobreski (School of Information Studies, Syracuse University) worked on reproducibility with one of the research groups involved with the recent gravitational wave discovery. &ldquo;Reproducibility&rdquo; for this work (as with much of physics) mostly equates to computational reproducibility: tracking the provenance of the code and its input and output is key. They also found that in practice the scientists' focus was on making the big discovery, and ensuring reproducibility was seen as secondary. This goes some way to explaining why current workflows and tools don&rsquo;t really capture enough metadata. Milena Golshan &amp; Ashley Sands (Center for Knowledge Infrastructures, UCLA) investigated the use of Software-as-a-Service (SaaS, such as Google Drive, Dropbox or more specialised tools) as a way of meeting the needs of long-tail science research such as ocean science. This research is characterised by small teams, diverse data, dynamic local development of tools, local practices and difficulty disseminating data. This results in a need for researchers to be generalists, as opposed to &ldquo;big science&rdquo; research areas, where they can afford to specialise much more deeply. Such generalists tend to develop their own isolated workflows, which can differ greatly even within a single lab. Long-tail research also often struggles from a lack of dedicated IT support. They found that use of SaaS could help to meet these challenges, but with a high cost required to cover the needed guarantees of security and stability. Education &amp; training This session focussed on the professional development of library staff. Eleanor Mattern (University of Pittsburgh) described the immersive training introduced to improve librarians' understanding of the data needs of their subject areas in delivering their RDM service delivery model. The participants each conducted a &ldquo;disciplinary deep dive&rdquo;, shadowing researchers and then reporting back to the group on their discoveries with a presentation and discussion. Liz Lyon (also University of Pittsburgh, formerly UKOLN/DCC) gave a systematic breakdown of the skills, knowledge and experience required in different data-related roles, obtained from an analysis of job adverts. She identified distinct roles of data analyst, data engineer and data journalist, and as well as each role&rsquo;s distinctive skills, pinpointed common requirements of all three: Python, R, SQL and Excel. This work follows on from an earlier phase which identified an allied set of roles: data archivist, data librarian and data steward. Data sharing and reuse This session gave an overview of several specific workflow tools designed for researchers. Marisa Strong (University of California Curation Centre/California Digital Libraries) presented Dash, a highly modular tool for manual data curation and deposit by researchers. It&rsquo;s built on their flexible backend, Stash, and though it&rsquo;s currently optimised to deposit in their Merritt data repository it could easily be hooked up to other repositories. It captures DataCite metadata and a few other fields, and is integrated with ORCID to uniquely identify people. In a different vein, Eleni Castro (Institute for Quantitative Social Science, Harvard University) discussed some of the ways that Harvard&rsquo;s Dataverse repository is streamlining deposit by enabling automation. It provides a number of standardised endpoints such as OAI-PMH for metadata harvest and SWORD for deposit, as well as custom APIs for discovery and deposit. Interesting use cases include: An addon for the Open Science Framework to deposit in Dataverse via SWORD An R package to enable automatic deposit of simulation and analysis results Integration with publisher workflows Open Journal Systems A growing set of visualisations for deposited data In the future they&rsquo;re also looking to integrate with DMPtool to capture data management plans and with Archivematica for digital preservation. Andrew Treloar (Australian National Data Service) gave us some reflections on the ANDS &ldquo;applications programme&rdquo;, a series of 25 small funded projects intended to address the fourth of their strategic transformations, single use → reusable. He observed that essentially these projects worked because they were able to throw money at a problem until they found a solution: not very sustainable. Some of them stuck to a traditional &ldquo;waterfall&rdquo; approach to project management, resulting in &ldquo;the right solution 2 years late&rdquo;. Every researcher&rsquo;s needs are &ldquo;special&rdquo; and communities are still constrained by old ways of working. The conclusions from this programme were that: &ldquo;Good enough&rdquo; is fine most of the time Adopt/Adapt/Augment is better than Build Existing toolkits let you focus on the 10% functionality that&rsquo;s missing Succussful projects involved research champions who can: 1) articulate their community&rsquo;s requirements; and 2) promote project outcomes Summary All in all, it was a really exciting conference, and I&rsquo;ve come home with loads of new ideas and plans to develop our services at Sheffield. I noticed a continuation of some of the trends I spotted at last year&rsquo;s IDCC, especially an increasing focus on &ldquo;second-order&rdquo; problems: we&rsquo;re no longer spending most of our energy just convincing researchers to take data management seriously and are able to spend more time helping them to do it better and get value out of it. There&rsquo;s also a shift in emphasis (identified by closing speaker Cliff Lynch) from sharing to reuse, and making sure that data is not just available but valuable. #IDCC16 Day 1: Open Data The main conference opened today with an inspiring keynote by Barend Mons, Professor in Biosemantics, Leiden University Medical Center. The talk had plenty of great stuff, but two points stood out for me. First, Prof Mons described a newly discovered link between Huntingdon&rsquo;s Disease and a previously unconsidered gene. No-one had previously recognised this link, but on mining the literature, an indirect link was identified in more than 10% of the roughly 1 million scientific claims analysed. This is knowledge for which we already had more than enough evidence, but which could never have been discovered without such a wide-ranging computational study. Second, he described a number of behaviours which should be considered &ldquo;malpractice&rdquo; in science: Relying on supplementary data in articles for data sharing: the majority of this is trash (paywalled, embedded in bitmap images, missing) Using the Journal Impact Factor to evaluate science and ignoring altmetrics Not writing data stewardship plans for projects (he prefers this term to &ldquo;data management plan&rdquo;) Obstructing tenure for data experts by assuming that all highly-skilled scientists must have a long publication record A second plenary talk from Andrew Sallons of the Centre for Open Science introduced a number of interesting-looking bits and bobs, including the Transparency &amp; Openness Promotion (TOP) Guidelines which set out a pathway to help funders, publishers and institutions move towards more open science. The rest of the day was taken up with a panel on open data, a poster session, some demos and a birds-of-a-feather session on sharing sensitive/confidential data. There was a great range of posters, but a few that stood out to me were: Lessons learned about ISO 16363 (&ldquo;Audit and certification of trustworthy digital repositories&rdquo;) certification from the British Library Two separate posters (from the Universities of Toronto and Colorado) about disciplinary RDM information &amp; training for liaison librarians A template for sharing psychology data developed by a psychologist-turned-information researcher from Carnegie Mellon University More to follow, but for now it&rsquo;s time for the conference dinner! #IDCC16 Day 0: business models for research data management I&rsquo;m at the International Digital Curation Conference 2016 (#IDCC16) in Amsterdam this week. It&rsquo;s always a good opportunity to pick up some new ideas and catch up with colleagues from around the world, and I always come back full of new possibilities. I&rsquo;ll try and do some more reflective posts after the conference but I thought I&rsquo;d do some quick reactions while everything is still fresh. Monday and Thursday are pre- and post-conference workshop days, and today I attended Developing Research Data Management Services. Joy Davidson and Jonathan Rans from the Digital Curation Centre (DCC) introduced us to the Business Model Canvas, a template for designing a business model on a single sheet of paper. The model prompts you to think about all of the key facets of a sustainable, profitable business, and can easily be adapted to the task of building a service model within a larger institution. The DCC used it as part of the Collaboration to Clarify Curation Costs (4C) project, whose output the Curation Costs Exchange is also worth a look. It was a really useful exercise to be able to work through the whole process for an aspect of research data management (my table focused on training &amp; guidance provision), both because of the ideas that came up and also the experience of putting the framework into practice. It seems like a really valuable tool and I look forward to seeing how it might help us with our RDM service development. Tomorrow the conference proper begins, with a range of keynotes, panel sessions and birds-of-a-feather meetings so hopefully more then! About me I help people in Higher Education communicate and collaborate more effectively using technology. I currently work at the University of Sheffield focusing on research data management policy, practice, training and advocacy. In my free time, I like to: run; play the accordion; morris dance; climb; cook; read (fiction and non-fiction); write. Better Science Through Better Data #scidata17 Better Science through Better DoughnutsJez Cope Update: fixed the link to the slides so it works now! Last week I had the honour of giving my first ever keynote talk, at an event entitled Better Science Through Better Data hosted jointly by Springer Nature and the Wellcome Trust. It was nerve-wracking but exciting and seemed to go down fairly well. I even got accidentally awarded a PhD in the programme — if only it was that easy! The slides for the talk, &ldquo;Supporting Open Research: The role of an academic library&rdquo;, are available online (doi:10.15131/shef.data.5537269), and the whole event was video&rsquo;d for posterity and viewable online. I got some good questions too, mainly from the clever online question system. I didn&rsquo;t get to answer all of them, so I&rsquo;m thinking of doing a blog post or two to address a few more. There were loads of other great presentations as well, both keynotes and 7-minute lightning talks, so I&rsquo;d encourage you to take a look at at least some of it. I&rsquo;ll pick out a few of my highlights. Dr Aled Edwards (University of Toronto) There&rsquo;s a major problem with science funding that I hadn&rsquo;t really thought about before. The available funding pool for research is divided up into pots by country, and often by funding body within a country. Each of these pots have robust processes to award funding to the most important problems and most capable researchers. The problem comes because there is no coordination between these pots, so researchers all over the world end up getting funded to research the most popular problems leading to a lot of duplication of effort. Industry funding suffers from a similar problem, particularly the pharmaceutical industry. Because there is no sharing of data or negative results, multiple companies spend billions researching the same dead ends chasing after the same drugs. This is where the astronomical costs of drug development come from. Dr Edwards presented one alternative, modelled by a company called M4K Pharma. The idea is to use existing IP laws to try and give academic researchers a reasonable, morally-justifiable and sustainable profit on drugs they develop, in contrast to the current model where basic research is funded by governments while large corporations hoover up as much profit as they possibly can. This new model would develop drugs all the way to human trial within academia, then license the resulting drugs to companies to manufacture with a price cap to keep the medicines affordable to all who need them. Core to this effort is openness with data, materials and methodology, and Dr Edwards presented several examples of how this approach benefited academic researchers, industry and patients compared with a closed, competitive focus. Dr Kirstie Whitaker (Alan Turing Institute) This was a brilliant presentation, presenting a practical how-to guide to doing reproducible research, from one researcher to another. I suggest you take a look at her slides yourself: Showing your working: a how-to guide to reproducible research. Dr Whitaker briefly addressed a number of common barriers to reproducible research: Is not considered for promotion: so it should be! Held to higher standards than others: reviewers should be discouraged from nitpicking just because the data/code/whatever is available (true unbiased peer review of these would be great though) Publication bias towards novel findings: it is morally wrong to not publish reproductions, replications etc. so we need to address the common taboo on doing so Plead the 5th: if you share, people may find flaws, but if you don&rsquo;t they can&rsquo;t — if you&rsquo;re worried about this you should ask yourself why! Support additional users: some (much?) of the burden should reasonably on the reuser, not the sharer Takes time: this is only true if you hack it together after the fact; if you do it from the start, the whole process will be quicker! Requires additional skills: important to provide training, but also to judge PhD students on their ability to do this, not just on their thesis &amp; papers The rest of the presentation, the &ldquo;how-to&rdquo; guide of the title' was a well-chosen and passionately delivered set of recommendations, but the thing that really stuck out for me is how good Dr Whitaker is at making the point that you only have to do one of these things to improve the quality of your research. It&rsquo;s easy to get the impression at the moment that you have to be fully, perfectly open or not at all, but it&rsquo;s actually OK to get there one step at a time, or even not to go all the way at all! Anyway, I think this is a slide deck that speaks for itself, so I won&rsquo;t say any more! Lightning talk highlights There was plenty of good stuff in the lightning talks, which were constrained to 7 minutes each, but a few of the things that stood out for me were, in no particular order: Code Ocean — share and run code in the cloud dat project — peer to peer data syncronisation tool Can automate metadata creation, data syncing, versioning Set up a secure data sharing network that keeps the data in sync but off the cloud Berlin Institute of Health — open science course for students Pre-print paper Course materials InterMine — taking the pain out of data cleaning &amp; analysis Nix/NixOS as a component of a reproducible paper BoneJ (ImageJ plugin for bone analysis) — developed by a scientist, used a lot, now has a Wellcome-funded RSE to develop next version ESASky — amazing live, online archive of masses of astronomical data Coda I really enjoyed the event (and the food was excellent too). My thanks go out to: The programme committee for asking me to come and give my take — I hope I did it justice! The organising team who did a brilliant job of keeping everything running smoothly before and during the event The University of Sheffield for letting me get away with doing things like this! Blog platform switch I&rsquo;ve just switched my blog over to the Nikola static site generator. Hopefully you won&rsquo;t notice a thing, but there might be a few weird spectres around til I get all the kinks ironed out. I&rsquo;ve made the switch for a couple of main reasons: Nikola supports Jupyter notebooks as a source format for blog posts, which will be useful to include code snippets It&rsquo;s written in Python, a language which I actually know, so I&rsquo;m more likely to be able to fix things that break, customise it and potentially contribute to the open source project (by contrast, Hugo is written in Go, which I&rsquo;m not really familiar with) Chat rooms vs Twitter: how I communicate now CC0, Pixabay This time last year, Brad Colbow published a comic in his &ldquo;The Brads&rdquo; series entitled &ldquo;The long slow death of Twitter&rdquo;. It really encapsulates the way I&rsquo;ve been feeling about Twitter for a while now. Go ahead and take a look. I&rsquo;ll still be here when you come back. According to my Twitter profile, I joined in February 2009 as user #20,049,102. It was nearing its 3rd birthday and, though there were clearly a lot of people already signed up at that point, it was still relatively quiet, especially in the UK. I was a lonely PhD student just starting to get interested in educational technology, and one thing that Twitter had in great supply was (and still is) people pushing back the boundaries of what tech can do in different contexts. Somewhere along the way Twitter got really noisy, partly because more people (especially commercial companies) are using it more to talk about stuff that doesn&rsquo;t interest me, and partly because I now follow 1,200+ people and find I get several tweets a second at peak times, which no-one could be expected to handle. More recently I&rsquo;ve found my attention drawn to more focussed communities instead of that big old shouting match. I find I&rsquo;m much more comfortable discussing things and asking questions in small focussed communities because I know who might be interested in what. If I come across an article about a cool new Python library, I&rsquo;ll geek out about it with my research software engineer friends; if I want advice on an aspect of my emacs setup, I&rsquo;ll ask a bunch of emacs users. I feel like I&rsquo;m talking to people who want to hear what I&rsquo;m saying. Next to that experience, Twitter just feels like standing on a street corner shouting. IRC channels (mostly on Freenode), and similar things like Slack and gitter form the bulk of this for me, along with a growing number of WhatsApp group chats. Although online chat is theoretically a synchronous medium, I find that I can treat it more as &ldquo;semi-synchronous&rdquo;: I can have real-time conversations as they arise, but I can also close them and tune back in later to catch up if I want. Now I come to think about it, this is how I used to treat Twitter before the 1,200 follows happened. I also find I visit a handful of forums regularly, mostly of the Reddit link-sharing or StackExchange Q&amp;A type. /r/buildapc was invaluable when I was building my latest box, /r/EarthPorn (very much not NSFW) is just beautiful. I suppose the risk of all this is that I end up reinforcing my own echo chamber. I&rsquo;m not sure how to deal with that, but I certainly can&rsquo;t deal with it while also suffering from information overload. Not just certifiable… A couple of months ago, I went to Oxford for an intensive, 2-day course run by Software Carpentry and Data Carpentry for prospective new instructors. I&rsquo;ve now had confirmation that I&rsquo;ve completed the checkout procedure so it&rsquo;s official: I&rsquo;m now a certified Data Carpentry instructor! As far as I&rsquo;m aware, the certification process is now combined, so I&rsquo;m also approved to teach Software Carpentry material too. And of course there&rsquo;s Library Carpentry too&hellip; SSI Fellowship 2020 I&rsquo;m honoured and excited to be named one of this year&rsquo;s Software Sustainability Institute Fellows. There&rsquo;s not much to write about yet because it&rsquo;s only just started, but I&rsquo;m looking forward to sharing more with you. In the meantime, you can take a look at the 2020 fellowship announcement and get an idea of my plans from my application video: Talks Here is a selection of talks that I&rsquo;ve given. {{% template %}} &lt;%! import arrow %&gt; Date Title Location % for talk in post.data("talks"): % if 'date' in talk: ${date.format('ddd d MMM YYYY')} % endif % if 'url' in talk: % endif ${talk['title']} % if 'url' in talk: % endif ${talk.get('location', '')} % endfor {{% /template %}} 
escueladefiscales-com-2189	----	Escuela de Fiscales Saltar al contenido (presiona la tecla Intro) Escuela de Fiscales Participación Ciudadana y Gobierno Abierto Sumate como FISCAL Inicio Quienes Somos? Portales Abiertos Descargas Blog Escuela de Fiscales Participación Ciudadana y Gobierno Abierto Inicio Quienes Somos? Portales Abiertos Descargas Blog Sumate como FISCAL Menú Inicio > ¡Sumate a Proyecto Yarquen! El cambio climático es una de las grandes amenazas que enfrenta nuestro mundo actualmente, por eso, tenemos que utilizar todos los medios que tenemos a disposición para detenerlo. Desde Escuela de Fiscales, creamos el «proyecto Yarquen» que utiliza datos abiertos para el activismo ambiental.  El proyecto Yarquen consiste en la creación de una web destinada a organizaciones de la sociedad civil, activistas ambientales, periodismo de datos y personas interesadas en cuestiones ambientales la cual, utilizando una herramienta API actualmente en desarrollo, accede a data sets de portales de transparencia oficial del gobierno nacional, provincial y municipal de Argentina, para posteriormente organizarlos en diferentes categorías y con un motor de búsqueda interno permitir un fácil acceso a las personas que no están familiarizados con el uso y trabajo con Datos Abiertos. Una de las grandes dificultades que encuentran usuarios y usuarias de open data en nuestro país consiste en que los data sets se encuentran distribuidos en docenas de portales oficiales diferentes, haciendo muy difícil para las personas que no trabajan habitualmente con open data poder llegar a tener información completa sobre un tema específico. La web de proyecto Yarquen pretende eliminar esas barreras, al permitir que toda la información disponible pueda accederse desde un solo lugar, con expresiones de búsqueda simples que arrojen como resultado el conjunto completo de datasets. El portal contará además con una sección que permitirá generar solicitudes de accesos a la información publica, para aquellos casos en que alguna información necesaria no se encuentre disponible en las webs oficiales. Contará, además, con una sección especial que permitirá a las organizaciones de la sociedad civil y activistas ambientales registrarse, facilitando la vinculación y el trabajo colaborativo entre ellos. Todo esto brindara herramientas útiles a la sociedad civil para trabajar por el cuidado y preservación del medio ambiente de una manera mas efectiva, con mayor conocimiento e información sobre temas específicos. ¿Cómo podes ayudar? Si sos programador/a, diseñador/a, periodista de datos, activista ambiental, trabajas con datos abiertos o perteneces a una organización o colectivo que luche por la preservación y cuidado del ambiente, o simplemente te preocupa el cambio climatico y queres hacer tu parte ¡Te necesitamos! Envianos un mail a info@escueladefiscales.com y te contactaremos a la brevedad.   ©2021 Escuela de Fiscales.Charity Care | Desarrollado por Rara Theme. Funciona con WordPress. 
escueladefiscales-com-6368	----	Escuela de Fiscales – Participación Ciudadana y Gobierno Abierto Saltar al contenido (presiona la tecla Intro) Escuela de Fiscales Participación Ciudadana y Gobierno Abierto Sumate como FISCAL Inicio Quienes Somos? Portales Abiertos Descargas Blog Escuela de Fiscales Participación Ciudadana y Gobierno Abierto Inicio Quienes Somos? Portales Abiertos Descargas Blog Sumate como FISCAL Menú Un proyecto argentino preseleccionado para ganar el “Net Zero Challenge”, un concurso internacional que premia el uso de datos abiertos para la acción climática. Seguir Leyendo! Mar del Plata celebró el Dia de los Datos Abiertos. Seguir Leyendo! Dia de los Datos Abiertos Mar del Plata 2021 Seguir Leyendo! Participamos de la creación del primer Plan de Acción de Congreso Abierto. Seguir Leyendo! Verifica tus datos en el PADRÓN 2021! Seguir Leyendo! Escuela de Fiscales participó de la Cumbre virtual de líderes de la Alianza para el Gobierno Abierto (OGP). Seguir Leyendo! Frena la Curva! Con el fin de hacer nuestro aporte a las medidas para combatir la pandemia del nuevo coronavirus Covid-19 nos sumamos a #FrenaLaCurvaArgentina, parte de la Red Internacional #FrenaLaCurva Seguir Leyendo! Foro Federal contra las Violencias de Género El pasado jueves 13 de febrero, participamos del encuentro convocado en Chapadmalal por el Ministerio de Mujeres, Géneros y Diversidad de la Nación, del que también participaron organizaciones comunitarias, organizaciones de la sociedad civil, referentes de los gobiernos locales y provinciales, legisladoras/es y público en general. Seguir Leyendo! Escuela de Fiscales participó del lanzamiento del 4to Plan Gobierno Abierto en la Casa Rosada Escuela de Fiscales, organización que promueve la participación ciudadana y la transparencia institucional y electoral, participó del lanzamiento del 4to Plan Nacional de Gobierno Abierto y de la asunción de la co-presidencia argentina de la alianza global para el Gobierno Abierto en un evento que se realizó el jueves 19 de septiembre en la Casa Rosada, de la que también participaron funcionarios de gobierno, embajadores y representantes de más de 70 organizaciones de la sociedad civil que trabajaron en la redacción del Plan.  Seguir Leyendo! Conoce nuestras actividades Gobierno AbiertoParticipa! Democracia y EleccionesConoce más! Politicas de GéneroSumate! Noticias Destacadas Un proyecto argentino preseleccionado para ganar el “Net Zero Challenge”, un concurso internacional que premia el uso de datos abiertos para la acción climática. Se trata del Proyecto Yarquen, desarrollado por la organización civil Escuela de Fiscales, el cual utiliza datos abiertos y tecnología como herramienta para la lucha por el ambiente. Un proyecto argentino preseleccionado para ganar el “Net Zero Challenge”, un concurso internacional que premia el uso de datos abiertos para la acción climática. Se trata del Proyecto Yarquen, desarrollado por la organización civil Escuela de Fiscales, el cual utiliza datos abiertos y tecnología como herramienta para la lucha por el ambiente. Mar del Plata celebró el Dia de los Datos Abiertos. Por 4to años consecutivo, Mar del Plata formo parte del calendario internacional de eventos del “Open Data Day”, con un encuentro organizado por Escuela de Fiscales donde se trabajó sobre Ambiente, tecnología, desarrollo sostenible, ecología y activismo ambiental. El sábado 6 de marzo Mar del Plata volvió a sumarse a los eventos internaciones del Open Data Day, una celebración anual … Mar del Plata celebró el Dia de los Datos Abiertos. Por 4to años consecutivo, Mar del Plata formo parte del calendario internacional de eventos del “Open Data Day”, con un encuentro organizado por Escuela de Fiscales donde se trabajó sobre Ambiente, tecnología, desarrollo sostenible, ecología y activismo ambiental. El sábado 6 de marzo Mar del Plata volvió a sumarse a los eventos internaciones del Open Data Day, una celebración anual … Participamos de la creación del primer Plan de Acción de Congreso Abierto. La Honorable Cámara de Diputados de la Nación inició el camino para la elaboración del Primer Plan de Acción de Congreso Abierto con el objetivo de construir un Parlamento más abierto, transparente y participativo, y Escuela de Fiscales participó de las mesas de trabajo y co-creación de los compromisos que la HCDN asumirá entre marzo de 2021 y julio de … Participamos de la creación del primer Plan de Acción de Congreso Abierto. La Honorable Cámara de Diputados de la Nación inició el camino para la elaboración del Primer Plan de Acción de Congreso Abierto con el objetivo de construir un Parlamento más abierto, transparente y participativo, y Escuela de Fiscales participó de las mesas de trabajo y co-creación de los compromisos que la HCDN asumirá entre marzo de 2021 y julio de … Escuela de Fiscales participó de la Cumbre virtual de líderes de la Alianza para el Gobierno Abierto (OGP). Este 24 de Septiembre Escuela de Fiscales participó de la cumbre virtual de líderes de la Alianza para el Gobierno Abierto (OGP), oportunidad en la que se realizó el traspaso de la copresidencia del organismo, que nuestro país ejercía  junto a Robin Hodess, al Gobierno de Corea y a María Baron en representación de la sociedad civil. Escuela de Fiscales participó de la Cumbre virtual de líderes de la Alianza para el Gobierno Abierto (OGP). Este 24 de Septiembre Escuela de Fiscales participó de la cumbre virtual de líderes de la Alianza para el Gobierno Abierto (OGP), oportunidad en la que se realizó el traspaso de la copresidencia del organismo, que nuestro país ejercía  junto a Robin Hodess, al Gobierno de Corea y a María Baron en representación de la sociedad civil. Blog Un proyecto argentino preseleccionado para ganar el “Net Zero Challenge”, un concurso internacional que premia el uso de datos abiertos para la acción climática. Se trata del Proyecto Yarquen, desarrollado por la organización civil... Leer más Mar del Plata celebró el Dia de los Datos Abiertos. Por 4to años consecutivo, Mar del Plata formo parte del... Leer más Dia de los Datos Abiertos Mar del Plata 2021 Escuela de Fiscales prepara un evento sobre medio ambiente y... Leer más ©2021 Escuela de Fiscales.Charity Care | Desarrollado por Rara Theme. Funciona con WordPress. 
erambler-co-uk-695	----	eRambler eRambler Recent content on eRambler Intro to the fediverse Wow, it turns out to be 10 years since I wrote this beginners guide to Twitter. Things have moved on a loooooong way since then. Far from being the interesting, disruptive technology it was back then, Twitter has become part of the mainstream, the establishment. Almost everyone and everything is on Twitter now, which has both pros and cons. So what&rsquo;s the problem? It&rsquo;s now possible to follow all sorts of useful information feeds, from live updates on transport delays to your favourite sports team&rsquo;s play-by-play performance to an almost infinite number of cat pictures. In my professional life it&rsquo;s almost guaranteed that anyone I meet will be on Twitter, meaning that I can contact them to follow up at a later date without having to exchange contact details (and they have options to block me if they don&rsquo;t like that). On the other hand, a medium where everyone&rsquo;s opinion is equally valid regardless of knowledge or life experience has turned some parts of the internet into a toxic swamp of hatred and vitriol. It&rsquo;s easier than ever to forget that we have more common ground with any random stranger than we have similarities, and that&rsquo;s led to some truly awful acts and a poisonous political arena. Part of the problem here is that each of the social media platforms is controlled by a single entity with almost no accountability to anyone other than shareholders. Technological change has been so rapid that the regulatory regime has no idea how to handle them, leaving them largely free to operate how they want. This has led to a whole heap of nasty consequences that many other people have done a much better job of documenting than I could (Shoshana Zuboff&rsquo;s book The Age of Surveillance Capitalism is a good example). What I&rsquo;m going to focus on instead are some possible alternatives. If you accept the above argument, one obvious solution is to break up the effective monopoly enjoyed by Facebook, Twitter et al. We need to be able to retain the wonderful affordances of social media but democratise control of it, so that it can never be dominated by a small number of overly powerful players. What&rsquo;s the solution? There&rsquo;s actually a thing that already exists, that almost everyone is familiar with and that already works like this. It&rsquo;s email. There are a hundred thousand email servers, but my email can always find your inbox if I know your address because that address identifies both you and the email service you use, and they communicate using the same protocol, Simple Mail Transfer Protocol (SMTP)1. I can&rsquo;t send a message to your Twitter from my Facebook though, because they&rsquo;re completely incompatible, like oil and water. Facebook has no idea how to talk to Twitter and vice versa (and the companies that control them have zero interest in such interoperability anyway). Just like email, a federated social media service like Mastodon allows you to use any compatible server, or even run your own, and follow accounts on your home server or anywhere else, even servers running different software as long as they use the same ActivityPub protocol. There&rsquo;s no lock-in because you can move to another server any time you like, and interact with all the same people from your new home, just like changing your email address. Smaller servers mean that no one server ends up with enough power to take over and control everything, as the social media giants do with their own platforms. But at the same time, a small server with a small moderator team can enforce local policy much more easily and block accounts or whole servers that host trolls, nazis or other poisonous people. How do I try it? I have no problem with anyone for choosing to continue to use what we&rsquo;re already calling &ldquo;traditional&rdquo; social media; frankly, Facebook and Twitter are still useful for me to keep in touch with a lot of my friends. However, I do think it&rsquo;s useful to know some of the alternatives if only to make a more informed decision to stick with your current choices. Most of these services only ask for an email address when you sign up and use of your real name vs a pseudonym is entirely optional so there&rsquo;s not really any risk in signing up and giving one a try. That said, make sure you take sensible precautions like not reusing a password from another account. Instead of… Try… Twitter, Facebook Mastodon, Pleroma, Misskey Slack, Discord, IRC Matrix WhatsApp, FB Messenger, Telegram Also Matrix Instagram, Flickr PixelFed YouTube PeerTube The web Interplanetary File System (IPFS) Which, if you can believe it, was formalised nearly 40 years ago in 1982 and has only had fairly minor changes since then! &#x21a9;&#xfe0e; Collaborations Workshop 2021: collaborative ideas & hackday My last post covered the more &ldquo;traditional&rdquo; lectures-and-panel-sessions approach of the first half of the SSI Collaborations Workshop. The rest of the workshop was much more interactive, consisting of a discussion session, a Collaborative Ideas session, and a whole-day hackathon! The discussion session on day one had us choose a topic (from a list of topics proposed leading up to the workshop) and join a breakout room for that topic with the aim of producing a &ldquo;speed blog&rdquo; by then end of 90 minutes. Those speed blogs will be published on the SSI blog over the coming weeks, so I won&rsquo;t go into that in more detail. The Collaborative Ideas session is a way of generating hackday ideas, by putting people together at random into small groups to each raise a topic of interest to them before discussing and coming up with a combined idea for a hackday project. Because of the serendipitous nature of the groupings, it&rsquo;s a really good way of generating new ideas from unexpected combinations of individual interests. After that, all the ideas from the session, along with a few others proposed by various participants, were pitched as ideas for the hackday and people started to form teams. Not every idea pitched gets worked on during the hackday, but in the end 9 teams of roughly equal size formed to spend the third day working together. My team&rsquo;s project: &ldquo;AHA! An Arts &amp; Humanities Adventure&rdquo; There&rsquo;s a lot of FOMO around choosing which team to join for an event like this: there were so many good ideas and I wanted to work on several of them! In the end I settled on a team developing an escape room concept to help Arts &amp; Humanities scholars understand the benefits of working with research software engineers for their research. Five of us rapidly mapped out an example storyline for an escape room, got a website set up with GitHub and populated it with the first few stages of the game. We decided to focus on a story that would help the reader get to grips with what an API is and I&rsquo;m amazed how much we managed to get done in less than a day&rsquo;s work! You can try playing through the escape room (so far) yourself on the web, or take a look at the GitHub repository, which contains the source of the website along with a list of outstanding tasks to work on if you&rsquo;re interested in contributing. I&rsquo;m not sure yet whether this project has enough momentum to keep going, but it was a really valuable way both of getting to know and building trust with some new people and demonstrating the concept is worth more work. Other projects Here&rsquo;s a brief rundown of the other projects worked on by teams on the day. Coding Confessions Everyone starts somewhere and everyone cuts corners from time to time. Real developers copy and paste! Fight imposter syndrome by looking through some of these confessions or contributing your own. https://coding-confessions.github.io/ CarpenPI A template to set up a Raspberry Pi with everything you need to run a Carpentries (https://carpentries.org/) data science/software engineering workshop in a remote location without internet access. https://github.com/CarpenPi/docs/wiki Research Dugnads A guide to running an event that is a coming together of a research group or team to share knowledge, pass on skills, tidy and review code, among other software and working best practices (based on the Norwegian concept of a dugnad, a form of &ldquo;voluntary work done together with other people&rdquo;) https://research-dugnads.github.io/dugnads-hq/ Collaborations Workshop ideas A meta-project to collect together pitches and ideas from previous Collaborations Workshop conferences and hackdays, to analyse patterns and revisit ideas whose time might now have come. https://github.com/robintw/CW-ideas howDescribedIs Integrate existing tools to improve the machine-readable metadata attached to open research projects by integrating projects like SOMEF, codemeta.json and HowFAIRIs (https://howfairis.readthedocs.io/en/latest/index.html). Complete with CI and badges! https://github.com/KnowledgeCaptureAndDiscovery/somef-github-action Software end-of-project plans Develop a template to plan and communicate what will happen when the fixed-term project funding for your research software ends. Will maintenance continue? When will the project sunset? Who owns the IP? https://github.com/elichad/software-twilight Habeas Corpus A corpus of machine readable data about software used in COVID-19 related research, based on the CORD19 dataset. https://github.com/softwaresaved/habeas-corpus Credit-all Extend the all-contributors GitHub bot (https://allcontributors.org/) to include rich information about research project contributions such as the CASRAI Contributor Roles Taxonomy (https://casrai.org/credit/) https://github.com/dokempf/credit-all I&rsquo;m excited to see so many metadata-related projects! I plan to take a closer look at what the Habeas Corpus, Credit-all and howDescribedIs teams did when I get time. I also really want to try running a dugnad with my team or for the GLAM Data Science network. Collaborations Workshop 2021: talks & panel session I&rsquo;ve just finished attending (online) the three days of this year&rsquo;s SSI Collaborations Workshop (CW for short), and once again it&rsquo;s been a brilliant experience, as well as mentally exhausting, so I thought I&rsquo;d better get a summary down while it&rsquo;s still fresh it my mind. Collaborations Workshop is, as the name suggests, much more focused on facilitating collaborations than a typical conference, and has settled into a structure that starts off with with longer keynotes and lectures, and progressively gets more interactive culminating with a hack day on the third day. That&rsquo;s a lot to write about, so for this post I&rsquo;ll focus on the talks and panel session, and follow up with another post about the collaborative bits. I&rsquo;ll also probably need to come back and add in more links to bits and pieces once slides and the &ldquo;official&rdquo; summary of the event become available. Updates 2021-04-07 Added links to recordings of keynotes and panel sessions Provocations The first day began with two keynotes on this year&rsquo;s main themes: FAIR Research Software and Diversity &amp; Inclusion, and day 2 had a great panel session focused on disability. All three were streamed live and the recordings remain available on Youtube: View the keynotes recording; Google-free alternative link View the panel session recording; Google-free alternative link FAIR Research Software Dr Michelle Barker, Director of the Research Software Alliance, spoke on the challenges to recognition of software as part of the scholarly record: software is not often cited. The FAIR4RS working group has been set up to investigate and create guidance on how the FAIR Principles for data can be adapted to research software as well; as they stand, the Principles are not ideally suited to software. This work will only be the beginning though, as we will also need metrics, training, career paths and much more. ReSA itself has 3 focus areas: people, policy and infrastructure. If you&rsquo;re interested in getting more involved in this, you can join the ReSA email list. Equality, Diversity &amp; Inclusion: how to go about it Dr Chonnettia Jones, Vice President of Research, Michael Smith Foundation for Health Research spoke extensively and persuasively on the need for Equality, Diversity &amp; Inclusion (EDI) initiatives within research, as there is abundant robust evidence that all research outcomes are improved. She highlighted the difficulties current approaches to EDI have effecting structural change, and changing not just individual behaviours but the cultures &amp; practices that perpetuate iniquity. What initiatives are often constructed around making up for individual deficits, a bitter framing is to start from an understanding of individuals having equal stature but having different tired experiences. Commenting on the current focus on &ldquo;research excellent&rdquo; she pointed out that the hyper-competition this promotes is deeply unhealthy. suggesting instead that true excellence requires diversity, and we should focus on an inclusive excellence driven by inclusive leadership. Equality, Diversity &amp; Inclusion: disability issues Day 2&rsquo;s EDI panel session brought together five disabled academics to discuss the problems of disability in research. Dr Becca Wilson, UKRI Innovation Fellow, Institute of Population Health Science, University of Liverpool (Chair) Phoenix C S Andrews (PhD Student, Information Studies, University of Sheffield and Freelance Writer) Dr Ella Gale (Research Associate and Machine Learning Subject Specialist, School of Chemistry, University of Bristol) Prof Robert Stevens (Professor and Head of Department of Computer Science, University of Manchester) Dr Robin Wilson (Freelance Data Scientist and SSI Fellow) NB. The discussion flowed quite freely so the following summary, so the following summary mixes up input from all the panel members. Researchers are often assumed to be single-minded in following their research calling, and aptness for jobs is often partly judged on &ldquo;time send&rdquo;, which disadvantages any disabled person who has been forced to take a career break. On top of this disabled people are often time-poor because of the extra time needed to manage their condition, leaving them with less &ldquo;output&rdquo; to show for their time served on many common metrics. This can partially affect early-career researchers, since resources for these are often restricted on a &ldquo;years-since-PhD&rdquo; criterion. Time poverty also makes funding with short deadlines that much harder to apply for. Employers add more demands right from the start: new starters are typically expected to complete a health and safety form, generally a brief affair that will suddenly become an 80-page bureaucratic nightmare if you tick the box declaring a disability. Many employers claim to be inclusive yet utterly fail to understand the needs of their disabled staff. Wheelchairs are liberating for those who use them (despite the awful but common phrase &ldquo;wheelchair-bound&rdquo;) and yet employers will refuse to insure a wheelchair while travelling for work, classifying it as a &ldquo;high value personal item&rdquo; that the owner would take the same responsibility for as an expensive camera. Computers open up the world for blind people in a way that was never possible without them, but it&rsquo;s not unusual for mandatory training to be inaccessible to screen readers. Some of these barriers can be overcome, but doing so takes yet more time that could and should be spent on more important work. What can we do about it? Academia works on patronage whether we like it or not, so be the person who supports people who are different to you rather than focusing on the one you &ldquo;recognise yourself in&rdquo; to mentor. As a manager, it&rsquo;s important to ask each individual what they need and believe them: they are the expert in their own condition and their lived experience of it. Don&rsquo;t assume that because someone else in your organisation with the same disability needs one set of accommodations, it&rsquo;s invalid for your staff member to require something totally different. And remember: disability is unusual as a protected characteristic in that anyone can acquire it at any time without warning! Lightning talks Lightning talk sessions are always tricky to summarise, and while this doesn&rsquo;t do them justice, here are a few highlights from my notes. Data &amp; metadata Malin Sandstrom talked about a much-needed refinement of contributor role taxonomies for scientific computing Stephan Druskat showcased a project to crowdsource a corpus of research software for further analysis Learning &amp; teaching/community Matthew Bluteau introduced the concept of the &ldquo;coding dojo&rdquo; as a way to enhance community of practice. A group of coders got together to practice &amp; learn by working together to solve a problem and explaining their work as they go He described 2 models: a code jam, where people work in small groups, and the Randori method, where 2 people do pair programming while the rest observe. I&rsquo;m excited to try this out! Steve Crouch talked about intermediate skills and helping people take the next step, which I&rsquo;m also very interested in with the GLAM Data Science network Esther Plomp recounted experience of running multiple Carpentry workshops online, while Diego Alonso Alvarez discussed planned workshops on making research software more usable with GUIs Shoaib Sufi showcased the SSI&rsquo;s new event organising guide Caroline Jay reported on a diary study into autonomy &amp; agency in RSE during COVID Lopez, T., Jay, C., Wermelinger, M., &amp; Sharp, H. (2021). How has the covid-19 pandemic affected working conditions for research software engineers? Unpublished manuscript. Wrapping up That&rsquo;s not everything! But this post is getting pretty long so I&rsquo;ll wrap up for now. I&rsquo;ll try to follow up soon with a summary of the &ldquo;collaborative&rdquo; part of Collaborations Workshop: the idea-generating sessions and hackday! Time for a new look... I&rsquo;ve decided to try switching this website back to using Hugo to manage the content and generate the static HTML pages. I&rsquo;ve been on the Python-based Nikola for a few years now, but recently I&rsquo;ve been finding it quite slow, and very confusing to understand how to do certain things. I used Hugo recently for the GLAM Data Science Network website and found it had come on a lot since the last time I was using it, so I thought I&rsquo;d give it another go, and redesign this site to be a bit more minimal at the same time. The theme is still a work in progress so it&rsquo;ll probably look a bit rough around the edges for a while, but I think I&rsquo;m happy enough to publish it now. When I get round to it I might publish some more detailed thoughts on the design. Ideas for Accessible Communications The Disability Support Network at work recently ran a survey on &ldquo;accessible communications&rdquo;, to develop guidance on how to make communications (especially internal staff comms) more accessible to everyone. I grabbed a copy of my submission because I thought it would be useful to share more widely, so here it is. Please note that these are based on my own experiences only. I am in no way suggesting that these are the only things you would need to do to ensure your communications are fully accessible. They&rsquo;re just some things to keep in mind. Policies/procedures/guidance can be stressful to use if anything is vague or inconsistent, or if it looks like there might be more information implied than is explicitly given (a common cause of this is use of jargon in e.g. HR policies). Emails relating to these policies have similar problems, made worse because they tend to be very brief. Online meetings can be very helpful, but can also be exhausting, especially if there are too many people, or not enough structure. Larger meetings &amp; webinars without agendas (or where the agenda is ignored, or timings are allowed to drift without acknowledgement) are very stressful, as are those where there is not enough structure to ensure fair opportunities to contribute. Written reference documents and communications should: Be carefully checked for consistency and clarity Have all all key points explicitly stated Explicitly acknowledge the need for flexibility where it is necessary, rather than implying or hinting at it Clearly define jargon &amp; acronyms where they are necessary to the point being made, and avoid them otherwise Include links to longer, more explicit versions where space is tight Provide clear bullet-point summaries with links to the details Online meetings should: Include sufficient break time (at least 10 minutes out of every hour) and not allow this to be compromised just because a speaker has misjudged the length of their talk Include initial &ldquo;settling-in&rdquo; time in agendas to avoid timing getting messed up from the start Ensure the agenda is stuck to, or that divergence from the agenda is acknowledged explicitly by the chair and updated timing briefly discussed to ensure everyone is clear Establish a norm for participation at the start of the meeting and stick to it e.g. ask people to raise hands when they have a point to make, or have specific time for round-robin contributions Ensure quiet/introverted people have space to contribute, but don&rsquo;t force them to do so if they have nothing to add at the time Offer a text-based alternative to contributing verbally If appropriate, at the start of the meeting assign specific roles of: Gatekeeper: ensures everyone has a chance to contribute Timekeeper: ensures meeting runs to time Scribe: ensures a consistent record of the meeting Be chaired by someone with the confidence to enforce the above: offer training to all staff on chairing meetings to ensure everyone has the skills to run a meeting effectively Matrix self-hosting I started running my own Matrix server a little while ago. Matrix is something rather cool, a chat system similar to IRC or Slack, but open and federated. Open in that the standard is available for anyone to view, but also the reference implementations of server and client are open source, along with many other clients and a couple of nascent alternative servers. Federated in that, like email, it doesn&rsquo;t matter what server you sign up with, you can talk to users on your own or any other server. I decided to host my own for three reasons. Firstly, to see if I could and to learn from it. Secondly, to try and rationalise the Cambrian explosion of Slack teams I was being added to in 2019. Thirdly, to take some control of the loss of access to historical messages in some communities that rely on Slack (especially the Carpentries and RSE communities). Since then, I&rsquo;ve also added a fourth goal: taking advantage of various bridges to bring other messaging network I use (such as Signal and Telegram) into a consistent UI. I&rsquo;ve also found that my use of Matrix-only rooms has grown as more individuals &amp; communities have adopted the platform. So, I really like Matrix and I use it daily. My problem now is whether to keep self-hosting. Synapse, the only full server implementation at the moment, is really heavy on memory, so I&rsquo;ve ended up running it on a much bigger server than I thought I&rsquo;d need, which seems overkill for a single-user instance. So now I have to make a decision about whether it&rsquo;s worth keeping going, or shutting it down and going back to matrix.org, or setting up on one of the other servers that have sprung up in the last couple of years. There are a couple of other considerations here. Firstly, Synapse resource usage is entirely down to the size of the rooms joined by users of the homeowner, not directly the number of users. So if users have mostly overlapping interests, and thus keep to the same rooms, you can support quite a large community without significant extra resource usage. Secondly, there are a couple of alternative server implementations in development specifically addressing this issue for small servers. Dendrite and Conduit. Neither are quite ready for what I want yet, but are getting close, and when ready that will allow running small homeservers with much more sensible resource usage. So I could start opening up for other users, and at least justify the size of the server that way. I wouldn&rsquo;t ever want to make it a paid-for service but perhaps people might be willing to make occasional donations towards running costs. That still leaves me with the question of whether I&rsquo;m comfortable running a service that others may come to rely on, or being responsible for the safety of their information. I could also hold out for Dendrite or Conduit to mature enough that I&rsquo;m ready to try them, which might not be more than a few months off. Hmm, seems like I&rsquo;ve convinced myself to stick with it for now, and we&rsquo;ll see how it goes. In the meantime, if you know me and you want to try it out let me know and I might risk setting you up with an account! What do you miss least about pre-lockdown life? @JanetHughes on Twitter: What do you miss the least from pre-lockdown life? I absolutely do not miss wandering around the office looking for a meeting room for a confidential call or if I hadn&rsquo;t managed to book a room in advance. Let&rsquo;s never return to that joyless frustration, hey? 10:27 AM · Feb 3, 2021 After seeing Terence Eden taking Janet Hughes' tweet from earlier this month as a writing prompt, I thought I might do the same. The first thing that leaps to my mind is commuting. At various points in my life I&rsquo;ve spent between one and three hours a day travelling to and from work and I&rsquo;ve never more than tolerated it at best. It steals time from your day, and societal norms dictate that it&rsquo;s your leisure &amp; self-care time that must be sacrificed. Longer commutes allow more time to get into a book or podcast, especially if not driving, but I&rsquo;d rather have that time at home rather than trying to be comfortable in a train seat designed for some mythical average man shaped nothing like me! The other thing I don&rsquo;t miss is the colds and flu! Before the pandemic, British culture encouraged working even when ill, which meant constantly coming into contact with people carrying low-grade viruses. I&rsquo;m not immunocompromised but some allergies and residue of being asthmatic as a child meant that I would get sick 2-3 times a year. A pleasant side-effect of the COVID precautions we&rsquo;re all taking is that I haven&rsquo;t been sick for over 12 months now, which is amazing! Finally, I don&rsquo;t miss having so little control over my environment. One of the things that working from home has made clear is that there are certain unavoidable aspects of working in my shared office that cause me sensory stress, and that are completely unrelated to my work. Working (or trying to work) next to a noisy automatic scanner; trying to find a light level that works for 6 different people doing different tasks; lacking somewhere quiet and still to eat lunch and recover from a morning of meetings or the constant vaguely-distracting bustle of a large shared office. It all takes energy. Although it&rsquo;s partly been replaced by the new stress of living through a global pandemic, that old stress was a constant drain on my productivity and mood that had been growing throughout my career as I moved (ironically, given the common assumption that seniority leads to more privacy) into larger and larger open plan offices. Remarkable blogging And the handwritten blog saga continues, as I&rsquo;ve just received my new reMarkable 2 tablet, which is designed for reading, writing and nothing else. It uses a super-responsive e-ink display and writing on it with a stylus is a dream. It has a slightly rough texture with just a bit of friction that makes my writing come out a lot more legibly than on a slippery glass touchscreen. If that was all there was to it, I might not have wasted my money, but it turns out that it runs on Linux and the makers have wisely decided not to lock it down but to give you full root mess. Yes, you read that right: root access. It presents as an ethernet device over USB, so you can SSH in with a password found in the settings and have full control over your own devices. What a novel concept. This fact alone has meant it&rsquo;s built a small yet devoted community of users who have come up with some clever ways of extending its functionality. In fact, many of these are listed on this GitHub repository. Finally, from what I&rsquo;ve seen so far, the handwriting recognition is impressive to say the least. This post was written on it and needed only a little editing. I think this is a device that will get a lot of use! GLAM Data Science Network fellow travellers Updates 2021-02-04 Thanks to Gene @dzshuniper@ausglam.space for suggesting ADHO and a better attribution for the opening quote (see comments below for details) See comments &amp; webmentions for details. “If you want to go fast, go alone. If you want to go far, go together.” — African proverb, probably popularised in English by Kenyan church leader Rev. Samuel Kobia (original) This quote is a popular one in the Carpentries community, and I interpret it in this context to mean that a group of people working together is more sustainable than individuals pursuing the same goal independently. That&rsquo;s something that speaks to me, and that I want to make sure is reflected in nurturing this new community for data science in galleries, archives, libraries &amp; museums (GLAM). To succeed, this work needs to be complementary and collaborative, rather than competitive, so I want to acknowledge a range of other networks &amp; organisations whose activities complement this. The rest of this article is an unavoidably incomplete list of other relevant organisations whose efforts should be acknowledged and potentially built on. And it should go without saying, but just in case: if the work I&rsquo;m planning fits right into an existing initiative, then I&rsquo;m happy to direct my resources there rather than duplicate effort. Inspirations &amp; collaborators Groups with similar goals or undertaking similar activities, but focused on a different sector, geographic area or topic. I think we should make as much use of and contribution to these existing communities as possible since there will be significant overlap. code4lib Probably the closest existing community to what I want to build, but primarily based in the US, so timezones (and physical distance for in-person events) make it difficult to participate fully. This is a well-established community though, with regular events including an annual conference so there&rsquo;s a lot to learn here. newCardigan Similar to code4lib but an Australian focus, so the timezone problem is even bigger! GLAM Labs Focused on supporting the people experimenting with and developing the infrastructure to enable scholars to access GLAM materials in new ways. In some ways, a GLAM data science network would be complementary to their work, by providing people not directly involved with building GLAM Labs with the skills to make best use of GLAM Labs infrastructure. UK Government data science community Another existing community with very similar intentions, but focused on UK Government sector. Clearly the British Library and a few national &amp; regional museums &amp; archives fall into this, but much of the rest of the GLAM sector does not. Artifical Intelligence for Libraries, Archives &amp; Museums (AI4LAM) A multinational collaboration between several large libraries, archives and museums with a specific focus on the Artificial Intelligence (AI) subset of data science UK Reproducibility Network A network of researchers, primarily in HEIs, with an interest in improving the transparency and reliability of academic research. Mostly science-focused but with some overlap of goals around ethical and robust use of data. Museums Computer Group I&rsquo;m less familiar with this than the others, but it seems to have a wider focus on technology generally, within the slightly narrower scope of museums specifically. Again, a lot of potential for collaboration. Training Several organisations and looser groups exist specifically to develop and deliver training that will be relevant to members of this network. The network also presents an opportunity for those who have done a workshop with one of these and want to know what the “next steps” are to continue their data science journey. The Carpentries, aka: Library Carpentry Data Carpentry Software Carpentry Data Science Training for Librarians (DST4L) The Programming Historian CDH Cultural Heritage Data School Supporters These misson-driven organisations have goals that align well with what I imagine for the GLAM DSN, but operate at a more strategic level. They work by providing expert guidance and policy advice, lobbying and supporting specific projects with funding and/or effort. In particular, the SSI runs a fellowship programme which is currently providing a small amount of funding to this project. Digital Preservation Coalition (DPC) Software Sustainability Institute (SSI) Research Data Alliance (RDA) Alliance of Digital Humanities Organizations (ADHO) &hellip; and its Libraries and Digital Humanities Special Interest Group (Lib&amp;DH SIG) Professional bodies These organisations exist to promote the interests of professionals in particular fields, including supporting professional development. I hope they will provide communication channels to their various members at the least, and may be interested in supporting more directly, depending on their mission and goals. Society of Research Software Engineering Chartered Institute of Library and Information Professionals Archives &amp; Records Association Museums Association Conclusion As I mentioned at the top of the page, this list cannot possibly be complete. This is a growing area and I&rsquo;m not the only or first person to have this idea. If you can think of anything glaring that I&rsquo;ve missed and you think should be on this list, leave a comment or tweet/toot at me! A new font for the blog I&rsquo;ve updated my blog theme to use the quasi-proportional fonts Iosevka Aile and Iosevka Etoile. I really like the aesthetic, as they look like fixed-width console fonts (I use the true fixed-width version of Iosevka in my terminal and text editor) but they&rsquo;re actually proportional which makes them easier to read. https://typeof.net/Iosevka/ Training a model to recognise my own handwriting If I&rsquo;m going to train an algorithm to read my weird &amp; awful writing, I&rsquo;m going to need a decent-sized training set to work with. And since one of the main things I want to do with it is to blog &ldquo;by hand&rdquo; it makes sense to focus on that type of material for training. In other words, I need to write out a bunch of blog posts on paper, scan them and transcribe them as ground truth. The added bonus of this plan is that after transcribing, I also end up with some digital text I can use as an actual post — multitasking! So, by the time you read this, I will have already run it through a manual transcription process using Transkribus to add it to my training set, and copy-pasted it into emacs for posting. This is a fun little project because it means I can: Write more by hand with one of my several nice fountain pens, which I enjoy Learn more about the operational process some of my colleagues go through when digitising manuscripts Learn more about the underlying technology &amp; maths, and how to tune the process Produce more lovely content! For you to read! Yay! Write in a way that forces me to put off editing until after a first draft is done and focus more on getting the whole of what I want to say down. That&rsquo;s it for now — I&rsquo;ll keep you posted as the project unfolds. Addendum Tee hee! I&rsquo;m actually just enjoying the process of writing stuff by hand in long-form prose. It&rsquo;ll be interesting to see how the accuracy turns out and if I need to be more careful about neatness. Will it be better or worse than the big but generic models used by Samsung Notes or OneNote. Maybe I should include some stylus-written text for comparison. Blogging by hand I wrote the following text on my tablet with a stylus, which was an interesting experience: So, thinking about ways to make writing fun again, what if I were to write some of them by hand? I mean I have a tablet with a pretty nice stylus, so maybe handwriting recognition could work. One major problem, of course, is that my handwriting is AWFUL! I guess I&rsquo;ll just have to see whether the OCR is good enough to cope… It&rsquo;s something I&rsquo;ve been thinking about recently anyway: I enjoy writing with a proper fountain pen, so is there a way that I can have a smooth workflow to digitise handwritten text without just typing it back in by hand? That would probably be preferable to this, which actually seems to work quite well but does lead to my hand tensing up to properly control the stylus on the almost-frictionless glass screen. I&rsquo;m surprised how well it worked! Here&rsquo;s a sample of the original text: And here&rsquo;s the result of converting that to text with the built-in handwriting recognition in Samsung Notes: Writing blog posts by hand So, thinking about ways to make writing fun again, what if I were to write some of chum by hand? I mean, I have a toldest winds a pretty nice stylus, so maybe handwriting recognition could work. One major problems, ofcourse, is that my , is AWFUL! Iguess I&rsquo;ll just have to see whattime the Ocu is good enough to cope&hellip; It&rsquo;s something I&rsquo;ve hun tthinking about recently anyway: I enjoy wilting with a proper fountain pion, soischeme a way that I can have a smooch workflow to digitise handwritten text without just typing it back in by hand? That wouldprobally be preferableto this, which actually scams to work quito wall but doers load to my hand tensing up to properly couldthe stylus once almost-frictionlessg lass scream. It&rsquo;s pretty good! It did require a fair bit of editing though, and I reckon we can do better with a model that&rsquo;s properly trained on a large enough sample of my own handwriting. What I want from a GLAM/Cultural Heritage Data Science Network Introduction As I mentioned last year, I was awarded a Software Sustainability Institute Fellowship to pursue the project of setting up a Cultural Heritage/GLAM data science network. Obviously, the global pandemic has forced a re-think of many plans and this is no exception, so I&rsquo;m coming back to reflect on it and make sure I&rsquo;m clear about the core goals so that everything else still moves in the right direction. One of the main reasons I have for setting up a GLAM data science network is because it&rsquo;s something I want. The advice to &ldquo;scratch your own itch&rdquo; is often given to people looking for an open project to start or contribute to, and the lack of a community of people with whom to learn &amp; share ideas and practice is something that itches for me very much. The &ldquo;motivation&rdquo; section in my original draft project brief for this work said: Cultural heritage work, like all knowledge work, is increasingly data-based, or at least gives opportunities to make use of data day-to-day. The proper skills to use this data enable more effective working. Knowledge and experience thus gained improves understanding of and empathy with users also using such skills. But of course, I have my own reasons for wanting to do this too. In particular, I want to: Advocate for the value of ethical, sustainable data science across a wide range of roles within the British Library and the wider sector Advance the sector to make the best use of data and digital sources in the most ethical and sustainable way possible Understand how and why people use data from the British Library, and plan/deliver better services to support that Keep up to date with relevant developments in data science Learn from others' skills and experiences, and share my own in turn Those initial goals imply some further supporting goals: Build up the confidence of colleagues who might benefit from data science skills but don&rsquo;t feel they are &ldquo;technical&rdquo; or &ldquo;computer literate&rdquo; enough Further to that, build up a base of colleagues with the confidence to share their skills &amp; knowledge with others, whether through teaching, giving talks, writing or other channels Identify common awareness gaps (skills/knowledge that people don&rsquo;t know they&rsquo;re missing) and address them Develop a communal space (primarily online) in which people feel safe to ask questions Develop a body of professional practice and help colleagues to learn and contribute to the evolution of this, including practices of data ethics, software engineering, statistics, high performance computing, … Break down language barriers between data scientists and others I&rsquo;ll expand on this separately as my planning develops, but here are a few specific activities that I&rsquo;d like to be able to do to support this: Organise less-formal learning and sharing events to complement the more formal training already available within organisations and the wider sector, including &ldquo;show and tell&rdquo; sessions, panel discussions, code cafés, masterclasses, guest speakers, reading/study groups, co-working sessions, … Organise training to cover intermediate skills and knowledge currently missing from the available options, including the awareness gaps and professional practice mentioned above Collect together links to other relevant resources to support self-led learning Decisions to be made There are all sorts of open questions in my head about this right now, but here are some of the key ones. Is it GLAM or Cultural Heritage? When I first started planning this whole thing, I went with &ldquo;Cultural Heritage&rdquo;, since I was pretty transparently targeting my own organisation. The British Library is fairly unequivocally a CH organisation. But as I&rsquo;ve gone along I&rsquo;ve found myself gravitating more towards the term &ldquo;GLAM&rdquo; (which stands for Galleries, Libraries, Archives, Museums) as it covers a similar range of work but is clearer (when you spell out the acronym) about what kinds of work are included. What skills are relevant? This turns out to be surprisingly important, at least in terms of how the community is described, as they define the boundaries of the community and can be the difference between someone feeling welcome or excluded. For example, I think that some introductory statistics training would be immensely valuable for anyone working with data to understand what options are open to them and what limitations those options have, but is the word &ldquo;statistics&rdquo; offputting per se to those who&rsquo;ve chosen a career in arts &amp; humanities? I don&rsquo;t know because I don&rsquo;t have that background and perspective. Keep it internal to the BL, or open up early on? I originally planned to focus primarily on my own organisation to start with, feeling that it would be easier to organise events and build a network within a single organisation. However, the pandemic has changed my thinking significantly. Firstly, it&rsquo;s now impossible to organise in-person events and that will continue for quite some time to come, so there is less need to focus on the logistics of getting people into the same room. Secondly, people within the sector are much more used to attending remote events, which can easily be opened up to multiple organisations in many countries, timezones allowing. It now makes more sense to focus primarily on online activities, which opens up the possibility of building a critical mass of active participants much more quickly by opening up to the wider sector. Conclusion This is the type of post that I could let run and run without ever actually publishing, but since it&rsquo;s something I need feedback and opinions on from other people, I&rsquo;d better ship it! I really want to know what you think about this, whether you feel it&rsquo;s relevant to you and what would make it useful. Comments are open below, or you can contact me via Mastodon or Twitter. Writing About Not Writing Under Construction Grunge Sign by Nicolas Raymond — CC BY 2.0 Every year, around this time of year, I start doing two things. First, I start thinking I could really start to understand monads and write more than toy programs in Haskell. This is unlikely to ever actually happen unless and until I get a day job where I can justify writing useful programs in Haskell, but Advent of Code always gets me thinking otherwise. Second, I start mentally writing this same post. You know, the one about how the blogger in question hasn&rsquo;t had much time to write but will be back soon? &ldquo;Sorry I haven&rsquo;t written much lately…&rdquo; It&rsquo;s about as cliché as a Geocities site with a permanent &ldquo;Under construction&rdquo; GIF. At some point, not long after the dawn of ~time~ the internet, most people realised that every website was permanently under construction and publishing something not ready to be published was just pointless. So I figured this year I&rsquo;d actually finish writing it and publish it. After all, what&rsquo;s the worst that could happen? If we&rsquo;re getting all reflective about this, I could probably suggest some reasons why I&rsquo;m not writing much: For a start, there&rsquo;s a lot going on in both my world and The World right now, which doesn&rsquo;t leave a lot of spare energy after getting up, eating, housework, working and a few other necessary activities. As a result, I&rsquo;m easily distracted and I tend to let myself get dragged off in other directions before I even get to writing much of anything. If I do manage to focus on this blog in general, I&rsquo;ll often end up working on some minor tweak to the theme or functionality. I mean, right now I&rsquo;m wondering if I can do something clever in my text-editor (Emacs, since you&rsquo;re asking) to streamline my writing &amp; editing process so it&rsquo;s more elegant, efficient, ergonomic and slightly closer to perfect in every way. It also makes me much more likely to self-censor, and to indulge my perfectionist tendencies to try and tweak the writing until it&rsquo;s absolutely perfect, which of course never happens. I&rsquo;ve got a whole heap of partly-written posts that are juuuust waiting for the right motivation for me to just finish them off. The only real solution is to accept that: I&rsquo;m not going to write much and that&rsquo;s probably OK What I do write won&rsquo;t always be the work of carefully-researched, finely crafted genius that I want it to be, and that&rsquo;s probably OK too Also to remember why I started writing and publishing stuff in the first place: to reflect and get my thoughts out onto a (virtual) page so that I can see them, figure out whether I agree with myself and learn; and to stimulate discussion and get other views on my (possibly uninformed, incorrect or half-formed) thoughts, also to learn. In other words, a thing I do for me. It&rsquo;s easy to forget that and worry too much about whether anyone else wants to read my s—t. Will you notice any changes? Maybe? Maybe not? Who knows. But it&rsquo;s a new year and that&rsquo;s as good a time for a change as any. When is a persistent identifier not persistent? Or an identifier? I wrote a post on the problems with ISBNs as persistent identifiers (PIDS) for work, so check it out if that sounds interesting. IDCC20 reflections I&rsquo;m just back from IDCC20, so here are a few reflections on this year&rsquo;s conference. You can find all the available slides and links to shared notes on the conference programme. There&rsquo;s also a list of all the posters and an overview of the Unconference Skills for curation of diverse datasets Here in the UK and elsewhere, you&rsquo;re unlikely to find many institutions claiming to apply a deep level of curation to every dataset/software package/etc deposited with them. There are so many different kinds of data and so few people in any one institution doing &ldquo;curation&rdquo; that it&rsquo;s impossible to do this for everything. Absent the knowledge and skills required to fully evaluate an object the best that can be done is usually to make a sense check on the metadata and flag up with the depositor potential for high-level issues such as accidental disclosure of sensitive personal information. The Data Curation Network in the United States is aiming to address this issue by pooling expertise across multiple organisations. The pilot has been highly successful and they&rsquo;re now looking to obtain funding to continue this work. The Swedish National Data Service is experimenting with a similar model, also with a lot of success. As well as sharing individual expertise, the DCN collaboration has also produced some excellent online quick-reference guides for curating common types of data. We had some further discussion as part of the Unconference on the final day about what it would look like to introduce this model in the UK. There was general agreement that this was a good idea and a way to make optimal use of sparse resources. There were also very valid concerns that it would be difficult in the current financial climate for anyone to justify doing work for another organisation, apparently for free. In my mind there are two ways around this, which are not mutually exclusive by any stretch of the imagination. First is to Just Do It: form an informal network of curators around something simple like a mailing list, and give it a try. Second is for one or more trusted organisations to provide some coordination and structure. There are several candidates for this including DCC, Jisc, DPC and the British Library; we all have complementary strengths in this area so it&rsquo;s my hope that we&rsquo;ll be able to collaborate around it. In the meantime, I hope the discussion continues. Artificial intelligence, machine learning et al As you might expect at any tech-oriented conference there was a strong theme of AI running through many presentations, starting from the very first keynote from Francine Berman. Her talk, The Internet of Things: Utopia or Dystopia? used self-driving cars as a case study to unpack some of the ethical and privacy implications of AI. For example, driverless cars can potentially increase efficiency, both through route-planning and driving technique, but also by allowing fewer vehicles to be shared by more people. However, a shared vehicle is not a private space in the way your own car is: anything you say or do while in that space is potentially open to surveillance. Aside from this, there are some interesting ideas being discussed, particularly around the possibility of using machine learning to automate increasingly complex actions and workflows such as data curation and metadata enhancement. I didn&rsquo;t get the impression anyone is doing this in the real world yet, but I&rsquo;ve previously seen theoretical concepts discussed at IDCC make it into practice so watch this space! Playing games! Training is always a major IDCC theme, and this year two of the most popular conference submissions described games used to help teach digital curation concepts and skills. Mary Donaldson and Matt Mahon of the University of Glasgow presented their use of Lego to teach the concept of sufficient metadata. Participants build simple models before documenting the process and breaking them down again. Then everyone had to use someone else&rsquo;s documentation to try and recreate the models, learning important lessons about assumptions and including sufficient detail. Kirsty Merrett and Zosia Beckles from the University of Bristol brought along their card game &ldquo;Researchers, Impact and Publications (RIP)&rdquo;, based on the popular &ldquo;Cards Against Humanity&rdquo;. RIP encourages players to examine some of the reasons for and against data sharing with plenty of humour thrown in. Both games were trialled by many of the attendees during Thursday&rsquo;s Unconference. Summary I realised in Dublin that it&rsquo;s 8 years since I attended my first IDCC, held at the University of Bristol in December 2011 while I was still working at the nearby University of Bath. While I haven&rsquo;t been every year, I&rsquo;ve been to every one held in Europe since then and it&rsquo;s interesting to see what has and hasn&rsquo;t changed. We&rsquo;re no longer discussing data management plans, data scientists or various other things as abstract concepts that we&rsquo;d like to encourage, but dealing with the real-world consequences of them. The conference has also grown over the years: this year was the biggest yet, boasting over 300 attendees. There has been especially big growth in attendees from North America, Australasia, Africa and the Middle East. That&rsquo;s great for the diversity of the conference as it brings in more voices and viewpoints than ever. With more people around to interact with I have to work harder to manage my energy levels but I think that&rsquo;s a small price to pay. Iosevka: a nice fixed-width-font Iosevka is a nice, slender monospace font with a lot of configurable variations. Check it out: https://typeof.net/Iosevka/ Replacing comments with webmentions Just a quickie to say that I&rsquo;ve replaced the comment section at the bottom of each post with webmentions, which allows you to comment by posting on your own site and linking here. It&rsquo;s a fundamental part of the IndieWeb, which I&rsquo;m slowly getting to grips with having been a halfway member of it for years by virtue of having my own site on my own domain. I&rsquo;d already got rid of Google Analytics to stop forcing that tracking on my visitors, I wanted to get rid of Disqus too because I&rsquo;m pretty sure the only way that is free for me is if they&rsquo;re selling my data and yours to third parties. Webmention is a nice alternative because it relies only on open standards, has no tracking and allows people to control their own comments. While I&rsquo;m currently using a third-party service to help, I can switch to self-hosted at any point in the future, completely transparently. Thanks to webmention.io, which handles incoming webmentions for me, and webmention.js, which displays them on the site, I can keep it all static and not have to implement any of this myself, which is nice. It&rsquo;s a bit harder to comment because you have to be able to host your own content somewhere, but then almost no-one ever commented anyway, so it&rsquo;s not like I&rsquo;ll lose anything! Plus, if I get Bridgy set up right, you should be able to comment just by replying on Mastodon, Twitter or a few other places. A spot of web searching shows that I&rsquo;m not the first to make the Disqus -&gt; webmentions switch (yes, I&rsquo;m putting these links in blatantly to test outgoing webmentions with Telegraph&hellip;): So long Disqus, hello webmention &mdash; Nicholas Hoizey Bye Disqus, hello Webmention! &mdash; Evert Pot Implementing Webmention on a static site &mdash; Deluvi Let&rsquo;s see how this goes! Bridging Carpentries Slack channels to Matrix It looks like I&rsquo;ve accidentally taken charge of bridging a bunch of The Carpentries Slack channels over to Matrix. Given this, it seems like a good idea to explain what that sentence means and reflect a little on my reasoning. I&rsquo;m more than happy to discuss the pros and cons of this approach If you just want to try chatting in Matrix, jump to the getting started section What are Slack and Matrix? Slack (see also on Wikipedia), for those not familiar with it, is an online text chat platform with the feel of IRC (Internet Relay Chat), a modern look and feel and both web and smartphone interfaces. By providing a free tier that meets many peoples' needs on its own Slack has become the communication platform of choice for thousands of online communities, private projects and more. One of the major disadvantages of using Slack&rsquo;s free tier, as many community organisations do, is that as an incentive to upgrade to a paid service your chat history is limited to the most recent 10,000 messages across all channels. For a busy community like The Carpentries, this means that messages older than about 6-7 weeks are already inaccessible, rendering some of the quieter channels apparently empty. As Slack is at pains to point out, that history isn&rsquo;t gone, just archived and hidden from view unless you pay the low, low price of $1/user/month. That doesn&rsquo;t seem too pricy, unless you&rsquo;re a non-profit organisation with a lot of projects you want to fund and an active membership of several hundred worldwide, at which point it soon adds up. Slack does offer to waive the cost for registered non-profit organisations, but only for one community. The Carpentries is not an independent organisation, but one fiscally sponsored by Community Initiatives, which has already used its free quota of one elsewhere rendering the Carpentries ineligible. Other umbrella organisations such as NumFocus (and, I expect, Mozilla) also run into this problem with Slack. So, we have a community which is slowly and inexorably losing its own history behind a paywall. For some people this is simply annoying, but from my perspective as a facilitator of the preservation of digital things the community is haemhorraging an important record of its early history. Enter Matrix. Matrix is a chat platform similar to IRC, Slack or Discord. It&rsquo;s divided into separate channels, and users can join one or more of these to take part in the conversation happening in those channels. What sets it apart from older technology like IRC and walled gardens like Slack &amp; Discord is that it&rsquo;s federated. Federation means simply that users on any server can communicate with users and channels on any other server. Usernames and channel addresses specify both the individual identifier and the server it calls home, just as your email address contains all the information needed for my email server to route messages to it. While users are currently tied to their home server, channels can be mirrored and synchronised across multiple servers making the overall system much more resilient. Can&rsquo;t connect to your favourite channel on server X? No problem: just connect via its alias on server Y and when X comes back online it will be resynchronised. The technology used is much more modern and secure than the aging IRC protocol, and there&rsquo;s no vender lock-in like there is with closed platforms like Slack and Discord. On top of that, Matrix channels can easily be &ldquo;bridged&rdquo; to channels/rooms on other platforms, including, yes, Slack, so that you can join on Matrix and transparently talk to people connected to the bridged room, or vice versa. So, to summarise: The current Carpentries Slack channels could be bridged to Matrix at no cost and with no disruption to existing users The history of those channels from that point on would be retained on matrix.org and accessible even when it&rsquo;s no longer available on Slack If at some point in the future The Carpentries chose to invest in its own Matrix server, it could adopt and become the main Matrix home of these channels without disruption to users of either Matrix or (if it&rsquo;s still in use at that point) Slack Matrix is an open protocol, with a reference server implementation and wide range of clients all available as free software, which aligns with the values of the Carpentries community On top of this: I&rsquo;m fed up of having so many different Slack teams to switch between to see the channels in all of them, and prefer having all the channels I regularly visit in a single unified interface; I wanted to see how easy this would be and whether others would also be interested. Given all this, I thought I&rsquo;d go ahead and give it a try to see if it made things more manageable for me and to see what the reaction would be from the community. How can I get started? !!! reminder Please remember that, like any other Carpentries space, the Code of Conduct applies in all of these channels. First, sign up for a Matrix account. The quickest way to do this is on the Matrix &ldquo;Try now&rdquo; page, which will take you to the Riot Web client which for many is synonymous with Matrix. Other clients are also available for the adventurous. Second, join one of the channels. The links below will take you to a page that will let you connect via your preferred client. You&rsquo;ll need to log in as they are set not to allow guest access, but, unlike Slack, you won&rsquo;t need an invitation to be able to join. #general &mdash; the main open channel to discuss all things Carpentries #random &mdash; anything that would be considered offtopic elsewhere #welcome &mdash; join in and introduce yourself! That&rsquo;s all there is to getting started with Matrix. To find all the bridged channels there&rsquo;s a Matrix &ldquo;community&rdquo; that I&rsquo;ve added them all to: Carpentries Matrix community. There&rsquo;s a lot more, including how to bridge your favourite channels from Slack to Matrix, but this is all I&rsquo;ve got time and space for here! If you want to know more, leave a comment below, or send me a message on Slack (jezcope) or maybe Matrix (@petrichor:matrix.org)! I&rsquo;ve also made a separate channel for Matrix-Slack discussions: #matrix on Slack and Carpentries Matrix Discussion on Matrix MozFest19 first reflections Discussions of neurodiversity at #mozfest Photo by Jennifer Riggins The other weekend I had my first experience of Mozilla Festival, aka #mozfest. It was pretty awesome. I met quite a few people in real life that I&rsquo;ve previously only known (/stalked) on Twitter, and caught up with others that I haven&rsquo;t seen for a while. I had the honour of co-facilitating a workshop session on imposter syndrome and how to deal with it with the wonderful Yo Yehudi and Emmy Tsang. We all learned a lot and hope our participants did too; we&rsquo;ll be putting together a summary blog post as soon as we can get our act together! I also attended a great session, led by Kiran Oliver (psst, they&rsquo;re looking for a new challenge), on how to encourage and support a neurodiverse workforce. I was only there for the one day, and I really wish that I&rsquo;d taken the plunge and committed to the whole weekend. There&rsquo;s always next year though! To be honest, I&rsquo;m just disappointed that I never had the courage to go sooner, Music for working Today1 the office conversation turned to blocking out background noise. (No, the irony is not lost on me.) Like many people I work in a large, open-plan office, and I&rsquo;m not alone amongst my colleagues in sometimes needing to find a way to boost concentration by blocking out distractions. Not everyone is like this, but I find music does the trick for me. I also find that different types of music are better for different types of work, and I use this to try and manage my energy better. There are more distractions than auditory noise, and at times I really struggle with visual noise. Rather than have this post turn into a rant about the evils of open-plan offices, I&rsquo;ll just mention that the scientific evidence doesn&rsquo;t paint them in a good light2, or at least suggests that the benefits are more limited in scope than is commonly thought3, and move on to what I actually wanted to share: good music for working to. There are a number of genres that I find useful for working. Generally, these have in common a consistent tempo, a lack of lyrics, and enough variation to prevent boredom without distracting. Familiarity helps my concentration too so I&rsquo;ll often listen to a restricted set of albums for a while, gradually moving on by dropping one out and bringing in another. In my case this includes: Traditional dance music, generally from northern and western European traditions for me. This music has to be rhythmically consistent to allow social dancing, and while the melodies are typically simple repeated phrases, skilled musicians improvise around that to make something beautiful. I tend to go through phases of listening to particular traditions; I&rsquo;m currently listening to a lot of French, Belgian and Scandinavian. Computer game soundtracks, which are specifically designed to enhance gameplay without distracting, making them perfect for other activities requiring a similar level of concentration. Chiptunes and other music incorporating it; partly overlapping with the previous category, chiptunes is music made by hacking the audio chips from (usually) old computers and games machines to become an instrument for new music. Because of the nature of the instrument, this will have millisecond-perfect rhythm and again makes for undistracting noise blocking with an extra helping of nostalgia! Purists would disagree with me, but I like artists that combine chiptunes with other instruments and effects to make something more complete-sounding. Retrowave/synthwave/outrun, synth-driven music that&rsquo;s instantly familiar as the soundtrack to many 90s sci-fi and thriller movies. Atmospheric, almost dreamy, but rhythmic with a driving beat, it&rsquo;s another genre that fits into the &ldquo;pleasing but not too surprising&rdquo; category for me. So where to find this stuff? One of the best resources I&rsquo;ve found is Music for Programming which provides carefully curated playlists of mostly electronic music designed to energise without distracting. They&rsquo;re so well done that the tracks move seamlessly, one to the next, without ever getting boring. Spotify is an obvious option, and I do use it quite a lot. However, I&rsquo;ve started trying to find ways to support artists more directly, and Bandcamp seems to be a good way of doing that. It&rsquo;s really easy to browse by genre, or discover artists similar to what you&rsquo;re currently hearing. You can listen for free as long as you don&rsquo;t mind occasional nags to buy the music you&rsquo;re hearing, but you can also buy tracks or albums. Music you&rsquo;ve paid for is downloadable in several open, DRM-free formats for you to keep, and you know that a decent chunk of that cash is going directly to that artist. I also love noise generators; not exactly music, but a variety of pleasant background noises, some of which nicely obscure typical office noise. I particularly like mynoise.net, which has a cornucopia of different natural and synthetic noises. Each generator comes with a range of sliders allowing you to tweak the composition and frequency range, and will even animate them randomly for you to create a gently shifting soundscape. A much simpler, but still great, option is Noisli with it&rsquo;s nice clean interface. Both offer apps for iOS and Android. For bonus points, you can always try combining one or more of the above. Adding in a noise generator allows me to listen to quieter music while still getting good environmental isolation when I need concentration. Another favourite combo is to open both the cafe and rainfall generators from myNoise, made easier by the ability to pop out a mini-player then open up a second generator. I must be missing stuff though. What other musical genres should I try? What background sounds are nice to work to? Well, you know. The other day. Whatever. &#x21a9;&#xfe0e; See e.g.: Lee, So Young, and Jay L. Brand. ‘Effects of Control over Office Workspace on Perceptions of the Work Environment and Work Outcomes’. Journal of Environmental Psychology 25, no. 3 (1 September 2005): 323–33. https://doi.org/10.1016/j.jenvp.2005.08.001. &#x21a9;&#xfe0e; Open plan offices can actually work under certain conditions, The Conversation &#x21a9;&#xfe0e; Working at the British Library: 6 months in It barely seems like it, but I&rsquo;ve been at the British Library now for nearly 6 months. It always takes a long time to adjust and from experience I know it&rsquo;ll be another year before I feel fully settled, but my team, department and other colleagues have really made me feel welcome and like I belong. One thing that hasn&rsquo;t got old yet is the occasional thrill of remembering that I work at my national library now. Every now and then I&rsquo;ll catch a glimpse of the collections at Boston Spa or step into one of the reading rooms and think &ldquo;wow, I actually work here!&rdquo; I also like having a national and international role to play, which means I get to travel a bit more than I used to. Budgets are still tight so there are limits, and I still prefer to be home more often than not, but there is more scope in this job than I&rsquo;ve had previously for travelling to conferences, giving talks that change the way people think, and learning in different contexts. I&rsquo;m learning a lot too, especially how to work with and manage people split across multiple sites, and the care and feeding of budgets. As well as missing mo old team at Sheffield, I do also miss some of the direct contact I had with researchers in HE. I especially miss the teaching work, but also the higher-level influencing of more senior academics to change practices on a wider scale. Still, I get to use those influencing skills in different ways now, and I&rsquo;m still involved with the Carpentries which should let me keep my hand in with teaching. I still deal with my general tendency to try and do All The Things, and as before I&rsquo;m slowly learning to recognise it, tame it and very occasionally turn it to my advantage. That also leads to feelings of imposterism that are only magnified by the knowledge that I now work at a national institution! It&rsquo;s a constant struggle some days to believe that I&rsquo;ve actually earned my place here through hard work, Even if I don&rsquo;t always feel that I have, my colleagues here certainly have, so I should have more faith in their opinion of me. Finally, I couldn&rsquo;t write this type of thing without mentioning the commute. I&rsquo;ve gone from 90 minutes each way on a good day (up to twice that if the trains were disrupted) to 35 minutes each way along fairly open roads. I have less time to read, but much more time at home. On top of that, the library has implemented flexitime across all pay grades, with even senior managers strongly encouraged to make full use. Not only is this an important enabler of equality across the organisation, it relieves for me personally the pressure to work over my contracted hours and the guilt I&rsquo;ve always felt at leaving work even 10 minutes early. If I work late, it&rsquo;s now a choice I&rsquo;m making based on business needs instead of guilt and in full knowledge that I&rsquo;ll get that time back later. So that&rsquo;s where I am right now. I&rsquo;m really enjoying the work and the culture, and I look forward to what the next 6 months will bring! RDA Plenary 13 reflection Photo by me I sit here writing this in the departure lounge at Philadelphia International Airport, waiting for my Aer Lingus flight back after a week at the 13th Research Data Alliance (RDA) Plenary (although I&rsquo;m actually publishing this a week or so later at home). I&rsquo;m pretty exhausted, partly because of the jet lag, and partly because it&rsquo;s been a very full week with so much to take in. It&rsquo;s my first time at an RDA Plenary, and it was quite a new experience for me! First off, it&rsquo;s my first time outside Europe, and thus my first time crossing quite so many timezones. I&rsquo;ve been waking at 5am and ready to drop by 8pm, but I&rsquo;ve struggled on through! Secondly, it&rsquo;s the biggest conference I&rsquo;ve been to for a long time, both in number of attendees and number of parallel sessions. There&rsquo;s been a lot of sustained input so I&rsquo;ve been very glad to have a room in the conference hotel and be able to escape for a few minutes when I needed to recharge. Thirdly, it&rsquo;s not really like any other conference I&rsquo;ve been to: rather than having large numbers of presentations submitted by attendees, each session comprises lots of parallel meetings of RDA interest groups and working groups. It&rsquo;s more community-oriented: an opportunity for groups to get together face to face and make plans or show off results. I found it pretty intense and struggled to take it all in, but incredibly valuable nonetheless. Lots of information to process (I took a lot of notes) and a few contacts to follow up on too, so overall I loved it! Using Pipfile in Binder Photo by Sear Greyson on Unsplash I recently attended a workshop, organised by the excellent team of the Turing Way project, on a tool called BinderHub. BinderHub, along with public hosting platform MyBinder, allows you to publish computational notebooks online as &ldquo;binders&rdquo; such that they&rsquo;re not static but fully interactive. It&rsquo;s able to do this by using a tool called repo2docker to capture the full computational environment and dependencies required to run the notebook. !!! aside &ldquo;What is the Turing Way?&rdquo; The Turing Way is, in its own words, &ldquo;a lightly opinionated guide to reproducible data science.&rdquo; The team is building an open textbook and running a number of workshops for scientists and research software engineers, and you should check out the project on Github. You could even contribute! The Binder process goes roughly like this: Do some work in a Jupyter Notebook or similar Put it into a public git repository Add some extra metadata describing the packages and versions your code relies on Go to mybinder.org and tell it where to find your repository Open the URL it generates for you Profit Other than step 5, which can take some time to build the binder, this is a remarkably quick process. It supports a number of different languages too, including built-in support for R, Python and Julia and the ability to configure pretty much any other language that will run on Linux. However, the Python support currently requires you to have either a requirements.txt or Conda-style environment.yml file to specify dependencies, and I commonly use a Pipfile for this instead. Pipfile allows you to specify a loose range of compatible versions for maximal convenience, but then locks in specific versions for maximal reproducibility. You can upgrade packages any time you want, but you&rsquo;re fully in control of when that happens, and the locked versions are checked into version control so that everyone working on a project gets consistency. Since Pipfile is emerging as something of a standard thought I&rsquo;d see if I could use that in a binder, and it turns out to be remarkably simple. The reference implementation of Pipfile is a tool called pipenv by the prolific Kenneth Reitz. All you need to use this in your binder is two files of one line each. requirements.txt tells repo2binder to build a Python-based binder, and contains a single line to install the pipenv package: pipenv Then postBuild is used by repo2binder to install all other dependencies using pipenv: pipenv install --system The --system flag tells pipenv to install packages globally (its default behaviour is to create a Python virtualenv). With these two files, the binder builds and runs as expected. You can see a complete example that I put together during the workshop here on Gitlab. What do you think I should write about? I&rsquo;ve found it increasingly difficult to make time to blog, and it&rsquo;s not so much not having the time — I&rsquo;m pretty privileged in that regard — but finding the motivation. Thinking about what used to motivate me, one of the big things was writing things that other people wanted to read. Rather than try to guess, I thought I&rsquo;d ask! Those who know what I&#39;m about, what would you read about, if it was written by me?I&#39;m trying to break through the blog-writers block and would love to know what other people would like to see my ill-considered opinions on.&mdash; Jez Cope (@jezcope) March 7, 2019 I&rsquo;m still looking for ideas, so please tweet me or leave me a comment below. Below are a few thoughts that I&rsquo;m planning to do something with. Something taking one of the more techy aspects of Open Research, breaking it down and explaining the benefits for non-techy folks?&mdash; Dr Beth 🏳️‍🌈 🐺 (@PhdGeek) March 7, 2019 Skills (both techy and non techy) that people need to most effectively support RDM&mdash; Kate O&#39;Neill (@KateFONeill) March 7, 2019 Sometimes I forget that my background makes me well-qualified to take some of these technical aspects of the job and break them down for different audiences. There might be a whole series in this&hellip; Carrying on our conversation last week I&#39;d love to hear more about how you&#39;ve found moving from an HE lib to a national library and how you see the BL&#39;s role in RDM. Appreciate this might be a bit niche/me looking for more interesting things to cite :)&mdash; Rosie Higman (@RosieHLib) March 7, 2019 This is interesting, and something I&rsquo;d like to reflect on; moving from one job to another always has lessons and it&rsquo;s easy to miss them if you&rsquo;re not paying attention. Another one for the pile. Life without admin rights to your computer&mdash; Mike Croucher (@walkingrandomly) March 7, 2019 This is so frustrating as an end user, but at the same time I get that endpoint security is difficult and there are massive risks associated with letting end users have admin rights. This is particularly important at the BL: as custodian&rsquo;s of a nation&rsquo;s cultural heritage, the risk for us is bigger than for many and for this reason we are now Cyber Essentials Plus certified. At some point I&rsquo;d like to do some research and have a conversation with someone who knows a lot more about InfoSec to work out what the proper approach to this, maybe involving VMs and a demilitarized zone on the network. I&rsquo;m always looking for more inspiration, so please leave a comment if you&rsquo;ve got anything you&rsquo;d like to read my thoughts on. If you&rsquo;re not familiar with my writing, please take a minute or two to explore the blog; the tags page is probably a good place to get an overview. Ultimate Hacking Keyboard: first thoughts Following on from the excitement of having built a functioning keyboard myself, I got a parcel on Monday. Inside was something that I&rsquo;ve been waiting for since September: an Ultimate Hacking Keyboard! Where the custom-built Laplace is small and quiet for travelling, the UHK is to be my main workhorse in the study at home. Here are my first impressions: Key switches I went with Kailh blue switches from the available options. In stark contrast to the quiet blacks on the Laplace, blues are NOISY! They have an extra piece of plastic inside the switch that causes an audible and tactile click when the switch activates. This makes them very satisfying to type on and should help as I train my fingers not to bottom out while typing, but does make them unsuitable for use in a shared office! Here are some animations showing how the main types of key switch vary. Layout This keyboard has what&rsquo;s known as a 60% layout: no number pad, arrows or function keys. As with the more spartan Laplace, these &ldquo;missing&rdquo; keys are made up for with programmable layers. For example, the arrow keys are on the Mod layer on the I/J/K/L keys, so I can access them without moving from the home row. I actually find this preferable to having to move my hand to the right to reach them, and I really never used the number pad in any case. Split This is a split keyboard, which means that the left and right halves can be separated to place the hands further apart which eases strain across the shoulders. The UHK has a neat coiled cable joining the two which doesn&rsquo;t get in the way. A cool design feature is that the two halves can be slotted back together and function perfectly well as a non-split keyboard too, held together by magnets. There are even electrical contacts so that when the two are joined you don&rsquo;t need the linking cable. Programming The board is fully programmable, and this is achieved via a custom (open source) GUI tool which talks to the (open source) firmware on the board. You can have multiple keymaps, each of which has a separate Base, Mod, Fn and Mouse layer, and there&rsquo;s an LED display that shows a short mnemonic for the currently active map. I already have a customised Dvorak layout for day-to-day use, plus a standard QWERTY for not-me to use and an alternative QWERTY which will be slowly tweaked for games that don&rsquo;t work well with Dvorak. Mouse keys One cool feature that the designers have included in the firmware is the ability to emulate a mouse. There&rsquo;s a separate layer that allows me to move the cursor, scroll and click without moving my hands from the keyboard. Palm rests Not much to say about the palm rests, other than they are solid wood, and chunky, and really add a little something. I have to say, I really like it so far! Overall it feels really well designed, with every little detail carefully thought out and excellent build quality and a really solid feeling. Custom-built keyboard I&rsquo;m typing this post on a keyboard I made myself, and I&rsquo;m rather excited about it! Why make my own keyboard? I wanted to learn a little bit about practical electronics, and I like to learn by doing I wanted to have the feeling of making something useful with my own hands I actually need a small, keyboard with good-quality switches now that I travel a fair bit for work and this lets me completely customise it to my needs Just because! While it is possible to make a keyboard completely from scratch, it makes much more sense to put together some premade parts. The parts you need are: PCB (printed circuit board): the backbone of the keyboard, to which all the other electrical components attach, this defines the possible physical locations for each key Switches: one for each key to complete a circuit whenever you press it Keycaps: switches are pretty ugly and pretty uncomfortable to press, so each one gets a cap; these are what you probably think of as the &ldquo;keys&rdquo; on your keyboard and come in almost limitless variety of designs (within the obvious size limitation) and are the easiest bit of personalisation Controller: the clever bit, which detects open and closed switches on the PCB and tells your computer what keys you pressed via a USB cable Firmware: the program that runs on the controller starts off as source code like any other program, and altering this can make the keyboard behave in loads of different ways, from different layouts to multiple layers accessed by holding a particular key, to macros and even emulating a mouse! In my case, I&rsquo;ve gone for the following: PCB Laplace from keeb.io, a very compact 47-key (&ldquo;40%&quot;) board, with no number pad, function keys or number row, but a lot of flexibility for key placement on the bottom row. One of my key design goals was small size so I can just pop it in my bag and have on my lap on the train. Controller Elite-C, designed specifically for keyboard builds to be physically compatible with the cheaper Pro Micro, with a more-robust USB port (the Pro Micro&rsquo;s has a tendency to snap off), and made easier to program with a built-in reset button and better bootloader. Switches Gateron Black: Gateron is one of a number of manufacturers of mechanical switches compatible with the popular Cherry range. The black switch is linear (no click or bump at the activation point) and slightly heavier sprung than the more common red. Cherry also make a black switch but the Gateron version is slightly lighter and having tested a few I found them smoother too. My key goal here was to reduce noise, as the stronger spring will help me type accurately without hitting the bottom of the keystroke with an audible sound. Keycaps Blank grey PBT in DSA profile: this keyboard layout has a lot of non-standard sized keys, so blank keycaps meant that I wouldn&rsquo;t be putting lots of keys out of their usual position; they&rsquo;re also relatively cheap, fairly classy IMHO and a good placeholder until I end up getting some really cool caps on a group buy or something; oh, and it minimises the chance of someone else trying the keyboard and getting freaked out by the layout&hellip; Firmware QMK (Quantum Mechanical Keyboard), with a work-in-progress layout, based on Dvorak. QMK has a lot of features and allows you to fully program each and every key, with multiple layers accessed through several different routes. Because there are so few keys on this board, I&rsquo;ll need to make good use of layers to make all the keys on a usual keyboard available. Dvorak Simplified Keyboard I&rsquo;m grateful to the folks of the Leeds Hack Space, especially Nav &amp; Mark who patiently coached me in various soldering techniques and good practice, but also everyone else who were so friendly and welcoming and interested in my project. I&rsquo;m really pleased with the result, which is small, light and fully customisable. Playing with QMK firmware features will keep me occupied for quite a while! This isn&rsquo;t the end though, as I&rsquo;ll need a case to keep the dust out. I&rsquo;m hoping to be able to 3D print this or mill it from wood with a CNC mill, for which I&rsquo;ll need to head back to the Hack Space! Less, but better &ldquo;Wenniger aber besser&rdquo; — Dieter Rams {:.big-quote} I can barely believe it&rsquo;s a full year since I published my intentions for 2018. A lot has happened since then. Principally: in November I started a new job as Data Services Lead at The British Library. One thing that hasn&rsquo;t changed is my tendency to try to do too much, so this year I&rsquo;m going to try and focus on a single intention, a translation of designer Dieter Rams' famous quote above: Less, but better. This chimes with a couple of other things I was toying with over the Christmas break, as they&rsquo;re essentially other ways of saying the same thing: Take it steady One thing at a time I&rsquo;m also going to keep in mind those touchstones from last year: What difference is this making? Am I looking after myself? Do I have evidence for this? I mainly forget to think about them, so I&rsquo;ll be sticking up post-its everywhere to help me remember! How to extend Python with Rust: part 1 Python is great, but I find it useful to have an alternative language under my belt for occasions when no amount of Pythonic cleverness will make some bit of code run fast enough. One of my main reasons for wanting to learn Rust was to have something better than C for that. Not only does Rust have all sorts of advantages that make it a good choice for code that needs to run fast and correctly, it&rsquo;s also got a couple of rather nice crates (libraries) that make interfacing with Python a lot nicer. Here&rsquo;s a little tutorial to show you how easy it is to call a simple Rust function from Python. If you want to try it yourself, you&rsquo;ll find the code on GitHub. !!! prerequisites I’m assuming for this tutorial that you’re already familiar with writing Python scripts and importing &amp; using packages, and that you’re comfortable using the command line. You’ll also need to have installed Rust. The Rust bit The quickest way to get compiled code into Python is to use the builtin ctypes package. This is Python&rsquo;s &ldquo;Foreign Function Interface&rdquo; or FFI: a means of calling functions outside the language you&rsquo;re using to make the call. ctypes allows us to call arbitrary functions in a shared library1, as long as those functions conform to certain standard C language calling conventions. Thankfully, Rust tries hard to make it easy for us to build such a shared library. The first thing to do is to create a new project with cargo, the Rust build tool: $ cargo new rustfrompy Created library `rustfrompy` project $ tree . ├── Cargo.toml └── src └── lib.rs 1 directory, 2 files !!! aside I use the fairly common convention that text set in fixed-width font is either example code or commands to type in. For the latter, a $ precedes the command that you type (omit the $), and lines that don&rsquo;t start with a $ are output from the previous command. I assume a basic familiarity with Unix-style command line, but I should probably put in some links to resources if you need to learn more! We need to edit the Cargo.toml file and add a [lib] section: [package] name = &#34;rustfrompy&#34; version = &#34;0.1.0&#34; authors = [&#34;Jez Cope &lt;j.cope@erambler.co.uk&gt;&#34;] [dependencies] [lib] name = &#34;rustfrompy&#34; crate-type = [&#34;cdylib&#34;] This tells cargo that we want to make a C-compatible dynamic library (crate-type = [&quot;cdylib&quot;]) and what to call it, plus some standard metadata. We can then put our code in src/lib.rs. We&rsquo;ll just use a simple toy function that adds two numbers together: #[no_mangle] pub fn add(a: i64, b: i64) -&gt; i64 { a + b } Notice the pub keyword, which instructs the compiler to make this function accessible to other modules, and the #[no_mangle] annotation, which tells it to use the standard C naming conventions for functions. If we don&rsquo;t do this, then Rust will generate a new name for the function for its own nefarious purposes, and as a side effect we won&rsquo;t know what to call it when we want to use it from Python. Being good developers, let&rsquo;s also add a test: #[cfg(test)] mod test { use ::*; #[test] fn test_add() { assert_eq!(4, add(2, 2)); } } We can now run cargo test which will compile that code and run the test: $ cargo test Compiling rustfrompy v0.1.0 (file:///home/jez/Personal/Projects/rustfrompy) Finished dev [unoptimized + debuginfo] target(s) in 1.2 secs Running target/debug/deps/rustfrompy-3033caaa9f5f17aa running 1 test test test::test_add ... ok test result: ok. 1 passed; 0 failed; 0 ignored; 0 measured; 0 filtered out Everything worked! Now just to build that shared library and we can try calling it from Python: $ cargo build Compiling rustfrompy v0.1.0 (file:///home/jez/Personal/Projects/rustfrompy) Finished dev [unoptimized + debuginfo] target(s) in 0.30 secs Notice that the build is unoptimized and includes debugging information: this is useful in development, but once we&rsquo;re ready to use our code it will run much faster if we compile it with optimisations. Cargo makes this easy: $ cargo build --release Compiling rustfrompy v0.1.0 (file:///home/jez/Personal/Projects/rustfrompy) Finished release [optimized] target(s) in 0.30 secs The Python bit After all that, the Python bit is pretty short. First we import the ctypes package (which is included in all recent Python versions): from ctypes import cdll Cargo has tidied our shared library away into a folder, so we need to tell Python where to load it from. On Linux, it will be called lib&lt;something&gt;.so where the &ldquo;something&rdquo; is the crate name from Cargo.toml, &ldquo;rustfrompy&rdquo;: lib = cdll.LoadLibrary(&#39;target/release/librustfrompy.so&#39;) Finally we can call the function anywhere we want. Here it is in a pytest-style test: def test_rust_add(): assert lib.add(27, 15) == 42 If you have pytest installed (and you should!) you can run the whole test like this: $ pytest --verbose test.py ====================================== test session starts ====================================== platform linux -- Python 3.6.4, pytest-3.1.1, py-1.4.33, pluggy-0.4.0 -- /home/jez/.virtualenvs/datasci/bin/python cachedir: .cache rootdir: /home/jez/Personal/Projects/rustfrompy, inifile: collected 1 items test.py::test_rust_add PASSED It worked! I&rsquo;ve put both the Rust and Python code on github if you want to try it for yourself. Shortcomings Ok, so that was a pretty simple example, and I glossed over a lot of things. For example, what would happen if we did lib.add(2.0, 2)? This causes Python to throw an error because our Rust function only accepts integers (64-bit signed integers, i64, to be precise), and we gave it a floating point number. ctypes can’t guess what type(s) a given function will work with, but it can at least tell us when we get it wrong. To fix this properly, we need to do some extra work, telling the ctypes library what the argument and return types for each function are. For a more complex library, there will probably be more housekeeping to do, such as translating return codes from functions into more Pythonic-style errors. For a small example like this there isn’t much of a problem, but the bigger your compiled library the more extra boilerplate is required on the Python side just to use all the functions. When you’re working with an existing library you don’t have much choice about this, but if you’re building it from scratch specifically to interface with Python, there’s a better way using the Python C API. You can call this directly in Rust, but there are a couple of Rust crates that make life much easier, and I’ll be taking a look at those in a future blog post. .so on Linux, .dylib on Mac and .dll on Windows &#x21a9;&#xfe0e; New Years's irresolution Photo by Andrew Hughes on Unsplash I&rsquo;ve chosen not to make any specific resolutions this year; I’ve found that they just don’t work for me. Like many people, all I get is a sense of guilt when I inevitably fail to live up to the expectations I set myself at the start of the year. However, I have set a couple of what I’m referring to as “themes” for the year: touchstones that I’ll aim to refer to when setting priorities or just feeling a bit overwhelmed or lacking in direction. They are: Contribution Self-care Measurement I may do some blog posts expanding on these, but in the meantime, I&rsquo;ve put together a handful of questions to help me think about priorities and get perspective when I&rsquo;m doing (or avoiding doing) something. What difference is this making? I feel more motivated when I can figure out how I&rsquo;m contributing to something bigger than myself. In society? In my organisation? To my friends &amp; family? Am I looking after myself? I focus a lot on the expectations have (or at least that I think others have) of me, but I can&rsquo;t do anything well unless I&rsquo;m generally happy and healthy. Is this making me happier and healthier? Is this building my capacity to to look after myself, my family &amp; friends and do my job? Is this worth the amount of time and energy I&rsquo;m putting in? Do I have evidence for this? I don&rsquo;t have to base decisions purely on feelings/opinions: I have the skills to obtain, analyse and interpret data. Is this fact or opinion? What are the facts? Am I overthinking this? Can I put a confidence interval for this? Build documents from code and data with Saga !!! tldr &ldquo;TL;DR&rdquo; I&rsquo;ve made Saga, a thing for compiling documents by combining code and data with templates. What is it? Saga is a very simple command-line tool that reads in one or more data files, runs one or more scripts, then passes the results into a template to produce a final output document. It enables you to maintain a clean separation between data, logic and presentation and produce data-based documents that can easily be updated. That allows the flow of data through the document to be easily understood, a cornerstone of reproducible analysis. You run it like this: saga build -d data.yaml -d other_data.yaml \ -s analysis.py -t report.md.tmpl \ -O report.md Any scripts specified with -s will have access to the data in local variables, and any changes to local variables in a script will be retained when everything is passed to the template for rendering. For debugging, you can also do: saga dump -d data.yaml -d other_data.yaml -s analysis.py which will print out the full environment that would be passed to your template with saga build. Features Right now this is a really early version. It does the job but I have lots of ideas for features to add if I ever have time. At present it does the following: Reads data from one or more YAML files Transforms data with one or more Python scripts Renders a template in Mako format Works with any plain-text output format, including Markdown, LaTeX and HTML Use cases Write reproducible reports &amp; papers based on machine-readable data Separate presentation from content in any document, e.g. your CV (example coming soon) Yours here? Get it! I haven&rsquo;t released this on PyPI yet, but all the code is available on GitHub to try out. If you have pipenv installed (and if you use Python you should!), you can try it out in an isolated virtual environment by doing: git clone https://github.com/jezcope/sagadoc.git cd sagadoc pipenv install pipenv run saga or you can set up for development and run some tests: pipenv install --dev pipenv run pytest Why? Like a lot of people, I have to produce reports for work, often containing statistics computed from data. Although these generally aren&rsquo;t academic research papers, I see no reason not to aim for a similar level of reproducibility: after all, if I&rsquo;m telling other people to do it, I&rsquo;d better take my own advice! A couple of times now I&rsquo;ve done this by writing a template that holds the text of the report and placeholders for values, along with a Python script that reads in the data, calculates the statistics I want and completes the template. This is valuable for two main reasons: If anyone wants to know how I processed the data and calculated those statistics, it&rsquo;s all there: no need to try and remember and reproduce a series of button clicks in Excel; If the data or calculations change, I just need to update the relevant part and run it again, and all the relevant parts of the document will be updated. This is particularly important if changing a single data value requires recalculation of dozens of tables, charts, etc. It also gives me the potential to factor out and reuse bits of code in the future, add tests and version control everything. Now that I&rsquo;ve done this more than once (and it seems likely I&rsquo;ll do it again) it makes sense to package that script up in a more portable form so I don&rsquo;t have to write it over and over again (or, shock horror, copy &amp; paste it!). It saves time, and gives others the possibility to make use of it. Prior art I&rsquo;m not the first person to think of this, but I couldn&rsquo;t find anything that did exactly what I needed. Several tools will let you interweave code and prose, including the results of evaluating each code snippet in the document: chief among these are Jupyter and Rmarkdown. There are also tools that let you write code in the order that makes most sense to read and then rearrange it into the right order to execute, so-call literate programming. The original tool for this is the venerable noweb. Sadly there is very little that combine both of these and allow you to insert the results of various calculations at arbitrary points in a document, independent of the order of either presenting or executing the code. The only two that I&rsquo;m aware of are: Dexy and org-mode. Unfortunately, Dexy currently only works on Legacy Python (/Python 2) and org-mode requires emacs (which is fine but not exactly portable). Rmarkdown comes close and supports a range of languages but the full feature set is only available with R. Actually, my ideal solution is org-mode without the emacs dependency, because that&rsquo;s the most flexible solution; maybe one day I&rsquo;ll have both the time and skill to implement that. It&rsquo;s also possible I might be able to figure out Dexy&rsquo;s internals to add what I want to it, but until then Saga does the job! Future work There are lots of features that I&rsquo;d still like to add when I have time: Some actual documentation! And examples! More data formats (e.g. CSV, JSON, TOML) More languages (e.g. R, Julia) Fetching remote data over http Caching of intermediate results to speed up rebuilds For now, though, I&rsquo;d love for you to try it out and let me know what you think! As ever, comment here, tweet me or start an issue on GitHub. Why try Rust for scientific computing? When you&rsquo;re writing analysis code, Python (or R, or JavaScript, or &hellip;) is usually the right choice. These high-level languages are set up to make you as productive as possible, and common tasks like array manipulation have been well optimised. However, sometimes you just can&rsquo;t get enough speed and need to turn to a lower-level compiled language. Often that will be C, C++ or Fortran, but I thought I&rsquo;d do a short post on why I think you should consider Rust. One of my goals for 2017&rsquo;s Advent of Code was to learn a modern, memory-safe, statically-typed language. I now know that there are quite a lot of options in this space, but two seem to stand out: Go &amp; Rust. I gave both of them a try, and although I&rsquo;ll probably go back to give Go a more thorough test at some point I found I got quite hooked on Rust. Both languages, though young, are definitely production-ready. Servo, the core of the new Firefox browser, is entirely written in Rust. In fact, Mozilla have been trying to rewrite the rendering core in C for nearly a decade, and switching to Rust let them get it done in just a couple of years. !!! tldr &ldquo;TL;DR&rdquo; - It&rsquo;s fast: competitive with idiomatic C/C++, and no garbage-collection overhead - It&rsquo;s harder to write buggy code, and compiler errors are actually helpful - It&rsquo;s C-compatible: you can call into Rust code anywhere you&rsquo;d call into C, call C/C++ from Rust, and incrementally replace C/C++ code with Rust - It has sensible modern syntax that makes your code clearer and more concise - Support for scientific computing are getting better all the time (matrix algebra libraries, built-in SIMD, safe concurrency) - It has a really friendly and active community - It&rsquo;s production-ready: Servo, the new rendering core in Firefox, is built entirely in Rust Performance To start with, as a compiled language Rust executes much faster than a (pseudo-)interpreted language like Python or R; the price you pay for this is time spent compiling during development. However, having a compile step also allows the language to enforce certain guarantees, such as type-correctness and memory safety, which between them prevent whole classes of bugs from even being possible. Unlike Go (which, like many higher-level languages, uses a garbage collector), Rust handles memory safety at compile time through the concepts of ownership and borrowing. These can take some getting used to and were a big source of frustration when I was first figuring out the language, but ultimately contribute to Rust&rsquo;s reliably-fast performance. Performance can be unpredictable in a garbage-collected language because you can&rsquo;t be sure when the GC is going to run and you need to understand it really well to stand a chance of optimising it if becomes a problem. On the other hand, code that has the potential to be unsafe will result in compilation errors in Rust. There are a number of benchmarks (example) that show Rust&rsquo;s performance on a par with idiomatic C &amp; C++ code, something that very few languages can boast. Helpful error messages Because beginner Rust programmers often get compile errors, it&rsquo;s really important that those errors are easy to interpret and fix, and Rust is great at this. Not only does it tell you what went wrong, but wherever possible it prints out your code annotated with arrows to show exactly where the error is, and makes specific suggestions how to fix the error which usually turn out to be correct. It also has a nice suite of warnings (things that don&rsquo;t cause compilation to fail but may indicate bugs) that are just as informative, and this can be extended even further by using the clippy linting tool to further analyse your code. warning: unused variable: `y` --&gt; hello.rs:3:9 | 3 | let y = x; | ^ | = note: #[warn(unused_variables)] on by default = note: to avoid this warning, consider using `_y` instead Easy to integrate with other languages If you&rsquo;re like me, you&rsquo;ll probably only use a low-level language for performance-critical code that you can call from a high-level language, and this is an area where Rust shines. Most programmers will turn to C, C++ or Fortran for this because they have a well established ABI (Application Binary Interface) which can be understood by languages like Python and R1. In Rust, it&rsquo;s trivial to make a C-compatible shared library, and the standard library includes extra features for working with C types. That also means that existing C code can be incrementally ported to Rust: see remacs for an example. On top of this, there are projects like rust-cpython and PyO3 which provide macros and structures that wrap the Python C API to let you build Python modules in Rust with minimal glue code; rustr does a similar job for R. Nice language features Rust has some really nice features, which let you write efficient, concise and correct code. Several feel particularly comfortable as they remind me of similar things available in Haskell, including: Enums, a super-powered combination of C enums and unions (similar to Haskell&rsquo;s algebraic data types) that enable some really nice code with no runtime cost Generics and traits that let you get more done with less code Pattern matching, a kind of case statement that lets you extract parts of structs, tuples &amp; enums and do all sorts of other clever things Lazy computation based on an iterator pattern, for efficient processing of lists of things: you can do for item in list { ... } instead of the C-style use of an index2, or you can use higher-order functions like map and filter Functions/closures as first-class citizens Scientific computing Although it&rsquo;s a general-purpose language and not designed specifically for scientific computing, Rust&rsquo;s support is improving all the time. There are some interesting matrix algebra libraries available, and built-in SIMD is incoming. The memory safety features also work to ensure thread safety, so it&rsquo;s harder to write concurrency bugs. You should be able to use your favourite MPI implementation too, and there&rsquo;s at least one attempt to portably wrap MPI in a more Rust-like way. Active development and friendly community One of the things you notice straight away is how active and friendly the Rust community is. There are several IRC channels on irc.mozilla.org including #rust-beginners, which is a great place to get help. The compiler is under constant but carefully-managed development, so that new features are landing all the time but without breaking existing code. And the fabulous Cargo build tool and crates.io are enabling the rapid growth of a healthy ecosystem of open source libraries that you can use to write less code yourself. Summary So, next time you need a compiled language to speed up hotspots in your code, try Rust. I promise you won&rsquo;t regret it! Julia actually allows you to call C and Fortran functions as a first-class language feature &#x21a9;&#xfe0e; Actually, since C++11 there&rsquo;s for (auto item : list) { ... } but still&hellip; &#x21a9;&#xfe0e; Reflections on #aoc2017 Trees reflected in a lake Joshua Reddekopp on Unsplash It seems like ages ago, but way back in November I committed to completing Advent of Code. I managed it all, and it was fun! All of my code is available on GitHub if you’re interested in seeing what I did, and I managed to get out a blog post for every one with a bit more commentary, which you can see in the series list above. How did I approach it? I’ve not really done any serious programming challenges before. I don’t get to write a lot of code at the moment, so all I wanted from AoC was an excuse to do some proper problem-solving. I never really intended to take a polyglot approach, though I did think that I might use mainly Python with a bit of Haskell. In the end, though, I used: Python (×12); Haskell (×7); Rust (×4); Go; C++; Ruby; Julia; and Coconut. For the most part, my priorities were getting the right answer, followed by writing readable code. I didn’t specifically focus on performance but did try to avoid falling into traps that I knew about. What did I learn? I found Python the easiest to get on with: it’s the language I know best and although I can’t always remember exact method names and parameters I know what’s available and where to look to remind myself, as well as most of the common idioms and some performance traps to avoid. Python was therefore the language that let me focus most on solving the problem itself. C++ and Ruby were more challenging, and it was harder to write good idiomatic code but I can still remember quite a lot. Haskell I haven’t used since university, and just like back then I really enjoyed working out how to solve problems in a functional style while still being readable and efficient (not always something I achieved&hellip;). I learned a lot about core Haskell concepts like monads &amp; functors, and I’m really amazed by the way the Haskell community and ecosystem has grown up in the last decade. I also wanted to learn at least one modern, memory-safe compiled language, so I tried both Go and Rust. Both seem like useful languages, but Rust really intrigued me with its conceptual similarities to both Haskell and C++ and its promise of memory safety without a garbage collector. I struggled a lot initially with the “borrow checker” (the component that enforces memory safety at compile time) but eventually started thinking in terms of ownership and lifetimes after which things became easier. The Rust community seems really vibrant and friendly too. What next? I really want to keep this up, so I’m going to look out some more programming challenges (Project Euler looks interesting). It turns out there’s a regular Code Dojo meetup in Leeds, so hopefully I’ll try that out too. I’d like to do more realistic data-science stuff, so I’ll be taking a closer look at stuff like Kaggle too, and figuring out how to do a bit more analysis at work. I’m also feeling motivated to find an open source project to contribute to and/or release a project of my own, so we’ll see if that goes anywhere! I’ve always found the advice to “scratch your own itch” difficult to follow because everything I think of myself has already been done better. Most of the projects I use enough to want to contribute to tend to be pretty well developed with big communities and any bugs that might be accessible to me will be picked off and fixed before I have a chance to get started. Maybe it’s time to get over myself and just reimplement something that already exists, just for the fun of it! The Halting Problem — Python — #adventofcode Day 25 Today&rsquo;s challenge, takes us back to a bit of computing history: a good old-fashioned Turing Machine. → Full code on GitHub !!! commentary Today&rsquo;s challenge was a nice bit of nostalgia, taking me back to my university days learning about the theory of computing. Turing Machines are a classic bit of computing theory, and are provably able to compute any value that is possible to compute: a value is computable if and only if a Turing Machine can be written that computes it (though in practice anything non-trivial is mind-bendingly hard to write as a TM). A bit of a library-fest today, compared to other days! from collections import deque, namedtuple from collections.abc import Iterator from tqdm import tqdm import re import fileinput as fi These regular expressions are used to parse the input that defines the transition table for the machine. RE_ISTATE = re.compile(r&#39;Begin in state (?P&lt;state&gt;\w+)\.&#39;) RE_RUNTIME = re.compile( r&#39;Perform a diagnostic checksum after (?P&lt;steps&gt;\d+) steps.&#39;) RE_STATETRANS = re.compile( r&#34;In state (?P&lt;state&gt;\w+):\n&#34; r&#34; If the current value is (?P&lt;read0&gt;\d+):\n&#34; r&#34; - Write the value (?P&lt;write0&gt;\d+)\.\n&#34; r&#34; - Move one slot to the (?P&lt;move0&gt;left|right).\n&#34; r&#34; - Continue with state (?P&lt;next0&gt;\w+).\n&#34; r&#34; If the current value is (?P&lt;read1&gt;\d+):\n&#34; r&#34; - Write the value (?P&lt;write1&gt;\d+)\.\n&#34; r&#34; - Move one slot to the (?P&lt;move1&gt;left|right).\n&#34; r&#34; - Continue with state (?P&lt;next1&gt;\w+).&#34;) MOVE = {&#39;left&#39;: -1, &#39;right&#39;: 1} A namedtuple to provide some sugar when using a transition rule. Rule = namedtuple(&#39;Rule&#39;, &#39;write move next_state&#39;) The TuringMachine class does all the work. class TuringMachine: def __init__(self, program=None): self.tape = deque() self.transition_table = {} self.state = None self.runtime = 0 self.steps = 0 self.pos = 0 self.offset = 0 if program is not None: self.load(program) def __str__(self): return f&#34;Current: {self.state}; steps: {self.steps} of {self.runtime}&#34; Some jiggery-pokery to allow us to use self[pos] to reference an infinite tape. def __getitem__(self, i): i += self.offset if i &lt; 0 or i &gt;= len(self.tape): return 0 else: return self.tape[i] def __setitem__(self, i, x): i += self.offset if i &gt;= 0 and i &lt; len(self.tape): self.tape[i] = x elif i == -1: self.tape.appendleft(x) self.offset += 1 elif i == len(self.tape): self.tape.append(x) else: raise IndexError(&#39;Tried to set position off end of tape&#39;) Parse the program and set up the transtion table. def load(self, program): if isinstance(program, Iterator): program = &#39;&#39;.join(program) match = RE_ISTATE.search(program) self.state = match[&#39;state&#39;] match = RE_RUNTIME.search(program) self.runtime = int(match[&#39;steps&#39;]) for match in RE_STATETRANS.finditer(program): self.transition_table[match[&#39;state&#39;]] = { int(match[&#39;read0&#39;]): Rule(write=int(match[&#39;write0&#39;]), move=MOVE[match[&#39;move0&#39;]], next_state=match[&#39;next0&#39;]), int(match[&#39;read1&#39;]): Rule(write=int(match[&#39;write1&#39;]), move=MOVE[match[&#39;move1&#39;]], next_state=match[&#39;next1&#39;]), } Run the program for the required number of steps (given by self.runtime). tqdm isn&rsquo;t in the standard library but it should be: it shows a lovely text-mode progress bar as we go. def run(self): for _ in tqdm(range(self.runtime), desc=&#34;Running&#34;, unit=&#34;steps&#34;, unit_scale=True): read = self[self.pos] rule = self.transition_table[self.state][read] self[self.pos] = rule.write self.pos += rule.move self.state = rule.next_state Calculate the &ldquo;diagnostic checksum&rdquo; required for the answer. @property def checksum(self): return sum(self.tape) Aaand GO! machine = TuringMachine(fi.input()) machine.run() print(&#34;Checksum:&#34;, machine.checksum) Electromagnetic Moat — Rust — #adventofcode Day 24 Today&rsquo;s challenge, the penultimate, requires us to build a bridge capable of reaching across to the CPU, our final destination. → Full code on GitHub !!! commentary We have a finite number of components that fit together in a restricted way from which to build a bridge, and we have to work out both the strongest and the longest bridge we can build. The most obvious way to do this is to recursively build every possible bridge and select the best, but that&rsquo;s an O(n!) algorithm that could blow up quickly, so might as well go with a nice fast language! Might have to try this in Haskell too, because it&rsquo;s the type of algorithm that lends itself naturally to a pure functional approach. I feel like I've applied some of the things I've learned in previous challenges I used Rust for, and spent less time mucking about with ownership, and made better use of various language features, including structs and iterators. I'm rather pleased with how my learning of this language is progressing. I'm definitely overusing `Option.unwrap` at the moment though: this is a lazy way to deal with `Option` results and will panic if the result is not what's expected. I'm not sure whether I need to be cloning the components `Vector` either, or whether I could just be passing iterators around. First, we import some bits of standard library and define some data types. The BridgeResult struct lets us use the same algorithm for both parts of the challenge and simply change the value used to calculate the maximum. use std::io; use std::fmt; use std::io::BufRead; #[derive(Debug, Copy, Clone, PartialEq, Eq, Hash)] struct Component(u8, u8); #[derive(Debug, Copy, Clone, Default)] struct BridgeResult { strength: u16, length: u16, } impl Component { fn from_str(s: &amp;str) -&gt; Component { let parts: Vec&lt;&amp;str&gt; = s.split(&#39;/&#39;).collect(); assert!(parts.len() == 2); Component(parts[0].parse().unwrap(), parts[1].parse().unwrap()) } fn fits(self, port: u8) -&gt; bool { self.0 == port || self.1 == port } fn other_end(self, port: u8) -&gt; u8 { if self.0 == port { return self.1; } else if self.1 == port { return self.0; } else { panic!(&#34;{} doesn&#39;t fit port {}&#34;, self, port); } } fn strength(self) -&gt; u16 { self.0 as u16 + self.1 as u16 } } impl fmt::Display for BridgeResult { fn fmt(&amp;self, f: &amp;mut fmt::Formatter) -&gt; fmt::Result { write!(f, &#34;(S: {}, L: {})&#34;, self.strength, self.length) } } best_bridge calculates the length and strength of the &ldquo;best&rdquo; bridge that can be built from the remaining components and fits the required port. Whether this is based on strength or length is given by the key parameter, which is passed to Iter.max_by_key. fn best_bridge&lt;F&gt;(port: u8, key: &amp;F, components: &amp;Vec&lt;Component&gt;) -&gt; Option&lt;BridgeResult&gt; where F: Fn(&amp;BridgeResult) -&gt; u16 { if components.len() == 0 { return None; } components.iter() .filter(|c| c.fits(port)) .map(|c| { let b = best_bridge(c.other_end(port), key, &amp;components.clone().into_iter() .filter(|x| x != c).collect()) .unwrap_or_default(); BridgeResult{strength: c.strength() + b.strength, length: 1 + b.length} }) .max_by_key(key) } Now all that remains is to read the input and calculate the result. I was rather pleasantly surprised to find that in spite of my pessimistic predictions about efficiency, when compiled with optimisations turned on this terminates in less than 1s on my laptop. fn main() { let stdin = io::stdin(); let components: Vec&lt;_&gt; = stdin.lock() .lines() .map(|l| Component::from_str(&amp;l.unwrap())) .collect(); match best_bridge(0, &amp;|b: &amp;BridgeResult| b.strength, &amp;components) { Some(b) =&gt; println!(&#34;Strongest bridge is {}&#34;, b), None =&gt; println!(&#34;No strongest bridge found&#34;) }; match best_bridge(0, &amp;|b: &amp;BridgeResult| b.length, &amp;components) { Some(b) =&gt; println!(&#34;Longest bridge is {}&#34;, b), None =&gt; println!(&#34;No longest bridge found&#34;) }; } Coprocessor Conflagration — Haskell — #adventofcode Day 23 Today&rsquo;s challenge requires us to understand why a coprocessor is working so hard to perform an apparently simple calculation. → Full code on GitHub !!! commentary Today&rsquo;s problem is based on an assembly-like language very similar to day 18, so I went back and adapted my code from that, which works well for the first part. I&rsquo;ve also incorporated some advice from /r/haskell, and cleaned up all warnings shown by the -Wall compiler flag and the hlint tool. Part 2 requires the algorithm to run with much larger inputs, and since some analysis shows that it's an `O(n^3)` algorithm it gets intractible pretty fast. There are several approaches to this. First up, if you have a fast enough processor and an efficient enough implementation I suspect that the simulation would probably terminate eventually, but that would likely still take hours: not good enough. I also thought about doing some peephole optimisations on the instructions, but the last time I did compiler optimisation was my degree so I wasn't really sure where to start. What I ended up doing was actually analysing the input code by hand to figure out what it was doing, and then just doing that calculation in a sensible way. I'd like to say I managed this on my own (and I ike to think I would have) but I did get some tips on [/r/adventofcode](https://reddit.com/r/adventofcode). The majority of this code is simply a cleaned-up version of day 18, with some tweaks to accommodate the different instruction set: module Main where import qualified Data.Vector as V import qualified Data.Map.Strict as M import Control.Monad.State.Strict import Text.ParserCombinators.Parsec hiding (State) type Register = Char type Value = Int type Argument = Either Value Register data Instruction = Set Register Argument | Sub Register Argument | Mul Register Argument | Jnz Argument Argument deriving Show type Program = V.Vector Instruction data Result = Cont | Halt deriving (Eq, Show) type Registers = M.Map Char Int data Machine = Machine { dRegisters :: Registers , dPtr :: !Int , dMulCount :: !Int , dProgram :: Program } instance Show Machine where show d = show (dRegisters d) ++ &#34; @&#34; ++ show (dPtr d) ++ &#34; ×&#34; ++ show (dMulCount d) defaultMachine :: Machine defaultMachine = Machine M.empty 0 0 V.empty type MachineState = State Machine program :: GenParser Char st Program program = do instructions &lt;- endBy instruction eol return $ V.fromList instructions where instruction = try (regOp &#34;set&#34; Set) &lt;|&gt; regOp &#34;sub&#34; Sub &lt;|&gt; regOp &#34;mul&#34; Mul &lt;|&gt; jump &#34;jnz&#34; Jnz regOp n c = do string n &gt;&gt; spaces val1 &lt;- oneOf &#34;abcdefgh&#34; secondArg c val1 jump n c = do string n &gt;&gt; spaces val1 &lt;- regOrVal secondArg c val1 secondArg c val1 = do spaces val2 &lt;- regOrVal return $ c val1 val2 regOrVal = register &lt;|&gt; value register = do name &lt;- lower return $ Right name value = do val &lt;- many $ oneOf &#34;-0123456789&#34; return $ Left $ read val eol = char &#39;\n&#39; parseProgram :: String -&gt; Either ParseError Program parseProgram = parse program &#34;&#34; getReg :: Char -&gt; MachineState Int getReg r = do st &lt;- get return $ M.findWithDefault 0 r (dRegisters st) putReg :: Char -&gt; Int -&gt; MachineState () putReg r v = do st &lt;- get let current = dRegisters st new = M.insert r v current put $ st { dRegisters = new } modReg :: (Int -&gt; Int -&gt; Int) -&gt; Char -&gt; Argument -&gt; MachineState () modReg op r v = do u &lt;- getReg r v&#39; &lt;- getRegOrVal v putReg r (u `op` v&#39;) incPtr getRegOrVal :: Argument -&gt; MachineState Int getRegOrVal = either return getReg addPtr :: Int -&gt; MachineState () addPtr n = do st &lt;- get put $ st { dPtr = n + dPtr st } incPtr :: MachineState () incPtr = addPtr 1 execInst :: Instruction -&gt; MachineState () execInst (Set reg val) = do newVal &lt;- getRegOrVal val putReg reg newVal incPtr execInst (Mul reg val) = do result &lt;- modReg (*) reg val st &lt;- get put $ st { dMulCount = 1 + dMulCount st } return result execInst (Sub reg val) = modReg (-) reg val execInst (Jnz val1 val2) = do test &lt;- getRegOrVal val1 jump &lt;- if test /= 0 then getRegOrVal val2 else return 1 addPtr jump execNext :: MachineState Result execNext = do st &lt;- get let prog = dProgram st p = dPtr st if p &gt;= length prog then return Halt else do execInst (prog V.! p) return Cont runUntilTerm :: MachineState () runUntilTerm = do result &lt;- execNext unless (result == Halt) runUntilTerm This implements the actual calculation: the number of non-primes between (for my input) 107900 and 124900: optimisedCalc :: Int -&gt; Int -&gt; Int -&gt; Int optimisedCalc a b k = sum $ map (const 1) $ filter notPrime [a,a+k..b] where notPrime n = elem 0 $ map (mod n) [2..(floor $ sqrt (fromIntegral n :: Double))] main :: IO () main = do input &lt;- getContents case parseProgram input of Right prog -&gt; do let c = defaultMachine { dProgram = prog } (_, c&#39;) = runState runUntilTerm c putStrLn $ show (dMulCount c&#39;) ++ &#34; multiplications made&#34; putStrLn $ &#34;Calculation result: &#34; ++ show (optimisedCalc 107900 124900 17) Left e -&gt; print e Sporifica Virus — Rust — #adventofcode Day 22 Today&rsquo;s challenge has us helping to clean up (or spread, I can&rsquo;t really tell) an infection of the &ldquo;sporifica&rdquo; virus. → Full code on GitHub !!! commentary I thought I&rsquo;d have another play with Rust, as its Haskell-like features resonate with me at the moment. I struggled quite a lot with the Rust concepts of ownership and borrowing, and this is a cleaned-up version of the code based on some good advice from the folks on /r/rust. use std::io; use std::env; use std::io::BufRead; use std::collections::HashMap; #[derive(PartialEq, Clone, Copy, Debug)] enum Direction {Up, Right, Down, Left} #[derive(PartialEq, Clone, Copy, Debug)] enum Infection {Clean, Weakened, Infected, Flagged} use self::Direction::*; use self::Infection::*; type Grid = HashMap&lt;(isize, isize), Infection&gt;; fn turn_left(d: Direction) -&gt; Direction { match d {Up =&gt; Left, Right =&gt; Up, Down =&gt; Right, Left =&gt; Down} } fn turn_right(d: Direction) -&gt; Direction { match d {Up =&gt; Right, Right =&gt; Down, Down =&gt; Left, Left =&gt; Up} } fn turn_around(d: Direction) -&gt; Direction { match d {Up =&gt; Down, Right =&gt; Left, Down =&gt; Up, Left =&gt; Right} } fn make_move(d: Direction, x: isize, y: isize) -&gt; (isize, isize) { match d { Up =&gt; (x-1, y), Right =&gt; (x, y+1), Down =&gt; (x+1, y), Left =&gt; (x, y-1), } } fn basic_step(grid: &amp;mut Grid, x: &amp;mut isize, y: &amp;mut isize, d: &amp;mut Direction) -&gt; usize { let mut infect = 0; let current = match grid.get(&amp;(*x, *y)) { Some(v) =&gt; *v, None =&gt; Clean, }; if current == Infected { *d = turn_right(*d); } else { *d = turn_left(*d); infect = 1; }; grid.insert((*x, *y), match current { Clean =&gt; Infected, Infected =&gt; Clean, x =&gt; panic!(&#34;Unexpected infection state {:?}&#34;, x), }); let new_pos = make_move(*d, *x, *y); *x = new_pos.0; *y = new_pos.1; infect } fn nasty_step(grid: &amp;mut Grid, x: &amp;mut isize, y: &amp;mut isize, d: &amp;mut Direction) -&gt; usize { let mut infect = 0; let new_state: Infection; let current = match grid.get(&amp;(*x, *y)) { Some(v) =&gt; *v, None =&gt; Infection::Clean, }; match current { Clean =&gt; { *d = turn_left(*d); new_state = Weakened; }, Weakened =&gt; { new_state = Infected; infect = 1; }, Infected =&gt; { *d = turn_right(*d); new_state = Flagged; }, Flagged =&gt; { *d = turn_around(*d); new_state = Clean; } }; grid.insert((*x, *y), new_state); let new_pos = make_move(*d, *x, *y); *x = new_pos.0; *y = new_pos.1; infect } fn virus_infect&lt;F&gt;(mut grid: Grid, mut step: F, mut x: isize, mut y: isize, mut d: Direction, n: usize) -&gt; usize where F: FnMut(&amp;mut Grid, &amp;mut isize, &amp;mut isize, &amp;mut Direction) -&gt; usize, { (0..n).map(|_| step(&amp;mut grid, &amp;mut x, &amp;mut y, &amp;mut d)) .sum() } fn main() { let args: Vec&lt;String&gt; = env::args().collect(); let n_basic: usize = args[1].parse().unwrap(); let n_nasty: usize = args[2].parse().unwrap(); let stdin = io::stdin(); let lines: Vec&lt;String&gt; = stdin.lock() .lines() .map(|x| x.unwrap()) .collect(); let mut grid: Grid = HashMap::new(); let x0 = (lines.len() / 2) as isize; let y0 = (lines[0].len() / 2) as isize; for (i, line) in lines.iter().enumerate() { for (j, c) in line.chars().enumerate() { grid.insert((i as isize, j as isize), match c {&#39;#&#39; =&gt; Infected, _ =&gt; Clean}); } } let basic_steps = virus_infect(grid.clone(), basic_step, x0, y0, Up, n_basic); println!(&#34;Basic: infected {} times&#34;, basic_steps); let nasty_steps = virus_infect(grid, nasty_step, x0, y0, Up, n_nasty); println!(&#34;Nasty: infected {} times&#34;, nasty_steps); } Fractal Art — Python — #adventofcode Day 21 Today&rsquo;s challenge asks us to assist an artist building fractal patterns from a rulebook. → Full code on GitHub !!! commentary Another fairly straightforward algorithm: the really tricky part was breaking the pattern up into chunks and rejoining it again. I could probably have done that more efficiently, and would have needed to if I had to go for a few more iterations and the grid grows with every iteration and gets big fast. Still behind on the blog posts… import fileinput as fi from math import sqrt from functools import reduce, partial import operator INITIAL_PATTERN = ((0, 1, 0), (0, 0, 1), (1, 1, 1)) DECODE = [&#39;.&#39;, &#39;#&#39;] ENCODE = {&#39;.&#39;: 0, &#39;#&#39;: 1} concat = partial(reduce, operator.concat) def rotate(p): size = len(p) return tuple(tuple(p[i][j] for i in range(size)) for j in range(size - 1, -1, -1)) def flip(p): return tuple(p[i] for i in range(len(p) - 1, -1, -1)) def permutations(p): yield p yield flip(p) for _ in range(3): p = rotate(p) yield p yield flip(p) def print_pattern(p): print(&#39;-&#39; * len(p)) for row in p: print(&#39; &#39;.join(DECODE[x] for x in row)) print(&#39;-&#39; * len(p)) def build_pattern(s): return tuple(tuple(ENCODE[c] for c in row) for row in s.split(&#39;/&#39;)) def build_pattern_book(lines): book = {} for line in lines: source, target = line.strip().split(&#39; =&gt; &#39;) for rotation in permutations(build_pattern(source)): book[rotation] = build_pattern(target) return book def subdivide(pattern): size = 2 if len(pattern) % 2 == 0 else 3 n = len(pattern) // size return (tuple(tuple(pattern[i][j] for j in range(y * size, (y + 1) * size)) for i in range(x * size, (x + 1) * size)) for x in range(n) for y in range(n)) def rejoin(parts): n = int(sqrt(len(parts))) size = len(parts[0]) return tuple(concat(parts[i + k][j] for i in range(n)) for k in range(0, len(parts), n) for j in range(size)) def enhance_once(p, book): return rejoin(tuple(book[part] for part in subdivide(p))) def enhance(p, book, n, progress=None): for _ in range(n): p = enhance_once(p, book) return p book = build_pattern_book(fi.input()) intermediate_pattern = enhance(INITIAL_PATTERN, book, 5) print(&#34;After 5 iterations:&#34;, sum(sum(row) for row in intermediate_pattern)) final_pattern = enhance(intermediate_pattern, book, 13) print(&#34;After 18 iterations:&#34;, sum(sum(row) for row in final_pattern)) Particle Swarm — Python — #adventofcode Day 20 Today&rsquo;s challenge finds us simulating the movements of particles in space. → Full code on GitHub !!! commentary Back to Python for this one, another relatively straightforward simulation, although it&rsquo;s easier to calculate the answer to part 1 than to simulate. import fileinput as fi import numpy as np import re First we parse the input into 3 2D arrays: using numpy enables us to do efficient arithmetic across the whole set of particles in one go. PARTICLE_RE = re.compile(r&#39;p=&lt;(-?\d+),(-?\d+),(-?\d+)&gt;, &#39; r&#39;v=&lt;(-?\d+),(-?\d+),(-?\d+)&gt;, &#39; r&#39;a=&lt;(-?\d+),(-?\d+),(-?\d+)&gt;&#39;) def parse_input(lines): x = [] v = [] a = [] for l in lines: m = PARTICLE_RE.match(l) x.append([int(x) for x in m.group(1, 2, 3)]) v.append([int(x) for x in m.group(4, 5, 6)]) a.append([int(x) for x in m.group(7, 8, 9)]) return (np.arange(len(x)), np.array(x), np.array(v), np.array(a)) i, x, v, a = parse_input(fi.input()) Now we can calculate which particle will be closest to the origin in the long-term: this is simply the particle with the smallest acceleration. It turns out that several have the same acceleration, so of these, the one we want is the one with the lowest starting velocity. This is only complicated slightly by the need to get the number of the particle rather than its other information, hence the need to use numpy.argmin. a_abs = np.sum(np.abs(a), axis=1) a_min = np.min(a_abs) a_i = np.squeeze(np.argwhere(a_abs == a_min)) closest = i[a_i[np.argmin(np.sum(np.abs(v[a_i]), axis=1))]] print(&#34;Closest: &#34;, closest) Now we define functions to simulate collisions between particles. We have to use the return_index and return_counts options to numpy.unique to be able to get rid of all the duplicate positions (the standard usage is to keep one of each duplicate). def resolve_collisions(x, v, a): (_, i, c) = np.unique(x, return_index=True, return_counts=True, axis=0) i = i[c == 1] return x[i], v[i], a[i] The termination criterion for this loop is an interesting aspect: the most robust to my mind seems to be that eventually the particles will end up sorted in order of their initial acceleration in terms of distance from the origin, so you could check for this but that&rsquo;s pretty computationally expensive. In the end, all that was needed was a bit of trial and error: terminating arbitrarily after 1,000 iterations seems to work! In fact, all the collisions are over after about 40 iterations for my input but there was always the possibility that two particles with very slightly different accelerations would eventually intersect much later. def simulate_collisions(x, v, a, iterations=1000): for _ in range(iterations): v += a x += v x, v, a = resolve_collisions(x, v, a) return len(x) print(&#34;Remaining particles: &#34;, simulate_collisions(x, v, a)) A Series of Tubes — Rust — #adventofcode Day 19 Today&rsquo;s challenge asks us to help a network packet find its way. → Full code on GitHub !!! commentary Today&rsquo;s challenge was fairly straightforward, following an ASCII art path, so I thought I&rsquo;d give Rust another try. I&rsquo;m a bit behind on the blog posts, so I&rsquo;m presenting the code below without any further commentary. I&rsquo;m not really convinced this is good idiomatic Rust, and it was interesting turning a set of strings into a 2D array of characters because there are both u8 (byte) and char types to deal with. use std::io; use std::io::BufRead; const ALPHA: &amp;&#39;static str = &#34;ABCDEFGHIJKLMNOPQRSTUVWXYZ&#34;; fn change_direction(dia: &amp;Vec&lt;Vec&lt;u8&gt;&gt;, x: usize, y: usize, dx: &amp;mut i32, dy: &amp;mut i32) { assert_eq!(dia[x][y], b&#39;+&#39;); if dx.abs() == 1 { *dx = 0; if y + 1 &lt; dia[x].len() &amp;&amp; (dia[x][y + 1] == b&#39;-&#39; || ALPHA.contains(dia[x][y + 1] as char)) { *dy = 1; } else if dia[x][y - 1] == b&#39;-&#39; || ALPHA.contains(dia[x][y - 1] as char) { *dy = -1; } else { panic!(&#34;Huh? {} {}&#34;, dia[x][y+1] as char, dia[x][y-1] as char); } } else { *dy = 0; if x + 1 &lt; dia.len() &amp;&amp; (dia[x + 1][y] == b&#39;|&#39; || ALPHA.contains(dia[x + 1][y] as char)) { *dx = 1; } else if dia[x - 1][y] == b&#39;|&#39; || ALPHA.contains(dia[x - 1][y] as char) { *dx = -1; } else { panic!(&#34;Huh?&#34;); } } } fn follow_route(dia: Vec&lt;Vec&lt;u8&gt;&gt;) -&gt; (String, i32) { let mut x: i32 = 0; let mut y: i32; let mut dx: i32 = 1; let mut dy: i32 = 0; let mut result = String::new(); let mut steps = 1; match dia[0].iter().position(|x| *x == b&#39;|&#39;) { Some(i) =&gt; y = i as i32, None =&gt; panic!(&#34;Could not find &#39;|&#39; in first row&#34;), } loop { x += dx; y += dy; match dia[x as usize][y as usize] { b&#39;A&#39;...b&#39;Z&#39; =&gt; result.push(dia[x as usize][y as usize] as char), b&#39;+&#39; =&gt; change_direction(&amp;dia, x as usize, y as usize, &amp;mut dx, &amp;mut dy), b&#39; &#39; =&gt; return (result, steps), _ =&gt; (), } steps += 1; } } fn main() { let stdin = io::stdin(); let lines: Vec&lt;Vec&lt;u8&gt;&gt; = stdin.lock().lines() .map(|l| l.unwrap().into_bytes()) .collect(); let result = follow_route(lines); println!(&#34;Route: {}&#34;, result.0); println!(&#34;Steps: {}&#34;, result.1); } Duet — Haskell — #adventofcode Day 18 Today&rsquo;s challenge introduces a type of simplified assembly language that includes instructions for message-passing. First we have to simulate a single program (after humorously misinterpreting the snd and rcv instructions as &ldquo;sound&rdquo; and &ldquo;recover&rdquo;), but then we have to simulate two concurrent processes and the message passing between them. → Full code on GitHub !!! commentary Well, I really learned a lot from this one! I wanted to get to grips with more complex stuff in Haskell and this challenge seemed like an excellent opportunity to figure out a) parsing with the parsec library and b) using the State monad to keep the state of the simulator. As it turned out, that wasn't all I'd learned: I also ran into an interesting situation whereby lazy evaluation was creating an infinite loop where there shouldn't be one, so I also had to learn how to selectively force strict evaluation of values. I'm pretty sure this isn't the best Haskell in the world, but I'm proud of it. First we have to import a bunch of stuff to use later, but also notice the pragma on the first line which instructs the compiler to enable the BangPatterns language extension, which will be important later. {-# LANGUAGE BangPatterns #-} module Main where import qualified Data.Vector as V import qualified Data.Map.Strict as M import Data.List import Data.Either import Data.Maybe import Control.Monad.State.Strict import Control.Monad.Loops import Text.ParserCombinators.Parsec hiding (State) First up we define the types that will represent the program code itself. data DuetVal = Reg Char | Val Int deriving Show type DuetQueue = [Int] data DuetInstruction = Snd DuetVal | Rcv DuetVal | Jgz DuetVal DuetVal | Set DuetVal DuetVal | Add DuetVal DuetVal | Mul DuetVal DuetVal | Mod DuetVal DuetVal deriving Show type DuetProgram = V.Vector DuetInstruction Next we define the types to hold the machine state, which includes: registers, instruction pointer, send &amp; receive buffers and the program code, plus a counter of the number of sends made (to provide the solution). type DuetRegisters = M.Map Char Int data Duet = Duet { dRegisters :: DuetRegisters , dPtr :: Int , dSendCount :: Int , dRcvBuf :: DuetQueue , dSndBuf :: DuetQueue , dProgram :: DuetProgram } instance Show Duet where show d = show (dRegisters d) ++ &#34; @&#34; ++ show (dPtr d) ++ &#34; S&#34; ++ show (dSndBuf d) ++ &#34; R&#34; ++ show (dRcvBuf d) defaultDuet = Duet M.empty 0 0 [] [] V.empty type DuetState = State Duet program is a parser built on the cool parsec library to turn the program text into a Haskell format that we can work with, a Vector of instructions. Yes, using a full-blown parser is overkill here (it would be much simpler just to split each line on whitespace, but I wanted to see how Parsec works. I&rsquo;m using Vector here because we need random access to the instruction list, which is much more efficient with Vector: O(1) compared with the O(n) of the built in Haskell list ([]) type. parseProgram applies the parser to a string and returns the result. program :: GenParser Char st DuetProgram program = do instructions &lt;- endBy instruction eol return $ V.fromList instructions where instruction = try (oneArg &#34;snd&#34; Snd) &lt;|&gt; oneArg &#34;rcv&#34; Rcv &lt;|&gt; twoArg &#34;set&#34; Set &lt;|&gt; twoArg &#34;add&#34; Add &lt;|&gt; try (twoArg &#34;mul&#34; Mul) &lt;|&gt; twoArg &#34;mod&#34; Mod &lt;|&gt; twoArg &#34;jgz&#34; Jgz oneArg n c = do string n &gt;&gt; spaces val &lt;- regOrVal return $ c val twoArg n c = do string n &gt;&gt; spaces val1 &lt;- regOrVal spaces val2 &lt;- regOrVal return $ c val1 val2 regOrVal = register &lt;|&gt; value register = do name &lt;- lower return $ Reg name value = do val &lt;- many $ oneOf &#34;-0123456789&#34; return $ Val $ read val eol = char &#39;\n&#39; parseProgram :: String -&gt; Either ParseError DuetProgram parseProgram = parse program &#34;&#34; Next up we have some utility functions that sit in the DuetState monad we defined above and perform common manipulations on the state: getting/setting/updating registers, updating the instruction pointer and sending/receiving messages via the relevant queues. getReg :: Char -&gt; DuetState Int getReg r = do st &lt;- get return $ M.findWithDefault 0 r (dRegisters st) putReg :: Char -&gt; Int -&gt; DuetState () putReg r v = do st &lt;- get let current = dRegisters st new = M.insert r v current put $ st { dRegisters = new } modReg :: (Int -&gt; Int -&gt; Int) -&gt; Char -&gt; DuetVal -&gt; DuetState Bool modReg op r v = do u &lt;- getReg r v&#39; &lt;- getRegOrVal v putReg r (u `op` v&#39;) incPtr return False getRegOrVal :: DuetVal -&gt; DuetState Int getRegOrVal (Reg r) = getReg r getRegOrVal (Val v) = return v addPtr :: Int -&gt; DuetState () addPtr n = do st &lt;- get put $ st { dPtr = n + dPtr st } incPtr = addPtr 1 send :: Int -&gt; DuetState () send v = do st &lt;- get put $ st { dSndBuf = (dSndBuf st ++ [v]), dSendCount = dSendCount st + 1 } recv :: DuetState (Maybe Int) recv = do st &lt;- get case dRcvBuf st of (x:xs) -&gt; do put $ st { dRcvBuf = xs } return $ Just x [] -&gt; return Nothing execInst implements the logic for each instruction. It returns False as long as the program can continue, but True if the program tries to receive from an empty buffer. execInst :: DuetInstruction -&gt; DuetState Bool execInst (Set (Reg reg) val) = do newVal &lt;- getRegOrVal val putReg reg newVal incPtr return False execInst (Mul (Reg reg) val) = modReg (*) reg val execInst (Add (Reg reg) val) = modReg (+) reg val execInst (Mod (Reg reg) val) = modReg mod reg val execInst (Jgz val1 val2) = do st &lt;- get test &lt;- getRegOrVal val1 jump &lt;- if test &gt; 0 then getRegOrVal val2 else return 1 addPtr jump return False execInst (Snd val) = do v &lt;- getRegOrVal val send v st &lt;- get incPtr return False execInst (Rcv (Reg r)) = do st &lt;- get v &lt;- recv handle v where handle :: Maybe Int -&gt; DuetState Bool handle (Just x) = putReg r x &gt;&gt; incPtr &gt;&gt; return False handle Nothing = return True execInst x = error $ &#34;execInst not implemented yet for &#34; ++ show x execNext looks up the next instruction and executes it. runUntilWait runs the program until execNext returns True to signal the wait state has been reached. execNext :: DuetState Bool execNext = do st &lt;- get let prog = dProgram st p = dPtr st if p &gt;= length prog then return True else execInst (prog V.! p) runUntilWait :: DuetState () runUntilWait = do waiting &lt;- execNext unless waiting runUntilWait runTwoPrograms handles the concurrent running of two programs, by running first one and then the other to a wait state, then swapping each program&rsquo;s send buffer to the other&rsquo;s receive buffer before repeating. If you look carefully, you&rsquo;ll see a &ldquo;bang&rdquo; (!) before the two arguments of the function: runTwoPrograms !d0 !d1. Haskell is a lazy language and usually doesn&rsquo;t evaluate a computation until you ask for a result, instead carrying around a &ldquo;thunk&rdquo; or plan for how to carry out the computation. Sometimes that can be a problem because the amount of memory your program is using can explode unnecessarily as a long computation turns into a large thunk which isn&rsquo;t evaluated until the very end. That&rsquo;s not the problem here though. What happens here without the bangs is another side-effect of laziness. The exit condition of this recursive function is that a deadlock has been reached: both programs are waiting to receive, but neither has sent anything, so neither can ever continue. The check for this is (null $ dSndBuf d0') &amp;&amp; (null $ dSndBuf d1'). As long as the first program has something in its send buffer, the test fails without ever evaluating the second part, which means the result d1' of running the second program is never needed. The function immediately goes to the recursive case and tries to continue the first program again, which immediately returns because it&rsquo;s still waiting to receive. The same thing happens again, and the result is that instead of running the second program to obtain something for the first to receive, we get into an infinite loop trying and failing to continue the first program. The bang forces both d0 and d1 to be evaluated at the point we recurse, which forces the rest of the computation: running the second program and swapping the send/receive buffers. With that, the evaluation proceeds correctly and we terminate with a result instead of getting into an infinite loop! runTwoPrograms :: Duet -&gt; Duet -&gt; (Int, Int) runTwoPrograms !d0 !d1 | (null $ dSndBuf d0&#39;) &amp;&amp; (null $ dSndBuf d1&#39;) = (dSendCount d0&#39;, dSendCount d1&#39;) | otherwise = runTwoPrograms d0&#39;&#39; d1&#39;&#39; where (_, d0&#39;) = runState runUntilWait d0 (_, d1&#39;) = runState runUntilWait d1 d0&#39;&#39; = d0&#39; { dSndBuf = [], dRcvBuf = dSndBuf d1&#39; } d1&#39;&#39; = d1&#39; { dSndBuf = [], dRcvBuf = dSndBuf d0&#39; } All that remains to be done now is to run the programs and see how many messages were sent before the deadlock. main = do prog &lt;- fmap (fromRight V.empty . parseProgram) getContents let d0 = defaultDuet { dProgram = prog, dRegisters = M.fromList [(&#39;p&#39;, 0)] } d1 = defaultDuet { dProgram = prog, dRegisters = M.fromList [(&#39;p&#39;, 1)] } (send0, send1) = runTwoPrograms d0 d1 putStrLn $ &#34;Program 0 sent &#34; ++ show send0 ++ &#34; messages&#34; putStrLn $ &#34;Program 1 sent &#34; ++ show send1 ++ &#34; messages&#34; Spinlock — Rust/Python — #adventofcode Day 17 In today&rsquo;s challenge we deal with a monstrous whirlwind of a program, eating up CPU and memory in equal measure. → Full code on GitHub (and Python driver script) !!! commentary One of the things I wanted from AoC was an opportunity to try out some popular languages that I don&rsquo;t currently know, including the memory-safe, strongly-typed compiled languages Go and Rust. Realistically though, I&rsquo;m likely to continue doing most of my programming in Python, and use one of these other languages when it has better tools or I need the extra speed. In which case, what I really want to know is how I can call functions written in Go or Rust from Python. I thought I'd try Rust first, as it seems to be designed to be C-compatible and that makes it easy to call from Python using [`ctypes`](https://docs.python.org/3.6/library/ctypes.html). Part 1 was another straightforward simulation: translate what the &quot;spinlock&quot; monster is doing into code and run it. It was pretty obvious from the story of this challenge and experience of the last few days that this was going to be another one where the simulation is too computationally expensive for part two, which turns out to be correct. So, first thing to do is to implement the meat of the solution in Rust. spinlock solves the first part of the problem by doing exactly what the monster does. Since we only have to go up to 2017 iterations, this is very tractable. The last number we insert is 2017, so we just return the number immediately after that. #[no_mangle] pub extern fn spinlock(n: usize, skip: usize) -&gt; i32 { let mut buffer: Vec&lt;i32&gt; = Vec::with_capacity(n+1); buffer.push(0); buffer.push(1); let mut pos = 1; for i in 2..n+1 { pos = (pos + skip + 1) % buffer.len(); buffer.insert(pos, i as i32); } pos = (pos + 1) % buffer.len(); return buffer[pos]; } For the second part, we have to do 50 million iterations instead, which is a lot. Given that every time you insert an item in the list it has to move up all the elements after that position, I&rsquo;m pretty sure the algorithm is O(n^2), so it&rsquo;s going to take a lot longer than 10,000ish times the first part. Thankfully, we don&rsquo;t need to build the whole list, just keep track of where 0 is and what number is immediately after it. There may be a closed-form solution to simply calculate the result, but I couldn&rsquo;t think of it and this is good enough. #[no_mangle] pub extern fn spinlock0(n: usize, skip: usize) -&gt; i32 { let mut pos = 1; let mut pos_0 = 0; let mut after_0 = 1; for i in 2..n+1 { pos = (pos + skip + 1) % i; if pos == pos_0 + 1 { after_0 = i; } if pos &lt;= pos_0 { pos_0 += 1; } } return after_0 as i31; } Now it&rsquo;s time to call this code from Python. Notice the #[no_mangle] pragmas and pub extern declarations for each function above, which are required to make sure the functions are exported in a C-compatible way. We can build this into a shared library like this: rustc --crate-type=cdylib -o spinlock.so 17-spinlock.rs The Python script is as simple as loading this library, reading the puzzle input from the command line and calling the functions. The ctypes module does a lot of magic so that we don&rsquo;t have to worry about converting from Python types to native types and back again. import ctypes import sys lib = ctypes.cdll.LoadLibrary(&#34;./spinlock.so&#34;) skip = int(sys.argv[1]) print(&#34;Part 1:&#34;, lib.spinlock(2017, skip)) print(&#34;Part 2:&#34;, lib.spinlock0(50_000_000, skip)) This is a toy example as far as calling Rust from Python is concerned, but it&rsquo;s worth noting that already we can play with the parameters to the two Rust functions without having to recompile. For more serious work, I&rsquo;d probably be looking at something like PyO3 to make a proper Python module. Looks like there&rsquo;s also a very early Rust numpy integration for integrating numerical stuff. You can also do the same thing from Julia, which has a ccall function built in: ccall((:spinlock, &#34;./spinlock.so&#34;), Int32, (UInt64, UInt64), 2017, 377) My next thing to try might be Haskell → Python though… Permutation Promenade — Julia — #adventofcode Day 16 Today&rsquo;s challenge rather appeals to me as a folk dancer, because it describes a set of instructions for a dance and asks us to work out the positions of the dancing programs after each run through the dance. → Full code on GitHub !!! commentary So, part 1 is pretty straight forward: parse the set of instructions, interpret them and keep track of the dancer positions as you go. One time through the dance. However, part 2 asks for the positions after 1 billion (yes, that&rsquo;s 1,000,000,000) times through the dance. In hindsight I should have immediately become suspicious, but I thought I&rsquo;d at least try the brute force approach first because it was simpler to code. So I give it a try, and after waiting for a while, having a cup of tea etc. it still hasn't terminated. I try reducing the number of iterations to 1,000. Now it terminates, but takes about 6 seconds. A spot of arithmetic suggests that running the full version will take a little over 190 years. There must be a better way than that! I'm a little embarassed that I didn't spot the solution immediately (blaming Julia) and tried again in Python to see if I could get it to terminate quicker. When that didn't work I had to think again. A little further investigation with a while loop shows that in fact the dance position repeats (in the case of my input) every 48 times. After that it becomes much quicker! Oh, and it was time for a new language, so I wasted some extra time working out the quirks of [Julia][]. First, a function to evaluate a single move — for neatness, this dispatches to a dedicated function depending on the type of move, although this isn&rsquo;t really necessary to solve the challenge. Ending a function name with a bang (!) is a Julia convention to indicate that it has side-effects. function eval_move!(move, dancers) move_type = move[1] params = move[2:end] if move_type == &#39;s&#39; # spin eval_spin!(params, dancers) elseif move_type == &#39;x&#39; # exchange eval_exchange!(params, dancers) elseif move_type == &#39;p&#39; # partner swap eval_partner!(params, dancers) end end These take care of the individual moves. Parsing the parameters from a string every single time probably isn&rsquo;t ideal, but as it turns out, that optimisation isn&rsquo;t really necessary. Note the + 1 in eval_exchange!, which is necessary because Julia is one of those crazy languages where indexes start from 1 instead of 0. These actions are pretty nice to implement, because Julia has circshift as a builtin to rotate a list, and allows you to assign to list slices and swap values in place with a single statement. function eval_spin!(params, dancers) shift = parse(Int, params) dancers[1:end] = circshift(dancers, shift) end function eval_exchange!(params, dancers) i, j = map(x -&gt; parse(Int, x) + 1, split(params, &#34;/&#34;)) dancers[i], dancers[j] = dancers[j], dancers[i] end function eval_partner!(params, dancers) a, b = split(params, &#34;/&#34;) ia = findfirst([x == a for x in dancers]) ib = findfirst([x == b for x in dancers]) dancers[ia], dancers[ib] = b, a end dance! takes a list of moves and takes the dances once through the dance. function dance!(moves, dancers) for m in moves eval_move!(m, dancers) end end To solve part 1, we simply need to read the moves in, set up the initial positions of the dances and run the dance through once. join is necessary to a) turn characters into length-1 strings, and b) convert the list of strings back into a single string to print out. moves = split(readchomp(STDIN), &#34;,&#34;) dancers = collect(join(c) for c in &#39;a&#39;:&#39;p&#39;) orig_dancers = copy(dancers) dance!(moves, dancers) println(join(dancers)) Part 2 requires a little more work. We run the dance through again and again until we get back to the initial position, saving the intermediate positions in a list. The list now contains every possible position available from that starting point, so we can find position 1 billion by taking 1,000,000,000 modulo the list length (plus 1 because 1-based indexing) and use that to index into the list to get the final position. dance_cycle = [orig_dancers] while dancers != orig_dancers push!(dance_cycle, copy(dancers)) dance!(moves, dancers) end println(join(dance_cycle[1_000_000_000 % length(dance_cycle) + 1])) This terminates on my laptop in about 1.6s: Brute force 0; Careful thought 1! Dueling Generators — Rust — #adventofcode Day 15 Today&rsquo;s challenge introduces two pseudo-random number generators which are trying to agree on a series of numbers. We play the part of the &ldquo;judge&rdquo;, counting the number of times their numbers agree in the lowest 16 bits. → Full code on GitHub Ever since I used Go to solve day 3, I&rsquo;ve had a hankering to try the other new kid on the memory-safe compiled language block, Rust. I found it a bit intimidating at first because the syntax wasn&rsquo;t as close to the C/C++ I&rsquo;m familiar with and there are quite a few concepts unique to Rust, like the use of traits. But I figured it out, so I can tick another language of my to-try list. I also implemented a version in Python for comparison: the Python version is more concise and easier to read but the Rust version runs about 10× faster. First we include the std::env &ldquo;crate&rdquo; which will let us get access to commandline arguments, and define some useful constants for later. use std::env; const M: i64 = 2147483647; const MASK: i64 = 0b1111111111111111; const FACTOR_A: i64 = 16807; const FACTOR_B: i64 = 48271; gen_next generates the next number for a given generator&rsquo;s sequence. gen_next_picky does the same, but for the &ldquo;picky&rdquo; generators, only returning values that meet their criteria. fn gen_next(factor: i64, current: i64) -&gt; i64 { return (current * factor) % M; } fn gen_next_picky(factor: i64, current: i64, mult: i64) -&gt; i64 { let mut next = gen_next(factor, current); while next % mult != 0 { next = gen_next(factor, next); } return next; } duel runs a single duel, and returns the number of times the generators agreed in the lowest 16 bits (found by doing a binary &amp; with the mask defined above). Rust allows functions to be passed as parameters, so we use this to be able to run both versions of the duel using only this one function. fn duel&lt;F, G&gt;(n: i64, next_a: F, mut value_a: i64, next_b: G, mut value_b: i64) -&gt; i64 where F: Fn(i64) -&gt; i64, G: Fn(i64) -&gt; i64, { let mut count = 0; for _ in 0..n { value_a = next_a(value_a); value_b = next_b(value_b); if (value_a &amp; MASK) == (value_b &amp; MASK) { count += 1; } } return count; } Finally, we read the start values from the command line and run the two duels. The expressions that begin |n| are closures (anonymous functions, often called lambdas in other languages) that we use to specify the generator functions for each duel. fn main() { let args: Vec&lt;String&gt; = env::args().collect(); let start_a: i64 = args[1].parse().unwrap(); let start_b: i64 = args[2].parse().unwrap(); println!( &#34;Duel 1: {}&#34;, duel( 40000000, |n| gen_next(FACTOR_A, n), start_a, |n| gen_next(FACTOR_B, n), start_b, ) ); println!( &#34;Duel 2: {}&#34;, duel( 5000000, |n| gen_next_picky(FACTOR_A, n, 4), start_a, |n| gen_next_picky(FACTOR_B, n, 8), start_b, ) ); } Disk Defragmentation — Haskell — #adventofcode Day 14 Today&rsquo;s challenge has us helping a disk defragmentation program by identifying contiguous regions of used sectors on a 2D disk. → Full code on GitHub !!! commentary Wow, today&rsquo;s challenge had a pretty steep learning curve. Day 14 was the first to directly reuse code from a previous day: the &ldquo;knot hash&rdquo; from day 10. I solved day 10 in Haskell, so I thought it would be easier to stick with Haskell for today as well. The first part was straightforward, but the second was pretty mind-bending in a pure functional language! I ended up solving it by implementing a [flood fill algorithm][flood]. It's recursive, which is right in Haskell's wheelhouse, but I ended up using `Data.Sequence` instead of the standard list type as its API for indexing is better. I haven't tried it, but I think it will also be a little faster than a naive list-based version. It took a looong time to figure everything out, but I had a day off work to be able to concentrate on it! A lot more imports for this solution, as we&rsquo;re exercising a lot more of the standard library. module Main where import Prelude hiding (length, filter, take) import Data.Char (ord) import Data.Sequence import Data.Foldable hiding (length) import Data.Ix (inRange) import Data.Function ((&amp;)) import Data.Maybe (fromJust, mapMaybe, isJust) import qualified Data.Set as Set import Text.Printf (printf) import System.Environment (getArgs) Also we&rsquo;ll extract the key bits from day 10 into a module and import that. import KnotHash Now we define a few data types to make the code a bit more readable. Sector represent the state of a particular disk sector, either free, used (but unmarked) or used and marked as belonging to a given integer-labelled group. Grid is a 2D matrix of Sector, as a sequence of sequences. data Sector = Free | Used | Mark Int deriving (Eq) instance Show Sector where show Free = &#34; .&#34; show Used = &#34; #&#34; show (Mark i) = printf &#34;%4d&#34; i type GridRow = Seq Sector type Grid = Seq (GridRow) Some utility functions to make it easier to view the grids (which can be quite large): used for debugging but not in the finished solution. subGrid :: Int -&gt; Grid -&gt; Grid subGrid n = fmap (take n) . take n printRow :: GridRow -&gt; IO () printRow row = do mapM_ (putStr . show) row putStr &#34;\n&#34; printGrid :: Grid -&gt; IO () printGrid = mapM_ printRow makeKey generates the hash key for a given row. makeKey :: String -&gt; Int -&gt; String makeKey input n = input ++ &#34;-&#34; ++ show n stringToGridRow converts a binary string of &lsquo;1&rsquo; and &lsquo;0&rsquo; characters to a sequence of Sector values. stringToGridRow :: String -&gt; GridRow stringToGridRow = fromList . map convert where convert x | x == &#39;1&#39; = Used | x == &#39;0&#39; = Free makeRow and makeGrid build up the grid to use based on the provided input string. makeRow :: String -&gt; Int -&gt; GridRow makeRow input n = stringToGridRow $ concatMap (printf &#34;%08b&#34;) $ dense $ fullKnotHash 256 $ map ord $ makeKey input n makeGrid :: String -&gt; Grid makeGrid input = fromList $ map (makeRow input) [0..127] Utility functions to count the number of used and free sectors, to give the solution to part 1. countEqual :: Sector -&gt; Grid -&gt; Int countEqual x = sum . fmap (length . filter (==x)) countUsed = countEqual Used countFree = countEqual Free Now the real meat begins! fundUnmarked finds the location of the next used sector that we haven&rsquo;t yet marked. It returns a Maybe value, which is Just (x, y) if there is still an unmarked block or Nothing if there&rsquo;s nothing left to mark. findUnmarked :: Grid -&gt; Maybe (Int, Int) findUnmarked g | y == Nothing = Nothing | otherwise = Just (fromJust x, fromJust y) where hasUnmarked row = isJust $ elemIndexL Used row x = findIndexL hasUnmarked g y = case x of Nothing -&gt; Nothing Just x&#39; -&gt; elemIndexL Used $ index g x&#39; floodFill implements a very simple recursive flood fill. It takes a target and replacement value and a starting location, and fills in the replacement value for every connected location that currently has the target value. We use it below to replace a connected used region with a marked region. floodFill :: Sector -&gt; Sector -&gt; (Int, Int) -&gt; Grid -&gt; Grid floodFill t r (x, y) g | inRange (0, length g - 1) x &amp;&amp; inRange (0, length g - 1) y &amp;&amp; elem == t = let newRow = update y r row newGrid = update x newRow g in newGrid &amp; floodFill t r (x+1, y) &amp; floodFill t r (x-1, y) &amp; floodFill t r (x, y+1) &amp; floodFill t r (x, y-1) | otherwise = g where row = g `index` x elem = row `index` y markNextGroup looks for an unmarked group and marks it if found. If no more groups are found it returns Nothing. markAllGroups then repeatedly applies markNextGroup until Nothing is returned. markNextGroup :: Int -&gt; Grid -&gt; Maybe Grid markNextGroup i g = case findUnmarked g of Nothing -&gt; Nothing Just loc -&gt; Just $ floodFill Used (Mark i) loc g markAllGroups :: Grid -&gt; Grid markAllGroups g = markAllGroups&#39; 1 g where markAllGroups&#39; i g = case markNextGroup i g of Nothing -&gt; g Just g&#39; -&gt; markAllGroups&#39; (i+1) g&#39; onlyMarks filters a grid row and returns a list of (possibly duplicated) group numbers in the row. onlyMarks :: GridRow -&gt; [Int] onlyMarks = mapMaybe getMark . toList where getMark Free = Nothing getMark Used = Nothing getMark (Mark i) = Just i Finally, countGroups puts all the group numbers into a set to get rid of duplicates and returns the size of the set, i.e. the total number of separate groups. countGroups :: Grid -&gt; Int countGroups g = Set.size groupSet where groupSet = foldl&#39; Set.union Set.empty $ fmap rowToSet g rowToSet = Set.fromList . toList . onlyMarks As always, every Haskell program needs a main function to drive the I/O and produce the actual result. main = do input &lt;- fmap head getArgs let grid = makeGrid input used = countUsed grid marked = countGroups $ markAllGroups grid putStrLn $ &#34;Used sectors: &#34; ++ show used putStrLn $ &#34;Groups: &#34; ++ show marked Packet Scanners — Haskell — #adventofcode Day 13 Today&rsquo;s challenge requires us to sneak past a firewall made up of a series of scanners. → Full code on GitHub !!! commentary I wasn&rsquo;t really thinking straight when I solved this challenge. I got a solution without too much trouble, but I ended up simulating the step-by-step movement of the scanners. I finally realised that I could calculate whether or not a given scanner was safe at a given time directly with modular arithmetic, and it bugged me so much that I reimplemented the solution. Both are given below, the faster one first. First we introduce some standard library stuff and define some useful utilities. module Main where import qualified Data.Text as T import Data.Maybe (mapMaybe) strip :: String -&gt; String strip = T.unpack . T.strip . T.pack splitOn :: String -&gt; String -&gt; [String] splitOn sep = map T.unpack . T.splitOn (T.pack sep) . T.pack parseScanner :: String -&gt; (Int, Int) parseScanner s = (d, r) where [d, r] = map read $ splitOn &#34;: &#34; s traverseFW does all the hard work: it checks for each scanner whether or not it&rsquo;s safe as we pass through, and returns a list of the severities of each time we&rsquo;re caught. mapMaybe is like the standard map in many languages, but operates on a list of Haskell Maybe values, like a combined map and filter. If the value is Just x, x gets included in the returned list; if the value is Nothing, then it gets thrown away. traverseFW :: Int -&gt; [(Int, Int)] -&gt; [Int] traverseFW delay = mapMaybe caught where caught (d, r) = if (d + delay) `mod` (2*(r-1)) == 0 then Just (d * r) else Nothing Then the total severity of our passage through the firewall is simply the sum of each individual severity. severity :: [(Int, Int)] -&gt; Int severity = sum . traverseFW 0 But we don&rsquo;t want to know how badly we got caught, we want to know how long to wait before setting off to get through safely. findDelay tries traversing the firewall with increasing delay, and returns the delay for the first pass where we predict not getting caught. findDelay :: [(Int, Int)] -&gt; Int findDelay scanners = head $ filter (null . flip traverseFW scanners) [0..] And finally, we put it all together and calculate and print the result. main = do scanners &lt;- fmap (map parseScanner . lines) getContents putStrLn $ &#34;Severity: &#34; ++ (show $ severity scanners) putStrLn $ &#34;Delay: &#34; ++ (show $ findDelay scanners) I&rsquo;m not generally bothered about performance for these challenges, but here I&rsquo;ll note that my second attempt runs in a little under 2 seconds on my laptop: $ time ./13-packet-scanners-redux &lt; 13-input.txt Severity: 1900 Delay: 3966414 ./13-packet-scanners-redux &lt; 13-input.txt 1.73s user 0.02s system 99% cpu 1.754 total Compare that with the first, simulation-based one, which takes nearly a full minute: $ time ./13-packet-scanners &lt; 13-input.txt Severity: 1900 Delay: 3966414 ./13-packet-scanners &lt; 13-input.txt 57.63s user 0.27s system 100% cpu 57.902 total And for good measure, here&rsquo;s the code. Notice the tick and tickOne functions, which together simulate moving all the scanners by one step; for this to work we have to track the full current state of each scanner, which is easier to read with a Haskell record-based custom data type. traverseFW is more complicated because it has to drive the simulation, but the rest of the code is mostly the same. module Main where import qualified Data.Text as T import Control.Monad (forM_) data Scanner = Scanner { depth :: Int , range :: Int , pos :: Int , dir :: Int } instance Show Scanner where show (Scanner d r p dir) = show d ++ &#34;/&#34; ++ show r ++ &#34;/&#34; ++ show p ++ &#34;/&#34; ++ show dir strip :: String -&gt; String strip = T.unpack . T.strip . T.pack splitOn :: String -&gt; String -&gt; [String] splitOn sep str = map T.unpack $ T.splitOn (T.pack sep) $ T.pack str parseScanner :: String -&gt; Scanner parseScanner s = Scanner d r 0 1 where [d, r] = map read $ splitOn &#34;: &#34; s tickOne :: Scanner -&gt; Scanner tickOne (Scanner depth range pos dir) | pos &lt;= 0 = Scanner depth range (pos+1) 1 | pos &gt;= range - 1 = Scanner depth range (pos-1) (-1) | otherwise = Scanner depth range (pos+dir) dir tick :: [Scanner] -&gt; [Scanner] tick = map tickOne traverseFW :: [Scanner] -&gt; [(Int, Int)] traverseFW = traverseFW&#39; 0 where traverseFW&#39; _ [] = [] traverseFW&#39; layer scanners@((Scanner depth range pos _):rest) -- | layer == depth &amp;&amp; pos == 0 = (depth*range) + (traverseFW&#39; (layer+1) $ tick rest) | layer == depth &amp;&amp; pos == 0 = (depth,range) : (traverseFW&#39; (layer+1) $ tick rest) | layer == depth &amp;&amp; pos /= 0 = traverseFW&#39; (layer+1) $ tick rest | otherwise = traverseFW&#39; (layer+1) $ tick scanners severity :: [Scanner] -&gt; Int severity = sum . map (uncurry (*)) . traverseFW empty :: [a] -&gt; Bool empty [] = True empty _ = False findDelay :: [Scanner] -&gt; Int findDelay scanners = delay where (delay, _) = head $ filter (empty . traverseFW . snd) $ zip [0..] $ iterate tick scanners main = do scanners &lt;- fmap (map parseScanner . lines) getContents putStrLn $ &#34;Severity: &#34; ++ (show $ severity scanners) putStrLn $ &#34;Delay: &#34; ++ (show $ findDelay scanners) Digital Plumber — Python — #adventofcode Day 12 Today&rsquo;s challenge has us helping a village of programs who are unable to communicate. We have a list of the the communication channels between their houses, and need to sort them out into groups such that we know that each program can communicate with others in its own group but not any others. Then we have to calculate the size of the group containing program 0 and the total number of groups. → Full code on GitHub !!! commentary This is one of those problems where I&rsquo;m pretty sure that my algorithm isn&rsquo;t close to being the most efficient, but it definitely works! For the sake of solving the challenge that&rsquo;s all that matters, but it still bugs me. By now I&rsquo;ve become used to using fileinput to transparently read data either from files given on the command-line or standard input if no arguments are given. import fileinput as fi First we make an initial pass through the input data, creating a group for each line representing the programs on that line (which can communicate with each other). We store this as a Python set. groups = [] for line in fi.input(): head, rest = line.split(&#39; &lt;-&gt; &#39;) group = set([int(head)]) group.update([int(x) for x in rest.split(&#39;, &#39;)]) groups.append(group) Now we iterate through the groups, starting with the first, and merging any we find that overlap with our current group. i = 0 while i &lt; len(groups): current = groups[i] Each pass through the groups brings more programs into the current group, so we have to go through and check their connections too. We make several merge passes, until we detect that no more merges took place. num_groups = len(groups) + 1 while num_groups &gt; len(groups): j = i+1 num_groups = len(groups) This inner loop does the actual merging, and deletes each group as it&rsquo;s merged in. while j &lt; len(groups): if len(current &amp; groups[j]) &gt; 0: current.update(groups[j]) del groups[j] else: j += 1 i += 1 All that&rsquo;s left to do now is to display the results. print(&#34;Number in group 0:&#34;, len([g for g in groups if 0 in g][0])) print(&#34;Number of groups:&#34;, len(groups)) Hex Ed — Python — #adventofcode Day 11 Today&rsquo;s challenge is to help a program find its child process, which has become lost on a hexagonal grid. We need to follow the path taken by the child (given as input) and calculate the distance it is from home along with the furthest distance it has been at any point along the path. → Full code on GitHub !!! commentary I found this one quite interesting in that it was very quick to solve. In fact, I got lucky and my first quick implementation (max(abs(l)) below) gave the correct answer in spite of missing an obvious not-so-edge case. Thinking about it, there&rsquo;s only a ⅓ chance that the first incorrect implementation would give the wrong answer! The code is shorter, so you get more words today. ☺ There are a number of different co-ordinate systems on a hexagonal grid (I discovered while reading up after solving it&hellip;). I intuitively went for the system known as &lsquo;axial&rsquo; coordinates, where you pick two directions aligned to the grid as your x and y axes: note that these won&rsquo;t be perpendicular. I chose ne/sw as the x axis and se/nw as y, but there are three other possible choices. That leads to the following definition for the directions, encoded as numpy arrays because that makes some of the code below neater. import numpy as np STEPS = {d: np.array(v) for d, v in [(&#39;ne&#39;, (1, 0)), (&#39;se&#39;, (0, -1)), (&#39;s&#39;, (-1, -1)), (&#39;sw&#39;, (-1, 0)), (&#39;nw&#39;, (0, 1)), (&#39;n&#39;, (1, 1))]} hex_grid_dist, given a location l calculates the number of steps needed to reach that location from the centre at (0, 0). Notice that we can&rsquo;t simply use the Manhattan distance here because, for example, one step north takes us to (1, 1), which would give a Manhattan distance of 2. Instead, we can see that moving in the n/s direction allows us to increment or decrement both coordinates at the same time: If the coordinates have the same sign: move n/s until one of them is zero, then move along the relevant ne or se axis back to the origin; in this case the number of steps is greatest of the absolute values of the two coordinates If the coordinates have opposite signs: move independently along the ne and se axes to reduce each to 0; this time the number of steps is the sum of the absolute values of the two coordinates def hex_grid_distance(l): if sum(np.sign(l)) == 0: # i.e. opposite signs return sum(abs(l)) else: return max(abs(l)) Now we can read in the path followed by the child and follow it ourselves, tracking the maximum distance from home along the way. path = input().strip().split(&#39;,&#39;) location = np.array((0, 0)) max_distance = 0 for step in map(STEPS.get, path): location += step max_distance = max(max_distance, hex_grid_distance(location)) distance = hex_grid_distance(location) print(&#34;Child process is at&#34;, location, &#34;which is&#34;, distance, &#34;steps away&#34;) print(&#34;Greatest distance was&#34;, max_distance) Knot Hash — Haskell — #adventofcode Day 10 Today&rsquo;s challenge asks us to help a group of programs implement a (highly questionable) hashing algorithm that involves repeatedly reversing parts of a list of numbers. → Full code on GitHub !!! commentary I went with Haskell again today, because it&rsquo;s the weekend so I have a bit more time, and I really enjoyed yesterday&rsquo;s Haskell implementation. Today gave me the opportunity to explore the standard library a bit more, as well as lending itself nicely to being decomposed into smaller parts to be combined using higher-order functions. You know the drill by know: import stuff we&rsquo;ll use later. module Main where import Data.Char (ord) import Data.Bits (xor) import Data.Function ((&amp;)) import Data.List (unfoldr) import Text.Printf (printf) import qualified Data.Text as T The worked example uses a concept of the &ldquo;current position&rdquo; as a pointer to a location in a static list. In Haskell it makes more sense to instead use the front of the list as the current position, and rotate the whole list as we progress to bring the right element to the front. rotate :: Int -&gt; [Int] -&gt; [Int] rotate 0 xs = xs rotate n xs = drop n&#39; xs ++ take n&#39; xs where n&#39; = n `mod` length xs The simple version of the hash requires working through the input list, modifying the working list as we go, and incrementing a &ldquo;skip&rdquo; counter with each step. Converting this to a functional style, we simply zip up the input with an infinite list [0, 1, 2, 3, ...] to give the counter values. Notice that we also have to calculate how far to rotate the working list to get back to its original position. foldl lets us specify a function that returns a modified version of the working list and feeds the input list in one at a time. simpleKnotHash :: Int -&gt; [Int] -&gt; [Int] simpleKnotHash size input = foldl step [0..size-1] input&#39; &amp; rotate (negate finalPos) where input&#39; = zip input [0..] finalPos = sum $ zipWith (+) input [0..] reversePart xs n = (reverse $ take n xs) ++ drop n xs step xs (n, skip) = reversePart xs n &amp; rotate (n+skip) The full version of the hash (part 2 of the challenge) starts the same way as the simple version, except making 64 passes instead of one: we can do this by using replicate to make a list of 64 copies, then collapse that into a single list with concat. fullKnotHash :: Int -&gt; [Int] -&gt; [Int] fullKnotHash size input = simpleKnotHash size input&#39; where input&#39; = concat $ replicate 64 input The next step in calculating the full hash collapses the full 256-element &ldquo;sparse&rdquo; hash down into 16 elements by XORing groups of 16 together. unfoldr is a nice efficient way of doing this. dense :: [Int] -&gt; [Int] dense = unfoldr dense&#39; where dense&#39; [] = Nothing dense&#39; xs = Just (foldl1 xor $ take 16 xs, drop 16 xs) The final hash step is to convert the list of integers into a hexadecimal string. hexify :: [Int] -&gt; String hexify = concatMap (printf &#34;%02x&#34;) These two utility functions put together building blocks from the Data.Text module to parse the input string. Note that no arguments are given: the functions are defined purely by composing other functions using the . operator. In Haskell this is referred to as &ldquo;point-free&rdquo; style. strip :: String -&gt; String strip = T.unpack . T.strip . T.pack parseInput :: String -&gt; [Int] parseInput = map (read . T.unpack) . T.splitOn (T.singleton &#39;,&#39;) . T.pack Now we can put it all together, including building the weird input for the &ldquo;full&rdquo; hash. main = do input &lt;- fmap strip getContents let simpleInput = parseInput input asciiInput = map ord input ++ [17, 31, 73, 47, 23] (a:b:_) = simpleKnotHash 256 simpleInput print $ (a*b) putStrLn $ fullKnotHash 256 asciiInput &amp; dense &amp; hexify Stream Processing — Haskell — #adventofcode Day 9 In today&rsquo;s challenge we come across a stream that we need to cross. But of course, because we&rsquo;re stuck inside a computer, it&rsquo;s not water but data flowing past. The stream is too dangerous to cross until we&rsquo;ve removed all the garbage, and to prove we can do that we have to calculate a score for the valid data &ldquo;groups&rdquo; and the number of garbage characters to remove. → Full code on GitHub !!! commentary One of my goals for this process was to knock the rust of my functional programming skills in Haskell, and I haven&rsquo;t done that for the whole of the first week. Processing strings character by character and acting according to which character shows up seems like a good choice for pattern-matching though, so here we go. I also wanted to take a bash at test-driven development in Haskell, so I also loaded up the Test.Hspec module to give it a try. I did find keeping track of all the state in arguments a bit mind boggling, and I think it could have been improved through use of a data type using record syntax and the `State` monad, so that's something to look at for a future challenge. First import the extra bits we&rsquo;ll need. module Main where import Test.Hspec import Data.Function ((&amp;)) countGroups solves the first part of the problem, counting up the &ldquo;score&rdquo; of the valid data in the stream. countGroups' is an auxiliary function that holds some state in its arguments. We use pattern matching for the base case: [] represents the empty list in Haskell, which indicates we&rsquo;ve finished the whole stream. Otherwise, we split the remaining stream into its first character and remainder, and use guards to decide how to interpret it. If skip is true, discard the character and carry on with skip set back to false. If we find a &ldquo;!&rdquo;, that tells us to skip the next. Other characters mark groups or sets of garbage: groups increase the score when they close and garbage is discarded. We continue to progress the list by recursing with the remainder of the stream and any updated state. countGroups :: String -&gt; Int countGroups = countGroups&#39; 0 0 False False where countGroups&#39; score _ _ _ [] = score countGroups&#39; score level garbage skip (c:rest) | skip = countGroups&#39; score level garbage False rest | c == &#39;!&#39; = countGroups&#39; score level garbage True rest | garbage = case c of &#39;&gt;&#39; -&gt; countGroups&#39; score level False False rest _ -&gt; countGroups&#39; score level True False rest | otherwise = case c of &#39;{&#39; -&gt; countGroups&#39; score (level+1) False False rest &#39;}&#39; -&gt; countGroups&#39; (score+level) (level-1) False False rest &#39;,&#39; -&gt; countGroups&#39; score level False False rest &#39;&lt;&#39; -&gt; countGroups&#39; score level True False rest c -&gt; error $ &#34;Garbage character found outside garbage: &#34; ++ show c countGarbage works almost identically to countGroups, except it ignores groups and counts garbage. They are structured so similarly that it would probably make more sense to combine them to a single function that returns both counts. countGarbage :: String -&gt; Int countGarbage = countGarbage&#39; 0 False False where countGarbage&#39; count _ _ [] = count countGarbage&#39; count garbage skip (c:rest) | skip = countGarbage&#39; count garbage False rest | c == &#39;!&#39; = countGarbage&#39; count garbage True rest | garbage = case c of &#39;&gt;&#39; -&gt; countGarbage&#39; count False False rest _ -&gt; countGarbage&#39; (count+1) True False rest | otherwise = case c of &#39;&lt;&#39; -&gt; countGarbage&#39; count True False rest _ -&gt; countGarbage&#39; count False False rest Hspec gives us a domain-specific language heavily inspired by the rspec library for Ruby: the tests read almost like natural language. I built up these tests one-by-one, gradually implementing the appropriate bits of the functions above, a process known as Test-driven development. runTests = hspec $ do describe &#34;countGroups&#34; $ do it &#34;counts valid groups&#34; $ do countGroups &#34;{}&#34; `shouldBe` 1 countGroups &#34;{{{}}}&#34; `shouldBe` 6 countGroups &#34;{{{},{},{{}}}}&#34; `shouldBe` 16 countGroups &#34;{{},{}}&#34; `shouldBe` 5 it &#34;ignores garbage&#34; $ do countGroups &#34;{&lt;a&gt;,&lt;a&gt;,&lt;a&gt;,&lt;a&gt;}&#34; `shouldBe` 1 countGroups &#34;{{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;}}&#34; `shouldBe` 9 it &#34;skips marked characters&#34; $ do countGroups &#34;{{&lt;!!&gt;},{&lt;!!&gt;},{&lt;!!&gt;},{&lt;!!&gt;}}&#34; `shouldBe` 9 countGroups &#34;{{&lt;a!&gt;},{&lt;a!&gt;},{&lt;a!&gt;},{&lt;ab&gt;}}&#34; `shouldBe` 3 describe &#34;countGarbage&#34; $ do it &#34;counts garbage characters&#34; $ do countGarbage &#34;&lt;&gt;&#34; `shouldBe` 0 countGarbage &#34;&lt;random characters&gt;&#34; `shouldBe` 17 countGarbage &#34;&lt;&lt;&lt;&lt;&gt;&#34; `shouldBe` 3 it &#34;ignores non-garbage&#34; $ do countGarbage &#34;{{},{}}&#34; `shouldBe` 0 countGarbage &#34;{{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;},{&lt;ab&gt;}}&#34; `shouldBe` 8 it &#34;skips marked characters&#34; $ do countGarbage &#34;&lt;{!&gt;}&gt;&#34; `shouldBe` 2 countGarbage &#34;&lt;!!&gt;&#34; `shouldBe` 0 countGarbage &#34;&lt;!!!&gt;&#34; `shouldBe` 0 countGarbage &#34;&lt;{o\&#34;i!a,&lt;{i&lt;a&gt;&#34; `shouldBe` 10 Finally, the main function reads in the challenge input and calculates the answers, printing them on standard output. main = do runTests repeat &#39;=&#39; &amp; take 78 &amp; putStrLn input &lt;- getContents &amp; fmap (filter (/=&#39;\n&#39;)) putStrLn $ &#34;Found &#34; ++ show (countGroups input) ++ &#34; groups&#34; putStrLn $ &#34;Found &#34; ++ show (countGarbage input) ++ &#34; characters garbage&#34; I Heard You Like Registers — Python — #adventofcode Day 8 Today&rsquo;s challenge describes a simple instruction set for a CPU, incrementing and decrementing values in registers according to simple conditions. We have to interpret a stream of these instructions, and to prove that we&rsquo;ve done so, give the highest value of any register, both at the end of the program and throughout the whole program. → Full code on GitHub !!! commentary This turned out to be a nice straightforward one to implement, as the instruction format was easily parsed by regular expression, and Python provides the eval function which made evaluating the conditions a doddle. Import various standard library bits that we&rsquo;ll use later. import re import fileinput as fi from math import inf from collections import defaultdict We could just parse the instructions by splitting the string, but using a regular expression is a little bit more robust because it won&rsquo;t match at all if given an invalid instruction. INSTRUCTION_RE = re.compile(r&#39;(\w+) (inc|dec) (-?\d+) if (.+)\s*&#39;) def parse_instruction(instruction): match = INSTRUCTION_RE.match(instruction) return match.group(1, 2, 3, 4) Executing an instruction simply checks the condition and if it evaluates to True updates the relevant register. def exec_instruction(registers, instruction): name, op, value, cond = instruction value = int(value) if op == &#39;dec&#39;: value = -value if eval(cond, globals(), registers): registers[name] += value highest_value returns the maximum value found in any register. def highest_value(registers): return sorted(registers.items(), key=lambda x: x[1], reverse=True)[0][1] Finally, loop through all the instructions and carry them out, updating global_max as we go. We need to be able to deal with registers that haven&rsquo;t been accessed before. Keeping the registers in a dictionary means that we can evaluate the conditions directly using eval above, passing it as the locals argument. The standard dict will raise an exception if we try to access a key that doesn&rsquo;t exist, so instead we use collections.defaultdict, which allows us to specify what the default value for a non-existent key will be. New registers start at 0, so we use a simple lambda to define a function that always returns 0. global_max = -inf registers = defaultdict(lambda: 0) for i in map(parse_instruction, fi.input()): exec_instruction(registers, i) global_max = max(global_max, highest_value(registers)) print(&#39;Max value:&#39;, highest_value(registers)) print(&#39;All-time max:&#39;, global_max) Recursive Circus — Ruby — #adventofcode Day 7 Today&rsquo;s challenge introduces a set of processes balancing precariously on top of each other. We find them stuck and unable to get down because one of the processes is the wrong size, unbalancing the whole circus. Our job is to figure out the root from the input and then find the correct weight for the single incorrect process. → Full code on GitHub !!! commentary So I didn&rsquo;t really intend to take a full polyglot approach to Advent of Code, but it turns out to have been quite fun, so I made a shortlist of languages to try. Building a tree is a classic application for object-orientation using a class to represent tree nodes, and I&rsquo;ve always liked the feel of Ruby&rsquo;s class syntax, so I gave it a go. First make sure we have access to Set, which we&rsquo;ll use later. require &#39;set&#39; Now to define the CircusNode class, which represents nodes in the tree. attr :s automatically creates a function s that returns the value of the instance attribute @s class CircusNode attr :name, :weight def initialize(name, weight, children=nil) @name = name @weight = weight @children = children || [] end Add a &lt;&lt; operator (the same syntax for adding items to a list) that adds a child to this node. def &lt;&lt;(c) @children &lt;&lt; c @total_weight = nil end total_weight recursively calculates the weight of this node and everything above it. The @total_weight ||= blah idiom caches the value so we only calculate it once. def total_weight @total_weight ||= @weight + @children.map {|c| c.total_weight}.sum end balance_weight does the hard work of figuring out the proper weight for the incorrect node by recursively searching through the tree. def balance_weight(target=nil) by_weight = Hash.new{|h, k| h[k] = []} @children.each{|c| by_weight[c.total_weight] &lt;&lt; c} if by_weight.size == 1 then if target return @weight - (total_weight - target) else raise ArgumentError, &#39;This tree seems balanced!&#39; end else odd_one_out = by_weight.select {|k, v| v.length == 1}.first[1][0] child_target = by_weight.select {|k, v| v.length &gt; 1}.first[0] return odd_one_out.balance_weight child_target end end A couple of utility functions for displaying trees finish off the class. def to_s &#34;#{@name} (#{@weight})&#34; end def print_tree(n=0) puts &#34;#{&#39; &#39;*n}#{self} -&gt; #{self.total_weight}&#34; @children.each do |child| child.print_tree n+1 end end end build_circus takes input as a list of lists [name, weight, children]. We make two passes over this list, first creating all the nodes, then building the tree by adding children to parents. def build_circus(data) all_nodes = {} all_children = Set.new data.each do |name, weight, children| all_nodes[name] = CircusNode.new name, weight end data.each do |name, weight, children| children.each {|child| all_nodes[name] &lt;&lt; all_nodes[child]} all_children.merge children end root_name = (all_nodes.keys.to_set - all_children).first return all_nodes[root_name] end Finally, build the tree and solve the problem! Note that we use String.to_sym to convert the node names to symbols (written in Ruby as :symbol), because they&rsquo;re faster to work with in Hashes and Sets as we do above. data = readlines.map do |line| match = /(?&lt;parent&gt;\w+) \((?&lt;weight&gt;\d+)\)(?: -&gt; (?&lt;children&gt;.*))?/.match line [match[&#39;parent&#39;].to_sym, match[&#39;weight&#39;].to_i, match[&#39;children&#39;] ? match[&#39;children&#39;].split(&#39;, &#39;).map {|x| x.to_sym} : []] end root = build_circus data puts &#34;Root node: #{root}&#34; puts root.balance_weight Memory Reallocation — Python — #adventofcode Day 6 Today&rsquo;s challenge asks us to follow a recipe for redistributing objects in memory that bears a striking resemblance to the rules of the African game Mancala. → Full code on GitHub !!! commentary When I was doing my MSci, one of our programming exercises was to write (in Haskell, IIRC) a program to play a Mancala variant called Oware, so this had a nice ring of nostalgia. Back to Python today: it's already become clear that it's by far my most fluent language, which makes sense as it's the only one I've used consistently since my schooldays. I'm a bit behind on the blog posts, so you get this one without any explanation, for now at least! import math def reallocate(mem): max_val = -math.inf size = len(mem) for i, x in enumerate(mem): if x &gt; max_val: max_val = x max_index = i i = max_index mem[i] = 0 remaining = max_val while remaining &gt; 0: i = (i + 1) % size mem[i] += 1 remaining -= 1 return mem def detect_cycle(mem): mem = list(mem) steps = 0 prev_states = {} while tuple(mem) not in prev_states: prev_states[tuple(mem)] = steps steps += 1 mem = reallocate(mem) return (steps, steps - prev_states[tuple(mem)]) initial_state = map(int, input().split()) print(&#34;Initial state is &#34;, initial_state) steps, cycle = detect_cycle(initial_state) print(&#34;Steps to cycle: &#34;, steps) print(&#34;Steps in cycle: &#34;, cycle) A Maze of Twisty Trampolines — C++ — #adventofcode Day 5 Today&rsquo;s challenge has us attempting to help the CPU escape from a maze of instructions. It&rsquo;s not quite a Turing Machine, but it has that feeling of moving a read/write head up and down a tape acting on and changing the data found there. → Full code on GitHub !!! commentary I haven&rsquo;t written anything in C++ for over a decade. It sounds like there have been lots of interesting developments in the language since then, with C++11, C++14 and the freshly finalised C++17 standards (built-in parallelism in the STL!). I won&rsquo;t use any of those, but I thought I&rsquo;d dust off my C++ and see what happened. Thankfully the Standard Template Library classes still did what I expected! As usual, we first include the parts of the standard library we&rsquo;re going to use: iostream for input &amp; output; vector for the container. We also declare that we&rsquo;re using the std namespace, so that we don&rsquo;t have to prepend vector and the other classes with std::. #include &lt;iostream&gt; #include &lt;vector&gt; using namespace std; steps_to_escape_part1 implements part 1 of the challenge: we read a location, move forward/backward by the number of steps given in that location, then add one to the location before repeating. The result is the number of steps we take before jumping outside the list. int steps_to_escape_part1(vector&lt;int&gt;&amp; instructions) { int pos = 0, iterations = 0, new_pos; while (pos &lt; instructions.size()) { new_pos = pos + instructions[pos]; instructions[pos]++; pos = new_pos; iterations++; } return iterations; } steps_to_escape_part2 solves part 2, which is very similar, except that an offset greater than 3 is decremented instead of incremented before moving on. int steps_to_escape_part2(vector&lt;int&gt;&amp; instructions) { int pos = 0, iterations = 0, new_pos, offset; while (pos &lt; instructions.size()) { offset = instructions[pos]; new_pos = pos + offset; instructions[pos] += offset &gt;=3 ? -1 : 1; pos = new_pos; iterations++; } return iterations; } Finally we pull it all together and link it up to the input. int main() { vector&lt;int&gt; instructions1, instructions2; int n; The cin class lets us read data from standard input, which we then add to a vector of ints to give our list of instructions. while (true) { cin &gt;&gt; n; if (cin.eof()) break; instructions1.push_back(n); } Solving the problem modifies the input, so we need to take a copy to solve part 2 as well. Thankfully the STL makes this easy with iterators. instructions2.insert(instructions2.begin(), instructions1.begin(), instructions1.end()); Finally, compute the result and print it on standard output. cout &lt;&lt; steps_to_escape_part1(instructions1) &lt;&lt; endl; cout &lt;&lt; steps_to_escape_part2(instructions2) &lt;&lt; endl; return 0; } High Entropy Passphrases — Python — #adventofcode Day 4 Today&rsquo;s challenge describes some simple rules supposedly intended to enforce the use of secure passwords. All we have to do is test a list of passphrase and identify which ones meet the rules. → Full code on GitHub !!! commentary Fearing that today might be as time-consuming as yesterday, I returned to Python and it&rsquo;s hugely powerful &ldquo;batteries-included&rdquo; standard library. Thankfully this challenge was more straightforward, and I actually finished this before finishing day 3. First, let&rsquo;s import two useful utilities. from fileinput import input from collections import Counter Part 1 requires simply that a passphrase contains no repeated words. No problem: we split the passphrase into words and count them, and check if any was present more than once. Counter is an amazingly useful class to have in a language&rsquo;s standard library. All it does is count things: you add objects to it, and then it will tell you how many of a given object you have. We&rsquo;re going to use it to count those potentially duplicated words. def is_valid(passphrase): counter = Counter(passphrase.split()) return counter.most_common(1)[0][1] == 1 Part 2 requires that no word in the passphrase be an anagram of any other word. Since we don&rsquo;t need to do anything else with the words afterwards, we can check for anagrams by sorting the letters in each word: &ldquo;leaf&rdquo; and &ldquo;flea&rdquo; both become &ldquo;aefl&rdquo; and can be compared directly. Then we count as before. def is_valid_ana(passphrase): counter = Counter(&#39;&#39;.join(sorted(word)) for word in passphrase.split()) return counter.most_common(1)[0][1] == 1 Finally we pull everything together. sum(map(boolean_func, list)) is a common idiom in Python for counting the number of times a condition (checked by boolean_func) is true. In Python, True and False can be treated as the numbers 1 and 0 respectively, so that summing a list of Boolean values gives you the number of True values in the list. lines = list(input()) print(sum(map(is_valid, lines))) print(sum(map(is_valid_ana, lines))) Spiral Memory — Go — #adventofcode Day 3 Today&rsquo;s challenge requires us to perform some calculations on an &ldquo;experimental memory layout&rdquo;, with cells moving outwards from the centre of a square spiral (squiral?). → Full code on GitHub !!! commentary I&rsquo;ve been wanting to try my hand at Go, the memory-safe, statically typed compiled language from Google for a while. Today&rsquo;s challenge seemed a bit more mathematical in nature, meaning that I wouldn&rsquo;t need too many advanced language features or knowledge of a standard library, so I thought I&rsquo;d give it a &ldquo;go&rdquo;. It might have been my imagination, but it was impressive how quickly the compiled program chomped through 60 different input values while I was debugging. I actually spent far too long on this problem because my brain led me down a blind alley trying to do the wrong calculation, but I got there in the end! The solution is a bit difficult to explain without diagrams, which I don't really have time to draw right now, but fear not because several other people have. First take a look at [the challenge itself which explains the spiral memory concept](http://adventofcode.com/2017/day/3). Then look at the [nice diagrams that Phil Tooley made with Python](http://acceleratedscience.co.uk/blog/adventofcode-day-3-spiral-memory/) and hopefully you'll be able to see what's going on! It's interesting to note that this challenge also admits of an algorithmic solution instead of the mathematical one: you can model the memory as an infinite grid using a suitable data structure and literally move around it in a spiral. In hindsight this is a much better way of solving the challenge quickly because it's easier and less error-prone to code. I'm quite pleased with my maths-ing though, and it's much quicker than the algorithmic version! First some Go boilerplate: we have to define the package we&rsquo;re in (main, because it&rsquo;s an executable we&rsquo;re producing) and import the libraries we&rsquo;ll use. package main import ( &#34;fmt&#34; &#34;math&#34; &#34;os&#34; ) Weirdly, Go doesn&rsquo;t seem to have these basic mathematics functions for integers in its standard library (please someone correct me if I&rsquo;m wrong!) so I&rsquo;ll define them instead of mucking about with data types. Go doesn&rsquo;t do any implicit type conversion, even between numeric types, and the math builtin package only operates on float64 values. func abs(n int) int { if n &lt; 0 { return -n } return n } func min(x, y int) int { if x &lt; y { return x } return y } func max(x, y int) int { if x &gt; y { return x } return y } This does the heavy lifting for part one: converting from a position on the spiral to a column and row in the grid. (0, 0) is the centre of the spiral. This actually does a bit more than is necessary to calculate the distance as required for part 1, but we&rsquo;ll use it again for part 2. func spiral_to_xy(n int) (int, int) { if n == 1 { return 0, 0 } r := int(math.Floor((math.Sqrt(float64(n-1)) + 1) / 2)) n_r := n - (2*r-1)*(2*r-1) o := ((n_r - 1) % (2 * r)) - r + 1 sector := (n_r - 1) / (2 * r) switch sector { case 0: return r, o case 1: return -o, r case 2: return -r, -o case 3: return o, -r } return 0, 0 } Now use spiral_to_xy to calculate the Manhattan distance that the value at location n in the spiral memory are carried to reach the &ldquo;access port&rdquo; at 0. func distance(n int) int { x, y := spiral_to_xy(n) return abs(x) + abs(y) } This function does the opposite of spiral_to_xy, translating a grid position back to its position on the spiral. This is the one that took me far too long to figure out because I had a brain bug and tried to calculate the value s (which sector or quarter of the spiral we&rsquo;re looking at) in a way that was never going to work! Fortunately I came to my senses. func xy_to_spiral(x, y int) int { if x == 0 &amp;&amp; y == 0 { return 1 } r := max(abs(x), abs(y)) var s, o, n int if x+y &gt; 0 &amp;&amp; x-y &gt;= 0 { s = 0 } else if x-y &lt; 0 &amp;&amp; x+y &gt;= 0 { s = 1 } else if x+y &lt; 0 &amp;&amp; x-y &lt;= 0 { s = 2 } else { s = 3 } switch s { case 0: o = y case 1: o = -x case 2: o = -y case 3: o = x } n = o + r*(2*s+1) + (2*r-1)*(2*r-1) return n } This is a utility function that uses xy_to_spiral to fetch the value at a given (x, y) location, and returns zero if we haven&rsquo;t filled that location yet. func get_spiral(mem []int, x, y int) int { n := xy_to_spiral(x, y) - 1 if n &lt; len(mem) { return mem[n] } return 0 } Finally we solve part 2 of the problem, which involves going round the spiral writing values into it that are the sum of some values already written. The result is the first of these sums that is greater than or equal to the given input value. func stress_test(input int) int { mem := make([]int, 1) n := 0 mem[0] = 1 for mem[n] &lt; input { n++ x, y := spiral_to_xy(n + 1) mem = append(mem, get_spiral(mem, x+1, y)+ get_spiral(mem, x+1, y+1)+ get_spiral(mem, x, y+1)+ get_spiral(mem, x-1, y+1)+ get_spiral(mem, x-1, y)+ get_spiral(mem, x-1, y-1)+ get_spiral(mem, x, y-1)+ get_spiral(mem, x+1, y-1)) } return mem[n] } Now the last part of the program puts it all together, reading the input value from a commandline argument and printing the results of the two parts of the challenge: func main() { var n int fmt.Sscanf(os.Args[1], &#34;%d&#34;, &amp;n) fmt.Printf(&#34;Input is %d\n&#34;, n) fmt.Printf(&#34;Distance is %d\n&#34;, distance(n)) fmt.Printf(&#34;Stress test result is %d\n&#34;, stress_test(n)) } Corruption Checksum — Python — #adventofcode Day 2 Today&rsquo;s challenge is to calculate a rather contrived &ldquo;checksum&rdquo; over a grid of numbers. → Full code on GitHub !!! commentary Today I went back to plain Python, and I didn&rsquo;t do formal tests because only one test case was given for each part of the problem. I just got stuck in. I did write part 2 out in as nested `for` loops as an intermediate step to working out the generator expression. I think that expanded version may have been more readable. Having got that far, I couldn't then work out how to finally eliminate the need for an auxiliary function entirely without either sorting the same elements multiple times or sorting each row as it's read. First we read in the input, split it and convert it to numbers. fileinput.input() returns an iterator over the lines in all the files passed as command-line arguments, or over standard input if no files are given. from fileinput import input sheet = [[int(x) for x in l.split()] for l in input()] Part 1 of the challenge calls for finding the difference between the largest and smallest number in each row, and then summing those differences: print(sum(max(x) - min(x) for x in sheet)) Part 2 is a bit more involved: for each row we have to find the unique pair of elements that divide into each other without remainder, then sum the result of those divisions. We can make it a little easier by sorting each row; then we can take each number in turn and compare it only with the numbers after it (which are guaranteed to be larger). Doing this ensures we only make each comparison once. def rowsum_div(row): row = sorted(row) return sum(y // x for i, x in enumerate(row) for y in row[i+1:] if y % x == 0) print(sum(map(rowsum_div, sheet))) We can make this code shorter (if not easier to read) by sorting each row as it&rsquo;s read: sheet = [sorted(int(x) for x in l.split()) for l in input()] Then we can just use the first and last elements in each row for part 1, as we know those are the smallest and largest respectively in the sorted row: print(sum(x[-1] - x[0] for x in sheet)) Part 2 then becomes a sum over a single generator expression: print(sum(y // x for row in sheet for i, x in enumerate(row) for y in row[i+1:] if y % x == 0)) Very satisfying! Inverse Captcha — Coconut — #adventofcode Day 1 Well, December&rsquo;s here at last, and with it Day 1 of Advent of Code. … It goes on to explain that you may only leave by solving a captcha to prove you&rsquo;re not a human. Apparently, you only get one millisecond to solve the captcha: too fast for a normal human, but it feels like hours to you. … As well as posting solutions here when I can, I&rsquo;ll be putting them all on https://github.com/jezcope/aoc2017 too. !!! commentary After doing some challenges from last year in Haskell for a warm up, I felt inspired to try out the functional-ish Python dialect, Coconut. Now that I&rsquo;ve done it, it feels a bit of an odd language, neither fish nor fowl. It&rsquo;ll look familiar to any Pythonista, but is loaded with features normally associated with functional languages, like pattern matching, destructuring assignment, partial application and function composition. That makes it quite fun to work with, as it works similarly to Haskell, but because it's restricted by the basic rules of Python syntax everything feels a bit more like hard work than it should. The accumulator approach feels clunky, but it's necessary to allow [tail call elimination](https://en.wikipedia.org/wiki/Tail_call), which Coconut will do and I wanted to see in action. Lo and behold, if you take a look at the [compiled Python version](https://github.com/jezcope/aoc2017/blob/86c8100824bda1b35e5db6e02d4b80890be7a022/01-inverse-captcha.py#L675) you'll see that my recursive implementation has been turned into a non-recursive `while` loop. Then again, maybe I'm just jealous of Phil Tooley's [one-liner solution in Python](https://github.com/ptooley/aocGolf/blob/1380d78194f1258748ccfc18880cfd575baf5d37/2017.py#L8). import sys def inverse_captcha_(s, acc=0): case reiterable(s): match (|d, d|) :: rest: return inverse_captcha_((|d|) :: rest, acc + int(d)) match (|d0, d1|) :: rest: return inverse_captcha_((|d1|) :: rest, acc) return acc def inverse_captcha(s) = inverse_captcha_(s :: s[0]) def inverse_captcha_1_(s0, s1, acc=0): case (reiterable(s0), reiterable(s1)): match ((|d0|) :: rest0, (|d0|) :: rest1): return inverse_captcha_1_(rest0, rest1, acc + int(d0)) match ((|d0|) :: rest0, (|d1|) :: rest1): return inverse_captcha_1_(rest0, rest1, acc) return acc def inverse_captcha_1(s) = inverse_captcha_1_(s, s$[len(s)//2:] :: s) def test_inverse_captcha(): assert &quot;1111&quot; |&gt; inverse_captcha == 4 assert &quot;1122&quot; |&gt; inverse_captcha == 3 assert &quot;1234&quot; |&gt; inverse_captcha == 0 assert &quot;91212129&quot; |&gt; inverse_captcha == 9 def test_inverse_captcha_1(): assert &quot;1212&quot; |&gt; inverse_captcha_1 == 6 assert &quot;1221&quot; |&gt; inverse_captcha_1 == 0 assert &quot;123425&quot; |&gt; inverse_captcha_1 == 4 assert &quot;123123&quot; |&gt; inverse_captcha_1 == 12 assert &quot;12131415&quot; |&gt; inverse_captcha_1 == 4 if __name__ == &quot;__main__&quot;: sys.argv[1] |&gt; inverse_captcha |&gt; print sys.argv[1] |&gt; inverse_captcha_1 |&gt; print Advent of Code 2017: introduction It&rsquo;s a common lament of mine that I don&rsquo;t get to write a lot of code in my day-to-day job. I like the feeling of making something from nothing, and I often look for excuses to write bits of code, both at work and outside it. Advent of Code is a daily series of programming challenges for the month of December, and is about to start its third annual incarnation. I discovered it too late to take part in any serious way last year, but I&rsquo;m going to give it a try this year. There are no restrictions on programming language (so of course some people delight in using esoteric languages like Brainf**k), but I think I&rsquo;ll probably stick with Python for the most part. That said, I miss my Haskell days and I&rsquo;m intrigued by new kids on the block Go and Rust, so I might end up throwing in a few of those on some of the simpler challenges. I&rsquo;d like to focus a bit more on how I solve the puzzles. They generally come in two parts, with the second part only being revealed after successful completion of the first part. With that in mind, test-driven development makes a lot of sense, because I can verify that I haven&rsquo;t broken the solution to the first part in modifying to solve the second. I may also take a literate programming approach with org-mode or Jupyter notebooks to document my solutions a bit more, and of course that will make it easier to publish solutions here so I&rsquo;ll do that as much as I can make time for. On that note, here are some solutions for 2016 that I&rsquo;ve done recently as a warmup. Day 1: Python Day 1 instructions import numpy as np import pytest as t import sys TURN = { &#39;L&#39;: np.array([[0, 1], [-1, 0]]), &#39;R&#39;: np.array([[0, -1], [1, 0]]) } ORIGIN = np.array([0, 0]) NORTH = np.array([0, 1]) class Santa: def __init__(self, location, heading): self.location = np.array(location) self.heading = np.array(heading) self.visited = [(0,0)] def execute_one(self, instruction): start_loc = self.location.copy() self.heading = self.heading @ TURN[instruction[0]] self.location += self.heading * int(instruction[1:]) self.mark(start_loc, self.location) def execute_many(self, instructions): for i in instructions.split(&#39;,&#39;): self.execute_one(i.strip()) def distance_from_start(self): return sum(abs(self.location)) def mark(self, start, end): for x in range(min(start[0], end[0]), max(start[0], end[0])+1): for y in range(min(start[1], end[1]), max(start[1], end[1])+1): if any((x, y) != start): self.visited.append((x, y)) def find_first_crossing(self): for i in range(1, len(self.visited)): for j in range(i): if self.visited[i] == self.visited[j]: return self.visited[i] def distance_to_first_crossing(self): crossing = self.find_first_crossing() if crossing is not None: return abs(crossing[0]) + abs(crossing[1]) def __str__(self): return f&#39;Santa @ {self.location}, heading {self.heading}&#39; def test_execute_one(): s = Santa(ORIGIN, NORTH) s.execute_one(&#39;L1&#39;) assert all(s.location == np.array([-1, 0])) assert all(s.heading == np.array([-1, 0])) s.execute_one(&#39;L3&#39;) assert all(s.location == np.array([-1, -3])) assert all(s.heading == np.array([0, -1])) s.execute_one(&#39;R3&#39;) assert all(s.location == np.array([-4, -3])) assert all(s.heading == np.array([-1, 0])) s.execute_one(&#39;R100&#39;) assert all(s.location == np.array([-4, 97])) assert all(s.heading == np.array([0, 1])) def test_execute_many(): s = Santa(ORIGIN, NORTH) s.execute_many(&#39;L1, L3, R3&#39;) assert all(s.location == np.array([-4, -3])) assert all(s.heading == np.array([-1, 0])) def test_distance(): assert Santa(ORIGIN, NORTH).distance_from_start() == 0 assert Santa((10, 10), NORTH).distance_from_start() == 20 assert Santa((-17, 10), NORTH).distance_from_start() == 27 def test_turn_left(): east = NORTH @ TURN[&#39;L&#39;] south = east @ TURN[&#39;L&#39;] west = south @ TURN[&#39;L&#39;] assert all(east == np.array([-1, 0])) assert all(south == np.array([0, -1])) assert all(west == np.array([1, 0])) def test_turn_right(): west = NORTH @ TURN[&#39;R&#39;] south = west @ TURN[&#39;R&#39;] east = south @ TURN[&#39;R&#39;] assert all(east == np.array([-1, 0])) assert all(south == np.array([0, -1])) assert all(west == np.array([1, 0])) if __name__ == &#39;__main__&#39;: instructions = sys.stdin.read() santa = Santa(ORIGIN, NORTH) santa.execute_many(instructions) print(santa) print(&#39;Distance from start:&#39;, santa.distance_from_start()) print(&#39;Distance to target: &#39;, santa.distance_to_first_crossing()) Day 2: Haskell Day 2 instructions module Main where data Pos = Pos Int Int deriving (Show) -- Magrittr-style pipe operator (|&gt;) :: a -&gt; (a -&gt; b) -&gt; b x |&gt; f = f x swapPos :: Pos -&gt; Pos swapPos (Pos x y) = Pos y x clamp :: Int -&gt; Int -&gt; Int -&gt; Int clamp lower upper x | x &lt; lower = lower | x &gt; upper = upper | otherwise = x clampH :: Pos -&gt; Pos clampH (Pos x y) = Pos x&#39; y&#39; where y&#39; = clamp 0 4 y r = abs (2 - y&#39;) x&#39; = clamp r (4-r) x clampV :: Pos -&gt; Pos clampV = swapPos . clampH . swapPos buttonForPos :: Pos -&gt; String buttonForPos (Pos x y) = [buttons !! y !! x] where buttons = [&#34; D &#34;, &#34; ABC &#34;, &#34;56789&#34;, &#34; 234 &#34;, &#34; 1 &#34;] decodeChar :: Pos -&gt; Char -&gt; Pos decodeChar (Pos x y) &#39;R&#39; = clampH $ Pos (x+1) y decodeChar (Pos x y) &#39;L&#39; = clampH $ Pos (x-1) y decodeChar (Pos x y) &#39;U&#39; = clampV $ Pos x (y+1) decodeChar (Pos x y) &#39;D&#39; = clampV $ Pos x (y-1) decodeLine :: Pos -&gt; String -&gt; Pos decodeLine p &#34;&#34; = p decodeLine p (c:cs) = decodeLine (decodeChar p c) cs makeCode :: String -&gt; String makeCode instructions = lines instructions -- split into lines |&gt; scanl decodeLine (Pos 1 1) -- decode to positions |&gt; tail -- drop start position |&gt; concatMap buttonForPos -- convert to buttons main = do input &lt;- getContents putStrLn $ makeCode input Research Data Management Forum 18, Manchester !!! intro &quot;&quot; Monday 20 and Tuesday 21 November 2017 I&rsquo;m at the Research Data Management Forum in Manchester. I thought I&rsquo;d use this as an opportunity to try liveblogging, so during the event some notes should appear in the box below (you may have to manually refresh your browser tab periodically to get the latest version). I've not done this before, so if the blog stops updating then it's probably because I've stopped updating it to focus on the conference instead! This was made possible using GitHub's cool [Gist](https://gist.github.com) tool. Draft content policy I thought it was about time I had some sort of content policy on here so this is a first draft. It will eventually wind up as a separate page. Feedback welcome! !!! aside &ldquo;Content policy&rdquo; This blog&rsquo;s primary purpose is as a reflective learning tool for my own development; my aim in writing any given post is mainly to expose and develop my own thinking on a topic. My reasons for making a public blog rather than a private journal are: 1. If I'm lucky, someone smarter than me will provide feedback that will help me and my readers to learn more 2. If I'm extra lucky, someone else might learn from the material as well Each post, therefore, represents the state of my thinking at the time I wrote it, or perhaps a deliberate provocation or exaggeration; either way, if you don't know me personally please don't judge me based entirely on my past words. This is a request though, not an attempt to excuse bad behaviour on my part. I accept full responsibility for any consequences of my words, whether intended or not. I will not remove comments or ban individuals for disagreeing with me, only for behaving offensively or disrespectfully. I will do my best to be fair and balanced and explain decisions that I take, but I reserve the right to take those decisions without making any explanation at all if it seems likely to further inflame a situation. If I end up responding to anything simply with a link to this policy, that's probably all the explanation you're going to get. It should go without saying, but the opinions presented in this blog are my own and not those of my employer or anyone else I might at times represent. Learning to live with anxiety !!! intro &quot;&quot; This is a post that I&rsquo;ve been writing for months, and writing in my head for years. For some it will explain aspects of my personality that you might have wondered about. For some it will just be another person banging on self-indulgently about so-called &ldquo;mental health issues&rdquo;. Hopefully, for some it will demystify some stuff and show that you&rsquo;re not alone and things do get better. For as long as I can remember I&rsquo;ve been a worrier. I&rsquo;ve also suffered from bouts of what I now recognise as depression, on and off since my school days. It&rsquo;s only relatively recently that I&rsquo;ve come to the realisation that these two might be connected and that my &lsquo;worrying&rsquo; might in fact be outside the normal range of healthy human behaviour and might more accurately be described as chronic anxiety. You probably won&rsquo;t have noticed it, but it&rsquo;s been there. More recently I&rsquo;ve begun feeling like I&rsquo;m getting on top of it and feeling &ldquo;normal&rdquo; for the first time in my life. Things I&rsquo;ve found that help include: getting out of the house more and socialising with friends; and getting a range of exercise, outdoors and away from the city (rock climbing is mentally and physically engaging and open water swimming is indescribably joyful). But mostly it&rsquo;s the cognitive behavioural therapy (CBT) and the antidepressants. Before I go any further, a word about drugs (&ldquo;don&rsquo;t do drugs, kids&rdquo;): I&rsquo;m on the lowest available dose of a common antidepressant. This isn&rsquo;t because it stops me being sad all the time (I&rsquo;m not) or because it makes all my problems go away (it really doesn&rsquo;t). It&rsquo;s because the scientific evidence points to a combination of CBT and antidepressants as being the single most effective treatment for generalised anxiety disorder. The reason for this is simple: CBT isn&rsquo;t easy, because it asks you to challenge habits and beliefs you&rsquo;ve held your whole life. In the short term there is going to be more anxiety and some antidepressants are also effective at blunting the effect of this additional anxiety. In short, CBT is what makes you better, and the drugs just make it a little bit more effective. A lot of people have misconceptions about what it means to be &lsquo;in therapy&rsquo;. I suspect a lot of these are derived from the psychoanalysis we often see portrayed in (primarily US) film and TV. The problem with that type of navel-gazing therapy is that you can spend years doing it, finally reach some sort of breakthrough insight, and still not have no idea what the supposed insight means for your actual life. CBT is different in that rather than addressing feelings directly it focuses on habits in your thoughts (cognitive) and actions (behavioural) with feeling better as an outcome (therapy). CBT and related forms of therapy now have decades of clinical evidence showing that they really work. It uses a wide range of techniques to identify, challenge and reduce various common unhelpful thoughts and behaviours. By choosing and practicing these, you can break bad mental habits that you&rsquo;ve been carrying around, often for decades. For me this means giving fair weight to my successes as well as my failings, allowing flexibility into the rigid rules that I have always, subconsciously, lived by, and being a bit kinder to myself when I make mistakes. It&rsquo;s not been easy and I have to remind myself to practice this every day, but it&rsquo;s really helped. !!! aside &ldquo;More info&rdquo; If you live in the UK, you might not be aware that you can get CBT and other psychological therapies on the NHS through a scheme called IAPT (improving access to psychological therapies). You can self-refer so you don&rsquo;t need to see a doctor first, but you might want to anyway if you think medication might help. They also have a progression of treatments, so you might be offered a course of &ldquo;guided self-help&rdquo; and then progressed to CBT or another talking therapy if need be. This is what happened to me, and it did help a bit but it was CBT that helped me the most. Becoming a librarian What is a librarian? Is it someone who has a masters degree in librarianship and information science? Is it someone who looks after information for other people? Is it simply someone who works in a library? I&rsquo;ve been grappling with this question a lot lately because I&rsquo;ve worked in academic libraries for about 3 years now and I never really thought that&rsquo;s something that might happen. People keep referring to me as &ldquo;a librarian&rdquo; but there&rsquo;s some imposter feelings here because all the librarians around me have much more experience, have skills in areas like cataloguing and collection management and, generally, have a librarian masters degree. So I&rsquo;ve been thinking about what it actually means to me to be a librarian or not. NB. some of these may be tongue-in-cheek Ways in which I am a librarian: I work in a library I help people to access and organise information I have a cat I like gin Ways in which I am not a librarian: I don&rsquo;t have a librarianship qualification I don&rsquo;t work with books 😉 I don&rsquo;t knit (though I can probably remember how if pressed) I don&rsquo;t shush people or wear my hair in a bun (I can confirm that this is also true of every librarian I know) Ways in which I am a shambrarian: I like beer I have more IT experience and qualification than librarianship At the end of the day, I still don&rsquo;t know how I feel about this or, for that matter, how important it is. I&rsquo;m probably going to accept whatever title people around me choose to bestow, though any label will chafe at times! Lean Libraries: applying agile practices to library services Kanban board Jeff Lasovski (via Wikimedia Commons) I&rsquo;ve been working with our IT services at work quite closely for the last year as product owner for our new research data portal, ORDA. That&rsquo;s been a fascinating process for me as I&rsquo;ve been able to see first-hand some of the agile techniques that I&rsquo;ve been reading about from time-to-time on the web over the last few years. They&rsquo;re in the process of adopting a specific set of practices going under the name &ldquo;Scrum&rdquo;, which is fun because it uses some novel terminology that sounds pretty weird to non-IT folks, like &ldquo;scrum master&rdquo;, &ldquo;sprint&rdquo; and &ldquo;product backlog&rdquo;. On my small project we&rsquo;ve had great success with the short cycle times and been able to build trust with our stakeholders by showing concrete progress on a regular basis. Modern librarianship is increasingly fluid, particularly in research services, and I think that to handle that fluidity it&rsquo;s absolutely vital that we are able to work in a more agile way. I&rsquo;m excited about the possibilities of some of these ideas. However, Scrum as implemented by our IT services doesn&rsquo;t seem something that transfers directly to the work that we do: it&rsquo;s too specialised for software development to adapt directly. What I intend to try is to steal some of the individual practices on an experimental basis and simply see what works and what doesn&rsquo;t. The Lean concepts currently popular in IT were originally developed in manufacturing: if they can be translated from the production of physical goods to IT, I don&rsquo;t see why we can&rsquo;t make the ostensibly smaller step of translating them to a different type of knowledge work. I&rsquo;ve therefore started reading around this subject to try and get as many ideas as possible. I&rsquo;m generally pretty rubbish at taking notes from books, so I&rsquo;m going to try and record and reflect on any insights I make on this blog. The framework for trying some of these out is clearly a Plan-Do-Check-Act continuous improvement cycle, so I&rsquo;ll aim to reflect on that process too. I&rsquo;m sure there will have been people implementing Lean in libraries already, so I&rsquo;m hoping to be able to discover and learn from them instead of starting froms scratch. Wish me luck! Mozilla Global Sprint 2017 Photo by Lena Bell on Unsplash Every year, the Mozilla Foundation runs a two-day Global Sprint, giving people around the world 50 hours to work on projects supporting and promoting open culture and tech. Though much of the work during the sprint is, of course, technical software development work, there are always tasks suited to a wide range of different skill sets and experience levels. The participants include writers, designers, teachers, information professionals and many others. This year, for the first time, the University of Sheffield hosted a site, providing a space for local researchers, developers and others to get out of their offices, work on #mozsprint and link up with others around the world. The Sheffield site was organised by the Research Software Engineering group in collaboration with the University Library. Our site was only small compared to others, but we still had people working on several different projects. My reason for taking part in the sprint was to contribute to the international effort on the Library Carpentry project. A team spread across four continents worked throughout the whole sprint to review and develop our lesson material. As there were no other Library Carpentry volunteers at the Sheffield site, I chose to work on some urgent work around improving the presentation of our workshops and lessons on the web and related workflows. It was a really nice subproject to work on, requiring not only cleaning up and normalising the metadata we hold on workshops and lessons, but also digesting and formalising our current ad hoc process of lesson development. The largest group were solar physicists from the School of Maths and Statistics, working on the SunPy project, an open source environment for solar data analysis. They pushed loads of bug fixes and documentation improvements, and also mentored a new contributor through their first additions to the project. Anna Krystalli from Research Software Engineering worked on the EchoBurst project, which is building a web browser extension to help people break out of their online echo chambers. It does this by using natural language processing techniques to highlight well-written, logically sound articles that disagree with the reader&rsquo;s stated views on particular topics of interest. Anna was part of an effort to begin extending this technology to online videos. We had a couple of individuals simply taking the opportunity to break out of their normal work environments to work or learn, including a couple of members of library staff show up for a couple of hours to learn how to use git on a new project! IDCC 2017 reflection For most of the last few years I&#39;ve been lucky enough to attend the International Digital Curation Conference (IDCC). One of the main audiences attending is people who, like me, work on research data management at universities around the world and it&#39;s begun to feel like a sort of &#34;home&#34; conference to me. This year, IDCC was held at the Royal College of Surgeons in the beautiful city of Edinburgh. For the last couple of years, my overall impression has been that, as a community, we&#39;re moving away from the &#34;first-order&#34; problem of trying to convince people (from PhD students to senior academics) to take RDM seriously and into a rich set of &#34;second-order&#34; problems around how to do things better and widen support to more people. This year has been no exception. Here are a few of my observations and takeaway points. Everyone has a repository now Only last year, the most common question you&#39;d get asked by strangers in the coffee break would be &#34;Do you have a data repository?&#34; Now the question is more likely to be &#34;What are you using for your data repository?&#34;, along with more subtle questions about specific components of systems and how they interact. Integrating active storage and archival systems Now that more institutions have data worth preserving, there is more interest in (and in many cases experience of) setting up more seamless integrations between active and archival storage. There are lessons here we can learn. Freezing in amber vs actively maintaining assets There seemed to be an interesting debate going on throughout the conference around the aim of preservation: should we be faithfully preserving the bits and bytes provided without trying to interpret them, or should we take a more active approach by, for example, migrating obsolete formats to newer alternatives. If the former, should we attempt to preserve the software required to access the data as well? If the latter, how much effort do we invest and how do we ensure nothing is lost or altered in the migration? Demonstrating Data Science instead of debating what it is The phrase &#34;Data Science&#34; was once again one of the most commonly uttered of the conference. However, there is now less abstract discussion about what, exactly, is meant by this &#34;data science&#34; thing; this has been replaced more by concrete demonstrations. This change was exemplified perfectly by the keynote by data scientist Alice Daish, who spent a riveting 40 minutes or so enthusing about all the cool stuff she does with data at the British Museum. Recognition of software as an issue Even as recently as last year, I&#39;ve struggled to drum up much interest in discussing software sustainability and preservation at events like this; the interest was there, but there were higher priorities. So I was completely taken by surprise when we ended up with 30+ people in the Software Preservation Birds of a Feather (BoF) session, and when very little input was needed from me as chair to keep a productive discussion going for a full 90 minutes. Unashamed promotion of openness As a community we seem to have nearly overthrown our collective embarrassment about the phrase &#34;open data&#34; (although maybe this is just me). We&#39;ve always known it was a good thing, but I know I&#39;ve been a bit of an apologist in the past, feeling that I had to &#34;soften the blow&#34; when asking researchers to be more open. Now I feel more confident in leading with the benefits of openness, and it felt like that&#39;s a change reflected in the community more widely. Becoming more involved in the conference This year, I took a decision to try and do more to contribute to the conference itself, and I felt like this was pretty successful both in making that contribution and building up my own profile a bit. I presented a paper on one of my current passions, Library Carpentry; it felt really good to be able to share my enthusiasm. I presented a poster on our work integrating our data repository and digital preservation platform; this gave me more of a structure for networking during breaks, as I was able to stand by the poster and start discussions with anyone who seemed interested. I chaired a parallel session; a first for me, and a different challenge from presenting or simply attending the talks. And finally, I proposed and chaired the Software Preservation BoF session (blog post forthcoming). Renewed excitement It&#39;s weird, and possibly all in my imagination, but there seemed to be more energy at this conference than at the previous couple I&#39;ve been to. More people seemed to be excited about the work we&#39;re all doing, recent achievements and the possibilities for the future. Introducing PyRefine: OpenRefine meets Python I&rsquo;m knocking the rust off my programming skills by attempting to write a pure-Python interpreter for OpenRefine &ldquo;scripts&rdquo;. OpenRefine is a great tool for exploring and cleaning datasets prior to analysing them. It also records an undo history of all actions that you can export as a sort of script in JSON format. One thing that bugs me though is that, having spent some time interactively cleaning up your dataset, you then need to fire up OpenRefine again and do some interactive mouse-clicky stuff to apply that cleaning routine to another dataset. You can at least re-import the JSON undo history to make that as quick as possible, but there&rsquo;s no getting around the fact that there&rsquo;s no quick way to do it from a cold start. There is a project, BatchRefine, that extends the OpenRefine server to accept batch requests over a HTTP API, but that isn&rsquo;t useful when you can&rsquo;t or don&rsquo;t want to keep a full Java stack running in the background the whole time. My concept is this: you use OR to explore the data interactively and design a cleaning process, but then export the process to JSON and integrate it into your analysis in Python. That way it can be repeated ad nauseam without having to fire up a full Java stack. I&rsquo;m taking some inspiration from the great talk &ldquo;So you want to be a wizard?&quot; by Julia Evans (@b0rk), who recommends trying experiments as a way to learn. She gives these Rules of Programming Experiments: &ldquo;it doesn&rsquo;t have to be good it doesn&rsquo;t have to work you have to learn something&rdquo; In that spirit, my main priorities are: to see if this can be done; to see how far I can get implementing it; and to learn something. If it also turns out to be a useful thing, well, that&rsquo;s a bonus. Some of the interesting possible challenges here: Implement all core operations; there are quite a lot of these, some of which will be fun (i.e. non-trivial) to implement Implement (a subset of?) GREL, the General Refine Expression Language; I guess my undergrad course on implementing parsers and compilers will come in handy after all! Generate clean, sane Python code from the JSON rather than merely executing it; more than anything, this would be a nice educational tool for users of OpenRefine who want to see how to do equivalent things in Python Selectively optimise key parts of the process; this will involve profiling the code to identify bottlenecks as well as tweaking the actual code to go faster Potentially handle contributions to the code from other people; I&rsquo;d be really happy if this happened but I&rsquo;m realistic&hellip; If you&rsquo;re interested, the project is called PyRefine and it&rsquo;s on github. Constructive criticism, issues &amp; pull requests all welcome! Implementing Yesterbox in emacs with mu4e I&rsquo;ve been meaning to give Yesterbox a try for a while. The general idea is that each day you only deal with email that arrived yesterday or earlier. This forms your inbox for the day, hence &ldquo;yesterbox&rdquo;. Once you&rsquo;ve emptied your yesterbox, or at least got through some minimum number (10 is recommended) then you can look at emails from today. Even then you only really want to be dealing with things that are absolutely urgent. Anything else can wait til tomorrow. The motivation for doing this is to get away from the feeling that we are King Canute, trying to hold back the tide. I find that when I&rsquo;m processing my inbox toward zero there&rsquo;s always a temptation to keep skipping to the new stuff that&rsquo;s just come in. Hiding away the new email until I&rsquo;ve dealt with the old is a very interesting idea. I use mu4e in emacs for reading my email, and handily the mu search syntax is very flexible so you&rsquo;d think it would be easy to create a yesterbox filter: maildir:&quot;/INBOX&quot; date:..1d Unfortunately, 1d is interpreted as &ldquo;24 hours ago from right now&rdquo; so this filter misses everything that was sent yesterday but less than 24 hours ago. There was a feature request raised on the mu github repository to implement an additional date filter syntax but it seems to have died a death for now. In the meantime, the answer to this is to remember that my workplace observes fairly standard office hours, so that anything sent more than 9 hours ago is unlikely to have been sent today. The following does the trick: maildir:&quot;/INBOX&quot; date:..9h In my mu4e bookmarks list, that looks like this: (setq mu4e-bookmarks &#39;((&#34;flag:unread AND NOT flag:trashed&#34; &#34;Unread messages&#34; ?u) (&#34;flag:flagged maildir:/archive&#34; &#34;Starred messages&#34; ?s) (&#34;date:today..now&#34; &#34;Today&#39;s messages&#34; ?t) (&#34;date:7d..now&#34; &#34;Last 7 days&#34; ?w) (&#34;maildir:\&#34;/Mailing lists.*\&#34; (flag:unread OR flag:flagged)&#34; &#34;Unread in mailing lists&#34; ?M) (&#34;maildir:\&#34;/INBOX\&#34; date:..1d&#34; &#34;Yesterbox&#34; ?y))) ;; &lt;- this is the new one Rewarding good practice in research From opensource.com on Flickr Whenever I&rsquo;m involved in a discussion about how to encourage researchers to adopt new practices, eventually someone will come out with some variant of the following phrase: &ldquo;That&rsquo;s all very well, but researchers will never do XYZ until it&rsquo;s made a criterion in hiring and promotion decisions.&rdquo; With all the discussion of carrots and sticks I can see where this attitude comes from, and strongly empathise with it, but it raises two main problems: It&rsquo;s unfair and more than a little insulting to anyone to be lumped into one homogeneous group; and Taking all the different possible XYZs into account, that&rsquo;s an awful lot of hoops to expect anyone to jump through. Firstly, &ldquo;researchers&rdquo; are as diverse as the rest of us in terms of what gets them out of bed in the morning. Some of us want prestige; some want to contribute to a greater good; some want to create new things; some just enjoy the work. One thing I&rsquo;d argue we all have in common is this: nothing is more offputting than feeling like you&rsquo;re being strongarmed into something you don&rsquo;t want to do. If we rely on simplistic metrics, people will focus on those and miss the point. At best people will disengage and at worst they will actively game the system. I&rsquo;ve got to do these ten things to get my next payrise, and still retain my sanity? Ok, what&rsquo;s the least I can get away with and still tick them off. You see it with students taking poorly-designed assessments and grown-ups are no difference. We do need to wield carrots as well as sticks, but the whole point is that these practices are beneficial in and of themselves. The carrots are already there if we articulate them properly and clear the roadblocks (don&rsquo;t you enjoy mixed metaphors?). Creating artificial benefits will just dilute the value of the real ones. Secondly, I&rsquo;ve heard a similar argument made for all of the following practices and more: Research data management Open Access publishing Public engagement New media (e.g. blogging) Software management and sharing Some researchers devote every waking hour to their work, whether it&rsquo;s in the lab, writing grant applications, attending conferences, authoring papers, teaching, and so on and so on. It&rsquo;s hard to see how someone with all this in their schedule can find time to exercise any of these new skills, let alone learn them in the first place. And what about the people who sensibly restrict the hours taken by work to spend more time doing things they enjoy? Yes, all of the above practices are valuable, both for the individual and the community, but they&rsquo;re all new (to most) and hence require more effort up front to learn. We have to accept that it&rsquo;s inevitably going to take time for all of them to become &ldquo;business as usual&rdquo;. I think if the hiring/promotion/tenure process has any role in this, it&rsquo;s in asking whether the researcher can build a coherent narrative as to why they&rsquo;ve chosen to focus their efforts in this area or that. You&rsquo;re not on Twitter but your data is being used by 200 research groups across the world? Great! You didn&rsquo;t have time to tidy up your source code for github but your work is directly impacting government policy? Brilliant! We still need convince more people to do more of these beneficial things, so how? Call me naïve, but maybe we should stick to making rational arguments, calming fears and providing low-risk opportunities to learn new skills. Acting (compassionately) like a stuck record can help. And maybe we&rsquo;ll need to scale back our expectations in other areas (journal impact factors, anyone?) to make space for the new stuff. Software Carpentry: SC Test; does your software do what you meant? &ldquo;The single most important rule of testing is to do it.&rdquo; &mdash; Brian Kernighan and Rob Pike, The Practice of Programming (quote taken from SC Test page One of the trickiest aspects of developing software is making sure that it actually does what it&rsquo;s supposed to. Sometimes failures are obvious: you get completely unreasonable output or even (shock!) a comprehensible error message. But failures are often more subtle. Would you notice if your result was out by a few percent, or consistently ignored the first row of your input data? The solution to this is testing: take some simple example input with a known output, run the code and compare the actual output with the expected one. Implement a new feature, test and repeat. Sounds easy, doesn&rsquo;t it? But then you implement a new bit of code. You test it and everything seems to work fine, except that your new feature required changes to existing code and those changes broke something else. So in fact you need to test everything, and do it every time you make a change. Further than that, you probably want to test that all your separate bits of code work together properly (integration testing) as well as testing the individual bits separately (unit testing). In fact, splitting your tests up like that is a good way of holding on to your sanity. This is actually a lot less scary than it sounds, because there are plenty of tools now to automate that testing: you just type a simple test command and everything is verified. There are even tools that enable you to have tests run automatically when you check the code into version control, and even automatically deploy code that passes the tests, a process known as continuous integration or CI. The big problems with testing are that it&rsquo;s tedious, your code seems to work without it and no-one tells you off for not doing it. At the time when the Software Carpentry competition was being run, the idea of testing wasn&rsquo;t new, but the tools to help were in their infancy. &ldquo;Existing tools are obscure, hard to use, expensive, don&rsquo;t actually provide much help, or all three.&rdquo; The SC Test category asked entrants &ldquo;to design a tool, or set of tools, which will help programmers construct and maintain black box and glass box tests of software components at all levels, including functions, modules, and classes, and whole programs.&rdquo; The SC Test category is interesting in that the competition administrators clearly found it difficult to specify what they wanted to see in an entry. In fact, the whole category was reopened with a refined set of rules and expectations. Ultimately, it&rsquo;s difficult to tell whether this category made a significant difference. Where the tools to write tests used to be very sparse and difficult to use they are now many and several options exist for most programming languages. With this proliferation, several tried-and-tested methodologies have emerged which are consistent across many different tools, so while things still aren&rsquo;t perfect they are much better. In recent years there has been a culture shift in the wider software development community towards both testing in general and test-first development, where the tests for a new feature are written first, and then the implementation is coded incrementally until all tests pass. The current challenge is to transfer this culture shift to the academic research community! Tools for collaborative markdown editing Photo by Alan Cleaver I really love Markdown1. I love its simplicity; its readability; its plain-text nature. I love that it can be written and read with nothing more complicated than a text-editor. I love how nicely it plays with version control systems. I love how easy it is to convert to different formats with Pandoc and how it&rsquo;s become effectively the native text format for a wide range of blogging platforms. One frustration I&rsquo;ve had recently, then, is that it&rsquo;s surprisingly difficult to collaborate on a Markdown document. There are various solutions that almost work but at best feel somehow inelegant, especially when compared with rock solid products like Google Docs. Finally, though, we&rsquo;re starting to see some real possibilities. Here are some of the things I&rsquo;ve tried, but I&rsquo;d be keen to hear about other options. 1. Just suck it up To be honest, Google Docs isn&rsquo;t that bad. In fact it works really well, and has almost no learning curve for anyone who&rsquo;s ever used Word (i.e. practically anyone who&rsquo;s used a computer since the 90s). When I&rsquo;m working with non-technical colleagues there&rsquo;s nothing I&rsquo;d rather use. It still feels a bit uncomfortable though, especially the vendor lock-in. You can export a Google Doc to Word, ODT or PDF, but you need to use Google Docs to do that. Plus as soon as I start working in a word processor I get tempted to muck around with formatting. 2. Git(hub) The obvious solution to most techies is to set up a GitHub repo, commit the document and go from there. This works very well for bigger documents written over a longer time, but seems a bit heavyweight for a simple one-page proposal, especially over short timescales. Who wants to muck around with pull requests and merging changes for a document that&rsquo;s going to take 2 days to write tops? This type of project doesn&rsquo;t need a bug tracker or a wiki or a public homepage anyway. Even without GitHub in the equation, using git for such a trivial use case seems clunky. 3. Markdown in Etherpad/Google Docs Etherpad is great tool for collaborative editing, but suffers from two key problems: no syntax highlighting or preview for markdown (it&rsquo;s just treated as simple text); and you need to find a server to host it or do it yourself. However, there&rsquo;s nothing to stop you editing markdown with it. You can do the same thing in Google Docs, in fact, and I have. Editing a fundamentally plain-text format in a word processor just feels weird though. 4. Overleaf/Authorea Overleaf and Authorea are two products developed to support academic editing. Authorea has built-in markdown support but lacks proper simultaneous editing. Overleaf has great simultaneous editing but only supports markdown by wrapping a bunch of LaTeX boilerplate around it. Both OK but unsatisfactory. 5. StackEdit Now we&rsquo;re starting to get somewhere. StackEdit has both Markdown syntax highlighting and near-realtime preview, as well as integrating with Google Drive and Dropbox for file synchronisation. 6. HackMD HackMD is one that I only came across recently, but it looks like it does exactly what I&rsquo;m after: a simple markdown-aware editor with live preview that also permits simultaneous editing. I&rsquo;m a little circumspect simply because I know simultaneous editing is difficult to get right, but it certainly shows promise. 7. Classeur I discovered Classeur literally today: it&rsquo;s developed by the same team as StackEdit (which is now apparently no longer in development), and is currently in beta, but it looks to offer two killer features: real-time collaboration, including commenting, and pandoc-powered export to loads of different formats. Anything else? Those are the options I&rsquo;ve come up with so far, but they can&rsquo;t be the only ones. Is there anything I&rsquo;ve missed? Other plain-text formats are available. I&rsquo;m also a big fan of org-mode. &#x21a9;&#xfe0e; Software Carpentry: SC Track; hunt those bugs! This competition will be an opportunity for the next wave of developers to show their skills to the world &mdash; and to companies like ours. &mdash; Dick Hardt, ActiveState (quote taken from SC Track page) All code contains bugs, and all projects have features that users would like but which aren&rsquo;t yet implemented. Open source projects tend to get more of these as their user communities grow and start requesting improvements to the product. As your open source project grows, it becomes harder and harder to keep track of and prioritise all of these potential chunks of work. What do you do? The answer, as ever, is to make a to-do list. Different projects have used different solutions, including mailing lists, forums and wikis, but fairly quickly a whole separate class of software evolved: the bug tracker, which includes such well-known examples as Bugzilla, Redmine and the mighty JIRA. Bug trackers are built entirely around such requests for improvement, and typically track them through workflow stages (planning, in progress, fixed, etc.) with scope for the community to discuss and add various bits of metadata. In this way, it becomes easier both to prioritise problems against each other and to use the hive mind to find solutions. Unfortunately most bug trackers are big, complicated beasts, more suited to large projects with dozens of developers and hundreds or thousands of users. Clearly a project of this size is more difficult to manage and requires a certain feature set, but the result is that the average bug tracker is non-trivial to set up for a small single-developer project. The SC Track category asked entrants to propose a better bug tracking system. In particular, the judges were looking for something easy to set up and configure without compromising on functionality. The winning entry was a bug-tracker called Roundup, proposed by Ka-Ping Yee. Here we have another tool which is still in active use and development today. Given that there is now a huge range of options available in this area, including the mighty github, this is no small achievement. These days, of course, github has become something of a de facto standard for open source project management. Although ostensibly a version control hosting platform, each github repository also comes with a built-in issue tracker, which is also well-integrated with the &ldquo;pull request&rdquo; workflow system that allows contributors to submit bug fixes and features themselves. Github&rsquo;s competitors, such as GitLab and Bitbucket, also include similar features. Not everyone wants to work in this way though, so it&rsquo;s good to see that there is still a healthy ecosystem of open source bug trackers, and that Software Carpentry is still having an impact. Software Carpentry: SC Config; write once, compile anywhere Nine years ago, when I first release Python to the world, I distributed it with a Makefile for BSD Unix. The most frequent questions and suggestions I received in response to these early distributions were about building it on different Unix platforms. Someone pointed me to autoconf, which allowed me to create a configure script that figured out platform idiosyncracies Unfortunately, autoconf is painful to use &ndash; its grouping, quoting and commenting conventions don&rsquo;t match those of the target language, which makes scripts hard to write and even harder to debug. I hope that this competition comes up with a better solution &mdash; it would make porting Python to new platforms a lot easier! &mdash; Guido van Rossum, Technical Director, Python Consortium (quote taken from SC Config page) On to the next Software Carpentry competition category, then. One of the challenges of writing open source software is that you have to make it run on a wide range of systems over which you have no control. You don&rsquo;t know what operating system any given user might be using or what libraries they have installed, or even what versions of those libraries. This means that whatever build system you use, you can&rsquo;t just send the Makefile (or whatever) to someone else and expect everything to go off without a hitch. For a very long time, it&rsquo;s been common practice for source packages to include a configure script that, when executed, runs a bunch of tests to see what it has to work with and sets up the Makefile accordingly. Writing these scripts by hand is a nightmare, so tools like autoconf and automake evolved to make things a little easier. They did, and if the tests you want to use are already implemented they work very well indeed. Unfortunately they&rsquo;re built on an unholy combination of shell scripting and the archaic Gnu M4 macro language. That means if you want to write new tests you need to understand both of these as well as the architecture of the tools themselves &mdash; not an easy task for the average self-taught research programmer. SC Conf, then, called for a re-engineering of the autoconf concept, to make it easier for researchers to make their code available in a portable, platform-independent format. The second round configuration tool winner was SapCat, &ldquo;a tool to help make software portable&rdquo;. Unfortunately, this one seems not to have gone anywhere, and I could only find the original proposal on the Internet Archive. There were a lot of good ideas in this category about making catalogues and databases of system quirks to avoid having to rerun the same expensive tests again the way a standard ./configure script does. I think one reason none of these ideas survived is that they were overly ambitions, imagining a grand architecture where their tool provide some overarching source of truth. This is in stark contrast to the way most Unix-like systems work, where each tool does one very specific job well and tools are easy to combine in various ways. In the end though, I think Moore&rsquo;s Law won out here, making it easier to do the brute-force checks each time than to try anything clever to save time &mdash; a good example of avoiding unnecessary optimisation. Add to that the evolution of the generic pkg-config tool from earlier package-specific tools like gtk-config, and it&rsquo;s now much easier to check for particular versions and features of common packages. On top of that, much of the day-to-day coding of a modern researcher happens in interpreted languages like Python and R, which give you a fully-functioning pre-configured environment with a lot less compiling to do. As a side note, Tom Tromey, another of the shortlisted entrants in this category, is still a major contributor to the open source world. He still seems to be involved in the automake project, contributes a lot of code to the emacs community too and blogs sporadically at The Cliffs of Inanity. Semantic linefeeds: one clause per line I&rsquo;ve started using &ldquo;semantic linefeeds&rdquo;, a concept I discovered on Brandon Rhodes' blog, when writing content, an idea described in that article far better than I could. I turns out this is a very old idea, promoted way back in the day by Brian W Kernighan, contributor to the original Unix system, co-creator of the AWK and AMPL programming languages and co-author of a lot of seminal programming textbooks including &ldquo;The C Programming Language&rdquo;. The basic idea is that you break lines at natural gaps between clauses and phrases, rather than simply after the last word before you hit 80 characters. Keeping line lengths strictly to 80 characters isn&rsquo;t really necessary in these days of wide aspect ratios for screens. Breaking lines at points that make semantic sense in the sentence is really helpful for editing, especially in the context of version control, because it isolates changes to the clause in which they occur rather than just the nearest 80-character block. I also like it because it makes my crappy prose feel just a little bit more like poetry. ☺ Software Carpentry: SC Build; or making a better make Software tools often grow incrementally from small beginnings into elaborate artefacts. Each increment makes sense, but the final edifice is a mess. make is an excellent example: a simple tool that has grown into a complex domain-specific programming language. I look forward to seeing the improvements we will get from designing the tool afresh, as a whole&hellip; &mdash; Simon Peyton-Jones, Microsoft Research (quote taken from SC Build page) Most people who have had to compile an existing software tool will have come across the venerable make tool (which usually these days means GNU Make). It allows the developer to write a declarative set of rules specifying how the final software should be built from its component parts, mostly source code, allowing the build itself to be carried out by simply typing make at the command line and hitting Enter. Given a set of rules, make will work out all the dependencies between components and ensure everything is built in the right order and nothing that is up-to-date is rebuilt. Great in principle but make is notoriously difficult for beginners to learn, as much of the logic for how builds are actually carried out is hidden beneath the surface. This also makes it difficult to debug problems when building large projects. For these reasons, the SC Build category called for a replacement build tool engineered from the ground up to solve these problems. The second round winner, ScCons, is a Python-based make-like build tool written by Steven Knight. While I could find no evidence of any of the other shortlisted entries, this project (now renamed SCons) continues in active use and development to this day. I actually use this one myself from time to time and to be honest I prefer it in many cases to trendy new tools like rake or grunt and the behemoth that is Apache Ant. Its Python-based SConstruct file syntax is remarkably intuitive and scales nicely from very simple builds up to big and complicated project, with good dependency tracking to avoid unnecessary recompiling. It has a lot of built-in rules for performing common build &amp; compile tasks, but it&rsquo;s trivial to add your own, either by combining existing building blocks or by writing a new builder with the full power of Python. A minimal SConstruct file looks like this: Program(&#39;hello.c&#39;) Couldn&rsquo;t be simpler! And you have the full power of Python syntax to keep your build file simple and readable. It&rsquo;s interesting that all the entries in this category apart from one chose to use a Python-derived syntax for describing build steps. Python was clearly already a language of choice for flexible multi-purpose computing. The exception is the entry that chose to use XML instead, which I think is a horrible idea (oh how I used to love XML!) but has been used to great effect in the Java world by tools like Ant and Maven. What happened to the original Software Carpentry? &ldquo;Software Carpentry was originally a competition to design new software tools, not a training course. The fact that you didn&rsquo;t know that tells you how well it worked.&rdquo; When I read this in a recent post on Greg Wilson&rsquo;s blog, I took it as a challenge. I actually do remember the competition, although looking at the dates it was long over by the time I found it. I believe it did have impact; in fact, I still occasionally use one of the tools it produced, so Greg&rsquo;s comment got me thinking: what happened to the other competition entries? Working out what happened will need a bit of digging, as most of the relevant information is now only available on the Internet Archive. It certainly seems that by November 2008 the domain name had been allowed to lapse and had been replaced with a holding page by the registrar. There were four categories in the competition, each representing a category of tool that the organisers thought could be improved: SC Build: a build tool to replace make SC Conf: a configuration management tool to replace autoconf and automake SC Track: a bug tracking tool SC Test: an easy to use testing framework I&rsquo;m hoping to be able to show that this work had a lot more impact than Greg is admitting here. I&rsquo;ll keep you posted on what I find! Changing static site generators: Nanoc → Hugo I&rsquo;ve decided to move the site over to a different static site generator, Hugo. I&rsquo;ve been using Nanoc for a long time and it&rsquo;s worked very well, but lately it&rsquo;s been taking longer and longer to compile the site and throwing weird errors that I can&rsquo;t get to the bottom of. At the time I started using Nanoc, static site generators were in their infancy. There weren&rsquo;t the huge number of feature-loaded options that there are now, so I chose one and I built a whole load of blogging-related functionality myself. I did it in ways that made sense at the time but no longer work well with Nanoc&rsquo;s latest versions. So it&rsquo;s time to move to something that has blogging baked-in from the beginning and I&rsquo;m taking the opportunity to overhaul the look and feel too. Again, when I started there weren&rsquo;t many pre-existing themes so I built the whole thing myself and though I&rsquo;m happy with the work I did on it it never quite felt polished enough. Now I&rsquo;ve got the opportunity to adapt one of the many well-designed themes already out there, so I&rsquo;ve taken one from the Hugo themes gallery and tweaked the colours to my satisfaction. Hugo also has various features that I&rsquo;ve wanted to implement in Nanoc but never quite got round to it. The nicest one is proper handling of draft posts and future dates, but I keep finding others. There&rsquo;s a lot of old content that isn&rsquo;t quite compatible with the way Hugo does things so I&rsquo;ve taken the old Nanoc-compiled content and frozen it to make sure that old links should still work. I could probably fiddle with it for years without doing much so it&rsquo;s probably time to go ahead and publish it. I&rsquo;m still not completely happy with my choice of theme but one of the joys of Hugo is that I can change that whenever I want. Let me know what you think! License Except where otherwise stated, all content on eRambler by Jez Cope is licensed under a Creative Commons Attribution-ShareAlike 4.0 International license. RDM Resources I occasionally get asked for resources to help someone learn more about research data management (RDM) as a discipline (i.e. for those providing RDM support rather than simply wanting to manage their own data). I&rsquo;ve therefore collected a few resources together on this page. If you&rsquo;re lucky I might even update it from time to time! First, a caveat: this is very focussed on UK Higher Education, though much of it will still be relevant for people outside that narrow demographic. My general recommendation would be to start with the Digital Curation Centre (DCC) website and follow links out from there. I also have a slowly growing list of RDM links on Diigo, and there&rsquo;s an RDM section in my list of blogs and feeds too. Mailing lists Jiscmail is a popular list server run for the benefit of further and higher education in the UK; the following lists are particularly relevant: RESEARCH-DATAMAN DATA-PUBLICATION DIGITAL-PRESERVATION LIS-RESEARCHSUPPORT The Research Data Alliance have a number of Interest Groups and Working Groups that discuss issues by email Events International Digital Curation Conference — major annual conference Research Data Management Forum — roughly every six months, places are limited! RDA Plenary — also every 6 months, but only about 1 in every 3 in Europe Books In no particular order: Martin, Victoria. Demystifying eResearch: A Primer for Librarians. Libraries Unlimited, 2014. Borgman, Christine L. Big Data, Little Data, No Data: Scholarship in the Networked World. Cambridge, Massachusetts: The MIT Press, 2015. Corti, Louise, Veerle Van den Eynden, and Libby Bishop. Managing and Sharing Research Data. Thousand Oaks, CA: SAGE Publications Ltd, 2014. Pryor, Graham, ed. Managing Research Data. Facet Publishing, 2012. Pryor, Graham, Sarah Jones, and Angus Whyte, eds. Delivering Research Data Management Services: Fundamentals of Good Practice. Facet Publishing, 2013. Ray, Joyce M., ed. Research Data Management: Practical Strategies for Information Professionals. West Lafayette, Indiana: Purdue University Press, 2014. Reports ‘Ten Recommendations for Libraries to Get Started with Research Data Management’. LIBER, 24 August 2012. http://libereurope.eu/news/ten-recommendations-for-libraries-to-get-started-with-research-data-management/. ‘Science as an Open Enterprise’. Royal Society, 2 June 2012. https://royalsociety.org/policy/projects/science-public-enterprise/Report/. Mary Auckland. ‘Re-Skilling for Research’. RLUK, January 2012. http://www.rluk.ac.uk/wp-content/uploads/2014/02/RLUK-Re-skilling.pdf. Journals International Journal of Digital Curation (IJDC) Journal of eScience Librarianship (JeSLib) Fairphone 2: initial thoughts on the original ethical smartphone I&rsquo;ve had my eye on the Fairphone 2 for a while now, and when my current phone, an aging Samsung Galaxy S4, started playing up I decided it was time to take the plunge. A few people have asked for my thoughts on the Fairphone so here are a few notes. Why I bought it The thing that sparked my interest, and the main reason for buying the phone really, was the ethical stance of the manufacturer. The small Swedish company have gone to great lengths to ensure that both labour and materials are sourced as responsibly as possible. They regularly inspect the factories where the parts are made and assembled to ensure fair treatment of the workers and they source all the raw materials carefully to minimise the environmental impact and the use of conflict minerals. Another side to this ethical stance is a focus on longevity of the phone itself. This is not a product with an intentionally limited lifespan. Instead, it&rsquo;s designed to be modular and as repairable as possible, by the owner themselves. Spares are available for all of the parts that commonly fail in phones (including screen and camera), and at the time of writing the Fairphone 2 is the only phone to receive 10/10 for reparability from iFixit. There are plans to allow hardware upgrades, including an expansion port on the back so that NFC or wireless charging could be added with a new case, for example. What I like So far, the killer feature for me is the dual SIM card slots. I have both a personal and a work phone, and the latter was always getting left at home or in the office or running out of charge. Now I have both SIMs in the one phone: I can recieve calls on either number, turn them on and off independently and choose which account to use when sending a text or making a call. The OS is very close to &ldquo;standard&rdquo; Android, which is nice, and I really don&rsquo;t miss all the extra bloatware that came with the Galaxy S4. It also has twice the storage of that phone, which is hardly unique but is still nice to have. Overall, it seems like a solid, reliable phone, though it&rsquo;s not going to outperform anything else at the same price point. It certainly feels nice and snappy for everything I want to use it for. I&rsquo;m no mobile gamer, but there is that distant promise of upgradability on the horizon if you are. What I don&rsquo;t like I only have two bugbears so far. Once or twice it&rsquo;s locked up and become unresponsive, requiring a &ldquo;manual reset&rdquo; (removing and replacing the battery) to get going again. It also lacks NFC, which isn&rsquo;t really a deal breaker, but I was just starting to make occasional use of it on the S4 (mostly experimenting with my Yubikey NEO) and it would have been nice to try out Android Pay when it finally arrives in the UK. Overall It&rsquo;s definitely a serious contender if you&rsquo;re looking for a new smartphone and aren&rsquo;t bothered about serious mobile gaming. You do pay a premium for the ethical sourcing and modularity, but I feel that&rsquo;s worth it for me. I&rsquo;m looking forward to seeing how it works out as a phone. Wiring my web I&rsquo;m a nut for automating repetitive tasks, so I was dead pleased a few years ago when I discovered that IFTTT let me plug different bits of the web together. I now use it for tasks such as: Syndicating blog posts to social media Creating scheduled/repeating todo items from a Google Calendar Making a note to revisit an article I&rsquo;ve starred in Feedly I&rsquo;d probably only be half-joking if I said that I spend more time automating things than I save not having to do said things manually. Thankfully it&rsquo;s also a great opportunity to learn, and recently I&rsquo;ve been thinking about reimplementing some of my IFTTT workflows myself to get to grips with how it all works. There are some interesting open source projects designed to offer a lot of this functionality, such as Huginn, but I decided to go for a simpler option for two reasons: I want to spend my time learning about the APIs of the services I use and how to wire them together, rather than learning how to use another big framework; and I only have a small Amazon EC2 server to pay with and a heavy Ruby on Rails app like Huginn (plus web server) needs more memory than I have. Instead I&rsquo;ve gone old-school with a little collection of individual scripts to do particular jobs. I&rsquo;m using the built-in scheduling functionality of systemd, which is already part of a modern Linux operating system, to get them to run periodically. It also means I can vary the language I use to write each one depending on the needs of the job at hand and what I want to learn/feel like at the time. Currently it&rsquo;s all done in Python, but I want to have a go at Lisp sometime, and there are some interesting new languages like Go and Julia that I&rsquo;d like to get my teeth into as well. You can see my code on github as it develops: https://github.com/jezcope/web-plumbing. Comments and contributions are welcome (if not expected) and let me know if you find any of the code useful. Image credit: xkcd #1319, Automation Data is like water, and language is like clothing I admit it: I&rsquo;m a grammar nerd. I know the difference between &lsquo;who&rsquo; and &lsquo;whom&rsquo;, and I&rsquo;m proud. I used to be pretty militant, but these days I&rsquo;m more relaxed. I still take joy in the mechanics of the language, but I also believe that English is defined by its usage, not by a set of arbitrary rules. I&rsquo;m just as happy to abuse it as to use it, although I still think it&rsquo;s important to know what rules you&rsquo;re breaking and why. My approach now boils down to this: language is like clothing. You (probably) wouldn&rsquo;t show up to a job interview in your pyjamas1, but neither are you going to wear a tuxedo or ballgown to the pub. Getting commas and semicolons in the right place is like getting your shirt buttons done up right. Getting it wrong doesn&rsquo;t mean you&rsquo;re an idiot. Everyone will know what you meant. It will affect how you&rsquo;re perceived, though, and that will affect how your message is perceived. And there are former rules2 that some still enforce that are nonetheless dropping out of regular usage. There was a time when everyone in an office job wore formal clothing. Then it became acceptable just to have a blouse, or a shirt and tie. Then the tie became optional and now there are many professions where perfectly well-respected and competent people are expected to show up wearing nothing smarter than jeans and a t-shirt. One such rule IMHO is that &lsquo;data&rsquo; is a plural and should take pronouns like &lsquo;they&rsquo; and &lsquo;these&rsquo;. The origin of the word &lsquo;data&rsquo; is in the Latin plural of &lsquo;datum&rsquo;, and that idea has clung on for a considerable period. But we don&rsquo;t speak Latin and the English language continues to evolve: &lsquo;agenda&rsquo; also began life as a Latin plural, but we don&rsquo;t use the word &lsquo;agendum&rsquo; any more. It&rsquo;s common everyday usage to refer to data with singular pronouns like &lsquo;it&rsquo; and &lsquo;this&rsquo;, and it&rsquo;s very rare to see someone referring to a single datum (as opposed to &lsquo;data point&rsquo; or something). If you want to get technical, I tend to think of data as a mass noun, like &lsquo;water&rsquo; or &lsquo;information&rsquo;. It&rsquo;s uncountable: talking about &lsquo;a water&rsquo; or &lsquo;an information&rsquo; doesn&rsquo;t make much sense, but it uses singular pronouns, as in &lsquo;this information&rsquo;. If you&rsquo;re interested, the Oxford English Dictionary also takes this position, while Chambers leaves the choice of singular or plural noun up to you. There is absolutely nothing wrong, in my book, with referring to data in the plural as many people still do. But it&rsquo;s no longer a rule and for me it&rsquo;s weakened further from guideline to preference. It&rsquo;s like wearing a bow-tie to work. There&rsquo;s nothing wrong with it and some people really make it work, but it&rsquo;s increasingly outdated and even a little eccentric. or maybe you&rsquo;d totally rock it. &#x21a9;&#xfe0e; Like not starting a sentence with a conjunction&hellip; &#x21a9;&#xfe0e; #IDCC16 day 2: new ideas Well, I did a great job of blogging the conference for a couple of days, but then I was hit by the bug that&rsquo;s been going round and didn&rsquo;t have a lot of energy for anything other than paying attention and making notes during the day! I&rsquo;ve now got round to reviewing my notes so here are a few reflections on day 2. Day 2 was the day of many parallel talks! So many great and inspiring ideas to take in! Here are a few of my take-home points. Big science and the long tail The first parallel session had examples of practical data management in the real world. Jian Qin &amp; Brian Dobreski (School of Information Studies, Syracuse University) worked on reproducibility with one of the research groups involved with the recent gravitational wave discovery. &ldquo;Reproducibility&rdquo; for this work (as with much of physics) mostly equates to computational reproducibility: tracking the provenance of the code and its input and output is key. They also found that in practice the scientists' focus was on making the big discovery, and ensuring reproducibility was seen as secondary. This goes some way to explaining why current workflows and tools don&rsquo;t really capture enough metadata. Milena Golshan &amp; Ashley Sands (Center for Knowledge Infrastructures, UCLA) investigated the use of Software-as-a-Service (SaaS, such as Google Drive, Dropbox or more specialised tools) as a way of meeting the needs of long-tail science research such as ocean science. This research is characterised by small teams, diverse data, dynamic local development of tools, local practices and difficulty disseminating data. This results in a need for researchers to be generalists, as opposed to &ldquo;big science&rdquo; research areas, where they can afford to specialise much more deeply. Such generalists tend to develop their own isolated workflows, which can differ greatly even within a single lab. Long-tail research also often struggles from a lack of dedicated IT support. They found that use of SaaS could help to meet these challenges, but with a high cost required to cover the needed guarantees of security and stability. Education &amp; training This session focussed on the professional development of library staff. Eleanor Mattern (University of Pittsburgh) described the immersive training introduced to improve librarians' understanding of the data needs of their subject areas in delivering their RDM service delivery model. The participants each conducted a &ldquo;disciplinary deep dive&rdquo;, shadowing researchers and then reporting back to the group on their discoveries with a presentation and discussion. Liz Lyon (also University of Pittsburgh, formerly UKOLN/DCC) gave a systematic breakdown of the skills, knowledge and experience required in different data-related roles, obtained from an analysis of job adverts. She identified distinct roles of data analyst, data engineer and data journalist, and as well as each role&rsquo;s distinctive skills, pinpointed common requirements of all three: Python, R, SQL and Excel. This work follows on from an earlier phase which identified an allied set of roles: data archivist, data librarian and data steward. Data sharing and reuse This session gave an overview of several specific workflow tools designed for researchers. Marisa Strong (University of California Curation Centre/California Digital Libraries) presented Dash, a highly modular tool for manual data curation and deposit by researchers. It&rsquo;s built on their flexible backend, Stash, and though it&rsquo;s currently optimised to deposit in their Merritt data repository it could easily be hooked up to other repositories. It captures DataCite metadata and a few other fields, and is integrated with ORCID to uniquely identify people. In a different vein, Eleni Castro (Institute for Quantitative Social Science, Harvard University) discussed some of the ways that Harvard&rsquo;s Dataverse repository is streamlining deposit by enabling automation. It provides a number of standardised endpoints such as OAI-PMH for metadata harvest and SWORD for deposit, as well as custom APIs for discovery and deposit. Interesting use cases include: An addon for the Open Science Framework to deposit in Dataverse via SWORD An R package to enable automatic deposit of simulation and analysis results Integration with publisher workflows Open Journal Systems A growing set of visualisations for deposited data In the future they&rsquo;re also looking to integrate with DMPtool to capture data management plans and with Archivematica for digital preservation. Andrew Treloar (Australian National Data Service) gave us some reflections on the ANDS &ldquo;applications programme&rdquo;, a series of 25 small funded projects intended to address the fourth of their strategic transformations, single use → reusable. He observed that essentially these projects worked because they were able to throw money at a problem until they found a solution: not very sustainable. Some of them stuck to a traditional &ldquo;waterfall&rdquo; approach to project management, resulting in &ldquo;the right solution 2 years late&rdquo;. Every researcher&rsquo;s needs are &ldquo;special&rdquo; and communities are still constrained by old ways of working. The conclusions from this programme were that: &ldquo;Good enough&rdquo; is fine most of the time Adopt/Adapt/Augment is better than Build Existing toolkits let you focus on the 10% functionality that&rsquo;s missing Succussful projects involved research champions who can: 1) articulate their community&rsquo;s requirements; and 2) promote project outcomes Summary All in all, it was a really exciting conference, and I&rsquo;ve come home with loads of new ideas and plans to develop our services at Sheffield. I noticed a continuation of some of the trends I spotted at last year&rsquo;s IDCC, especially an increasing focus on &ldquo;second-order&rdquo; problems: we&rsquo;re no longer spending most of our energy just convincing researchers to take data management seriously and are able to spend more time helping them to do it better and get value out of it. There&rsquo;s also a shift in emphasis (identified by closing speaker Cliff Lynch) from sharing to reuse, and making sure that data is not just available but valuable. #IDCC16 Day 1: Open Data The main conference opened today with an inspiring keynote by Barend Mons, Professor in Biosemantics, Leiden University Medical Center. The talk had plenty of great stuff, but two points stood out for me. First, Prof Mons described a newly discovered link between Huntingdon&rsquo;s Disease and a previously unconsidered gene. No-one had previously recognised this link, but on mining the literature, an indirect link was identified in more than 10% of the roughly 1 million scientific claims analysed. This is knowledge for which we already had more than enough evidence, but which could never have been discovered without such a wide-ranging computational study. Second, he described a number of behaviours which should be considered &ldquo;malpractice&rdquo; in science: Relying on supplementary data in articles for data sharing: the majority of this is trash (paywalled, embedded in bitmap images, missing) Using the Journal Impact Factor to evaluate science and ignoring altmetrics Not writing data stewardship plans for projects (he prefers this term to &ldquo;data management plan&rdquo;) Obstructing tenure for data experts by assuming that all highly-skilled scientists must have a long publication record A second plenary talk from Andrew Sallons of the Centre for Open Science introduced a number of interesting-looking bits and bobs, including the Transparency &amp; Openness Promotion (TOP) Guidelines which set out a pathway to help funders, publishers and institutions move towards more open science. The rest of the day was taken up with a panel on open data, a poster session, some demos and a birds-of-a-feather session on sharing sensitive/confidential data. There was a great range of posters, but a few that stood out to me were: Lessons learned about ISO 16363 (&ldquo;Audit and certification of trustworthy digital repositories&rdquo;) certification from the British Library Two separate posters (from the Universities of Toronto and Colorado) about disciplinary RDM information &amp; training for liaison librarians A template for sharing psychology data developed by a psychologist-turned-information researcher from Carnegie Mellon University More to follow, but for now it&rsquo;s time for the conference dinner! #IDCC16 Day 0: business models for research data management I&rsquo;m at the International Digital Curation Conference 2016 (#IDCC16) in Amsterdam this week. It&rsquo;s always a good opportunity to pick up some new ideas and catch up with colleagues from around the world, and I always come back full of new possibilities. I&rsquo;ll try and do some more reflective posts after the conference but I thought I&rsquo;d do some quick reactions while everything is still fresh. Monday and Thursday are pre- and post-conference workshop days, and today I attended Developing Research Data Management Services. Joy Davidson and Jonathan Rans from the Digital Curation Centre (DCC) introduced us to the Business Model Canvas, a template for designing a business model on a single sheet of paper. The model prompts you to think about all of the key facets of a sustainable, profitable business, and can easily be adapted to the task of building a service model within a larger institution. The DCC used it as part of the Collaboration to Clarify Curation Costs (4C) project, whose output the Curation Costs Exchange is also worth a look. It was a really useful exercise to be able to work through the whole process for an aspect of research data management (my table focused on training &amp; guidance provision), both because of the ideas that came up and also the experience of putting the framework into practice. It seems like a really valuable tool and I look forward to seeing how it might help us with our RDM service development. Tomorrow the conference proper begins, with a range of keynotes, panel sessions and birds-of-a-feather meetings so hopefully more then! About me I help people in Higher Education communicate and collaborate more effectively using technology. I currently work at the University of Sheffield focusing on research data management policy, practice, training and advocacy. In my free time, I like to: run; play the accordion; morris dance; climb; cook; read (fiction and non-fiction); write. Better Science Through Better Data #scidata17 Better Science through Better DoughnutsJez Cope Update: fixed the link to the slides so it works now! Last week I had the honour of giving my first ever keynote talk, at an event entitled Better Science Through Better Data hosted jointly by Springer Nature and the Wellcome Trust. It was nerve-wracking but exciting and seemed to go down fairly well. I even got accidentally awarded a PhD in the programme — if only it was that easy! The slides for the talk, &ldquo;Supporting Open Research: The role of an academic library&rdquo;, are available online (doi:10.15131/shef.data.5537269), and the whole event was video&rsquo;d for posterity and viewable online. I got some good questions too, mainly from the clever online question system. I didn&rsquo;t get to answer all of them, so I&rsquo;m thinking of doing a blog post or two to address a few more. There were loads of other great presentations as well, both keynotes and 7-minute lightning talks, so I&rsquo;d encourage you to take a look at at least some of it. I&rsquo;ll pick out a few of my highlights. Dr Aled Edwards (University of Toronto) There&rsquo;s a major problem with science funding that I hadn&rsquo;t really thought about before. The available funding pool for research is divided up into pots by country, and often by funding body within a country. Each of these pots have robust processes to award funding to the most important problems and most capable researchers. The problem comes because there is no coordination between these pots, so researchers all over the world end up getting funded to research the most popular problems leading to a lot of duplication of effort. Industry funding suffers from a similar problem, particularly the pharmaceutical industry. Because there is no sharing of data or negative results, multiple companies spend billions researching the same dead ends chasing after the same drugs. This is where the astronomical costs of drug development come from. Dr Edwards presented one alternative, modelled by a company called M4K Pharma. The idea is to use existing IP laws to try and give academic researchers a reasonable, morally-justifiable and sustainable profit on drugs they develop, in contrast to the current model where basic research is funded by governments while large corporations hoover up as much profit as they possibly can. This new model would develop drugs all the way to human trial within academia, then license the resulting drugs to companies to manufacture with a price cap to keep the medicines affordable to all who need them. Core to this effort is openness with data, materials and methodology, and Dr Edwards presented several examples of how this approach benefited academic researchers, industry and patients compared with a closed, competitive focus. Dr Kirstie Whitaker (Alan Turing Institute) This was a brilliant presentation, presenting a practical how-to guide to doing reproducible research, from one researcher to another. I suggest you take a look at her slides yourself: Showing your working: a how-to guide to reproducible research. Dr Whitaker briefly addressed a number of common barriers to reproducible research: Is not considered for promotion: so it should be! Held to higher standards than others: reviewers should be discouraged from nitpicking just because the data/code/whatever is available (true unbiased peer review of these would be great though) Publication bias towards novel findings: it is morally wrong to not publish reproductions, replications etc. so we need to address the common taboo on doing so Plead the 5th: if you share, people may find flaws, but if you don&rsquo;t they can&rsquo;t — if you&rsquo;re worried about this you should ask yourself why! Support additional users: some (much?) of the burden should reasonably on the reuser, not the sharer Takes time: this is only true if you hack it together after the fact; if you do it from the start, the whole process will be quicker! Requires additional skills: important to provide training, but also to judge PhD students on their ability to do this, not just on their thesis &amp; papers The rest of the presentation, the &ldquo;how-to&rdquo; guide of the title' was a well-chosen and passionately delivered set of recommendations, but the thing that really stuck out for me is how good Dr Whitaker is at making the point that you only have to do one of these things to improve the quality of your research. It&rsquo;s easy to get the impression at the moment that you have to be fully, perfectly open or not at all, but it&rsquo;s actually OK to get there one step at a time, or even not to go all the way at all! Anyway, I think this is a slide deck that speaks for itself, so I won&rsquo;t say any more! Lightning talk highlights There was plenty of good stuff in the lightning talks, which were constrained to 7 minutes each, but a few of the things that stood out for me were, in no particular order: Code Ocean — share and run code in the cloud dat project — peer to peer data syncronisation tool Can automate metadata creation, data syncing, versioning Set up a secure data sharing network that keeps the data in sync but off the cloud Berlin Institute of Health — open science course for students Pre-print paper Course materials InterMine — taking the pain out of data cleaning &amp; analysis Nix/NixOS as a component of a reproducible paper BoneJ (ImageJ plugin for bone analysis) — developed by a scientist, used a lot, now has a Wellcome-funded RSE to develop next version ESASky — amazing live, online archive of masses of astronomical data Coda I really enjoyed the event (and the food was excellent too). My thanks go out to: The programme committee for asking me to come and give my take — I hope I did it justice! The organising team who did a brilliant job of keeping everything running smoothly before and during the event The University of Sheffield for letting me get away with doing things like this! Blog platform switch I&rsquo;ve just switched my blog over to the Nikola static site generator. Hopefully you won&rsquo;t notice a thing, but there might be a few weird spectres around til I get all the kinks ironed out. I&rsquo;ve made the switch for a couple of main reasons: Nikola supports Jupyter notebooks as a source format for blog posts, which will be useful to include code snippets It&rsquo;s written in Python, a language which I actually know, so I&rsquo;m more likely to be able to fix things that break, customise it and potentially contribute to the open source project (by contrast, Hugo is written in Go, which I&rsquo;m not really familiar with) Chat rooms vs Twitter: how I communicate now CC0, Pixabay This time last year, Brad Colbow published a comic in his &ldquo;The Brads&rdquo; series entitled &ldquo;The long slow death of Twitter&rdquo;. It really encapsulates the way I&rsquo;ve been feeling about Twitter for a while now. Go ahead and take a look. I&rsquo;ll still be here when you come back. According to my Twitter profile, I joined in February 2009 as user #20,049,102. It was nearing its 3rd birthday and, though there were clearly a lot of people already signed up at that point, it was still relatively quiet, especially in the UK. I was a lonely PhD student just starting to get interested in educational technology, and one thing that Twitter had in great supply was (and still is) people pushing back the boundaries of what tech can do in different contexts. Somewhere along the way Twitter got really noisy, partly because more people (especially commercial companies) are using it more to talk about stuff that doesn&rsquo;t interest me, and partly because I now follow 1,200+ people and find I get several tweets a second at peak times, which no-one could be expected to handle. More recently I&rsquo;ve found my attention drawn to more focussed communities instead of that big old shouting match. I find I&rsquo;m much more comfortable discussing things and asking questions in small focussed communities because I know who might be interested in what. If I come across an article about a cool new Python library, I&rsquo;ll geek out about it with my research software engineer friends; if I want advice on an aspect of my emacs setup, I&rsquo;ll ask a bunch of emacs users. I feel like I&rsquo;m talking to people who want to hear what I&rsquo;m saying. Next to that experience, Twitter just feels like standing on a street corner shouting. IRC channels (mostly on Freenode), and similar things like Slack and gitter form the bulk of this for me, along with a growing number of WhatsApp group chats. Although online chat is theoretically a synchronous medium, I find that I can treat it more as &ldquo;semi-synchronous&rdquo;: I can have real-time conversations as they arise, but I can also close them and tune back in later to catch up if I want. Now I come to think about it, this is how I used to treat Twitter before the 1,200 follows happened. I also find I visit a handful of forums regularly, mostly of the Reddit link-sharing or StackExchange Q&amp;A type. /r/buildapc was invaluable when I was building my latest box, /r/EarthPorn (very much not NSFW) is just beautiful. I suppose the risk of all this is that I end up reinforcing my own echo chamber. I&rsquo;m not sure how to deal with that, but I certainly can&rsquo;t deal with it while also suffering from information overload. Not just certifiable… A couple of months ago, I went to Oxford for an intensive, 2-day course run by Software Carpentry and Data Carpentry for prospective new instructors. I&rsquo;ve now had confirmation that I&rsquo;ve completed the checkout procedure so it&rsquo;s official: I&rsquo;m now a certified Data Carpentry instructor! As far as I&rsquo;m aware, the certification process is now combined, so I&rsquo;m also approved to teach Software Carpentry material too. And of course there&rsquo;s Library Carpentry too&hellip; SSI Fellowship 2020 I&rsquo;m honoured and excited to be named one of this year&rsquo;s Software Sustainability Institute Fellows. There&rsquo;s not much to write about yet because it&rsquo;s only just started, but I&rsquo;m looking forward to sharing more with you. In the meantime, you can take a look at the 2020 fellowship announcement and get an idea of my plans from my application video: Talks Here is a selection of talks that I&rsquo;ve given. {{% template %}} &lt;%! import arrow %&gt; Date Title Location % for talk in post.data("talks"): % if 'date' in talk: ${date.format('ddd d MMM YYYY')} % endif % if 'url' in talk: % endif ${talk['title']} % if 'url' in talk: % endif ${talk.get('location', '')} % endfor {{% /template %}} 
ethereum-org-4312	----	ERC-721 Non-Fungible Token Standard | ethereum.org Help update this page There’s a new version of this page but it’s only in English right now. Help us translate the latest version. Translate page See English Use Ethereum Ethereum Wallets Get ETH Decentralized applications (dapps) Stablecoins Stake ETH Learn What is Ethereum? What is ether (ETH)? Decentralized finance (DeFi) Decentralized autonomous organisations (DAOs) Non-fungible tokens (NFTs) History of Ethereum Ethereum Whitepaper Ethereum 2.0 Ethereum Glossary Ethereum Improvement Proposals Community guides and resources Developers Developers' Home Documentation Tutorials Learn by coding Set up local environment Enterprise Mainnet Ethereum Private Ethereum Community Ethereum Community Grants No results for your search "" Languages Use Ethereum Ethereum Wallets Get ETH Decentralized applications (dapps) Stablecoins Stake ETH Learn What is Ethereum? What is ether (ETH)? Decentralized finance (DeFi) Decentralized autonomous organisations (DAOs) Non-fungible tokens (NFTs) History of Ethereum Ethereum Whitepaper Ethereum 2.0 Ethereum Glossary Ethereum Improvement Proposals Community guides and resources Developers Developers' Home Documentation Tutorials Learn by coding Set up local environment Enterprise Mainnet Ethereum Private Ethereum Community Ethereum Community Grants Search Light Languages Search No results for your search "" Search away! This page is incomplete and we'd love your help. Edit this page and add anything that you think might be useful to others. ERC-721 Non-Fungible Token Standard On this page Introduction Prerequisites Body Examples Popular NFTs Further reading Introduction What is a Non-Fungible Token? A Non-Fungible Token (NFT) is used to identify something or someone in a unique way. This type of Token is perfect to be used on platforms that offer collectible items, access keys, lottery tickets, numbered seats for concerts and sports matches, etc. This special type of Token has amazing possibilities so it deserves a proper Standard, the ERC-721 came to solve that! What is ERC-721? The ERC-721 introduces a standard for NFT, in other words, this type of Token is unique and can have different value than another Token from the same Smart Contract, maybe due to its age, rarity or even something else like its visual. Wait, visual? Yes! All NFTs have a uint256 variable called tokenId, so for any ERC-721 Contract, the pair contract address, uint256 tokenId must be globally unique. Said that a dApp can have a "converter" that uses the tokenId as input and outputs an image of something cool, like zombies, weapons, skills or amazing kitties! Prerequisites Accounts Smart Contracts Token standards Body The ERC-721 (Ethereum Request for Comments 721), proposed by William Entriken, Dieter Shirley, Jacob Evans, Nastassia Sachs in January 2018, is a Non-Fungible Token Standard that implements an API for tokens within Smart Contracts. It provides functionalities like to transfer tokens from one account to another, to get the current token balance of an account, to get the owner of a specific token and also the total supply of the token available on the network. Besides these it also has some other functionalities like to approve that an amount of token from an account can be moved by a third party account. If a Smart Contract implements the following methods and events it can be called an ERC-721 Non-Fungible Token Contract and, once deployed, it will be responsible to keep track of the created tokens on Ethereum. From EIP-721: Methods 1 function balanceOf(address _owner) external view returns (uint256); 2 function ownerOf(uint256 _tokenId) external view returns (address); 3 function safeTransferFrom(address _from, address _to, uint256 _tokenId, bytes data) external payable; 4 function safeTransferFrom(address _from, address _to, uint256 _tokenId) external payable; 5 function transferFrom(address _from, address _to, uint256 _tokenId) external payable; 6 function approve(address _approved, uint256 _tokenId) external payable; 7 function setApprovalForAll(address _operator, bool _approved) external; 8 function getApproved(uint256 _tokenId) external view returns (address); 9 function isApprovedForAll(address _owner, address _operator) external view returns (bool); 10 Show all Copy Events 1 event Transfer(address indexed _from, address indexed _to, uint256 indexed _tokenId); 2 event Approval(address indexed _owner, address indexed _approved, uint256 indexed _tokenId); 3 event ApprovalForAll(address indexed _owner, address indexed _operator, bool _approved); 4 Copy Examples Let's see how a Standard is so important to make things simple for us to inspect any ERC-721 Token Contract on Ethereum. We just need the Contract Application Binary Interface (ABI) to create an interface to any ERC-721 Token. As you can see below we will use a simplified ABI, to make it a low friction example. Web3.py Example First, make sure you have installed Web3.py Python library: 1$ pip install web3 2 1from web3 import Web3 2from web3.utils.events import get_event_data 3 4 5w3 = Web3(Web3.HTTPProvider("https://cloudflare-eth.com")) 6 7ck_token_addr = "0x06012c8cf97BEaD5deAe237070F9587f8E7A266d" # CryptoKitties Contract 8 9acc_address = "0xb1690C08E213a35Ed9bAb7B318DE14420FB57d8C" # CryptoKitties Sales Auction 10 11# This is a simplified Contract Application Binary Interface (ABI) of an ERC-721 NFT Contract. 12# It will expose only the methods: balanceOf(address), name(), ownerOf(tokenId), symbol(), totalSupply() 13simplified_abi = [ 14 { 15 'inputs': [{'internalType': 'address', 'name': 'owner', 'type': 'address'}], 16 'name': 'balanceOf', 17 'outputs': [{'internalType': 'uint256', 'name': '', 'type': 'uint256'}], 18 'payable': False, 'stateMutability': 'view', 'type': 'function', 'constant': True 19 }, 20 { 21 'inputs': [], 22 'name': 'name', 23 'outputs': [{'internalType': 'string', 'name': '', 'type': 'string'}], 24 'stateMutability': 'view', 'type': 'function', 'constant': True 25 }, 26 { 27 'inputs': [{'internalType': 'uint256', 'name': 'tokenId', 'type': 'uint256'}], 28 'name': 'ownerOf', 29 'outputs': [{'internalType': 'address', 'name': '', 'type': 'address'}], 30 'payable': False, 'stateMutability': 'view', 'type': 'function', 'constant': True 31 }, 32 { 33 'inputs': [], 34 'name': 'symbol', 35 'outputs': [{'internalType': 'string', 'name': '', 'type': 'string'}], 36 'stateMutability': 'view', 'type': 'function', 'constant': True 37 }, 38 { 39 'inputs': [], 40 'name': 'totalSupply', 41 'outputs': [{'internalType': 'uint256', 'name': '', 'type': 'uint256'}], 42 'stateMutability': 'view', 'type': 'function', 'constant': True 43 }, 44] 45 46ck_extra_abi = [ 47 { 48 'inputs': [], 49 'name': 'pregnantKitties', 50 'outputs': [{'name': '', 'type': 'uint256'}], 51 'payable': False, 'stateMutability': 'view', 'type': 'function', 'constant': True 52 }, 53 { 54 'inputs': [{'name': '_kittyId', 'type': 'uint256'}], 55 'name': 'isPregnant', 56 'outputs': [{'name': '', 'type': 'bool'}], 57 'payable': False, 'stateMutability': 'view', 'type': 'function', 'constant': True 58 } 59] 60 61ck_contract = w3.eth.contract(address=w3.toChecksumAddress(ck_token_addr), abi=simplified_abi+ck_extra_abi) 62name = ck_contract.functions.name().call() 63symbol = ck_contract.functions.symbol().call() 64kitties_auctions = ck_contract.functions.balanceOf(acc_address).call() 65print(f"{name} [{symbol}] NFTs in Auctions: {kitties_auctions}") 66 67pregnant_kitties = ck_contract.functions.pregnantKitties().call() 68print(f"{name} [{symbol}] NFTs Pregnants: {pregnant_kitties}") 69 70# Using the Transfer Event ABI to get info about transferred Kitties. 71tx_event_abi = { 72 'anonymous': False, 73 'inputs': [ 74 {'indexed': False, 'name': 'from', 'type': 'address'}, 75 {'indexed': False, 'name': 'to', 'type': 'address'}, 76 {'indexed': False, 'name': 'tokenId', 'type': 'uint256'}], 77 'name': 'Transfer', 78 'type': 'event' 79} 80 81# We need the event's signature to filter the logs 82event_signature = w3.sha3(text="Transfer(address,address,uint256)").hex() 83 84logs = w3.eth.getLogs({ 85 "fromBlock": w3.eth.blockNumber - 120, 86 "address": w3.toChecksumAddress(ck_token_addr), 87 "topics": [event_signature] 88}) 89 90# Notes: 91# - 120 blocks is the max range for CloudFlare Provider 92# - If you didn't find any Transfer event you can also try to get a tokenId at: 93# https://etherscan.io/address/0x06012c8cf97BEaD5deAe237070F9587f8E7A266d#events 94# Click to expand the event's logs and copy its "tokenId" argument 95 96recent_tx = [get_event_data(tx_event_abi, log)["args"] for log in logs] 97 98kitty_id = recent_tx[0]['tokenId'] # Paste the "tokenId" here from the link above 99is_pregnant = ck_contract.functions.isPregnant(kitty_id).call() 100print(f"{name} [{symbol}] NFTs {kitty_id} is pregnant: {is_pregnant}") 101 Show all Copy CryptoKitties Contract has some interesting Events other than the Standard ones. Let's check two of them, Pregnant and Birth. 1# Using the Pregnant and Birth Events ABI to get info about new Kitties. 2ck_extra_events_abi = [ 3 { 4 'anonymous': False, 5 'inputs': [ 6 {'indexed': False, 'name': 'owner', 'type': 'address'}, 7 {'indexed': False, 'name': 'matronId', 'type': 'uint256'}, 8 {'indexed': False, 'name': 'sireId', 'type': 'uint256'}, 9 {'indexed': False, 'name': 'cooldownEndBlock', 'type': 'uint256'}], 10 'name': 'Pregnant', 11 'type': 'event' 12 }, 13 { 14 'anonymous': False, 15 'inputs': [ 16 {'indexed': False, 'name': 'owner', 'type': 'address'}, 17 {'indexed': False, 'name': 'kittyId', 'type': 'uint256'}, 18 {'indexed': False, 'name': 'matronId', 'type': 'uint256'}, 19 {'indexed': False, 'name': 'sireId', 'type': 'uint256'}, 20 {'indexed': False, 'name': 'genes', 'type': 'uint256'}], 21 'name': 'Birth', 22 'type': 'event' 23 }] 24 25# We need the event's signature to filter the logs 26ck_event_signatures = [ 27 w3.sha3(text="Pregnant(address,uint256,uint256,uint256)").hex(), 28 w3.sha3(text="Birth(address,uint256,uint256,uint256,uint256)").hex(), 29] 30 31# Here is a Pregnant Event: 32# - https://etherscan.io/tx/0xc97eb514a41004acc447ac9d0d6a27ea6da305ac8b877dff37e49db42e1f8cef#eventlog 33pregnant_logs = w3.eth.getLogs({ 34 "fromBlock": w3.eth.blockNumber - 120, 35 "address": w3.toChecksumAddress(ck_token_addr), 36 "topics": [ck_extra_events_abi[0]] 37}) 38 39recent_pregnants = [get_event_data(ck_extra_events_abi[0], log)["args"] for log in pregnant_logs] 40 41# Here is a Birth Event: 42# - https://etherscan.io/tx/0x3978028e08a25bb4c44f7877eb3573b9644309c044bf087e335397f16356340a 43birth_logs = w3.eth.getLogs({ 44 "fromBlock": w3.eth.blockNumber - 120, 45 "address": w3.toChecksumAddress(ck_token_addr), 46 "topics": [ck_extra_events_abi[1]] 47}) 48 49recent_births = [get_event_data(ck_extra_events_abi[1], log)["args"] for log in birth_logs] 50 Show all Copy Popular NFTs Etherscan NFT Tracker list the top NFT on Ethereum by tranfers volume. CryptoKitties is a game centered around breedable, collectible, and oh-so-adorable creatures we call CryptoKitties. Sorare is a global fantasy football game where you can collect limited editions collectibles, manage your teams and compete to earn prizes. The Ethereum Name Service (ENS) offers a secure & decentralised way to address resources both on and off the blockchain using simple, human-readable names. Unstoppable Domains is a San Francisco-based company building domains on blockchains. Blockchain domains replace cryptocurrency addresses with human-readable names and can be used to enable censorship-resistant websites. Gods Unchained Cards is a TCG on the Ethereum blockchain that uses NFT's to bring real ownership to in-game assets. Further reading EIP-721: ERC-721 Non-Fungible Token Standard OpenZeppelin - ERC-721 Docs OpenZeppelin - ERC-721 Implementation Back to top ↑ Did this page help answer your question? YesNo PreviousERC-20: Fungible Tokens NextOracles Edit page On this page Introduction Prerequisites Body Examples Popular NFTs Further reading Website last updated: April 27, 2021 Use Ethereum Ethereum Wallets Get ETH Decentralized applications (dapps) Stablecoins Stake ETH Learn What is Ethereum? What is ether (ETH)? Community guides and resources History of Ethereum Ethereum Whitepaper Ethereum 2.0 Ethereum Glossary Ethereum Improvement Proposals Developers Get started Documentation Tutorials Learn by coding Set up local environment Developer Resources Ecosystem Ethereum Community Ethereum Foundation Ethereum Foundation Blog Ecosystem Support Program Ecosystem Grant Programs Ethereum Brand Assets Devcon Enterprise Mainnet Ethereum Private Ethereum Enterprise About ethereum.org About us Jobs Contributing Language Support Privacy policy Terms of Use Cookie Policy Contact 
evergreen-ils-org-1147	----	Evergreen Downloads – Evergreen ILS Skip to content Evergreen – Open Source Library Software Evergreen – Open Source Library Software About Us Overview Annual Reports F.A.Q. Evergreen Event Code of Conduct Software Freedom Conservancy Project Governance Trademark Policy Documentation Official Documentation Documentation Interest Group Evergreen Roadmap Evergreen Wiki Tabular Release Notes Get Involved! Get Involved! Committees & Interest Groups Communications Mailing Lists IRC Calendar Blog Jobs Proposed Development Projects Merchandise T-shirts and more Conference All Conferences 2021 Evergreen International Online Conference 2020 Evergreen International Online Conference Event Photography Policy Code of Conduct Downloads Evergreen Downloads OpenSRF Downloads Home » Evergreen Downloads Evergreen Downloads Evergreen Downloads Evergreen depends on the following technologies Perl, C, JavaScript, XML, XPath, XSLT, XMPP, OpenSRF, Apache, mod_perl, and PostgreSQL. The latest stable release of a supported Linux distribution is recommended for an Evergreen installation. For Ubuntu, please use the 18.04 64-bit LTS (long term support) Server release. Currently the latest release from the Evergreen 3.6 series is recommended for new installations and stable releases are suggested for production systems. Note: Evergreen servers and staff clients must match. For example, if you are running server version 3.1.0, you should use version 3.1.0 of the staff client. Evergreen 3.2.0+ no longer supports a separate client by default, but building a client remains as an unsupported option. Server & staff client downloads 3.7 Series 3.6 Series 3.5 Series Status stable stable stable Latest Release 3.7.0 3.6.3 3.5.4 Release Date 2021-04-14 2021-04-01 2021-04-01 Release Notes Release Notes Release Notes Release Notes Tabular release notes summary ChangeLog ChangeLog ChangeLog ChangeLog Evergreen Installation Install Instructions Install Instructions Install Instructions Upgrading Notes on upgrading from 3.6.2 TBD TBD OpenSRF Software 3.2.1 (md5) 3.2.1 (md5) 3.2.1 (md5) Server Software Source (md5) Source (md5) Source (md5) Web Staff Client Extension (“Hatch”) Windows Hatch Installer 0.3.2 (md5) – Installation Instructions (Windows & Linux) Git Repository Git Location Git Location Git Location Other Evergreen Staff Clients Staff Client Archive Windows Staff Clients for slightly older stable releases (2.11, 2.10). For Mac and Linux Installing the Evergreen client on Macs Evergreen 2.8.3 Mac Staff Client [.dmg] Evergreen 2.9.0 Mac Staff Client [.dmg] Evergreen 2.12.0 Mac Staff Client [.zip] Evergreen 3.0.0 Mac Staff Client [.zip] Pre-built MAC staff client for Evergreen 2.10 and 2.8 – Provided by SITKA Evergreen in action Visit the Evergreen catalog on our demonstration and development servers, or visit this list of live Evergreen libraries. You can also download an Evergreen staff client and point it at the Evergreen demo or development server (see the community servers page for details). Bug Reports Please report any Evergreen bugs/wishlist on Launchpad. To submit a vulnerability please email your report to open-ils-security@esilibrary.com. Evergreen Code Museum Older versions of Evergreen software are available from the Evergreen Code Museum. Source Code Repository A Gitweb instance sits atop the Git repositories for Evergreen and OpenSRF. You can find both repositories at git.evergreen-ils.org. Here is the running change log for the Evergreen code repository: watch us work. Trac sends code commits to two public Evergreen mailing lists: For Evergreen commits, subscribe to open-ils-commits For OpenSRF commits, subscribe to opensrf-commits About Evergreen This is the project site for Evergreen, a highly-scalable software for libraries that helps library patrons find library materials, and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries. © 2008-2020 GPLS and others. Evergreen is open source software, freely licensed under GNU GPLv2 or later. The Evergreen Project is a 501(c)3 nonprofit organization. Community Links Evergreen Bug Tracker Evergreen on Open HUB Evergreen Wiki Git Repositories Join IRC! IRC Logs Official Documentation · © 2021 Evergreen ILS · Powered by · Designed with the Customizr theme · 
evergreen-ils-org-2740	----	None 
evergreen-ils-org-3150	----	Evergreen 3.7.0 Release Notes Evergreen 3.7.0 Release Notes Table of Contents JavaScript must be enabled in your browser to display the table of contents. 1. Upgrade notes 1.1. Database Upgrade Procedure The database schema upgrade for Evergreen 3.7 has more steps than normal. The general procedure, assuming Evergreen 3.6.2 as the starting point, is: Run the main 3.6.2 ⇒ to 3.7 schema update script from the Evergreen source directory, supplying database connection parameters as needed: psql -f Open-ILS/src/sql/Pg/version-upgrade/3.6.2-3.7.0-upgrade-db.sql 2>&1 | tee 3.6.2-3.7.0-upgrade-db.log Create and ingest search suggestions: Run the following from psql to export the strings to files: \a \t \o title select value from metabib.title_field_entry; \o author select value from metabib.author_field_entry; \o subject select value from metabib.subject_field_entry; \o series select value from metabib.series_field_entry; \o identifier select value from metabib.identifier_field_entry; \o keyword select value from metabib.keyword_field_entry; \o \a \t From the command line, convert the exported words into SQL scripts to load into the database. This step assumes that you are at the top of the Evergreen source tree. $ ./Open-ILS/src/support-scripts/symspell-sideload.pl title > title.sql $ ./Open-ILS/src/support-scripts/symspell-sideload.pl author > author.sql $ ./Open-ILS/src/support-scripts/symspell-sideload.pl subject > subject.sql $ ./Open-ILS/src/support-scripts/symspell-sideload.pl series > series.sql $ ,/Open-ILS/src/support-scripts/symspell-sideload.pl identifier > identifier.sql $ ./Open-ILS/src/support-scripts/symspell-sideload.pl keyword > keyword.sql Back in psql, import the suggestions. This step can take several hours in a large databases, but the \i $FILE.sql` steps can be run in parallel. ALTER TABLE search.symspell_dictionary SET UNLOGGED; TRUNCATE search.symspell_dictionary; \i identifier.sql \i author.sql \i title.sql \i subject.sql \i series.sql \i keyword.sql CLUSTER search.symspell_dictionary USING symspell_dictionary_pkey; REINDEX TABLE search.symspell_dictionary; ALTER TABLE search.symspell_dictionary SET LOGGED; VACUUM ANALYZE search.symspell_dictionary; DROP TABLE search.symspell_dictionary_partial_title; DROP TABLE search.symspell_dictionary_partial_author; DROP TABLE search.symspell_dictionary_partial_subject; DROP TABLE search.symspell_dictionary_partial_series; DROP TABLE search.symspell_dictionary_partial_identifier; DROP TABLE search.symspell_dictionary_partial_keyword; (optional) Apply the new opt-in setting for overdue and preduce notices. The following query will set the circ.default_overdue_notices_enabled user setting to true (the default value) for all existing users, ensuring they continue to receive overdue/predue emails. INSERT INTO actor.usr_setting (usr, name, value) SELECT id, circ.default_overdue_notices_enabled, true FROM actor.usr; The following query will add the circ.default_overdue_notices_enabled user setting as an opt-in setting for all action triggers that send emails based on a circ being due (unless another opt-in setting is already in use). UPDATE action_trigger.event_definition SET opt_in_setting = circ.default_overdue_notices_enabled, usr_field = usr WHERE opt_in_setting IS NULL AND hook = checkout.due AND reactor = SendEmail; Evergreen admins who wish to use the new setting should run both of the above queries. Admins who do not wish to use it, or who are already using a custom opt-in setting of their own, do not need to do anything. Perform a VACUUM ANALYZE of the following tables using psql: VACUUM ANALYZE authority.full_rec; VACUUM ANALYZE authority.simple_heading; VACUUM ANALYZE metabib.identifier_field_entry; VACUUM ANALYZE metabib.combined_identifier_field_entry; VACUUM ANALYZE metabib.title_field_entry; VACUUM ANALYZE metabib.combined_title_field_entry; VACUUM ANALYZE metabib.author_field_entry; VACUUM ANALYZE metabib.combined_author_field_entry; VACUUM ANALYZE metabib.subject_field_entry; VACUUM ANALYZE metabib.combined_subject_field_entry; VACUUM ANALYZE metabib.keyword_field_entry; VACUUM ANALYZE metabib.combined_keyword_field_entry; VACUUM ANALYZE metabib.series_field_entry; VACUUM ANALYZE metabib.combined_series_field_entry; VACUUM ANALYZE metabib.real_full_rec; 1.2. New Seed Data 1.2.1. New Permissions Administer geographic location services (ADMIN_GEOLOCATION_SERVICES) Administer library groups (ADMIN_LIBRARY_GROUPS) Manage batch (subscription) hold events (MANAGE_HOLD_GROUPS) Modify patron SSO settings (SSO_ADMIN) View geographic location services (VIEW_GEOLOCATION_SERVICES) 1.2.2. New Global Flags Block the ability of expired user with the STAFF_LOGIN permission to log into Evergreen (auth.block_expired_staff_login) Offer use of geographic location services in the public catalog (opac.use_geolocation) 1.2.3. New Internal Flags Maximum search result count at which spelling suggestions may be offered (opac.did_you_mean.low_result_threshold) 1.2.4. New Library Settings Allow both Shibboleth and native OPAC authentication (opac.login.shib_sso.allow_native) Allow renewal request if renewal recipient privileges have expired (circ.renew.expired_patron_allow) Enable Holdings Sort by Geographic Proximity ('opac.holdings_sort_by_geographic_proximity`) Enable Shibboleth SSO for the OPAC (opac.login.shib_sso.enable) Evergreen SSO matchpoint (opac.login.shib_sso.evergreen_matchpoint) Geographic Location Service to use for Addresses (opac.geographic_location_service_for_address) Keyboard distance score weighting in OPAC spelling suggestions (search.symspell.keyboard_distance.weight) Log out of the Shibboleth IdP (opac.login.shib_sso.logout) Minimum required uses of a spelling suggestions that may be offered (search.symspell.min_suggestion_use_threshold) Pg_trgm score weighting in OPAC spelling suggestions (search.symspell.pg_trgm.weight) Randomize group hold order (holds.subscription.randomize) Shibboleth SSO Entity ID (opac.login.shib_sso.entityId) Shibboleth SSO matchpoint (opac.login.shib_sso.shib_matchpoint) Show Geographic Proximity in Miles (opac.geographic_proximity_in_miles) Soundex score weighting in OPAC spelling suggestions (search.symspell.soundex.weight) 1.2.5. New Stock Action/Trigger Event Definitions Hold Group Hold Placed for Patron Email Notification 2. New Features 2.1. Administration 2.1.1. Single Sign On (Shibboleth) Public Catalog integration The Evergreen OPAC can now be used as a Service Provider (SP) in a Single Sign On infrastructure. This allows system administrators to connect the Evergreen OPAC to an identity provider (IdP). Such a scenario offers significant usability improvements to patrons: They can use the same, IdP-provided login screen and credentials that they use for other applications (SPs). If they have already logged into another participating application, when they arrive at the Evergreen OPAC, they can be logged in without needing to enter any credentials at all. Evergreen can be configured to offer a Single Sign-out service, where logging out of the Evergreen OPAC will also log the user out of all other SPs. It can also offer security benefits, if it enables a Shibboleth-enabled Evergreen installation to move away from insecure autogenerated user passwords (e.g. year of birth or last four digits of a phone number). Different Org Units can use different IdPs. This development also supports a mix of Shibboleth and non-Shibboleth libraries. Note that only the OPAC can be integrated with Shibboleth at this time; no such support exists for the staff client, self-check, etc. Also note that this development does not include automatic provisioning of accounts. At this time, matching accounts must already exist in Evergreen for a patron to successfully authenticate into the OPAC via Single Sign On. Installation Installing and configuring Shibboleth support is a complex project. In broad strokes, the process includes: Installing Shibboleth and the Shibboleth Apache module (apt install libapache2-mod-shib2 on Debian and Ubuntu) Configuring Shibboleth, including: Setting up a certificate assigning an Entity ID getting metadata about the IdP from the IdP (perhaps "locally maintained metadata", where an XML file from the IdP is copied into place on your Evergreen server) Understanding what attributes the IdP will provide about your users, and describing them in the attribute-map.xml file. Providing your Entity ID, information about possible bindings, and any other requested information to the IdP administrator. Much of this information will be available at http://YOUR_EVERGREEN_DOMAIN/Shibboleth.sso/Metadata Configuring Apache, including: Enabling Shibboleth authentication in the eg_vhost.conf file (Optional) Using the new sso_loc Apache variable to identify which org unit should be used as the context location when fetching Shibboleth-related library settings. As a user with the new SSO_ADMIN permission, configure Evergreen using the Library Settings Editor, including: Enable Shibboleth SSO for the OPAC (Optional) Configure whether you will use SSO exclusively, or offer patrons a choice between SSO and standard Evergreen authentication (Optional) Configure whether or not you will use Single Log Out (Optional) In scenarios where a single Evergreen installation is connected to multiple IdPs, assign org units to the relevant IdPs, referenced by the IdP’s Entity Id. Of the attributes defined in attribute-map.xml, configure which one should be used to match users in the Evergreen database. This defaults to uid. For the attribute you chose in the previous step, configure which Evergreen field it should match against. Options are usrname (default), barcode, and email. This video on the SAML protocol can be very helpful for introducing the basic concepts used in the installation and configuration processes. 2.2. Architecture 2.2.1. Block Login of Expired Staff Accounts Evergreen now has the ability to prevent staff users whose accounts have expired from logging in. This is controlled by the new global flag "auth.block_expired_staff_login", which is not enabled by default. If that flag is turned on, accounts that have the STAFF_LOGIN permission and whose expiration date is in the past are prevented from logging into any Evergreen interface, including the staff client, the public catalog, and SIP2. It should be noted that ordinary patrons are allowed to log into the public catalog if their circulation privileges have expired. This feature prevents expired staff users from logging into the public catalog (and all other Evergreen interfaces and APIs) outright in order to prevent them from getting into the staff interface anyway by creative use of Evergreen’s authentication APIs. Evergreen admins are advised to check the expiration status of staff accounts before turning on the global flag, as otherwise it is possible to lock staff users out unexpectedly. The following SQL query will identify expired but otherwise un-deleted users that would be blocked by turning on the flag: SELECT DISTINCT usrname, expire_date FROM actor.usr au, permission.usr_has_perm_at_all(id, 'STAFF_LOGIN') WHERE active AND NOT deleted AND NOT barred AND expire_date < NOW() Note that this query can take a long time to run in large databases given the general way that it checks for users that have the STAFF_LOGIN permission. Replacing the use of permission.usr_has_perm_at_all() with a query on expired users with profiles known to have the STAFF_LOGIN permission will be much faster. 2.2.2. Migration From GIST to GIN Indexes for Full Text Search Evergreen now uses GIN indexes for full text search in PostgreSQL. GIN indexes offer better performance than GIST. For more information on the differences in the two index types, please refer to the PostgreSQL documentation. An upgrade script is provided as part of this migration. If you upgrade normally from a previous release of Evergreen, this upgrade script should run as part of the upgrade process. The migration script recommends that you run a VACUUM ANALYZE in PostgreSQL on the tables that had the indexes changed. The migration process does not do this for you, so you should do it as soon as is convenient after the upgrade. Updating Your Own Indexes If you have added your own full text indexes of type GIST, and you wish to migrate them to GIN, you may do so. The following query, when run in your Evergreen databsase after the migration from GIST to GIN, will identify the remaining GIST indexes in your database: SELECT schemaname, indexname FROM pg_indexes WHERE indexdef ~* 'gist'; If the above query produces output, you can run the next query to output a SQL script to migrate the remaining indexes from GIST to GIN: SELECT 'DROP INDEX ' || schemaname || '.' || indexname || E';\n' || REGEXP_REPLACE(indexdef, 'gist', 'gin', 'i') || E';\n' || 'VACUUM ANAlYZE ' || schemaname || '.' || tablename || ';' FROM pg_indexes WHERE indexdef ~* 'gist'; 2.2.3. Removal of Custom Dojo Build Evergreen had a method of making a custom build of the Dojo JavaScript library. Following this procedure could improve the load times for the OPAC and other interfaces that use Dojo. However, very few sites took advantage of this process or even knew of its existence. As a part of the process, an openils_dojo.js file was built and installed along with the other Dojo files. Evergreen had many references to load this optional file. For the majority of sites that did not use this custom Dojo process, this file did not exist. Browsers would spend time and resources requesting this nonexistent file. This situation also contributed noise to the Apache logs with the 404 errors from these requests. In keeping with the goal of eliminating Dojo from Evergreen, all references to openils_dojo.js have been removed from the OPAC and other files. The profile script required to make the custom Dojo build has also been removed. 2.3. Cataloging 2.3.1. Czech language records in sample data This release adds 7 Czech-language MARC records to the sample data set (also known as Concerto data set). 2.3.2. Publisher Catalog Display Includes 264 Tag Publisher values are now extracted for display from tags 260 OR 264. Upgrade Notes A partial reingest is required to extract the new publisher data for display. This query may be long-running. WITH affected_bibs AS ( SELECT DISTINCT(bre.id) AS id FROM biblio.record_entry bre JOIN metabib.real_full_rec mrfr ON (mrfr.record = bre.id AND mrfr.tag = '264') WHERE NOT bre.deleted ) SELECT metabib.reingest_metabib_field_entries(id, TRUE, FALSE, TRUE, TRUE) FROM affected_bibs; 2.4. Circulation 2.4.1. Hold Groups This feature allows staff to add multiple users to a named hold group bucket and place title-level holds for a record for that entire set of users. Users can be added to such a hold group bucket from either the patron search result interface, via the Add to Bucket dropdown, or through a dedicated Hold Group interface available from the Circulation menu. Adding new patrons to a hold group bucket will require staff have the PLACE_HOLD permission. Holds can be placed for the users in a hold group bucket either directly from the normal staff-place hold interface in the embedded OPAC, or by supplying the record ID within the hold group bucket interface. In the latter case, the list of users for which a hold was attempted but failed to be placed can be downloaded by staff in order to address any placement issues. Placing a hold group bucket hold will requires staff have the MANAGE_HOLD_GROUPS permission, which is new with this development. In the event of a mistaken hold group hold, staff with the MANAGE_HOLD_GROUPS permission will have the ability to cancel all unfulfilled holds created as part of a hold group event. A link to the title’s hold interface is available from the list of hold group events in the dedicated hold group interface. 2.4.2. Scan Item as Missing Pieces Angular Port The Scan Item As Missing Pieces interface is now an Angular interface. The functionality is the same, but the interface displays more details on the item in question (title/author/callnum) before proceeding with the missing pieces process. 2.4.3. Opt-In Setting for Overdue and Predue Emails The "Receive Overdue and Courtesy Emails" user setting permits users to control whether they receive email notifications about overdue items. To use the setting, modify any action trigger event definitions which send emails about overdue items, setting the "Opt In Setting" to "circ.default_overdue_notices_enabled" and the "User Field" to "usr". You can accomplish this by running the following query in your database: UPDATE action_trigger.event_definition SET opt_in_setting = 'circ.default_overdue_notices_enabled', usr_field = 'usr' WHERE opt_in_setting IS NULL AND hook = 'checkout.due' AND reactor = 'SendEmail'; Once this is done, the patron registration screen in the staff client will show a "Receive Overdue and Courtesy Emails" checkbox, which will be checked by default. To ensure that existing patrons continue to recieve email notifications, you will need to add the user setting to their accounts, which you can do by running the following query in your database: INSERT INTO actor.usr_setting (usr, name, value) SELECT id, 'circ.default_overdue_notices_enabled', 'true' FROM actor.usr; 2.4.4. Allow Circulation Renewal for Expired Patrons The "Allow renewal request if renewal recipient privileges have expired" organizational unit setting can be set to true to permit expired patrons to renew circulations. Allowing renewals for expired patrons reduces the number of auto-renewal failures and assumes that a patron with items out eligible for renewals has not been expired for very long and that such patrons are likely to renew their privileges in a timely manner. The setting is referenced based on the current circulation library for the renewal. It takes into account the global flags for "Circ: Use original circulation library on desk renewal instead of the workstation library" and "Circ: Use original circulation library on opac renewal instead of user home library." 2.5. OPAC 2.5.1. Consistent Ordering for Carousels Carousel ordering is now stable and predictable: Newly Cataloged Item and Newest Items by Shelving Location carousels are ordered from most recently cataloged to least recently cataloged. Recently Returned Item carousels is ordered is from most recently returned to least recently returned. Top Circulated Items carousels is ordered is from most circulated to least circulated. Manual carousels (as of now, without the ability to adjust the position of items) are in the order they are added to the backing bucket. Emptying and refilling the bucket allows reordering. 2.5.2. Default Public Catalog to the Bootstrap Skin The public catalog now defaults to the Bootstrap skin rather than the legacy TPAC skin. Bootstrap is now the default in order to encourage more testing, but users should be aware of the following issues; certain specific functionality is available only in the TPAC skin. The TPAC skin remains available for use, but current Evergreen users should start actively considering migrating to the Bootstrap skin. In order to continue to use the TPAC skin, comment out the following line in eg_vhost.conf PerlAddVar OILSWebTemplatePath "@localstatedir@/templates-bootstrap" # Comment this line out to use the legacy TPAC 2.5.3. Did You Mean? Single word search suggestions This feature is the first in the series to add native search suggestions to the Evergreen search logic. A significant portion of the code is dedicated to infrastructure that will be used in later enhancements to the functionality. Overview When searching the public or staff catalog in a single search class (title, author, subject, series, identifier, or keyword) with a single search term users can be presented with alternate search terms. Depending on how the instance has been configured, suggestions may be provided for only misspelled words (as defined by existence in the bibliographic corpus), terms that are spelled properly but occur very few times, or on every single-term search. Settings The following new library settings control the behavior of the suggestions: Maximum search result count at which spelling suggestions may be offered Minimum required uses of a spelling suggestions that may be offered Maximum number of spelling suggestions that may be offered Pg_trgm score weighting in OPAC spelling suggestions Soundex score weighting in OPAC spelling suggestions QWERTY Keyboard similarity score weighting in OPAC spelling suggestions There are also two new internal flags: symspell.prefix_length symspell.max_edit_distance Upgrading This feature requires the addition of new Perl module dependencies. Please run the app server and database server dependency Makefiles before applying the database and code updates. At the end of the database upgrade script, the administrator is presented with a set of instructions necessary to precompute the suggestion dictionary based on the current bibliographic database. The first half of this procedure can be started even before the upgrade begins, as soon as the Evergreen database is no longer accessible to users that might cause changes to bibliographic records. For very large instances, this dictionary generation can take several hours and needs to be run on a server with significant RAM and CPU resources. Please look at the upgrade script before beginning an upgrade and plan this dictionary creation as part of the overall upgrade procedure. Given a server, such as a database server with 64G of RAM, you should be able to run all six of the shell commands in parallel in screen sessions or with a tool such as GNU parallel. These commands invoke a script that will generate a class-specific sub-set of the dictionary, and can be used to recreate the dictionary if necessary in the future. 2.5.4. Sort Holdings by Geographical Proximity This functionality integrates 3rd party geographic lookup services to allow patrons to enter an address on the record details page in the OPAC and sort the holdings for that record based on proximity of their circulating libraries to the entered address. To support this, latitude and longitude coordinates may be associated with each org unit. Care is given to not log or leak patron provided addresses or the context in which they are used. Requires the following Perl modules: Geo::Coder::Free, Geo::Coder::Google, and Geo::Coder::OSM Configuration instructions: Register an account with a third party geographic location service and copy the API Key. Configure the Geographic Location Service (Server Administration > Geographic Location Service > New Geographic Location Service). Enable Global Flag by navigating to Server Administration → Global Flags and locating the opac.use_geolocation flag. (Any entry in the Value field will be ignored.) Enable Library Setting: Enable Holdings Sort by Geographic Proximity (set to True). Enable Library Setting: Geographic Location Service to use for Addresses (use the value from the Name field entered in the Geographic Location Services Configuration entry). Enable Library Setting: Show Geographic Proximity in Miles (if not set, it will default to kilometers). Set the geographic coordinates for each location by navigating to Server Administration > Organizational Units. Select the org unit, switch to the Physical Address subtab and either manually enter Latitude and Longitude values or use the Get Coordinate button. Two new permissions, VIEW_GEOLOCATION_SERVICES and ADMIN_GEOLOCATION_SERVICES, control viewing and editing values in the Geolocation Location Services interface. They are added to the System Administrator and Global Administrator permissions groups by default. 2.5.5. Library Groups The Library Groups search feature revives a longstanding internal concept in Evergreen called “Lassos,” which allows an administrator to define a group of organizational units for searching outside of the standard organizational unit hierarchy. Use case examples include creating a group of law or science libraries within a university consortium, or grouping all school libraries together within a mixed school/public library consortium. Searches can be restricted to a particular Library Group from the library selector in the public catalog basic search page and from the new "Where" selector on the advanced search page. Restricting catalog searches by Library Group is available only in the public catalog and "traditional" staff catalog; it is not available in the Angular staff catalog. This feature adds a new permission, ADMIN_LIBRARY_GROUPS, that allows updating Library Groups and Library Group Maps. This permission is not associated with any profiles by default, and replaces the CREATE_LASSO, UPDATE_LASSO, and DELETE_LASSO permissions. To define new library groups, use the Server Administration Library Groups and Library Group Maps pages. An autogen and a reload of Apache should be performed after making changes to Library Groups. 2.5.6. Easier Styling of Public Catalog Logo and Cart Images Evergreen now has IDs associated with logos and cart images in the TPAC and Bootstrap OPACs to aid in customization. Images are as follows: small Evergreen logo in navigation bar is topnav_logo_image the large Evergreen logo in the center of the splash page of the TPAC is homesearch_main_logo_image the cart icon is cart_icon_image the small logo in the footer is footer_logo_image The Bootstrap OPAC does not have a homesearch logo icon as it is added in the background by CSS and can be directly styled through the CSS. 2.5.7. Easier TPAC Customization via colors.tt2 Twelve new colors for TPAC have been added to the colors.tt2 file as well as having corresponding changes to the style.css.tt2 file. These use descriptive rather than abstract names. These changes help avoid situations were unreadable values are placed on top of each other and where different values are wanted for elements that only refernece a single color previously. Guidelines are below for setting values that correspond to the previous values used in the colors.tt2 file. For more diverse customizations the OPAC should be reviewed before a production load. footer is used for the background color of the footer. It replaces the primary. footer_text sets the text color in the footer and replaces text_invert header sets the background of the header and replaces primary_fade header_text sets the color of text in the header and replaces text_invert header_links_bar sets the background of the links bar that separates the header on the front page of the opac and replaces background_invert header_links_text sets the text on the links bar and replaces text_invert header_links_text_hover set the hover text color on the links bar and replaces primary opac_button sets the background color of the My Opac button and replaces control opac_button_text explicitly sets the text color on the My Opac button opac_button_hover sets the background color of the My Opac button when the mouse is hovering over it and replaces primary opac_button_hover_text sets the text color of the My Opac button when the mouse is hovering over it and replaces text invert Note that is patch is primarily meant for users who wish to continue using TPAC rather than the Bootstrap skin for a while; new Evergreen users are advised to use the now-default Bootstrap skin. 2.5.8. Configurable Read More Accordion for OPAC Search and Record View (TPAC) Read More Button Public catalog record fields (in the TPAC skin only) now truncate themselves based on a configurable amount of characters. The full field may be displayed upon hitting a (Read More) link, which will then toggle into a (Read Less) link to re-truncate the field. Configuration Open-ILS/src/templates/opac/parts/config.tt2 contains two new configuration variables: truncate_contents (default: 1) contents_truncate_length (default: 50). Setting truncate_contents to 0 will disable the read more functionality. The variable contents_truncate_length corresponds to the amount of characters to display before truncating the text. If contents_truncate_length is removed, it will default to 100. Additional configuration for note fields can be made in Open-ILS/src/templates/opac/parts/record/contents.tt2, allowing a trunc_length variable for each individual type of note, which will override contents_truncate_length for that specific type of note. Adding Read More Functionality to further fields To add Read More functionality to any additional fields, you may use the macro accordion(), defined in misc_util.tt2. It can take three variables: str, trunc_length, and element. str corresponds to the string you want to apply it to, trunc_length (optional) will override contents_truncate_length if supplied, and element (optional) provides an alternative HTML element to look at for the truncation process (useful in situations such as the Authors and Cast fields, where each field is processed individually, but needs to be treated as a single field). 2.6. Reports 2.6.1. Reports Scheduler Improvements Previously, the reports scheduler allowed duplicated reports under certain circumstances. A uniqueness constraint now disallows this without adversely affecting the reports process. 3. Miscellaneous The Create Reservation form in the Booking module now includes an option to search for the patron by attributes other than just their barcode. (Bug 1816655) The form to add a user to a Course now includes an option to search for the patron by attributes other than just their barcode. (Bug 1907921) For consistency with the menu action Cataloging ⇒ Retrieve Record by TCN Value, the staff catalog Numeric Search ⇒ TCN search now includes deleted bib records. (Bug 1881650) Add a new command-line script, overdrive-api-checker.pl, for testing the OverDrive API. (Bug 1696825) The Shelving Location Groups editor is ported to Angular. (Bug 1852321) The staff catalog now has the ability to add all search results (up to 1,000 titles) to the basket in one fell swoop. (Bug 1885179) Add All Videos as a search format. (Bug 1917826) Server-side print templates can now have print contexts set. (Bug 1891550) Add ability to set the print context for a print template to "No-Print" to specify, well, that a given receipt should never be printed. (Bug 1891550) Add Check Number as an available column to the Bill History grids. (Bug 1705693) Adds a new control to the item table in the TPAC public catalog only to specify that only items that are available should be displayed. (Bug 1853006) Adds warning before deleting bib records with holds (Bug 1398107) Library scope on (Angular) Administration pages now defaults to workstation location rather than consortium (Bug 173322) Pending users now set last four digits of phone number as password when library setting is enabled (Bug 1887852) 4. Acknowledgments The Evergreen project would like to acknowledge the following organizations that commissioned developments in this release of Evergreen: BC Libraries Cooperative Community Library (Sunbury) Consortium of Ohio Libraries (COOL) Evergreen Community Development Initiative Evergreen Indiana Georgia PINES Linn-Benton Community College Pennsylvania Integrated Library System (PaILS) We would also like to thank the following individuals who contributed code, translations, documentation, patches, and tests to this release of Evergreen: John Amundson Zavier Banks Felicia Beaudry Jason Boyer Dan Briem Andrea Buntz Neiman Christine Burns Galen Charlton Garry Collum Eva Cerniňáková Dawn Dale Elizabeth Davis Jeff Davis Martha Driscoll Bill Erickson Jason Etheridge Ruth Frasur Blake Graham-Henderson Katie Greenleaf Martin Rogan Hamby Elaine Hardy Kyle Huckins Angela Kilsdonk Tiffany Little Mary Llewellyn Terran McCanna Chauncey Montgomery Gina Monti Michele Morgan Carmen Oleskevich Jennifer Pringle Mike Risher Mike Rylander Jane Sandberg Chris Sharp Ben Shum Remington Steed Jason Stephenson Jennifer Weston Beth Willis We also thank the following organizations whose employees contributed patches: BC Libraries Cooperative Calvin College Catalyte CW MARS Equinox Open Library Initiative Georgia Public Library Service Kenton County Public Library King County Library System Linn-Benton Community College MOBIUS NOBLE Westchester Library System We regret any omissions. If a contributor has been inadvertently missed, please open a bug at http://bugs.launchpad.net/evergreen/ with a correction. Last updated 2021-04-14 15:04:29 EDT 
evergreen-ils-org-4101	----	Evergreen 3.7.0 Release Notes Evergreen 3.7.0 Release Notes Table of Contents JavaScript must be enabled in your browser to display the table of contents. 1. Upgrade notes 1.1. Database Upgrade Procedure The database schema upgrade for Evergreen 3.7 has more steps than normal. The general procedure, assuming Evergreen 3.6.2 as the starting point, is: Run the main 3.6.2 ⇒ to 3.7 schema update script from the Evergreen source directory, supplying database connection parameters as needed: psql -f Open-ILS/src/sql/Pg/version-upgrade/3.6.2-3.7.0-upgrade-db.sql 2>&1 | tee 3.6.2-3.7.0-upgrade-db.log Create and ingest search suggestions: Run the following from psql to export the strings to files: \a \t \o title select value from metabib.title_field_entry; \o author select value from metabib.author_field_entry; \o subject select value from metabib.subject_field_entry; \o series select value from metabib.series_field_entry; \o identifier select value from metabib.identifier_field_entry; \o keyword select value from metabib.keyword_field_entry; \o \a \t From the command line, convert the exported words into SQL scripts to load into the database. This step assumes that you are at the top of the Evergreen source tree. $ ./Open-ILS/src/support-scripts/symspell-sideload.pl title > title.sql $ ./Open-ILS/src/support-scripts/symspell-sideload.pl author > author.sql $ ./Open-ILS/src/support-scripts/symspell-sideload.pl subject > subject.sql $ ./Open-ILS/src/support-scripts/symspell-sideload.pl series > series.sql $ ,/Open-ILS/src/support-scripts/symspell-sideload.pl identifier > identifier.sql $ ./Open-ILS/src/support-scripts/symspell-sideload.pl keyword > keyword.sql Back in psql, import the suggestions. This step can take several hours in a large databases, but the \i $FILE.sql` steps can be run in parallel. ALTER TABLE search.symspell_dictionary SET UNLOGGED; TRUNCATE search.symspell_dictionary; \i identifier.sql \i author.sql \i title.sql \i subject.sql \i series.sql \i keyword.sql CLUSTER search.symspell_dictionary USING symspell_dictionary_pkey; REINDEX TABLE search.symspell_dictionary; ALTER TABLE search.symspell_dictionary SET LOGGED; VACUUM ANALYZE search.symspell_dictionary; DROP TABLE search.symspell_dictionary_partial_title; DROP TABLE search.symspell_dictionary_partial_author; DROP TABLE search.symspell_dictionary_partial_subject; DROP TABLE search.symspell_dictionary_partial_series; DROP TABLE search.symspell_dictionary_partial_identifier; DROP TABLE search.symspell_dictionary_partial_keyword; (optional) Apply the new opt-in setting for overdue and preduce notices. The following query will set the circ.default_overdue_notices_enabled user setting to true (the default value) for all existing users, ensuring they continue to receive overdue/predue emails. INSERT INTO actor.usr_setting (usr, name, value) SELECT id, circ.default_overdue_notices_enabled, true FROM actor.usr; The following query will add the circ.default_overdue_notices_enabled user setting as an opt-in setting for all action triggers that send emails based on a circ being due (unless another opt-in setting is already in use). UPDATE action_trigger.event_definition SET opt_in_setting = circ.default_overdue_notices_enabled, usr_field = usr WHERE opt_in_setting IS NULL AND hook = checkout.due AND reactor = SendEmail; Evergreen admins who wish to use the new setting should run both of the above queries. Admins who do not wish to use it, or who are already using a custom opt-in setting of their own, do not need to do anything. Perform a VACUUM ANALYZE of the following tables using psql: VACUUM ANALYZE authority.full_rec; VACUUM ANALYZE authority.simple_heading; VACUUM ANALYZE metabib.identifier_field_entry; VACUUM ANALYZE metabib.combined_identifier_field_entry; VACUUM ANALYZE metabib.title_field_entry; VACUUM ANALYZE metabib.combined_title_field_entry; VACUUM ANALYZE metabib.author_field_entry; VACUUM ANALYZE metabib.combined_author_field_entry; VACUUM ANALYZE metabib.subject_field_entry; VACUUM ANALYZE metabib.combined_subject_field_entry; VACUUM ANALYZE metabib.keyword_field_entry; VACUUM ANALYZE metabib.combined_keyword_field_entry; VACUUM ANALYZE metabib.series_field_entry; VACUUM ANALYZE metabib.combined_series_field_entry; VACUUM ANALYZE metabib.real_full_rec; 1.2. New Seed Data 1.2.1. New Permissions Administer geographic location services (ADMIN_GEOLOCATION_SERVICES) Administer library groups (ADMIN_LIBRARY_GROUPS) Manage batch (subscription) hold events (MANAGE_HOLD_GROUPS) Modify patron SSO settings (SSO_ADMIN) View geographic location services (VIEW_GEOLOCATION_SERVICES) 1.2.2. New Global Flags Block the ability of expired user with the STAFF_LOGIN permission to log into Evergreen (auth.block_expired_staff_login) Offer use of geographic location services in the public catalog (opac.use_geolocation) 1.2.3. New Internal Flags Maximum search result count at which spelling suggestions may be offered (opac.did_you_mean.low_result_threshold) 1.2.4. New Library Settings Allow both Shibboleth and native OPAC authentication (opac.login.shib_sso.allow_native) Allow renewal request if renewal recipient privileges have expired (circ.renew.expired_patron_allow) Enable Holdings Sort by Geographic Proximity ('opac.holdings_sort_by_geographic_proximity`) Enable Shibboleth SSO for the OPAC (opac.login.shib_sso.enable) Evergreen SSO matchpoint (opac.login.shib_sso.evergreen_matchpoint) Geographic Location Service to use for Addresses (opac.geographic_location_service_for_address) Keyboard distance score weighting in OPAC spelling suggestions (search.symspell.keyboard_distance.weight) Log out of the Shibboleth IdP (opac.login.shib_sso.logout) Minimum required uses of a spelling suggestions that may be offered (search.symspell.min_suggestion_use_threshold) Pg_trgm score weighting in OPAC spelling suggestions (search.symspell.pg_trgm.weight) Randomize group hold order (holds.subscription.randomize) Shibboleth SSO Entity ID (opac.login.shib_sso.entityId) Shibboleth SSO matchpoint (opac.login.shib_sso.shib_matchpoint) Show Geographic Proximity in Miles (opac.geographic_proximity_in_miles) Soundex score weighting in OPAC spelling suggestions (search.symspell.soundex.weight) 1.2.5. New Stock Action/Trigger Event Definitions Hold Group Hold Placed for Patron Email Notification 2. New Features 2.1. Administration 2.1.1. Single Sign On (Shibboleth) Public Catalog integration The Evergreen OPAC can now be used as a Service Provider (SP) in a Single Sign On infrastructure. This allows system administrators to connect the Evergreen OPAC to an identity provider (IdP). Such a scenario offers significant usability improvements to patrons: They can use the same, IdP-provided login screen and credentials that they use for other applications (SPs). If they have already logged into another participating application, when they arrive at the Evergreen OPAC, they can be logged in without needing to enter any credentials at all. Evergreen can be configured to offer a Single Sign-out service, where logging out of the Evergreen OPAC will also log the user out of all other SPs. It can also offer security benefits, if it enables a Shibboleth-enabled Evergreen installation to move away from insecure autogenerated user passwords (e.g. year of birth or last four digits of a phone number). Different Org Units can use different IdPs. This development also supports a mix of Shibboleth and non-Shibboleth libraries. Note that only the OPAC can be integrated with Shibboleth at this time; no such support exists for the staff client, self-check, etc. Also note that this development does not include automatic provisioning of accounts. At this time, matching accounts must already exist in Evergreen for a patron to successfully authenticate into the OPAC via Single Sign On. Installation Installing and configuring Shibboleth support is a complex project. In broad strokes, the process includes: Installing Shibboleth and the Shibboleth Apache module (apt install libapache2-mod-shib2 on Debian and Ubuntu) Configuring Shibboleth, including: Setting up a certificate assigning an Entity ID getting metadata about the IdP from the IdP (perhaps "locally maintained metadata", where an XML file from the IdP is copied into place on your Evergreen server) Understanding what attributes the IdP will provide about your users, and describing them in the attribute-map.xml file. Providing your Entity ID, information about possible bindings, and any other requested information to the IdP administrator. Much of this information will be available at http://YOUR_EVERGREEN_DOMAIN/Shibboleth.sso/Metadata Configuring Apache, including: Enabling Shibboleth authentication in the eg_vhost.conf file (Optional) Using the new sso_loc Apache variable to identify which org unit should be used as the context location when fetching Shibboleth-related library settings. As a user with the new SSO_ADMIN permission, configure Evergreen using the Library Settings Editor, including: Enable Shibboleth SSO for the OPAC (Optional) Configure whether you will use SSO exclusively, or offer patrons a choice between SSO and standard Evergreen authentication (Optional) Configure whether or not you will use Single Log Out (Optional) In scenarios where a single Evergreen installation is connected to multiple IdPs, assign org units to the relevant IdPs, referenced by the IdP’s Entity Id. Of the attributes defined in attribute-map.xml, configure which one should be used to match users in the Evergreen database. This defaults to uid. For the attribute you chose in the previous step, configure which Evergreen field it should match against. Options are usrname (default), barcode, and email. This video on the SAML protocol can be very helpful for introducing the basic concepts used in the installation and configuration processes. 2.2. Architecture 2.2.1. Block Login of Expired Staff Accounts Evergreen now has the ability to prevent staff users whose accounts have expired from logging in. This is controlled by the new global flag "auth.block_expired_staff_login", which is not enabled by default. If that flag is turned on, accounts that have the STAFF_LOGIN permission and whose expiration date is in the past are prevented from logging into any Evergreen interface, including the staff client, the public catalog, and SIP2. It should be noted that ordinary patrons are allowed to log into the public catalog if their circulation privileges have expired. This feature prevents expired staff users from logging into the public catalog (and all other Evergreen interfaces and APIs) outright in order to prevent them from getting into the staff interface anyway by creative use of Evergreen’s authentication APIs. Evergreen admins are advised to check the expiration status of staff accounts before turning on the global flag, as otherwise it is possible to lock staff users out unexpectedly. The following SQL query will identify expired but otherwise un-deleted users that would be blocked by turning on the flag: SELECT DISTINCT usrname, expire_date FROM actor.usr au, permission.usr_has_perm_at_all(id, 'STAFF_LOGIN') WHERE active AND NOT deleted AND NOT barred AND expire_date < NOW() Note that this query can take a long time to run in large databases given the general way that it checks for users that have the STAFF_LOGIN permission. Replacing the use of permission.usr_has_perm_at_all() with a query on expired users with profiles known to have the STAFF_LOGIN permission will be much faster. 2.2.2. Migration From GIST to GIN Indexes for Full Text Search Evergreen now uses GIN indexes for full text search in PostgreSQL. GIN indexes offer better performance than GIST. For more information on the differences in the two index types, please refer to the PostgreSQL documentation. An upgrade script is provided as part of this migration. If you upgrade normally from a previous release of Evergreen, this upgrade script should run as part of the upgrade process. The migration script recommends that you run a VACUUM ANALYZE in PostgreSQL on the tables that had the indexes changed. The migration process does not do this for you, so you should do it as soon as is convenient after the upgrade. Updating Your Own Indexes If you have added your own full text indexes of type GIST, and you wish to migrate them to GIN, you may do so. The following query, when run in your Evergreen databsase after the migration from GIST to GIN, will identify the remaining GIST indexes in your database: SELECT schemaname, indexname FROM pg_indexes WHERE indexdef ~* 'gist'; If the above query produces output, you can run the next query to output a SQL script to migrate the remaining indexes from GIST to GIN: SELECT 'DROP INDEX ' || schemaname || '.' || indexname || E';\n' || REGEXP_REPLACE(indexdef, 'gist', 'gin', 'i') || E';\n' || 'VACUUM ANAlYZE ' || schemaname || '.' || tablename || ';' FROM pg_indexes WHERE indexdef ~* 'gist'; 2.2.3. Removal of Custom Dojo Build Evergreen had a method of making a custom build of the Dojo JavaScript library. Following this procedure could improve the load times for the OPAC and other interfaces that use Dojo. However, very few sites took advantage of this process or even knew of its existence. As a part of the process, an openils_dojo.js file was built and installed along with the other Dojo files. Evergreen had many references to load this optional file. For the majority of sites that did not use this custom Dojo process, this file did not exist. Browsers would spend time and resources requesting this nonexistent file. This situation also contributed noise to the Apache logs with the 404 errors from these requests. In keeping with the goal of eliminating Dojo from Evergreen, all references to openils_dojo.js have been removed from the OPAC and other files. The profile script required to make the custom Dojo build has also been removed. 2.3. Cataloging 2.3.1. Czech language records in sample data This release adds 7 Czech-language MARC records to the sample data set (also known as Concerto data set). 2.3.2. Publisher Catalog Display Includes 264 Tag Publisher values are now extracted for display from tags 260 OR 264. Upgrade Notes A partial reingest is required to extract the new publisher data for display. This query may be long-running. WITH affected_bibs AS ( SELECT DISTINCT(bre.id) AS id FROM biblio.record_entry bre JOIN metabib.real_full_rec mrfr ON (mrfr.record = bre.id AND mrfr.tag = '264') WHERE NOT bre.deleted ) SELECT metabib.reingest_metabib_field_entries(id, TRUE, FALSE, TRUE, TRUE) FROM affected_bibs; 2.4. Circulation 2.4.1. Hold Groups This feature allows staff to add multiple users to a named hold group bucket and place title-level holds for a record for that entire set of users. Users can be added to such a hold group bucket from either the patron search result interface, via the Add to Bucket dropdown, or through a dedicated Hold Group interface available from the Circulation menu. Adding new patrons to a hold group bucket will require staff have the PLACE_HOLD permission. Holds can be placed for the users in a hold group bucket either directly from the normal staff-place hold interface in the embedded OPAC, or by supplying the record ID within the hold group bucket interface. In the latter case, the list of users for which a hold was attempted but failed to be placed can be downloaded by staff in order to address any placement issues. Placing a hold group bucket hold will requires staff have the MANAGE_HOLD_GROUPS permission, which is new with this development. In the event of a mistaken hold group hold, staff with the MANAGE_HOLD_GROUPS permission will have the ability to cancel all unfulfilled holds created as part of a hold group event. A link to the title’s hold interface is available from the list of hold group events in the dedicated hold group interface. 2.4.2. Scan Item as Missing Pieces Angular Port The Scan Item As Missing Pieces interface is now an Angular interface. The functionality is the same, but the interface displays more details on the item in question (title/author/callnum) before proceeding with the missing pieces process. 2.4.3. Opt-In Setting for Overdue and Predue Emails The "Receive Overdue and Courtesy Emails" user setting permits users to control whether they receive email notifications about overdue items. To use the setting, modify any action trigger event definitions which send emails about overdue items, setting the "Opt In Setting" to "circ.default_overdue_notices_enabled" and the "User Field" to "usr". You can accomplish this by running the following query in your database: UPDATE action_trigger.event_definition SET opt_in_setting = 'circ.default_overdue_notices_enabled', usr_field = 'usr' WHERE opt_in_setting IS NULL AND hook = 'checkout.due' AND reactor = 'SendEmail'; Once this is done, the patron registration screen in the staff client will show a "Receive Overdue and Courtesy Emails" checkbox, which will be checked by default. To ensure that existing patrons continue to recieve email notifications, you will need to add the user setting to their accounts, which you can do by running the following query in your database: INSERT INTO actor.usr_setting (usr, name, value) SELECT id, 'circ.default_overdue_notices_enabled', 'true' FROM actor.usr; 2.4.4. Allow Circulation Renewal for Expired Patrons The "Allow renewal request if renewal recipient privileges have expired" organizational unit setting can be set to true to permit expired patrons to renew circulations. Allowing renewals for expired patrons reduces the number of auto-renewal failures and assumes that a patron with items out eligible for renewals has not been expired for very long and that such patrons are likely to renew their privileges in a timely manner. The setting is referenced based on the current circulation library for the renewal. It takes into account the global flags for "Circ: Use original circulation library on desk renewal instead of the workstation library" and "Circ: Use original circulation library on opac renewal instead of user home library." 2.5. OPAC 2.5.1. Consistent Ordering for Carousels Carousel ordering is now stable and predictable: Newly Cataloged Item and Newest Items by Shelving Location carousels are ordered from most recently cataloged to least recently cataloged. Recently Returned Item carousels is ordered is from most recently returned to least recently returned. Top Circulated Items carousels is ordered is from most circulated to least circulated. Manual carousels (as of now, without the ability to adjust the position of items) are in the order they are added to the backing bucket. Emptying and refilling the bucket allows reordering. 2.5.2. Default Public Catalog to the Bootstrap Skin The public catalog now defaults to the Bootstrap skin rather than the legacy TPAC skin. Bootstrap is now the default in order to encourage more testing, but users should be aware of the following issues; certain specific functionality is available only in the TPAC skin. The TPAC skin remains available for use, but current Evergreen users should start actively considering migrating to the Bootstrap skin. In order to continue to use the TPAC skin, comment out the following line in eg_vhost.conf PerlAddVar OILSWebTemplatePath "@localstatedir@/templates-bootstrap" # Comment this line out to use the legacy TPAC 2.5.3. Did You Mean? Single word search suggestions This feature is the first in the series to add native search suggestions to the Evergreen search logic. A significant portion of the code is dedicated to infrastructure that will be used in later enhancements to the functionality. Overview When searching the public or staff catalog in a single search class (title, author, subject, series, identifier, or keyword) with a single search term users can be presented with alternate search terms. Depending on how the instance has been configured, suggestions may be provided for only misspelled words (as defined by existence in the bibliographic corpus), terms that are spelled properly but occur very few times, or on every single-term search. Settings The following new library settings control the behavior of the suggestions: Maximum search result count at which spelling suggestions may be offered Minimum required uses of a spelling suggestions that may be offered Maximum number of spelling suggestions that may be offered Pg_trgm score weighting in OPAC spelling suggestions Soundex score weighting in OPAC spelling suggestions QWERTY Keyboard similarity score weighting in OPAC spelling suggestions There are also two new internal flags: symspell.prefix_length symspell.max_edit_distance Upgrading This feature requires the addition of new Perl module dependencies. Please run the app server and database server dependency Makefiles before applying the database and code updates. At the end of the database upgrade script, the administrator is presented with a set of instructions necessary to precompute the suggestion dictionary based on the current bibliographic database. The first half of this procedure can be started even before the upgrade begins, as soon as the Evergreen database is no longer accessible to users that might cause changes to bibliographic records. For very large instances, this dictionary generation can take several hours and needs to be run on a server with significant RAM and CPU resources. Please look at the upgrade script before beginning an upgrade and plan this dictionary creation as part of the overall upgrade procedure. Given a server, such as a database server with 64G of RAM, you should be able to run all six of the shell commands in parallel in screen sessions or with a tool such as GNU parallel. These commands invoke a script that will generate a class-specific sub-set of the dictionary, and can be used to recreate the dictionary if necessary in the future. 2.5.4. Sort Holdings by Geographical Proximity This functionality integrates 3rd party geographic lookup services to allow patrons to enter an address on the record details page in the OPAC and sort the holdings for that record based on proximity of their circulating libraries to the entered address. To support this, latitude and longitude coordinates may be associated with each org unit. Care is given to not log or leak patron provided addresses or the context in which they are used. Requires the following Perl modules: Geo::Coder::Free, Geo::Coder::Google, and Geo::Coder::OSM Configuration instructions: Register an account with a third party geographic location service and copy the API Key. Configure the Geographic Location Service (Server Administration > Geographic Location Service > New Geographic Location Service). Enable Global Flag by navigating to Server Administration → Global Flags and locating the opac.use_geolocation flag. (Any entry in the Value field will be ignored.) Enable Library Setting: Enable Holdings Sort by Geographic Proximity (set to True). Enable Library Setting: Geographic Location Service to use for Addresses (use the value from the Name field entered in the Geographic Location Services Configuration entry). Enable Library Setting: Show Geographic Proximity in Miles (if not set, it will default to kilometers). Set the geographic coordinates for each location by navigating to Server Administration > Organizational Units. Select the org unit, switch to the Physical Address subtab and either manually enter Latitude and Longitude values or use the Get Coordinate button. Two new permissions, VIEW_GEOLOCATION_SERVICES and ADMIN_GEOLOCATION_SERVICES, control viewing and editing values in the Geolocation Location Services interface. They are added to the System Administrator and Global Administrator permissions groups by default. 2.5.5. Library Groups The Library Groups search feature revives a longstanding internal concept in Evergreen called “Lassos,” which allows an administrator to define a group of organizational units for searching outside of the standard organizational unit hierarchy. Use case examples include creating a group of law or science libraries within a university consortium, or grouping all school libraries together within a mixed school/public library consortium. Searches can be restricted to a particular Library Group from the library selector in the public catalog basic search page and from the new "Where" selector on the advanced search page. Restricting catalog searches by Library Group is available only in the public catalog and "traditional" staff catalog; it is not available in the Angular staff catalog. This feature adds a new permission, ADMIN_LIBRARY_GROUPS, that allows updating Library Groups and Library Group Maps. This permission is not associated with any profiles by default, and replaces the CREATE_LASSO, UPDATE_LASSO, and DELETE_LASSO permissions. To define new library groups, use the Server Administration Library Groups and Library Group Maps pages. An autogen and a reload of Apache should be performed after making changes to Library Groups. 2.5.6. Easier Styling of Public Catalog Logo and Cart Images Evergreen now has IDs associated with logos and cart images in the TPAC and Bootstrap OPACs to aid in customization. Images are as follows: small Evergreen logo in navigation bar is topnav_logo_image the large Evergreen logo in the center of the splash page of the TPAC is homesearch_main_logo_image the cart icon is cart_icon_image the small logo in the footer is footer_logo_image The Bootstrap OPAC does not have a homesearch logo icon as it is added in the background by CSS and can be directly styled through the CSS. 2.5.7. Easier TPAC Customization via colors.tt2 Twelve new colors for TPAC have been added to the colors.tt2 file as well as having corresponding changes to the style.css.tt2 file. These use descriptive rather than abstract names. These changes help avoid situations were unreadable values are placed on top of each other and where different values are wanted for elements that only refernece a single color previously. Guidelines are below for setting values that correspond to the previous values used in the colors.tt2 file. For more diverse customizations the OPAC should be reviewed before a production load. footer is used for the background color of the footer. It replaces the primary. footer_text sets the text color in the footer and replaces text_invert header sets the background of the header and replaces primary_fade header_text sets the color of text in the header and replaces text_invert header_links_bar sets the background of the links bar that separates the header on the front page of the opac and replaces background_invert header_links_text sets the text on the links bar and replaces text_invert header_links_text_hover set the hover text color on the links bar and replaces primary opac_button sets the background color of the My Opac button and replaces control opac_button_text explicitly sets the text color on the My Opac button opac_button_hover sets the background color of the My Opac button when the mouse is hovering over it and replaces primary opac_button_hover_text sets the text color of the My Opac button when the mouse is hovering over it and replaces text invert Note that is patch is primarily meant for users who wish to continue using TPAC rather than the Bootstrap skin for a while; new Evergreen users are advised to use the now-default Bootstrap skin. 2.5.8. Configurable Read More Accordion for OPAC Search and Record View (TPAC) Read More Button Public catalog record fields (in the TPAC skin only) now truncate themselves based on a configurable amount of characters. The full field may be displayed upon hitting a (Read More) link, which will then toggle into a (Read Less) link to re-truncate the field. Configuration Open-ILS/src/templates/opac/parts/config.tt2 contains two new configuration variables: truncate_contents (default: 1) contents_truncate_length (default: 50). Setting truncate_contents to 0 will disable the read more functionality. The variable contents_truncate_length corresponds to the amount of characters to display before truncating the text. If contents_truncate_length is removed, it will default to 100. Additional configuration for note fields can be made in Open-ILS/src/templates/opac/parts/record/contents.tt2, allowing a trunc_length variable for each individual type of note, which will override contents_truncate_length for that specific type of note. Adding Read More Functionality to further fields To add Read More functionality to any additional fields, you may use the macro accordion(), defined in misc_util.tt2. It can take three variables: str, trunc_length, and element. str corresponds to the string you want to apply it to, trunc_length (optional) will override contents_truncate_length if supplied, and element (optional) provides an alternative HTML element to look at for the truncation process (useful in situations such as the Authors and Cast fields, where each field is processed individually, but needs to be treated as a single field). 2.6. Reports 2.6.1. Reports Scheduler Improvements Previously, the reports scheduler allowed duplicated reports under certain circumstances. A uniqueness constraint now disallows this without adversely affecting the reports process. 3. Miscellaneous The Create Reservation form in the Booking module now includes an option to search for the patron by attributes other than just their barcode. (Bug 1816655) The form to add a user to a Course now includes an option to search for the patron by attributes other than just their barcode. (Bug 1907921) For consistency with the menu action Cataloging ⇒ Retrieve Record by TCN Value, the staff catalog Numeric Search ⇒ TCN search now includes deleted bib records. (Bug 1881650) Add a new command-line script, overdrive-api-checker.pl, for testing the OverDrive API. (Bug 1696825) The Shelving Location Groups editor is ported to Angular. (Bug 1852321) The staff catalog now has the ability to add all search results (up to 1,000 titles) to the basket in one fell swoop. (Bug 1885179) Add All Videos as a search format. (Bug 1917826) Server-side print templates can now have print contexts set. (Bug 1891550) Add ability to set the print context for a print template to "No-Print" to specify, well, that a given receipt should never be printed. (Bug 1891550) Add Check Number as an available column to the Bill History grids. (Bug 1705693) Adds a new control to the item table in the TPAC public catalog only to specify that only items that are available should be displayed. (Bug 1853006) Adds warning before deleting bib records with holds (Bug 1398107) Library scope on (Angular) Administration pages now defaults to workstation location rather than consortium (Bug 173322) Pending users now set last four digits of phone number as password when library setting is enabled (Bug 1887852) 4. Acknowledgments The Evergreen project would like to acknowledge the following organizations that commissioned developments in this release of Evergreen: BC Libraries Cooperative Community Library (Sunbury) Consortium of Ohio Libraries (COOL) Evergreen Community Development Initiative Evergreen Indiana Georgia PINES Linn-Benton Community College Pennsylvania Integrated Library System (PaILS) We would also like to thank the following individuals who contributed code, translations, documentation, patches, and tests to this release of Evergreen: John Amundson Zavier Banks Felicia Beaudry Jason Boyer Dan Briem Andrea Buntz Neiman Christine Burns Galen Charlton Garry Collum Eva Cerniňáková Dawn Dale Elizabeth Davis Jeff Davis Martha Driscoll Bill Erickson Jason Etheridge Ruth Frasur Blake Graham-Henderson Katie Greenleaf Martin Rogan Hamby Elaine Hardy Kyle Huckins Angela Kilsdonk Tiffany Little Mary Llewellyn Terran McCanna Chauncey Montgomery Gina Monti Michele Morgan Carmen Oleskevich Jennifer Pringle Mike Risher Mike Rylander Jane Sandberg Chris Sharp Ben Shum Remington Steed Jason Stephenson Jennifer Weston Beth Willis We also thank the following organizations whose employees contributed patches: BC Libraries Cooperative Calvin College Catalyte CW MARS Equinox Open Library Initiative Georgia Public Library Service Kenton County Public Library King County Library System Linn-Benton Community College MOBIUS NOBLE Westchester Library System We regret any omissions. If a contributor has been inadvertently missed, please open a bug at http://bugs.launchpad.net/evergreen/ with a correction. Last updated 2021-04-14 15:04:29 EDT 
evergreen-ils-org-4187	----	Evergreen 3.7.0 released – Evergreen ILS Skip to content Evergreen – Open Source Library Software Evergreen – Open Source Library Software About Us Overview Annual Reports F.A.Q. Evergreen Event Code of Conduct Software Freedom Conservancy Project Governance Trademark Policy Documentation Official Documentation Documentation Interest Group Evergreen Roadmap Evergreen Wiki Tabular Release Notes Get Involved! Get Involved! Committees & Interest Groups Communications Mailing Lists IRC Calendar Blog Jobs Proposed Development Projects Merchandise T-shirts and more Conference All Conferences 2021 Evergreen International Online Conference 2020 Evergreen International Online Conference Event Photography Policy Code of Conduct Downloads Evergreen Downloads OpenSRF Downloads Home » Announcements » Evergreen 3.7.0 released Evergreen 3.7.0 released This entry was posted in Announcements Releases on 4/14/2021 by Galen Charlton The Evergreen Community is pleased to announce the release of Evergreen 3.7.0. Evergreen is highly-scalable software for libraries that helps library patrons find library materials and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries. Evergreen 3.7.0 is a major release that includes the following new features of note: Support for SAML-based Single Sign On Hold Groups, a feature that allows staff to add multiple users to a named hold group bucket and place title-level holds for a record for that entire set of users The Bootstrap public catalog skin is now the default “Did you mean?” functionality for catalog search focused on making suggestions for single search terms Holdings on the public catalog record details page can now be sorted by geographic proximity Library Groups, a feature that allows defining groups of organizational units outside of the hierarchy that can be used to limit catalog search results Expired staff accounts can now be blocked from logging in Publisher data in the public catalog display is now drawn from both the 260 and 264 field The staff catalog can now save all search results (up to 1,000) to a bucket in a single operation New opt-in settings for overdue and predue email notifications A new setting to allow expired patrons to renew loans Porting of additional interfaces to Angular, including Scan Item as Missing Pieces and Shelving Location Groups Evergreen admins installing or upgrading to 3.7.0 should be aware of the following: The minimum version of PostgreSQL required to run Evergreen 3.6 is PostgreSQL 9.6. The minimum version of OpenSRF is 3.2. This release adds anew OpenSRF service, open-ils.geo. The release also adds several new Perl module dependencies, Geo::Coder::Google, Geo::Coder::OSM, String::KeyboardDistance, and Text::Levenshtein::Damerau::XS. The database update procedure has more steps than usual; please consult the upgrade section of the release notes. The release is available on the Evergreen downloads page. Additional information, including a full list of new features, can be found in the release notes. Share this: Facebook Twitter More Reddit LinkedIn Pocket Pinterest Tumblr Print Related Post navigation ← Evergreen 3.7-rc available About Evergreen This is the project site for Evergreen, a highly-scalable software for libraries that helps library patrons find library materials, and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries. © 2008-2020 GPLS and others. Evergreen is open source software, freely licensed under GNU GPLv2 or later. The Evergreen Project is a 501(c)3 nonprofit organization. Community Links Evergreen Bug Tracker Evergreen on Open HUB Evergreen Wiki Git Repositories Join IRC! IRC Logs Official Documentation · © 2021 Evergreen ILS · Powered by · Designed with the Customizr theme · 
evergreen-ils-org-5070	----	Evergreen 3.7-rc available – Evergreen ILS Skip to content Evergreen – Open Source Library Software Evergreen – Open Source Library Software About Us Overview Annual Reports F.A.Q. Evergreen Event Code of Conduct Software Freedom Conservancy Project Governance Trademark Policy Documentation Official Documentation Documentation Interest Group Evergreen Roadmap Evergreen Wiki Tabular Release Notes Get Involved! Get Involved! Committees & Interest Groups Communications Mailing Lists IRC Calendar Blog Jobs Proposed Development Projects Merchandise T-shirts and more Conference All Conferences 2021 Evergreen International Online Conference 2020 Evergreen International Online Conference Event Photography Policy Code of Conduct Downloads Evergreen Downloads OpenSRF Downloads Home » Development Update » Evergreen 3.7-rc available Evergreen 3.7-rc available This entry was posted in Development Update on 4/12/2021 by Galen Charlton The Evergreen Community is pleased to announce the availability of the release candidate for Evergreen 3.7. This release follows up on the recent beta release. The general release of 3.7.0 is planned for Wednesday, 14 April 2021. Between now and then, please download the release candidate and try it out. Additional information, including a full list of new features, can be found in the release notes. Share this: Facebook Twitter More Reddit LinkedIn Pocket Pinterest Tumblr Print Related Post navigation ← Evergreen 3.7-beta available Evergreen 3.7.0 released → About Evergreen This is the project site for Evergreen, a highly-scalable software for libraries that helps library patrons find library materials, and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries. © 2008-2020 GPLS and others. Evergreen is open source software, freely licensed under GNU GPLv2 or later. The Evergreen Project is a 501(c)3 nonprofit organization. Community Links Evergreen Bug Tracker Evergreen on Open HUB Evergreen Wiki Git Repositories Join IRC! IRC Logs Official Documentation · © 2021 Evergreen ILS · Powered by · Designed with the Customizr theme · 
evergreen-ils-org-5339	----	Evergreen Downloads – Evergreen ILS Skip to content Evergreen – Open Source Library Software Evergreen – Open Source Library Software About Us Overview Annual Reports F.A.Q. Evergreen Event Code of Conduct Software Freedom Conservancy Project Governance Trademark Policy Documentation Official Documentation Documentation Interest Group Evergreen Roadmap Evergreen Wiki Tabular Release Notes Get Involved! Get Involved! Committees & Interest Groups Communications Mailing Lists IRC Calendar Blog Jobs Proposed Development Projects Merchandise T-shirts and more Conference All Conferences 2021 Evergreen International Online Conference 2020 Evergreen International Online Conference Event Photography Policy Code of Conduct Downloads Evergreen Downloads OpenSRF Downloads Home » Evergreen Downloads Evergreen Downloads Evergreen Downloads Evergreen depends on the following technologies Perl, C, JavaScript, XML, XPath, XSLT, XMPP, OpenSRF, Apache, mod_perl, and PostgreSQL. The latest stable release of a supported Linux distribution is recommended for an Evergreen installation. For Ubuntu, please use the 18.04 64-bit LTS (long term support) Server release. Currently the latest release from the Evergreen 3.6 series is recommended for new installations and stable releases are suggested for production systems. Note: Evergreen servers and staff clients must match. For example, if you are running server version 3.1.0, you should use version 3.1.0 of the staff client. Evergreen 3.2.0+ no longer supports a separate client by default, but building a client remains as an unsupported option. Server & staff client downloads 3.7 Series 3.6 Series 3.5 Series Status stable stable stable Latest Release 3.7.0 3.6.3 3.5.4 Release Date 2021-04-14 2021-04-01 2021-04-01 Release Notes Release Notes Release Notes Release Notes Tabular release notes summary ChangeLog ChangeLog ChangeLog ChangeLog Evergreen Installation Install Instructions Install Instructions Install Instructions Upgrading Notes on upgrading from 3.6.2 TBD TBD OpenSRF Software 3.2.1 (md5) 3.2.1 (md5) 3.2.1 (md5) Server Software Source (md5) Source (md5) Source (md5) Web Staff Client Extension (“Hatch”) Windows Hatch Installer 0.3.2 (md5) – Installation Instructions (Windows & Linux) Git Repository Git Location Git Location Git Location Other Evergreen Staff Clients Staff Client Archive Windows Staff Clients for slightly older stable releases (2.11, 2.10). For Mac and Linux Installing the Evergreen client on Macs Evergreen 2.8.3 Mac Staff Client [.dmg] Evergreen 2.9.0 Mac Staff Client [.dmg] Evergreen 2.12.0 Mac Staff Client [.zip] Evergreen 3.0.0 Mac Staff Client [.zip] Pre-built MAC staff client for Evergreen 2.10 and 2.8 – Provided by SITKA Evergreen in action Visit the Evergreen catalog on our demonstration and development servers, or visit this list of live Evergreen libraries. You can also download an Evergreen staff client and point it at the Evergreen demo or development server (see the community servers page for details). Bug Reports Please report any Evergreen bugs/wishlist on Launchpad. To submit a vulnerability please email your report to open-ils-security@esilibrary.com. Evergreen Code Museum Older versions of Evergreen software are available from the Evergreen Code Museum. Source Code Repository A Gitweb instance sits atop the Git repositories for Evergreen and OpenSRF. You can find both repositories at git.evergreen-ils.org. Here is the running change log for the Evergreen code repository: watch us work. Trac sends code commits to two public Evergreen mailing lists: For Evergreen commits, subscribe to open-ils-commits For OpenSRF commits, subscribe to opensrf-commits About Evergreen This is the project site for Evergreen, a highly-scalable software for libraries that helps library patrons find library materials, and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries. © 2008-2020 GPLS and others. Evergreen is open source software, freely licensed under GNU GPLv2 or later. The Evergreen Project is a 501(c)3 nonprofit organization. Community Links Evergreen Bug Tracker Evergreen on Open HUB Evergreen Wiki Git Repositories Join IRC! IRC Logs Official Documentation · © 2021 Evergreen ILS · Powered by · Designed with the Customizr theme · 
evergreen-ils-org-8730	----	Evergreen ILS – Evergreen – Open Source Library Software Skip to content Evergreen – Open Source Library Software Evergreen – Open Source Library Software About Us Overview Annual Reports F.A.Q. Evergreen Event Code of Conduct Software Freedom Conservancy Project Governance Trademark Policy Documentation Official Documentation Documentation Interest Group Evergreen Roadmap Evergreen Wiki Tabular Release Notes Get Involved! Get Involved! Committees & Interest Groups Communications Mailing Lists IRC Calendar Blog Jobs Proposed Development Projects Merchandise T-shirts and more Conference All Conferences 2021 Evergreen International Online Conference 2020 Evergreen International Online Conference Event Photography Policy Code of Conduct Downloads Evergreen Downloads OpenSRF Downloads Evergreen Downloads Find Download files, install steps, etc. Learn more » Documentation Official Documentation, DokuWiki, and other resources... Learn more » Get Involved! Mailing Lists, IRC, and more. Come join our community! Learn more » Evergreen 3.7.0 released The Evergreen Community is pleased to announce the release of Evergreen 3.7.0. Evergreen is highly-scalable software for libraries that helps library patrons find library materials and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries. Evergreen 3.7.0 is a major release that includes […] Share this: Facebook Twitter More Reddit LinkedIn Pocket Pinterest Tumblr Print Evergreen 3.7-rc available The Evergreen Community is pleased to announce the availability of the release candidate for Evergreen 3.7. This release follows up on the recent beta release. The general release of 3.7.0 is planned for Wednesday, 14 April 2021. Between now and then, please download the release candidate and try it out. […] Share this: Facebook Twitter More Reddit LinkedIn Pocket Pinterest Tumblr Print Evergreen 3.7-beta available The Evergreen Community is pleased to announce the availability of the beta release for Evergreen 3.7. This release contains various new features and enhancements, including: Support for SAML-based Single Sign On Hold Groups, a feature that allows staff to add multiple users to a named hold group bucket and place […] Share this: Facebook Twitter More Reddit LinkedIn Pocket Pinterest Tumblr Print Post navigation ← Older posts About Evergreen This is the project site for Evergreen, a highly-scalable software for libraries that helps library patrons find library materials, and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries. © 2008-2020 GPLS and others. Evergreen is open source software, freely licensed under GNU GPLv2 or later. The Evergreen Project is a 501(c)3 nonprofit organization. Community Links Evergreen Bug Tracker Evergreen on Open HUB Evergreen Wiki Git Repositories Join IRC! IRC Logs Official Documentation · © 2021 Evergreen ILS · Powered by · Designed with the Customizr theme · 
evergreen-ils-org-9796	----	Evergreen 3.7-beta available – Evergreen ILS Skip to content Evergreen – Open Source Library Software Evergreen – Open Source Library Software About Us Overview Annual Reports F.A.Q. Evergreen Event Code of Conduct Software Freedom Conservancy Project Governance Trademark Policy Documentation Official Documentation Documentation Interest Group Evergreen Roadmap Evergreen Wiki Tabular Release Notes Get Involved! Get Involved! Committees & Interest Groups Communications Mailing Lists IRC Calendar Blog Jobs Proposed Development Projects Merchandise T-shirts and more Conference All Conferences 2021 Evergreen International Online Conference 2020 Evergreen International Online Conference Event Photography Policy Code of Conduct Downloads Evergreen Downloads OpenSRF Downloads Home » Development Update » Evergreen 3.7-beta available Evergreen 3.7-beta available This entry was posted in Development Update on 4/1/2021 by Galen Charlton The Evergreen Community is pleased to announce the availability of the beta release for Evergreen 3.7. This release contains various new features and enhancements, including: Support for SAML-based Single Sign On Hold Groups, a feature that allows staff to add multiple users to a named hold group bucket and place title-level holds for a record for that entire set of users The Bootstrap public catalog skin is now the default “Did you mean?” functionality for catalog search focused on making suggestions for single search terms Holdings on the public catalog record details page can now be sorted by geographic proximity Library Groups, a feature that allows defining groups of organizational units outside of the hierarchy that can be used to limit catalog search results Expired staff accounts can now be blocked from logging in Publisher data in the public catalog display is now drawn from both the 260 and 264 field The staff catalog can now save all search results (up to 1,000) to a bucket in a single operation New opt-in settings for overdue and predue email notifications A new setting to allow expired patrons to renew loans Porting of additional interfaces to Angular, including Scan Item as Missing Pieces and Shelving Location Groups Evergreen admins installing the beta or upgrading a test system to the beta should be aware of the following: The minimum version of PostgreSQL required to run Evergreen 3.6 is PostgreSQL 9.6. The minimum version of OpenSRF is 3.2. This release adds anew OpenSRF service, open-ils.geo. The release also adds several new Perl module dependencies, Geo::Coder::Google, Geo::Coder::OSM, String::KeyboardDistance, and Text::Levenshtein::Damerau::XS. The database update procedure has more steps than usual; please consult the upgrade section of the release notes. The beta release should not be used for production. Additional information, including a full list of new features, can be found in the release notes. Share this: Facebook Twitter More Reddit LinkedIn Pocket Pinterest Tumblr Print Related Post navigation ← Security releases: Evergreen 3.6.3 and 3.5.4 Evergreen 3.7-rc available → About Evergreen This is the project site for Evergreen, a highly-scalable software for libraries that helps library patrons find library materials, and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries. © 2008-2020 GPLS and others. Evergreen is open source software, freely licensed under GNU GPLv2 or later. The Evergreen Project is a 501(c)3 nonprofit organization. Community Links Evergreen Bug Tracker Evergreen on Open HUB Evergreen Wiki Git Repositories Join IRC! IRC Logs Official Documentation · © 2021 Evergreen ILS · Powered by · Designed with the Customizr theme · 
everybodyslibraries-com-2729	----	Everybody's Libraries Everybody's Libraries Libraries for everyone, by everyone, shared with everyone, about everything Public Domain Day 2021: Honoring a lost generation It&#8217;s Public Domain Day again. In much of Europe, and other countries with &#8220;life+70 years&#8221; copyright terms, works by authors who died in 1950, such as George Orwell, Karin Michaelis, George Bernard Shaw, and Edna St. Vincent Millay, have joined &#8230; Continue reading &#8594; Counting down to 1925 in the public domain We&#8217;re rapidly approaching another Public Domain Day, the day at the start of the year when a year&#8217;s worth of creative work joins the public domain. This will be the third year in a row that the US will have &#8230; Continue reading &#8594; From our subjects to yours (and vice versa) (TL;DR: I&#8217;m starting to implement services and publish data to support searching across library collections that use customized subject headings, such as the increasingly-adopted substitutes for LCSH terms like &#8220;Illegal aliens&#8221;. Read on for what I&#8217;m doing, why, and where &#8230; Continue reading &#8594; Everybody&#8217;s Library Questions: Finding films in the public domain Welcome to another installment of Everybody&#8217;s Library Questions, where I give answers to questions people ask me (in comments or email) that seem to be useful for general consumption. Before I start, though, I want to put in a plug &#8230; Continue reading &#8594; Build a better registry: My intended comments to the Library of Congress on the next Register of Copyrights The Library of Congress is seeking public input on abilities and priorities desired for the next Register of Copyrights, who heads the Copyright Office, a department within the Library of Congress.  The deadline for comments as I write this is &#8230; Continue reading &#8594; Welcome to everybody&#8217;s online libraries As coronavirus infections spread throughout the world, lots of people are staying home to slow down the spread and save lives.  In the US, many universities, schools, and libraries have closed their doors.  (Here&#8217;s what happening at the library where &#8230; Continue reading &#8594; Public Domain Day 2020: Coming Around Again I&#8217;m very happy for 2020 to be arriving.  As the start of the 2020s, it represents a new decade in which we can have a fresh start, and hope to make better decisions and have better outcomes than some of &#8230; Continue reading &#8594; 2020 vision #5: Rhapsody in Blue by George Gershwin It&#8217;s only a few hours from the new year where I write this, but before I ring in the new year, and a new year&#8217;s worth of public domain material, I&#8217;d like to put in a request for what music &#8230; Continue reading &#8594; 2020 vision #4: Ding Dong Merrily on High by George Ratcliffe Woodward and others It&#8217;s beginning to sound a lot like Christmas everywhere I go.  The library where I work had its holiday party earlier this week, where I joined librarian colleagues singing Christmas, Hanukkah, and winter-themed songs in a pick-up chorus.  Radio stations &#8230; Continue reading &#8594; 2020 vision #3: The Most Dangerous Game by Richard Connell &#8220;Be a realist. The world is made up of two classes&#8211;the hunters and the huntees. Luckily, you and I are hunters.&#8221; Sanger Rainsford speaks these words at the start of &#8220;The Most Dangerous Game&#8221;, one of the most famous short &#8230; Continue reading &#8594; 
everybodyslibraries-com-9960	----	Everybody's Libraries | Libraries for everyone, by everyone, shared with everyone, about everything Everybody's Libraries Libraries for everyone, by everyone, shared with everyone, about everything Skip to content Home About About the Free Decimal Correspondence Free Decimal Correspondence ILS services for discovery applications John Mark Ockerbloom The Metadata Challenge ← Older posts Public Domain Day 2021: Honoring a lost generation Posted on January 1, 2021 by John Mark Ockerbloom It’s Public Domain Day again. In much of Europe, and other countries with “life+70 years” copyright terms, works by authors who died in 1950, such as George Orwell, Karin Michaelis, George Bernard Shaw, and Edna St. Vincent Millay, have joined the public domain. Canada, and other countries that still have the Berne Convention’s “life+50 years” copyright terms, get works by authors like E. M. Forster, Nelly Sachs, Bertrand Russell, Elsa Triolet, and other authors who died in 1970 in the public domain. And in the United States, copyrights from 1925 that are still in force have expired, introducing to the public domain a wide variety of works I’ve covered in my prior blog post. The new public domain work that I’ve seen most widely noted is F. Scott Fitzgerald’s Jazz Age novel The Great Gatsby. My library has a copy of the first edition, and its scan of the volume became available on HathiTrust today. Though he doesn’t use the term in Gatsby, Fitzgerald and many other authors writing around 1925 are often considered both members and chroniclers of the “Lost Generation”. The term was coined by Gertrude Stein, and made famous by Ernest Hemingway, who used it in the epigraph to his novel The Sun Also Rises (one of many more works scheduled to join the US public domain a year from now). The Lost Generation describes an age cohort that was disrupted by the First World War, and all the deaths caused by that war and by the influenza pandemic that arose in its wake. Society would never be the same afterwards. It’s ironic that some of the definitive creations of that generation are themselves part of a largely lost generation. At the time of their publication, they were supposed to enter the public domain after 56 years at most, but that maximum term has been extended by 39 more years, well over a generation’s worth of time. The creators of these works that got the full copyright term are almost all now dead, and many of the less famous works in this cohort have also become lost from most people’s memories. Some, including many fragile films of that era, now have all copies lost as well. The generation that now sees these works joining the public domain also has many of the makings of a new “lost generation”. The number of deaths from COVID-19 in the United States, which badly botched its response compared to many similar countries, far exceeds the number of American deaths in World War I, and is a sizable and rapidly growing fraction of all the American deaths from the 1918-1920 flu pandemic. Many more people who have dealt with illness and quarantine have also experienced what feels like a lost year, one that hasn’t ended yet despite today’s change in the calendar. But it’s also important to recognize the key role of the public domain and of open access publications in preventing further loss. While Philadelphia, where I live, has been hit hard by this pandemic, it hasn’t been hit as hard as some other places, in part because masking and other behavioral changes have been more widely used and accepted here. Not long before the current pandemic started, the Mutter Museum’s Spit Spreads Death exhibit reminded us of the horrifying death toll of the 1918 flu pandemic here, caused in large part by failing to stop mass gatherings that made the flu spread like wildfire here. The exhibit’s narrative, which many other local media outlets further elaborated on, was able to freely draw on a wide variety of source materials of the era that were all in the public domain due to their age. The freely available sources from 1918 helped spread public health awareness here in 2020. Open access to resources also spurred the rapid development and testing of effective treatments against COVID. Open sharing of the novel coronavirus genomes, and related scientific data, enabled research on the virus and effective responses to be carried out by many different labs across the globe, and many of the resulting research papers and research materials have also been made freely available in venues that are usually limited to paid subscribers. While much of this work is not public domain, strictly speaking, it is being shared and built on largely as if it were. That has enabled vaccines to be safely rolled out much more quickly than they have been for other diseases. While we celebrate today’s belated additions to the public domain, it’s also important to promote and protect it, because there are still efforts to freeze it or roll it back. The successor to the NAFTA trade deal requires Canada to add 20 years to its copyright terms, for instance (though Canada has not yet implemented that provision). And while there is no current legislation to extend US copyright terms any further, such extensions have been proposed in the past, and we’ve just seen in Congress’s recent funding bill how questionable changes to copyright law can be jammed into “must-pass” legislation with little or no warning or recourse. The public domain enriches our culture, reminds us and lets us learn from our past, and helps us make better futures. As 2021 gives us opportunities to turn the page, let’s celebrate the new opportunities we have to enjoy, share, reuse, and build on our newly public domain works. And let’s make sure we don’t lose any more generations. Posted in online books, open access, publicdomain | 3 Comments Counting down to 1925 in the public domain Posted on December 15, 2020 by John Mark Ockerbloom We’re rapidly approaching another Public Domain Day, the day at the start of the year when a year’s worth of creative work joins the public domain. This will be the third year in a row that the US will have a full crop of new public domain works (after a prior 20-year drought), and once again, I’m noting and celebrating works that will be entering the public domain shortly. Approaching 2019, I wrote a one-post-a-day Advent Calendar for 1923 works throughout the month of December, and approaching 2020, I highlighted a few 1924 works, and related copyright issues, in a series of December posts called 2020 Vision. This year I took to Twitter, making one tweet per day featuring a different 1925 work and creator using the #PublicDomainDayCountdown hashtag. Tweets are shorter than blog posts, but I started 99 days out, so by the time I finish the series at the end of December, I’ll have written short notices on more works than ever. Since not everyone reads Twitter, and there’s no guarantee that my tweets will always be accessible on that site, I’ll reproduce them here. (This post will be updated to include all the tweets up to 2021.) The tweet links have been reformatted for the blog, a couple of 2-tweet threads have been recombined, and some typos may be corrected. If you’d like to comment yourself on any of the works mentioned here, or suggest others I can feature, feel free to reply here or on Twitter. (My account there is @JMarkOckerbloom. You’ll also find some other people tweeting on the #PublicDomainDayCountdown hashtag, and you’re welcome to join in as well.) September 24: It’s F. Scott Fitzgerald’s birthday. His best-known book, The Great Gatsby, joins the US public domain 99 days from now, along with other works with active 1925 copyrights. #PublicDomainDayCountdown (Links to free online books by Fitzgerald here.) September 25: C. K. Scott-Moncrieff’s birthday’s today. He translated Proust’s Remembrance of Things Past (a controversial title, as the Public Domain Review notes). The Guermantes Way, his translation of Proust’s 3rd volume, joins the US public domain in 98 days. #PublicDomainDayCountdown September 26: Today is T.S. Eliot’s birthday. His poem “The Hollow Men” (which ends “…not with a bang but a whimper”) was first published in full in 1925, & joins the US public domain in 97 days. #PublicDomainDayCountdown More by & about him here. September 27: Lady Cynthia Asquith, born today in 1887, edited a number of anthologies that have long been read by children and fans of fantasy and supernatural fiction. Her first major collection, The Flying Carpet, joins the US public domain in 96 days. #PublicDomainDayCountdown September 28: As @Marketplace reported tonight, Agatha Christie’s mysteries remain popular after 100 years. In 95 days, her novel The Secret of Chimneys will join the US public domain, as will the expanded US Poirot Investigates collection. #PublicDomainDayCountdown September 29: Homer Hockett’s and Arthur Schlesinger, Sr.’s Political and Social History of the United States first came out in 1925, and was an influential college textbook for years thereafter. The first edition joins the public domain in 94 days. #PublicDomainDayCountdown September 30: Inez Haynes Gillmore Irwin died 50 years ago this month, after a varied, prolific writing career. This 2012 blog post looks at 4 of her books, including Gertrude Haviland’s Divorce, which joins the public domain in 93 days. #PublicDomainDayCountdown October 1: For some, spooky stories and themes aren’t just for October, but for the whole year. We’ll be welcoming a new year’s worth of Weird Tales to the public domain in 3 months. See what’s coming, and what’s already free online, here. #PublicDomainDayCountdown October 2: Misinformation and quackery has been a threat to public health for a long time. In 13 weeks, the 1925 book The Patent Medicine and the Public Health, by American quack-fighter Arthur J. Cramp joins the public domain. #PublicDomainDayCountdown October 3: Sophie Treadwell, born this day in 1885, was a feminist, modernist playwright with several plays produced on Broadway, but many of her works are now hard to find. Her 1925 play “Many Mansions” joins the public domain in 90 days. #PublicDomainDayCountdown October 4: It’s Edward Stratemeyer’s birthday. Books of his syndicate joining the public domain in 89 days include the debuts of Don Sturdy & the Blythe Girls, & further adventures of Tom Swift, Ruth Fielding, Baseball Joe, Betty Gordon, the Bobbsey Twins, & more. #PublicDomainDayCountdown October 5: Russell Wilder was a pioneering diabetes doctor, testing newly invented insulin treatments that saved many patients’ lives. His 1925 book Diabetes: Its Cause and its Treatment with Insulin joins the public domain in 88 days. #PublicDomainDayCountdown October 6: Queer British Catholic author Radclyffe Hall is best known for The Well of Loneliness. Hall’s earlier novel A Saturday Life is lighter, though it has some similar themes in subtext. It joins the US public domain in 87 days. #PublicDomainDayCountdown October 7: Edgar Allan Poe’s stories have long been public domain, but some work unpublished when he died (on this day in 1849) stayed in © much longer. In 86 days, the Valentine Museum’s 1925 book of his previously unpublished letters finally goes public domain. #PublicDomainDayCountdown October 8: In 1925, the Nobel Prize in Literature went to George Bernard Shaw. In 85 days, his Table-Talk, published that year, will join the public domain in the US, and all his solo works published in his lifetime will be public domain nearly everywhere else. #PublicDomainDayCountdown October 9: Author and editor Edward Bok was born this day in 1863. In Twice Thirty (1925), he follows up his Pulitzer-winning memoir The Americanization of Edward Bok with a set of essays from the perspective of his 60s. It joins the public domain in 84 days. #PublicDomainDayCountdown October 10: In the 1925 silent comedy “The Freshman”, Harold Lloyd goes to Tate University, “a large football stadium with a college attached”, and goes from tackling dummy to unlikely football hero. It joins the public domain in 83 days. #PublicDomainDayCountdown October 11: It’s François Mauriac’s birthday. His Le Desert de l’Amour, a novel that won the 1926 Grand Prix of the Académie Française, joins the US public domain in 82 days. Published translations may stay copyrighted, but Americans will be free to make new ones. #PublicDomainDayCountdown October 12: Pulitzer-winning legal scholar Charles Warren’s Congress, the Constitution, and the Supreme Court (1925) analyzes controversies, some still argued, over relations between the US legislature and the US judiciary. It joins the public domain in 81 days. #PublicDomainDayCountdown October 13: Science publishing in 1925 was largely a boys’ club, but some areas were more open to women authors, such as nursing & science education. I look forward to Maude Muse’s Textbook of Psychology for Nurses going public domain in 80 days. #PublicDomainDayCountdown #AdaLovelaceDay October 14: Happy birthday to poet E. E. Cummings, born this day in 1894. (while some of his poetry is lowercase he usually still capitalized his name when writing it out) His collection XLI Poems joins the public domain in 79 days. #PublicDomainDayCountdown October 15: It’s PG Wodehouse’s birthday. In 78 days more of his humorous stories join the US public domain, including Sam in the Suburbs. It originally ran as a serial in the Saturday Evening Post in 1925. All that year’s issues also join the public domain then. #PublicDomainDayCountdown October 16: Playwright and Nobel laureate Eugene O’Neill was born today in 1888. His “Desire Under the Elms” entered the US public domain this year; in 77 days, his plays “Marco’s Millions” and “The Great God Brown” will join it. #PublicDomainDayCountdown October 17: Not everything makes it to the end of the long road to the US public domain. In 76 days, the copyright for the film Man and Maid (based on a book by Elinor Glyn) expires, but no known copies survive. Maybe someone will find one? #PublicDomainDayCountdown October 18: Corra Harris became famous for her novel A Circuit Rider’s Wife and her World War I reporting. The work she considered her best, though, was As a Woman Thinks. It joins the public domain in 75 days. #PublicDomainDayCountdown October 19: Edna St. Vincent Millay died 70 years ago today. All her published work joins the public domain in 74 days in many places outside the US. Here, magazine work like “Sonnet to Gath” (in Sep 1925 Vanity Fair) will join, but renewed post-’25 work stays in ©. #PublicDomainDayCountdown October 20: All songs eventually reach the public domain. Authors can put them there themselves, like Tom Lehrer just did for his lyrics. But other humorous songs arrive by the slow route, like Tilzer, Terker, & Heagney’s “Pardon Me (While I Laugh)” will in 73 days. #PublicDomainDayCountdown October 21: Sherwood Anderson’s Winesburg, Ohio wasn’t a best-seller when it came out, but his Dark Laughter was. Since Joycean works fell out of fashion, that book’s been largely forgotten, but may get new attention when it joins the public domain in 72 days. #PublicDomainDayCountdown October 22: Artist NC Wyeth was born this day in 1882. The Brandywine Museum near Philadelphia shows many of his works. His illustrated edition of Francis Parkman’s book The Oregon Trail joins the public domain in 71 days. #PublicDomainDayCountdown October 23: Today (especially at 6:02, on 10/23) many chemists celebrate #MoleDay. In 70 days, they’ll also get to celebrate historically important chemistry publications joining the US public domain, including all 1925 issues of Justus Liebigs Annalen der Chemie. #PublicDomainDayCountdown October 24: While some early Alfred Hitchcock films were in the US public domain for a while due to formality issues, the GATT accords restored their copyrights. His directorial debut, The Pleasure Garden, rejoins the public domain (this time for good) in 69 days. #PublicDomainDayCountdown (Addendum: There may still be one more year of copyright to this film as of 2021; see the comments to this post for details.) October 25: Albert Barnes took a different approach to art than most of his contemporaries. The first edition of The Art in Painting, where he explains his theories and shows examples from his collection, joins the public domain in 68 days. #PublicDomainDayCountdown October 26: Prolific writer Carolyn Wells had a long-running series of mystery novels featuring Fleming Stone. Here’s a blog post by The Passing Tramp on one of them, The Daughter of the House, which will join the public domain in 67 days. #PublicDomainDayCountdown October 27: Theodore Roosevelt was born today in 1858, and died over 100 years ago, but some of his works are still copyrighted. In 66 days, 2 volumes of his correspondence with Henry Cabot Lodge, written from 1884-1918 and published in 1925, join the public domain. #PublicDomainDayCountdown October 28: American composer and conductor Howard Hanson was born on this day in 1896. His choral piece “Lament for Beowulf” joins the public domain in 65 days. #PublicDomainDayCountdown October 29: “Skitter Cat” was a white Persian cat who had adventures in several children’s books by Eleanor Youmans, illustrated by Ruth Bennett. The first of the books joins the public domain in 64 days. #PublicDomainDayCountdown #NationalCatDay October 30: “Secret Service Smith” was a detective created by Canadian author R. T. M. Maitland. His first magazine appearance was in 1920; his first original full-length novel, The Black Magician, joins the public domain in 9 weeks. #PublicDomainDayCountdown October 31: Poet John Keats was born this day in 1795. Amy Lowell’s 2-volume biography links his Romantic poetry with her Imagist poetry. (1 review.) She finished and published it just before she died. It joins the public domain in 62 days. #PublicDomainDayCountdown November 1: “Not just for an hour, not for just a day, not for just a year, but always.” Irving Berlin gave the rights to this song to his bride in 1926. Both are gone now, and in 2 months it will join the public domain for all of us, always. #PublicDomainDayCountdown November 2: Mikhail Fokine’s The Dying Swan dance, set to music by Camille Saint-Saëns, premiered in 1905, but its choreography wasn’t published until 1925, the same year a film of it was released. It joins the public domain in 60 days. #PublicDomainDayCountdown (Choreography copyright is weird. Not only does the term not start until publication, which can be long after 1st performance, but what’s copyrightable has also changed. Before 1978 it had to qualify as dramatic; now it doesn’t, but it has to be more than a short step sequence.) November 3: Herbert Hoover was the only sitting president to be voted out of office between 1912 & 1976. Before taking office, he wrote the foreword to Carolyn Crane’s Everyman’s House, part of a homeowners’ campaign he co-led. It goes out of copyright in 59 days. #PublicDomainDayCountdown November 4: “The Golden Cocoon” is a 1925 silent melodrama featuring an election, jilted lovers, and extortion. The Ruth Cross novel it’s based on went public domain this year. The film will join it there in 58 days. #PublicDomainDayCountdown November 5: Investigative journalist Ida Tarbell was born today in 1857. Her History of Standard Oil helped break up that trust in 1911, but her Life of Elbert H. Gary wrote more admiringly of his chairmanship of US Steel. It joins the public domain in 57 days. #PublicDomainDayCountdown November 6: Harold Ross was born on this day in 1892. He was the first editor of The New Yorker, which he established in coöperation with his wife, Jane Grant. After ninety-five years, the magazine’s first issues are set to join the public domain in fifty-six days. #PublicDomainDayCountdown November 7: “Sweet Georgia Brown” by Ben Bernie & Maceo Pinkard (lyrics by Kenneth Casey) is a jazz standard, the theme tune of the Harlem Globetrotters, and a song often played in celebration. One thing we can celebrate in 55 days is it joining the public domain. #PublicDomainDayCountdown November 8: Today I hiked on the Appalachian Trail. It was completed in 1937, but parts are much older. Walter Collins O’Kane’s Trails and Summits of the White Mountains, published in 1925 when the AT was more idea than reality, goes public domain in 54 days. #PublicDomainDayCountdown November 9: In Sinclair Lewis’ Arrowsmith, a brilliant medical researcher deals with personal and ethical issues as he tries to find a cure for a deadly epidemic. The novel has stayed relevant well past its 1925 publication, and joins the public domain in 53 days. #PublicDomainDayCountdown November 10: John Marquand was born today in 1893. He’s known for his spy stories and satires, but an early novel, The Black Cargo, features a sailor curious about a mysterious payload on a ship he’s been hired onto. It joins the US public domain in 52 days. #PublicDomainDayCountdown November 11: The first world war, whose armistice was 102 years ago today, cast a long shadow. Among the many literary works looking back to it is Ford Madox Ford’s novel No More Parades, part of his “Parade’s End” tetralogy. It joins the public domain in 51 days. #PublicDomainDayCountdown November 12: Anne Parrish was born on this day in 1888. In 1925, The Dream Coach, co-written with her brother, got a Newbery honor , and her novel The Perennial Bachelor was a best-seller. The latter book joins the public domain in 50 days. #PublicDomainDayCountdown November 13: In “The Curse of the Golden Cross”, G. K. Chesterton’s Father Brown once again finds a natural explanation to what seem to be preternatural symbols & events. As of today, Friday the 13th, the 1925 story is exactly 7 weeks away from the US public domain. #PublicDomainDayCountdown November 14: The pop standard “Yes Sir, That’s My Baby” was the baby of Walter Donaldson (music) and Gus Kahn (lyrics). It’s been performed by many artists since its composition, and in 48 days, this baby steps out into the public domain. #PublicDomainDayCountdown November 15: Marianne Moore, born on this day in 1887, had a long literary career, including editing the influential modernist magazine The Dial from 1925 on. In 47 days, all 1925 issues of that magazine will be fully in the public domain. #PublicDomainDayCountdown November 16: George S. Kaufman, born today in 1889, wrote or directed a play in every Broadway season from 1921 till 1958. In 46 days, several of his plays join the public domain, including his still-performed comedy “The Butter and Egg Man”. #PublicDomainDayCountdown November 17: Shen of the Sea was a Newbery-winning collection of stories presented as “Chinese” folktales, but written by American author Arthur Bowie Chrisman. Praised when first published, seen more as appropriation later, it’ll be appropriable itself in 45 days. #PublicDomainDayCountdown November 18: I share a birthday today with Jacques Maritain, a French Catholic philosopher who influenced the Universal Declaration of Human Rights. His book on 3 reformers (Luther, Descartes, and Rousseau) joins the public domain in 44 days. #PublicDomainDayCountdown November 19: Prevailing views of history change a lot over 95 years. The 1926 Pulitzer history prize went to a book titled “The War for Southern Independence”. The last volume of Edward Channing’s History of the United States, it joins the public domain in 43 days. #PublicDomainDayCountdown November 20: Alfred North Whitehead’s Science and the Modern World includes a nuanced discussion of science and religion differing notably from many of his contemporaries’. (A recent review of it.) It joins the US public domain in 6 weeks. November 21: Algonquin Round Table member Robert Benchley tried reporting, practical writing, & reviews, but soon found that humorous essays & stories were his forte. One early collection, Pluck and Luck, joins the public domain in 41 days. #PublicDomainDayCountdown November 22: I’ve often heard people coming across a piano sit down & pick out Hoagy Carmichael’s “Heart and Soul”. He also had other hits, one being “Washboard Blues“. His original piano instrumental version becomes public domain in 40 days. #PublicDomainDayCountdown November 23: Harpo Marx, the Marx Brothers mime, was born today in 1888. In his oldest surviving film, “Too Many Kisses” he does “speak”, but silently (like everyone else in it), without his brothers. It joins the public domain in 39 days. #PublicDomainDayCountdown November 24: In The Man Nobody Knows, Bruce Barton likened the world of Jesus to the world of business. Did he bring scriptural insight to management, or subordinate Christianity to capitalism? It’ll be easier to say, & show, after it goes public domain in 38 days. #PublicDomainDayCountdown November 25: Before Virgil Thomson (born today in 1896) was well-known as a composer, he wrote a music column for Vanity Fair. His first columns, and the rest of Vanity Fair for 1925, join the public domain in 37 days. #PublicDomainDayCountdown November 26: “Each moment that we’re apart / You’re never out of my heart / I’d rather be lonely and wait for you only / Oh how I miss you tonight” Those staying safe by staying apart this holiday might appreciate this song, which joins the public domain in 36 days. #PublicDomainDayCountdown (The song, “Oh, How I Miss You Tonight” is by Benny Davis, Joe Burke, and Mark Fisher, was published in 1925, and performed and recorded by many musicians since then, some of whom are mentioned in this Wikipedia article.) November 27: Feminist author Katharine Anthony, born today in 1877, was best known for her biographies. Her 1925 biography of Catherine the Great, which drew extensively on the empress’s private memoirs, joins the public domain in 35 days. #PublicDomainDayCountdown November 28: Tonight in 1925 “Barn Dance” (soon renamed “Grand Ole Opry”) debuted in Nashville. Most country music on it & similar shows then were old favorites, but there were new hits too, like “The Death of Floyd Collins”, which joins the public domain in 34 days. #PublicDomainDayCountdown (The song, with words by Andrew Jenkins and music by John Carson, was in the line of other disaster ballads that were popular in the 1920s. This particular disaster had occurred earlier in the year, and became the subject of song, story, drama, and film.) November 29: As many folks get ready for Christmas, many Christmas-themed works are also almost ready to join the public domain in 33 days. One is The Holly Hedge, and Other Christmas Stories by Temple Bailey. More on the book & author. #PublicDomainDayCountdown November 30: In 1925 John Maynard Keynes published The Economic Consequences of Sterling Parity objecting to Winston Churchill returning the UK to the gold standard. That policy ended in 1931; the book’s US copyright lasted longer, but will finally end in 32 days. #PublicDomainDayCountdown December 1: Du Bose Heyward’s novel Porgy has a distinguished legacy of adaptations, including a 1927 Broadway play, and Gershwin’s opera “Porgy and Bess”. When the book joins the public domain a month from now, further adaptation possibilities are limitless. #PublicDomainDayCountdown December 2: In Dorothy Black’s Romance — The Loveliest Thing a young Englishwoman “inherits a small sum of money, buys a motor car and goes off in search of adventure and romance”. First serialized in Ladies’ Home Journal, it joins the public domain in 30 days. #PublicDomainDayCountdown December 3: Joseph Conrad was born on this day in 1857, and died in 1924, leaving unfinished his Napoleonic novel Suspense. But it was still far enough along to get serialized in magazines and published as a book in 1925, and it joins the public domain in 29 days. #PublicDomainDayCountdown December 4: Ernest Hemingway’s first US-published story collection In Our Time introduced his distinctive style to an American audience that came to view his books as classics of 20th century fiction: It joins the public domain in 28 days. #PublicDomainDayCountdown December 5: Libertarian author Rose Wilder Lane helped bring her mother’s “Little House” fictionalized memoirs into print. Before that, she published biographical fiction based on the life of Jack London, called He Was a Man. It joins the public domain in 27 days. #PublicDomainDayCountdown December 6: Indiana naturalist and author Gene Stratton-Porter died on this day in 1924. Her final novel, The Keeper of the Bees, was published the following year, and joins the public domain in 26 days. One review. #PublicDomainDayCountdown December 7: Willa Cather was born today in 1873. Her novel The Professor’s House depicts 1920s cultural dislocation from a different angle than F. Scott Fitzgerald’s better-known Great Gatsby. It too joins the public domain in 25 days. #PublicDomainDayCountdown December 8: The last symphony published by Finnish composer Jean Sibelius (born on this day in 1865) is described in the Grove Dictionary as his “most remarkable compositional achievement”. It joins the public domain in the US in 24 days. #PublicDomainDayCountdown December 9: When the Habsburg Empire falls, what comes next for the people & powers of Vienna? The novel Old Wine, by Phyllis Bottome (wife of the local British intelligence head) depicts a society undergoing rapid change. It joins the US public domain in 23 days. #PublicDomainDayCountdown December 10: Lewis Browne was “a world traveler, author, rabbi, former rabbi, lecturer, socialist and friend of the literary elite”. His first book, Stranger than Fiction: A Short History of the Jews, joins the public domain in 22 days. #PublicDomainDayCountdown December 11: In 1925, John Scopes was convicted for teaching evolution in Tennessee. Books explaining the science to lay audiences were popular that year, including Henshaw Ward’s Evolution for John Doe. It becomes public domain in 3 weeks. #PublicDomainDayCountdown December 12: Philadelphia artist Jean Leon Gerome Ferris was best known for his “Pageant of a Nation” paintings. Three of them, “The Birth of Pennsylvania”, “Gettysburg, 1863”, and “The Mayflower Compact”, join the public domain in 20 days. #PublicDomainDayCountdown December 13: The Queen of Cooks, and Some Kings was a memoir of London hotelier Rosa Lewis, as told to Mary Lawton. Her life story was the basis for the BBC and PBS series “The Duchess of Duke Street”. It joins the public domain in 19 days. #PublicDomainDayCountdown December 14: Today we’re celebrating new films being added to the National Film Registry. In 18 days, we can also celebrate more Registry films joining the public domain. One is The Clash of the Wolves, starring Rin Tin Tin. #PublicDomainDayCountdown December 15: Etsu Inagaki Sugimoto, daughter of a high-ranking Japanese official, moved to the US in an arranged marriage after her family fell on hard times. Her 1925 memoir, A Daughter of the Samurai, joins the public domain in 17 days. #PublicDomainDayCountdown December 16: On the Trail of Negro Folk-Songs compiled by Dorothy Scarborough assisted by Ola Lee Gulledge, has over 100 songs. Scarborough’s next of kin (not Gulledge, or any of their sources) renewed its copyright in 1953. But in 16 days, it’ll be free for all. #PublicDomainDayCountdown December 17: Virginia Woolf’s writings have been slowly entering the public domain in the US. We’ve had the first part of her Mrs. Dalloway for a while. The complete novel, and her first Common Reader essay collection, join it in 15 days. #PublicDomainDayCountdown December 18: Lovers in Quarantine with Harrison Ford sounds like a movie made for 2020, but it’s actually a 1925 silent comedy (with a different Harrison Ford). It’ll be ready to go out into the public domain after a 14-day quarantine. #PublicDomainDayCountdown December 19: Ma Rainey wrote, sang, and recorded many blues songs in a multi-decade career. Two of her songs becoming public domain in 13 days are “Shave ’em Dry” (written with William Jackson) & “Army Camp Harmony Blues” (with Hooks Tilford). #PublicDomainDayCountdown December 20: For years we’ve celebrated the works of prize-winning novelist Edith Wharton as her stories join the public domain. In 12 days, The Writing of Fiction, her book on how she writes her memorable tales, will join that company. #PublicDomainDayCountdown December 21: Albert Payson Terhune, born today in 1872, raised and wrote about dogs he kept at what’s now a public park in New Jersey. His book about Wolf, who died heroically and is buried there, will also be in the public domain in 11 days. #PublicDomainDayCountdown December 22: In the 1920s it seemed Buster Keaton could do anything involving movies. Go West, a 1925 feature film that he co-wrote, directed, co-produced, and starred in, is still enjoyed today, and it joins the public domain in 10 days. #PublicDomainDayCountdown December 23: In 9 days, not only will Theodore Dreiser’s massive novel An American Tragedy be in the public domain, but so will a lot of the raw material that went into it. Much of it is in @upennlib‘s special collections. #PublicDomainDayCountdown December 24: Johnny Gruelle, born today in 1880, created the Raggedy Ann doll, and a series of books sold with it that went under many Christmas trees. Two of them, Raggedy Ann’s Alphabet Book and Raggedy Ann’s Wishing Pebble, join the public domain in 8 days. #PublicDomainDayCountdown December 25: Written in Hebrew by Joseph Klausner, translated into English by Anglican priest Herbert Danby, Jesus of Nazareth reviewed Jesus’s life and teachings from a Jewish perspective. It made a stir when published in 1925, & joins the public domain in 7 days. #PublicDomainDayCountdown December 26: “It’s a travesty that this wonderful, hilarious, insightful book lives under the inconceivably large shadow cast by The Great Gatsby.” A review of Anita Loos’s Gentlemen Prefer Blondes, also joining the public domain in 6 days. #PublicDomainDayCountdown December 27: “On revisiting Manhattan Transfer, I came away with an appreciation not just for the breadth of its ambition, but also for the genius of its representation.” A review of the John Dos Passos novel becoming public domain in 5 days. #PublicDomainDayCountdown December 28: All too often legal systems and bureaucracies can be described as “Kafkaesque”. The Kafka work most known for that sense of arbitrariness and doom is Der Prozess (The Trial), reviewed here. It joins the public domain in 4 days. #PublicDomainDayCountdown December 29: Chocolate Kiddies, an African American music and dance revue that toured Europe in 1925, featured songs by Duke Ellington and Jo Trent including “Jig Walk”, “Jim Dandy”, and “With You”. They join the public domain in 3 days. #PublicDomainDayCountdown December 30: Lon Chaney starred in 2 of the top-grossing movies of 1925. The Phantom of the Opera has long been in the public domain due to copyright nonrenewal. The Unholy Three, which was renewed, joins it in the public domain in 2 days. #PublicDomainDayCountdown (If you’re wondering why some of the other big film hits of 1925 haven’t been in this countdown, in many cases it’s also because their copyrights weren’t renewed. Or they weren’t actually copyrighted in 1925.) December 31: “…You might as well live.” Dorothy Parker published “Resumé” in 1925, and ultimately outlived most of her Algonquin Round Table-mates. This poem, and her other 1925 writing for periodicals, will be in the public domain tomorrow. #PublicDomainDayCountdown Posted in copyright, publicdomain | 3 Comments From our subjects to yours (and vice versa) Posted on December 3, 2020 by John Mark Ockerbloom (TL;DR: I’m starting to implement services and publish data to support searching across library collections that use customized subject headings, such as the increasingly-adopted substitutes for LCSH terms like “Illegal aliens”. Read on for what I’m doing, why, and where I would value advice and discussion on how to proceed.) I’ve run the Forward to Libraries service for a few years now. As I’ve noted in earlier posts here, it’s currently used on The Online Books Page and in some Wikipedia articles to search for resources in your local library (or any other library you’re interested in) on a subject you’re exploring. One of the key pieces of infrastructure that makes it work is the Library of Congress Subject Headings (LCSH) system, which many research libraries use to describe their holdings. Using the headings in the system, along with mappings between it and other systems for describing subjects (such as the English Wikipedia article titles that Forward to Libraries knows how to relate to LCSH) allows researchers to find materials on the same subjects across multiple collections, using common terminology. There are limitations to relying on LCSH for cross-collection subject searches, though. First of all, many libraries, particularly those outside the US, do not use LCSH. Some use other subject vocabularies. If a mapping has been defined between LCSH and another subject vocabulary (as has been done, for example, with MeSH) one can use that mapping to determine search terms to use in libraries that use that subject vocabulary. We don’t yet have that capability in Forward to Libraries, but I’m hoping to add it eventually. Changing the subjects I’m now also seeing more libraries that use LCSH, but that also use different terms for certain subjects that they find more appropriate for their users. While there is a process for updating LCSH terms (and its terms get updated on a monthly basis) the process can be slow, hard for non-specialists to participate in, and contentious, particularly for larger-scale subject heading changes. It can also be subject to pressure by non-librarians. The Library of Congress ultimately answers to Congress (as its name suggests), and members of Congress have used funding bills to block changes in subject headings that the librarian-run process had approved. They did that in 2016 for the subject heading “Illegal aliens”, where librarians had recommended using other terms to cover subjects related to unauthorized immigration. The documentary film “Change the Subject” (linked with context in this article) has a detailed report on this controversy. Four years after the immigration subject changes were blocked, some libraries have decided not to wait for LCSH to change, and are introducing their own subject terms. The University of Colorado Boulder, for example, announced in 2018 that they would use the term “Undocumented immigrants” where the Library of Congress had “Illegal aliens”. Other libraries have recently announced similar changes. Some library consortia have organized systematic programs to supersede outdated and offensive terms in LCSH in their catalogs. Some groups now maintain specialized subject vocabularies that can both supplement and supersede LCSH terms, such as Homosaurus for LGBT+-related subjects. And there’s also been increasing interest in using subject terms and classifications adapted to local communities. For instance, the Brian Deer Classification System is intended to be both used and shaped by local indigenous communities, and therefore libraries in different locations that use it may well use different terms for some subjects, depending on local usage and interests. Supporting cross-collection search in a community of localized catalogs We can still search across collections that use local terms, as long as we know what those terms are and how to translate between them. Forward to Libraries already uses a data file indicating Wikipedia article titles that correspond closely to LCSH subjects, and vice versa. By extension, we can also create a data file indicating terms to use at a given library that correspond to terms in LCSH and other vocabularies, so we can see what resources are available at different places on a given topics. You can see how that works in practice at The Online Books Page. As I write this, we’re still using the unaltered LCSH subjects (updated to October 2020), so we have a subject page showing free online books on “Illegal aliens”. You can follow links from there to see what other libraries have. If you select the “elsewhere” link in the upper left column and choose the Library of Congress as the library to search, you’ll see what they hold under that subject heading. But if you instead choose the University of Colorado Boulder, you’ll see what they have under “Undocumented immigrants”, the subject term they’ve adopted there. Similar routing happens from Wikipedia. The closest related Wikipedia article at present is “Illegal immigration”, and if you go down to the Further Reading section and select links in the Library Resources box, selecting “Online books” or most libraries will currently take you to their “Illegal aliens” subject search. But selecting University of Colorado Boulder (from “Resources in other libraries” if you don’t already have it specified as your preferred library in Wikipedia) will take you to their “Undocumented immigrants” search. This routing applies two mappings, one from Wikipedia terms to LCSH terms, and another from LCSH terms to local library terms. A common data resource These sorts of transformations are fundamentally data-driven. My Forward to Libraries Github repository now includes a data file listing local subject terms that different libraries use, and how they relate to LCSH subject terms. (The library codes used in the file are the same ones that are used in my libraries data file, and are based on OCLC and/or ISIL identifiers.) The local subject terms file is very short for now– as I write this, it only has enough data for the examples I’ve described above, but I’ll be adding more data shortly for other libraries that have announced and implemented subject headings changes. (And I’ll be glad to hear about more so I can add them.) As with other data in this repository, the data in this file is CC0, so it can be used by anyone for any purpose. In particular, it could be be used by services other than my Forward to Libraries tool, such as by aggregated catalogs that incorporate data from multiple libraries, some of which might use localized subject terms that have LCSH analogues. Where to go next What I’ve shown so far is not far removed from a proof-of-concept demo, but I hope it suggests ways that services can be developed to support searches among and across library collections with diverse subject headings. As I mentioned, I’ll be adding more data on localized subject headings as I hear about it, as well as adding more functionality to the Forward to Libraries service (such as the ability to link from a collection with localized subject headings, so I can support them in The Online Books Page, or in other libraries that have such headings and want to use to the service). There are some extensions that could be done to the basic data model to support scaling up these sorts of localizations, such as customizations used by all the libraries in a given consortium, or ones that adopt wholesale an alternative set of subjects, whether that be MeSH, Homosaurus, or the subject thesaurus of a national library outside the US. Even with data declarations supporting those sorts of “bulk” subject mappings, a universal subject mapping knowledge base could get large over time. I’ve created my own mapping file for my services, and for now I’m happy to grow it as needed and share the data freely. But if there is another suitable mapping hub already available or in the works, I’m happy to consider using that instead. It’s important to support exploration across a community of diverse libraries with a diverse array of subject terms and descriptions. I hope the tools and data I’ve described here will help advance us towards that goal, and that I can help grow them from their current nascent state to make them more broadly useful. Posted in discovery, metadata, subjects, wikipedia | Leave a comment Everybody’s Library Questions: Finding films in the public domain Posted on March 30, 2020 by John Mark Ockerbloom Welcome to another installment of Everybody’s Library Questions, where I give answers to questions people ask me (in comments or email) that seem to be useful for general consumption. Before I start, though, I want to put in a plug for your local librarians.  Even though many library buildings are closed now (as they should be) while we’re trying to get propagation and treatment for COVID-19 under control, many of those libraries offer online services, including interactive online help from librarians. (Many of our libraries are also expanding the scope and hours of these services during this health crisis.)   Your local librarians will have the best knowledge of what’s available to you, can find out more about your needs when they talk to you, and will usually be able to respond to questions faster than I or other specific folks on the Internet can. Check out your favorite library’s website, and look for links like “get help” or “online chat” and see what they offer. OK, now here’s the question, extracted from a comment made by Nicholas Escobar to a recent post: I am currently studying at the University of Edinburgh getting masters degree in film composition. For my final project I am required to score a 15 minute film. I was thinking of picking a short silent film (any genre) in the public domain that is 15 minutes (or very close to that length) and was wondering if you had any suggestions? There are three questions implied by this one: First, how do you find out what films exist that meet your content criteria?  Second, how do you find out whether films in that set are in the public domain?  Finally, how can you get access to a film so you can do things with it (such as write a score for it)? There are a few ways you can come up with films to consider.  One is to ask your local librarian (see above) or professor to recommend reference works or data sources that feature short films.  (Information about feature films, which run longer, are often easier to find, but there’s a fair bit out there as well on short films.)  Another is to search some of the reference works and online data sources I’ll mention in the other answers below. The answer to the copyright question depends on where you are.  In the United States, there are basically three categories of public domain films: First, there are films copyrighted before 1925.  All such films’ copyrights have now expired in the US.  This covers most, but not all, of the commercial silent-film era; once The Jazz Singer came out in 1927, movie studies quickly switched to films with sound. Second, there are US films that entered the public domain because they did not take the steps required to secure or maintain their copyrights.  Researching whether this has occurred with a particular film can be complicated, but because there’s been so much interest in cinema history, others have already researched the copyright history of many US films.  The Wikipedia article “List of films in the public domain in the United States” cites a number of reference sources you can check for the status of various films.  (It also lists specific films believed to be in the public domain, but you should check sources cited in the article for those films, and not just take the word of what could be a random Internet user before relying on that information.) Third, there are films created in their entirety by the US government.  There’s a surprisingly large number of these, in various genres and lengths, with tens of thousands or more digitized in the Internet Archive’s United States Government film collection or listed in the National Archives catalog.  You can do lots of things with works of the United States government, which are generally not subject to copyright. That’s the situation in the United States, at least.  However, if you’re not in the United States, different rules may apply.  In Edinburgh and elsewhere in the United Kingdom (and in most of the rest of Europe), works are generally copyrighted until the end of the 70th year after the death of the last author.  In the UK, the authors of a film are considered to be the principal director, the screenwriter(s), and the composer(s).  (For more specifics, see the relevant portion of UK law.)  However, some countries will also let the copyrights of foreign works expire when they do in their country of origin, and in those a US film that’s in the public domain in the US would also be public domain in those countries.  As you can see in the UK law section I link to, the UK does apply such a “rule of the shorter term” to films from outside the European Economic Area (EEA), if none of the authors are EEA nationals.  So you might be good to go in the UK with many, but not all, US films that are public domain in the US.  (I’m not a UK copyright expert, though; you might want to talk to one to be sure.) Let’s suppose you’ve come up with some suitable possible films, either ones that are in the public domain, ones that have suitable Creative Commons licenses or you can otherwise get permission to score, or ones that are in-copyright but that you could score in the context of a study project, even if you couldn’t publish the resulting audiovisual work.  (Educational fair use is a thing, though its scope also varies from country to country.  Here a guide from the British Library on how it works in the UK.)  We then move on to the last question: How do you get hold of a copy so you can write a score for it? The answer to that question depends on your situation.  Right now, the situation for many of us is that we’re stuck at home, and can’t visit libraries or archives in person.  (And our ability to get physical items like DVDs or videotapes may be limited too.)  So for now, you may be limited to films you can obtain online.  There are various free sources of public domain films: I’ve already mentioned the Internet Archive, whose moving image archive includes many films that are in the public domain (and many that are not, so check rights before choosing one to score).  The Library of Congress also offers more than 2,000 compilations and individual films free to all online.  And your local library may well offer more, as digital video, or as physical recordings (if you can still obtain those).  A number of streaming services that libraries or individuals can subscribe to offer films in the public domain that you can free free to set to music.  Check with your librarian or browse the collection of your favorite streaming service. I’m not an expert in films myself.  Folks reading this who know more, or have more suggestions, should feel free to add comments to this post while comments are open.  In general, the first librarians you talk to won’t usually be experts about the questions you ask.  But even when we can’t give definitive answers on our own, we’re good at sending researchers in productive directions, whether that’s to useful research and reference sources, or to more knowledgeable people.  I hope you’ll take advantage of your librarians’ help, especially during this health crisis.  And, for my questioner and other folks who are interested in scoring or otherwise building on public domain films, I’ll be very interested in hearing about the new works you produce from them.   Posted in copyright, publicdomain, Questions | Comments Off on Everybody’s Library Questions: Finding films in the public domain Build a better registry: My intended comments to the Library of Congress on the next Register of Copyrights Posted on March 19, 2020 by John Mark Ockerbloom The Library of Congress is seeking public input on abilities and priorities desired for the next Register of Copyrights, who heads the Copyright Office, a department within the Library of Congress.  The deadline for comments as I write this is March 20, though I’m currently having trouble getting the form to accept my input, and operations at the Library, like many other places, are in flux due to the COVID-19 pandemic.  Below I reproduce the main portion of the comments I’m hoping to get in before the deadline, in the hope that they will be useful for both them and others interested in copyright.  I’ve added a few hyperlinks for context. At root, the Register of Copyrights needs to do the job the position title implies: Build and maintain an effective copyright registry. A well designed, up-to-date digital registry should make it easy for rightsholders to register, and for the public to use registration information. Using today’s copyright registry involves outdated, cumbersome, and costly technologies and practices. Much copyright data is not online, and the usability of what is online is limited. The Library of Congress is now redesigning its catalogs for linked data and modern interfaces. Its Copyright Office thus also has an opportunity to build a modern copyright registry linked to Library databases and to the world, with compatible linked data technologies, robust APIs, and free open bulk downloads. The Copyright Office’s registry and the Library of Congress’s bibliographic and authority knowledge bases could share data, using global identifiers to name and describe entities they both cover, including publications, works, creators, rightsholders, publishers, serials and other aggregations, registrations, relationships, and transactions. The Copyright Office need not convert wholesale to BIBFRAME, or to other Library-specific systems. It simply needs to create and support identifiers for semantic entities described in the registry (“things, not strings“), associate data with them, and exchange data in standard formats with the Library of Congress catalog and other knowledge bases. As a comprehensive US registry for creative works of all types, the Copyright Office is uniquely positioned to manage such data. The Deep Backfile project at the University of Pennsylvania (which I maintain) provides one example of uses that can be made of linked copyright data. At <https://onlinebooks.library.upenn.edu/webbin/cinfo/colliers> is a page showing selected copyrights associated with Collier’s Magazine (1888-1957). It links to online copies of public domain issues, contents and descriptive information from external sources like FictionMags, Wikidata, and Wikipedia, and rights contact information for some of its authors. The information shown has no rights restrictions, and can be used by humans and machines. JSON files, and the entire Deep Backfile knowledge base, are available from this page and from Github. It is not the Copyright Office’s job to produce applications like these. But it can provide data that powers them. Much of our Deep Backfile data was copied manually from scanned Catalog of Copyright Entries pages, and from online catalogs lacking easily exported or linked data. The Copyright Office and the Library of Congress could instead produce such data natively (first prospectively, eventually retrospectively). In the process, they could also cross-pollinate each other’s knowledge bases. To implement this vision, the Register needs to understand library standards and linked open data technologies, gather and manage a skilled implementation team, and be sufficiently persuasive, trusted, and organized to bring stakeholders together inside and outside the Copyright Office and the Library of Congress to support and fund a new system’s development. If explained and implemented well, a registry of the sort described here could greatly benefit copyright holders and copyright users alike. The Register of Copyrights should also know copyright law thoroughly, implement sensible regulations required by copyright law and policy, and be a trusted and inclusive expert that rightsholders, users, and policymakers can consult. I expect other commenters to go into more detail about these skills, which are also useful in building a trustworthy registry of the sort I describe. But the Copyright Office is long overdue to be led by a Register who can revitalize its defining purpose: Register copyrights, in up-to-date, scalable, and flexible ways that encourage wide use of the creations they cover, and thus promote the progress of science and useful arts. Update, March 20: As of the late afternoon on the day of the deadline, the form appears to be still rejecting my submission, without a clear error message.  It did, however, accept a very short submission without any attachment, and with a URL pointing here.  So below I include the rest of my intended comment, listing 3 top priorities. (The essay above was for the longer comment asked for about knowledge, skills, and abilities.) These priorities largely restate in summary form what I wrote above.   If anyone else reading this was unable to post their full comment by the deadline due to technical difficulties, you can try emailing something to me (or leaving a comment to this post) and posting a simple comment to that effect on the LC site, and I’ll do my best to get your full comment posted on this blog. Priority #1: Make copyright registration data easy to use: Data should be easy to search, consult, and analyze, individually and in bulk, by people and machines, linked with the Library of Congress’s rich bibliographic data, facilitating verification of copyright ownership, licensing from rightsholders, and cataloging and analysis by libraries, publishers, vendors, and researchers. Priority #2: Make effective copyright registration easy to do: Ensure copyright registration is simple, inexpensive, supports a variety of electronic and physical deposits, and where possible supports persistent, addressible identifiers and accompanying data for semantic entities described in registrations, and their relationships. Priority #3: Be a trusted, inclusive resource for understanding copyright and its uses: Creators, publishers, consumers, and policymakers all are concerned with copyright, and with possible reforms. The Register should help all understand their rights, and provide expert and impartial advice and mediation for diverse copyright stakeholders and policymaking priorities. Other factors: The Register of Copyrights should also be capable of creating, implementing, and keeping up to date appropriate regulations and practices required or implied by Congressional statutes.  (For the “additional comments” attachment, I had a static PDF attachment showing the Collier’s web page linked from my main essay, as it was on March 19.)   Posted in copyright, data, metadata, open access, serials | Comments Off on Build a better registry: My intended comments to the Library of Congress on the next Register of Copyrights Welcome to everybody’s online libraries Posted on March 16, 2020 by John Mark Ockerbloom As coronavirus infections spread throughout the world, lots of people are staying home to slow down the spread and save lives.  In the US, many universities, schools, and libraries have closed their doors.  (Here’s what happening at the library where I work, which as I write this has closed all its buildings.)  But lots of people are still looking for information, to continue studies online, or just to find something good to read. Libraries are stepping up to provide these things online.  Many libraries have provided online information for years, through our own websites, electronic resources that we license, create, or link to, and other online services.  During this crisis, as our primary forms of interaction move online, many of us will be working hard to meet increased demand for digital materials and services (even as many library workers also have to cope with increased demands and stresses on their personal lives). Services are likely to be in flux for a while.  I have a few suggestions for the near term: Check your libraries’ web sites regularly. They should tell you whether the libraries are now physically open or closed (many are closed now, for good reason), and what services the library is currently offering.  Those might change over time, sometimes quickly.  Our main library location at Penn, for instance, was declared closed indefinitely last night, less than 12 hours before it was next due to reopen.   On the other hand, some digitally mediated library services and resources might not be available initially, but then become available after we have safe and workable procedures set up for them and sufficient staffing.    Many library web sites also prominently feature their most useful electronic resources and services, and have extensive collections of electronic resources in their catalogs or online directories.  They may be acquiring more electronic resources to meet increased user demand for online content. Some providers are also increasing what they offer to their library customers during the crisis, and sometimes making some of their material free for all to access. If  you need particular things from your library during this crisis, reach out to them using the contact information given on their website.  When libraries know what their users need, they can often make those needs a priority, and can let you know if and when they can provide them. Check out other free online library services.    I run one of them, The Online Books Page, which now lists over 3 million books and serials freely readable online due to their public domain status or the generosity of their rightsholders.   We’ll be adding more material there over the next few weeks as we incorporate the listings of more collections, and respond to your requests.  There are many other services online as well.   Wikipedia serves not only as a crowd-sourced collection of articles on millions of topics, but also as a directory of further online resources related to those topics.   And the Internet Archive also offers access millions of books and other information resources no longer readily commercially available, many through controlled digital lending and other manifestations of fair use.  (While the limits of fair use are often subject to debate, library copyright specialists make a good case that its bounds tend to increase during emergencies like this one.  See also Kyle Courtney’s blog for more discussion of useful things libraries can do in a health crisis with their copyright powers.) Support the people who provide the informative and creative resources you value.  The current health crisis has also triggered an economic crisis that will make life more precarious for many creators.  If you have funds you can spare, send some of them their way so they can keep making and publishing the content you value.  Humble Bundles, for instance, offer affordable packages of ebooks, games, and other online content you can enjoy while you’re staying home, and pay for to support their authors, publishers, and associated charities.  (I recently bought their Tachyon SF bundle with that in mind; it’s on offer for two more weeks as I write this.)  Check the websites of your favorite authors and artists to see if they offer ways to sponsor their work, or specific projects they’re planning.  Buy books from your favorite independent booksellers (and if they’re closed now, check their website or call them to see if you can buy gift cards to keep them afloat now and redeem them for books later on).  Pay for journalism you value.  Support funding robust libraries in your community. Consider ways you can help build up online libraries.  Many research papers on COVID-19 and related topics have been opened to free access by their authors or publishers since the crisis began.  Increasing numbers of scholarly and other works are also being made open access, especially by those who have already been paid for creating them.   If you’re interested in sharing your work more broadly, and want to learn more about how you can secure rights to do so, the Authors’ Alliance has some useful resources. As libraries shift focus from in-person to online service, some librarians may be busy with new tasks, while others may be left hanging until new plans and procedures get put into motion.  If you’re in the latter category, and want something to do, there are various library-related projects you can work on or learn about.  One that I’m running is the deep backfile project to identify serial issues that are in the public domain in less-than-obvious ways, and to find or create free digital copies of these serials (so that, among other things, people who are stuck at home can read them online).  I’ve recently augmented my list of serial backfiles to research to include serials held by the library in which I work, in the hopes that we could eventually find or produce digital surrogates for some of them that our readers (and anyone else interested) could access from afar.  I can also add sets for other libraries; if you’re interested in one for yours, let me know and I can go into more detail about the data I’m looking for.  (I’m not too worried about creating too many serial sets to research, especially since once information about a serial is added into one of the serial sets, it also gets automatically added into any other sets that include that serial.) Take care of yourself, and your loved ones.  Whether you work in libraries of just use them, this is a stressful time.  Give yourself and those around you room and resources to cope, as we disengage from much of our previous activities, and deal with new responsibilities and concerns.  I’m gratified to see the response of the Wikimedia Foundation, for instance, which is committed both to keeping the world well-informed and up-to-date through Wikipedia and related projects, and also to letting its staff and contractors work half-time for the same pay during the crisis, and waiving sick-day limits. Among new online community support initiatives, I’m also pleased to see librarian-created resources like the Ontario Library Association’s pandemic information brief, with useful information for library users and workers, and the COVID4GLAM Discord community, a discussion space to support the professional and personal needs of people working in libraries, archives, galleries and museums. These will be difficult times ahead.  Our libraries can make a difference online, even as our doors are closed.  I hope you’ll be able to put them to good use.   Posted in libraries, online books, open access | 4 Comments Public Domain Day 2020: Coming Around Again Posted on January 1, 2020 by John Mark Ockerbloom I’m very happy for 2020 to be arriving.  As the start of the 2020s, it represents a new decade in which we can have a fresh start, and hope to make better decisions and have better outcomes than some of what we’ve gone through in recent years.  And I’m also excited to have a full year’s worth of copyrighted works entering the public domain in much of the world, including in the US for the second year in a row after a 20-year public domain freeze. Outside the US, in countries that still use the Berne Convention‘s “life plus 50 years” copyright terms, works by authors who died in 1969 are now in the public domain.  (Such countries include Canada, New Zealand, and a number of other countries mostly in Asia and Africa.)  Many other countries, including most European countries, have extended copyright terms to life of the author(s) plus 70 years, often under pressure from the United States or the European Union.  In those countries, works by authors who died in 1949 are now in the public domain.  The Public Domain Review has a “class of 2020” post featuring some of these authors, along with links to lists of other people who died in the relevant years. In the US, nearly all remaining copyrights from 1924 have now expired, just as copyrights from 1923 expired at the start of last year.  (The exceptions are sound recordings, which will still be under copyright for a little while longer.   But thanks to recent changes in copyright law, those too will join the public domain soon instead of remaining indefinitely in state copyright.)  I discussed some of the works joining the public domain in a series of blog posts last month, in the last one linking to some posts by others that mentioned new public domain arrivals from 1924.  But I’m happy not just because of these specific works, but also because new arrivals to the US public domain are now an annual event, and not just something that happens with published works at rare intervals.  I could get used to this. It isn’t all good news this year.  The most recent draft of the intellectual property chapter of the US-Canada-Mexico trade agreement requires Canada to extend its copyrights another 20 years, making it freeze its public domain not long after we’ve unfrozen our own in the US.  But the agreement hasn’t yet been ratified, and could conceivably still be changed or rejected.  And the continued force of copyrights from the second half of the previous ’20s while we’re entering a new set of ’20s is a reminder that US copyright terms remain overlong; so long, in fact, that many works from that era are lost or severely deteriorated before their copyrights expire. But there’s now an annual checklist of things to do for me and for many other library organizations.  For me, some of the things to do for The Online Books Page include: Updating our documentation on what’s public domain  (done) and on what versions of our site are public domain (also done; as in previous years, I’m dedicating to the public domain works that I wrote whose copyrights I control that are were published more than 14 years ago.  This year that includes the 2005 copyrights to The Online Books Page.) Removing the “no US access” notices from 1924 books I’d linked to at non-US sites, when I couldn’t previously establish that they were public domain here; and removing “US access only” notices for 1879 volumes at HathiTrust, which over the next few days will be making 140-year-old volumes globally accessible without requiring author-death-date review.   (This and other activities below will start tomorrow and continue until done.) Updating our list of first active renewals for serials and our “Determining copyright status of serial issues” decision guide to reflect the expiration of 1924’s copyrights.  As part of this process, I’ll be deleting all the 1924 serial issue and contribution renewals currently recorded in our serials knowledge base, since they’re no longer in force.  If anyone wants to know what they were for historical or other analytical purposes, I have a zipped collection of all our serial renewals records as of the end of 2019, available on request.  They can also be found in the January 1, 2020 commit of this Github directory. Adding newly opened or scanned 1924 books to our listings, through our automated OAI harvests of selected digital collections, readers’ suggestions and requests, surveys of prize winners and other relevant collections, and our own bibliographer selections. All of this is work I’m glad to be doing this year, and hope to be doing more in the years to come.  (And I’m already streamlining our processes to make it easier to do in years to come.)  Its the job of libraries to collect and preserve works of knowledge and creativity and make them easy for people to discover, access, and use.  It’s also our job to empower our users to draw on those works to make new ones.  As the public domain grows, we can freely collect and widely share more works, and our users can likewise build on and reuse more public domain works in their own creations. Supporting the public domain, then, is supporting the work and mission of libraries.  I therefore hope that all libraries and their users will support a robust public domain, and have more works to celebrate and work with every year.  Happy Public Domain Day!       Posted in publicdomain | Comments Off on Public Domain Day 2020: Coming Around Again 2020 vision #5: Rhapsody in Blue by George Gershwin Posted on December 31, 2019 by John Mark Ockerbloom It’s only a few hours from the new year where I write this, but before I ring in the new year, and a new year’s worth of public domain material, I’d like to put in a request for what music to ring it in with: George Gershwin’s Rhapsody in Blue, which joins the public domain in the US as the clock strikes twelve, over 95 years after it was first performed. The unofficial song for Public Domain Day 2019 turned out to be “Yes! We Have No Bananas”, one of the members of the first big class of US public domain works in the last 20 years.  That’s a fun novelty song, and certainly memorable, but not something I necessarily want to hear a lot.  In contrast, for me Rhapsody in Blue has a freshness that makes it a joy for me to hear repeatedly, right from the opening clarinet glissando (apparently the idea of clarinetist Ross Gorman, who took the scale that Gershwin had composed for the piece and gave it the bendy, slidy wail that tells you right away that this is no ordinary concert piece).  It’s brought together classical, popular, high-art and everyday music, as it’s been played and recorded countless times by jazz bands (the original scoring is for jazz band and piano), symphony orchestras, and pop musicans like Billy Joel.  Even its licensing as an theme tune for an airline hasn’t diminished it. There’s lots of other work joining the public domain along with Gershwin’s tune.  I’ve only had a chance to mention a few others in my short series, but others have mentioned more works you may find of interest. At the Internet Archive’s blog, Elizabeth Townsend Gard writes about Vera Brittain’s Not without Honour and other 1924 works that will be in the public domain very soon.  Duke’s Public Domain Day 2020 post mentions various books, films, and musical compositions joining the public domain as well (and has more to say on Rhapsody in Blue).  Wikipedia’s various 1924 articles also mention various works that will either be joining the public domain, or becoming more clearly established there.  And Hathitrust will begin opening access to tens of thousands of scanned volumes from 1924 over the next few days. I’ll have more to say on the new arrivals tomorrow, sometime after the midnight bells chime.  By tradition, the first tune played in the New Year is usually the public domain song “Auld Lang Syne”.  But after that, at your new years’ party or at a later Public Domain celebration, you might enjoy hearing or playing Gershwin’s new arrival in the public domain.     Posted in publicdomain | Comments Off on 2020 vision #5: Rhapsody in Blue by George Gershwin 2020 vision #4: Ding Dong Merrily on High by George Ratcliffe Woodward and others Posted on December 19, 2019 by John Mark Ockerbloom It’s beginning to sound a lot like Christmas everywhere I go.  The library where I work had its holiday party earlier this week, where I joined librarian colleagues singing Christmas, Hanukkah, and winter-themed songs in a pick-up chorus.  Radio stations and shopping centers play a familiar rotation of popular seasonal songs whose biggest hits are from a surprisingly narrow date range centered in the 1950s.  And more traditional familiar Christmas carols, hymns, and songs are being sung and played in concert halls and churches well into January. The more “classic” Christmas music often feels timeless to those of us singing and hearing it.  But while their roots often go back far, the form in which we know them is often much newer that we might think.  Notice how the list in the previous link, for instance, includes “Carol of the Bells”, dated 1936.  That’s when it was first published as a Christmas song, one that’s still under copyright.  Its roots are older, and darker, as is made clear in a recent Slate article well worth reading. As noted there, the melody is based on a Ukrainian folk tune (date unknown), its full musical setting composed by Mykola Leontovych (assassinated by a Soviet agent in 1921), and Christmas-themed lyrics written by the Ukrainian-descended American musician Peter Wilhousky (who lived until 1978). While “Carol of the Bells” still has a number of years left to go on its copyright, another classic Christmas carol will most likely be joining the public domain in the US in just under two weeks.  Like Carol of the Bells, “Ding Dong Merrily on High” is based on a folk tune, in this case a secular dance tune first published in France in the 16th century under the title “Branle de l’Official”.  In 1924, George Ratcliffe Woodward, an English cleric already known for publishing collections of old songs, wrote lyrics for the tune recalling earlier ages, and included them in the Cambridge Carol-Book, published that year by the Society for Promoting Christian Knowledge. Charles Wood, who’d collaborated with Woodward on the earlier Cowley Carol Book,  wrote a harmonization to go with it.  While you won’t hear it at every Christmas service, it remains widely sung this time of year.  That’s in large part because it’s so much fun to sing, with its dance-like rhythms, its long bell-like vocal runs on “Gloria” (something also heard in “Angels We Have Heard on High“), and its praise of various forms of music (musicians liking to hear good things about themselves as much as anyone else). I don’t actually know for sure that “Ding Dong Merrily on High” is still under copyright here.  I have not found a 1951 or 1952 copyright renewal for the song or the book it was published in, but I’m assuming that, if nothing else, GATT restoration retroactively secured and automatically renewed a 1924 US copyright for the song as published in the Cambridge Carol-Book.  (Folks with more knowledge or legal expertise are free to correct me on that.)  Later published arrangements of the song may continue to have active copyrights, but only for material original to those arrangements.  1924’s remaining copyrights, on the other hand, all end in the US on January 1.   (And since Woodward and Wood both died over 70 years ago, the song’s already public domain in most other countries.) The arrival of 2020, then, should at least clear up any ambiguity about the public domain status of the basic carol.  I appreciate that, in part because this song, like many other Christmas carols, lives in a sort of liminal space between the private property regimes set up for copyright holders and the older, more informal understandings of folk culture.  Both kinds of spaces have good reason to exist. On the one hand, it’s good to have more than a few people who can earn a living through music, and one important way many musicians do so is by controlling rights to their compositions.  On the other hand, the folk process, which originally gave rise to the tunes for both “Ding Dong Merrily on High” and “Carol of the Bells”, is also a very good way of creating and passing on shared cultural works. Conflict can rage when two different sets of cultural expectations around creative works try to occupy the same space.  That’s one reason we’ve seen decades of conflict in academia over open access, where scholarly work is largely published by companies that depend on its control and sale to earn money, while it’s largely written by scholars who earn their money in other ways, and tend to prefer free, widespread availability of their work.  Sometimes informal arrangements work best to keep the peace.  Publishers, for instance, have grown more used to free preprint servers, and memes and fan fiction communities have become more widely accepted (and even winning awards) as long as they stay well away from unauthorized commercial exploitation (where both big and small creators tend to draw the line). Sometimes, though, it’s best to have a more formal understanding that works are free for anyone to freely use as we like.  That’s what we’ll have when 1924’s copyrights end, and the works they cover, such as “Ding Dong Merrily on High” are clearly seen to be in the public domain.  And then, those of us who are so inclined can freely sing “hosanna in excelsis!“ Posted in publicdomain | Comments Off on 2020 vision #4: Ding Dong Merrily on High by George Ratcliffe Woodward and others 2020 vision #3: The Most Dangerous Game by Richard Connell Posted on December 13, 2019 by John Mark Ockerbloom “Be a realist. The world is made up of two classes–the hunters and the huntees. Luckily, you and I are hunters.” Sanger Rainsford speaks these words at the start of “The Most Dangerous Game”, one of the most famous short stories of all time. First published in Collier’s magazine in 1924, it’s been reprinted in numerous anthologies, been adapted for radio, TV, and multiple movies, and assigned in countless middle and high school English classes.  The tropes established in the story, in which a hunter finds himself a “huntee”, are so well-established in present-day American culture that there are lengthy TV Tropes pages not just for the story itself, but for the trope named by its title. Up until now, the story’s been under copyright in the US, as well as in Europe and other countries that have “life plus 70 years” copyright terms.  (The author, Richard Connell,  died just over 70 years ago in 1949, so as of January 1, it will be public domain nearly everywhere in the world.)  Anyone reprinting the story, or explicitly adapting it for drama or art has had to get permission or pay a royalty.  On the other hand, many creators have reused its basic idea– humans being hunted for sport or entertainment– without getting such permission. That’s because ideas themselves are not copyrightable, but rather the expression of those ideas.  And the basic idea long predates this particular story: Consider, for instance, gladiators in Roman arenas, or tributes being hunted down in the Labyrinth by the Minotaur of Greek mythology.  But the particular formulation in Connell’s short story, in which General Zaroff, a former nobleman bored with hunting animals, lures humans to his private island to hunt and kill them for sport, is both distinctively memorable, and copyrightable.  Stray too close to it, or quote too much from the story, and you may find yourself the target of lawyers.  (But perhaps not if you yourself are dangerous enough game.  I don’t know if the makers of “The Incredibles“, which also featured a rich recluse using his wits and inventions to hunt humans on a private island, paid royalties to Connell’s estate, or relied on fair use or arguments about uncopyrightable ideas.  But in any case, Disney is better equipped to either negotiate or defend themselves against infringement lawsuits than others would be.) Rereading the story recently, I’m struck by both how it reflects its time in some ways, and in how its action is surprisingly economical.  In 1924, we were still living in the shadow of the First World War, in which multiple empires and noble houses fell, while others continued but began to teeter.  The deadly spectacles of public executions and lynchings were still not uncommon in the United States.  And the dividing of people into two classes– those who are inherently privileged and those who are left in the cold or even considered fair game– was particularly salient that year, as the second incarnation of the Ku Klux Klan neared its peak in popularity, and as immigration law was changed to explicitly keep out people of the “wrong” national origin or race.  Those sorts of division haunt our society to this day. Rainsford objects to Zaroff’s dehumanizing game in what we now tend to think of the story’s setup, which actually takes most of the story’s telling.  (The description of the hunt itself is relatively brief, and no words at all are used to describe the final showdown, which implicitly takes place in the gap between the story’s last two sentences.)  In the end, though, Rainsford prevails by beating his opponent at his own game.  He doesn’t want to kill another human being, but when pressed to the extreme, he adopts his opponent’s rules (at the end giving Zaroff the sporting warning “I am still a beast at bay… Get ready”) and proves to be the better killer. With the story entering the public domain in less than three weeks, we’ll have the chance to reuse, adapt, and critique the story in quotation more freely than ever before.  I hope we use the opportunity not just to recapitulate the story, but to go beyond it in new ways. That’s what happens in the best reuses of tropes.  Consider for instance, how in the Hunger Games books, the main character Katniss repeatedly finds ways to subvert the trope of killing others for entertainment.  Instead of prevailing by beating opponents at the deadly human-hunting game the enemy has created, she and her allies find ways to reject the game’s premise, cut it short, or prevent its recurrence. When, in 19 days, we get another year’s worth of public domain works, I hope we too find ways not just to revisit what’s come before, but make new and better work out of them.  That’s something that the public domain allows everyone, and not just members of some privileged class, to do.           Posted in publicdomain | Comments Off on 2020 vision #3: The Most Dangerous Game by Richard Connell ← Older posts Search for: RSS feed Pages About Free Decimal Correspondence ILS services for discovery applications John Mark Ockerbloom The Metadata Challenge Recent Posts Public Domain Day 2021: Honoring a lost generation Counting down to 1925 in the public domain From our subjects to yours (and vice versa) Everybody’s Library Questions: Finding films in the public domain Build a better registry: My intended comments to the Library of Congress on the next Register of Copyrights Recent Comments Jason on Public Domain Day 2021: Honoring a lost generation John Mark Ockerbloom on Public Domain Day 2021: Honoring a lost generation Norma Bruce on Public Domain Day 2021: Honoring a lost generation Brent Reid on Counting down to 1925 in the public domain John Mark Ockerbloom on Counting down to 1925 in the public domain Archives January 2021 December 2020 March 2020 January 2020 December 2019 November 2019 October 2019 September 2019 July 2019 June 2019 January 2019 December 2018 October 2018 June 2018 January 2018 December 2017 September 2017 January 2017 October 2016 September 2016 July 2016 May 2016 January 2016 January 2015 June 2014 January 2014 October 2013 August 2013 April 2013 March 2013 February 2013 January 2013 December 2012 July 2012 May 2012 January 2012 October 2011 September 2011 June 2011 May 2011 April 2011 January 2011 December 2010 November 2010 October 2010 September 2010 August 2010 July 2010 June 2010 May 2010 April 2010 March 2010 February 2010 January 2010 December 2009 October 2009 September 2009 August 2009 July 2009 June 2009 May 2009 April 2009 March 2009 January 2009 December 2008 November 2008 October 2008 September 2008 August 2008 July 2008 June 2008 May 2008 April 2008 March 2008 February 2008 January 2008 December 2007 November 2007 Access for all Open Access News Copyrights and wrongs Copyfight Copyright & Fair Use Freedom to Tinker Lawrence Lessig General library-related news and comment LISNews TeleRead Interesting folks Jessamyn West John Scalzi Jonathan Rochkind K. G. Schneider Karen Coyle Lawrence Lessig Leslie Johnston Library Loon Lorcan Dempsey Paul Courant Peter Brantley Walt Crawford Metadata and friends Planet Cataloging Shiny tech Boing Boing O’Reilly Radar Planet Code4lib Tales from the repository RepositoryMan Writing and publishing if:book Making Light Publishing Frontier Everybody's Libraries Blog at WordPress.com. Everybody's Libraries Blog at WordPress.com. Email (Required) Name (Required) Website   Loading Comments... Comment × 
faillab-wordpress-com-5107	----	Fail!lab Fail!lab technology, libraries and the future! Luddites, Trumpism and Change: A crossroads for libraries &#8220;Globalization is a proxy for technology-powered capitalism, which tends to reward fewer and fewer members of society.&#8221; &#8211; Om Malik Corner someone and they will react. We may be seeing this across the world as change, globalization, technology and economic dislocation force more and more people into the corner of benefit-nots. They are reacting out [&#8230;] Is 3D Printing Dying? Inc.&#8217;s John Brandon recently wrote about The Slow, Sad, and Ultimately Predictable Decline of 3D Printing. Uh, not so fast. 3D Printing is just getting started. For libraries whose adopted mission is to introduce people to emerging technologies, this is a fantastic opportunity to do so. But it has to be done right. Another dead [&#8230;] The State of the Library Website T&#8217;was a time when the Library Website was an abomination. Those dark days have lightened significantly. But new clouds have appeared on the horizon. Darkest Before the Dawn In the dark ages of Library Websites, users suffered under UX regimes that were rigid, unhelpful and confusing. This was before responsive design became a standard in [&#8230;] Virtual Realty is Getting Real in the Library My library just received three Samsung S7 devices with Gear VR goggles. We put them to work right away. The first thought I had was: Wow, this will change everything. My second thought was: Wow, I can&#8217;t wait for Apple to make a VR device! The Samsung Gear VR experience is grainy and fraught with [&#8230;] W3C’s CSS Framework Review I&#8217;m a longtime Bootstrap fan, but recently I cheated on my old framework. Now I&#8217;m all excited by the W3C&#8217;s new framework. Like Bootstrap, the W3C&#8217;s framework comes with lots of nifty utilities and plug and play classes and UI features. Even if you have a good CMS, you&#8217;ll find many of their code libraries [&#8230;] AI First Looking to the future, the next big step will be for the very concept of the “device” to fade away. Over time, the computer itself—whatever its form factor—will be an intelligent assistant helping you through your day. We will move from mobile first to an AI first world. Google Founder&#8217;s Letter, April 2016 My Library [&#8230;] Google Analytics and Privacy Collecting web usage data through services like Google Analytics is a top priority for any library. But what about user privacy? Most libraries (and websites for that matter) lean on Google Analytics to measure website usage and learn about how people access their online content. It&#8217;s a great tool. You can learn about where people [&#8230;] The L Word I&#8217;ve been working with my team on a vision document for what we want our future digital library platform to look like. This exercise keeps bringing us back to defining the library of the future. And that means addressing the very use of the term, &#8216;Library.&#8217; When I first exited my library (and information science) [&#8230;] Locking Down Windows I&#8217;ve recently moved Back to Windows for my desktop computing. But Windows 10 comes with enormous privacy and security issues that people need to take into account&#8230;and get under a semblance of control. Here&#8217;s how I did it. There has been much written on this subject, so what I&#8217;m including here is more of a [&#8230;] Killer Apps & Hacks for Windows 10 Did the UX people at Microsoft ever test Windows 10? Here are some must have apps and hacks I&#8217;ve found to make life on Windows 10 quick and easy. Set Hotkeys for Apps Sometimes you just want to launch an app from your keyboard. Using a method on Laptopmag.com, you can do this for most [&#8230;] 
faillab-wordpress-com-7233	----	Fail!lab | technology, libraries and the future! Fail!lab technology, libraries and the future! Menu Skip to content Home About Luddites, Trumpism and Change: A crossroads for libraries Posted on December 6, 2016 by mryanhess “Globalization is a proxy for technology-powered capitalism, which tends to reward fewer and fewer members of society.” – Om Malik Corner someone and they will react. We may be seeing this across the world as change, globalization, technology and economic dislocation force more and more people into the corner of benefit-nots. They are reacting out of desperation. It’s not rational. It’s not pretty. But it shouldn’t be surprising. Years ago at a library conference, one of the keynote speakers forecast that there would be a return to the analog (sorry my Twitter-based memory does not identify the person). The rapidity of digitization would be met by a reaction. People would scurry back to the familiar, he said. They always do. Fast forward to 2016, where the decades-long trends toward globalization, borderless labor markets, denationalization, exponential technological change and corresponding social revolutions has hit the wall of public reaction. Brexit. Global Trumpism. Call it what you will. We’re in a change moment. The reaction is here. Reacting to the Reaction People in the Blue Zones, the Technorati, the beneficiaries of cheap foreign labor, free trade and technological innovation are scratching their heads. For all their algorithms and AI, they didn’t see this coming. Everything looked good on their feeds. No danger could possibly burst their self-assured bubble of inevitability. All was quiet. It was like a clear blue, September 2001, morning in New York City. It was like the boardroom in the Federal Reserve in 2006. The serenity was over in an instant. Since Brexit, and then Trump’s election, the Glittery Digitarians have initiated a period of introspection. They’re looking up from their stock tickers and gold-plated smart watches to find a grim reality: the world is crowded with people that have lost much ground at the expense of the global maelstrom that has elevated a very small, lucky few to greatness. They are now seeing, as for the first time, the shuttered towns. The empty retail stores. The displaced and homeless. Suddenly their confident talk of personal AI assistants has turned from technolust to terror. Their success suddenly looks short-sighted. Om Malik wrote in his recent New Yorker op-ed, that Silicon Valley may soon find itself equated with the super villains on Wall Street. He posits that a new business model needs to account for the public good…or else. I recently read Throwing Rocks at the Google Bus: How Growth Became the Enemy of Prosperity by Douglas Rushkoff. If you haven’t read it, now would be a good time. Like Bernie Sanders and others, Rushkoff has been warning of this kind of reaction for awhile. The system is not designed for the public good, but only around a narrow set of shareholder requirements. All other considerations do not compute. My Reaction Let me put this in personal perspective. In my work, I engage the public in “the heart of Silicon Valley” on what they want from their community and what’s missing. What I hear is concern about the loss of quiet, of connection to others, of a pace of life that is not 24/7 always a click away. This is consistent. People feel overwhelmed. As one of the chief technologists for my library, this puts me in a strange place. And I’ve been grappling with it for the past few months. On the one hand, people are curious. They’re happy to try the next big thing. But you also hear the frustration. Meanwhile, the burden of the Tech Industry is more than inflated rents and traffic. There’s a very obvious divide between long-time residents and newcomers. There’s a sense that something has been lost. There’s anger too, even here in the shadow of Google and Facebook. The Library as a Philosophy The other day, I was visited by a Eurpean Library Director who wanted to talk about VR. He asked me where I thought we’d be in ten years. I hesitated. My thoughts immediately went back to the words of despair that I’d been hearing from the public lately. Of course, the genie’s out of the bottle. We can’t stop the digital era. VR interface revolutions will likely emerge. The robots will come. But we can harness this change to our benefit. We can add rules to heal it to our collective needs. This is where the Library comes in. We have a sharing culture. A model that values bridging divides, pooling resources and re-distributing knowledge. It’s a model that is practically unique to the library if you think about it. As I read Rushkoff, I kept coming back to the Librarian’s philosophy on sharing. In his book, he contends that we need to re-imagine (re-code) our economy to work for people. He recalls technologies like HTTP and RSS which were invented and then given away to the world to share and re-use. This sounded very ‘librarian’ to me. We share knowledge in the form of access to technology, after all. We host training on new maker gear, coding, robotics, virtual reality. Perhaps we need to double-down on this philosophy. Perhaps, we can be more than just a bridge. Maybe we can be the engine driving our communities to the other side. We can not just advocate, but do. Have a hackathon? Build a public alternative to the Airbnb app to be used by people in your town. Know the Future In the end, libraires, technologists and digitarians need to tell a better story. We need to get outside our bubbles and tell that story with words that resonate with the benefit-nots. And more, we need that story to be backed up with real-world benefits. It starts with asking the community what kind of world they want to live it? What obstacles keep them from living that way? And then how the library and technology can help make change. We have the philosophy, we have the spaces and we have public permission. Let’s get to work. Posted in innovation, librarianship, society, technology, Uncategorized | Leave a comment Is 3D Printing Dying? Posted on October 12, 2016 by mryanhess Inc.’s John Brandon recently wrote about The Slow, Sad, and Ultimately Predictable Decline of 3D Printing. Uh, not so fast. 3D Printing is just getting started. For libraries whose adopted mission is to introduce people to emerging technologies, this is a fantastic opportunity to do so. But it has to be done right. Another dead end? Brandon cites a few reasons for his pessimism: 3D printed objects are low quality and the printers are finicky 3D printing growth is falling behind initial estimates people in manufacturing are not impressed and the costs are too high I won’t get into all that’s wrong with this analysis, as I feel like most of it is incorrect, or at the very least, a temporary problem typical of a new technology. Instead, I’d like to discuss this in the library maker context. And in fact, you can apply these ideas to any tech project. How to make failure a win—no matter what Libraries are quick to jump on tech. Remember those QR Codes that would revolutionize mobile access? Did your library consider a Second Life branch? How about those Chromebooks! Inevitably, these experiments are going to fail. But that’s okay. As this blog often suggests, failure is a win when doing so teaches you something. Experimenting is the first step in the process of discovery. And that’s really what all these kinds of projects need to be. In the case of a 3D Printing project at your library, it’s important to keep this notion front and center. A 3D Printing pilot with the goal of introducing the public to the technology can be successful if people simply try it out. That seems easy enough. But to be really successful, even this kind of basic 3D Printing project needs to have a fair amount of up-front planning attached to it. Chicago Public Library created a successful Maker Lab. Their program was pretty simple: Hold regular classes showing people how to use the 3D printers and then allow those that completed the introductory course to use the printers in open studio lab times. When I tried this out at CPL, it was quite difficult to get a spot in the class due to popularity. The grant-funded project was so successful, based on the number of attendees, that it was extended and continues to this day. As a grant-funded endeavor, CPL likely wrote out the specifics before any money was handed over. But even an internally-funded project should do this. Keep the goals simple and clear so expectations on the front line match those up the chain of command. Figure out what your measurements of success are before you even purchase the first printer. Be realistic. Always document everything. And return to that documentation throughout the project’s timeline. Taking it to the next level San Diego Public Library is an example of a Maker Project that went to the next level. Uyen Tran saw an opportunity to merge startup seminars with their maker tools at her library. She brought aspiring entrepreneurs into her library for a Startup Weekend event where budding innovators learned how the library could be a resource for them as they launched their companies. 3D printers were part of this successful program. It’s important to note that Uyen already had the maker lab in place before she launched this project. And it would be risky for a library to skip the establishment of a rudimentary 3D printer program before trying for this more ambitious program. But it could be done if that library was well organized with solid project managers and deep roots in the target community. But that’s a tall order to fill. What’s the worst thing that could go wrong? The worst thing that could go wrong is doubling down on failure: repeating one failed project after another without changing the flawed approach behind it. I’d also add that libraries are often out ahead of the public on these technologies, so dead ends are inevitable. To address this, I would also add one more tactic to your tech projects: listening. The public has lots of concerns about a variety of things. If you ask them, they’ll tell you all about them. Many of their concerns are directly related to libraries, but we can often help. We have permission to do so. People trust us. It’s a great position to be in. But we have to ask them to tell us what’s on their mind. We have to listen. And then we need to think creatively. Listening and thinking outside the box was how San Diego took their 3D Printers to the next level. The Long Future of 3D Printing The Wright Brothers first flight managed only 120 feet in the air. A year later, they flew 24 miles. These initial attempts looked nothing like the jet age and yet the technology of flight was born from these humble experiments. Already, 3D printing is being adopted in multiple industries. Artists are using it to prototype their designs. Astronauts are using it to print parts aboard the International Space Station. Bio-engineers are now looking at printing stem-cell structures to replace organs and bones. We’re decades away from the jet age of 3D printing, but this tech is here to stay. John Brandon’s read is incorrect simply because he’s looking at the current state and not seeing the long-term promise. When he asks a Ford engineer for his take on 3D Printing in the assembly process, he gets a smirk. Not a hotbed of innovation. What kind of reaction would he have gotten from an engineer at Tesla? At Apple? Fundamentally, he’s approaching 3D Printers from the wrong perspective and this is why it looks doomed. Libraries should not make this mistake. The world is changing ever more quickly and the public needs us to help them navigate the new frontier. We need to do this methodically, with careful planning and a good dose of optimism. Posted in innovation, technology | Tagged 3D printing, innovation, project planning | 2 Comments The State of the Library Website Posted on September 28, 2016 by mryanhess T’was a time when the Library Website was an abomination. Those dark days have lightened significantly. But new clouds have appeared on the horizon. Darkest Before the Dawn In the dark ages of Library Websites, users suffered under UX regimes that were rigid, unhelpful and confusing. This was before responsive design became a standard in the library world. It was before search engine optimization started to creep into Library meetings. It was before user experience became an actual librarian job title. We’ve come a long way since I wrote The Ugly Truth About Library Websites. Most libraries have evolved beyond the old “website as pamphlet” paradigm to one that is dynamic and focused on user tasks. Public libraries have deployed platforms like BiblioCommons to serve responsive, task-oriented interfaces that integrate their catalogs, programming and website into a single social platform. Books, digital resources, programs and even loanable equipment are all accessible via a single search. What’s more, the critical social networking aspects of library life are also embedded along the user’s path. Celebrated examples of this integrated solution include the San Francisco Public Library and Chicago Public Library. Queens is also hard at work to develop a custom solution. In the academic realm, libraries have turned to unified discovery layers like WorldCat Discovery and EBSCO Discovery Service to simplify (Googlize) the research process. These systems put a single-search box front and center that access resources on the shelf, but also all those electronic resources that make up the bulk of academic budgets. And while there are still many laggards, few libraries ignore these problems outright. The Storm Ahead While the general state of online library interfaces has improved, the unforgiving, hyperbolic curve of change continues to press forward. And libraries cannot stay put. Indeed, we need to quicken our pace and prepare our organizations for ongoing recalibration as the tempo of change increases. The biggest problem for library websites, is that there is little future for the library website. That’s because people will get less and less information through web browsers. Indeed, consider how often you use a web browser on your phone versus an app. Developments in AI, Augmented Reality and Virtual Reality will compound that trend. If you’re like Chris Milk, videographer and VR evangelist, you see the writing on the wall. The modes of how we experience information are about to undergo a fundamental revolution. Milk likens the current state of VR to the old black and white silent films at the dawn of motion pictures. I’d extend this line of thinking to the web page. Within a decade or two, I expect people will look back on web pages as a brief, transitory medium bridging print information to linked data. And as our AI, VR and AR technologies take off, they will liberate information from the old print paradigms altogether. In short, people will interact with information in more direct ways. They will ask a computer to provide them the answer. They will virtually travel to a “space” where they can experience the information they seek. Get Ready to Re-invent the Library…again So where does the library fit into this virtualized and automated future? One possibility is that the good work to transform library data into linked data will enable us to survive this revolution. In fact, it may be our best hope. Another hope is that we continue to emphasize the library as a social space for people to come together around ideas. Whether its a virtual library space or a physical one, the library can be the place in both local and global communities where people meet their universal thirst for connecting with others. The modes of those ideas (books, ebooks, videos, games) will matter far less than the act of connecting. In a sense, you could define the future online library as something between an MMORPG, Meetup.com and the TED conference. So, the library website is vastly improved, but we won’t have long to rest on our laurels. Ready Player One? Put on your VR goggles. Call up Siri. Start rethinking everything you know about the Library website.     Posted in information architecture, librarianship | Tagged internet, libraries, user experience, web design, websites | 1 Comment Virtual Realty is Getting Real in the Library Posted on June 20, 2016 by mryanhess My library just received three Samsung S7 devices with Gear VR goggles. We put them to work right away. The first thought I had was: Wow, this will change everything. My second thought was: Wow, I can’t wait for Apple to make a VR device! The Samsung Gear VR experience is grainy and fraught with limitations, but you can see the potential right away. The virtual reality is, after all, working off a smartphone. There is no high-end graphics card working under the hood. Really, the goggles are just a plastic case holding the phone up to your eyes. But still, despite all this, it’s amazing. Within twenty-four hours, I’d surfed beside the world’s top surfers on giant waves off Hawaii, hung out with the Masai in Africa and shared an intimate moment with a pianist and his dog in their (New York?) apartment. It was all beautiful. We’ve Been Here Before Remember when the Internet came online? If you’re old enough, you’ll recall the crude attempts to chat on digital bulletin board systems (BBS) or, much later, the publication of the first colorful (often jarringly so) HTML pages. It’s the Hello World! moment for VR now. People are just getting started. You can tell the content currently available is just scratching the surface of potentialities for this medium. But once you try VR and consider the ways it can be used, you start to realize nothing will be the same again. The Internet Will Disappear So said Google CEO Erik Schmidt in 2015. He was talking about the rise of AI, wearable tech and many other emerging technologies that will transform how we access data. For Schmidt, the Internet will simply fade into these technologies to the point that it will be unrecognizable. I agree. But being primarily a web librarian, I’m mostly concerned with how new technologies will translate in the library context. What will VR mean for library websites, online catalogs, eBooks, databases and the social networking aspects of libraries. So after trying out VR, I was already thinking about all this. Here are some brief thoughts: Visiting the library stacks in VR could transform the online catalog experience Library programming could break out of the physical world (virtual speakers, virtual locations) VR book discussions could incorporate virtual tours of topics/locations touched on in books Collections of VR experiences could become a new source for local collections VR maker spaces and tools for creatives to create VR experiences/objects Year Zero? Still, VR makes your eyes tired. It’s not perfect. It has a long way to go. But based on my experience sharing this technology with others, it’s addictive. People love trying it. They can’t stop talking about it afterward. So, while it may be some time before the VR revolution disrupts the Internet (and virtual library services with it), it sure feels imminent. Posted in innovation, librarianship, technology | Tagged gear vr, internet, oculus, samsung, virtual reality, vr | Leave a comment W3C’s CSS Framework Review Posted on May 10, 2016 by mryanhess I’m a longtime Bootstrap fan, but recently I cheated on my old framework. Now I’m all excited by the W3C’s new framework. Like Bootstrap, the W3C’s framework comes with lots of nifty utilities and plug and play classes and UI features. Even if you have a good CMS, you’ll find many of their code libraries quite handy. And if you’re CMS-deficient, this framework will save you time and headaches! Why a Framework? Frameworks are great for saving time. You don’t have to reinvent the wheel for standard UI chunks like navigation, image positioning, responsive design, etc. All you need to do is reference the framework in your code and you can start calling the classes to make your site pop. And this is really great since not all well-meaning web teams have an eye for good design. Most quality frameworks look really nice, and they get updated periodically to keep up with design trends. And coming from this well-known standards body, you can also be assured that the W3C’s framework complies with all the nitty-gritty standards all websites should aspire to. Things to Love Some of the things I fell in love with include: CSS-driven navigation menus. There’s really no good reason to rely on JavaScript for a responsive, interactive navigation menu. The W3C agrees. Icon support. This framework allows you to choose from three popular icon sets to bring icons right into your interface. Image support: Lots of great image styling including circular cropping, shadowing, etc. Cards. Gotta love cards in your websites and this framework has some very nice looking card designs for you to use. Built-in colors. Nuff sed. Animations. There are plenty of other nice touches like buttons that lift off the screen, elements that drop into place and much more. I give it a big thumbs up! Check it out at the W3C.org.     Posted in reviews | Tagged css, frameworks, w3c, web design | 1 Comment AI First Posted on May 2, 2016 by mryanhess Looking to the future, the next big step will be for the very concept of the “device” to fade away. Over time, the computer itself—whatever its form factor—will be an intelligent assistant helping you through your day. We will move from mobile first to an AI first world. Google Founder’s Letter, April 2016 My Library recently finalized a Vision Document for our virtual library presence. Happily, our vision was aligned with the long-term direction of technology as understood by movers and shakers like Google. As I’ve written previously, the Library Website will disappear. But this is because the Internet (as we currently understand it) will also disappear. In its place, a new mode of information retrieval and creation will move us away from the paper-based metaphor of web pages. Information will be more ubiquitous. It will be more free-form, more adaptable, more contextualized, more interactive. Part of this is already underway. For example, people are becoming a data set. And other apps are learning about you and changing how they work based on who you are. Your personal data set contains location data, patterns in speech and movement around the world, consumer history, keywords particular to your interests, associations based on your social networks, etc. AI Emerging All of this information makes it possible for emerging AI systems like Siri and Cortana to better serve you. Soon, it will allow AI to control the flow of information based on your mood and other factors to help you be more productive. And like a good friend that knows you very, very well, AI will even be able to alert you to serendipitous events or inconveniences so that you can navigate life more happily. People’s expectations are already being set for this kind of experience. Perhaps you’ve noticed yourself getting annoyed when your personal assistant just fetches a Wikipedia article when you ask it something. You’re left wanting. What we want is that kernel of gold we asked about. But what we get right now, is something too general to be useful. But soon, that will all change. Nascent AI will soon be able to provide exactly the piece of information that you really want rather than a generalized web page. This is what Google means when they make statements like “AI First” or “the Web will die.” They’re talking about a world where information is not only presented as article-like web pages, but broken down into actual kernels of information that are both discrete and yet interconnected. AI First in the Library Library discussions often focus on building better web pages or navigation menus or providing responsive websites. But the conversation we need to have is about pulling our data out of siloed systems and websites and making it available to all modes like AI, apps and basic data harvesters. You hear this conversation in bits and pieces. The ongoing linked data project is part of this long-term strategy. So too with next-gen OPACs. But on the ground, in our local strategy meetings, we need to tie every big project we do to this emerging reality where web browsers are increasingly no longer relevant. We need to think AI First. Posted in librarianship, society, tech industry | Tagged artificial intelligence, google, internet, libraries, linked data | Leave a comment Google Analytics and Privacy Posted on April 27, 2016 by mryanhess Collecting web usage data through services like Google Analytics is a top priority for any library. But what about user privacy? Most libraries (and websites for that matter) lean on Google Analytics to measure website usage and learn about how people access their online content. It’s a great tool. You can learn about where people are coming from (the geolocation of their IP addresses anyway), what devices, browsers and operating systems they are using. You can learn about how big their screen is. You can identify your top pages and much much more. Google Analytics is really indispensable for any organization with an online presence. But then there’s the privacy issue. Is Google Analytics a Privacy Concern? The question is often asked, what personal information is Google Analytics actually collecting? And then, how does this data collection jive with our organization’s privacy policies. It turns out, as a user of Google Analytics, you’ve already agreed to publish a privacy document on your site outlining the why and what of your analytics program. So if you haven’t done so, you probably should if only for the sake of transparency. Personally Identifiable Data Fact is, if someone really wanted to learn about a particular person, it’s not entirely outside the realm of possibility that they could glean a limited set of personal attributes from the generally anonymized data Google Analytics collects. IP addresses can be loosely linked to people. If you wanted to, you could set up filters in Google Analytics that look at a single IP. Of course, on the Google side, any user that is logged into their Gmail, YouTube or other Google account, is already being tracked and identified by Google. This is a broadly underappreciated fact. And it’s a critical one when it comes to how approach the question of dealing with the privacy issue. In both the case of what your organization collects with Google Analytics and what all those web trackers, including Google’s trackers, collect, the onus falls entirely on the user. The Internet is Public Over the years, the Internet has become a public space and users of the Web should understand it as such. Everything you do, is recorded and seen. Companies like Google, Facebook, Mircosoft, Yahoo! and many, many others are all in the data mining business. Carriers and Internet Service Providers are also in this game. They deploy technologies in websites that identify you and then sell what your interests, shopping habits, web searches and other activities are to companies interested in selling to you. They’ve made billions on selling your data. Ever done a search on Google and then seen ads all over the Web trying to sell you that thing you searched last week? That’s the tracking at work. Only You Can Prevent Data Fires The good news is that with little effort, individuals can stop most (but not all) of the data collection. Browsers like Chrome and Firefox have plugins like Ghostery, Avast and many others that will block trackers. Google Analytics can be stopped cold by these plugins. But it won’t solve all the problems. Users also need to set up their browsers to delete cookies websites save to their browsers. And moving off of accounts provided from data mining companies “for free” like Facebook accounts, Gmail and Google.com can also help. But you’ll never be completely anonymous. Super cookies are a thing and are very difficult to stop without breaking websites. And some trackers are required in order to load content. So sometimes you need to pay with your data to play. Policies for Privacy Conscious Libraries All of this means that libraries wishing to be transparent and honest about their data collection, need to also contextualize the information in the broader data mining debate. First and foremost, we need to educate our users on what it means to go online. We need to let them know its their responsibility alone to control their own data. And we need to provide instructions on doing so. Unfortunately, this isn’t an opt-in model. That’s too bad. It actually would be great if the world worked that way. But don’t expect the moneyed interests involved in data mining to allow the US Congress to pass anything that cuts into their bottom line. This ain’t Germany, after all. There are ways with a little javascript to create a temporary opt-in/opt-out feature to your site. This will toggle tags added by Google Tag Manager on and off with a single click. But let’s be honest. Most people will ignore it. And if they do opt-out, it will be very easy for them to overlook everytime without a much more robust opt-in/opt-out functionality baked in to your site. But for most sites and users, this is asking alot. Meanwhile, it diverts attention from the real solution: users concerned about privacy need to protect themselves and not take a given websites word for it. We actually do our users a service by going with the opt-out model. This underlines the larger privacy problems on the Wild Wild Web, which our sites are a part of. Posted in online security & privacy, society | Tagged data mining, google analytics, online security & privacy | 2 Comments The L Word Posted on March 21, 2016 by mryanhess I’ve been working with my team on a vision document for what we want our future digital library platform to look like. This exercise keeps bringing us back to defining the library of the future. And that means addressing the very use of the term, ‘Library.’ When I first exited my library (and information science) program, I was hired by Adobe Systems to work in a team of other librarians. My manager warned us against using the word ‘Librarian’ among our non-librarian colleagues. I think the gist was: too much baggage there. So, we used the word ‘Information Specialist.’ Fast forward a few years to my time in an academic environment at DePaul University Library and this topic came up in the context of services the library provided. Faculty and students associated the library in very traditional ways: a quiet, book-filled space. But the way they used the library was changing despite the lag in their semantic understanding. The space and the virtual tools we put in place online helped users not only find and evaluate information, but also create, organize and share information. A case in point was our adoption of digital publishing tools like Bepress and Omeka, but also the Scholar’s Lab. I’m seeing a similar contradiction in the public library space. Say library and people think books. Walk into a public library and people do games, meetings, trainings and any number of online tasks. This disconnect between what the word ‘Library’ evokes in the mind’s eye and what it means in practice is telling. We’ve got a problem with our brand. In fact, we may need a new word. Taken literally, a library has  been a word for a physical collection of written materials. The Library of Alexandria held scrolls for example. Even code developers rely on ‘libraries’ today, which are collections of materials. In every case, the emphasis is on the collection of things. Now, I’m not suggesting that we move away from books. Books are vessels for ideas and libraries will always be about ideas. In fact, this focus on ideas rather than any one mode for transmitting ideas is key. In today’s library’s people not only read about ideas, they meet to discuss ideas, they brainstorm ideas. I don’t pretend to have the magic word. In fact, maybe it’s taking so long for us to drop ‘Library’ because there is not a good word in existence. Maybe we need create a new one. One tactic that comes to mind as we navigate this terminological evolution is to retain the library, but subsume it inside of something new. I’ve seen this done to various degrees in other libraries. For example, Loyola University in Chicago built an entirely new building adjacent to the book-filled library. Administratively, the building is run by the library, but it is called the Klarchek Information Commons. In that rather marvelous space looking out over Lake Michigan, you’ll find the modern ‘library’ in all its glory. Computers, Collaboration booths, etc. I like this model for fixing our identity problem and I think it would work without throwing the baby out with the bathwater. However, its done, one thing is for sure. Our users have moved on from ‘the library’ and are left with no accurate way to describe that place that they love to go to when they want to engage with ideas. Let’s put our thinking caps on and puts a word on their lips that does justice to what the old library has become. Let’s get past the L Word. Posted in librarianship | Tagged branding, information commons | Leave a comment Locking Down Windows Posted on March 10, 2016 by mryanhess I’ve recently moved Back to Windows for my desktop computing. But Windows 10 comes with enormous privacy and security issues that people need to take into account…and get under a semblance of control. Here’s how I did it. There has been much written on this subject, so what I’m including here is more of a digest of what I’ve found elsewhere with perspective on how it worked out for me over time. Windows Tweaker This is a pretty good tool that does what Windows should do out of the box: give you one-stop access to all Windows’ settings. As it is, Windows 10 has spread out many settings, including those for Privacy, to the Settings screen as well as Registry Editor and Group Policy Editor. There are dozens of look and feel tweaks, including an easy way to force Windows to use the hidden Dark Theme. The Privacy Tab, however, is the single most important. There, you can easily turn of all the nasty privacy holes in Windows 10, such as how the OS sends things like keystrokes (that’s right!) back to Microsoft. The list of holes it will close is long: Telemetry, Biometrics, Advertising ID, Cortana, etc. Cortana Speaking of Cortana, I was really excited that this kind of virtual assistant was embedded in Windows 10. I looked forward to trying it out. But then I read the fine print. Cortana is a privacy nightmare. She can’t be trusted. She’s a blabbermouth and repeats back everything you tell her to not just Microsoft, but indirectly to all of their advertising partners. And who knows where all that data goes and how secure it is in the long run. Yuck! Turn her off. Pull the plug. Zero her out. The easiest way to disable her is to set up a Local Account. But there’s more info out there, including this at PC World. Local Account When you first install Windows 10, unplug the ethernet and shut down wifi. Then, when you’re certain that all of MSFT’s listeners can’t communicate with your machine, go through the Installation Set Up process and when asked to create/log in to your Microsoft Account, don’t. Instead, use the Local Account option. The down sides of going this route are that you can’t sync your experience, accounts and apps across devices. You also won’t be able to use Cortana. The up sides are that using a Local account means you will be far more secure and private in whatever you do with your computer (as long as you maintain the many other privacy settings). Reduce Risk and Streamline Your PC Windows 10 comes crammed with many programs you may not want. Some of these may even be tracking and sharing, so if you don’t actually use it, why not lighten the load on your system and remove them. You can do this the slow way, one app at a time, or you can use the Powershell nuclear option and kill them all at once. I did this and haven’t regretted it one bit. So fire away… Privacy Settings I won’t go into all of this. There is plenty of solid advise on reducing your exposure on other sites (like at PC World) and some lengthy YouTube videos which you can easily find. But it is critical that you go into the Settings panel and turn everything off at the very least. That’s my feeling. Some tell you that you even need to set up IP blocks to keep your machine from reporting back to Microsoft and its advertising partners. Others say this is somewhat overblown, and not unique to Windows, like over at LifeHacker, so I’ll leave it to you to decide. Conclusion It’s really too bad that operating systems have gone down this road. Our PCs should be tools for us and not the other way around. Imagine if everything that happened on your device stayed private. Imagine if it was all encrypted and nobody could hack into your PC or Microsoft’s servers or their advertisers’ databases and learn all kinds of things about you, your family, your work, your finances, your secrets. And yet, this is precisely what Microsoft (and iOS, Android and others) did, intentionally. Frankly, I think its bordering on criminal negligence, but good luck suing when your data gets exploited. Better safe than sorry…that’s my take. Do a little work and lock down your computer. Good luck out there…   Posted in online security & privacy, technology | Tagged microsoft, online security & privacy, security, Windows | Leave a comment Killer Apps & Hacks for Windows 10 Posted on March 3, 2016 by mryanhess Did the UX people at Microsoft ever test Windows 10? Here are some must have apps and hacks I’ve found to make life on Windows 10 quick and easy. Set Hotkeys for Apps Sometimes you just want to launch an app from your keyboard. Using a method on Laptopmag.com, you can do this for most any program. I use this in combination with macros like those noted below. Quick Switch to VPN VPN Macro If you’re a smart and secure Internet user, you probably already use a VPN service to encrypt the data and web requests you send over the Internet (especially while on public wif-fi networks). But Windows 10 makes connecting to your VPN service a bit of a chore (I use Private Internet Access, by the way). It’s weird because Windows actually placed the Connect to VPN in the Communications Center, but you still need to click into that, then click the VPN you want and then click Connect…that’s 3 clicks if you’re counting. I’ve tried two methods to make this at least a little easier. One caveat on all of this: if you log in with an administrator account (which I don’t because I’m concerned about security after all!), you could have your VPN client launch at start, but you’d still need to click the connect button and anytime you put the machine to sleep, it would disconnect (why they do that is beyond me). With both methods, you need to manually add a VPN account to Windows built-in VPN feature. Anyway, here are my two methods: Macro Method You can record actions as a “macro” and then save it as an executable program. You can then save the program to your desktop, start or taskbar. It’s a bit of a chore and in the end, the best you get is two-click access to your VPN connection…not the one-click you would get on a Mac. If my memory serves, this method only works if you log-in with an administrator account. Otherwise, you’ll be prompted for an administrator password each time…an who wants that? Create shortcut to Settings page Add a hotkey to shortcut: Create Macro using something like JitBit that uses the new hotkey. Save as executable Create a shortcut to the desktop and pin to Start Optionally, change the icon to look pretty Pin the Communicator VPN app to your Start pane. This is actually how I ended up going in the end. To do this, you need to ‘hack’ a shortcut that points to your VPN settings panel (where the Connect button resides). On your desktop, right-click and select New > Shortcut A Shortcut wizard will open Paste ms-settings:network-vpn into the form Now pin the shortcut to your Start and you have quick access to the Connect dialog for your VPN Switch between Audio Devices Sometimes I want to jump between my speakers and my headphones and because I hate clicking and loath jumping out of Windows 10’s Metro design into the old-school looking Audio Device Controller, I followed the advice from The Windows Club. Their solution uses freeware called Audio Switcher to assign a hotkey to different audio devices. I added Audio Switcher to my startup to make this a little more automated. Unfortunately, because I normally work in a non-administrator account on Windows 10, I get asked for an Admin password to launch this app at Startup. Egads! In my case, I can now click the F1 (Headphones) and F2 (Speakers)  keys to switch playback devices for sound. Overcoming the Windows Education or Windows Pro watermark Windows embeds a horrible little Windows Education or Windows Pro watermark over the lower right corner of your desktop if you use one of those versions. There are two solutions to removing this remarkably distracting bit of text. Use a white background to “disappear” the white text Or, have an app sit over that space. I use MusicBee (recommended by LifeHacker) and set position the mini-version over that spot. Supposedly there’s a Regex trick where you delete the text but that’s a bit much work for me for such a slight annoyance. Other Tricks There are a couple other tricks that I’ve used to clean up Windows. Removing Metro Apps. This allows you to remove all the built-in apps that are there simply to confound your privacy and peddle your identity to Microsoft’s advertising partners. Remove them. Removing default folders from Explorer. If you’re like me and want better performance, you use a separate hard disk drive for your music, video and images and another drive (probably an SSD) for your OS and programs. Windows 10 is confusing for people with this kind of set up by placing folders in the File Explorer to your Images, Documents, etc. on your C Drive. In my case, that’s not the right drive. So I used the method linked above to remove those from Explorer. Posted in technology | Tagged life hacks, macros, vpn, windows 10 | Leave a comment Post navigation ← Older posts Search Search Subscribe Enter your email address to subscribe to this blog and receive notifications of new posts by email. Join 150 other followers Email Address: Sign me up! Recent Luddites, Trumpism and Change: A crossroads for libraries Is 3D Printing Dying? The State of the Library Website Virtual Realty is Getting Real in the Library W3C’s CSS Framework Review Topics best practices (6) case studies (22) digital services (4) green tech (1) information architecture (13) innovation (33) international librarianship (2) librarianship (23) library management (18) online security & privacy (12) reviews (8) society (34) tech industry (21) technology (48) Uncategorized (2) Tweets RT @techn0joy: I often check other library websites for design inspiration. Today, I found my very favorite stat on @UVaLibrary 's page htt… 2 years ago Congress has sold your privacy. Let's buy theirs - gofundme.com/BuyCongressDat… 4 years ago Luddites, Trumpism and Change: A crossroads for libraries faillab.wordpress.com/2016/12/06/lud… 4 years ago Archives Archives Select Month December 2016 October 2016 September 2016 June 2016 May 2016 April 2016 March 2016 January 2016 December 2015 November 2015 April 2015 February 2015 January 2015 December 2014 November 2014 September 2014 April 2014 March 2014 February 2014 November 2013 October 2013 July 2013 June 2013 May 2013 April 2013 March 2013 February 2013 January 2013 November 2012 October 2012 September 2012 August 2012 July 2012 March 2012 February 2012 January 2012 December 2011 October 2011 September 2011 August 2011 July 2011 June 2011 May 2011 April 2011 March 2011 February 2011 January 2011 December 2010 Blog at WordPress.com. Fail!lab Blog at WordPress.com. Email (Required) Name (Required) Website   Loading Comments... Comment × Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use. To find out more, including how to control cookies, see here: Cookie Policy 
fc18-ifca-ai-5055	----	A Quantitative Analysis of the Impact of Arbitrary Blockchain Content on Bitcoin Roman Matzutt1, Jens Hiller1, Martin Henze1, Jan Henrik Ziegeldorf1, Dirk Müllmann2, Oliver Hohlfeld1, and Klaus Wehrle1 1 Communication and Distributed Systems, RWTH Aachen University, Germany, {matzutt,hiller,henze,ziegeldorf,hohlfeld,wehrle}@comsys.rwth-aachen.de 2 Data Protection Research Institute, Goethe University, Frankfurt/Main, muellmann@jur.uni-frankfurt.de Abstract. Blockchains primarily enable credible accounting of digital events, e.g., money transfers in cryptocurrencies. However, beyond this original purpose, blockchains also irrevocably record arbitrary data, rang- ing from short messages to pictures. This does not come without risk for users as each participant has to locally replicate the complete blockchain, particularly including potentially harmful content. We provide the first systematic analysis of the benefits and threats of arbitrary blockchain content. Our analysis shows that certain content, e.g., illegal pornogra- phy, can render the mere possession of a blockchain illegal. Based on these insights, we conduct a thorough quantitative and qualitative anal- ysis of unintended content on Bitcoin’s blockchain. Although most data originates from benign extensions to Bitcoin’s protocol, our analysis re- veals more than 1600 files on the blockchain, over 99 % of which are texts or images. Among these files there is clearly objectionable content such as links to child pornography, which is distributed to all Bitcoin partic- ipants. With our analysis, we thus highlight the importance for future blockchain designs to address the possibility of unintended data insertion and protect blockchain users accordingly. 1 Introduction Bitcoin [45] was the first completely distributed digital currency and remains the most popular and widely accepted of its kind with a market price of ∼4750 USD per bitcoin as of August 31st, 2017 [14]. The enabler and key innovation of Bit- coin is the blockchain, a public append-only and tamper-proof log of all transac- tions ever issued. These properties establish trust in an otherwise trustless, com- pletely distributed environment, enabling a wide range of new applications, up to distributed general-purpose data management systems [69] and purely digital data-sharing markets [41]. In this work, we focus on the arbitrary, non-financial data on Bitcoin’s famous blockchain, which primarily stores financial transac- tions. This non-financial data fuels, e.g., digital notary services [50], secure re- leases of cryptographic commitments [16], or non-equivocation schemes [62]. However, since all Bitcoin participants maintain a complete local copy of the blockchain (e.g., to ensure correctness of blockchain updates and to bootstrap new users), these desired and vital features put all users at risk when objection- able content is irrevocably stored on the blockchain. This risk potential is exem- plified by the (mis)use of Bitcoin’s blockchain as an anonymous and irrevocable content store [40,56,35]. In this paper, we systematically analyse non-financial content on Bitcoin’s blockchain. While most of this content is harmless, there is also content to be considered objectionable in many jurisdictions, e.g., the depic- tion of nudity of a young woman or hundreds of links to child pornography. As a result, it could become illegal (or even already is today) to possess the block- chain, which is required to participate in Bitcoin. Hence, objectionable content can jeopardize the currently popular multi-billion dollar blockchain systems. These observations raise the question whether or not unintended content is ultimately beneficial or destructive for blockchain-based systems. To address this question, we provide the first comprehensive and systematic study of unin- tended content on Bitcoin’s blockchain. We first survey and explain methods to store arbitrary, non-financial content on Bitcoin’s blockchain and discuss poten- tial benefits as well as threats, most notably w.r.t. content considered illegal in different jurisdictions. Subsequently and in contrast to related work [56,40,12], we quantify and discuss unintended blockchain content w.r.t. the wide range of insertion methods. We believe that objectionable blockchain content is a pres- suring issue despite potential benefits and hope to stimulate research to mitigate the resulting risks for novel as well as existing systems such as Bitcoin. This paper is organized as follows. We survey methods to insert arbitrary data into Bitcoin’s blockchain in Section 2 and discuss their benefits and risks in Section 3. In Section 4, we systematically analyze non-financial content in Bitcoin’s blockchain and assess resulting consequences. We discuss related work in Section 5 and conclude this paper in Section 6. 2 Data Insertion Methods for Bitcoin Beyond intended recording of financial transactions, Bitcoin’s blockchain also allows for injection of non-financial data, either short messages via special trans- action types or even complete files by encoding arbitrary data as standard trans- actions. We first briefly introduce Bitcoin transactions and subsequently survey methods available to store arbitrary content on the blockchain via transactions. Bitcoin transactions transfer funds between a payer (sender) and a payee (receiver), who are identified by public-private key pairs. Payers announce their transactions to the Bitcoin network. The miners then publish these transactions in new blocks using their computational power in exchange for a fee. These fees vary, but averaged at 215 satoshi per Byte during August 2017 [4] (1 satoshi = 10−8 bitcoin). Each transaction consists of several input scripts, which unlock funds of previous transactions, and of several output scripts, which specify who receives these funds. To unlock funds, input scripts contain a signature for the previous transaction generated by the owner of the funds. To prevent malicious scripts from causing excessive transaction verification overheads, Bitcoin uses transaction script templates and expects peers to discard non-compliant scripts. Data Insertion Methods Input ScriptsOutput Scripts P2PK P2PKH P2SHP2MS P2SH Injectors SatoshiCryptoGraffiti Apertus StandardOP_RET. Non-St. Coinbase P2SHNon-St. Fig. 1: Bitcoin data insertion methods (italics show content insertion services) Method Payload Costs/B Eff. OP RET. 80 B 3.18–173.55 ct poor Coinbase 96 B — poor Non-St. Out. 99 044 B 1.03–198.05 ct poor Non-St. In. med. P2PK 85 345 B 1.24–207.79 ct high P2PKH 58 720 B 1.87–197.58 ct high P2MS 92 625 B 1.11–234.33 ct high P2SH Out. 62 400 B 1.77–195.54 ct high P2SH In. 99 018 B 1.03–225.61 ct high Table 1: Payload, costs, and efficiency of low-level data insertion methods Figure 1 shows the insertion methods for non-financial data we identified in Bitcoin. We distinguish low-level data insertion methods inserting small data chunks and content insertion services, which systematically utilize the low-level methods to insert larger chunks of data. In the following, we refer to non-financial blockchain data as content if it has a self-contained structure, e.g., a file or read- able text, or as data otherwise, e.g., fragments inserted via a low-level method. 2.1 Low-level Data Insertion Methods We first survey the efficiency of the low-level data insertion methods w.r.t. to in- sertable payload and costs per transaction (Table 1). To this end, we first explain our comparison methodology, before we detail i) intended data insertion meth- ods (OP RETURN and coinbase), ii) utilization of non-standard transactions, and iii) manipulation of standard transactions to insert arbitrary data. Comparison Methodology. We measure the payload per transaction (PpT), i.e., the number of non-financial Bytes that can be added to a single standard- sized transaction (≤ 100 000 B). Costs are given as the minimum and maximum costs per Byte (CpB) for the longest data chunk a transaction can hold, and for inserting 1 B. Costs are inflicted by paying transaction fees and possibly burning currency (at least 546 satoshi per output script), i.e., making it unspendable. For our cost analysis we assume Bitcoin’s market price of 4748.25 USD as of August 31st, 2017 [14] and the average fees of 215 satoshi per Byte as of August 2017 [4]. Note that high variation of market price and fees results in frequent changes of presented absolute costs per Byte. Finally, we rate the overall efficiency of an approach w.r.t. insertion of arbitrary-length content. Intuitively, a method is efficient if it allows for easy insertion of large payloads at low costs. OP RETURN. This special transaction template allows attaching one small data chunk to a transaction and thus provides a controlled channel to an- notate transactions without negative side effects. E.g., in typical implementa- tions peers increase performance by caching spendable transaction outputs and OP RETURN outputs can safely be excluded from this cache. However, data chunk sizes are limited to 80 B per transaction. Coinbase. In Bitcoin, each block contains exactly one coinbase transaction, which introduces new currency into the system to incentivize miners to dedi- cate their computational power to maintain the blockchain. The input script of coinbase transactions is up to 100 B long and consists of a variable-length field encoding the new block’s position in the blockchain [9]. Stating a larger size than the overall script length allows placing arbitrary data in the resulting gap. This method is inefficient as only active miners can insert only small data chunks. Non-standard Transactions. Transactions can deviate from the approved transaction templates [48] via their output scripts as well as input scripts. In the- ory, such transactions can carry arbitrarily encoded data chunks. Transactions using non-standard output scripts can carry up to 96.72 KiB at comparably low costs. However, they are inefficient as miners ignore them with high probability. Yet, non-standard output scripts occasionally enter the blockchain if miners in- sufficiently check them (cf. Section 4.2). Contrarily, non-standard input scripts are only required to match their respective output script. Hence, input scripts can be altered to carry arbitrary data if their semantics are not changed, e.g., by using dead conditional branches. This makes non-standard input scripts slightly better suited for large-scale content insertion than non-standard output scripts. Standard Financial Transactions. Even standard financial transactions can be (mis)used to insert data using mutable values of output scripts. There are four approved templates for standard financial transactions: Pay to public-key (P2PK) and pay to public-key hash (P2PKH) transactions send currency to a dedicated receiver, identified by an address derived from her private key, which is required to spend any funds received [48]. Similarly, multi-signature (P2MS) transactions require m out of n private keys to authorize payments. Pay to script hash (P2SH) transactions refer to a script instead of keys to enable complex spending conditions [48], e.g., to replace P2MS [10]. The respective public keys (P2PK, P2MS) and script hash values (P2PKH, P2SH) can be replaced with ar- bitrary data as Bitcoin peers can not verify their correctness before they are ref- erenced by a subsequent input script. While this method can store large amounts of content, it involves significant costs: In addition to transaction fees, the user must burn bitcoins as she replaces valid receiver identifiers with arbitrary data (i.e., invalid receiver identities), making the output unspendable. Using multi- ple outputs enables PpTs ranging from 57.34 KiB (P2PKH) to 96.70 KiB (P2SH inputs) at CpBs from 1.03 ct to 1.87 ct. As they behave similarly w.r.t. data in- sertion, we collectively refer to all standard financial transactions as P2X in the following. P2SH scripts also allow for efficient data insertion into input scripts as P2SH input scripts are published with their redeem script. Due to miners’ verification of P2SH transactions, transaction are not discarded if the redeem script is not template-compliant (but the overall P2SH transaction is). We now survey different services that systematically leverage the discussed data insertion methods to add larger amounts of content to the blockchain. 2.2 Content Insertion Services Content insertion services rely on the low-level data insertion methods to add content, i.e., files such as documents or images, to the blockchain. We identify four conceptually different content insertion services and present their protocols. CryptoGraffiti. This web-based service [30] reads and writes messages and files from and to Bitcoin’s blockchain. It adds content via multiple P2PKH output scripts within a single transaction, storing up to 60 KiB of content. To retrieve previously added content, CryptoGraffiti scans for transactions that either con- sist of at least 90 % printable characters or contain an image file. Satoshi Uploader. The Satoshi Uploader [56] inserts content using a single transaction with multiple P2X outputs. The inserted data is stored together with a length field and a CRC32 checksum to ease decoding of the content. P2SH Injectors. Several services [35] insert content via slightly varying P2SH input scripts. They store chunks of a file in P2SH input scripts. To ensure file integrity, the P2SH redeem scripts contain and verify hash values of each chunk. Apertus. This service [29] allows fragmenting content over multiple transac- tions using an arbitrary number of P2PKH output scripts. Subsequently, these fragments are referenced in an archive stored on the blockchain, which is used to retrieve and reassemble the fragments. The chosen encoding optionally allows augmenting content with a comment, file name, or digital signature. To conclude, Bitcoin offers various options to insert arbitrary, non-financial data. These options range from small-scale data insertion methods exclusive to active miners to services that allow any user to store files of arbitrary length. This wide spectrum of options for data insertion raises the question which benefits and risks arise from storing content on Bitcoin’s blockchain. 3 Benefits and Risks of Arbitrary Blockchain Content Bitcoin’s design includes several methods to insert arbitrary, non-financial data into its blockchain in both intended and unintended ways. In this section, we discuss potential benefits of engraving arbitrary data into Bitcoin’s blockchain as well as risks of (mis)using these channels for content insertion. 3.1 Benefits of Arbitrary Blockchain Content Besides the manipulation of standard financial transactions, Bitcoin offers coin- base and OP RETURN transactions as explicit channels to irrevocably insert small chunks of non-financial data into its blockchain (cf. Section 2). As we discuss in the following, each insertion method has distinguishing benefits: OP RETURN. Augmenting transactions with short pieces of arbitrary data is beneficial for a wide area of applications [40,12,62]. Different services use OP RETURN to link non-financial assets, e.g., vouchers, to Bitcoin’s block- chain [40,12], to attest the existence of digital documents at a certain point of time as a digital notary service [58,50,12], to realize distributed digital rights management [70,12], or to create non-equivocation logs [62,8]. Coinbase. Coinbase transactions differ from OP RETURN as only miners, who dedicate significant computational resources to maintain the blockchain, can use them to add extra chunks of data to their newly mined blocks. Beyond advertisements or short text messages [40], coinbase transactions can aid the mining process. Adding random bytes to the coinbase transactions allows miners to increase entropy when repeatedly testing random nonces to solve the proof- of-work puzzle [48]. Furthermore, adding identifiable voting flags to transactions enables miners to vote on proposed features, e.g., the adoption of P2SH [10]. Large-scale Data Insertion. Engraving large amounts of data into the block- chain creates a long-term non-manipulable file storage. This enables, e.g., the archiving of historical data or censorship-resistant publication, which helps pro- tecting whistleblowers or critical journalists [66]. However, their content is repli- cated to all users, who do not have a choice to reject storing it. Hence, non-financial data on the blockchain enables new applications that leverage Bitcoin’s security guarantees. In the following, we discuss threats of forcing honest users to download copies of all blockchain content. 3.2 Risks of Arbitrary Blockchain Content Despite potential benefits of data in the blockchain, insertion of objectionable content can put all participants of the Bitcoin network at risk [43,11,40], as such unwanted content is unchangeable and locally replicated by each peer of the Bitcoin network as benign data. To underpin this threat, we first derive an extensive catalog of content that poses high risks if possessed by individuals and subsequently argue that objectionable blockchain content is able to harm honest users. In the following, we identify five categories of objectionable content: Copyright Violations. With the advent of file-sharing networks, pirated data has become a huge challenge for copyright holders. To tackle this problem, copy- right holders predominantly target users that actively distribute pirated data. E.g., German law firms sue users who distribute copyright-protected content via file-sharing networks for fines on behalf of the copyright holders [28]. In re- cent years, prosecutors also convicted downloaders of pirated data. For instance, France temporarily suspended users’ Internet access and subsequently switched to issuing high fines [36]. As users distribute their blockchain copy to new peers, copyright-protected material on the blockchain can thus provoke legal disputes about copyright infringement. Malware. Another threat is to download malware [20,42], which could poten- tially be spread via blockchains [31]. Malware has serious consequences as it can destroy sensitive documents, make devices inoperable, or cause financial losses [34]. Furthermore, blockchain malware can irritate users as it causes an- tivirus software to deny access to important blockchain files. E.g., Microsoft’s antivirus software detected a non-functional virus signature from 1987 on the blockchain, which had to be fixed manually [68]. Privacy Violations. By disclosing sensitive personal data, individuals can harm their own privacy and that of others. This threat peaks when individuals deliberately violate the privacy of others, e.g., by blackmailing victims under the threat of disclosing sensitive data about them on the blockchain. Real-world manifestations of these threats are well-known, e.g., non-consensually releasing private nude photos or videos [54] or fully disclosing an individual’s identity to the public with malicious intents [21]. Jurisdictions such as the whole European Union begin to actively prosecute the unauthorized disclosure and forwarding of private information in social networks to counter this novel threat [5]. Politically Sensitive Content. Governments have concerns regarding the leakage of classified information such as state secrets or information that other- wise harms national security, e.g., propaganda. Although whistleblowers reveal nuisances such as corruption, they force all blockchain users to keep a copy of leaked material. Depending on the jurisdiction, the intentional disclosure or the mere possession of such content may be illegal. While, e.g., the US government usually tends to prosecute intentional theft or disclosure of state secrets [63], in China the mere possession of state secrets can result in longtime prison sen- tences [49]. Furthermore, China’s definition of state secrets is vague [49] and covers, e.g., “activities for safeguarding state security” [60]. Such vague allega- tions w.r.t. state secrets have been applied to critical news in the past [18,24]. Illegal and Condemned Content. Some categories of content are virtually universally condemned and prosecuted. Most notably, possession of child pornog- raphy is illegal at least in the 112 countries [64] that ratified an optional protocol to the Convention on the Rights of the Child [65]. Religious content such as cer- tain symbols, prayers, or sacred texts can be objectionable in extremely religious countries that forbid other religions and under oppressive regimes that forbid re- ligion in general. As an example, possession of items associated with an objected religion, e.g., Bibles in Islamist countries, or blasphemy have proven risky and were sometimes even punished by death [13,38]. In conclusion, a wide range of objectionable content can cause direct harm if possessed by users. In contrast to systems such as social media platforms, file-sharing networks, or online storage systems, such content can be stored on blockchains anonymously and irrevocably. Since all blockchain data is down- loaded and persistently stored by users, they are liable for any objectionable content added to the blockchain by others. Consequently, it would be illegal to participate in a blockchain-based systems as soon as it contains illegal content. While this risk has previously been acknowledged [43], definitive answers re- quire court rulings yet to come. However, considering legal texts we anticipate a high potential for illegal blockchain content to jeopardize blockchain-based sys- tem such as Bitcoin in the future. Our belief stems from the fact that, w.r.t. child pornography as an extreme case of illegal content, legal texts from countries such as the USA [47], England [3], Ireland [32] deem all data illegal that can be con- verted into a visual representation of illegal content. As we stated in Section 2, it is easily possible to locate and reassemble such content on the blockchain. Hence, even though convertibility usually covers creating a visual representation by, e.g., decoding an image file, we expect that the term can be interpreted to include blockchain data in the future. For instance, this is already covered implicitly by German law, as a person is culpable for possession of illegal content if she knowingly possesses an accessible document holding said content [2]. It is criti- cal here that German law perceives the hard disk holding the blockchain as an document [1] and that users can easily reassemble any illegal content within the blockchain. Furthermore, users can be assumed to knowingly maintain control over such illegal content w.r.t. German law if sufficient media coverage causes the content’s existence to become public knowledge among Bitcoin users [61], as has been attempted by Interpol [31]. We thus believe that legislators will speak law w.r.t. non-financial blockchain content and that this has the potential to jeopardize systems such as Bitcoin if they hold illegal content. 4 Blockchain Content Landscape To understand the landscape of non-financial blockchain data and assess its potentials and risks, we thoroughly analyze Bitcoin’s blockchain as it is the most widely used blockchain today. Especially, we are interested in i) the degree of utilization of data and content insertion methods, ii) the temporal evolution of data insertion, and iii) the types of content on Bitcoin’s blockchain, especially w.r.t. objectionable content. In the following, we first outline our measurement methodology before we present an overview and the evolution of non-financial data on Bitcoin’s blockchain. Finally, we analyze files stored on the blockchain to derive if any objectionable content is already present on the blockchain. 4.1 Methodology We detect data-holding transactions recorded on Bitcoin’s blockchain based on our study of data insertion methods and content insertion services (cf. Section 2). We distinguish detectors for data insertion methods and detectors for content insertion services. To reduce false positives, e.g., due to public-key hash values that resemble text, we exclude all standard transaction outputs that include already-spent funds from analysis. This is sensible as data-holding transactions replace public keys or hashes such that spending requires computing correspond- ing private keys or pre-images, which is assumed to be infeasible. Contrarily, even though we thoroughly analyzed possible insertion methods, there is still a chance that we do not exhaustively detect all non-financial data. Nevertheless, our con- tent type analysis establishes a solid lower bound as we only consider readable files retrieved from Bitcoin’s blockchain. In the following, we explain the key characteristics of the two classes of our blockchain content detectors. Low-level Insertion Method Detectors. The first class of detectors is tai- lored to match individual transactions that are likely to contain non-financial data (cf. Section 2.1). These detectors detect manipulated financial transactions as well as OP RETURN, non-standard, and coinbase transactions. Our text detector scans for P2X output scripts for mutable values containing ≥ 90 % printable ASCII characters (to avoid false positives). The detector returns the concatenation of all output scripts of the same transaction that contain text. Finally, we consider all coinbase and OP RETURN transactions as well as non-standard output scripts. We detect coinbase transactions based on the length field mismatch described in Section 2.1. OP RETURN scripts are detectable as they always begin with an OP RETURN operation. Non-standard output scripts comprise all output scripts which are not template-conform. 2009 2011 2013 2015 2017 101 103 105 107 T ra n sa c ti o n s [# ] OP RET. Coinb. Non-St. P2X P2SH Input Fig. 2: Cumulative numbers of detected transactions per data insertion method 2013 2014 2015 2016 2017 0.0 0.2 0.4 0.6 0.8 1.0 1.2 P re se n c e in T X s [% ] OP RET. P2X P2SH Input Fig. 3: Ratio of transactions that utilize data insertion methods Service Detectors. We implemented detectors specific to the content insertion services we identified in Section 2.2. These service-specific detectors enable us to detect and extract files based on the services’ protocols. These detectors also track the data insertion method used in service-created transactions. The CryptoGraffiti detector matches transactions with an output that sends a tip to a public-key hash controlled by its provider. For such a transaction, we concatenate all mutable values of output scripts that spend fewer than 10 000 satoshi and store them in a file. This threshold is used to ignore non- manipulated output scripts, e.g., the service provider spending their earnings. To detect a Satoshi Uploader transaction, we concatenate all of its mutable values that spend the same small amount of bitcoins. If we find the first eight bytes to contain a valid combination of length and CRC32 checksum for the transaction’s payload, we store the payload as an individual file. We detect P2SH Injector content based on redeem scripts containing more than one hash operation (standard transactions use at most one). We then ex- tract the concatenation of the second inputs of all redeem scripts (the first one contains a signature) of a transaction as one file. Finally, the Apertus detector recursively scans the blockchain for Apertus archives, i.e., Apertus-encoded lists of previous transaction identifiers. Once a referred Apertus payload does not constitute another archive, we retrieve its payload file and optional comment by parsing the Apertus protocol. Suspicious Transaction Detector. To account for less wide-spread insertion services, we finally analyze standard transactions that likely carry non-financial data but are not detected otherwise. We only consider transactions with at least 50 suspicious outputs, i.e., roughly 1 KiB of content. We consider a set of outputs suspicious if all outputs i) spend the same small amount (< 10 000 satoshi) and ii) are unspent. This detector trades off detection rate against false-positive rate. Due to overlaps with service detectors, we exclude matches of this detector from our quantitative analysis, but discuss individual findings in Section 4.3. 4.2 Utilization of Data Insertion Methods Data and content insertion in Bitcoin has evolved over time, transitioning from single miners exploiting coinbase transactions to sophisticated services that en- able the insertion of whole files into the blockchain. We study this evolution in 645 In se rt io n s/ M o n th [# ] 2013 2014 2015 2016 2017 0 50 100 150 P2SH Injectors CryptoGraffiti Satoshi Uploader Apertus Fig. 4: Number of files inserted via con- tent insertion services per month 2013 2014 2015 2016 2017 0 2 4 6 8 T X si z e s [M iB ] P2SH Injectors CryptoGraffiti Satoshi Uploader Apertus Fig. 5: Cumulative sizes of transactions from content insertion services terms of used data insertion methods as well as content insertion services and quantify the amount of blockchain data using our developed detectors. Our key insights are that OP RETURN constitutes a well-accepted success story while content insertion services are currently only infrequently utilized. However, the introduction of OP RETURN did not shut down other insertion methods, e.g., P2X manipulation, which enable single users to insert objectionable content. Our measurements are based on Bitcoin’s complete blockchain as of August 31st, 2017, containing 482 870 blocks and 250 845 217 transactions with a total disk size of 122.64 GiB. We first analyze the popularity of different data inser- tion methods and subsequently turn towards the utilization of content insertion services to assess how non-financial data enters the blockchain. Data Insertion Methods. As described in Section 2.1, OP RETURN and coinbase transactions constitute intended data insertion methods, whereas P2X and non-standard P2SH inputs manipulate legitimate transaction templates to contain arbitrary data. Figure 2 shows the cumulative number of transactions containing non-financial data on a logarithmic scale. In total, our detectors found 3 535 855 transactions carrying a total payload of 118.53 MiB, i.e., only 1.4 % of Bitcoin transactions contain non-financial data. However, we strive to further un- derstand the characteristics of non-financial blockchain content as even a single instance of objectionable content can potentially jeopardize the overall system. The vast majority of extracted transactions are OP RETURN (86.8 % of all matches) and coinbase (13.13 %) transactions. Combined, they constitute 95.90 MiB (80.91 % of all extracted data). Out of all blocks, 96.15 % have content- holding coinbase transactions. While only 0.26 % of these contain ≥ 90 % print- able text, 33.49 % of them contain ≥ 15 consecutive printable ASCII characters (mostly surrounded by data without obvious structure). Of these short messages, 14.39 % contain voting flags for new features (cf. Section 3.1). Apart from this, miners often advertise themselves or leave short messages, e.g., prayer verses. OP RETURN transactions were introduced in 2013 to offer a benign way to augment single transactions with non-financial data. This feature is widely used, as shown by Figure 3. Among all methods, OP RETURN is the only one to be present with a rising tendency, with currently 1.2 % of all transactions containing OP RETURN outputs. These transactions predominantly manage off-blockchain assets or originate from notary services [12]. While P2X transactions are contin- uously being manipulated, they make up only 0.02 % of all transactions; P2SH inputs are virtually irrelevant. Hence, short non-financial data chunks are well- accepted, viable extensions to the Bitcoin system (cf. Section 3.1). P2X transactions are asymmetric w.r.t. the number and sizes of data-carrying transactions. Although constituting only 1.6 % of all detector hits, they make up 9.08 % of non-financial data (10.76 MiB). This again highlights the high content- insertion efficiency of P2X transactions (cf. Section 2.1). Finally, we discuss non-standard transactions and non-standard P2SH in- put scripts. In total, we found 1703 transactions containing non-standard out- puts. The three first non-standard transactions (July 2010) repeatedly used the OP CHECKSIG operation. We dedicate this to an attempted DoS attack that tar- gets to cause high verification times. Furthermore, we found 23 P2PKH transac- tions from October 2011 that contained OP 0 instead of a hash value. The steady increase of non-standard transactions in 2012 is due to scripts that consist of 32 seemingly random bytes. Contrarily, P2SH input scripts sporadically carry non- standard redeem scripts and are then often used to insert larger data chunks (as they are used by P2SH Injectors). This is due to P2SH scripts not being checked for template conformity. We found 888 such transactions holding 8.37 MiB of data. Although peers should reject such transactions [48], they still often man- age to enter the blockchain. Non-standard P2SH scripts even carry a substantial amount of data (7.07 % of the total data originate from P2SH Injectors). Content Insertion Services. We now investigate to which extent content insertion services are used to store content on Bitcoin’s blockchain. Figure 4 shows utilization patterns for each service and Figure 5 shows the cumulative size of non-financial data inserted via the respective service. Notably, only few users are likely responsible for the majority of service-inserted content. In total, content insertion services account for 16.12 MiB of non-financial data. More than a half of this content (8.37 MiB) originates from P2SH In- jectors. The remainder was mostly inserted using Apertus (21.70 % of service- inserted data) and Satoshi Uploader (21.24 %). Finally, CryptoGraffiti accounts for 0.82 MiB (5.10 %) of content related to content insertion services. In the following, we study how the individual services have been used over time. Our key observation is that both CryptoGraffiti and P2SH Injectors are in- frequently but steadily used; since 2016 we recognize on average 23.65 data items being added per month using these services. Contrarily, Apertus has been used only 26 times since 2016, while the Satoshi Uploader has not been used at all. In fact, the Satoshi Uploader was effectively used only during a brief period: 92.73 % of all transactions emerged in April 2013. During this time, the service was used to upload four archives, six backup text files, and a PDF file. Although Apertus and the Satoshi Uploader have been used only infrequently, together they constitute 64.32 % of all P2X data we detected. This stems from the utilization of those services to engrave files into the blockchain, e.g., archives or documents (Satoshi Uploader), or images (Apertus). Similarly, P2SH Injectors are used to backup conversations regarding development of the Bitcoin client, especially online chat logs, forum threads, and emails, with a significant peak File Via Service? Overall File Via Service? Overall Type yes no Portion Type yes no Portion Text 1353 54 87.07 % Archive 4 0 0.25 % Images 144 2 9.03 % Audio 2 0 0.12 % HTML 45 0 2.78 % PDF 2 0 0.12 % Source Code 7 3 0.62 % Total 1557 59 100.0 % Table 2: Distribution of blockchain file types according to our content-insertion- service and suspicious-transactions detectors. utilization between May and June 2015 (76.46 % of P2SH Injector matches). Es- pecially Apertus is well-suited for this task as files are spread over multiple trans- actions. Based on the median, the average Apertus file has a size of 17.15 KiB and is spread over 10 transactions, including all overheads. The largest Aper- tus file is 310.72 KiB large (including overheads), i.e., three times the size of a standard transaction, and is spread over 96 transactions. The most heavily frag- mented Apertus file is even spread over 664 transactions. Contrarily, 95.7 % of CryptoGraffiti matches are short text messages with a median length of 80 Byte. In conclusion, content insertion services are only infrequently used with vary- ing intentions and large portions of content was uploaded in bursts, indicating that only few users are likely responsible for the majority of service-inserted blockchain content. While CryptoGraffiti is mostly used to insert short text messages that also fit into one OP RETURN transaction, other services are pre- dominantly used to store, e.g., images or documents. As such files can constitute objectionable content, we further investigate them in the following. 4.3 Investigating Blockchain Files After quantifying basic content insertion in Bitcoin, we now focus on readable files that are extractable from the blockchain. We refer to files as findings of our content-insertion-service or suspicious-transaction detectors that are viewable using appropriate standard software. We reassemble fragmented files only if this is unambiguously possible, e.g., via an Apertus archive. Out of the 22.63 MiB of blockchain data not originating from coinbase or OP RETURN transactions, we can extract and analyze 1557 files with meaningful content. In addition to these, we could extract 59 files using our suspicious-transaction detector (92.25 % text). Table 2 summarizes the different file types of the analyzed files. The vast majority are text-based files and images (99.34 %). In the following, we discuss our findings with respect to objectionable con- tent. We manually evaluated all readable files with respect to the problematic categories we identified in Section 3.2. This analysis reveals that content from all those categories already exists in Bitcoin’s blockchain today. For each of these categories, we discuss the most severe examples. To protect the safety and pri- vacy of individuals, we omit personal identifiable information and refrain from providing exact information on the location of critical content in the blockchain. Copyright Violations. We found seven files that publish (intellectual) property and showcase Bitcoin’s potential to aid copyright violations. Engraved are the text of a book, a copy of the original Bitcoin paper [45,56], and two short textual white papers. Furthermore, we found two leaked cryptographic keys: one RSA private key and a firmware secret key. Finally, the blockchain contains a so-called illegal prime, encoding software to break the copy protection of DVDs [56]. Malware. We could not find actual malware in Bitcoin’s blockchain. How- ever, an individual non-standard transaction contains a non-malicious cross-site scripting detector. A security researcher inserted this small piece of code which, if interpreted by an online blockchain parser, notifies the author about the vul- nerability. Such malicious code could become a threat for users as most websites offering an online blockchain parser also offer online Bitcoin accounts. Privacy Violations. Users store memorable private moments on the block- chain. We extracted six wedding-related images and one image showing a group of people, labeled with their online pseudonyms. Furthermore, 609 transactions contain online public chat logs, emails, and forum posts discussing Bitcoin, in- cluding topics such as money laundering. Storing private chat logs on the block- chain can, e.g., leak single user’s private information irrevocably. Moreover, third parties can release information without knowledge nor consent of affected users. Most notably, we found at least two instances of doxing, i.e., the complete dis- closure of another individual’s personal information. This data includes phone numbers, addresses, bank accounts, passwords, and multiple online identities. Recently, jurisdictions such as the European Union began to punish such serious privacy violations, including the distribution of doxing data [5]. Again, carrying out such assaults via blockchains fortifies the problem due to their immutability. Politically Sensitive Content. The blockchain has been used by whistleblow- ers as a censorship-resistant permanent storage for leaked information. We found backups of the WikiLeaks Cablegate data [37] as well as an online news arti- cle concerning pro-democracy demonstrations in Hong Kong in 2014 [25]. As stated in Section 3.2, restrictive governments are known to prosecute the pos- session of such content. For example, state-critical media coverage has already put individuals in China [18] or Turkey [24] at the risk of prosecution. Illegal and Condemned Content. Bitcoin’s blockchain contains at least eight files with sexual content. While five files only show, describe, or link to mildly pornographic content, we consider the remaining three instances objectionable for almost all jurisdictions: Two of them are backups of link lists to child pornog- raphy, containing 274 links to websites, 142 of which refer to Tor hidden services. The remaining instance is an image depicting mild nudity of a young woman. In an online forum this image is claimed to show child pornography, albeit this claim cannot be verified (due to ethical concerns we refrain from providing a ci- tation). Notably, two of the explicit images were only detected by our suspicious- transaction detector, i.e., they were not inserted via known services. While largely harmless, potentially objectionable blockchain content is infre- quently inserted, e.g., links to alleged child pornography or privacy violations. We thus believe that future blockchain designs must proactively cope with objec- tionable content. Peers can, e.g., filter incoming transactions or revert content- holding transactions [11,51], but this must be scalable and transparent. 5 Related Work Previous work related to ours comprises i) mitigating the distribution of objec- tionable content in file-sharing peer-to-peer networks, ii) studies on Bitcoin’s blockchain, iii) reports on Bitcoin’s susceptibility for content insertion, and iv) approaches to retrospectively remove blockchain content. The trade-off between enabling open systems for data distribution and risking that unwanted or even illegal content is being shared is already known from peer-to-peer networks. Peer-to-peer-based file-sharing protocols typically limit the spreading of objectionable public content by tracking the reputation of users offering files [6,26,55,73] or assigning a reputation to files themselves [19,67]. This way, users can reject objectionable content or content from untrustworthy sources. Contrarily, distributed content stores usually resort to encrypt private files before outsourcing them to other peers [17,7]. By storing only encrypted files, users can plausibly deny possessing any content of others and can thus obliviously store it on their hard disk. Unfortunately, these protection mechanisms are not applicable to blockchains, as content cannot be deleted once it has been added to the blockchain and the utilization of encryption cannot be enforced reliably. Bitcoin’s blockchain was analyzed w.r.t. different aspects by numerous stud- ies. In a first step, multiple research groups [53,33,71,72,39] studied the currency flows in Bitcoin, e.g., to perform wealth analyses. From a different line of re- search, several approaches focused on user privacy and investigated the identities used in Bitcoin [52,46,44,59,23]. These works analyzed to which extent users can be de-anonymized by clustering identities [52,46,44,59,23] and augmenting these clusters with side-channel information [52,44,59,23]. Finally, the blockchain was analyzed w.r.t. the use cases of OP RETURN transactions [12]. While this work is very close to ours, we provide a first comprehensive study of the complete landscape of non-financial data on Bitcoin’s blockchain. The seriousness of objectionable content stored on public blockchains has been motivated by multiple works [56,57,43,11,40,51]. These works, however, fo- cus on reporting individual incidents or consist of preliminary analyses of the distribution and general utilization of content insertion. To the best of our knowl- edge, this paper gives the first comprehensive analysis of this problem space, including a categorization of objectionable content and a survey of potential risks for users if such content enters the blockchain. In contrast to previously considered attacks on Bitcoin’s ecosystem [22,27], illegal content can be inserted instantly at comparably low costs and can put all participants at risk. The utilization of chameleon hash functions [15] to chain blocks recently opened up a potential approach to mitigate unwanted or illegal blockchain con- tent [11]. Here, a single blockchain maintainer or a small group of maintainers can retrospectively revert single transactions, e.g., due to illegal content. To overcome arising trust issues, µchain [51] leverages the consensus approach of traditional blockchains to vote on alterations of the blockchain history. As these approaches tackle unwanted content for newly designed blockchains, we seek to motivate a discussion on countermeasures also for existing systems, e.g., Bitcoin. 6 Conclusion The possibility to store non-financial data on cryptocurrency blockchains is both beneficial and threating for its users. Although controlled channels to insert non- financial data at small rates opens up a field of new applications such as digital notary services, rights management, or non-equivocation systems, objectionable or even illegal content has the potential to jeopardize a whole cryptocurrency. Although court rulings do not yet exist, legislative texts from countries such as Germany, the UK, or the USA suggest that illegal content such as child pornography can make the blockchain illegal to possess for all users. As we have shown in this paper, a plethora of fundamentally different meth- ods to store non-financial–potentially objectionable–content on the blockchain exists in Bitcoin. As of now, this can affect at least 112 countries in which pos- sessing content such as child pornography is illegal. This especially endangers the multi-billion dollar markets powering cryptocurrencies such as Bitcoin. To assess this problem’s severity, we comprehensively analyzed the quantity and quality of non-financial blockchain data in Bitcoin today. Our quantitative analysis shows that 1.4 % of the roughly 251 million transactions in Bitcoin’s blockchain carry arbitrary data. We could retrieve over 1600 files, with new con- tent infrequently being added. Despite a majority of arguably harmless content, we also identify different categories of objectionable content. The harmful poten- tial of single instances of objectionable blockchain content is already showcased by findings such as links to illegal pornography or serious privacy violations. Acknowledgements This work has been funded by the German Federal Ministry of Education and Research (BMBF) under funding reference number 16KIS0443. The responsibil- ity for the content of this publication lies with the authors. References 1. German Criminal Code, Section 11 (2013) 2. German Criminal Code, Sections 184b and 184c (2013) 3. Protection of Children Act, Chapter 37, Section 7 (2015) 4. Bitcoin Transaction Fees. https://bitcoinfees.info (2016) Accessed 09/23/2017. 5. General Data Protection Regulation, Section 24 (2016) 6. Aberer, K., Despotovic, Z.: Managing Trust in a Peer-2-Peer Information System. In: ACM CIKM. (2001) pp. 310–317 7. Adya, A., Bolosky, W.J., Castro, M., Cermak, G., Chaiken, R., Douceur, J.R., Howell, J., Lorch, J.R., Theimer, M., Wattenhofer, R.P.: FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment. SIGOPS Oper. Syst. Rev. 36(SI) (2002) pp. 1–14 8. Ali, M., Shea, R., Nelson, J., Freedman, M.J.: Blockstack: A New Decentralized Internet. (2017) Accessed 09/23/2017. https://bitcoinfees.info 9. Andresen, G.: Block v2 (Height in Coinbase). https://github.com/bitcoin/ bips/blob/master/bip-0034.mediawiki (2012) Accessed 09/23/2017. 10. Andresen, G.: Pay to Script Hash. https://github.com/bitcoin/bips/blob/ master/bip-0016.mediawiki (2012) Accessed 09/23/2017. 11. Ateniese, G., Magri, B., Venturi, D., Andrade, E.: Redactable Blockchain – or – Rewriting History in Bitcoin and Friends. In: IEEE EuroS&P. (2017) pp. 111–126 12. Bartoletti, M., Pompianu, L.: An analysis of Bitcoin OP RETURN metadata. In: FC Bitcoin Workshop. (2017) 13. Bellinger, J., Hussain, M.: Freedom of Speech: The Great Divide and the Common Ground between the United States and the Rest of the World. Islamic Law and International Human Rights Law: Searching for Common Ground? (2012) pp. 168– 180 14. Blockchain.info: Bitcoin Charts. https://blockchain.info/charts (2011) Ac- cessed 09/23/2017. 15. Camenisch, J., Derler, D., Krenn, S., Pöhls, H.C., Samelin, K., Slamanig, D.: Chameleon-Hashes with Ephemeral Trapdoors. In: PKC ’17. (2017) pp. 152–182 16. Clark, J., Essex, A.: CommitCoin: Carbon Dating Commitments with Bitcoin. In: FC. (2012) pp. 390–398 17. Clarke, I., Sandberg, O., Wiley, B., Hong, T.W.: Freenet: A Distributed Anony- mous Information Storage and Retrieval System. In: Designing Privacy Enhanc- ing Technologies: Workshop on Design Issues in Anonymity and Unobservability. (2001) pp. 46–66 18. Committee to Protect Journalists: Chinese journalist accused of illegally acquiring state secrets. https://cpj.org/x/660d (2015) Accessed 09/23/2017. 19. Damiani, E., di Vimercati, D.C., Paraboschi, S., Samarati, P., Violante, F.: A Reputation-based Approach for Choosing Reliable Resources in Peer-to-peer Net- works. In: ACM CCS. (2002) pp. 207–216 20. Dell Security: Annual Threat Report. (2016) Accessed 09/23/2017. 21. Douglas, D.M.: Doxing: a conceptual analysis. Ethics and Information Technology 18(3) (2016) pp. 199–210 22. Eyal, I., Sirer, E.G.: Majority Is Not Enough: Bitcoin Mining Is Vulnerable. In: FC. (2014) pp. 436–454 23. Fleder, M., Kester, M., Sudeep, P.: Bitcoin Transaction Graph Analysis. (2015) 24. Freedom House: Turkey Freedom of the Press Report. https://freedomhouse. org/report/freedom-press/2016/turkey (2016) Accessed 09/23/2017. 25. Gracie, C.: Hong Kong stages huge National Day democracy protests. http: //www.bbc.com/news/world-asia-china-29430229 (2014) Accessed 09/23/2017. 26. Gupta, M., Judge, P., Ammar, M.: A Reputation System for Peer-to-peer Net- works. In: ACM NOSSDAV. (2003) pp. 144–152 27. Heilman, E., Kendler, A., Zohar, A., Goldberg, S.: Eclipse Attacks on Bitcoin’s Peer-to-Peer Network. In: USENIX Security. (2015) pp. 129–144 28. Herald Union: Copyright infringement by illegal file sharing in Ger- many. http://www.herald-union.com/copyright-infringement-by-illegal- file-sharing-in-germany (2015) Accessed 09/23/2017. 29. HugPuddle: Apertus – Archive data on your favorite blockchains. http:// apertus.io (2013) Accessed 09/23/2017. 30. “Hyena”: Cryptograffiti.info. http://cryptograffiti.info Accessed 09/23/2017. 31. Interpol: INTERPOL cyber research identifies malware threat to virtual curren- cies. https://www.interpol.int/News-and-media/News/2015/N2015-033 (2015) Accessed 09/23/2017. https://github.com/bitcoin/bips/blob/master/bip-0034.mediawiki https://github.com/bitcoin/bips/blob/master/bip-0034.mediawiki https://github.com/bitcoin/bips/blob/master/bip-0016.mediawiki https://github.com/bitcoin/bips/blob/master/bip-0016.mediawiki https://blockchain.info/charts https://cpj.org/x/660d https://freedomhouse.org/report/freedom-press/2016/turkey https://freedomhouse.org/report/freedom-press/2016/turkey http://www.bbc.com/news/world-asia-china-29430229 http://www.bbc.com/news/world-asia-china-29430229 http://www.herald-union.com/copyright-infringement-by-illegal-file-sharing-in-germany http://www.herald-union.com/copyright-infringement-by-illegal-file-sharing-in-germany http://apertus.io http://apertus.io http://cryptograffiti.info https://www.interpol.int/News-and-media/News/2015/N2015-033 32. Irish Office of the Attorney General: Child Trafficking and Pornography Act, Section 2. Irish Statue Book (1998) pp. 44–61 33. Kondor, D., Pósfai, M., Csabai, I., Vattay, G.: Do the Rich Get Richer? An Empirical Analysis of the Bitcoin Transaction Network. PLOS ONE 9(2) (02 2014) pp. 1–10 34. Labs, F.S.: Ransomware: How to Predict, Prevent, Detect & Resond. Threat Response (2016) Accessed 09/23/2017. 35. Le Calvez, A.: Non-standard P2SH scripts. https://medium.com/@alcio/non- standard-p2sh-scripts-508fa6292df5 (2015) Accessed 09/23/2017. 36. Lee, D.: France ends three-strikes internet piracy ban policy. http://www.bbc. com/news/technology-23252515 (2013) Accessed 12/12/2017. 37. Lynch, L.: The Leak Heard Round the World? Cablegate in the Evolving Global Mediascape. In Brevini, B., Hintz, A., McCurdy, P., eds.: Beyond WikiLeaks: Implications for the Future of Communications, Journalism and Society. Palgrave Macmillan UK (2013) pp. 56–77 38. Lyons, K., Blight, G.: Where in the world is the worst place to be a Christian? (2015) Accessed 09/23/2017. 39. Maesa, D.D.F., Marino, A., Ricci, L.: Uncovering the Bitcoin Blockchain: An Analysis of the Full Users Graph. In: IEEE DSAA. (2016) pp. 537–546 40. Matzutt, R., Hohlfeld, O., Henze, M., Rawiel, R., Ziegeldorf, J.H., Wehrle, K.: POSTER: I Don’t Want That Content! On the Risks of Exploiting Bitcoin’s Block- chain as a Content Store. In: ACM CCS. (2016) 41. Matzutt, R., Müllmann, D., Zeissig, E.M., Horst, C., Kasugai, K., Lidynia, S., Wieninger, S., Ziegeldorf, J.H., Gudergan, G., Spiecker gen. Döhmann, I., Wehrle, K., Ziefle, M.: myneData: Towards a Trusted and User-controlled Ecosystem for Sharing Personal Data. In Eibl, M., Gaedke, M., eds.: INFORMATIK, Gesellschaft für Informatik, Bonn (2017) pp. 1073–1084 42. McAfee Labs: Threats Report (December 2016). (2016) Accessed 09/23/2017. 43. McReynolds, E., Lerner, A., Scott, W., Roesner, F., Kohno, T.: Cryptographic currencies from a tech-policy perspective: Policy issues and technical directions. In: Springer LNCS. Volume 8976. (2015) pp. 94–111 44. Meiklejohn, S., Pomarole, M., Jordan, G., Levchenko, K., McCoy, D., Voelker, G.M., Savage, S.: A Fistful of Bitcoins: Characterizing Payments Among Men with No Names. In: IMC. (2013) pp. 127–140 45. Nakamoto, S.: Bitcoin: A Peer-to-Peer Electronic Cash System. (2008) https: //bitcoin.org/bitcoin.pdf. 46. Ober, M., Katzenbeisser, S., Hamacher, K.: Structure and Anonymity of the Bit- coin Transaction Graph. Future Internet 5(2) (2013) pp. 237–250 47. Office of the Law Revision Counsel of the United States House of Representatives: U.S. Code, Title 18, Chapter 110, § 2256 (2017) 48. Okupski, K.: Bitcoin Developer Reference. Technical report (2014) 49. Peerenboom, R.P.: Assessing Human Rights in China: Why the Double Standard. (2005) Accessed 09/23/2017. 50. PoEx Co., Ltd: Proof of Existence. https://proofofexistence.com (2015) Ac- cessed 09/23/2017. 51. Puddu, I., Dmitrienko, A., Capkun, S.: µchain: How to forget without hard forks. IACR Cryptology ePrint Archive 2017/106 (2017) Accessed 09/23/2017. 52. Reid, F., Harrigan, M.: An Analysis of Anonymity in the Bitcoin System. In: Security and Privacy in Social Networks. (2013) pp. 197–223 53. Ron, D., Shamir, A.: Quantitative Analysis of the Full Bitcoin Transaction Graph. In: FC. (2013) pp. 6–24 https://medium.com/@alcio/non-standard-p2sh-scripts-508fa6292df5 https://medium.com/@alcio/non-standard-p2sh-scripts-508fa6292df5 http://www.bbc.com/news/technology-23252515 http://www.bbc.com/news/technology-23252515 https://bitcoin.org/bitcoin.pdf https://bitcoin.org/bitcoin.pdf https://proofofexistence.com 54. Scheller, S.H.: A Picture Is Worth a Thousand Words: The Legal Implications of Revenge Porn. North Carolina Law Review 93(2) (2015) pp. 551–595 55. Selcuk, A.A., Uzun, E., Pariente, M.R.: A Reputation-based Trust Management System for P2P Networks. In: IEEE CCGrid. (2004) pp. 251–258 56. Shirriff, K.: Hidden surprises in the Bitcoin blockchain and how they are stored: Nelson Mandela, Wikileaks, photos, and Python software. http://www. righto.com/2014/02/ascii-bernanke-wikileaks-photographs.html (2014) Ac- cessed 09/23/2017. 57. Sleiman, M.D., Lauf, A.P., Yampolskiy, R.: Bitcoin message: Data insertion on a proof-of-work cryptocurrency system. In: ACM CW. (2015) pp. 332–336 58. Snow, P., Deery, B., Lu, J., Johnston, D., Kirby, P.: Factom: Business Processes Secured by Immutable Audit Trails on the Blockchain. https://www.factom.com/ devs/docs/guide/factom-white-paper-1-0 (2014) Accessed 09/23/2017. 59. Spagnuolo, M., Maggi, F., Zanero, S.: BitIodine: Extracting Intelligence from the Bitcoin Network. In: FC. (2014) pp. 457–468 60. Standing Committee of the National People’s Congress: Law of the People’s Re- public of China on Guarding State Secrets. (1989) Accessed 09/23/2017. 61. Taylor, G.: Concepts of Intention in German Criminal Law. Oxford Journal of Legal Studies 24(1) (2004) pp. 99–127 62. Tomescu, A., Devadas, S.: Catena: Efficient non-equivocation via bitcoin. In: IEEE S&P. (2017) pp. 393–409 63. Tucker, E.: A Look at Federal Cases on Handling Classified In- formation. http://www.military.com/daily-news/2016/01/30/a-look-at- federal-cases-on-handling-classified-information.html (2016) Accessed 09/23/2017. 64. United Nations: Appendix to the Optional protocols to the Convention on the Rights of the Child on the involvement of children in armed conflict and on the sale of children, child prostitution and child pornography (2000) 65. United Nations: Optional protocols to the Convention on the Rights of the Child on the involvement of children in armed conflict and on the sale of children, child prostitution and child pornography. 2171 (2000) pp. 247–254 66. Waldman, M., Rubin, A.D., Cranor, L.: Publius: A Robust, Tamper-Evident, Censorship-Resistant and Source-Anonymous Web Publishing System. In: USENIX Security. (2000) pp. 59–72 67. Walsh, K., Sirer, E.G.: Experience with an Object Reputation System for Peer- to-peer Filesharing. In: NSDI. (2006) 68. Wei, W.: Ancient ’STONED’ Virus Signatures found in Bitcoin Block- chain. https://thehackernews.com/2014/05/microsoft-security-essential- found.html (2014) Accessed 09/23/2017. 69. Wood, G.: Ethereum: A Secure Decentralised Generalised Transaction Ledger. Ethereum Project Yellow Paper (2016) Accessed 09/23/2017. 70. Zeilinger, M.: Digital art as ‘monetised graphics’: Enforcing intellectual property on the blockchain. Philosophy & Technology (2016) 71. Ziegeldorf, J.H., Grossmann, F., Henze, M., Inden, N., Wehrle, K.: CoinParty: Secure Multi-Party Mixing of Bitcoins. In: ACM CODASPY. (2015) pp. 75–86 72. Ziegeldorf, J.H., Matzutt, R., Henze, M., Grossmann, F., Wehrle, K.: Secure and Anonymous Decentralized Bitcoin Mixing. FGCS 80 (3 2018) 448–466 73. Zimmermann, T., Rüth, J., Wirtz, H., Wehrle, K.: Maintaining Integrity and Reputation in Content Offloading. In: IEEE/IFIP WONS. (2016) pp. 1–8 http://www.righto.com/2014/02/ascii-bernanke-wikileaks-photographs.html http://www.righto.com/2014/02/ascii-bernanke-wikileaks-photographs.html https://www.factom.com/devs/docs/guide/factom-white-paper-1-0 https://www.factom.com/devs/docs/guide/factom-white-paper-1-0 http://www.military.com/daily-news/2016/01/30/a-look-at-federal-cases-on-handling-classified-information.html http://www.military.com/daily-news/2016/01/30/a-look-at-federal-cases-on-handling-classified-information.html https://thehackernews.com/2014/05/microsoft-security-essential-found.html https://thehackernews.com/2014/05/microsoft-security-essential-found.html A Quantitative Analysis of the Impact of Arbitrary Blockchain Content on Bitcoin 
feeds-dltj-org-5409	----	Disruptive Library Technology Jester Disruptive Library Technology Jester We're Disrupted, We're Librarians, and We're Not Going to Take It Anymore More Thoughts on Pre-recording Conference Talks Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion about recording talks to fill in gaps—positive and negative—about the concept, and I was not disappointed. I’m particularly thankful to Lisa Janicke Hinchliffe and Andromeda Yelton along with Jason Griffey, Junior Tidal, and Edward Lim Junhao for generously sharing their thoughts. Daniel S and Kate Deibel also commented on the Code4Lib Slack team. I added to the previous article’s bullet points and am expanding on some of the issues here. I’m inviting everyone mentioned to let me know if I’m mischaracterizing their thoughts, and I will correct this post if I hear from them. (I haven’t found a good comments system to hook into this static site blog.) Pre-recorded Talks Limit Presentation Format Lisa Janicke Hinchliffe made this point early in the feedback: @DataG For me downside is it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? I was required to turn workshops into talks this year. Even tho tech can do more. Not at all best pedagogy for learning&mdash; Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 Jason described the “flipped classroom” model that he had in mind as the NISOplus2021 program was being developed. The flipped classroom model is one where students do the work of reading material and watching lectures, then come to the interactive time with the instructors ready with questions and comments about the material. Rather than the instructor lecturing during class time, the class time becomes a discussion about the material. For NISOplus, “the recording is the material the speaker and attendees are discussing” during the live Zoom meetings. In the previous post, I described how the speaker could respond in text chat while the recording replay is beneficial. Lisa went on to say: @DataG Q+A is useful but isn't an interactive session. To me, interactive = participants are co-creating the session, not watching then commenting on it.&mdash; Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 She described an example: the SSP preconference she ran at CHS. I’m paraphrasing her tweets in this paragraph. The preconference had a short keynote and an “Oprah-style” panel discussion (not pre-prepared talks). This was done live; nothing was recorded. After the panel, people worked in small groups using Zoom and a set of Google Slides to guide the group work. The small groups reported their discussions back to all participants. Andromeda points out (paraphrasing twitter-speak): “Presenters will need much more— and more specialized—skills to pull it off, and it takes a lot more work.” And Lisa adds: “Just so there is no confusion … I don’t think being online makes it harder to do interactive. It’s the pre-recording. Interactive means participants co-create the session. A pause to chat isn’t going to shape what comes next on the recording.” Increased Technical Burden on Speakers and Organizers @ThatAndromeda @DataG Totally agree on this. I had to pre-record a conference presentation recently and it was a terrible experience, logistically. I feel like it forces presenters to become video/sound editors, which is obviously another thing to worry about on top of content and accessibility.&mdash; Junior Tidal (@JuniorTidal) April 5, 2021 Andromeda also agreed with this: “I will say one of the things I appreciated about NISO is that @griffey did ALL the video editing, so I was not forced to learn how that works.” She continued, “everyone has different requirements for prerecording, and in [Code4Lib’s] case they were extensive and kept changing.” And later added: “Part of the challenge is that every conference has its own tech stack/requirements. If as a presenter I have to learn that for every conference, it’s not reducing my workload.” It is hard not to agree with this; a high-quality (stylistically and technically) recording is not easy to do with today’s tools. This is also a technical burden for meeting organizers. The presenters will put a lot of work into talks—including making sure the recordings look good; whatever playback mechanism is used has to honor the fidelity of that recording. For instance, presenters who have gone through the effort to ensure the accessibility of the presentation color scheme want the conference platform to display the talk “as I created it.” The previous post noted that recorded talks also allow for the creation of better, non-real-time transcriptions. Lisa points out that presenters will want to review that transcription for accuracy, which Jason noted adds to the length of time needed before the start of a conference to complete the preparations. Increased Logistical Burden on Presenters @ThatAndromeda @DataG @griffey Even if prep is no more than the time it would take to deliver live (which has yet to be case for me and I'm good at this stuff), it is still double the time if you are expected to also show up live to watch along with everyone else.&mdash; Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 This is a consideration I hadn’t thought through—that presenters have to devote more clock time to the presentation because first they have to record it and then they have to watch it. (Or, as Andromeda added, “significantly more than twice the time for some people, if they are recording a bunch in order to get it right and/or doing editing.”) No. Audience. Reaction. @DataG @griffey 3) No. Audience. Reaction. I give a joke and no one laughs. Was it funny? Was it not funny? Talks are a *performance* and a *relationship*; I'm getting energy off the audience, I'm switching stuff on the fly to meet their vibe. Prerecorded/webinar is dead. Feels like I'm bombing.&mdash; Andromeda Yelton (@ThatAndromeda) April 5, 2021 Wow, yes. I imagine it would take a bit of imagination to get in the right mindset to give a talk to a small camera instead of an audience. I wonder how stand-up comedians are dealing with this as they try to put on virtual shows. Andromeda summed this up: @DataG @griffey oh and I mean 5) I don't get tenure or anything for speaking at conferences and goodness knows I don't get paid. So the ENTIRE benefit to me is that I enjoy doing the talk and connect to people around it. prerecorded talk + f2f conf removes one of these; online removes both.&mdash; Andromeda Yelton (@ThatAndromeda) April 5, 2021 Also in this heading could be “No Speaker Reaction”—or the inability for subsequent speakers at a conference to build on something that someone said earlier. In the Code4Lib Slack team, Daniel S noted: “One thing comes to mind on the pre-recording [is] the issue that prerecorded talks lose the ‘conversation’ aspect where some later talks at a conference will address or comment on earlier talks.” Kate Deibel added: “Exactly. Talks don’t get to spontaneously build off of each other or from other conversations that happen at the conference.” Currency of information Lisa points out that pre-recording talks before en event means there is a delay between the recording and the playback. In the example she pointed out, there was a talk at RLUK that pre-recorded would have been about the University of California working on an Open Access deal with Elsevier; live, it was able to be “the deal we announced earlier this week”. Conclusions? Near the end of the discussion, Lisa added: @DataG @griffey @ThatAndromeda I also recommend going forward that the details re what is required of presenters be in the CfP. It was one thing for conferences that pivoted (huge effort!) but if you write the CfP since the pivot it should say if pre-record, platform used, etc.&mdash; Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 …and Andromeda added: “Strong agree here. I understand that this year everyone was making it up as they went along, but going forward it’d be great to know that in advance.” That means conferences will need to take these needs into account well before the Call for Proposals (CfP) is published. A conference that is thinking now about pre-recording their talks must work through these issues and set expectations with presenters early. As I hoped, the Twiter replies tempered my eagerness for the all-recorded style with some real-world experience. There could be possibilities here, but adapting face-to-face meetings to a world with less travel won’t be simple and will take significant thought beyond the issues of technology platforms. Edward Lim Junhao summarized this nicely: “I favor unpacking what makes up our prof conferences. I’m interested in recreating that shared experience, the networking, &amp; the serendipity of learning sth you didn’t know. I feel in-person conferences now have to offer more in order to justify people traveling to attend them.” Related, Andromeda said: “Also, for a conf that ultimately puts its talks online, it’s critical that it have SOMEthing beyond content delivery during the actual conference to make it worth registering rather than just waiting for youtube. realtime interaction with the speaker is a pretty solid option.” If you have something to add, reach out to me on Twitter. Given enough responses, I’ll create another summary. Let’s keep talking about what that looks like and sharing discoveries with each other. The Tree of Tweets It was a great discussion, and I think I pulled in the major ideas in the summary above. With some guidance from Ed Summers, I’m going to embed the Twitter threads below using Treeverse by Paul Butler. We might be stretching the boundaries of what is possible, so no guarantees that this will be viewable for the long term. Should All Conference Talks be Pre-recorded? The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and conference organizers. Should all talks be pre-recorded, even when we are back face-to-face? Note! After I posted a link to this article on Twitter, there was a great response of thoughtful comments. I've included new bullet points below and summarized the responses in another blog post. As an entirely virtual conference, I think we can call Code4Lib 2021 a success. Success ≠ Perfect, of course, and last week the conference coordinating team got together on a Zoom call for a debriefing session. We had a lengthy discussion about what we learned and what we wanted to take forward to the 2022 conference, which we’re anticipating will be something with a face-to-face component. That last sentence was tough to compose: “…will be face-to-face”? “…will be both face-to-face and virtual”? (Or another fully virtual event?) Truth be told, I don’t think we know yet. I think we know with some certainty that the COVID pandemic will become much more manageable by this time next year—at least in North America and Europe. (Code4Lib draws from primarily North American library technologists with a few guests from other parts of the world.) I’m hearing from higher education institutions, though, that travel is going to be severely curtailed…if not for health risk reasons, then because budgets have been slashed. So one has to wonder what a conference will look like next year. I’ve been to two online conferences this year: NISOplus21 and Code4Lib. Both meetings recorded talks in advance and started playback of the recordings at a fixed point in time. This was beneficial for a couple of reasons. For organizers and presenters, pre-recording allowed technical glitches to be worked through without the pressure of a live event happening. Technology is not nearly perfect enough or ubiquitously spread to count on it working in real-time. 1 NISOplus21 also used the recordings to get transcribed text for the videos. (Code4Lib used live transcriptions on the synchronous playback.) Attendees and presenters benefited from pre-recording because the presenters could be in the text chat channel to answer questions and provide insights. Having the presenter free during the playback offers new possibilities for making talks more engaging: responding in real-time to polls, getting forehand knowledge of topics for subsequent real-time question/answer sessions, and so forth. The synchronous playback time meant that there was a point when (almost) everyone was together watching the same talk—just as in face-to-face sessions. During the Code4Lib conference coordinating debrief call, I asked the question: “If we saw so many benefits to pre-recording talks, do we want to pre-record them all next year?” In addition to the reasons above, pre-recorded talks benefit those who are not comfortable speaking English or are first-time presenters. (They have a chance to re-do their talk as many times as they need in a much less stressful environment.) “Live” demos are much smoother because a recording can be restarted if something goes wrong. Each year, at least one presenter needs to use their own machine (custom software, local development environment, etc.), and swapping out presenter computers in real-time is risky. And it is undoubtedly easier to impose time requirements with recorded sessions. So why not pre-record all of the talks? I get it—it would be different to sit in a ballroom watching a recording play on big screens at the front of the room while the podium is empty. But is it so different as to dramatically change the experience of watching a speaker at a podium? In many respects, we had a dry-run of this during Code4Lib 2020. It was at the early stages of the coming lockdowns when institutions started barring employee travel, and we had to bring in many presenters remotely. I wrote a blog post describing the setup we used for remote presenters, and at the end, I said: I had a few people comment that they were taken aback when they realized that there was no one standing at the podium during the presentation. Some attendees, at least, quickly adjusted to this format. For those with the means and privilege of traveling, there can still be face-to-face discussions in the hall, over meals, and social activities. For those that can’t travel (due to risks of traveling, family/personal responsibilities, or budget cuts), the attendee experience is a little more level—everyone is watching the same playback and in the same text backchannels during the talk. I can imagine a conference tool capable of segmenting chat sessions during the talk playback to “tables” where you and close colleagues can exchange ideas and then promote the best ones to a conference-wide chat room. Something like that would be beneficial as attendance grows for events with an online component, and it would be a new form of engagement that isn’t practical now. There are undoubtedly reasons not to pre-record all session talks (beyond the feels-weird-to-stare-at-an-unoccupied-ballroom-podium reasons). During the debriefing session, one person brought up that having all pre-recorded talks erodes the justification for in-person attendance. I can see a manager saying, “All of the talks are online…just watch it from your desk. Even your own presentation is pre-recorded, so there is no need for you to fly to the meeting.” That’s legitimate. So if you like bullet points, here’s how it lays out. Pre-recording all talks is better for: Accessibility: better transcriptions for recorded audio versus real-time transcription (and probably at a lower cost, too) Engagement: the speaker can be in the text chat during playback, and there could be new options for backchannel discussions Better quality: speakers can re-record their talk as many times as needed Closer equality: in-person attendees are having much the same experience during the talk as remote attendees Downsides for pre-recording all talks: Feels weird: yeah, it would be different Erodes justification: indeed a problem, especially for those for whom giving a speech is the only path to getting the networking benefits of face-to-face interaction Limits presentation format: it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? (Lisa Janicke Hinchliffe) Increased Technical Burden on Speaker and Organizers: conference organizers asking presenters to do their own pre-recording is a barrier (Junior Tidal), and organizers have added new requirements for themselves No Audience Feedback: pre-recording forces the presenter into an unnatural state relative to the audience (Andromeda Yelton) Currency of information: pre-recording talks before en event naturally introduces a delay between the recording and the playback. (Lisa Janicke Hinchliffe) I’m curious to hear of other reasons, for and against. Reach out to me on Twitter if you have some. The COVID-19 pandemic has changed our society and will undoubtedly transform it in ways that we can’t even anticipate. Is the way that we hold professional conferences one of them? Can we just pause for a moment and consider the decades of work and layers of technology that make a modern teleconference call happen? For you younger folks, there was a time when one couldn’t assume the network to be there. As in: the operating system on your computer couldn’t be counted on to have a network stack built into it. In the earliest years of my career, we were tickled pink to have Macintoshes at the forefront of connectivity through GatorBoxes. Go read the first paragraph of that Wikipedia article on GatorBoxes…TCP/IP was tunneled through LocalTalk running over PhoneNet on unshielded twisted pairs no faster than about 200 kbit/second. (And we loved it!) Now the network is expected; needing to know about TCP/IP is pushed so far down the stack as to be forgotten…assumed. Sure, the software on top now is buggy and bloated—is my Zoom client working? has Zoom’s service gone down?—but the network…we take that for granted. &#8617; User Behavior Access Controls at a Library Proxy Server are Okay Earlier this month, my Twitter timeline lit up with mentions of a half-day webinar called Cybersecurity Landscape - Protecting the Scholarly Infrastructure. What had riled up the people I follow on Twitter was the first presentation: “Security Collaboration for Library Resource Access” by Cory Roach, the chief information security officer at the University of Utah. Many of the tweets and articles linked in tweets were about a proposal for a new round of privacy-invading technology coming from content providers as a condition of libraries subscribing to publisher content. One of the voices that I trust was urging caution: I highly recommend you listen to the talk, which was given by a university CIO, and judge if this is a correct representation. FWIW, I attended the event and it is not what I took away.&mdash; Lisa Janicke Hinchliffe (@lisalibrarian) November 14, 2020 As near as I can tell, much of the debate traces back to this article: Scientific publishers propose installing spyware in university libraries to protect copyrights - Coda Story https://t.co/rtCokIukBf&mdash; Open Access Tracking Project (@oatp) November 14, 2020 The article describes Cory’s presentation this way: One speaker proposed a novel tactic publishers could take to protect their intellectual property rights against data theft: introducing spyware into the proxy servers academic libraries use to allow access to their online services, such as publishers’ databases. The “spyware” moniker is quite scary. It is what made me want to seek out the recording from the webinar and hear the context around that proposal. My understanding (after watching the presentation) is that the proposal is not nearly as concerning. Although there is one problematic area—the correlation of patron identity with requested URLs—overall, what is described is a sound and common practice for securing web applications. To the extent that it is necessary to determine a user’s identity before allowing access to licensed content (an unfortunate necessity because of the state of scholarly publishing), this is an acceptable proposal. (Through the university communications office, Corey published a statement about the reaction to his talk.) In case you didn’t know, a web proxy server ensures the patron is part of the community of licensed users, and the publisher trusts requests that come through the web proxy server. The point of Cory’s presentation is that the username/password checking at the web proxy server is a weak form of access control that is subject to four problems: phishing (sending email to tricking a user into giving up their username/password) social engineering (non-email ways of tricking a user into giving up their username/password) credential reuse (systems that are vulnerable because the user used the same password in more than one place) hactivism (users that intentionally give out their username/password so others can access resources) Right after listing these four problems, Cory says: “But anyway we look at it, we can safely say that this is primarily a people problem and the technology alone is not going to solve that problem. Technology can help us take reasonable precautions… So long as the business model involves allowing access to the data that we’re providing and also trying to protect that same data, we’re unlikely to stop theft entirely.” His proposal is to place “reasonable precautions” in the web proxy server as it relates to the campus identity management system. This is a slide from his presentation: Slide from presentation by Cory Roach I find this layout (and lack of labels) somewhat confusing, so I re-imagined the diagram as this: Revised 'Modern Library Design' The core of Cory’s presentation is to add predictive analytics and per-user blocking automation to the analysis of the log files from the web proxy server and the identity management server. By doing so, the university can react quicker to compromised usernames and passwords. In fact, it could probably do so more quicker than the publisher could do with its own log analysis and reporting back to the university. Where Cory runs into trouble is this slide: Slide from presentation by Cory Roach In this part of the presentation, Cory describes the kinds of patron-identifying data that the university could-or-would collect and analyze to further the security effort. In search engine optimization, these sorts of data points are called “signals” and are used to improve the relevance of search results; perhaps there is an equivalent term in access control technology. But for now, I’ll just call them “signals”. There are some problems in gathering these signals—most notably the correlation between user identity and “URLs Requested”. In the presentation, he says: “You can also move over to behavioral stuff. So it could be, you know, why is a pharmacy major suddenly looking up a lot of material on astrophysics or why is a medical professional and a hospital suddenly interested in internal combustion. Things that just don’t line up and we can identify fishy behavior.” It is core to the library ethos that we make our best effort to not track what a user is interested in—to not build a profile of a user’s research unless they have explicitly opted into such data collection. As librarians, we need to gracefully describe this professional ethos and work that into the design of the systems used on campus (and at the publishers). Still, there is much to be said for using some of the other signals to analyze whether a particular request is from an authorized community member. For instance, Cory says: “We commonly see this user coming in from the US and today it’s coming in from Botswana. You know, has there been enough time that they could have traveled from the US to Botswana and actually be there? Have they ever access resources from that country before is there residents on record in that country?” The best part of what Cory is proposing is that the signals’ storage and processing is at the university and not at the publisher. I’m not sure if Cory knew this, but a recent version of EZProxy added a UsageLimit directive that builds in some of these capabilities. It can set per-user limits based on the number of page requests or the amount of downloaded information over a specified interval. One wonders if somewhere in OCLC’s development queue is the ability to detect IP addresses from multiple networks (geographic detection) and browser differences across a specified interval. Still, pushing this up to the university’s identity provider allows for a campus-wide view of the signals…not just the ones coming through the library. Also, in designing the system, there needs to be clarity about how the signals are analyzed and used. I think Cory knew this as well: “we do have to be careful about not building bias into the algorithms.” Yeah, the need for this technology sucks. Although it was the tweet to the Coda Story about the presentation that blew up, the thread of the story goes through TechDirt to a tangential paragraph from Netzpolitik in an article about Germany’s licensing struggle with Elsevier. With this heritage, any review of the webinar’s ideas are automatically tainted by the disdain the library community in general has towards Elsevier. It is reality—an unfortunate reality, in my opinion—that the traditional scholarly journal model has publishers exerting strong copyright protection on research and ideas behind paywalls. (Wouldn’t it be better if we poured the anti-piracy effort into improving scholarly communication tools in an Open Access world? Yes, but that isn’t the world we live in.) Almost every library deals with this friction by employing a web proxy server as an agent between the patron and the publisher’s content. The Netzpolitik article says: …but relies on spyware in the fight against „cybercrime“ Of Course, Sci-Hub and other shadow libraries are a thorn in Elsevier’s side. Since they have existed, libraries at universities and research institutions have been much less susceptible to blackmail. Their staff can continue their research even without a contract with Elsevier. Instead of offering transparent open access contracts with fair conditions, however, Elsevier has adopted a different strategy in the fight against shadow libraries. These are to be fought as „cybercrime“, if necessary also with technological means. Within the framework of the „Scholarly Networks Security Initiative (SNSI)“, which was founded together with other large publishers, Elsevier is campaigning for libraries to be upgraded with security technology. In a SNSI webinar entitled „Cybersecurity Landscape – Protecting the Scholarly Infrastructure“*, hosted by two high-ranking Elsevier managers, one speaker recommended that publishers develop their own proxy or a proxy plug-in for libraries to access more (usage) data („develop or subsidize a low cost proxy or a plug-in to existing proxies“). With the help of an „analysis engine“, not only could the location of access be better narrowed down, but biometric data (e.g. typing speed) or conspicuous usage patterns (e.g. a pharmacy student suddenly interested in astrophysics) could also be recorded. Any doubts that this software could also be used—if not primarily—against shadow libraries were dispelled by the next speaker. An ex-FBI analyst and IT security consultant spoke about the security risks associated with the use of Sci-Hub. The other commentary that I saw was along similar lines: [Is the SNSI the new PRISM? bjoern.brembs.blog](http://bjoern.brembs.net/2020/10/is-the-snsi-the-new-prism/) [Academics band together with publishers because access to research is a cybercrime chorasimilarity](https://chorasimilarity.wordpress.com/2020/11/14/academics-band-together-with-publishers-because-access-to-research-is-a-cybercrime/) [WHOIS behind SNSI &amp; GetFTR? Motley Marginalia](https://csulb.edu/~ggardner/2020/11/16/snsi-getftr/) Let’s face it: any friction beyond follow-link-to-see-PDF is more friction than a researcher deserves. I doubt we would design a scholarly communication system this way were we to start from scratch. But the system is built on centuries of evolving practice, organizations, and companies. It really would be a better world if we didn’t have to spend time and money on scholarly publisher paywalls. And I’m grateful for the Open Access efforts that are pivoting scholarly communications into an open-to-all paradigm. That doesn’t negate the need to provide better options for content that must exist behind a paywall. So what is this SNSI thing? The webinar where Cory presented was the first mention I’d seen of a new group called the Scholarly Networks Security Initiative (SNSI). SNSI is the latest in a series of publisher-driven initiatives to reduce the paywall’s friction for paying users or library patrons coming from licensing institutions. GetFTR (my thoughts) and Seamless Access (my thoughts). (Disclosure: I’m serving on two working groups for Seamless Access that are focused on making it possible for libraries to sensibly and sanely integrate the goals of Seamless Access into campus technology and licensing contracts.) Interestingly, while the Seamless Access initiative is driven by a desire to eliminate web proxy servers, this SNSI presentation upgrades a library’s web proxy server and makes it a more central tool between the patron and the content. One might argue that all access on campus should come through the proxy server to benefit from this kind of access control approach. It kinda makes one wonder about the coordination of these efforts. Still, SNSI is on my radar now, and I think it will be interesting to see what the next events and publications are from this group. As a Cog in the Election System: Reflections on My Role as a Precinct Election Official I may nod off several times in composing this post the day after election day. Hopefully, in reading it, you won’t. It is a story about one corner of democracy. It is a journal entry about how it felt to be a citizen doing what I could do to make other citizens’ voices be heard. It needed to be written down before the memories and emotions are erased by time and naps. Yesterday I was a precinct election officer (PEO—a poll worker) for Franklin County—home of Columbus, Ohio. It was my third election as a PEO. The first was last November, and the second was the election aborted by the onset of the coronavirus in March. (Not sure that second one counts.) It was my first as a Voting Location Manager (VLM), so I felt the stakes were high to get it right. Would there be protests at the polling location? Would I have to deal with people wearing candidate T-shirts and hats or not wearing masks? Would there be a crash of election observers, whether official (scrutinizing our every move) or unofficial (that I would have to remove)? It turns out the answer to all three questions was “no”—and it was a fantastic day of civic engagement by PEOs and voters. There were well-engineered processes and policies, happy and patient enthusiasm, and good fortune along the way. This story is going to turn out okay, but it could have been much worse. Because of the complexity of the election day voting process, last year Franklin County started allowing PEOs to do some early setup on Monday evenings. The early setup started at 6 o’clock. I was so anxious to get it right that the day before I took the printout of the polling room dimensions from my VLM packet, scanned it into OmniGraffle on my computer, and designed a to-scale diagram of what I thought the best layout would be. The real thing only vaguely looked like this, but it got us started. What I imagined our polling place would look like We could set up tables, unpack equipment, hang signs, and other tasks that don’t involve turning on machines or breaking open packets of ballots. One of the early setup tasks was updating the voters’ roster on the electronic poll pads. As happened around the country, there was a lot of early voting activity in Franklin County, so the update file must have been massive. The electronic poll pads couldn’t handle the update; they hung at step 8-of-9 for over an hour. I called the Board of Elections and got ahold of someone in the equipment warehouse. We tried some of the simple troubleshooting steps, and he gave me his cell phone number to call back if it wasn’t resolved. By 7:30, everything was done except for the poll pad updates, and the other PEOs were wandering around. I think it was 8 o’clock when I said everyone could go home while the two Voting Location Deputies and I tried to get the poll pads working. I called the equipment warehouse and we hung out on the phone for hours…retrying the updates based on the advice of the technicians called in to troubleshoot. I even “went rogue” towards the end. I searched the web for the messages on the screen to see if anyone else had seen the same problem with the poll pads. The electronic poll pad is an iPad with a single, dedicated application, so I even tried some iPad reset options to clear the device cache and perform a hard reboot. Nothing worked—still stuck at step 8-of-9. The election office people sent us home at 10 o’clock. Even on the way out the door, I tried a rogue option: I hooked a portable battery to one of the electronic polling pads to see if the update would complete overnight and be ready for us the next day. It didn’t, and it wasn’t. Text from Board of Elections Polling locations in Ohio open at 6:30 in the morning, and PEOs must report to their sites by 5:30. So I was up at 4:30 for a quick shower and packing up stuff for the day. Early in the setup process, the Board of Elections sent a text that the electronic poll pads were not going to be used and to break out the “BUMPer Packets” to determine a voter’s eligibility to vote. At some point, someone told me what “BUMPer” stood for. I can’t remember, but I can imagine it is Back-Up-something-something. “Never had to use that,” the trainers told me, but it is there in case something goes wrong. Well, it is the year 2020, so was something going to go wrong? Fortunately, the roster judges and one of the voting location deputies tore into the BUMPer Packet and got up to speed on how to use it. It is an old fashioned process: the voter states their name and address, the PEO compares that with the details on the paper ledger, and then asks the voter to sign beside their name. With an actual pen…old fashioned, right? The roster judges had the process down to a science. They kept the queue of verified voters full waiting to use the ballot marker machines. The roster judges were one of my highlights of the day. And boy did the voters come. By the time our polling location opened at 6:30 in the morning, they were wrapped around two sides of the building. We were moving them quickly through the process: three roster tables for checking in, eight ballot-marking machines, and one ballot counter. At our peak capacity, I think we were doing 80 to 90 voters an hour. As good as we were doing, the line never seemed to end. The Franklin County Board of Elections received a grant to cover the costs of two greeters outside that helped keep the line orderly. They did their job with a welcoming smile, as did our inside greeter that offered masks and a squirt of hand sanitizer. Still, the voters kept back-filling that line, and we didn’t see a break until 12:30. The PEOs serving as machine judges were excellent. This was the first time that many voters had seen the new ballot equipment that Franklin County put in place last year. I like this new equipment: the ballot marker prints your choices on a card that it spits out. You can see and verify your choices on the card before you slide it into a separate ballot counter. That is reassuring for me, and I think for most voters, too. But it is new, and it takes a few extra moments to explain. The machine judges got the voters comfortable with the new process. And some of the best parts of the day were when they announced to the room that a first-time voter had just put their card into the ballot counter. We would all pause and cheer. The third group of PEOs at our location were the paper table judges. They handle all of the exceptions. Someone wants to vote with a pre-printed paper ballot rather than using a machine? To the paper table! The roster shows that someone requested an absentee ballot? That voter needs to vote a “provisional” ballot that will be counted at the Board of Elections office if the absentee ballot isn’t received in the mail. The paper table judges explain that with kindness and grace. In the wrong location? The paper table judges would find the correct place. The two paper table PEOs clearly had experience helping voters with the nuances of election processes. Rounding out the team were two voting location deputies (VLD). By law, a polling location can’t have a VLD and a voting location manager (VLM) of the same political party. That is part of the checks and balances built into the system. One VLD had been a VLM at this location, and she had a wealth of history and wisdom about running a smooth polling location. For the other VLD, this was his first experience as a precinct election officer, and he jumped in with both feet to do the visible and not-so-visible things that made for a smooth operation. He reminded me a bit of myself a year ago. My first PEO position was as a voting location deputy last November. The pair handled a challenging curbside voter situation where it wasn’t entirely clear if one of the voters in the car was sick. I’d be so lucky to work with them again. The last two hours of the open polls yesterday were dreadfully dull. After the excitement of the morning, we may have averaged a voter every 10 minutes for those last two hours. Everyone was ready to pack it in early and go home. (Polls in Ohio close at 7:30, so counting the hour early for setup and the half an hour for tear down, this was going to be a 14 to 15 hour day.) Over the last hour, I gave the PEOs little tasks to do. At one point, I said they could collect the barcode scanners attached to the ballot markers. We weren’t using them anyway because the electronic poll pads were not functional. Then, in stages (as it became evident that there was no final rush of voters), they could pack up one or two machines and put away tables. Our second to last voter was someone in medical scrubs that just got off their shift. I scared our last voter because she walked up to the roster table at 7:29:30. Thirty seconds later, I called out that the polls are closed (as I think a VLM is required to do), and she looked at me startled. (She got to vote, of course; that’s the rule.) She was our last voter; 799 voters in our precinct that day. Then our team packed everything up as efficiently as they had worked all day. We had put away the equipment and signs, done our final counts, closed out the ballot counter, and sealed the ballot bin. At 8:00, we were done and waving goodbye to our host facility’s office manager. One of the VLD rode along with me to the board of elections to drop off the ballots, and she told me of a shortcut to get there. We were among the first reporting results for Franklin County. I was home again by a quarter of 10—exhausted but proud. I’m so happy that I had something to do yesterday. After weeks of concern and anxiety for how the election was going to turn out, it was a welcome bit of activity to ensure the election was held safely and that voters got to have their say. It was certainly more productive than continually reloading news and election results pages. The anxiety of being put in charge of a polling location was set at ease, too. I’m proud of our polling place team and that the voters in our charge seemed pleased and confident about the process. Maybe you will find inspiration here. If you voted, hopefully it felt good (whether or not the result turned out as you wanted). If you voted for the first time, congratulations and welcome to the club (be on the look-out for the next voting opportunity…likely in the spring). If being a poll worker sounded like fun, get in touch with your local board of elections (here is information about being a poll worker in Franklin County). Democracy is participatory. You’ve got to tune in and show up to make it happen. Certificate of Appreciation Running an All-Online Conference with Zoom [post removed] This is an article draft that was accidentally published. I hope to work on a final version soon. If you really want to see it, I saved a copy on the Internet Archive Wayback Machine. With Gratitude for the NISO Ann Marie Cunningham Service Award During the inaugural NISO Plus meeting at the end of February, I was surprised and proud to receive the Ann Marie Cunningham Service award. Todd Carpenter, NISO’s executive director, let me know by tweet as I was not able to attend the conference. Pictured in that tweet is my co-recipient, Christine Stohn, who serves NISO with me as the co-chair of the Information Delivery and Interchange Topic Committee. This got me thinking about what NISO has meant to me. As I think back on it, my activity in NISO spans at least four employers and many hours of standard working group meetings, committee meetings, presentations, and ballot reviews. NISO Ann Marie Cunningham Service Award I did not know Ms Cunningham, the award’s namesake. My first job started when she was the NFAIS executive director in the early 1990s, and I hadn’t been active in the profession yet. I read her brief biography on the NISO website: The Ann Marie Cunningham Service award was established in 1994 to honor NFAIS members who routinely went above and beyond the normal call of duty to serve the organization. It is named after Ann Marie Cunningham who, while working with abstracting and information services such as Biological Abstracts and the Institute for Scientific Information (both now part of NISO-member Clarivate Analytics), worked tirelessly as an dedicated NFAIS volunteer. She ultimately served as the NFAIS Executive Director from 1991 to 1994 when she died unexpectedly. NISO is pleased to continue to present this award to honor a NISO volunteer who has shown the same sort of commitment to serving our organization. As I searched the internet for her name, I came across the proceedings of the 1993 NFAIS meeting, in which Ms Cunningham wrote the introduction with Wendy Wicks. These first sentences from some of the paragraphs of that introduction are as true today as they were then: In an era of rapidly expanding network access, time and distance no longer separate people from information. Much has been said about the global promise of the Internet and the emerging concept of linking information highways, to some people, “free” ways. What many in the networking community, however, seem to take for granted is the availability of vital information flowing on these high-speed links. I wonder what Ms Cunningham of 1993 would think of the information landscape today? Hypertext linking has certainly taken off, if not taken over, the networked information landscape. How that interconnectedness has improved with the adaptation of print-oriented standards and the creation of new standards that match the native capabilities of the network. In just one corner of that space, we have the adoption of PDF as a faithful print replica and HTML as a common tool for displaying information. In another corner, MARC has morphed into a communication format that far exceeds its original purpose of encoding catalog cards; we have an explosion of purpose-built metadata schemas and always the challenge of finding common ground in tools like Dublin Core and Schema.org. We’ve seen several generations of tools and protocols for encoding, distributing, and combining data in new ways to reach users. And still we strive to make it better…to more easily deliver a paper to its reader—a dataset to its next experimenter—an idea to be built upon by the next generation. It is that communal effort to make a better common space for ideas that drives me forward. To work in a community at the intersection of libraries, publishers, and service providers is an exciting and fulfilling place to be. I’m grateful to my employers that have given me the ability to participate while bringing the benefits of that connectedness to my organizations. I was not able to be at NISO Plus to accept the award in person, but I was so happy to be handed it by Jason Griffey of NISO about a week later during the Code4lib conference in Pittsburgh. What made that even more special was to learn that Jason created it on his own 3D printer. Thank you to the new NFAIS-joined-with-NISO community for honoring me with this service award. Tethering a Ubiquity Network to a Mobile Hotspot I saw it happen. The cable-chewing device The contractor in the neighbor’s back yard with the Ditch Witch trencher burying a cable. I was working outside at the patio table and just about to go into a Zoom meeting. Then the internet dropped out. Suddenly, and with a wrenching feeling in my gut, I remembered where the feed line was buried between the house and the cable company’s pedestal in the right-of-way between the properties. Yup, he had just cut it. To be fair, the utility locator service did not mark the my cable’s location, and he was working for a different cable provider than the one we use. (There are three providers in our neighborhood.) It did mean, though, that our broadband internet would be out until my provider could come and run another line. It took an hour of moping about the situation to figure out a solution, then another couple of hours to put it in place: an iPhone tethered to a Raspberry Pi that acted as a network bridge to my home network’s UniFi Security Gateway 3P. Network diagram with tethered iPhone A few years ago I was tired of dealing with spotty consumer internet routers and upgraded the house to UniFi gear from Ubiquity. Rob Pickering, a college comrade, had written about his experience with the gear and I was impressed. It wasn’t a cheap upgrade, but it was well worth it. (Especially now with four people in the household working and schooling from home during the COVID-19 outbreak.) The UniFi Security Gateway has three network ports, and I was using two: one for the uplink to my cable internet provider (WAN) and one for the local area network (LAN) in the house. The third port can be configured as another WAN uplink or as another LAN port. And you can tell the Security Gateway to use the second WAN as a failover for the first WAN (or as load balancing the first WAN). So that is straight forward enough, but do I get the Personal Hotspot on the iPhone to the second WAN port? That is where the Raspberry Pi comes in. The Raspberry Pi is a small computer with USB, ethernet, HDMI, and audio ports. The version I had laying around is a Raspberry Pi 2—an older model, but plenty powerful enough to be the network bridge between the iPhone and the home network. The toughest part was bootstrapping the operating system packages onto the Pi with only the iPhone Personal Hotspot as the network. That is what I’m documenting here for future reference. Bootstrapping the Raspberry Pi The Raspberry Pi runs its own operating system called Raspbian (a Debian/Linux derivative) as well as more mainstream operating systems. I chose to use the Ubuntu Server for Raspberry Pi instead of Raspbian because I’m more familiar with Ubuntu. I tethered my MacBook Pro to the iPhone to download the Ubuntu 18.04.4 LTS image and follow the instructions for copying that disk image to the Pi’s microSD card. That allows me to boot the Pi with Ubuntu and a basic set of operating system packages. The Challenge: Getting the required networking packages onto the Pi It would have been really nice to plug the iPhone into the Pi with a USB-Lightning cable and have it find the tethered network. That doesn’t work, though. Ubuntu needs at least the usbmuxd package in order to see the tethered iPhone as a network device. That package isn’t a part of the disk image download. And of course I can’t plug my Pi into the home network to download it (see first paragraph of this post). My only choice was to tether the Pi to the iPhone over WiFi with a USB network adapter. And that was a bit of Ubuntu voodoo. Fortunately, I found instructions on configuring Ubuntu to use a WPA-protected wireless network (like the one that the iPhone Personal Hotspot is providing). In brief: sudo -i cd /root wpa_passphrase my_ssid my_ssid_passphrase &gt; wpa.conf screen -q wpa_supplicant -Dwext -iwlan0 -c/root/wpa.conf &lt;control-a&gt; c dhclient -r dhclient wlan0 Explanation of lines: Use sudo to get a root shell Change directory to root’s home Use the wpa_passphrase command to create a wpa.conf file. Replace my_ssid with the wireless network name provided by the iPhone (your iPhone’s name) and my_ssid_passphrase with the wireless network passphrase (see the “Wi-Fi Password” field in Settings -&gt; Personal Hotspot). Start the screen program (quietly) so we can have multiple pseudo terminals. Run the wpa_supplicant command to connect to the iPhone wifi hotspot. We run this the foreground so we can see the status/error messages; this program must continue running to stay connected to the wifi network. Use the screen hotkey to create a new pseudo terminal. This is control-a followed by a letter c. Use dhclient to clear out any DHCP network parameters Use dhclient to get an IP address from the iPhone over the wireless network. Now I was at the point where I could install Ubuntu packages. (I ran ping www.google.com to verify network connectivity.) To install the usbmuxd and network bridge packages (and their prerequisites): apt-get install usbmuxd bridge-utils If your experience is like mine, you’ll get an error back: couldn't get lock /var/lib/dpkg/lock-frontend The Ubuntu Pi machine is now on the network, and the automatic process to install security updates is running. That locks the Ubuntu package registry until it finishes. That took about 30 minutes for me. (I imagine this varies based on the capacity of your tethered network and the number of security updates that need to be downloaded.) I monitored the progress of the automated process with the htop command and tried the apt-get command when it finished. If you are following along, now would be a good time to skip ahead to Configuring the UniFi Security Gateway if you haven’t already set that up. Turning the Raspberry Pi into a Network Bridge With all of the software packages installed, I restarted the Pi to complete the update: shutdown -r now While it was rebooting, I pulled out the USB wireless adapter from the Pi and plugged in the iPhone’s USB cable. The Pi now saw the iPhone as eth1, but the network did not start until I went to the iPhone to say that I “Trust” the computer that it is plugged into. When I did that, I ran these commands on the Ubuntu Pi: dhclient eth1 brctl addbr iphonetether brctl addif iphonetether eth0 eth1 brctl stp iphonetether on ifconfig iphonetether up Explanation of lines: Get an IP address from the iPhone over the USB interface Add a network bridge (the iphonetether is an arbitrary string; some instructions simply use br0 for the zero-ith bridge) Add the two ethernet interfaces to the network bridge Turn on the Spanning Tree Protocol (I don’t think this is actually necessary, but it does no harm) Bring up the bridge interface The bridge is now live! Thanks to Amitkumar Pal for the hints about using the Pi as a network bridge. More details about the bridge networking software is on the Debian Wiki. Note! I'm using a hardwired keyboard/monitor to set up the Raspbery Pi. I've heard from someone that was using SSH to run these commands, and the SSH connection would break off at brctl addif iphonetecther eth0 eth1 Configuring the UniFi Security Gateway I have a UniFi Cloud Key, so I could change the configuration of the UniFi network with a browser. (You’ll need to know the IP address of the Cloud Key; hopefully you have that somewhere.) I connected to my Cloud Key at https://192.168.1.58:8443/ and clicked through the self-signed certificate warning. First I set up a second Wide Area Network (WAN—your uplink to the internet) for the iPhone Personal Hotspot: Settings -&gt; Internet -&gt; WAN Networks. Select “Create a New Network”: Network Name: Backup WAN IPV4 Connection Type: Use DHCP IPv6 Connection Types: Use DHCPv6 DNS Server: 1.1.1.1 and 1.0.0.1 (CloudFlare’s DNS servers) Load Balancing: Failover only The last selection is key…I wanted the gateway to only use this WAN interfaces as a backup to the main broadband interface. If the broadband comes back up, I want to stop using the tethered iPhone! Second, assign the Backup WAN to the LAN2/WAN2 port on the Security Gateway (Devices -&gt; Gateway -&gt; Ports -&gt; Configure interfaces): Port WAN2/LAN2 Network: WAN2 Speed/Duplex: Autonegotiate Apply the changes to provision the Security Gateway. After about 45 seconds, the Security Gateway failed over from “WAN iface eth0” (my broadband connection) to “WAN iface eth2” (my tethered iPhone through the Pi bridge). These showed up as alerts in the UniFi interface. Performance and Results So I’m pretty happy with this setup. The family has been running simultaneous Zoom calls and web browsing on the home network, and the performance has been mostly normal. Web pages do take a little longer to load, but whatever Zoom is using to dynamically adjust its bandwidth usage is doing quite well. This is chewing through the mobile data quota pretty fast, so it isn’t something I want to do every day. Knowing that this is possible, though, is a big relief. As a bonus, the iPhone is staying charged via the 1 amp power coming through the Pi. Managing Remote Conference Presenters with Zoom Bringing remote presenters into a face-to-face conference is challenging and fraught with peril. In this post, I describe a scheme using Zoom that had in-person attendees forgetting that the presenter was remote! The Code4Lib conference was this week, and with the COVID-19 pandemic breaking through many individuals and institutions made decisions to not travel to Pittsburgh for the meeting. We had an unprecedented nine presentations that were brought into the conference via Zoom. I was chairing the livestream committee for the conference (as I have done for several years—skipping last year), so it made the most sense for me to arrange a scheme for remote presenters. With the help of the on-site A/V contractor, we were able to pull this off with minimal requirements for the remote presenter. List of Requirements 2 Zoom Pro accounts 1 PC/Mac with video output, as if you were connecting an external monitor (the “Receiving Zoom” computer) 1 PC/Mac (the “Coordinator Zoom” computer) 1 USB audio interface Hardwired network connection for the Receiving Zoom computer (recommended) The Pro-level Zoom accounts were required because we needed to run a group call for longer than 40 minutes (to include setup time). And two were needed: one for the Coordinator Zoom machine and one for the dedicated Receiving Zoom machine. It would have been possible to consolidate the two Zoom Pro accounts and the two PC/Mac machines into one, but we had back-to-back presenters at Code4Lib, and I wanted to be able to help one remote presenter get ready while another was presenting. In addition to this equipment, the A/V contractor was indispensable in making the connection work. We fed the remote presenter’s video and audio from the Receiving Zoom computer to the contractor’s A/V switch through HDMI, and the contractor put the video on the ballroom projectors and audio through the ballroom speakers. The contractor gave us a selective audio feed of the program audio minus the remote presenter’s audio (so they wouldn’t hear themselves come back through the Zoom meeting). This becomes a little clearer in the diagram below. Physical Connections and Setup This diagram shows the physical connections between machines. The Audio Mixer and Video Switch were provided and run by the A/V contractor. The Receiving Zoom machine was the one that is connected to the A/V contractor’s Video Switch via an HDMI cable coming off the computer’s external monitor connection. In the Receiving Zoom computer’s control panel, we set the external monitor to mirror what was on the main monitor. The audio and video from the computer (i.e., the Zoom call) went out the HDMI cable to the A/V contractor’s Video Switch. The A/V contractor took the audio from the Receiving Zoom computer through the Video Switch and added it to the Audio Mixer as an input channel. From there, the audio was sent out to the ballroom speakers the same way audio from the podium microphone was amplified to the audience. We asked the A/V contractor to create an audio mix that includes all of the audio sources except the Receiving Zoom computer (e.g., in-room microphones) and plugged that into the USB Audio interface. That way, the remote presenter could hear the sounds from the ballroom—ambient laughter, questions from the audience, etc.—in their Zoom call. (Note that it was important to remove the remote presenter’s own speaking voice from this audio mix; there was a significant, distracting delay between the time the presenter spoke and the audio was returned to them through the Zoom call.) We used a hardwired network connection to the internet, and I would recommend that—particularly with tech-heavy conferences that might overflow the venue wi-fi. (You don’t want your remote presenter’s Zoom to have to compete with what attendees are doing.) Be aware that the hardwired network connection will cost more from the venue, and may take some time to get functioning since this doesn’t seem to be something that hotels often do. In the Zoom meeting, we unmuted the microphone and selected the USB Audio interface as the microphone input. As the Zoom meeting was connected, we made the meeting window full-screen so the remote presenter’s face and/or presentation were at the maximum size on the ballroom projectors. Setting Up the Zoom Meetings The two Zoom accounts came from the Open Library Foundation. (Thank you!) As mentioned in the requirements section above, these were Pro-level accounts. The two accounts were olf_host2@openlibraryfoundation.org and olf_host3@openlibraryfoundation.org. The olf_host2 account was used for the Receiving Zoom computer, and the olf_host3 account was used for the Coordinator Zoom computer. The Zoom meeting edit page looked like this: This is for the “Code4Lib 2020 Remote Presenter A” meeting with the primary host as olf_host2@openlibraryfoundation.org. Note these settings: A recurring meeting that ran from 8:00am to 6:00pm each day of the conference. Enable join before host is checked in case the remote presenter got on the meeting before I did. Record the meeting automatically in the cloud to use as a backup in case something goes wrong. Alternative Hosts is olf_host3@openlibraryfoundation.org The “Code4Lib 2020 Remote Presenter B” meeting was exactly the same except the primary host was olf_host3, and olf_host2 was added as an alternative host. The meetings were set up with each other as the alternative host so that the Coordinator Zoom computer could start the meeting, seamlessly hand it off to the Receiving Zoom computer, then disconnect. Preparing the Remote Presenter Remote presenters were given this information: Code4Lib will be using Zoom for remote presenters. In addition to the software, having the proper audio setup is vital for a successful presentation. Microphone: The best option is a headset or earbuds so a microphone is close to your mouth. Built-in laptop microphones are okay, but using them will make it harder for the audience to hear you. Speaker: A headset or earbuds are required. Do not use your computer’s built-in speakers. The echo cancellation software is designed for small rooms and cannot handle the delay caused by large ballrooms. You can test your setup with a test Zoom call. Be sure your microphone and speakers are set correctly in Zoom. Also, try sharing your screen on the test call so you understand how to start and stop screen sharing. The audience will see everything on your screen, so quit/disable/turn-off notifications that come from chat programs, email clients, and similar tools. Plan to connect to the Zoom meeting 30 minutes before your talk to work out any connection or setup issues. At the 30-minute mark before the remote presentation, I went to the ballroom lobby and connected to the designated Zoom meeting for the remote presenter using the Coordinator Zoom computer. I used this checklist with each presenter: Check presenter’s microphone level and sound quality (make sure headset/earbud microphone is being used!) Check presenter’s speakers and ensure there is no echo Test screen-sharing (start and stop) with presenter Remind presenter to turn off notifications from chat programs, email clients, etc. Remind the presenter that they need to keep track of their own time; there is no way for us to give them cues about timing other than interrupting them when their time is up The critical item was making sure the audio worked (that their computer was set to use the headset/earbud microphone and audio output). The result was excellent sound quality for the audience. When the remote presenter was set on the Zoom meeting, I returned to the A/V table and asked a livestream helper to connect the Receiving Zoom to the remote presenter’s Zoom meeting. At this point, the remote presenter can hear the audio in the ballroom of the speaker before them coming through the Receiving Zoom computer. Now I would lock the Zoom meeting to prevent others from joining and interrupting the presenter (from the Zoom Participants panel, select More then Lock Meeting). I hung out on the remote presenter’s meeting on the Coordinator Zoom computer in case they had any last-minute questions. As the speaker in the ballroom was finishing up, I wished the remote presenter well and disconnected the Coordinator Zoom computer from the meeting. (I always selected Leave Meeting rather than End Meeting for All so that the Zoom meeting continued with the remote presenter and the Receiving Zoom computer.) As the remote presenter was being introduced—and the speaker would know because they could hear it in their Zoom meeting—the A/V contractor switched the video source for the ballroom projectors to the Receiving Zoom computer and unmuted the Receiving Zoom computer’s channel on the Audio Mixer. At this point, the remote speaker is off-and-running! Last Thoughts This worked really well. Surprisingly well. So well that I had a few people comment that they were taken aback when they realized that there was no one standing at the podium during the presentation. I’m glad I had set up the two Zoom meetings. We had two cases where remote presenters were back-to-back. I was able to get the first remote presenter set up and ready on one Zoom meeting while preparing the second remote presenter on the other Zoom meeting. The most stressful part was at the point when we disconnected the first presenter’s Zoom meeting and quickly connected to the second presenter’s Zoom meeting. This was slightly awkward for the second remote presenter because they didn’t hear their full introduction as it happened and had to jump right into their presentation. This could be solved by setting up a second Receiving Zoom computer, but this added complexity seemed to be too much for the benefit gained. I would definitely recommend making this setup a part of the typical A/V preparations for future Code4Lib conferences. We don’t know when an individual’s circumstances (much less a worldwide pandemic) might cause a last-minute request for a remote presentation capability, and the overhead of the setup is pretty minimal. What is known about GetFTR at the end of 2019 In early December 2019, a group of publishers announced Get-Full-Text-Research, or GetFTR for short. There was a heck of a response on social media, and the response was—on the whole—not positive from my librarian-dominated corner of Twitter. For my early take on GetFTR, see my December 3rd blog post “Publishers going-it-alone (for now?) with GetFTR.” As that post title suggests, I took the five founding GetFTR publishers to task on their take-it-or-leave-it approach. I think that is still a problem. To get you caught up, here is a list of other commentary. Roger Schonfeld’s December 3rd “Publishers Announce a Major New Service to Plug Leakage” piece in The Scholarly Kitchen Tweet from Herbert Van de Sompel, the lead author of the OpenURL spec, on solving the appropriate copy problem December 5th post “Get To Fulltext Ourselves, Not GetFTR.” on the Open Access Button blog Twitter thread on December 7th between @cshillum and @lisalibrarian on the positioning of GetFTR in relation to link resolvers and an unanswered question about how GetFTR aligns with library interests Twitter thread started by @TAC_NISO on December 9th looking for more information with a link to an STM Association presentation added by @aarontay A tree of tweets starting from @mrgunn’s [I don’t trust publishers to decide] is the crux of the whole thing. In particular, threads of that tweet that include Jason Griffey of NISO saying he knew nothing about GetFTR and Bernhard Mittermaier’s point about hidden motivations behind GetFTR Twitter thread started by @aarontay on December 7th saying “GetFTR is bad for researchers/readers and librarians. It only benefits publishers, change my mind.” Lisa Janicke Hinchliffe’s December 10th “Why are Librarians Concerned about GetFTR?” in The Scholarly Kitchen and take note of the follow-up discussion in the comments Twitter thread between @alison_mudditt and @lisalibrarian clarifying PLOS is not on the Advisory Board with some @TAC_NISO as well. Ian Mulvany’s December 11th “thoughts on GetFTR” on ScholCommsProd GetFTR’s December 11th “Updating the community” post on their website The Spanish Federation of Associations of Archivists, Librarians, Archaeologists, Museologists and Documentalists (ANABAD)’s December 12th “GetFTR: new publishers service to speed up access to research articles” (original in Spanish, Google Translate to English) December 20th news entry from eContent Pro with the title “What GetFTR Means for Journal Article Access” which I’ll only quarrel with this sentence: “Thus, GetFTR is a service where Academic articles are found and provided to you at absolutely no cost.” No—if you are in academia the cost is born by your library even if you don’t see it. But this seems like a third party service that isn’t directly related to publishers or libraries, so perhaps they can be forgiven for not getting that nuance. Wiley’s Chemistry Views news post on December 26th titled simply “Get Full Text Research (GetFTR)” is perhaps only notable for the sentence “Growing leakage has steadily eroded the ability of the publishers to monetize the value they create.” If you are looking for a short list of what to look at, I recommend these posts. GetFTR’s Community Update On December 11—after the two posts I list below—an “Updating the Community” web page was posted to the GetFTR website. From a public relations perspective, it was…interesting. We are committed to being open and transparent This section goes on to say, “If the community feels we need to add librarians to our advisory group we will certainly do so and we will explore ways to ensure we engage with as many of our librarian stakeholders as possible.” If the GetFTR leadership didn’t get the indication between December 3 and December 12 that librarians feel strongly about being at the table, then I don’t know what will. And it isn’t about being on the advisory group; it is about being seen and appreciated as important stakeholders in the research discovery process. I’m not sure who the “community” is in this section, but it is clear that librarians are—at best—an afterthought. That is not the kind of “open and transparent” that is welcoming. Later on in the Questions about library link resolvers section is this sentence: We have, or are planning to, consult with existing library advisory boards that participating publishers have, as this enables us to gather views from a significant number of librarians from all over the globe, at a range of different institutions. As I said in my previous post, I don’t know why GetFTR is not engaging in existing cross-community (publisher/technology-supplier/library) organizations to have this discussion. It feels intentional, which colors the perception of what the publishers are trying to accomplish. To be honest, I don’t think the publishers are using GetFTR to drive a wedge between library technology service providers (who are needed to make GetFTR a reality for libraries) and libraries themselves. But I can see how that interpretation could be made. Understandably, we have been asked about privacy. I punted on privacy in my previous post, so let’s talk about it here. It remains to be seen what is included in the GetFTR API request between the browser and the publisher site. Sure, it needs to include the DOI and a token that identifies the patron’s institution. We can inspect that API request to ensure nothing else is included. But the fact that the design of GetFTR has the browser making the call to the publisher site means that the publisher site knows the IP address of the patron’s browser, and the IP address can be considered personally identifiable information. This issue could be fixed by having the link resolver or the discovery layer software make the API request, and according to the Questions about library link resolvers section of the community update, this may be under consideration. So, yes, an auditable privacy policy and implementation is key for for GetFTR. GetFTR is fully committed to supporting third-party aggregators This is good to hear. I would love to see more information published about this, including how discipline-specific repositories and institutional repositories can have their holdings represented in GetFTR responses. My Take-a-ways In the second to last paragraph: “Researchers should have easy, seamless pathways to research, on whatever platform they are using, wherever they are.” That is a statement that I think every library could sign onto. This Updating the Community is a good start, but the project has dug a deep hole of trust and it hasn’t reached level ground yet. Lisa Janicke Hinchliffe’s “Why are Librarians Concerned about GetFTR?” Posted on December 10th in The Scholarly Kitchen, Lisa outlines a series of concerns from a librarian perspective. I agree with some of these; others are not an issue in my opinion. Librarian Concern: The Connection to Seamless Access Many librarians have expressed a concern about how patron information can leak to the publisher through ill-considered settings at an institution’s identity provider. Seamless Access can ease access control because it leverages a campus’ single sign-on solution—something that a library patron is likely to be familiar with. If the institution’s identity provider is overly permissive in the attributes about a patron that get transmitted to the publisher, then there is a serious risk of tying a user’s research activity to their identity and the bad things that come from that (patrons self-censoring their research paths, commoditization of patron activity, etc.). I’m serving on a Seamless Access task force that is addressing this issue, and I think there are technical, policy, and education solutions to this concern. In particular, I think some sort of intermediate display of the attributes being transmitted to the publisher is most appropriate. Librarian Concern: The Limited User Base Enabled As Lisa points out, the population of institutions that can take advantage of Seamless Access, a prerequisite for GetFTR, is very small and weighted heavily towards well-resourced institutions. To the extent that projects like Seamless Access (spurred on by a desire to have GetFTR-like functionality) helps with the adoption of SAML-based infrastructure like Shibboleth, then the whole academic community benefits from a shared authentication/identity layer that can be assumed to exist. Librarian Concern: The Insertion of New Stumbling Blocks Of the issues Lisa mentioned here, I’m not concerned about users being redirected to their campus single sign-on system in multiple browsers on multiple machines. This is something we should be training users about—there is a single website to put your username/password into for whatever you are accessing at the institution. That a user might already be logged into the institution single sign-on system in the course of doing other school work and never see a logon screen is an attractive benefit to this system. That said, it would be useful for an API call from a library’s discovery layer to a publisher’s GetFTR endpoint to be able to say, “This is my user. Trust me when I say that they are from this institution.” If that were possible, then the Seamless Access Where-Are-You-From service could be bypassed for the GetFTR purpose of determining whether a user’s institution has access to an article on the publisher’s site. It would sure be nice if librarians were involved in the specification of the underlying protocols early on so these use cases could be offered. Update Lisa reached out on Twitter to say (in part): “Issue is GetFTR doesn’t redirect and SA doesnt when you are IPauthenticated. Hence user ends up w mishmash of experience.” I went back to read her Scholarly Kitchen post and realized I did not fully understand her point. If GetFTR is relying on a Seamless Access token to know which institution a user is coming from, then that token must get into the user’s browser. The details we have seen about GetFTR don’t address how that Seamless Access institution token is put in the user’s browser if the user has not been to the Seamless Access select-your-institution portal. One such case is when the user is coming from an IP-address-authenticated computer on a campus network. Do the GetFTR indicators appear even when the Seamless Access institution token is not stored in the browser? If at the publisher site the GetFTR response also uses the institution IP address table to determine entitlements, what does a user see when they have neither the Seamless Access institution token nor the institution IP address? And, to Lisa’s point, how does one explain this disparity to users? Is the situation better if the GetFTR determination is made in the link resolver rather than in the user browser? Librarian Concern: Exclusion from Advisory Committee See previous paragraph. That librarians are not at the table offering use cases and technical advice means that the developers are likely closing off options that meet library needs. Addressing those needs would ease the acceptance of the GetFTR project as mutually beneficial. So an emphatic “AGREE!” with Lisa on her points in this section. Publishers—what were you thinking? Librarian Concern: GetFTR Replacing the Library Link Resolver Libraries and library technology companies are making significant investments in tools that ease the path from discovery to delivery. Would the library’s link resolver benefit from a real-time API call to a publisher’s service that determines the direct URL to a specific DOI? Oh, yes—that would be mighty beneficial. The library could put that link right at the top of a series of options that include a link to a version of the article in a Green Open Access repository, redirection to a content aggregator, one-click access to an interlibrary-loan form, or even an option where the library purchases a copy of the article on behalf of the patron. (More likely, the link resolver would take the patron right to the article URL supplied by GetFTR, but the library link resolver needs to be in the loop to be able to offer the other options.) My Take-a-ways The patron is affiliated with the institution, and the institution (through the library) is subscribing to services from the publisher. The institution’s library knows best what options are available to the patron (see above section). Want to know why librarians are concerned? Because they are inserting themselves as the arbiter of access to content, whether it is in the patron’s best interest or not. It is also useful to reinforce Lisa’s closing paragraph: Whether GetFTR will act to remediate these concerns remains to be seen. In some cases, I would expect that they will. In others, they may not. Publishers’ interests are not always aligned with library interests and they may accept a fraying relationship with the library community as the price to pay to pursue their strategic goals. Ian Mulvany’s “thoughts on GetFTR” Ian’s entire post from December 11th in ScholCommsProd is worth reading. I think it is an insightful look at the technology and its implications. Here are some specific comments: Clarifying the relation between SeamlessAccess and GetFTR There are a couple of things that I disagree with: OK, so what is the difference, for the user, between seamlessaccess and GetFTR? I think that the difference is the following - with seamless access you the user have to log in to the publisher site. With GetFTR if you are providing pages that contain DOIs (like on a discovery service) to your researchers, you can give them links they can click on that have been setup to get those users direct access to the content. That means as a researcher, so long as the discovery service has you as an authenticated user, you don’t need to even think about logins, or publisher access credentials. To the best of my understanding, this is incorrect. With SeamlessAccess, the user is not “logging into the publisher site.” If the publisher site doesn’t know who a user is, the user is bounced back to their institution’s single sign-on service to authenticate. If the publisher site doesn’t know where a user is from, it invokes the SeamlessAccess Where-Are-You-From service to learn which institution’s single sign-on service is appropriate for the user. If a user follows a GetFTR-supplied link to a publisher site but the user doesn’t have the necessary authentication token from the institution’s single sign-on service, then they will be bounced back for the username/password and redirected to the publisher’s site. GetFTR signaling that an institution is entitled to view an article does not mean the user can get it without proving that they are a member of the institution. What does this mean for Green Open Access A key point that Ian raises is this: One example of how this could suck, lets imagine that there is a very usable green OA version of an article, but the publisher wants to push me to using some “e-reader limited functionality version” that requires an account registration, or god forbid a browser exertion, or desktop app. If the publisher shows only this limited utility version, and not the green version, well that sucks. Oh, yeah…that does suck, and it is because the library—not the publisher of record—is better positioned to know what is best for a particular user. Will GetFTR be adopted? Ian asks, “Will google scholar implement this, will other discovery services do so?” I do wonder if GetFTR is big enough to attract the attention of Google Scholar and Microsoft Research. My gut tells me “no”: I don’t think Google and Microsoft are going to add GetFTR buttons to their search results screens unless they are paid a lot. As for Google Scholar, it is more likely that Google would build something like GetFTR to get the analytics rather than rely on a publisher’s version. I’m even more doubtful that the companies pushing GetFTR can convince discovery layers makers to embed GetFTR into their software. Since the two widely adopted discovery layers (in North America, at least) are also aggregators of journal content, I don’t see the discovery-layer/aggregator companies devaluing their product by actively pushing users off their site. My Take-a-ways It is also useful to reinforce Ian’s closing paragraph: I have two other recommendations for the GetFTR team. Both relate to building trust. First up, don’t list orgs as being on an advisory board, when they are not. Secondly it would be great to learn about the team behind the creation of the Service. At the moment its all very anonymous. Where Do We Stand? Wow, I didn’t set out to write 2,500 words on this topic. At the start I was just taking some time to review everything that happened since this was announced at the start of December and see what sense I could make of it. It turned into a literature review of sort. While GetFTR has some powerful backers, it also has some pretty big blockers: Can GetFTR help spur adoption of Seamless Access enough to convince big and small institutions to invest in identity provider infrastructure and single sign-on systems? Will GetFTR grab the interest of Google, Google Scholar, and Microsoft Research (where admittedly a lot of article discovery is already happening)? Will developers of discovery layers and link resolvers prioritize GetFTR implementation in their services? Will libraries find enough value in GetFTR to enable it in their discovery layers and link resolvers? Would libraries argue against GetFTR in learning management systems, faculty profile systems, and other campus systems if its own services cannot be included in GetFTR displays? I don’t know, but I think it is up to the principles behind GetFTR to make more inclusive decisions. The next steps is theirs. Publishers going-it-alone (for now?) with GetFTR In early December 2019, a group of publishers announced Get-Full-Text-Research, or GetFTR for short. I read about this first in Roger Schonfeld’s “Publishers Announce a Major New Service to Plug Leakage” piece in The Scholarly Kitchen via Jeff Pooley’s Twitter thread and blog post. Details about how this works are thin, so I’m leaning heavily on Roger’s description. I’m not as negative about this as Jeff, and I’m probably a little more opinionated than Roger. This is an interesting move by publishers, and—as the title of this post suggests—I am critical of the publisher’s “go-it-alone” approach. First, some disclosure might be in order. My background has me thinking of this in the context of how it impacts libraries and library consortia. For the past four years, I’ve been co-chair of the NISO Information Discovery and Interchange topic committee (and its predecessor, the “Discovery to Delivery” topic committee), so this is squarely in what I’ve been thinking about in the broader library-publisher professional space. I also traced the early development of RA21 and more recently am volunteering on the SeamlessAccess Entity Category and Attribute Bundles Working Group; that’ll become more important a little further down this post. I was nodding along with Roger’s narrative until I stopped short here: The five major publishing houses that are the driving forces behind GetFTR are not pursuing this initiative through one of the major industry collaborative bodies. All five are leading members of the STM Association, NISO, ORCID, Crossref, and CHORUS, to name several major industry groups. But rather than working through one of these existing groups, the houses plan instead to launch a new legal entity.  While [Vice President of Product Strategy &amp; Partnerships for Wiley Todd] Toler and [Senior Director, Technology Strategy &amp; Partnerships for the American Chemical Society Ralph] Youngen were too politic to go deeply into the details of why this might be, it is clear that the leadership of the large houses have felt a major sense of mismatch between their business priorities on the one hand and the capabilities of these existing industry bodies. At recent industry events, publishing house CEOs have voiced extensive concerns about the lack of cooperation-driven innovation in the sector. For example, Judy Verses from Wiley spoke to this issue in spring 2018, and several executives did so at Frankfurt this fall. In both cases, long standing members of the scholarly publishing sector questioned if these executives perhaps did not realize the extensive collaborations driven through Crossref and ORCID, among others. It is now clear to me that the issue is not a lack of knowledge but rather a concern at the executive level about the perceived inability of existing collaborative vehicles to enable the new strategic directions that publishers feel they must pursue.  This is the publishers going-it-alone. To see Roger describe it, they are going to create this web service that allows publishers to determine the appropriate copy for a patron and do it without input from the libraries. Librarians will just be expected to put this web service widget into their discovery services to get “colored buttons indicating that the link will take [patrons] to the version of record, an alternative pathway, or (presumably in rare cases) no access at all.” (Let’s set aside for the moment the privacy implications of having a fourth-party web service recording all of the individual articles that come up in a patron’s search results.) Librarians will not get to decide the “alternative pathway” that is appropriate for the patron: “Some publishers might choose to provide access to a preprint or a read-only version, perhaps in some cases on some kind of metered basis.” (Roger goes on to say that he “expect[s] publishers will typically enable some alternative version for their content, in which case the vast majority of scholarly content will be freely available through publishers even if it is not open access in terms of licensing.” I’m not so confident.) No, thank you. If publishers want to engage in technical work to enable libraries and others to build web services that determine the direct link to an article based on a DOI, then great. Libraries can build a tool that consumes that information as well as takes into account information about preprint services, open access versions, interlibrary loan and other methods of access. But to ask libraries to accept this publisher-controlled access button in their discovery layers, their learning management systems, their scholarly profile services, and their other tools? That sounds destined for disappointment. I am only somewhat encouraged by the fact that RA21 started out as a small, isolated collaboration of publishers before they brought in NISO and invited libraries to join the discussion. Did it mean that it slowed down deployment of RA21? Undoubtedly yes. Did persnickety librarians demand transparent discussions and decisions about privacy-related concerns like what attributes the publisher would get about the patron in the Shibboleth-powered backchannel? Yes, but because the patrons weren’t there to advocate for themselves. Will it likely mean wider adoption? I’d like to think so. Have publishers learned that forcing these kinds of technologies onto users without consultation is a bad idea? At the moment it would appear not. Some of what publishers are seeking with GetFTR can be implemented with straight-up OpenURL or—at the very least—limited-scope additions to OpenURL (the Z39.88 open standard!). So that they didn’t start with OpenURL, a robust existing standard, is both concerning and annoying. I’ll be watching and listening for points of engagement, so I remain hopeful. A few words about Jeff Pooley’s five-step “laughably creaky and friction-filled effort” that is SeamlessAccess. Many of the steps Jeff describes are invisible and well-established technical protocols. What Jeff fails to take into account is the very visible and friction-filled effect of patrons accessing content beyond the boundaries of campus-recognized internet network addresses. Those patrons get stopped at step two with a “pay $35 please” message. I’m all for removing that barrier entirely by making all published content “open access”. It is folly to think, though, that researchers and readers can enforce an open access business model on all publishers, so solutions like SeamlessAccess will have a place. (Which is to say nothing of the benefit of inter-institutional resource collaboration opened up by a more widely deployed Shibboleth infrastructure powered by SeamlessAccess.) 
feeds-feedburner-com-1059	----	None 
feeds-feedburner-com-1102	----	None 
feeds-feedburner-com-1128	----	None 
feeds-feedburner-com-1353	----	HubLog by Alf Eatonsearch Continuous deployment of a web service on Cloud Run March 14, 2021 Creating and deploying a web service using Cloud Run's continuous deployment, GitHub integration and Cloud Build's buildpacks cloud runnode.jsgithubBuilding amd64 Docker images with arm64 (M1) macOS March 7, 2021 Using docker buildx bake to build Docker images for different system architectures docker"git scraping" data from the Office for National Statistics API March 7, 2021 Fetching and publishing regularly-updated data as a web service with GitHub Actions and Datasette github actiondatasettecsvsqliteDocker on a Raspberry Pi 400 December 14, 2020 Using armv8 Docker images on a Raspberry Pi 400 DockerRaspberry PiARMAn Express app as a web service in Cloud Functions July 6, 2020 Deploying a simple web service to Cloud Functions node.jscloud functionsAn Express app as a web service in Cloud Run July 5, 2020 Deploying a simple web service to Cloud Run node.jscloud runA single-author web app hosted on Cloud Run June 13, 2020 Developing, building and deploying a single-author web app blogjavascriptexpressnode.jscloud rungithubSending a raw HTTPS request May 1, 2020 Storing, editing and sending a multipart/form-data request over HTTPS Converting PDF to PNG or JPEG September 13, 2019 Tools and services for converting a page of a PDF to an image How to build a user interface April 4, 2019 The 5 steps of designing a software product May 20, 2018 Designing a user interface for moving data from one state to another OpenID Connect March 13, 2018 A summary of the OpenID Connect protocol and its usage for authentication in an SPA Serving a web application over HTTPS February 16, 2018 Using nginx and LetsEncrypt to serve a web application over HTTPS JANICE: a prototype re-implementation of JANE, using the Semantic Scholar Open Research Corpus January 19, 2018 Formatting a LaCie external drive for Time Machine January 18, 2018 Indexing Semantic Scholar's Open Research Corpus in Elasticsearch January 4, 2018 Building an Elasticsearch index of Semantic Scholar's Open Research Corpus dataset A single-user blog October 22, 2017 Building a simple blog using React and Firebase Recovering from a failed macOS High Sierra upgrade October 18, 2017 OAuth in a Chrome extension October 16, 2017 ES6 export/import August 14, 2017 Exporting/importing/re-exporting ES6 modules Styling and theming React Components August 10, 2017 Using CSS in JS to style and theme React components async is more than await April 20, 2017 Symfony Forms March 23, 2017 Symfony is best at allowing users to apply mutations to resources via HTML forms Polymer + Firebase Makefile October 17, 2016 A Makefile for deploying Polymer apps to Firebase Distributed Consensus April 1, 2016 What Aaron understood September 11, 2015 What colour is a tree? September 11, 2015 Collections of items in time and space Fetching Web Resources September 10, 2015 Using Resource and Collection interfaces to retrieve data from the web Quantifying journals September 10, 2015 Metrics for scoring and ranking journals It's a shame about Google Plus September 10, 2015 URLs for people Distributed Asynchronous Composable Resources September 10, 2015 Filling out data tables using promises and computed properties Access-Control-Allow-Origin: * April 21, 2015 Add the Access-Control-Allow-Origin: * header to the data you publish No More Documents April 19, 2015 Client-side XML validation in JavaScript April 18, 2015 Using an Emscripten port of xmllint to validate XML against a DTD in a web browser. Organising, building and deploying static web sites/applications March 1, 2015 Using Jekyll (remote or local) or Yeoman (local) to build, serve and deploy a GitHub Pages site or application Visualising political donations February 15, 2015 Using Tableau Public to visualise donations to UK political parties Force-directed tag clouds February 15, 2015 Using artists as the dark matter in a graph of tags, to visualise the thematic content of radio shows Exploring a personal Twitter network January 25, 2015 Using Gephi to create a network graph showing the most highly-connected Twitter friends of those I follow. Searching for mergeable tables January 12, 2015 Finding tabular data sets that can be merged, using URLs for data types UK Prospective Parliamentary Candidates January 4, 2015 The people who will be standing as candidates in the 2015 General Election Creating a map of Grade I listed buildings January 4, 2015 Filtering an Environment Agency Shapefile to create a custom map UK parliamentary constituencies January 3, 2015 Boundaries, names and codes of the UK's parliamentary constituencies The trouble with scientific software December 31, 2014 Scientific software is often opaque, and difficult to obtain and cite Archiving and displaying tweets with dat September 18, 2014 Don't just publish JSON-LD June 19, 2014 Publish plain, simple JSON, with a linked context document for consumers that want it vege-table: the data table that grows, with leaves May 16, 2014 The easiest, most resourceful way to harvest, explore and publish a collection of data. Line-oriented data formats February 26, 2014 Iterating Arrays February 20, 2014 JavaScript methods for iterating arrays Publishing research on the web January 27, 2014 Two examples of publishing code, data and a human-readable report jQuery Microdata January 13, 2014 A jQuery plugin for working with HTML Microdata Creating printable cards with HTML and CSS December 22, 2013 Use HTML and CSS to fill a printed card with content Post-humanist technology December 19, 2013 If you can't tell why a technology would be useful to you, it's for the robots Collecting article metrics with OpenRefine December 16, 2013 Using OpenRefine to collect article metrics data JSON templates December 16, 2013 Using JSON templates to describe objects and query by example JSON-LD December 12, 2013 Using context documents to map local property names to shared URLs CSV on the Web, with PHP December 12, 2013 Fetching, parsing and publishing CSV Publishing, Versioning and Persistence December 12, 2013 Some rules for publishing a resource online SELECT * FROM WEB December 11, 2013 OK Guha Describing Objects December 11, 2013 Using names and classes as shorthand for object properties Switching off HubMed's RSS and Atom feeds August 14, 2013 HubMed's RSS and Atom feeds are discontinued Web Components July 7, 2013 Using Web Components to define custom HTML elements Internet Surveillance June 10, 2013 Methods of gathering information from the internet. Citing Articles Within Articles March 2, 2013 HTML markup for inline citations in scholarly articles Open, Social, Academic Bookmarking: Save to App.net February 5, 2013 Using App.net's File API to create an open, personal reading library. HTML metadata for journal articles November 28, 2012 A summary of ontologies for describing journal articles Ten years of HubMed November 28, 2012 An overview of the ten years since HubMed was created Publishing a podcast using Google Drive (in theory) September 20, 2012 Generate a podcast feed for audio files stored on Google Drive, using Apps Script and Yahoo Pipes Publishing Articles Using Gists September 5, 2012 Introducing macrodocs.org, a client-side renderer for articles stored in Gists Music Seeds and More Like These August 17, 2012 Sources for music recommendation; querying by example Querying Data Sets using Google BigQuery August 17, 2012 Using Google Fusion Tables to provide an API to data files August 15, 2012 Resourceful Web Interfaces August 2, 2012 Classlessness June 26, 2012 A Resourceful Alternative to OAI-PMH June 4, 2012 Adding Files to Google Drive using PHP May 1, 2012 Working with the Harvard Library Bibliographic Dataset April 27, 2012 BBC Radio -> XSPF Bookmarklet March 12, 2012 How To Text Mine Open Access Documents February 22, 2012 Open Access Author Manuscripts in PubMed Central February 20, 2012 ISSN(L)s And Serial Title Abbreviations February 9, 2012 Extracting Text From A PDF Using Only Javascript November 18, 2011 Open Graph wins the Semantic Web September 29, 2011 Citing With URIs in Google Docs September 16, 2011 Client-Side PubMed Searching July 23, 2011 Capturing a manipulated web page with PhantomJS March 25, 2011 This Weblog In (Some) URLs March 6, 2011 A Modular System for Automatic Entity Extraction and Manual Annotation of Academic Papers February 3, 2011 Getting and Sending Binary Files with XMLHttpRequest December 15, 2010 AOTY 2010 November 18, 2010 ReCo: a music recommender October 18, 2010 Artists October 7, 2010 Creating a single file, lossless rip of a DVD chapter in Ubuntu August 22, 2010 London Cycle Hire data/apps August 7, 2010 Writing Firefox Add-ons with the JetPack SDK July 31, 2010 UK Fuel Consumption for Energy Use July 1, 2010 Current UK Reservoir Stocks July 1, 2010 eCryptfs in Ubuntu (Lucid) June 27, 2010 Using STIX fonts with @font-face June 10, 2010 Inline annotations/formatting in HTML May 20, 2010 Command line Twitter authentication using the PECL OAuth library May 20, 2010 Automatically mounting a remote directory in Ubuntu using autofs + sshfs May 15, 2010 A Simple Hit Counter with Node.js and Redis May 13, 2010 Voting Correlation (UK General Election 2010) May 9, 2010 UK General Election 2010 May 8, 2010 Installing PHP 5.3 etc on Ubuntu Karmic (9.10) May 5, 2010 Maps at the British Library, and on the BBC May 4, 2010 mapstvBillions April 15, 2010 Archiving Timestamped Copies of Bookmarked Web Content March 30, 2010 A WSDL 2.0 description of the EUtils EFetch web service March 28, 2010 phpschemaxmlREST Web Services, XML and Data Typing March 27, 2010 phpxmlGoogle Bookmarks Lists March 24, 2010 googlelistsmapsA Solr index of Wikipedia on EC2/EBS March 17, 2010 ec2lucenesolrMapping XML Named Character References to Unicode Characters March 16, 2010 A Pipe for New Episodes in a BBC series March 8, 2010 bbce4xjavascriptpipesrdfxmlyahooyqlIndependent UK Record Labels on Spotify March 4, 2010 Adding Spotify links to BBC Radio playlists, via RDFa, using Greasemonkey and rdfQuery March 2, 2010 Indexing JSON data in MongoDB using PHP February 23, 2010 Showing Delicious bookmarks of pages within a domain February 19, 2010 ElasticSearch in PHP February 16, 2010 Describing REST APIs with HTML5 forms February 2, 2010 The Top Google Search Result for each Unicode Character January 22, 2010 Listing Unicode Characters January 22, 2010 Spotify Playlist: The Hype Machine Top 1000 Albums of 2009 January 21, 2010 On A Bus updated January 19, 2010 Using the Bing Maps Web Services in PHP January 19, 2010 Publishing Files using a Public Folder in Google Docs January 12, 2010 Web Applications January 11, 2010 Installing platform-specific applications January 11, 2010 Operating Systems and Application Launching January 11, 2010 An OS X Single Site Browser with HTML5 Storage Support? January 6, 2010 A Basic Web App With A Settings Page, Using jQTouch and PHP January 5, 2010 OpenURL + OpenSearch January 4, 2010 Map Overlays January 2, 2010 mapsThird-Party Cookies December 23, 2009 Spotify lookup and Playdar in AOTY December 10, 2009 aotyplaydarspotifyOpenSearch + YQL December 10, 2009 Importing GeoPlanet data into MySQL December 10, 2009 Semantic Assistants November 16, 2009 Text Mining November 16, 2009 SoyLatte: Java 1.6 for 32-bit OS X November 11, 2009 javaosxTransforming XML files with XSLT 2.0 and Saxon-HE on OS X, using an XML catalog October 26, 2009 xmlLatest NPG articles in PubMed Central October 22, 2009 Exploring PubChem via SPARQL October 7, 2009 Bacode October 7, 2009 Yahoo! APIs Terms of Use changed October 2, 2009 Using PubMed's autocomplete data in JQuery September 30, 2009 HTML template September 30, 2009 htmlSheevaPlug as a Torrent Seed Box September 19, 2009 Graphing weather time series data with Timetric September 10, 2009 dataweatherConverting PDF to PNG using ImageMagick or Ghostscript August 20, 2009 The Music Industry (version) August 11, 2009 QR code testing on the iPhone August 7, 2009 Embedding chemical structure information in image files August 6, 2009 applescriptchemistryUsing the Tesco API with PHP August 5, 2009 apiphpTopic Modelling with MALLET August 3, 2009 Travel with an iPhone August 2, 2009 iphonetravelMarking up a bibliographic reference with RDFa July 30, 2009 Entities in Scientific News Stories June 18, 2009 onabus.com June 16, 2009 Annotation of Scientific Articles June 14, 2009 annotationNow Playing in Songbird June 14, 2009 musicnow-playingsongbirdA Private Radio Archive June 14, 2009 notuberadioDealing with election results data June 11, 2009 Adding Bing search results to Google June 4, 2009 Extracting keyphrases from documents using MeSH terms and KEA June 1, 2009 Scraping with YQL Execute June 1, 2009 scrapingClustering documents with CLUTO May 28, 2009 Exploring an OAI-PMH repository May 26, 2009 oaiYahoo! PlaceMaker May 21, 2009 apilocationyahooFetching article citation counts from Web of Science May 21, 2009 apiPHP, DOM, DTDs and named entities May 20, 2009 phpxmlPHP, DOM and XML encodings May 20, 2009 phpxmlRecording video from a webcam in Ubuntu May 17, 2009 ubuntuvideoQuerying BBC programmes in a Talis data store May 15, 2009 bbcrdfuriOAI, YQL and JSON May 14, 2009 phpyqlWhat's the Unicode character for "irony"? May 7, 2009 Updating local copies of databases and ontologies May 6, 2009 Server-side DOM scraping with Javascript: options April 29, 2009 domjavascriptSolr/Lucene on EC2/EBS April 20, 2009 ec2lucenesolrInstalling CouchDB from source on OS X April 17, 2009 Everything? April 7, 2009 Playdar as an OpenURL resolver? April 3, 2009 audiocoinsopenurlplaydarresolutionGraph of new albums added to Spotify April 1, 2009 Analysing 'science' bookmarks in Delicious March 29, 2009 deliciousPosting shared items from Google Reader to Delicious March 29, 2009 deliciousphpResolving URLs with PHP March 29, 2009 phpFinding all occurrences of a UTF-8-encoded needle in a UTF-8-encoded haystack March 25, 2009 phpUsing YQL and Pipes to make a screensaver of The Big Picture March 25, 2009 pipesyqlPages tagged as 'science' on Delicious, by co-tags March 21, 2009 delicioussciencePopular pages tagged as 'science' on Delicious March 21, 2009 deliciousscienceSelecting Wikipedia articles by InChI March 20, 2009 chemistryinchirdfContent Hashing March 19, 2009 similarityYQL Open Data Tables March 16, 2009 scrapingyqlFestive 50 Spotify Playlists March 15, 2009 playlistsradioxspfTfL feeds March 13, 2009 Semantic/Scientific Authoring Add-ins for Microsoft Word March 13, 2009 publishingsemanticData, Science and Stories March 12, 2009 dataMusic Recipe March 12, 2009 musicDelicious Network Meme Tracker March 12, 2009 deliciousComparing similar articles and categorisation with Wikipedia March 12, 2009 Fetching articles from the NY Times API March 11, 2009 apiGuardian + Lucene = Similar Articles + Categorisation March 10, 2009 Guardian Open Platform March 10, 2009 apiCloudMade February 23, 2009 mapsAn open question to authors of text mining tools February 22, 2009 text-miningHTML + WMV -> XSPF + MP4 February 21, 2009 phpvideoAnalysing the ticTOCs collection of journal TOC feeds February 18, 2009 Freebase: Types, Topics, Timelines and Mentions February 7, 2009 freebaseontologyGoogle, jQuery and plugin loading February 3, 2009 googlejavascriptjqueryYouMomus February 2, 2009 BigMaps with Modest Maps January 31, 2009 mapsQuestion for a map January 31, 2009 mapsBigMaps with CutyCapt and Xvfb January 30, 2009 ecsstract: Scraping in XULRunner with JSON/CSS selectors January 29, 2009 Generating Standard Chemical Identifiers (Standard InChI) January 22, 2009 PubMed XML in eXist on OS X January 21, 2009 Difficult Album Titles of 2008 January 19, 2009 musicPrivacy online: prevent tracking using Adblock Plus' site-specific filters January 19, 2009 adblockprivacyExtracting a certificate/key pair from a Java keystore January 19, 2009 Spotified SXSW Catalog January 19, 2009 greasemonkeyspotifyDefining scraper mappings using CSS selectors January 19, 2009 An Annotated Timeline of U.S. Public Debt, using Google Spreadsheet and Google Calendar January 16, 2009 dataGenerative art in Second Life January 13, 2009 Installing an independent PHP 5.3 to run from the command line January 13, 2009 phpNotes on using the Ubuntu EC2 AMI January 13, 2009 Events! January 11, 2009 Radio Now January 6, 2009 iplayerradioUbuntu on EC2 January 5, 2009 Displaying new episodes from BBC iPlayer January 4, 2009 bbciplayertvZemanta API January 2, 2009 Making a Lucene index of Wikipedia for MoreLikeThis queries January 2, 2009 lucenephpwikipediaAlbums of the Year collages January 1, 2009 musicEnd-of-year TV (UK only) January 1, 2009 tvSkyrails December 19, 2008 graphnetworkvisualisationSpotification December 19, 2008 greasemonkeyUniProt / RDF / SPARQL December 19, 2008 rdfuniprotGetting a visitor's location (city) December 15, 2008 Browse My Privates December 10, 2008 Firefox 3.1, maxVersion for extensions December 10, 2008 firefoxAlbums of the Year 2008 December 10, 2008 nokeepalive December 9, 2008 POTAtoo December 6, 2008 greasemonkeySongbird links and bookmarks December 3, 2008 Libxml2, PHP and UTF-8 December 2, 2008 The 16 Most Interesting Regions in Second Life November 24, 2008 secondlifeOn A Bus November 21, 2008 iphonemapstransportSecond Life person pseudo-APIs November 17, 2008 Second Life region APIs November 13, 2008 secondlifeSecond Life BigMap November 11, 2008 Mouse coordinates bookmarklet November 11, 2008 bookmarkletEncoding AAC/MP4 audio files on OS X November 11, 2008 audioosxInline Wikipedia History, updated November 11, 2008 greasemonkeywikipediaIntrepid vs NVIDIA November 9, 2008 ubuntuNational Public Transport Data Repository data November 9, 2008 transportRoyal Mail PAF data November 9, 2008 dataTransport Direct API November 9, 2008 apiphptransportGetting Started in Second Life November 8, 2008 secondlifeJSONP, Google Spreadsheet security October 29, 2008 securityUIMA October 28, 2008 Minimal PHP script for downloading PubMed XML October 23, 2008 phppubmedMinimal PHP script for downloading PubMed XML (with error checking) October 23, 2008 phppubmedHuffduffer October 22, 2008 Who Cares About Open Access October 21, 2008 publishingscienceSecond Life: "Teleport to Camera Position" October 9, 2008 second lifeVideo Encoding Recommendations October 2, 2008 videoMaximise OS X windows with a keyboard shortcut September 26, 2008 Query Parameters in URIs September 26, 2008 Logout/Login CSRF September 24, 2008 Pure Data September 22, 2008 audiopuredataPlaylist Builder using Freebase Suggest September 22, 2008 freebasemetadataWeb Playlist Tool September 22, 2008 mediaplaylistsPreprints and Categorisation September 18, 2008 Creating a Freebase data view September 18, 2008 datafreebaseRasmus Lerdorf on PHP performance September 14, 2008 phpUbiquity PubMed search September 12, 2008 Audacity September 12, 2008 PHP, SimpleXML, XPath and namespaced attributes September 11, 2008 Removing 'for each' from Javascript examples September 10, 2008 PubMed JSON API September 8, 2008 apijavascriptjsonpubmedLinux music players: compilations and watching folders September 7, 2008 audiolinuxBBC AOD Filter Pipe September 4, 2008 audiobbcpipesGoPubMed export API September 4, 2008 Ubiquity commands September 2, 2008 National Rail bus service sparklines September 1, 2008 businfographictransportGmail MenuExtra SSB in Fluid August 29, 2008 Veodia August 28, 2008 second lifevideoPulseAudio resampling August 28, 2008 audioprojectM-pulseaudio August 27, 2008 audioubuntu,visualisationLondon: cycling and walking route maps August 26, 2008 London: Visitors bus map, mobile TFL August 26, 2008 PulseAudio voodoo August 24, 2008 audioubuntuUKPA negotiates a licence for commercial music podcasting August 23, 2008 podcastGeoNames NearbyWikipedia API August 21, 2008 apijavascriptjquerylocationwikipediaAideRSS PostRank API August 21, 2008 apijqueryUK Postcode -> Bus Stop prototype August 20, 2008 maptransport400,000 bus stops August 20, 2008 locationphpFull-text feeds as a route around censorship August 20, 2008 feedsEPUB and Stanza August 20, 2008 epubliteraturepdfMendeley August 19, 2008 bibliographypdfCreating MapTube maps with Neighbourhood Statistics data August 19, 2008 mapsAn Amazon Wishlist Competition/Contest August 18, 2008 amazoncompetitionwishlistListen Later updated August 18, 2008 audiobbcextensionfirefoxMobile bus departures August 18, 2008 locationtransportUpload to Google Docs bookmarklet August 18, 2008 bookmarkletgoogleGrowl Alerts for Gmail Messages from Address Book August 18, 2008 applescriptgmailRadio 4 Comedy Feeds August 18, 2008 bbcradioSending a URL from Safari to Firefox August 18, 2008 applescriptosxWriting an Atom feed in PHP 5 August 13, 2008 atomphpHow to Share a Social Network August 12, 2008 portabilityprivacyLocatory August 11, 2008 Britain From Above August 10, 2008 bbctvMeta-TV August 8, 2008 tviPhone reader for Google Reader Starred Items August 8, 2008 feedsgoogleiphoneFree August 6, 2008 iphoneCOUNT/DISTINCT queries August 5, 2008 mysqlrdfxqueryManipulating Forms in Google Spreadsheets August 4, 2008 googlespreadsheetsTiddlyWiki zoomable interface August 4, 2008 tiddlywikiiPhone interface for Delicious Network August 4, 2008 deliciousiphoneMapping Statistics mini-presentation at BarCamb August 4, 2008 mapsDS Game Classics July 29, 2008 dsgamesLondon Age Distribution Maps Part 2 July 29, 2008 mapsLondon Age Distribution Maps July 29, 2008 mapsdata URI for a Google search box July 26, 2008 googleiphoneSecurity Email Addresses that are Black Holes July 26, 2008 securityBeware of the App July 26, 2008 iphonesecurityXMPP comments July 26, 2008 xmppUpcoming API (PHP5) July 24, 2008 apiphpSend PDFs from Skim to Gmail July 24, 2008 applescriptgmailpdfListen Direct /programmes July 17, 2008 bbcgreasemonkeyradioInstalling Java Advanced Imaging in Ubuntu Hardy July 12, 2008 javaubuntuConvert AMR files to WAV in Ubuntu Hardy July 9, 2008 audiolinuxubuntuNeighbourhood Statistics API July 4, 2008 apiphpsoapOrdnance Survey-based BigMap of the UK July 2, 2008 mapArtist -> BBC Radio Shows lookup June 24, 2008 bbcmusicphpradioSoul Bubbles June 22, 2008 gamesEssential Add-ons for Firefox 3 June 19, 2008 extensionsfirefoxChris Wetherell on Google Reader June 18, 2008 feedsgooglePod-U-Like June 17, 2008 app-enginepodcastsUsing Google to Fetch All of a Feed's Items June 17, 2008 apifeedsgooglephpFirefox, OpenSearch and Autocomplete June 14, 2008 firefoxopensearchOpenCalais API June 11, 2008 apiSkim: Open All With Papers June 9, 2008 applescriptosxpdfTumblr Auto-Pager June 9, 2008 greasemonkeyWebClipCountUpDown June 7, 2008 csssafariCreate a calendar from del.icio.us bookmarks June 6, 2008 calendardel.icio.usPod News June 5, 2008 podcastsBringing a publisher's content to the Life Science researcher May 29, 2008 presentationpublishingslidestext-miningThe rules of Web 3.0 May 28, 2008 Publications May 28, 2008 app-enginemedlineopensocialpublicationspythonOn the Rain-Slick Precipice of Darkness, from Penny Arcade May 25, 2008 gameOpenSocial terminology May 24, 2008 opensocialMy Speediest Gatherers updated May 21, 2008 del.icio.usphpUpgrading to Gmail May 11, 2008 emailWith or Without UIDs May 8, 2008 metadatapresentationsearchslidesxtechI'm Feeling Unlucky May 8, 2008 googlegreasemonkeyMeta Latest May 7, 2008 searchDealing with corrupt preference files on OS X May 4, 2008 osxRecipeBook April 29, 2008 drupalrecipebookrecipesMixingIt April 26, 2008 drupalminingphpradiotextGazelle April 22, 2008 musicp2pNow Playing on the radio April 22, 2008 bbcradioxmppRealPlayer 11 for Linux April 21, 2008 linuxradiorealplayerPubMed search URL April 16, 2008 pubmedHow to Make Someone Fetch a URL with a Blank Referer Header April 15, 2008 securityReification April 15, 2008 rdfSecurity against SQL injection in Wordpress April 14, 2008 phpsecuritywordpressTwubble April 10, 2008 twitterGoogle Docs April 7, 2008 Code quality in contributed Drupal modules April 5, 2008 drupalNDS April 2, 2008 dsgamesSecure password hashing April 1, 2008 drupalopenpasswordssecuritysourcewordpressHow IdentiFight works April 1, 2008 identifightprivacySemgine's myMap for exploring semantic networks of information April 1, 2008 datagraphinterfacerdfVST instruments in Linux March 30, 2008 linuxvstGoogle Site Search bookmarklet March 30, 2008 bookmarkletgoogleBookMooch, LibraryThing March 30, 2008 booksSneetchalizer March 28, 2008 audiolinuxLast.fm Fingerprinting Client March 28, 2008 last.fmmetadatamusicDecentralised music subscription services March 28, 2008 musicDrupalCPP March 28, 2008 drupalS01E01 March 28, 2008 tvPenguin March 28, 2008 booksIdentiFight additions March 27, 2008 identifightprivacyIdentiFight March 26, 2008 identifightprivacySpokeo March 18, 2008 identityPHP script for downloading MP4 files from iPlayer March 16, 2008 bbcphpTopCited March 13, 2008 citationpublishingDownload TV shows from the BBC iPlayer as MP4 March 8, 2008 bbctvClimate Change March 4, 2008 climate-changeconferenceparticipationLocating London Buses March 3, 2008 busesopen-dataBuilding a "Now Playing" Wall February 20, 2008 amarokxmppMusicBrainz artist info API February 17, 2008 apimusicbrainzphpLast.fm artist info API February 17, 2008 lastfmphp"Relation" metadata February 14, 2008 metadatapublishingXMPP, Publish-Subscribe, PEP and User Tune February 14, 2008 pubsubxmppFull OpenURL metadata from CrossRef February 14, 2008 openurlSetting the height of a cross-domain iframe using postMessage February 13, 2008 htmljavascriptNo Frills Fullscreen February 12, 2008 extensionsfirefoxKitte February 12, 2008 designwordpressCrossRef Citation plugin February 11, 2008 citationcrossref"Play in Sidebar" Firefox extension February 6, 2008 audioextensionfirefoxplaylistvlcCoding Niggles February 5, 2008 codeZowbar February 5, 2008 firefoxmetadatazoterorefactormycode February 3, 2008 codeWindows-less? February 3, 2008 linuxmusicrenoisewindowsCanon printers in Ubuntu February 3, 2008 printubuntuLinking to papers February 3, 2008 citationconversationsdisambiguationUnpredictability of influence January 29, 2008 XMPP January 27, 2008 firefoxxmppUpdating "Selected Text" Bookmarklets January 25, 2008 bookmarkletsFinding Conversations around Academic Publications January 24, 2008 citationconversationscintillaCanonical PubMed URLs January 22, 2008 pubmedListen Later January 22, 2008 bbcextensionfirefoxradioHow a Firefox Extension Works January 22, 2008 extensionsfirefoxFSDL January 21, 2008 searchSingle Window mode January 21, 2008 firefoxMozilla, Chrome and FUEL January 21, 2008 firefoxFirefox 3, del.icio.us posting extension January 21, 2008 del.icio.usextensionfirefoxOpenURLed January 18, 2008 openurlFeed Deltas: What's Changed? January 18, 2008 feedsBlog Remix January 16, 2008 musicxspfAnnotations in XML January 16, 2008 annotationSubmitting Author Manuscripts to PubMed Central January 15, 2008 publishingDepositing Nature articles in PubMed Central January 15, 2008 natureopen-accesspublishingBPR3 markup January 14, 2008 citationmicroformatsCrowbar January 14, 2008 scrapingzoteroThe Long Arm of Copyright January 13, 2008 copyrightgamesAll Nature papers now available online January 10, 2008 natureAccessing the UMLSKS SOAP Web Service using PHP5 January 10, 2008 apiphpCoverFlow-ish for Newest Amarok Albums January 6, 2008 amarokphpCommunicating with Amarok from a local web page January 6, 2008 amarokHTTP POST in PHP5 January 6, 2008 phpArchiving del.icio.us bookmarks January 3, 2008 del.icio.usdrupalprojectM in Amarok December 28, 2007 audiovisualisationContextLinks Amarok plugin December 23, 2007 amarokAmarok: Record Labels from MusicBrainz December 22, 2007 amarokmusicbrainzpythonAmarok: Album Release Dates from MusicBrainz December 22, 2007 amarokmusicbrainzpythonTV on the Internet December 20, 2007 bbctvBBC Cross-Platform iPlayer December 18, 2007 bbcradiotvBest Albums of 2007 lists December 18, 2007 drupalmusicPresenting replicates in a table December 11, 2007 datahtmlpublishingCharting features December 7, 2007 datajavascriptvisualisationAIR December 5, 2007 e4xEasylistener bookmarklet December 5, 2007 bookmarkletplayrAIDA Toolkit Entity Extraction API December 3, 2007 medlineminingReasons for loving CASH Music December 3, 2007 musicmyExperiment December 3, 2007 bioinformaticsXNAT workflow December 3, 2007 scienceFirefox's Sandbox December 2, 2007 firefoxsecurityjQuery in Zotero November 29, 2007 jqueryzoteroThe World is 67108864 Pixels at Zoom Level 5 November 28, 2007 googlemapConditionally hiding HTML elements with jQuery/CSS November 27, 2007 jqueryAmazon AWS API November 26, 2007 amazonapiPubChem (EUtilities) API November 20, 2007 apieutilsphpCross-platform Javascript omissions November 20, 2007 javascriptRenoise November 18, 2007 musicrenoiseSideload/MP3tunes vs EMI November 15, 2007 copyrightmusicPredictive accuracy is substantially improved when blending multiple predictors November 14, 2007 algorithmsServer-side scraping with Javascript November 12, 2007 javascriptmetadataCureHunter's graph viewer November 9, 2007 visualisationScraping web pages with PHP 5 November 8, 2007 phpscrapingMaking a screencast November 8, 2007 screencastvideorev="review" November 8, 2007 citationmicroformatsPreserving PDF metadata November 6, 2007 metadatapdfGutsy November 6, 2007 ubuntuBPR3 November 5, 2007 citationmicroformatsStill OiNK-less November 3, 2007 musicp2pted November 1, 2007 p2ptvMySpace -> File2HD Greasemonkey script November 1, 2007 greasemonkeyMetadata Scrapers October 31, 2007 metadatadel.icio.us / earlier October 31, 2007 del.icio.usHype Machine October 17, 2007 musicsongbirdGetting a local copy of MEDLINE October 9, 2007 medlinephppubmedBBC Radio Player as a separate application, with WebRunner October 6, 2007 bbcfirefoxradioI Forgot My Password October 3, 2007 securityXUL FTW October 3, 2007 xulFix SSH in Mac OS X by reinstalling Kerberos.framework October 1, 2007 osxMail -> Thunderbird October 1, 2007 emailMethods for private Atom/RSS feeds October 1, 2007 feedssecurityLinux and wireless devices September 30, 2007 linuxwifiGmail vulnerability September 27, 2007 emailsecurityThings that Taste Great Together September 20, 2007 Notes From DrupalCon Barcelona 2007 September 20, 2007 drupalUpdate: Artist popularity in specific countries September 12, 2007 lastfmNCBI Resource Locator September 11, 2007 pubmedhCalendar, Microformats and Google Calendar September 10, 2007 microformatsDealing with hard drives in Ubuntu September 8, 2007 ubuntuLondon Cinema Today September 4, 2007 cinemadrupalHigh Usage of PubMed's "Related Articles" August 23, 2007 pubmedsearchInline Wikipedia History August 22, 2007 greasemonkeywikipediaOn The Wire has a podcast August 21, 2007 radioMusicSun August 16, 2007 audioscrobblervisualisationNon-destructive faceted browsing August 15, 2007 searchvisualisationquite |kw?t| August 15, 2007 Adding Random Email Addresses to Facebook August 15, 2007 Geocoding APIs August 14, 2007 apigeoGene Network API August 14, 2007 apibioinformaticsphpPostgenomic API August 14, 2007 apicitationWhatizit API August 14, 2007 apibioinformaticsphpPubMed API August 13, 2007 apiphppubmedClearForest SWS API August 13, 2007 apimetadataphpCSS Workarounds for Internet Explorer &lt; 7 August 3, 2007 cssWikipedia API July 31, 2007 apiFreebase API July 30, 2007 apiScopus API July 30, 2007 apicitationRSS Nightmare July 30, 2007 feedsLazyTube July 26, 2007 screencastvideoPublishing data tables July 26, 2007 datapublishingUser Styles July 19, 2007 cssFarewell Azureus July 19, 2007 p2pEPUB and Adobe Digital Editions July 14, 2007 publishingIf It's Ready, Release It July 13, 2007 musicOpera Mini 4 Beta July 9, 2007 Faceted Search in Solr/Drupal July 3, 2007 drupalsearchmusic.of.interest July 1, 2007 musicPeel Sessions July 1, 2007 drupalradioScintilla June 14, 2007 naturescintillaCreate a Google Custom Search Engine on the fly June 14, 2007 googlesearchXTech 2007 Science BOF slides June 7, 2007 presentationxtechUnofficial London RSS feeds June 6, 2007 feedslondonBenchmarking PHP 5.2.3 string manipulation June 5, 2007 phpBenchmarking PHP 5.1.4 string manipulation June 2, 2007 phpExpanding Abbreviations in HubMed June 1, 2007 hubmedMahalo June 1, 2007 searchPodule #10 May 31, 2007 podulesLast.fm listening graph May 31, 2007 lastfmvisualisationPodcast Awards May 30, 2007 podcastXSS vulnerabilities by PageRank May 30, 2007 securityRIAA-safe Top 100 May 24, 2007 musicp2pCompiling and installing Xalan on OS X May 24, 2007 osxProgramming Language Reference Widgets for Dashboard May 23, 2007 osxWebjay → last.fm playlists May 22, 2007 lastfmplaylistsNotes From XTech 2007 May 22, 2007 xtechPredictions/Observations for 2007 May 16, 2007 Mobile Feed Reader May 10, 2007 feedsReal-time, 32-bit audio processing May 10, 2007 audioPlay This Gene May 3, 2007 greasemonkeyItems You Rated in Amazon May 2, 2007 amazonrecommendationRate items quickly in Amazon with Greasemonkey April 30, 2007 amazongreasemonkeyXTech 2007 April 29, 2007 xtechMultiple "Related Articles" in PubMed April 26, 2007 pubmedR4DS April 25, 2007 ds2500 album covers April 25, 2007 musicvisualisationFetching cover art for a list of albums April 25, 2007 amarokmusicmusicbrainzpythonAdd publications from HubMed to PublicationsList.org April 23, 2007 greasemonkeyhubmedpublicationsMirror last.fm listening statistics April 4, 2007 lastfmmusicphpMobile Mapping/GPS April 3, 2007 gpsZotero ? HubMed Tags April 1, 2007 extensionfirefoxhubmedzoteroOwnership of user-contributed data March 31, 2007 data-portabilityTouchGraph relaunched March 29, 2007 touchgraphAmarok → Last.fm links (My First Ruby) March 29, 2007 amaroklastfmmusicrubyAdding MusicBrainz data to an Amarok database March 29, 2007 amarokmusicmusicbrainzpythonSmall Pieces Please March 27, 2007 p2pThe Sorry State of Online Music March 27, 2007 musicVideo Aggregators March 27, 2007 video32-bit Firefox on 64-bit Ubuntu March 27, 2007 ubuntuWhat's on your Google Homepage? March 26, 2007 googleVisual Scrapers March 26, 2007 automationscrapingXHTML vs HTML March 9, 2007 xhtmlFour Tenets of Web Security March 8, 2007 securityOne column layouts March 6, 2007 cssFirefox offline browsing March 4, 2007 firefoxpublishinglast.fm user listening data March 3, 2007 lastfmScientific article conversations and distributed libraries February 20, 2007 citationSearch-and-replace February 19, 2007 bashWarning: don't use hpmount February 17, 2007 linuxPLoS One February 13, 2007 publishingOpenSearchFox February 13, 2007 extensionfirefoxCopying bookmarks from del.icio.us to Connotea February 13, 2007 bookmarksdel.icio.use4xgreasemonkeyGetting an audio file in another format from an M4P on OS X February 8, 2007 audioosxmemcached and Drupal February 8, 2007 drupalDrupal module for Solr February 8, 2007 drupalsearchUbuntu Edgy, Bluetooth and Sony Ericsson k800i February 6, 2007 mobileubuntuMAME reviews February 5, 2007 drupalgamesA web interface to search and download albums in an Amarok library February 4, 2007 amarokphpQ: What am I using iTunes for? January 31, 2007 musicPosting machine tags to del.icio.us January 25, 2007 del.icio.usDrupal 5 January 21, 2007 drupalBeryl 0.2 beta2 January 21, 2007 berylubuntuBT have been busy January 21, 2007 btThings You Need To Play Arcade Games January 20, 2007 gamesdvd::rip January 19, 2007 dvdlinuxNew server January 13, 2007 server365 Days Of London widget January 2, 2007 drupallondonosxPLoS Too December 23, 2006 drupalpublishingmetrack December 21, 2006 del.icio.usgreasemonkeyAutomatically play YouTube videos in a full window December 17, 2006 greasemonkeyyoutubePlaying web video in fullscreen December 17, 2006 playlistsTits & Sharks & Acid December 5, 2006 audiomashupNautilus Actions December 1, 2006 ubuntuPlaying YouTube videos in Ubuntu December 1, 2006 ubuntuvideoSound from a microphone on HDA Intel in Ubuntu December 1, 2006 ubuntumetalicious November 28, 2006 javascriptperlMusicBrainz Picard Tagger November 26, 2006 metadatamusicmusicbrainzUniform Requirements for Manuscripts November 20, 2006 citationVisual jQuery user style November 13, 2006 cssjqueryCheap-ish Windows XP November 7, 2006 windowsGeocoding UK postcodes with PostcodeAnywhere November 7, 2006 geophpYahoo! Bookmarks November 2, 2006 bookmarksGMap Geocoding UK Postcodes November 2, 2006 geoLast.fm events calendar October 21, 2006 lastfmNovelty vs Necessity October 21, 2006 ubuntuFitting in Ubuntu October 19, 2006 firefoxubuntuAmarok, MySQL, JSON and Greasemonkey October 19, 2006 amarokgreasemonkeyphpThe case of the disappearing comments October 16, 2006 Wikipedia export format for citing papers from HubMed October 16, 2006 greasemonkeyhubmedunapiwikipediaTGN1412 analysis in The Lancet October 16, 2006 immunologytgn1412HubMed speed October 16, 2006 hubmedNNW Sneak Peek Release October 16, 2006 netnewswireZotero and compound documents October 11, 2006 metadatapublishingzoteroMetaphors that have had their day October 9, 2006 Google Webpage Gadgets October 4, 2006 googleprivacyDreamhost promotion today October 3, 2006 MSN.co.uk doesn't rank Firefox October 1, 2006 firefoxsearchI Candy September 30, 2006 ubuntuAll You Need On A (Consumer) PC September 29, 2006 appsosxubuntuwindowsUbuntu and Core2 Duo PCs September 25, 2006 ubuntuMigrate Movable Type to Drupal (4.7) September 22, 2006 drupalmtSecurity as a non-admin user in OS X September 21, 2006 securityNetNewsWire: "Mark All As Read And Proceed" September 20, 2006 netnewswireBuilding a site to handle images in Drupal September 17, 2006 drupalBT Home Hub September 16, 2006 btPodcasts For People Who Say They Don't Know Any Good Podcasts September 7, 2006 podcastsSharing a list of podcasts September 7, 2006 podcastsFirefox 2 Beta 2 September 4, 2006 firefoxWhy is MySpace popular August 30, 2006 myspace2 Steps to Making MySpace Nicer August 28, 2006 cssgreasemonkeymyspaceScan in iTunes August 21, 2006 applescriptitunesGenerate a bookmarklet to automate offprint requests August 12, 2006 bookmarkletpublishingNature.com CSS August 1, 2006 cssnaturestylishCleanliness July 28, 2006 osxNotate July 27, 2006 annotationpublishingAggademia July 26, 2006 aggregationdrupalnatureunAPI link enabler for Greasemonkey July 25, 2006 greasemonkeyunapiHubMed paper in Nucleic Acids Research July 17, 2006 hubmedSphere It! July 5, 2006 bookmarkletRSS feeds for Bloglines citation searches July 4, 2006 bloglinesfeedsFerret: Lucene for Ruby July 3, 2006 lucenerubysearch2006-06-28 Data Webs Conference June 28, 2006 conferenceMore About/Like This Page June 25, 2006 bookmarkletsMapping and Tagging Greasemonkey scripts June 22, 2006 connoteagreasemonkeyGraph your Connotea library June 22, 2006 connoteatouchgraphvisualisationXSL files for publishing from NLM XML May 30, 2006 publishingxslLucene 2.0 May 29, 2006 lucenesearchMeSH information in HubMed May 24, 2006 hubmedmeshPodule #8 May 22, 2006 podulesQuery statistics in HubMed May 17, 2006 hubmedSentence ordering in OTMI May 17, 2006 text-miningGoogle Co-op May 15, 2006 googlesearchAdd HubMed links to Google search results May 11, 2006 greasemonkeyhubmedAdd radio commands to BBC Radio Player May 11, 2006 bbcgreasemonkeyradioStructure of a scientific article May 9, 2006 publishingRelated Articles algorithms May 9, 2006 hubmedHealth-related queries in Google May 8, 2006 googleRecommendations from HubMed May 6, 2006 hubmedrecommendationA Plan for Publishing Journal Articles May 6, 2006 publishingTreemaps of MEDLINE May 3, 2006 medlinevisualisationPlaying Streaming Radio [RealAudio, BBC, OS X] Through an Airport Express May 3, 2006 osxA Network of Politicians and Interviewers/Journalists on the BBC May 1, 2006 touchgraophvisualisationTouchGraph of BBC TV/Radio Collaborators April 30, 2006 bbctouchgraphvisualisationRecent Papers in HubMed Search Results April 30, 2006 hubmedPersonalisation and Privacy April 28, 2006 hubmedpersonalisationMarkov-chained text from MEDLINE abstracts April 19, 2006 medlineiTunes Alarm Clock (iCal + Applescript) April 19, 2006 applescriptitunesosxEmail notifications in Gnome April 17, 2006 emailubuntuDocument clustering in HubMed April 12, 2006 hubmedRecording Streaming Radio (improved) April 12, 2006 bashradioVLC, XSPF, Dapper and Tango April 10, 2006 playlistsubuntuOpen Text Mining Interface (OTMI) April 7, 2006 text-miningInterDB links in HubMed April 7, 2006 hubmedex-HTML April 1, 2006 htmlHubMed extension for MediaWiki March 29, 2006 hubmedCTLA-4-Ig March 25, 2006 immunologyAdobe XMP SDK 4 beta March 25, 2006 metadatapdfxmpPeer Review with Marginalia March 25, 2006 annotationpublishingFulltext links from HubMed's feeds March 24, 2006 hubmedA Week of TV in Pictures (comedy, mostly) March 24, 2006 tvCriticker March 23, 2006 filmrecommendationFeedback down March 20, 2006 Playr's XSPF player March 20, 2006 playrPodule #7 March 18, 2006 podulesTGN1412 March 17, 2006 immunologytgn1412My Most Played Artists This Week, from Last.fm March 17, 2006 lastfmXHTML, SVG and MathML March 16, 2006 xhtmlGetting Document Elements Out Of The Clipboard March 16, 2006 bookmarkletHobbs on Rewind March 12, 2006 Lots of Comment Spam March 10, 2006 mtGetting Document Elements Into The Clipboard March 10, 2006 bookmarkletExclusive Photo of the New Google Colander March 8, 2006 googleCopy and Paste with unAPI March 7, 2006 unapiLinking and Storing Supplementary Data March 7, 2006 dataNotepress for Wordpress 2 March 6, 2006 wordpressConnecting to the Nintendo WFC March 1, 2006 dsSupplementary Data March 1, 2006 dataBluetooth Intellimouse Explorer on OS X March 1, 2006 osxSound in Ubuntu February 27, 2006 ubuntuTorrentbot missed some episodes February 26, 2006 p2pManaging Metadata for Academic PDFs February 21, 2006 bibdeskbibtexmetadatapdfScalable Bar Charts with Tables and CSS February 20, 2006 cssOpenURL For Music February 20, 2006 openurlPimp My Paper! February 18, 2006 publishingWhere to Download Firefox February 16, 2006 firefoxsearchUniProt Creative Commons licensed, available as RDF February 10, 2006 rdfThe state of online biomedical full text articles February 10, 2006 publishingAllFullText February 8, 2006 bookmarkletshubmedA_List of Podcasts February 7, 2006 podcastsThe State of Biomedical PDFs February 6, 2006 publishingManaging Academic Papers (almost) Like MP3s February 3, 2006 publishingQuery Expansion in HubMed February 3, 2006 hubmedWebphones February 3, 2006 audioAuthor Contributions in Scientific Paper Metadata February 2, 2006 publishingUpcoming.org simple event posting form February 2, 2006 calendareventsInterview with David Lipman of the NCBI February 2, 2006 pubmedBritish Albums vs Mercury Nominees February 2, 2006 musicPodule #6 January 31, 2006 podulesThings That Reek Of Greatness January 21, 2006 Normalising URIs January 18, 2006 citationAd-hoc XML databases with MySQL 5.1 January 18, 2006 mysqlxlmPubMed lookup for Structure Blogging January 12, 2006 hubmednotepressRelevance-ranked search results in HubMed January 12, 2006 hubmedPopulate iTunes With Webjay Playlists update January 11, 2006 applescriptituneswebjayNLM2MODS January 11, 2006 xsltCreating an Atom feed in Perl January 10, 2006 atomperlTV January 9, 2006 tvCreating an OpenOffice document in Perl January 9, 2006 openofficeperlA suggestion for OpenSearch January 9, 2006 opensearchListenable retrospectives January 7, 2006 feedsmusicBest Albums of 2006 January 7, 2006 musicrvwHubMed BibTeX changes January 6, 2006 bibtexhubmedCiteProxy January 5, 2006 citationidentifiersmodsxmlReading Feeds January 3, 2006 feedsosxsoftwaremp3blog TopList updated January 2, 2006 blogsmusicVideo chat between Mac and PC December 31, 2005 imosxvideoA script for Slogger December 30, 2005 bookmarksextensionfirefoxMachineProse December 18, 2005 biomedicalontologypublishingrdfAcademic metadata workflow December 17, 2005 metadataNow That's What I Call Weblogs... Vol 1 December 15, 2005 weblogsGoogle Music Search December 15, 2005 googleUpdated OpenSearch templates for Movable Type December 13, 2005 mtMore Useful Firefox Extensions December 12, 2005 firefoxDragThing [OS X] December 12, 2005 osxsoftwareEasyNews search plugin December 11, 2005 firefoxsearchpluginsSpacer December 11, 2005 bookmarkletExtracting Knowledge from Biomedical Text December 8, 2005 biomedicalhubmedrdftextBioinformatics Workflows December 8, 2005 bioinformaticsAlbums of 2005 snapshot December 5, 2005 musicThings of Interest Added to HubMed December 2, 2005 hubmedIndestructible user profiles December 2, 2005 searchGoggle update December 1, 2005 greasemonkeyRDF interoperability for social bookmarking tools December 1, 2005 feedshubmedrdftagsPopulate iTunes With Webjay Playlists November 29, 2005 applescriptituneswebjaySequence Manipulation En-Suite November 29, 2005 javascriptscienceIndex Diagnosticus November 29, 2005 searchBest Of: Bookmarklets November 27, 2005 bookmarkletsStylish November 27, 2005 firefoxBest Of: Games on OS X November 27, 2005 gamesosxBest Of: Applescripts for iTunes November 27, 2005 itunesosxBest Of: Firefox extensions November 27, 2005 extensionsfirefoxMechanical Turking November 23, 2005 amazonRDF export from HubMed Tags November 22, 2005 hubmedrdftagsContent Negotiation for HubMed Tags November 22, 2005 hubmedpiggybankrdftagsYahoo! Canada Movies feed November 22, 2005 cinemafeedCached web pages and Spurl November 20, 2005 bookmarkscacheUTF-8 citation export from HubMed November 17, 2005 hubmedA modular dynamic web page for bioinformatics searches November 16, 2005 bioinformaticsperlLast.fm search plugin November 16, 2005 firefoxlastfmDated web page snapshots with My Web November 16, 2005 bookmarkletcachemywebSubmitting reviews to Google Base November 16, 2005 googlegreasemonkeyrvwCreating a citable archive of a web page November 15, 2005 archivebookmarkscitationEeeeeeeeeeeevil November 14, 2005 googleThe Wire on Resonance FM podcast November 13, 2005 musicplayrpodcastMIME types and feed handlers November 13, 2005 feedspodcastsPodule #5 November 12, 2005 musicpodulesFOAF + hCard November 12, 2005 Yahoo's My Web 2.0 November 11, 2005 taggingVisible changes in HubMed this week November 11, 2005 hubmedDisabling Caps Lock in Ubuntu November 11, 2005 ubuntuVisible changes to HubMed this week November 4, 2005 hubmedBlogBridge November 3, 2005 softwareJEdit November 3, 2005 softwareTemporary feed subscriptions and individual item archives November 3, 2005 feedsOpenOffice 2 on OS X November 3, 2005 openofficeosxNotePress October 24, 2005 notepressresearchwordpressFlock October 22, 2005 firefoxflockHealthline October 19, 2005 healthsearchA definite lack of standards for academic metadata October 19, 2005 metadataLooking for MP3s? October 17, 2005 firefoxsearchpluginsTweaking Firefox for user-side accessibility October 16, 2005 accessibilitycssfirefoxstart.com gadgets October 15, 2005 A couple of music videos on slow servers October 15, 2005 musicPublishing whole documents using an open XML standard format October 13, 2005 publishingxmlSomeone Comes To Town October 7, 2005 torontoSomeone Leaves Town October 7, 2005 parisFixation October 6, 2005 hubmedrvwTiny Greasemonkey script for Flickr page titles October 6, 2005 flickrgreasemonkeyNews.com graph visualisation for related stories October 4, 2005 Anti-personal portal aggregators October 3, 2005 BlogPulse Alert Feed for Recently Played Artists September 28, 2005 audioscrobblerfeedslast.fmmusicPodule #4 September 26, 2005 musicpodulesOpenSearch Description and Atom-based Response templates for Movable Type September 24, 2005 atommovabletypeopensearchsearchUbuntu Breezy September 24, 2005 breezyubuntuReplace the Guardian logo September 22, 2005 greasemonkeyOpenSearch 1.1 September 20, 2005 firefoxopensearchsearchsruSpyware 2.0 September 20, 2005 security142 September 19, 2005 fruityloopsmusicFirefox search plugin for Google Blog Search September 16, 2005 firefoxgooglesearchpluginsMetadata in feeds (again) September 14, 2005 atommetadatardfreviewsThou shalt not make me squint September 14, 2005 accessibilitycssfirefoxFlickr vs Yahoo sign-up September 13, 2005 flickrsecurityyahooBookmark Folders September 12, 2005 bookmarksfirefoxChanging Feed Format September 12, 2005 atomfeedsrdfrssDVD ripping to Matroska September 12, 2005 dvdmatroskaoggxvidMovable Type + Tags September 7, 2005 movabletypetagAudioscrobbler Browser update September 7, 2005 last.fmmusictouchgraphCOinS Browser Extensions updated September 7, 2005 bookmarkletcoinsgreasemonkeyopenurlTV August 31, 2005 torrentbottvSort del.icio.us popular (again) August 31, 2005 deliciousgreasemonkeyGetting Firefox bookmarks into Spotlight August 31, 2005 firefoxspotlightDYLD_FALLBACK_LIBRARY_PATH (OS X) August 31, 2005 osxExport citations from HubMed to Refworks August 29, 2005 bookmarklethubmedSelf-contained Firefox search plugins August 28, 2005 firefoxsearchpluginsxsltFirefox Search Plugin Template for a Movable Type Weblog August 25, 2005 firefoxmovabletypesearchpluginsTalk To Google August 24, 2005 googleprivacyExtracting microcontent (XSLT, GRDDL, RDF) August 24, 2005 firefoxgreasemonkeymicrocontentrdfsome handy links for del.icio.us August 20, 2005 deliciousCOinS to CrossRef resolver script August 20, 2005 coinsgreasemonkeymicrocontentopenurlA better cite bookmarklet August 18, 2005 bookmarkletcitemicrocontentAdding to last.fm with Greasemonkey August 15, 2005 flittergreasemonkeylast.fmmusicTorrentbot additions August 15, 2005 torrentbotTurning a Java jar into an Application bundle (OS X) August 14, 2005 Jackson & His Computer Band (a test of Bleep.com's Web Tools) August 11, 2005 Hide Flickr comments from specific users August 11, 2005 flickrgreasemonkeyPlayr Atom feed August 11, 2005 atomplayrPandora August 9, 2005 musicAudioscrobbler & last.fm relaunch August 9, 2005 audioscrobblerlast.fmdel.icio.us Firefox extension security update August 8, 2005 deliciousfirefoxPodule #3 August 5, 2005 musicpodulesServer speed August 5, 2005 debianNTRSTNG (OS X) August 4, 2005 flickrperlHide Flickr Comments (Greasemonkey) August 3, 2005 flickrgreasemonkeyPublication of cytokines August 1, 2005 Converting RTF to plain text (OS X) July 31, 2005 osxGatherers of the Month #2 July 31, 2005 deliciousMaking a big old map July 30, 2005 perlPlayr's MP3 blogs section July 30, 2005 playrpodcastsOpenURL COinS July 30, 2005 coinsopenurlPodule #2 July 27, 2005 musicpodulesBBC Air Time July 23, 2005 bbcvisualisationAtom 1.0 in HubMed July 22, 2005 atomfeedshubmedLinux Applications July 18, 2005 linuxsoftwareubuntuFirefox form widgets in OS X July 16, 2005 firefoxosxAtom 1.0 July 15, 2005 atomfeedsInsecure RSS encryption July 15, 2005 greasemonkeysecuritydel.icio.us inbox in Firefox's sidebar July 12, 2005 deliciousfirefoxOS X URL handler to open links to local files July 11, 2005 applescriptosxwordpressFlickr Pro accounts July 11, 2005 Almost Everything About HubMed July 7, 2005 hubmedStatistics in Nature Immunology July 2, 2005 publishingstatisticsiTunes podcasts June 29, 2005 itunesplayrpodcastsExtracting Microcontent June 21, 2005 greasemonkeymicrocontentIt's about the catalogue June 19, 2005 musicp2pEasyNewzBin June 18, 2005 greasemonkeyHubMed Tag Storage June 17, 2005 hubmedtagGive us a big Back! June 12, 2005 cssfirefoxSpotlight June 9, 2005 osxSemantic weblog posts with Movable Type June 8, 2005 movabletyperdfTV June 6, 2005 tvMore feeds please, vicar June 4, 2005 feedsClient-side M3U generation June 4, 2005 playrGoggle update May 27, 2005 googlegreasemonkeyOperation D-Elite May 26, 2005 p2pWeb Assistants May 25, 2005 firefoxRDF data in HubMed May 24, 2005 hubmedpiggybankrdfParis 1911 May 22, 2005 mapsparisG.W.A. May 22, 2005 googleprivacyFoF May 22, 2005 feedonfeedsAbout reviews and microformats May 22, 2005 microcontentreviewsConcatenating multiple MP3s into one big playable MP3 May 22, 2005 musicsoftwarePodules May 21, 2005 musicpodcastspodulesTag search May 21, 2005 searchtagBagram May 21, 2005 M.I.A. loop May 21, 2005 musicGoggle May 19, 2005 firefoxgooglegreasemonkeyScreencast of HubMed and BibDesk May 12, 2005 bibliographyhubmedscreencastPubMed RSS May 9, 2005 Automator Plug-Ins May 4, 2005 osxGot Adblock? May 1, 2005 firefoxOpen Sourcing APIs April 28, 2005 opensourceThe Hype Machine April 28, 2005 musicEmail reply notifications (Mail and Growl) April 27, 2005 applescriptgrowlosxHow to get a Firefox that works (OS X) April 27, 2005 firefoxosxTargeted advertising April 26, 2005 cssnetnewswirePlay LHB Daily Downloads April 26, 2005 playrBrowser anti-aliasing April 24, 2005 firefoxItems for consideration April 23, 2005 bibliographyidentifiersopenurlSidÑ”É³otÑ” April 20, 2005 osxsoftwareSkinning del.icio.us with Firefox and URIid April 19, 2005 deliciousfirefoxFirefox search plugin for Audioscrobbler April 19, 2005 audioscrobblerfirefoxsearchpluginsSend Me A File April 18, 2005 gpgJavascript benchmarks April 17, 2005 firefoxosxsafariSwitching from Safari to Firefox April 17, 2005 firefoxosxFirefox search plugin installer April 17, 2005 firefoxsearchpluginsBest albums of 2005 April 12, 2005 musicreviewsPrefetch Google ad links April 10, 2005 googlegreasemonkeyNew rvw! April 9, 2005 deliciousreviewsDeliciousify Audioscrobbler April 5, 2005 audioscrobblerdeliciousgreasemonkeyLesInrocksParis March 30, 2005 concertsfeedsparisFlash + XSPF in Playr March 29, 2005 xspfUpcoming API March 28, 2005 TV March 25, 2005 tvUpdates March 24, 2005 Why upcoming.org isn't more popular March 21, 2005 upcomingOpenSearch March 16, 2005 feedssearchFlitter v1.1 March 14, 2005 flittersearchGive Me All Your Cookies March 12, 2005 greasemonkeyOpenURL resolver bookmarklet March 7, 2005 bookmarkletopenurlAdd search links to Audioscrobbler artist pages March 5, 2005 audioscrobblerbookmarkletgreasemonkeyArtist popularity in specific countries March 4, 2005 audioscrobblermusicvisualisationRemove PDF delay for journal articles March 4, 2005 pithhelmetHubMed UTF-8 export March 1, 2005 bibtexhubmedrisutf-8US vs UK band popularity February 28, 2005 audioscrobblermusicvisualisationBlackwell's 'author pays' publishing February 27, 2005 biomedicalSkip SourceForge delay page February 26, 2005 pithhelmetHide selected content with CSS (in the future) February 21, 2005 cssnetnewswireSSL certificates for Apache2, Courier, Exim4 and Jabberd2 on Debian February 21, 2005 debiansslXML-based book authoring February 20, 2005 softwaresubversionWordPress 1.5 February 15, 2005 softwarewordpressChilibot February 15, 2005 biomedicaldatavisualisationRadio 4 RSS feeds February 15, 2005 bbcfeedsradioBD Graphit floating (modified) style for NetNewsWire February 15, 2005 cssnetnewswireTorrentbot moved February 15, 2005 torrentbotSorted lists of reviews, using rvw! and del.icio.us February 15, 2005 deliciousreviewsFlickrdesk update February 12, 2005 flickrsoftwareSorting del.icio.us/popular February 11, 2005 bookmarkletdeliciousMap interfaces February 8, 2005 googlemapsgre.gario.us February 7, 2005 deliciousDynamic OpenURL resolver links February 7, 2005 openurlPubMed tabs and Amplify February 6, 2005 biomedicalosxpubmedsoftwareMy Speediest Gatherers February 2, 2005 deliciousGatherers of the Month February 1, 2005 deliciousSwervedriver oddities February 1, 2005 musicPithHelmet HubMed redirect January 31, 2005 hubmedpithhelmetOrdnance Survey copyright annoyance (again) January 30, 2005 mapsMFeeds January 30, 2005 feedsmusicplayrBeagle January 29, 2005 linuxsoftwareWPA January 29, 2005 ubuntuMore Paris RSS feeds January 28, 2005 feedsparisOCLC software contest January 28, 2005 A9's street photos January 27, 2005 mapsGraph del.icio.us subscriptions network January 26, 2005 deliciousvisualisationGraph del.icio.us related tags January 26, 2005 deliciousvisualisationExplosion January 23, 2005 playrm3ucast update January 21, 2005 m3ucastosxplayrsoftwaregames TopList January 20, 2005 deliciousblogresearch TopList January 20, 2005 deliciousmp3blog TopList January 18, 2005 deliciousmusicYesterday's KEXP January 18, 2005 musicradioFlitter with del.icio.us links January 18, 2005 deliciousflitterLugRadio interviews Mark Shuttleworth January 17, 2005 radioubuntuCoral too slow January 17, 2005 p2pplayrFlitter with band photos January 14, 2005 flitterFlickrdesk update January 14, 2005 flickrsoftwareJavascript drag-and-drop ordered lists January 12, 2005 javascriptUnshuffly iPod January 12, 2005 SVG maps from PDF January 11, 2005 mapspdfsvgXSPF + SWF January 11, 2005 xspfListmania January 10, 2005 listsTechnorati TouchGraph January 9, 2005 touchgraphvisualisationA mini SVG map January 9, 2005 mapsparissvgUpcoming.org for Paris concerts January 6, 2005 feedsparisupcomingBreezy Listening January 4, 2005 Flitter bookmarklet January 2, 2005 flitteriTunes Music Store gift certificates January 2, 2005 musicp2pBands of 2004 January 2, 2005 musicBe The Coolest December 29, 2004 bookmarkletdeliciousCatching Up December 28, 2004 feedsmusicp2ptvLocalOpenURL and LocalSFX for HubMed pages December 13, 2004 openurlXSLT export from OmniOutliner Pro December 12, 2004 osxsoftwarexsltParis Concerts RSS feed update December 12, 2004 concertsfeedsparisCompiled live Swervedriver albums December 8, 2004 Search a restricted set of feeds December 7, 2004 feedssearchGoogle Reviews December 7, 2004 googlereviewsWi-Fi iPod December 7, 2004 musicp2pHubMed tutorials? December 7, 2004 hubmedAmiga Emulation December 6, 2004 emulationWindows Applications December 4, 2004 softwarewindowsVennMaster December 3, 2004 biomedicalvisualisationWorlds Apart December 3, 2004 musicFlitter December 2, 2004 musicMusic in del.icio.us December 2, 2004 playlistsHorizontal Amazon music thing December 1, 2004 amazonPricenoia December 1, 2004 amazonucomics Atom feeds for NetNewsWire November 30, 2004 feedperlGoogle Scholar bookmarklet November 27, 2004 bookmarkletTouchGraph browser for Amazon Citations November 24, 2004 touchgraphCiteSeer OAI compliance November 24, 2004 citeseerTouchGraph browser for Google Scholar November 22, 2004 googletouchgraphvisualisationPaper CD Case November 20, 2004 John Peel's final show November 20, 2004 musicAdium Groupchats November 18, 2004 Links to Google Scholar November 18, 2004 Free shipping for books at Amazon France November 17, 2004 More TV November 17, 2004 Mark Steel Lectures November 17, 2004 OS X Applications November 14, 2004 osxClusty PubMed November 12, 2004 Dowser November 12, 2004 Momentum November 11, 2004 CiteULike November 10, 2004 A decent setup for writing (OS X, with LaTeX) November 9, 2004 Sending files from one computer to another November 9, 2004 Motorcasting November 9, 2004 Fluxpod November 3, 2004 Visitors October 31, 2004 Flickrdesk: daily updated desktop pictures from Flickr (OS X) October 29, 2004 Define The Lie October 28, 2004 Netgear WG311FS and Airport Express on Ubuntu October 24, 2004 Ubuntu October 24, 2004 M3Ucast October 19, 2004 Desktop pictures from Flickr, using Magpie and OS X October 15, 2004 Making Podcasts from MP3 blogs October 13, 2004 What's in your menubar? October 13, 2004 RealPlayer update October 4, 2004 Fulltext links from HubMed October 4, 2004 Must... resist... October 4, 2004 Torrentbot missed episodes September 30, 2004 Bookmarklets September 30, 2004 Review sites reviewed September 30, 2004 reviewsFirefox extensions updated September 30, 2004 Glit_9 September 26, 2004 Hercules/PowerVR drivers September 26, 2004 Storing or aggregating microcontent September 26, 2004 NetNewsWire 2.0 beta September 22, 2004 Feed Your Reader September 20, 2004 MP3.com community features September 19, 2004 Albums for people who are bored of music September 19, 2004 deliciousPresidential candidates on science September 17, 2004 A Buttload of Bootlegs September 15, 2004 bootlegsNew web software releases September 14, 2004 softwareMolecular Systems Biology - a new open access journal from NPG September 14, 2004 open-accessMedical literature info sorting overload September 12, 2004 Aggregate feeds as impromptu record labels September 12, 2004 musicDivision of Laura Lee September 12, 2004 musicGoogle vs MEDLINE September 12, 2004 pubmedThe Return of STG September 10, 2004 p2pDocco September 10, 2004 documentssearchvisualisationDiebold's tamper-conducive vote counter August 31, 2004 securityCoral August 30, 2004 cacheData retention August 30, 2004 googleArtificial meme tracking August 30, 2004 memeNobel laureates call for open access to public-funded research August 30, 2004 open-accessThe 99p challenge August 28, 2004 radioFactory resetting an Airport Express August 25, 2004 appleiBook/PowerBook + iPod rebate August 25, 2004 appleDevendra Banhart August 24, 2004 musicThings Google knows about you August 24, 2004 googleBlogger's Next Blog tour August 22, 2004 IngentaConnect August 21, 2004 RSS munging with Urchin August 21, 2004 AWS 4.0 beta August 21, 2004 Rilo Kiley + others August 20, 2004 I wish I had a tripod August 19, 2004 The Guardian Digital Music survey August 18, 2004 ALAC droplet August 18, 2004 Some people say 'Tax the Rich' August 17, 2004 Audio channelling August 17, 2004 OSXPlanet is fantastic August 14, 2004 FLAC &rarr; ALAC August 13, 2004 What to do if you're running out of bandwidth August 12, 2004 RSS feeds of playlist recommendations from Playr August 12, 2004 Inducement to Hymn August 12, 2004 JustePort streams audio to an Airport Express August 11, 2004 Create a wishlist of videos to rent August 11, 2004 AllMusic desertion August 5, 2004 Caching media files August 5, 2004 Brain Hacks August 4, 2004 Joggle Tellybot August 4, 2004 Tagged reviews August 3, 2004 Cover art in del.icio.us RSS feeds August 3, 2004 Sidebar stars August 2, 2004 meme destruction project August 2, 2004 rvw! tool repurposed August 1, 2004 deliciousReview of an Airport Express August 1, 2004 AllConsuming book list July 29, 2004 Connected bands July 28, 2004 Cross-cluster navigation July 28, 2004 Nodule Organiser July 28, 2004 Hallowed be thy game July 26, 2004 Punk Voter playlist July 23, 2004 PageRank 10 sites July 23, 2004 PubMed search field tags July 23, 2004 PDF warning from CSS July 22, 2004 A 13.8GB torrent July 21, 2004 Open Source + Television July 18, 2004 DRM denial July 17, 2004 CacheM3U July 16, 2004 Mobile RFID reader July 16, 2004 Feed autodetection in Firefox July 16, 2004 iPapers July 15, 2004 BioMail July 14, 2004 OMG - NoMusic July 13, 2004 HTML2wget2M3U2mpg123 July 11, 2004 BBC News video console July 11, 2004 wget M3U playlist files July 10, 2004 MP3 blog playlists July 10, 2004 Simpler TODO RSS feed July 10, 2004 Music to go to sleep by July 9, 2004 Audioscrobbler Browser bugfix July 4, 2004 Twitch 14 July 3, 2004 CrossRef/Google search July 3, 2004 Choose your open access with Springer July 3, 2004 A Ghost Is Born July 2, 2004 RealPlayer 10 beta for OS X July 1, 2004 Getting the web out of the browser July 1, 2004 Data auto-detection in browsers July 1, 2004 Identifying papers with content hashes June 30, 2004 RIS format confusion June 30, 2004 Document publishing diagram June 30, 2004 HubMed.org June 29, 2004 Apple Tiger June 29, 2004 Compressing PDFs containing colour images June 29, 2004 Server switch June 29, 2004 Mail.app, IMAP and Courier June 27, 2004 Torrentbot updates June 22, 2004 Live music from the Sonar festival June 19, 2004 Webjay Alarm Clock June 17, 2004 clevercactus share June 17, 2004 SOAP interface for E-Utilities June 16, 2004 Suprnova discographies (-ish) feed June 16, 2004 Firefox 0.9 June 16, 2004 Favourite albums at/of the moment June 16, 2004 OS X root email account forwarding June 16, 2004 RVW 0.2 June 9, 2004 JEM archives June 8, 2004 Blogdigger Media June 7, 2004 Muziekhobbyist Webradio June 6, 2004 Your kids need drugs June 6, 2004 Pocket Radio June 4, 2004 Goliath X June 3, 2004 Scopus links from HubMed June 3, 2004 DrupalEd and DrupalBlog June 3, 2004 OME June 2, 2004 HubMed history tab June 2, 2004 Music file data hashes June 2, 2004 All Back To Mine May 31, 2004 rvw! formatter May 30, 2004 Détente May 29, 2004 Boosh on TV May 29, 2004 Elsevier author self-archiving May 28, 2004 Server co-op May 26, 2004 PNAS offers Open Access option May 24, 2004 Real BBC Radio May 23, 2004 That Man Will Not Hang May 21, 2004 Bastet May 20, 2004 Swervedriver May 20, 2004 MP3.com May 17, 2004 Safari vulnerability May 17, 2004 Google Groups Atom feeds May 17, 2004 Printing from Windows 98 to a shared CUPS printer on Panther using SAMBA May 17, 2004 Endnote incompatibility May 17, 2004 Infomediaries May 16, 2004 Sente: like iTunes, for biomedical literature May 16, 2004 Penance SoirÃ©e May 15, 2004 Red Cross reports May 15, 2004 Dropload May 11, 2004 Fillable, home-networked file servers May 11, 2004 HubMed search box May 11, 2004 RSS feeds for new album releases May 10, 2004 Rock Ahoy May 10, 2004 A big red blip May 9, 2004 Synchronized Multimedia Working Group May 8, 2004 Printer sharing May 8, 2004 The Golden Apples of the Sun May 7, 2004 Hey Hey 16k May 6, 2004 Franz Ferdinand 2004-04-22 May 6, 2004 prefuse visualisation toolkit May 6, 2004 Electric Chill Noise May 4, 2004 Searching scientific papers online May 3, 2004 Unbinding May 3, 2004 iPod pricing May 3, 2004 Returned-to Albums May 2, 2004 FNAC listings in Paris concerts RSS feed May 1, 2004 Be Not Afraid April 30, 2004 Blosxom 3 April 28, 2004 iTunes 4.5 April 28, 2004 Flickr photo badging April 27, 2004 GeneInfoViz April 27, 2004 Inline del.icio.us April 27, 2004 Random MP3.com playlist April 26, 2004 Inline Musilog April 25, 2004 Academic PDF workflow April 23, 2004 Collaborative playlists April 22, 2004 Current advantages of IM clients April 21, 2004 Follow mouse focus in X11 windows April 21, 2004 DaFONT April 21, 2004 Lab notebook database system April 21, 2004 Genotyping a meme April 20, 2004 OS X fonts in GTK2/Gimp April 20, 2004 Google ads April 19, 2004 Unicode vs Latin-1 April 19, 2004 Pixies Reunion Show April 16, 2004 PDF Browser plugin April 14, 2004 CulturePool April 13, 2004 ImageMagick 6 April 13, 2004 musiCompass April 12, 2004 TV torrent/RSS automation April 12, 2004 Spillsbury April 11, 2004 Fibonacci Ratios and Musical Intervals April 10, 2004 A shared word processor-bibliographic manager interface April 9, 2004 Musiclogging from Winamp April 8, 2004 Torrentbot Pt 2 April 8, 2004 Torrentbot April 8, 2004 Smell the satire April 7, 2004 Musiclogging April 6, 2004 MLtorrents bookmarklet April 5, 2004 Wonderfalls April 5, 2004 ml_www April 4, 2004 Outcesticide Set April 3, 2004 The live music archive is huge April 3, 2004 Repaying generosity April 3, 2004 BitTorrent command line client on OS X April 2, 2004 Bitcollider April 2, 2004 Entrez search updates April 2, 2004 MediaSeek April 2, 2004 Link extraction bookmarklet for Webjay March 31, 2004 User-centric data services March 31, 2004 AIBrainz March 31, 2004 AMG New Releases March 30, 2004 Gimp.app March 30, 2004 Album cover art tagging for Windows March 28, 2004 TV Shows March 28, 2004 OpenURL Router March 25, 2004 Text mining March 25, 2004 Science Commons March 24, 2004 Remembering konspire March 24, 2004 Music ownership in an open, online database March 23, 2004 PerlPrimer March 22, 2004 Music Publishers, Sales and Metadata March 22, 2004 Atomly March 22, 2004 HubMed SVG graphs March 19, 2004 Free Software directory March 15, 2004 LIARS March 12, 2004 Playr update March 10, 2004 Time Shifting March 10, 2004 pyget***** March 9, 2004 SHN and FLAC tools March 8, 2004 X11 on OS X March 8, 2004 Acknowledgements March 8, 2004 LaTeX add-ons March 8, 2004 LaTeX for Dummies March 8, 2004 LaTeX for the Modern Age March 8, 2004 How To Find (More Of) What You Want March 8, 2004 iPod Whamb skin with volume March 7, 2004 Join groups, find better music March 6, 2004 propa' mash-up ragga-rave dubplate bloodclot jungle tekno tour-de-force March 6, 2004 Semantic HiFi March 5, 2004 Document Handling LinkDump March 5, 2004 /cores March 5, 2004 Sparklines March 2, 2004 Closed-access data from open-acces publications March 1, 2004 HubMed Atom feeds March 1, 2004 Power laws and purchasing priorities February 29, 2004 One World in March February 29, 2004 Audioscrobbler Browser update February 29, 2004 Darkplace February 29, 2004 reviewsKimya Dawson February 28, 2004 Netlabel Catalogue February 27, 2004 Releasing Mac Word 6.0 February 27, 2004 Groupthinking, but on which side? February 26, 2004 The Advancement of Science and Culture February 25, 2004 Ranchero's Big Cat Scripts plugin February 25, 2004 Movable Type 'Edit This Entry' bookmarklet February 25, 2004 AdvanceMAME February 25, 2004 Zoom Player February 24, 2004 LaTeX February 20, 2004 GTK/Panther February 20, 2004 RAM February 19, 2004 Yahoo search bookmarklet February 19, 2004 Mperia February 17, 2004 Jukebox 45 MP3 collection for £3.99 February 13, 2004 Recommend playlists with Flickr February 13, 2004 Berlin Bastard Lesson February 12, 2004 Raster Noton February 11, 2004 Reviewr February 11, 2004 reviewsSubviral RNA February 11, 2004 America's Sweetheart February 10, 2004 Dear Microsoft Word February 10, 2004 Listening post February 10, 2004 DeepVacuum [OS X] February 8, 2004 Throttled [OS X] February 8, 2004 CiteSpace February 3, 2004 Open-source audio, tools for endless music [OS X] February 3, 2004 OmniaMea February 2, 2004 EndNote PubMed import filter February 2, 2004 Compiling an MP3-playing Helix Client on Panther February 1, 2004 Helix developer grants February 1, 2004 NetNewsWire has no Atom February 1, 2004 Ecto final January 31, 2004 betterPropaganda January 30, 2004 QuickTime for M3U January 30, 2004 Kazaa/Kapsule test needed January 30, 2004 Emotilinks January 30, 2004 USB floppy drive with Red Hat Linux January 29, 2004 Install Classic after Panther January 29, 2004 Primer design for cDNA amplification January 29, 2004 Vienna RNA January 28, 2004 It's called I Like January 26, 2004 Webjay.org January 26, 2004 Nothing in return January 25, 2004 Playlist bookmarklet update January 25, 2004 Sente January 23, 2004 Automatically update iTunes library with daapd January 23, 2004 Earth Map Desktop January 22, 2004 MacGDE January 22, 2004 Musicplasma January 22, 2004 iTunes Music Store RSS Generator January 22, 2004 MTCommentAuthorLink January 21, 2004 M3U playlists page January 20, 2004 Artemis sequence viewer January 20, 2004 iTunes Opener January 19, 2004 JPEGs progress January 18, 2004 Pick of the bastard pops January 18, 2004 RealPlayer 10 installation January 18, 2004 Copper, prions and TSE January 18, 2004 Suprnova RSS feed January 18, 2004 HubMed print-friendly pages January 17, 2004 Scientific stories January 17, 2004 MP3 to M3U or SMIL playlist January 16, 2004 Return of the Mac January 14, 2004 kX project and AVG January 14, 2004 Firebird Tabbrowser Extensions January 11, 2004 Morality of software patches January 11, 2004 Transmission 3000 January 10, 2004 Pitchfork Singles of 2003 January 7, 2004 Are you feeling throaty? January 7, 2004 1&1 free hosting January 7, 2004 largehearted boy January 5, 2004 What a difference a year makes January 5, 2004 Linking to MusicBrainz January 5, 2004 Fountain of Youth January 4, 2004 Collections of music files from distributed sources January 3, 2004 Trying this again January 2, 2004 iBook display problems? January 1, 2004 2003 Festive Fifty January 1, 2004 Happy New Year January 1, 2004 Some contributions to saving the internet December 31, 2003 Who represents my points of view December 31, 2003 Scientists for Dean December 31, 2003 Social software interfaces December 31, 2003 Movable Type plugins December 31, 2003 Using religion for aggression December 31, 2003 Playlist distribution December 31, 2003 GPSWeb December 31, 2003 RPXP web service December 19, 2003 MLDonkey [OS X] December 19, 2003 Vocal removal December 19, 2003 Pester December 18, 2003 Make GTK apps pretty December 17, 2003 MAD plugin eats track numbers December 17, 2003 The Signaling Gateway December 16, 2003 Movable Type on Windows XP December 16, 2003 XStream Radio December 16, 2003 SourceForge December 16, 2003 No Fink December 12, 2003 Fluorescence microscopy movies December 12, 2003 GoFigure December 12, 2003 Rumoured demise of BioMedNet December 11, 2003 SemBlogBibMan December 9, 2003 Not safe for work December 9, 2003 RDF for last played tracks, via Audioscrobbler December 9, 2003 MP3 blogs are switched on December 7, 2003 RSS feed for Paris concerts December 7, 2003 Google irregularity December 6, 2003 Festival Octopus December 4, 2003 Azureus December 4, 2003 0day audio December 4, 2003 Kid 606 video December 4, 2003 Album continuum December 3, 2003 Laptop DJ December 3, 2003 Firebird [OS X] December 3, 2003 Playlist tracklisting update December 3, 2003 WEASEL December 2, 2003 Electromagnetism December 2, 2003 QOTSA/Kyuss circle of collaboration December 1, 2003 musictouchgraphBBC Radio interface November 30, 2003 The Shins on KCRW November 29, 2003 Unfree the music November 28, 2003 Movable Type spam vulnerability November 26, 2003 French-speaking weblog rankings November 24, 2003 QTFairUse November 23, 2003 Got FOAF November 23, 2003 cgi_buffer November 22, 2003 blam2 for trial November 22, 2003 Vorbis updates November 21, 2003 BibDesk November 21, 2003 The perfect email November 20, 2003 Singingfish November 20, 2003 GLC November 19, 2003 bleep.com November 18, 2003 Free- or donation-ware updates for Panther November 18, 2003 You will require... November 8, 2003 XML for individual entries November 6, 2003 Eugene Garfield commentaries November 6, 2003 Styling RSS with CSS to make it browser-friendly November 6, 2003 SciDev Open Access section November 5, 2003 Reviews-enabled Movable Type November 5, 2003 Sublime electronica November 5, 2003 Excel add-in to remove low numbers November 4, 2003 Fixed the blaxm reviews exchange November 3, 2003 Blogware with reviews metadata November 2, 2003 A Tune Called Grin October 31, 2003 Longhorn October 30, 2003 iTunes playlist hint October 29, 2003 Facil-o-SMIL update for M3U and CC October 29, 2003 Phrase searching in PubMed October 28, 2003 The Knowledge Society October 28, 2003 FlowJo October 28, 2003 Wow. A big clock. October 28, 2003 PlayLouder MSP October 27, 2003 Soulseek recommendations October 25, 2003 WinAMP5 October 25, 2003 PDC Pokemon October 24, 2003 Fink upgrade for gcc 3.3 October 24, 2003 Google Glossary October 22, 2003 PLOS Biology October 21, 2003 Chutes Too Narrow October 19, 2003 Constant playlist October 18, 2003 Facil-o-SMIL October 18, 2003 Weed October 18, 2003 GNU Privacy Guard October 18, 2003 OS X IM: MSN IS ON October 18, 2003 PLOS Biology trackbacks in HubMed October 17, 2003 PLOS Biology October 17, 2003 From The Ashes October 17, 2003 Stop the Leaks October 16, 2003 tunA October 15, 2003 Empty pages in search results October 15, 2003 Steam [OS X] October 13, 2003 Calendar events from XHTML October 13, 2003 Sound October 12, 2003 In defence of open access October 10, 2003 EMusic pricing changes October 9, 2003 Digital Accretion October 9, 2003 Daily MP3s from Pitchforkmedia October 9, 2003 Albums Of The Year (so far) October 8, 2003 Dynamo playlist October 7, 2003 Pure Data DSP software October 7, 2003 Open Source Democracy October 6, 2003 Bad spiders October 6, 2003 Mercora October 6, 2003 Boom Selected October 2, 2003 HeadCloud October 2, 2003 Flat-fee P2P model October 2, 2003 Neuro-info-transmitters October 2, 2003 What's on my docks? October 2, 2003 Metadata in the MetaWeblog API October 1, 2003 RDF review vocabulary October 1, 2003 The Wellcome Trust supports open access October 1, 2003 Shareable playlists October 1, 2003 Bloglines recommendations October 1, 2003 Mini-links RSS feed September 30, 2003 Research Mapper September 30, 2003 Downstream September 29, 2003 iWebCal September 29, 2003 Dynamic event files September 29, 2003 Freak Up, Look Smart September 28, 2003 X11 goodies September 25, 2003 Export an event from a web page to iCal September 25, 2003 Terminally ill September 24, 2003 iCal events from web pages September 24, 2003 Fingertips September 24, 2003 NatureEvents September 24, 2003 KDE September 24, 2003 Share The Music September 22, 2003 Jumbled words September 22, 2003 Equinox September 19, 2003 Syncato September 18, 2003 WSIL for blogroll autodiscovery September 17, 2003 TV listings and audio streaming licensing September 17, 2003 Worst Jobs in Science September 17, 2003 Digital marketplace summary September 16, 2003 UK data surveillance measures September 16, 2003 Intellectual is not physical September 16, 2003 Collective payment September 15, 2003 ICARIS 2003 September 10, 2003 The importance of open access for semantic research September 10, 2003 DEVONagent [OS X] September 9, 2003 Polished turds September 9, 2003 Fame *and* fortune (if you're good enough) September 9, 2003 Open source bibliography format September 8, 2003 biologging September 8, 2003 Science and Religion forum September 8, 2003 Scientific publishing September 7, 2003 Konspire radio channel September 7, 2003 Subscribe to comments September 5, 2003 Trackbacks September 4, 2003 Nicotine September 4, 2003 Waypath September 4, 2003 my.PubMed RSS feeds September 4, 2003 Musical interlude September 4, 2003 Biotech protocols September 2, 2003 Smokescreen September 1, 2003 Openam.com subdomains August 29, 2003 Armagetron Tron clone August 26, 2003 NetNewsWire with WebKit August 26, 2003 Fun with XMLTV August 26, 2003 Peel Sessions August 25, 2003 The return of OpenAm August 25, 2003 BBC Creative Archive August 25, 2003 radioLaw Enforcement Against Prohibition August 21, 2003 EarthStation5 August 21, 2003 MSN network rejigged August 21, 2003 Tofu August 21, 2003 Album cover artwork August 21, 2003 FreakMachine August 20, 2003 Human Knowledge Navigator August 14, 2003 Peer review under scrutiny August 11, 2003 Tools for handling information August 11, 2003 mCode August 9, 2003 Classification of associations August 9, 2003 Morale-o-Meter August 9, 2003 Miranda IM August 9, 2003 PRISM for RDF August 9, 2003 Who's going to pay? August 8, 2003 Perception August 8, 2003 BluFilter August 8, 2003 Trillian Pro 2.0 beta August 7, 2003 Protein Interaction Browser August 7, 2003 Music Browser repaired August 6, 2003 109 August 6, 2003 Jobs as RSS extensions August 6, 2003 Quick Release August 5, 2003 Test August 1, 2003 OpenAm linking July 31, 2003 Rock-It Launcher July 30, 2003 AOL Journals July 29, 2003 Myths and legends of file sharing July 29, 2003 OS X Show Desktop July 28, 2003 The same thing, again July 28, 2003 Digital sales network July 25, 2003 YAPC July 25, 2003 Buy back continues July 22, 2003 blosxom.com July 21, 2003 Vague memories July 21, 2003 Amazon tracks search box July 18, 2003 Perl Culture July 18, 2003 We're all going straight to hell :-) July 17, 2003 BioMed Central links July 17, 2003 Faculty of 1000 links July 17, 2003 Open bibliography software July 15, 2003 Musical artifacts July 11, 2003 WireTap July 11, 2003 The Marigolds July 9, 2003 The Holy Grail July 9, 2003 Audioscrobbler + last.fm July 9, 2003 Safari FullScreen bookmarklet July 9, 2003 BioMed Central articles in one big zip July 9, 2003 Negative Feedback on eBay July 9, 2003 Copyright for scientific papers in Eprint archives July 7, 2003 Performance at the cost of expansibility July 7, 2003 I (heart symbol) MP3 July 7, 2003 When Fireworks Attack July 7, 2003 Online electronic hardware stores July 7, 2003 TouchGraph LiveJournal Browser July 7, 2003 Zane Lowe on Radio 1 July 4, 2003 Clutter update July 2, 2003 Endnote v7 July 2, 2003 RSS legacy July 2, 2003 Did you know? July 2, 2003 Open access conference reports June 30, 2003 Phone GPS June 30, 2003 Searching for the social benefits of technological progress June 30, 2003 EFF seeks P2P licensing scheme June 27, 2003 Cites and Insights July June 27, 2003 Public Access to Science Act June 27, 2003 Blosxom rating plugin June 27, 2003 PithHelmet June 25, 2003 NITLE Blog Census API June 25, 2003 Blogs ! US June 25, 2003 Concept clustering June 25, 2003 Molecular Graphics on OS X June 25, 2003 How I got Soulseek to work on OS X June 25, 2003 Handy hints June 25, 2003 Public Library of Science June 24, 2003 ID card consultation figures June 20, 2003 Nature PDF content extraction June 20, 2003 OpenURL draft standard June 20, 2003 Politics and the English Language June 20, 2003 Four Tet Favourites June 18, 2003 Costs of illicit MP3 downloading June 18, 2003 Spoogefest June 12, 2003 Unicode characters in HubMed June 11, 2003 Paid for software June 10, 2003 Site redesign June 10, 2003 Ontologies in scientific research June 10, 2003 Concert listings June 10, 2003 PDF annotation June 10, 2003 Spared from internet hell June 10, 2003 Political positioning June 6, 2003 Technicalities of a P2P Music Market June 5, 2003 Modelling social interactions June 5, 2003 Andromeda on OS X May 29, 2003 Back on Track May 27, 2003 RVW specification May 27, 2003 Jack Valenti says May 26, 2003 kast/konspire2b May 21, 2003 Come Together May 21, 2003 Emergence May 20, 2003 iTunes script May 20, 2003 Emusic signs Beggars Group May 20, 2003 Principles of Emergent Democracy May 20, 2003 Lo-Fi May 19, 2003 Jabber notification of new referrers May 19, 2003 LameBrain May 18, 2003 Kwiki and VoodooPad May 17, 2003 Music recommendations May 17, 2003 Technorati API in blaxm! May 17, 2003 If they want to do this the hard way... May 17, 2003 SFX/OpenURL interview May 16, 2003 Advertoys May 16, 2003 Nodalpoint - moderated bioinformatics papers from PubMed May 15, 2003 RVW success May 14, 2003 Test review for RVW markup in RSS 2.0 May 14, 2003 Photopal May 14, 2003 Geograffiti May 14, 2003 RVW format in RSS 2.0 May 14, 2003 Video Sans Frontieres May 14, 2003 The Scientist in RSS May 14, 2003 RVW format in RSS May 14, 2003 Arrowsmith May 13, 2003 WinAMP AAC/MP4 input plugin May 13, 2003 iSuck May 13, 2003 iTunes, again May 13, 2003 Global Friendster visualisation May 13, 2003 Peer-to-peer search spidering May 9, 2003 DRM within AAC files May 8, 2003 Processing Soda May 8, 2003 Improving science through online commentary May 8, 2003 RVW standard metadata format for reviews May 7, 2003 DJ Martian's page May 6, 2003 iTunes download May 5, 2003 FOS News catchup May 4, 2003 Scrobbleyou May 4, 2003 On The Wire May 1, 2003 EMusic upgrade May 1, 2003 iTunes 4 May 1, 2003 Electric Six - Fire April 30, 2003 iTunes Music Store top downloads April 30, 2003 CD industry seeks niche April 29, 2003 Echocloud April 29, 2003 Music licensing April 27, 2003 Semantic blogging demonstrator April 27, 2003 Modular, extensible RDF April 27, 2003 TouchGraph Audioscrobbler Browser April 26, 2003 Antisocial behaviour in online communities April 26, 2003 Laszlo April 22, 2003 The World Live Web April 21, 2003 Finding people April 21, 2003 The Wipers - Box Set -- Is This Real April 21, 2003 Librarians on the offensive April 20, 2003 Environmental Noise Retards Auditory Cortical Development April 20, 2003 The liberation will not be nationalised April 19, 2003 Fire with intent April 19, 2003 Thinkbot April 18, 2003 Globe Alive April 17, 2003 Journal of Mammalogy April 17, 2003 Open Access April 16, 2003 EQUATOR April 15, 2003 Last.fm April 15, 2003 Completion of the Human Genome Project April 15, 2003 RDF braindump April 14, 2003 WaveFinder, DAB April 14, 2003 Microsound April 13, 2003 Sumeria April 13, 2003 George Boosh April 11, 2003 Terrestrial jukebox April 10, 2003 Not content April 10, 2003 Summarise this April 10, 2003 Queens of the Stone Age - Feel Good Hit of the Summer April 10, 2003 Many to many April 10, 2003 Digital video April 10, 2003 Digital music streaming April 9, 2003 An excellent lab web page April 8, 2003 W3C drafting, drifting April 8, 2003 Internet Explorer 6 April 8, 2003 Winamp 2.9 April 6, 2003 Phoenix April 6, 2003 Clarity of Writing April 6, 2003 a miscommunication with civilians April 6, 2003 A few blam! and blaxm! updates April 6, 2003 Not in My Name ++ April 5, 2003 Some radio shows April 5, 2003 Automata and visualisation April 5, 2003 Complexity Digest RSS April 5, 2003 Political fiction April 5, 2003 Distributing music on plastic discs one album at a time April 4, 2003 Discussion from CTO forum April 4, 2003 Blueprint for phased access journals April 4, 2003 MP3 ripping and encoding benchmark April 3, 2003 New Clinic album April 3, 2003 A Tune April 3, 2003 Rock & Roll Library April 2, 2003 More sites on sticks April 1, 2003 The day music became priceless April 1, 2003 blaxm!, FOAF, RSS March 31, 2003 MP3 track IDs March 31, 2003 Death of an activist March 29, 2003 Iraqi opposition March 29, 2003 Web applications March 28, 2003 Anacubis visual Google March 27, 2003 BeOS file system with metadata for OS X March 26, 2003 BioinforMatrix March 26, 2003 AcroMed March 26, 2003 FOAF Browser March 26, 2003 Thinkbot March 25, 2003 Lock down March 25, 2003 Empire March 21, 2003 A global discussion forum, by invitation only. March 21, 2003 Techgnosis March 20, 2003 blam! + Radio March 20, 2003 blam! + Blogger March 19, 2003 XNap hint for OS X March 19, 2003 Standardised review metadata March 19, 2003 blam! + Moveable Type March 19, 2003 OAI searches from HubMed March 18, 2003 Newzcrawler update March 18, 2003 IMDB moveabletype hack March 18, 2003 Amazon CD track listings March 17, 2003 Not unexpectedly pleasant March 17, 2003 blam!: Amazon review creator March 17, 2003 Research buy-back March 15, 2003 Endnote March 15, 2003 MP3 Sushi March 15, 2003 But why? March 15, 2003 Making money March 13, 2003 ScienceDirect Backfiles March 13, 2003 A simplified valuation of commoditised art March 12, 2003 Apple Java Hooray March 11, 2003 iBook USB FM radio tuner March 11, 2003 More mini-things March 10, 2003 Digital collection and peer review March 10, 2003 SpirographX March 10, 2003 Biopedia March 8, 2003 The Ends of the Internet March 7, 2003 biologging part 2 March 6, 2003 Oral traditions in online communication March 4, 2003 Value of music March 4, 2003 Biologging March 4, 2003 Spiders March 3, 2003 Citation Trackbacks March 3, 2003 Keyboard shortcuts March 2, 2003 GiftBoX March 2, 2003 CNPS March 2, 2003 AllAbstracts bookmarklet March 2, 2003 Citation Maps February 28, 2003 Science Citation Index February 27, 2003 Medscape Headlines in RSS February 25, 2003 iScrobbler February 25, 2003 Open Access Literature Part III February 24, 2003 Open Access Literature Part II February 23, 2003 Visualisations of political polarisation February 23, 2003 Better late than never February 22, 2003 Andromeda/PHP on OS X February 18, 2003 Open Access Literature February 18, 2003 HubLink February 17, 2003 allmusic-to-magnet-URI bookmarklet February 15, 2003 Semantic Blogging and Bibliographies February 15, 2003 Linking Services February 14, 2003 Nice Titles February 14, 2003 EndNote import filter updated February 13, 2003 endnotehubmedExceptions to copyright February 10, 2003 Latent Semantic Indexing February 9, 2003 Fair use February 7, 2003 Proper P2P February 7, 2003 Taking the internet outside February 7, 2003 The Infography February 7, 2003 Safari cookies February 2, 2003 cookiessafariTOC alerts January 31, 2003 alertsnewsreaderpushrsszetoc evaluation January 31, 2003 ParaTools January 27, 2003 citationparaciteparsingIntegrated Comments and TrackBacks January 25, 2003 commentstrackbackFixed TouchGraph scripts January 23, 2003 applettouchgraphvisualisationCitation parser update January 22, 2003 citationparsereferencesCollaboration Network Browser January 22, 2003 Analogies with TrackBack variants January 21, 2003 analogiesbiomedicalliteraturenetworksself-organisingtrackbackMake A List January 21, 2003 collaborativelisttrackbackweblogEDINA Join-Up January 20, 2003 openurlCitation matcher updated for multiple references January 20, 2003 citationparseRIS citation export file suffix January 20, 2003 exportfilerisCitation matching January 18, 2003 citationopcitparseHubLog RSS update January 18, 2003 hublogrssTrackBack January 18, 2003 trackbackAlternative software for community-driven literature management January 15, 2003 blogcommunitysitesoftwarePersonal/group publishing January 15, 2003 knowledgeliteraturepersonalpublishingwebImmunoLog launched January 15, 2003 collaborativejournalJoining the dots - advances in online biomedical literature management. January 14, 2003 biomedicalknowledgeliteraturemanagementSafari, TouchGraph update January 11, 2003 touchgraphSFX Lookup bookmarklet January 10, 2003 bookmarkletsfx03-01-08: Perl scripts for organising PDFs January 9, 2003 acrobatperl03-01-03: Library Lookup ISSN bookmarklet January 9, 2003 bookmarkletlibrarylookup03-01-03: Experimental links January 9, 2003 citationdoi02-12-20: Gnutella P2P January 9, 2003 gnutellamagnetp2p02-12-16: BibTex output January 9, 2003 bibtexpubmed02-12-08: Endnote and RIS import filters January 9, 2003 endnoteexportpubmed02-12-04: Related Articles algorithm January 9, 2003 articlespubmedrelated02-12-03: LinkOut URLs January 9, 2003 fulltextlinkoutpubmed02-12-02: PubMed Javascript January 9, 2003 javascriptpubmed02-11-25: HubMed online. January 9, 2003 perlpubmedutilitiesxml 
feeds-feedburner-com-1367	----	None 
feeds-feedburner-com-1383	----	None 
feeds-feedburner-com-1396	----	None 
feeds-feedburner-com-139	----	None 
feeds-feedburner-com-1472	----	None 
feeds-feedburner-com-1551	----	None 
feeds-feedburner-com-1654	----	None 
feeds-feedburner-com-1780	----	None 
feeds-feedburner-com-1820	----	None 
feeds-feedburner-com-1864	----	None 
feeds-feedburner-com-2000	----	None 
feeds-feedburner-com-2010	----	None 
feeds-feedburner-com-2109	----	None 
feeds-feedburner-com-2265	----	None 
feeds-feedburner-com-2282	----	None 
feeds-feedburner-com-2544	----	None 
feeds-feedburner-com-2558	----	None 
feeds-feedburner-com-2565	----	None 
feeds-feedburner-com-2587	----	None 
feeds-feedburner-com-2638	----	None 
feeds-feedburner-com-2805	----	None 
feeds-feedburner-com-2898	----	None 
feeds-feedburner-com-2907	----	None 
feeds-feedburner-com-2947	----	None 
feeds-feedburner-com-2968	----	None 
feeds-feedburner-com-3096	----	None 
feeds-feedburner-com-31	----	None 
feeds-feedburner-com-3212	----	None 
feeds-feedburner-com-3225	----	None 
feeds-feedburner-com-3374	----	None 
feeds-feedburner-com-3463	----	None 
feeds-feedburner-com-34	----	None 
feeds-feedburner-com-3542	----	None 
feeds-feedburner-com-3635	----	None 
feeds-feedburner-com-3711	----	None 
feeds-feedburner-com-389	----	None 
feeds-feedburner-com-4034	----	None 
feeds-feedburner-com-405	----	What I Learned Today… What I Learned Today… Taking a Break I&#8217;m sure those of you who are still reading have noticed that I haven&#8217;t been updating this site much in the past few years. I was sharing my links with you all but now Delicious has started adding ads to that. I&#8217;m going to rethink how I can use this site effectively going forward. For [&#8230;] Bookmarks for May 3, 2016 Today I found the following resources and bookmarked them on Delicious. Start A Fire Grow and expand your audience by recommending your content within any link you share Digest powered by RSS Digest Bookmarks for April 4, 2016 Today I found the following resources and bookmarked them on Delicious. Mattermost Mattermost is an open source, self-hosted Slack-alternative mBlock Program your app, Arduino projects and robots by dragging &#38; dropping Fidus Writer Fidus Writer is an online collaborative editor especially made for academics who need to use citations and/or formulas. Beek Social network for [&#8230;] Bookmarks for February 25, 2016 Today I found the following resources and bookmarked them on Delicious. Connfa Open Source iOS &#38; Android App for Conferences &#38; Events Paperless Scan, index, and archive all of your paper documents Foss2Serve Foss2serve promotes student learning via participation in humanitarian Free and Open Source Software (FOSS) projects. Disk Inventory X Disk Inventory X is [&#8230;] Bookmarks for January 9, 2016 Today I found the following resources and bookmarked them on Delicious. Superpowers The open source, extensible, collaborative HTML5 2D+3D game maker Sequel Pro Sequel Pro is a fast, easy-to-use Mac database management application for working with MySQL databases. Digest powered by RSS Digest Bookmarks for December 11, 2015 Today I found the following resources and bookmarked them on Delicious. Open Broadcaster Software Free, open source software for live streaming and recording Digest powered by RSS Digest Bookmarks for November 22, 2015 Today I found the following resources and bookmarked them on Delicious. NumFOCUS Foundation NumFOCUS promotes and supports the ongoing research and development of open-source computing tools through educational, community, and public channels. Digest powered by RSS Digest Bookmarks for November 16, 2015 Today I found the following resources and bookmarked them on Delicious. Smore Smore makes it easy to design beautiful and effective online flyers and newsletters. Ninite Install and Update All Your Programs at Once Digest powered by RSS Digest Bookmarks for November 13, 2015 Today I found the following resources and bookmarked them on Delicious. VIM Adventures Learning VIM while playing a game Digest powered by RSS Digest Bookmarks for November 10, 2015 Today I found the following resources and bookmarked them on Delicious. Star Wars: Building a Galaxy with Code Digest powered by RSS Digest Bookmarks for October 31, 2015 Today I found the following resources and bookmarked them on Delicious. Open Food Facts Open Food Facts gathers information and data on food products from around the world. Digest powered by RSS Digest Bookmarks for October 27, 2015 Today I found the following resources and bookmarked them on Delicious. VersionPress WordPress meets Git, properly. Undo anything (including database changes), clone &#38; merge your sites, maintain efficient backups, all with unmatched simplicity. Digest powered by RSS Digest Bookmarks for October 20, 2015 Today I found the following resources and bookmarked them on Delicious. SOGo Share your calendars, address books and mails in your community with a completely free and open source solution. Let your Mozilla Thunderbird/Lightning, Microsoft Outlook, Android, Apple iCal/iPhone and BlackBerry users collaborate using a modern platform. GitBook GitBook is a modern publishing toolchain. Making [&#8230;] Bookmarks for October 19, 2015 Today I found the following resources and bookmarked them on Delicious. Discourse Discourse is the 100% open source discussion platform built for the next decade of the Internet. It works as a mailing list, a discussion forum, and a long-form chat room Digest powered by RSS Digest Bookmarks for September 28, 2015 Today I found the following resources and bookmarked them on Delicious. Zulip A group chat application optimized for software development teams Digest powered by RSS Digest Bookmarks for September 25, 2015 Today I found the following resources and bookmarked them on Delicious. iDoneThis Reply to an evening email reminder with what you did that day. The next day, get a digest with what everyone on the team got done. Digest powered by RSS Digest Bookmarks for September 22, 2015 Today I found the following resources and bookmarked them on Delicious. Vector Vector is a new, fully open source communication and collaboration tool we’ve developed that’s open, secure and interoperable. Based on the concept of rooms and participants, it combines a great user interface with all core functions we need (chat, file transfer, VoIP and [&#8230;] Bookmarks for September 11, 2015 Today I found the following resources and bookmarked them on Delicious. Roundcube Free and Open Source Webmail Software Bolt Bolt is an open source Content Management Tool, which strives to be as simple and straightforward as possible. It is quick to set up, easy to configure, uses elegant templates, and above all: It’s a joy [&#8230;] Bookmarks for September 10, 2015 Today I found the following resources and bookmarked them on Delicious. MadEye MadEye is a collaborative web editor backed by your filesystem. Digest powered by RSS Digest Bookmarks for September 6, 2015 Today I found the following resources and bookmarked them on Delicious. Gimlet Your library&#8217;s questions and answers put to their best use. Know when your desk will be busy. Everyone on your staff can find answers to difficult questions. Digest powered by RSS Digest Bookmarks for September 2, 2015 Today I found the following resources and bookmarked them on Delicious. Thimble by Mozilla Thimble is an online code editor that makes it easy to create and publish your own web pages while learning HTML, CSS &#38; JavaScript. Google Coder a simple way to make web stuff on Raspberry Pi Digest powered by RSS Digest Bookmarks for August 23, 2015 Today I found the following resources and bookmarked them on Delicious. MediaGoblin MediaGoblin is a free software media publishing platform that anyone can run. You can think of it as a decentralized alternative to Flickr, YouTube, SoundCloud, etc. The Architecture of Open Source Applications A web whiteboard A Web Whiteboard is touch-friendly online whiteboard app [&#8230;] Bookmarks for August 6, 2015 Today I found the following resources and bookmarked them on Delicious. Computer Science Learning Opportunities We have developed a range of resources, programs, scholarships, and grant opportunities to engage students and educators around the world interested in computer science. Digest powered by RSS Digest Bookmarks for August 3, 2015 Today I found the following resources and bookmarked them on Delicious. Pydio The mature open source alternative to Dropbox and box.net Digest powered by RSS Digest Bookmarks for July 23, 2015 Today I found the following resources and bookmarked them on Delicious. hylafax The world&#8217;s most advanced open source fax server Digest powered by RSS Digest 
feeds-feedburner-com-4092	----	None 
feeds-feedburner-com-4179	----	None 
feeds-feedburner-com-4232	----	None 
feeds-feedburner-com-4330	----	None 
feeds-feedburner-com-4354	----	None 
feeds-feedburner-com-4356	----	None 
feeds-feedburner-com-4386	----	None 
feeds-feedburner-com-4399	----	None 
feeds-feedburner-com-4413	----	None 
feeds-feedburner-com-4502	----	None 
feeds-feedburner-com-4552	----	None 
feeds-feedburner-com-4684	----	None 
feeds-feedburner-com-4718	----	None 
feeds-feedburner-com-4815	----	None 
feeds-feedburner-com-4819	----	None 
feeds-feedburner-com-4909	----	None 
feeds-feedburner-com-4915	----	None 
feeds-feedburner-com-4919	----	None 
feeds-feedburner-com-5344	----	None 
feeds-feedburner-com-5357	----	None 
feeds-feedburner-com-5408	----	None 
feeds-feedburner-com-5455	----	None 
feeds-feedburner-com-5456	----	None 
feeds-feedburner-com-5594	----	None 
feeds-feedburner-com-5610	----	None 
feeds-feedburner-com-5675	----	None 
feeds-feedburner-com-5786	----	None 
feeds-feedburner-com-5871	----	None 
feeds-feedburner-com-5993	----	None 
feeds-feedburner-com-6084	----	None 
feeds-feedburner-com-6133	----	None 
feeds-feedburner-com-6161	----	None 
feeds-feedburner-com-6178	----	None 
feeds-feedburner-com-6184	----	commonplace.net commonplace.net Data. The final frontier. Infrastructure for heritage institutions – ARK PID’s In the Digital Infrastructure program at the Library of the University of Amsterdam we have reached a first milestone. In my previous post in the Infrastructure for heritage institutions series, &#8220;Change of course&#8220;, I mentioned the coming implementation of ARK persistent identifiers for our collection objects. Since November 3, 2020, ARK PID&#8217;s are available for our university library Alma catalogue through the Primo user interface. Implementation of ARK PID&#8217;s for the other collection description systems [&#8230;] Infrastructure for heritage institutions – change of course In July 2019 I published the first post about our planning to realise a “coherent and future proof digital infrastructure” for the Library of the University of Amsterdam. In February I reported on the first results. As frequently happens, since then the conditions have changed, and naturally we had to adapt the direction we are following to achieve our goals. In other words: a change of course, of course. &#160;Projects&#160; I will leave aside the [&#8230;] Infrastructure for heritage institutions – first results In July 2019 I published the post&#160;Infrastructure for heritage institutions in which I described our planning to realise a&#160;“coherent and future proof digital infrastructure” for the Library of the University of Amsterdam. Time to look back: how far have we come? And time to look forward: what&#8217;s in store for the near future? Ongoing activities I mentioned three &#8220;currently ongoing activities&#8221;:&#160; Monitoring and advising on infrastructural aspects of new projects Maintaining a structured dynamic overview [&#8230;] Infrastructure for heritage institutions During my vacation I saw this tweet by LIBER about topics to address, as suggested by the participants of the LIBER 2019 conference in Dublin: It shows a word cloud (yes, a word cloud) containing a large number of terms. I list the ones I can read without zooming in (so the most suggested ones, I guess), more or less grouped thematically: Open scienceOpen dataOpen accessLicensingCopyrightsLinked open dataOpen educationCitizen science Scholarly communicationDigital humanities/DHDigital scholarshipResearch assessmentResearch [&#8230;] Ten years linked open data This post is the English translation of my original article in Dutch, published in META (2016-3), the Flemish journal for information professionals. Ten years after the term “linked data” was introduced by Tim Berners-Lee it appears to be time to take stock of the impact of linked data for libraries and other heritage institutions in the past and in the future. I will do this from a personal historical perspective, as a library technology professional, [&#8230;] Maps, dictionaries and guidebooks Interoperability in heterogeneous library data landscapes Libraries have to deal with a highly opaque landscape of heterogeneous data sources, data types, data formats, data flows, data transformations and data redundancies, which I have earlier characterized as a “data maze”. The level and magnitude of this opacity and heterogeneity varies with the amount of content types and the number of services that the library is responsible for. Academic and national libraries are possibly dealing with more [&#8230;] Standard deviations in data modeling, mapping and manipulation Or: Anything goes. What are we thinking? An impression of ELAG 2015 This year’s ELAG conference in Stockholm was one of many questions. Not only the usual questions following each presentation (always elicited in the form of yet another question: “Any questions?”). But also philosophical ones (Why? What?). And practical ones (What time? Where? How? How much?). And there were some answers too, fortunately. This is my rather personal impression of the event. For a [&#8230;] Analysing library data flows for efficient innovation In my work at the Library of the University of Amsterdam I am currently taking a step forward by actually taking a step back from a number of forefront activities in discovery, linked open data and integrated research information towards a more hidden, but also more fundamental enterprise in the area of data infrastructure and information architecture. All for a good cause, for in the end a good data infrastructure is essential for delivering high [&#8230;] Looking for data tricks in Libraryland IFLA 2014 Annual World Library and Information Congress Lyon &#8211; Libraries, Citizens, Societies: Confluence for Knowledge After attending the IFLA 2014 Library Linked Data Satellite Meeting in Paris I travelled to Lyon for the first three days (August 17-19) of the IFLA 2014 Annual World Library and Information Congress. This year’s theme “Libraries, Citizens, Societies: Confluence for Knowledge” was named after the confluence or convergence of the rivers Rhône and Saône where the city of [&#8230;] Library Linked Data Happening On August 14 the IFLA 2014 Satellite Meeting ‘Linked Data in Libraries: Let&#8217;s make it happen!’ took place at the National Library of France in Paris. Rurik Greenall (who also wrote a very readable conference report) and I had the opportunity to present our paper ‘An unbroken chain: approaches to implementing Linked Open Data in libraries; comparing local, open-source, collaborative and commercial systems’. In this paper we do not go into reasons for libraries to [&#8230;] 
feeds-feedburner-com-6309	----	None 
feeds-feedburner-com-6319	----	None 
feeds-feedburner-com-6462	----	None 
feeds-feedburner-com-6496	----	None 
feeds-feedburner-com-6549	----	Dan Cohen Dan Cohen Vice Provost, Dean, and Professor at Northeastern University When We Look Back on 2020, What Will We See? It is far too early to understand what happened in this historic year of 2020, but not too soon to grasp what we will write that history from: data—really big data, gathered from our devices and ourselves. Sometimes a new technology provides an important lens through which a historical event is recorded, viewed, and remembered. [&#8230;] More than THAT “Less talk, more&#160;grok.” That was one of our early mottos at&#160;THATCamp, The Humanities and Technology Camp, which started at the Roy Rosenzweig Center for History and New Media at George Mason University in 2008. It was a riff on “Less talk, more rock,” the motto of WAAF, the hard rock station in Worcester, Massachusetts. And [&#8230;] Humane Ingenuity: My New Newsletter With the start of this academic year, I&#8217;m launching a new newsletter to explore technology that helps rather than hurts human understanding, and human understanding that helps us create better technology. It&#8217;s called Humane Ingenuity, and you can subscribe here. (It&#8217;s free, just drop your email address into that link.) Subscribers to this blog know [&#8230;] Engagement Is the Enemy of Serendipity Whenever I&#8217;m grumpy about an update to a technology I use, I try to perform a self-audit examining why I&#8217;m unhappy about this change. It&#8217;s a helpful exercise since we are all by nature resistant to even minor alterations to the technologies we use every day (which is why website redesign is now a synonym [&#8230;] On the Response to My Atlantic Essay on the Decline in the Use of Print Books in Universities I was not expecting—but was gratified to see—an enormous response to my latest piece in The Atlantic, &#8220;The Books of College Libraries Are Turning Into Wallpaper,&#8221; on the seemingly inexorable decline in the circulation of print books on campus. I&#8217;m not sure that I&#8217;ve ever written anything that has generated as much feedback, commentary, and [&#8230;] What’s New Season 2 Wrap-up With the end of the academic year at Northeastern University, the library wraps up our What&#8217;s New podcast, an interview series with researchers who help us understand, in plainspoken ways, some of the latest discoveries and ideas about our world. This year&#8217;s slate of podcasts, like last year&#8217;s, was extraordinarily diverse, ranging from the threat [&#8230;] When a Presidential Library Is Digital I&#8217;ve got a new piece over at The Atlantic on Barack Obama&#8217;s prospective presidential library, which will be digital rather than physical. This has caused some consternation. We need to realize, however, that the Obama library is already largely digital: The vast majority of the record his presidency left behind consists not of evocative handwritten [&#8230;] Robin Sloan’s Fusion of Technology and Humanity When Roy Rosenzweig and I wrote Digital History 15 years ago, we spent a lot of time thinking about the overall tone and approach of the book. It seemed to us that there were, on the one hand, a lot of our colleagues in professional history who were adamantly opposed to the use of digital [&#8230;] Presidential Libraries and the Digitization of Our Lives Buried in the recent debates (New York Times, Chicago Tribune, The Public Historian) about the nature, objectives, and location of the Obama Presidential Center is the inexorable move toward a world in which virtually all of the documentation about our lives is digital. To make this decades-long shift—now almost complete—clear, I made the following infographic [&#8230;] Kathleen Fitzpatrick’s Generous Thinking Generosity and thoughtfulness are not in abundance right now, and so Kathleen Fitzpatrick&#8216;s important new book, Generous Thinking: A Radical Approach to Saving the University, is wholeheartedly welcome. The generosity Kathleen seeks relates to lost virtues, such as listening to others and deconstructing barriers between groups. As such, Generous Thinking can be helpfully read alongside [&#8230;] 
feeds-feedburner-com-6560	----	None 
feeds-feedburner-com-6640	----	None 
feeds-feedburner-com-6661	----	None 
feeds-feedburner-com-6753	----	Library Hat Library Hat http://www.bohyunkim.net/blog/ Blockchain: Merits, Issues, and Suggestions for Compelling Use Cases * This post was also published in ACRL TechConnect.*** Blockchain holds a great potential for both innovation and disruption. The adoption of blockchain also poses certain risks, and those risks will need to be addressed and mitigated before blockchain becomes mainstream. A lot of people have heard of blockchain at this point. But many are [&#8230;] Taking Diversity to the Next Level ** This post was also published in ACRL TechConnect on Dec. 18, 2017.*** Getting Minorities on Board I recently moderated a panel discussion program titled “Building Bridges in a Divisive Climate: Diversity in Libraries, Archives, and Museums.”1 Participating in organizing this program was interesting experience. During the whole time, I experienced my perspective constantly shifting [&#8230;] From Need to Want: How to Maximize Social Impact for Libraries, Archives, and Museums At the NDP at Three event organized by IMLS yesterday, Sayeed Choudhury on the “Open Scholarly Communications” panel suggested that libraries think about return on impact in addition to return on investment (ROI). He further elaborated on this point by proposing a possible description of such impact. His description was that when an object or [&#8230;] How to Price 3D Printing Service Fees ** This post was originally published in ACRL TechConnect on May. 22, 2017.*** Many libraries today provide 3D printing service. But not all of them can afford to do so for free. While free 3D printing may be ideal, it can jeopardize the sustainability of the service over time. Nevertheless, many libraries tend to worry [&#8230;] Post-Election Statements and Messages that Reaffirm Diversity These are statements and messages sent out publicly or internally to re-affirm diversity, equity, and inclusion by libraries or higher ed institutions. I have collected these &#8211; some myself and many others through my fellow librarians. Some of them were listed on my blog post, &#8220;Finding the Right Words in Post-Election Libraries and Higher Ed.&#8221; [&#8230;] Finding the Right Words in Post-Election Libraries and Higher Ed ** This post was originally published in ACRL TechConnect on Nov. 15, 2016.*** This year’s election result has presented a huge challenge to all of us who work in higher education and libraries. Usually, libraries, universities, and colleges do not comment on presidential election result and we refrain from talking about politics at work. But [&#8230;] Say It Out Loud – Diversity, Equity, and Inclusion I usually and mostly talk about technology. But technology is so far away from my thought right now. I don’t feel that I can afford to worry about Internet surveillance or how to protect privacy at this moment. Not that they are unimportant. Such a worry is real and deserves our attention and investigation. But [&#8230;] Cybersecurity, Usability, Online Privacy, and Digital Surveillance ** This post was originally published in ACRL TechConnect on May. 9, 2016.*** Cybersecurity is an interesting and important topic, one closely connected to those of online privacy and digital surveillance. Many of us know that it is difficult to keep things private on the Internet. The Internet was invented to share things with others [&#8230;] Three Recent Talks of Mine on UX, Data Visualization, and IT Management I have been swamped at work and pretty quiet here in my blog. But I gave a few talks recently. So I wanted to share those at least. I presented about how to turn the traditional library IT department and its operation that is usually behind the scene into a more patron-facing unit at the recent American Library Association Midwinter [&#8230;] Near Us and Libraries, Robots Have Arrived ** This post was originally published in ACRL TechConnect on Oct. 12, 2015.*** The movie, Robot and Frank, describes the future in which the elderly have a robot as their companion and also as a helper. The robot monitors various activities that relate to both mental and physical health and helps Frank with various house chores. [&#8230;] 
feeds-feedburner-com-6828	----	None 
feeds-feedburner-com-6846	----	None 
feeds-feedburner-com-7031	----	None 
feeds-feedburner-com-7110	----	None 
feeds-feedburner-com-7189	----	None 
feeds-feedburner-com-7223	----	None 
feeds-feedburner-com-7283	----	The Code4Lib Journal The Code4Lib Journal Editorial Resuming our publication schedule Managing an institutional repository workflow with GitLab and a folder-based deposit system Institutional Repositories (IR) exist in a variety of configurations and in various states of development across the country. Each organization with an IR has a workflow that can range from explicitly documented and codified sets of software and human workflows, to ad hoc assortments of methods for working with faculty to acquire, process and load items into a repository. The University of North Texas (UNT) Libraries has managed an IR called UNT Scholarly Works for the past decade but has until recently relied on ad hoc workflows. Over the past six months, we have worked to improve our processes in a way that is extensible and flexible while also providing a clear workflow for our staff to process submitted and harvested content. Our approach makes use of GitLab and its associated tools to track and communicate priorities for a multi-user team processing resources. We paired this Web-based management with a folder-based system for moving the deposited resources through a sequential set of processes that are necessary to describe, upload, and preserve the resource. This strategy can be used in a number of different applications and can serve as a set of building blocks that can be configured in different ways. This article will discuss which components of GitLab are used together as tools for tracking deposits from faculty as they move through different steps in the workflow. Likewise, the folder-based workflow queue will be presented and described as implemented at UNT, and examples for how we have used it in different situations will be presented. Customizing Alma and Primo for Home & Locker Delivery Like many Ex Libris libraries in Fall 2020, our library at California State University, Northridge (CSUN) was not physically open to the public during the 2020-2021 academic year, but we wanted to continue to support the research and study needs of our over 38,000 university students and 4,000 faculty and staff. This article will explain our Alma and Primo implementation to allow for home mail delivery of physical items, including policy decisions, workflow changes, customization of request forms through labels and delivery skins, customization of Alma letters, a Python solution to add the “home” address type to patron addresses to make it all work, and will include relevant code samples in Python, XSL, CSS, XML, and JSON. In Spring 2021, we will add the on-site locker delivery option in addition to home delivery, and this article will include new system changes made for that option. GaNCH: Using Linked Open Data for Georgia’s Natural, Cultural and Historic Organizations’ Disaster Response In June 2019, the Atlanta University Center Robert W. Woodruff Library received a LYRASIS Catalyst Fund grant to support the creation of a publicly editable directory of Georgia’s Natural, Cultural and Historical Organizations (NCHs), allowing for quick retrieval of location and contact information for disaster response. By the end of the project, over 1,900 entries for NCH organizations in Georgia were compiled, updated, and uploaded to Wikidata, the linked open data database from the Wikimedia Foundation. These entries included directory contact information and GIS coordinates that appear on a map presented on the GaNCH project website (https://ganch.auctr.edu/), allowing emergency responders to quickly search for NCHs by region and county in the event of a disaster. In this article we discuss the design principles, methods, and challenges encountered in building and implementing this tool, including the impact the tool has had on statewide disaster response after implementation. Archive This Moment D.C.: A Case Study of Participatory Collecting During COVID-19 When the COVID-19 pandemic brought life in Washington, D.C. to a standstill in March 2020, staff at DC Public Library began looking for ways to document how this historic event was affecting everyday life. Recognizing the value of first-person accounts for historical research, staff launched Archive This Moment D.C. to preserve the story of daily life in the District during the stay-at-home order. Materials were collected from public Instagram and Twitter posts submitted through the hashtag #archivethismomentdc. In addition to social media, creators also submitted materials using an Airtable webform set up for the project and through email. Over 2,000 digital files were collected. This article will discuss the planning, professional collaboration, promotion, selection, access, and lessons learned from the project; as well as the technical setup, collection strategies, and metadata requirements. In particular, this article will include a discussion of the evolving collection scope of the project and the need for clear ethical guidelines surrounding privacy when collecting materials in real-time. Advancing ARKs in the Historical Ontology Space This paper presents the application of Archival Resource Keys (ARKs) for persistent identification and resolution of concepts in historical ontologies. Our use case is the 1910 Library of Congress Subject Headings (LCSH), which we have converted to the Simple Knowledge Organization System (SKOS) format and will use for representing a corpus of historical Encyclopedia Britannica articles. We report on the steps taken to assign ARKs in support of the Nineteenth-Century Knowledge Project, where we are using the HIVE vocabulary tool to automatically assign subject metadata from both the 1910 LCSH and the contemporary LCSH faceted, topical vocabulary to enable the study of the evolution of knowledge. Considered Content: a Design System for Equity, Accessibility, and Sustainability The University of Minnesota Libraries developed and applied a principles-based design system to their Health Sciences Library website. With the design system at its center, the revised site was able to achieve accessible, ethical, inclusive, sustainable, responsible, and universal design. The final site was built with elegantly accessible semantic HTML-focused code on Drupal 8 with highly curated and considered content, meeting and exceeding WCAG 2.1 AA guidance and addressing cognitive and learning considerations through the use of plain language, templated pages for consistent page-level organization, and no hidden content. As a result, the site better supports all users regardless of their abilities, attention level, mental status, reading level, and reliability of their internet connection, all of which are especially critical now as an elevated number of people experience crises, anxieties, and depression. Robustifying Links To Combat Reference Rot Links to web resources frequently break, and linked content can change at unpredictable rates. These dynamics of the Web are detrimental when references to web resources provide evidence or supporting information. In this paper, we highlight the significance of reference rot, provide an overview of existing techniques and their characteristics to address it, and introduce our Robust Links approach, including its web service and underlying API. Robustifying links offers a proactive, uniform, and machine-actionable way to combat reference rot. In addition, we discuss our reasoning and approach aimed at keeping the approach functional for the long term. To showcase our approach, we have robustified all links in this article. Machine Learning Based Chat Analysis The BYU library implemented a Machine Learning-based tool to perform various text analysis tasks on transcripts of chat-based interactions between patrons and librarians. These text analysis tasks included estimating patron satisfaction and classifying queries into various categories such as Research/Reference, Directional, Tech/Troubleshooting, Policy/Procedure, and others. An accuracy of 78% or better was achieved for each category. This paper details the implementation details and explores potential applications for the text analysis tool. Always Be Migrating At the University of California, Los Angeles, the Digital Library Program is in the midst of a large, multi-faceted migration project. This article presents a narrative of migration and a new mindset for technology and library staff in their ever-changing infrastructure and systems. This article posits that migration from system to system should be integrated into normal activities so that it is not a singular event or major project, but so that it is a process built into the core activities of a unit. Editorial: For Pandemic Times Such as This A pandemic changes the world and changes libraries. Open Source Tools for Scaling Data Curation at QDR This paper describes the development of services and tools for scaling data curation services at the Qualitative Data Repository (QDR). Through a set of open-source tools, semi-automated workflows, and extensions to the Dataverse platform, our team has built services for curators to efficiently and effectively publish collections of qualitatively derived data. The contributions we seek to make in this paper are as follows: 1. We describe ‘human-in-the-loop’ curation and the tools that facilitate this model at QDR; 2. We provide an in-depth discussion of the design and implementation of these tools, including applications specific to the Dataverse software repository, as well as standalone archiving tools written in R; and 3. We highlight the role of providing a service layer for data discovery and accessibility of qualitative data. Keywords: Data curation; open-source; qualitative data From Text to Map: Combing Named Entity Recognition and Geographic Information Systems This tutorial shows readers how to leverage the power of named entity recognition (NER) and geographic information systems (GIS) to extract place names from text, geocode them, and create a public-facing map. This process is highly useful across disciplines. For example, it can be used to generate maps from historical primary sources, works of literature set in the real world, and corpora of academic scholarship. In order to lead the reader through this process, the authors work with a 500 article sample of the COVID-19 Open Research Dataset Challenge (CORD-19) dataset. As of the date of writing, CORD-19 includes 45,000 full-text articles with metadata. Using this sample, the authors demonstrate how to extract locations from the full-text with the spaCy library in Python, highlight methods to clean up the extracted data with the Pandas library, and finally teach the reader how to create an interactive map of the places using ArcGIS Online. The processes and code are described in a manner that is reusable for any corpus of text Using Integrated Library Systems and Open Data to Analyze Library Cardholders The Harrison Public Library in Westchester County, New York operates two library buildings in Harrison: The Richard E. Halperin Memorial Library Building (the library’s main building, located in downtown Harrison) and a West Harrison branch location. As part of its latest three-year strategic plan, the library sought to use existing resources to improve understanding of its cardholders at both locations. To do so, we needed to link the circulation data in our integrated library system, Evergreen, to geographic data and demographic data. We decided to build a geodemographic heatmap that incorporated all three aforementioned types of data. Using Evergreen, American Community Survey (ACS) data, and Google Maps, we plotted each cardholder’s residence on a map, added census boundaries (called tracts) and our town’s borders to the map, and produced summary statistics for each tract detailing its demographics and the library card usage of its residents. In this article, we describe how we acquired the necessary data and built the heatmap. We also touch on how we safeguarded the data while building the heatmap, which is an internal tool available only to select authorized staff members. Finally, we discuss what we learned from the heatmap and how libraries can use open data to benefit their communities. Update OCLC Holdings Without Paying Additional Fees: A Patchwork Approach Accurate OCLC holdings are vital for interlibrary loan transactions. However, over time weeding projects, replacing lost or damaged materials, and human error can leave a library with a catalog that is no longer reflected through OCLC. While OCLC offers reclamation services to bring poorly maintained collections up-to-date, the associated fee may be cost prohibitive for libraries with limited budgets. This article will describe the process used at Austin Peay State University to identify, isolate, and update holdings using OCLC Collection Manager queries, MarcEdit, Excel, and Python. Some portions of this process are completed using basic coding; however, troubleshooting techniques will be included for those with limited previous experience. Data reuse in linked data projects: a comparison of Alma and Share-VDE BIBFRAME networks This article presents an analysis of the enrichment, transformation, and clustering used by vendors Casalini Libri/@CULT and Ex Libris for their respective conversions of MARC data to BIBFRAME. The analysis considers the source MARC21 data used by Alma then the enrichment and transformation of MARC21 data from Share-VDE partner libraries. The clustering of linked data into a BIBFRAME network is a key outcome of data reuse in linked data projects and fundamental to the improvement of the discovery of library collections on the web and within search systems. CollectionBuilder-CONTENTdm: Developing a Static Web ‘Skin’ for CONTENTdm-based Digital Collections Unsatisfied with customization options for CONTENTdm, librarians at University of Idaho Library have been using a modern static web approach to creating digital exhibit websites that sit in front of the digital repository. This "skin" is designed to provide users with new pathways to discover and explore collection content and context. This article describes the concepts behind the approach and how it has developed into an open source, data-driven tool called CollectionBuilider-CONTENTdm. The authors outline the design decisions and principles guiding the development of CollectionBuilder, and detail how a version is used at the University of Idaho Library to collaboratively build digital collections and digital scholarship projects. Automated Collections Workflows in GOBI: Using Python to Scrape for Purchase Options The NC State University Libraries has developed a tool for querying GOBI, our print and ebook ordering vendor platform, to automate monthly collections reports. These reports detail purchase options for missing or long-overdue items, as well as popular items with multiple holds. GOBI does not offer an API, forcing staff to conduct manual title-by-title searches that previously took up to 15 hours per month. To make this process more efficient, we wrote a Python script that automates title searches and the extraction of key data (price, date of publication, binding type) from GOBI. This tool can gather data for hundreds of titles in half an hour or less, freeing up time for other projects. This article will describe the process of creating this script, as well as how it finds and selects data in GOBI. It will also discuss how these results are paired with NC State’s holdings data to create reports for collection managers. Lastly, the article will examine obstacles that were experienced in the creation of the tool and offer recommendations for other organizations seeking to automate collections workflows. Testing remote access to e-resource with CodeceptJS At the Badische Landesbibliothek Karlsruhe (BLB) we offer a variety of e-resources with different access requirements. On the one hand, there is free access to open access material, no matter where you are. On the other hand, there are e-resources that you can only access when you are in the rooms of the BLB. We also offer e-resources that you can access from anywhere, but you must have a library account for authentication to gain access. To test the functionality of these access methods, we have created a project to automatically test the entire process from searching our catalogue, selecting a hit, logging in to the provider's site and checking the results. For this we use the End 2 End Testing Framework CodeceptJS. Editorial An abundance of information sharing. Leveraging Google Drive for Digital Library Object Storage This article will describe a process at the University of Kentucky Libraries for utilizing an unlimited Google Drive for Education account for digital library object storage. For a number of recent digital library projects, we have used Google Drive for both archival file storage and web derivative file storage. As a part of the process, a Google Drive API script is deployed in order to automate the gathering of of Google Drive object identifiers. Also, a custom Omeka plugin was developed to allow for referencing web deliverable files within a web publishing platform via object linking and embedding. For a number of new digital library projects, we have moved toward a small VM approach to digital library management where the VM serves as a web front end but not a storage node. This has necessitated alternative approaches to storing web addressable digital library objects. One option is the use of Google Drive for storing digital objects. An overview of our approach is included in this article as well as links to open source code we adopted and more open source code we produced. Building a Library Search Infrastructure with Elasticsearch This article discusses our implementation of an Elastic cluster to address our search, search administration and indexing needs, how it integrates in our technology infrastructure, and finally takes a close look at the way that we built a reusable, dynamic search engine that powers our digital repository search. We cover the lessons learned with our early implementations and how to address them to lay the groundwork for a scalable, networked search environment that can also be applied to alternative search engines such as Solr. How to Use an API Management platform to Easily Build Local Web Apps Setting up an API management platform like DreamFactory can open up a lot of possibilities for potential projects within your library. With an automatically generated restful API, the University Libraries at Virginia Tech have been able to create applications for gathering walk-in data and reference questions, public polling apps, feedback systems for service points, data dashboards and more. This article will describe what an API management platform is, why you might want one, and the types of potential projects that can quickly be put together by your local web developer. Git and GitLab in Library Website Change Management Workflows Library websites can benefit from a separate development environment and a robust change management workflow, especially when there are multiple authors. This article details how the Oakland University William Beaumont School of Medicine Library use Git and GitLab in a change management workflow with a serverless development environment for their website development team. Git tracks changes to the code, allowing changes to be made and tested in a separate branch before being merged back into the website. GitLab adds features such as issue tracking and discussion threads to Git to facilitate communication and planning. Adoption of these tools and this workflow have dramatically improved the organization and efficiency of the OUWB Medical Library web development team, and it is the hope of the authors that by sharing our experience with them others may benefit as well. Experimenting with a Machine Generated Annotations Pipeline The UCLA Library reorganized its software developers into focused subteams with one, the Labs Team, dedicated to conducting experiments. In this article we describe our first attempt at conducting a software development experiment, in which we attempted to improve our digital library’s search results with metadata from cloud-based image tagging services. We explore the findings and discuss the lessons learned from our first attempt at running an experiment. Leveraging the RBMS/BSC Latin Place Names File with Python To answer the relatively straight-forward question “Which rare materials in my library catalog were published in Venice?” requires an advanced knowledge of geography, language, orthography, alphabet graphical changes, cataloging standards, transcription practices, and data analysis. The imprint statements of rare materials transcribe place names more faithfully as it appears on the piece itself, such as Venetus, or Venetiae, rather than a recognizable and contemporary form of place name, such as Venice, Italy. Rare materials catalogers recognize this geographic discoverability and selection issue and solve it with a standardized solution. To add consistency and normalization to imprint locations, rare materials catalogers utilize hierarchical place names to create a special imprint index. However, this normalized and contemporary form of place name is often missing from legacy bibliographic records. This article demonstrates using a traditional rare materials cataloging aid, the RBMS/BSC Latin Place Names File, with programming tools, Jupyter Notebook and Python, to retrospectively populate a special imprint index for 17th-century rare materials. This methodology enriched 1,487 MAchine Readable Cataloging (MARC) bibliographic records with hierarchical place names (MARC 752 fields) as part of a small pilot project. This article details a partially automated solution to this geographic discoverability and selection issue; however, a human component is still ultimately required to fully optimize the bibliographic data. Tweeting Tennessee’s Collections: A Case Study of a Digital Collections Twitterbot Implementation This article demonstrates how a Twitterbot can be used as an inclusive outreach initiative that breaks down the barriers between the web and the reading room to share materials with the public. These resources include postcards, music manuscripts, photographs, cartoons and any other digitized materials. Once in place, Twitterbots allow physical materials to converge with the technical and social space of the Web. Twitterbots are ideal for busy professionals because they allow librarians to make meaningful impressions on users without requiring a large time investment. This article covers the recent implementation of a digital collections bot (@UTKDigCollBot) at the University of Tennessee, Knoxville (UTK), and provides documentation and advice on how you might develop a bot to highlight materials at your own institution. Building Strong User Experiences in LibGuides with Bootstrapr and Reviewr With nearly fifty subject librarians creating LibGuides, the LibGuides Management Team at Notre Dame needed a way to both empower guide authors to take advantage of the powerful functionality afforded by the Bootstrap framework native to LibGuides, and to ensure new and extant library guides conformed to brand/identity standards and the best practices of user experience (UX) design. To accomplish this, we developed an online handbook to teach processes and enforce styles; a web app to create Twitter Bootstrap components for use in guides (Bootstrapr); and a web app to radically speed the review and remediation of guides, as well as better communicate our changes to guide authors (Reviewr). This article describes our use of these three applications to balance empowering guide authors against usefully constraining them to organizational standards for user experience. We offer all of these tools as FOSS under an MIT license so that others may freely adapt them for use in their own organization. IIIF by the Numbers The UCLA Library began work on building a suite of services to support IIIF for their digital collections. The services perform image transformations and delivery as well as manifest generation and delivery. The team was unsure about whether they should use local or cloud-based infrastructure for these services, so they conducted some experiments on multiple infrastructure configurations and tested them in scenarios with varying dimensions. Trust, But Verify: Auditing Vendor-Supplied Accessibility Claims Despite a long-overdue push to improve the accessibility of our libraries’ online presences, much of what we offer to our patrons comes from third party vendors: discovery layers, OPACs, subscription databases, and so on. We can’t directly affect the accessibility of the content on these platforms, but rely on vendors to design and test their systems and report on their accessibility through Voluntary Product Accessibility Templates (VPATS). But VPATs are self-reported. What if we want to verify our vendors’ claims? We can’t thoroughly test the accessibility of hundreds of vendor systems, can we? In this paper, we propose a simple methodology for spot-checking VPATs. Since most websites struggle with the same accessibility issues, spot checking particular success criteria in a library vendor VPAT can tip us off to whether the VPAT as a whole can be trusted. Our methodology combines automated and manual checking, and can be done without any expensive software or complex training. What’s more, we are creating a repository to share VPAT audit results with others, so that we needn’t all audit the VPATs of all our systems. 
feeds-feedburner-com-7338	----	None 
feeds-feedburner-com-7357	----	None 
feeds-feedburner-com-7360	----	None 
feeds-feedburner-com-7426	----	None 
feeds-feedburner-com-7445	----	None 
feeds-feedburner-com-7453	----	Zotero Zotero Collect, organize, cite, and share your research Move Zotero Citations Between Google Docs, Word, and LibreOffice Last year, we added Google Docs integration to Zotero, bringing to Google Docs the same powerful citation functionality — with support for over 9,000 citation styles — that Zotero offers in Word and LibreOffice. Today we&#8217;re adding a feature that lets you move documents between Google Docs and Word or LibreOffice while preserving active Zotero citations. [&#8230;] Retracted item notifications with Retraction Watch integration Zotero can now help you avoid relying on retracted publications in your research by automatically checking your database and documents for works that have been retracted. We&#8217;re providing this service in partnership with Retraction Watch, which maintains the largest database of retractions available, and we&#8217;re proud to help sustain their important work. How It Works [&#8230;] Scan Books into Zotero from Your iPhone or iPad Zotero makes it easy to collect research materials with a single click as you browse the web, but what do you do when you want to add a real, physical book to your Zotero library? If you have an iPhone or iPad running iOS 12, you can now save a book to Zotero just by [&#8230;] Zotero Comes to Google Docs We&#8217;re excited to announce the availability of Zotero integration with Google Docs, joining Zotero&#8217;s existing support for Microsoft Word and LibreOffice. The same powerful functionality that Zotero has long offered for traditional word processors is now available for Google Docs. You can quickly search for items in your Zotero library, add page numbers and other [&#8230;] Improved PDF retrieval with Unpaywall integration As an organization dedicated to developing free and open-source research tools, we care deeply about open access to scholarship. With the latest version of Zotero, we&#8217;re excited to make it easier than ever to find PDFs for the items in your Zotero library. While Zotero has always been able to download PDFs automatically as you [&#8230;] Introducing ZoteroBib: Perfect bibliographies in minutes We think Zotero is the best tool for almost anyone doing serious research, but we know that a lot of people — including many students — don’t need all of Zotero’s power just to create the occasional bibliography. Today, we’re introducing ZoteroBib, a free service to help people quickly create perfect bibliographies. Powered by the same technology [&#8230;] Zotero 5.0.36: New PDF features, faster citing in large documents, and more The latest version of Zotero introduces some major improvements for PDF-based workflows, a new citing mode that can greatly speed up the use of the word processor plugin in large documents, and various other improvements and bug fixes. New PDF features Improved PDF metadata retrieval While the &#8220;Save to Zotero&#8221; button in the Zotero Connector [&#8230;] Zotero 5.0 and Firefox: Frequently Asked Questions In A Unified Zotero Experience, we explained the changes introduced in Zotero 5.0 that affect Zotero for Firefox users. See that post for a full explanation of the change, and read on for some additional answers. What&#8217;s changing? Zotero 5.0 is available only as a standalone program, and Zotero 4.0 for Firefox is being replaced [&#8230;] New Features for Chrome and Safari Connectors We are excited to announce major improvements to the Zotero Connectors for Chrome and Safari. Chrome The Zotero Connector for Chrome now includes functionality that was previously available only in Zotero for Firefox. Automatic Institutional Proxy Detection Many institutions provide a way to access electronic resources while you are off-campus by signing in to a [&#8230;] A Unified Zotero Experience Since the introduction of Zotero Standalone in 2011, Zotero users have had two versions to choose from: the original Firefox extension, Zotero for Firefox, which provides deep integration into the Firefox user interface, and Zotero Standalone, which runs as a separate program and can be used with any browser. Starting with the release of Zotero [&#8230;] 
feeds-feedburner-com-7472	----	None 
feeds-feedburner-com-7538	----	None 
feeds-feedburner-com-7642	----	None 
feeds-feedburner-com-7745	----	None 
feeds-feedburner-com-7753	----	None 
feeds-feedburner-com-7770	----	None 
feeds-feedburner-com-7775	----	None 
feeds-feedburner-com-7879	----	None 
feeds-feedburner-com-7884	----	None 
feeds-feedburner-com-7912	----	None 
feeds-feedburner-com-7967	----	None 
feeds-feedburner-com-796	----	None 
feeds-feedburner-com-8173	----	None 
feeds-feedburner-com-8217	----	None 
feeds-feedburner-com-8311	----	None 
feeds-feedburner-com-8326	----	None 
feeds-feedburner-com-8419	----	None 
feeds-feedburner-com-8459	----	None 
feeds-feedburner-com-8480	----	Free Range Librarian Free Range Librarian K.G. Schneider's blog on librarianship, writing, and everything else (Dis)Association I have been reflecting on the future of a national association I belong to that has struggled with relevancy and with closing the distance between itself and its members, has distinct factions that differ on fundamental matters of values, faces declining national and chapter membership, needs to catch up on the technology curve, has sometimes [&#8230;] I have measured out my life in Doodle polls You know that song? The one you really liked the first time you heard it? And even the fifth or fifteenth? But now your skin crawls when you hear it? That&#8217;s me and Doodle. In the last three months I have filled out at least a dozen Doodle polls for various meetings outside my organization. [&#8230;] Memento DMV This morning I spent 40 minutes in the appointment line at the Santa Rosa DMV to get my license renewed and converted to REAL ID, but was told I was “too early” to renew my license, which expires in September, so I have to return after I receive my renewal notice. I could have converted [&#8230;] An Old-Skool Blog Post I get up early these days and get stuff done &#8212; banking and other elder-care tasks for my mother, leftover work from the previous day, association or service work. A lot of this is writing, but it&#8217;s not writing. I have a half-dozen unfinished blog posts in WordPress, and even more in my mind. I [&#8230;] Keeping Council Editorial note: Over half of this post was composed in July 2017. At the time, this post could have been seen as politically neutral (where ALA is the political landscape I&#8217;m referring to) but tilted toward change and reform. Since then, Events Have Transpired. I revised this post in November, but at the time hesitated [&#8230;] What burns away We are among the lucky ones. We did not lose our home. We did not spend day after day evacuated, waiting to learn the fate of where we live. We never lost power or Internet. We had three or four days where we were mildly inconvenienced because PG&#38;E wisely turned off gas to many neighborhoods, [&#8230;] Neutrality is anything but &#8220;We watch people dragged away and sucker-punched at rallies as they clumsily try to be an early-warning system for what they fear lies ahead.&#8221; &#8212; Unwittingly prophetic me, March, 2016. Sometime after last November, I realized something very strange was happening with my clothes. My slacks had suddenly shrunk, even if I hadn&#8217;t washed them. After [&#8230;] MPOW in the here and now I have coined a few biblioneologisms in my day, but the one that has had the longest legs is MPOW (My Place of Work), a convenient, mildly-masking shorthand for one&#8217;s institution. For the last four years I haven&#8217;t had the bandwidth to coin neologisms, let alone write about MPOW*. This silence could be misconstrued. I [&#8230;] Questions I have been asked about doctoral programs About six months ago I was visiting another institution when someone said to me, &#8220;Oh, I used to read your blog, BACK IN THE DAY.&#8221; Ah yes, back in the day, that Pleistocene era when I wasn&#8217;t working on a PhD while holding down a big job and dealing with the rest of life&#8217;s shenanigans. [&#8230;] A scholar’s pool of tears, Part 2: The pre in preprint means not done yet Note, for two more days, January 10 and 11, you (as in all of you) have free access to my article, To be real: Antecedents and consequences of sexual identity disclosure by academic library directors. Then it drops behind a paywall and sits there for a year. When I wrote Part 1 of this blog [&#8230;] 
feeds-feedburner-com-8611	----	None 
feeds-feedburner-com-861	----	None 
feeds-feedburner-com-8646	----	None 
feeds-feedburner-com-8690	----	None 
feeds-feedburner-com-8727	----	None 
feeds-feedburner-com-8744	----	None 
feeds-feedburner-com-8789	----	Hanging Together Hanging Together the OCLC Research blog Nederlandse ronde tafel sessie over next generation metadata: Denk groter dan NACO en WorldCat Met dank aan Ellen Hartman, OCLC, voor het vertalen van de oorspronkelijke Engelstalige blogpost. Op 8 maart 2021 werd een Nederlandse ronde tafel discussie georganiseerd als onderdeel van de OCLC &#8230; The post Nederlandse ronde tafel sessie over next generation metadata: Denk groter dan NACO en WorldCat appeared first on Hanging Together. Recognizing bias in research data – and research data management As the COVID pandemic grinds on, vaccinations are top of mind. A recent article published in JAMA Network Open examined whether vaccination clinical trials over the last decade adequately represented &#8230; The post Recognizing bias in research data – and research data management appeared first on Hanging Together. Accomplishments and priorities for the OCLC Research Library Partnership With 2021 well underway, the OCLC Research Library Partnership is as active as ever. We are heartened by the positive feedback and engagement our Partners have provided in response to &#8230; The post Accomplishments and priorities for the OCLC Research Library Partnership appeared first on Hanging Together. Dutch round table on next generation metadata: think bigger than NACO and WorldCat As part of the OCLC Research Discussion Series on Next Generation Metadata, this blog post reports back from the Dutch language round table discussion held on March 8, 2021. (A Dutch &#8230; The post Dutch round table on next generation metadata: think bigger than NACO and WorldCat appeared first on Hanging Together. Third English round table on next generation metadata: investing in the utility of authorities and identifiers Thanks to George Bingham, UK Account Manager at OCLC, for contributing this post as part of the Metadata Series blog posts.  As part of the OCLC Research Discussion Series on Next Generation Metadata, this blog post reports back &#8230; The post Third English round table on next generation metadata: investing in the utility of authorities and identifiers appeared first on Hanging Together. Mesa redonda sobre metadatos de próxima generación en español: la gestión de las identidades de los investigadores es lo más importante Muchas gracias a Francesc García Grimau, OCLC, por la traducción de esta entrada de blog, que originalmente estaba en inglés. Como parte de la Serie de Debates de OCLC Research &#8230; The post Mesa redonda sobre metadatos de próxima generación en español: la gestión de las identidades de los investigadores es lo más importante appeared first on Hanging Together. Making strategic choices about library collaboration in RDM Academic libraries are responding to a host of disruptions – emerging technologies, changing user expectations, evolving research and learning practices, economic pressures, and of course, the COVID-19 pandemic. While these &#8230; The post Making strategic choices about library collaboration in RDM appeared first on Hanging Together. Spanish round table on next generation metadata: managing researcher identities is top of mind As part of the OCLC Research Discussion Series on Next Generation Metadata, this blog post reports back from the Spanish language round table discussion held on March 8, 2021. (A Spanish &#8230; The post Spanish round table on next generation metadata: managing researcher identities is top of mind appeared first on Hanging Together. Table ronde française sur les métadonnées de nouvelle génération: le défi consiste à gérer de concert de multiples échelles Merci à Arnaud Delivet, OCLC, pour la traduction de l&#8217;article original en anglais. Cet article de blog revient sur la table ronde en français organisée par le département recherche d&#8217;OCLC &#8230; The post Table ronde française sur les métadonnées de nouvelle génération: le défi consiste à gérer de concert de multiples échelles appeared first on Hanging Together. Deutschsprachige Gesprächsrunde zu Metadaten der nächsten Generation: Formate, Kontexte und Lücken Vielen Dank an Petra Löffel, OCLC, für die Übersetzung dieses im Original englischsprachigen Blogposts. Im Rahmen der Diskussionsserie zu Metadaten der nächsten Generation berichtet dieser Blogpost von der deutschen Gesprächsrunde &#8230; The post Deutschsprachige Gesprächsrunde zu Metadaten der nächsten Generation: Formate, Kontexte und Lücken appeared first on Hanging Together. 
feeds-feedburner-com-8820	----	None 
feeds-feedburner-com-8867	----	None 
feeds-feedburner-com-8983	----	None 
feeds-feedburner-com-9113	----	None 
feeds-feedburner-com-9292	----	None 
feeds-feedburner-com-9338	----	None 
feeds-feedburner-com-9428	----	None 
feeds-feedburner-com-9453	----	None 
feeds-feedburner-com-9570	----	None 
feeds-feedburner-com-9678	----	None 
feeds-feedburner-com-9885	----	None 
feeds-feedburner-com-9931	----	None 
feeds-feedburner-com-9997	----	None 
feeds-fiander-info-4620	----	Rapid Communications Rapid Communications Rapid, but irregular, communications from the frontiers of Library Technology Mac OS vs Emacs: Getting on the right (exec) PATH Finding ISBNs in the the digits of π Software Upgrades and The Parable of the Windows Using QR Codes in the Library A Manifesto for the Library I'm a Shover and Maker! LITA Tears Down the Walls A (Half) Year in Books The Desk Set Drinking Game July Book a Month Challenge: Independence June Book a Month Challenge: Knowledge Anthony Hope and the Triumph of the Public Domain May Book a Month Challenge: Mother Eric S. Raymond on Proprietary ILSs One Big Library Unconference in Toronto April Book A Month Challenge: Beauty Thinking About Dates on To-Do List Web Sites The Most Important Programming Language I've Learned Building Systems that Support Librarians Book A Month Challenge for March: Craft Social Aggregators On Keeping a Reading Journal BAM Challenge: Heart Where the Users Are My Top Technology Trends Slides 
feeds-pinboard-in-8651	----	Pinboard (items tagged code4lib) https://pinboard.in/t:code4lib/ (400) https://twitter.com/rudokemper/status/1371454887721119748/photo/1 2021-03-23T15:36:56+00:00 https://twitter.com/rudokemper/status/1371454887721119748/photo/1 bsscdt RT @rudokemper: Floored and honored to have been invited to give a keynote for the #c4l21 #code4lib conference next Monday. I can't wait to share about our work building open-source tech for communities to map oral histories, and how my journey started in the library + archive space! @code4lib c4l21 code4lib https://twitter.com/ https://pinboard.in/u:bsscdt/b:393a9fefac65/ Untitled (https://d1keuthy5s86c8.cloudfront.net/static/ems/upload/files/code4lib21_discogs_blacklight.pdf) 2021-03-23T05:00:37+00:00 https://d1keuthy5s86c8.cloudfront.net/static/ems/upload/files/code4lib21_discogs_blacklight.pdf rybesh RT @sf433: Really happy to share, “Dynamic Integration of Discogs Data within a Blacklight Catalog” From now on I’m going to ask myself, “Can this talk be a poster?” #code4lib code4lib https://twitter.com/ https://pinboard.in/u:rybesh/b:731426d5f14f/ The Code4Lib Journal – Advancing ARKs in the Historical Ontology Space 2021-03-10T18:19:28+00:00 https://journal.code4lib.org/articles/15608 geephroh code4lib digitallibraries digitalpreservation data ontology identifiers digitalhumanities ark computationalarchivalscience cas archives journalarticle https://pinboard.in/ https://pinboard.in/u:geephroh/b:60093e26caf8/ The Code4Lib Journal – Managing an institutional repository workflow with GitLab and a folder-based deposit system 2021-02-16T00:56:37+00:00 https://journal.code4lib.org/articles/15650 aarontay Managing an institutional repository workflow with GitLab and a folder-based deposit system by Whitney R. Johnson-Freeman, @vphill, and Kristy K. Phillips #code4lib Journal issue 50. code4lib https://twitter.com/ https://pinboard.in/u:aarontay/b:95dfc9c36cda/ LISTSERV 16.5 - CODE4LIB Archives 2020-09-29T12:04:57+00:00 https://lists.clir.org/cgi-bin/wa?A2=CODE4LIB;e2bc9365.2009 miaridge RT @kiru: I forgot to post the call earlier: The Code4Lib Journal () is looking for volunteers to join its editorial committee. Deadline: 12 Oct. #code4lib code4lib https://twitter.com/ https://pinboard.in/u:miaridge/b:e26e92731fb6/ 20 - C4L [5] Future Role of Libraries in Researcher Workflows - Google Slides 2020-03-11T00:13:42+00:00 https://t.co/JCoE2mVhD5 elibtronic research-lifecycle code4lib publish scholarly-communication https://pinboard.in/u:elibtronic/b:7282952b4f7a/ Twitter 2020-02-18T09:20:53+00:00 https://twitter.com/i/web/status/1229697282284625920 aarontay New issue of the The #Code4Lib Journal published. Some terrific looking papers, including a review of PIDs for heri… Code4Lib https://twitter.com/ https://pinboard.in/u:aarontay/b:8525b50b475d/ (500) https://journal.code4lib.org/ 2020-02-18T08:24:34+00:00 https://journal.code4lib.org/ miaridge RT @kiru: I am very happy to announce the publication of the @Code4Lib Journal issue #47: webscraping… 47 code4lib https://twitter.com/ https://pinboard.in/u:miaridge/b:8f5c33d4d11c/ The Code4Lib Journal – COLUMN: We Love Open Source Software. No, You Can’t Have Our Code 2019-12-09T23:24:08+00:00 https://journal.code4lib.org/articles/527 pfhyper Librarians are among the strongest proponents of open source software. Paradoxically, libraries are also among the least likely to actively contribute their code to open source projects. This article identifies and discusses six main reasons this dichotomy exists and offers ways to get around them. Code4Lib library LIBT opensource finalproject https://pinboard.in/ https://pinboard.in/u:pfhyper/b:4da9d5a48b61/ The Code4Lib Journal – Barriers to Initiation of Open Source Software Projects in Libraries 2019-12-09T23:20:43+00:00 https://journal.code4lib.org/articles/10665 pfhyper Libraries share a number of core values with the Open Source Software (OSS) movement, suggesting there should be a natural tendency toward library participation in OSS projects. However Dale Askey’s 2008 Code4Lib column entitled “We Love Open Source Software. No, You Can’t Have Our Code,” claims that while libraries are strong proponents of OSS, they are unlikely to actually contribute to OSS projects. He identifies, but does not empirically substantiate, six barriers that he believes contribute to this apparent inconsistency. In this study we empirically investigate not only Askey’s central claim but also the six barriers he proposes. In contrast to Askey’s assertion, we find that initiation of and contribution to OSS projects are, in fact, common practices in libraries. However, we also find that these practices are far from ubiquitous; as Askey suggests, many libraries do have opportunities to initiate OSS projects, but choose not to do so. Further, we find support for only four of Askey’s six OSS barriers. Thus, our results confirm many, but not all, of Askey’s assertions. Code4Lib library LIBT opensource finalproject https://pinboard.in/ https://pinboard.in/u:pfhyper/b:74f337d2e129/ Twitter 2019-11-07T05:59:14+00:00 https://twitter.com/i/web/status/1191993029948780545 jbfink RT @kiru: The #Code4Lib Journal's issue 46 (2019/4) has been just published: . Worldcat Search API, Go… Code4Lib https://twitter.com/ https://pinboard.in/u:jbfink/b:d0cd0f6754e5/ Twitter 2019-11-01T15:40:51+00:00 https://twitter.com/i/web/status/1190292574008987648 jbfink RT @mjingle: Who's excited for the next #code4lib conference?! It will be in Pittsburgh, PA from March 8-11. Is your org interes… code4lib https://twitter.com/ https://pinboard.in/u:jbfink/b:14defc6eb027/ Attempto Project 2019-09-13T09:31:25+00:00 http://attempto.ifi.uzh.ch/site/ blebo nlp basic cnl computationalLinguistics controlledLanguage controlled_language code4lib compsci english knowledgeRepresentation https://pinboard.in/u:blebo/b:5a5b84f3a2fd/ Twitter 2019-08-22T22:54:45+00:00 https://twitter.com/i/web/status/1164566585371066368 danbri When our grandchildren ask about the Great #code4lib IRC Battle of the Tisane, we will serve them both tea and coff… code4lib https://twitter.com/ https://pinboard.in/u:danbri/b:3ce9a224628e/ Code4Lib 2019 Recap – bloggERS! 2019-07-23T17:38:41+00:00 https://saaers.wordpress.com/2019/04/02/code4lib-2019-recap/ geephroh code4lib digitallibraries research saa archives https://pinboard.in/ https://pinboard.in/u:geephroh/b:232421afd001/ Digital Technologies Development Librarian | NC State University Libraries 2019-07-09T15:54:39+00:00 https://www.lib.ncsu.edu/jobs/ehra/dtdl2019 cdmorris We're hiring a Digital Technologies Development Librarian @ncsulibraries ! #job #libjobs #code4lib #dlf #libtech dlf libtech code4lib job libjobs https://twitter.com/ https://pinboard.in/u:cdmorris/b:cf25e0f15239/ Twitter 2019-07-03T13:01:26+00:00 https://twitter.com/i/web/status/1146403575649787904 jbfink 3) All the men who want to preserve the idea of a #Code4Lib discussion space as one that's free of such topics as s… Code4Lib https://twitter.com/ https://pinboard.in/u:jbfink/b:d2f274738572/ Google Refine cheat sheet (code4lib) 2019-05-31T23:23:19+00:00 https://code4libtoronto.github.io/2018-10-12-access/GoogleRefineCheatSheets.pdf Psammead openRefine code4lib how-to cheatsheet https://pinboard.in/ https://pinboard.in/u:Psammead/b:d34452c7d709/ Untitled (https://www.youtube.com/watch?v=ICbLVnCHpnw) 2019-05-31T19:41:08+00:00 https://www.youtube.com/watch?v=ICbLVnCHpnw cdmorris Code4Lib Southeast happening today! Live stream starting at 9:30am eastern. #code4libse2019 #code4lib code4libse2019 code4lib https://twitter.com/ https://pinboard.in/u:cdmorris/b:d06090cf849c/ Twitter 2019-04-12T16:27:34+00:00 https://twitter.com/i/web/status/1116739648724897792 lbjay It occurs to me the #code4lib statement of support for Chris Bourg, , offers a better model… code4lib https://twitter.com/ https://pinboard.in/u:lbjay/b:d8424d01c06f/ GitHub - code4lib/c4l18-keynote-statement: Code4Lib Community Statement in Support of Chris Bourg 2019-04-12T16:27:34+00:00 https://github.com/code4lib/c4l18-keynote-statement lbjay It occurs to me the #code4lib statement of support for Chris Bourg, , offers a better model… code4lib https://twitter.com/ https://pinboard.in/u:lbjay/b:80b4ef487c08/ Twitter 2019-03-01T18:42:32+00:00 https://twitter.com/i/web/status/1101553322773770240 jbfink Now that the #code4lib Discord is up & running, I'm contemplating leaving Slack overall, with exception for plannin… code4lib https://twitter.com/ https://pinboard.in/u:jbfink/b:c5d0f0ddd90d/ (429) https://twitter.com/palcilibraries/status/1098658932589965312/photo/1 2019-02-22T03:01:16+00:00 https://twitter.com/palcilibraries/status/1098658932589965312/photo/1 cdmorris Talking privacy and RA21 at #c4l19 with Dave Lacy from @TempleLibraries #code4lib c4l19 code4lib https://twitter.com/ https://pinboard.in/u:cdmorris/b:9f144c1c99f8/ SCOPE: An access interface for DIPs from Archivematica 2019-02-21T00:55:32+00:00 https://github.com/CCA-Public/dip-access-interface sdellis archives code4lib https://pinboard.in/ https://pinboard.in/u:sdellis/b:1489ef99d5c6/ Review, Appraisal and Triage of Mail (RATOM) 2019-02-21T00:48:05+00:00 http://ratom.web.unc.edu/ sdellis archives code4lib https://pinboard.in/ https://pinboard.in/u:sdellis/b:5cdd23154090/ National Web Privacy Forum - MSU Library | Montana State University 2019-02-20T21:36:08+00:00 http://www.lib.montana.edu/privacy-forum/ sdellis privacy analytics code4lib https://pinboard.in/ https://pinboard.in/u:sdellis/b:0b1957db96e2/ The Code4Lib Journal 2019-01-16T14:25:26+00:00 https://journal.code4lib.org/ ratledge Code4lib Library_Technology Journal Journals_Code4Lib https://pinboard.in/ https://pinboard.in/u:ratledge/b:8a9f4c764b97/ Code4Lib | We are developers and technologists for libraries, museums, and archives who are dedicated to being a diverse and inclusive community, seeking to share ideas and build collaboration. 2018-12-05T14:35:01+00:00 https://code4lib.org/ ratledge Code4lib https://pinboard.in/ https://pinboard.in/u:ratledge/b:113cfc93ccb3/ Twitter 2018-11-15T09:00:52+00:00 https://twitter.com/i/web/status/1062993826913099781 verwinv Ne'er had the pleasure to attend #Code4lib myself ... but if you're thinking about it but can't afford to go - ther… Code4lib https://twitter.com/ https://pinboard.in/u:verwinv/b:f42046813ceb/ Twitter 2018-07-26T23:19:49+00:00 https://twitter.com/justindlc/status/1022612508979355649/photo/1 LibrariesVal RT @justindlc: Pre-conference meetup at Ormsby's for Code4Lib Southeast 2018! #code4libse2018 #code4lib code4lib code4libse2018 https://twitter.com/ https://pinboard.in/u:LibrariesVal/b:465c39ad24b0/ Twitter 2018-05-26T00:10:08+00:00 https://twitter.com/i/web/status/1000167164471529477 jbfink Thanks @lydia_zv @redlibrarian and Jolene (are you on Twitter, I can find you?) for a great #code4lib day! It was… code4lib https://twitter.com/ https://pinboard.in/u:jbfink/b:64faa19e4bad/ Twitter 2018-05-11T14:55:10+00:00 https://twitter.com/i/web/status/994954070707273728 jbfink My slides and speakers notes from #code4lib #c4ln18 on Ursula Franklin's "Real World of Technology" (which I really… code4lib c4ln18 https://twitter.com/ https://pinboard.in/u:jbfink/b:a2ed9a40fc54/ Twitter 2018-05-10T09:27:02+00:00 https://twitter.com/i/web/status/994509105006956544 jbfink In an unfortunate timing, it appears the code4lib wiki is down the first day of #code4lib North - there's a cache o… code4lib https://twitter.com/ https://pinboard.in/u:jbfink/b:099edcfb623c/ Twitter 2018-05-08T11:57:24+00:00 https://twitter.com/i/web/status/993775574291230720 jbfink RT @kiru: Just off the (word)press: the #Code4Lib Journal issue 40 is available: . Great articles writ… Code4Lib https://twitter.com/ https://pinboard.in/u:jbfink/b:db3c0bb083a8/ The Code4Lib Journal 2018-05-08T11:57:24+00:00 http://journal.code4lib.org/ jbfink RT @kiru: Just off the (word)press: the #Code4Lib Journal issue 40 is available: . Great articles writ… Code4Lib https://twitter.com/ https://pinboard.in/u:jbfink/b:9be441405213/ Twitter 2018-03-22T18:28:45+00:00 https://twitter.com/GitWishes/status/976754075164438528 lbjay this is all of #code4lib working on @bot4lib circa 2012. code4lib https://twitter.com/ https://pinboard.in/u:lbjay/b:3e91697b52b4/ Twitter 2018-03-19T19:07:46+00:00 https://twitter.com/gmcharlt/status/975810223842713601 danbri This is fabulous news for the cultural heritage open source world. Big ups to @code4lib and @CLIRDLF! #code4lib code4lib https://twitter.com/ https://pinboard.in/u:danbri/b:8cbe22ff2f58/ Twitter 2018-03-11T20:39:02+00:00 https://twitter.com/i/web/status/972894218237743105 miaridge RT @achdotorg: We too co-sign the #code4lib Community Statement in Support of @mchris4duke. We continue to admire an honor our col… code4lib https://twitter.com/ https://pinboard.in/u:miaridge/b:cf4f6d5494e3/ code4lib/c4l18-keynote-statement: Code4Lib Community Statement in Support of Chris Bourg 2018-03-10T00:33:33+00:00 https://github.com/code4lib/c4l18-keynote-statement jbfink code4lib github https://pinboard.in/ https://pinboard.in/u:jbfink/b:12610b3f6bd6/ Code4Lib Community Statement in Support of Chris Bourg | c4l18-keynote-statement 2018-03-09T22:50:57+00:00 https://code4lib.github.io/c4l18-keynote-statement/ wragge RT @CLIRDLF: We’re proud to stand with the #code4lib community in support of #c4l18 keynoter @mchris4duke: code4lib c4l18 https://twitter.com/ https://pinboard.in/u:wragge/b:d81e2b3e7158/ Matthew Reidsma : Auditing Algorithms 2018-02-20T16:41:34+00:00 https://matthew.reidsrow.com/talks/206 malantonio <blockquote>Talks about libraries, technology, and the Web by Matthew Reidsma.</blockquote> algorithms bias search libraries technology code4lib code4lib-2018 https://pinboard.in/u:malantonio/b:7dd04c469f56/ For the love of baby unicorns: My Code4Lib 2018 Keynote | Feral Librarian 2018-02-19T17:49:48+00:00 https://chrisbourg.wordpress.com/2018/02/14/for-the-love-of-baby-unicorns-my-code4lib-2018-keynote/ petej code4lib diversity technology libraries inclusion mansplaining https://pinboard.in/ https://pinboard.in/u:petej/b:18d1e6f30875/ JIRA for archives - Google Slides 2018-02-15T14:37:38+00:00 https://docs.google.com/presentation/d/1uwYWg04-nT6Qjm-j5HAAvsoH88iKzUCAX0eFBNLcy34/edit#slide=id.g306a7ccaec_0_0 malantonio see https://youtu.be/4cNo3SERnXI?t=1h45m28s for presentation code4lib code4lib-2018 libraries work-life https://pinboard.in/u:malantonio/b:5fc7b215e268/ Twitter 2018-02-07T10:53:30+00:00 https://twitter.com/justin_littman/status/960859481914605568/photo/1 aarontay RT @justin_littman: Peer review of my #code4lib poster on "Where to get Twitter data for academic research." code4lib https://twitter.com/ https://pinboard.in/u:aarontay/b:c54955c97e7d/ Availability Calendar - Kalorama Guest House 2018-01-16T17:55:55+00:00 https://secure.rezovation.com/Reservations/AvailabilityCalendar.aspx?s=UT57fw2WiD skorasaurus KALORAMA GUEST HOUSE CODE4LIB https://pinboard.in/ https://pinboard.in/u:skorasaurus/b:10f300ea6594/ (429) https://twitter.com/i/web/status/941746243352563712 2017-12-20T21:22:21+00:00 https://twitter.com/i/web/status/941746243352563712 DocDre RT @nowviskie: ICYMI: #Code4Lib 2018 registration is open! @mmsubram & @mchris4duke to keynote, reception in the Great Hall… Code4Lib https://twitter.com/ https://pinboard.in/u:DocDre/b:9e19136f92cb/ (429) https://twitter.com/freethefiles/status/938843684572889090/photo/1 2017-12-07T18:52:31+00:00 https://twitter.com/freethefiles/status/938843684572889090/photo/1 verwinv Yay! I'm presenting at #code4lib. And I can say hello to Walter Forsberg, @hbmcd4 and @cristalyze! code4lib https://twitter.com/ https://pinboard.in/u:verwinv/b:86bf904d3371/ (429) https://twitter.com/i/web/status/938488557911576576 2017-12-06T19:21:23+00:00 https://twitter.com/i/web/status/938488557911576576 verwinv Registration for #code4lib is now open! And its being held in #WashingtonDC where our #MemoryLab is - so come visit… WashingtonDC code4lib MemoryLab https://twitter.com/ https://pinboard.in/u:verwinv/b:19769bc2fa8c/ code4lib 2018 - Washington, D.C. 2017-11-13T23:02:58+00:00 http://2018.code4lib.org/ verwinv Last day to vote #code4lib 2018 program! don't forget 😓! code4lib https://twitter.com/ https://pinboard.in/u:verwinv/b:1efcaa1db5a7/ 2018 Presentation Voting Survey 2017-10-23T19:49:45+00:00 https://www.surveymonkey.com/r/c4l2018-presentations verwinv vote #code4lib proposals rather than the presenters. new anonymity feature! check it: Got until 11/13 code4lib https://twitter.com/ https://pinboard.in/u:verwinv/b:81a15e672b49/ LODLAM Challenge Winners 2017-06-29T14:06:06+00:00 https://summit2017.lodlam.net/2017/06/29/lodlam-challenge-winners/ miaridge RT @LODLAM: #LODLAM Challenge prize winners congrats to DIVE+ (Grand) & WarSampo (Open data) teams #DH #musetech #code4lib DH musetech LODLAM code4lib https://twitter.com/ https://pinboard.in/u:miaridge/b:c6429902bd26/ JobBoard 2017-05-11T16:33:41+00:00 https://jobs.code4lib.org/ lbjay Some heroes don't wear capes, y'all. back online and and better than ever thanks to @ryanwick and @_cb_ #code4lib code4lib https://twitter.com/ https://pinboard.in/u:lbjay/b:a7f06f02b03e/ Digital Technologies Development Librarian | NCSU Libraries 2017-05-08T12:56:52+00:00 https://www.lib.ncsu.edu/jobs/ehra/digital-technologies-development-librarian jbfink RT @ronallo: Job opening: Digital Technologies Development Librarian @ncsulibraries #code4lib #libtechwomen Know someone? libtechwomen code4lib https://twitter.com/ https://pinboard.in/u:jbfink/b:3a5951bff6fd/ Who's Using IPFS in Libraries, Archives and Museums - Communities / Libraries, Archives and Museums - discuss.ipfs.io 2017-04-19T20:32:44+00:00 https://discuss.ipfs.io/t/whos-using-ipfs-in-libraries-archives-and-museums/130 sdellis career ipfs libraries code4lib https://pinboard.in/ https://pinboard.in/u:sdellis/b:df848f7bc65b/ Scott W. H. Young on Twitter: "Slides for my talk on participatory design with underrepresented populations. Thank you, #c4l17 :) https://t.co/rVS2Zdv25u" 2017-04-02T17:47:47+00:00 https://twitter.com/hei_scott/status/839523334744236033 brainwane refers to my Code4Lib keynote on empathy & UX yay Code4Lib https://pinboard.in/ https://pinboard.in/u:brainwane/b:4c2ef624cde4/ Twitter 2017-03-23T12:45:58+00:00 https://twitter.com/i/web/status/844892979965890560 lbjay Have not read the full report but based on the abstract seems useful to those involved in the #code4lib incorporati… code4lib https://twitter.com/ https://pinboard.in/u:lbjay/b:96f82f0b17b3/ ResistanceIsFertile - Google Drive 2017-03-09T18:06:41+00:00 https://drive.google.com/drive/folders/0B74oOQcTdnHjMy1WN003ZW5HTXc pmhswe code4lib harlow keynote https://pinboard.in/u:pmhswe/b:1760658453c2/ ResistanceIsFertile - Google Drive 2017-03-09T17:22:43+00:00 https://drive.google.com/drive/folders/0B74oOQcTdnHjMy1WN003ZW5HTXc markpbaggett code4lib harlow keynote https://pinboard.in/ https://pinboard.in/u:markpbaggett/b:cffeeb1e58e6/ Google Drive CMS 2017-03-09T16:08:39+00:00 https://www.drivecms.xyz/ jju webdev programming tech 2017 Code4Lib https://pinboard.in/u:jju/b:f9af0e34a8a0/ Code4Lib | Docker Presentation - Google Slides 2017-03-08T19:48:57+00:00 https://docs.google.com/presentation/d/12P1pR3p67dXIKXJWE5_sHa-RSktax-hzquo-Ffz-TH0/edit#slide=id.p markpbaggett code4lib docker https://pinboard.in/ https://pinboard.in/u:markpbaggett/b:bd340aec487e/ Best Catalog Results Page Ever 2017-03-08T19:05:25+00:00 https://www.dropbox.com/s/jbxe4jpbdck874z/deibel-c4l17-best-ever.pptx markpbaggett code4lib accessibility presentation https://pinboard.in/ https://pinboard.in/u:markpbaggett/b:56f2b0fea47a/ Participatory User Experience Design with Underrepresented Populations: A Model for Disciplined Empathy 2017-03-08T18:09:13+00:00 http://2017.code4lib.org/talks/Participatory-User-Experience-Design-with-Underrepresented-Populations-A-Model-for-Disciplined-Empathy brainwane Am honored & humbled to see #c4l17 Glad my talk/article was helpful! Wish I were at #code4lib to thank you in person c4l17 code4lib https://twitter.com/ https://pinboard.in/u:brainwane/b:9bf7ebd61d5d/ Twitter 2017-02-01T14:02:05+00:00 https://twitter.com/i/web/status/826792743464792065 bsscdt Why don't you join us in the #libux slack? Sign yourself up: #litaux #ux #code4lib… ux libux litaux code4lib https://twitter.com/ https://pinboard.in/u:bsscdt/b:961f3bd08a75/ Untitled (http://libux.co/slack?utm_content=buffer0f822&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer) 2017-02-01T14:02:05+00:00 http://libux.co/slack bsscdt Why don't you join us in the #libux slack? Sign yourself up: #litaux #ux #code4lib… ux libux litaux code4lib https://twitter.com/ https://pinboard.in/u:bsscdt/b:839a04bf9612/ Twitter 2016-12-23T18:11:41+00:00 https://twitter.com/jschneider/status/812360040082456576/photo/1 jcarletonoh Ten Principles for User Protection: #code4lib #privacy #ISCHOOLUI ISCHOOLUI privacy code4lib https://twitter.com/ https://pinboard.in/u:jcarletonoh/b:3bf57dea160b/ Technology in Hostile States: Ten Principles for User Protection | The Tor Blog 2016-12-23T18:11:41+00:00 https://blog.torproject.org/blog/technology-hostile-states-ten-principles-user-protection jcarletonoh Ten Principles for User Protection: #code4lib #privacy #ISCHOOLUI ISCHOOLUI privacy code4lib https://twitter.com/ https://pinboard.in/u:jcarletonoh/b:729712aebf8a/ Analyzing MARC with MicroXPath, part 1 - U. Ogbuji on the 1s & 2sies 2016-11-03T18:45:01+00:00 http://uogbuji.tumblr.com/post/152693143951/analyzing-marc-with-microxpath-part-1#_=_ uche Analyzing MARC with MicroXPath, part 1 #XML #XPath #libraries #code4lib XPath XML libraries code4lib https://twitter.com/ https://pinboard.in/u:uche/b:18f29fca2a83/ Library Technology Jobs 2016-11-03T12:34:08+00:00 http://librarytechnology.org/jobs/ jbfink RT @yo_bj: 2/2 For the #code4lib, #lita, and #mashcat crowds, keep an eye out on for #libtech jobs. libtech mashcat lita code4lib https://twitter.com/ https://pinboard.in/u:jbfink/b:ad699235bba2/ 2017 Keynote Speakers Nominations - Code4Lib 2016-10-11T16:10:19+00:00 http://wiki.code4lib.org/2017_Keynote_Speakers_Nominations verwinv Do you know who should keynote #Code4Lib 2017? Help us out: #c4l17 Code4Lib c4l17 https://twitter.com/ https://pinboard.in/u:verwinv/b:fe39b74c928e/ Library of Congress LCCN Permalink sh2016001442 2016-09-15T13:02:28+00:00 https://lccn.loc.gov/sh2016001442 anneheathen RT @JulieSwierczek: #code4lib #c4l16 - "Black Lives Matter movement" is now a SUBJECT HEADING. . Catalogers, make sure you USE IT! c4l16 code4lib https://twitter.com/ https://pinboard.in/u:anneheathen/b:3de84d0358ff/ 
femsom-org-3527	----	Hawa Feminist Coalition – Coalition of Young Feminists in Somalia Skip to content News Home     |     Hawa Feminist Coalition Coalition of Young Feminists in Somalia EMAIL info@femsom.org CALL NOW 907 483965 Donate Menu Home About Us Our Vision Our Mission Our Team Our Members Our Governance Structure Our Work Advocacy & Awareness Rising Leadership Development & Empowerment Collective Action and Feminist Movement Building Publications Blog Join Us Contact Us Coalition of young feminists working to promote the safety, equality, justice, rights and dignity of girls and young women in Somalia. Join Us! We strive to providing a brighter future for our sisters in Somalia. Join Us! We do mobilize collective action and meaningful ways of working with each other…. Join Us! Home Who We Are About Us Hawa Feminist Coalition was founded by young feminists all under the age of 35 in 2018 with aim of promoting the safety, equality, justice, rights and dignity of girls and young women in Somalia where women and girls bear an unequal brunt of hardships exacerbated by poverty, conflict, religious and cultural limitations which promotes strict male authority. Read More Our Vision Somalia where gender equality is achieved and women and girls enjoy all their rights and live in dignity. Read More Our Mission Mobilisation of Somali young women and girls for achievement of gender equality and the realisation of women’s and girls’ rights at all levels to enjoy all their rights and live in dignity. Read More We are coalition of young feminists all under the age of 35 all standing for promoting the safety, equality, justice, rights and dignity of girls and young women in Somalia. JOIN US! If you are interesting to be part of collective feminist movement. Join Us! 0 Member Groups / Clubs 0 Members What We Do Collective Action & Feminist Movement Building We mobilize and strengthen feminist based collective actions and practice meaningful ways of working with each other to be visible, strong and diverse enough to result in concrete and sustainable change in achieving gender equality in Somalia. Read More Leadership Development & Empowerment We provide capacity development and empowerment for our members and feminist grassroots groups to build stronger grassroots movements with the confidence, information, skills and strategies they need to bring dramatic changes in norms, laws, policies and practices toward achieving gender equality in Somalia. Read More Advocacy & Awareness Rising We use the influence of art, music, culture, poetry, social media, feminist activism to promote the safety, equality, justice, rights and dignity of girls, young women and other marginalized groups. Read More latest news Statement: Hawa Feminist Coalition condemns Two Sisters Killed In Mogadishu Hawa Feminist Coalition condemns the death of Fahdi Adow Abdi and Faiza Adow Abdi in Mogadishu in the night of April 22, 2021 after a mortar landed in their house ... Read More Hawa Feminist Coalition advocates promotion of sex-disaggregated data in the event of commemoration of Open Data Day 2021 In commemoration of Open Data Day; an annual celebration event held all over the world in the first week of March in every year, Hawa Feminist Coalition organized an online ... Read More Join us in promotion of sex-disaggregated data in the event of commemoration of Open Day 2021 Open Data Day is an annual celebration of open data all over the world. Groups from around the world create local events on the day where they will use open ... Read More Awareness rising on raise of domestic violence amid the COVID-19 health crisis in Puntland Gender-based violence (GBV) increases during every type of emergency – whether economic crises, conflict or disease outbreaks. Pre-existing toxic social norms and gender inequalities, economic and social stress caused by ... Read More STATEMENT: Hawa Feminist Coalition condemns arbitrary arrest of Daljir journalists for reporting on GBV cases We understand that reporting on these topics is a difficult task, and we appreciate the media’s commitment to doing so with integrity. We strongly condemn the arbitrary arrest of these journalists ... Read More STATETEMENT: Call for Immediate Action on Rape and Murder of Innocent Girl in Bosaso Very Sad! A body of young girl horrifically murdered was found lying on Bosaso street. The young girl seems to be raped and then murdered with her face beaten terribly, ... Read More Our Team Jawahir A. Mohamed Executive Director Mariam M. Hussein Head of Operations Kowsar Abdisalam Guled Head of Programs Linda S. Mohamed Membership Officer Learn More Our Partners Previous Next Who we are Coalition of Young Feminists in Somalia, all under the age of 35 working to promote the safety, equality, justice, rights and dignity of girls and young women in Somalia. Our Office Hawa Feminist Coalition HQ Laanta Hawada, Airport Road 500 Bosaso Puntland Somalia Email: info@femsom.org Tell: +252 907 483965 Join Us! Join us!, If you are a young girl under the age of 35 years, interesting to be part of collective feminist movement working to promote the safety, equality, justice, rights and dignity of girls and young women in Somalia. Hawa Feminist Coalition Theme by Grace Themes 
forum2021-diglib-org-4568	----	Home - DLF Forum 2021 Skip to content Home About Code of Conduct CoC Reporting Form Thank You Resources News Affiliated Events Learn@DLF NDSA’s #DigiPres21 Sponsors Sponsorship Opportunities Registration CFP Search for... Search for... Toggle Navigation Toggle Navigation Home About Code of Conduct CoC Reporting Form Thank You Resources News Affiliated Events Learn@DLF NDSA’s #DigiPres21 Sponsors Sponsorship Opportunities Registration CFP A world-class marketplace of ideas for digital GLAM practitioners since 1999 What's the DLF Forum? DLF programs stretch year-round, but we are perhaps best known for our signature event, the annual DLF Forum. The DLF Forum welcomes digital library, archives, and museum practitioners from member institutions and beyond—for whom it serves as a meeting place, marketplace, and congress. Learn about the event and plan to attend Attend our Affiliated Events! NDSA's Digital Preservation 2021 November 4 Digital Preservation is the annual conference of the National Digital Stewardship Alliance. DigiPres is expected to be a crucial venue for intellectual exchange, community-building, development of best practices, and national-level agenda-setting in the field. Learn@DLF November 8-10 Now in its fourth year, Learn@DLF returns in 2021 includes engaging, hands-on sessions where attendees will gain experience with new tools and resources, exchange ideas, and develop and share expertise with fellow community members as well as short tutorials about specific tools, techniques, workflows, or concepts. Make an Impact! Sponsor the DLF Forum and NDSA's #DigiPres21 DLF Forum starts in Days Hours Minutes Seconds NOW What makes the DLF Forum great? After nearly 14 years of academic library experience and subsequently participating in no less than 25 conferences, I can say that the DLF Forum was the most progressive and enlightening conference that I have ever attended. It was downright empowering. Ana Ndumu 2017 DLF Forum Fellow The thoughtful way the experience was designed was due to the efforts of the organizers...As a first-time participant, I am grateful to have been able to participate in this year’s virtual Forum and look forward to continuing to learn from the DLF community! Betsy Yoon 2020 DLF Forum Community Journalist Forum Updates 2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals April 8, 2021 We’re delighted to share that it’s CFP season for CLIR’s annual events. Based on community feedback, we’ve made the decision… Read more Want Forum news? Subscribe to our newsletter to stay informed! Subscribe Sponsorship Opportunities About DLF Join DLF Contact Menu Sponsorship Opportunities About DLF Join DLF Contact Envelope Facebook Twitter Youtube Instagram Linkedin Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset Sitemap 
forum2021-diglib-org-5902	----	Call for Proposals - DLF Forum 2021 Skip to content Home About Code of Conduct CoC Reporting Form Thank You Resources News Affiliated Events Learn@DLF NDSA’s #DigiPres21 Sponsors Sponsorship Opportunities Registration CFP Search for... Search for... Toggle Navigation Toggle Navigation Home About Code of Conduct CoC Reporting Form Thank You Resources News Affiliated Events Learn@DLF NDSA’s #DigiPres21 Sponsors Sponsorship Opportunities Registration CFP Call for Proposals 2021 DLF Forum & Learn@DLF Call for Proposals CLIR’s Digital Library Federation invites proposals for the 2021 DLF Forum (November 1-3) and Learn@DLF (November 8-10), our workshop series, both held online this year. A separate call will be issued for Digital Preservation 2021, the annual conference of the NDSA (November 4). The Forum is a meeting place, a marketplace, and a congress for digital library practitioners from DLF member institutions and the broader community. Now that our events will take place virtually for a second time, we look forward to new and better ways to come together—as always, with community at the center.  Therefore, our guiding focus for this year’s Forum is sustaining our community. Relentless innovation, disruptive change, and constant demands on our time and energy rarely allow for a pause to assess how we got here. Sustenance comes in many forms and while it allows for growth, it is also an end in itself. How can we then shift our focus to prioritize the sustaining and nurturing of ourselves and our communities while still pushing for greater openness and inclusivity?  Pervasive racism persists and contributes to wrenching inequalities in the United States, especially among our Black, Indigenous, and People of Color (BIPOC) communities. CLIR has long recognized this inequity; diversity, social justice, and broad access to cultural heritage have been integral to our mission. In 2021, we reaffirm our commitment to pursuing greater equity and justice throughout the DLF Forum, working with our entire community toward an inclusivity that prizes the chorus of diverse voices needed for systemic change. As such, the planning committee will again prioritize submissions from BIPOC people and people working at Historically Black Colleges and Universities (HBCUs) and other BIPOC-centered libraries, archives, and museums. We therefore have self-identification options in the proposal submission form. For all events, we encourage proposals from DLF members and non-members; regulars and newcomers; digital library practitioners and those in adjacent fields such as institutional research and educational technology; and students, early-career professionals and senior staff alike. Proposals to more than one event are permitted, though please submit different proposals for each. Our Events The DLF Forum will take place Monday, November 1 through Wednesday, November 3, 2021. Digital Preservation 2021: Embracing Digitality will take place on Thursday, November 4, 2021. More information on that event can be found here: https://ndsa.org/conference/ Learn@DLF is a series of workshops offered the week after the DLF Forum, November 8-10, 2021. About Presenting Accepted presentations and panels will be delivered via pre-recorded video. This format allows for flexible watch times and speeds, captioning, and avoids many technical challenges. Videos must be submitted by Wednesday, September 15. Presenters will receive support in the form of tutorials, resources, and individual assistance. Presenters will be expected to be in attendance and available during their presentation time for live Q&A (chat-based or video, format TBD). To make space for as many voices as possible, individuals may present only once on the Forum program. The DLF Forum is explicitly designed to enact and support the DLF community’s values, and we strive to create a safe, accessible, welcoming, and inclusive event that reflects our Code of Conduct. Submissions & Evaluation Based on community feedback and the work of our Program Committee, we welcome submissions geared toward a practitioner audience that: Clearly engage with DLF’s mission of advancing research, learning, social justice, and the public good through the creative design and wise application of digital library technologies Activate and inspire participants to think, make, and do Engage people from different backgrounds, experience levels, and disciplines Include clear take-aways that participants can implement in their own work Submission Formats Sessions are invited in the following lengths and formats: At the DLF Forum, November 1-3: 45-minute Panels: A panel discussion of three to four speakers on a unified topic, with an emphasis on the discussion. A maximum of four speakers is allowed per submission. Proposals with representative and inclusive speaker involvement will be favored by the committee, and all-male-identifying panels will not be accepted. The main goals of the panel format at the DLF Forum are to bring together diverse perspectives on a topic and to encourage a community discussion of panelists’ approaches or findings. 15-minute Presentations: A presentation by one to two speakers on a single topic or project. A maximum of two speakers is allowed per submission. Presentations will be grouped by the program committee based on overarching themes or ideas. 5-minute Lightning Talks: High-profile, high-energy lightning talks held in plenary, with the opportunity to point attendees to contact information and additional materials online. No more than two speakers are allowed per submission. 25-minute Birds of a Feather (BOAF) Sessions: Working on a project on which you’d like feedback? Have a question you want to ponder with other interested people? New this year, 25-minute BOAF sessions are live video discussion sections where folks can discuss a topic of the proposer’s choice. These are roundtables where ideas can be shared and questions can be asked in the spirit of shared knowledge.   At Learn@DLF, November 8-10: 90-minute Workshops: Live, in-depth, hands-on training sessions on specific tools, techniques, workflows, or concepts. All workshop organizers are asked to provide details on technology needed, participant proficiency level, and learning outcomes for participants. Workshops must be interactive and inclusive, and the strongest proposals will demonstrate this clearly. Interested in presenting something longer? Consider submitting a ‘part I’ (morning session) and ‘part II’ (afternoon session). 10-15-minute Tutorials: Pre-recorded training sessions or demonstrations between 10 to 15 minutes in length about specific tools, techniques, workflows, or concepts. Proposal Requirements Proposal title Submission format and event: Varies by event First and last names, organizational affiliations, and email addresses for all authors / presenters Abstract (50 words max) Proposal (250 works max for all formats except for panels and workshops, up to 500 words) Five keywords for your proposal   Submit using our online system: bit.ly/2021CLIRcfps. Submit your proposal THE DEADLINE FOR ALL PROPOSALS IS MONDAY, MAY 17, 2021, AT 11:59PM EASTERN TIME. As in previous years, all submissions will be peer reviewed. Broader DLF community input will also be solicited through an open community voting process, which will inform the Program Committee’s final decisions. Selected presenters will be notified over the summer and will have a minimum of four weeks to prepare their recordings. We are still looking for sponsors for this year’s events! If you or someone you know may be interested, check out our sponsorship opportunities or contact us. Questions? You can reach us at forum@diglib.org. Want Forum news? Subscribe to our newsletter to stay informed! Subscribe Sponsorship Opportunities About DLF Join DLF Contact Menu Sponsorship Opportunities About DLF Join DLF Contact Envelope Facebook Twitter Youtube Instagram Linkedin Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset Sitemap 
forum2021-diglib-org-7292	----	Learn@DLF - DLF Forum 2021 Skip to content Home About Code of Conduct CoC Reporting Form Thank You Resources News Affiliated Events Learn@DLF NDSA’s #DigiPres21 Sponsors Sponsorship Opportunities Registration CFP Search for... Search for... Toggle Navigation Toggle Navigation Home About Code of Conduct CoC Reporting Form Thank You Resources News Affiliated Events Learn@DLF NDSA’s #DigiPres21 Sponsors Sponsorship Opportunities Registration CFP Join us for Learn@DLF November 8-10, 2021 To cultivate creative training and professional development opportunities stemming from our past three successful DLF Forum Pre-Conferences as well as our series of video tutorials from last year’s first-ever virtual DLF Forum, we are excited to host Learn@DLF the week immediately following the DLF Forum and NDSA’s Digital Preservation 2021 on Monday-Wednesday, November 8-10, 2021. Stay tuned for updates on Learn@DLF offerings. Share your experiences on Twitter with #LearnAtDLF! Want Forum news? Subscribe to our newsletter to stay informed! Subscribe Sponsorship Opportunities About DLF Join DLF Contact Menu Sponsorship Opportunities About DLF Join DLF Contact Envelope Facebook Twitter Youtube Instagram Linkedin Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset Sitemap 
forum2021-diglib-org-9655	----	Home - DLF Forum 2021 Skip to content Home About Code of Conduct CoC Reporting Form Thank You Resources News Affiliated Events Learn@DLF NDSA’s #DigiPres21 Sponsors Sponsorship Opportunities Registration CFP Search for... Search for... Toggle Navigation Toggle Navigation Home About Code of Conduct CoC Reporting Form Thank You Resources News Affiliated Events Learn@DLF NDSA’s #DigiPres21 Sponsors Sponsorship Opportunities Registration CFP A world-class marketplace of ideas for digital GLAM practitioners since 1999 What's the DLF Forum? DLF programs stretch year-round, but we are perhaps best known for our signature event, the annual DLF Forum. The DLF Forum welcomes digital library, archives, and museum practitioners from member institutions and beyond—for whom it serves as a meeting place, marketplace, and congress. Learn about the event and plan to attend Attend our Affiliated Events! NDSA's Digital Preservation 2021 November 4 Digital Preservation is the annual conference of the National Digital Stewardship Alliance. DigiPres is expected to be a crucial venue for intellectual exchange, community-building, development of best practices, and national-level agenda-setting in the field. Learn@DLF November 8-10 Now in its fourth year, Learn@DLF returns in 2021 includes engaging, hands-on sessions where attendees will gain experience with new tools and resources, exchange ideas, and develop and share expertise with fellow community members as well as short tutorials about specific tools, techniques, workflows, or concepts. Make an Impact! Sponsor the DLF Forum and NDSA's #DigiPres21 DLF Forum starts in Days Hours Minutes Seconds NOW What makes the DLF Forum great? After nearly 14 years of academic library experience and subsequently participating in no less than 25 conferences, I can say that the DLF Forum was the most progressive and enlightening conference that I have ever attended. It was downright empowering. Ana Ndumu 2017 DLF Forum Fellow The thoughtful way the experience was designed was due to the efforts of the organizers...As a first-time participant, I am grateful to have been able to participate in this year’s virtual Forum and look forward to continuing to learn from the DLF community! Betsy Yoon 2020 DLF Forum Community Journalist Forum Updates 2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals April 8, 2021 We’re delighted to share that it’s CFP season for CLIR’s annual events. Based on community feedback, we’ve made the decision… Read more Want Forum news? Subscribe to our newsletter to stay informed! Subscribe Sponsorship Opportunities About DLF Join DLF Contact Menu Sponsorship Opportunities About DLF Join DLF Contact Envelope Facebook Twitter Youtube Instagram Linkedin Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset Sitemap 
freerangelibrarian-com-7953	----	Free Range Librarian › K.G. Schneider's blog on librarianship, writing, and everything else Free Range Librarian K.G. Schneider's blog on librarianship, writing, and everything else Skip to content About Free Range Librarian Comment guidelines Writing: Clips & Samples (Dis)Association Monday, May 27, 2019 Walking two roses to their new home, where they would be planted in the front yard. I have been reflecting on the future of a national association I belong to that has struggled with relevancy and with closing the distance between itself and its members, has distinct factions that differ on fundamental matters of values, faces declining national and chapter membership, needs to catch up on the technology curve, has sometimes problematic vendor relationships, struggles with member demographics and diversity,  and has an uneven and sometimes conflicting national message and an awkward at best relationship with modern communications; but represents something important that I believe in and has a spark of vitality that is the secret to its future. I am not, in fact, writing about the American Library Association, but the American Rose Society.  Most readers of Free Range Librarian associate me with libraries, but the rose connection may be less visible. I’ve grown roses in nine places I’ve lived in the last thirty-plus years, starting with roses planted in front of a rental house in Clovis, New Mexico, when I was stationed at Cannon Air Force Base in the 1980s, and continuing in pots or slices of garden plots as I moved around the world and later, the United States. Basically, if I had an outdoor spot to grow in, I grew roses, either in-ground or in pots, whether it was a slice of sunny backyard in Wayne, New Jersey, a tiny front garden area in Point Richmond, California, a sunny interior patio in our fake Eichler rental in Palo Alto, or a windy, none-too-sunny, and cold (but still much-appreciated) deck in our rental in San Francisco. When Sandy and I bought our sweet little house in Santa Rosa, part of the move involved rolling large garden pots on my Radio Flyer from our rental two blocks away. Some of you know I’m an association geek, an avocation that has waxed as the years have progressed. I join associations because I’m from a generation where that’s done, but another centripetal pull for staying and being involved is that associations, on their own, have always interested me. It’s highly likely that a long time ago, probably when I was stationed in New Mexico and, later, Germany (the two duty stations where I had the ability to grow roses), that I was a member of the American Rose Society for two or three years. I infer this because I accumulated, then later recycled, their house magazine, American Rose, and I also have vague memories of receiving the annual publication, Handbook for Selecting Roses. Early this year I joined the Redwood Empire Rose Society and a few weeks after that joined the American Rose Society. I joined the local society because I was eager to plant roses in our new home’s garden and thought this would be a way to tap local expertise, and was won over by the society’s programming, a range of monthly educational events that ranged from how to sharpen pruning shears to the habits and benefits of bees (a program where the audience puffed with pride, because roses--if grown without toxic chemical intervention–are highly beneficial bee-attracting pollen plants). I joined the national society less out of need than because I was curious about what ARS had to offer to people like me who are rose-lovers but average gardeners, and I was also inquisitive about how the society had (or had not) repositioned itself over the years. My own practices around rose gardening have gradually changed, reflecting broader societal trends. Thirty years ago, I was an unwitting cog in the agricultural-industrial rose complex. I planted roses that appealed to my senses — attractive, repeat-blooming, and fragrant — and then managed their ability to grow and produce flowers not only through providing the two things all roses need to grow– sun and water — but also through liberal applications of synthetic food and toxic pest and disease products. The roses I purchased were bred for the most part with little regard for their ability to thrive without toxic intervention or for their suitability for specific regions. Garden by garden, my behavior changed. I slowly adopted a “thrive or die” mantra. If a rose could not exist without toxic chemical interventions, then it did not belong in my garden, and I would, in rosarian parlance, “shovel-prune” it and replace it with a rose that could succeed with sun, water, good organic food and amendments, and an occasional but not over-fussy attention. Eventually, as I moved toward organic gardening and became more familiar with sustainability in general, I absorbed the message that roses are plants, and the soil they grow in is like the food I put in my body: it influences their health. So I had the garden soil tested this winter while I was moving and replacing plants, digging holes that were close to two feet wide and deep. Based on the test results, I adjusted the soil accordingly: I used organic soil sulphur to lower the ph, dug in slow-release nitrogen in the form of feathermeal, and bathed the plants in a weak solution of organic liquid manganese. As I now do every spring, when it warmed up a bit I also resumed my monthly treatment of fish fertilizer, and this year, based on local rose advice, in a folksier vein dressed all the bushes with organic worm castings and alfalfa, both known to have good fertilizing capabilities. Alfalfa also has a lot of trace nutrients we know less about but appear to be important. Princesse Charlene de Monaco, hybrid tea rose bred by Meilland Guess what? Science is real! Nearly all of the rose bushes are measurably larger and more vigorous. Carding Mill, a David Austin rose, went from a medium shrub to a flowering giant. New roses I planted this spring, such as Grand Dame and Pinkerbelle, are growing much more vigorously than last year’s new plantings. Some of this is due to the long, gloomy, wet winter, which gave roses opportunities to snake their long roots deeper into the good soil we have in Sonoma County; my friends are reporting great spring flushes this year. But roses planted even in the last six weeks, such as Princesse Charlene de Monaco and Sheila’s Perfume, are taking off like a rocket, so it’s not just the rain or the variety. (You do not need to do all this to grow roses that will please you and your garden visitors, including bees and other beneficial insects. I enjoy the process. The key thing is that nearly all of my roses are highly rated for disease resistance and nearly all are reported to grow well in our region.) Science–under attack in our national conversations–is also an area of conflict within the ARS. Presidents of the ARS have three-year terms, and the previous president, Pat Shanley, was an advocate of sustainable rose growing. She spoke and wrote about the value of organic gardening, and championed selecting varieties that do not require toxic intervention to thrive. The theme of the 2018 American Rose Annual was “Roses are for Everyone,” and this Annual is a fascinating look at the sustainable-gardening wing of the ARS. Most of the articles emphasized the value of what Paul Zimmerman, a rose evangelist, calls “garden roses,” flowers that everyday people like you and me can grow and enjoy. The message in this Annual is reinforced by recent books by longtime rose advocates and ARS members, such as Peter Kukielski’s Roses without Chemicals and Zimmerman’s Everyday Roses, books I highly recommend for library collections as well as personal use. (Roses without Chemicals is a book I use when I wake up at odd hours worried about things, because it is beautifully written and photographed and the roses are listed alphabetically.) Now the ARS has a new president, Bob Martin, a longtime exhibitor, who in editorials has promoted chemical intervention for roses. “And yes Virginia we do spray our roses,” he wrote in the March/April “First Word” editorial in American Rose, the house organ of the ARS. “As does nearly every serious rose exhibitor and those who want their rose bushes to sustainably produce the most beautiful blooms [emphasis mine].” American Rose does not appear to publish letters to the editor. There is no section listed for letters that I can find in any recent issue, and the masthead only lists a street address for “member and subscription correspondence.” Otherwise, I would write a short letter protesting the misuse of the term “sustainably,” as well as the general direction of this editorial. I am a rose amateur, and make no bones about it. But I know that equating chemical spraying with sustainability is, hands-down, fake news. It’s one thing to soak roses in toxins and call it a “health maintenance” program, as he does in this article. That’s close to the line but not over it, since he’s from the exhibitors’ wing of ARS. But it’s just plain junk science to claim that there is anything connected to sustainability about this approach. I also can’t imagine that this “toxins forever” message is attracting new ARS members or encouraging them to renew. It feels disconnected from what motivates average gardeners like me to grow roses today (to enjoy them in their gardens) and from how they want to grow them today (in a manner that honors the earth). Frankly, one of the happiest moments in my garden last year was not from personal enjoyment of the flowers or even the compliments of neighbors and passers-by, but when I saw bees doing barrel-rolls in the stamens of my roses, knowing that I was helping, not hurting, their survival. The vast majority of people buying and planting roses these days have no idea there is a single-plant society dedicated to this plant, or even less that this society believes it understands their motivations for and interest in roses. My environmental scan of the literature and the quantities of roses provided by garden stores makes me suspect that many people buy roses based on a mix of personal recommendations, marketing guidance (what the vendors are promoting), and what they remember from their family gardens. (I would love to learn there had been market research in this area; vendors may have taken this up.) For average gardeners, their memories include roses such as Peace and Mr. Lincoln, which were bred in the middle of the last century, when the focus was not on disease resistance but on producing the hourglass hybrid tea shape that became the de facto standard for exhibiting. We can get sentimental about roses from the late 20th century, but many of these varieties also helped perpetuate the idea that roses are hard to grow, despite the many varieties that grew just fine for thousands of years (or in the case of Excellenz von Schubert, which I planted this year, 110 years and counting). Market persuasion continues today; vendors tempt buyers through savvy marketing plans such as the Downton Abbey rose series from Weeks or David Austin’s persistent messaging about “English” roses. Note — I own a lovely rose from the Downton Abbey line, Violet’s Pride, that is quite the garden champ, and have three David Austin roses (Carding Mill, Munstead Wood, and Gentle Hermione). I’m just noting market behavior. It is well-documented in rose literature that the rose that seems to have shaken the ARS to the core is the Knockout series, which introduced maintenance-free roses to a generation short on time and patience and increasingly invested in sustainable practices throughout their lives, including their gardens. Again, smart marketing was part of the formula, because there always have been sustainable roses, and ome companies, such as Kordes, moved to disease-resistant hybridizing decades ago. But the Knockout roses were promoted as an amazing breakthrough. (It may help to know that new varieties of roses have 20-year patents during which propagation is only legally through license. I don’t begrudge hybridizers their income, given how much work–sometimes thousands of seedlings–goes into producing a single good rose, but this does factor into how and why roses are marketed.) You don’t need a certificate as a master gardener or membership in a rose society to grow Knockout roses or newer competitors such as the Oso Easy line. You don’t really need to know anything about roses at all, other than roses grow in sun, not shade, and appreciate water. You also don’t need to spray Knockout roses with powerful fungicides to prevent blackspot and mildew. Regardless of the public’s reaction to easy-to-grow roses, the rose world’s reception of the Knockout rose by the rose world was mixed, to use an understatement. Though the Knockout rose was the 2004 ARS members’ choice rose, rumblings abounded, and Knockout was even blamed in popular literature as a vector for the rose rosette virus (RRV), though this was later debunked. Fifty years ago RRV was observed in a number of rose varieties, long before the Knockout rose appeared. (This mite-spread virus was promulgated in the United States to control a pest rose, rosa multiflora, that was itself introduced without realizing what havoc it would wreak.) Again, I’m no scientist, but I would think the appearance of RRV in “domesticated” roses was inevitable, regardless of which rose variety was first identified by name as carrying this disease. Rose hybridizing is now catching up with the public’s interests and the wider need for roses with strong disease resistance. Rose companies prominently tout disease resistance and many new varieties can be grown toxin-free. I selected Princesse Charlene de Monaco in part because it medaled as best hybrid tea in the 2018 Biltmore International Rose Trials, for which roses must perform well in terms of vigor and disease resistance as well as aesthetic qualities. There were companies such as Kordes who walked this walk before it was fashionable, but in typical change-adoption fashion, other vendors are adapting their own practices, because the market is demanding it. But association leadership is driven by different goals than that for for-profit companies. A colleague of mine, after sharing his support for my successful run for ALA Executive Board, commented that it takes expertise to run a $50 million organization–skills not everyone has in equal abundance. My further reflection is that the kind of leadership we need at any one time is also unique to that moment, though–with absolutely no aspersions on our current crop of excellent leaders in ALA–historically, we have not always selected leadership for either general expertise or current needs, an issue hardly unique to ARS or ALA. So I watch the ARS seesaw. As just one more example, recently I read an article within the same ARS email newsletter touting the value of lacewings for insect management, followed by an article about the value of chemical interventions that I know are toxic to beneficial insects. These aren’t just contradictory ideas; they are contradictory values, contradictory messages, and contradictory branding. And these conflicting messages are evident even before we look at the relationship between the national association and local societies (organized differently than ALA chapters but with the similar intent). If I could deduce the current priorities for ARS from its magazine, website, and email newsletters, it would be the renovation of the ARS garden in Shreveport. The plan to update the 84-year-old “national rosarium” makes sense, if you like rose gardens, but it sounds more like a call to the passionate few than the general public. It’s hard to infer other priorities when website sections such as “Cyber Rosarian” invite members to ask questions that then go unanswered for over a year. The section called “Endorsed Products” is its own conflicted mix of chemical interventions, artificial fertilizers, and organic rose food. The website section on rose preservation–a goal embedded in the ARS mission statement, “The American Rose Society exists to promote the culture, preservation and appreciation of the Rose”–is a blank page with a note it is under construction. A section with videos by Paul Zimmerman is useful, but the rose recommendations by district are incomplete, and also raise the issue that ARS districts are organized geopolitically, not by climate. A rose suited for the long dry summers of Sonoma County may not do as well in Maui. The ARS “Modern Roses” database has value, listing over 37,000 cultivars. But if I want insight into a specific rose, I use Helpmefind.com, which despite its generic name and rustic interface is the de facto go-to site for rose information, questions, and discussion, often in the context of region, climate, and approaches to sustainability. I pay a small annual fee for premium access, in part to get HMF’s extra goodies (advanced search, and access to lineage information) but primarily because this site gives me value and I want to support their work. Though I couldn’t find data on the ARS website for membership numbers in national, district, or local societies, I intuit membership overall is declining. It is in our local society, where despite great programming in a region where many people grow roses, I am one of the younger members. Again, there are larger forces at work with association membership, but pointing to those forces and then doing business as usual is a recipe for slow death. Interestingly, the local rose society is aware of its challenges and interested in what it might mean to reposition itself for survival. Most recently, we founded a Facebook group that anyone could join (look for Redwood Empire Rose Society). But the society doesn’t have very much time, and a Facebook group isn’t the magic bullet. To loop back to ALA for a moment: I can remember when the response to concerns about membership decline were that the library field was contracting as a whole and association membership was also less popular in general. But these days, ALA is invested in moving past these facts and asking, what then? ALA is willing to change to survive. And I believe that is why ALA will be around 100 years from now, assuming we continue to support human life on this continent. As I ponder all this, deep in my association geekiness, I’m left with these questions: if the ARS can’t save itself, who will be there for the roses? Will the ad hoc, de facto green-garden rosarians form a new society, will they simply soldier on as a loose federation, or will the vendors determine the future of roses? Have rose societies begun talking about strategic redirection, consolidation, and other new approaches? Does the ARS see itself as a change leader? Where does the ARS see itself in 25 years? Am I just a naive member in the field, totally missing the point, or is there something to what I’m observing, outside the palace walls? I’ve been writing this off and on for months. It’s Memorial Day and it’s now light enough outside to wander into our front yard, pruners and deadheading bucket in hand, iPhone in my pocket so I can share what bloomed while I slept. Over time I changed how I grow roses, but not why I grow roses. Somewhere in there is an insight, but it’s time to garden. Bookmark to: Filed in Uncategorized | | Comments Off on (Dis)Association I have measured out my life in Doodle polls Wednesday, April 10, 2019 You know that song? The one you really liked the first time you heard it? And even the fifth or fifteenth? But now your skin crawls when you hear it? That’s me and Doodle. In the last three months I have filled out at least a dozen Doodle polls for various meetings outside my organization. I complete these polls at work, where my two-monitor setup means I can review my Outlook calendar while scrolling through a Doodle poll with dozens of date and time options. I don’t like to inflict Doodle polls on our library admin because she has her hands full enough, including managing my real calendar. I have largely given up on earmarking dates on my calendar for these polls, and I just wait for the inevitable scheduling conflicts that come up. Some of these polls have so many options I would have absolutely no time left on my calendar for work meetings, many of which need to be made on fairly short notice. Not only that, I gird my loins for the inevitable “we can’t find a date, we’re Doodling again” messages that mean once again, I’m going to spend 15 minutes checking my calendar against a Doodle poll. I understand the allure of Doodle; when I first “met” Doodle, I was in love. At last, a way to pick meeting dates without long, painful email threads! But we’re now deep into the Tragedy of the Doodle Commons, with no relief in sight. Here are some Doodle ideas–you may have your own to toss in. First, when possible, before Doodling, I ask for blackout dates. That narrows the available date/time combos and helps reduce the “we gotta Doodle again” scenarios. Second, if your poll requires more than a little right-scrolling, reconsider how many options you’re providing. A poll with 40 options might as well be asking me to block out April. And I can’t do that. Third, I have taken exactly one poll where the pollster chose to suppress other people’s responses, and I hope to never see that again. There is a whole gaming side to Doodling in which early respondents get to drive the dates that are selected, and suppressing other’s responses eliminates that capability. Plus I want to know who has and hasn’t responded, and yes, I may further game things when I have that information. Also, if you don’t have to Doodle, just say no. Bookmark to: Filed in Uncategorized | | Comments (4) Memento DMV Saturday, March 30, 2019 This morning I spent 40 minutes in the appointment line at the Santa Rosa DMV to get my license renewed and converted to REAL ID, but was told I was “too early” to renew my license, which expires in September, so I have to return after I receive my renewal notice. I could have converted to REAL ID today, but I would still need to return to renew my license, at least as it was explained to me, and I do hope that was correct. CC BY 4.0, https://wellcomecollection.org/works/m8wh2kmc But–speaking as a librarian, and therefore from a profession steeped in resource management–I predict chaos in 2020 if DMV doesn’t rethink their workflow. We’re 18 months out from October 2020, the point at which people will not be able to board domestic flights if they don’t have a REAL ID or a valid passport, or another (and far less common) substitute. Then again, California DMV is already in chaos. Their longtime leader retired, the replacement lasted 32 days, and their new leader has been there ca. 60 days. Last year featured the license renewal debacle, which I suspect impacted the man standing behind me. He said he was there to apply for his license again because he never received the one he applied for last fall. And California DMV is one of 10 states that still needs a REAL ID extension because it didn’t have it together on time. Indeed, I was on the appointment line, and nearly everyone in that line was on their second visit to DMV for the task they were trying to accomplish, and not for lack of preparation on their part. Some of that was due to various DMV crises, and some of it is baked into DMV processes. Based on how their current policies were explained to me today at Window 13, I should never have been on that line in the first place; somewhere, in the online appointment process, the DMV should have prevented me from completing that task. I needlessly took up staff time at DMV. But the bigger problem is a system that gets in its own way, like libraries that lock book drops during the day to force users to enter the libraries to return books. With me standing there at Window 13 with my online appointment, my license, and my four types of ID, the smart thing to do would be to complete the process and get me out of the pipeline of REAL ID applicants–or any other DMV activity. But that didn’t happen. And I suspect I’m just one drop in a big, and overflowing, bucket. I suppose an adroit side move is to ensure your passport is current, but I hope we don’t reach the point where we need a passport to travel in our own country. Bookmark to: Filed in Uncategorized | | Comments Off on Memento DMV An Old-Skool Blog Post Friday, March 29, 2019 I get up early these days and get stuff done — banking and other elder-care tasks for my mother, leftover work from the previous day, association or service work. A lot of this is writing, but it’s not writing. I have a half-dozen unfinished blog posts in WordPress, and even more in my mind. I map them out and they are huge topics, so then I don’t write them. But looking back at the early days of this blog — 15 years ago! — I didn’t write long posts. I still wrote long-form for other media, but my blog posts were very much in the moment. So this is an old-skool post designed to ease me back in the writing habit. I’ll strive for twice a week, which is double the output of the original blogger, Samuel Johnson. I’ll post for 15 minutes and move on to other things. I am an association nerd, and I spend a lot of time thinking about associations of all kinds, particularly the American Library Association, the American Homebrewers Association, the American Rose Society, the Redwood Empire Rose Society, the local library advisory boards, my church, and our neighborhood association. Serving on the ALA Steering Committee on Organizational Effectiveness, I’m reminded of a few indelible truths. One is that during the change management process you need to continuously monitor the temperature of the association you’re trying to change and in the words of one management pundit, keep fiddling with the thermostat. An association didn’t get that big or bureaucratic overnight, and it’s not going to get agile overnight, either. Another is that the same people show up in each association, and–more interesting to me–stereotypes are not at play in determining who the change agents are. I had a great reminder of that 20 years ago, when I served as the library director for one of those tiny Barbie Dream libraries in upstate New York, and I led the migration from a card catalog to a shared system in a consortium. Too many people assumed that the library staff–like so many employees in these libraries, all female, and nearly all older women married to retired spouses–would be resistant to this change. In fact, they loved this change. They were fully on board with the relearning process and they were delighted and proud that they were now part of a larger system where they could not only request books from 30 other libraries but sometimes even lend books as well from our wee collection. There were changes they and the trustees resisted, and that was a good lesson too, but the truism of older women resisting technology was dashed against the rocks of reality. My 15 minutes are up. I am going in early today because I need to print things, not because I am an older woman who fears technology but because our home printer isn’t working and I can’t trust that I’ll have seatback room on my flight to Chicago to open my laptop and read the ALA Executive Board manual electronically, let alone annotate it or mark it up. I still remember the time I was on a flight, using my RPOD (Red Pen of Death, a fine-point red-ink Sharpie) to revise an essay, and the passenger next to me turned toward me wide-eyed and whispered, “Are you a TEACHER?” Such is the power of RPOD, an objective correlative that can immediately evoke the fear of correction from decades ago. Bookmark to: Filed in American Liberry Ass'n, Association Nerd | | Comments (1) Keeping Council Saturday, January 20, 2018 Editorial note: Over half of this post was composed in July 2017. At the time, this post could have been seen as politically neutral (where ALA is the political landscape I’m referring to) but tilted toward change and reform. Since then, Events Have Transpired. I revised this post in November, but at the time hesitated to post it because Events Were Still Transpiring. Today, in January 2018, I believe even more strongly in what I write here, but take note that the post didn’t have a hidden agenda when I wrote it, and, except where noted, it still reflects my thoughts from last July, regardless of ensuing events. My agendas tend to be fairly straightforward. — KGS   Original Post, in which Councilors are Urged to Council Edits in 2018 noted with bolding. As of July 2017, I am back on ALA Council for my fifth (non-consecutive) term since joining the American Library Association in 1991. In June I attended Council Orientation, and though it was excellent–the whole idea that Councilors would benefit from an introduction to the process is a beneficial concept that emerged over the last two decades–it did make me reflect on what I would add if there had been a follow-on conversation with sitting Councilors called “sharing the wisdom.” I was particularly alerted to that by comments during Orientation which pointed up a traditional view of the Council process where ALA’s largest governing body is largely inactive for over 350 days a year, only rousing when we prepare to meet face to face. Take or leave what I say here, or boldly contradict me, but it does come from an abundance of experience. You are a Councilor year-round Most newly-elected Councilors “take their seats” immediately after the annual conference following their election — a factoid with significance. Council, as a body, struggles with being a year-round entity that takes action twice a year during highly-condensed meetings during a conference with many other things happening. I have written about this before, in a dryly wonky post from 2012 that also addresses Council’s composition and the role of chapters. I proposed that Council meet four times a year, in a solstice-and-equinox model. Two of those meetings (the “solstice” meetings) could  be online. (As far back as 2007 I was hinting around about the overhead and carbon footprint of Midwinter.) I doubt Midwinter will go to an online format even within the next decade–it’s a moneymaker for ALA, if less so than before, and ALA’s change cycle is glacial–but the proposal was intended to get people thinking about how Council does, and doesn’t, operate. In lieu of any serious reconsideration of Council, here are some thoughts. First, think of yourself as a year-round Councilor, even if you do not represent a constituency such as a state chapter or a division that meets and takes action outside of ALA. Have at least a passing familiarity with the ALA Policy Manual. Bookmark it and be prepared to reference it. Get familiar with ALA’s financial model through the videos that explain things such as the operating agreement. Read and learn about ALA. Share news. Read the reports shared on the list, and post your thoughts and your questions. Think critically about what you’re reading. It’s possible to love your Association, believe with your heart that it has a bright future, and still raise your eyebrows about pat responses to budget questions, reassurances that membership figures and publishing revenue will rebound, and glib responses about the value of units such as the Planning and Budget Assembly. Come to Council prepared. Read everything you can in advance, speak with other Councilors, and apply solid reflection, and research if needed, before you finish packing for your trip. Preparation requires an awareness that you will be deluged with reading just as you are struggling to button up work at your library and preparing to be away for nearly a week, so skimming is essential. I focus on issues where I know I can share expertise, and provide input when I can. Also, I am proud we do memorial resolutions and other commemorations but I don’t dwell on them in advance unless I have helped write them or had close familiarity with the people involved. Fee, Fie, Foe, Forum Coming prepared to Council is one of those values Council has struggled with. Looking at the Council list for the week prior to Annual 2017, the only conversation was a discussion about the relocation of the Council Forum meeting room from one hotel to another, complete with an inquiry asking if ALA could rent a special bus to tote Councilors to and from the Forum hotel. Council Forum is an informal convening that has taken place for decades to enable Council to discuss resolutions and other actions outside of the strictures of parliamentary procedure. It meets three times during ALA, in the evening, and though it is optional, I agree with the Councilor who noted that important work happens at this informal gathering. I am conflicted about Forum. It allows substantive discussion about key resolutions to happen outside of the constrictive frameworks of parliamentary procedure. Forum is also well-run, with volunteer Councilors managing the conversation. But Forum also appears to have morphed into a substitute for reading and conversation in advance. It also means that Councilors have to block out yet more time to do “the work of the Association,” which in turn takes us away from other opportunities during the few days we are together as an Association. I don’t say this to whine about the sacrifice of giving up dinners and networking with ALA colleagues, though those experiences are important to me, but rather to point out that Forum as a necessary-but-optional Council activity takes a silo–that Brobdingnabian body that is ALA Council–and further silos it. That can’t be good for ALA. As Councilors, we benefit from cross-pollination with the work of the Association. Resolved: To tread lightly with resolutions New Councilors, and I was one of them once, are eager to solve ALA’s problems by submitting resolutions. Indeed, there are new Councilors who see resolutions as the work of Council, and there have been round tables and other units that clearly saw their work as generating reams of lightly-edited, poorly-written resolutions just prior to and during the conference. There are at least three questions to ask before submitting a resolution (other than memorial and other commemorative resolutions): Can the resolution itself help solve a problem? Has it been coordinated with the units and people involved in the issue it addresses? Is it clear and well-written? There are other questions worth considering, such as, if the issue this resolution proposed to address cropped up a month after Council met, would you still push it online with your Council colleagues, or ask the ALA Executive Board to address it? Which is another way to ask, is it important? Tread lightly with Twitter Overall, since coming through the stress of living through the Santa Rosa fires, I’m feeling weary, and perhaps wary, of social media. Though I appreciate the occasional microbursts taking on idiots insulting libraries and so on, right now much of social media feels at once small and overwrought. If I seem quieter on social media, that’s true. (But I have had more conversations with neighbors and area residents during and after the fires than I have since we moved to Santa Rosa in early 2015, and those convos are the real thing.) More problematically, as useful as Twitter can be for following real-world issues–including ALA–Twitter also serves as a place where people go to avoid the heavy lifting involved with crucial conversations. I find I like #alacouncil Twitter best when it is gently riffing on itself or amplifying action that the larger ALA body would benefit hearing about. [the following, to the end of this post, is all new content] I like #alacouncil Twitter least when it is used as a substitute for authentic conversation, used to insult other Councilors, or otherwise undermining the discourse taking place in the meatware world. Twitter is also particularly good at the unthinking pile-on, and many people have  vulnerabilities in this area that are easily exploited. Sometimes those pile-ons hit me close to home, as happened a little over a year ago. Other times these pile-ons serve only to amuse the minx in me, such as when a Famous Author (™) recently scolded me for “trafficking in respectability politics” because I was recommending a list of books written by writers from what our fearless leader calls “s–thole countries.” Guilty as charged! Indeed, I have conducted two studies where a major theme was “Do I look too gay?” I basically have a Ph.D. in respectability politics. And like all writers–including Famous Author (™)–I traffic in them. I chuckled and walked on by. Walking on by, on Twitter, takes different forms. As an administrator, I practice a certain pleasant-but-not-sugary facial expression that stays on my face regardless of what’s going on in my head. I’m not denying my emotions, which would be the sugary face; I’m managing them. It’s a kind of discipline that also helps me fjord difficult conversations, in which the discipline of managing my face also helps me manage my brain. The equivalent of my Admin Face for me for #alacouncil Twitter is to exercise the mute button. I have found it invaluable. People don’t know they are muted (or unmuted). If only real life had mute buttons–can you imagine how much better some meetings would be if you could click a button and the person speaking would be silenced, unaware that you couldn’t hear them? Everyone wins. But that aside, I have yet to encounter a situation on Twitter when–for me–muting was the wrong call. It’s as if you stepped off the elevator and got away from that person smacking gum. Another car will be along momentarily. My last thought on this post has to do with adding the term “sitting” before Councilors in the first part of this post. When I was not on Council I tried very hard not to be “that” former Councilor who is always kibitizing behind scene, sending Councilors messages about how things should be and how, in the 1960s, ALA did something bad and therefore we can never vote online because nobody knows how to find ALA Connect and it’s all a nefarious plot hatched by the ALA President, his dimwitted sycophants, and the Executive Board, and why can’t MY division have more representation because after all we’re the 800-pound gorilla (ok, I just got political, but you’ll note I left out anything about what should or should not be required for a Very Special Job). Yes, once in a while I sent a note if I thought it was helpful, the way some of my very ALA-astute friends will whisper in my ear about policy and process I may be unfamiliar with. Michael Golrick, a very connected ALA friend of mine, must have a third brain hemisphere devoted to the ALA policy manual and bylaws. And during a time when I was asking a lot of questions about the ALA budget (boiling down to one question: who do you think you’re fooling?), I was humbled by the pantheon of ALA luminaries whispering in my ear, providing encouragement as well as crucial guidance and information. But when I am no longer part of something, I am mindful that things can and should change and move on, and that I may not have enough information to inform that change. We don’t go to ALA in horse-and-buggies any more, but we conduct business as if we do, and when we try to change that, the fainting couches are rolled out and the smelling salts waved around as if we had, say, attempted to change the ALA motto, which is, I regret to inform you, “The best reading, for the largest number, at the least cost”–and yes, attempts to change that have been defeated. My perennial question is, if you were starting an association today, how would it function? If the answer is “as it did in 1893” (when that motto was adopted), perhaps your advice on a current situation is less salient than you fancy. You may succeed at what you’re doing, but that doesn’t make you right. And with that, I go off to Courthouse Square today to make exactly that point about events writ much, much larger, and of greater significance, than our fair association. But I believe how we govern makes a difference, and I believe in libraries and library workers, and I believe in ALA. Especially today. Bookmark to: Filed in American Liberry Ass'n, Librarianship | | Comments (2) What burns away Thursday, November 16, 2017 We are among the lucky ones. We did not lose our home. We did not spend day after day evacuated, waiting to learn the fate of where we live. We never lost power or Internet. We had three or four days where we were mildly inconvenienced because PG&E wisely turned off gas to many neighborhoods, but we showered at the YMCA and cooked on an electric range we had been planning to upgrade to gas later this fall (and just did, but thank you, humble Frigidaire electric range, for being there to let me cook out my anxiety). We kept our go-bags near the car, and then we kept our go-bags in the car, and then, when it seemed safe, we took them out again. That, and ten days of indoor living and wearing masks when we went out, was all we went through. But we all bear witness. The Foreshadowing It began with a five-year drought that crippled forests and baked plains, followed by an soaking-wet winter and a lush  spring that crowded the hillsides with greenery. Summer temperatures hit records several times, and the hills dried out as they always do right before autumn, but this time unusually crowded with parched foliage and growth. The air in Santa Rosa was hot and dry that weekend, an absence of humidity you could snap between your fingers. In the southwest section of the city, where we live, nothing seemed unusual. Like many homes in Santa Rosa our home does not have air conditioning, so for comfort’s sake I grilled our dinner, our 8-foot backyard fence buffering any hint of the winds gathering speed northeast of us. We watched TV and went to bed early. Less than an hour later one of several major fires would be born just 15 miles east of where we slept. Reports vary, but accounts agree it was windy that Sunday night, with windspeeds ranging between 35 and 79 miles per hour, and a gust northwest of Santa Rosa reaching nearly 100 miles per hour. If the Diablo winds were not consistently hurricane-strength, they were exceptionally fast, hot, and dry, and they meant business. A time-lapse map of 911 calls shows the first reports of downed power lines and transformers coming in around 10 pm.  The Tubbs fire was named for a road that is named for a 19th-century winemaker who lived in a house in  Calistoga that burned to the ground in an eerily similar fire in 1964. In three hours this fire sped 12 miles southwest, growing in size and intent as it gorged on hundreds and then thousands of homes in its way, breaching city limits and expeditiously laying waste to 600 homes in the Fountaingrove district before it tore through the Journey’s End mobile home park, then reared back on its haunches and leapt across a six-lane divided section of Highway 101, whereupon it gobbled up big-box stores and fast food restaurants flanking Cleveland Avenue, a business road parallel to the highway.  Its swollen belly, fat with miles of fuel, dragged over the area and took out buildings in the  the random manner of fires. Kohl’s and KMart were totaled and Trader Joe’s was badly damaged, while across the street from KMart, JoAnn Fabrics was untouched. The fire demolished one Mexican restaurant, hopscotched over another, and feasted on a gun shop before turning its ravenous maw toward the quiet middle-class neighborhood of Coffey Park, making short work of thousands more homes. Santa Rosa proper is itself only 41 square miles, approximately 13 miles north-south and 9 miles east-west, including the long tail of homes flanking the Annadel mountains. By the time Kohl’s was collapsing, the “wildfire” was less than 4 miles from our home. I woke up around 2 am, which I tend to do a lot anyway. I walked outside and smelled smoke, saw people outside their homes looking around, and went on Twitter and FaceBook. There I learned of a local fire, forgotten by most in the larger conflagration, but duly noted in brief by the Press Democrat: a large historic home at 6th and Pierson burned to the ground, possibly from  a downed transformer, and the fire licked the edge of the Santa Rosa Creek Trail for another 100 feet. Others in the West End have reported the same experience of reading about the 6th Street house fire on social media and struggling to reconcile the reports of this fire with reports of panic and flight from areas north of us and videos of walls of flame. At 4 am I received a call that the university had activated its Emergency Operations Center and I asked if I should report in. I showered and dressed, packed a change of clothes in a tote bag, threw my bag of important documents in my purse, and drove south on my usual route to work, Petaluma Hill Road. The hills east of the road flickered with fire, the road itself was packed with fleeing drivers, and halfway to campus I braked at 55 mph when a massive buck sprang inches in front of my car, not running in that “oops, is this a road?” way deer usually cross lanes of traffic but yawing too and fro, its eyes wide. I still wonder, was it hurt or dying. As I drove onto campus I thought, the cleaning crew. I parked at the Library and walked through the building, already permeated with smoky air. I walked as quietly as I could, so that if they were anywhere in the building I would hear them. As I walked through the silent building I wondered, is this the last time I will see these books? These computers? The new chairs I’m so proud of? I then went to the EOC and found the cleaning crew had been accounted for, which was a relief. At Least There Was Food And Beer A few hours later I went home. We had a good amount of food in the house, but like many of us who were part of this disaster but not immediately affected by it, I decided to stock up. The entire Santa Rosa Marketplace– CostCo and Trader Joe’s, Target–on Santa Rosa Avenue was closed, and Oliver’s had a line outside of people waiting to get in. I went to the “G&G Safeway”–the one that took over a down-at-the-heels family market known as G&G and turned it into a spiffy market with a wine bar, no less–and it was without power, but open for business and, thanks to a backup system, able to take ATM cards. I had emergency cash on me but was loathe to use it until I had to. Sweating through an N95 mask I donned to protect my lungs, I wheeled my cart through the dark store, selecting items that would provide protein and carbs if we had to stuff them in our go-bags, but also fresh fruit and vegetables, dairy and eggs–things I thought we might not see for a while, depending on how the disaster panned out. (Note, we do already have emergency food, water, and other supplies.) The cold case for beer was off-limits–Safeway was trying to retain the cold in its freezer and fridge cases in case it could save the food–but there was a pile of cases of Lagunitas Lil Sumpin Sumpin on sale, so that with a couple of bottles of local wine went home with me too. And with one wild interlude, for most of the rest of the time we stayed indoors with the windows closed.  I sent out email updates and made phone calls, kept my phone charged and read every Nexil alert, and people at work checked in with one another. My little green library emergency contact card stayed in my back pocket the entire time. We watched TV and listened to the radio, including extraordinary local coverage by KSRO, the Little Station that Could; patrolled newspapers and social media; and rooted for Sheriff Rob, particularly after his swift smack-down of a bogus, Breitbart-fueled report that an undocumented person had started the fires. Our home was unoccupied for a long time before we moved in this September, possibly up to a decade, while it was slowly but carefully upgraded. The electric range was apparently an early purchase; it was a line long discontinued by Frigidaire, with humble electric coils. But it had been unused until we arrived, and was in perfect condition. If an electric range could express gratitude for finally being useful, this one did. I used it to cook homey meals: pork loin crusted with Smithfield bacon; green chili cornbread; and my sui generis meatloaf, so named because every time I make it, I grind and add meat scraps from the freezer for a portion of the meat mixture. (It would be several weeks before I felt comfortable grilling again.) We cooked. We stirred. We sauteed. We waited. On Wednesday, we had to run an errand. To be truthful, it was an Amazon delivery purchased that Saturday, when the world was normal, and sent to an Amazon locker at the capacious Whole Foods at Coddington Mall, a good place to send a package until the mall closes down because the northeast section of the city is out of power and threatened by a massive wildfire. By Wednesday, Whole Foods had reopened, and after picking up my silly little order–a gadget that holds soda cans in the fridge–we drove past Russian River Brewing Company and saw it was doing business, so we had salad and beer for lunch, because it’s a luxury to have beer at lunch and the fires were raging and it’s so hard to get seating there nights and weekends, when I have time to go there, but there we were. We asked our waiter how he was doing, and he said he was fine but he motioned to the table across from ours, where a family was enjoying pizza and beer, and he said they had lost their homes. There were many people striving for routine during the fires, and to my surprise, even the city planning office returned correspondence regarding some work we have planned for our new home, offering helpful advice on the permitting process required for minor improvements for homes in historic districts. Because it turns out developers and engineers could serenely ignore local codes and build entire neighborhoods in Santa Rosa in areas known to be vulnerable to wildfire; but to replace bare dirt with a little white wooden picket fence, or to restore front windows from 1950s-style plate glass to double-hung wooden windows with mullions–projects intended to reinstate our house to its historic accuracy, and to make it more welcoming–requires a written justification of the project, accompanying photos, “Proposed Elevations (with Landscape Plan IF you are significantly altering landscape) (5 copies),” five copies of a paper form, a Neighborhood Context and Vicinity Map provided by the city, and a check for $346, followed by “8-12 weeks” before a decision is issued. The net result of this process is like the codes about not building on ridges, though much less dangerous; most people ignore the permitting process, so that the historic set piece that is presumably the goal is instead rife with anachronisms. And of course, first I had to bone up on the residential building code and the historic district guidelines, which contradict one another on key points, and because the permitting process is poorly documented I have an email traffic thread rivaling in word count Byron’s letters to his lovers. But the planning people are very pleasant, and we all seemed to take comfort in plodding through the administrivia of city bureaucracy as if we were not all sheltering in place, masks over our noses and mouths, go-bags in our cars, while fires raged just miles from their office and our home. The Wild Interlude, or, I Have Waited My Entire Career For This Moment Regarding the wild interlude, the first thing to know about my library career is that nearly everywhere I have gone where I have had the say-so to make things happen, I have implemented key management. That mishmosh of keys in  a drawer, the source of so much strife and arguments, becomes an orderly key locker with numbered labels. It doesn’t happen overnight, because keys are control and control is political and politics are what we tussle about in libraries because we don’t have that much money, but it happens. Sometimes I even succeed in convincing people to sign keys out so we know who has them. Other times I convince people to buy a locker with a keypad so we sidestep the question of where the key to the key locker is kept. But mostly, I leave behind the lockers, and, I hope, an appreciation for lockers. I realize it’s not quite as impressive as founding the Library of Alexandria, and it’s not what people bring up when I am introduced as a keynote speaker, and I have never had anyone ask for a tour of my key lockers nor have I ever been solicited to write a peer-reviewed article on key lockers. However unheralded, it’s a skill. My memory insists it was Tuesday, but the calendar says it was late Monday night when I received a call that the police could not access a door to an area of the library where we had high-value items. It would turn out that this was a rogue lock, installed sometime soon after the library opened in 2000, that unlike others did not have a master registered with the campus, an issue we have since rectified. But in any event, the powers that be had the tremendous good fortune to contact the person who has been waiting her entire working life to prove beyond doubt that KEY LOCKERS ARE IMPORTANT. After a brief internal conversation with myself, I silently nixed the idea of offering to walk someone through finding the key. I said I knew where the key was, and I could be there in twenty minutes to find it. I wasn’t entirely sure this was the case, because as obsessed as I am with key lockers, this year I have been preoccupied with things such as my deanly duties, my doctoral degree completion, national association work, our home purchase and household move, and the selection of geegaws like our new gas range (double oven! center griddle!). This means I had not spend a lot of time perusing this key locker’s manifest. So there was an outside chance I would have to find the other key, located somewhere in an another department, which would require a few more phone calls. I was also in that liminal state between sleep and waking; I had been asleep for two hours after being up since 2 am, and I would have agreed to do just about anything. Within minutes I was dressed and again driving down Petaluma Hill Road, still busy with fleeing cars.  The mountain ridges to the east of the road roiled with flames, and I gripped the steering wheel, watching for more animals bolting from fire. Once in the library, now sour with smoke, I ran up the stairs into my office suite and to the key locker, praying hard that the key I sought was in it. My hands shook. There it was, its location neatly labeled by the key czarina who with exquisite care had overseen the organization of the key locker. The me who lives in the here-and-now profusely thanked past me for my legacy of key management, with a grateful nod to the key czarina as well. What a joy it is to be able to count on people! Items were packed up, and off they rolled. After a brief check-in at the EOC, home I went, to a night of “fire sleep”–waking every 45 minutes to sniff the air and ask, is fire approaching?–a type of sleep I would have for the next ten days, and occasionally even now. How we speak to one another in the here and now Every time Sandy and I interact with people, we ask, how are you. Not, hey, how are ya, where the expected answer is “fine, thanks” even if you were just turned down for a mortgage or your mother died. But no, really, how are you. Like, fire-how-are-you. And people usually tell you, because everyone has a story. Answers range from: I’m ok, I live in Petaluma or Sebastopol or Bodega Bay (in SoCo terms, far from the fire), to I’m ok but I opened my home to family/friends/people who evacuated or lost their homes; or, I’m ok but we evacuated for a week; or, as the guy from Home Depot said, I’m ok and so is my wife, my daughter, and our 3 cats, but we lost our home. Sometimes they tell you and they change the subject, and sometimes they stop and tell you the whole story: when they first smelled smoke, how they evacuated, how they learned they did or did not lose their home. Sometimes they have before-and-after photos they show you. Sometimes they slip it in between other things, like our cat sitter, who mentioned that she lost her apartment in Fountaingrove and her cat died in the fire but in a couple of weeks she would have a home and she’d be happy to cat-sit for us. Now, post-fire, we live in that tritest of phrases, a new normal. The Library opened that first half-day back, because I work with people who like me believe that during disasters libraries should be the first buildings open and the last to close. I am proud to report the Library also housed NomaCares, a resource center for those at our university affected by the fire. That first Friday back we held our Library Operations meeting, and we shared our stories, and that was hard but good. But we also resumed regular activity, and soon the study tables and study rooms were full of students, meetings were convened, work was resumed, and the gears of life turned. But the gears turned forward, not back. Because there is no way back. I am a city mouse, and part of moving to Santa Rosa was our decision to live in a highly citified section, which turned out to be a lucky call. But my mental model of city life has been forever twisted by this fire. I drive on 101 just four miles north of our home, and there is the unavoidable evidence of a fire boldly leaping into an unsuspecting city. I go to the fabric store, and I pass twisted blackened trees and a gun store totaled that first night. I drive to and from work with denuded hills to my east a constant reminder. But that’s as it should be. Even if we sometimes need respite from those reminders–people talk about taking new routes so they won’t see scorched hills and devastated neighborhoods–we cannot afford to forget. Sandy and I have moved around the country in our 25 years together, and we have seen clues everywhere that things are changing and we need to take heed. People like to lapse into the old normal, but it is not in our best interests to do so. All of our stories are different. But we share a collective loss of innocence, and we can never return to where we were. We can only move forward, changed by the fire, changed forever. Bookmark to: Filed in Santa Rosa Living | | Comments Off on What burns away Neutrality is anything but Saturday, August 19, 2017 “We watch people dragged away and sucker-punched at rallies as they clumsily try to be an early-warning system for what they fear lies ahead.” — Unwittingly prophetic me, March, 2016. Sheet cake photo by Flickr user Glane23. CC by 2.0 Sometime after last November, I realized something very strange was happening with my clothes. My slacks had suddenly shrunk, even if I hadn’t washed them. After months of struggling to keep myself buttoned into my clothes, I gave up and purchased slacks and jeans one size larger. I call them my T***p Pants. This post is about two things. It is about the lessons librarians are learning in this frightening era about the nuances and qualifications shadowing our deepest core values–an era so scary that quite a few of us, as Tina Fey observed, have acquired T***p Pants. And it’s also some advice, take it or leave it, on how to “be” in this era. I suspect many librarians have had the same thoughts I have been sharing with a close circle of colleagues. Most librarians take pride in our commitment to free speech. We see ourselves as open to all viewpoints. But in today’s new normal, we have seen that even we have limits. This week, the ACRL Board of Directors put out a statement condemning the violence in Charlottesville. That was the easy part. The Board then stated, “ACRL is unwavering in its long-standing commitment to free exchange of different viewpoints, but what happened in Charlottesville was not that; instead, it was terrorism masquerading as free expression.” You can look at what happened in Charlottesville and say there was violence “from many sides,” some of it committed by “very fine people” who just happen to be Nazis surrounded by their own private militia of heavily-armed white nationalists. Or you can look at Charlottesville and see terrorism masquerading as free expression, where triumphant hordes descended upon a small university town under the guise of protecting some lame-ass statue of an American traitor, erected sixty years after the end of the Civil War, not coincidentally during a very busy era for the Klan. Decent people know the real reason the Nazis were in Charlottesville: to tell us they are empowered and emboldened by our highest elected leader. There is no middle ground. You can’t look at Charlottesville and see everyday people innocently exercising First Amendment rights. As I and many others have argued for some time now, libraries are not neutral.  Barbara Fister argues, “we stand for both intellectual freedom and against bigotry and hate, which means some freedoms are not countenanced.” She goes on to observe, “we don’t have all the answers, but some answers are wrong.” It goes to say that if some answers are wrong, so are some actions. In these extraordinary times, I found myself for the first time ever thinking the ACLU had gone too far; that there is a difference between an unpopular stand, and a stand that is morally unjustifiable. So I was relieved when the national ACLU concurred with its three Northern California chapters that “if white supremacists march into our towns armed to the teeth and with the intent to harm people, they are not engaging in activity protected by the United States Constitution. The First Amendment should never be used as a shield or sword to justify violence.” But I was also sad, because once again, our innocence has been punctured and our values qualified. Every asterisk we put after “free speech” is painful. It may be necessary and important pain, but it is painful all the same. Many librarians are big-hearted people who like to think that our doors are open to everyone and that all viewpoints are welcome, and that enough good ideas, applied frequently, will change people. And that is actually very true, in many cases, and if I didn’t think it was true I would conclude I was in the wrong profession. But we can’t change people who don’t want to be changed. Listen to this edition of The Daily, a podcast from the New York Times, where American fascists plan their activities. These are not people who are open to reason. As David Lankes wrote, “there are times when a community must face the fact that parts of that community are simply antithetical to the ultimate mission of a library.” We urgently need to be as one voice as a profession around these issues. I was around for–was part of–the “filtering wars” of the 1990s, when libraries grappled with the implications of the Internet bringing all kinds of content into libraries, which also challenged our core values. When you’re hand-selecting the materials you share with your users, you can pretend you’re open to all points of view. The Internet challenged that pretense, and we struggled and fought, and were sometimes divided by opportunistic outsiders. We are fortunate to have strong ALA leadership this year. The ALA Board and President came up swinging on Tuesday with an excellent presser that stated unequivocally that “the vile and racist actions and messages of the white supremacist and neo-Nazi groups in Charlottesville are in stark opposition to the ALA’s core values,” a statement that (in the tradition of ensuring chapters speak first) followed a strong statement from our Virginia state association.  ARL also chimed in with a stemwinder of a statement.  I’m sure we’ll see more. But ALA’s statement also describes the mammoth horns of the library dilemma. As I wrote colleagues, “My problem is I want to say I believe in free speech and yet every cell in my body resists the idea that we publicly support white supremacy by giving it space in our meeting rooms.” If you are in a library institution that has very little likelihood of exposure to this or similar crises, the answers can seem easy, and our work appears done. But for more vulnerable libraries, it is crucial that we are ready to speak with one voice, and that we be there for those libraries when they need us. How we get there is the big question. I opened this post with an anecdote about my T***p pants, and I’ll wrap it up with a concern. It is so easy on social media to leap in to condemn, criticize, and pick apart ideas. Take this white guy, in an Internet rag, the week after the election, chastising people for not doing enough.  You know what’s not enough? Sitting on Twitter bitching about other people not doing enough. This week, Siva Vaidhyanathan posted a spirited defense of a Tina Fey skit where she addressed the stress and anxiety of these political times.  Siva is in the center of the storm, which gives him the authority to state an opinion about a sketch about Charlottesville. I thought Fey’s skit was insightful on many fronts. It addressed the humming anxiety women have felt since last November (if not earlier). It was–repeatedly–slyly critical of inaction: “love is love, Colin.” It even had a Ru Paul joke. A lot of people thought it was funny, but then the usual critics came out to call it naive, racist, un-funny, un-woke, advocating passivity, whatever. We are in volatile times, and there are provocateurs from outside, but also from inside. Think. Breathe. Step away from the keyboard. Take a walk. Get to know the mute button in Twitter and the unfollow feature in Facebook. Pull yourself together and think about what you’re reading, and what you’re planning to say. Interrogate your thinking, your motives, your reactions. I’ve read posts by librarians deriding their peers for creating subject guides on Charlottesville, saying instead we should be punching Nazis. Get a grip. First off, in real life, that scenario is unlikely to transpire. You, buried in that back cubicle in that library department, behind three layers of doors, are not encountering a Nazi any time soon, and if you did, I recommend fleeing, because that wackdoodle is likely accompanied by a trigger-happy militiaman carrying a loaded gun. (There is an entire discussion to be had about whether violence to violence is the politically astute response, but that’s for another day.) Second, most librarians understand that their everyday responses to what is going on in the world are not in and of themselves going to defeat the rise of fascism in America. But we are information specialists and it’s totally wonderful and cool to respond to our modern crisis with information, and we need to be supportive and not go immediately into how we are all failing the world. Give people a positive framework for more action, not scoldings for not doing enough. In any volatile situation, we need to slow the eff down and ask how we’re being manipulated and to what end; that is a lesson the ACLU just learned the hard way. My colleague Michael Stephens is known for saying, “speak with a human voice.” I love his advice, and I would add, make it the best human voice you have. We need one another, more than we know.   Bookmark to: Filed in Intellectual Freedom, Librarianship | | Comments (2) MPOW in the here and now Sunday, April 9, 2017 Sometimes we have monsters and UFOs, but for the most part it’s a great place to work I have coined a few biblioneologisms in my day, but the one that has had the longest legs is MPOW (My Place of Work), a convenient, mildly-masking shorthand for one’s institution. For the last four years I haven’t had the bandwidth to coin neologisms, let alone write about MPOW*. This silence could be misconstrued. I love what I do, and I love where I am. I work with a great team on a beautiful campus for a university that is undergoing a lot of good change. We are just wrapping up the first phase of a visioning project to help our large, well-lit building serve its communities well for the decades to come. We’re getting ready to join the other 22 CSU libraries on OneSearch, our first-ever unified library management system. We have brought on some great hires, thrown some great events (the last one featured four Black Panthers talking about their life work — wow!). With a new dean (me) and a changing workforce, we are developing our own personality. It’s all good… and getting better The Library was doing well when I arrived, so my job was to revitalize and switch it up. As noted in one of the few posts about MPOW, the libraries in my system were undergoing their own reassessment, and that has absorbed a fair amount of our attention, but we continue to move forward. Sometimes it’s the little things. You may recall I am unreasonably proud of the automated table of contents I generated for my dissertation, and I also feel that way about MPOW’s slatwall book displays, which in ten areas beautifully market new materials in spaces once occupied by prison-industry bookcases or ugly carpet and unused phones (what were the phones for? Perhaps we will never know). The slatwall was a small project that was a combination of expertise I brought from other libraries, good teamwork at MPOW, and knowing folks. The central problem was answered quickly by an email to a colleague in my doctoral program (hi, Cindy!) who manages public libraries where I saw the displays I thought would be a good fit. The team selected the locations, a staff member with an eye for design recommended the color, everyone loves it, and the books fly off the shelves. If there is any complaining, it is that we need more slatwall. Installed slatwall needs to wait until we know if we are moving/removing walls as part of our building improvements. A bigger holdup is that we need to hire an Access Services Manager, and really, anything related to collections needs the insight of a collections librarian. People… who need people… But we had failed searches for both these positions… in the case of collections, twice. *cue mournful music* We have filled other positions with great people now doing great things, and are on track to fill more positions, but these two, replacing people who have retired, are frustrating us. The access services position is a managerial role, and the collections librarian is a tenure-track position. Both offer a lot of opportunity. We are relaunching both searches very soon (I’ll post a brief update when that happens), and here’s my pitch. If you think you might qualify for either position, please apply. Give yourself the benefit of the doubt. If you know someone who would be a good fit for either position, ask them to apply. I recently mentored someone who was worried about applying to a position. “Will that library hold it against me if I am not qualified?” The answer is of course not!  (And if they do, well, you dodged that bullet!) I have watched far too many people self-select out of positions they were qualified for (hrrrrmmmm particularly one gender…). Qualification means expertise + capacity + potential. We expect this to be a bit of a stretch to you. If a job is really good, most days will have a “fake it til you make it” quality. This is also not a “sink or swim” institution. If it ever was, those days are in the dim past, long before I arrived. The climate is positive. People do great things and we do our best to support them. I see our collective responsibility as an organization as to help one another succeed. Never mind me and my preoccupation with slatwall (think of it as something to keep the dean busy and happy, like a baby with a binky). We are a great team, a great library, on a great campus, and we’re a change-friendly group with a minimum of organizational issues, and I mean it. I have worked enough places to put my hand on a Bible and swear to that. It has typical organizational challenges, and it’s a work in progress… as are we all. The area is crazily expensive, but it’s also really beautiful and so convenient for any lifestyle. You like city? We got city. You like suburb, or ocean, or mountain, or lake? We got that! Anyway, that’s where I am with MPOW: I’m happy enough, and confident enough, to use this blog post to BEG YOU OH PLEASE HELP US FILL THESE POSITIONS. The people who join us will be glad you did. ### *   Sidebar: the real hilarity of coining neologisms is that quite often someone, generally of a gender I do not identify with, will heatedly object to the term, as happened in 2004 when I coined the term biblioblogosphere. Then, as I noted in that post from 2012, others will defend it. That leads me to believe that creating new words is the linguistic version of lifting one’s hind leg on a tree. Bookmark to: Filed in Uncategorized | | Comments (1) Questions I have been asked about doctoral programs Wednesday, March 29, 2017 About six months ago I was visiting another institution when someone said to me, “Oh, I used to read your blog, BACK IN THE DAY.” Ah yes, back in the day, that Pleistocene era when I wasn’t working on a PhD while holding down a big job and dealing with the rest of life’s shenanigans. So now the PhD is done–I watched my committee sign the signature page, two copies of it, even, before we broke out the champers and celebrated–and here I am again. Not blogging every day, as I did once upon a time, but still freer to put virtual pen to electronic paper as the spirit moves me. I have a lot to catch up on–for example, I understand there was an election last fall, and I hear it may not have gone my way–but the first order of business is to address the questions I have had from library folk interested in doctoral programs. Note that my advice is not directed at librarians whose goal is to become faculty in LIS programs. Dropping Back In One popular question comes from people who had dropped out of doctoral programs. Could they ever be accepted into a program again? I’m proof there is a patron saint for second chances. I spent one semester in a doctoral program in 1995 and dropped out for a variety of reasons–wrong time, wrong place, too many life events happening. At the time, I felt that dropping out was the academic equivalent of You’ll Never Eat Lunch In This Town Again, but part of higher education is a series of head games, and that was one of them. The second time around, I had a much clearer idea of what I wanted from a program and what kind of program would work for me, and I had the confluence of good timing and good luck. The advice Tom Galvin gave me in 1999, when Sandy and I were living in Albany and when Tom–a longtime ALA activist and former ALA Exec Director–was teaching at SUNY Albany, still seems sound: you can drop out of one program and still find your path back to a doctorate, just don’t drop out of two programs. I also have friends who suffered through a semester or two, then decided it wasn’t for them. When I started the program, I remember thinking “I need this Ph.D. because I could never get a job at, for example, X without it.” Then I watched as someone quite accomplished, with no interest in ever pursuing even a second masters, was hired at X. There is no shame in deciding the cost/benefit analysis isn’t there for you–though I learned, through this experience, that I was in the program for other, more sustainable reasons. Selecting Your Program I am also asked what program to attend. To that my answer is, unless you are very young and can afford to go into, and hopefully out of, significant amounts of debt, pick the program that is most affordable and allows you to continue working as a professional (though if you are at a point in life when you can afford to take a couple years off and get ‘er done, more power to you). That could be a degree offered by your institution or in cooperation with another institution, or otherwise at least partially subsidized. I remember pointing out to an astonished colleague that the Ed.D. he earned for free (plus many Saturdays of sweat equity) was easily worth $65,000, based on the tuition rate at his institution. Speaking of which, I get asked about Ph.D. versus Ed.D. This can be a touchy question. My take: follow the most practical and affordable path available to you that gets you the degree you will be satisfied with and that will be the most useful to you in your career. But whether Ed.D. or Ph.D., it’s still more letters after your name than you had before you started. Where Does It Hurt? What’s the hardest part of a doctoral program? For me, that was a two-way tie between the semester coursework and the comprehensive exams. The semester work was challenging because it couldn’t be set aside or compartmentalized. The five-day intensives were really seven days for me as I had to fly from the Left Coast to Boston. The coursework had deadlines that couldn’t be put aside during inevitable crises. The second semester was the hardest, for so many reasons, not the least of which is that once I had burned off the initial adrenaline, the finish line seemed impossibly far away; meanwhile, the tedium of balancing school and work was settling in, and I was floundering in alien subjects I was struggling to learn long-distance. Don’t get me wrong, the coursework was often excellent: managing in a political environment, strategic finance, human resources, and other very practical and interesting topics. But it was a bucket o’ work, and when I called a colleague with a question about chair manufacturers (as one does) and heard she was mired in her second semester, I immediately informed her This Too Shall Pass. Ah, the comprehensive exams. I would say I shall remember them always, except they destroyed so much of my frontal lobe, that will not be possible. The comps required memorizing piles of citations–authors and years, with salient points–to regurgitate during two four-hour closed-book tests.  I told myself afterwards that the comps helped me synthesize major concepts in grand theory, which is a dubious claim but at least made me feel better about the ordeal. A number of students in my program helped me with comps. My favorite memory is of colleague Gary Shaffer, who called me from what sounded like a windswept city corner to offer his advice. I kept hearing this crinkling sound. The crinkling became louder. “Always have your cards with you,” Gary said. He had brought a sound prop: the bag of index cards he used to constantly drill himself. I committed myself to continuous study until done, helped by partnering with my colleague Chuck in long-distance comps prep. We didn’t study together, but we compared timelines and kept one another apprised of our progress. You can survive a doctoral program without a study buddy, but whew, is it easier if you have one. Comps were an area where I started with old tech–good old paper index cards–and then asked myself, is this how it’s done these days? After research, I moved on to electronic flashcards through Quizlet. When I wasn’t flipping through text cards on my phone, iPad, or computer, I was listening to the cards on my phone during my run or while driving around running errands. Writing != Not Writing So about that dissertation. It was a humongous amount of work, but the qualifying paper that preceded it and the coursework and instruction in producing dissertation-quality research gave me the research design skills I needed to pull it off. Once I had the data gathered, it was just a lot of writing. This, I can do. Not everyone can. Writing is two things (well, writing is many things, but we’ll stick with two for now): it is a skill, and it is a discipline. If you do not have those two things, writing will be a third thing: impossible. Here is my method. It’s simple. You schedule yourself, you show up, and you write. You do not talk about how you are going to write, unless you are actually going to write. You do not tweet that you are writing (because then you are tweeting, not writing). You do not do other things and feel guilty because you are not writing. (If you do other things, embrace them fully.) I would write write write write write, at the same chair at the same desk (really, a CostCo folding table) facing the same wall with the same prompts secured to the wall with painter’s tape that on warm days would loosen, requiring me to crawl under my “desk” to retrieve the scattered papers, which on many days was pretty much my only form of exercise. Then I would write write write write write some more, on weekends, holiday breaks, and the occasional “dissercation day,” as I referred to vacation days set aside for this purpose. Dissercation Days had the added value that  I was very conscious I was using vacation time to write, so I didn’t procrastinate–though in general I find procrastinating at my desk a poor use of time; if I’m going to procrastinate, let me at least get some fresh air. People will advise you when and how to write. A couple weekends ago I was rereading Stephen King’s On Writing–now that I can read real books again–in which King recommends writing every day. If that works for you, great. What worked for me was using weekends, holidays, or vacation days; writing early in the day, often starting as early as 4 am; taking a short exercise break or powering through until mid-afternoon; and then stopping no later than 4 pm, many times more like 2 pm if I hadn’t stopped by then. When I tried to write on weekday mornings, work would distract me. Not actual tasks, but the thought of work. It would creep into my brain and then I would feel the urgent need to see if the building consultant had replied to my email or if I had the agenda ready for the program and marketing meeting. It also takes me about an hour to get into a writing groove, so by the time the words were flowing it was time to get ready for work. As for evenings, a friend of mine observed that I’m a lark, not an owl. The muse flees me by mid-afternoon. (This also meant I saved the more chore-like tasks of writing for the afternoon.) The key is to find your own groove and stick to it. If your groove isn’t working, maybe it’s not your groove after all. Do not take off too much time between writing sessions. I had to do that a couple of times for six to eight weeks each time, during life events such as household moves and so on, and it took some revisiting to reacquaint myself with my writing (which was Stephen King’s main, and excellent, point in his recommendation to write daily). Even when I was writing on a regular basis I often spent at least an hour at the start of the weekend rereading my writing from page 1 to ensure that my most recent writing had a coherent flow of reasoning and narrative and that the writing for that day would be its logical descendant. Another universal piece of advice is to turn off the technology. I see people tweeting “I’m writing my dissertation right now” and I think, no you aren’t. I used a Mac app called Howler timer to give me writing sieges of 45, 60, 75, or 90 minutes, depending on my degree of focus for that day, during which all interruptions–email, Facebook, Twitter, etc.–were turned off. Twitter and Facebook became snack breaks, though I timed those snacks as well. I had favorite Pandora stations to keep me company and drown out ambient noise, and many, many cups of herbal tea. Technology Will Save Us All A few technical notes about technology and doctoral programs. With the exception of the constant allure of social networks and work email, it’s a good thing. I used Kahn Academy and online flash cards to study for the math portion of the GRE.  As noted earlier, I used Quizlet for my comps, in part because this very inexpensive program not only allowed me to create digital flashcards but also read them aloud to me on my iPhone while I exercised or ran errands. I conducted interviews using FaceTime with an inexpensive plug-in, Call Recorder, that effortlessly produced digital recordings, from which the audio files could be easily split out. I then emailed the audio files to Valerie, my transcriptionist, who lives several thousand miles away but always felt as if she were in the next room, swiftly and flawlessly producing transcripts. I used Dedoose, a cloud-based analytical product, to mark up the narratives, and with the justifiable paranoia of any doctoral student, exported the output to multiple secure online locations. I dimly recall life before such technology, but cannot fathom operating in such a world again, or how much longer some of the tasks would have taken.  I spent some solid coin on things like paying a transcriptionist, but when I watch friends struggling to transcribe their own recordings, I have no regrets. There are parts of my dissertation I am exceptionally proud of, but I admit particular pride for my automatically-generated table of contents, just one of many skills I learned through YouTube (spoiler alert: the challenge is not marking up the text, it’s changing the styles to match your requirements. Word could really use a style set called Just Times Roman Please). And of course, there were various library catalogs and databases, and hundreds of e-journals to plumb, activity I accomplished as far away from your typical “library discovery layer” as possible. You can take Google Scholar away from me when you pry it from my cold, dead hands. I also plowed through a lot of print books, and many times had to do backflips to get the book in that format. Journal articles work great in e-format (though I do have a leaning paper pillar of printed journal articles left over from comps review and classes). Books, not so much. I needed to have five to fifteen books simultaneously open during a writing session, something ebooks are lame at.  I don’t get romantic about the smell of paper blah blah blah, but when I’m writing, I need my tools in the most immediately accessible format possible, and for me that is digital for articles and paper for books. Nothing Succeeds Like Success Your cohort can be very important,  and indeed I remember all of them with fondness but one with particular gratitude. Nevertheless, you alone will cross the finish line. I was unnerved when one member of our cohort dropped out after the first semester, but I shouldn’t have been. Doctoral student attrition happens throughout the academy, no less so in LibraryLand. Like the military, or marriage, you really have no idea what it’s like until you’re in it, and it’s not for everyone. It should be noted that the program I graduated from has graduated, or will graduate, nearly all of the students who made it past the first two semesters, which in turn is most of the people who entered the program in its short but glorious life–another question you should investigate while looking at programs. It turned out that for a variety of reasons that made sense, the cohort I was in was the last for this particular doctoral program. That added a certain pressure since each class was the last one to ever be offered, but it also encouraged me to keep my eyes on the prize. I also, very significantly, had a very supportive committee, and most critically, I fully believed they wanted me to succeed. I also had a very supportive spouse, with whom I racked up an infinity of backlogged honey-dos and I-owe-you-for-this promises. Regarding success and failure, at the beginning of the program, I asked if anyone had ever failed out of the program. The answer was no, everyone who left self-selected. I later asked the same question regarding comps: had anyone failed comps? The answer was that a student or two had retaken a section of comps in order to pass, but no one had completely failed (and you got one do-over if that happened). These were crucial questions for me. It also helped me to reflect on students who had bigger jobs, or were also raising kids, or otherwise were generally worse off than me in the distraction department. If so-and-so, with the big Ivy League job, or so-and-so, with the tiny infant, could do it, couldn’t I? (There is a fallacy inherent here that more prestigious schools are harder to administer, but it is a fallacy that comforted me many a day.) Onward I am asked what I will “do” with my Ph.D. In higher education, a doctorate is the expected degree for administrators, and indeed, the news of my successful doctoral defense was met with comments such as “welcome to the club.” So, mission accomplished. Also, I have a job I love, but having better marketability is never a bad idea, particularly in a political moment that can best be described as volatile and unpredictable. I can consult. I can teach (yes, I already could teach, but now more fancy-pants). I could make a reservation at a swanky bistro under the name Dr. Oatmeal and only half of that would be a fabrication. The world is my oyster! Frankly, I did not enter the program with the idea that I would gain skills and develop the ability to conduct doctoral-quality research (I was really shooting for the fancy six-sided tam), but that happened and I am pondering what to do with this expertise. I already have the joy of being pedantic, if only quietly to myself. Don’t tell me you are writing a “case study” unless it has the elements of a case study not to mention the components of any true research design. Otherwise it’s just anecdata. And of course, when it comes to owning the area of LGTBQ leadership in higher education, I am totally M.C. Hammer: u can’t touch this! I would not mind being part of the solution for addressing the dubious quality of so much LIS “research.” LibraryLand needs more programs such as the Institute for Research Design in Librarianship to address the sorry fact that basic knowledge of the fundamentals of producing industry-appropriate research is in most cases not required for a masters degree in library science, which at least for academic librarianship, given the student learning objectives we claim to support, is absurd. I also want to write a book, probably continuing the work I have been doing with documenting the working experiences of LGBTQ librarians. But first I need to sort and purge my home office, revisit places such as Hogwarts and Narnia, and catch up on some of those honey-dos and I-owe-you-for-this promises. And buy a six-sided tam. Bookmark to: Filed in Uncategorized | | Comments (2) A scholar’s pool of tears, Part 2: The pre in preprint means not done yet Tuesday, January 10, 2017 Note, for two more days, January 10 and 11, you (as in all of you) have free access to my article, To be real: Antecedents and consequences of sexual identity disclosure by academic library directors. Then it drops behind a paywall and sits there for a year. When I wrote Part 1 of this blog post in late September, I had keen ambitions of concluding this two-part series by discussing “the intricacies of navigating the liminal world of OA that is not born OA; the OA advocacy happening in my world; and the implications of the publishing environment scholars now work in.” Since then, the world, and my priorities have changed. My goals are to prevent nuclear winter and lead our library to its first significant building upgrades since it opened close to 20 years ago. But at some point I said on Twitter, in response to a conversation about posting preprints, that I would explain why I won’t post a preprint of To be real. And the answer is very simple: because what qualifies as a preprint for Elsevier is a draft of the final product that presents my writing before I incorporated significant stylistic guidance from the second reviewer, and that’s not a version of the article I want people to read. In the pre-Elsevier draft, as noted before, my research is present, but it is overshadowed by clumsy style decisions that Reviewer 2 presented far more politely than the following summary suggests: quotations that were too brief; rushing into the next thought without adequately closing out the previous thought; failure to loop back to link the literature review to the discussion; overlooking a chance to address the underlying meaning of this research; and a boggy conclusion. A crucial piece of advice from Reviewer 2 was to use pseudonyms or labels to make the participants more real. All of this advice led to a final product, the one I have chosen to show the world. That’s really all there is to it. It would be better for the world if my article were in an open access publication, but regardless of where it is published, I as the author choose to share what I know is my best work, not my work in progress. The OA world–all sides of it, including those arguing against OA–has some loud, confident voices with plenty of “shoulds,” such as the guy (and so many loud OA voices are male) who on a discussion list excoriated an author who was selling self-published books on Amazon by saying “people who value open access should praise those scholars who do and scorn those scholars who don’t.” There’s an encouraging appproach! Then there are the loud voices announcing the death of OA when a journal’s submissions drop, followed by the people who declare all repositories are Potemkin villages, and let’s not forget the fellow who curates a directory of predatory OA journals that is routinely cited as an example of what’s wrong with scholarly publishing. I keep saying, the scholarly-industrial complex is broken. I’m beyond proud that the Council of Library Deans for the California State University–my 22 peers–voted to encourage and advocate for open access publishing in the CSU system. I’m also excited that my library has its first scholarly communications librarian who is going to bat on open access and open educational resources and all other things open–a position that in consultation with the library faculty I prioritized as our first hire in a series of retirement/moving-on faculty hires. But none of that translates to sharing work I consider unfinished. We need to fix things in scholarly publishing and there is no easy, or single, path. And there are many other things happening in the world right now. I respect every author’s decision about what they will share with the world and when and how they will share it. As for my decision–you have it here. Bookmark to: Filed in Uncategorized | | Comments Off on A scholar’s pool of tears, Part 2: The pre in preprint means not done yet ‹ Older posts Search for: Recto and verso About Free Range Librarian Comment guidelines Writing: Clips & Samples You were saying… K.G. Schneider on I have measured out my life in Doodle polls Thomas Dowling on I have measured out my life in Doodle polls Chad on I have measured out my life in Doodle polls Dale McNeill on I have measured out my life in Doodle polls Walter Underwood on An Old-Skool Blog Post Recent Posts (Dis)Association I have measured out my life in Doodle polls Memento DMV An Old-Skool Blog Post Keeping Council Browse by month Browse by month Select Month May 2019  (1) April 2019  (1) March 2019  (2) January 2018  (1) November 2017  (1) August 2017  (1) April 2017  (1) March 2017  (1) January 2017  (1) November 2016  (1) September 2016  (1) June 2016  (1) March 2016  (3) January 2016  (2) September 2015  (1) August 2015  (1) July 2015  (1) April 2015  (1) March 2015  (1) January 2015  (1) October 2014  (1) September 2014  (1) August 2014  (1) June 2014  (1) April 2014  (1) February 2014  (1) January 2014  (1) December 2013  (2) October 2013  (1) August 2013  (1) July 2013  (1) June 2013  (2) April 2013  (2) March 2013  (4) February 2013  (2) January 2013  (2) December 2012  (2) November 2012  (1) September 2012  (1) August 2012  (2) July 2012  (2) June 2012  (2) May 2012  (4) April 2012  (3) March 2012  (5) February 2012  (3) January 2012  (5) December 2011  (1) November 2011  (1) October 2011  (1) September 2011  (1) August 2011  (1) July 2011  (1) June 2011  (2) May 2011  (1) April 2011  (1) March 2011  (1) February 2011  (4) January 2011  (2) December 2010  (2) November 2010  (2) September 2010  (6) August 2010  (2) July 2010  (4) June 2010  (4) May 2010  (3) April 2010  (6) March 2010  (3) February 2010  (2) January 2010  (5) December 2009  (6) November 2009  (4) October 2009  (7) September 2009  (9) August 2009  (4) July 2009  (5) June 2009  (6) May 2009  (6) April 2009  (8) March 2009  (6) February 2009  (9) January 2009  (20) December 2008  (11) November 2008  (16) October 2008  (11) September 2008  (7) August 2008  (8) July 2008  (10) June 2008  (15) May 2008  (12) April 2008  (14) March 2008  (15) February 2008  (9) January 2008  (15) December 2007  (17) November 2007  (30) October 2007  (21) September 2007  (23) August 2007  (34) July 2007  (41) June 2007  (34) May 2007  (34) April 2007  (28) March 2007  (15) February 2007  (12) January 2007  (11) December 2006  (9) November 2006  (20) October 2006  (21) September 2006  (21) August 2006  (25) July 2006  (24) June 2006  (26) May 2006  (30) April 2006  (31) March 2006  (32) February 2006  (45) January 2006  (37) December 2005  (25) November 2005  (27) October 2005  (18) September 2005  (26) August 2005  (22) July 2005  (47) June 2005  (25) May 2005  (31) April 2005  (41) March 2005  (44) February 2005  (33) January 2005  (37) December 2004  (35) November 2004  (26) October 2004  (14) September 2004  (9) August 2004  (26) July 2004  (5) June 2004  (34) May 2004  (22) April 2004  (31) March 2004  (22) February 2004  (29) January 2004  (34) December 2003  (27) November 2003  (36) July 2003  (3) Categories CategoriesSelect Category American Liberry Ass’n Annual Lists Another Library Blog Association Nerd Bad Entry Titles Best of FRL Blog Problems Blogging Blogging and Ethics Blogging and RSS BlogHer Blogs and Journalism Blogs Worth Reading Book Reviews Business 2.0 California Dreamin’ Car shopping Cats Who Blog CLA Shenanigans Conferences Congrunts Cooking creative nonfiction Cuba Customer Service Digital Divide Issues Digital Preservation ebooks Essays from the MP Ethics Evergreen ILS Family Values Five Minute Reviews Flickr Fun Flori-duh Friends FRL Blogroll Additions FRL Penalty Box FRL Spotlight Reviews Gardening Whatnots Gay Rights Gender and Librarianship Get a Grip Get Real! God’s Grammar Google Gormangate Hire Edukayshun Homebrewing Homosexual Agenda Hot Tech Intellectual Freedom Intellectual Property Ipukarea Katrina and Libraries Kudos and Woo-Hoos lastentries Leadership LGBT librarianship Librarian Wisdom Librarianship Library 2.0 Library Journal Librewians Lies damn lies Life Linkalicious LITA Councilor Memes MFA-O-Rama Military Life Movable Type Movie Reviews MPOW MPOW Wishlist Must-Read Blogs NASIG Paper Next Gen Catalog Online learning Onomies Open Access Open Data Open Source Software Our World People Sitings Podcasts Politics Postalicious Prayer Circle Product Reviews Reading Recipes Recto and Verso Regency Zombies Regular Issues Religion RSS-alicious Santa Rosa Living Search 4 Search Standards Schmandards tagging Talks and Tours Tallahassee Dining Tallahassee Living TANSTAAFL Test Entries The Big O This and That Top Tech Trends Travel Schmavel Treo Time Twitterprose Two Minutes Hate Uncategorized Upcoming gigs Uppity Wimmin Vast stupidity War No More WoGroFuBiCo Women WordPress Writing Writing for the Web Ye Olde Tech Tags ALA a mile down BACABI bush Castro CIL2008 cloud tests CNF creative nonfiction crowdvine david vann defragcon defragcon07 defrag07 defragcon defragcon07 defrag07 shootingonesownfoot defragcon defragcon2007 defrag07 Digital Preservation email environment essays flaming homophobia gay Gay Rights GLBT Harvey Milk Homebrew hybrid iasummit08 iasummit2008 idjits journals keating 5 LOCKSS mccain mea culpas mullets naked emperors obama ready fire aim San Francisco silly tags swift tag clouds tagging VALA-CAVAL WoGroFuBiCo Writing Scribbly stuff Log in Entries feed Comments feed WordPress.org © 2021 K.G. Schneider ¶ Thanks, WordPress. ¶ veryplaintxt theme by Scott Allan Wallick. ¶ It's nice XHTML & CSS. 
futurearchives-blogspot-com-8781	----	futureArch, or the future of archives... Monday, 5 September 2016 This blog is no longer being updated But you will find posts on some of our digital archives work here: http://blogs.bodleian.ox.ac.uk/archivesandmanuscripts/category/activity/digital-archives/  Posted by Susan Thomas at 17:33 No comments: Thursday, 31 October 2013 Born Digital: Guidance for Donors, Dealers, and Archival Repositories Today CLIR published a report which is designed to provide guidance on the acquisition of archives in a digital world. The report provides recommendations for donors and dealers, and for repository staff, based on the experiences of archivists and curators at ten repositories in the UK and US, including the Bodleian. You can read it here: http://www.clir.org/pubs/reports/pub159 Posted by Susan Thomas at 17:49 No comments: Labels: acquisitions, dealers, donors, guidance, scoping, sensitivity review, transfers Thursday, 31 January 2013 Digital Preservation: What I wish I knew before I started The Digital Preservation Coalition (DPC) and Archives and Records Association event ‘Digital Preservation: What I wish I knew before I started, 2013’ took place at Birkbeck College, London on 24 January 2013. A half-day conference, it brought together a group of leading specialists in the filed to discuss the challenges of digital collection. William Kilbride kicked off events with his presentation ‘What’s the problem with digital preservation’. He looked at the traditional -or in his words "bleak"- approach that is too often characterised by data loss. William suggested we need to create new approaches, such as understanding the actual potential and value of output; data loss is not the issue if there is no practical case for keeping or digitising material. Some key challenges facing digital archivists were also outlined and it was argued that impediments such as obsolescence issues and storage media failure are a problem bigger than one institution, and collaboration across the profession is paramount. Helen Hockx-Yu discussed how the British Library is collaborating with other institutions to archive websites of historical and cultural importance through the UK Web Archive. Interestingly, web archiving at the British Library is now a distinct business unit with a team of eight people. Like William, Helen also emphasised how useful it is to share experiences and work together, both internally and externally. Next, Dave Thompson, Digital Curator at the Wellcome Library stepped up with a lively presentation entitled ‘So You Want to go Digital’. For Dave, it is “not all glamour, metadata and preservation events”, which he illustrated with an example of his diary for the week. He then looked at the planning side of digital preservation, arguing that if digital preservation is going to work, not only are we required to be creative, but we need to be sure what we are doing is sustainable. Dave highlighted some key lessons from his career thus far: 1.     We must be willing to embrace change 2.     Data preservation is not solely an exercise in technology but requires engagement with data and consumers. 3.     Little things we do everyday in the workplace are essential to efficient digital preservation, including backup, planning, IT infrastructure, maintenance and virus checking. 4.     It needs to be easy to do and within our control, otherwise the end product is not preservation. 5.     Continued training is essential so we can make the right decisions in appraisal, arrangement, context, description and preservation. 6.     We must understand copyright access. Patricia Sleeman, Digital Archivist at University of London Computer Centre then highlighted a selection of practical skills that should underpin how we move forward with digital preservation. For instance, she stressed that information without context is meaningless and has little value without the appropriate metadata. Like the other speakers, she suggested planning is paramount, and before we start a project we must look forward and learn about how we will finish it. As such, project management is an essential tool, including the ability to understand budgets. Adrian Brown from the Parliamentary Archives continued with his presentation 'A Day in the Life of a Digital Archivist'. His talk was a real eye-opener on just how busy and varied the role is. A typical day for Adrian might involve talking to information owners about possible transfers, ingesting and cataloguing new records into the digital repository, web archiving, providing demos to various groups, drafting preservation policies and developing future requirements such as building software, software testing and preservation planning. No room to be bored here! Like Dave Thompson, Adrian noted that while there are more routine tasks such as answering emails and endless meetings, the rewards from being involved in a new and emerging discipline far outweigh the more mundane moments. We then heard from Simon Rooks from the BBC Multi-Media Archive who described the varied roles at his work (I think some of the audience were feeling quite envious here!). In keeping with the theme of the day, Simon reflected on his career path. Originally trained as a librarian, he argued that he would have benefited immensely as a digital archivist if he had learnt the key functions of an archivist’s role early on. He emphasised how the same archival principles (intake, appraisal and selection, cataloguing, access etc.) underpin our practices, whether records are paper or digital, and whether we are in archives or records management. These basic functions help to manage many of the issues concerning digital content. Simon added that the OAIS functional model is an approach that has encouraged multi-disciplinary team-work amongst those working at the BBC. After some coffee there followed a Q&A session, which proved lively and engaging. A lot of ground was covered including how appropriate it is to distinguish 'digital archivists' from 'archivists'. We also looked at issues of cost modelling and it was suggested that while we need to articulate budgets better, we should perhaps be less obsessed with costs and focus on the actual benefits and return of investment from projects. There was then some debate about what students should expect from undertaking the professional course. Most agreed that it is simply not enough to have the professional qualification, and continually acquiring new skill sets is essential. A highly enjoyable afternoon then, with some thought-provoking presentations, which were less about the techie side of digital preservation, and more a valuable lesson on the planning and strategies involved in managing digital assets. Communications, continued learning and project planning were central themes of the day, and importantly, that we should be seeking to build something that will have value and worth. Posted by Anonymous at 10:42 No comments: Tuesday, 13 November 2012 Transcribe at the arcHIVE I do worry from time to time that textual analogue records will come to suffer from their lack of searchability when compared with their born-digital peers. For those records that have been digitised, crowd-sourcing transcription could be an answer. A rather neat example of just that is the arcHIVE platform from the National Archives of Australia. arHIVE is a pilot from NAA's labs which allows anyone to contribute to the transcription of records. To get started they have chosen a selection of records from their Brisbane office which are 'known to be popular'. Not too many of them just yet, but at this stage I guess they're just trying to prove the concept works. All the items have been OCR-ed, and users can choose to improve or overwrite the results from the OCR process. There are lots of nice features here, including the ability to choose documents by a difficulty rating (easy, medium or hard) or by type (a description of the series by the looks of it). The competitive may be inspired by the presence of a leader board, while the more collaborative may appreciate the ability to do as much as you can, and leave the transcription for someone else to finish up later. You can register for access to some features, but you don't have to either. Very nice. Posted by Susan Thomas at 09:37 No comments: Labels: crowdsourcing, searchability, transcription Friday, 19 October 2012 Atlas of digital damages An Atlas of digital damage has been created on Flickr, which will provide a handy resource for illustrating where digital preservation has failed. Perhaps 'failed' is a little strong. In some cases the imperfection may be an acceptable trade off. A nice, and useful, idea. Contribute here. Posted by Susan Thomas at 17:48 No comments: Labels: corruption, damage Saturday, 13 October 2012 DayOfDigitalArchives 2012 Yesterday was Day of Digital Archives 2012! (And yes, I'm a little late posting...) This 'Day' was initiated last year to encourage those working with digital archives to use social media to raise awareness of digital archives: "By collectively documenting what we do, we will be answering questions like: What are digital archives? Who uses them? How are they created and managed? Why are they important?" . So in that spirit, here is a whizz through my week. Coincidentally not only does this week include the Day of Digital Archives but it's also the week that the Digital Preservation Coalition (or DPC) celebrated its 10th birthday. On Monday afternoon I went to the reception at the House of Lords to celebrate that landmark anniversary. A lovely event, during which the shortlist for the three digital preservation awards was announced. It's great to see three award categories this time around, including one that takes a longer view: 'the most outstanding contribution to digital preservation in the last decade'. That's quite an accolade. On the train journey home from the awards I found some quiet time to review a guidance document on the subject of acquiring born-digital materials. There is something about being on a train that puts my brain in the right mode for this kind of work. Nearing its final form, this guidance is the result of a collaboration between colleagues from a handful of archive repositories. The document will be out for further review before too long, and if we've been successful in our work it should prove helpful to creators, donors, dealers and repositories. Part of Tuesday I spent reviewing oral history guidance drafted by a colleague to support the efforts of Oxford Medical Alumni in recording interviews with significant figures in the world of Oxford medicine. Oral histories come to us in both analogue and digital formats these days, and we try to digitise the former as and when we can. The development of the guidance is in the context of our Saving Oxford Medicine initiative to capture important sources for the recent history of medicine in Oxford. One of the core activities of this initiative is survey work, and it is notable that many archives surveyed include plenty of digital material. Web archiving is another element of the 'capturing' work that the Saving Oxford Medicine team has been doing, and you can see what has been archived to-date via Archive-It, our web archiving service provider. Much of Wednesday morning was given over to a meeting of our building committee, which had very little to do with digital archives! In the afternoon, however, we were pleased to welcome visitors from MIT - Nancy McGovern and Kari Smith. I find visits like these are one of the most important ways of sharing information, experiences and know-how, and as always I got a lot out of it. I hope Nancy and Kari did too! That same afternoon, colleagues returned from a trip to London to collect another tranche of a personal archive. I'm not sure if this instalment contains much in the way of digital material, but previous ones have included hundreds of floppies and optical media, some zip discs and two hard disks. Also arriving on Wednesday, some digital Library records courtesy of our newly retired Executive Secretary; these supplement materials uploaded to BEAM (our digital archives repository) last week. On Thursday, I found some time to work with developer Carl Wilson on our SPRUCE-funded project. Becky Nielsen (our recent trainee, now studying at Glasgow) kicked off this short project with Carl, following on from her collaboration with Peter May at a SPRUCE mashup in Glasgow. I'm picking up some of the latter stages of testing and feedback work now Becky's started her studies. The development process has been an agile one with lots of chat and testing. I've found this very productive - it's motivating to see things evolving, and to be able to provide feedback early and often. For now you can see what's going on at github here, but this link will likely change once we settle on a name that's more useful than 'spruce-beam' (doesn't tell you much, does it?! Something to do with trees...) One of the primary aims of this tool is to facilitate collection analysis, so we know better what our holdings are in terms of format and content. We expect that it will be useful to others, and there will be more info. on it available soon. Friday was more SPRUCE work with Carl, among other things. Also a few meetings today - one around funding and service models for digital archiving, and a meeting of the Bodleian's eLegal Deposit Group (where my special interest is web archiving). The curious can read more about e-legal deposit at the DCMS website.  One fun thing that came out of the day was that the Saving Oxford Medicine team decided to participate in a Women in Science wikipedia editathon. This will be hosted by the Radcliffe Science Library on 26 October as part of a series of 'Engage' events on social media organised by the Bodleian and the University's Computing Services. It's fascinating to contemplate how the range and content of Wikipedia articles change over time, something a web archive would facilitate perhaps.  For more on working with digital archives, go take a look at the great posts at the Day of Digital Archives blog! Posted by Susan Thomas at 19:45 No comments: Labels: acquisition, collection analysis, DayofDigArc, DODA12, dpc, mashup, SPRUCE, webarchiving Friday, 8 June 2012 Sprucing up the TikaFileIdentifier As it's International Archives Day tomorrow, I thought it would be nice to quickly share some news of a project we are working on, which should help us (and others!) to carry out digital preservation work a little bit more efficiently. Following the SPRUCE mashup I attended in April, we are very pleased to be one of the organizations granted a SPRUCE Project funding award, which will allow us to 'spruce' up the TikaFileIdentifier tool. (Paul has written more about these funding awards on the OPF site.) TikaFileIdentifier is the tool which was developed at the mashup to address a problem several of us were having extracting metadata from batches of files, in our case within ISO images. Due to the nature of the mashup event the tool is still a bit rough around the edges, and this funding will allow us to improve on it. We aim to create a user interface and a simpler install process, and carry out performance improvements. Plus, if resources allow, we hope to scope some further functionality improvements. This is really great news, as with the improvements that this funding allows us to make, the TikaFileIdentifier will provide us with better metadata for our digital files more efficiently than our current system of manually checking each file in a disk image. Hopefully the simpler user interface and other improvements means that other repositories will want to make use of it as well; I certainly think it will be very useful! Posted by Rebecca Nielsen at 17:18 No comments: Labels: metadata, SPRUCE, TikaFileIdentifier Friday, 20 April 2012 SPRUCE Mashup: 16th-18th April 2012 Earlier this week I attended a 3 day mashup event in Glasgow, organised as part of the SPRUCE project.  SPRUCE aims to enable Higher Education Institutions to address preservation gaps and articulate the business case of digital preservation, and the mashup serves as a way to bring practitioners and developers together to work on these problems. Practitioners took along a collection which they were having issues with, and were paired off with a developer who could work on a tool to provide a solution.  Day 1 After some short presentations on the purpose of SPRUCE and the aims of the mashup, the practitioners presented some lightning talks on our collections and problems. These included dealing with email attachments, preserving content off Facebook, software emulation, black areas in scanned images, and identifying file formats with incorrect extensions, amongst others. I took along some disk images, as we find it very time-consuming to find out date ranges, file types and content of the files in the disk image, and we wanted a more efficient way to get this metadata. More information on the collections and issues presented can be found at the wiki. After a short break for coffee (and excellent cakes and biscuits) we were sorted into small groups of collection owners and developers to discuss our issues in more detail. In my group this led to conversations about natural language processing, and the possibilities of using predefined subjects to identify files as being about a particular topic, which we thought could be really helpful, but somewhat impossible to create in a couple of days! We were then allocated our developers. As there were a few of us with problems with file identification, we were assigned to the same developer, Peter May from the BL. The day ended with a short presentation from William Kilbride on the value of digital collections and Neil Beagrie's benefits framework. Day 2 The developers were packed off to another room to work on coding, while we collection owners started to look into the business case for digital preservation. We used Beagrie’s framework to consider the three dimensions of benefits (direct or indirect, near- or long-term, and internal or external), as they apply to our institutions. When we reported back, it was interesting to see how different organisations benefit in different ways. We also looked at various stakeholders and how important or influential they are to digital preservation. Write ups of these sessions are also available at the wiki.   The developers came back at several points throughout the day to share their progress with us, and by lunchtime the first solution had been found! The first steps to solving our problem were being made; Peter had found a program, Apache Tika, which can parse a file and extract metadata (it can also identify the content type of files with incorrect extensions), and had written a script so that it could work through a directory of files, and output the information into a CSV spreadsheet. This was a really promising start, especially due to the amount of metadata that could potentially be extracted (provided it exists within the file), and the ability to identify file types with incorrect extensions. Day 3 We had another catch up with the developers and their overnight progress. Peter had written a script that took the information from the CSV file and summarised it into one row, so that it fits into the spreadsheets we use at BEAM. Unfortunately, mounting the ISO image to check it with Apache Tika was slightly more complicated than anticipated, so our disk images couldn't be checked this way without further work. While the developers set about finalizing their solutions, we continued to work on the business case, doing a skills gap analysis to consider whether our institutions had the skills and resources to carry out digital preservation. Reporting back, we had a very interesting discussion on skills gaps within the broader archives sector, and the need to provide digital preservation training to students as well as existing professionals. We then had to prepare an ‘elevator pitch’ for those occasions when we find ourselves in a lift with senior management, which neatly brought together all the things we had discussed, as we had to explain the specific benefits of digital preservation to our institution and our goals in about a minute.  To wrap up the developers presented their solutions, which solved many of the problems we had arrived with. A last minute breakthrough in mounting ISO images using  WinCDEmu and running scripts on them meant that we are able to use the Tika script on our disk images. However, because we were so short on time, there are still some small problems that need addressing. I'm really happy with our solution, and I was very impressed by all the developers and how much they were able to get done in such a short space of time. I felt that this event was a very useful way to get thinking about the business case for what we do, and to get to see what other people within the sector are doing and what problems they are facing. It was also really helpful as a non-techie to get to talk with developers and get an idea of what it is possible to build tools to do (and get them made!). I would definitely recommend this type of event – in fact, I’d love to go along again if I get the opportunity! Posted by Rebecca Nielsen at 15:52 2 comments: Monday, 26 March 2012 Media Recognition: DV part 3 DVCAM (encoding) Type: Digital videotape cassette encoding Introduced: 1996 Active: Yes, but few new camcorders are being produced. Cessation: - Capacity: 184 minutes (large), 40 minutes (MiniDV). Compatibility: DVCAM is an enhancement of the widely adopted DV format, and uses the same encoding. Cassettes recorded in DVCAM format can be played back in DVCAM VTRs (Video Tape Recorders), newer DV VTRs (made after the introduction of DVCAM), and DVCPRO VTRs, as long as the correct settings are specified (this resamples the signal to 4:1:1). DVCAM can also be played back in compatible HDV players. Users: Professional / Industrial. File Systems: - Common Manufacturers: Sony, Ikegami. DVCAM is Sony’s enhancement of the DV format for the professional market. DVCAM uses the same encoding as DV, although it records ‘locked’ rather than ‘unlocked’ audio. It also differs from DV as it has a track width of 15 microns and a tape speed of 28.215 mm/sec to make it more robust. Any DV cassette can contain DVCAM format video, but some are sold with DVCAM branding on them. Recognition DVCAM labelled cassettes come in large (125.1 x 78 x 14.6 mm) or MiniDV (66 x 48 x 12.2mm) sizes. Tape width is ¼”. Large cassettes are used in editing and recording decks, while the smaller cassettes are used in camcorders. They are marked with the DVCAM logo, usually in the upper-right hand corner.  HDV (encoding) Type: Digital videotape cassette encoding Introduced: 2003 Active: Yes, although industry experts do not expect many new HDV products. Cessation: - Capacity: 1 hour (MiniDV), up to 4.5 hours (large) Compatibility: Video is recorded in the popular MPEG-2 video format. Files can be transferred to computers without loss of quality using an IEEE 1394 connection. There are two types of HDV, HDV 720p and HDV 1080, which are not cross-compatible. HDV can be played back in HDV VTRs. These are often able to support other formats such as DV and DVCAM. Users: Amateur/Professional File Systems: - Common Manufacturers: Format developed by JVC, Sony, Canon and Sharp. Unlike the other DV enhancements, HDV uses MPEG-2 compression rather than DV encoding. Any DV cassette can contain HDV format video, but some are sold with HDV branding on them.  There are two different types of HDV: HDV 720p (HD1, made by JVC) and HDV 1080 (HD2, made by Sony and Canon). HDV 1080 devices are not generally compatible with HDV 720p devices. The type of HDV used is not always identified on the cassette itself, as it depends on the camcorder used rather than the cassette. Recognition  HDV is a tape only format which can be recorded on normal DV cassettes. Some MiniDV cassettes with lower dropout rates are indicated as being for HDV, either with text or the HDV logo. These are not essential for recording HDV video.  Posted by Rebecca Nielsen at 14:52 No comments: Labels: digital video, DVCAM, HDV, media recoginition, video Media Recognition: DV part 2 DV (encoding) Type: Digital videotape cassette encoding Introduced: 1995 Active: Yes, but tapeless formats such as MPEG-1, MPEG-2 and MPEG-4 are becoming more popular. Cessation: - Capacity: MiniDV cassettes can hold up to 80/120 minutes SP/LP. Medium cassette size can hold up to 3.0/4.6 hrs SP/LP. Files sizes can be up to 1GB per 4 minutes of recording. Compatibility: DV format is widely adopted. Cassettes recorded in the DV format can be played back on DVCAM, DVCPRO and HDV replay devices. However, LP recordings cannot be played back in these machines. Users: DV is aimed at a consumer market – may also be used by ‘prosumer’ film makers. File Systems: - Common Manufacturers: A consortium of over 60 manufacturers including Sony, Panasonic, JVC, Canon, and Sharp. DV has a track width of 10 microns and a tape speed of 18.81mm/sec. It can be found on any type of DV cassette, regardless of branding, although most commonly it is the format used on MiniDV cassettes.  Recognition DV cassettes are usually found in the small size, known as MiniDV. Medium size (97.5 × 64.5 × 14.6 mm) DV cassettes are also available, although these are not as popular as MiniDV. DV cassettes are labelled with the DV logo. DVCPRO (encoding) Type: Digital videotape cassette encoding Introduced: 1995 (DVCPRO), 1997 (DVCPRO 50), 2000 (DVCPRO HD) Active: Yes, but few new camcorders are being produced. Cessation: - Capacity: 126 minutes (large), 66 minutes (medium). Compatibility: DVCPRO is an enhancement of the widely adopted DV format, and uses the same encoding. Cassettes recorded in DVCPRO format can be played back only in DVCPRO Video Tape Recorders (VTRs) and some DVCAM VTRs. Users: Professional / Industrial; designed for electronic news gathering File Systems: - Common Manufacturers: Panasonic, also Philips, Ikegami and Hitachi. DVCPRO is Panasonic’s enhancement of the DV format, which is aimed at a professional market. DVCPRO uses the same encoding as DV, but it features ‘locked’ audio, and uses 4:1:1 sampling instead of 4:2:0. It has an 18 micron track width, and a tape speed of 33.82 mm/sec which makes it more robust. DVCPRO uses Metal Particle (MP) tape rather than Metal Evaporate( ME) to improve durability. DVCPRO 50 and DVCPRO HD are further developments of DVCPRO, which use the equivalent of 2 or 4 DV codecs in parallel to increase the video data rate. Any DV cassette can contain DVCPRO format video, but some are sold with DVCPRO branding on them. Recognition DVCPRO branded cassettes come in medium (97.5 × 64.5 × 14.6mm) or large (125 × 78 × 14.6mm) cassette sizes. The medium size is for use in camcorders, and the large size in editing and recording decks. DVCPRO 50 and DVCPRO HD branded cassettes are extra-large cassettes (172 x 102 x 14.6mm). Tape width is ¼”. DVCPRO labelled cassettes have different coloured tape doors depending on their type; DVCPRO has a yellow tape door, DVCPRO50 has a blue tape door, and DVCPRO HD has a red tape door. Images of DVCPRO cassettes are available at the Panasonic website. Posted by Rebecca Nielsen at 14:31 No comments: Labels: digital video, DV, DVCPRO, media recoginition, video Media Recognition: DV part 1 DV can be used to refer to both a digital tape format, and a codec for digital video. DV tape usually carries video encoded with the DV codec, although it can hold any type of data. The DV format was developed in the mid 1990s by a consortium of video manufacturers, including Sony, JVC and Panasonic, and quickly became the de facto standard for home video production after introduction in 1995. Videos are recorded in .dv or .dif formats, or wrapped in an AVI, QuickTime or MXF container. These can be easily transferred to a computer with no loss of data over an IEEE 1394 (Fire Wire) connection. DV tape is ¼ inch (6.35mm) wide. DV cassettes come in four different sizes: Small, also known as MiniDV (66 x 48 x 12.2 mm), medium (97.5 × 64.5 × 14.6 mm), large (125.1 x 78 x 14.6 mm), and extra-large (172 x 102 x 14.6 mm). MiniDV is the most popular cassette size. DV cassettes can be encoded with one of four formats; DV, DVCAM, DVCPRO, or HDV. DV is the original encoding, and is used in consumer devices. DVCPRO and DVCAM were developed by Panasonic and Sony respectively as an enhancement of DV, and are aimed at a professional market. The basic encoding algorithm is the same as with DV, but a higher track width (18 and 15 microns versus DV’s 10 micron track width) and faster tape speed means that these formats are more robust and better suited to professional users. HDV is a high-definition variant, aimed at professionals and consumers, which uses MPEG-2 compression rather than the DV format. Depending on the recording device, any of the four DV encodings can be recorded on any size DV cassette. However, due to different recording speeds, the formats are not always backwards compatible. A cassette recorded in an enhanced format, such as HDV, DVCAM or DVCPRO, will not play back on a standard DV player. Also, as they are supported by different companies, there are some issues with playing back a DVCPRO cassette on DVCAM equipment, and vice versa. Although all DV cassette sizes can record any format of DV, some are marketed specifically as being of a certain type; e.g. DVCAM. The guide below looks at some of the most common varieties of DV cassette that might be encountered, and the encodings that may be used with them. It is important to remember that any type of encoding may be found on any kind of cassette, depending on what system the video was recorded on. MiniDV (cassette) Type: Digital videotape cassette Introduced: 1995 Active: Yes, but is being replaced in popularity by hard disk and flash memory recording. At the International Consumer Electronics Show 2011 no camcorders were presented which record on tape. Cessation: - Capacity: Up to 80 minutes SP / 120 minutes LP, depending on the tape used; 60/90 minutes SP/LP is standard. This can also depend on the encoding used (see further entries). Files sizes can be up to 1GB per 4 minutes of recording. Compatibility: DV file format is widely adopted. Requires Fire Wire (IEEE 1394) port for best transfer. Users: Consumer and ‘Prosumer’ film makers, some professionals. File Systems: - Common Manufacturers: A consortium of over 60 manufacturers including Sony, Panasonic, JVC, Canon, and Sharp MiniDV refers to the size of the cassette; as noted above, it can come with any encoding. As a consumer format they generally use DV encoding. DVCAM and HDV cassettes also come in MiniDV size. MiniDV is the most popular DV cassette, and is used for consumer and semi-professional (‘prosumer’) recordings due to its high quality. Recognition These cassettes are the small cassette size, measuring 66 x 48 x 12.2mm. Tape width is ¼”. They carry the MiniDV logo, as seen below: Posted by Rebecca Nielsen at 13:03 No comments: Labels: digital video, DV, media recoginition, MiniDV, video Monday, 30 January 2012 Digital Preservation: What I Wish I Knew Before I Started Tuesday 24th January, 2012 Last week I attended a student conference, hosted by the Digital Preservation Coalition, on what digital preservation professionals wished they had known before they started. The event covered a great deal of the challenges faced by those involved in digital preservation, and the skills required to deal with these challenges. The similarities between traditional archiving and digital preservation were highlighted at the beginning of the afternoon, when Sarah Higgins translated terms from the OAIS model into more traditional ‘archive speak’. Dave Thompson also emphasized this connection, arguing that digital data “is just a new kind of paper”, and that trained archivists already have 85-90% of the skills needed for digital preservation. Digital preservation was shown to be a human rather than a technical challenge. Adrian Brown argued that much of the preservation process (the "boring stuff") can be automated. Dave Thompson stated that many of the technical issues of digital preservation, such as migration, have been solved, and that the challenge we now face is to retain the context and significance of the data. The point made throughout the afternoon was that you don’t need to be a computer expert in order to carry out effective digital preservation. The urgency of intervention was another key lesson for the afternoon. As William Kilbride put it; digital preservation won’t do itself, won’t go away, and we shouldn't wait for perfection before we begin to act. Access to data in the future is not guaranteed without input now, and digital data is particularly intolerant to gaps in preservation. Andrew Fetherstone added to this argument, noting that doing something is (usually) better than doing nothing, and that even if you are not in a position to carry out the whole preservation process, it is better to follow the guidelines as far as you can, rather than wait and create a backlog. The scale of digital preservation was another point illustrated throughout the afternoon. William Kilbride suggested that the days of manual processing are over, due to the sheer amount of digital data being created (estimated to reach 35ZB by 2020!). He argued that the ability to process this data is more important to the future of digital preservation than the risks of obsolescence. The impossibility of preserving all of this data was illustrated by Helen Hockx-Yu, who offered the statistic the the UK Web Archive and National Archives Web Archive combined have archived less than 1% of UK websites. Adrian Brown also pointed out that as we move towards dynamic, individualised content on the web, we must decide exactly what the information is that we are trying to preserve. During the Q&A session, it was argued that the scale of digital data means that we have to accept that we can’t preserve everything, that not everything needs to be preserved, and that there will be data loss. The importance of collaboration was another theme which was repeated by many speakers. Collaboration between institutions on a local, national and even international level was encouraged, as by sharing solutions to problems and implementing common standards we can make the task of digital preservation easier. This is only a selection of the points covered in a very engaging afternoon of discussion. Overall, the event showed that, despite the scale of the task, digital preservation needn't be a frightening prospect, as archivists already have many of the necessary skills. The DPC have uploaded the slides used during the event, and the event was also live-tweeted, using the hashtag #dpc_wiwik, if you are interested in finding out more. Posted by Rebecca Nielsen at 09:41 1 comment: Labels: http://www.blogger.com/img/blank.gif Tuesday, 18 October 2011 What is ‘The Future of the Past of the Web’? ‘The Future of the Past of the Web’, Digital Preservation Coalition Workshop British Library, 7 October 2011 Chrissie Webb and Liz McCarthy In his keynote address to this event – organised by the Digital Preservation Coalition , the Joint Information Systems Committee and the British Library – Herbert van der Sompel described the purpose of web archiving as combating the internet’s ‘perpetual now’. Stressing the importance to researchers of establishing the ‘temporal context’ of publications and information, he explained how the framework of his Memento Project uses a ‘ timegate’ implemented via web plugins to show what a resource was like at a particular date in the past. There is a danger, however, that not enough is being archived to provide the temporal context; for instance, although DOIs provide stable documents, the resources they link to may disappear (‘link rot’). The Memento Project Firefox plugin uses a sliding timeline (here, just below the Google search box) to let users choose an archived date A session on using web archives picked up on the theme of web continuity in a presentation by The National Archives on the UK Government Web Archive, where a redirection solution using open source software helps tackle the problems that occur when content is moved or removed and broken links result. Current projects are looking at secure web archiving, capturing internal (e.g. intranet) sources, social media capture and a semantic search tool that helps to tag ‘unstructured’ material. In a presentation that reinforced the reason for the day’s ‘use and impact’ theme, Eric Meyer of the Oxford Internet Institute wondered whether web archives were in danger of becoming the ‘dusty archives’ of the future, contrasting their lack of use with the mass digitisation of older records to make them accessible. Is this due to a lack of engagement with researchers, their lack of confidence with the material or the lingering feeling that a URL is not a ‘real’ source? Archivists need to interrupt the momentum of ‘learned’ academic behaviour, engaging researchers with new online material and developing archival resources in ways that are relevant to real research – for instance, by helping set up mechanisms for researchers to trigger archiving activity around events or interests, or making more use of server logs to help them understand use of content and web traffic. One of the themes of the second session on emerging trends was the shift from a ‘page by page’ approach to the concept of ‘data mining’ and large scale data analysis. Some of the work being done in this area is key to addressing the concerns of Eric Meyer’s presentation; it has meant working with researchers to determine what kinds and sources of data they could really use in their work. Representatives of the UK Web Archive and the Internet Archive described their innovations in this field, including visualisation and interactive tools. Archiving social networks was also a major theme, and Wim Peters outlined the challenges of the ARCOMEM project, a collaboration between Sheffield and Hanover Universities that is tackling the problems of archiving ‘community memory’ through the social web, confronting extremely diverse and volatile content of varying quality for which future demand is uncertain. Richard Davis of the University of London Computer Centre spoke about the BlogForever project, a multi-partner initiative to preserve blogs, while Mark Williamson of Hanzo Archives spoke about web archiving from a commercial perspective, noting that companies are very interested in preserving the research opportunities online information offers. The final panel session raised the issue of the changing face of the internet, as blogs replace personal websites and social media rather than discrete pages are used to create records of events. The notion of ‘web pages’ may eventually disappear, and web archivists must be prepared to manage the dispersed data that will take (and is taking) their place. Other points discussed included the need for advocacy and better articulation of the demand for web archiving (proposed campaign: ‘Preserve!: Are you saving your digital stuff?’), duplication and deduplication of content, the use of automated selection for archiving and the question of standards. Posted by lizrosemccarthy at 13:40 No comments: Labels: Future of the Past of the Web, webarchives, workshop Older Posts Home Subscribe to: Posts (Atom) What's the futureArch blog? A place for sharing items of interest to those curating hybrid archives & manuscripts. Legacy computer bits wanted! At Bodleian Electronic Archives and Manuscripts (BEAM) we are always on the lookout for older computers, disk drives, technical manuals and software that can help us recover digital archives. If you have any such stuff that you would be willing to donate, please contact susan.thomas@bodleian.ox.ac.uk. Examples of items in our wish list include: an Apple Mac Macintosh Classic II Computer, a Wang PC 200/300 series, as well as myriad legacy operating system and word-processing software. Handy links Bodleian Electronic Archives & Manuscripts (BEAM) Bodleian Library Digital Preservation Coalition Oxford University Label Cloud 4n6umd (1) access (1) accession (1) accessioning (2) adapter (1) advisory board (2) agents (1) agrippa (1) Amatino Manucci (1) analysis (1) appraisal (1) arch enemy (1) archival dates (1) archival interfaces (7) archiving habits (1) ATA (1) audio (2) authority control (2) autogenerated metadata (3) autumn (1) BBC (2) BEAM architecture (1) blu-ray (1) buzz (1) cais (1) case studies (3) cd (3) cerp (1) chat (1) community (1) content model (2) copyright review (1) corruption (2) creator curation (1) cunning plan (1) D-Link (1) DAMS (1) data capture (5) data extraction (4) data recovery (5) dead media (1) Desktop (1) development (1) DGE-530T (1) digital archaeology (4) digital legacy (1) digital preservation (6) digitallivesconference (1) disk imaging (3) disks (1) documents (3) dundee (2) DVD (2) eac (1) ead (1) electronic records (1) email (5) emcap (1) emulation (1) emulators (1) eSATA (1) estate planning (1) etdf (1) facebook (1) faceted browser (1) file carving (1) file format recognition (2) file format specifications (1) file signatures (1) film (1) finding aids (1) FireWire (2) flash media (2) floppy disks (5) forensics (2) formats (1) friday post (1) funny (1) futureArch (1) gaip (1) geocities (1) Gigabit (1) gmail (1) google (1) googledocs (1) graduate traineeship (1) hard drive (5) highslide (1) holidays (1) hybrid archives (1) hybridity (1) hypertext exhibitions writers (1) images (1) indexing (1) ingest (2) interfaces (11) interoperability (2) intrallect (1) ipaper (1) ipr (1) iPres2008 (1) IPs (1) island of unscalable complexity (1) iso8601 (1) java (2) javascript (1) jif08 (1) JISC (1) job (1) kryoflux (1) lightboxes (1) linked data (1) literary (1) markup (2) may 2010 (1) media (2) media recoginition (14) metadata (4) microsoft (1) Microsoft Works (1) migration tools (4) moon landings (1) multitouch (1) music blogs (1) namespaces (1) never say never (1) normalisation (1) object characteristics (1) obsolescence (1) odd (1) office documents (2) online data stores (1) open source (5) open source development (2) open source software (2) optical media (4) osswatch (1) PCI (1) planets (2) planets testbed (1) plato (1) preservation planning (1) preservation policy (2) preservation tools (5) projects (2) pst (2) repositories (1) researchers (3) saa2009 (1) SAS (1) SATA (1) scat (1) scholars (1) scooters (1) scribd (1) SCSI (1) semantic (1) seminars (1) significant properties (1) snow (1) software (4) solr (1) sound (1) steganography (1) storage (1) tag clouds (1) tags (1) technical metadata (1) transfer bagit verify (1) twapperkeeper (1) tweets (1) USB (2) use cases (1) users (2) validation (1) value (1) video (6) vintage computers (2) weavers (1) webarchives (4) workshop (2) xena (1) xml (3) xmp (1) zip disks (1) Blog archive ▼  2016 (1) ▼  September (1) This blog is no longer being updated ►  2013 (2) ►  October (1) ►  January (1) ►  2012 (9) ►  November (1) ►  October (2) ►  June (1) ►  April (1) ►  March (3) ►  January (1) ►  2011 (16) ►  October (2) ►  September (1) ►  August (2) ►  July (1) ►  May (1) ►  April (4) ►  March (3) ►  February (1) ►  January (1) ►  2010 (42) ►  December (1) ►  November (2) ►  September (4) ►  August (4) ►  July (3) ►  June (4) ►  May (1) ►  April (5) ►  March (13) ►  February (3) ►  January (2) ►  2009 (51) ►  December (4) ►  November (3) ►  October (2) ►  September (2) ►  August (5) ►  July (6) ►  June (5) ►  May (6) ►  April (5) ►  March (1) ►  February (6) ►  January (6) ►  2008 (10) ►  November (1) ►  October (3) ►  September (1) ►  August (1) ►  July (4) Subscribe To futureArch Posts Atom Posts All Comments Atom All Comments beamtweet Loading... My Blog List The Signal: Digital Preservation Joining By the People: An interview with Abby Shelton ArchivesBlogs Meet Ike Digital Archiving at the University of York Latest Booking System in Google Sheets (WORKING!) ArchivesNext Now available: “A Very Correct Idea of Our School”: A Photographic History of the Carlisle Indian Industrial School Practical E-Records Hello world! born digital archives Practical First Steps mgolson@stanford.edu's blog KEEP - Keeping Emulation Environments Portable Digital Curation Blog Thoughts before "The Future of the Past of the Web" Archives Hub Blog Open Planets Foundation UK Web Archive Technology Watch Digital Lives Bits Bytes & Archives branker's blog DPC RSS News Feed Loading... About Me Susan Thomas View my complete profile 
futurelab-mx-5627	----	¡El futuro es ahora! | Future Lab ☰ ✎ Edit navigation Inicio Nosotros Eventos Blog Contacto Somos Future Lab, la comunidad del futuro. Desarrollamos tecnología y compartimos conocimiento. Trabajamos por el futuro que queremos ver. Síguenos en Facebook Desarrollamos proyectos de base científica y tecnológica Ya sea que nosotros mismo metamos mano en el desarrollo, o que sea a través de mentorías, nos encanta ser capaces de innovar y poder aportar al desarrollo de nuevas tecnologías; llevarlo a diferentes partes de México y poder presentar nuestros proyectos en diferentes eventos con empresas que están en el medio. Compartimos conocimiento y fomentamos la educación en tecnología A través de talleres, charlas, conferencias y participaciones en eventos compartimos conocimientos técnicos y de cultura para el desarrollo de tecnología. Vinculamos y creamos comunidad Nos encanta empoderar a nuestra comunidad, apoyar y poder vincular con quien pueda potencializar a las grandes mentes del futuro que se nos acercan. Nuestra visión en Future Lab es poder desarrollar nuestro futuro, compartir conocimientos y poder crear las conexiones que ayuden a nuestra comunidad. Rodolfo Ferro, Co-fundador de Future Lab. ¡Conoce todo lo que estamos haciendo! Future Lab en Facebook ✎ Edit footer © 2020 Future Lab. 
galencharlton-com-3162	----	Meta Interchange – Libraries, computing, metadata, and more Skip to content Meta Interchange Libraries, computing, metadata, and more Search for Submit Primary Menu About Comment policy Privacy Policy Search for Submit Trading for images Posted: 23 February 2020 Categories: Libraries, Patron Privacy Let’s search a Koha catalog for something that isn’t at all controversial: What you search for in a library catalog ought to be only between you and the library — and that, only briefly, as the library should quickly forget. Of course, between “ought” and “is” lies the Devil and his details. Let’s poke around with Chrome’s DevTools: Hit Control-Shift-I (on Windows) Switch to the Network tab. Hit Control-R to reload the page and get a list of the HTTP requests that the browser makes. We get something like this: There’s a lot to like here: every request was made using HTTPS rather than HTTP, and almost all of the requests were made to the Koha server. (If you can’t trust the library catalog, who can you trust? Well… that doesn’t have an answer as clear as we would like, but I won’t tackle that question here.) However, the two cover images on the result’s page come from Amazon: https://images-na.ssl-images-amazon.com/images/P/0974458902.01.TZZZZZZZ.jpg https://images-na.ssl-images-amazon.com/images/P/1849350949.01.TZZZZZZZ.jpg What did I trade in exchange for those two cover images? Let’s click on the request on and see: :authority: images-na.ssl-images-amazon.com :method: GET :path: /images/P/0974458902.01.TZZZZZZZ.jpg :scheme: https accept: image/webp,image/apng,image/,/*;q=0.8 accept-encoding: gzip, deflate, br accept-language: en-US,en;q=0.9 cache-control: no-cache dnt: 1 pragma: no-cache referer: https://catalog.libraryguardians.com/cgi-bin/koha/opac-search.pl?q=anarchist sec-fetch-dest: image sec-fetch-mode: no-cors sec-fetch-site: cross-site user-agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.116 Safari/537.36 Here’s what was sent when I used Firefox: Host: images-na.ssl-images-amazon.com User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:73.0) Gecko/20100101 Firefox/73.0 Accept: image/webp,/ Accept-Language: en-US,en;q=0.5 Accept-Encoding: gzip, deflate, br Connection: keep-alive Referer: https://catalog.libraryguardians.com/cgi-bin/koha/opac-search.pl?q=anarchist DNT: 1 Pragma: no-cache Amazon also knows what my IP address is. With that, it doesn’t take much to figure out that I am in Georgia and am clearly up to no good; after all, one look at the Referer header tells all. Let’s switch over to using Google Book’s cover images: https://books.google.com/books/content?id=phzFwAEACAAJ&printsec=frontcover&img=1&zoom=5 https://books.google.com/books/content?id=wdgrJQAACAAJ&printsec=frontcover&img=1&zoom=5 This time, the request headers are in Chrome: :authority: books.google.com :method: GET :path: /books/content?id=phzFwAEACAAJ&printsec=frontcover&img=1&zoom=5 :scheme: https accept: image/webp,image/apng,image/,/*;q=0.8 accept-encoding: gzip, deflate, br accept-language: en-US,en;q=0.9 cache-control: no-cache dnt: 1 pragma: no-cache referer: https://catalog.libraryguardians.com/ sec-fetch-dest: image sec-fetch-mode: no-cors sec-fetch-site: cross-site user-agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.116 Safari/537.36 x-client-data: CKO1yQEIiLbJAQimtskBCMG2yQEIqZ3KAQi3qsoBCMuuygEIz6/KAQi8sMoBCJe1ygEI7bXKAQiNusoBGKukygEYvrrKAQ== and in Firefox: Host: books.google.com User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:73.0) Gecko/20100101 Firefox/73.0 Accept: image/webp,/ Accept-Language: en-US,en;q=0.5 Accept-Encoding: gzip, deflate, br Connection: keep-alive Referer: https://catalog.libraryguardians.com/ DNT: 1 Pragma: no-cache Cache-Control: no-cache On the one hand… the Referer now contains only the base URL of the catalog. I believe this is due to a difference in how Koha figures out the correct image URL. When using Amazon for cover images, the ISBN of the title is normalized and used to construct a URL for an <img> tag. Koha doesn’t currently set a Referrer-Policy, so the default of no-referrer-when-downgrade is used and the full referrer is sent. Google Book’s cover image URLs cannot be directly constructed like that, so a bit of JavaScript queries a web service and gets back the image URLs, and for reasons that are unclear to me at the moment, doesn’t send the full URL as the referrer. (Cover images from OpenLibrary are fetched in a similar way, but full Referer header is sent.) As a side note, the x-client-data header sent by Chrome to books.google.com is… concerning. There are some relatively simple things that can be done to limit leaking the full referring URL to the likes of Google and Amazon, including Setting the Referrer-Policy header via web server configuration or meta tag to something like origin or origin-when-cross-origin. Setting referrerpolicy for <script> and <img> tags involved in fetching book jackets. This would help, but only up to a point: fetching https://books.google.com/books/content?id=wdgrJQAACAAJ&printsec=frontcover&img=1&zoom=5 still tells Google that a web browser at your IP address has done something to fetch the book jacket image for The Anarchist Cookbook. Suspicious! What to do? Ultimately, if we’re going to use free third-party services to provide cover images for library catalogs, our options to do so in a way that preserves patron privacy boil down to: Only use sources that we trust to not broadcast or misuse the information that gets sent in the course of requesting the images. The Open Library might qualify, but ultimately isn’t beholden to any particular library that uses its data. Proxy image requests through the library catalog server. Evergreen does this in some cases, and it wouldn’t be much work to have Koha do something similar. It should be noted that Coce does not help in the case of Koha, as all it does is proxy image URLs, meaning that it’s still the user’s web browser fetching the actual images. Figure out a way to obtain local copies of the cover images and serve them from the library’s web server. Sometimes this is necessary anyway for libraries that collect stuff that wasn’t commercially sold in the past couple decades, but otherwise this is a lot of work. Do nothing and figure that Amazon and Google aren’t trawling through their logs correlate cover image retrieval with the potential reading interests. I actually have a tiny bit of sympathy to that approach — it’s not beyond the realm of possibility that cover image access logs are simply getting ignored, unlike say, direct usage data from Kindle or Google Books — but ostriches sticking their head in the sand are not known as a good model for due diligence. Non-free book jacket and added content services are also an option, of course — and at least unlike Google and Amazon, it’s plausible that libraries could insist on contracts (with teeth) that forbid misuse of patron information. My thanks to Callan Bignoli for the tweet that inspired this ramble. Share this: Twitter More Tumblr Reddit Print Author: Galen Charlton Entering 2019 Posted: 2 January 2019 Categories: Computing A very brief post to start the new year. I’m not inclined to make elaborate resolutions for the new year other than being very firm that I will stop writing “2018” in dates by the end of March… or maybe April. But seriously, I do want to write and engage more this year and more actively try new things. As I’m doing, right now, by trying WordPress’s new Gutenberg editor. Beyond that? We’ll see. A brief digression on Gutenberg: I will bet a bag of coffee that the rollout of Gutenberg will become a standard case study in software management course syllabi. It encapsulates so many points of conflict: open source governance and the role of commercial entities in open source communities; accessibility and the politics of serving (or not) all potential users; technical change management and the balance between backwards compatibility and keeping up to date with modern technology (or, more cynically, modern fashions in technology); and managing major changes to the conceptual model required to use a piece of software. (And an idea for a future post, either by me or anybody who wants to run with it: can the transition of WordPress’s editor from a document-based model to a block-based modal be usefully compared with the transition from AACR2/ISBD to RDA/LRM/LOD/etc.?) Of course, the situation with Gutenberg is evolving, so while initial analyses exist, obviously no definitive post mortems have been written. But before I let this digression run away from me… onwards to 2019. May everybody reading this have a happy new year, or at least better one than 2018. Hecate sleeping behind me in my chair. Her New Year’s resolutions are pretty clear: play, sleep, and eat. Also, torment her humans and her brother/uncle cats. Share this: Twitter More Tumblr Reddit Print Author: Galen Charlton Data cleanup as a force for evil Posted: 15 October 2018 Categories: Libraries, Politics A quotidian concern of anybody responsible for a database is the messy data it contains. See a record about a Pedro GonzÃ¡lez? Bah, the assumption of Latin-1 strikes again! Better correct it to González. Looking at his record in the first place because you’re reading his obituary? Oh dear, better mark him as deceased. 12,741 people living in the bungalow at 123 Main St.? Let us now ponder the wisdom of the null and the foolishness of the dummy value. Library name authority control could be viewed as a grand collaborative data cleanup project without having to squint too hard. What of the morality of data cleanup? Let’s assume that the data should be gathered in the first place; then as Patricia Hayes noted back in 2004, there is of course an ethical expectation that efforts such as medical research will be based on clean data: data that has been carefully collected under systematic supervision. Let’s consider another context: whether to engage in batch authority cleanup of a library catalog. The decision of whether it is worth the cost, like most decisions on allocating resources, has an ethical dimension: does the improvement in the usefulness of the catalog outweigh the benefits of other potential uses of the money? Sometimes yes, sometimes no, and the decision often depends on local factors, but generally there’s not much examination of the ethics of the data cleanup per se. After all, if you should have the database in the first place, it should be as accurate and precise as you can manage consistent with its raison d’être. Now let’s consider a particular sort of database. One full of records about people. Specifically, a voter registration database.  There are many like it; after all, at its heart it’s just a slightly overgrown list of names and addresses. An overgrown list of names of addresses around which much mischief has been done in the name of accuracy. This is on my mind because the state I live in, Georgia, is conducting a gubernatorial election that just about doubles as a referendum on how to properly maintain a voter registration list. On the one hand, you have Brian Kemp, the current Georgia secretary of state, whose portfolio includes the office that maintains the statewide voter database and oversees all elections. On other hand, Stacey Abrams, who among other things founded the New Georgia Project aimed at registering tens of thousands of new voters, albeit with mixed results. Is it odd for somebody to oversee the department that would certify the winner of the governor’s race? The NAACP and others think so, having filed a lawsuit to try to force Kemp to step down as secretary of state. Moreover, Kemp has a history of efforts to “clean” the voter rolls; efforts that tend to depress votes by minorities—in a state that is becoming increasingly purple.  (And consider the county I live in, Gwinnett County. It is the most demographically diverse county in the southeast… and happens to have the highest rate of rejection of absentee ballots so far this year.) Most recently, the journalist Greg Palast published a database of voters purged from Georgia’s list. This database contains 591,000 names removed from the rolls in 2017… one tenth of the list! A heck of a data cleanup project, eh? Every record removal that prevents a voter from casting their ballot on election day is an injustice. Every one of the 53,000 voters whose registration is left pending due to the exact match law is suffering an injustice. Hopefully they won’t be put off and will vote… if they can produce ID… if the local registrar discretion leans towards expanding and not collapsing the franchise. Dare I say it? Data cleanup is not an inherently neutral endeavor. Sure, much of the time data cleanup work is just improving the accuracy of a database—but not always. If you work with data about people, be wary. Share this: Twitter More Tumblr Reddit Print Author: Galen Charlton On being wrong, wrong, wrong Posted: 2 May 2018 Categories: Evergreen, Libraries Yesterday I gave a lightning talk at the Evergreen conference on being wrong. Appropriately, I started out the talk on the wrong foot. I intended to give the talk today, but when I signed up for a slot, I failed to notice that the signup sheet I used was for yesterday. It was a good thing that I had decided to listen to the other lightning talks yesterday, as that way the facilitator was able to find me to tell me that I was up next. Oops. When she did that, I initially asked to do it today as I had intended… but changed my mind and decided to charge ahead. Lightning talks are all about serendipity, right? The talk went something like this: after mentioning my scheduling mix-up, I spoke about how I have been active in the Evergreen project for almost nine years. I’ve worn a variety of project hats over that time, including those of developer, core committer, release manager, member of the Evergreen Oversight Board, chair of the EOB, and so forth. While I am of course proud of the contributions I’ve made, my history with the project also includes being wrong about many things and failing a lot. I’ve been wrong about coding issues. I’ve been responsible for regressions. I’ve had my share of brown-bag releases. I’ve misunderstood what library staff and patrons were trying to accomplish. I’ve made assumptions about the working conditions and circumstances of users that were very wrong indeed. Some of my bug reports and test plans have not been particularly clear. Why bring up my wrongness? Prior to the talk, I had been part of a couple conversations about how some folks feel intimidated about writing bug reports or posting to the mailing lists for fear of being judged if their submission was not perfect. Of course, I don’t want people to feel intimidated; the project needs bug reports and contributions from anybody who cares enough about the software to make the effort. By mentioning how I — as somebody who is unquestionably a senior contributor to the project — have been repeatedly wrong, I hoped to humanize people like me: we’re not perfect. Perfection is not a requirement for gaining status in the community as a respected contributor — and that’s a good thing. I also wanted to give permission for folks to be wrong, in the hopes that doing so might help lower a barrier to participating. So much for the gist of the lightning talk. People in the audience seemed to enjoy it, and I got a couple nice comments about it, including somebody mentioning how they wished they had heard something like that as they were making their first contributions to the project. However, I would also like to expand a bit on a couple points. Permission to be wrong is not something I can grant all by myself. While I can try to model good ways of providing feedback (and get better myself at it; I’ve certainly been wrong many a time about how to do so), it sometimes doesn’t take much for an interaction with a new contributor (or an experienced one!) to become unwelcoming to the point where we lose the contributor forever. This is not a theoretical concern; while I think we have gotten much better over the years, there were certainly times and circumstances where it was very rational to feel intimidated about participating in the project in certain ways for fear of getting dismissive feedback. Giving ourselves permission to be wrong is a community responsibility; by doing so we can give ourselves permission to improve. However, this can’t be treated as a platitude: it takes effort and thoughtfulness both to ensure that the community is welcoming at all levels, and to ensure that permission to be wrong isn’t accorded only to people who look like me. One of the things that the conference keynote speaker Crystal Martin asked the community to consider was this: “Lift as you climb.” I challenge senior contributors to the Evergreen project — including myself — to take this to heart. I have benefited a lot by being able to be wrong; we should act to ensure that everybody else in the community can be allowed to be wrong as well. Share this: Twitter More Tumblr Reddit Print Author: Galen Charlton Fostering a habit of nondisclosure Posted: 3 April 2018 Categories: Libraries, Patron Privacy It almost doesn’t need to be said that old-fashioned library checkout cards were terrible for patron privacy. Want to know who had checked out a book? Just take the card out of its pocket and read. It’s also a trivial observation that there’s a mini-genre of news articles and social media posts telling the tales of prodigal books, returning to their library after years or decades away, usually having gathered nothing but dust. Put these two together on a slow news day? Without care, you can end up not protecting a library user’s right to privacy and confidentially with respect to resources borrowed, to borrow some words from the ALA Code of Ethics. Faced with this, one’s sense of proportion may ask, “so what?” The borrower of a book returned sixty years late is quite likely dead, and if alive, not likely to suffer any social opprobrium or even sixty years of accumulated overdue fines.  Even if the book in question was a copy of The Anarchist Cookbook, due back on Tuesday, 11 May 1976, the FBI no doubt has lost interest in the matter. Of course, an immediate objection to that attitude is that personal harm to the patron remains possible, even if not probable. Sometimes the borrower wants to keep a secret to the grave. They may simply not care to be the subject of a local news story. The potential for personal harm to the borrower is of course clearer if we consider more recent loans. It’s not the job of a librarian to out somebody who wishes to remain in the closet; it remains the case that somebody who does not care to have another snoop on their reading should be entitled to read, and think, in peace. At this point, the sense of proportion that has somehow embodied itself in this post may rejoin, “you’re catastrophizing here, Charlton,” and not be entirely wrong. Inadvertent disclosure of patron information at the “retail” level does risk causing harm, but is not guaranteed to. After all, lots of people have no problem sharing (some) of their reading history. Otherwise, LibraryThing and Goodreads would just sit there gathering tumbleweeds. I’d still bid that sense of proportion to shuffle off with this: it’s mostly not the librarians bearing the risk of harm. However, there’s a larger point: libraries nowadays run much higher risks of violating patron privacy at the “wholesale” level than they used to. Remember those old checkout cards? Back in the day, an outsider trying to get a borrower’s complete reading history might have to turn out every book in the library to do so. Today, it can be much easier: find a way in, and you can have everything (including driver’s license numbers, addressees, and, if the patrons are really ill-served by their library, SSNs). That brings me to my point: we should care about nondisclosure (and better yet, non-collection of data we don’t need) at the retail level to help bolster a habit of caring about it at the wholesale level. Imagine a library where people at every level can feel free to point out and correct patron privacy violations — and know that they should. Where the social media manager — whose degree may not be an MLS — redacts patron names and/or asks for permission every time.  Where, and more to my point, the director and the head of IT make technology choices that protect patron privacy — because they are in the habit of thinking about patron privacy in the first place. This is why it’s worth it to sweat the small disclosures, to be better prepared against large ones. Share this: Twitter More Tumblr Reddit Print Author: Galen Charlton Scaling the annual Code4Lib conference Posted: 31 October 2017 Categories: Code4Lib One of the beautiful things about Code4Lib qua banner is that it can be easily taken up by anyway without asking permission. If I wanted to, I could have lunch with a colleague, talk about Evergreen, and call it a Code4Lib meetup, and nobody could gainsay me — particularly if I wrote up a summary of what we talked about. Three folks in a coffeehouse spending an afternoon hacking together a connection between digital repository Foo and automatic image metadata extractor Bar, then tossing something up on the Code4Lib Wiki? Easy-peasy. Ten people for dinner and plotting to take over the world replace MARC once and for all? Probably should make a reservation at the restaurant. Afternoon workshop for 20 in your metro area? Well, most libraries have meeting rooms, integral classrooms, or computer labs— and directors willing to let them be used for the occasional professional development activity. Day and a half conference for 60 from your state, province, or region? That’s probably a bit more than you can pull off single-handedly, and you may well simply not have the space for it if you work for a small public library. You at least need to think about how folks will get meals and find overnight accommodations. The big one? The one that nowadays attracts over four hundred people from across the U.S. and Canada, with a good sprinkling of folks from outside North America — and expects that for a good chunk of the time, they’ll all be sitting in the same room? And that also expects that at least half of them will spend a day scattered across ten or twenty rooms for pre-conference workshops? That folks unable to be there in person expect to live-stream? That tries in more and more ways tries to lower barriers to attending it? Different kettle of fish entirely. The 2017 conference incurred a tick under $240,000 in expenses. The 2016 conference: a bit over $207,000. This year? At the moment, projected expenses are in the neighborhood of $260,000. What is this going towards? Convention center or hotel conference space rental and catering (which typically need to be negotiated together, as guaranteeing enough catering revenue and/or hotel nights often translates into “free” room rental). A/V services, including projectors, sound systems, and microphones. Catering and space rental for the reception. For the past few years, the services of a professional event management firm — even with 50+ people volunteering for Code4Lib conference committees, we need the professionals as well. Diversity scholarships, including travel expenses, forgone registration fees, and hotel nights. T-shirts. Gratuities. Live transcription services. How is this all getting paid for? Last year, 49% of the income came from conference and pre-conference registrations, 31% from sponsorships and exhibitor tables, 5% from donations and sponsorships for scholarships, and 3% from hotel rebates and room credits. The other 12%? That came from the organizers of the 2016 conference in Philadelphia, who passed along a bit under $33,000 to the 2017 LPC. The 2017 conference in turn was able to pass along a bit over $25,000 to the organizers of the forthcoming 2018 conference. In other words, the 2017 conference effectively operated at a loss of a bit under $8,000, although fortunately there was enough of a cushion that from UCLA’s perspective, the whole thing was a wash — if you ignore some things. Things like the time that UCLA staff who were members of the 2017 local planning committee spent on the whole effort — and time spent by administrative staff in UCLA’s business office. What are their names? I have no clue. But something I can say much more confidently: every member of the 2017 LPC and budget committees lost sleep pondering what might happen if things went wrong. If we didn’t get enough sponsorships. If members of the community would balk at the registration fee — or simply be unable to afford it — and we couldn’t meet our hotel room night commitments. I can also say, without direct knowledge this time, but equally confidently, that members of the 2016 organizers lost sleep. And 2015. And so on down the line. While to my knowledge no Code4Lib member has ever been personally liable for the hotel contracts, I leave it to folks to consider the reputational consequence of telling their employer, were a conference to fail, that that institution is on the hook for potentially tens of thousands of dollars. Of course, somebody could justly respond by citing an ancient joke. You know, the one that begins like this: “Doctor, it hurts when I do this!”. And that’s a fair point. It is both a strength and weakness of Code4Lib that it imposes no requirement that anybody do anything in particular. We don’t have to have a big annual conference; a lot of good can be done under the Code4Lib banner via electronic communications and in-person meetups small enough that it’s of little consequence if nobody happens to show up. But I also remember the days when the Code4Lib conference would open registration, then close it a couple hours later because capacity has been reached. Based on the attendance trends, we know that we can reasonably count on at least 400 people being willing to travel to attend the annual conference.  If a future LPC manages to make the cost of attending the conference signficantly lower, I could easily see 500 or 600 people showing up (although I would then wonder if we might hit some limits on how large a single-track conference can be and still remain relevant for all of the attendees), I think there is value in trying to put on a conference that brings in as many practitioners (and yes, managers) in the GLAM technology space together in person as can come while also supporting online participation — but puts control of the program in the hands of the attendees via a process that both honors democracy and invites diversity of background and viewpoint. Maybe you agree with that—and maybe you don’t. But even if you don’t agree, please do acknowledge the astonishing generosity of the people and institutions that have put their money and reputation on the line to host the annual conference over the years. Regardless, if Code4Lib is to continue to hold a large annual conference while not being completely dependent on the good graces of a  small set of libraries that are in a position to assume $250,000+ liabilities, the status quo is not sustainable. That brings me to the Fiscal Continuity Interest Group, which I have helped lead. If you care about the big annual conference, please read the report (and if you’re pressed for time, start with the summary of options), then vote. You have until 23:59 ET on Friday, November 3 to respond to the survey. The survey offers the following options: maintain the status quo, meaning that each potential conference host is ultimately responsible for deciding how the liability of holding the conference should be managed set up a non-profit organization pick among four institutions that have generously offered to consider acting as ongoing fiscal sponsors for the annual conference I believe that moving away from the status quo will help ensure that the big annual Code4Lib conference can keep happening while broadening the number of institutions that would be able to physically host it. Setting up some kind of ongoing fiscal existence for Code4Lib may also solve some problems for the folks who have been running the Code4Lib Journal. I also believe that continuing with the status quo necessarily means that the Code4Lib community must rethink the annual conference: whether to keep having it at all; to accept the fact that only a few institutions are nowadays capable of hosting it at the scale we’re accustomed to; and to accept that if an institution is nonetheless willing to host it, that we should scale back expectations that the community is entitled to direct the shape of the conference once a host has been selected. In other words, it boils down to deciding how we wish to govern ourselves. This doesn’t mean that Code4Lib needs to embrace bureaucracy… but we must either accept some ongoing structure or scale back. Choose wisely. Share this: Twitter More Tumblr Reddit Print Author: Galen Charlton 3 Comments on Scaling the annual Code4Lib conference Amelia, 2000-2017 Posted: 26 October 2017 Categories: Cats, Personal Mellie on a blue blanket Last year, I wrote about the blossoming of the Mellie-cat, and closed with this line: “Sixteen years is not long enough to get to know a cat.” It turns out that neither is seventeen and a half years. Mellie passed away today after a brief illness. She is the last of my first set of cats, daughter of Erasmus and LaZorra, sister of Sophia. In the last year of her life, she trained Freddie how to cat; while she perhaps did not have the most apt of pupils, I know that he will miss her too. She was the bravest cat I have ever known. She was not inclined to pounce on the world and take it in full; she was reserved and cautious… and yet she always showed up to observe, no matter how unfamiliar the strangers or unusual the circumstances. Amelia is a grand name for a cat, but perhaps too grand for daily use. She was Mellie most days, but like many cats had accumulated a number of names and sobriquets throughout her life. The Clown Princess. Senior Member of the Treat Committee. Inspector of the Feets. Her mother’s special daughter. The softest and fluffiest. And so another cat joins the realm of story. It never gets any easier to mark that transition. Mellie and LaZorra The meeting of the Treat Committee Share this: Twitter More Tumblr Reddit Print Author: Galen Charlton Mashcat at ALA Annual 2017 + shared notes Posted: 21 June 2017 Categories: Libraries, Mashcat I’m leaving for Chicago tomorrow to attend ALA Annual 2017 (and to eat some real pizza), and while going over the schedule I found some programs that may be of interest to Mashcat folk: Cataloging and Metadata for the Web: Meeting the User Where They Are (Friday, June 23 8:00; AM – 4:00 PM; McCormick Place, W179a; ticketed pre-conference) Competencies and Education for a Career in Cataloging (ALCTS CaMMS) (Friday, June 23; 1:00 PM – 2:30 PM; McCormick Place, W184d) Linked Library Data Interest Group (ALCTS LITA): Applicability and Practicality of BIBFRAME (Saturday, June 24; 8:30 AM – 10:00 AM; McCormick Place, S102) Cataloging Norms Interest Group (ALCTS CaMMS): The Changing Metadata Arena and Its Practitioners (Saturday, June 24; 10:30 AM – 11:30 AM; McCormick Place, S104) MARC Format Transition Interest Group (ALCTS LITA): What happens to the library catalog in the age of linked data?: Discussion & Reaction Panel (Saturday, June 24; 4:30 PM – 5:30 PM; McCormick Place, W184a) Cleaning Up the Mess: Modernizing Your Dev Team’s Outdated Workflow (Sunday, June 25; 10:30 AM – 11:30 AM; McCormick Place, W178a) Metadata Migration: Managing Methods and Mayhem (Sunday, June 25; 3:00 PM – 4:00 PM; McCormick Place, W185bc) New sources of metadata : A path for opening up public records for all (Monday, June 26; 8:00 AM – 8:45 AM; McCormick Place, W183a) Heads of Cataloging Departments Interest Group (ALCTS CaMMS): Applying Agile Practices to Metadata Workflows (Monday, June 26; 8:30 AM – 10:00 AM; McCormick Place, W190a) As a little experiment, I’ve started a Google Doc for shared notes about events and other goings-on at the conference. There will of course be a lot of coverage on social media about the conference, but the shared notes doc might be a way for Mashcatters to identify common themes. Share this: Twitter More Tumblr Reddit Print Author: Galen Charlton What makes an anti-librarian? Posted: 15 June 2017 Categories: Libraries, Politics Assuming the order gets made and shipped in time (update 2017-06-22: it did), I’ll be arriving in Chicago for ALA Annual carrying a few tens of badge ribbons like this one: Am I hoping that the librarians made of anti-matter will wear these ribbons to identify themselves, thereby avoiding unpleasant explosions and gamma ray bursts? Not really. Besides, there’s an obvious problem with this strategy, were anti-matter librarians a real constituency at conferences. No, in a roundabout way, I’m mocking this behavior by Jeffrey Beall: Seriously, dude? I suggest reading Rachel Walden’s tweets for more background, but suffice it to say that even if you were to discount Walden’s experience as a medical library director (which I do not), Beall’s response to her is extreme. (And for even more background, John Dupuis has an excellent compilation of links on recent discussions about Open Access and “predatory” journals.) But I’d like to unpack Beall’s choice of the expression “anti-librarian”? What exactly makes for an anti-librarian? We already have plenty of names for folks who oppose libraries and librarians. Book-burners. Censors. Austeritarians. The closed-minded. The tax-cutters-above-all-else. The drowners of governments in bathtubs. The fearful. We could have a whole taxonomy, in fact, were the catalogers to find a few spare moments. “Anti-librarian” as an epithet doesn’t fit most of these folks. Instead, as applied to a librarian, it has some nasty connotations: a traitor. Somebody who wears the mantle of the profession but opposes its very existence. Alternatively: a faker. A purveyor of fake news. One who is unfit to participate in the professional discourse. There may be some librarians who deserve to have that title — but it would take a lot more than being mistaken, or even woefully misguided to earn that. So let me also protest Beall’s response to Walden explicitly: It is not OK. It is not cool. It is not acceptable. Share this: Twitter More Tumblr Reddit Print Author: Galen Charlton 3 Comments on What makes an anti-librarian? IMLS support for free and open source software Posted: 18 March 2017 Categories: Uncategorized The Institute of Museum and Library Services is the U.S. government’s primary vehicle for direct federal support of libraries, museums, and archives across the entire country. It should come as no surprise that the Trump administration’s “budget blueprint” proposes to wipe it out, along with the NEA, NEH, Meals on Wheels, and dozens of other programs. While there is reason for hope that Congress will ignore at least some of the cuts that Trump proposes, the IMLS in particular has been in the sights of House Speaker Paul Ryan before. We cannot afford to be complacent. Loss of the IMLS and the funding it delivers would be a disaster for many reasons, but I’ll focus on just one: the IMLS has paid a significant role in funding in the creation and use of free and open source software for libraries, museums, and archives. Besides the direct benefit to the institutions who were awarded grants to build or use F/LOSS, such grants are a smart investment on the part of an IMLS: a dollar spent on producing software that anybody can freely use can rebound to the benefit of many more libraries. For example, here is a list of some of the software projects whose creation or enhancement was funded by an IMLS grant: Pachyderm 2.0, a multimedia authoring tool for museums ($499,500.00, grant LG-30-03-0214-03, Apache license) Open Video Digital Library Toolkit ($272,179.00, grant LG-30-04-0208-04 MIT license) iVia, a virtual library system that includes automatic metadata generation ($999,719.00, grant LG-06-05-0110-05, GPL) Zotero née SmartFox ($249,420.00, grant LG-05-05-0197-05, Affero GPL) Omeka ($249,817.00, grant LG-24-07-0005-07, GPL) Systematic Archival Fascimile Engine (SAFE) ($823,016.00, grant LG-05-09-0041-09, Apache license) CollectionSpace ($750,999.00, grant LG-24-10-0046-10, Educational Community License) CINCH ($25,000.00, grant LG-46-11-0078-11, public domain) Mukurtu CMS ($1,126,604.00, grants LG-05-11-0329-11 and LG-70-16-0054-16, GPL) Avalon Media System, ($947,963.00, grant LG-05-11-0167-11, Apache license) CrowdAsk ($23,831.00, grant LG-46-13-0239-13, GPL) The “old” Social Feed Manager ($24,550.00, grant LG-46-13-0257-13, BSD license) SimplyE ($2,567,154, grants LG-05-13-0356-13, LG-00-15-0263-15, and LG-70-16-0010-16; Apache license) ePADD ($685,129.00, grant LG-70-15-0242-15, Apache license) EADitor ($17,750.00, grant SP-02-15-0056-15, Apache license) Hyku / Hydra-in-a-box ($1,999,897.00, grant LG-70-15-0006-15, Apache license) This is only a partial list; it does not include LSTA funding that libraries may have used to either implement or enhance F/LOSS systems or money that libraries contributed to F/LOSS development as part of a broader grant project. IMLS has also funded some open source projects that ultimately… went nowhere. But that’s OK; IMLS funding is one way that libraries can afford to experiment. Do you or your institution use any of this software? Would you miss it if it were gone — or never existed — or was only available in some proprietary form? If so… write your congressional legislators today. Share this: Twitter More Tumblr Reddit Print Author: Galen Charlton Posts navigation Older posts Follow me on Twitter My Tweets Delving the Metadeeps Copyright © 2021 - Meta Interchange Powered by WordPress and the Stix Theme 
galencharlton-com-9823	----	Meta Interchange Meta Interchange Libraries, computing, metadata, and more Trading for images Let&#8217;s search a Koha catalog for something that isn&#8217;t at all controversial: What you search for in a library catalog ought to be only between... Entering 2019 A very brief post to start the new year. I&#8217;m not inclined to make elaborate resolutions for the new year other than being very firm... Data cleanup as a force for evil A quotidian concern of anybody responsible for a database is the messy data it contains. See a record about a Pedro GonzÃ¡lez? Bah, the assumption of... On being wrong, wrong, wrong Yesterday I gave a lightning talk at the Evergreen conference on being wrong. Appropriately, I started out the talk on the wrong foot. I intended... Fostering a habit of nondisclosure It almost doesn&#8217;t need to be said that old-fashioned library checkout cards were terrible for patron privacy. Want to know who had checked out a... Scaling the annual Code4Lib conference One of the beautiful things about Code4Lib qua banner is that it can be easily taken up by anyway without asking permission. If I wanted... Amelia, 2000-2017 Last year, I wrote about the blossoming of the Mellie-cat, and closed with this line: &#8220;Sixteen years is not long enough to get to know... Mashcat at ALA Annual 2017 + shared notes I&#8217;m leaving for Chicago tomorrow to attend ALA Annual 2017 (and to eat some real pizza), and while going over the schedule I found some... What makes an anti-librarian? Assuming the order gets made and shipped in time (update 2017-06-22: it did), I&#8217;ll be arriving in Chicago for ALA Annual carrying a few tens... IMLS support for free and open source software The Institute of Museum and Library Services is the U.S. government&#8217;s primary vehicle for direct federal support of libraries, museums, and archives across the entire... 
geog-umd-edu-7164	----	Sauer, Jeffery (Jeff) | GEOG | Geographical Sciences Department | University of Maryland Skip to main content About Us Department Overview People Administration Resources Undergraduate Prospective Students Courses Advising Special Programs Graduation Internship and Career Exploration Clubs, Associations & Social Media Graduate Prospective Ph.D. Students Master of Science and Graduate Certificate Programs About Our Ph.D. Students Resources Research Geospatial-Information Science and Remote Sensing Human Dimensions of Global Change - Coupled Human and Natural Systems Land Cover - Land Use Change Carbon, Vegetation Dynamics and Landscape-Scale Processes GIS Center for Geospatial Information Science GIS Undergraduate GIS Summer/Winter Workshops Centers Geographical Sciences Centers Alumni Faculty: A Historic Look The Department of Geographical Sciences Alumni About Us Department Overview People Administration Resources Undergraduate Prospective Students Courses Advising Special Programs Graduation Internship and Career Exploration Clubs, Associations & Social Media Graduate Prospective Ph.D. Students Master of Science and Graduate Certificate Programs About Our Ph.D. Students Resources Research Geospatial-Information Science and Remote Sensing Human Dimensions of Global Change - Coupled Human and Natural Systems Land Cover - Land Use Change Carbon, Vegetation Dynamics and Landscape-Scale Processes GIS Center for Geospatial Information Science GIS Undergraduate GIS Summer/Winter Workshops Centers Geographical Sciences Centers Alumni Faculty: A Historic Look The Department of Geographical Sciences Alumni Search Search Enter the terms you wish to search for. Sauer, Jeffery (Jeff) Biography Teaching Research Publications Service Related Faculty Pursuing a PhD in Geospatial Information Science at the University of Maryland College Park (began August 2019). Working under the supervision of Dr. Kathleen Stewart on developing a geospatial understanding of the U.S. Opioid Overdose Epidemic (OOE). This includes extensive geospatial modeling of drug-related health outcomes and exposures, as well as quantitative literature reviews of geospatial health research. In Summer 2020 I worked as a Google Summer of Code student developer with the Python Spatial Analysis Library (PySAL) to implement several spatial statistics.   Before UMD I studied at the London School of Hygiene and Tropical Medicine (LSHTM) and graduated with an MSc in Epidemiology (2017-2018). I completed my undergraduate studies at McGill University (Canada) with a BA in Geography (2013-2017).   I have experience in applied data analysis and technical writing in epidemiology, political science, and GIS. I am comfortable working in R and Python, and I have learned the fundamentals of statistical programming in STATA. On the side I am developing my skills in dask, PostgreSQL, PostGIS, and more.   Feel free to connect with me via email (jcsauer [at] terpmail.umd.edu) or one of the social channels below! Github Personal website LinkedIn   Last edited: 4/15/2021 Areas of Interest GIS Programming Health Geography Spatial Epidemiology Opioid Overdose Epidemic Spatial modeling CV: Full CV updated as of 11/30/202094.26 KB Personal Website Link to personal website (github, blog, additional information) Degrees Degree Type MSc Degree Details Epidemiology, London School of Hygiene and Tropical Medicine (London, UK, 2017-2018) Degree Type BA (Hnrs.) Degree Details Geography, McGill university (Montreal, CAN, 2013-2017) Research Topics Geospatial-Information Science and Remote Sensing Breau, S., Burkhart., N., Shin, M., Marchand, Y., Sauer, J. (2020). Is it time to start worrying more about growing regional inequalities in Canada? The Canadian Geographer. https://doi.org/10.1111/cag.12634 Gao, X., Liang, S., Sauer, J. (2020). Greening hiatus in Eurasian boreal forests since 1997 caused by a wetting and cooling summer climate, Journal of Geophysical Research-Biogeosciences Rich et al., (2020). arcos and arcospy: R and Python packages for accessing the DEA ARCOS database from 2006 - 2014. Journal of Open Source Software, 5(53), 2450, https://doi.org/10.21105/joss.02450 Sauer, J., Berrang-Ford, L., Patterson, K., Donnelly, B., Lwasa, S., Namanya, D., . . . Harper, S. (2018). An Analysis of the Nutrition Status of Neighboring Indigenous and non-Indigenous Populations in Kanungu District, Southwestern Uganda Stewart, Kathleen Phone 6123941115 Email jcsauer [at] terpmail.umd.edu Department of Geographical Sciences University of Maryland, 2181 Samuel J. LeFrak Hall, 7251 Preinkert Drive, College Park, MD 20742 Phone: 301-405-4050  ♦ Contact Us College Directory Give to GEOG Alumni UMD Web Accessibility Login / Logout 
geoladiesph-github-io-8595	----	Geoladies PH GeoladiesPH Home Join About Projects Contact GeoladiesPH We advocate for community diversity, collaborative participation, and affirmative spaces especially for women and under-represented communities. Cheers to the ladies! 😉 Join our latest workshop! For our latest workshop, we will provide an introduction to how drones are used in geospatial aerial surveys. Apply for a slot below. According to the Civil Aviation Authority of the Philippines, only 8% of licensed drone pilots are women. With this, please note that women applicants will be prioritized to generate interest from an under-represented sector, in which our group would like to focus on. powered by Typeform Learn more about Geoladies PH About Our Core Team Jen One of Jen's advocacies is mapping breastfeeding stations in the Philippines to help fellow wives and mommies. 🤱🏽 Andi Andi advocates for mapping mental health resources and services and promoting mental health aWHEREness to fight the stigma on it. 🌻 Leigh Drone expert! 🛫 Cham Artist mapper 🎨 Nalie Nalie is an advocate for Sustainable Living and she maps for work and voluntarily. 🌿 Feye Disaster Response Mapper 🌊 👉🏽Resources for knowledge sharing ✨ View/Download here. Projects Recent Projects, Workshops, and Activities DRONE't You Wish Your Girl friend Could Fly Like Me? A workshop on how drones are used in geospatial aerial surveys. Women applicants are prioritized to generate interest from an under-represented sector in this field. Pista ng Mapa 2019 We've participated at the Pista ng Mapa 2019! More Workshops More workshops! More projects More projects! Follow us on Social Media for more updates or email us. Follow us on Facebook! Email us here! View Larger Map Copyright © Geoladies PH 2019 
geosm-org-5151	----	Geosm 
github-com-1166	----	twarc/youtubedl.py at main · DocNow/twarc · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Permalink main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags twarc/utils/youtubedl.py / Jump to Code definitions No definitions found in this file. Code navigation not available for this commit Go to file Go to file T Go to line L Go to definition R Copy path Copy permalink     Cannot retrieve contributors at this time executable file 193 lines (163 sloc) 5.79 KB Raw Blame Open with Desktop View raw View blame #!/usr/bin/env python3 """ usage: youtubedl.py [-h] [--max-downloads MAX_DOWNLOADS] [--max-filesize MAX_FILESIZE] [--ignore-livestreams] [--download-dir DOWNLOAD_DIR] [--block BLOCK] [--timeout TIMEOUT] files Download videos in Twitter JSON data. positional arguments: files json files to parse optional arguments: -h, --help show this help message and exit --max-downloads MAX_DOWNLOADS max downloads per URL --max-filesize MAX_FILESIZE max filesize to download (bytes) --ignore-livestreams ignore livestreams which may never end --download-dir DOWNLOAD_DIR directory to download to --block BLOCK hostnames to block (repeatable) --timeout TIMEOUT timeout download after n seconds """ import os import sys import json import time import argparse import logging import fileinput import youtube_dl import multiprocessing as mp from urllib.parse import urlparse from datetime import datetime, timedelta from youtube_dl.utils import match_filter_func parser = argparse.ArgumentParser(description='Download videos in Twitter JSON data.') parser.add_argument( '--max-downloads', type=int, help='max downloads per URL') parser.add_argument( '--max-filesize', type=int, help='max filesize to download (bytes)') parser.add_argument( '--ignore-livestreams', action='store_true', default=False, help='ignore livestreams which may never end') parser.add_argument( '--download-dir', type=str, help='directory to download to', default='youtubedl') parser.add_argument( '--block', action='append', help='hostnames to block (repeatable)') parser.add_argument( '--timeout', type=int, default=0, help='timeout download after n seconds') parser.add_argument('files', action='append', help='json files to parse') def main(): args = parser.parse_args() # make download directory download_dir = args.download_dir if not os.path.isdir(download_dir): os.mkdir(download_dir) # setup logger log_file = "{}/youtubedl.log".format(download_dir) logging.basicConfig(filename=log_file, level=logging.INFO) log = logging.getLogger() # setup youtube_dl config ydl_opts = { "format": "best", "logger": log, "restrictfilenames": True, "ignoreerrors": True, "nooverwrites": True, "writedescription": True, "writeinfojson": True, "writesubtitles": True, "writeautomaticsub": True, "outtmpl": "{}/%(extractor)s/%(id)s/%(title)s.%(ext)s".format(download_dir), "download_archive": "{}/archive.txt".format(download_dir) } if args.ignore_livestreams: ydl_opts["matchfilter"] = match_filter_func("!is_live") if args.max_downloads: ydl_opts['max_downloads'] = args.max_downloads if args.max_filesize: ydl_opts['max_filesize'] = args.max_filesize # keep track of domains to block blocklist = [] if args.block: blocklist = args.block # read in existing mapping file to know which urls we can ignorej seen = set() mapping_file = os.path.join(download_dir, 'mapping.tsv') if os.path.isfile(mapping_file): for line in open(mapping_file): url, path = line.split('\t') log.info('found %s in %s', url, mapping_file) seen.add(url) # loop through the tweets results = open(mapping_file, 'a') for line in fileinput.input(args.files): tweet = json.loads(line) log.info('analyzing %s', tweet['id_str']) for e in tweet['entities']['urls']: url = e.get('unshortened_url') or e['expanded_url'] # see if we can skip this one if not url: continue if url in seen: log.info('already processed %s', url) continue seen.add(url) # check for blocks uri = urlparse(url) if uri.netloc in blocklist: logging.warn("%s in block list", url) continue # set up a multiprocessing queue to manage the download with a timeout log.info('processing %s', url) q = mp.Queue() p = mp.Process(target=download, args=(url, q, ydl_opts, log)) p.start() started = datetime.now() while True: # if we've exceeded the timeout terminate the process if args.timeout and datetime.now() - started > timedelta(seconds=args.timeout): log.warning('reached timeout %s', args.timeout) p.terminate() break # if the process is done we can stop elif not p.is_alive(): break # otherwise sleep and the check again time.sleep(1) # if the queue was empty there either wasn't a download or it timed out if q.empty(): filename = '' else: filename = q.get() p.join() # write the result to the mapping file results.write("{}\t{}\n".format(url, filename)) def download(url, q, ydl_opts, log): try: ydl = youtube_dl.YoutubeDL(ydl_opts) info = ydl.extract_info(url) if info: filename = ydl.prepare_filename(info) log.info('downloaded %s as %s', url, filename) else: filename = "" logging.warning("%s doesn't look like a video", url) except youtube_dl.utils.MaxDownloadsReached as e: logging.warning('only %s downloads per url allowed', args.max_downloads) if __name__ == "__main__": main() Copy lines Copy permalink View git blame Reference in new issue Go © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-1403	----	Issues · lostRSEs/escape-room · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} lostRSEs / escape-room Notifications Star 1 Fork 0 Code Issues 18 Pull requests 0 Actions Projects 1 Security Insights More Code Issues Pull requests Actions Projects Security Insights Labels 9 Milestones 0 Labels 9 Milestones 0 New issue Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community. Pick a username Email Address Password Sign up for GitHub By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails. Already on GitHub? Sign in to your account 18 Open 22 Closed 18 Open 22 Closed Author Filter by author author: Filter by this user Label Filter by label Use alt + click/return to exclude labels. Projects Filter by project Milestones Filter by milestone Assignee Filter by who’s assigned Sort Sort by Newest Oldest Most commented Least commented Recently updated Least recently updated Most reactions 👍 👎 😄 🎉 😕 ❤️ 🚀 👀 Add some books about Python to the bookshelf in Room 1 #46 opened Apr 1, 2021 by jezcope Weave a name for our absent RSE into the narrative for continuity #44 opened Apr 1, 2021 by jezcope 2 Add small pictures for some of the interactable objects in the room #43 opened Apr 1, 2021 by jezcope Add some breadcrumb navigation to the story website #42 opened Apr 1, 2021 by jezcope Check description of rooms for consistency #41 opened Apr 1, 2021 by jezcope Add illustrations for items in the RSE office #40 opened Apr 1, 2021 by tlestang Make sure all images have alt text #39 opened Apr 1, 2021 by jezcope Move image credits from laptop page to README for assets folder #29 opened Apr 1, 2021 by LauraCarter Create a presentation for the end of the hackday #18 opened Apr 1, 2021 by jezcope 1 How do we get the A&H to use this? How do we get the word out? #15 opened Apr 1, 2021 by MarionBWeinzierl 3 Make it shinier! #14 opened Apr 1, 2021 by MarionBWeinzierl Updating the README to include all the information required for judging criteria for CW21 Hackday (1 April 2021) #13 opened Apr 1, 2021 by LauraCarter 2 of 24 Create puzzles for Software Sustainability #9 opened Apr 1, 2021 by MarionBWeinzierl 1 Create puzzle for Research Software Engineering #8 opened Apr 1, 2021 by MarionBWeinzierl Create puzzles for software testing and CI #7 opened Apr 1, 2021 by MarionBWeinzierl Create puzzle for version control #6 opened Apr 1, 2021 by MarionBWeinzierl Create puzzle for licenses #5 opened Apr 1, 2021 by MarionBWeinzierl Create Puzzle for ReadMes #4 opened Apr 1, 2021 by MarionBWeinzierl ProTip! Find all open issues with in progress development work with linked:pr. © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-1536	----	twarc/unshrtn.py at main · DocNow/twarc · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Permalink main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags twarc/utils/unshrtn.py / Jump to Code definitions No definitions found in this file. Code navigation not available for this commit Go to file Go to file T Go to line L Go to definition R Copy path Copy permalink     Cannot retrieve contributors at this time executable file 136 lines (110 sloc) 3.35 KB Raw Blame Open with Desktop View raw View blame #!/usr/bin/env python """ Unfortunately the "expanded_url" as supplied by Twitter aren't fully expanded one hop past t.co. unshrtn.py will attempt to completely unshorten URLs and add them as the "unshortened_url" key to each url, and emit the tweet as JSON again on stdout. This script starts 10 seaprate processes which talk to an instance of unshrtn that is running: http://github.com/edsu/unshrtn """ import re import json import time import urllib.request, urllib.parse, urllib.error import logging import argparse import fileinput import multiprocessing # number of urls to look up in parallel POOL_SIZE = 10 unshrtn_url = "http://localhost:3000" retries = 2 wait = 15 logging.basicConfig(filename="unshorten.log", level=logging.INFO) def unshorten_url(url): if url is None: return None # TODO: Worth providing some way for the user to specify specific hostnames they want to expand, # instead of assuming that all hostnames need expanding? if re.match(r"^https?://twitter.com/", url): return url u = "{}/?{}".format( unshrtn_url, urllib.parse.urlencode({"url": url.encode("utf8")}) ) resp = None for retry in range(0, retries): try: resp = json.loads(urllib.request.urlopen(u).read().decode("utf-8")) break except Exception as e: logging.error( "http error: %s when looking up %s. Try %s of %s", e, url, retry, retries, ) time.sleep(wait) for key in ["canonical", "long"]: if key in resp: return resp[key] return None def rewrite_line(line): try: tweet = json.loads(line) except Exception as e: # garbage in, garbage out logging.error(e) return line for url_dict in tweet["entities"]["urls"]: if "expanded_url" in url_dict: url = url_dict["expanded_url"] else: url = url_dict["url"] url_dict["unshortened_url"] = unshorten_url(url) tweet["user"]["unshortened_url"] = unshorten_url(tweet["user"]["url"]) return json.dumps(tweet) def main(): global unshrtn_url, retries, wait parser = argparse.ArgumentParser() parser.add_argument( "--pool-size", help="number of urls to look up in parallel", default=POOL_SIZE, type=int, ) parser.add_argument( "--unshrtn", help="url of the unshrtn service", default=unshrtn_url ) parser.add_argument( "--retries", help="number of time to retry if error from unshrtn service", default=retries, type=int, ) parser.add_argument( "--wait", help="number of seconds to wait between retries if error from unshrtn service", default=wait, type=int, ) parser.add_argument( "files", metavar="FILE", nargs="*", help="files to read, if empty, stdin is used", ) args = parser.parse_args() unshrtn_url = args.unshrtn retries = args.retries wait = args.wait pool = multiprocessing.Pool(args.pool_size) for line in pool.imap_unordered( rewrite_line, fileinput.input(files=args.files if len(args.files) > 0 else ("-",)), ): if line != "\n": print(line) if __name__ == "__main__": main() Copy lines Copy permalink View git blame Reference in new issue Go © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-2313	----	twarc/geo.py at main · DocNow/twarc · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Permalink main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags twarc/utils/geo.py / Jump to Code definitions No definitions found in this file. Code navigation not available for this commit Go to file Go to file T Go to line L Go to definition R Copy path Copy permalink     Cannot retrieve contributors at this time executable file 17 lines (14 sloc) 378 Bytes Raw Blame Open with Desktop View raw View blame #!/usr/bin/env python """ Filter tweets/retweets that have geocoding. """ from __future__ import print_function import json import fileinput for line in fileinput.input(): tweet = json.loads(line) if 'retweeted_status' in tweet: if tweet['retweeted_status']['geo']: print(json.dumps(tweet)) elif tweet['geo']: print(json.dumps(tweet)) Copy lines Copy permalink View git blame Reference in new issue Go © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-248	----	GitHub - code4lib/planetcode4lib: Configuration for https://planet.code4lib.org/ Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} code4lib / planetcode4lib Notifications Star 10 Fork 14 Configuration for https://planet.code4lib.org/ 10 stars 14 forks Star Notifications Code Issues 0 Pull requests 0 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights master Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags 1 branch 0 tags Go to file Code Clone HTTPS GitHub CLI Use Git or checkout with SVN using the web URL. Work fast with our official CLI. Learn more. Open with GitHub Desktop Download ZIP Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching Xcode If nothing happens, download Xcode and try again. Go back Launching Visual Studio If nothing happens, download the GitHub extension for Visual Studio and try again. Go back Latest commit   Git stats 121 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time themes     venus @ 9de2109     .gitignore     .gitmodules     README.md     config.ini     test.ini     View code planet.code4lib.org Installation generally Installation on the code4lib.org server Adding (or removing) a feed README.md planet.code4lib.org Planet Code4Lib aggregates feeds and blogs of interest to the Code4Lib community. It uses Planet Venus. Installation generally > git clone git@github.com:code4lib/planetcode4lib.git > cd planetcode4lib > git submodule init > git submodule update > ./venus/planet.py --verbose The generated files will be in output/. To test it with one feed, run > ./venus/planet.py --verbose test.ini Installation on the code4lib.org server Downloading and cloning is done over HTTPS so it's as generic as possible. No updates are to be made on the server; they should be made locally, pushed to GitHub, then pulled down. > # Become the c4l user > cd /var/www/code4lib.org/planet_new > git clone https://github.com/code4lib/planetcode4lib.git > cd planetcode4lib > git submodule init > git submdule update > ./venus/planet.py --verbose --expunge To update: > # Become the c4l user > cd /var/www/code4lib.org/planet_new/planetcode4lib > git pull The relevant line in c4l's crontab is: 10,40 * * * * cd /var/www/code4lib.org/planet_new/planetcode4lib; ./venus/planet.py --expunge 2>&1 Adding (or removing) a feed Additions are welcome! Email William Denton or submit a pull request modifying config.ini. If you're on the list but don't want to be, please do the same, and you'll be removed, no questions asked. About Configuration for https://planet.code4lib.org/ Resources Readme Releases No releases published Packages 0 No packages published Contributors 22 + 11 contributors Languages XSLT 46.0% CSS 42.5% HTML 11.5% © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-2852	----	Home · CarpenPi/docs Wiki · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} CarpenPi / docs Notifications Star 2 Fork 0 Code Issues 2 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Home Jump to bottom Flic Anderson edited this page Apr 1, 2021 · 18 revisions About CarpenPi CarpenPi aims to facilitate Software Carpentry and Data Carpentry lessons to be taught from a cluster of Raspberry Pis, to allow them to be run in places with unreliable internet connections. CarpenPi was born out of the Software Sustainability Institute's 2021 Collababorations Workshop. The idea was formed by a team during the Collaborative Ideas session, and the implementation began on the Hack Day. For more info on the project motivation see the CollabW21-Demo-Presentation repository. Minimum Requirements All Raspberry Pi's need Wifi capability which is built into version 3 and 4 Pi's but USB dongles can be included for lower versions. At least two Pi's are required for infrastructure and then enough Pi's for all attendees. Code of Conduct We follow the Code of Conduct outlined by the Carpentries Architecture See Pi-Network for an overview. Repositories Raspberry Pi images/setup AccessPoint: runs an access point on a Pi to set up a local network WebServer: runs a web server on a Pi to host the carpentries training materials without internet access git-server: runs a git server on a Pi to allow course participants to collaborate via git without needing external internet access Other repositories TrainTrainers: Carpentry course for trainers who want to use a Pi cluster CollabW21-DemoPresentation: Presentation for the Collaborations Workshop Hackday docs: Repository for this wiki workshop-admin: Repository for a web app to help administer the courses Recent Decisions Project name: CarpPi CarpenPis CarpentPis (with the 't'?) Carpentries in a Case CarPinTries The Fruit of Knowledge Pandora's box DeliverPi RaspPiTrain Off-grid Carpentries Raspberry CarpenPis CarpentryJam Logo : Combining Raspberry Pi and Carpentries logo. Colour scheme close to Pi or Carpentries? Why chose? Lets not modify the original images Carpentry bolt with Raspberry on or replace bolt with raspberry? Remove th bolt due to clash of colours Fun font or formal? We're too fun to be formal. Licence : Following the carpentries website we are using the MIT licence for code and CC-BY for materials. Future Work See project issues for details for future work. The main areas are: Making the pi network auto-configurable Updating the training materials The workshop admin area Contributors In alphabetical order: Abhishek Dasgupta Alison Clarke Emily Lewis Flic Anderson Irma Hafidz Jannetta Steyn Rebecca Wilson Sam Haynes Talia Caplan Pages 2 Home Pi Network Clone this wiki locally © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-3123	----	twarc/network.py at main · DocNow/twarc · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Permalink main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags twarc/utils/network.py / Jump to Code definitions No definitions found in this file. Code navigation not available for this commit Go to file Go to file T Go to line L Go to definition R Copy path Copy permalink     Cannot retrieve contributors at this time executable file 363 lines (294 sloc) 9.06 KB Raw Blame Open with Desktop View raw View blame #!/usr/bin/env python # build a reply, quote, retweet network from a file of tweets and write it # out as a gexf, dot, json or html file. You will need to have networkx # installed and pydotplus if you want to use dot. The html presentation # uses d3 to display the network graph in your browser. # # ./network.py tweets.jsonl network.html # # or # ./network.py tweets.jsonl network.dot # # or # # ./network.py tweets.jsonl network.gexf # # if you would rather have the network oriented around nodes that are users # instead of tweets use the --users flag # # ./network.py --users tweets.jsonl network.gexf # # if you would rather have the network oriented around nodes that are hashtags # instead of tweets or users, use the --hashtags flag # # TODO: this is mostly here some someone can improve it :) import sys import json import networkx import optparse import itertools import time from networkx import nx_pydot from networkx.readwrite import json_graph usage = "network.py tweets.jsonl graph.html" opt_parser = optparse.OptionParser(usage=usage) opt_parser.add_option( "--retweets", dest="retweets", action="store_true", help="include retweets" ) opt_parser.add_option( "--min_subgraph_size", dest="min_subgraph_size", type="int", help="remove any subgraphs with a size smaller than this number" ) opt_parser.add_option( "--max_subgraph_size", dest="max_subgraph_size", type="int", help="remove any subgraphs with a size larger than this number" ) opt_parser.add_option( "--users", dest="users", action="store_true", help="show user relations instead of tweet relations" ) opt_parser.add_option( "--hashtags", dest="hashtags", action="store_true", help="show hashtag relations instead of tweet relations" ) options, args = opt_parser.parse_args() if len(args) != 2: opt_parser.error("must supply input and output file names") tweets, output = args G = networkx.DiGraph() def add(from_user, from_id, to_user, to_id, type, created_at=None): "adds a relation to the graph" # storing start_data will allow for timestamps for gephi timeline, where nodes will appear on screen at their start dataset # and stay on forever after if (options.users or options.hashtags) and to_user: G.add_node(from_user, screen_name=from_user, start_date=created_at) G.add_node(to_user, screen_name=to_user, start_date=created_at) if G.has_edge(from_user, to_user): weight = G[from_user][to_user]['weight'] + 1 else: weight = 1 G.add_edge(from_user, to_user, type=type, weight=weight) elif not options.users and to_id: G.add_node(from_id, screen_name=from_user, type=type) if to_user: G.add_node(to_id, screen_name=to_user) else: G.add_node(to_id) G.add_edge(from_id, to_id, type=type) def to_json(g): j = {"nodes": [], "links": []} for node_id, node_attrs in g.nodes(True): j["nodes"].append({ "id": node_id, "type": node_attrs.get("type"), "screen_name": node_attrs.get("screen_name") }) for source, target, attrs in g.edges(data=True): j["links"].append({ "source": source, "target": target, "type": attrs.get("type") }) return j for line in open(tweets): try: t = json.loads(line) except: continue from_id = t['id_str'] from_user = t['user']['screen_name'] from_user_id = t['user']['id_str'] to_user = None to_id = None # standardize raw created at date to dd/MM/yyyy HH:mm:ss created_at_date = time.strftime('%d/%m/%Y %H:%M:%S', time.strptime(t["created_at"],'%a %b %d %H:%M:%S +0000 %Y')) if options.users: for u in t['entities'].get('user_mentions', []): add(from_user, from_id, u['screen_name'], None, 'reply', created_at_date) elif options.hashtags: hashtags = t['entities'].get('hashtags', []) hashtag_pairs = list(itertools.combinations(hashtags, 2)) # list of all possible hashtag pairs for u in hashtag_pairs: # source hashtag: u[0]['text'] # target hashtag: u[1]['text'] add('#' + u[0]['text'], None, '#' + u[1]['text'], None, 'hashtag', created_at_date) else: if t.get('in_reply_to_status_id_str'): to_id = t['in_reply_to_status_id_str'] to_user = t['in_reply_to_screen_name'] add(from_user, from_id, to_user, to_id, "reply") if t.get('quoted_status'): to_id = t['quoted_status']['id_str'] to_user = t['quoted_status']['user']['screen_name'] to_user_id = t['quoted_status']['user']['id_str'] add(from_user, from_id, to_user, to_id, "quote") if options.retweets and t.get('retweeted_status'): to_id = t['retweeted_status']['id_str'] to_user = t['retweeted_status']['user']['screen_name'] to_user_id = t['retweeted_status']['user']['id_str'] add(from_user, from_id, to_user, to_id, "retweet") if options.min_subgraph_size or options.max_subgraph_size: g_copy = G.copy() for g in networkx.connected_component_subgraphs(G): if options.min_subgraph_size and len(g) < options.min_subgraph_size: g_copy.remove_nodes_from(g.nodes()) elif options.max_subgraph_size and len(g) > options.max_subgraph_size: g_copy.remove_nodes_from(g.nodes()) G = g_copy if output.endswith(".gexf"): networkx.write_gexf(G, output) elif output.endswith(".gml"): networkx.write_gml(G, output) elif output.endswith(".dot"): nx_pydot.write_dot(G, output) elif output.endswith(".json"): json.dump(to_json(G), open(output, "w"), indent=2) elif output.endswith(".html"): graph_data = json.dumps(to_json(G), indent=2) html = """<!DOCTYPE html> <meta charset="utf-8"> <script src="https://platform.twitter.com/widgets.js"></script> <script src="https://d3js.org/d3.v4.min.js"></script> <script src="https://code.jquery.com/jquery-3.1.1.min.js"></script> <style> .links line { stroke: #999; stroke-opacity: 0.8; stroke-width: 2px; } line.reply { stroke: #999; } line.retweet { stroke-dasharray: 5; } line.quote { stroke-dasharray: 5; } .nodes circle { stroke: red; fill: red; stroke-width: 1.5px; } circle.retweet { fill: white; stroke: #999; } circle.reply { fill: #999; stroke: #999; } circle.quote { fill: yellow; stroke: yellow; } #graph { width: 99vw; height: 99vh; } #tweet { position: absolute; left: 100px; top: 150px; } </style> <svg id="graph"></svg> <div id="tweet"></div> <script> var width = $(window).width(); var height = $(window).height(); var svg = d3.select("svg") .attr("height", height) .attr("width", width); var color = d3.scaleOrdinal(d3.schemeCategory20c); var simulation = d3.forceSimulation() .velocityDecay(0.6) .force("link", d3.forceLink().id(function(d) { return d.id; })) .force("charge", d3.forceManyBody()) .force("center", d3.forceCenter(width / 2, height / 2)); var graph = %s; var link = svg.append("g") .attr("class", "links") .selectAll("line") .data(graph.links) .enter().append("line") .attr("class", function(d) { return d.type; }); var node = svg.append("g") .attr("class", "nodes") .selectAll("circle") .data(graph.nodes) .enter().append("circle") .attr("r", 5) .attr("class", function(d) { return d.type; }) .call(d3.drag() .on("start", dragstarted) .on("drag", dragged) .on("end", dragended)); node.append("title") .text(function(d) { return d.id; }); node.on("click", function(d) { $("#tweet").empty(); var rect = this.getBoundingClientRect(); var paneHeight = d.type == "retweet" ? 50 : 200; var paneWidth = d.type == "retweet" ? 75 : 500; var left = rect.x - paneWidth / 2; if (rect.y > height / 2) { var top = rect.y - paneHeight; } else { var top = rect.y + 10; } var tweet = $("#tweet"); tweet.css({left: left, top: top}); if (d.type == "retweet") { twttr.widgets.createFollowButton(d.screen_name, tweet[0], {size: "large"}); } else { twttr.widgets.createTweet(d.id, tweet[0], {conversation: "none"}); } d3.event.stopPropagation(); }); svg.on("click", function(d) { $("#tweet").empty(); }); simulation .nodes(graph.nodes) .on("tick", ticked); simulation.force("link") .links(graph.links); function ticked() { link .attr("x1", function(d) { return d.source.x; }) .attr("y1", function(d) { return d.source.y; }) .attr("x2", function(d) { return d.target.x; }) .attr("y2", function(d) { return d.target.y; }); node .attr("cx", function(d) { return d.x; }) .attr("cy", function(d) { return d.y; }); } function dragstarted(d) { if (!d3.event.active) simulation.alphaTarget(0.3).restart(); d.fx = d.x; d.fy = d.y; } function dragged(d) { d.fx = d3.event.x; d.fy = d3.event.y; } function dragended(d) { if (!d3.event.active) simulation.alphaTarget(0); d.fx = null; d.fy = null; } </script> """ % graph_data open(output, "w").write(html) Copy lines Copy permalink View git blame Reference in new issue Go © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-3751	----	GitHub - lostRSEs/escape-room: Escape room: Translating between RSEs and Arts & Humanities Researchers Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} lostRSEs / escape-room Notifications Star 1 Fork 0 Escape room: Translating between RSEs and Arts & Humanities Researchers lostrses.github.io/escape-room/ CC-BY-4.0 License 1 star 0 forks Star Notifications Code Issues 18 Pull requests 0 Actions Projects 1 Security Insights More Code Issues Pull requests Actions Projects Security Insights main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags 7 branches 0 tags Go to file Code Clone HTTPS GitHub CLI Use Git or checkout with SVN using the web URL. Work fast with our official CLI. Learn more. Open with GitHub Desktop Download ZIP Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching Xcode If nothing happens, download Xcode and try again. Go back Launching Visual Studio If nothing happens, download the GitHub extension for Visual Studio and try again. Go back Latest commit   Git stats 115 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time docs     CODE_OF_CONDUCT.md     CONTRIBUTING.md     LICENSE     README.md     View code AHA: An Arts and Humanities Adventure! Welcome! What is AHA? Problem Solution README.md AHA: An Arts and Humanities Adventure! Welcome! Welcome to ⭐AHA: An Arts and Humanities Adventure!⭐ What is AHA? AHA: An Arts and Humanities Adventure is an interactive game to help 'translate' concepts from computer science, for researchers in the arts and humanities. For researchers in the arts and humanities: this game aims to help you understand some of the ideas, concepts (and jargon) that your research software engineering colleagues have been using. For research software engineers: this will help you explain the ideas and concepts that you use in your work to people who do not have a computer science background. We hope that playing this game will help RSEs and arts and humanities reserachers work together better and build research software that helps advance research in artss and humanities! This project began at a hackday run as part of Software Sustainability Institute's Collaborations Workshop 2021. There is a proof-of-concept web version of the game now online! You can see the source for that website in the docs folder. Problem Researchers in the Arts & Humanities can benefit greatly from research software, but often don’t have the kind of background in formally-structured design that a physicist or engineer does. This can make developing research software for them challenging- particularly when A&H problems are often defined in ways that are very different from how computational problems are defined. We want to help researchers in A&H and RSEs to communicate better, so that they can collaborate on building research software more easily. Using gamified versions of boring and dry training materials for software development, we want to make learning about software development fun and accessible. Solution Virtual escape room: Solve a set of connected puzzles to escape the virtual game room. In the course of solving the puzzles, the participants will learn key concepts from research software development. Our pitch: develop the Part 1 of this escape room series: Theme: Gamified activities to learn the meaning of common jargon words. E.g. API, Object, function, Sprint, version, Agile, automation The escape room will be themed around learning to translate an alien language (Software development) expressed in an unusual way, so that the unfamiliar concepts can be understood in the context of our work. For example: which of these flow diagrams is the correct one? What analogy of a RSE concept can we find in humanities? Format: Online, can use existing websites or a GitHub repository with questions and clues to find information. Learning journey. Aim: The aim is to encourage participants to look for information and find out resources about software development practices and RSE related concepts themselves as they find answers to solve the puzzles. Outcome of the escape room activity: participants are familiar with 4 concepts/jargon words usually used by software developers. Participants are now in a better position to work/interact with Research Software Engineers- or to go on and learn to become digital humanities developers themselves. Potential topics and set of activities for escape rooms for part 2 onwards (not proposed for this pitch, but idea for future collaboration): Set a repo to teach GitHub / version control (create with long history, ask people to find who did what, and on what days) Give a project goal that required chunking down one goal into different tasks and create clues (Agile development) Create puzzles to teach reproducibility Use interesting data table to teach about dataframe and coding using pandas Use a visualization tool or shiny app to solve different puzzles About Escape room: Translating between RSEs and Arts & Humanities Researchers lostrses.github.io/escape-room/ Resources Readme License CC-BY-4.0 License Contributors 5 © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-3945	----	Pull requests · frictionlessdata/frictionless-py · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} frictionlessdata / frictionless-py Notifications Star 393 Fork 74 Code Issues 68 Pull requests 0 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights Labels 9 Milestones 0 Labels 9 Milestones 0 New pull request New 0 Open 322 Closed 0 Open 322 Closed Author Filter by author author: Filter by this user Label Filter by label Use alt + click/return to exclude labels. Projects Filter by project Milestones Filter by milestone Reviews Filter by reviews No reviews Review required Approved review Changes requested Assignee Filter by who’s assigned Sort Sort by Newest Oldest Most commented Least commented Recently updated Least recently updated Most reactions 👍 👎 😄 🎉 😕 ❤️ 🚀 👀 There aren’t any open pull requests. You could search all of GitHub or try an advanced search. ProTip! Mix and match filters to narrow down what you’re looking for. © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-4265	----	twarc/urls.py at main · DocNow/twarc · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Permalink main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags twarc/utils/urls.py / Jump to Code definitions No definitions found in this file. Code navigation not available for this commit Go to file Go to file T Go to line L Go to definition R Copy path Copy permalink     Cannot retrieve contributors at this time executable file 19 lines (16 sloc) 461 Bytes Raw Blame Open with Desktop View raw View blame #!/usr/bin/env python3 """ Print out the URLs in a tweet json stream. """ from __future__ import print_function import json import fileinput for line in fileinput.input(): tweet = json.loads(line) for url in tweet["entities"]["urls"]: if 'unshortened_url' in url: print(url['unshortened_url']) elif url.get('expanded_url'): print(url['expanded_url']) elif url.get('url'): print(url['url']) Copy lines Copy permalink View git blame Reference in new issue Go © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-428	----	twarc/wordcloud.py at main · DocNow/twarc · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Permalink main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags twarc/utils/wordcloud.py / Jump to Code definitions No definitions found in this file. Code navigation not available for this commit Go to file Go to file T Go to line L Go to definition R Copy path Copy permalink     Cannot retrieve contributors at this time executable file 116 lines (98 sloc) 3.96 KB Raw Blame Open with Desktop View raw View blame #!/usr/bin/env python from __future__ import print_function import re import sys import json import fileinput def main(): try: from urllib import urlopen # Python 2 except ImportError: from urllib.request import urlopen # Python 3 MAX_WORDS = 100 word_counts = {} stop_words = set(["a","able","about","across","actually","after","against","agreed","all","almost","already","also","am","among","an","and","any","anyone","anyway","are","as","at","be","because","been","being","between","but","by","can","cannot","come","could","dear","did","do","does","either","else","ever","every","for","from","get","getting","got","had","has","have","he","her","here","hers","hey","hi","him","his","how","however","i","i'd","i'll","i'm","if","in","into","is","isnt","isn't","it","its","just","kind","last","latest","least","let","like","likely","look","make","may","me","might","more","most","must","my","neither","new","no","nor","not","now","of","off","often","on","only","or","other","our","out","over","own","part","piece","play","put","putting","rather","real","really","said","say","says","she","should","simply","since","so","some","than","thanks","that","that's","thats","the","their","them","then","there","these","they","they're","this","those","tis","to","too","try","twas","us","use","used","uses","via","wants","was","way","we","well","were","what","when","where","which","while","who","whom","why","will","with","would","yet","you","your","you're","youre"]) for line in fileinput.input(): try: tweet = json.loads(line) except: pass for word in text(tweet).split(' '): word = word.lower() word = word.replace(".", "") word = word.replace(",", "") word = word.replace("...", "") word = word.replace("'", "") word = word.replace(":", "") word = word.replace("(", "") word = word.replace(")", "") if len(word) < 3: continue if len(word) > 15: continue if word in stop_words: continue if word[0] in ["@", "#"]: continue if re.match('https?', word): continue if word.startswith("rt"): continue if not re.match('^[a-z]', word, re.IGNORECASE): continue word_counts[word] = word_counts.get(word, 0) + 1 sorted_words = list(word_counts.keys()) sorted_words.sort(key = lambda x: word_counts[x], reverse=True) top_words = sorted_words[0:MAX_WORDS] words = [] count_range = word_counts[top_words[0]] - word_counts[top_words[-1]] + 1 size_ratio = 100.0 / count_range for word in top_words: size = int(word_counts[word] * size_ratio) + 15 words.append({ "text": word, "size": size }) wordcloud_js = urlopen('https://raw.githubusercontent.com/jasondavies/d3-cloud/master/build/d3.layout.cloud.js').read() output = """<!DOCTYPE html> <html> <head> <meta charset="utf-8"> <title>twarc wordcloud</title> <script src="https://d3js.org/d3.v3.min.js"></script> </head> <body> <script> // embed Jason Davies' d3-cloud since it's not available in a CDN %s var fill = d3.scale.category20(); var words = %s d3.layout.cloud().size([800, 800]) .words(words) .rotate(function() { return ~~(Math.random() * 2) * 90; }) .font("Impact") .fontSize(function(d) { return d.size; }) .on("end", draw) .start(); function draw(words) { d3.select("body").append("svg") .attr("width", 1000) .attr("height", 1000) .append("g") .attr("transform", "translate(400,400)") .selectAll("text") .data(words) .enter().append("text") .style("font-size", function(d) { return d.size + "px"; }) .style("font-family", "Impact") .style("fill", function(d, i) { return fill(i); }) .attr("text-anchor", "middle") .attr("transform", function(d) { return "translate(" + [d.x, d.y] + ")rotate(" + d.rotate + ")"; }) .text(function(d) { return d.text; }); } </script> </body> </html> """ % (wordcloud_js.decode('utf8'), json.dumps(words, indent=2)) sys.stdout.write(output) def text(t): if 'full_text' in t: return t['full_text'] return t['text'] if __name__ == "__main__": main() Copy lines Copy permalink View git blame Reference in new issue Go © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-4737	----	GitHub - softwaresaved/habeas-corpus: A corpus of research software used in COVID-19 research. Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} softwaresaved / habeas-corpus Notifications Star 4 Fork 3 A corpus of research software used in COVID-19 research. MIT License 4 stars 3 forks Star Notifications Code Issues 12 Pull requests 0 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags 6 branches 0 tags Go to file Code Clone HTTPS GitHub CLI Use Git or checkout with SVN using the web URL. Work fast with our official CLI. Learn more. Open with GitHub Desktop Download ZIP Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching Xcode If nothing happens, download Xcode and try again. Go back Launching Visual Studio If nothing happens, download the GitHub extension for Visual Studio and try again. Go back Latest commit   Git stats 73 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time R     data     docs     notebooks     .gitignore     Habeas Corpus logo.png     LICENSE     README.md     postBuild     requirements.txt     View code Habeas Corpus Contributing ✏️ Project roadmap 🏁 Licensing Acknowledgements 👪 References 📚 README.md Habeas Corpus This is work done during the hack day at Collaborations Workshop 2021, to create a corpus of research software used for COVID-19 and coronavirus-related research that will be useful in a number of ways to the research software sustainability community around the Software Sustainability Institute. This is based on and extends the "CORD-19 Software Mentions" dataset published by the Chan Zuckerberg Institute (doi: https://doi.org/10.5061/dryad.vmcvdncs0). Contributing ✏️ Habeas Corpus is a collaborative project and we welcome suggestions and contributions. We hope one of the invitations below works for you, but if not, please let us know! 🏃 I'm busy, I only have 1 minute Tell a friend about the project! ⏳ I've got 5 minutes - tell me what I should do Suggest ideas for how you would like to use Habeas Corpus 💻 I've got a few hours to work on this Take a look at the issues and see if there are any you can contribute to Create an analysis using the data and let us know about it 🎉 I really want to help increase the community Organise a hackday to use or improve Habeas Corpus Please open a GitHub issue to suggest a new idea or let us know about bugs. Project roadmap 🏁 For tasks to work on in the near future, please see open Issues. For the bigger picture, please check and contribute to plan.md Licensing Software code and notebooks from this project are licensed under the open source MIT license. Project documentation and images are licensed under CC BY 4.0. Data produced by this project in the data/outputs directory is licensed under CC0. Other data included in this project from other sources remains licensed under its original license. Acknowledgements 👪 This project originated as part of the Collaborations Workshop 2021. It was based on an original idea by Neil Chue Hong (@npch) and Stephan Druskat (@sdruskat), incorporated ideas and feedback from Michelle Barker, Daniel S. Katz, Shoaib Sufi, Carina Haupt and Callum Rollo, and was developed by Alexander Konovalov (@alex-konovalov), Hao Ye (@ha0ye), Louise Chisholm (@LouiseChisholm), Mark Turner (@MarkLTurner), Neil Chue Hong (@npch), Sammie Buzzard (@sammiebuzzard), and Stephan Druskat (@sdruskat). The data is derived from the "CORD-19 Software Mentions" dataset published by Alex D Wade and Ivana Williams from the Chan Zuckerberg Initiative and released under a CC0 license. References 📚 Softcite dataset v1.0: Du, C., Cohoon, J., Lopez, P., & Howison, J. (forthcoming). Softcite Dataset: A Dataset of Software Mentions in Biomedical and Economic Research Publications. Journal of the Association for Information Science and Technology. DOI: 10.1002/asi.24454. CORD-19 Software Mentions Software in the Scientific Literature: Problems with Seeing, Finding, and Using Software Mentioned in the Biology Literature Introducing the PID Graph About A corpus of research software used in COVID-19 research. Topics research-software Resources Readme License MIT License Releases No releases published Packages 0 No packages published Contributors 8 Languages Jupyter Notebook 99.4% Other 0.6% © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-5521	----	GitHub - elichad/software-twilight: Software end of project plans Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this user All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} elichad / software-twilight Notifications Star 0 Fork 0 Software end of project plans View license 0 stars 0 forks Star Notifications Code Issues 2 Pull requests 0 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags 8 branches 0 tags Go to file Code Clone HTTPS GitHub CLI Use Git or checkout with SVN using the web URL. Work fast with our official CLI. Learn more. Open with GitHub Desktop Download ZIP Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching Xcode If nothing happens, download Xcode and try again. Go back Launching Visual Studio If nothing happens, download the GitHub extension for Visual Studio and try again. Go back Latest commit   Git stats 51 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time __pycache__     .replit     CODEofCONDUCT.md     CONTRIBUTING.md     LICENSE.md     README.md     backend.py     decisions.py     environment.yml     index.ipynb     questionnaire.md     test_data.py     twilight_date_example.svg     twilight_plan_example.svg     View code software-twilight License Introduction Available badges Question themes Running Design Question format Customization of UI Further resources Known issues README.md This work is licensed under a Creative Commons Attribution 4.0 International License. software-twilight Software end of project plans License This project is licensed under the CC-BY license. You are free to: Share — copy and redistribute the material in any medium or format Adapt — remix, transform, and build upon the material for any purpose, even commercially. The licensor cannot revoke these freedoms as long as you follow the license terms. The full text of the license can be found here. Introduction Development of software under a fixed-term project should consider several aspects of ongoing support after the project's end. There are two main eventualities: the software's development abruptly ends; there is some end-user support, although there will be no new feature development. Each of these presents a problem. Ending support reduces the sustainability of the environment, while ongoing maintenance requires the dedication of further resources. Under the software twilight plan, the project's developer will be aware of necessary considerations. This repository is intended to be used to assess and guide a project maintainer in plans for the software's end of life. We provide a tool to be used, during the active development phase, by a project maintainer to assess and certify support plans for the project once it will no longer be actively developed. On completion of a short questionnaire the user is offered a badge to add to the repository to signal to the community when, and how, the software will go gentle into its good night. Available badges We have two badges, as examples, which look look like and mean the following: - we have a (good) plan - twilight is coming up at the specified time Question themes The tool covers a number of themes, including: potential funding for ongoing development required levels of future support deployment infrastructure required size of user community size of maintainer group status of ongoing contact with main developer(s)/development group Running Design The tool is designed in three parts: The front-end is designed with Jupyer Notebooks. It uses Jupyter Widgets, appmode package and mybinder.org to display automatically the notebook cells as a web app. The questions and answers are populated by the backend, that provides the appropriate next question based on the answer to the previous one, following a decision tree, until there are no more (relevant) questions to ask. Finally, all the answers are processed and one or more badges informing on the end-of-life status of the project are provided in the form of markdown text. A summary of the answers is also provided. This text can be easily pasted into the project README file. Question format The decision tree is populated from the file decisions.py. This file has quite customizable entries in the format described below. This is initially represented by a serialized Python dictionary. We have a Python object Question which has attributes for the question text and a dictionary for the answers (and links to each answer's follow-up question). Our input file is like: decision_tree = { 1: Question("Is this a question?", {"Yes": 2, "No", 3}), 2: Question("Is it a good question?", {"Yes": None, "No", 3}), 3: Question("Really!?", {"Yes": None, "No": None}) } decision_tree is an object with (contiguous, [1,n]?) numeric identifier and a Question object with question text and answer dictionary. The answer dictionary keys are answer text (diplayed) and the value the link to the question to follow. None is used to indicate that a decision will be reached with this answer. In this prototype there is no full decision tree. We indicate the path to follow by placing non-supported answers in parentheses. Customization of UI If the UI can be readily customized, we describe here that. Further resources Here we list related resources which may be of interest to the developer of a sustainable project. FAIRness, etc. Known issues This is a proof of concept. It is far from complete. We have a desire that the following features be implemented: Improved decision tree input (not deserialization) Complete decision tree Final badge choice and design About Software end of project plans Resources Readme License View license Releases No releases published Packages 0 No packages published Contributors 5 Languages Jupyter Notebook 56.1% Python 43.9% © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-5855	----	twarc/tags.py at main · DocNow/twarc · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Permalink main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags twarc/utils/tags.py / Jump to Code definitions No definitions found in this file. Code navigation not available for this commit Go to file Go to file T Go to line L Go to definition R Copy path Copy permalink     Cannot retrieve contributors at this time executable file 16 lines (13 sloc) 378 Bytes Raw Blame Open with Desktop View raw View blame #!/usr/bin/env python from __future__ import print_function import json import fileinput import collections counts = collections.Counter() for line in fileinput.input(): tweet = json.loads(line) for tag in tweet['entities']['hashtags']: t = tag['text'].lower() counts[t] += 1 for tag, count in counts.most_common(): print("%5i %s" % (count, tag)) Copy lines Copy permalink View git blame Reference in new issue Go © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-5998	----	marcedit_xslt_files/homosaurus_xml.xsl at master · reeset/marcedit_xslt_files · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this user All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} reeset / marcedit_xslt_files Notifications Star 20 Fork 2 Code Issues 0 Pull requests 0 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights Permalink master Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags marcedit_xslt_files/homosaurus_xml.xsl Go to file Go to file T Go to line L Copy path Copy permalink     Cannot retrieve contributors at this time 132 lines (119 sloc) 4.77 KB Raw Blame Open with Desktop View raw View blame <?xml version="1.0" encoding="UTF-8"?> <xsl:stylesheet version="1.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:skos="http://www.w3.org/2004/02/skos/core#" xmlns="http://www.loc.gov/MARC21/slim" xmlns:xsl="http://www.w3.org/1999/XSL/Transform"> <xsl:output method="xml" indent="yes"/> <xsl:template match="/"> <record xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/MARC21/slim http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd"> <leader>00596nz a2200217n 4500</leader> <xsl:variable name="mod_date"> <xsl:choose> <xsl:when test="//modified"> <xsl:variable name="holddata" select="//modified/value" /> <xsl:value-of select="substring(translate($holddata,'-',''),3,6)" /> </xsl:when> <xsl:otherwise> <xsl:text>210101</xsl:text> </xsl:otherwise> </xsl:choose> </xsl:variable>" <!-- Control field 008 needs processed--> <controlfield tag="008"> <xsl:value-of select="$mod_date"/> |||anznnbab||||||||||||||a|||||||d </controlfield> <xsl:if test="//identifier"> <datafield tag="024" ind1="8" ind2=" "> <subfield code="a"> <xsl:value-of select="//identifier" /> </subfield> <subfield code="0"> <xsl:value-of select="//id" /> </subfield> </datafield> </xsl:if> <!--*************************************************************** * To add your cataloging institution code to the 040 * use the <subfield code="[yoursubfied">[your data]</subfield> * template. * for example; modify the below to use Ohio State would look like <datafield tag="040" ind1=" " ind2=" "> <subfield code="a">OSU</subfield> <subfield code="f">homosaurus</subfield> <subfield code="c">OSU</subfield> </datafield> ***************************************************************** --> <datafield tag="040" ind1=" " ind2 = " "> <subfield code="f">homosaurus</subfield> </datafield> <!--************************************************************** * At this point (4/3/2021), the vocabulary does not provide a * conceptual definition between topical vocab elements versus * genre elements. In MARC, this is an important distinction * however, to represent the vocabulary faithfully, unless this * distinction is coded into the terms, applying a genre * context would be inferrence and would arguably no longer * faithfully representing the vocabulary or intended use. * Until the vocabulary explains variations in Concept, * all terms are treated as topics. --> <xsl:if test="//prefLabel"> <datafield tag="150" ind1=" " ind2=" "> <subfield code="a"> <xsl:value-of select="//prefLabel" /> </subfield> </datafield> </xsl:if> <xsl:for-each select="//altLabel"> <datafield tag="450" ind1=" " ind2=" "> <subfield code="a"> <xsl:value-of select="." /> </subfield> </datafield> </xsl:for-each> <xsl:for-each select="//hasTopConcept"> <xsl:if test ="./prefLabel or ./id"> <datafield tag="550" ind1=" " ind2=" "> <xsl:if test="./prefLabel"> <subfield code="a"> <xsl:value-of select="./prefLabel" /> </subfield> </xsl:if> <xsl:if test="./id"> <subfield code="0"> <xsl:value-of select="./id" /> </subfield> </xsl:if> </datafield> </xsl:if> </xsl:for-each> <xsl:for-each select="//broader"> <xsl:if test ="./prefLabel or ./id"> <datafield tag="550" ind1=" " ind2=" "> <xsl:if test="./prefLabel"> <subfield code="a"> <xsl:value-of select="./prefLabel" /> </subfield> </xsl:if> <xsl:if test="./id"> <subfield code="0"> <xsl:value-of select="./id" /> </subfield> </xsl:if> </datafield> </xsl:if> </xsl:for-each> <xsl:for-each select="//comment"> <datafield tag="680" ind1=" " ind2=" "> <subfield code="a"> <xsl:value-of select="." /> </subfield> </datafield> </xsl:for-each> </record> </xsl:template> </xsl:stylesheet> Copy lines Copy permalink View git blame Reference in new issue Go © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-643	----	Issues · DocNow/twarc · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Labels 8 Milestones 0 Labels 8 Milestones 0 New issue Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community. Pick a username Email Address Password Sign up for GitHub By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails. Already on GitHub? Sign in to your account 53 Open 214 Closed 53 Open 214 Closed Author Filter by author author: Filter by this user Label Filter by label Use alt + click/return to exclude labels. Projects Filter by project Milestones Filter by milestone Assignee Filter by who’s assigned Sort Sort by Newest Oldest Most commented Least commented Recently updated Least recently updated Most reactions 👍 👎 😄 🎉 😕 ❤️ 🚀 👀 Override keys #442 opened Apr 24, 2021 by edsu Error Message after running Twarc command #441 opened Apr 22, 2021 by osemele 12 Counts and basic statistics plugins #440 opened Apr 21, 2021 by igorbrigadir 2 Progress bar v2 #437 opened Apr 16, 2021 by igorbrigadir 2 youtubedl.py #433 opened Apr 13, 2021 by ameliameyer 2 tweets.py #429 opened Apr 8, 2021 by ameliameyer 8 wall.py #419 opened Mar 30, 2021 by ameliameyer 6 Plugin for ActivityStreams? plugins #412 opened Mar 24, 2021 by edsu 4 Document Common V2 usecase: Crawl archive tweets, flatten, export to CSV plugins v2 #411 opened Mar 23, 2021 by igorbrigadir 11 Thread v2 #404 opened Mar 8, 2021 by edsu 8 Retweets v2 #403 opened Mar 8, 2021 by edsu 1 Support Batch Compliance Endpoints v2 #399 opened Mar 4, 2021 by igorbrigadir foaf.py #392 opened Feb 24, 2021 by ameliameyer 1 Make sure the rate limit decorator works appropriately for the new monthly tweet cap v2 #391 opened Feb 23, 2021 by SamHames 3 TWARC Utilities #387 opened Feb 19, 2021 by shamreeza 5 sqlite schema v2 #379 opened Feb 16, 2021 by edsu 1 An example to run twarc as a Kafka producer #374 opened Feb 2, 2021 by rongpenl 3 deleted.py #373 opened Feb 1, 2021 by ameliameyer 20 How are accent marks handled? #366 opened Dec 2, 2020 by cgb37 1 Keep getting "Please run the command "twarc configure" to get started" after updating OS to Big Sur #364 opened Nov 24, 2020 by lalkulaib 2 Got "MissingKeys" error using app-only auth #362 opened Nov 21, 2020 by JiA1996 7 Temporal and Spatial Query #361 opened Nov 6, 2020 by eo4929 Can't track hashtags with '#' in the 'filter' query #359 opened Oct 25, 2020 by glocalglocal 3 Support for providing reply_count #356 opened Oct 22, 2020 by jasco 5 UnicodeDecodeError when running utils in window #343 opened Sep 3, 2020 by juanulload Previous 1 2 3 Next Previous Next ProTip! Follow long discussions with comments:>50. © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-7303	----	Home · DocNow/twarc Wiki · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Home Jump to bottom Ed Summers edited this page Apr 7, 2021 · 7 revisions 🐦 🐍 💾 Welcome to the twarc wiki. We mostly use this space to organically share ideas for how to develop and use twarc. In practice this wiki is a place for documentation about the design and use of twarc that doesn't fit comfortably into a discrete issue ticket or the current documentation. Sometimes these pages graduate into the official documentation that is available on ReadTheDocs. However there is no requirement for wiki pages to be written with the goal of integrating them into the official documentation. Please feel empowered to add new pages, it's a wiki! You can send a pull request, or if you prefer create an issue to request the ability to edit directly. If you'd like to have your page migrated into the official documentation, or think it warrants changes to the code please open an issue to let us know. Pages 4 Home End to End Example Twitter Study twarc2 Design Working with v2 Tweet Formats Clone this wiki locally © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-7750	----	GitHub - KnowledgeCaptureAndDiscovery/somef-github-action Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} KnowledgeCaptureAndDiscovery / somef-github-action Notifications Star 3 Fork 0 Apache-2.0 License 3 stars 0 forks Star Notifications Code Issues 4 Pull requests 1 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags 4 branches 1 tag Go to file Code Clone HTTPS GitHub CLI Use Git or checkout with SVN using the web URL. Work fast with our official CLI. Learn more. Open with GitHub Desktop Download ZIP Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching Xcode If nothing happens, download Xcode and try again. Go back Launching Visual Studio If nothing happens, download the GitHub extension for Visual Studio and try again. Go back Latest commit   Git stats 36 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time .github/workflows     Dockerfile     LICENSE     README.md     action.yml     entrypoint.sh     View code SOMEF GitHub Action Basic usage Advanced workflow README.md SOMEF GitHub Action This action uses SOMEF to generate a .codemeta file and meet the recommendations from howfairis Basic usage In its more basic usage, the github action only uses SOMEF to generate a codemeta.json file. on: [push] jobs: somef_job: runs-on: ubuntu-latest name: Run SOMEF steps: # Checks-out your repository under $GITHUB_WORKSPACE, so your job can access it - name: Chechout repo uses: actions/checkout@v2 # Use SOMEF generate codemeta.json - name: Somef with repo-url input uses: KnowledgeCaptureAndDiscovery/somef-github-action@main with: repo-url: "https://github.com/${{ github.repository }}" Advanced workflow A more advanced workflow uses howfairis and Create Pull Request actions to create a howfairis badge and send a pull request with the generated codemeta.json file if necessary: on: [push] jobs: somef_job: runs-on: ubuntu-latest name: Test somef steps: # Checks-out your repository under $GITHUB_WORKSPACE, so your job can access it - name: Chechout repo uses: actions/checkout@v2 # Run howfairis - name: fair-software uses: fair-software/howfairis-github-action@0.1.0 with: MY_REPO_URL: "https://github.com/${{ github.repository }}" # Use SOMEF generate codemeta.json - name: Somef with repo-url input uses: KnowledgeCaptureAndDiscovery/somef-github-action@main with: repo-url: "https://github.com/${{ github.repository }}" # Create a PR - name: Create Pull Request uses: peter-evans/create-pull-request@v3.8.2 with: title: Generating codemeta template commit-message: Add codemeta.json template committer: GitHub <noreply@github.com> author: ${{ github.actor }} <${{ github.actor }}@users.noreply.github.com> labels: automated pr branch: add-codemeta About No description, website, or topics provided. Resources Readme License Apache-2.0 License Releases 1 tags Packages 0 No packages published Contributors 3       Languages Shell 88.2% Dockerfile 11.8% © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-7851	----	twarc/expansions.py at main · DocNow/twarc · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Permalink main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags twarc/twarc/expansions.py / Jump to Code definitions extract_includes Function flatten Function expand_payload Function Code navigation index up-to-date Go to file Go to file T Go to line L Go to definition R Copy path Copy permalink     Cannot retrieve contributors at this time 207 lines (177 sloc) 6.18 KB Raw Blame Open with Desktop View raw View blame """ This module contains a list of the known Twitter V2+ API expansions and fields for each expansion, and a function for "flattening" a result set, including all expansions inline """ from collections import defaultdict EXPANSIONS = [ "author_id", "in_reply_to_user_id", "referenced_tweets.id", "referenced_tweets.id.author_id", "entities.mentions.username", "attachments.poll_ids", "attachments.media_keys", "geo.place_id", ] USER_FIELDS = [ "created_at", "description", "entities", "id", "location", "name", "pinned_tweet_id", "profile_image_url", "protected", "public_metrics", "url", "username", "verified", "withheld", ] TWEET_FIELDS = [ "attachments", "author_id", "context_annotations", "conversation_id", "created_at", "entities", "geo", "id", "in_reply_to_user_id", "lang", "public_metrics", # "non_public_metrics", # private # "organic_metrics", # private # "promoted_metrics", # private "text", "possibly_sensitive", "referenced_tweets", "reply_settings", "source", "withheld", ] MEDIA_FIELDS = [ "duration_ms", "height", "media_key", "preview_image_url", "type", "url", "width", # "non_public_metrics", # private # "organic_metrics", # private # "promoted_metrics", # private "public_metrics", ] POLL_FIELDS = ["duration_minutes", "end_datetime", "id", "options", "voting_status"] PLACE_FIELDS = [ "contained_within", "country", "country_code", "full_name", "geo", "id", "name", "place_type", ] EVERYTHING = { "expansions": ",".join(EXPANSIONS), "user.fields": ",".join(USER_FIELDS), "tweet.fields": ",".join(TWEET_FIELDS), "media.fields": ",".join(MEDIA_FIELDS), "poll.fields": ",".join(POLL_FIELDS), "place.fields": ",".join(PLACE_FIELDS), } # For endpoints focused on user objects such as looking up users and followers. # Not all of the expansions are available for these endpoints. USER_EVERYTHING = { "expansions": "pinned_tweet_id", "tweet.fields": ",".join(TWEET_FIELDS), "user.fields": ",".join(USER_FIELDS), } def extract_includes(response, expansion, _id="id"): if "includes" in response and expansion in response["includes"]: return defaultdict( lambda: {}, {include[_id]: include for include in response["includes"][expansion]}, ) else: return defaultdict(lambda: {}) def flatten(response): """ Flatten the response. Expects an entire page response from the API (data, includes, meta) Defaults: Return empty objects for things missing in includes. Doesn't modify tweets, only adds extra data. """ # Users extracted both by id and by username for expanding mentions includes_users = defaultdict( lambda: {}, { **extract_includes(response, "users", "id"), **extract_includes(response, "users", "username"), }, ) # Media is by media_key, not id includes_media = extract_includes(response, "media", "media_key") includes_polls = extract_includes(response, "polls") includes_places = extract_includes(response, "places") # Tweets in includes will themselves be expanded includes_tweets = extract_includes(response, "tweets") # Errors are returned but unused here for now includes_errors = extract_includes(response, "errors") def expand_payload(payload): """ Recursively step through an object and sub objects and append extra data. Can be applied to any tweet, list of tweets, sub object of tweet etc. """ # Don't try to expand on primitive values, return strings as is: if isinstance(payload, (str, bool, int, float)): return payload # expand list items individually: elif isinstance(payload, list): payload = [expand_payload(item) for item in payload] return payload # Try to expand on dicts within dicts: elif isinstance(payload, dict): for key, value in payload.items(): payload[key] = expand_payload(value) if "author_id" in payload: payload["author"] = includes_users[payload["author_id"]] if "in_reply_to_user_id" in payload: payload["in_reply_to_user"] = includes_users[payload["in_reply_to_user_id"]] if "media_keys" in payload: payload["media"] = list( includes_media[media_key] for media_key in payload["media_keys"] ) if "poll_ids" in payload and len(payload["poll_ids"]) > 0: poll_id = payload["poll_ids"][-1] # only ever 1 poll per tweet. payload["poll"] = includes_polls[poll_id] if "geo" in payload and "place_id" in payload["geo"]: place_id = payload["geo"]["place_id"] payload["geo"] = {**payload["geo"], **includes_places[place_id]} if "mentions" in payload: payload["mentions"] = list( {**referenced_user, **includes_users[referenced_user["username"]]} for referenced_user in payload["mentions"] ) if "referenced_tweets" in payload: payload["referenced_tweets"] = list( {**referenced_tweet, **includes_tweets[referenced_tweet["id"]]} for referenced_tweet in payload["referenced_tweets"] ) if "pinned_tweet_id" in payload: payload["pinned_tweet"] = includes_tweets[payload["pinned_tweet_id"]] return payload # First, expand the included tweets, before processing actual result tweets: for included_id, included_tweet in extract_includes(response, "tweets").items(): includes_tweets[included_id] = expand_payload(included_tweet) # Now flatten the list of tweets or an individual tweet if "data" in response: response["data"] = expand_payload(response["data"]) # Add the __twarc metadata to each tweet if it's a result set if "__twarc" in response and isinstance(response["data"], list): for tweet in response["data"]: tweet["__twarc"] = response["__twarc"] return response Copy lines Copy permalink View git blame Reference in new issue Go © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-8456	----	GitHub - DocNow/twarc-ids: A plugin for twarc2 to extract tweet ids from tweet JSON. Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc-ids Notifications Star 1 Fork 0 A plugin for twarc2 to extract tweet ids from tweet JSON. MIT License 1 star 0 forks Star Notifications Code Issues 0 Pull requests 0 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags 1 branch 0 tags Go to file Code Clone HTTPS GitHub CLI Use Git or checkout with SVN using the web URL. Work fast with our official CLI. Learn more. Open with GitHub Desktop Download ZIP Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching Xcode If nothing happens, download Xcode and try again. Go back Launching Visual Studio If nothing happens, download the GitHub extension for Visual Studio and try again. Go back Latest commit   Git stats 15 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time test-data     .gitignore     LICENSE     README.md     setup.cfg     setup.py     test_twarc_ids.py     twarc_ids.py     View code README.md twarc-ids This module is a simple example of how to create a plugin for twarc. It uses click-plugins to extend the main twarc command, and to manage the command line options. First you need to install twarc and this plugin: pip install twarc pip install twarc-ids Now you can collect data using the core twarc utility: twarc search blacklivesmatter > tweets.jsonl And you have a new subcommand ids that is supplied by twarc-ids. twarc ids tweets.jsonl > ids.txt It's good practice to include some tests for your module. See test_twarc_ids.py for an example. You can run it directly with pytest or using: python setup.py test When creating your setup.py make sure you don't forget the entry_points magic so that twarc will find your plugin when it is installed! About A plugin for twarc2 to extract tweet ids from tweet JSON. Resources Readme License MIT License Releases No releases published Packages 0 No packages published Languages Python 100.0% © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-8970	----	GitHub - robintw/CW-ideas: Hack day project from CW21 working on collating and analysing collaborative ideas and hack day projects from previous Collaborations Workshops Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this user All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} robintw / CW-ideas Notifications Star 1 Fork 0 Hack day project from CW21 working on collating and analysing collaborative ideas and hack day projects from previous Collaborations Workshops robintw.github.io/cw-ideas/ MIT License 1 star 0 forks Star Notifications Code Issues 9 Pull requests 0 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags 3 branches 0 tags Go to file Code Clone HTTPS GitHub CLI Use Git or checkout with SVN using the web URL. Work fast with our official CLI. Learn more. Open with GitHub Desktop Download ZIP Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching Xcode If nothing happens, download Xcode and try again. Go back Launching Visual Studio If nothing happens, download the GitHub extension for Visual Studio and try again. Go back Latest commit   Git stats 124 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time .github/workflows     archetypes     content     static     themes/PaperMod     CONTRIBUTING.md     LICENSE     README.md     config.yml     View code Exploring previous Collaborations Workshop ideas (CW-ideas) Building locally Task split during the hack day Hack day presentation README.md Exploring previous Collaborations Workshop ideas (CW-ideas) This is the repo for a hack day project from Collaborations Workshop 2021 which aims to explore previous ideas from Collaborations Workshops and provide them in an easily browseable and searchable form. A live version of the website is hosted at https://robintw.github.io/CW-ideas/. The repo consists of markdown versions of the collaborative ideas and hackday pitches, plus code to host a website to view them. To contribute to the repository - either by adding new ideas from previous CWs, or to contribute to the code to view the ideas - please see the contributing guide. This repository is licensed under the MIT license, and all the ideas themselves are CC-BY (this is mentioned at the bottom of each idea). The team creating this was Mario Antonioletti, Heather Turner and Robin Wilson. Building locally The repository is automatically built and deployed on every push, but if you want to build locally for testing or debugging purposes, follow the instructions below: Install Hugo In the root of the repo, run hugo server The site will be built, and served on localhost - see the command-line output for the full URL Task split during the hack day Heather Turner: The brains behind the idea Robin Wilson: The technical guru Mario Antonioletti: The plodder with superpowers Tasks divided orthogonally Conversion of past google doc proposals to markdown (Mario and Robin) Configuring and setting up Hugo (Robin and Heather) Provisioning a GitHub repo (Robin) Hack day presentation Available here About Hack day project from CW21 working on collating and analysing collaborative ideas and hack day projects from previous Collaborations Workshops robintw.github.io/cw-ideas/ Resources Readme License MIT License Releases No releases published Packages 0 No packages published Contributors 3       Languages HTML 83.2% CSS 13.4% JavaScript 3.4% © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-9494	----	twarc/deletes.py at main · DocNow/twarc · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights Permalink main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags twarc/utils/deletes.py / Jump to Code definitions No definitions found in this file. Code navigation not available for this commit Go to file Go to file T Go to line L Go to definition R Copy path Copy permalink     Cannot retrieve contributors at this time executable file 187 lines (155 sloc) 6.18 KB Raw Blame Open with Desktop View raw View blame #!/usr/bin/env python3 """ This program assumes that you are feeding it tweet JSON data for tweets that have been deleted. It will use the metadata and the API to analyze why each tweet appears to have been deleted. Note that lookups are based on user id, so may give different results than looking up a user by screen name. """ import json import fileinput import collections import requests import twarc import argparse import logging USER_OK = "USER_OK" USER_DELETED = "USER_DELETED" USER_PROTECTED = "USER_PROTECTED" USER_SUSPENDED = "USER_SUSPENDED" TWEET_OK = "TWEET_OK" TWEET_DELETED = "TWEET_DELETED" # You have been blocked by the user. TWEET_BLOCKED = "TWEET_BLOCKED" RETWEET_DELETED = "RETWEET_DELETED" ORIGINAL_TWEET_DELETED = "ORIGINAL_TWEET_DELETED" ORIGINAL_TWEET_BLOCKED = "ORIGINAL_TWEET_BLOCKED" ORIGINAL_USER_DELETED = "ORIGINAL_USER_DELETED" ORIGINAL_USER_PROTECTED = "ORIGINAL_USER_PROTECTED" ORIGINAL_USER_SUSPENDED = "ORIGINAL_USER_SUSPENDED" t = twarc.Twarc() def main(files, enhance_tweet=False, print_results=True): counts = collections.Counter() for count, line in enumerate(fileinput.input(files=files)): if count % 10000 == 0: logging.info("processed {:,} tweets".format(count)) tweet = json.loads(line) result = examine(tweet) if enhance_tweet: tweet['delete_reason'] = result print(json.dumps(tweet)) else: print(tweet_url(tweet), result) counts[result] += 1 if print_results: for result, count in counts.most_common(): print(result, count) def examine(tweet): user_status = get_user_status(tweet) # Go with user status first (suspended, protected, deleted) if user_status != USER_OK: return user_status else: retweet = tweet.get('retweeted_status', None) tweet_status = get_tweet_status(tweet) # If not a retweet and tweet deleted, then tweet deleted. if tweet_status == TWEET_OK: return TWEET_OK elif retweet is None or tweet_status == TWEET_BLOCKED: return tweet_status else: rt_status = examine(retweet) if rt_status == USER_DELETED: return ORIGINAL_USER_DELETED elif rt_status == USER_PROTECTED: return ORIGINAL_USER_PROTECTED elif rt_status == USER_SUSPENDED: return ORIGINAL_USER_SUSPENDED elif rt_status == TWEET_DELETED: return ORIGINAL_TWEET_DELETED elif rt_status == TWEET_BLOCKED: return ORIGINAL_TWEET_BLOCKED elif rt_status == TWEET_OK: return RETWEET_DELETED else: raise "Unexpected retweet status %s for %s" % (rt_status, tweet['id_str']) users = {} def get_user_status(tweet): user_id = tweet['user']['id_str'] if user_id in users: return users[user_id] url = "https://api.twitter.com/1.1/users/show.json" params = {"user_id": user_id} # USER_DELETED: 404 and {"errors": [{"code": 50, "message": "User not found."}]} # USER_PROTECTED: 200 and user object with "protected": true # USER_SUSPENDED: 403 and {"errors":[{"code":63,"message":"User has been suspended."}]} result = USER_OK try: resp = t.get(url, params=params, allow_404=True) user = resp.json() if user['protected']: result = USER_PROTECTED except requests.exceptions.HTTPError as e: try: resp_json = e.response.json() except json.decoder.JSONDecodeError: raise e if e.response.status_code == 404 and has_error_code(resp_json, 50): result = USER_DELETED elif e.response.status_code == 403 and has_error_code(resp_json, 63): result = USER_SUSPENDED else: raise e users[user_id] = result return result tweets = {} def get_tweet_status(tweet): id = tweet['id_str'] if id in tweets: return tweets[id] # USER_SUSPENDED: 403 and {"errors":[{"code":63,"message":"User has been suspended."}]} # USER_PROTECTED: 403 and {"errors":[{"code":179,"message":"Sorry, you are not authorized to see this status."}]} # TWEET_DELETED: 404 and {"errors":[{"code":144,"message":"No status found with that ID."}]} # or {"errors":[{"code":34,"message":"Sorry, that page does not exist."}]} url = "https://api.twitter.com/1.1/statuses/show.json" params = {"id": id} result = TWEET_OK try: t.get(url, params=params, allow_404=True) except requests.exceptions.HTTPError as e: try: resp_json = e.response.json() except json.decoder.JSONDecodeError: raise e if e.response.status_code == 404 and has_error_code(resp_json, (34, 144)): result = TWEET_DELETED elif e.response.status_code == 403 and has_error_code(resp_json, 63): result = USER_SUSPENDED elif e.response.status_code == 403 and has_error_code(resp_json, 179): result = USER_PROTECTED elif e.response.status_code == 401 and has_error_code(resp_json, 136): result = TWEET_BLOCKED else: raise e tweets[id] = result return result def tweet_url(tweet): return "https://twitter.com/%s/status/%s" % ( tweet['user']['screen_name'], tweet['id_str']) def has_error_code(resp, code): if isinstance(code, int): code = (code, ) for error in resp['errors']: if error['code'] in code: return True return False if __name__ == "__main__": parser = argparse.ArgumentParser() parser.add_argument('--enhance', action='store_true', help='Enhance tweet with delete_reason and output enhanced tweet.') parser.add_argument('--skip-results', action='store_true', help='Skip outputting delete reason summary') parser.add_argument('files', metavar='FILE', nargs='*', help='files to read, if empty, stdin is used') args = parser.parse_args() main(args.files if len(args.files) > 0 else ('-',), enhance_tweet=args.enhance, print_results=not args.skip_results and not args.enhance) Copy lines Copy permalink View git blame Reference in new issue Go © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-9542	----	GitHub - DocNow/twarc: A command line tool (and Python library) for archiving Twitter JSON Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 A command line tool (and Python library) for archiving Twitter JSON MIT License 1k stars 214 forks Star Notifications Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags 4 branches 99 tags Go to file Code Clone HTTPS GitHub CLI Use Git or checkout with SVN using the web URL. Work fast with our official CLI. Learn more. Open with GitHub Desktop Download ZIP Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching Xcode If nothing happens, download Xcode and try again. Go back Launching Visual Studio If nothing happens, download the GitHub extension for Visual Studio and try again. Go back Latest commit edsu Merge pull request #443 from DocNow/install-docs-mac-clarifications … 5ebd0ef Apr 26, 2021 Merge pull request #443 from DocNow/install-docs-mac-clarifications Clarifications to Mac install instructions 5ebd0ef Git stats 1,260 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time .github/workflows add a message to the slack notification Apr 8, 2021 docs Update README.md Apr 26, 2021 twarc Pagination fix Apr 25, 2021 utils commit on every insert is slow when writing to a usb thumbdrive appar… Feb 27, 2021 .gitignore Retweets changes Jun 25, 2020 .readthedocs.yaml moving readthedocs back here; refs #421 Apr 7, 2021 LICENSE it easier this way I think Apr 12, 2021 MANIFEST.in add docs/README.md to manifest so setup.py can read it Apr 7, 2021 README.md instructions for running mkdocs locally Apr 12, 2021 mkdocs.yml fix docs edit links Apr 7, 2021 requirements-mkdocs.txt moving readthedocs back here; refs #421 Apr 7, 2021 requirements.txt moving readthedocs back here; refs #421 Apr 7, 2021 setup.cfg small fixes to tests for python3, and a new version Sep 15, 2016 setup.py read version from version.py Apr 7, 2021 test_twarc.py and then there 100 Mar 27, 2021 test_twarc2.py Pagination fix Apr 25, 2021 View code twarc Contributing Documentation Code README.md twarc Collect data at the command line from the Twitter API (v1.1 and v2). Read the documentation Ask questions in Slack or Matrix Contributing Documentation The documentation is managed at ReadTheDocs. If you would like to improve the documentation you can edit the Markdown files in docs or add new ones. Then send a pull request and we can add it. To view your documentation locally you should be able to: pip install -r requirements-mkdocs.txt mkdocs serve open http://127.0.0.1:8000/ If you prefer you can create a page on the wiki to workshop the documentation, and then when/if you think it's ready to be merged with the documentation create an issue. Please feel free to create whatever documentation is useful in the wiki area. Code If you are interested in adding functionality to twarc or fixing something that's broken here are the steps to setting up your development environment: git clone https://github.io/docnow/twarc cd twarc pip install -r requirements.txt Create a .env file that included Twitter App keys to use during testing: BEARER_TOKEN=CHANGEME CONSUMER_KEY=CHANGEME CONSUMER_SECRET=CHANGEME ACCESS_TOKEN=CHANGEME ACCESS_TOKEN_SECRET=CHANGEME Now run the tests: python setup.py test Add your code and some new tests, and send a pull request! About A command line tool (and Python library) for archiving Twitter JSON Resources Readme License MIT License Releases 99 v2.0.8 Latest Apr 25, 2021 + 98 releases Packages 0 No packages published Used by 146 + 138 Contributors 51 + 40 contributors Languages Python 100.0% © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-9574	----	twarc/utils at main · DocNow/twarc · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} DocNow / twarc Notifications Star 1k Fork 214 Code Issues 53 Pull requests 0 Actions Projects 0 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags twarc/utils/ Go to file twarc/utils/ Latest commit edsu commit on every insert is slow when writing to a usb thumbdrive appar… … 3dd7635 Feb 27, 2021 commit on every insert is slow when writing to a usb thumbdrive appar… …ently 3dd7635 Git stats History Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time . . auth_timing.py     deduplicate.py     deleted.py     deleted_users.py     deletes.py     embeds.py     emojis.py     extractor.py     filter_date.py     filter_users.py     flakey.py     foaf.py     gender.py     geo.py     geofilter.py     geojson.py     json2csv.py     media2warc.py     media_urls.py     network.py     noretweets.py     oembeds.py     remove_limit.py     retweets.py     search.py     sensitive.py     sort_by_id.py     source.py     tags.py     times.py     twarc-archive.py     tweet.py     tweet_compliance.py     tweet_text.py     tweet_urls.py     tweetometer.py     tweets.py     unshrtn.py     urls.py     users.py     validate.py     wall.py     wayback.py     webarchives.py     wordcloud.py     youtubedl.py     © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-9699	----	GitHub - dokempf/credit-all Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this user All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} dokempf / credit-all Notifications Star 0 Fork 0 MIT License 0 stars 0 forks Star Notifications Code Issues 1 Pull requests 0 Actions Projects 1 Wiki Security Insights More Code Issues Pull requests Actions Projects Wiki Security Insights master Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags 1 branch 0 tags Go to file Code Clone HTTPS GitHub CLI Use Git or checkout with SVN using the web URL. Work fast with our official CLI. Learn more. Open with GitHub Desktop Download ZIP Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching Xcode If nothing happens, download Xcode and try again. Go back Launching Visual Studio If nothing happens, download the GitHub extension for Visual Studio and try again. Go back Latest commit   Git stats 37 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time creditall     .all-contributorsrc     .gitignore     CodeOfConduct.md     Credit-all.odp     LICENSE.md     MANIFEST.in     README.md     Sandstrom2021.jpg     setup.py     View code Welcome! Thanks for visiting Credit All! 😁 What is this project about and why is it important? The problem The solution Installation Who are we? What does this project need? We need you! How can you get involved? Get in touch Thank you README.md Welcome! Thanks for visiting Credit All! 😁 In this document you can find lots of information about this project. You can just scroll down or use the quick links below for each section. Welcome! Thanks for visiting Credit All! 😁 What is this project about and why is it important? The problem The solution Installation Who are we? What does this project need? We need you! How can you get involved? Get in touch Thank you What is this project about and why is it important? There is no one size fits all system for capturing all of the contributions during different research projects. This could be a scientific research project, a software development project or an open-source community project. We think it is important that all contributions are recorded and therefore everyone is given credit for their work more fairly. The problem Current systems that attribute contributions to authors in academic outputs do not include all of the jobs/roles/tasks that are encompassed in research projects. The current problems include: Capturing all roles on a project. Capturing all tasks within those roles. How to convert this into the actual authorship or contributions list that can be used for project outputs. How this list can be presented. The solution Taking inspiration from Malin Sandstroms Lightning talk at the Software Sustainability Institutes Collaboration Workshop 2021, in which she proposed to combine the current contributions approaches. Slide from Malin Sandstrom's SSI talk In this project, we propose to: Expand current lists to be more inclusive - using current systems such as CRediT, INRIA, BIDS Contributors. Develop a tool to be used to record these contributions during the project such as within a Github repository - we have adapted the All Contributor bot for our tool. Develop a way that this can be shown on academic papers - lists, table, cinema title page? (look at e.g. Brainhack paper w 100+ authors and Living with machines). Installation You can install the command line tool using pip: python -m pip install git+git://github.com/dokempf/credit-all.git Who are we? In alphabetical order: Daisy Perry (Writing a code of conduct, Curating data) Dominic Kempf (Initial ideas of the project, Writing new code, Writing documentation about the code) Emma Karoune (Initial ideas of the project, Curating data) Malin Sandström (Initial ideas of the project, Curating data) What does this project need? We need you! Please review our list of tasks and tell us if something needs to be added. Spot a bug and tell us about it! Suggest new ways that our contributions list can be presented. If you have any feedback on the work that is going on, then please get in contact. How can you get involved? If you think you can help in any way or just want to suggest something currently not in the project, then please check out the contributor’s guidelines. Please note that it’s very important to maintain a positive and supportive environment for everyone who wants to participate. When you join as a collaborator, you must follow the code of conduct in all interactions both on and offline. Get in touch Please feel free to get in touch with our team: ekaroune@googlemail.com Thank you Thanks for taking the time to read this project page and do please get involved. About No description, website, or topics provided. Resources Readme License MIT License Releases No releases published Packages 0 No packages published Contributors 4         Languages Python 97.0% TeX 3.0% © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-9789	----	EIPs/eip-721.md at master · ethereum/EIPs · GitHub Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this organization All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} ethereum / EIPs Notifications Star 6.3k Fork 2.5k Code Issues 468 Pull requests 34 Actions Projects 2 Security Insights More Code Issues Pull requests Actions Projects Security Insights Permalink master Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags EIPs/EIPS/eip-721.md Go to file Go to file T Go to line L Copy path Copy permalink MicahZoltu Adds rule to EIP-1 that references to other EIPs must use relative pa… … Latest commit 15f61ed Sep 29, 2020 History …th format and the first reference must be linked. (#2947) I have gone through and updated all existing EIPs to match this rule, including EIP-1. In some cases, people were using markdown citations, I suspect because the long-form was a bit verbose to inline. Since the relative path is quite short, I moved these to inline but I wouldn't be opposed to putting them back to citation format if that is desired by the authors. In doing the migration/cleanup, I found some EIP references to EIPs that don't actually exist. In these cases I tried to excise the reference from the EIP as best I could. It is worth noting that the Readme actually already had this rule, it just wasn't expressed properly in EIP-1 and the "Citation Format" section of the readme I think caused people a bit of confusion (when citing externally, you should use the citation format). 13 contributors Users who have contributed to this file +1 Simple Summary Abstract Motivation Specification Caveats Rationale Backwards Compatibility Test Cases Implementations References Copyright 447 lines (335 sloc) 29.7 KB Raw Blame Open with Desktop View raw View blame eip title author discussions-to type category status created requires 721 ERC-721 Non-Fungible Token Standard William Entriken <github.com@phor.net>, Dieter Shirley <dete@axiomzen.co>, Jacob Evans <jacob@dekz.net>, Nastassia Sachs <nastassia.sachs@protonmail.com> https://github.com/ethereum/eips/issues/721 Standards Track ERC Final 2018-01-24 165 Simple Summary A standard interface for non-fungible tokens, also known as deeds. Abstract The following standard allows for the implementation of a standard API for NFTs within smart contracts. This standard provides basic functionality to track and transfer NFTs. We considered use cases of NFTs being owned and transacted by individuals as well as consignment to third party brokers/wallets/auctioneers ("operators"). NFTs can represent ownership over digital or physical assets. We considered a diverse universe of assets, and we know you will dream up many more: Physical property — houses, unique artwork Virtual collectables — unique pictures of kittens, collectable cards "Negative value" assets — loans, burdens and other responsibilities In general, all houses are distinct and no two kittens are alike. NFTs are distinguishable and you must track the ownership of each one separately. Motivation A standard interface allows wallet/broker/auction applications to work with any NFT on Ethereum. We provide for simple ERC-721 smart contracts as well as contracts that track an arbitrarily large number of NFTs. Additional applications are discussed below. This standard is inspired by the ERC-20 token standard and builds on two years of experience since EIP-20 was created. EIP-20 is insufficient for tracking NFTs because each asset is distinct (non-fungible) whereas each of a quantity of tokens is identical (fungible). Differences between this standard and EIP-20 are examined below. Specification The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119. Every ERC-721 compliant contract must implement the ERC721 and ERC165 interfaces (subject to "caveats" below): pragma solidity ^0.4.20; /// @title ERC-721 Non-Fungible Token Standard /// @dev See https://eips.ethereum.org/EIPS/eip-721 /// Note: the ERC-165 identifier for this interface is 0x80ac58cd. interface ERC721 /* is ERC165 */ { /// @dev This emits when ownership of any NFT changes by any mechanism. /// This event emits when NFTs are created (`from` == 0) and destroyed /// (`to` == 0). Exception: during contract creation, any number of NFTs /// may be created and assigned without emitting Transfer. At the time of /// any transfer, the approved address for that NFT (if any) is reset to none. event Transfer(address indexed _from, address indexed _to, uint256 indexed _tokenId); /// @dev This emits when the approved address for an NFT is changed or /// reaffirmed. The zero address indicates there is no approved address. /// When a Transfer event emits, this also indicates that the approved /// address for that NFT (if any) is reset to none. event Approval(address indexed _owner, address indexed _approved, uint256 indexed _tokenId); /// @dev This emits when an operator is enabled or disabled for an owner. /// The operator can manage all NFTs of the owner. event ApprovalForAll(address indexed _owner, address indexed _operator, bool _approved); /// @notice Count all NFTs assigned to an owner /// @dev NFTs assigned to the zero address are considered invalid, and this /// function throws for queries about the zero address. /// @param _owner An address for whom to query the balance /// @return The number of NFTs owned by `_owner`, possibly zero function balanceOf(address _owner) external view returns (uint256); /// @notice Find the owner of an NFT /// @dev NFTs assigned to zero address are considered invalid, and queries /// about them do throw. /// @param _tokenId The identifier for an NFT /// @return The address of the owner of the NFT function ownerOf(uint256 _tokenId) external view returns (address); /// @notice Transfers the ownership of an NFT from one address to another address /// @dev Throws unless `msg.sender` is the current owner, an authorized /// operator, or the approved address for this NFT. Throws if `_from` is /// not the current owner. Throws if `_to` is the zero address. Throws if /// `_tokenId` is not a valid NFT. When transfer is complete, this function /// checks if `_to` is a smart contract (code size > 0). If so, it calls /// `onERC721Received` on `_to` and throws if the return value is not /// `bytes4(keccak256("onERC721Received(address,address,uint256,bytes)"))`. /// @param _from The current owner of the NFT /// @param _to The new owner /// @param _tokenId The NFT to transfer /// @param data Additional data with no specified format, sent in call to `_to` function safeTransferFrom(address _from, address _to, uint256 _tokenId, bytes data) external payable; /// @notice Transfers the ownership of an NFT from one address to another address /// @dev This works identically to the other function with an extra data parameter, /// except this function just sets data to "". /// @param _from The current owner of the NFT /// @param _to The new owner /// @param _tokenId The NFT to transfer function safeTransferFrom(address _from, address _to, uint256 _tokenId) external payable; /// @notice Transfer ownership of an NFT -- THE CALLER IS RESPONSIBLE /// TO CONFIRM THAT `_to` IS CAPABLE OF RECEIVING NFTS OR ELSE /// THEY MAY BE PERMANENTLY LOST /// @dev Throws unless `msg.sender` is the current owner, an authorized /// operator, or the approved address for this NFT. Throws if `_from` is /// not the current owner. Throws if `_to` is the zero address. Throws if /// `_tokenId` is not a valid NFT. /// @param _from The current owner of the NFT /// @param _to The new owner /// @param _tokenId The NFT to transfer function transferFrom(address _from, address _to, uint256 _tokenId) external payable; /// @notice Change or reaffirm the approved address for an NFT /// @dev The zero address indicates there is no approved address. /// Throws unless `msg.sender` is the current NFT owner, or an authorized /// operator of the current owner. /// @param _approved The new approved NFT controller /// @param _tokenId The NFT to approve function approve(address _approved, uint256 _tokenId) external payable; /// @notice Enable or disable approval for a third party ("operator") to manage /// all of `msg.sender`'s assets /// @dev Emits the ApprovalForAll event. The contract MUST allow /// multiple operators per owner. /// @param _operator Address to add to the set of authorized operators /// @param _approved True if the operator is approved, false to revoke approval function setApprovalForAll(address _operator, bool _approved) external; /// @notice Get the approved address for a single NFT /// @dev Throws if `_tokenId` is not a valid NFT. /// @param _tokenId The NFT to find the approved address for /// @return The approved address for this NFT, or the zero address if there is none function getApproved(uint256 _tokenId) external view returns (address); /// @notice Query if an address is an authorized operator for another address /// @param _owner The address that owns the NFTs /// @param _operator The address that acts on behalf of the owner /// @return True if `_operator` is an approved operator for `_owner`, false otherwise function isApprovedForAll(address _owner, address _operator) external view returns (bool); } interface ERC165 { /// @notice Query if a contract implements an interface /// @param interfaceID The interface identifier, as specified in ERC-165 /// @dev Interface identification is specified in ERC-165. This function /// uses less than 30,000 gas. /// @return `true` if the contract implements `interfaceID` and /// `interfaceID` is not 0xffffffff, `false` otherwise function supportsInterface(bytes4 interfaceID) external view returns (bool); } A wallet/broker/auction application MUST implement the wallet interface if it will accept safe transfers. /// @dev Note: the ERC-165 identifier for this interface is 0x150b7a02. interface ERC721TokenReceiver { /// @notice Handle the receipt of an NFT /// @dev The ERC721 smart contract calls this function on the recipient /// after a `transfer`. This function MAY throw to revert and reject the /// transfer. Return of other than the magic value MUST result in the /// transaction being reverted. /// Note: the contract address is always the message sender. /// @param _operator The address which called `safeTransferFrom` function /// @param _from The address which previously owned the token /// @param _tokenId The NFT identifier which is being transferred /// @param _data Additional data with no specified format /// @return `bytes4(keccak256("onERC721Received(address,address,uint256,bytes)"))` /// unless throwing function onERC721Received(address _operator, address _from, uint256 _tokenId, bytes _data) external returns(bytes4); } The metadata extension is OPTIONAL for ERC-721 smart contracts (see "caveats", below). This allows your smart contract to be interrogated for its name and for details about the assets which your NFTs represent. /// @title ERC-721 Non-Fungible Token Standard, optional metadata extension /// @dev See https://eips.ethereum.org/EIPS/eip-721 /// Note: the ERC-165 identifier for this interface is 0x5b5e139f. interface ERC721Metadata /* is ERC721 */ { /// @notice A descriptive name for a collection of NFTs in this contract function name() external view returns (string _name); /// @notice An abbreviated name for NFTs in this contract function symbol() external view returns (string _symbol); /// @notice A distinct Uniform Resource Identifier (URI) for a given asset. /// @dev Throws if `_tokenId` is not a valid NFT. URIs are defined in RFC /// 3986. The URI may point to a JSON file that conforms to the "ERC721 /// Metadata JSON Schema". function tokenURI(uint256 _tokenId) external view returns (string); } This is the "ERC721 Metadata JSON Schema" referenced above. { "title": "Asset Metadata", "type": "object", "properties": { "name": { "type": "string", "description": "Identifies the asset to which this NFT represents" }, "description": { "type": "string", "description": "Describes the asset to which this NFT represents" }, "image": { "type": "string", "description": "A URI pointing to a resource with mime type image/* representing the asset to which this NFT represents. Consider making any images at a width between 320 and 1080 pixels and aspect ratio between 1.91:1 and 4:5 inclusive." } } } The enumeration extension is OPTIONAL for ERC-721 smart contracts (see "caveats", below). This allows your contract to publish its full list of NFTs and make them discoverable. /// @title ERC-721 Non-Fungible Token Standard, optional enumeration extension /// @dev See https://eips.ethereum.org/EIPS/eip-721 /// Note: the ERC-165 identifier for this interface is 0x780e9d63. interface ERC721Enumerable /* is ERC721 */ { /// @notice Count NFTs tracked by this contract /// @return A count of valid NFTs tracked by this contract, where each one of /// them has an assigned and queryable owner not equal to the zero address function totalSupply() external view returns (uint256); /// @notice Enumerate valid NFTs /// @dev Throws if `_index` >= `totalSupply()`. /// @param _index A counter less than `totalSupply()` /// @return The token identifier for the `_index`th NFT, /// (sort order not specified) function tokenByIndex(uint256 _index) external view returns (uint256); /// @notice Enumerate NFTs assigned to an owner /// @dev Throws if `_index` >= `balanceOf(_owner)` or if /// `_owner` is the zero address, representing invalid NFTs. /// @param _owner An address where we are interested in NFTs owned by them /// @param _index A counter less than `balanceOf(_owner)` /// @return The token identifier for the `_index`th NFT assigned to `_owner`, /// (sort order not specified) function tokenOfOwnerByIndex(address _owner, uint256 _index) external view returns (uint256); } Caveats The 0.4.20 Solidity interface grammar is not expressive enough to document the ERC-721 standard. A contract which complies with ERC-721 MUST also abide by the following: Solidity issue #3412: The above interfaces include explicit mutability guarantees for each function. Mutability guarantees are, in order weak to strong: payable, implicit nonpayable, view, and pure. Your implementation MUST meet the mutability guarantee in this interface and you MAY meet a stronger guarantee. For example, a payable function in this interface may be implemented as nonpayble (no state mutability specified) in your contract. We expect a later Solidity release will allow your stricter contract to inherit from this interface, but a workaround for version 0.4.20 is that you can edit this interface to add stricter mutability before inheriting from your contract. Solidity issue #3419: A contract that implements ERC721Metadata or ERC721Enumerable SHALL also implement ERC721. ERC-721 implements the requirements of interface ERC-165. Solidity issue #2330: If a function is shown in this specification as external then a contract will be compliant if it uses public visibility. As a workaround for version 0.4.20, you can edit this interface to switch to public before inheriting from your contract. Solidity issues #3494, #3544: Use of this.*.selector is marked as a warning by Solidity, a future version of Solidity will not mark this as an error. If a newer version of Solidity allows the caveats to be expressed in code, then this EIP MAY be updated and the caveats removed, such will be equivalent to the original specification. Rationale There are many proposed uses of Ethereum smart contracts that depend on tracking distinguishable assets. Examples of existing or planned NFTs are LAND in Decentraland, the eponymous punks in CryptoPunks, and in-game items using systems like DMarket or EnjinCoin. Future uses include tracking real-world assets, like real-estate (as envisioned by companies like Ubitquity or Propy. It is critical in each of these cases that these items are not "lumped together" as numbers in a ledger, but instead each asset must have its ownership individually and atomically tracked. Regardless of the nature of these assets, the ecosystem will be stronger if we have a standardized interface that allows for cross-functional asset management and sales platforms. "NFT" Word Choice "NFT" was satisfactory to nearly everyone surveyed and is widely applicable to a broad universe of distinguishable digital assets. We recognize that "deed" is very descriptive for certain applications of this standard (notably, physical property). Alternatives considered: distinguishable asset, title, token, asset, equity, ticket NFT Identifiers Every NFT is identified by a unique uint256 ID inside the ERC-721 smart contract. This identifying number SHALL NOT change for the life of the contract. The pair (contract address, uint256 tokenId) will then be a globally unique and fully-qualified identifier for a specific asset on an Ethereum chain. While some ERC-721 smart contracts may find it convenient to start with ID 0 and simply increment by one for each new NFT, callers SHALL NOT assume that ID numbers have any specific pattern to them, and MUST treat the ID as a "black box". Also note that a NFTs MAY become invalid (be destroyed). Please see the enumerations functions for a supported enumeration interface. The choice of uint256 allows a wide variety of applications because UUIDs and sha3 hashes are directly convertible to uint256. Transfer Mechanism ERC-721 standardizes a safe transfer function safeTransferFrom (overloaded with and without a bytes parameter) and an unsafe function transferFrom. Transfers may be initiated by: The owner of an NFT The approved address of an NFT An authorized operator of the current owner of an NFT Additionally, an authorized operator may set the approved address for an NFT. This provides a powerful set of tools for wallet, broker and auction applications to quickly use a large number of NFTs. The transfer and accept functions' documentation only specify conditions when the transaction MUST throw. Your implementation MAY also throw in other situations. This allows implementations to achieve interesting results: Disallow transfers if the contract is paused — prior art, CryptoKitties deployed contract, line 611 Blacklist certain address from receiving NFTs — prior art, CryptoKitties deployed contract, lines 565, 566 Disallow unsafe transfers — transferFrom throws unless _to equals msg.sender or countOf(_to) is non-zero or was non-zero previously (because such cases are safe) Charge a fee to both parties of a transaction — require payment when calling approve with a non-zero _approved if it was previously the zero address, refund payment if calling approve with the zero address if it was previously a non-zero address, require payment when calling any transfer function, require transfer parameter _to to equal msg.sender, require transfer parameter _to to be the approved address for the NFT Read only NFT registry — always throw from unsafeTransfer, transferFrom, approve and setApprovalForAll Failed transactions will throw, a best practice identified in ERC-223, ERC-677, ERC-827 and OpenZeppelin's implementation of SafeERC20.sol. ERC-20 defined an allowance feature, this caused a problem when called and then later modified to a different amount, as on OpenZeppelin issue #438. In ERC-721, there is no allowance because every NFT is unique, the quantity is none or one. Therefore we receive the benefits of ERC-20's original design without problems that have been later discovered. Creating of NFTs ("minting") and destruction NFTs ("burning") is not included in the specification. Your contract may implement these by other means. Please see the event documentation for your responsibilities when creating or destroying NFTs. We questioned if the operator parameter on onERC721Received was necessary. In all cases we could imagine, if the operator was important then that operator could transfer the token to themself and then send it -- then they would be the from address. This seems contrived because we consider the operator to be a temporary owner of the token (and transferring to themself is redundant). When the operator sends the token, it is the operator acting on their own accord, NOT the operator acting on behalf of the token holder. This is why the operator and the previous token owner are both significant to the token recipient. Alternatives considered: only allow two-step ERC-20 style transaction, require that transfer functions never throw, require all functions to return a boolean indicating the success of the operation. ERC-165 Interface We chose Standard Interface Detection (ERC-165) to expose the interfaces that a ERC-721 smart contract supports. A future EIP may create a global registry of interfaces for contracts. We strongly support such an EIP and it would allow your ERC-721 implementation to implement ERC721Enumerable, ERC721Metadata, or other interfaces by delegating to a separate contract. Gas and Complexity (regarding the enumeration extension) This specification contemplates implementations that manage a few and arbitrarily large numbers of NFTs. If your application is able to grow then avoid using for/while loops in your code (see CryptoKitties bounty issue #4). These indicate your contract may be unable to scale and gas costs will rise over time without bound. We have deployed a contract, XXXXERC721, to Testnet which instantiates and tracks 340282366920938463463374607431768211456 different deeds (2^128). That's enough to assign every IPV6 address to an Ethereum account owner, or to track ownership of nanobots a few micron in size and in aggregate totalling half the size of Earth. You can query it from the blockchain. And every function takes less gas than querying the ENS. This illustration makes clear: the ERC-721 standard scales. Alternatives considered: remove the asset enumeration function if it requires a for-loop, return a Solidity array type from enumeration functions. Privacy Wallets/brokers/auctioneers identified in the motivation section have a strong need to identify which NFTs an owner owns. It may be interesting to consider a use case where NFTs are not enumerable, such as a private registry of property ownership, or a partially-private registry. However, privacy cannot be attained because an attacker can simply (!) call ownerOf for every possible tokenId. Metadata Choices (metadata extension) We have required name and symbol functions in the metadata extension. Every token EIP and draft we reviewed (ERC-20, ERC-223, ERC-677, ERC-777, ERC-827) included these functions. We remind implementation authors that the empty string is a valid response to name and symbol if you protest to the usage of this mechanism. We also remind everyone that any smart contract can use the same name and symbol as your contract. How a client may determine which ERC-721 smart contracts are well-known (canonical) is outside the scope of this standard. A mechanism is provided to associate NFTs with URIs. We expect that many implementations will take advantage of this to provide metadata for each NFT. The image size recommendation is taken from Instagram, they probably know much about image usability. The URI MAY be mutable (i.e. it changes from time to time). We considered an NFT representing ownership of a house, in this case metadata about the house (image, occupants, etc.) can naturally change. Metadata is returned as a string value. Currently this is only usable as calling from web3, not from other contracts. This is acceptable because we have not considered a use case where an on-blockchain application would query such information. Alternatives considered: put all metadata for each asset on the blockchain (too expensive), use URL templates to query metadata parts (URL templates do not work with all URL schemes, especially P2P URLs), multiaddr network address (not mature enough) Community Consensus A significant amount of discussion occurred on the original ERC-721 issue, additionally we held a first live meeting on Gitter that had good representation and well advertised (on Reddit, in the Gitter #ERC channel, and the original ERC-721 issue). Thank you to the participants: @ImAllInNow Rob from DEC Gaming / Presenting Michigan Ethereum Meetup Feb 7 @Arachnid Nick Johnson @jadhavajay Ajay Jadhav from AyanWorks @superphly Cody Marx Bailey - XRAM Capital / Sharing at hackathon Jan 20 / UN Future of Finance Hackathon. @fulldecent William Entriken A second event was held at ETHDenver 2018 to discuss distinguishable asset standards (notes to be published). We have been very inclusive in this process and invite anyone with questions or contributions into our discussion. However, this standard is written only to support the identified use cases which are listed herein. Backwards Compatibility We have adopted balanceOf, totalSupply, name and symbol semantics from the ERC-20 specification. An implementation may also include a function decimals that returns uint8(0) if its goal is to be more compatible with ERC-20 while supporting this standard. However, we find it contrived to require all ERC-721 implementations to support the decimals function. Example NFT implementations as of February 2018: CryptoKitties -- Compatible with an earlier version of this standard. CryptoPunks -- Partially ERC-20 compatible, but not easily generalizable because it includes auction functionality directly in the contract and uses function names that explicitly refer to the assets as "punks". Auctionhouse Asset Interface -- The author needed a generic interface for the Auctionhouse ÐApp (currently ice-boxed). His "Asset" contract is very simple, but is missing ERC-20 compatibility, approve() functionality, and metadata. This effort is referenced in the discussion for EIP-173. Note: "Limited edition, collectible tokens" like Curio Cards and Rare Pepe are not distinguishable assets. They're actually a collection of individual fungible tokens, each of which is tracked by its own smart contract with its own total supply (which may be 1 in extreme cases). The onERC721Received function specifically works around old deployed contracts which may inadvertently return 1 (true) in certain circumstances even if they don't implement a function (see Solidity DelegateCallReturnValue bug). By returning and checking for a magic value, we are able to distinguish actual affirmative responses versus these vacuous trues. Test Cases 0xcert ERC-721 Token includes test cases written using Truffle. Implementations 0xcert ERC721 -- a reference implementation MIT licensed, so you can freely use it for your projects Includes test cases Active bug bounty, you will be paid if you find errors Su Squares -- an advertising platform where you can rent space and place images Complete the Su Squares Bug Bounty Program to seek problems with this standard or its implementation Implements the complete standard and all optional interfaces ERC721ExampleDeed -- an example implementation Implements using the OpenZeppelin project format XXXXERC721, by William Entriken -- a scalable example implementation Deployed on testnet with 1 billion assets and supporting all lookups with the metadata extension. This demonstrates that scaling is NOT a problem. References Standards ERC-20 Token Standard. ERC-165 Standard Interface Detection. ERC-173 Owned Standard. ERC-223 Token Standard. ERC-677 transferAndCall Token Standard. ERC-827 Token Standard. Ethereum Name Service (ENS). https://ens.domains Instagram -- What's the Image Resolution? https://help.instagram.com/1631821640426723 JSON Schema. https://json-schema.org/ Multiaddr. https://github.com/multiformats/multiaddr RFC 2119 Key words for use in RFCs to Indicate Requirement Levels. https://www.ietf.org/rfc/rfc2119.txt Issues The Original ERC-721 Issue. https://github.com/ethereum/eips/issues/721 Solidity Issue #2330 -- Interface Functions are External. https://github.com/ethereum/solidity/issues/2330 Solidity Issue #3412 -- Implement Interface: Allow Stricter Mutability. https://github.com/ethereum/solidity/issues/3412 Solidity Issue #3419 -- Interfaces Can't Inherit. https://github.com/ethereum/solidity/issues/3419 Solidity Issue #3494 -- Compiler Incorrectly Reasons About the selector Function. https://github.com/ethereum/solidity/issues/3494 Solidity Issue #3544 -- Cannot Calculate Selector of Function Named transfer. https://github.com/ethereum/solidity/issues/3544 CryptoKitties Bounty Issue #4 -- Listing all Kitties Owned by a User is O(n^2). https://github.com/axiomzen/cryptokitties-bounty/issues/4 OpenZeppelin Issue #438 -- Implementation of approve method violates ERC20 standard. https://github.com/OpenZeppelin/zeppelin-solidity/issues/438 Solidity DelegateCallReturnValue Bug. https://solidity.readthedocs.io/en/develop/bugs.html#DelegateCallReturnValue Discussions Reddit (announcement of first live discussion). https://www.reddit.com/r/ethereum/comments/7r2ena/friday_119_live_discussion_on_erc_nonfungible/ Gitter #EIPs (announcement of first live discussion). https://gitter.im/ethereum/EIPs?at=5a5f823fb48e8c3566f0a5e7 ERC-721 (announcement of first live discussion). https://github.com/ethereum/eips/issues/721#issuecomment-358369377 ETHDenver 2018. https://ethdenver.com NFT Implementations and Other Projects CryptoKitties. https://www.cryptokitties.co 0xcert ERC-721 Token. https://github.com/0xcert/ethereum-erc721 Su Squares. https://tenthousandsu.com Decentraland. https://decentraland.org CryptoPunks. https://www.larvalabs.com/cryptopunks DMarket. https://www.dmarket.io Enjin Coin. https://enjincoin.io Ubitquity. https://www.ubitquity.io Propy. https://tokensale.propy.com CryptoKitties Deployed Contract. https://etherscan.io/address/0x06012c8cf97bead5deae237070f9587f8e7a266d#code Su Squares Bug Bounty Program. https://github.com/fulldecent/su-squares-bounty XXXXERC721. https://github.com/fulldecent/erc721-example ERC721ExampleDeed. https://github.com/nastassiasachs/ERC721ExampleDeed Curio Cards. https://mycuriocards.com Rare Pepe. https://rarepepewallet.com Auctionhouse Asset Interface. https://github.com/dob/auctionhouse/blob/master/contracts/Asset.sol OpenZeppelin SafeERC20.sol Implementation. https://github.com/OpenZeppelin/zeppelin-solidity/blob/master/contracts/token/ERC20/SafeERC20.sol Copyright Copyright and related rights waived via CC0. Go © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
github-com-997	----	GitHub - hughrun/yawp: command line app for publishing social media posts Skip to content Sign up Sign up Why GitHub? Features → Mobile → Actions → Codespaces → Packages → Security → Code review → Project management → Integrations → GitHub Sponsors → Customer stories→ Team Enterprise Explore Explore GitHub → Learn and contribute Topics → Collections → Trending → Learning Lab → Open source guides → Connect with others The ReadME Project → Events → Community forum → GitHub Education → GitHub Stars program → Marketplace Pricing Plans → Compare plans → Contact Sales → Education → In this repository All GitHub ↵ Jump to ↵ No suggested jump to results In this repository All GitHub ↵ Jump to ↵ In this user All GitHub ↵ Jump to ↵ In this repository All GitHub ↵ Jump to ↵ Sign in Sign up Sign up {{ message }} hughrun / yawp Notifications Star 0 Fork 0 command line app for publishing social media posts AGPL-3.0 License 0 stars 0 forks Star Notifications Code Issues 1 Pull requests 0 Actions Projects 0 Security Insights More Code Issues Pull requests Actions Projects Security Insights main Switch branches/tags Branches Tags Nothing to show {{ refName }} default View all branches Nothing to show {{ refName }} default View all tags 1 branch 1 tag Go to file Code Clone HTTPS GitHub CLI Use Git or checkout with SVN using the web URL. Work fast with our official CLI. Learn more. Open with GitHub Desktop Download ZIP Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Go back Launching Xcode If nothing happens, download Xcode and try again. Go back Launching Visual Studio If nothing happens, download the GitHub extension for Visual Studio and try again. Go back Latest commit   Git stats 12 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time src add more detail to README Apr 12, 2021 .gitignore more readme updates Apr 12, 2021 Cargo.lock Initial code commit Apr 12, 2021 Cargo.toml minor updates to cargo.toml and readme Apr 12, 2021 LICENSE Initial commit Apr 11, 2021 README.md add yawp definition to readme Apr 12, 2021 example.env add more detail to README Apr 12, 2021 View code yawp In brief Installation MacOS or Linux From source Usage: Flags: Options: Args: Environment variables Mastodon Twitter Examples README.md yawp A command line (CLI) app for publishing social media posts. In brief yawp takes some text as an argument and publishes it to the social media accounts of your choice. No need to read the comments, just send your yawp and move on with your day. Current options are Twitter and Mastodon, it's possible more will be added in future (or not). yawp is specifically designed to fit within a broader toolchain: in general terms it tries to follow "the Unix philosophy": can take input from stdin (e.g. redirected from a file or another process) outputs the message as plaintext to stdout (i.e. the output is the input) takes all configuration from environment (ENV) values to enable flexibility Installation MacOS or Linux Download the relevant binary file from the latest release. Save it somewhere in your PATH, e.g. in /usr/local/bin/. Alternatively you can symlink it from wherever you want to save it, like this: ln -s /my/awesome/directory/yawp /usr/local/bin/ From source If you're using another platform or don't trust my binaries you can build your own from source: git clone or download the repository as a zip. cargo build --release Usage: yawp [FLAGS] [OPTIONS] <YAWP> Flags: -h, --help Prints help information -m, --mastodon Send toot -q, --quiet Suppress output (error messages will still be sent to stderr) -t, --twitter Send tweet -V, --version Prints version information Options: -e, --env <env> path to env file Args: <YAWP> Message (post) to send. If using stdin you must provide a hyphen (-) as the argument. However if you do this and are not redirecting stdin from somewhere, yawp will hang your shell unless you supply EOF by pressing Ctrl + D. (See example 5 below). Environment variables yawp requires some environment variables in order to actually publish your message. You can set these in a number of ways depending on your operating system. yawp also allows you to call them in from a file. See example 6 for using a file or example 7 for setting environment values at the same time you call yawp. An example environment variables file is provided at example.env. The possible values are: Mastodon For Mastodon you need the base url of your instance (server), and an API access token. MASTODON_ACCESS_TOKEN - You can create a token at settings - applications in your Mastodon account. You require write:statuses permission. MASTODON_BASE_URL - This is the base URL of your server. e.g. https://mastodon.social Twitter For Twitter you need the four tokens provided when you create an app at https://developer.twitter.com/en/apps. TWITTER_CONSUMER_KEY TWITTER_CONSUMER_SECRET TWITTER_ACCESS_TOKEN TWITTER_ACCESS_SECRET Examples Provide message on command line: yawp 'Hello, World!' -t # Output: Hello, World! # Tweets: Hello, World! Pipe in message: echo 'Hello again, World!' | yawp - -m # Output: Hello again, World! # Toots: Hello again, World! Read from file # create a file (echo Hello fronds; echo " It's me"; echo ...a tree 🌳) > message.txt # run yawp and direct file content into it yawp - <message.txt # this does the same thing: cat message.txt | yawp - # Output: #Hello fronds # It's me #...a tree 🌳 Chain commands You can redirect the output of yawp as well as the input: cat message.txt | yawp - > output.txt # the message.txt and output.txt files are now identical. Read from user input This is not really recommended, but you may find yourself facing a user input prompt if you use a hyphen without providing any redirected input. i.e. if you do this: yawp - # machine awaits user further input from command line Don't panic, you can provide the message text by typing it in at the command prompt. There is a catch, however, in that yawp will wait for further input until it reaches EOF (End of File). This will not happen when you press Enter but can usually be provided by pressing Ctrl + D: yawp -t - # machine awaits user further input from command line Awoo! [Ctrl + D] # Output: Awoo! # Tweets: Awoo! Provide environment variables from file In some situtations (e.g. when using Docker Compose) you may have already set environment variables specific to those needed by yawp. If not, you can call them in from a file by providing the filepath using -e or --env: yawp -m --env 'yawp.env' 'I love to toot!' Provide environment variables on command line You could also set ENV settings manually when you call yawp: MASTODON_BASE_URL=https://ausglam.space MASTODON_ACCESS_TOKEN=abcd1234 yawp -m '🎺 I am tooting!' About command line app for publishing social media posts Resources Readme License AGPL-3.0 License Releases 1 0.1.0 Latest Apr 12, 2021 Packages 0 No packages published Languages Rust 96.8% Shell 3.2% © 2021 GitHub, Inc. Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About You can’t perform that action at this time. You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. 
go-to-hellman-blogspot-com-1227	----	Go To Hellman skip to main | skip to sidebar Go To Hellman If you wanna end war and stuff, you gotta sing loud! Monday, February 22, 2021 Open Access for Backlist Books, Part II: The All-Stars Libraries know that a big fraction of their book collections never circulate, even once. The flip side of this fact is that a small fraction of a library's collection accounts for most of the circulation. This is often referred to as Zipf's law; as a physicist I prefer to think of it as another manifestation of log-normal statistics resulting a preferential attachment mechanism for reading. (English translation: "word-of-mouth".) In my post about the value of Open Access for books, I suggested that usage statistics (circulation, downloads, etc.) are a useful proxy for the value that books generate for their readers. The logical conclusion is that the largest amount of value that can be generated from opening of the backlist comes from the books that are most used, the "all-stars" of the library, not the discount rack or the discards. If libraries are to provide funding for Open Access backlist books, shouldn't they focus their resources on the books that create the most value? The question of course, is how the library community would ever convince publishers, who have monopolies on these books as a consequence of international copyright laws, to convert these books to Open Access. Although some sort of statutory licensing or fair-use carve-outs could eventually do the trick, I believe that Open Access for a significant number of "backlist All-Stars" can be achieved today by pushing ALL the buttons available to supporters of Open Access. Here's where the Open Access can learn from the game (and business) of baseball. "Baseball", Henry Sandham, L. Prang & Co. (1861).   from Digital Commonwealth Baseball's best player, Mike Trout, should earn $33.25 million this year, a bit over $205,000 per regular season game. If he's chosen for the All-Star game, he won't get even a penny extra to play unless he's named MVP, in which case he earns a $50,000 bonus. So why would he bother to play for free? It turns out there are lots of reasons. The most important have everything to with the recognition and honor of being named as an All-Star, and with having respect for his fans. But being an All-Star is not without financial benefits considering endorsement contracts and earning potential outside of baseball. Playing in the All-Star game is an all-around no-brainer for Mike Trout. Open Access should be an All-Star game for backlist books. We need to create community-based award programs that recognize and reward backlist conversions to OA. If the world's libraries want to spend $50,000 on backlist physics books, for example, isn't it better to spend it on the the Mike Trout of physics books than on a team full of discount-rack replacement-level players? Competent publishers would line up in droves for major-league all-star backlist OA programs. They know that publicity will drive demand for their print versions (especially if NC licenses are used.) They know that awards will boost their prestige, and if they're trying to build Open Access publication programs, prestige and quality are a publisher's most important selling points. The Newbury Medal Over a hundred backlist books have been converted to open access already this year. Can you name one of them? Probably not, because the publicity value of existing OA conversion programs is negligible. To relicense an All-Star book, you need an all-star publicity program. You've heard of the Newbury Medal, right? You've seen the Newbury medal sticker on children's books, maybe even special sections for them in bookstores. That prize, award by the American Library Association every year to honor the most distinguished contributions to American literature for children, is a powerful driver of sales. The winners get feted in a gala banquet and party (at least they did in the before-times). That's the sort of publicity we need to create for open access books. If you doubt that "All-Star Open Access" could work, don't discount the fact that it's also the right thing to do. Authors of All-Star backlist books want their books to be used, cherished and remembered. Libraries want books that measurably benefit the communities they serve. Foundations and governmental agencies want to make a difference. Even publishers who look only at their bottom lines can structure a rights conversion as a charitable donation to reduce their tax bills. And did I mention that there could be Gala Award Celebrations? We need more celebrations, don't you think? If your community is interest in creating an Open-Access program for backlist books, don't hesitate to contact me at the Free Ebook Foundation! Notes I've written about the statistics of book usage here, here and here. This is the third in a series of posts about creating value of Open Access books. The first two are: Creating Value with Open Access Books Open Access for Backlist Books, Part I: The Slush Pile Posted by Eric at 9:49 PM 2 comments Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Links to this post Labels: Baseball, Book Use, Open Access, Ungluing Ebooks Tuesday, February 16, 2021 Open Access for Backlist Books, Part I: The Slush Pile "Kale emerging from a slush pile" (CC BY, Eric Hellman) Book publishers hate their "slush pile": books submitted for publication unsolicited, rarely with literary merit and unlikely to make money for the publisher if accepted. In contrast, book publishers love their backlist; a strong backlist is what allows a book publisher to remain consistently profitable even when most of their newly published books fail to turn a profit. A publisher's backlist typically consists of a large number of "slushy" books that generate negligible income and a few steady "evergreen" earners. Publishers don't talk much about the backlist slush pile, maybe because it reminds them of their inability to predict a book's commercial success. With the advent of digital books has come new possibilities for generating value from the backlist slush pile. Digital books can be kept "in print" at essentially no cost (printed books need warehouse space) which has allowed publishers to avoid rights reversion in many cases. Some types of books can be bundled in ebook aggregations that can be offered on a subscription basis. This is reminiscent of the way investment bankers created valuable securities by packaging junk bonds with opaque derivatives. Open access is a more broadly beneficial way to generate value from the backlist slush pile. There is a reason that libraries keep large numbers of books on their shelves even when they don't circulate for years. The myriad ways that books can create value doesn't have to be tied to book sales, as I wrote in my previous post. Those of us who want to promote Open Access for backlist ebooks have a number of strategies at our disposal. The most basic strategy is to promote the visibility of these books. Libraries can add listings for these ebooks in their catalogs. Aggregators can make these books easier to find. Switching backlist books to Open Access licenses can be expensive and difficult. While the cost of digitization has dropped dramatically over the past decade, quality control is still a significant conversion expense. Licensing-related expenses are sometimes large. Unlike journals and journal articles, academic books are typically covered by publishing agreements that give authors royalties on sales and licensing, and give authors control over derivative works such as translations. No publisher would consent to OA relicensing without the consent and support of the author. For older books, a publisher may not even have electronic rights (in the US, the Tasini decision established that electronic rights are separate from print rights), or may need to have a lawyer interpret the language of the original publishing contract.  While most scholarly publishers obtain worldwide rights to the books they publish, rights for trade books are very often divided among markets. Open-access licenses such as the Creative Commons licenses are not limited to markets, so a license conversion would require the participation of every rights holder worldwide.  The CC BY license can be problematic for books containing illustrations or figures used by permission from third party rights holders. "All Rights Reserved" illustrations are often included in Open Access Books, but they are carved out of the license by separate rights statements, and to be safe, publishers use the CC BY-ND or CC BY-ND-NC license for the complete book, as the permissions do not cover derivative works. Since the CC BY license allows derivative works, it cannot be used in cases where translation rights have been sold (without also buying out the translation rights). A publisher cannot use a CC BY license for a translated work without also having rights to the original work. The bottom line is that converting a backlist book to OA often requires economic motivations quite apart from any lost sales. Luckily, there's evidence that opening access can lead to increased sales. Nagaraj and Reimers found that digitization and exposure through Google Books increased sales of print editions by 35% for books in the Public Domain.  In addition, a publisher's commercial position and prestige can be enhanced by the attribution requirement in Creative Commons licenses. Additional motivation for OA conversion of the backlist slush pile has been supplied by programs such as used by Knowledge Unlatched, where libraries contribute to to a fund used for "unlatching" backlist books. (Knowledge Unlatched has programs for front list books as well.) While such programs can in principle be applied for the "evergreen" backlist, the incentives currently in place result in the unlatching of books in the "slush pile" backlist. While value for society is being gained this way, the willingness of publishers to "unlatch" hundreds of these books poses the question of how much library funding for Open Access should be allocated to the discount bin, as opposed to the backlist books most used in libraries. That's the topic of my next post!  Notes This is the second in a series of posts about creating value of Open Access books. The others are: Creating Value with Open Access Books Open Access for Backlist Books, Part II: The All-Stars Posted by Eric at 9:32 PM 0 comments Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Links to this post Labels: Creative Commons, ebooks, Open Access, Ungluing Ebooks Friday, February 12, 2021 Creating Value with Open Access Books Can a book be more valuable if it's free? How valuable? To whom? How do we unlock this value? I've been wrestling with these questions for over ten years now.  And for each of these questions, the answer is... it depends. A truism of the bookselling business is that "Every book is different" and the same is true of the book freeing "business". Recently there's been increased interest in academic communities around Open Access book publishing and in academic book relicensing (adding an Open Access License to an already published book). Both endeavors have been struggling with the central question of how to value an open access book. The uncertainty in OA book valuation has led to many rookie mistakes among OA stakeholders. For example, when we first started Unglue.it, we assumed that reader interest would accelerate the relicensing process for older books whose sales had declined. But the opposite turned out to be true. Evidence of reader interest let rights holders know that these backlist titles were much more valuable than sales would indicate, thus precluding any notion of making them Open Access. Pro tip: if you want to pay a publisher to make a books free, don't publish your list of incredibly valuable books! Instead of a strictly transactional approach, it's more useful to consider the myriad ways that academic books create value. Each of these value mechanisms offer buttons that we can push to promote open access, and point to new structures for markets where participants join together to create mutual value. First, consider the book's reader. The value created is the reader's increased knowledge, understanding and sometimes, sheer enjoyment. The fact of open access does not itself create the value, but removes some of the barriers which might suppress this value. It's almost impossible to quantify the understanding and enjoyment from books; but "hours spent reading" might be a useful proxy for it. Next consider a book's creator. While a small number of creators derive an income stream from their books, most academic authors benefit primarily from the development and dissemination of their ideas. In many fields of inquiry, publishing a book is the academic's path to tenure. Educators (and their students!) similarly benefit. In principle, you might assess a textbook's value by measuring student performance. The value of a book to a publisher can be more than just direct sales revenue. A widely distributed book can be a marketing tool for a publisher's entire business. In the world of Open Access, we can see new revenue models emerging - publication charges, events, sponsorships, even grants and memberships.  The value of a book to society as a whole can be enormous. In areas of research, a book might lead to technological advances, healthier living, or a more equitable society. Or a book might create outrage, civil strife, and misinformation. That's another issue entirely! Books can be valuable to secondary distributors as well. Both used book resellers and libraries add value to physical books by increasing their usage. This is much harder to accomplish for paywalled ebooks! Since academic libraries are often considered as potential funding sources for Open Access publishing it's worth noting that the value of an open access ebook to a library is entirely indirect. When a library acts as an Open Access funding source, it's acting as a proxy for the community it serves. This brings us to communities. The vast majority of books create value for specific communities, not societies as a whole. I believe that community-based funding is the most sustainable path for support of Open Access Books. Community supported OA article publishing has already had plenty of support. Communities organized by discipline have been particularly successful: consider the success that ArXiv has had in promoting Open Access in physics, both at the preprint level and for journals in high-energy physics. A similar story can be told for biomedicine, Pubmed and Pubmed Central. A different sort of community success story has been SciELO, which has used Open Access to address challenges faced by scholars in Latin America. So far, however, sustainable Open Access has proven to be challenging for scholarly ebooks. My next few posts will discuss the challenges and ways forward for support of ebook relicensing and for OA ebook creation: Open Access for Backlist Books, Part I: The Slush Pile Open Access for Backlist Books, Part II: The All-Stars Posted by Eric at 12:31 PM 0 comments Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Links to this post Labels: Open Access, Ungluing Ebooks Tuesday, December 29, 2020 Infra-infrastructure, inter-infrastructure and para-infrastructure No one is against "Investing in Infrastructure". No one wants bridges to collapse, investing is always more popular than spending, and it's even alliterative! What's more, since infrastructure is almost invisible by definition, it's politically safe to support investing in infrastructure because no one will see when you don't follow through on your commitment! Ponte Morandi collapse - Michele Ferraris, CC BY-SA 4.0 via Wikimedia Commons Geoffrey Bilder gives a talk where he asks us to think of Crossref and similar services as "information infrastructure" akin to "plumbing", where the implication is that since we, as a society, are accustomed to paying plumbers and bridge builders lots of money, we should also pony up for "information infrastructure", which is obvious once you say it. What qualifies as infrastructure, anyway? If I invest in a new laptop, is that infrastructure for the Go-to-Hellman blog? Blogspot is Google-owned blogging infrastructure for sure. It's certainly not open infrastructure, but it works, and I haven't had to do much maintenance on it.  There's a lot of infrastructure used to make Unglue.it, which supports distribution of open-access ebooks. It uses Django, which is open-source software originally developed to support newspaper websites. Unglue.it also uses modules that extend Django that were made possible by Django's Open license. It works really well, but I've had to put a fair amount of work into updating my code to keep up with new versions of Django. Ironically, most of this work has been in fixing the extensions that have not updated along with Django. I deploy Unglue.it on AWS, which is DEFINITELY infrastructure. I have a love/hate relationship with AWS because it works so well, but every time I need to change something, I have to spend 2 hours with documentation to find the one-line incantation that make it work. But every few months, the cost of using AWS goes down, which I like, but the money goes to Amazon, which is ironic because they really don't care for the free ebooks we distribute. Aside from AWS and Django, the infrastructure I use to deliver Ebook Foundation services includes Python, Docker, Travis-CI, GitHub, git, Ubuntu Linux, MySQL, Postgres, Ansible, Requests, Beautiful Soup, and many others. The Unglue.it database relies on infrastructure services from DOAB, OAPEN, LibraryThing, Project Gutenberg, OpenLibrary and Google Books. My development environment relies heavily on BBEdit and Jupyter. We depend on Crossref and Internet Archive to resolve some links; we use subject vocabulary from Library of Congress and BISAC. You can imagine why I was interested in "JROST 2020" which turns out to stand for "Join Roadmap for Open Science Tools 2020", a meeting organized by a relatively new non-profit, "Invest in Open Infrastructure" (IOI). The meeting was open and free, and despite the challenges associated with such a meeting in our difficult times, it managed to present a provocative program along with a compelling vision. If you think a bit about how to address the infrastructure needs of open science and open scholarship in general, you come up with at least 3 questions: How do you identify the "leaky pipes" that need fixing so as to avoid systemic collapse? How do you bolster healthy infrastructure so that it won't need repair? How do you build new infrastructure that will be valuable and thrive? If it were up to me, my first steps would be to: Get people with a stake in open infrastructure to talk to each other. Break them out of their silos and figure out how their solutions can help solve problems in other communities. Create a 'venture fund" for new needed infrastructure. Work on solving the problems that no one wants to tackle on their own. Invest in Open Infrastructure is already doing this! Kaitlin Thaney, who's been Executive Director of IOI for less that a year, seems to be pressing all the right buttons. The JROST 2020 meeting was a great start on #1 and #2 is the initial direction of the "JROST Rapid Response Fund", whose first round of awards was announced at the meeting. Among the first awardees of the JROST Rapid Response Fund announced at JROST2020 was an organization that ties into the infrastructure that I use, 2i2c. It's a great example of much-needed infrastructure for scientific computing, education, digital humanities and data science. 2i2c aims to create hosted interactive computing environments that run in the cloud and are powered by entirely open-source technology (Jupyter). As I'm a Jupyter user and enthusiast, this makes me happy. But while 2i2c is the awardee,  it's being built on top of Jupyter. Is Jupyter also infrastructure? It needs investment too, doesn't it? There's a lot of overlap between the Jupyter team and the 2i2c team, so investment in one could be investment in the other. In fact, Chris Holdgraf, Executive Director of 2i2c, told me that "we see 2i2c as a way to both increase the impact of Jupyter in the research/education community, and a way to more sustainably drive resources back into the Jupyter community.". Open Science Infrastructure Interdependency (from “Scoping the Open Science Infrastructure Landscape in Europe”, https://doi.org/10.5281/zenodo.4153809) Where does Jupyter fit in the infrastructure landscape? It's nowhere to be seen on the neat "interdependency map" presented by SPARC EU at JROST. If 2i2c is an example of investment-worthy infrastructure, maybe the best way to think of Jupyter is "infra-infrastructure" - the open information infrastructure needed to build open information infrastructure. "Trickle-down" investment in this sort of infrastructure may be the best way to support projects like Jupyter so they stay open and are widely used. But wait... Jupyter is built on top of Python, right? Python needs people investing in it, Is Python infra-infra-infrastructure? And Python is built on top of C  (I won't even mention Jython or PyJS), right?? Turtles all the way down. Will 2i2c eventually get buried under other layers of infrastructure, be forgotten and underinvested in, only to be one day excavated and studied by technology archeologists? Looking carefully at the interdependency map, I don't see a lot of layers. I see a network with lots of loops. And many of the nodes are connectors themselves. Orcid and CrossRef resemble roads, bridges and plumbing not because they're hidden underneath, but because they're visible and in-between. They exist because of the entities they connect cooperate to make the connection robust instead of incidental. They're not infra-infrastructure, they're inter-infrastructure. Trickle-down investment probably wouldn't work for inter-infrastucture. Instead, investments need to come from the communities that benefit so that the communities can decide how to manage and access to the inter-infrastructure to maximize the community benefit. There's another type of infrastructure that needs investment. I work in ebooks, and a lot of overlapping communities have tackled their own special ebook problems. But the textbook people don't talk to the public domain people don't talk to the monograph people don't talk to the library people. (A slight exaggeration.) There are lots of "almost" solutions that work well for specific tasks. But with the total amount of effort being expended, we could some really amazing things... if only we were better at collaborating. For example, the Jupyter folks have gotten funding from Sloan for the "Executable Book Project". This is really cool. Similarly, there's Bookdown, which comes out of the R community. And there are other efforts to give ebooks the functionality that a website could have. Gitbook is a commercial open-source effort targeting a similar space, Rebus, a non-profit, is using Pressbooks to gain traction in the textbook space, while MIT Press's PubPub has similar goals. I'll call these overlapping efforts "para-infrastructure." Should investors in open infrastructure target investment in "rolling up" or merging these efforts? When private equity investors have done this to library automation companies the results have not benefited the user communities, so I'd say "NO!" but what's the alternative? I've observed that the folks who are doing the best job of just making stuff work rarely have the time or resources to go off to conferences or workshops. Typically, these folks have no incentive to do the work to make their tools work for slightly different problems. That can be time consuming! But it's still easier than taking someone else's work and modifying it to solve your own special problem. I think the best way to invest in open para-infrastructure is to get lots of these folks together and give the time and incentive to talk and to share solutions (and maybe code.) It's hard work, but making the web of open infrastructure stronger and more resilient is what investment in open infrastructure is all about.  Different types of open infrastructure benefit from different styles of investment; I'm hoping that IOI will build on the directions exhibited by its Rapid Response Fund and invest effectively in infra-infrastructure, inter-infrastructure, and para-infrastructure.   Notes 1. Geoff Bilder and Cameron Neylon have a nice discussion of many of the issues in this post: “Bilder G, Lin J, Neylon C (2016) Where are the pipes? Building Foundational Infrastructures for Future Services, retrieved [date], http://cameronneylon.net/blog/where-are-the-pipes-building-foundational-infrastructures-for-future-services/ ‎” 2. "Trickle-down" has a negative connotation in economics, but that's how you feed a tree, right? Posted by Eric at 1:17 PM 0 comments Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Links to this post Labels: Crossref, ebooks, Infrastructure, Open Source Monday, October 19, 2020 We should regulate virality It turns out that virality on internet platforms is a social hazard!  Living in the age of the Covid pandemic, we see around us what happens when we let things grow exponentially. The reason that the novel coronavirus has changed our lives is not that it's often lethal - it's that it found a way to jump from one infected person to several others on average, leading to exponential growth. We are infected with virus without regard to the lethality of the virus, but only its reproduction rate. For years, websites have been built to optimize virality of content. What we see on Facebook or Twitter is not shown to us for its relevance to our lives, its education value, or even its entertainment value. It shown to us because it maximizes our "engagement" - our tendency to interact and spread it. The more we interact with a website, the more money it makes, and so a generation of minds has been employed in the pursuit of more engagement. Sometimes it's cat videos that delight us, but more often these days it's content that enrages and divides us. Our dissatisfaction with what the internet has become has led calls to regulate the giants of the internet. A lot of the political discourse has focused on "section 20" https://en.wikipedia.org/wiki/Section_230  a part of US law that gives interactive platforms such as Facebook a set of rules that result in legal immunity for content posted by users. As might be expected, many of the proposals for reform have sounded attractive, but the details are typically unworkable in the real world, and often would have effects opposite of what is intended.  I'd like to argue that the only workable approaches to regulating internet platforms should target their virality. Our society has no problem with regulations that force restaurant, food preparation facilities, and even barbershops to prevent the spread of disease, and no one ever complains that the regulations affect "good" bacteria too. These regulations are a component of our society's immune system, and they are necessary for its healthy functioning. Add caption You might think that platform virality is too technical to be amenable to regulation, but it's not. That's because of the statistical characteristics of exponential growth. My study of free ebook usage has made me aware of the pervasiveness of exponential statistics on the internet. Sometime labeled the 80-20 rule, the Pareto principle, or log-normal statistics, it's the natural result of processes that grow at a rate proportional to their size. As a result, it's possible to regulate virality of platforms because only a very small amount of content is viral enough dominate the platform. Regulate that tiny amount of super-viral content, and you create incentive to moderate the virality of platforms. The beauty of doing this is that a huge majority of content is untouched by regulation. How might this work? Imagine a law that removed a platform's immunity for content that it shows to a million people (or maybe 10 million - I've not sure what the cutoff should be). This makes sense, too; if a platform promotes illegal content in such a way that a million people see it, the platform shouldn't get immunity just because "algorithms"! It also makes it practical for platforms to curate the content for harmlessness- it won't kill off the cat videos! The Facebooks and Twitters of the world will complain, but they'll be able to add antibodies and T-cells to their platforms, and the platforms will be healthier for it. Smaller sites will be free to innovate, without too much worry, but to get funding they'll need to have plans for virality limits. So we really do have a choice; healthy platforms with diverse content, or cesspools of viral content. Doesn't seem like such a hard decision! Techdirt has excellent coverage of Section 230.  Posted by Eric at 9:29 PM 0 comments Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Links to this post Labels: social networks Older Posts Home Subscribe to: Posts (Atom) Unglue.it Making the world of ebooks safe for the free. Blog Archive ▼  2021 (3) ▼  February (3) Open Access for Backlist Books, Part II: The All-S... Open Access for Backlist Books, Part I: The Slush ... Creating Value with Open Access Books ►  2020 (3) ►  December (1) ►  October (1) ►  September (1) ►  2019 (9) ►  December (1) ►  July (1) ►  May (6) ►  April (1) ►  2018 (11) ►  December (2) ►  October (1) ►  September (1) ►  August (1) ►  June (1) ►  May (2) ►  April (1) ►  March (1) ►  January (1) ►  2017 (13) ►  December (1) ►  November (1) ►  October (1) ►  September (1) ►  August (1) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (1) ►  February (1) ►  January (2) ►  2016 (12) ►  December (1) ►  October (1) ►  September (1) ►  July (1) ►  June (1) ►  May (1) ►  April (1) ►  March (2) ►  February (1) ►  January (2) ►  2015 (18) ►  December (2) ►  November (1) ►  October (1) ►  September (2) ►  August (1) ►  July (2) ►  June (2) ►  May (2) ►  April (1) ►  March (1) ►  February (2) ►  January (1) ►  2014 (30) ►  December (2) ►  November (3) ►  October (4) ►  September (4) ►  August (1) ►  July (2) ►  June (3) ►  May (2) ►  April (2) ►  March (3) ►  February (2) ►  January (2) ►  2013 (49) ►  December (3) ►  November (5) ►  October (3) ►  September (2) ►  August (3) ►  July (4) ►  June (6) ►  May (12) ►  April (3) ►  March (2) ►  February (4) ►  January (2) ►  2012 (31) ►  December (3) ►  November (2) ►  October (3) ►  September (2) ►  August (2) ►  July (2) ►  June (2) ►  May (2) ►  April (3) ►  March (3) ►  February (3) ►  January (4) ►  2011 (69) ►  December (3) ►  November (3) ►  October (7) ►  September (5) ►  August (4) ►  July (4) ►  June (6) ►  May (8) ►  April (5) ►  March (7) ►  February (9) ►  January (8) ►  2010 (87) ►  December (9) ►  November (4) ►  October (7) ►  September (5) ►  August (5) ►  July (5) ►  June (7) ►  May (6) ►  April (8) ►  March (10) ►  February (7) ►  January (14) ►  2009 (82) ►  December (10) ►  November (7) ►  October (11) ►  September (8) ►  August (5) ►  July (14) ►  June (13) ►  May (10) ►  April (4) Popular Posts One Human Brain = 8 Global Internets (Total Data Rate) It's a trope of science fiction movies . Somebody gets the content of their brain sucked out and transferred to some other brain over a ... Google's "Crypto-Cookies" are tracking Chrome users Ordinary  HTTP cookies are used in many ways to make the internet work. Cookies help websites remember their users. A common use of cookies... Sci-Hub, LibGen, and Total Information Awareness "Good thing downloads NOT trackable!" was one twitter response to my post imagining a skirmish in the imminent scholarly publi... Open Access for Backlist Books, Part II: The All-Stars Libraries know that a big fraction of their book collections never circulate, even once. The flip side of this fact is that a small fraction... Beware, Comment Spammers! I had this great idea about how to fight comment spam. If you're not familiar with comment spam, you probably don't have your own bl... Twitter Updates Twitter Updates follow me on Twitter Tweets by @gluejar Subscribe To Posts Atom Posts All Comments Atom All Comments Me Eric Eric Go To Hellman Fan Page Go To Hellman on Facebook Labels ebooks (91) Libraries (71) book industry (52) E-book (49) privacy (48) Copyright (46) business models (44) linked data (33) Semantic web (28) Ungluing Ebooks (26) Open Access (25) Creative Commons (23) physics (23) Google Book Search (21) Publishing (21) Twitter (21) Web Design and Development (21) library automation (21) Google (19) Piracy (19) Unglue.it (19) Gluejar (18) magic (18) ALA Midwinter (17) RDF (17) social practice (17) Overdrive (16) linking technology (16) scholarly publishing (16) Amazon (15) Amazon Kindle (15) identifiers (15) metadata (15) Digital rights management (14) ALA Annual (13) Book Use (13) Conferences (13) Google Book Search Settlement (13) HarperCollins (13) Crossref (12) EPUB (12) OpenURL (12) facebook (12) Just Kidding (11) New York Times (11) RDFa (11) Big Library Read (10) Book Digitization (10) HTTP Secure (10) The Four Corners of the Sky: A Novel (10) Truth (10) isbn (10) Blogging (9) Public library (9) Bugs (8) Denny Chin (8) IDPF (8) URL redirection (8) knowledgebases (8) languages (8) semtech2009 (8) social networks (8) wikipedia (8) Attributor (7) Book Rights Registry (7) Hackathon (7) Kickstarter (7) Library (7) Project Gutenberg (7) RA21 (7) bit.ly (7) Apple (6) DOI (6) Digital library (6) Google Books (6) IPad (6) India (6) New Jersey (6) Newspaper industry (6) Open Source (6) Public Domain (6) semantic technology (6) Digital Object Identifier (5) Entrepreneurship (5) Intel (5) Interlibrary loan (5) Library journal (5) Microdata (5) OCLC (5) Star Trek (5) authentication (5) public identity (5) Aaron Swartz (4) Amazon Web Services (4) American Library Association (4) Bell Labs (4) Bitcoin (4) Brian O'Leary (4) DPLA (4) Electronic Journals (4) J. K. Rowling (4) Koha (4) Liblime (4) LibraryThing (4) Neal Stephenson (4) Publishing Point (4) SOPA (4) crowdfunding (4) my attic (4) Accessibility (3) AdWords (3) Adobe Digital Editions (3) Baseball (3) Code4Lib (3) Cryptography (3) Forms of government (3) Geolocation (3) GitHub (3) Google Analytics (3) Google Wave (3) JSTOR (3) Macmillan (3) Network Effect (3) New York Public Library (3) OWL (3) PTFS (3) Search Engine Optimization (3) Sweden (3) death (3) genealogy (3) hashtags (3) http-range (3) iPhone (3) my dad (3) security (3) unicode (3) Advertising (2) Americans with Disabilities Act of 1990 (2) Book Design (2) Book Industry Study Group (2) Bruce Springsteen (2) Database Licensing (2) Disruptive technology (2) Electronic Frontier Foundation (2) FRBR (2) Fair use (2) Fan Fiction (2) File sharing (2) Fusion Tables (2) Gitenberg (2) Google Book (2) Great Gatsby (2) Hachette Book Group (2) Hal Varian (2) Hurricane Sandy (2) John Sundman (2) Nook (2) OpenID (2) OpenSource (2) Payments (2) Philadelphia Phillies (2) Proxy server (2) Radiolab (2) Random House (2) Rush Holt (2) School library (2) Social network (2) Spam (2) Star trek TNG (2) Vegetables (2) Wolfram Alpha (2) blockchain (2) ebrary (2) linkedin (2) politics (2) technology (2) tr.im (2) AdaptiveBlue (1) Assistive Technology (1) Beer (1) Bibliocommons (1) Bots (1) Brewster Kahle (1) Clay Johnson (1) Clayton M. Christensen (1) Comic Con (1) DBpedia (1) DCWG (1) Dave Winer (1) Digital watermarking (1) EBL (1) Evan Ratliff (1) Evert Taube (1) Firefox (1) GNU Affero General Public License (1) Garage sale (1) Hugh Howie (1) Ian Davis (1) Infochimps (1) Infrastructure (1) Instant Messaging (1) Internet Archive (1) Jon Stewart (1) Knowledge representation (1) Kobo (1) Lawrence Lessig (1) Mac OS X (1) Metcalfe's Law (1) Neil Gaiman (1) Neurobiology (1) ORCID (1) Open Database License (1) Open Knowledge Foundation (1) Open Library (1) PDDL (1) Paypal (1) ProQuest (1) PubMed (1) Qin Dynasty (1) Qin Shi Huangdi (1) RV Guha (1) Ralph Waldo Emerson (1) SPARQL (1) Simon and Schuster (1) Single sign-on (1) Siri (1) Star Wars (1) Text-To-Speech (1) Textbooks (1) The Hitchhiker's Guide to the Galaxy (1) Tim O'Reilly (1) Tor (anonymity network) (1) Warner Oland (1) Weeds (1) YouTube (1) Zemanta (1) Zola Books (1) dead serious (1) design patterns (1) gmail (1) h1n1 (1) life (1) patents (1) shibboleth (1) swedish music (1) twitterdata (1) If you are a Comment Spammer, comments are closed to you.   Your use of this material is subject to the Go To Hellman Blog License Agreement. This blog uses StatCounter analytics; they set a tracking cookie that may spy on you. 
go-to-hellman-blogspot-com-7799	----	Go To Hellman Go To Hellman If you wanna end war and stuff, you gotta sing loud! Open Access for Backlist Books, Part II: The All-Stars Open Access for Backlist Books, Part I: The Slush Pile Creating Value with Open Access Books Infra-infrastructure, inter-infrastructure and para-infrastructure We should regulate virality Notes on work-from-home teams Your Identity, Your Library Four-Leaf Clovers Responding to Critical Reviews RA21: Technology is not the problem. RA21 doesn't address the yet-another-WAYF problem. Radical inclusiveness would. RA21's recommended technical approach is broken by emerging browser privacy features RA21 Draft RP session timeout recommendation considered harmful RA21 RP does not require secure protocols. It should. Fudge, and open access ebook download statistics On the Surveillance Techno-state Towards Impact-based OA Funding A Milestone for GITenberg eBook DRM and Blockchain play CryptoKitty and Mouse. And the Winner is... My Face is Personally Identifiable Information The Vast Potential for Blockchain in Libraries The Shocking Truth About RA21: It's Made of People! Choose Privacy Week: Your Library Organization Is Watching You Everything* You Always Wanted To Know About Voodoo (But Were Afraid To Ask) Holtzbrinck has attacked Project Gutenberg in a new front in the War of Copyright Maximization 
groups-google-com-1246	----	Research Software Alliance (ReSA) - Google Groups Groups Conversations All groups and messages Send feedback to Google Help Account Search Maps YouTube Play News Gmail Meet Chat Contacts Drive Calendar Translate Photos Duo Chrome Shopping Finance Docs Sheets Slides Books Blogger Hangouts Keep Jamboard Earth Collections Arts and Culture Google Ads Podcasts Stadia Travel Forms More from Google Sign in Groups Research Software Alliance (ReSA) Conversations About PrivacyÂ •Â Terms Research Software Alliance (ReSA) You don't have permission to access this content For access, try logging in If you are subscribed to this group and have noticed abuse, report abusive group. Search Clear search Close search Google apps Main menu 
groups-google-com-2597	----	Google Groups To use Google Groups Discussions, please enable JavaScript in your browser settings, and then refresh this page. . Search Images Maps Play YouTube News Gmail Drive More » Help | Report an issue about Google Groups | Keyboard shortcuts | Sign in 
guides-library-ucsc-edu-6046	----	Home - Research Data Management - Library Guides at University of California, Santa Cruz Skip to main content University Library Hours My Account Contact Us Giving Search form Search MenuUniversity Library Find & Borrow Research Materials Search: Books, Articles & More Start your search for research materials Course Reserves Set up reserves or find course materials Borrowing Policies Databases A - Z Continue your research with more databases Interlibrary Loan: Borrow from other libraries Borrow items from libraries worldwide Search Libraries Worldwide (Melvyl) Search beyond the UCSC Library Request a Purchase Help & Tutorials Recommended Resources Find the best databases for your classes Get Research Help Contact the library with your questions Cite Your Sources Get help with citation basics Sign In from Off-Campus Access books, articles, and other online materials from off-campus Start Your Research Learn how to use library resources Collections & Scholarly Communication Our Collections Digital collections, video games, maps, and more Media Collection & Desk Borrow films, music, and digital equipment Special Collections Find and use our unique collections and archives Borrow Tech & Equipment Laptops, cameras, mics, and more Faculty & Teaching Support Faculty & Graduate Services Learn about how we support your work Open Access Learn about OA policies and publishing Online Journals Locate a journal by its title Teaching Support Consult with us on your next assignment Digital Scholarship Upgrade your digital skills About the Library About the Library News & Events Stay up-to-date on library events Library Computers Find and use computer stations at both libraries McHenry Library Reserve a Study Room Student Study Center Science & Engineering Library Campus Maps & Directions Find our libraries on campus Research Data Management Home Create Your Plan (DMPTool) Preserve & Publish (Dryad) Find Data For ReUse Best Practices Tools Video Tutorials Researcher to Researcher We can help you Create a Data Management Plan Easily create a data management plan for your next grant proposal using the DMPTool Preserve & Publish Your Data Publish your data in Dryad for preservation and discovery Manage your paper or data set with a unique persistent identifier. Request a DOI (Digital Object Identifier) Manage Your Data Check out these best practices for file naming, file organization, file formats, archival data storage, metadata creation and data sharing options Find Data for Reuse Locate an appropriate data repository Research Data Management Lifecycle Our Goal To assist UCSC faculty, staff and students with strategies and tools for organizing, managing and preserving research data throughout the research data life cycle.  Request a Data Consultation Scholarly Communication & eResearch Team email: research@library.ucsc.edu UCSC Campus Services UCSC ITS provides a range of research support services, including data backup. Next: Create Your Plan (DMPTool) >> 1156 High StreetSanta Cruz, CA95064 Feedback Creative Commons Attribution 3.0 License except where otherwise noted. Patrons with Disabilities Privacy Policy Staff Portal LibApps Login Incident Form (staff only) Print Page Edit this page Tags: how do i...?, special topics 
hangingtogether-org-1218	----	None 
hangingtogether-org-261	----	None 
hangingtogether-org-3299	----	None 
hangingtogether-org-3574	----	None 
hangingtogether-org-4228	----	None 
hangingtogether-org-4999	----	None 
hangingtogether-org-5859	----	None 
hangingtogether-org-7556	----	None 
hangingtogether-org-8658	----	None 
hbr-org-5273	----	Strategies for Learning from Failure Subscribe Sign In CLEAR SUGGESTED TOPICS Explore HBR Diversity Latest The Magazine Most Popular Podcasts Video Store Webinars Newsletters Popular Topics Managing Yourself Leadership Strategy Managing Teams Gender Innovation Work-life Balance All Topics For Subscribers The Big Idea Visual Library Reading Lists Case Selections Subscribe My Account My Library Topic Feeds Orders Account Settings Email Preferences Log Out Sign In Subscribe Diversity Latest Podcasts Video The Magazine Store Webinars Newsletters All Topics The Big Idea Visual Library Reading Lists Case Selections My Library Account Settings Log Out Sign In Your Cart Your Shopping Cart is empty. Visit Our Store Guest User Subscriber My Library Topic Feeds Orders Account Settings Email Preferences Log Out Reading List Reading Lists You have 1 free articles left this month. You are reading your last free article for this month. Subscribe for unlimited access. Create an account to read 2 more. Leadership Strategies for Learning from Failure We are programmed at an early age to think that failure is bad. That belief prevents organizations from effectively learning from their missteps. by Amy C. Edmondson by Amy C. Edmondson From the Magazine (April 2011) Tweet Post Share Save Get PDF Buy Copies Print Summary.    Reprint: R1104B Many executives believe that all failure is bad (although it usually provides lessons) and that learning from it is pretty straightforward. The author, a professor at Harvard Business School, thinks both beliefs are misguided. In organizational life, she says, some failures are inevitable and some are even good. And successful learning from failure is not simple: It requires context-specific strategies. But first leaders must understand how the blame game gets in the way and work to create an organizational culture in which employees feel safe admitting or reporting on failure. Failures fall into three categories: preventable ones in predictable operations, which usually involve deviations from spec; unavoidable ones in complex systems, which may arise from unique combinations of needs, people, and problems; and intelligent ones at the frontier, where “good” failures occur quickly and on a small scale, providing the most valuable information. Strong leadership can build a learning culture—one in which failures large and small are consistently reported and deeply analyzed, and opportunities to experiment are proactively sought. Executives commonly and understandably worry that taking a sympathetic stance toward failure will create an “anything goes” work environment. They should instead recognize that failure is inevitable in today’s complex work organizations. Tweet Post Share Save Get PDF Buy Copies Print Leer en español The wisdom of learning from failure is incontrovertible. Yet organizations that do it well are extraordinarily rare. This gap is not due to a lack of commitment to learning. Managers in the vast majority of enterprises that I have studied over the past 20 years—pharmaceutical, financial services, product design, telecommunications, and construction companies; hospitals; and NASA’s space shuttle program, among others—genuinely wanted to help their organizations learn from failures to improve future performance. In some cases they and their teams had devoted many hours to after-action reviews, postmortems, and the like. But time after time I saw that these painstaking efforts led to no real change. The reason: Those managers were thinking about failure the wrong way. Most executives I’ve talked to believe that failure is bad (of course!). They also believe that learning from it is pretty straightforward: Ask people to reflect on what they did wrong and exhort them to avoid similar mistakes in the future—or, better yet, assign a team to review and write a report on what happened and then distribute it throughout the organization. These widely held beliefs are misguided. First, failure is not always bad. In organizational life it is sometimes bad, sometimes inevitable, and sometimes even good. Second, learning from organizational failures is anything but straightforward. The attitudes and activities required to effectively detect and analyze failures are in short supply in most companies, and the need for context-specific learning strategies is underappreciated. Organizations need new and better ways to go beyond lessons that are superficial (“Procedures weren’t followed”) or self-serving (“The market just wasn’t ready for our great new product”). That means jettisoning old cultural beliefs and stereotypical notions of success and embracing failure’s lessons. Leaders can begin by understanding how the blame game gets in the way. The Blame Game Failure and fault are virtually inseparable in most households, organizations, and cultures. Every child learns at some point that admitting failure means taking the blame. That is why so few organizations have shifted to a culture of psychological safety in which the rewards of learning from failure can be fully realized. Executives I’ve interviewed in organizations as different as hospitals and investment banks admit to being torn: How can they respond constructively to failures without giving rise to an anything-goes attitude? If people aren’t blamed for failures, what will ensure that they try as hard as possible to do their best work? This concern is based on a false dichotomy. In actuality, a culture that makes it safe to admit and report on failure can—and in some organizational contexts must—coexist with high standards for performance. To understand why, look at the exhibit “A Spectrum of Reasons for Failure,” which lists causes ranging from deliberate deviation to thoughtful experimentation. Which of these causes involve blameworthy actions? Deliberate deviance, first on the list, obviously warrants blame. But inattention might not. If it results from a lack of effort, perhaps it’s blameworthy. But if it results from fatigue near the end of an overly long shift, the manager who assigned the shift is more at fault than the employee. As we go down the list, it gets more and more difficult to find blameworthy acts. In fact, a failure resulting from thoughtful experimentation that generates valuable information may actually be praiseworthy. When I ask executives to consider this spectrum and then to estimate how many of the failures in their organizations are truly blameworthy, their answers are usually in single digits—perhaps 2% to 5%. But when I ask how many are treated as blameworthy, they say (after a pause or a laugh) 70% to 90%. The unfortunate consequence is that many failures go unreported and their lessons are lost. Not All Failures Are Created Equal A sophisticated understanding of failure’s causes and contexts will help to avoid the blame game and institute an effective strategy for learning from failure. Although an infinite number of things can go wrong in organizations, mistakes fall into three broad categories: preventable, complexity-related, and intelligent. Preventable failures in predictable operations. Most failures in this category can indeed be considered “bad.” They usually involve deviations from spec in the closely defined processes of high-volume or routine operations in manufacturing and services. With proper training and support, employees can follow those processes consistently. When they don’t, deviance, inattention, or lack of ability is usually the reason. But in such cases, the causes can be readily identified and solutions developed. Checklists (as in the Harvard surgeon Atul Gawande’s recent best seller The Checklist Manifesto) are one solution. Another is the vaunted Toyota Production System, which builds continual learning from tiny failures (small process deviations) into its approach to improvement. As most students of operations know well, a team member on a Toyota assembly line who spots a problem or even a potential problem is encouraged to pull a rope called the andon cord, which immediately initiates a diagnostic and problem-solving process. Production continues unimpeded if the problem can be remedied in less than a minute. Otherwise, production is halted—despite the loss of revenue entailed—until the failure is understood and resolved. Unavoidable failures in complex systems. A large number of organizational failures are due to the inherent uncertainty of work: A particular combination of needs, people, and problems may have never occurred before. Triaging patients in a hospital emergency room, responding to enemy actions on the battlefield, and running a fast-growing start-up all occur in unpredictable situations. And in complex organizations like aircraft carriers and nuclear power plants, system failure is a perpetual risk. Although serious failures can be averted by following best practices for safety and risk management, including a thorough analysis of any such events that do occur, small process failures are inevitable. To consider them bad is not just a misunderstanding of how complex systems work; it is counterproductive. Avoiding consequential failures means rapidly identifying and correcting small failures. Most accidents in hospitals result from a series of small failures that went unnoticed and unfortunately lined up in just the wrong way. Intelligent failures at the frontier. Failures in this category can rightly be considered “good,” because they provide valuable new knowledge that can help an organization leap ahead of the competition and ensure its future growth—which is why the Duke University professor of management Sim Sitkin calls them intelligent failures. They occur when experimentation is necessary: when answers are not knowable in advance because this exact situation hasn’t been encountered before and perhaps never will be again. Discovering new drugs, creating a radically new business, designing an innovative product, and testing customer reactions in a brand-new market are tasks that require intelligent failures. “Trial and error” is a common term for the kind of experimentation needed in these settings, but it is a misnomer, because “error” implies that there was a “right” outcome in the first place. At the frontier, the right kind of experimentation produces good failures quickly. Managers who practice it can avoid the unintelligent failure of conducting experiments at a larger scale than necessary. Leaders of the product design firm IDEO understood this when they launched a new innovation-strategy service. Rather than help clients design new products within their existing lines—a process IDEO had all but perfected—the service would help them create new lines that would take them in novel strategic directions. Knowing that it hadn’t yet figured out how to deliver the service effectively, the company started a small project with a mattress company and didn’t publicly announce the launch of a new business. Although the project failed—the client did not change its product strategy—IDEO learned from it and figured out what had to be done differently. For instance, it hired team members with MBAs who could better help clients create new businesses and made some of the clients’ managers part of the team. Today strategic innovation services account for more than a third of IDEO’s revenues. Tolerating unavoidable process failures in complex systems and intelligent failures at the frontiers of knowledge won’t promote mediocrity. Indeed, tolerance is essential for any organization that wishes to extract the knowledge such failures provide. But failure is still inherently emotionally charged; getting an organization to accept it takes leadership. Building a Learning Culture Only leaders can create and reinforce a culture that counteracts the blame game and makes people feel both comfortable with and responsible for surfacing and learning from failures. (See the sidebar “How Leaders Can Build a Psychologically Safe Environment.”) They should insist that their organizations develop a clear understanding of what happened—not of “who did it”—when things go wrong. This requires consistently reporting failures, small and large; systematically analyzing them; and proactively searching for opportunities to experiment. How Leaders Can Build a Psychologically Safe Environment If an organization’s employees are to help spot existing and pending failures and to learn from them, their leaders must make it safe to speak up. Julie Morath, the chief operating officer of Children’s Hospital and Clinics of Minnesota from 1999 to 2009, did just that when she led a highly successful effort to reduce medical errors. Here are five practices I’ve identified in my research, with examples of how Morath employed them to build a psychologically safe environment. Frame the Work Accurately People need a shared understanding of the kinds of failures that can be expected to occur in a given work context (routine production, complex operations, or innovation) and why openness and collaboration are important for surfacing and learning from them. Accurate framing detoxifies failure. In a complex operation like a hospital, many consequential failures are the result of a series of small events. To heighten awareness of this system complexity, Morath presented data on U.S. medical error rates, organized discussion groups, and built a team of key influencers from throughout the organization to help spread knowledge and understanding of the challenge. Embrace Messengers Those who come forward with bad news, questions, concerns, or mistakes should be rewarded rather than shot. Celebrate the value of the news first and then figure out how to fix the failure and learn from it. Morath implemented “blameless reporting”—an approach that encouraged employees to reveal medical errors and near misses anonymously. Her team created a new patient safety report, which expanded on the previous version by asking employees to describe incidents in their own words and to comment on the possible causes. Soon after the new system was implemented, the rate of reported failures shot up. Morath encouraged her people to view the data as good news, because the hospital could learn from failures—and made sure that teams were assigned to analyze every incident. Acknowledge Limits Being open about what you don’t know, mistakes you’ve made, and what you can’t get done alone will encourage others to do the same. As soon as she joined the hospital, Morath explained her passion for patient safety and acknowledged that as a newcomer, she had only limited knowledge of how things worked at Children’s. In group presentations and one-on-one discussions, she made clear that she would need everyone’s help to reduce errors. Invite Participation Ask for observations and ideas and create opportunities for people to detect and analyze failures and promote intelligent experiments. Inviting participation helps defuse resistance and defensiveness. Morath set up cross-disciplinary teams to analyze failures and personally asked thoughtful questions of employees at all levels. Early on, she invited people to reflect on their recent experiences in caring for patients: Was everything as safe as they would have wanted it to be? This helped them recognize that the hospital had room for improvement. Suddenly, people were lining up to help. Set Boundaries and Hold People Accountable Paradoxically, people feel psychologically safer when leaders are clear about what acts are blameworthy. And there must be consequences. But if someone is punished or fired, tell those directly and indirectly affected what happened and why it warranted blame. When she instituted blameless reporting, Morath explained to employees that although reporting would not be punished, specific behaviors (such as reckless conduct, conscious violation of standards, failing to ask for help when over one’s head) would. If someone makes the same mistake three times and is then laid off, coworkers usually express relief, along with sadness and concern—they understand that patients were at risk and that extra vigilance was required from others to counterbalance the person’s shortcomings. Leaders should also send the right message about the nature of the work, such as reminding people in R&D, “We’re in the discovery business, and the faster we fail, the faster we’ll succeed.” I have found that managers often don’t understand or appreciate this subtle but crucial point. They also may approach failure in a way that is inappropriate for the context. For example, statistical process control, which uses data analysis to assess unwarranted variances, is not good for catching and correcting random invisible glitches such as software bugs. Nor does it help in the development of creative new products. Conversely, though great scientists intuitively adhere to IDEO’s slogan, “Fail often in order to succeed sooner,” it would hardly promote success in a manufacturing plant. The slogan “Fail often in order to succeed sooner” would hardly promote success in a manufacturing plant. Often one context or one kind of work dominates the culture of an enterprise and shapes how it treats failure. For instance, automotive companies, with their predictable, high-volume operations, understandably tend to view failure as something that can and should be prevented. But most organizations engage in all three kinds of work discussed above—routine, complex, and frontier. Leaders must ensure that the right approach to learning from failure is applied in each. All organizations learn from failure through three essential activities: detection, analysis, and experimentation. Detecting Failure Spotting big, painful, expensive failures is easy. But in many organizations any failure that can be hidden is hidden as long as it’s unlikely to cause immediate or obvious harm. The goal should be to surface it early, before it has mushroomed into disaster. Shortly after arriving from Boeing to take the reins at Ford, in September 2006, Alan Mulally instituted a new system for detecting failures. He asked managers to color code their reports green for good, yellow for caution, or red for problems—a common management technique. According to a 2009 story in Fortune, at his first few meetings all the managers coded their operations green, to Mulally’s frustration. Reminding them that the company had lost several billion dollars the previous year, he asked straight out, “Isn’t anything not going well?” After one tentative yellow report was made about a serious product defect that would probably delay a launch, Mulally responded to the deathly silence that ensued with applause. After that, the weekly staff meetings were full of color. That story illustrates a pervasive and fundamental problem: Although many methods of surfacing current and pending failures exist, they are grossly underutilized. Total Quality Management and soliciting feedback from customers are well-known techniques for bringing to light failures in routine operations. High-reliability-organization (HRO) practices help prevent catastrophic failures in complex systems like nuclear power plants through early detection. Electricité de France, which operates 58 nuclear power plants, has been an exemplar in this area: It goes beyond regulatory requirements and religiously tracks each plant for anything even slightly out of the ordinary, immediately investigates whatever turns up, and informs all its other plants of any anomalies. Such methods are not more widely employed because all too many messengers—even the most senior executives—remain reluctant to convey bad news to bosses and colleagues. One senior executive I know in a large consumer products company had grave reservations about a takeover that was already in the works when he joined the management team. But, overly conscious of his newcomer status, he was silent during discussions in which all the other executives seemed enthusiastic about the plan. Many months later, when the takeover had clearly failed, the team gathered to review what had happened. Aided by a consultant, each executive considered what he or she might have done to contribute to the failure. The newcomer, openly apologetic about his past silence, explained that others’ enthusiasm had made him unwilling to be “the skunk at the picnic.” In researching errors and other failures in hospitals, I discovered substantial differences across patient-care units in nurses’ willingness to speak up about them. It turned out that the behavior of midlevel managers—how they responded to failures and whether they encouraged open discussion of them, welcomed questions, and displayed humility and curiosity—was the cause. I have seen the same pattern in a wide range of organizations. A horrific case in point, which I studied for more than two years, is the 2003 explosion of the Columbia space shuttle, which killed seven astronauts (see “Facing Ambiguous Threats,” by Michael A. Roberto, Richard M.J. Bohmer, and Amy C. Edmondson, HBR November 2006). NASA managers spent some two weeks downplaying the seriousness of a piece of foam’s having broken off the left side of the shuttle at launch. They rejected engineers’ requests to resolve the ambiguity (which could have been done by having a satellite photograph the shuttle or asking the astronauts to conduct a space walk to inspect the area in question), and the major failure went largely undetected until its fatal consequences 16 days later. Ironically, a shared but unsubstantiated belief among program managers that there was little they could do contributed to their inability to detect the failure. Postevent analyses suggested that they might indeed have taken fruitful action. But clearly leaders hadn’t established the necessary culture, systems, and procedures. One challenge is teaching people in an organization when to declare defeat in an experimental course of action. The human tendency to hope for the best and try to avoid failure at all costs gets in the way, and organizational hierarchies exacerbate it. As a result, failing R&D projects are often kept going much longer than is scientifically rational or economically prudent. We throw good money after bad, praying that we’ll pull a rabbit out of a hat. Intuition may tell engineers or scientists that a project has fatal flaws, but the formal decision to call it a failure may be delayed for months. Again, the remedy—which does not necessarily involve much time and expense—is to reduce the stigma of failure. Eli Lilly has done this since the early 1990s by holding “failure parties” to honor intelligent, high-quality scientific experiments that fail to achieve the desired results. The parties don’t cost much, and redeploying valuable resources—particularly scientists—to new projects earlier rather than later can save hundreds of thousands of dollars, not to mention kickstart potential new discoveries. Analyzing Failure Once a failure has been detected, it’s essential to go beyond the obvious and superficial reasons for it to understand the root causes. This requires the discipline—better yet, the enthusiasm—to use sophisticated analysis to ensure that the right lessons are learned and the right remedies are employed. The job of leaders is to see that their organizations don’t just move on after a failure but stop to dig in and discover the wisdom contained in it. Why is failure analysis often shortchanged? Because examining our failures in depth is emotionally unpleasant and can chip away at our self-esteem. Left to our own devices, most of us will speed through or avoid failure analysis altogether. Another reason is that analyzing organizational failures requires inquiry and openness, patience, and a tolerance for causal ambiguity. Yet managers typically admire and are rewarded for decisiveness, efficiency, and action—not thoughtful reflection. That is why the right culture is so important. The challenge is more than emotional; it’s cognitive, too. Even without meaning to, we all favor evidence that supports our existing beliefs rather than alternative explanations. We also tend to downplay our responsibility and place undue blame on external or situational factors when we fail, only to do the reverse when assessing the failures of others—a psychological trap known as fundamental attribution error. My research has shown that failure analysis is often limited and ineffective—even in complex organizations like hospitals, where human lives are at stake. Few hospitals systematically analyze medical errors or process flaws in order to capture failure’s lessons. Recent research in North Carolina hospitals, published in November 2010 in the New England Journal of Medicine, found that despite a dozen years of heightened awareness that medical errors result in thousands of deaths each year, hospitals have not become safer. Fortunately, there are shining exceptions to this pattern, which continue to provide hope that organizational learning is possible. At Intermountain Healthcare, a system of 23 hospitals that serves Utah and southeastern Idaho, physicians’ deviations from medical protocols are routinely analyzed for opportunities to improve the protocols. Allowing deviations and sharing the data on whether they actually produce a better outcome encourages physicians to buy into this program. (See “Fixing Health Care on the Front Lines,” by Richard M.J. Bohmer, HBR April 2010.) Motivating people to go beyond first-order reasons (procedures weren’t followed) to understanding the second- and third-order reasons can be a major challenge. One way to do this is to use interdisciplinary teams with diverse skills and perspectives. Complex failures in particular are the result of multiple events that occurred in different departments or disciplines or at different levels of the organization. Understanding what happened and how to prevent it from happening again requires detailed, team-based discussion and analysis. A team of leading physicists, engineers, aviation experts, naval leaders, and even astronauts devoted months to an analysis of the Columbia disaster. They conclusively established not only the first-order cause—a piece of foam had hit the shuttle’s leading edge during launch—but also second-order causes: A rigid hierarchy and schedule-obsessed culture at NASA made it especially difficult for engineers to speak up about anything but the most rock-solid concerns. Promoting Experimentation The third critical activity for effective learning is strategically producing failures—in the right places, at the right times—through systematic experimentation. Researchers in basic science know that although the experiments they conduct will occasionally result in a spectacular success, a large percentage of them (70% or higher in some fields) will fail. How do these people get out of bed in the morning? First, they know that failure is not optional in their work; it’s part of being at the leading edge of scientific discovery. Second, far more than most of us, they understand that every failure conveys valuable information, and they’re eager to get it before the competition does. In contrast, managers in charge of piloting a new product or service—a classic example of experimentation in business—typically do whatever they can to make sure that the pilot is perfect right out of the starting gate. Ironically, this hunger to succeed can later inhibit the success of the official launch. Too often, managers in charge of pilots design optimal conditions rather than representative ones. Thus the pilot doesn’t produce knowledge about what won’t work. Too often, pilots are conducted under optimal conditions rather than representative ones. Thus they can’t show what won’t work. In the very early days of DSL, a major telecommunications company I’ll call Telco did a full-scale launch of that high-speed technology to consumer households in a major urban market. It was an unmitigated customer-service disaster. The company missed 75% of its commitments and found itself confronted with a staggering 12,000 late orders. Customers were frustrated and upset, and service reps couldn’t even begin to answer all their calls. Employee morale suffered. How could this happen to a leading company with high satisfaction ratings and a brand that had long stood for excellence? A small and extremely successful suburban pilot had lulled Telco executives into a misguided confidence. The problem was that the pilot did not resemble real service conditions: It was staffed with unusually personable, expert service reps and took place in a community of educated, tech-savvy customers. But DSL was a brand-new technology and, unlike traditional telephony, had to interface with customers’ highly variable home computers and technical skills. This added complexity and unpredictability to the service-delivery challenge in ways that Telco had not fully appreciated before the launch. A more useful pilot at Telco would have tested the technology with limited support, unsophisticated customers, and old computers. It would have been designed to discover everything that could go wrong—instead of proving that under the best of conditions everything would go right. (See the sidebar “Designing Successful Failures.”) Of course, the managers in charge would have to have understood that they were going to be rewarded not for success but, rather, for producing intelligent failures as quickly as possible. Designing Successful Failures Perhaps unsurprisingly, pilot projects are usually designed to succeed rather than to produce intelligent failures—those that generate valuable information. To know if you’ve designed a genuinely useful pilot, consider whether your managers can answer yes to the following questions: Is the pilot being tested under typical circumstances (rather than optimal conditions)? Do the employees, customers, and resources represent the firm’s real operating environment? Is the goal of the pilot to learn as much as possible (rather than to demonstrate the value of the proposed offering)? Is the goal of learning well understood by all employees and managers? Is it clear that compensation and performance reviews are not based on a successful outcome for the pilot? Were explicit changes made as a result of the pilot test? In short, exceptional organizations are those that go beyond detecting and analyzing failures and try to generate intelligent ones for the express purpose of learning and innovating. It’s not that managers in these organizations enjoy failure. But they recognize it as a necessary by-product of experimentation. They also realize that they don’t have to do dramatic experiments with large budgets. Often a small pilot, a dry run of a new technique, or a simulation will suffice.   The courage to confront our own and others’ imperfections is crucial to solving the apparent contradiction of wanting neither to discourage the reporting of problems nor to create an environment in which anything goes. This means that managers must ask employees to be brave and speak up—and must not respond by expressing anger or strong disapproval of what may at first appear to be incompetence. More often than we realize, complex systems are at work behind organizational failures, and their lessons and improvement opportunities are lost when conversation is stifled. Savvy managers understand the risks of unbridled toughness. They know that their ability to find out about and help resolve problems depends on their ability to learn about them. But most managers I’ve encountered in my research, teaching, and consulting work are far more sensitive to a different risk—that an understanding response to failures will simply create a lax work environment in which mistakes multiply. This common worry should be replaced by a new paradigm—one that recognizes the inevitability of failure in today’s complex work organizations. Those that catch, correct, and learn from failure before others do will succeed. Those that wallow in the blame game will not. A version of this article appeared in the April 2011 issue of Harvard Business Review. Read more on Leadership or related topics Organizational culture, Knowledge management, Business processes and Experimentation Amy C. Edmondson is the Novartis Professor of Leadership and Management at Harvard Business School. She is the author of The Fearless Organization: Creating Psychological Safety in the Workplace for Learning, Innovation, and Growth (Wiley, 2019). Tweet Post Share Save Get PDF Buy Copies Print Read more on Leadership or related topics Organizational culture, Knowledge management, Business processes and Experimentation Partner Center Diversity Latest Magazine Popular Topics Podcasts Video Store The Big Idea Visual Library Case Selections Subscribe Explore HBR The Latest Most Popular All Topics Magazine Archive The Big Idea Reading Lists Case Selections Video Podcasts Webinars Visual Library My Library Newsletters HBR Press HBR Ascend HBR Store Article Reprints Books Cases Collections Magazine Issues HBR Guide Series HBR 20-Minute Managers HBR Emotional Intelligence Series HBR Must Reads Tools About HBR Contact Us Advertise with Us Information for Booksellers/Retailers Masthead Global Editions Media Inquiries Guidelines for Authors HBR Analytic Services Copyright Permissions Manage My Account My Library Topic Feeds Orders Account Settings Email Preferences Account FAQ Help Center Contact Customer Service Follow HBR Facebook Twitter LinkedIn Instagram Your Newsreader About Us Careers Privacy Policy Cookie Policy Copyright Information Trademark Policy Harvard Business Publishing: Higher Education Corporate Learning Harvard Business Review Harvard Business School Copyright ©   Harvard Business School Publishing. All rights reserved. Harvard Business Publishing is an affiliate of Harvard Business School. 
hecticpace-com-3498	----	Hectic Pace Hectic Pace A view on libraries, the library business, and the business of libraries My Pre-Covid Things Authors note: these parodies are always about libraries and always based on Christmas songs, stories, or poems. 2020 being what it is, this year is an exception to both&#8230;that&#8217;s right, I&#8217;m siding with my family and admitting that My Favorite Things is not a Christmas song. (sung to the tune of “My Favorite Things&#8221;) [Click the YouTube link to listen while you sing along.] Eating in restaurants and movies on big screensPeople who don&#8217;t doubt the virtue of vaccines.Inspiring leaders who don&#8217;t act like kings.These were a few of my pre-Covid things. Live music venues and in-person classes.No masks or ... Sitting in the Reading Room All Day (sung to the tune of “Walking in a Winter Wonderland”) [Click the YouTube link to listen while you sing along.] People shhhhhh, are you listening? In the stacks, laptops glistening The reading light&#8217;s bright The library&#8217;s right For sitting in the reading room all day. Gone away are the book stacks Here to stay, the only town&#8217;s fax. We share all our books Without judgy looks. Sitting in the reading room all day. In the lobby we could build a book tree. Readers Guide is green and they stack well. I&#8217;ll say &#8216;Do we have &#8217;em?&#8217; You&#8217;ll say, &#8216;Yeah man.&#8217; ... It’s the Best Library Time of the Year (sung to the tune of “It&#8217;s the Most Wonderful Time of the Year&#8221;) ﻿ Press play to sing along with the instrumental track! It&#8217;s the best library time of the year With no more children yelling And no one is telling you &#8220;get it in gear!&#8221; It&#8217;s the best library time of the year It&#8217;s the qui-quietest season at school Only smile-filled greetings and no more dull meetings Where bosses are cruel It&#8217;s the qui-quietest season at school There&#8217;ll be books for re-stocking Vendor end-of-year-hawking And overdue fine cash for beer Send the word out to pre-schools Drag queen visit ... Maybe It’s Books We Need [I figured this was a song in desperate need of some new lyrics. Sung to the tune of Baby It&#8217;s Cold Outside. You&#8217;re gonna want to grab a singing partner and use the instrumental track for this one!] (Listen to the track while you sing!) I really must binge (But maybe it&#8217;s books we need) You mustn&#8217;t infringe (It&#8217;s definitely books we need) This season has been (Reading will make you grin) So fun to watch (I&#8217;ll hold the remote, you hold my scotch) My Netflix queue scrolls forever (Mystery, poems, whichever) And Stranger Things won&#8217;t just watch itself (Grab ... Being a Better Ally: First, Believe Warning: I might make you uncomfortable. I’m uncomfortable. But it comes from an earnest place. I was recently lucky enough to participate with my OCLC Membership &#38; Research Division colleagues in DeEtta Jones &#38; Associates’ Cultural Competency Training. This day-long session has a firm spot in the top 5 of my professional development experiences. (Not coincidentally, one of the others in that top 5 was DeEtta’s management training I took part in when she was with the Association of Research Libraries). A week later, I&#8217;m still processing this incredible experience. And I&#8217;m very grateful to OCLC for sponsoring the workshop! ... Fake News Forever! Librarians were among the first to join the call to arms and combat the onslaught of fake news that has permeated our political discussions for the last several months. Frankly, it seems hard for anyone to be on the other side of this issue. But is it? Not long after the effort to stop fake news in its tracks, a group of librarians began to consider the long-term implications of eradicating an entire body of content from history. Thus began a concerted effort to preserve all the fake news that a vigilant group of librarians could gather up. Building on ... How will you be remembered? My grandfather had a sizable library when he passed away, and his son (my father) would wind up with roughly half of it. I remember shelves and shelves of books of quotations. He was a criminal lawyer with a love of quotes. I either inherited this love or caught it through the osmosis of being surrounded by these books throughout my childhood. Most of the books were ruined over the years by mold and silverfish and a dose of neglect. But I managed to save a few handfuls of eclectic titles. Their smell still transports me to the basement of ... Seeking Certainty “Uncertain times” is a phrase you hear a lot these days. It was actually in the title of the ALA Town Hall that took place in Atlanta last month (ALA Town Hall: Library Advocacy and Core Values in Uncertain Times). Political turmoil, uncertainty, divisiveness, and vitriol have so many of us feeling a bit unhinged. When I feel rudderless, adrift, even completely lost at sea, I tend to seek a safer port. I’ve exercised this method personally, geographically, and professionally and it has always served me well. For example, the stability and solid foundation provided by my family gives me solace ... No Not Google Search Box, Just You (to the tune of “All I want for Christmas is You”) (if you need a karaoke track, try this one) I don’t need a lot for freedom, Peace, or love, democracy, and I Don’t care about the Congress or their failed bureaucracy I just want a li-brar-y Filled with places just for me A librarian or two No not Google search box, just you I don’t want a lot of features Search results are too grotesque I don’t care about the systems Back behind your reference desk I don’t need to download e-books On the de-vice of my choice Noisy ... We are ALA I’ve been thinking a lot about governance lately. That said, I will avoid the topic of the recent U.S. election as much as possible, even though it is a factor in what makes me think about governance. Instead, I will focus on library governance and what makes it work and not work. Spoiler alert: active participation. I am an admitted governance junky, an unapologetic lover of Robert’s Rules of Order, and someone who tries to finds beauty in bureaucratic process. I blame my heritage. I come from a long line of federal government employees, all of us born in the ... 
hecticpace-com-5840	----	Hectic Pace – A view on libraries, the library business, and the business of libraries Skip to content Hectic Pace My Pre-Covid Things Posted On Dec 21 2020 by Andrew K. Pace Authors note: these parodies are always about libraries and always based on Christmas songs, stories, or poems. 2020 being what it is, this year is an exception to both…that’s right, I’m siding with my family and admitting that My Favorite Things is not a Christmas song. (sung to the tune of “My Favorite Things”) [Click the YouTube link to listen while you sing along.] Eating in restaurants and movies on big screensPeople who don’t doubt the virtue of vaccines.Inspiring leaders who don’t act like kings.These were a few of my pre-Covid things. Live music venues and in-person classes.No masks or … Tagged with: / Category: Christmas Parody / Leave a comment Sitting in the Reading Room All Day Posted On Dec 17 2019 by Andrew K. Pace (sung to the tune of “Walking in a Winter Wonderland”) [Click the YouTube link to listen while you sing along.] People shhhhhh, are you listening? In the stacks, laptops glistening The reading light’s bright The library’s right For sitting in the reading room all day. Gone away are the book stacks Here to stay, the only town’s fax. We share all our books Without judgy looks. Sitting in the reading room all day. In the lobby we could build a book tree. Readers Guide is green and they stack well. I’ll say ‘Do we have ’em?’ You’ll say, ‘Yeah man.’ … Tagged with: / Category: Christmas Parody / 1 Comment It’s the Best Library Time of the Year Posted On Dec 20 2018 by Andrew K. Pace (sung to the tune of “It’s the Most Wonderful Time of the Year”) ﻿ Press play to sing along with the instrumental track! It’s the best library time of the year With no more children yelling And no one is telling you “get it in gear!” It’s the best library time of the year It’s the qui-quietest season at school Only smile-filled greetings and no more dull meetings Where bosses are cruel It’s the qui-quietest season at school There’ll be books for re-stocking Vendor end-of-year-hawking And overdue fine cash for beer Send the word out to pre-schools Drag queen visit … Tagged with: / Category: Christmas Parody / Leave a comment Posts navigation 1 2 … 56 Next About A blog and its author Search the Archive Search for: Archives Archives Select Month December 2020  (1) December 2019  (1) December 2018  (1) December 2017  (1) July 2017  (1) April 2017  (1) March 2017  (1) February 2017  (1) December 2016  (1) November 2016  (1) October 2016  (4) September 2016  (1) August 2016  (3) May 2013  (1) December 2012  (1) October 2012  (1) September 2012  (1) July 2012  (1) April 2012  (1) March 2012  (1) January 2012  (1) December 2011  (3) June 2011  (2) April 2011  (2) December 2010  (3) October 2010  (1) August 2010  (1) July 2010  (1) June 2010  (1) May 2010  (1) April 2010  (2) March 2010  (2) January 2010  (1) December 2009  (2) September 2009  (2) August 2009  (3) July 2009  (1) June 2009  (2) May 2009  (2) April 2009  (3) March 2009  (1) January 2009  (3) December 2008  (2) November 2008  (1) October 2008  (1) September 2008  (3) August 2008  (2) July 2008  (2) June 2008  (3) May 2008  (4) April 2008  (3) March 2008  (4) January 2008  (1) December 2007  (4) November 2007  (2) October 2007  (4) September 2007  (4) August 2007  (3) July 2007  (3) June 2007  (5) May 2007  (5) April 2007  (5) March 2007  (6) February 2007  (4) January 2007  (5) December 2006  (4) November 2006  (4) October 2006  (4) September 2006  (4) August 2006  (5) July 2006  (4) June 2006  (2) Categories CategoriesSelect Category 2.0  (6) ALA  (20) April Fool’s  (7) Catalogs  (9) Christmas Parody  (12) Community Development  (1) E-books  (6) EBSCO  (3) Education  (2) Equity, Diversity, & Inclusion  (1) General  (82) Google  (9) Innovation  (31) LITA  (5) Mergers & Acquisitions  (19) Metasearch  (7) NISO  (5) OCLC  (30) Open Source  (5) OpenURL  (1) Product Management  (5) Public Libraries  (1) Publishers  (9) Sacred Cows  (5) Search  (1) Standards  (9) Vendors  (41) Web-scale  (9) WMS  (20) WorldShare  (7) OCLC Next We persevere through challenges when we rely on each other Skip Prichard Why a “Library on-demand” vision benefits from pandemic wisdom Cathy King The OCLC network: Collaboration, innovation, and efficiency OCLC OCLC Colleagues 025.431: The Dewey blog The Digital Shift (Roy Tennant) Hanging Together Lorcan Dempsey’s Weblog OCLC Next WebJunction Library Colleagues librarian.net Screwy Decimal The Shifted Librarian Librarian in Black Free Range Librarian The Travelin’ Librarian David Lee King Jenny Arch Justin the Librarian Mr. Library Dude Thoughts from Carl Grant Search WorldCat Enter title, subject, person, or keyword Hectic Pace RSS feeds Entries RSS Comments RSS © 2006–2021. All Rights Reserved, Hectic Pace 
help-twitter-com-1683	----	How to Tweet – what is a Tweet, keyboard shortcuts, and sources Open menu Help Center Help topics Using Twitter Managing your account Safety and security Rules and policies Guides New user FAQ Glossary A safer Twitter Our rules My privacy Getting Started Guide Contact us Provide Feedback Search Go to Twitter Sign out Sign in Search this site Search goglobalwithtwitterbanner Tweets Search Using Twitter Tweets Adding content to your Tweet Search and trends Following and unfollowing Blocking and muting Direct Messages Twitter on your device Website and app integrations Using Periscope Twitter Voices Fleets Managing your account Login and password Username, email, and phone Account settings Notifications Verified accounts Suspended accounts Deactivate and reactivate accounts Safety and security Security and hacked accounts Privacy Spam and fake accounts Sensitive content Abuse Rules and policies Twitter Rules and policies General guidelines and policies Law enforcement guidelines Research and experiments Help Center Tweets How to Tweet How to Tweet A Tweet may contain photos, GIFs, videos, links, and text. Looking for information on how to Tweet at someone? Check out our article about how to post replies and mentions on Twitter. View instructions for: How to Tweet Tap the Tweet compose icon  Compose your message (up to 280 characters) and tap Tweet. How to Tweet Tap on the Tweet compose icon  Enter your message (up to 280 characters), and then tap Tweet. A notification will appear in the status bar on your device and will go away once the Tweet successfully sends. How to Tweet Type your Tweet (up to 280 characters) into the compose box at the top of your Home timeline, or click the Tweet button in the navigation bar. You can include up to 4 photos, a GIF, or a video in your Tweet. Click the Tweet button to post the Tweet to your profile. To save a draft of your Tweet, click the X icon in the top left corner of the compose box, then click Save. To schedule your Tweet to be sent at a later date/time, click on the calendar icon at the bottom of the compose box and make your schedule selections, then click Confirm. To access your drafts and scheduled Tweets, click on Unsent Tweets from the Tweet compose box.   Tweet source labels Tweet source labels help you better understand how a Tweet was posted. This additional information provides context about the Tweet and its author. If you don’t recognize the source, you may want to learn more to determine how much you trust the content.   Click on a Tweet to go to the Tweet details page. At the bottom of the Tweet, you’ll see the label for the source of the account’s Tweet. For example, Twitter for iPhone, Twitter for Android, or Twitter for Web. Tweets containing the Twitter for Advertisers label indicate they are created through the Twitter Ads Composer and not whether they are paid content or not. Paid content contains a Promoted badge across all ad formats. In some cases you may see a third-party client name, which indicates the Tweet came from a non-Twitter application. Authors sometimes use third-party client applications to manage their Tweets, manage marketing campaigns, measure advertising performance, provide customer support, and to target certain groups of people to advertise to. Third-party clients are software tools used by authors and therefore are not affiliated with, nor do they reflect the views of, the Tweet content. Tweets and campaigns can be directly created by humans or, in some circumstances, automated by an application. Visit our partners page for a list of common third-party sources. Deleting Tweets Read about how to delete a Tweet. Note that you can only delete your own Tweets. You cannot delete Tweets which were posted by other accounts. Instead, you can unfollow, block or mute accounts whose Tweets you do not want to receive. Read about how to delete or undo a Retweet. Keyboard shortcuts  The following are a list of keyboard shortcuts to use on twitter.com. Actions n  =  new Tweet l  =  like r  =  reply t  =  Retweet m  =  Direct Message u  =  mute account b  =  block account enter  =  open Tweet details o   =  expand photo /  =  search cmd-enter | ctrl-enter  =  send Tweet Navigation ?  =  full keyboard menu j  =  next Tweet k  =  previous Tweet space  =  page down .  =  load new Tweets Timelines g and h  =  Home timeline g and o  =  Moments g and n  =  Notifications tab g and r  =  Mentions g and p  =  profile  g and l  =  likes tab g and i  =  lists tab g and m  =  Direct Messages g and s  =  Settings and privacy g and u  =  go to someone’s profile Bookmark or share this article Scroll to top Twitter platform Twitter.com Status Card validator Privacy Center Transparency Center Twitter, Inc. About the company Twitter for Good Company news Brand toolkit Jobs and internships Investors Help Help Center Using Twitter Twitter Media Ads Help Center Managing your account Safety and security Rules and policies Contact us Developer resources Developer home Documentation Forums Communities Developer blog Engineering blog Developer terms Business resources Advertise Twitter for business Resources and guides Twitter for marketers Marketing insights Brand inspiration Twitter Data Twitter Flight School © 2021 Twitter, Inc. Cookies Privacy Terms and conditions English Help Center English Español 日本語 한국어 Português Deutsch Türkçe Français Italiano العربيّة Nederlands Bahasa Indonesia Русский हिंदी সহায়তা কেন্দ্র मदत केंद्र સહાયતા કેન્દ્ર உதவி மையம் ಸಹಾಯ ಕೇಂದ್ರ By using Twitter’s services you agree to our Cookies Use. We use cookies for purposes including analytics, personalisation, and ads. OK 
homosaurus-org-167	----	Homosaurus Vocabulary Site homosaurus.org Toggle navigation Home Vocabulary Search Releases About Contact Welcome to the Homosaurus! The Homosaurus is an international linked data vocabulary of Lesbian, Gay, Bisexual, Transgender, and Queer (LGBTQ) terms. This vocabulary is intended to function as a companion to broad subject term vocabularies, such as the Library of Congress Subject Headings. Libraries, archives, museums, and other institutions are encouraged to use the Homosaurus to support LGBTQ research by enhancing the discoverability of their LGBTQ resources. If you are using the Homosaurus, we want to hear from you! Please contact us to let us know how you are using this vocabulary and share any feedback you might have. Homosaurus.org is a linked data service maintained by the Digital Transgender Archive Loading... 
homosaurus-org-6206	----	Homosaurus Vocabulary Site homosaurus.org Toggle navigation Home Vocabulary Search Releases About Contact Welcome to the Homosaurus! The Homosaurus is an international linked data vocabulary of Lesbian, Gay, Bisexual, Transgender, and Queer (LGBTQ) terms. This vocabulary is intended to function as a companion to broad subject term vocabularies, such as the Library of Congress Subject Headings. Libraries, archives, museums, and other institutions are encouraged to use the Homosaurus to support LGBTQ research by enhancing the discoverability of their LGBTQ resources. If you are using the Homosaurus, we want to hear from you! Please contact us to let us know how you are using this vocabulary and share any feedback you might have. Homosaurus.org is a linked data service maintained by the Digital Transgender Archive Loading... 
hopeforgirlsandwomen-com-6443	----	Hope for girls & women Skip to content Facebook Instagram Twitter LinkedIn Search for: Hope for girls & women Menu News News from Hope Upcoming events About Us About Rhobi About Hope About FGM in Tanzania Background Updates from Rhobi Our Supporters Awards & Articles Contact Challenges Team members Marketing material COVID-19 What We Do Safe Houses Sponsor a girl Sponsored girls Community Road Shows Alternative Rites of Passage Film Screenings: In the Name of your Daughter Digital Champions Mapping Re-educating cutters Donate We provide a safe environment for girls escaping Female Genital Mutilation (FGM) Girls often arrive at Hope’s safe houses late at night with just the clothes they have run away in. Those arriving on foot have to navigate from remote, rural areas in the dark. We also work with local police teams to rescue girls when we are alerted that FGM is going to take place. We provide girls with safety, education and hope. Donate to hope Sponsor a girl According to the United Nations, in the Mara region of Tanzania, 32% of women aged between 15 and 49 report having undergone FGM. Hope for Girls and Women was founded by the Tanzanian activist Rhobi Samwelly in 2017. Rhobi’s personal experience of being forced to undergo female genital mutilation (FGM) as a child inspired her lifelong commitment to fight for the rights of girls and women. Our organisation runs two safe houses in the Butiama and Serengeti Districts of the Mara Region of Tanzania, which shelter and support those fleeing FGM, child marriage, and other forms of gender based violence. Read more here. Find out more about our important work to provide Alternative Rites of Passage ceremonies here. We’re continually working on raising awareness locally and globally, whilst also raising funds for our safe houses. Watch our new film here Subscribe here to follow our updates: Email Address: Sign Up Share this: Twitter Facebook Recent Posts 14/02/2021 beccadash Human Rights Detecting pests in Maize and Cassava with the PlantNuru app 20/12/202020/12/2020 beccadash Event reports Rhobi Participates in Women’s Health Talk 13/12/2020 beccadash Event reports Debating Gender-Based Violence with male villagers in Northern Tanzania 13/12/202020/12/2020 hopeforgirlsandwomen Event reports Fighting FGM with Maps 02/12/202006/12/2020 beccadash Human Rights How mapping is helping Tanzanian villages source water More Posts→ Create a website or blog at WordPress.com Email (Required) Name (Required) Website   Loading Comments... Comment × 
i0-wp-com-7448	----	None 
i0-wp-com-7651	----	None 
i0-wp-com-8298	----	None 
i0-wp-com-8704	----	None 
i1-wp-com-5169	----	None 
i1-wp-com-5773	----	None 
i1-wp-com-8412	----	None 
i1-wp-com-8783	----	None 
i2-wp-com-1556	----	None 
i2-wp-com-240	----	None 
i2-wp-com-2606	----	None 
i2-wp-com-3644	----	None 
i2-wp-com-4290	----	None 
i2-wp-com-5138	----	None 
i2-wp-com-5720	----	None 
i2-wp-com-7223	----	None 
i2-wp-com-8043	----	None 
i2-wp-com-9284	----	None 
idatosabiertos-org-8870	----	Home - ILDA About About ILDA Transparency Report 2020 Strategic areas Community Gender and inclusion Developing technologies Transparency and governance Projects Femicide data standard Artificial Intelligence Regional Open Data Barometer Global Data Barometer Resources Papers Reports Tools Blog Contact  Español  English  Português do Brasil We work towards an open, equal and data-driven region Featured projects Proyectos Status: active ILDA: The Next Generation Proyectos Status: active Empatía Proyectos Status: active Femicide Data Standardization Proyectos Status: active Global Data Barometer Proyectos Status: active Regional Open Data Barometer Proyectos Status: active Data+Art News Posts 21/08/2020 Open data standards design behind closed doors? Recursos 06/10/2020 Data for development – a road ahead Recursos 15/09/2020 Flow to identify femicides Dirección Legal Rincon 477/803 Montevideo - Uruguay Impact Hub Av.12, entre calle 35 y 37, San Pedro San José - Costa Rica Home Researches Projects Blog Contacto Suscribite a nuestro newsletter: Leave this field empty if you're human: Contactanos Seguinos Seguinos en: Apoyan: 
inkdroid-org-1054	----	inkdroid inkdroid Paper or Plastic 856 Coincidence? twarc2 This post was originally published on Medium but I spent time writing it so I wanted to have it here too. TL;DR twarc has been redesigned from the ground up to work with the new Twitter v2 API and their Academic Research track. Many thanks for the code and design contributions of Betsy Alpert, Igor Brigadir, Sam Hames, Jeff Sauer, and Daniel Verdeer that have made twarc2 possible, as well as early feedback from Dan Kerchner, Shane Lin, Miles McCain, 李荣蓬, David Thiel, Melanie Walsh and Laura Wrubel. Extra special thanks to the Institute for Future Environments at Queensland University of Technology for supporting Betsy and Sam in their work, and for the continued support of the Mellon Foundation. Back in August of last year Twitter announced early access to their new v2 API, and their plans to sunset the v1.1 API that has been active for almost the last 10 years. Over the lifetime of their v1.1 API Twitter has become deeply embedded in the media landscape. As magazines, newspapers and television have moved onto the web they have increasingly adopted tweets as a mechanism for citing politicians, celebrities and organizations, while also using them to document current events, generate leads and gather feedback for evolving stories. As a result Twitter has also become a popular object of study for humanities and social science researchers looking to understand the world as reflected, refracted and distorted by/in social media. On the surface the v2 API update seems pretty insignificant since the shape of a tweet, its parts, properties and affordances, aren’t changing at all. Tweets with 280 characters of text, images and video will continue to be posted, retweeted and quoted. However behind the scenes the representation of a tweet as data, and the quotas that control the rates at which this data can flow between apps and other third party services will be greatly transformed. Needless to say, v2 represents a big change for the Documenting the Now project. Along with community members we’ve developed and maintained open source tools like twarc that talk directly to the Twitter API to help users to search for and collect live tweets that match criteria like hashtags, names and geographic locations. Today we’re excited to announce the release of twarc v2 which has been designed from the ground up to work with the v2 API and Twitter’s new Academic Research track. Clearly it’s extremely problematic having a multi-national corporation act as a gatekeeper for who counts as an academic researcher, and what constitutes academic research. We need look no further than the recent experiences of Timnit Gebru and Margaret Mitchell at Google for an example of what happens when research questions run up against the business objectives of capital. We only know their stories because Gebru and Mitchell’s bravely took a principled approach, where many researchers would have knowingly or unknowingly shaped their research to better fit the needs of the company. So it is important for us that twarc still be usable by people with and without access to the Academic Research Track. But we have heard from many users that the Academic Research Track presents new opportunities for Twitter data collection that are essential for researchers interested in the observability of social media platforms. Twitter is making a good faith effort to work with the academic research community, and we thought twarc should support it, even if big challenges lie ahead. So why are people interested in the Academic Research Track? Once your application has been approved you are able to collect data from the full history of Tweets, at no cost. This is a massive improvement over the v1.1 access which was limited to a one week window and researchers had to pay for access. Access to the full archive means it’s now possible to study events that have happened in the past back to the beginning of Twitter in 2006. If you do create any historical datasets we’d love for you to share the tweet identifier datasets in The Catalog. However this opening up of access on the one hand comes with a simultaneous contraction in terms of how much data can be collected at one time. The remainder of this post describes some of the details and the design decisions we have made with twarc2 to address them. If you would prefer to watch a quick introduction to using twarc v2 please check out this short video: Installation If you are familiar with installing twarc nothing is changed. You still install (or upgrade) with pip as you did before: $ pip install --upgrade twarc In fact you will still have full access to the v1.1 API just as you did before. So the old commands will continue to work as they did1 $ twarc search blacklivesmatter &gt; tweets.jsonl twarc2 was designed to let you to continue to use Twitter’s v1.1 API undisturbed until it is finally turned off by Twitter, at which point the functionality will be removed from twarc. All the support for the v2 API is mediated by a new command line utility twarc2. For example to search for blacklivesmatter tweets and write them to a file tweets.jsonl: $ twarc2 search blacklivesmatter &gt; tweets.jsonl All the usual twarc functionality such as searching for tweets, collecting live tweets from the streaming API endpoint, requesting user timelines and user metadata are all still there, twarc2 --help gives you the details. But while the interface looks the same there’s quite a bit different going on behind the scenes. Representation Truth be told, there is no shortage of open source libraries and tools for interacting with the Twitter API. In the past twarc has made a bit of a name for itself by catering to a niche group of users who want a reliable, programmable way to collect the canonical JSON representation of a tweet. JavaScript Object Notation (JSON) is the language of Web APIs, and Twitter has kept its JSON representation of a tweet relatively stable over the years. Rather than making lots of decisions about the many ways you might want to collect, model and analyze tweets twarc has tried to do one thing and do it well (data collection) and get out of the way so that you can use (or create) the tools for putting this data to use. But the JSON representation of a tweet in the Twitter v2 API is completely burst apart. The v2 base representation of a tweet is extremely lean and minimal, and just includes the text of the tweet its identifier and a handful of other things. All the details about the user who created the tweet, embedded media, and more are not included. Fortunately this information is still available, but the user needs to craft their API request to request tweets using a set of expansions that tell the Twitter API what additional entities to include. In addition for each expansion there are a set of field options to include that control what of these expansions is returned. So rather than there being a single JSON representation of a tweet API users now have the ability to shape the data based on what they need, much like how GraphQL APIs work. This kind of makes you wonder why Twitter didn’t make their GraphQL API available. For specific use cases this customizability is very useful, but the mutability of the representation of a tweet presents challenges when collecting data for future use. If you didn’t request the right expansions or fields when collecting the data then you won’t be able to analyze that data later when doing your research. To solve for this twarc2 has been designed to collect the richest possible representation for a tweet, by requesting all possible expansions and field combinations for tweets. See the expansions module for the details if you are interested. This takes a significant burden off of users to digest the API documentation, and craft the correct API requests themselves. In addition the twarc community will be monitoring the Twitter API documentation going forward to incorporate new expansions and fields as they will inevitably be added in the future. Flattening This is diving into the weeds a little bit, but it’s worth noting here that Twitter’s introduction of expansions allows data that was once duplicated across multiple tweets (such as user information, media, retweets, etc) to be included once per response from the API. This means that instead of seeing information about the user who created a tweet in the context of their tweet the user will be referenced using an identifier, and this identifier will map to user metadata in the outer envelope of the response. It makes sense why Twitter have introduced expansions since it means in a set of 100 tweets from a given user the user information will just be included once rather than repeated 100 times, which means less data, less network traffic and less money. It’s even more significant when consider the large number of possible expansions. However this pass by-reference rather than by-value presents some challenges for stream based processing which expects each tweet to be self-contained. For this reason we’ve introduce the idea of flattening the response data when persisting the JSON to disk. This means that tools and data pipelines that expect to operate on a stream of tweets can continue to do so. Since the representation of a tweet is so dependent on how data is requested we’ve taken the opportunity to introduce a small stanza of twarc specific metadata using the __twarc prefix. This metadata records what API endpoint the data was requested from, and when. This information is critically important when interpreting the data, because some information about a tweet like its retweet and quote counts are constantly changing. Data Flows As mentioned above you can still collect tweets from the search and streaming API endpoints in a way that seems quite similar to the v1 API. The big changes however are the quotas associated with these endpoints which govern how much can be collected. These quotas control how many requests can be sent to Twitter in 15 minute intervals. In fact these quotas are not much changed, but what’s new are app wide quotas that constrain how many tweets a given application (app) can collect every month. An app in this context is a piece of software (e.g. your twarc software) identified by unique API keys set up in the Twitter Developer Portal. The standard API access sets a 500,000 tweet per month limit. This is a huge change considering there were no monthly app limits before. If you get approved for the Academic Research track your app quota is increased to 10 million per month. This is markedly better but the achievable data volume is still nothing like the v1.1 API, as these graphs attempt to illustrate: twarc2 will still observe the same rate limits, but once you’ve collected your portion for the month there’s not much that can be done, for that app at least. Apart from the quotas Twitter’s streaming endpoint in v2 is substantially changed which impacts how users interact with twarc. Previously twarc users would be able to create up to to two connections to the filter stream API. This could be done by simply: twarc filter obama &gt; obama.jsonl However in the Twitter v2 API only apps can connect to the filter stream, and they can only connect once. At first this seems like a major limitation but rather than creating a connection per query the v2 API allows you to build a set of rules for tweets to match, which in turns controls what tweets are included in the stream. This means you can collect for multiple types of queries at the same time, and the tweets will come back with a piece of metadata indicating what rule caused its inclusion. This translates into a markedly different set of interactions at the command line for collecting from the stream where you first need to set your stream rules and then open a connection to fetch it. twarc2 stream-rules add blacklivesmatter twarc2 stream &gt; tweets.jsonl One useful side effect of this is that you can update the stream (add and remove rules) while the stream is in motion: twarc2 stream-rules add blm While you are limited by the API quota in terms of how many tweets you can collect, tweets are not “dropped on the floor” when the volume gets too high. Once upon a time the v1.1 filter stream was rumored to be rate limited when your stream exceeds 1% of the total volume of new tweets. Plugins In addition to twarc helping you collect tweets the GitHub repository has also been a place to collect a set of utilities for working with the data. For example there are scripts for extracting and unshortening urls, identifying suspended/deleted content, extracting videos, buiding wordclouds, putting tweets on maps, displaying network graph visualizations, counting hashtags, and more. These utilities all work like Unix filters where the input is a stream of tweets and the output varies depending on what the utility is doing, e.g. a Gephi file for a network visualization, or a folder of mp4 files for video extraction. While this has worked well in general the kitchen sink approach has been difficult to manage from a configuration management perspective. Users have to download these scripts manually from GitHub or by cloning the repository. For some users this is fine, but it’s a bit of a barrier to entry for users who have just installed twarc with pip. Furthermore these plugins often have their own dependencies which twarc itself does not. This lets twarc can stay pretty lean, and things like youtube_dl, NetworkX or Pandas can be installed by people that want to use utilities that need them. But since there is no way to install the utilities there isn’t a way to ensure that the dependencies are installed, which can lead to users needing to diagnose missing libraries themselves. Finally the plugins have typically lacked their own tests. twarc’s test suite has really helped us track changes to the Twitter API and to make sure that it continues to operate properly as new functionality has been added. But nothing like this has existed for the utilities. We’ve noticed that over time some of them need updating. Also their command line arguments have drifted over time which can lead to some inconsistencies in how they are used. So with twarc2 we’ve introduced the idea of plugins which extend the functionality of the twarc2 command, are distributed on PyPI separately from twarc, and exist in their own GitHub repositories where they can be developed and tested independently of twarc itself. This is all achieved through twarc2’s use of the click library and specifically click-plugins. So now if you would like to convert your collected tweets to CSV you can install the twarc-csv: $ pip install twarc-csv $ twarc2 search covid19 &gt; covid19.jsonl $ twarc2 csv covid19.jsonl &gt; covid19.csv Or if you want to extract embedded and referenced videos from tweets you can install twarc-videos which will write all the videos to a directory: $ pip install twarc-videos $ twarc2 videos covid19.jsonl --download-dir covid19-videos You can write these plugins yourself and release them as needed. Check out the plugin reference implementation tweet-ids for a simple example to adapt. We’re still in the process of porting some of the most useful utilities over and would love to see ideas for new plugins. Check out the current list of twarc2 plugins and use the twarc issue tracker on GitHub to join the discussion. You may notice from the list of plugins that twarc now (finally) has documentation on ReadTheDocs external from the documentation that was previously only available on GitHub. We got by with GitHub’s rendering of Markdown documents for a while, but GitHub’s boilerplate designed for developers can prove to be quite confusing for users who aren’t used to selectively ignoring it. ReadTheDocs allows us to manage the command line and API documentation for twarc, and to showcase the work that has gone into the Spanish, Japanese, Portuguese, Swedish, Swahili and Chinese translations. Feedback Thanks for reading this far! We hope you will give twarc2 a try. Let us know what you think either in comments here, in the DocNow Slack or over on GitHub. ✨ ✨ Happy twarcing! ✨ ✨ ✨ Windows users will want to indicate the output file using a second argument rather than redirecting output with &gt;. See this page for details.↩ $ j You may have noticed that I try to use this static website as a journal. But, you know, not everything I want to write down is really ready (or appropriate) to put here. Some of these things end up in actual physical notebooks–there’s no beating the tactile experience of writing on paper for some kind of thinking. But I also spend a lot of time on my laptop, and at the command line in some form or another. So I have a directory of time stamped Markdown files stored on Dropbox, for example: ... /home/ed/Dropbox/Journal/2019-08-25.md /home/ed/Dropbox/Journal/2020-01-27.md /home/ed/Dropbox/Journal/2020-05-24.md /home/ed/Dropbox/Journal/2020-05-25.md /home/ed/Dropbox/Journal/2020-05-31.md ... Sometimes these notes migrate into a blog post or some other writing I’m doing. I used this technique quite a bit when writing my dissertation when I wanted to jot down things on my phone when an idea arrived. I’ve tried a few different apps for editing Markdown on my phone, but mostly settled on iA Writer which mostly just gets out of the way. But when editing on my laptop I tend to use my favorite text editor Vim with the vim-pencil plugin for making Markdown fun and easy. If Vim isn’t your thing and you use another text editor keep reading since this will work for you too. The only trick to this method of journaling is that I just need to open the right file. With command completion on the command line this isn’t so much of a chore. But it does take a moment to remember the date, and craft the right path. Today while reflecting on how nice it is to still be using Unix, it occurred to me that I could create a little shell script to open my journal for that day (or a previous day). So I put this little file j in my PATH: #!/bin/zsh journal_dir=&quot;/home/ed/Dropbox/Journal&quot; if [ &quot;$1&quot; ]; then date=$1 else date=`date +%Y-%m-%d` fi vim &quot;$journal_dir/$date.md&quot; So now when I’m in the middle of something else and want to jot a note in my journal I just type j. Unix, still crazy after all these years. Strengths and Weaknesses Quoting Macey (2019), quoting Foucault, quoting Nietzsche: One thing is needful. – To ‘give style’ to one’s character – a great and rare art! It is practised by those who survey all the strengths and weaknesses that their nature has to offer and then fit them into an artistic plan until each appears as art and reason and even weaknesses delight the eye. Nietzsche, Williams, Nauckhoff, &amp; Del Caro (2001), p. 290 This is a generous and lively image of what art does when it is working. Art is not perfection. Macey, D. (2019). The lives of Michel Foucault: A biography. Verso. Nietzsche, F. W., Williams, B., Nauckhoff, J., &amp; Del Caro, A. (2001). The gay science: with a prelude in German rhymes and an appendix of songs. Cambridge, U.K. ; New York: Cambridge University Press. Data Speculation I’ve taken the ill-advised approach of using the Coronavirus as a topic to frame the exercises in my computer programming class this semester. I say “ill-advised” because given the impact that COVID has been having on students I’ve been thinking they probably need a way to escape news of the virus by way of writing code, rather than diving into it more. It’s late in the semester to modulate things but I think we will shift gears to look at programming through another lens after spring break. That being said, one of the interesting things we’ve been doing is looking at vaccination data that is being released by the Maryland Department of Health through their ESRI ArcGIS Hub. Note: this dataset has since been removed from the web because it has been superseded by a new dataset that includes single dose vaccinations. I guess it’s good that students get a feel for how ephemeral data on the web is, even when it is published by the government. We noticed that this dataset recorded a small number of vaccinations as happening as early as the 1930s up until December 11, 2020 when vaccines were approved for use. I asked students to apply what we have been learning about Python (files, strings, loops, and sets) to identify the Maryland counties that were responsible for generating this anomalous data. I thought this exercise provided a good demonstration using real, live data that critical thinking about the provenance of data is always important because there is no such thing as raw data (Gitelman, 2013). While we were working with the data to count the number of anomalous vaccinations per county one of my sharp eyed students noticed that the results we were seeing with my version of the dataset (downloaded on February 28) were different from what we saw with his (downloaded on March 4). We expected to see new rows in the later one because new vaccination data seem to be reported daily–which is cool in itself. But we were surprised to find new vaccination records for dates earlier than December 11, 2020. Why would new vaccinations for these erroneous older dates still be entering the system? For example the second dataset downloaded March 4 acquired 6 new rows: Object ID Vaccination Date County Daily First Dose Cumulative First Dose Daily Second Dose Cumulative Second Dose 4 1972/10/13 Allegany 1 1 0 0 5 1972/12/16 Baltimore 1 1 0 0 6 2012/02/03 Baltimore 1 2 0 0 28 2020/02/24 Baltimore City 1 2 0 0 34 2020/08/24 Baltimore 1 4 0 0 64 2020/12/10 Prince George’s 1 3 0 0 And these rows present in the February 28 version were deleted in the March 4 version: Object ID Vaccination Date County Daily First Dose Cumulative First Dose Daily Second Dose Cumulative Second Dose 4 2019/12/26 Frederick 1 1 0 0 15 2020/01/25 Talbot 1 1 0 0 19 2020/01/28 Baltimore 1 1 0 0 20 2020/01/30 Caroline 1 1 0 0 28 2020/02/12 Prince George’s 1 1 0 0 30 2020/02/20 Anne Arundel 1 6 0 0 56 2020/10/16 Frederick 1 7 0 4 59 2020/11/01 Wicomico 1 1 0 0 60 2020/11/04 Frederick 1 8 0 4 I found these additions perplexing at first, because I assumed these outliers were part of an initial load. But it appears that the anomalies are still being generated? The deletions suggest that perhaps the anomalous data is being identified and scrubbed in a live system that is then dumping out the data? Or maybe the code that is being used to update the dataset in ArcGIS Hub itself is malfunctioning in some way? If you are interested in toying around with the code and data it is up on GitHub. I was interested to learn about pandas.DataFrame.merge which is useful for diffing tables when you use indicator=True. At any rate, having students notice, measure and document anomalies like this seems pretty useful. I also asked them to speculate about what kinds of activities could generate these errors. I meant speculate in the speculative fiction sense of imagining a specific scenario that caused it. I think this made some students scratch their head a bit, because I wasn’t asking them for the cause, but to invent a possible cause. Based on the results so far I’d like to incorporate more of these speculative exercises concerned with the functioning of code and data representations into my teaching. I want to encourage students to think creatively about data processing as they learn about the nuts and bolts of how code operates. For example the treatments in How to Run a City Like Amazon, and Other Fables which use sci-fi to test ideas about how information technologies are deployed in society. Another model is the Speculative Ethics Book Club which also uses sci-fi to explore the ethical and social consequences of technology. I feel like I need to read up on specualtive research more generally before doing this though (Michael &amp; Wilkie, 2020). I’d also like to focus the speculation down at the level of the code or data processing, rather than at the macro super-system level. But that has its place too. Another difference is that I was asking students to engage in speculation about the past rather than the future. How did the data end up this way? Perhaps this is more of a genealogical approach, of winding things backwards, and tracing what is known. Maybe it’s more Mystery than Sci-Fi. The speculative element is important because (in this case) operations at the MD Dept of Health, and their ArcGIS Hub setup are mostly opaque to us. But even when access isn’t a problem these systems they can feel opaque, because rather than there being a dearth of information you are drowning in it. Speculation is a useful abductive approach to hypothesis generation and, hopefully, understanding. Update 2021-03-17: Over in the fediverse David Benque recommended I take a look at Matthew Stanley’s chapter in (Gitelman, 2013) “Where Is That Moon, Anyway? The Problem of Interpreting Historical Solar Eclipse Observations” for the connection to Mystery. For the connection to Peirce and abduction he also pointed to Luciana Parisi’s chapter “Speculation: A method for the unattainable” in Lury &amp; Wakeford (2012). Definitely things to follow up on! References Gitelman, L. (Ed.). (2013). “Raw data” is an oxymoron. MIT Press. Lury, C., &amp; Wakeford, N. (2012). Inventive methods: The happening of the social. Routledge. Michael, M., &amp; Wilkie, A. (2020). Speculative research. In The Palgrave encyclopedia of the possible (pp. 1–8). Cham: Springer International Publishing. Retrieved from https://doi.org/10.1007/978-3-319-98390-5_118-1 Recovering Foucault I’ve been enjoying reading David Macey’s biography of Michel Foucault, that was republished in 2019 by Verso. Macey himself is an interesting figure, both a scholar and an activist who took leave from academia to do translation work and to write this biography and others of Lacan and Fanon. One thing that struck me as I’m nearing the end of Macey’s book is the relationship between Foucault and archives. I think Foucault has become emblematic of a certain brand of literary analysis of “the archive” that is far removed from the research literature of archival studies, while using “the archive” as a metaphor (Caswell, 2016). I’ve spent much of my life working in libraries and digital preservation, and now studying and teaching about them from the perspective of practice, so I am very sympathetic to this critique. It is perhaps ironic that the disconnect between these two bodies of research is a difference in discourse which Foucault himself brought attention to. At any rate, the thing that has struck me while reading this biography is how much time Foucault himself spent working in libraries and archives. Here’s Foucault in his own words talking about his thesis: In Histoire de la folie à l’âge classique I wished to determine what could be known about mental illness in a given epoch … An object took shape for me: the knowledge invested in complex systems of institutions. And a method became imperative: rather than perusing … only the library of scientific books, it was necessary to consult a body of archives comprising decrees, rules hospital and prison registers, and acts of jurisprudence. It was in the Arsenal or the Archives Nationales that I undertook the analysis of a knowledge whose visible body is neither scientific nor theoretical discourse, nor literature, but a daily and regulated practice. (Macey, 2019, p. 94) Foucault didn’t simply use archives for his research: understanding the processes and practices of archives were integral to his method. Even though the theory and practice of libraries and archives are quite different given their different functions and materials, they are often lumped together as a convenience in the same buildings. Macey blurs them a little bit, in sections like this where he talks about how important libraries were to Foucault’s work: Foucault required access to Paris for a variety of reasons, not least because he was also teaching part-time at ENS. The putative thesis he had begun at the Fondation Thiers – and which he now described to Polin as being on the philosophy of psychology – meant that he had to work at the Bibliothèque Nationale and he had already become one of its habitues. For the next thirty years, Henri Labrouste’s great building in the rue de Richelieu, with its elegant pillars and arches of cast iron, would be his primary place of work. His favourite seat was in the hemicycle, the small, raised section directly opposite the entrance, sheltered from the main reading room, where a central aisle separates rows of long tables subdivided into individual reading desks. The hemicycle affords slighty more quiet and privacy. For thirty years, Foucault pursued his research here almost daily, with occasional forays to the manuscript department and to other libraries, and contended with the Byzantine cataloguing system: two incomplete and dated printed catalogues supplemented by cabinets containing countless index cards, many of them inscribed with copperplate handwriting. Libraries were to become Foucault’s natural habitat: ‘those greenish institutions where books accumulate and where there grows the dense vegetation of their knowledge’ There’s a metaphor for you: libraries as vegetation :) It kind of reminds me of some recent work looking at decentralized web technologies in terms of mushrooms. But I digress. I really just wanted to note here that the erasure of archival studies from humanities research about “the archive” shouldn’t really be attributed to Foucault, whose own practice centered the work of libraries and archives. Foucault wasn’t just writing about an abstract archive, he was practically living out of them. As someone who has worked in libraries and archives I can appreciate how power users (pun intended) often knew aspects of the holdings and intricacies of their their management better than I did. Archives, when they are working, are always collaborative endeavours, and the important thing is to recognize and attribute the various sides of that collaboration. PS. Writing this blog post led me to dig up a few things I want to read (Eliassen, 2010; Radford, Radford, &amp; Lingel, 2015 ). References Caswell, M. (2016). The archive is not an archives: On acknowledging the intellectual contributions of archival studies. Reconstruction, 16(1). Retrieved from http://reconstruction.eserver.org/Issues/161/Caswell.shtml Eliassen, K. (2010). Archives of Michel Foucualt. In E. Røssaak (Ed.), The archive in motion, new conceptions of the archive in contemporary thought and new media practices. Novus Press. Macey, D. (2019). The lives of Michel Foucault: A biography. Verso. Radford, G. P., Radford, M. L., &amp; Lingel, J. (2015). The library as heterotopia: Michel Foucault and the experience of library space. Journal of Documentation, 71(4), 773–751. Teaching OOP in the Time of COVID I’ve been teaching a section of the Introduction to Object Oriented Programming at the UMD College for Information Studies this semester. It’s difficult for me, and for the students, because we are remote due to the Coronavirus pandemic. The class is largely asynchronous, but every week I’ve been holding two synchronous live coding sessions in Zoom to discuss the material and the exercises. These have been fun because the students are sharp, and haven’t been shy about sharing their screen and their VSCode session to work on the details. But students need quite a bit of self-discipline to move through the material, and probably only about 1/4 of the students take advantage of these live sessions. I’m quite lucky because I’m working with a set of lectures, slides and exercises that have been developed over the past couple of years by other instructors: Josh Westgard, Aric Bills and Gabriel Cruz. You can see some of the public facing materials here. Having this backdrop of content combined with Severance’s excellent (and free) Python for Everybody has allowed me to focus more on my live sessions, on responsive grading, and to also spend some time crafting additional exercises that are geared to this particular moment. This class is in the College for Information Studies and not in the Computer Science Department, so it’s important for the students to not only learn how to use a programming language, but to understand programming as a social activity, with real political and material effects in the world. Being able to read, understand, critique and talk about code and its documentation is just as important as being able to write it. In practice, out in the “real world” of open source software I think these aspects are arguably more important. One way I’ve been trying to do this in the first few weeks of class is to craft a sequence of exercises that form a narrative around Coronavirus testing and data collection to help remind the students of the basics of programming: variables, expressions, conditionals, loops, functions, files. In the first exercise we imagined a very simple data entry program that needed to record results of Real-time polymerase chain reaction tests (RT-PCR). I gave them the program and described how it was supposed to work, and asked them describe (in English) any problems that they noticed and to submit a version of the program with problems fixed. I also asked them to reflect on a request from their boss about adding the collection of race, gender and income information. The goal here was to test their ability to read the program and write English about it while also demonstrating a facility for modifying the program. Most importantly I wanted them to think about how inputs such as race or gender have questions about categories and standards behind them, and weren’t simply a matter of syntax. The second exercise builds on the first by asking them to adjust the revised program to be able to save the data in a very particular format. Yes, in the first exercise the data is stored in memory and printed to the screen in aggregate at the end. The scenario here is that the Department of Health and Human Services has assumed the responsibility for COVID test data collection from the Centers for Disease Control. Of course this really happened, but the data format I chose was completely made up (maybe we will be working with some real data at the end of the semester if I continue with this theme). The goal in this exercise was to demonstrate their ability to read another program and fit a function into it. The students were given a working program that had a save_results() function stubbed out. In addition to submitting their revised code I asked them to reflect on some limitations of the data format chosen, and the data processing pipeline that it was a part of. And in the third exercise I asked them to imagine that this lab they were working in had a scientist who discovered a problem with some of the thresholds for acceptable testing, which required an update to the program from Exercise 2, and also a test suite to make sure the program was behaving properly. In addition to writing the tests I asked them to reflect on what functionality was not being tested that probably should be. This alternation between writing code and writing prose is something I started doing as part of a Digital Curation class. I don’t know if this dialogical or perhaps dialectical, approach is something others have tried. I should probably do some research to see. In my last class I alternated week by week: one week reading and writing code, the next week reading and writing prose. But this semester I’ve stayed focused on code, but required the reading and writing of code as well as prose about code in the same week. I hope to write more about how this goes, and these exercises as I go. I’m not sure if I will continue with the Coronavirus data examples. One thing I’m sensitive to is that my students themselves are experiencing the effects of the Coronavirus, and may want to escape it just for a bit in their school work. Just writing in the open about it here, in addition to the weekly meetings I’ve had with Aric, Josh and Gabriel has been very useful. Speaking of those meetings. I learned today from Aric that tomorrow (February 20th, 2021) is the 30th anniversary of Python’s first public release! You can see this reflected in this timeline. This v0.9.1 release was the first release Guido van Rossum made outside of CWI and was made on the Usenet newsgroup alt.sources where it is split out into chunks that need to be reassembled. Back in 2009 Andrew Dalke located a and repackaged these sources in Google Groups which acquired alt.sources as part of DejaNews in 2001. But if you look at the time stamp on the first part of the release you can see that it was made February 19, 1991 (not February 20). So I’m not sure if the birthday is actually today. I sent this little note out to my students with this wonderful two part oral history that the Computer History Museum did with Guido van Rossum a couple years ago. I turns out Both of his parents were atheists and pacifists. His dad went to jail because he refused to be conscripted into the military. That and many more details of his background and thoughts about the evolution of Python can be found in these delightful interviews: Happy Birthday Python! GPT-3 Jam One of the joys of pandemic academic life has been a true feast of online events to attend, on a wide variety of topics, some of which are delightfully narrow and esoteric. Case in point was today’s Reflecting on Power and AI: The Case of GPT-3 which lived up to its title. I’ll try to keep an eye out for when the video posts, and update here. The workshop was largely organized around an exploration of whether GPT-3, the largest known machine learning language model, changes anything for media studies theory, or if it amounts to just more of the same. So the discussion wasn’t focused so much on what games could be played with GPT-3, but rather if GPT-3 changes the rules of the game for media theory, at all. I’m not sure there was a conclusive answer at the end, but it sounded like the consensus was that current theorization around media is adequate for understanding GPT-3, but it matters greatly what theory or theories are deployed. The online discussion after the presentations indicated that attendees didn’t see this as merely a theoretical issue, but one that has direct social and political impacts on our lives. James Steinhoff looked at GPT-3 using a Marxist media theory perspective where he told the story of GPT-3’s as a project of OpenAI and as a project of capital. OpenAI started with much fanfare in 2015 as a non-profit initiative where the technology, algorithms and models developed would would be kept openly licensed and freely available so that the world could understand the benefits and risks of AI technology. Steinhoff described how in 2019 the project’s needs for capital (compute power and staff) transitioned it from a non-profit into a capped-profit company, which is now owned, or at least controlled, by Microsoft. The code for generating the model as well as the model itself are gated behind a token driven Web API run my Microsoft. You can get on a waiting list to use it, but apparently a lot of people have been waiting a while, so … Being a Microsoft employee probably helps. I grabbed a screenshot of the pricing page that Steinhoff shared during his presentation: I’d be interested to hear more about how these tokens operate. Are they per-request, or are they measured according something else? I googled around a bit during the presentation to try to find some documentation for the Web API, and came up empty handed. I did find Shreya Shankar’s gpt3-sandbox project for interacting with the API in your browser (mostly for iteratively crafting text input in order to generate desired output). It depends on the openai Python package created by OpenAI themselves. The docs for openai then point at a page on the openai.com website which is behind a login. You can create an account, but you need to be pre-approved (made it through the waitlist) to be able to see the docs. There’s probably some sense that can be made from examining the python client though. All of the presentations in some form or another touched on the 175 billion parameters that were used to generate the model. But the API to the model doesn’t have that many parameters. It allows you to enter text and get text back. But the API surface that the GPT-3 service provides could be interesting to examine a bit more closely, especially to track how it changes over time. In terms of how this model mediates knowledge and understanding it’ll be important watch. Steinhoff’s message seemed to be that, despite the best of intentions, GPT-3 functions in the service of very large corporations with very particular interests. One dimension that he didn’t explore perhaps because of time, is how the GPT-3 model itself is fed massive amounts of content from the web, or the commons. Indeed 60% of the data came from the CommonCrawl project. GPT-3 is an example of an extraction project that has been underway at large Internet companies for some time. I think the critique of these corporations has often been confined to seeing them in terms of surveillance capitalism rather than in terms of raw resource extraction, or the primitive accumulation of capital. The behavioral indicators of who clicked on what are certainly valuable, but GPT-3 and sister projects like CommonCrawl shows just the accumulation of data with modest amounts of metadata can be extremely valuable. This discussion really hit home for me since I’ve been working with Jess Ogden and Shawn Walker using CommonCrawl as a dataset for talking about the use of web archives, while also reflecting on the use of web archives as data. CommonCrawl provides a unique glimpse into some of the data operations that are at work in the accumulation of web archives. I worry that the window is closing and the CommonCrawl itself will be absorbed into Microsoft. Following Steinhoff Olya Kudina and Bas de Boer jointly presented some compelling thoughts about how its important to understand GPT-3 in terms of sociotechnical theory, using ideas drawn from Foucault and Arendt. I actually want to watch their presentation again because it followed a very specific path that I can’t do justice to here. But their main argument seemed to be that GPT-3 is an expression of power and that where there is power there is always resistance to power. GPT-3 can and will be subverted and used to achieve particular political ends of our own choosing. Because of my own dissertation research I’m partial to Foucault’s idea of governmentality, especially as it relates to ideas of legibility (Scott, 1998)–the who, what and why of legibility projects, aka archives. GPT-3 presents some interesting challenges in terms of legibility because the model is so complex, the results it generates defy deductive logic and auditing. In some ways GPT-3 obscures more than it makes a population legible, as Foucault moved from disciplinary analysis of the subject, to the ways in which populations are described and governed through the practices of pastoral power, of open datasets. Again the significance of CommonCrawl as an archival project, as a web legibility project, jumps to the fore. I’m not as up on Arendt as I should be, so one outcome of their presentation is that I’m going to read her The Human Condition which they had in a slide. I’m long overdue. References Scott, J. C. (1998). Seeing like a state: How certain schemes to improve the human condition have failed. Yale University Press. mimetypes Today I learned that Python has a mimetypes module, and has ever since Guido von Rossum added it in 1997. Honestly I’m just a bit sheepish to admit this discovery, as someone who has been using Python for digital preservation work for about 15 years. But maybe there’s a good reason for that. Since the entire version history for Python is available on GitHub (which is a beautiful thing in itself) you can see that the mimetypes module started as a guess_type() function built around a pretty simple hard coded mapping of file extensions to mimetypes. The module also includes a little bit of code to look for, and parse, mimetype registries that might be available on the host operating system. The initial mimetype registries used included one from the venerable Apache httpd web server, and the Netscape web browser, which was about three years old at the time. It makes sense why this function to look up a mimetype for a filename would be useful at that time, since Python was being used to serve up files on the nascent web and for sending email, and whatnot. Today the module looks much the same, but has a few new functions and about twice as many mimetypes in its internal list. Some of the new mimetypes include text/csv, audio.mpeg, application/vnd.ms-powerpoint, application/x-shockwave-flash, application/xml, and application/json. Comparing the first commit to the most latest provides a thumbnail sketch of 25 years of web format evolution. I’ll admit, this is is a bit of an esoteric thing to be writing a blog post about. So I should explain. At work I’ve been helping out on a community archiving project which has accumulated a significant amount of photographs, scans, documents of various kinds, audio files and videos. Some of these files are embedded in web applications like Omeka, some are in cloud storage like Google Drive, or on the office networked attached storage, and others are on scattered storage devices in people’s desk drawers and closets. We’ve also created new files during community digitization events, and oral history interviews. As part of this work we’ve wanted to start building a place on the web where all these materials live. This has required not only describing the files, but also putting all the files in one place so that access can be provided. In principle this sounds simple. But it turns out that collecting the files from all these diverse locations poses significant challenges, because their context matters. The filenames, and the directories they are found in, are sometimes the only descriptive metadata that exists for this data. In short, the original order matters. But putting this content on the web means that the files need to be brought together and connected with their metadata programmatically. This is how I stumbled across the mimetypes module. I’ve been writing some throwaway code to collect the files together into the same directory structure while preserving their original filenames and locations in an Airtable database. I’ve been using the magic module to identify the format of the file, which is used to copy the file into a Dropbox storage location. The extension is important because we are expecting this to be a static site serving up the content and we want the files to also be browsable using the Dropbox drive. It turns out the mimetypes.guess_extension is pretty useful for turning a mediatype into an file extension. I’m kind of surprised that it took me this long to discover mimetypes, but I’m glad I did. As an aside I think this highlights for me how important Git can be as an archive and research method for software studies work. Northwest Branch Cairn Here is a short recording and a couple photos from my morning walk along the Northwest Branch trail with Penny. I can’t go every day but at 7 months old she has tons of energy, so it’s generally a good idea for all concerned to go at least every other morning. And it’s a good thing, because the walk is surprisingly peaceful, and it’s such a joy to see her run through the woods. After walking about 30 minutes there is this little cairn that is a reminder for me to turn around. After seeing it grow in size I was sad to see it knocked down one day. But, ever so slowly, it is getting built back up again. 
inkdroid-org-180	----	inkdroid inkdroid Paper or Plastic 856 Coincidence? twarc2 This post was originally published on Medium but I spent time writing it so I wanted to have it here too. TL;DR twarc has been redesigned from the ground up to work with the new Twitter v2 API and their Academic Research track. Many thanks for the code and design contributions of Betsy Alpert, Igor Brigadir, Sam Hames, Jeff Sauer, and Daniel Verdeer that have made twarc2 possible, as well as early feedback from Dan Kerchner, Shane Lin, Miles McCain, 李荣蓬, David Thiel, Melanie Walsh and Laura Wrubel. Extra special thanks to the Institute for Future Environments at Queensland University of Technology for supporting Betsy and Sam in their work, and for the continued support of the Mellon Foundation. Back in August of last year Twitter announced early access to their new v2 API, and their plans to sunset the v1.1 API that has been active for almost the last 10 years. Over the lifetime of their v1.1 API Twitter has become deeply embedded in the media landscape. As magazines, newspapers and television have moved onto the web they have increasingly adopted tweets as a mechanism for citing politicians, celebrities and organizations, while also using them to document current events, generate leads and gather feedback for evolving stories. As a result Twitter has also become a popular object of study for humanities and social science researchers looking to understand the world as reflected, refracted and distorted by/in social media. On the surface the v2 API update seems pretty insignificant since the shape of a tweet, its parts, properties and affordances, aren’t changing at all. Tweets with 280 characters of text, images and video will continue to be posted, retweeted and quoted. However behind the scenes the representation of a tweet as data, and the quotas that control the rates at which this data can flow between apps and other third party services will be greatly transformed. Needless to say, v2 represents a big change for the Documenting the Now project. Along with community members we’ve developed and maintained open source tools like twarc that talk directly to the Twitter API to help users to search for and collect live tweets that match criteria like hashtags, names and geographic locations. Today we’re excited to announce the release of twarc v2 which has been designed from the ground up to work with the v2 API and Twitter’s new Academic Research track. Clearly it’s extremely problematic having a multi-national corporation act as a gatekeeper for who counts as an academic researcher, and what constitutes academic research. We need look no further than the recent experiences of Timnit Gebru and Margaret Mitchell at Google for an example of what happens when research questions run up against the business objectives of capital. We only know their stories because Gebru and Mitchell’s bravely took a principled approach, where many researchers would have knowingly or unknowingly shaped their research to better fit the needs of the company. So it is important for us that twarc still be usable by people with and without access to the Academic Research Track. But we have heard from many users that the Academic Research Track presents new opportunities for Twitter data collection that are essential for researchers interested in the observability of social media platforms. Twitter is making a good faith effort to work with the academic research community, and we thought twarc should support it, even if big challenges lie ahead. So why are people interested in the Academic Research Track? Once your application has been approved you are able to collect data from the full history of Tweets, at no cost. This is a massive improvement over the v1.1 access which was limited to a one week window and researchers had to pay for access. Access to the full archive means it’s now possible to study events that have happened in the past back to the beginning of Twitter in 2006. If you do create any historical datasets we’d love for you to share the tweet identifier datasets in The Catalog. However this opening up of access on the one hand comes with a simultaneous contraction in terms of how much data can be collected at one time. The remainder of this post describes some of the details and the design decisions we have made with twarc2 to address them. If you would prefer to watch a quick introduction to using twarc v2 please check out this short video: Installation If you are familiar with installing twarc nothing is changed. You still install (or upgrade) with pip as you did before: $ pip install --upgrade twarc In fact you will still have full access to the v1.1 API just as you did before. So the old commands will continue to work as they did1 $ twarc search blacklivesmatter &gt; tweets.jsonl twarc2 was designed to let you to continue to use Twitter’s v1.1 API undisturbed until it is finally turned off by Twitter, at which point the functionality will be removed from twarc. All the support for the v2 API is mediated by a new command line utility twarc2. For example to search for blacklivesmatter tweets and write them to a file tweets.jsonl: $ twarc2 search blacklivesmatter &gt; tweets.jsonl All the usual twarc functionality such as searching for tweets, collecting live tweets from the streaming API endpoint, requesting user timelines and user metadata are all still there, twarc2 --help gives you the details. But while the interface looks the same there’s quite a bit different going on behind the scenes. Representation Truth be told, there is no shortage of open source libraries and tools for interacting with the Twitter API. In the past twarc has made a bit of a name for itself by catering to a niche group of users who want a reliable, programmable way to collect the canonical JSON representation of a tweet. JavaScript Object Notation (JSON) is the language of Web APIs, and Twitter has kept its JSON representation of a tweet relatively stable over the years. Rather than making lots of decisions about the many ways you might want to collect, model and analyze tweets twarc has tried to do one thing and do it well (data collection) and get out of the way so that you can use (or create) the tools for putting this data to use. But the JSON representation of a tweet in the Twitter v2 API is completely burst apart. The v2 base representation of a tweet is extremely lean and minimal, and just includes the text of the tweet its identifier and a handful of other things. All the details about the user who created the tweet, embedded media, and more are not included. Fortunately this information is still available, but the user needs to craft their API request to request tweets using a set of expansions that tell the Twitter API what additional entities to include. In addition for each expansion there are a set of field options to include that control what of these expansions is returned. So rather than there being a single JSON representation of a tweet API users now have the ability to shape the data based on what they need, much like how GraphQL APIs work. This kind of makes you wonder why Twitter didn’t make their GraphQL API available. For specific use cases this customizability is very useful, but the mutability of the representation of a tweet presents challenges when collecting data for future use. If you didn’t request the right expansions or fields when collecting the data then you won’t be able to analyze that data later when doing your research. To solve for this twarc2 has been designed to collect the richest possible representation for a tweet, by requesting all possible expansions and field combinations for tweets. See the expansions module for the details if you are interested. This takes a significant burden off of users to digest the API documentation, and craft the correct API requests themselves. In addition the twarc community will be monitoring the Twitter API documentation going forward to incorporate new expansions and fields as they will inevitably be added in the future. Flattening This is diving into the weeds a little bit, but it’s worth noting here that Twitter’s introduction of expansions allows data that was once duplicated across multiple tweets (such as user information, media, retweets, etc) to be included once per response from the API. This means that instead of seeing information about the user who created a tweet in the context of their tweet the user will be referenced using an identifier, and this identifier will map to user metadata in the outer envelope of the response. It makes sense why Twitter have introduced expansions since it means in a set of 100 tweets from a given user the user information will just be included once rather than repeated 100 times, which means less data, less network traffic and less money. It’s even more significant when consider the large number of possible expansions. However this pass by-reference rather than by-value presents some challenges for stream based processing which expects each tweet to be self-contained. For this reason we’ve introduce the idea of flattening the response data when persisting the JSON to disk. This means that tools and data pipelines that expect to operate on a stream of tweets can continue to do so. Since the representation of a tweet is so dependent on how data is requested we’ve taken the opportunity to introduce a small stanza of twarc specific metadata using the __twarc prefix. This metadata records what API endpoint the data was requested from, and when. This information is critically important when interpreting the data, because some information about a tweet like its retweet and quote counts are constantly changing. Data Flows As mentioned above you can still collect tweets from the search and streaming API endpoints in a way that seems quite similar to the v1 API. The big changes however are the quotas associated with these endpoints which govern how much can be collected. These quotas control how many requests can be sent to Twitter in 15 minute intervals. In fact these quotas are not much changed, but what’s new are app wide quotas that constrain how many tweets a given application (app) can collect every month. An app in this context is a piece of software (e.g. your twarc software) identified by unique API keys set up in the Twitter Developer Portal. The standard API access sets a 500,000 tweet per month limit. This is a huge change considering there were no monthly app limits before. If you get approved for the Academic Research track your app quota is increased to 10 million per month. This is markedly better but the achievable data volume is still nothing like the v1.1 API, as these graphs attempt to illustrate: twarc2 will still observe the same rate limits, but once you’ve collected your portion for the month there’s not much that can be done, for that app at least. Apart from the quotas Twitter’s streaming endpoint in v2 is substantially changed which impacts how users interact with twarc. Previously twarc users would be able to create up to to two connections to the filter stream API. This could be done by simply: twarc filter obama &gt; obama.jsonl However in the Twitter v2 API only apps can connect to the filter stream, and they can only connect once. At first this seems like a major limitation but rather than creating a connection per query the v2 API allows you to build a set of rules for tweets to match, which in turns controls what tweets are included in the stream. This means you can collect for multiple types of queries at the same time, and the tweets will come back with a piece of metadata indicating what rule caused its inclusion. This translates into a markedly different set of interactions at the command line for collecting from the stream where you first need to set your stream rules and then open a connection to fetch it. twarc2 stream-rules add blacklivesmatter twarc2 stream &gt; tweets.jsonl One useful side effect of this is that you can update the stream (add and remove rules) while the stream is in motion: twarc2 stream-rules add blm While you are limited by the API quota in terms of how many tweets you can collect, tweets are not “dropped on the floor” when the volume gets too high. Once upon a time the v1.1 filter stream was rumored to be rate limited when your stream exceeds 1% of the total volume of new tweets. Plugins In addition to twarc helping you collect tweets the GitHub repository has also been a place to collect a set of utilities for working with the data. For example there are scripts for extracting and unshortening urls, identifying suspended/deleted content, extracting videos, buiding wordclouds, putting tweets on maps, displaying network graph visualizations, counting hashtags, and more. These utilities all work like Unix filters where the input is a stream of tweets and the output varies depending on what the utility is doing, e.g. a Gephi file for a network visualization, or a folder of mp4 files for video extraction. While this has worked well in general the kitchen sink approach has been difficult to manage from a configuration management perspective. Users have to download these scripts manually from GitHub or by cloning the repository. For some users this is fine, but it’s a bit of a barrier to entry for users who have just installed twarc with pip. Furthermore these plugins often have their own dependencies which twarc itself does not. This lets twarc can stay pretty lean, and things like youtube_dl, NetworkX or Pandas can be installed by people that want to use utilities that need them. But since there is no way to install the utilities there isn’t a way to ensure that the dependencies are installed, which can lead to users needing to diagnose missing libraries themselves. Finally the plugins have typically lacked their own tests. twarc’s test suite has really helped us track changes to the Twitter API and to make sure that it continues to operate properly as new functionality has been added. But nothing like this has existed for the utilities. We’ve noticed that over time some of them need updating. Also their command line arguments have drifted over time which can lead to some inconsistencies in how they are used. So with twarc2 we’ve introduced the idea of plugins which extend the functionality of the twarc2 command, are distributed on PyPI separately from twarc, and exist in their own GitHub repositories where they can be developed and tested independently of twarc itself. This is all achieved through twarc2’s use of the click library and specifically click-plugins. So now if you would like to convert your collected tweets to CSV you can install the twarc-csv: $ pip install twarc-csv $ twarc2 search covid19 &gt; covid19.jsonl $ twarc2 csv covid19.jsonl &gt; covid19.csv Or if you want to extract embedded and referenced videos from tweets you can install twarc-videos which will write all the videos to a directory: $ pip install twarc-videos $ twarc2 videos covid19.jsonl --download-dir covid19-videos You can write these plugins yourself and release them as needed. Check out the plugin reference implementation tweet-ids for a simple example to adapt. We’re still in the process of porting some of the most useful utilities over and would love to see ideas for new plugins. Check out the current list of twarc2 plugins and use the twarc issue tracker on GitHub to join the discussion. You may notice from the list of plugins that twarc now (finally) has documentation on ReadTheDocs external from the documentation that was previously only available on GitHub. We got by with GitHub’s rendering of Markdown documents for a while, but GitHub’s boilerplate designed for developers can prove to be quite confusing for users who aren’t used to selectively ignoring it. ReadTheDocs allows us to manage the command line and API documentation for twarc, and to showcase the work that has gone into the Spanish, Japanese, Portuguese, Swedish, Swahili and Chinese translations. Feedback Thanks for reading this far! We hope you will give twarc2 a try. Let us know what you think either in comments here, in the DocNow Slack or over on GitHub. ✨ ✨ Happy twarcing! ✨ ✨ ✨ Windows users will want to indicate the output file using a second argument rather than redirecting output with &gt;. See this page for details.↩ $ j You may have noticed that I try to use this static website as a journal. But, you know, not everything I want to write down is really ready (or appropriate) to put here. Some of these things end up in actual physical notebooks–there’s no beating the tactile experience of writing on paper for some kind of thinking. But I also spend a lot of time on my laptop, and at the command line in some form or another. So I have a directory of time stamped Markdown files stored on Dropbox, for example: ... /home/ed/Dropbox/Journal/2019-08-25.md /home/ed/Dropbox/Journal/2020-01-27.md /home/ed/Dropbox/Journal/2020-05-24.md /home/ed/Dropbox/Journal/2020-05-25.md /home/ed/Dropbox/Journal/2020-05-31.md ... Sometimes these notes migrate into a blog post or some other writing I’m doing. I used this technique quite a bit when writing my dissertation when I wanted to jot down things on my phone when an idea arrived. I’ve tried a few different apps for editing Markdown on my phone, but mostly settled on iA Writer which mostly just gets out of the way. But when editing on my laptop I tend to use my favorite text editor Vim with the vim-pencil plugin for making Markdown fun and easy. If Vim isn’t your thing and you use another text editor keep reading since this will work for you too. The only trick to this method of journaling is that I just need to open the right file. With command completion on the command line this isn’t so much of a chore. But it does take a moment to remember the date, and craft the right path. Today while reflecting on how nice it is to still be using Unix, it occurred to me that I could create a little shell script to open my journal for that day (or a previous day). So I put this little file j in my PATH: #!/bin/zsh journal_dir=&quot;/home/ed/Dropbox/Journal&quot; if [ &quot;$1&quot; ]; then date=$1 else date=`date +%Y-%m-%d` fi vim &quot;$journal_dir/$date.md&quot; So now when I’m in the middle of something else and want to jot a note in my journal I just type j. Unix, still crazy after all these years. Strengths and Weaknesses Quoting Macey (2019), quoting Foucault, quoting Nietzsche: One thing is needful. – To ‘give style’ to one’s character – a great and rare art! It is practised by those who survey all the strengths and weaknesses that their nature has to offer and then fit them into an artistic plan until each appears as art and reason and even weaknesses delight the eye. Nietzsche, Williams, Nauckhoff, &amp; Del Caro (2001), p. 290 This is a generous and lively image of what art does when it is working. Art is not perfection. Macey, D. (2019). The lives of Michel Foucault: A biography. Verso. Nietzsche, F. W., Williams, B., Nauckhoff, J., &amp; Del Caro, A. (2001). The gay science: with a prelude in German rhymes and an appendix of songs. Cambridge, U.K. ; New York: Cambridge University Press. Data Speculation I’ve taken the ill-advised approach of using the Coronavirus as a topic to frame the exercises in my computer programming class this semester. I say “ill-advised” because given the impact that COVID has been having on students I’ve been thinking they probably need a way to escape news of the virus by way of writing code, rather than diving into it more. It’s late in the semester to modulate things but I think we will shift gears to look at programming through another lens after spring break. That being said, one of the interesting things we’ve been doing is looking at vaccination data that is being released by the Maryland Department of Health through their ESRI ArcGIS Hub. Note: this dataset has since been removed from the web because it has been superseded by a new dataset that includes single dose vaccinations. I guess it’s good that students get a feel for how ephemeral data on the web is, even when it is published by the government. We noticed that this dataset recorded a small number of vaccinations as happening as early as the 1930s up until December 11, 2020 when vaccines were approved for use. I asked students to apply what we have been learning about Python (files, strings, loops, and sets) to identify the Maryland counties that were responsible for generating this anomalous data. I thought this exercise provided a good demonstration using real, live data that critical thinking about the provenance of data is always important because there is no such thing as raw data (Gitelman, 2013). While we were working with the data to count the number of anomalous vaccinations per county one of my sharp eyed students noticed that the results we were seeing with my version of the dataset (downloaded on February 28) were different from what we saw with his (downloaded on March 4). We expected to see new rows in the later one because new vaccination data seem to be reported daily–which is cool in itself. But we were surprised to find new vaccination records for dates earlier than December 11, 2020. Why would new vaccinations for these erroneous older dates still be entering the system? For example the second dataset downloaded March 4 acquired 6 new rows: Object ID Vaccination Date County Daily First Dose Cumulative First Dose Daily Second Dose Cumulative Second Dose 4 1972/10/13 Allegany 1 1 0 0 5 1972/12/16 Baltimore 1 1 0 0 6 2012/02/03 Baltimore 1 2 0 0 28 2020/02/24 Baltimore City 1 2 0 0 34 2020/08/24 Baltimore 1 4 0 0 64 2020/12/10 Prince George’s 1 3 0 0 And these rows present in the February 28 version were deleted in the March 4 version: Object ID Vaccination Date County Daily First Dose Cumulative First Dose Daily Second Dose Cumulative Second Dose 4 2019/12/26 Frederick 1 1 0 0 15 2020/01/25 Talbot 1 1 0 0 19 2020/01/28 Baltimore 1 1 0 0 20 2020/01/30 Caroline 1 1 0 0 28 2020/02/12 Prince George’s 1 1 0 0 30 2020/02/20 Anne Arundel 1 6 0 0 56 2020/10/16 Frederick 1 7 0 4 59 2020/11/01 Wicomico 1 1 0 0 60 2020/11/04 Frederick 1 8 0 4 I found these additions perplexing at first, because I assumed these outliers were part of an initial load. But it appears that the anomalies are still being generated? The deletions suggest that perhaps the anomalous data is being identified and scrubbed in a live system that is then dumping out the data? Or maybe the code that is being used to update the dataset in ArcGIS Hub itself is malfunctioning in some way? If you are interested in toying around with the code and data it is up on GitHub. I was interested to learn about pandas.DataFrame.merge which is useful for diffing tables when you use indicator=True. At any rate, having students notice, measure and document anomalies like this seems pretty useful. I also asked them to speculate about what kinds of activities could generate these errors. I meant speculate in the speculative fiction sense of imagining a specific scenario that caused it. I think this made some students scratch their head a bit, because I wasn’t asking them for the cause, but to invent a possible cause. Based on the results so far I’d like to incorporate more of these speculative exercises concerned with the functioning of code and data representations into my teaching. I want to encourage students to think creatively about data processing as they learn about the nuts and bolts of how code operates. For example the treatments in How to Run a City Like Amazon, and Other Fables which use sci-fi to test ideas about how information technologies are deployed in society. Another model is the Speculative Ethics Book Club which also uses sci-fi to explore the ethical and social consequences of technology. I feel like I need to read up on specualtive research more generally before doing this though (Michael &amp; Wilkie, 2020). I’d also like to focus the speculation down at the level of the code or data processing, rather than at the macro super-system level. But that has its place too. Another difference is that I was asking students to engage in speculation about the past rather than the future. How did the data end up this way? Perhaps this is more of a genealogical approach, of winding things backwards, and tracing what is known. Maybe it’s more Mystery than Sci-Fi. The speculative element is important because (in this case) operations at the MD Dept of Health, and their ArcGIS Hub setup are mostly opaque to us. But even when access isn’t a problem these systems they can feel opaque, because rather than there being a dearth of information you are drowning in it. Speculation is a useful abductive approach to hypothesis generation and, hopefully, understanding. Update 2021-03-17: Over in the fediverse David Benque recommended I take a look at Matthew Stanley’s chapter in (Gitelman, 2013) “Where Is That Moon, Anyway? The Problem of Interpreting Historical Solar Eclipse Observations” for the connection to Mystery. For the connection to Peirce and abduction he also pointed to Luciana Parisi’s chapter “Speculation: A method for the unattainable” in Lury &amp; Wakeford (2012). Definitely things to follow up on! References Gitelman, L. (Ed.). (2013). “Raw data” is an oxymoron. MIT Press. Lury, C., &amp; Wakeford, N. (2012). Inventive methods: The happening of the social. Routledge. Michael, M., &amp; Wilkie, A. (2020). Speculative research. In The Palgrave encyclopedia of the possible (pp. 1–8). Cham: Springer International Publishing. Retrieved from https://doi.org/10.1007/978-3-319-98390-5_118-1 Recovering Foucault I’ve been enjoying reading David Macey’s biography of Michel Foucault, that was republished in 2019 by Verso. Macey himself is an interesting figure, both a scholar and an activist who took leave from academia to do translation work and to write this biography and others of Lacan and Fanon. One thing that struck me as I’m nearing the end of Macey’s book is the relationship between Foucault and archives. I think Foucault has become emblematic of a certain brand of literary analysis of “the archive” that is far removed from the research literature of archival studies, while using “the archive” as a metaphor (Caswell, 2016). I’ve spent much of my life working in libraries and digital preservation, and now studying and teaching about them from the perspective of practice, so I am very sympathetic to this critique. It is perhaps ironic that the disconnect between these two bodies of research is a difference in discourse which Foucault himself brought attention to. At any rate, the thing that has struck me while reading this biography is how much time Foucault himself spent working in libraries and archives. Here’s Foucault in his own words talking about his thesis: In Histoire de la folie à l’âge classique I wished to determine what could be known about mental illness in a given epoch … An object took shape for me: the knowledge invested in complex systems of institutions. And a method became imperative: rather than perusing … only the library of scientific books, it was necessary to consult a body of archives comprising decrees, rules hospital and prison registers, and acts of jurisprudence. It was in the Arsenal or the Archives Nationales that I undertook the analysis of a knowledge whose visible body is neither scientific nor theoretical discourse, nor literature, but a daily and regulated practice. (Macey, 2019, p. 94) Foucault didn’t simply use archives for his research: understanding the processes and practices of archives were integral to his method. Even though the theory and practice of libraries and archives are quite different given their different functions and materials, they are often lumped together as a convenience in the same buildings. Macey blurs them a little bit, in sections like this where he talks about how important libraries were to Foucault’s work: Foucault required access to Paris for a variety of reasons, not least because he was also teaching part-time at ENS. The putative thesis he had begun at the Fondation Thiers – and which he now described to Polin as being on the philosophy of psychology – meant that he had to work at the Bibliothèque Nationale and he had already become one of its habitues. For the next thirty years, Henri Labrouste’s great building in the rue de Richelieu, with its elegant pillars and arches of cast iron, would be his primary place of work. His favourite seat was in the hemicycle, the small, raised section directly opposite the entrance, sheltered from the main reading room, where a central aisle separates rows of long tables subdivided into individual reading desks. The hemicycle affords slighty more quiet and privacy. For thirty years, Foucault pursued his research here almost daily, with occasional forays to the manuscript department and to other libraries, and contended with the Byzantine cataloguing system: two incomplete and dated printed catalogues supplemented by cabinets containing countless index cards, many of them inscribed with copperplate handwriting. Libraries were to become Foucault’s natural habitat: ‘those greenish institutions where books accumulate and where there grows the dense vegetation of their knowledge’ There’s a metaphor for you: libraries as vegetation :) It kind of reminds me of some recent work looking at decentralized web technologies in terms of mushrooms. But I digress. I really just wanted to note here that the erasure of archival studies from humanities research about “the archive” shouldn’t really be attributed to Foucault, whose own practice centered the work of libraries and archives. Foucault wasn’t just writing about an abstract archive, he was practically living out of them. As someone who has worked in libraries and archives I can appreciate how power users (pun intended) often knew aspects of the holdings and intricacies of their their management better than I did. Archives, when they are working, are always collaborative endeavours, and the important thing is to recognize and attribute the various sides of that collaboration. PS. Writing this blog post led me to dig up a few things I want to read (Eliassen, 2010; Radford, Radford, &amp; Lingel, 2015 ). References Caswell, M. (2016). The archive is not an archives: On acknowledging the intellectual contributions of archival studies. Reconstruction, 16(1). Retrieved from http://reconstruction.eserver.org/Issues/161/Caswell.shtml Eliassen, K. (2010). Archives of Michel Foucualt. In E. Røssaak (Ed.), The archive in motion, new conceptions of the archive in contemporary thought and new media practices. Novus Press. Macey, D. (2019). The lives of Michel Foucault: A biography. Verso. Radford, G. P., Radford, M. L., &amp; Lingel, J. (2015). The library as heterotopia: Michel Foucault and the experience of library space. Journal of Documentation, 71(4), 773–751. Teaching OOP in the Time of COVID I’ve been teaching a section of the Introduction to Object Oriented Programming at the UMD College for Information Studies this semester. It’s difficult for me, and for the students, because we are remote due to the Coronavirus pandemic. The class is largely asynchronous, but every week I’ve been holding two synchronous live coding sessions in Zoom to discuss the material and the exercises. These have been fun because the students are sharp, and haven’t been shy about sharing their screen and their VSCode session to work on the details. But students need quite a bit of self-discipline to move through the material, and probably only about 1/4 of the students take advantage of these live sessions. I’m quite lucky because I’m working with a set of lectures, slides and exercises that have been developed over the past couple of years by other instructors: Josh Westgard, Aric Bills and Gabriel Cruz. You can see some of the public facing materials here. Having this backdrop of content combined with Severance’s excellent (and free) Python for Everybody has allowed me to focus more on my live sessions, on responsive grading, and to also spend some time crafting additional exercises that are geared to this particular moment. This class is in the College for Information Studies and not in the Computer Science Department, so it’s important for the students to not only learn how to use a programming language, but to understand programming as a social activity, with real political and material effects in the world. Being able to read, understand, critique and talk about code and its documentation is just as important as being able to write it. In practice, out in the “real world” of open source software I think these aspects are arguably more important. One way I’ve been trying to do this in the first few weeks of class is to craft a sequence of exercises that form a narrative around Coronavirus testing and data collection to help remind the students of the basics of programming: variables, expressions, conditionals, loops, functions, files. In the first exercise we imagined a very simple data entry program that needed to record results of Real-time polymerase chain reaction tests (RT-PCR). I gave them the program and described how it was supposed to work, and asked them describe (in English) any problems that they noticed and to submit a version of the program with problems fixed. I also asked them to reflect on a request from their boss about adding the collection of race, gender and income information. The goal here was to test their ability to read the program and write English about it while also demonstrating a facility for modifying the program. Most importantly I wanted them to think about how inputs such as race or gender have questions about categories and standards behind them, and weren’t simply a matter of syntax. The second exercise builds on the first by asking them to adjust the revised program to be able to save the data in a very particular format. Yes, in the first exercise the data is stored in memory and printed to the screen in aggregate at the end. The scenario here is that the Department of Health and Human Services has assumed the responsibility for COVID test data collection from the Centers for Disease Control. Of course this really happened, but the data format I chose was completely made up (maybe we will be working with some real data at the end of the semester if I continue with this theme). The goal in this exercise was to demonstrate their ability to read another program and fit a function into it. The students were given a working program that had a save_results() function stubbed out. In addition to submitting their revised code I asked them to reflect on some limitations of the data format chosen, and the data processing pipeline that it was a part of. And in the third exercise I asked them to imagine that this lab they were working in had a scientist who discovered a problem with some of the thresholds for acceptable testing, which required an update to the program from Exercise 2, and also a test suite to make sure the program was behaving properly. In addition to writing the tests I asked them to reflect on what functionality was not being tested that probably should be. This alternation between writing code and writing prose is something I started doing as part of a Digital Curation class. I don’t know if this dialogical or perhaps dialectical, approach is something others have tried. I should probably do some research to see. In my last class I alternated week by week: one week reading and writing code, the next week reading and writing prose. But this semester I’ve stayed focused on code, but required the reading and writing of code as well as prose about code in the same week. I hope to write more about how this goes, and these exercises as I go. I’m not sure if I will continue with the Coronavirus data examples. One thing I’m sensitive to is that my students themselves are experiencing the effects of the Coronavirus, and may want to escape it just for a bit in their school work. Just writing in the open about it here, in addition to the weekly meetings I’ve had with Aric, Josh and Gabriel has been very useful. Speaking of those meetings. I learned today from Aric that tomorrow (February 20th, 2021) is the 30th anniversary of Python’s first public release! You can see this reflected in this timeline. This v0.9.1 release was the first release Guido van Rossum made outside of CWI and was made on the Usenet newsgroup alt.sources where it is split out into chunks that need to be reassembled. Back in 2009 Andrew Dalke located a and repackaged these sources in Google Groups which acquired alt.sources as part of DejaNews in 2001. But if you look at the time stamp on the first part of the release you can see that it was made February 19, 1991 (not February 20). So I’m not sure if the birthday is actually today. I sent this little note out to my students with this wonderful two part oral history that the Computer History Museum did with Guido van Rossum a couple years ago. I turns out Both of his parents were atheists and pacifists. His dad went to jail because he refused to be conscripted into the military. That and many more details of his background and thoughts about the evolution of Python can be found in these delightful interviews: Happy Birthday Python! GPT-3 Jam One of the joys of pandemic academic life has been a true feast of online events to attend, on a wide variety of topics, some of which are delightfully narrow and esoteric. Case in point was today’s Reflecting on Power and AI: The Case of GPT-3 which lived up to its title. I’ll try to keep an eye out for when the video posts, and update here. The workshop was largely organized around an exploration of whether GPT-3, the largest known machine learning language model, changes anything for media studies theory, or if it amounts to just more of the same. So the discussion wasn’t focused so much on what games could be played with GPT-3, but rather if GPT-3 changes the rules of the game for media theory, at all. I’m not sure there was a conclusive answer at the end, but it sounded like the consensus was that current theorization around media is adequate for understanding GPT-3, but it matters greatly what theory or theories are deployed. The online discussion after the presentations indicated that attendees didn’t see this as merely a theoretical issue, but one that has direct social and political impacts on our lives. James Steinhoff looked at GPT-3 using a Marxist media theory perspective where he told the story of GPT-3’s as a project of OpenAI and as a project of capital. OpenAI started with much fanfare in 2015 as a non-profit initiative where the technology, algorithms and models developed would would be kept openly licensed and freely available so that the world could understand the benefits and risks of AI technology. Steinhoff described how in 2019 the project’s needs for capital (compute power and staff) transitioned it from a non-profit into a capped-profit company, which is now owned, or at least controlled, by Microsoft. The code for generating the model as well as the model itself are gated behind a token driven Web API run my Microsoft. You can get on a waiting list to use it, but apparently a lot of people have been waiting a while, so … Being a Microsoft employee probably helps. I grabbed a screenshot of the pricing page that Steinhoff shared during his presentation: I’d be interested to hear more about how these tokens operate. Are they per-request, or are they measured according something else? I googled around a bit during the presentation to try to find some documentation for the Web API, and came up empty handed. I did find Shreya Shankar’s gpt3-sandbox project for interacting with the API in your browser (mostly for iteratively crafting text input in order to generate desired output). It depends on the openai Python package created by OpenAI themselves. The docs for openai then point at a page on the openai.com website which is behind a login. You can create an account, but you need to be pre-approved (made it through the waitlist) to be able to see the docs. There’s probably some sense that can be made from examining the python client though. All of the presentations in some form or another touched on the 175 billion parameters that were used to generate the model. But the API to the model doesn’t have that many parameters. It allows you to enter text and get text back. But the API surface that the GPT-3 service provides could be interesting to examine a bit more closely, especially to track how it changes over time. In terms of how this model mediates knowledge and understanding it’ll be important watch. Steinhoff’s message seemed to be that, despite the best of intentions, GPT-3 functions in the service of very large corporations with very particular interests. One dimension that he didn’t explore perhaps because of time, is how the GPT-3 model itself is fed massive amounts of content from the web, or the commons. Indeed 60% of the data came from the CommonCrawl project. GPT-3 is an example of an extraction project that has been underway at large Internet companies for some time. I think the critique of these corporations has often been confined to seeing them in terms of surveillance capitalism rather than in terms of raw resource extraction, or the primitive accumulation of capital. The behavioral indicators of who clicked on what are certainly valuable, but GPT-3 and sister projects like CommonCrawl shows just the accumulation of data with modest amounts of metadata can be extremely valuable. This discussion really hit home for me since I’ve been working with Jess Ogden and Shawn Walker using CommonCrawl as a dataset for talking about the use of web archives, while also reflecting on the use of web archives as data. CommonCrawl provides a unique glimpse into some of the data operations that are at work in the accumulation of web archives. I worry that the window is closing and the CommonCrawl itself will be absorbed into Microsoft. Following Steinhoff Olya Kudina and Bas de Boer jointly presented some compelling thoughts about how its important to understand GPT-3 in terms of sociotechnical theory, using ideas drawn from Foucault and Arendt. I actually want to watch their presentation again because it followed a very specific path that I can’t do justice to here. But their main argument seemed to be that GPT-3 is an expression of power and that where there is power there is always resistance to power. GPT-3 can and will be subverted and used to achieve particular political ends of our own choosing. Because of my own dissertation research I’m partial to Foucault’s idea of governmentality, especially as it relates to ideas of legibility (Scott, 1998)–the who, what and why of legibility projects, aka archives. GPT-3 presents some interesting challenges in terms of legibility because the model is so complex, the results it generates defy deductive logic and auditing. In some ways GPT-3 obscures more than it makes a population legible, as Foucault moved from disciplinary analysis of the subject, to the ways in which populations are described and governed through the practices of pastoral power, of open datasets. Again the significance of CommonCrawl as an archival project, as a web legibility project, jumps to the fore. I’m not as up on Arendt as I should be, so one outcome of their presentation is that I’m going to read her The Human Condition which they had in a slide. I’m long overdue. References Scott, J. C. (1998). Seeing like a state: How certain schemes to improve the human condition have failed. Yale University Press. mimetypes Today I learned that Python has a mimetypes module, and has ever since Guido von Rossum added it in 1997. Honestly I’m just a bit sheepish to admit this discovery, as someone who has been using Python for digital preservation work for about 15 years. But maybe there’s a good reason for that. Since the entire version history for Python is available on GitHub (which is a beautiful thing in itself) you can see that the mimetypes module started as a guess_type() function built around a pretty simple hard coded mapping of file extensions to mimetypes. The module also includes a little bit of code to look for, and parse, mimetype registries that might be available on the host operating system. The initial mimetype registries used included one from the venerable Apache httpd web server, and the Netscape web browser, which was about three years old at the time. It makes sense why this function to look up a mimetype for a filename would be useful at that time, since Python was being used to serve up files on the nascent web and for sending email, and whatnot. Today the module looks much the same, but has a few new functions and about twice as many mimetypes in its internal list. Some of the new mimetypes include text/csv, audio.mpeg, application/vnd.ms-powerpoint, application/x-shockwave-flash, application/xml, and application/json. Comparing the first commit to the most latest provides a thumbnail sketch of 25 years of web format evolution. I’ll admit, this is is a bit of an esoteric thing to be writing a blog post about. So I should explain. At work I’ve been helping out on a community archiving project which has accumulated a significant amount of photographs, scans, documents of various kinds, audio files and videos. Some of these files are embedded in web applications like Omeka, some are in cloud storage like Google Drive, or on the office networked attached storage, and others are on scattered storage devices in people’s desk drawers and closets. We’ve also created new files during community digitization events, and oral history interviews. As part of this work we’ve wanted to start building a place on the web where all these materials live. This has required not only describing the files, but also putting all the files in one place so that access can be provided. In principle this sounds simple. But it turns out that collecting the files from all these diverse locations poses significant challenges, because their context matters. The filenames, and the directories they are found in, are sometimes the only descriptive metadata that exists for this data. In short, the original order matters. But putting this content on the web means that the files need to be brought together and connected with their metadata programmatically. This is how I stumbled across the mimetypes module. I’ve been writing some throwaway code to collect the files together into the same directory structure while preserving their original filenames and locations in an Airtable database. I’ve been using the magic module to identify the format of the file, which is used to copy the file into a Dropbox storage location. The extension is important because we are expecting this to be a static site serving up the content and we want the files to also be browsable using the Dropbox drive. It turns out the mimetypes.guess_extension is pretty useful for turning a mediatype into an file extension. I’m kind of surprised that it took me this long to discover mimetypes, but I’m glad I did. As an aside I think this highlights for me how important Git can be as an archive and research method for software studies work. Northwest Branch Cairn Here is a short recording and a couple photos from my morning walk along the Northwest Branch trail with Penny. I can’t go every day but at 7 months old she has tons of energy, so it’s generally a good idea for all concerned to go at least every other morning. And it’s a good thing, because the walk is surprisingly peaceful, and it’s such a joy to see her run through the woods. After walking about 30 minutes there is this little cairn that is a reminder for me to turn around. After seeing it grow in size I was sad to see it knocked down one day. But, ever so slowly, it is getting built back up again. 
inkdroid-org-2856	----	twarc2 Toggle Navigation inkdroid About Bookmarks Photos Music Software Social Talks twarc2 April 7, 2021 python twitter This post was originally published on Medium but I spent time writing it so I wanted to have it here too. TL;DR twarc has been redesigned from the ground up to work with the new Twitter v2 API and their Academic Research track. Many thanks for the code and design contributions of Betsy Alpert, Igor Brigadir, Sam Hames, Jeff Sauer, and Daniel Verdeer that have made twarc2 possible, as well as early feedback from Dan Kerchner, Shane Lin, Miles McCain, 李荣蓬, David Thiel, Melanie Walsh and Laura Wrubel. Extra special thanks to the Institute for Future Environments at Queensland University of Technology for supporting Betsy and Sam in their work, and for the continued support of the Mellon Foundation. Back in August of last year Twitter announced early access to their new v2 API, and their plans to sunset the v1.1 API that has been active for almost the last 10 years. Over the lifetime of their v1.1 API Twitter has become deeply embedded in the media landscape. As magazines, newspapers and television have moved onto the web they have increasingly adopted tweets as a mechanism for citing politicians, celebrities and organizations, while also using them to document current events, generate leads and gather feedback for evolving stories. As a result Twitter has also become a popular object of study for humanities and social science researchers looking to understand the world as reflected, refracted and distorted by/in social media. On the surface the v2 API update seems pretty insignificant since the shape of a tweet, its parts, properties and affordances, aren’t changing at all. Tweets with 280 characters of text, images and video will continue to be posted, retweeted and quoted. However behind the scenes the representation of a tweet as data, and the quotas that control the rates at which this data can flow between apps and other third party services will be greatly transformed. Needless to say, v2 represents a big change for the Documenting the Now project. Along with community members we’ve developed and maintained open source tools like twarc that talk directly to the Twitter API to help users to search for and collect live tweets that match criteria like hashtags, names and geographic locations. Today we’re excited to announce the release of twarc v2 which has been designed from the ground up to work with the v2 API and Twitter’s new Academic Research track. Clearly it’s extremely problematic having a multi-national corporation act as a gatekeeper for who counts as an academic researcher, and what constitutes academic research. We need look no further than the recent experiences of Timnit Gebru and Margaret Mitchell at Google for an example of what happens when research questions run up against the business objectives of capital. We only know their stories because Gebru and Mitchell’s bravely took a principled approach, where many researchers would have knowingly or unknowingly shaped their research to better fit the needs of the company. So it is important for us that twarc still be usable by people with and without access to the Academic Research Track. But we have heard from many users that the Academic Research Track presents new opportunities for Twitter data collection that are essential for researchers interested in the observability of social media platforms. Twitter is making a good faith effort to work with the academic research community, and we thought twarc should support it, even if big challenges lie ahead. So why are people interested in the Academic Research Track? Once your application has been approved you are able to collect data from the full history of Tweets, at no cost. This is a massive improvement over the v1.1 access which was limited to a one week window and researchers had to pay for access. Access to the full archive means it’s now possible to study events that have happened in the past back to the beginning of Twitter in 2006. If you do create any historical datasets we’d love for you to share the tweet identifier datasets in The Catalog. However this opening up of access on the one hand comes with a simultaneous contraction in terms of how much data can be collected at one time. The remainder of this post describes some of the details and the design decisions we have made with twarc2 to address them. If you would prefer to watch a quick introduction to using twarc v2 please check out this short video: Installation If you are familiar with installing twarc nothing is changed. You still install (or upgrade) with pip as you did before: $ pip install --upgrade twarc In fact you will still have full access to the v1.1 API just as you did before. So the old commands will continue to work as they did1 $ twarc search blacklivesmatter > tweets.jsonl twarc2 was designed to let you to continue to use Twitter’s v1.1 API undisturbed until it is finally turned off by Twitter, at which point the functionality will be removed from twarc. All the support for the v2 API is mediated by a new command line utility twarc2. For example to search for blacklivesmatter tweets and write them to a file tweets.jsonl: $ twarc2 search blacklivesmatter > tweets.jsonl All the usual twarc functionality such as searching for tweets, collecting live tweets from the streaming API endpoint, requesting user timelines and user metadata are all still there, twarc2 --help gives you the details. But while the interface looks the same there’s quite a bit different going on behind the scenes. Representation Truth be told, there is no shortage of open source libraries and tools for interacting with the Twitter API. In the past twarc has made a bit of a name for itself by catering to a niche group of users who want a reliable, programmable way to collect the canonical JSON representation of a tweet. JavaScript Object Notation (JSON) is the language of Web APIs, and Twitter has kept its JSON representation of a tweet relatively stable over the years. Rather than making lots of decisions about the many ways you might want to collect, model and analyze tweets twarc has tried to do one thing and do it well (data collection) and get out of the way so that you can use (or create) the tools for putting this data to use. But the JSON representation of a tweet in the Twitter v2 API is completely burst apart. The v2 base representation of a tweet is extremely lean and minimal, and just includes the text of the tweet its identifier and a handful of other things. All the details about the user who created the tweet, embedded media, and more are not included. Fortunately this information is still available, but the user needs to craft their API request to request tweets using a set of expansions that tell the Twitter API what additional entities to include. In addition for each expansion there are a set of field options to include that control what of these expansions is returned. So rather than there being a single JSON representation of a tweet API users now have the ability to shape the data based on what they need, much like how GraphQL APIs work. This kind of makes you wonder why Twitter didn’t make their GraphQL API available. For specific use cases this customizability is very useful, but the mutability of the representation of a tweet presents challenges when collecting data for future use. If you didn’t request the right expansions or fields when collecting the data then you won’t be able to analyze that data later when doing your research. To solve for this twarc2 has been designed to collect the richest possible representation for a tweet, by requesting all possible expansions and field combinations for tweets. See the expansions module for the details if you are interested. This takes a significant burden off of users to digest the API documentation, and craft the correct API requests themselves. In addition the twarc community will be monitoring the Twitter API documentation going forward to incorporate new expansions and fields as they will inevitably be added in the future. Flattening This is diving into the weeds a little bit, but it’s worth noting here that Twitter’s introduction of expansions allows data that was once duplicated across multiple tweets (such as user information, media, retweets, etc) to be included once per response from the API. This means that instead of seeing information about the user who created a tweet in the context of their tweet the user will be referenced using an identifier, and this identifier will map to user metadata in the outer envelope of the response. It makes sense why Twitter have introduced expansions since it means in a set of 100 tweets from a given user the user information will just be included once rather than repeated 100 times, which means less data, less network traffic and less money. It’s even more significant when consider the large number of possible expansions. However this pass by-reference rather than by-value presents some challenges for stream based processing which expects each tweet to be self-contained. For this reason we’ve introduce the idea of flattening the response data when persisting the JSON to disk. This means that tools and data pipelines that expect to operate on a stream of tweets can continue to do so. Since the representation of a tweet is so dependent on how data is requested we’ve taken the opportunity to introduce a small stanza of twarc specific metadata using the __twarc prefix. This metadata records what API endpoint the data was requested from, and when. This information is critically important when interpreting the data, because some information about a tweet like its retweet and quote counts are constantly changing. Data Flows As mentioned above you can still collect tweets from the search and streaming API endpoints in a way that seems quite similar to the v1 API. The big changes however are the quotas associated with these endpoints which govern how much can be collected. These quotas control how many requests can be sent to Twitter in 15 minute intervals. In fact these quotas are not much changed, but what’s new are app wide quotas that constrain how many tweets a given application (app) can collect every month. An app in this context is a piece of software (e.g. your twarc software) identified by unique API keys set up in the Twitter Developer Portal. The standard API access sets a 500,000 tweet per month limit. This is a huge change considering there were no monthly app limits before. If you get approved for the Academic Research track your app quota is increased to 10 million per month. This is markedly better but the achievable data volume is still nothing like the v1.1 API, as these graphs attempt to illustrate: twarc2 will still observe the same rate limits, but once you’ve collected your portion for the month there’s not much that can be done, for that app at least. Apart from the quotas Twitter’s streaming endpoint in v2 is substantially changed which impacts how users interact with twarc. Previously twarc users would be able to create up to to two connections to the filter stream API. This could be done by simply: twarc filter obama > obama.jsonl However in the Twitter v2 API only apps can connect to the filter stream, and they can only connect once. At first this seems like a major limitation but rather than creating a connection per query the v2 API allows you to build a set of rules for tweets to match, which in turns controls what tweets are included in the stream. This means you can collect for multiple types of queries at the same time, and the tweets will come back with a piece of metadata indicating what rule caused its inclusion. This translates into a markedly different set of interactions at the command line for collecting from the stream where you first need to set your stream rules and then open a connection to fetch it. twarc2 stream-rules add blacklivesmatter twarc2 stream > tweets.jsonl One useful side effect of this is that you can update the stream (add and remove rules) while the stream is in motion: twarc2 stream-rules add blm While you are limited by the API quota in terms of how many tweets you can collect, tweets are not “dropped on the floor” when the volume gets too high. Once upon a time the v1.1 filter stream was rumored to be rate limited when your stream exceeds 1% of the total volume of new tweets. Plugins In addition to twarc helping you collect tweets the GitHub repository has also been a place to collect a set of utilities for working with the data. For example there are scripts for extracting and unshortening urls, identifying suspended/deleted content, extracting videos, buiding wordclouds, putting tweets on maps, displaying network graph visualizations, counting hashtags, and more. These utilities all work like Unix filters where the input is a stream of tweets and the output varies depending on what the utility is doing, e.g. a Gephi file for a network visualization, or a folder of mp4 files for video extraction. While this has worked well in general the kitchen sink approach has been difficult to manage from a configuration management perspective. Users have to download these scripts manually from GitHub or by cloning the repository. For some users this is fine, but it’s a bit of a barrier to entry for users who have just installed twarc with pip. Furthermore these plugins often have their own dependencies which twarc itself does not. This lets twarc can stay pretty lean, and things like youtube_dl, NetworkX or Pandas can be installed by people that want to use utilities that need them. But since there is no way to install the utilities there isn’t a way to ensure that the dependencies are installed, which can lead to users needing to diagnose missing libraries themselves. Finally the plugins have typically lacked their own tests. twarc’s test suite has really helped us track changes to the Twitter API and to make sure that it continues to operate properly as new functionality has been added. But nothing like this has existed for the utilities. We’ve noticed that over time some of them need updating. Also their command line arguments have drifted over time which can lead to some inconsistencies in how they are used. So with twarc2 we’ve introduced the idea of plugins which extend the functionality of the twarc2 command, are distributed on PyPI separately from twarc, and exist in their own GitHub repositories where they can be developed and tested independently of twarc itself. This is all achieved through twarc2’s use of the click library and specifically click-plugins. So now if you would like to convert your collected tweets to CSV you can install the twarc-csv: $ pip install twarc-csv $ twarc2 search covid19 > covid19.jsonl $ twarc2 csv covid19.jsonl > covid19.csv Or if you want to extract embedded and referenced videos from tweets you can install twarc-videos which will write all the videos to a directory: $ pip install twarc-videos $ twarc2 videos covid19.jsonl --download-dir covid19-videos You can write these plugins yourself and release them as needed. Check out the plugin reference implementation tweet-ids for a simple example to adapt. We’re still in the process of porting some of the most useful utilities over and would love to see ideas for new plugins. Check out the current list of twarc2 plugins and use the twarc issue tracker on GitHub to join the discussion. You may notice from the list of plugins that twarc now (finally) has documentation on ReadTheDocs external from the documentation that was previously only available on GitHub. We got by with GitHub’s rendering of Markdown documents for a while, but GitHub’s boilerplate designed for developers can prove to be quite confusing for users who aren’t used to selectively ignoring it. ReadTheDocs allows us to manage the command line and API documentation for twarc, and to showcase the work that has gone into the Spanish, Japanese, Portuguese, Swedish, Swahili and Chinese translations. Feedback Thanks for reading this far! We hope you will give twarc2 a try. Let us know what you think either in comments here, in the DocNow Slack or over on GitHub. ✨ ✨ Happy twarcing! ✨ ✨ ✨ Windows users will want to indicate the output file using a second argument rather than redirecting output with >. See this page for details.↩ Unless otherwise noted all the content here is licensed CC-BY 
inkdroid-org-4236	----	inkdroid inkdroid Paper or Plastic 856 Coincidence? twarc2 This post was originally published on Medium but I spent time writing it so I wanted to have it here too. TL;DR twarc has been redesigned from the ground up to work with the new Twitter v2 API and their Academic Research track. Many thanks for the code and design contributions of Betsy Alpert, Igor Brigadir, Sam Hames, Jeff Sauer, and Daniel Verdeer that have made twarc2 possible, as well as early feedback from Dan Kerchner, Shane Lin, Miles McCain, 李荣蓬, David Thiel, Melanie Walsh and Laura Wrubel. Extra special thanks to the Institute for Future Environments at Queensland University of Technology for supporting Betsy and Sam in their work, and for the continued support of the Mellon Foundation. Back in August of last year Twitter announced early access to their new v2 API, and their plans to sunset the v1.1 API that has been active for almost the last 10 years. Over the lifetime of their v1.1 API Twitter has become deeply embedded in the media landscape. As magazines, newspapers and television have moved onto the web they have increasingly adopted tweets as a mechanism for citing politicians, celebrities and organizations, while also using them to document current events, generate leads and gather feedback for evolving stories. As a result Twitter has also become a popular object of study for humanities and social science researchers looking to understand the world as reflected, refracted and distorted by/in social media. On the surface the v2 API update seems pretty insignificant since the shape of a tweet, its parts, properties and affordances, aren’t changing at all. Tweets with 280 characters of text, images and video will continue to be posted, retweeted and quoted. However behind the scenes the representation of a tweet as data, and the quotas that control the rates at which this data can flow between apps and other third party services will be greatly transformed. Needless to say, v2 represents a big change for the Documenting the Now project. Along with community members we’ve developed and maintained open source tools like twarc that talk directly to the Twitter API to help users to search for and collect live tweets that match criteria like hashtags, names and geographic locations. Today we’re excited to announce the release of twarc v2 which has been designed from the ground up to work with the v2 API and Twitter’s new Academic Research track. Clearly it’s extremely problematic having a multi-national corporation act as a gatekeeper for who counts as an academic researcher, and what constitutes academic research. We need look no further than the recent experiences of Timnit Gebru and Margaret Mitchell at Google for an example of what happens when research questions run up against the business objectives of capital. We only know their stories because Gebru and Mitchell’s bravely took a principled approach, where many researchers would have knowingly or unknowingly shaped their research to better fit the needs of the company. So it is important for us that twarc still be usable by people with and without access to the Academic Research Track. But we have heard from many users that the Academic Research Track presents new opportunities for Twitter data collection that are essential for researchers interested in the observability of social media platforms. Twitter is making a good faith effort to work with the academic research community, and we thought twarc should support it, even if big challenges lie ahead. So why are people interested in the Academic Research Track? Once your application has been approved you are able to collect data from the full history of Tweets, at no cost. This is a massive improvement over the v1.1 access which was limited to a one week window and researchers had to pay for access. Access to the full archive means it’s now possible to study events that have happened in the past back to the beginning of Twitter in 2006. If you do create any historical datasets we’d love for you to share the tweet identifier datasets in The Catalog. However this opening up of access on the one hand comes with a simultaneous contraction in terms of how much data can be collected at one time. The remainder of this post describes some of the details and the design decisions we have made with twarc2 to address them. If you would prefer to watch a quick introduction to using twarc v2 please check out this short video: Installation If you are familiar with installing twarc nothing is changed. You still install (or upgrade) with pip as you did before: $ pip install --upgrade twarc In fact you will still have full access to the v1.1 API just as you did before. So the old commands will continue to work as they did1 $ twarc search blacklivesmatter &gt; tweets.jsonl twarc2 was designed to let you to continue to use Twitter’s v1.1 API undisturbed until it is finally turned off by Twitter, at which point the functionality will be removed from twarc. All the support for the v2 API is mediated by a new command line utility twarc2. For example to search for blacklivesmatter tweets and write them to a file tweets.jsonl: $ twarc2 search blacklivesmatter &gt; tweets.jsonl All the usual twarc functionality such as searching for tweets, collecting live tweets from the streaming API endpoint, requesting user timelines and user metadata are all still there, twarc2 --help gives you the details. But while the interface looks the same there’s quite a bit different going on behind the scenes. Representation Truth be told, there is no shortage of open source libraries and tools for interacting with the Twitter API. In the past twarc has made a bit of a name for itself by catering to a niche group of users who want a reliable, programmable way to collect the canonical JSON representation of a tweet. JavaScript Object Notation (JSON) is the language of Web APIs, and Twitter has kept its JSON representation of a tweet relatively stable over the years. Rather than making lots of decisions about the many ways you might want to collect, model and analyze tweets twarc has tried to do one thing and do it well (data collection) and get out of the way so that you can use (or create) the tools for putting this data to use. But the JSON representation of a tweet in the Twitter v2 API is completely burst apart. The v2 base representation of a tweet is extremely lean and minimal, and just includes the text of the tweet its identifier and a handful of other things. All the details about the user who created the tweet, embedded media, and more are not included. Fortunately this information is still available, but the user needs to craft their API request to request tweets using a set of expansions that tell the Twitter API what additional entities to include. In addition for each expansion there are a set of field options to include that control what of these expansions is returned. So rather than there being a single JSON representation of a tweet API users now have the ability to shape the data based on what they need, much like how GraphQL APIs work. This kind of makes you wonder why Twitter didn’t make their GraphQL API available. For specific use cases this customizability is very useful, but the mutability of the representation of a tweet presents challenges when collecting data for future use. If you didn’t request the right expansions or fields when collecting the data then you won’t be able to analyze that data later when doing your research. To solve for this twarc2 has been designed to collect the richest possible representation for a tweet, by requesting all possible expansions and field combinations for tweets. See the expansions module for the details if you are interested. This takes a significant burden off of users to digest the API documentation, and craft the correct API requests themselves. In addition the twarc community will be monitoring the Twitter API documentation going forward to incorporate new expansions and fields as they will inevitably be added in the future. Flattening This is diving into the weeds a little bit, but it’s worth noting here that Twitter’s introduction of expansions allows data that was once duplicated across multiple tweets (such as user information, media, retweets, etc) to be included once per response from the API. This means that instead of seeing information about the user who created a tweet in the context of their tweet the user will be referenced using an identifier, and this identifier will map to user metadata in the outer envelope of the response. It makes sense why Twitter have introduced expansions since it means in a set of 100 tweets from a given user the user information will just be included once rather than repeated 100 times, which means less data, less network traffic and less money. It’s even more significant when consider the large number of possible expansions. However this pass by-reference rather than by-value presents some challenges for stream based processing which expects each tweet to be self-contained. For this reason we’ve introduce the idea of flattening the response data when persisting the JSON to disk. This means that tools and data pipelines that expect to operate on a stream of tweets can continue to do so. Since the representation of a tweet is so dependent on how data is requested we’ve taken the opportunity to introduce a small stanza of twarc specific metadata using the __twarc prefix. This metadata records what API endpoint the data was requested from, and when. This information is critically important when interpreting the data, because some information about a tweet like its retweet and quote counts are constantly changing. Data Flows As mentioned above you can still collect tweets from the search and streaming API endpoints in a way that seems quite similar to the v1 API. The big changes however are the quotas associated with these endpoints which govern how much can be collected. These quotas control how many requests can be sent to Twitter in 15 minute intervals. In fact these quotas are not much changed, but what’s new are app wide quotas that constrain how many tweets a given application (app) can collect every month. An app in this context is a piece of software (e.g. your twarc software) identified by unique API keys set up in the Twitter Developer Portal. The standard API access sets a 500,000 tweet per month limit. This is a huge change considering there were no monthly app limits before. If you get approved for the Academic Research track your app quota is increased to 10 million per month. This is markedly better but the achievable data volume is still nothing like the v1.1 API, as these graphs attempt to illustrate: twarc2 will still observe the same rate limits, but once you’ve collected your portion for the month there’s not much that can be done, for that app at least. Apart from the quotas Twitter’s streaming endpoint in v2 is substantially changed which impacts how users interact with twarc. Previously twarc users would be able to create up to to two connections to the filter stream API. This could be done by simply: twarc filter obama &gt; obama.jsonl However in the Twitter v2 API only apps can connect to the filter stream, and they can only connect once. At first this seems like a major limitation but rather than creating a connection per query the v2 API allows you to build a set of rules for tweets to match, which in turns controls what tweets are included in the stream. This means you can collect for multiple types of queries at the same time, and the tweets will come back with a piece of metadata indicating what rule caused its inclusion. This translates into a markedly different set of interactions at the command line for collecting from the stream where you first need to set your stream rules and then open a connection to fetch it. twarc2 stream-rules add blacklivesmatter twarc2 stream &gt; tweets.jsonl One useful side effect of this is that you can update the stream (add and remove rules) while the stream is in motion: twarc2 stream-rules add blm While you are limited by the API quota in terms of how many tweets you can collect, tweets are not “dropped on the floor” when the volume gets too high. Once upon a time the v1.1 filter stream was rumored to be rate limited when your stream exceeds 1% of the total volume of new tweets. Plugins In addition to twarc helping you collect tweets the GitHub repository has also been a place to collect a set of utilities for working with the data. For example there are scripts for extracting and unshortening urls, identifying suspended/deleted content, extracting videos, buiding wordclouds, putting tweets on maps, displaying network graph visualizations, counting hashtags, and more. These utilities all work like Unix filters where the input is a stream of tweets and the output varies depending on what the utility is doing, e.g. a Gephi file for a network visualization, or a folder of mp4 files for video extraction. While this has worked well in general the kitchen sink approach has been difficult to manage from a configuration management perspective. Users have to download these scripts manually from GitHub or by cloning the repository. For some users this is fine, but it’s a bit of a barrier to entry for users who have just installed twarc with pip. Furthermore these plugins often have their own dependencies which twarc itself does not. This lets twarc can stay pretty lean, and things like youtube_dl, NetworkX or Pandas can be installed by people that want to use utilities that need them. But since there is no way to install the utilities there isn’t a way to ensure that the dependencies are installed, which can lead to users needing to diagnose missing libraries themselves. Finally the plugins have typically lacked their own tests. twarc’s test suite has really helped us track changes to the Twitter API and to make sure that it continues to operate properly as new functionality has been added. But nothing like this has existed for the utilities. We’ve noticed that over time some of them need updating. Also their command line arguments have drifted over time which can lead to some inconsistencies in how they are used. So with twarc2 we’ve introduced the idea of plugins which extend the functionality of the twarc2 command, are distributed on PyPI separately from twarc, and exist in their own GitHub repositories where they can be developed and tested independently of twarc itself. This is all achieved through twarc2’s use of the click library and specifically click-plugins. So now if you would like to convert your collected tweets to CSV you can install the twarc-csv: $ pip install twarc-csv $ twarc2 search covid19 &gt; covid19.jsonl $ twarc2 csv covid19.jsonl &gt; covid19.csv Or if you want to extract embedded and referenced videos from tweets you can install twarc-videos which will write all the videos to a directory: $ pip install twarc-videos $ twarc2 videos covid19.jsonl --download-dir covid19-videos You can write these plugins yourself and release them as needed. Check out the plugin reference implementation tweet-ids for a simple example to adapt. We’re still in the process of porting some of the most useful utilities over and would love to see ideas for new plugins. Check out the current list of twarc2 plugins and use the twarc issue tracker on GitHub to join the discussion. You may notice from the list of plugins that twarc now (finally) has documentation on ReadTheDocs external from the documentation that was previously only available on GitHub. We got by with GitHub’s rendering of Markdown documents for a while, but GitHub’s boilerplate designed for developers can prove to be quite confusing for users who aren’t used to selectively ignoring it. ReadTheDocs allows us to manage the command line and API documentation for twarc, and to showcase the work that has gone into the Spanish, Japanese, Portuguese, Swedish, Swahili and Chinese translations. Feedback Thanks for reading this far! We hope you will give twarc2 a try. Let us know what you think either in comments here, in the DocNow Slack or over on GitHub. ✨ ✨ Happy twarcing! ✨ ✨ ✨ Windows users will want to indicate the output file using a second argument rather than redirecting output with &gt;. See this page for details.↩ $ j You may have noticed that I try to use this static website as a journal. But, you know, not everything I want to write down is really ready (or appropriate) to put here. Some of these things end up in actual physical notebooks–there’s no beating the tactile experience of writing on paper for some kind of thinking. But I also spend a lot of time on my laptop, and at the command line in some form or another. So I have a directory of time stamped Markdown files stored on Dropbox, for example: ... /home/ed/Dropbox/Journal/2019-08-25.md /home/ed/Dropbox/Journal/2020-01-27.md /home/ed/Dropbox/Journal/2020-05-24.md /home/ed/Dropbox/Journal/2020-05-25.md /home/ed/Dropbox/Journal/2020-05-31.md ... Sometimes these notes migrate into a blog post or some other writing I’m doing. I used this technique quite a bit when writing my dissertation when I wanted to jot down things on my phone when an idea arrived. I’ve tried a few different apps for editing Markdown on my phone, but mostly settled on iA Writer which mostly just gets out of the way. But when editing on my laptop I tend to use my favorite text editor Vim with the vim-pencil plugin for making Markdown fun and easy. If Vim isn’t your thing and you use another text editor keep reading since this will work for you too. The only trick to this method of journaling is that I just need to open the right file. With command completion on the command line this isn’t so much of a chore. But it does take a moment to remember the date, and craft the right path. Today while reflecting on how nice it is to still be using Unix, it occurred to me that I could create a little shell script to open my journal for that day (or a previous day). So I put this little file j in my PATH: #!/bin/zsh journal_dir=&quot;/home/ed/Dropbox/Journal&quot; if [ &quot;$1&quot; ]; then date=$1 else date=`date +%Y-%m-%d` fi vim &quot;$journal_dir/$date.md&quot; So now when I’m in the middle of something else and want to jot a note in my journal I just type j. Unix, still crazy after all these years. Strengths and Weaknesses Quoting Macey (2019), quoting Foucault, quoting Nietzsche: One thing is needful. – To ‘give style’ to one’s character – a great and rare art! It is practised by those who survey all the strengths and weaknesses that their nature has to offer and then fit them into an artistic plan until each appears as art and reason and even weaknesses delight the eye. Nietzsche, Williams, Nauckhoff, &amp; Del Caro (2001), p. 290 This is a generous and lively image of what art does when it is working. Art is not perfection. Macey, D. (2019). The lives of Michel Foucault: A biography. Verso. Nietzsche, F. W., Williams, B., Nauckhoff, J., &amp; Del Caro, A. (2001). The gay science: with a prelude in German rhymes and an appendix of songs. Cambridge, U.K. ; New York: Cambridge University Press. Data Speculation I’ve taken the ill-advised approach of using the Coronavirus as a topic to frame the exercises in my computer programming class this semester. I say “ill-advised” because given the impact that COVID has been having on students I’ve been thinking they probably need a way to escape news of the virus by way of writing code, rather than diving into it more. It’s late in the semester to modulate things but I think we will shift gears to look at programming through another lens after spring break. That being said, one of the interesting things we’ve been doing is looking at vaccination data that is being released by the Maryland Department of Health through their ESRI ArcGIS Hub. Note: this dataset has since been removed from the web because it has been superseded by a new dataset that includes single dose vaccinations. I guess it’s good that students get a feel for how ephemeral data on the web is, even when it is published by the government. We noticed that this dataset recorded a small number of vaccinations as happening as early as the 1930s up until December 11, 2020 when vaccines were approved for use. I asked students to apply what we have been learning about Python (files, strings, loops, and sets) to identify the Maryland counties that were responsible for generating this anomalous data. I thought this exercise provided a good demonstration using real, live data that critical thinking about the provenance of data is always important because there is no such thing as raw data (Gitelman, 2013). While we were working with the data to count the number of anomalous vaccinations per county one of my sharp eyed students noticed that the results we were seeing with my version of the dataset (downloaded on February 28) were different from what we saw with his (downloaded on March 4). We expected to see new rows in the later one because new vaccination data seem to be reported daily–which is cool in itself. But we were surprised to find new vaccination records for dates earlier than December 11, 2020. Why would new vaccinations for these erroneous older dates still be entering the system? For example the second dataset downloaded March 4 acquired 6 new rows: Object ID Vaccination Date County Daily First Dose Cumulative First Dose Daily Second Dose Cumulative Second Dose 4 1972/10/13 Allegany 1 1 0 0 5 1972/12/16 Baltimore 1 1 0 0 6 2012/02/03 Baltimore 1 2 0 0 28 2020/02/24 Baltimore City 1 2 0 0 34 2020/08/24 Baltimore 1 4 0 0 64 2020/12/10 Prince George’s 1 3 0 0 And these rows present in the February 28 version were deleted in the March 4 version: Object ID Vaccination Date County Daily First Dose Cumulative First Dose Daily Second Dose Cumulative Second Dose 4 2019/12/26 Frederick 1 1 0 0 15 2020/01/25 Talbot 1 1 0 0 19 2020/01/28 Baltimore 1 1 0 0 20 2020/01/30 Caroline 1 1 0 0 28 2020/02/12 Prince George’s 1 1 0 0 30 2020/02/20 Anne Arundel 1 6 0 0 56 2020/10/16 Frederick 1 7 0 4 59 2020/11/01 Wicomico 1 1 0 0 60 2020/11/04 Frederick 1 8 0 4 I found these additions perplexing at first, because I assumed these outliers were part of an initial load. But it appears that the anomalies are still being generated? The deletions suggest that perhaps the anomalous data is being identified and scrubbed in a live system that is then dumping out the data? Or maybe the code that is being used to update the dataset in ArcGIS Hub itself is malfunctioning in some way? If you are interested in toying around with the code and data it is up on GitHub. I was interested to learn about pandas.DataFrame.merge which is useful for diffing tables when you use indicator=True. At any rate, having students notice, measure and document anomalies like this seems pretty useful. I also asked them to speculate about what kinds of activities could generate these errors. I meant speculate in the speculative fiction sense of imagining a specific scenario that caused it. I think this made some students scratch their head a bit, because I wasn’t asking them for the cause, but to invent a possible cause. Based on the results so far I’d like to incorporate more of these speculative exercises concerned with the functioning of code and data representations into my teaching. I want to encourage students to think creatively about data processing as they learn about the nuts and bolts of how code operates. For example the treatments in How to Run a City Like Amazon, and Other Fables which use sci-fi to test ideas about how information technologies are deployed in society. Another model is the Speculative Ethics Book Club which also uses sci-fi to explore the ethical and social consequences of technology. I feel like I need to read up on specualtive research more generally before doing this though (Michael &amp; Wilkie, 2020). I’d also like to focus the speculation down at the level of the code or data processing, rather than at the macro super-system level. But that has its place too. Another difference is that I was asking students to engage in speculation about the past rather than the future. How did the data end up this way? Perhaps this is more of a genealogical approach, of winding things backwards, and tracing what is known. Maybe it’s more Mystery than Sci-Fi. The speculative element is important because (in this case) operations at the MD Dept of Health, and their ArcGIS Hub setup are mostly opaque to us. But even when access isn’t a problem these systems they can feel opaque, because rather than there being a dearth of information you are drowning in it. Speculation is a useful abductive approach to hypothesis generation and, hopefully, understanding. Update 2021-03-17: Over in the fediverse David Benque recommended I take a look at Matthew Stanley’s chapter in (Gitelman, 2013) “Where Is That Moon, Anyway? The Problem of Interpreting Historical Solar Eclipse Observations” for the connection to Mystery. For the connection to Peirce and abduction he also pointed to Luciana Parisi’s chapter “Speculation: A method for the unattainable” in Lury &amp; Wakeford (2012). Definitely things to follow up on! References Gitelman, L. (Ed.). (2013). “Raw data” is an oxymoron. MIT Press. Lury, C., &amp; Wakeford, N. (2012). Inventive methods: The happening of the social. Routledge. Michael, M., &amp; Wilkie, A. (2020). Speculative research. In The Palgrave encyclopedia of the possible (pp. 1–8). Cham: Springer International Publishing. Retrieved from https://doi.org/10.1007/978-3-319-98390-5_118-1 Recovering Foucault I’ve been enjoying reading David Macey’s biography of Michel Foucault, that was republished in 2019 by Verso. Macey himself is an interesting figure, both a scholar and an activist who took leave from academia to do translation work and to write this biography and others of Lacan and Fanon. One thing that struck me as I’m nearing the end of Macey’s book is the relationship between Foucault and archives. I think Foucault has become emblematic of a certain brand of literary analysis of “the archive” that is far removed from the research literature of archival studies, while using “the archive” as a metaphor (Caswell, 2016). I’ve spent much of my life working in libraries and digital preservation, and now studying and teaching about them from the perspective of practice, so I am very sympathetic to this critique. It is perhaps ironic that the disconnect between these two bodies of research is a difference in discourse which Foucault himself brought attention to. At any rate, the thing that has struck me while reading this biography is how much time Foucault himself spent working in libraries and archives. Here’s Foucault in his own words talking about his thesis: In Histoire de la folie à l’âge classique I wished to determine what could be known about mental illness in a given epoch … An object took shape for me: the knowledge invested in complex systems of institutions. And a method became imperative: rather than perusing … only the library of scientific books, it was necessary to consult a body of archives comprising decrees, rules hospital and prison registers, and acts of jurisprudence. It was in the Arsenal or the Archives Nationales that I undertook the analysis of a knowledge whose visible body is neither scientific nor theoretical discourse, nor literature, but a daily and regulated practice. (Macey, 2019, p. 94) Foucault didn’t simply use archives for his research: understanding the processes and practices of archives were integral to his method. Even though the theory and practice of libraries and archives are quite different given their different functions and materials, they are often lumped together as a convenience in the same buildings. Macey blurs them a little bit, in sections like this where he talks about how important libraries were to Foucault’s work: Foucault required access to Paris for a variety of reasons, not least because he was also teaching part-time at ENS. The putative thesis he had begun at the Fondation Thiers – and which he now described to Polin as being on the philosophy of psychology – meant that he had to work at the Bibliothèque Nationale and he had already become one of its habitues. For the next thirty years, Henri Labrouste’s great building in the rue de Richelieu, with its elegant pillars and arches of cast iron, would be his primary place of work. His favourite seat was in the hemicycle, the small, raised section directly opposite the entrance, sheltered from the main reading room, where a central aisle separates rows of long tables subdivided into individual reading desks. The hemicycle affords slighty more quiet and privacy. For thirty years, Foucault pursued his research here almost daily, with occasional forays to the manuscript department and to other libraries, and contended with the Byzantine cataloguing system: two incomplete and dated printed catalogues supplemented by cabinets containing countless index cards, many of them inscribed with copperplate handwriting. Libraries were to become Foucault’s natural habitat: ‘those greenish institutions where books accumulate and where there grows the dense vegetation of their knowledge’ There’s a metaphor for you: libraries as vegetation :) It kind of reminds me of some recent work looking at decentralized web technologies in terms of mushrooms. But I digress. I really just wanted to note here that the erasure of archival studies from humanities research about “the archive” shouldn’t really be attributed to Foucault, whose own practice centered the work of libraries and archives. Foucault wasn’t just writing about an abstract archive, he was practically living out of them. As someone who has worked in libraries and archives I can appreciate how power users (pun intended) often knew aspects of the holdings and intricacies of their their management better than I did. Archives, when they are working, are always collaborative endeavours, and the important thing is to recognize and attribute the various sides of that collaboration. PS. Writing this blog post led me to dig up a few things I want to read (Eliassen, 2010; Radford, Radford, &amp; Lingel, 2015 ). References Caswell, M. (2016). The archive is not an archives: On acknowledging the intellectual contributions of archival studies. Reconstruction, 16(1). Retrieved from http://reconstruction.eserver.org/Issues/161/Caswell.shtml Eliassen, K. (2010). Archives of Michel Foucualt. In E. Røssaak (Ed.), The archive in motion, new conceptions of the archive in contemporary thought and new media practices. Novus Press. Macey, D. (2019). The lives of Michel Foucault: A biography. Verso. Radford, G. P., Radford, M. L., &amp; Lingel, J. (2015). The library as heterotopia: Michel Foucault and the experience of library space. Journal of Documentation, 71(4), 773–751. Teaching OOP in the Time of COVID I’ve been teaching a section of the Introduction to Object Oriented Programming at the UMD College for Information Studies this semester. It’s difficult for me, and for the students, because we are remote due to the Coronavirus pandemic. The class is largely asynchronous, but every week I’ve been holding two synchronous live coding sessions in Zoom to discuss the material and the exercises. These have been fun because the students are sharp, and haven’t been shy about sharing their screen and their VSCode session to work on the details. But students need quite a bit of self-discipline to move through the material, and probably only about 1/4 of the students take advantage of these live sessions. I’m quite lucky because I’m working with a set of lectures, slides and exercises that have been developed over the past couple of years by other instructors: Josh Westgard, Aric Bills and Gabriel Cruz. You can see some of the public facing materials here. Having this backdrop of content combined with Severance’s excellent (and free) Python for Everybody has allowed me to focus more on my live sessions, on responsive grading, and to also spend some time crafting additional exercises that are geared to this particular moment. This class is in the College for Information Studies and not in the Computer Science Department, so it’s important for the students to not only learn how to use a programming language, but to understand programming as a social activity, with real political and material effects in the world. Being able to read, understand, critique and talk about code and its documentation is just as important as being able to write it. In practice, out in the “real world” of open source software I think these aspects are arguably more important. One way I’ve been trying to do this in the first few weeks of class is to craft a sequence of exercises that form a narrative around Coronavirus testing and data collection to help remind the students of the basics of programming: variables, expressions, conditionals, loops, functions, files. In the first exercise we imagined a very simple data entry program that needed to record results of Real-time polymerase chain reaction tests (RT-PCR). I gave them the program and described how it was supposed to work, and asked them describe (in English) any problems that they noticed and to submit a version of the program with problems fixed. I also asked them to reflect on a request from their boss about adding the collection of race, gender and income information. The goal here was to test their ability to read the program and write English about it while also demonstrating a facility for modifying the program. Most importantly I wanted them to think about how inputs such as race or gender have questions about categories and standards behind them, and weren’t simply a matter of syntax. The second exercise builds on the first by asking them to adjust the revised program to be able to save the data in a very particular format. Yes, in the first exercise the data is stored in memory and printed to the screen in aggregate at the end. The scenario here is that the Department of Health and Human Services has assumed the responsibility for COVID test data collection from the Centers for Disease Control. Of course this really happened, but the data format I chose was completely made up (maybe we will be working with some real data at the end of the semester if I continue with this theme). The goal in this exercise was to demonstrate their ability to read another program and fit a function into it. The students were given a working program that had a save_results() function stubbed out. In addition to submitting their revised code I asked them to reflect on some limitations of the data format chosen, and the data processing pipeline that it was a part of. And in the third exercise I asked them to imagine that this lab they were working in had a scientist who discovered a problem with some of the thresholds for acceptable testing, which required an update to the program from Exercise 2, and also a test suite to make sure the program was behaving properly. In addition to writing the tests I asked them to reflect on what functionality was not being tested that probably should be. This alternation between writing code and writing prose is something I started doing as part of a Digital Curation class. I don’t know if this dialogical or perhaps dialectical, approach is something others have tried. I should probably do some research to see. In my last class I alternated week by week: one week reading and writing code, the next week reading and writing prose. But this semester I’ve stayed focused on code, but required the reading and writing of code as well as prose about code in the same week. I hope to write more about how this goes, and these exercises as I go. I’m not sure if I will continue with the Coronavirus data examples. One thing I’m sensitive to is that my students themselves are experiencing the effects of the Coronavirus, and may want to escape it just for a bit in their school work. Just writing in the open about it here, in addition to the weekly meetings I’ve had with Aric, Josh and Gabriel has been very useful. Speaking of those meetings. I learned today from Aric that tomorrow (February 20th, 2021) is the 30th anniversary of Python’s first public release! You can see this reflected in this timeline. This v0.9.1 release was the first release Guido van Rossum made outside of CWI and was made on the Usenet newsgroup alt.sources where it is split out into chunks that need to be reassembled. Back in 2009 Andrew Dalke located a and repackaged these sources in Google Groups which acquired alt.sources as part of DejaNews in 2001. But if you look at the time stamp on the first part of the release you can see that it was made February 19, 1991 (not February 20). So I’m not sure if the birthday is actually today. I sent this little note out to my students with this wonderful two part oral history that the Computer History Museum did with Guido van Rossum a couple years ago. I turns out Both of his parents were atheists and pacifists. His dad went to jail because he refused to be conscripted into the military. That and many more details of his background and thoughts about the evolution of Python can be found in these delightful interviews: Happy Birthday Python! GPT-3 Jam One of the joys of pandemic academic life has been a true feast of online events to attend, on a wide variety of topics, some of which are delightfully narrow and esoteric. Case in point was today’s Reflecting on Power and AI: The Case of GPT-3 which lived up to its title. I’ll try to keep an eye out for when the video posts, and update here. The workshop was largely organized around an exploration of whether GPT-3, the largest known machine learning language model, changes anything for media studies theory, or if it amounts to just more of the same. So the discussion wasn’t focused so much on what games could be played with GPT-3, but rather if GPT-3 changes the rules of the game for media theory, at all. I’m not sure there was a conclusive answer at the end, but it sounded like the consensus was that current theorization around media is adequate for understanding GPT-3, but it matters greatly what theory or theories are deployed. The online discussion after the presentations indicated that attendees didn’t see this as merely a theoretical issue, but one that has direct social and political impacts on our lives. James Steinhoff looked at GPT-3 using a Marxist media theory perspective where he told the story of GPT-3’s as a project of OpenAI and as a project of capital. OpenAI started with much fanfare in 2015 as a non-profit initiative where the technology, algorithms and models developed would would be kept openly licensed and freely available so that the world could understand the benefits and risks of AI technology. Steinhoff described how in 2019 the project’s needs for capital (compute power and staff) transitioned it from a non-profit into a capped-profit company, which is now owned, or at least controlled, by Microsoft. The code for generating the model as well as the model itself are gated behind a token driven Web API run my Microsoft. You can get on a waiting list to use it, but apparently a lot of people have been waiting a while, so … Being a Microsoft employee probably helps. I grabbed a screenshot of the pricing page that Steinhoff shared during his presentation: I’d be interested to hear more about how these tokens operate. Are they per-request, or are they measured according something else? I googled around a bit during the presentation to try to find some documentation for the Web API, and came up empty handed. I did find Shreya Shankar’s gpt3-sandbox project for interacting with the API in your browser (mostly for iteratively crafting text input in order to generate desired output). It depends on the openai Python package created by OpenAI themselves. The docs for openai then point at a page on the openai.com website which is behind a login. You can create an account, but you need to be pre-approved (made it through the waitlist) to be able to see the docs. There’s probably some sense that can be made from examining the python client though. All of the presentations in some form or another touched on the 175 billion parameters that were used to generate the model. But the API to the model doesn’t have that many parameters. It allows you to enter text and get text back. But the API surface that the GPT-3 service provides could be interesting to examine a bit more closely, especially to track how it changes over time. In terms of how this model mediates knowledge and understanding it’ll be important watch. Steinhoff’s message seemed to be that, despite the best of intentions, GPT-3 functions in the service of very large corporations with very particular interests. One dimension that he didn’t explore perhaps because of time, is how the GPT-3 model itself is fed massive amounts of content from the web, or the commons. Indeed 60% of the data came from the CommonCrawl project. GPT-3 is an example of an extraction project that has been underway at large Internet companies for some time. I think the critique of these corporations has often been confined to seeing them in terms of surveillance capitalism rather than in terms of raw resource extraction, or the primitive accumulation of capital. The behavioral indicators of who clicked on what are certainly valuable, but GPT-3 and sister projects like CommonCrawl shows just the accumulation of data with modest amounts of metadata can be extremely valuable. This discussion really hit home for me since I’ve been working with Jess Ogden and Shawn Walker using CommonCrawl as a dataset for talking about the use of web archives, while also reflecting on the use of web archives as data. CommonCrawl provides a unique glimpse into some of the data operations that are at work in the accumulation of web archives. I worry that the window is closing and the CommonCrawl itself will be absorbed into Microsoft. Following Steinhoff Olya Kudina and Bas de Boer jointly presented some compelling thoughts about how its important to understand GPT-3 in terms of sociotechnical theory, using ideas drawn from Foucault and Arendt. I actually want to watch their presentation again because it followed a very specific path that I can’t do justice to here. But their main argument seemed to be that GPT-3 is an expression of power and that where there is power there is always resistance to power. GPT-3 can and will be subverted and used to achieve particular political ends of our own choosing. Because of my own dissertation research I’m partial to Foucault’s idea of governmentality, especially as it relates to ideas of legibility (Scott, 1998)–the who, what and why of legibility projects, aka archives. GPT-3 presents some interesting challenges in terms of legibility because the model is so complex, the results it generates defy deductive logic and auditing. In some ways GPT-3 obscures more than it makes a population legible, as Foucault moved from disciplinary analysis of the subject, to the ways in which populations are described and governed through the practices of pastoral power, of open datasets. Again the significance of CommonCrawl as an archival project, as a web legibility project, jumps to the fore. I’m not as up on Arendt as I should be, so one outcome of their presentation is that I’m going to read her The Human Condition which they had in a slide. I’m long overdue. References Scott, J. C. (1998). Seeing like a state: How certain schemes to improve the human condition have failed. Yale University Press. mimetypes Today I learned that Python has a mimetypes module, and has ever since Guido von Rossum added it in 1997. Honestly I’m just a bit sheepish to admit this discovery, as someone who has been using Python for digital preservation work for about 15 years. But maybe there’s a good reason for that. Since the entire version history for Python is available on GitHub (which is a beautiful thing in itself) you can see that the mimetypes module started as a guess_type() function built around a pretty simple hard coded mapping of file extensions to mimetypes. The module also includes a little bit of code to look for, and parse, mimetype registries that might be available on the host operating system. The initial mimetype registries used included one from the venerable Apache httpd web server, and the Netscape web browser, which was about three years old at the time. It makes sense why this function to look up a mimetype for a filename would be useful at that time, since Python was being used to serve up files on the nascent web and for sending email, and whatnot. Today the module looks much the same, but has a few new functions and about twice as many mimetypes in its internal list. Some of the new mimetypes include text/csv, audio.mpeg, application/vnd.ms-powerpoint, application/x-shockwave-flash, application/xml, and application/json. Comparing the first commit to the most latest provides a thumbnail sketch of 25 years of web format evolution. I’ll admit, this is is a bit of an esoteric thing to be writing a blog post about. So I should explain. At work I’ve been helping out on a community archiving project which has accumulated a significant amount of photographs, scans, documents of various kinds, audio files and videos. Some of these files are embedded in web applications like Omeka, some are in cloud storage like Google Drive, or on the office networked attached storage, and others are on scattered storage devices in people’s desk drawers and closets. We’ve also created new files during community digitization events, and oral history interviews. As part of this work we’ve wanted to start building a place on the web where all these materials live. This has required not only describing the files, but also putting all the files in one place so that access can be provided. In principle this sounds simple. But it turns out that collecting the files from all these diverse locations poses significant challenges, because their context matters. The filenames, and the directories they are found in, are sometimes the only descriptive metadata that exists for this data. In short, the original order matters. But putting this content on the web means that the files need to be brought together and connected with their metadata programmatically. This is how I stumbled across the mimetypes module. I’ve been writing some throwaway code to collect the files together into the same directory structure while preserving their original filenames and locations in an Airtable database. I’ve been using the magic module to identify the format of the file, which is used to copy the file into a Dropbox storage location. The extension is important because we are expecting this to be a static site serving up the content and we want the files to also be browsable using the Dropbox drive. It turns out the mimetypes.guess_extension is pretty useful for turning a mediatype into an file extension. I’m kind of surprised that it took me this long to discover mimetypes, but I’m glad I did. As an aside I think this highlights for me how important Git can be as an archive and research method for software studies work. Northwest Branch Cairn Here is a short recording and a couple photos from my morning walk along the Northwest Branch trail with Penny. I can’t go every day but at 7 months old she has tons of energy, so it’s generally a good idea for all concerned to go at least every other morning. And it’s a good thing, because the walk is surprisingly peaceful, and it’s such a joy to see her run through the woods. After walking about 30 minutes there is this little cairn that is a reminder for me to turn around. After seeing it grow in size I was sad to see it knocked down one day. But, ever so slowly, it is getting built back up again. 
inkdroid-org-4669	----	None 
inkdroid-org-616	----	856 Toggle Navigation inkdroid About Bookmarks Photos Music Software Social Talks 856 April 27, 2021 metadata Coincidence? Unless otherwise noted all the content here is licensed CC-BY 
inkdroid-org-798	----	None 
inkdroid-org-8502	----	None 
inkdroid-org-8885	----	None 
inkdroid-org-9563	----	inkdroid Toggle Navigation inkdroid About Bookmarks Photos Music Software Social Talks 2021-04-27 ~ 856 2021-04-07 ~ twarc2 2021-03-27 ~ $ j 2021-03-19 ~ Strengths and Weaknesses 2021-03-16 ~ Data Speculation 2021-02-26 ~ Recovering Foucault 2021-02-19 ~ Teaching OOP in the Time of COVID 2021-02-18 ~ GPT-3 Jam 2021-02-13 ~ mimetypes 2021-02-12 ~ Northwest Branch Cairn 2021-02-11 ~ Blow back derelict wind 2021-02-04 ~ Outgoing 2021-01-21 ~ Trump's Tweets 2020-12-31 ~ noarchive 2020-12-23 ~ What's the diff? 2020-12-07 ~ 25 for 2020 2020-12-05 ~ Diss Music 2020-12-02 ~ 25 Years of robots.txt 2020-12-01 ~ Curation Communities 2020-11-30 ~ Mystery File! 2020-11-27 ~ Kettle 2020-11-24 ~ Static-Dynamic 2020-11-08 ~ Dark Reading 2020-10-28 ~ Seeing Software 2020-10-16 ~ Curating Corpora 2020-10-14 ~ Fuzzy 2020-10-11 ~ Penny 2020-10-09 ~ Fuzzy File Formats 2020-09-26 ~ Pandoc 2020-09-26 ~ Fuzzy Matching 2020-09-22 ~ Less is (sometimes) More 2020-09-20 ~ Teaching Digital Curation 2020-09-08 ~ RSS 2020-09-05 ~ Organizations on Twitter 2020-09-03 ~ BibDesk, Zotero and JabRef 2020-09-02 ~ Disinformation Metadata 2020-08-30 ~ Equipment 2020-08-27 ~ Twitter 2020-08-26 ~ Music for Hard Times 2020-08-23 ~ Digital Curation 2020-08-22 ~ Dependency Hell 2020-08-14 ~ Keyboard 2020-08-06 ~ Tech Tree 2020-07-02 ~ Appraisal Talk in Web Archives 2020-06-16 ~ Talk Talk 2020-06-07 ~ Original Voice 2020-06-02 ~ Write It Down 2020-05-15 ~ Sun and Moon 2020-05-07 ~ First Thought 2020-04-23 ~ Studying the COVID-19 Web « Prev 1 2 3 4 5 6 7 8 9 10 11 12 13 Next » Unless otherwise noted all the content here is licensed CC-BY 
inkdroid-org-9635	----	None 
invidious-xyz-1352	----	Collaborations Workshop 2021 - Keynotes Live Stream - Invidious true Invidious Log in Collaborations Workshop 2021 - Keynotes Live Stream Video unavailable. Watch on YouTube Show annotations Download is disabled. 190 0 0 Genre: Family friendly? No Wilson score: 0.0 Rating: 0.0 / 5 Engagement: 0.0% SoftwareSaved Subscribe | - Shared March 30, 2021 Hi! Looks like you have JavaScript turned off. Click here to view comments, keep in mind they may take a bit longer to load. Play next by default: 46:35 Collaborations Workshop 2021 - Panel Live Stream SoftwareSaved 50 views 4:13:05 Python Software Carpentry workshop 05 March 2021 - Version Control with Git Module SoftwareSaved 32 views 2:23:16 Python Software Carpentry workshop 10-11 Nov 2020 - Building Programs with Python (part 1) SoftwareSaved 48 views 3:43:12 April Series: Enhancing Learning Using Google for Education Tools Franco Nicolo Addun 20K views 2:59:44 Python Software Carpentry workshop 10-11 Nov 2020 - Automating Tasks with the Unix Shell SoftwareSaved 46 views 2:38:21 April Series | Communicating Using Google For Education Tools Franco Nicolo Addun 17K views 59:41 Fellowship Programme 2021 Launch Webinar SoftwareSaved 179 views 1:29:42 Resurrecting Retail Virtual Launch Party Retail Prophet 14K views 18:26 Chris Hartgerink keynote talk on "The social model of inaccessibility" SoftwareSaved 121 views 2:46:06 April Series: Organizing Life and Work Using Google for Education Tools Franco Nicolo Addun 32K views 7:17:50 Volt Europa General Assembly 25.04.2021 | #VoteVolt Volt Europa 1.5K views 3:18:29 Python Software Carpentry workshop 02-04 March 2021 - Building Programs with Python Module Part 2 SoftwareSaved 17 views Released under the AGPLv3 by Omar Roth. BTC: 356DpZyMXu6rYd55Yqzjs29n79kGKWcYrY BCH: qq4ptclkzej5eza6a50et5ggc58hxsq5aylqut2npk Liberapay View JavaScript license information. / View privacy policy. Current version: 0.20.1-99ba987 @ master 
invidious-xyz-3206	----	Collaborations Workshop 2021 - Panel Live Stream - Invidious true Invidious Log in Collaborations Workshop 2021 - Panel Live Stream Video unavailable. Watch on YouTube Show annotations Download is disabled. 50 0 0 Genre: Family friendly? No Wilson score: 0.0 Rating: 5.0 / 5 Engagement: 0.0% SoftwareSaved Subscribe | - Shared March 31, 2021 Hi! Looks like you have JavaScript turned off. Click here to view comments, keep in mind they may take a bit longer to load. Play next by default: 4:13:05 Python Software Carpentry workshop 05 March 2021 - Version Control with Git Module SoftwareSaved 32 views 2:23:16 Python Software Carpentry workshop 10-11 Nov 2020 - Building Programs with Python (part 1) SoftwareSaved 48 views 52:44 Collaborations Workshop 2021 - Keynotes Live Stream SoftwareSaved 190 views 59:41 Fellowship Programme 2021 Launch Webinar SoftwareSaved 179 views 2:59:44 Python Software Carpentry workshop 10-11 Nov 2020 - Automating Tasks with the Unix Shell SoftwareSaved 46 views 45:55 SSI Fellows Community Call: February 2021 SoftwareSaved 26 views 18:26 Chris Hartgerink keynote talk on "The social model of inaccessibility" SoftwareSaved 121 views 6:18:51 R Data Carpentry workshop 13-14 Oct 20 - Data Analysis and Visualisation in R SoftwareSaved 76 views 3:04:58 Python Software Carpentry workshop 01 March 2021 - Automating Tasks with Shell Module SoftwareSaved 46 views 32:44 Research Software Camp: Q&A with Chris Hartgerink SoftwareSaved 17 views 1:21 Research Software Camp: Chris Hartgerink's abstract SoftwareSaved 83 views 27:08 The most elegant key change in all of pop music Adam Neely 1.1M views Released under the AGPLv3 by Omar Roth. BTC: 356DpZyMXu6rYd55Yqzjs29n79kGKWcYrY BCH: qq4ptclkzej5eza6a50et5ggc58hxsq5aylqut2npk Liberapay View JavaScript license information. / View privacy policy. Current version: 0.20.1-99ba987 @ master 
ipfs-io-5542	----	IPFS Powers the Distributed Web IPFS About Install Docs Team Blog Help IPFS powers the Distributed Web A peer-to-peer hypermedia protocol designed to make the web faster, safer, and more open. Get started How it works View more Disable animation The web of tomorrow needs IPFS today IPFS aims to surpass HTTP in order to build a better web for all of us. Today's web is inefficient and expensive HTTP downloads files from one computer at a time instead of getting pieces from multiple computers simultaneously. Peer-to-peer IPFS saves big on bandwidth — up to 60% for video — making it possible to efficiently distribute high volumes of data without duplication. Today's web can't preserve humanity's history The average lifespan of a web page is 100 days before it's gone forever. It's not good enough for the primary medium of our era to be this fragile. IPFS keeps every version of your files and makes it simple to set up resilient networks for mirroring data. Today's web is centralized, limiting opportunity The Internet has turbocharged innovation by being one of the great equalizers in human history — but increasing consolidation of control threatens that progress. IPFS stays true to the original vision of an open, flat web by delivering technology to make that vision a reality. Today's web is addicted to the backbone IPFS powers the creation of diversely resilient networks that enable persistent availability — with or without Internet backbone connectivity. This means better connectivity for the developing world, during natural disasters, or just when you're on flaky coffee shop wi-fi. Install IPFS Join the future of the web right now — just choose the option that's right for you. Store and share files IPFS Desktop IPFS for everyone The desktop app offers menubar/tray shortcuts and an easy interface for adding, pinning, and sharing files — plus a full IPFS node ready for heavy-duty hosting and development too. A great choice for devs and non-devs alike. Get IPFS Desktop Command-line install All IPFS, no frills Just want IPFS in your terminal? Get step-by-step instructions for getting up and running on the command line using the Go implementation of IPFS. Includes directions for Windows, macOS, and Linux. Get the CLI IPFS Companion Add IPFS to your browser Get ipfs:// URL support and much more in your web browser with this extension. Get Companion IPFS Cluster For servers or big data Automatically allocate, replicate, and track your data as pinsets across multiple IPFS nodes. Get Cluster Build with IPFS Go implementation The original IPFS, with core implementation, daemon server, CLI tooling, and more. Get go-ipfs JS implementation Written entirely in JavaScript for a world of possibilities in browser implementations. Get js-ipfs Here's how IPFS works Take a look at what happens when you add a file to IPFS. Your file, and all of the blocks within it, is given a unique fingerprint called a cryptographic hash. IPFS removes duplications across the network. Each network node stores only content it is interested in, plus some indexing information that helps figure out which node is storing what. When you look up a file to view or download, you're asking the network to find the nodes that are storing the content behind that file's hash. You don't need to remember the hash, though — every file can be found by human-readable names using a decentralized naming system called IPNS. Take a closer look Want to dig in? Check out the docs Hands-on learner? Explore ProtoSchool Curious where it all began? Read the whitepaper IPFS can help here and now No matter what you do with the web, IPFS helps make it better today. Archivists IPFS provides deduplication, high performance, and clustered persistence — empowering you to store the world's information for future generations. Service providers Providing large amounts of data to users? IPFS offers secure, peer-to-peer content delivery — an approach that could save you millions in bandwidth costs. Researchers If you're working with or distributing large data sets, IPFS can help provide fast performance and decentralized archiving. Developing world High-latency networks are a big barrier for those with poor internet infrastructure. IPFS provides resilient access to data independent of latency or backbone connectivity. Blockchains With IPFS, you can address large amounts of data and put immutable, permanent links in transactions — timestamping and securing content without having to put the data itself on-chain. Content creators IPFS brings the freedom and independent spirit of the web in full force — and can help you deliver your content at a much lower cost. Who's already using IPFS? Companies and organizations worldwide are already building amazing things on IPFS. See the list News and more IPFS blog 07 April 2021 Welcome to IPFS Weekly 130 07 April 2021 Meet the New IPFS Blog & News 05 April 2021 Storing NFTs on IPFS 31 March 2021 Welcome to IPFS Weekly 129 In the media TechCrunch Why The Internet Needs IPFS Before It’s Too Late Motherboard IPFS Wants to Create a Permanent Web MakeUseOf Faster, Safer, Decentralized Internet With IPFS Videos Why IPFS? Developers Speak: Building on IPFS More videos Stay on top of the latest Sign up for the IPFS Weekly newsletter to get project updates, community news, event details, and more. In your inbox, each Tuesday. Subscribe Protocol Labs About Join IPFS Install GitHub Code of Conduct Docs Community Help Awesome IPFS IPFS Cluster Team Press Blog Legal ProtoSchool Tutorials Events Filecoin About FAQ Other Projects libp2p IPLD Drand Multiformats Testground Twitter Facebook YouTube © Protocol Labs | Except as noted, content licensed CC-BY 3.0. 
islandora-ca-1879	----	Islandora Open Meeting: April 27, 2021 | Islandora Skip to main content Toggle navigation Main Menu Home About Events Blog Contact Newsletter Support Islandora Search Search You are here : Home Islandora Open Meeting: April 27, 2021 About Menu Islandora Foundation Get Started Community Contribute Help Islandora Open Meeting: April 27, 2021 We are happy to announce the date of our next Open Meeting! Join us on April 27, 2021 any time between 10:00-2:00pm EDT. The Open Meetings are drop-in style sessions where users of all levels and abilities gather to ask questions, share use cases and get updates on Islandora. There will be experienced Islandora 8 users on hand to answer questions or give demos. We would love for your to join us any time during the 4-hour window, so feel free to pop by any time! More details about the Open Meeting, and the Zoom link to join, are in this Google doc.  Registration is not required. If you would like a calendar invite as a reminder, please let us know at community@islandora.ca. Submitted by agriffith on Tue, 04/13/2021 - 16:11 Log in to post comments notvisible Home About Events Blog Contact Newsletter Support Islandora © Copyright 2020 Islandora Foundation. Header photo credits. Privacy Policy. 
islandora-ca-4289	----	Islandorans Unite! It's Release Time | Islandora Skip to main content Toggle navigation Main Menu Home About Events Blog Contact Newsletter Support Islandora Search Search You are here : Home Islandorans Unite! It's Release Time About Menu Islandora Foundation Get Started Community Contribute Help Islandorans Unite! It's Release Time It's that time again everyone!  Our amazing community contributors have made all sorts of improvements and upgrades to Islandora.  Some have been merged, but some are still hanging out, waiting for the love they need to make it into the code base.  We're calling on you - yes you! - to help us get things merged, tested, documented, and released to the world. I would like to kick off this release cycle with a sprint to mop up some the amazing improvements that have unmerged pull requests.  Did you know that we have pull requests for an advanced search module and a basic batch ingest form just lounging around?  And that's not all.  There are all kinds of great improvements that just need some time and attention. A little code review and some basic testing by others are all that is needed before we freeze the code and start turning the crank on the release process. Here's a rough timetable for the release: April 19 - 30th: Code Sprint May 3rd: Code Freeze May 3rd - 14th: Testing, bug fixing, responding to feedback May 17th - 28th: Documentation sprint May 31st - June 18th: More testing, bug fixing, and responding to feedback June 21st - July 2nd: Testing sprint Release! This is, of course, an optimistic plan.  If major issues are discovered we will take the time to address them which can affect the timeline.  I also plan on liaising with the Documentation Interest Group and folks from the Users' Call / Open Meetings for the documentation and testing sprints, and their availabilities may nudge things a week in either direction. An open and transparent release process is one of the hallmarks of our amazing community. If you or your organization have any interest in helping out, please feel free to reach out or sign up for any of the upcoming sprints.  There are plenty of opportunities to contribute regardless of your skill set or level of experience with Islandora.  There's something for everyone! We'll make further announcements for the other sprints, but you can sign up for the code sprint now using our sign up sheet.  Hope to see you there!   Submitted by dlamb on Mon, 03/29/2021 - 19:07 Log in to post comments notvisible Home About Events Blog Contact Newsletter Support Islandora © Copyright 2020 Islandora Foundation. Header photo credits. Privacy Policy. 
islandora-ca-5670	----	Islandora Islandora What's New What's New manez Tue, 05/12/2020 - 14:05 Body Our website has been overhauled in a big way. We have moved to Drupal 8, changed our look, and shifted content around to make it easier to find the Islandora information and resources that you need. Can't find something you expect from the old site? Let us know and we'll get it fixed.     Islandora Open Meeting: April 27, 2021 Islandora Open Meeting: April 27, 2021 agriffith Tue, 04/13/2021 - 16:11 Body We are happy to announce the date of our next Open Meeting! Join us on April 27, 2021 any time between 10:00-2:00pm EDT. The Open Meetings are drop-in style sessions where users of all levels and abilities gather to ask questions, share use cases and get updates on Islandora. There will be experienced Islandora 8 users on hand to answer questions or give demos. We would love for your to join us any time during the 4-hour window, so feel free to pop by any time! More details about the Open Meeting, and the Zoom link to join, are in this Google doc.  Registration is not required. If you would like a calendar invite as a reminder, please let us know at community@islandora.ca. Upcoming DIG Sprint Upcoming DIG Sprint agriffith Thu, 04/08/2021 - 20:03 Body The Islandora Documentation Interest Group is holding a sprint! To support the upcoming release of Islandora, the DIG has planned a 2-week documentation, writing-and-updating sprint to occur as part of the release process. To prepare for that effort, we’re going to spend April 19 – 30th on an Auditing Sprint, where volunteers will review existing documentation and complete this spreadsheet, providing a solid overview of the current status of our docs so we know where to best deploy our efforts during the release. This sprint will run alongside the upcoming Pre-Release Code Sprint, so if you’re not up for coding, auditing docs is a great way to contribute during sprint season! We are looking for volunteers to sign up to take on two sprint roles: Auditor: Review a page of documentation and fill out a row in the spreadsheet indicating things like the current status (‘Good Enough’ or ‘Needs Work’) , the goal for that particular page (e.g., “Explain how to create an object,” or “Compare Islandora 7 concepts to Islandora 8 concepts”), and the intended audience (Beginners, developers, etc.). Reviewer: Read through a page that has been audited and indicate if you agree with the auditor’s assessment, add additional notes or suggestions as needed; basically, give a second set of eyes on each page.  You can sign up for the sprint here, and sign up for individual pages here.   Community Announcement Community Announcement agriffith Wed, 03/31/2021 - 16:49 Body As you know, the Islandora Foundation has recently updated its governance structure to remain compliant with Canadian non-profit regulations. Islandora Foundation members approved these changes at the Annual General Meeting in early March. A summary of these changes is provided here, as well as our emerging roadmap for moving forward. A newly formed “Leadership Group”, composed of representatives from our Partner-level member organizations, replaces the pre-existing Board of Directors, and a smaller Board of Directors remains responsible for Islandora’s administrative and fiscal responsibilities. This Leadership Group met for the first time on Friday, March 26th to begin to discuss their goals going forward, and the ways the Leadership Group will interact with the other governance structures of the Islandora community. The Leadership Group immediately affirmed their commitment to transparent communication and collaboration with the vibrant, robust Islandora community and will be creating a Terms of Reference over the next month. The Terms of Reference will be written with agility and transformation in mind, as we work together to secure a strong future for both the community and codebase. In the meantime, please let us know if you have any questions regarding the formation of the Leadership Group, and stay tuned to hear more about the initial goals of this group. Islandorans Unite! It's Release Time Islandorans Unite! It's Release Time dlamb Mon, 03/29/2021 - 19:07 Body It's that time again everyone!  Our amazing community contributors have made all sorts of improvements and upgrades to Islandora.  Some have been merged, but some are still hanging out, waiting for the love they need to make it into the code base.  We're calling on you - yes you! - to help us get things merged, tested, documented, and released to the world. I would like to kick off this release cycle with a sprint to mop up some the amazing improvements that have unmerged pull requests.  Did you know that we have pull requests for an advanced search module and a basic batch ingest form just lounging around?  And that's not all.  There are all kinds of great improvements that just need some time and attention. A little code review and some basic testing by others are all that is needed before we freeze the code and start turning the crank on the release process. Here's a rough timetable for the release: April 19 - 30th: Code Sprint May 3rd: Code Freeze May 3rd - 14th: Testing, bug fixing, responding to feedback May 17th - 28th: Documentation sprint May 31st - June 18th: More testing, bug fixing, and responding to feedback June 21st - July 2nd: Testing sprint Release! This is, of course, an optimistic plan.  If major issues are discovered we will take the time to address them which can affect the timeline.  I also plan on liaising with the Documentation Interest Group and folks from the Users' Call / Open Meetings for the documentation and testing sprints, and their availabilities may nudge things a week in either direction. An open and transparent release process is one of the hallmarks of our amazing community. If you or your organization have any interest in helping out, please feel free to reach out or sign up for any of the upcoming sprints.  There are plenty of opportunities to contribute regardless of your skill set or level of experience with Islandora.  There's something for everyone! We'll make further announcements for the other sprints, but you can sign up for the code sprint now using our sign up sheet.  Hope to see you there!   Islandora Open Meeting: March 30, 2021 Islandora Open Meeting: March 30, 2021 agriffith Wed, 03/24/2021 - 16:26 Body We will be holding our next Open Meeting on Tuesday, March 30 from 10:00 AM to 2:00 PM Eastern. Full details, and the Zoom link to join, are in this Google doc. The meeting is drop-in and will be free form, with experienced Islandora 8 users on hand to answer questions or give demos on request. We would love for your to join us any time during the 4-hour window, so feel free to pop by any time. Registration is not required. If you would like a calendar invite as a reminder, please let us know at community@islandora.ca. ISLE: Now with Islandora 8 ISLE: Now with Islandora 8 dlamb Tue, 03/23/2021 - 20:12 Body The Islandora Foundation is pleased to announce that ISLE for Islandora 8 has gone alpha and is now available! What is ISLE? ISLE (short for ISLandora Enterprise), is "Dockerized" Islandora, and seeks to create community managed infrastructure, streamlining the installation and maintenance of an Islandora repository.  With ISLE, the bulk of your repository's infrastructure is managed for you, and updates are as easy as pulling in new Docker images.  System administrators are only responsible for maintaining and updating their Drupal site, and can rely on ISLE to handle Fedora, Solr, the triplestore, and all the other services we use to run a digital repository. The project began as a Mellon grant funded initiative by the Islandora Collaboration Group back in 2017 for Islandora 7. Then in January 2020, the ICG, Born Digital, Lyrasis, CWRC, and the Islandora Foundation got together and started working on a version for Islandora 8.  This version would be a full community project, worked on in the open and residing in the Islandora-Devops Github organization. What are the benefits of using ISLE? On top of being easier to install, run, and update, there are many awesome reasons to use ISLE for running Islandora.  First and foremost: speed. Simply put, ISLE is fast! Installation time is simply the amount of time it takes to download the images from Dockerhub.  For those who are building the images themselves, ISLE takes advantage of Docker's buildkit feature for blazing fast builds.  A complete rebuild of the entire stack consistently takes less than ten minutes on my laptop.  And for small tweaks to the environment, builds often take seconds to make a change. Compared to our Ansible playbook, which usually takes around 45 minutes for me, this is a significant boost to productivity when testing/deploying changes! Because it's so quick, it lends itself well to automation using CI/CD tools like Github Actions and Gitlab. The Islandora Foundation is "dogfooding" with ISLE, putting it at the center of its deployment strategy for future.islandora.ca and release testing. ISLE is also cross-platform. It is the first and only community supported way to run Islandora on a Windows machine. Any Windows computer with WSL2 can build and run ISLE.  ISLE also supports ARM builds, and can be run on cheaper cloud resources, newer Macs with M1 chips, and even (theoretically) Raspberry Pis. How can I get ISLE? Docker images for Islandora 8 are automatically pushed to Dockerhub and are available here. If you want to run them using docker-compose, you can use isle-dc to build yourself a sandbox or a local development environment.  Upcoming Sprint: Metadata Upcoming Sprint: Metadata dlamb Wed, 02/24/2021 - 16:12 Body Our very own Metadata Interest Group is running a sprint from March 8th to the 19th, and everyone's invited to participate.  We'll be auditing the default metadata fields that we ship with and comparing them to the excellent metadata profile the MIG has worked so hard to create for us. The goal of the sprint is just to find out where the gaps are so we know the full scope of work needed to implement their recommendations.  If you can navigate the Drupal fields UI (or just want to learn!), contributing is easy and would be super helpful to us. NO PROGRAMMING REQUIRED. And if you don't have an Islandora 8 instance to work on (or are having a hard time installing one), we're making a fresh sandbox just for the sprint. Also, Islandora Foundation staff (a.k.a. me) and representatives from the MIG will be on hand to help out and answer any questions you may have. You can sign up for the sprint here, and choose a metadata field to audit in this spreadsheet.  As always, commit to as much or as little as you like.  It only takes a couple minutes to check out a field and its settings to see if they line up with the recommendations. If we get enough folks to sign up, then many hands will make light work of this task! This is yet another sign of the strength of our awesome community.  An interest group is taking it upon themselves to run a sprint to help achieve their goals, and the Islandora Foundation couldn't be happier to help. If you're a member of an interest group and want help engaging the community to make your goals happen, please feel free to reach out on Slack or email me (dlamb@islandora.ca). Islandora Open Meeting: February 23, 2021 Islandora Open Meeting: February 23, 2021 manez Wed, 02/03/2021 - 19:09 Body We will be holding another open drop-in session on Tuesday, February 23 from 10:00 AM to 2:00 PM Eastern. Full details, and the Zoom link to join, are in this Google doc. The meeting is free form, with experienced Islandora 8 users on hand to answer questions or give demos on request. Please drop in at any time during the four-hour window. Registration is not required. If you would like a calendar invite as a reminder, please let us know at community@islandora.ca. Islandora Open Meeting: January 28, 2021 Islandora Open Meeting: January 28, 2021 manez Thu, 01/14/2021 - 15:55 Body We will be holding another open drop-in session on January 28th from 10:00 AM to 2:00 PM Eastern. Full details, and the Zoom link to join, are in this Google doc. The meeting is free form, with experienced Islandora 8 users on hand to answer questions or give demos on request. Please drop in at any time during the four-hour window. Registration is not required. If you would like a calendar invite as a reminder, please let us know at community@islandora.ca. 
islandora-ca-7106	----	Upcoming DIG Sprint | Islandora Skip to main content Toggle navigation Main Menu Home About Events Blog Contact Newsletter Support Islandora Search Search You are here : Home Upcoming DIG Sprint About Menu Islandora Foundation Get Started Community Contribute Help Upcoming DIG Sprint The Islandora Documentation Interest Group is holding a sprint! To support the upcoming release of Islandora, the DIG has planned a 2-week documentation, writing-and-updating sprint to occur as part of the release process. To prepare for that effort, we’re going to spend April 19 – 30th on an Auditing Sprint, where volunteers will review existing documentation and complete this spreadsheet, providing a solid overview of the current status of our docs so we know where to best deploy our efforts during the release. This sprint will run alongside the upcoming Pre-Release Code Sprint, so if you’re not up for coding, auditing docs is a great way to contribute during sprint season! We are looking for volunteers to sign up to take on two sprint roles: Auditor: Review a page of documentation and fill out a row in the spreadsheet indicating things like the current status (‘Good Enough’ or ‘Needs Work’) , the goal for that particular page (e.g., “Explain how to create an object,” or “Compare Islandora 7 concepts to Islandora 8 concepts”), and the intended audience (Beginners, developers, etc.). Reviewer: Read through a page that has been audited and indicate if you agree with the auditor’s assessment, add additional notes or suggestions as needed; basically, give a second set of eyes on each page.  You can sign up for the sprint here, and sign up for individual pages here.   Submitted by agriffith on Thu, 04/08/2021 - 20:03 Log in to post comments notvisible Home About Events Blog Contact Newsletter Support Islandora © Copyright 2020 Islandora Foundation. Header photo credits. Privacy Policy. 
isni-org-9658	----	ISNI | : Home Page Toggle navigation ABOUT What is ISNI? Governance Our History Objectives & Policies ISNI COMMUNITY The ISNI Community ISNI Registration Agencies ISNI Members Direct Data Contributors Joining ISNI RESOURCES How ISNI Works Data Quality Procedures Data Inputs & Outputs Technical Documentation Training Linked Data NEWS News & Archive ISNI Newsletter HELP FAQs Get an ISNI Contact ISNI SEARCH SITE MAILING LIST GET AN ISNI SEARCH DATABASE SEARCH WEBSITE: ABOUT ISNI ISNI is the ISO certified global standard number for identifying the millions of contributors to creative works and those active in their distribution, including researchers, inventors, writers, artists, visual creators, performers, producers, publishers, aggregators, and more. As ISO 27729, it is part of a family of international standard identifiers that includes identifiers of works, recordings, products and right holders in all repertoires, e.g. DOI, ISAN, ISBN, ISRC, ISSN, and ISWC. The mission of the ISNI International Agency (ISNI-IA) is to assign to the public name(s) of a researcher, inventor, writer, artist, performer, publisher, etc. a persistent unique identifying number in order to resolve the problem of name ambiguity in search and discovery; and diffuse each assigned ISNI across all repertoires in the global supply chain so that every published work can be unambiguously attributed to its creator wherever that work is described. By achieving these goals, the ISNI will act as a bridge identifier across multiple domains and become a critical component in Linked Data and Semantic Web applications. KEY STATISTICS 12.22 million  ISNI holds public records of more than 12 million identities 11.10 million  ISNI holds public records of over 11.10 million individuals (of which 2.93 million are researchers)  1.11 million ISNI holds public records of 1,119,480 organizations  104 sources The ISNI database is a cross-domain resource with direct contributions from 104 sources NEWS The British Library launches its ISNI Portal: A Brand New, Online Service for ISNI Users   We are delighted and privileged to announce that the British Library has now launched its online, all-in-one service for the International Standard Name... READ MORE BDS Builds a New Website for ISNI BDSDigital, the web services and IT arm of BDS, has built a new ISNI website, which went live in June 2020. BDS also transferred existing content from the... READ MORE Music Industry ISNI Registrations Now Free and Automated Sound Credit music credit cloud profile system offers world's only free and automated ISNI registration service MEMPHIS, TENN., OCTOBER 23, 2020 – Every creative work... READ MORE GET AN ISNI SEARCH DATABASE ISNI International Agency (ISNI-IA) LIMITED Registered address: c/o EDItEUR, United House, North Road, London, N7 9DP, UK Company registration number: 07476425 Follow us Privacy Terms of Use FAQs 
iwatchafrica-org-721	----	iWatch Home - iWatch Africa Menu iWatch Africa Home Education Government Expenditure Health Job Creation Together Against Corruption Watch Africa Digital Rights Gender Force Ocean & Climate Action Search for Latest Articles Transforming climate finance for debt-distressed economies during COVID-19 EC proposed Carbon Border Adjustment mechanism: Key considerations for Least Developed Countries iWatch Africa marks 2021 Open Data Day with focus on women safety online How Big Tech’s Content Moderation Policies Could Jeopardize Users in Authoritarian Regimes iWatch Africa launches its 2021 Policy Dialogue Series Where women journalists in Ghana go to ‘die’ Predictions for 2021: Digital Rights, Global Security, Climate Change & Expectations of the Biden Administration – Part 1 Stolen at sea: An investigation into illegal Chinese transhipment activities in Ghana and Nigeria On the other side of Saiko Value your personal and public integrity – Co-founder, iWatch Africa About Team iWatch Gallery Contact Transforming climate finance for debt-distressed economies during COVID-19 One year after the World Health Organisation declared the COVID-19 disease as a global pandemic, many… EC proposed Carbon Border Adjustment mechanism: Key considerations for Least Developed Countries Although most nations recognise the need to transition to a decarbonised world, carbon tax policies… iWatch Africa marks 2021 Open Data Day with focus on women safety online iWatch Africa, marked the 2021 Open Data Day last Saturday with virtual event with focused… Ocean & Climate Action Transforming climate finance for debt-distressed economies during COVID-19 One year after the World Health Organisation declared the COVID-19 disease as a global pandemic, many emerging markets and developing economies… Read More » Ocean & Climate Action EC proposed Carbon Border Adjustment mechanism: Key considerations for Least Developed Countries Although most nations recognise the need to transition to a decarbonised world, carbon tax policies have usually encountered significant roadblocks,… Read More » Digital Rights iWatch Africa marks 2021 Open Data Day with focus on women safety online iWatch Africa, marked the 2021 Open Data Day last Saturday with virtual event with focused on leveraging data to promote… Read More » Digital Rights How Big Tech’s Content Moderation Policies Could Jeopardize Users in Authoritarian Regimes Social media advocates have historically lauded its ability to facilitate democratic progress by connecting people over space and time, enabling… Read More » News iWatch Africa launches its 2021 Policy Dialogue Series iWatch Africa has launched its 2021 Policy Dialogue Series which seeks to bring diverse experts and stakeholders across the world… Read More » Load More Follow Us On Facebook Find us on Facebook Most Read Transforming climate finance for debt-distressed economies during COVID-19 17 seconds ago EC proposed Carbon Border Adjustment mechanism: Key considerations for Least Developed Countries 2 weeks ago iWatch Africa marks 2021 Open Data Day with focus on women safety online March 11, 2021 Watch Video: iWatch Africa Open Data Day Event 2021 March 6, 2021 Open Data Day 2021: iWatch Africa to focus on safety of women journalists & equal development online March 1, 2021 iWatch Video Playlist 1 / 20 Videos 1 Inside iWatch Africa's Digital Rights Campaign 01:11 2 iWatch Digital Rights Campaign. What is doxxing and the effects of doxxing? 00:40 3 Gideon Sarpong, Policy and News Director at iWatch Africa interviewed on Plus TV Africa, Nigeria 06:07 4 iWatch Africa campaign against domestic violence. 00:29 5 iWatch Africa investigation into use of corporal punishment in Ghana 00:50 6 iWatch Africa campaign against online trolling and impersonation (cyber-stalking) 00:46 7 iWatch Africa's video highlighting the negative impact of abuse on journalists 00:40 8 iWatch Africa campaign against cyberstalking and its impact on journalists and rights activists 00:36 9 Together Against Corruption iWatch Africa, Socioserve & JMK 01:52 10 Budget Tracking: Key educational commitments to be tracked in 2018 01:22 11 Budget Tracking: iWatch Africa Budget Tracking 2018 Health 01:14 12 iWatch Africa: Ministry of Finance released close to GH¢ 30 million to the Electoral Commission 02:59 13 iWatch Africa assessment of GoG commitment in education 2017 03:21 14 iWatch Africa third-quarter assessment of GoG commitments - Health Sector, 2017 02:40 15 iWatch Africa assessment of Planting for Food and Jobs Program 2017 04:02 16 A new age for Data Journalism | Nana Boakye-Yiadom 10:06 17 Government promise to distribute school uniforms and sandals yet to take off 02:04 18 iWatch Progress Report: One District, One Factory Initiative (1D1F) 02:12 19 iWatch Africa: Over GH¢600 million in off shore accounts at risk of abuse & recovery 03:35 20 iWatch Review:How Assemblies in Ghana mismanaged their Common Fund-Ranking 04:00 Our Partners Quick Links Education Government Expenditure Health Job Creation Quick Links Education Government Expenditure Health Job Creation © Copyright 2021. iWatch Africa. All Rights Reserved Back to top button Close Search for: Popular Posts Watch Africa Missing Gold: How Ghana lost over $6 billion in gold export revenue to major trading partners May 29, 2018 iWatch Africa joins the World Economic Forum 1 Trillion Trees Initiative as part of our Climate Action March 9, 2020 Full List: Volta Region ranked 1st for mismanagement of Assemblies’ Common Fund August 23, 2017 Third Quarter Assessment of the ‘One Village One Dam’ Promise October 16, 2017 Parents with wards in Class A schools must be allowed to pay fees- Ass. Headmaster Mfantsipim August 24, 2017 Most Commented 17 seconds ago Transforming climate finance for debt-distressed economies during COVID-19 August 10, 2017 Everybody is affected by climate change [Infographic] August 11, 2017 Reasons Journalists should use data to improve their stories August 11, 2017 Ghana’s shameful record in child marriages [Infographics] August 12, 2017 Meet Maukeni Padiki Kodjo, an iWatch Africa Transparency Launch facilitator August 12, 2017 Meet Sandister Tei, an iWatch Africa Transparency Project facilitator Recent Comments .widget-title .the-subtitle { color: #000 !important; } 
jakoblog-de-6672	----	Warning: "continue" targeting switch is equivalent to "break". Did you mean to use "continue 2"? in /kunden/116716_10965/jakoblog.de/wp/wp-content/plugins/mendeleyplugin/wp-mendeley.php on line 548 Jakoblog — Das Weblog von Jakob Voß Blog About Erster expliziter Entwurf einer Digitalen Bibliothek (1959) 18. März 2018 um 23:38 3 Kommentare Ich recherchiere (mal wieder) zu Digitalen Bibliotheken und habe mich gefragt, wann der Begriff zum ersten mal verwendet wurde. Laut Google Books taucht (nach Aussortieren falsch-positiver Treffer) „digital library“ erstmals 1959 in einem Bericht für das US-Außenministerium auf. Die bibliographischen Daten habe ich bei Wikidata eingetragen. Der Bericht „The Need for Fundamental Research in Seismology“ wurde damals erstellt um zu Untersuchen wie mit seismischen Wellen Atomwaffentests erkannt werden können. In Anhang 19 legte John Gerrard, einer von vierzehn an der Studie beteiligten Wissenschaftler, auf zwei Seiten den Bedarf an einem Rechenzentrum mit einem IBM 704 Rechner dar. Da das US-Regierungsdokument gemeinfrei ist hier die entsprechenden Seiten: Bei der geplanten digitalen Bibliothek handelt es sich um eine Sammlung von Forschungsdaten mitsamt wissenschaftlicher Software um aus den Forschungsdaten neue Erkenntnisse zu gewinnen: The following facilities should be available: A computer equivalent to the IBM 704 series, plus necessary peripheral equipment. Facilities for converting standard seismograms into digital form. A library of records of earthquakes and explosions in form suitable for machine analysis. A (growing) library of basic programs which have proven useful in investigations of seismic disturbances and related phenomena. … Klingt doch ziemlich aktuell, oder? Gefallen hat mir auch die Beschreibung des Rechenzentrums als „open shop“ und der Hinweis „nothing can dampen enthusiasm for new ideas quite as effectively as long periods of waiting time“. Die Bezeichnung „digital library“ bezieht sich in dem Text primär auf die Sammlung von digitalisierten Seimsmogrammen. Am Ende der Empfehlung wird abweichend der Begriff „digitized library“ verwendet. Dies spricht dafür dass beide Begriffe synonym verwendet wurden. Interessanterweise bezieht sich „library“ aber auch auf die Sammlung von Computerprogrammen. Ob das empfohlene Rechenzentrum mit digitaler Bibliothek realisiert wurde konnte ich leider nicht herausfinden (vermutlich nicht). Zum Autor Dr. John Gerrard ist mir nicht viel mehr bekannt als dass er 1957 als Director of Data Systems and Earth Science Research bei Texas Instruments (TI) arbeitete. TI wurde 1930 als „Geophysical Service Incorporated“ zur seismischen Erkundung von Erdöllagerstätten gegründet und bekam 1965 den Regierungsauftrag zur Überwachung von Kernwaffentests (Projekt Vela Uniform). An Gerrard erinnert sich in diesem Interview ein ehemaliger Kollege: John Gerrard: into digital seismology, and he could see a little bit of the future of digital processing and he talked about how that could be effective in seismology, he was right that this would be important in seismology In Birmingham gibt es einen Geologen gleichen Namens, der ist aber erst 1944 geboren. Ich vermute dass Gerrard bei TI an der Entwicklung des Texas Instruments Automatic Computer (TIAC) beteiligt war, der speziell zur digitalen Verarbeitung seismischer Daten entwickelt wurde. Der Einsatz von Computern in klassischen Bibliotheken kam übrigens erst mit der nächsten Rechner-Generation: das MARC-Format wurde in den 1960ern mit dem IBM System/360 entwickelt (von Henriette Avram, die zuvor bei der NSA auch mit IBM 701 gearbeitet hatte). Davor gabe es den fiktiven Bibliotheks-Computer EMMARAC (angelehnt an ENIAC und UNIVAC) in „Eine Frau, die alles weiß“ mit Katharine Hepburn als Bibliothekarin und Spencer Tracy als Computervertreter. Bis Ende der 1980er taucht der Begriff „digital library“ bei Google Books übrigens nur vereinzelt auf. Tags: digital library, Geschichte 3 Kommentare Data models age like parents 15. März 2018 um 21:51 Keine Kommentare Denny Vrandečić, employed as ontologist at Google, noticed that all six of of six linked data applications linked to 8 years ago (IWB, Tabulator, Disko, Marbles, rdfbrowser2, and Zitgist) have disappeared or changed their calling syntax. This reminded me at a proverb about software and data: software ages like fish, data ages like wine. ‏ The original form of this saying seems to come from James Governor (@monkchips) who in 2007 derived it from from an earlier phrase: Hardware is like fish, operating systems are like wine. The analogy of fishy applications and delightful data has been repeated and explained and criticized several times. I fully agree with the part about software rot but I doubt that data actually ages like wine (I’d prefer Whisky anyway). A more accurate simile may be „data ages like things you put into your crowded cellar and then forget about“. Thinking a lot about data I found that data is less interesting than the structures and rules that shape and restrict data: data models, ontologies, schemas, forms etc. How do they age compared with software and data? I soon realized: data models age like parents. First they guide you, give good advise, and support you as best as they can. But at some point data begin to rebel against their models. Sooner or later parents become uncool, disconnected from current trends, outdated or even embarrassing. Eventually you have to accept their quaint peculiarities and live your own life. That’s how standards proliferate. Both ontologies and parents ultimately become weaker and need support. And in the end you have to let them go, sadly looking back. (The analogy could further be extended, for instance data models might be frustrated confronted by how actual data compares to their ideals, but that’s another story) Tags: Data Modeling Keine Kommentare in memoriam Ingetraut Dahlberg 28. Oktober 2017 um 09:24 3 Kommentare Die Informationswissenschaftlerin Ingetraut Dahlberg, bekannt unter Anderem als Gründerin der International Society for Knowledge Organization (ISKO), ist letzte Woche im Alter von 91 Jahren verstorben. Meine erste Reaktion nach einem angemessenen Bedauern war es in Wikipedia und in Wikidata das Sterbedatum einzutragen, was jedoch schon andere erledigt hatten. Also stöberte ich etwas im Lebenslauf, und legte stattdessen Wikidata-Items zum McLuhan Institute for Digital Culture and Knowledge Organization an, dem Dahlberg schon zu Lebzeiten ihre Bibliothek vermacht hat, das aber bereits 2004 wieder geschlossen wurde. Der ehemalige Direktor Kim Veltman betreibt noch eine Webseite zum Institut und nennt in seinen Memoiren Ingetraut Dahlberg, Douglas Engelbart, Ted Nelson und Tim Berners Lee in einem Atemzug. Das sollte eigentlich Grund genug sein, mich mit der Frau zu beschäftigen. Wenn ich ehrlich bin war mein Verhältnis zu Ingetraut Dahlberg allerdings eher ein distanziert-ignorantes. Ich wusste um ihre Bedeutung in der „Wissensorganisation-Szene“, der ich zwangsläufig auch angehöre, bin ihr aber nur ein oder zwei mal auf ISKO-Tagungen begegnet und hatte auch nie Interesse daran mich mehr mit ihr auseinanderzusetzen. Als „junger Wilder“ schien sie mir immer wie eine Person, deren Zeit schon lange vorbei ist und deren Beiträge hoffnungslos veraltet sind. Dass alte Ideen auch im Rahmen der Wissensorganisation keineswegs uninteressant und irrelevant sind, sollte mir durch die Beschäftigung mit Ted Nelson und Paul Otlet eigentlich klar sein; irgendwie habe ich aber bisher nie einen Anknüpfungspunkt zu Dahlbergs Werk gefunden. Wenn ich zurückblicke muss der Auslöser für meine Ignoranz in meiner ersten Begegnung mit Vertreter*innen der Wissensorganisation auf einer ISKO-Tagung Anfang der 2000er Jahre liegen: Ich war damals noch frischer Student der Bibliotheks- und Informationswissenschaft mit Informatik-Hintergrund und fand überall spannende Themen wie Wikipedia, Social Tagging und Ontologien, die prinzipiell alle etwas mit Wissensorganisation zu tun hatten. Bei der ISKO fand ich dagegen nichts davon. Das Internet schien jedenfalls noch sehr weit weg. Erschreckend fand ich dabei weniger das Fehlen inhaltlicher Auseinandersetzung mit den damals neuesten Entwicklungen im Netz sondern die formale Fremdheit: mehrere der beteiligten Wissenschaftler*innen hatten nach meiner Erinnerung nicht einmal eine Email-Adresse. Menschen, die sich Anfang der 2000er Jahre ohne Email mit Information und Wissen beschäftigten konnte ich einfach nicht ernst nehmen. So war die ISKO in meiner Ignoranz lange ein Relikt, das ähnlich wie die International Federation for Information and Documentation (FID, warum haben die sich eigentlich nicht zusammengetan?) auf tragische Weise von der technischen Entwicklung überholt wurde. Und Ingetraut Dahlberg stand für mich exemplarisch für dieses ganze Scheitern einer Zunft. Inzwischen sehe ich es etwas differenzierter und bin froh Teil dieser kleinen aber feinen Fachcommunity zu sein (und wenn die ISKO endlich auf Open Access umstellt, werde ich auch meinen Publikations-Boycott aufgeben). In jedem Fall habe ich Ingetraut Dahlberg Unrecht getan und hoffe auf differenziertere Auseinandersetzungen mit ihrem Werk. Tags: Nachruf 3 Kommentare Wikidata documentation on the 2017 Hackathon in Vienna 21. Mai 2017 um 15:21 2 Kommentare At Wikimedia Hackathon 2017, a couple of volunteers sat together to work on the help pages of Wikidata. As part of that Wikidata documentation sprint. Ziko and me took a look at the Wikidata glossary. We identified several shortcomings and made a list of rules how the glossary should look like. The result are the glossary guidelines. Where the old glossary partly replicated Wikidata:Introduction, the new version aims to allow quick lookup of concepts. We already rewrote some entries of the glossary according to these guidelines but several entries are outdated and need to be improved still. We changed the structure of the glossary into a sortable table so it can be displayed as alphabetical list in all languages. The entries can still be translated with the translation system (it took some time to get familiar with this feature). We also created some missing help pages such as Help:Wikimedia and Help:Wikibase to explain general concepts with regard to Wikidata. Some of these concepts are already explained elsewhere but Wikidata needs at least short introductions especially written for Wikidata users. Image taken by Andrew Lih (CC-BY-SA) Tags: Wikidata, wmhack 2 Kommentare Introduction to Phabricator at Wikimedia Hackathon 20. Mai 2017 um 09:44 1 Kommentar This weekend I participate at Wikimedia Hackathon in Vienna. I mostly contribute to Wikidata related events and practice the phrase "long time no see", but I also look into some introductionary talks. In the late afternoon of day one I attended an introduction to Phabricator project management tool given by André Klapper. Phabricator was introduced in Wikimedia Foundation about three years ago to replace and unify Bugzilla and several other management tools. Phabricator is much more than an issue tracker for software projects (although it is mainly used for this purpose by Wikimedia developers). In summary there are tasks, projects, and teams. Tasks can be tagged, assigned, followed,discussed, and organized with milestones and workboards. The latter are Kanban-boards like those I know from Trello, waffle, and GitHub project boards. Phabricator is Open Source so you can self-host it and add your own user management without having to pay for each new user and feature (I am looking at you, JIRA). Internally I would like to use Phabricator but for fully open projects I don’t see enough benefit compared to using GitHub. P.S.: Wikimedia Hackathon is also organized with Phabricator. There is also a task for blogging about the event. Tags: Wikimedia, wmhack 1 Kommentar Some thoughts on IIIF and Metadata 5. Mai 2017 um 22:40 Keine Kommentare Yesterday at DINI AG Kim Workshop 2017 I Martin Baumgartner and Stefanie Rühle gave an introduction to the International Image Interoperability Framework (IIIF) with focus on metadata. I already knew that IIIF is a great technology for providing access to (especially large) images but I had not have a detailed look yet. The main part of IIIF is its Image API and I hope that all major media repositories (I am looking at you, Wikimedia Commons) will implement it. In addition the IIIF community has defined a „Presentation API“, a „Search API“, and an „Authentication API“. I understand the need of such additional APIs within the IIIF community, but I doubt that solving the underlying problems with their own standards (instead of reusing existing standards) is the right way to go. Standards should better „Do One Thing and Do It Well“ (Unix philosophy). If Images are the „One Thing“ of IIIF, then Search and Authentication are different matter. In the workshop we only looked at parts of the Presentation API to see where metadata (creator, dates, places, provenance etc. and structural metadata such as lists and hierarchies) could be integrated into IIIF. Such metadata is already expressed in many other formats such as METS/MODS and TEI so the question is not whether to use IIIF or other metadata standards but how to connect IIIF with existing metadata standards. A quick look at the Presentation API surprised me to find out that the metadata element is explicitly not intended for additional metadata but only „to be displayed to the user“. The element contains an ordered list of key-value pairs that „might be used to convey the author of the work, information about its creation, a brief physical description, or ownership information, amongst other use cases“. At the same time the standard emphasizes that „there are no semantics conveyed by this information“. Hello, McFly? Without semantics conveyed it isn’t information! In particular there is no such thing as structured data (e.g. a list of key-value pairs) without semantics. I think the design of field metadata in IIIF is based on a common misconception about the nature of (meta)data, which I already wrote about elsewhere (Sorry, German article – some background in my PhD and found by Ballsun-Stanton). In a short discussion at Twitter Rob Sanderson (Getty) pointed out that the data format of IIIF Presentation API to describe intellectual works (called a manifest) is expressed in JSON-LD, so it can be extended by other RDF statements. For instance the field „license“ is already defined with dcterms:rights. Addition of a field „author“ for dcterms:creator only requires to define this field in the JSON-LD @context of a manifest. After some experimenting I found a possible way to connect the „meaningless“ metadata field with JSON-LD fields: { "@context": [ "http://iiif.io/api/presentation/2/context.json", { "author": "http://purl.org/dc/terms/creator", "bibo": "http://purl.org/ontology/bibo/" } ], "@id": "http://example.org/iiif/book1/manifest", "@type": ["sc:Manifest", "bibo:book"], "metadata": [ { "label": "Author", "property": "http://purl.org/dc/terms/creator", "value": "Allen Smithee" }, { "label": "License", "property": "http://purl.org/dc/terms/license", "value": "CC-BY 4.0" } ], "license": "http://creativecommons.org/licenses/by/4.0/", "author": { "@id": "http://www.wikidata.org/entity/Q734916", "label": "Allen Smithee" } } This solution requires an additional element property in the IIIF specification to connect a metadata field with its meaning. IIIF applications could then enrich the display of metadata fields for instance with links or additional translations. In JSON-LD some names such as „CC-BY 4.0“ and „Allen Smithee“ need to be given twice, but this is ok because normal names (in contrast to field names such as „Author“ and „License“) don’t have semantics. Tags: iiif, Metadata Keine Kommentare Ersatzteile aus dem 3D-Drucker 30. Dezember 2014 um 10:43 2 Kommentare Krach, Zack, Bumm! Da liegt die Jalousie unten. Ein kleinen Plastikteil ist abgebrochen, das wäre doch ein prima Anwendungsfall für einen 3D-Drucker, oder? Schön länger spiele ich mit dem Gedanken, einen 3D-Drucker anzuschaffen, kann aber nicht so recht sagen, wozu eigentlich. Die Herstellung von Ersatzteilen aus dem 3D-Drucker scheint mir allerdings eher so ein Versprechen zu sein wie der intelligente Kühlschrank: theoretisch ganz toll aber nicht wirklich praktisch. Es würde mich vermutlich Stunden kosten, das passende Teil auf diversen Plattformen wie Thingiverse zu finden oder es mit CAD selber zu konstruieren. Ohne verlässliche 3D-Modelle bringt also der beste 3D-Drucker nichts, deshalb sind die Geräte auch nur ein Teil der Lösung zur Herstellung von Ersatzteilen. Ich bezweifle sehr dass in naher Zukunft Hersteller 3D-Modelle ihrer Produkte zum Download anbieten werden, es sei denn es handelt sich um Open Hardware. Abgesehen von elektronischen Bastelprojekten ist das Angebot von Open-Hardware-Produkten für den Hausgebrauch aber noch sehr überschaubar. Dennoch denke ich, dass Open Hardware, das heisst Produkte deren Baupläne frei lizensiert zur kostenlosen Verfügung stehen, sowie standardisierte Bauteile das einzig Richtige für den Einsatz von 3D-Druckern im Hausgebrauch sind. Ich werde das Problem mit der kaputten Jalousie erstmal mit analoger Technik angehen und schauen, was ich so an passenden Materialien und Werkzeugen herumliegen habe. Vielleicht hilft ja Gaffer Tape? Tags: 3D-Drucker, maker, Open Hardware 2 Kommentare Einfachste Projekthomepage bei GitHub 24. September 2014 um 09:57 1 Kommentar Die einfachste Form einer Projekthomepage bei GitHub pages besteht aus einer Startseite, die lediglich auf das Repository verweist. Lokal lässt sich eine solche Seite so angelegen: 1. Erstellung des neuen, leeren branch gh-pages: git checkout --orphan gh-pages git rm -rf . 2. Anlegen der Datei index.md mit folgendem Inhalt: --- --- # {{site.github.project_title}} [{{site.github.repository_url}}]({{site.github.repository_url}}#readme). 3. Hinzufügen der Datei und push nach GitHub git add index.md git commit -m "homepage" git push origin gh-pages Tags: github 1 Kommentar Abbreviated URIs with rdfns 9. September 2014 um 11:26 4 Kommentare Working with RDF and URIs can be annoying because URIs such as „http://purl.org/dc/elements/1.1/title“ are long and difficult to remember and type. Most RDF serializations make use of namespace prefixes to abbreviate URIs, for instance „dc“ is frequently used to abbreviate „http://purl.org/dc/elements/1.1/“ so „http://purl.org/dc/elements/1.1/title“ can be written as qualified name „dc:title„. This simplifies working with URIs, but someone still has to remember mappings between prefixes and namespaces. Luckily there is a registry of common mappings at prefix.cc. A few years ago I created the simple command line tool rdfns and a Perl library to look up URI namespace/prefix mappings. Meanwhile the program is also available as Debian and Ubuntu package librdf-ns-perl. The newest version (not included in Debian yet) also supports reverse lookup to abbreviate an URI to a qualified name. Features of rdfns include: look up namespaces (as RDF/Turtle, RDF/XML, SPARQL…) $ rdfns foaf.ttl foaf.xmlns dbpedia.sparql foaf.json @prefix foaf: . xmlns:foaf="http://xmlns.com/foaf/0.1/" PREFIX dbpedia: "foaf": "http://xmlns.com/foaf/0.1/" expand a qualified name $ rdfns dc:title http://purl.org/dc/elements/1.1/title lookup a preferred prefix $ rdfns http://www.w3.org/2003/01/geo/wgs84_pos# geo create a short qualified name of an URL $ rdfns http://purl.org/dc/elements/1.1/title dc:title I use RDF-NS for all RDF processing to improve readability and to avoid typing long URIs. For instance Catmandu::RDF can be used to parse RDF into a very concise data structure: $ catmandu convert RDF --file rdfdata.ttl to YAML Tags: Perl, rdf 4 Kommentare Das Wissen der Welt 24. August 2014 um 22:32 2 Kommentare Denny Vrandečić, einer der Köpfe hinter Semantic MediaWiki und Wikidata, hat eine clevere Metrik vorgeschlagen um den Erfolg der Wikimedia-Projekte zu messen. Die Tätigkeit und damit das Ziel der Wikimedia-Foundation wurde 2004 von Jimbo Wales so ausgedrückt: Imagine a world in which every single person on the planet is given free access to the sum of all human knowledge. That’s what we’re doing. In Wikiquote wird dieser bekannte Ausspruch momentan folgendermaßen übersetzt: „Stell dir eine Welt vor, in der jeder Mensch auf der Erde freien Zugang zum gesamten menschlichem Wissen hat. Das ist, was wir machen.“ Wie lässt sich nun aber quantifizieren, zu welchem Grad das Ziel erreicht ist? So wie ich es verstanden (und in meine Worte übersetzt) habe, schlägt Denny Folgendes vor: Für jedem Menschen auf der Welt gibt es theoretisch eine Zahl zwischen Null und Eins, die angibt wieviel vom gesamten Wissens der Welt („the sum of all human knowledge“) diesem Menschen durch Wikimedia-Inhalte zugänglich ist. Der Wert lässt sich als Prozentzahl des zugänglichen Weltwissens interpretieren – da sich Wissen aber kaum so einfach messen und vergleichen lässt, ist diese Interpretation problematisch. Der Wert von Eins ist utopisch, da Wikipedia & Co nicht alles Wissen der Welt enthält. Für Menschen ohne Internet-Zugang kann der Wert aber bei Null liegen. Selbst mit Zugang zu Wikipedia ist die Zahl bei jedem Menschen eine andere, da nicht alle Inhalte in allen Sprachen vorhanden sind und weil viele Inhalte ohne Vorwissen unverständlich und somit praktisch nicht zugänglich sind. Die Zahlen der individuellen Zugänglichkeit des Weltwissens lassen sich nun geordnet in ein Diagram eintragen, das von links (maximales Wissen) nach rechts (kein Wissen durch zugänglich) alle Menschen aufführt. Wie Denny an folgendem Bild ausführt, kann die Wikimedia-Community ihrem Weg auf verschiedenen Wegen näher kommen: (1) Der Ausbau von vielen Artikeln in einem komplexen Spezialgebiet oder einer kleinen Sprache kommt nur wenigen Menschen zu gute. (2) Stattdessen könnten auch die wichtigsten Artikel bzw. Themen in Sprachen verbessert und ergänzt werden, welche von vielen Menschen verstanden werden. (3) Schließlich kann Wikimedia auch dafür sorgen, dass mehr Menschen einen Zugang zu den Wikimedia-Ihren Inhalten bekommen – zum Beispiel durch Initiativen wie Wikipedia Zero Ich halte die von Denny vorgeschlagene Darstellung für hilfreich um über das einfache Zählen von Wikipedia-Artikeln hinauszukommen. Wie er allerdings selber zugibt, gibt es zahlreiche offene Fragen da sich die tatsächlichen Zahlen der Verfügbarkeit von Wissen nicht einfach ermitteln lassen. Meiner Meinung nach liegt ein Grundproblem darin, dass sich Wissen – und vor allem das gesamte Wissen der Menschheit – nicht quantifizieren lässt. Es ist auch irreführend davon auszugehen, dass die Wikimedia-Produkte Wissen sammeln oder enthalten. Möglicherweise ist dieser Irrtum für die Metrik egal, nicht aber für das was eigentlich gemessen werden soll (Zugänglichkeit des Wissens der Welt). Falls Wikimedia an einem unverstelltem Blick auf die Frage interessiert ist, wieviel des Wissens der Menschheit durch ihre Angebote den Menschen zugänglich gemacht wird, könnte es helfen mal einige Philosophen und Philosophinnen zu fragen. Ganz im Ernst. Mag sein (und so vermute ich mit meinem abgebrochenen Philosophie-Studium), dass am Ende lediglich deutlich wird, warum dass ganze Wikimedia-Projekt nicht zu realisieren ist; selbst Erkenntnisse über mögliche Gründe dieses Scheitern wären aber hilfreich. Vermutlich ist es aber zu verpönt, Philosophen ernsthaft um Rat zu fragen oder die verbliebenen Philosophen beschäftigen sich lieber mit anderen Fragen. P.S: Eine weitere relevante Disziplin zur Beantwortung der Frage wieviel Wissen der Welt durch Wikipedia & Co der Menschheit zugänglich gemacht wird, ist die Pädagogik, aber da kenne ich mich noch weniger aus als mit der Philosophie. Tags: Freie Inhalte, Wikipedia, Wissensordnung 2 Kommentare Nächste Seite » Neueste Beiträge Erster expliziter Entwurf einer Digitalen Bibliothek (1959) Data models age like parents in memoriam Ingetraut Dahlberg Wikidata documentation on the 2017 Hackathon in Vienna Introduction to Phabricator at Wikimedia Hackathon Neueste Kommentare подробности... bei Erster expliziter Entwurf einer Digitalen Bibliothek (1959) ayam s128 bei Ersatzteile aus dem 3D-Drucker will taking dht at 16 increase penis size bei Abbreviated URIs with rdfns Http://asikgapleqq.Com/ bei Dublin Core conference 2008 started thekitchenconnection-nc.com bei Suchmaschinenoptimierung á la INSM Themen API Archivierung ATOM Bibliothek Bibliothekswissenschaft BibSonomy DAIA Data Modeling digital library Feed Freie Inhalte GBV Humor Identifier Katalog Katalog 2.0 LibraryThing Literatur Mashup Medien Metadata Microformats musik OAI Open Access OpenStreetMap Perl PICA Politik rdf Seealso Semantic Web SOA Software Standards Suchmaschine Tagging Veranstaltung Web 2.0 Webservices Widget Wikimedia Wikipedia Wikis Überwachungsstaat Blogroll Planet Biblioblog 2.0 Planet Code4Lib Planet Wikimedia (de) Feeds Siehe auch Powered by WordPress with Theme based on Pool theme and Silk Icons. Entries and comments feeds. Valid XHTML and CSS. ^Top^ 
jakoblog-de-7417	----	<br /> <b>Warning</b>: "continue" targeting switch is equivalent to "break". Did you mean to use "continue 2"? in <b>/kunden/116716_10965/jakoblog.de/wp/wp-content/plugins/mendeleyplugin/wp-mendeley.php</b> on line <b>548</b><br /> <br /> <b>Warning</b>: Cannot modify header information - headers already sent by (output started at /kunden/116716_10965/jakoblog.de/wp/wp-content/plugins/mendeleyplugin/wp-mendeley.php:548) in <b>/kunden/116716_10965/jakoblog.de/wp/wp-includes/feed-atom.php</b> on line <b>8</b><br /> <?xml version="1.0" encoding="UTF-8"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:thr="http://purl.org/syndication/thread/1.0" xml:lang="de-DE" xml:base="http://jakoblog.de/wp-atom.php" > <title type="text">en &#8211; Jakoblog</title> <subtitle type="text">Das Weblog von Jakob Voß</subtitle> <updated>2018-03-18T21:52:22Z</updated> <link rel="alternate" type="text/html" href="http://jakoblog.de" /> <id>http://jakoblog.de/feed/atom/</id> <link rel="self" type="application/atom+xml" href="jakoblog-de-7417.html" /> <generator uri="https://wordpress.org/" version="4.9.17">WordPress</generator> <entry> <author> <name>jakob</name> </author> <title type="html"><![CDATA[Data models age like parents]]></title> <link rel="alternate" type="text/html" href="http://jakoblog.de/2018/03/15/data-models-age-like-parents/" /> <id>http://jakoblog.de/?p=1499</id> <updated>2018-03-15T19:51:45Z</updated> <published>2018-03-15T19:51:45Z</published> <category scheme="http://jakoblog.de" term="en" /><category scheme="http://jakoblog.de" term="Data Modeling" /> <summary type="html"><![CDATA[Denny Vrandečić, employed as ontologist at Google, noticed that all six of of six linked data applications linked to 8 years ago (IWB, Tabulator, Disko, Marbles, rdfbrowser2, and Zitgist) have disappeared or changed their calling syntax. This reminded me at a proverb about software and data: software ages like fish, data ages like wine. ‏ [&#8230;]]]></summary> <content type="html" xml:base="http://jakoblog.de/2018/03/15/data-models-age-like-parents/"><![CDATA[<p>Denny Vrandečić, employed as ontologist at Google, <a href="https://twitter.com/vrandezo/status/974303577467273216">noticed that</a> all six of of six linked data applications linked to 8 years ago (IWB, Tabulator, Disko, Marbles, rdfbrowser2, and Zitgist) have disappeared or changed their calling syntax. This reminded me at a proverb about software and data:</p> <blockquote><p>software ages like fish, data ages like wine.</p></blockquote> <p>‏<br /> The original form of this saying <a href="http://redmonk.com/jgovernor/2007/04/05/why-applications-are-like-fish-and-data-is-like-wine/">seems to come from James Governor</a> (<a href="https://twitter.com/monkchips">@monkchips</a>) who in 2007 derived it from from an earlier phrase:</p> <blockquote><p>Hardware is like fish, operating systems are like wine.</p></blockquote> <p>The analogy of fishy applications and delightful data <a href="https://www.openlinksw.com/dataspace/doc/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1497 ">has been repeated</a> and <a href="https://www.quora.com/What-is-the-meaning-of-Data-matures-like-wine-applications-like-fish">explained</a> and <a href="https://www.masterdata.co.za/index.php/news/87-mdm-insights/320-your-data-ages-like-fine-wine-whereas-your-software-applications-age-like-fish">criticized</a> several times. I fully agree with the part about <a href="https://en.wikipedia.org/wiki/Software_rot">software rot</a> but I doubt that data actually ages like wine (I&#8217;d prefer Whisky anyway). A more accurate simile may be &#8222;data ages like things you put into your crowded cellar and then forget about&#8220;.</p> <p>Thinking a lot <a href="http://aboutdata.org/">about data</a> I found that data is less interesting than the structures and rules that shape and restrict data: data models, ontologies, schemas, forms etc. How do they age compared with software and data? I soon realized:</p> <blockquote><p>data models age like parents.</p></blockquote> <p>First they guide you, give good advise, and support you as best as they can. But at some point data begin to rebel against their models. Sooner or later parents become uncool, disconnected from current trends, outdated or even embarrassing. Eventually you have to accept their quaint peculiarities and live your own life. That&#8217;s <a href="https://xkcd.com/927/">how standards proliferate</a>. Both ontologies and parents ultimately become weaker and need support. And in the end you have to let them go, sadly looking back.</p> <p>(The analogy could further be extended, for instance data models might be frustrated confronted by how actual data compares to their ideals, but that&#8217;s another story)</p> ]]></content> <link rel="replies" type="text/html" href="http://jakoblog.de/2018/03/15/data-models-age-like-parents/#comments" thr:count="0"/> <link rel="replies" type="application/atom+xml" href="http://jakoblog.de/2018/03/15/data-models-age-like-parents/feed/atom/" thr:count="0"/> <thr:total>0</thr:total> </entry> <entry> <author> <name>jakob</name> </author> <title type="html"><![CDATA[Wikidata documentation on the 2017 Hackathon in Vienna]]></title> <link rel="alternate" type="text/html" href="http://jakoblog.de/2017/05/21/wikidata-documentation-on-the-2017-hackathon-in-vienna/" /> <id>http://jakoblog.de/?p=1490</id> <updated>2017-05-21T13:47:47Z</updated> <published>2017-05-21T13:21:39Z</published> <category scheme="http://jakoblog.de" term="en" /><category scheme="http://jakoblog.de" term="Wikidata" /><category scheme="http://jakoblog.de" term="wmhack" /> <summary type="html"><![CDATA[At Wikimedia Hackathon 2017, a couple of volunteers sat together to work on the help pages of Wikidata. As part of that Wikidata documentation sprint. Ziko and me took a look at the Wikidata glossary. We identified several shortcomings and made a list of rules how the glossary should look like. The result are the [&#8230;]]]></summary> <content type="html" xml:base="http://jakoblog.de/2017/05/21/wikidata-documentation-on-the-2017-hackathon-in-vienna/"><![CDATA[<p>At <a href="https://www.mediawiki.org/wiki/Wikimedia_Hackathon_2017">Wikimedia Hackathon 2017</a>, a couple of volunteers sat together to work on the help pages of <a href="https://wikidata.org">Wikidata</a>. As part of that <a href="https://www.wikidata.org/wiki/Wikidata:WikiProject_Documentation">Wikidata documentation sprint</a>. Ziko and me took a look at the <a href="https://www.wikidata.org/wiki/Wikidata:Glossary">Wikidata glossary</a>. We identified several shortcomings and made a list of rules how the glossary should look like. The result are the <a href="https://www.wikidata.org/wiki/Wikidata:Glossary/Guidelines">glossary guidelines</a>. Where the old glossary partly replicated <a href="https://www.wikidata.org/wiki/Wikidata:Introduction">Wikidata:Introduction</a>, the new version aims to allow quick lookup of concepts. We already rewrote some entries of the glossary according to these guidelines but several entries are outdated and need to be improved still. We changed the structure of the glossary into a sortable table so it can be displayed as alphabetical list in all languages. The entries can still be translated with the <a href="https://www.mediawiki.org/wiki/Extension:Translate">translation system</a> (it took some time to get familiar with this feature).</p> <p>We also created some missing help pages such as <a href="https://www.wikidata.org/wiki/Help:Wikimedia">Help:Wikimedia</a> and <a href="https://www.wikidata.org/wiki/Help:Wikibase">Help:Wikibase</a> to explain general concepts with regard to Wikidata. Some of these concepts are already explained elsewhere but Wikidata needs at least short introductions especially written for Wikidata users.</p> <div style="width: 810px" class="wp-caption aligncenter"><a href="https://commons.wikimedia.org/wiki/File:Wikimedia_Hackathon_2017_-_documentation_sprint_-_reviewing_end_of_day_results.jpg"><img src="https://upload.wikimedia.org/wikipedia/commons/thumb/3/3b/Wikimedia_Hackathon_2017_-_documentation_sprint_-_reviewing_end_of_day_results.jpg/800px-Wikimedia_Hackathon_2017_-_documentation_sprint_-_reviewing_end_of_day_results.jpg" width="800" height="232" class="size-full" /></a><p class="wp-caption-text">Image taken by Andrew Lih (CC-BY-SA)</p></div> ]]></content> <link rel="replies" type="text/html" href="http://jakoblog.de/2017/05/21/wikidata-documentation-on-the-2017-hackathon-in-vienna/#comments" thr:count="2"/> <link rel="replies" type="application/atom+xml" href="http://jakoblog.de/2017/05/21/wikidata-documentation-on-the-2017-hackathon-in-vienna/feed/atom/" thr:count="2"/> <thr:total>2</thr:total> </entry> <entry> <author> <name>jakob</name> </author> <title type="html"><![CDATA[Introduction to Phabricator at Wikimedia Hackathon]]></title> <link rel="alternate" type="text/html" href="http://jakoblog.de/2017/05/20/introduction-to-phabricator-at-wikimedia-hackathon/" /> <id>http://jakoblog.de/?p=1484</id> <updated>2017-05-20T07:47:48Z</updated> <published>2017-05-20T07:44:30Z</published> <category scheme="http://jakoblog.de" term="en" /><category scheme="http://jakoblog.de" term="Wikimedia" /><category scheme="http://jakoblog.de" term="wmhack" /> <summary type="html"><![CDATA[This weekend I participate at Wikimedia Hackathon in Vienna. I mostly contribute to Wikidata related events and practice the phrase &#34;long time no see&#34;, but I also look into some introductionary talks. In the late afternoon of day one I attended an introduction to Phabricator project management tool given by André Klapper. Phabricator was introduced [&#8230;]]]></summary> <content type="html" xml:base="http://jakoblog.de/2017/05/20/introduction-to-phabricator-at-wikimedia-hackathon/"><![CDATA[<p>This weekend I participate at <a href="https://www.mediawiki.org/wiki/Wikimedia_Hackathon_2017">Wikimedia Hackathon</a> in Vienna. I mostly contribute to Wikidata related events and practice the phrase &quot;long time no see&quot;, but I also look into some introductionary talks.</p> <p>In the late afternoon of day one I attended <a href="https://phabricator.wikimedia.org/T158792">an introduction to Phabricator</a> project management tool given by André Klapper. Phabricator was introduced in Wikimedia Foundation <a href="https://blogs.gnome.org/aklapper/2014/12/17/welcome-phabricator/">about three years ago</a> to replace and unify Bugzilla and several other management tools.</p> <p>Phabricator is much more than an issue tracker for software projects (although it is mainly used for this purpose by Wikimedia developers). In summary there are <strong>tasks</strong>, <strong>projects</strong>, and <strong>teams</strong>. Tasks can be tagged, assigned, followed,discussed, and organized with milestones and <strong>workboards</strong>. The latter are <a href="https://en.wikipedia.org/wiki/Kanban_board">Kanban-boards</a> like those I know from <a href="https://trello.com">Trello</a>, <a href="https://waffle.io/">waffle</a>, and GitHub project boards.</p> <div class="figure"> <img src="https://upload.wikimedia.org/wikipedia/commons/thumb/d/d3/Simple-kanban-board-.jpg/600px-Simple-kanban-board-.jpg" /> </div> <p>Phabricator is Open Source so you can self-host it and add your own user management without having to pay for each new user and feature (I am looking at you, <a href="https://en.wikipedia.org/wiki/Jira_(software)">JIRA</a>). Internally I would like to use Phabricator but for fully open projects I don&#8217;t see enough benefit compared to using GitHub.</p> <p>P.S.: Wikimedia Hackathon is also organized with Phabricator. There is also a <a href="https://phabricator.wikimedia.org/T165466">task for blogging about the event</a>.</p> ]]></content> <link rel="replies" type="text/html" href="http://jakoblog.de/2017/05/20/introduction-to-phabricator-at-wikimedia-hackathon/#comments" thr:count="1"/> <link rel="replies" type="application/atom+xml" href="http://jakoblog.de/2017/05/20/introduction-to-phabricator-at-wikimedia-hackathon/feed/atom/" thr:count="1"/> <thr:total>1</thr:total> </entry> <entry> <author> <name>jakob</name> </author> <title type="html"><![CDATA[Some thoughts on IIIF and Metadata]]></title> <link rel="alternate" type="text/html" href="http://jakoblog.de/2017/05/05/some-thoughts-on-iiif-and-metadata/" /> <id>http://jakoblog.de/?p=1476</id> <updated>2017-05-05T20:40:59Z</updated> <published>2017-05-05T20:40:59Z</published> <category scheme="http://jakoblog.de" term="en" /><category scheme="http://jakoblog.de" term="iiif" /><category scheme="http://jakoblog.de" term="Metadata" /> <summary type="html"><![CDATA[Yesterday at DINI AG Kim Workshop 2017 I Martin Baumgartner and Stefanie Rühle gave an introduction to the International Image Interoperability Framework (IIIF) with focus on metadata. I already knew that IIIF is a great technology for providing access to (especially large) images but I had not have a detailed look yet. The main part [&#8230;]]]></summary> <content type="html" xml:base="http://jakoblog.de/2017/05/05/some-thoughts-on-iiif-and-metadata/"><![CDATA[<p>Yesterday at <a href="https://wiki.dnb.de/display/DINIAGKIM/KIM+WS+2017">DINI AG Kim Workshop 2017</a> I Martin Baumgartner and Stefanie Rühle gave an introduction to the <a href="http://iiif.io/">International Image Interoperability Framework (IIIF)</a> with focus on metadata. I already knew that IIIF is a great technology for providing access to (especially large) images but I had not have a detailed look yet. The main part of IIIF is its <a href="http://iiif.io/api/image/2.1/">Image API</a> and I hope that all major media repositories (I am looking at you, Wikimedia Commons) will implement it. In addition the IIIF community has defined a &#8222;Presentation API&#8220;, a &#8222;Search API&#8220;, and an &#8222;Authentication API&#8220;. I understand the need of such additional APIs within the IIIF community, but I doubt that solving the underlying problems with their own standards (instead of reusing existing standards) is the right way to go. Standards should better &#8222;Do One Thing and Do It Well&#8220; (<a href="https://en.wikipedia.org/wiki/Unix_philosophy">Unix philosophy</a>). If Images are the &#8222;One Thing&#8220; of IIIF, then Search and Authentication are different matter.</p> <p>In the workshop we only looked at parts of the Presentation API to see where metadata (creator, dates, places, provenance etc. and structural metadata such as lists and hierarchies) could be integrated into IIIF. Such metadata is already expressed in many other formats such as METS/MODS and TEI so the question is not whether to use IIIF or other metadata standards but how to connect IIIF with existing metadata standards. A quick look at the Presentation API surprised me to find out that the <a href="http://iiif.io/api/presentation/2.1/#descriptive-properties"><code>metadata</code> element</a> is explicitly <em>not</em> intended for additional metadata but only &#8222;to be displayed to the user&#8220;. The element contains an ordered list of key-value pairs that &#8222;might be used to convey the author of the work, information about its creation, a brief physical description, or ownership information, amongst other use cases&#8220;. At the same time the standard emphasizes that &#8222;there are no semantics conveyed by this information&#8220;. Hello, McFly? Without semantics conveyed it isn&#8217;t information! In particular there is no such thing as structured data (e.g. a list of key-value pairs) without semantics.</p> <p>I think the design of field <code>metadata</code> in IIIF is based on a common misconception about the nature of (meta)data, which I already wrote about <a href="http://libreas.eu/ausgabe23/02voss/">elsewhere</a> (Sorry, German article &#8211; some background in <a href="http://aboutdata.org/">my PhD</a> and found <a href="https://doi.org/10.1109/ICCIT.2010.5711041">by Ballsun-Stanton</a>).</p> <p>In a <a href="https://twitter.com/nichtich/status/860067420161150976">short discussion at Twitter</a> Rob Sanderson (Getty) pointed out that the data format of IIIF Presentation API to describe intellectual works (called a <a href="http://iiif.io/api/presentation/2.1/#manifest">manifest</a>) is expressed in JSON-LD, so it can be extended by other RDF statements. For instance the field &#8222;license&#8220; is already defined with <a href="http://purl.org/dc/terms/rights">dcterms:rights</a>. Addition of a field &#8222;author&#8220; for <a href="http://purl.org/dc/terms/creator">dcterms:creator</a> only requires to define this field in the JSON-LD <code>@context</code> of a manifest. After some experimenting I found a possible way to connect the &#8222;meaningless&#8220; metadata field with JSON-LD fields:</p> <pre> { "@context": [ "http://iiif.io/api/presentation/2/context.json", { "author": "http://purl.org/dc/terms/creator", "bibo": "http://purl.org/ontology/bibo/" } ], "@id": "http://example.org/iiif/book1/manifest", "@type": ["sc:Manifest", "bibo:book"], "metadata": [ { "label": "Author", "property": "http://purl.org/dc/terms/creator", "value": "Allen Smithee" }, { "label": "License", "property": "http://purl.org/dc/terms/license", "value": "CC-BY 4.0" } ], "license": "http://creativecommons.org/licenses/by/4.0/", "author": { "@id": "http://www.wikidata.org/entity/Q734916", "label": "Allen Smithee" } } </pre> <p>This solution requires an additional element <code>property</code> in the IIIF specification to connect a metadata field with its meaning. IIIF applications could then enrich the display of metadata fields for instance with links or additional translations. In JSON-LD some names such as &#8222;CC-BY 4.0&#8220; and &#8222;Allen Smithee&#8220; need to be given twice, but this is ok because normal names (in contrast to field names such as &#8222;Author&#8220; and &#8222;License&#8220;) don&#8217;t have semantics.</p> ]]></content> <link rel="replies" type="text/html" href="http://jakoblog.de/2017/05/05/some-thoughts-on-iiif-and-metadata/#comments" thr:count="0"/> <link rel="replies" type="application/atom+xml" href="http://jakoblog.de/2017/05/05/some-thoughts-on-iiif-and-metadata/feed/atom/" thr:count="0"/> <thr:total>0</thr:total> </entry> <entry> <author> <name>jakob</name> </author> <title type="html"><![CDATA[Abbreviated URIs with rdfns]]></title> <link rel="alternate" type="text/html" href="http://jakoblog.de/2014/09/09/abbreviated-uris-with-rdfns/" /> <id>http://jakoblog.de/?p=1459</id> <updated>2014-09-09T09:26:13Z</updated> <published>2014-09-09T09:26:13Z</published> <category scheme="http://jakoblog.de" term="en" /><category scheme="http://jakoblog.de" term="Perl" /><category scheme="http://jakoblog.de" term="rdf" /> <summary type="html"><![CDATA[Working with RDF and URIs can be annoying because URIs such as &#8222;http://purl.org/dc/elements/1.1/title&#8220; are long and difficult to remember and type. Most RDF serializations make use of namespace prefixes to abbreviate URIs, for instance &#8222;dc&#8220; is frequently used to abbreviate &#8222;http://purl.org/dc/elements/1.1/&#8220; so &#8222;http://purl.org/dc/elements/1.1/title&#8220; can be written as qualified name &#8222;dc:title&#8222;. This simplifies working with URIs, [&#8230;]]]></summary> <content type="html" xml:base="http://jakoblog.de/2014/09/09/abbreviated-uris-with-rdfns/"><![CDATA[<p>Working with RDF and URIs can be annoying because URIs such as &#8222;<code>http://purl.org/dc/elements/1.1/title</code>&#8220; are long and difficult to remember and type. Most RDF serializations make use of namespace prefixes to abbreviate URIs, for instance &#8222;<code>dc</code>&#8220; is frequently used to abbreviate &#8222;<code>http://purl.org/dc/elements/1.1/</code>&#8220; so &#8222;<code>http://purl.org/dc/elements/1.1/title</code>&#8220; can be written as qualified name &#8222;<code>dc:title</code>&#8222;. This simplifies working with URIs, but someone still has to remember mappings between prefixes and namespaces. Luckily there is a registry of common mappings at <a href="http://prefix.cc/">prefix.cc</a>.</p> <p><a href="http://jakoblog.de/2011/11/03/uri-namespace-lookup-with-prefixcc-and-rdf-ns/">A few years ago</a> I created the simple command line tool <b><a href="https://metacpan.org/pod/distribution/RDF-NS/bin/rdfns">rdfns</a></b> and a Perl library to look up URI namespace/prefix mappings. Meanwhile the program is also available as Debian and Ubuntu package <a href="http://packages.ubuntu.com/search?keywords=librdf-ns-perl&#038;searchon=names&#038;exact=1&#038;suite=all&#038;section=all">librdf-ns-perl</a>. The newest version (not included in Debian yet) also supports <a href="https://metacpan.org/pod/RDF::SN">reverse lookup</a> to abbreviate an URI to a qualified name. Features of rdfns include:</p> <p><b>look up namespaces (as RDF/Turtle, RDF/XML, SPARQL&#8230;)</b></p> <pre> $ rdfns foaf.ttl foaf.xmlns dbpedia.sparql foaf.json @prefix foaf: <http: //xmlns.com/foaf/0.1/> . xmlns:foaf="http://xmlns.com/foaf/0.1/" PREFIX dbpedia: <http: //dbpedia.org/resource/> "foaf": "http://xmlns.com/foaf/0.1/" </pre> <p><b>expand a qualified name</b></p> <pre> $ rdfns dc:title http://purl.org/dc/elements/1.1/title </pre> <p><b>lookup a preferred prefix</b></p> <pre> $ rdfns http://www.w3.org/2003/01/geo/wgs84_pos# geo </pre> <p><b>create a short qualified name of an URL</b></p> <pre> $ rdfns http://purl.org/dc/elements/1.1/title dc:title </pre> <p>I use RDF-NS for all RDF processing to improve readability and to avoid typing long URIs. For instance <a href="https://metacpan.org/pod/Catmandu::RDF">Catmandu::RDF</a> can be used to parse RDF into a very concise data structure:</p> <pre> $ catmandu convert RDF --file rdfdata.ttl to YAML </pre> ]]></content> <link rel="replies" type="text/html" href="http://jakoblog.de/2014/09/09/abbreviated-uris-with-rdfns/#comments" thr:count="4"/> <link rel="replies" type="application/atom+xml" href="http://jakoblog.de/2014/09/09/abbreviated-uris-with-rdfns/feed/atom/" thr:count="4"/> <thr:total>4</thr:total> </entry> <entry> <author> <name>jakob</name> </author> <title type="html"><![CDATA[Testing command line apps with App::Cmd]]></title> <link rel="alternate" type="text/html" href="http://jakoblog.de/2013/11/01/testing-command-line-apps-with-appcmd/" /> <id>http://jakoblog.de/?p=1435</id> <updated>2013-11-01T08:49:19Z</updated> <published>2013-11-01T08:49:19Z</published> <category scheme="http://jakoblog.de" term="en" /><category scheme="http://jakoblog.de" term="Perl" /> <summary type="html"><![CDATA[This posting has also been published at blogs.perl.org. Ricardo Signes&#8216; App::Cmd has been praised a lot so I gave it a try for my recent command line app. In summary, the module is great although I missed some minor features and documentation (reminder to all: if you miss some feature in a CPAN module, don&#8217;t [&#8230;]]]></summary> <content type="html" xml:base="http://jakoblog.de/2013/11/01/testing-command-line-apps-with-appcmd/"><![CDATA[<p><em>This posting has also been published <a href="http://blogs.perl.org/users/jakob/2013/11/testing-command-line-apps-with-appcmd.html">at blogs.perl.org</a>.</em></p> <p>Ricardo Signes&#8216; <a href="https://metacpan.org/pod/App::Cmd">App::Cmd</a> has been praised a lot so I gave it a try for my recent command line app. In summary, the module is great although I missed some minor features and documentation (<em>reminder to all: if you miss some feature in a CPAN module, don&#8217;t create yet another module but try to improve the existing one!</em>). One feature I like a lot is how App::Cmd facilitates writing tests for command line apps. After having written a short wrapper around <a href="https://metacpan.org/pod/App::Cmd::Tester">App::Cmd::Tester</a> my formerly ugly unit tests look very simple and clean. Have a look at this example:</p> <pre> use Test::More; use App::PAIA::Tester; new_paia_test; paia qw(config); is stdout, "{}\n"; is error, undef; paia qw(config -c x.json --verbose); is error, "failed to open config file x.json\n"; ok exit_code; paia qw(config --config x.json --verbose foo bar); is output, "# saved config file x.json\n"; paia qw(config foo bar); paia qw(config base http://example.org/); is exit_code, 0; is output, ''; paia qw(config); is_deeply stdout_json, { base => 'http://example.org/', foo => 'bar', }, "get full config" done_paia_test; </pre> <p>The application is called paia &#8211; that&#8217;s how it called at command line and that&#8217;s how it is simply called as function in the tests. The wrapper class (here: <a href="https://metacpan.org/source/App::PAIA::Tester">App::PAIA::Tester</a>) creates a singleton App::Cmd::Tester::Result object and exports its methods (stdout, stderr, exit_code&#8230;). This alone makes the test much more readable. The wrapper further exports two methods to set up a testing environment (new_paia_test) and to finish testing (done_paia_test). In my case the setup creates an empty temporary directory, other applications might clean up environment variables etc. Depending on your application you might also add some handy functions like stdout_json to parse the app&#8217;s output in a form that can better be tested.</p> ]]></content> <link rel="replies" type="text/html" href="http://jakoblog.de/2013/11/01/testing-command-line-apps-with-appcmd/#comments" thr:count="0"/> <link rel="replies" type="application/atom+xml" href="http://jakoblog.de/2013/11/01/testing-command-line-apps-with-appcmd/feed/atom/" thr:count="0"/> <thr:total>0</thr:total> </entry> <entry> <author> <name>jakob</name> </author> <title type="html"><![CDATA[My PhD thesis about data]]></title> <link rel="alternate" type="text/html" href="http://jakoblog.de/2013/09/23/my-phd-thesis-about-data/" /> <id>http://jakoblog.de/?p=1422</id> <updated>2013-09-23T11:22:22Z</updated> <published>2013-09-23T07:03:55Z</published> <category scheme="http://jakoblog.de" term="en" /> <summary type="html"><![CDATA[I have finally received paper copies of my PhD thesis &#8222;Describing Data Patterns&#8220;, published and printed via CreateSpace. The full PDF has already been archived as CC-BY-SA, but a paper print may still be nice and more handy (it&#8217;s printed as small paperback instead of the large A4-PDF). You can get a copy for 12.80€ [&#8230;]]]></summary> <content type="html" xml:base="http://jakoblog.de/2013/09/23/my-phd-thesis-about-data/"><![CDATA[<p><a href="http://aboutdata.org/"><img src="http://jakoblog.de/wp-content/uploads/2013/09/phdcover.jpeg" alt="" title="Describing Data Patterns" width="167" height="240" class="alignright size-full wp-image-1423" /></a></p> <p>I have finally received paper copies of my PhD thesis <em>&#8222;Describing Data Patterns&#8220;</em>, published and printed <a href="https://www.createspace.com/4351505">via CreateSpace</a>. The <a href="https://www.researchgate.net/publication/255711288_Describing_Data_Patterns._A_general_deconstruction_of_metadata_standards">full PDF</a> has already been <a href="http://d-nb.info/1041284497">archived</a> as CC-BY-SA, but a paper print may still be nice and more handy (it&#8217;s printed as small paperback instead of the large A4-PDF). You can get a copy for 12.80€ or 12.24€ <a href="http://amzn.com/1490931864">via Amazon</a> (ISBN 1-4909-3186-4).</p> <p>I also set up a little website at <b><a href="http://aboutdata.org/">aboutdata.org</a></b>. The site contains an HTML view of <a href="http://aboutdata.org/patterns.html">the pattern language</a> that I developed as one result of the thesis.</p> <p>I am sorry for not having written the thesis in Pandoc Markdown but in LaTeX (source code <a href="https://github.com/jakobib/phdthesis2013">available at GitHub</a>), so there is no EPUB/HTML version.</p> ]]></content> <link rel="replies" type="text/html" href="http://jakoblog.de/2013/09/23/my-phd-thesis-about-data/#comments" thr:count="3"/> <link rel="replies" type="application/atom+xml" href="http://jakoblog.de/2013/09/23/my-phd-thesis-about-data/feed/atom/" thr:count="3"/> <thr:total>3</thr:total> </entry> <entry> <author> <name>jakob</name> </author> <title type="html"><![CDATA[On the way to a library ontology]]></title> <link rel="alternate" type="text/html" href="http://jakoblog.de/2013/04/11/on-the-way-to-a-library-ontology/" /> <id>http://jakoblog.de/?p=1379</id> <updated>2013-04-11T13:02:50Z</updated> <published>2013-04-11T13:02:50Z</published> <category scheme="http://jakoblog.de" term="en" /><category scheme="http://jakoblog.de" term="DAIA" /><category scheme="http://jakoblog.de" term="DSO" /><category scheme="http://jakoblog.de" term="Library" /><category scheme="http://jakoblog.de" term="PAIA" /><category scheme="http://jakoblog.de" term="Semantic Web" /><category scheme="http://jakoblog.de" term="ssso" /> <summary type="html"><![CDATA[I have been working for some years on specification and implementation of several APIs and exchange formats for data used in, and provided by libraries. Unfortunately most existing library standards are either fuzzy, complex, and misused (such as MARC21), or limited to bibliographic data or authority data, or both. Libraries, however, are much more than [&#8230;]]]></summary> <content type="html" xml:base="http://jakoblog.de/2013/04/11/on-the-way-to-a-library-ontology/"><![CDATA[<p>I have been working for some years on specification and implementation of several APIs and exchange formats for data used in, and provided by libraries. Unfortunately most existing library standards are either fuzzy, complex, and misused (such as MARC21), or limited to bibliographic data or authority data, or both. Libraries, however, are much more than bibliographic data &#8211; they involve library patrons, library buildings, library services, library holdings, library databases etc.</p> <p>During the work on formats and APIs for these parts of library world, <a href="http://purl.org/ontology/paia">Patrons Account Information API</a> (PAIA) being the newest piece, I found myself more and more on the way to a whole library ontology. The idea of a library ontology <a href="http://wiki.code4lib.org/index.php?title=Library_Ontology&#038;oldid=3630">started in 2009</a> (now moved to <a href="https://gist.github.com/nichtich/5300505">this location</a>) but designing such a broad data model from bottom would surely have lead to yet another complex, impractical and unused library standard. Meanwhile there are several smaller ontologies for parts of the library world, to be combined and used as Linked Open Data.</p> <p>In my opinion, ontologies, RDF, Semantic Web, Linked Data and all the buzz is is overrated, but it includes some opportunities for clean data modeling and data integration, which one rarely finds in library data. For this reason I try to design all APIs and formats at least compatible with RDF. For instance the <a href="http://purl.org/NET/DAIA">Document Availability Information API</a> (DAIA), created in 2008 (and now <a href="http://gbv.github.io/daiaspec/daia.html#status-of-this-document">being slightly redesigned</a> for version 1.0) can be accessed in XML and in JSON format, and both can fully be mapped to RDF. Other micro-ontologies include:</p> <ul> <li><b><a href="http://gbv.github.io/dso/">Document Service Ontology</a> (DSO)</b> defines typical document-related services such as loan, presentation, and digitization</li> <li><b><a href="http://purl.org/ontology/ssso">Simple Service Status Ontology</a> (SSSO)</b> defines a service instance as kind of event that connects a service provider (e.g. a library) with a service consumer (e.g. a library patron). SSSO further defines typical <a href="http://gbv.github.io/ssso/ssso.html#service-status">service status</a> (e.g. reserved, prepared, executed&#8230;) and limitations of a service (e.g. a waiting queue or a delay</li> <li><b><a href="http://purl.org/ontology/paia">Patrons Account Information API </a> (PAIA)</b> will include a mapping to RDF to express basic <a href="http://gbv.github.io/paia/paia.html#patron">patron information</a>, <a href="http://gbv.github.io/paia/paia.html#fees">fees</a>, and a list of current services in a patron account, based on SSSO and DSO.</li> <li><b><a href="http://gbv.github.io/daiaspec/">Document Availability Information API</a> (DAIA)</b> includes a mapping to RDF to express the current availability of library holdings for selected services. <a href="http://gbv.github.io/daiaspec/daia.html#relevant-differences-to-daia-0.5">See here</a> for the current draft.</li> <li>A <b>holdings ontology</b> should define properties to relate holdings (or parts of holdings) to abstract documents and editions and to holding institutions.</li> <li><a href="http://purl.org/ontology/gbv/">GBV Ontology</a> contains several concepts and relations used in GBV library network that do not fit into other ontologies (yet).</li> <li>One might further create a <a href="http://uri.gbv.de/database/ontology.html">database ontology</a> to describe library databases with their provider, extent APIs etc. &#8211; right now we use the GBV ontology for this purpose. Is there anything to reuse instead of creating just another ontology?!</li> </ul> <p>The next step will probably creation of a <b>small holdings ontology</b> that nicely fits to the other micro-ontologies. This ontology should be aligned or compatible with the <a href="http://bibframe.org/">BIBFRAME</a> initiative, other ontologies such as <a href="http://schema.org">Schema.org</a>, and existing holding formats, without becoming too complex. The German Initiative DINI-KIM has just launched a <a href="https://wiki.dnb.de/display/DINIAGKIM/Bestandsdaten+Gruppe">a working group</a> to define such holding format or ontology.</p> ]]></content> <link rel="replies" type="text/html" href="http://jakoblog.de/2013/04/11/on-the-way-to-a-library-ontology/#comments" thr:count="1"/> <link rel="replies" type="application/atom+xml" href="http://jakoblog.de/2013/04/11/on-the-way-to-a-library-ontology/feed/atom/" thr:count="1"/> <thr:total>1</thr:total> </entry> <entry> <author> <name>jakob</name> </author> <title type="html"><![CDATA[Dead End Electronic Resource Citation (ERC)]]></title> <link rel="alternate" type="text/html" href="http://jakoblog.de/2013/03/29/dead-end-electronic-resource-citation-erc/" /> <id>http://jakoblog.de/?p=1376</id> <updated>2013-03-29T09:51:26Z</updated> <published>2013-03-29T09:51:26Z</published> <category scheme="http://jakoblog.de" term="en" /><category scheme="http://jakoblog.de" term="Citation" /><category scheme="http://jakoblog.de" term="Identifier" /><category scheme="http://jakoblog.de" term="Standards" /> <summary type="html"><![CDATA[Tidying up my PhD notes, I found this short rant about &#8222;Electronic Resource Citation&#8220;. I have not used it anywhere, so I publish it here, licensed under CC-BY-SA. Electronic Resource Citation (ERC) was introduced by John Kunze with a presentation at the International Conference on Dublin Core and Metadata Applications 2001 and with a paper [&#8230;]]]></summary> <content type="html" xml:base="http://jakoblog.de/2013/03/29/dead-end-electronic-resource-citation-erc/"><![CDATA[<p><i>Tidying up my PhD notes, I found this short rant about &#8222;Electronic Resource Citation&#8220;. I have not used it anywhere, so I publish it here, licensed under CC-BY-SA.</i></p> <p><b>Electronic Resource Citation (ERC)</b> was introduced by <a href="http://dot.ucop.edu/home/jak/">John Kunze</a> with a presentation at the International Conference on Dublin Core and Metadata Applications 2001 and with <a href="http://journals.tdl.org/jodi/index.php/jodi/article/view/43">a paper</a> in the Journal of Digital Information, Vol. 2, No 2 (2002). Kunze cited his paper in a call for an ERC Interest Group within the Dublin Core Metadata Initiative (DCMI) <a href="http://www.rice.edu/perl4lib/archives/2002-09/msg00017.html">at the PERL4LIB mailing list</a>, giving the following example of an ERC:</p> <pre> erc: Kunze, John A. | A Metadata Kernel for Electronic Permanence | 20011106 | http://jodi.ecs.soton.ac.uk/Articles/v02/i02/Kunze/ </pre> <p>An ERC is a minimal &#8222;kernel&#8220; metadata record that consist of four elements: who, what, when and where. In the given example they are:</p> <pre> who: Kunze, John A. what: A Metadata Kernel for Electronic Permanence when: 20011106 where: http://jodi.ecs.soton.ac.uk/Articles/v02/i02/Kunze/ </pre> <p>Ironically the given URL is obsolete, the host &#8218;jodi.ecs.soton.ac.uk&#8216; does not even exist anymore. The ERC is pretty useless if it just uses a fragile URL to cite a resource. How about some value that does not change over time, e.g:</p> <pre> where: Journal of Digital Information, Volume 2 Issue 2 </pre> <p>As ERC is defined as &#8222;a location or machine-oriented identifier&#8220;, one could also use <em>stable</em> identifiers:</p> <pre> where: ISSN 1368-7506, Article No. 81 </pre> <p>Both ISSN and article numbers 81 are much more identifiers then URLs. Citing an URL is more like</p> <pre> where: at the desk in the little reading room of my library </pre> <p>By the way the current location is <a href="http://jodi.tamu.edu/Articles/v02/i02/Kunze/">http://www.rice.edu/perl4lib/archives/2002-09/msg00017.html</a> &#8211; but who knows whether Texas A&#038;M University will still host the journal at this URL in 20 years?</p> <p>There are some interesting ideas in the original ERC proposal (different kinds of missing values, TEMPER date values, the four questions etc.), but its specification and implementation are just ridiculous and missing references to current technology (you know that you are doing something wrong in specification if you start to define your own encodings for characters, dates etc. instead of concentrating to your core subject and refering to existing specifications for the rest). The <a href="http://dot.ucop.edu/specs/ercspec.html">current draft</a> (2010) is a typical example of badly mixing modeling and encoding issues and of loosing touch with existing, established data standards.</p> <p>In addition to problems at the &#8222;low level&#8220; of encoding, the &#8222;high level&#8220; of conceptual modeling lacks appropriate references. What about the relation of ERC concepts to models such as FRBR and CIDOC-CRM? Why are &#8218;who&#8216;, &#8218;when&#8216;, &#8218;where&#8216;, &#8218;what&#8216; the important metadata fields (in many cases the most interesting question is &#8218;why&#8216;)? How about Ranganathan&#8217;s colon classification with personality, matter, energy, space, and time?</p> <p>In summary the motivation behind ERC contains some good ideas, but its form is misdirected.</p> ]]></content> <link rel="replies" type="text/html" href="http://jakoblog.de/2013/03/29/dead-end-electronic-resource-citation-erc/#comments" thr:count="0"/> <link rel="replies" type="application/atom+xml" href="http://jakoblog.de/2013/03/29/dead-end-electronic-resource-citation-erc/feed/atom/" thr:count="0"/> <thr:total>0</thr:total> </entry> <entry> <author> <name>jakob</name> </author> <title type="html"><![CDATA[Access to library accounts for better user experience]]></title> <link rel="alternate" type="text/html" href="http://jakoblog.de/2013/02/08/access-to-library-accounts-for-better-user-experience/" /> <id>http://jakoblog.de/?p=1362</id> <updated>2013-02-08T09:10:03Z</updated> <published>2013-02-08T09:10:03Z</published> <category scheme="http://jakoblog.de" term="en" /><category scheme="http://jakoblog.de" term="API" /><category scheme="http://jakoblog.de" term="PAIA" /> <summary type="html"><![CDATA[I just stumbled upon ReadersFirst, a coalition of (public) libraries that call for a better user experience for library patrons, especially to access e-books. The libraries regret that the products currently offered by e-content distributors, the middlemen from whom libraries buy e-books, create a fragmented, disjointed and cumbersome user experience. One of the explicit goals [&#8230;]]]></summary> <content type="html" xml:base="http://jakoblog.de/2013/02/08/access-to-library-accounts-for-better-user-experience/"><![CDATA[<p>I just stumbled upon <a href="http://readersfirst.org/">ReadersFirst</a>, a coalition of (public) libraries that call for a better user experience for library patrons, especially to access e-books. The libraries regret that</p> <blockquote><p>the products currently offered by e-content distributors, the middlemen from whom libraries buy e-books, create a fragmented, disjointed and cumbersome user experience.</p></blockquote> <p>One of the explicit goals of ReadersFirst is to urge providers of e-content and integrated library systems for systems that allow users to</p> <blockquote><p>Place holds, check-out items, view availability, manage fines and receive communications within individual library catalogs or in the venue the library believes will serve them best, without having to visit separate websites.</p></blockquote> <p>In a <a href="http://blogs.publishersweekly.com/blogs/PWxyz/2013/01/31/at-ala-readersfirst-moves-forward-a-notch/">summary of the first ReadersFirst meeting</a> at January 28, the president of Queens Library (NY) is cited with the following request:</p> <blockquote><p>The reader should be able to look at their library account and see what they have borrowed regardless of the vendor that supplied the ebook.</p></blockquote> <p>This goal matches well with my activity at GBV: as part of a project to implement a mobile library app, I designed an <b>API to access library accounts</b>. The <a href="http://gbv.github.com/paia/paia.html">Patrons Account Information API (PAIA)</a> is current being implemented and tested by two independent developers. It will also be used to provide a better user experience in VuFind discovery interfaces.</p> <p>During the research for PAIA I was surprised by the lack of existing methods to access library patron accounts. Some library systems not even provide an <em>internal</em> API to connect to the loan system &#8211; not to speak of a <em>public</em> API that could directly be used by patrons and third parties. The only example I could find was York University Libraries with a <a href="http://www.library.yorku.ca/cms/web/docs/apis/">simple, XML-based, read-only API</a>. This lack of public APIs to library patron accounts is disappointing, given that its almost ten years after the buzz around Web 2.0, service oriented architecture, and mashups. All all major providers of web applications (Google, Twitter, Facebook, StackExchange, GitHub etc.) support access to user accounts via APIs. </p> <p>The Patrons Account Information API will hopefully fill this gap with defined methods to place holds and to view checked out items and fines. PAPI is agnostic to specific library systems, aligned with similar APIs as listed above, and designed with RDF in mind (without any need to bother with RDF, apart from the requirement to use URIs as identifiers). <a href="http://gbv.github.com/paia/">Feedback and implementations</a> are very welcome!</p> ]]></content> <link rel="replies" type="text/html" href="http://jakoblog.de/2013/02/08/access-to-library-accounts-for-better-user-experience/#comments" thr:count="5"/> <link rel="replies" type="application/atom+xml" href="http://jakoblog.de/2013/02/08/access-to-library-accounts-for-better-user-experience/feed/atom/" thr:count="5"/> <thr:total>5</thr:total> </entry> </feed> 
jamanetwork-com-2664	----	None 
jodischneider-com-1630	----	jodischneider.com/blog reading, technology, stray thoughts Blog About Categories argumentative discussions books and reading computer science Firefox future of publishing higher education information ecosystem Information Quality Lab news intellectual freedom iOS: iPad, iPhone, etc. library and information science math old newspapers PhD diary programming random thoughts reviews scholarly communication semantic web social semantic web social web Uncategorized Search Paid graduate hourly research position at UIUC for Spring 2021 December 3rd, 2020 by jodi Jodi Schneider’s Information Quality Lab (http://infoqualitylab.org) seeks a graduate hourly student for a research project on bias in citation networks. Biased citation benefits authors in the short-term by bolstering grants and papers, making them more easily accepted. However, it can have severe negative consequences for scientific inquiry. Our goal is to find quantitative measures of network structure that can indicate the existence of citation bias.  This job starts January 4, 2021. Pay depending on experience (Master’s students start at $18/hour). Optionally, the student can also take a graduate independent study course (generally 1-2 credits IS 589 or INFO 597). Apply on Handshake Responsibilities will include: Assist in the development of algorithms to simulate an unbiased network Carry out statistical significance tests for candidate network structure measures Attend weekly meetings Assist with manuscript and grant preparation Required Skills Proficiency in Python or R Demonstrated ability to systematically approach a simulation or modeling problem Statistical knowledge, such as developed in a course on mathematical statistics and probability (e.g. STAT400 Statistics and Probability I https://courses.illinois.edu/schedule/2021/spring/STAT/400 ) Preferred Skills Knowledge of stochastic processes Experience with simulation Knowledge of random variate generation and selection of input probability distribution Knowledge of network analysis May have taken classes such as STAT433 Stochastic Processes (https://courses.illinois.edu/schedule/2021/spring/STAT/433) or IE410 Advanced Topics in Stochastic Processes & Applications (https://courses.illinois.edu/schedule/2020/fall/IE/410) MORE INFORMATION: https://ischool.illinois.edu/people/jodi-schneider http://infoqualitylab.org APPLICATION DEADLINE: Monday December 14th. Apply on Handshake with the following APPLICATION MATERIALS: Resume Transcript – Such as free University of Illinois academic history from Banner self-service (https://apps.uillinois.edu, click “Registration & Records”, “Student Records and Transcripts”, “View Academic History”, choose “Web Academic History”) Cover letter: Just provide short answers to the following two questions: 1) Why are you interested in this particular project? 2) What past experience do you have that is related to this project?  Tags: citation bias, jobs, network analysis, statistical modeling Posted in Information Quality Lab news | Comments (0) Avoiding long-haul air travel during the COVID-19 pandemic October 28th, 2020 by jodi I would not recommend long-haul air travel at this time. An epidemiological study of a 7.5 hour flight from the Middle East to Ireland concluded that 4 groups (13 people), traveling from 3 continents in four groups, who used separate airport lounges, were likely infected in flight. The flight had 17% occupancy (49 passengers/283 seats; 12 crew) and took place in summer 2020. (Note: I am not an epidemiologist.) The study (published open access): Murphy Nicola, Boland Máirín, Bambury Niamh, Fitzgerald Margaret, Comerford Liz, Dever Niamh, O’Sullivan Margaret B, Petty-Saphon Naomi, Kiernan Regina, Jensen Mette, O’Connor Lois. A large national outbreak of COVID-19 linked to air travel, Ireland, summer 2020. Euro Surveill. 2020;25(42):pii=2001624. https://doi.org/10.2807/1560-7917.ES.2020.25.42.2001624 Irish news sites including RTE and the Irish Times also covered the paper. Figure 2 from “A large national outbreak of COVID-19 linked to air travel, Ireland, summer 2020” https://doi.org/10.2807/1560-7917.ES.2020.25.42.2001624 Caption in original “Passenger seating diagram on flight, Ireland, summer 2020 (n=49 passengers)” “Numbers on the seats indicate the Flight Groups 1–4.” The age of the 13 flight cases ranged from 1 to 65 years with a median age of 23 years. Twelve of 13 flight cases and almost three quarters (34/46) of the non-flight cases were symptomatic. After the flight, the earliest onset of symptoms occurred 2 days after arrival, and the latest case in the entire outbreak occurred 17 days after the flight. Of 12 symptomatic flight cases, symptoms reported included cough (n = 7), coryza (n = 7), fever (n = 6) and sore throat (n = 5), and six reported loss of taste or smell. No symptoms were reported for one flight case. A mask was worn during the flight by nine flight cases, not worn by one (a child), and unknown for three. Murphy Nicola, Boland Máirín, Bambury Niamh, Fitzgerald Margaret, Comerford Liz, Dever Niamh, O’Sullivan Margaret B, Petty-Saphon Naomi, Kiernan Regina, Jensen Mette, O’Connor Lois. A large national outbreak of COVID-19 linked to air travel, Ireland, summer 2020. Euro Surveill. 2020;25(42):pii=2001624. https://doi.org/10.2807/1560-7917.ES.2020.25.42.2001624 (Notes to Figure 1 Caption) “It is interesting that four of the flight cases were not seated next to any other positive case, had no contact in the transit lounge, wore face masks in-flight and would not be deemed close contacts under current guidance from the European Centre for Disease Prevention and Control (ECDC) [1].” Murphy Nicola, Boland Máirín, Bambury Niamh, Fitzgerald Margaret, Comerford Liz, Dever Niamh, O’Sullivan Margaret B, Petty-Saphon Naomi, Kiernan Regina, Jensen Mette, O’Connor Lois. A large national outbreak of COVID-19 linked to air travel, Ireland, summer 2020. Euro Surveill. 2020;25(42):pii=2001624. https://doi.org/10.2807/1560-7917.ES.2020.25.42.2001624 “The source case is not known. The first two cases in Group 1 became symptomatic within 48 h of the flight, and COVID-19 was confirmed in three, including an asymptomatic case from this Group in Region A within 5 days of the flight. Thirteen secondary cases and one tertiary case were later linked to these cases. Two cases from Flight Group 2 were notified separately in Region A with one subsequent secondary family case, followed by three further flight cases notified from Region B in two separate family units (Flight Groups 1 and 2). These eight cases had commenced their journey from the same continent and had some social contact before the flight. The close family member of a Group 2 case seated next to the case had tested positive abroad 3 weeks before, and negative after the flight. Flight Group 3 was a household group of which three cases were notified in Region C and one case in Region D. These cases had no social or airport lounge link with Groups 1 or 2 pre-flight and were not seated within two rows of them. Their journey origin was from a different continent. A further case (Flight Group 4) had started the journey from a third continent, had no social or lounge association with other cases and was eated in the same row as passengers from Group 1. Three household contacts and a visitor of Flight Group 4 became confirmed cases. One affected contact travelled to Region E, staying in shared accommodation with 34 others; 25 of these 34 became cases (attack rate 73%) notified in regions A, B, C, D, E and F, with two cases of quaternary spread.” Murphy Nicola, Boland Máirín, Bambury Niamh, Fitzgerald Margaret, Comerford Liz, Dever Niamh, O’Sullivan Margaret B, Petty-Saphon Naomi, Kiernan Regina, Jensen Mette, O’Connor Lois. A large national outbreak of COVID-19 linked to air travel, Ireland, summer 2020. Euro Surveill. 2020;25(42):pii=2001624. https://doi.org/10.2807/1560-7917.ES.2020.25.42.2001624 “In-flight transmission is a plausible exposure for cases in Group 1 and Group 2 given seating arrangements and onset dates. One case could hypothetically have acquired the virus as a close household contact of a previous positive case, with confirmed case onset date less than two incubation periods before the flight, and symptom onset in the flight case was 48 h after the flight. In-flight transmission was the only common exposure for four other cases (Flight Groups 3 and 4) with date of onset within four days of the flight in all but the possible tertiary case. This case from Group 3 developed symptoms nine days after the flight and so may have acquired the infection in-flight or possibly after the flight through transmission within the household.” Murphy Nicola, Boland Máirín, Bambury Niamh, Fitzgerald Margaret, Comerford Liz, Dever Niamh, O’Sullivan Margaret B, Petty-Saphon Naomi, Kiernan Regina, Jensen Mette, O’Connor Lois. A large national outbreak of COVID-19 linked to air travel, Ireland, summer 2020. Euro Surveill. 2020;25(42):pii=2001624. https://doi.org/10.2807/1560-7917.ES.2020.25.42.2001624 “Genomic sequencing for cases travelling from three different continents strongly supports the epidemiological transmission hypothesis of a point source for this outbreak. The ability of genomics to resolve transmission events may increase as the virus evolves and accumulates greater diversity [23].” Murphy Nicola, Boland Máirín, Bambury Niamh, Fitzgerald Margaret, Comerford Liz, Dever Niamh, O’Sullivan Margaret B, Petty-Saphon Naomi, Kiernan Regina, Jensen Mette, O’Connor Lois. A large national outbreak of COVID-19 linked to air travel, Ireland, summer 2020. Euro Surveill. 2020;25(42):pii=2001624. https://doi.org/10.2807/1560-7917.ES.2020.25.42.2001624 Authors note that a large percentage of the flight passengers were infected: “We calculated high attack rates, ranging plausibly from 9.8 % to 17.8% despite low flight occupancy and lack of passenger proximity on-board.” Murphy Nicola, Boland Máirín, Bambury Niamh, Fitzgerald Margaret, Comerford Liz, Dever Niamh, O’Sullivan Margaret B, Petty-Saphon Naomi, Kiernan Regina, Jensen Mette, O’Connor Lois. A large national outbreak of COVID-19 linked to air travel, Ireland, summer 2020. Euro Surveill. 2020;25(42):pii=2001624. https://doi.org/10.2807/1560-7917.ES.2020.25.42.2001624 Among the reasons for the uncertainty of this range is that “11 flight passengers could not be contacted and were consequently not tested.” (A twelfth passenger “declined testing”.) There is also some inherent uncertainty due to incubation period and possibility of “transmission within the household”, especially after the flight; authors note that “Exposure possibilities for flight cases include in-flight, during overnight transfer/pre-flight or unknown acquisition before the flight.” Beyond the 13 people on the flight, cases spread to several social groups, across “six of the eight different health regions (Regions A–H) throughout the Republic of Ireland”. Flight groups 1 and 2 started their travel from one continent; Flight group 3 from another; Flight group 4 from a third continent. Figure 3 from “A large national outbreak of COVID-19 linked to air travel, Ireland, summer 2020” https://doi.org/10.2807/1560-7917.ES.2020.25.42.2001624 caption in original: “Diagram of chains of transmission, flight-related COVID-19 cases, Ireland, summer 2020 (n=59)” Tags: air travel, attack rate, COVID-19, COVID19, epidemiology, flights, flying, Ireland, Middle East, pandemic Posted in random thoughts | Comments (0) Paid Undergraduate Research position at UIUC for Fall & Spring 2020 August 18th, 2020 by jodi University of Illinois undergraduates are encouraged to apply for a position in my lab. I particularly welcome applications from students in the new iSchool BS/IS degree or in the university-wide informatics minor. While I only have 1 paid position open, I also supervise unpaid independent study projects. Dr. Jodi Schneider and the Information Quality Lab <https://infoqualitylab.org> seek undergraduate research assistants for 100% REMOTE WORK. Past students have published research articles, presented posters, earned independent study credit, James Scholar research credit, etc. One paid position in news analytics/data science for Assessing the Impact of Media Polarization on Public Health Emergencies, funded by the Cline Center for Advanced Research in the Social Sciences. (8hrs/week at $12.50/hour + possible independent study – 100% REMOTE WORK). COVID-19 news analytics: We seek to understand how public health emergencies are reported and to assess the polarization and politicization of the U.S. news coverage. You will be responsible for testing and improving search parameters, investigating contextual information such as media bias and media circulation, using text mining and data science, and close reading of sample texts. You will work closely with a student who has worked on the opioid crisis – see the past work following poster (try the link TWICE – you have to log in with an Illinois NetID): https://compass2g.illinois.edu/webapps/discussionboard/do/message?action=list_messages&course_id=_50281_1&nav=discussion_board&conf_id=_247818_1&forum_id=_417427_1&message_id=_6264991_1 Applications should be submitted here: https://forms.illinois.edu/sec/742264484 DEADLINE: 5 pm Central Time SUNDAY AUGUST 30, 2020 Tags: COVID19, data science, health controversies, jobs, media polarization, news analytics, research experiences for undergraduates, undergraduate research Posted in Information Quality Lab news | Comments (0) #ShutDownSTEM #strike4blacklives #ShutDownAcademia June 10th, 2020 by jodi I greatly appreciated receiving messages from senior people about their participation in the June 10th #ShutDownSTEM #strike4blacklives #ShutDownAcademia. In that spirit, I am sharing my email bounce message for tomorrow, and the message I sent to my research lab. Email bounce: I am not available by email today:  This June 10th is a day of action about understanding and addressing racism, and its impact on the academy, and on STEM.  -Jodi Email to my research lab Wednesday is a day of action about understanding and addressing racism, and its impact on the academy, and on STEM. I strongly encourage you to use tomorrow for this purpose. Specifically, I invite you to think about what undoing racism – moving towards antiracism – means, and what you can do. One single day, by itself, will not cure racism; but identifying what we can do on an ongoing basis, and taking those actions day after day – that can and will have an impact. And, if racism is vivid in your daily life, make #ShutDownSTEM a day of rest. If tomorrow doesn’t suit, I encourage you to reserve a day over the course of the next week, to replace your everyday duties. What does taking this time actually mean? It means scheduling a dedicated block of time to learn more; rescheduling meetings; shutting down your email; reading books and articles and watching videos; and taking time to reflect on recent events and the stress that they cause every single person in our community. What am I doing personally? I’ve cancelled meetings tomorrow, and set an email bounce. I will spend part of the day to think more seriously about what real antiracist action looks like from my position, as a white female academic. This week I will also be using time to re-read White Fragility, to finish Dreamland Burning (a YA novel about the 1921 Tulsa race riot), and to investigate how to bring bystander training to the iSchool. I will also be thinking about the relationship of racism to other forms of oppression – classism, sexism, homophobia, transphobia, xenophobia. If you are looking for readings of your own, I can point to a list curated by an Anti-Racism Task Force: https://idea.illinois.edu/education For basic information, #ShutDownSTEM #strike4blacklives #ShutDownAcademia website: https://www.shutdownstem.com Physicists’ Particles for Justice: https://www.particlesforjustice.org -Jodi Tags: #ShutDownAcademia, #ShutDownSTEM, #strike4blacklives, email Posted in random thoughts | Comments (0) QOTD: Storytelling in protest and politics March 16th, 2020 by jodi I recently read Francesca Polletta‘s book It was like a fever: Storytelling in protest and politics (2006, University of Chicago Press). I recommend it! It will appeal to researchers interested in topics such as narrative, strategic communication, (narrative) argumentation, or epistemology (here, of narrative). Parts may also interest activists. The book’s case studies are drawn from the Student Nonviolent Coordinating Committee (SNCC) (Chapters 2 & 3); online deliberation about the 9/11 memorial (Listening to the City, summer 2002) (Chapter 4); women’s stories in law (including, powerfully, battered women who had killed their abusers, and the challenges in making their stories understandable) (Chapter 5); references to Martin Luther King by African American Congressmen (in the Congressional Record) and by “leading back political figures who were not serving as elected or appointed officials” (Chapter 6). Several are extended from work Polletta previously published from 1998 through 2005 (see page xiii for citations). The conclusion—”Conclusion: Folk Wisdom and Scholarly Tales” (pages 166-187)—takes up several topics, starting with canonicity, interpretability, ambivalence. I especially plan to go back to the last two sections: “Scholars Telling Stories” (pages 179-184)—about narrative and storytelling in analysts’ telling of events—and “Towards a Sociology of Discursive Forms” (pages 185-187)—about investigating the beliefs and conventions of narrative and its institutional conventions (and relating those to conventions of other “discursive forms” such as interviews). These set forward a research agenda likely useful to other scholars interested in digging in further. These are foreshadowed a bit in the introduction (“Why Stories Matter”) which, among other things, sets out the goal of developing “A Sociology of Storytelling”. A few quotes I noted—may give you the flavor of the book: page 141: “But telling stories also carries risks. People with unfamiliar experiences have found those experiences assimilated to canonical plot lines and misheard as a result. Conventional expectations about how stories work, when they are true, and when they are appropriate have also operated to diminish the impact of otherwise potent political stories. For the abused women whom juries disbelieved because their stories had changed in small details since their first traumatized [p142] call to police, storytelling has not been especially effective. Nor was it effective for the citizen forum participants who did not say what it was like to search fruitlessly for affordable housing because discussions of housing were seen as the wrong place in which to tell stories.” pages 166-167: “So which is it? Is narrative fundamentally subversive or hegemonic? Both. As a rhetorical form, narrative is equipped to puncture reigning verities and to uphold them. At times, it seems as if most of the stories in circulation are subtly or not so subtly defying authorities; at others as if the most effective storytelling is done by authorities. To make it more complicated, sometimes authorities unintentionally undercut their own authority when they tell stories. And even more paradoxically, undercutting their authority by way of a titillating but politically inconsequential story may actually strengthen it. Dissenters, for their part, may find their stories misread in ways that support the very institutions that are challenging….”For those interested in the relations between storytelling, protest, and politics, this all suggests two analytical tasks. One is to identify the features of narrative that allow it to [p167] achieve certain rhetorical effects. The other is to identify the social conditions in which those rhetorical effects are likely to be politically consequential. The surprise is that scholars of political processes have devoted so little attention to either task.” pages 177-8 – “So institutional conventions of storytelling influence what people can do strategically with stories. In the previously pages, I have described the narrative conventions that operate in legal adjudication, media reporting, television talk shows, congressional debate, and public deliberation. Sociolinguists have documented such conventions in other settings: in medical intake interviews, for example, parole hearings, and jury deliberations. One could certainly generate a catalogue of the institutional conventions of storytelling. To some extent, those conventions reflect the peculiarities of the institution as it has developed historically. They also serve practical functions; some explicit, others less so. I have argued that the lines institutions draw between suitable and unsuitable occasions for storytelling or for certain kinds of stories serve to legitimate the institution.” [specific examples follow] ….”As these examples suggest, while institutions have different conventions of storytelling, storytelling does some of the same work in many institutions. It does so because of broadly shared assumptions about narrative’s epistemological status. Stories are generally thought to be more affecting by less authoritative than analysis, in part because narrative is associated with women rather than men, the private sphere rather than the public one, and custom rather than law. Of course, conventions of storytelling and the symbolic associations behind them are neither unitary nor fixed. Nor are they likely to be uniformly advantageous for those in power and disadvantageous for those without it. Narrative’s alignment [179] along the oppositions I noted is complex. For example, as I showed in chapter 5, Americans’ skepticism of expert authority gives those telling stories clout. In other words, we may contrast science with folklore (with science seen as much more credible), but we may also contrast it with common sense (with science seen as less credible). Contrary to the lamentation of some media critics and activists, when disadvantaged groups have told personal stories to the press and on television talk shows, they have been able to draw attention not only to their own victimization but to the social forces responsible for it.“ Tags: Congressional Record, Francesca Polletta, Listening to the City, Martin Luther King, narrative, QOTD, SNCC, storytelling, strategic communication, Student Nonviolent Coordinating Committee Posted in argumentative discussions, books and reading | Comments (0) Knowledge Graphs: An Aggregation of Definitions March 3rd, 2019 by jodi I am not aware of a consensus definition of knowledge graph. I’ve been discussing this for awhile with Liliana Giusti Serra, and the topic came up again with my fellow organizers of the knowledge graph session at US2TS as we prepare for a panel. I’ve proposed the following main features: RDF-compatible, has a defined schema (usually an OWL ontology) items are linked internally may be a private enterprise dataset (e.g. not necessarily openly available for external linking) or publicly available covers one or more domains Below are some quotes. I’d be curious to hear of other definitions, especially if you think there’s a consensus definition I’m just not aware of. “A knowledge graph consists of a set of interconnected typed entities and their attributes.” Jose Manuel Gomez-Perez, Jeff Z. Pan, Guido Vetere and Honghan Wu. “Enterprise Knowledge Graph: An Introduction.”  In Exploiting Linked Data and Knowledge Graphs in Large Organisations. Springer. Part of the whole book: http://link.springer.com/10.1007/978-3-319-45654-6 “A knowledge graph is a structured dataset that is compatible with the RDF data model and has an (OWL) ontology as its schema. A knowledge graph is not necessarily linked to external knowledge graphs; however, entities in the knowledge graph usually have type information, defined in its ontology, which is useful for providing contextual information about such entities. Knowledge graphs are expected to be reliable, of high quality, of high accessibility and providing end user oriented information services.” Boris Villazon-Terrazas, Nuria Garcia-Santa, Yuan Ren, Alessandro Faraotti, Honghan Wu, Yuting Zhao, Guido Vetere and Jeff Z. Pan .  “Knowledge graphs: Foundations”. In Exploiting Linked Data and Knowledge Graphs in Large Organisations.  Springer. Part of the whole book: http://link.springer.com/10.1007/978-3-319-45654-6 “The term Knowledge Graph was coined by Google in 2012, referring to their use of semantic knowledge in Web Search (“Things, not strings”), and is recently also used to refer to Semantic Web knowledge bases such as DBpedia or YAGO. From a broader perspective, any graph-based representation of some knowledge could be considered a knowledge graph (this would include any kind of RDF dataset, as well as description logic ontologies). However, there is no common definition about what a knowledge graph is and what it is not. Instead of attempting a formal definition of what a knowledge graph is, we restrict ourselves to a minimum set of characteristics of knowledge graphs, which we use to tell knowledge graphs from other collections of knowledge which we would not consider as knowledge graphs. A knowledge graph mainly describes real world entities and their interrelations, organized in a graph. defines possible classes and relations of entities in a schema. allows for potentially interrelating arbitrary entities with each other. covers various topical domains.” Paulheim, H. (2017). Knowledge graph refinement: A survey of approaches and evaluation methods. Semantic web, 8(3), 489-508. http://www.semantic-web-journal.net/system/files/swj1167.pdf “ISI’s Center on Knowledge Graphs research group combines artificial intelligence, the semantic web, and database integration techniques to solve complex information integration problems. We leverage general research techniques across information-intensive disciplines, including medical informatics, geospatial data integration and the social Web.” http://usc-isi-i2.github.io/home/ Just as I was “finalizing” my list to send to colleagues, I found a poster all about definitions: Ehrlinger, L., & Wöß, W. (2016). Towards a Definition of Knowledge Graphs. SEMANTiCS (Posters, Demos, SuCCESS), 48. http://ceur-ws.org/Vol-1695/paper4.pdf Its Table 1: Selected definitions of knowledge graph has the following definitions (for citations see that paper) “A knowledge graph (i) mainly describes real world entities and their interrelations, organized in a graph, (ii) defines possible classes and relations of entities in a schema, (iii) allows for potentially interrelating arbitrary entities with each other and (iv) covers various topical domains.” Paulheim [16] “Knowledge graphs are large networks of entities, their semantic types, properties, and relationships between entities.” Journal of Web Semantics [12] “Knowledge graphs could be envisaged as a network of all kind things which are relevant to a specific domain or to an organization. They are not limited to abstract concepts and relations but can also contain instances of things like documents and datasets.” Semantic Web Company [3] “We define a Knowledge Graph as an RDF graph. An RDF graph consists of a set of RDF triples where each RDF triple (s, p, o) is an ordered set of the following RDF terms: a subjects∈U∪B,apredicatep∈U,andanobjectU∪B∪L. AnRDFtermiseithera URI u ∈ U, a blank node b ∈ B, or a literal l ∈ L.” Färber et al. [7] “[…] systems exist, […], which use a variety of techniques to extract new knowledge, in the form of facts, from the web. These facts are interrelated, and hence, recently this extracted knowledge has been referred to as a knowledge graph.” Pujara et al. [17] “A knowledge graph is a graph that models semantic knowledge, where each node is a real-world concept, and each edge represents a relationship between two concepts” Fang, Y., Kuan, K., Lin, J., Tan, C., & Chandrasekhar, V. (2017). Object detection meets knowledge graphs. https://oar.a-star.edu.sg/jspui/handle/123456789/2147 “things not strings” – Google https://googleblog.blogspot.com/2012/05/introducing-knowledge-graph-things-not.html Tags: knowledge graph, knowledge representation, quotations Posted in information ecosystem, semantic web | Comments (0) QOTD: Doing more requires thinking less December 1st, 2018 by jodi by the aid of symbolism, we can make transitions in reasoning almost mechanically by the eye which would otherwise call into play the higher faculties of the brain. …Civilization advances by extending the number of important operations that we can perform without thinking about them. Operations of thought are like cavalry charges in a battle — they are strictly limited in number, they require fresh horses, and must only be made at decisive moments. One very important property for symbolism to possess is that it should be concise, so as to be visible at one glance of the eye and be rapidly written. – Whitehead, A.N. (1911). An introduction to mathematics, Chapter 5, “The Symbolism of Mathematics” (page 61 in this version) HT to Santiago Nuñez-Corrales (Illinois page for Santiago Nuñez-Corrales, LinkedIn for Santiago Núñez-Corrales) who used part of this quote in a Conceptual Foundations Group talk, Nov 29. From my point of view, this is why memorizing multiplication tables is not now irrelevant; why new words for concepts are important; and underlies a lot of scientific advancement. Tags: cavalry, modes of thought, QOTD, symbolism Posted in information ecosystem, random thoughts | Comments (0) QOTD: Sally Jackson on how disagreement makes arguments more explicit June 19th, 2018 by jodi Sally Jackson explicates the notion of the “disagreement space” in a new Topoi article: “a position that remains in doubt remains in need of defense”1   “The most important theoretical consequence of seeing argumentation as a system for management of disagreement is a reversal of perspective on what arguments accomplish. Are arguments the means by which conclusions are built up from established premises? Or are they the means by which participants drill down from disagreements to locate how it is that they and others have arrived at incompatible positions? A view of argumentation as a process of drilling down from disagreements suggests that arguers themselves do not simply point to the reasons they hold for a particular standpoint, but sometimes discover where their own beliefs come from, under questioning by others who do not share their beliefs. A logical analysis of another’s argument nearly always involves first making the argument more explicit, attributing more to the author than was actually said. This is a familiar enough problem for analysts; my point is that it is also a pervasive problem for participants, who may feel intuitively that something is seriously wrong in what someone else has said but need a way to pinpoint exactly what. Getting beliefs externalized is not a precondition for argument, but one of its possible outcomes.”2 From Sally Jackson’s Reason-Giving and the Natural Normativity of Argumentation.3 The original treatment of disagreement space is cited to a book chapter revising an ISSA 1992 paper4, somewhat harder to get one’s hands on. p 12, Sally Jackson. Reason-Giving and the Natural Normativity of Argumentation. Topoi. 2018 Online First. http://doi.org/10.1007/s11245-018-9553-5 [↩] p 10, Sally Jackson. Reason-Giving and the Natural Normativity of Argumentation. Topoi. 2018 Online First. http://doi.org/10.1007/s11245-018-9553-5 [↩] Sally Jackson. Reason-Giving and the Natural Normativity of Argumentation. Topoi. 2018 Online First. http://doi.org/10.1007/s11245-018-9553-5 [↩] Jackson S (1992) “Virtual standpoints” and the pragmatics of conversational argument. In: van Eemeren FH, Grootendorst R, Blair JA, Willard CA (eds) Argument illuminated. International Centre for the Study of Argumentation, Amsterdam, pp. 260–226 [↩] Tags: argumentation, argumentation norms, disagreement space Posted in argumentative discussions | Comments (0) QOTD: Working out scientific insights on paper, Lavoisier case study July 12th, 2017 by jodi …language does do much of our thinking for us, even in the sciences, and rather than being an unfortunate contamination, its influence has been productive historically, helping individual thinkers generate concepts and theories that can then be put to the test. The case made here for the constitutive power of figures [of speech] per se supports the general point made by F.L. Holmes in a lecture addressed to the History of Science Society in 1987. A distinguished historian of medicine and chemistry, Holmes based his study of Antoine Lavoisier on the French chemist’s laboratory notebooks. He later examined drafts of Lavoisier’s published papers and discovered that Lavoisier wrote many versions of his papers and in the course of careful revisions gradually worked out the positions he eventually made public (Holmes, 221). Holmes, whose goal as a historian is to reconstruct the careful pathways and fine structure of scientific insights, concluded from his study of Lavoisier’s drafts We cannot always tell whether a thought that led him to modify a passage, recast an argument, or develop an alternative interpretation occurred while he was still engaged in writing what he subsequently altered, or immediately afterward, or after some interval during which he occupied himself with something else; but the timing is, I believe, less significant than the fact that the new developments were consequences of the effort to express ideas and marshall supporting information on paper (225). – page xi of Rhetorical Figures in Science by Jeanne Fahnestock, Oxford University Press, 1999. She is quoting Frederich L. Holmes. 1987. Scientific writing and scientific discovery. Isis 78:220-235. DOI:10.1086/354391 As Moore summarizes, Lavoisier wrote at least six drafts of the paper over a period of at least six months. However, his theory of respiration did not appear until the fifth draft. Clearly, Lavoisier’s writing helped him refine and understand his ideas. Moore, Randy. Language—A Force that Shapes Science. Journal of College Science Teaching 28.6 (1999): 366. http://www.jstor.org/stable/42990615 (which I quoted in a review I wrote recently) Fahnestock adds: “…Holmes’s general point [is that] there are subtle interactions ‘between writing, thought, and operations in creative scientific activity’ (226).” Tags: Lavoisier, revision, rhetoric of science, scientific communication, scientific writing Posted in future of publishing, information ecosystem, scholarly communication | Comments (0) David Liebovitz: Achieving Care transformation by Infusing Electronic Health Records with Wisdom May 1st, 2017 by jodi Today I am at the Health Data Analytics summit. The title of the keynote talk is Achieving Care transformation by Infusing Electronic Health Records with Wisdom. It’s a delight to hear from a medical informaticist: David M. Liebovitz (publications in Google Scholar), MD, FACP, Chief Medical Information Officer, The University of Chicago. He graduated from University of Illinois in electrical engineering, making this a timely talk as the engineering-focused Carle Illinois College of Medicine gets going. David Liebovitz started with a discussion of the data problems — problem lists, medication lists, family history, rules, results, notes — which will be familiar to anyone using EHRs or working with EHR data. He draws attention also to the human problems — both in terms of provider “readiness” (e.g. their vision for population-level health) as well as about “current expectations”. (An example of such an expectation is a “main clinician satisfier” he closed with: U Chicago is about to turn on outbound faxing from the EHR!) He mentioned also the importance of resilience. He mentioned customizing systems as a risk when the vendor makes upstream changes (this is not unique to healthcare but a threat to innovation and experimentation with information systems in other industries.) Still, in managing the EHR, there is continual optimization, scored based on a number of factors. He mentioned: Safety Quality/patient experience Regulatory/legal Financial Usability/productivity Availability of alternative solutions As well as weighting for old requests. He emphasized the complexity of healthcare in several ways: “Nobody knew that healthcare could be so complicated.” – POTUS Showing the Medicare readmissions adjustment factors Pharmacy pricing, an image (showing kickbacks among other things) from “Prices That Are Too High”, Chapter 5, The Healthcare Imperative: Lowering Costs and Improving Outcomes: Workshop Series Summary (2010)  National Academies Press doi:10.17226/12750 An image from “Prices That Are Too High”, Chapter 5, The Healthcare Imperative: Lowering Costs and Improving Outcomes: Workshop Series Summary (2010) Icosystem’s diagram of the complexity of the healthcare system Icosystem – complexity of the healthcare system Another complexity is the modest impact of medical care compared to other factors such as the impact of socioeconomic and political context on equity in health and well-being (see the WHO image below). For instance, there is a large impact of health behaviors, which “happen in larger social contexts.” (See the Relative Contribution of Multiple Determinants to Health, August 21, 2014, Health Policy Briefs) Solar O, Irwin A. A conceptual framework for action on the social determinants of health. Social Determinants of Health Discussion Paper 2 (Policy and Practice). Given this complexity, David Liebovitz stresses that we need to start with the right model, “simultaneously improving population health, improving the patient experience of care, and reducing per capita cost”. (See Stiefel M, Nolan K. A Guide to Measuring the Triple Aim: Population Health, Experience of Care, and Per Capita Cost. IHI Innovation Series white paper. Cambridge, Massachusetts: Institute for Healthcare Improvement; 2012). Table 1 from Stiefel M, Nolan K. A Guide to Measuring the Triple Aim: Population Health, Experience of Care, and Per Capita Cost. IHI Innovation Series white paper. Cambridge, Massachusetts: Institute for Healthcare Improvement; 2012. Given the modest impact of medical care, and of data, he suggests that we should choose the right outcomes. David Liebovitz says that “not enough attention has been paid to usability”; I completely agree and suggest that information scientists, human factors engineeers, and cognitive ergonomists help mainstream medical informaticists fill this gap. He put up Jakob Nielsen’s 10 usability heuristics for user interface design A vivid example is whether a patient’s resuscitation preferences are shown (which seems to depend on the particular EHR screen): the system doesn’t highlight where we are in the system. For providers, he says user control and freedom are very important. He suggests that there are only a few key tasks. A provider should be able to do ANY of these things wherever they are in the chart: put a note order something send a message Similarly, EHR should support recognition (“how do I admit a patient again?”) rather than requiring recall. Meanwhile, on the decision support side he highlights the (well-known) problems around interruptions by saying that speed is everything and changing direction is much easier than stopping. Here he draws on some of his own work, describing what he calls a “diagnostic process aware workflow” David Liebovitz. Next steps for electronic health records to improve the diagnostic process. Diagnosis 2015 2(2) 111-116. doi:10.1515/dx-2014-0070 Can we predict X better? Yes, he says (for instance pointing to Table 3 of “Can machine-learning improve cardiovascular risk prediction using routine clinical data?” and its machine learning analysis of over 300,000 patients, based on variables chosen from previous guidelines and expert-informed selection–generating further support for aspects such as aloneness, access to resources, socio-economic status). But what’s really needed, he says, is to: Predict the best next medical step, iteratively Predict the best next lifestyle step, iteratively (And what to do about genes and epigenetic measures?) He shows an image of “All of our planes in the air” from flightaware, drawing the analogy that we want to work on “optimal patient trajectories” — predicting what are the “turbulent events” to avoid”. This is not without challenges. He points to three: Data privacy (He suggests Google DeepMind and healthcare in an age of algorithms. Powles, J. & Hodson, H. Health Technol. (2017). doi:10.1007/s12553-017-0179-1 Two sorts of mismatches between the current situation and where we want to go: For instance the source of data being from finance Certain basic current clinician needs  (e.g. that a main clinician satisfier is that UChicago is soon to turn on outbound faxing from their EHR — and that an ongoing source of dissatisfaction: managing volume of inbound faxes.) He closes suggesting that we: Finish the basics Address key slices of the spectrum Descriptive/prescriptive Begin the prescriptive journey: impact one trajectory at a time. Tags: data analytics, electronic health records, healthcare systems, medical informatics Posted in information ecosystem | Comments (0) « Older Entries Recent Posts Paid graduate hourly research position at UIUC for Spring 2021 Avoiding long-haul air travel during the COVID-19 pandemic Paid Undergraduate Research position at UIUC for Fall & Spring 2020 #ShutDownSTEM #strike4blacklives #ShutDownAcademia QOTD: Storytelling in protest and politics Monthly December 2020 October 2020 August 2020 June 2020 March 2020 Meta Log in Valid XHTML XFN WordPress Wordpress powers jodischneider.com/blog. Layers theme Designed by Jai Pandya. 
jodischneider-com-7291	----	jodischneider.com/blog jodischneider.com/blog reading, technology, stray thoughts Paid graduate hourly research position at UIUC for Spring 2021 Jodi Schneider&#8217;s Information Quality Lab (http://infoqualitylab.org) seeks a graduate hourly student for a research project on bias in citation networks. Biased citation benefits authors in the short-term by bolstering grants and papers, making them more easily accepted. However, it can have severe negative consequences for scientific inquiry. Our goal is to find quantitative measures of [&#8230;] Avoiding long-haul air travel during the COVID-19 pandemic I would not recommend long-haul air travel at this time. An epidemiological study of a 7.5 hour flight from the Middle East to Ireland concluded that 4 groups (13 people), traveling from 3 continents in four groups, who used separate airport lounges, were likely infected in flight. The flight had 17% occupancy (49 passengers/283 seats; [&#8230;] Paid Undergraduate Research position at UIUC for Fall & Spring 2020 University of Illinois undergraduates are encouraged to apply for a position in my lab. I particularly welcome applications from students in the new iSchool BS/IS degree or in the university-wide informatics minor. While I only have 1 paid position open, I also supervise unpaid independent study projects. Dr.&#160;Jodi&#160;Schneider and the Information Quality Lab &#60;https://infoqualitylab.org&#62; seek [&#8230;] #ShutDownSTEM #strike4blacklives #ShutDownAcademia I greatly appreciated receiving messages from senior people about their participation in the June 10th #ShutDownSTEM #strike4blacklives #ShutDownAcademia. In that spirit, I am sharing my email bounce message for tomorrow, and the message I sent to my research lab. Email bounce: I am not available by email today:&#160;This June 10th is a day of action [&#8230;] QOTD: Storytelling in protest and politics I recently read Francesca Polletta&#8216;s book It was like a fever: Storytelling in protest and politics (2006, University of Chicago Press). I recommend it! It will appeal to researchers interested in topics such as&#160;narrative, strategic communication, (narrative) argumentation, or&#160;epistemology (here, of narrative). Parts may also interest activists. The book&#8217;s case studies are drawn from the [&#8230;] Knowledge Graphs: An Aggregation of Definitions I am not aware of a consensus definition of knowledge graph. I&#8217;ve been discussing this for awhile with Liliana Giusti Serra, and the topic came up again with my fellow organizers of the knowledge graph session at US2TS as we prepare for a panel. I&#8217;ve proposed the following main features: RDF-compatible, has a defined schema (usually an [&#8230;] QOTD: Doing more requires thinking less by the aid of symbolism, we can make transitions in reasoning almost mechanically by the eye which would otherwise call into play the higher faculties of the brain. &#8230;Civilization advances by extending the number of important operations that we can perform without thinking about them. Operations of thought are like cavalry charges in a battle [&#8230;] QOTD: Sally Jackson on how disagreement makes arguments more explicit Sally Jackson explicates the notion of the &#8220;disagreement space&#8221; in a new Topoi article: &#8220;a position that remains in doubt remains in need of defense&#8221;1 &#160; &#8220;The most important theoretical consequence of seeing argumentation as a system for management of disagreement is a reversal of perspective on what arguments accomplish. Are arguments the means by [&#8230;] QOTD: Working out scientific insights on paper, Lavoisier case study &#8230;language does do much of our thinking for us, even in the sciences, and rather than being an unfortunate contamination, its influence has been productive historically, helping individual thinkers generate concepts and theories that can then be put to the test. The case made here for the constitutive power of figures [of speech] per se [&#8230;] David Liebovitz: Achieving Care transformation by Infusing Electronic Health Records with Wisdom Today I am at the Health Data Analytics summit. The title of the keynote talk is Achieving Care transformation by Infusing Electronic Health Records with Wisdom. It&#8217;s a delight to hear from a medical informaticist: David M. Liebovitz (publications in Google Scholar), MD, FACP, Chief Medical Information Officer, The University of Chicago. He graduated from [&#8230;] 
joinpeertube-org-5198	----	JoinPeerTube developed by Home Create an account News Help Contribute Git Languages English Français Deutsch Español Esperanto Italiano Polski Português русский svenska magyar galego 日本語 繁體中文（台灣） Translate Free software to take back control of your videos image/svg+xml PeerTube, developed by Framasoft, is the free and decentralized alternative to video platforms, providing you over 400,000 videos published by 60,000 users and viewed over 15 million times What is PeerTube? See the instances listDiscover our content selection The Hackers War Watch the video The Hackers War The Hacker Wars (2014) is a film about the targeting of hacktivists and journalists by the US government. The film follows the information warriors who are fighting back, and it depicts the dangerous battle in which (h)ac(k)tivists fight for information freedom. Hacktivists impact the world in a new way by using the government's information against itself to call out those in power. The Hacker Wars takes you to the front lines of the high-stakes battle over the fate of the Internet, freedom and privacy. #information #freedom #privacy Gaby Weber Documentaries Discover the channel Gaby Weber Documentaries Gabriele "Gaby" Weber is a German journalist. She has been reporting from South America since the mid-eighties, mainly for ARD. Her focal points are international politics, human rights and the history of German-Latin American relations. On this channel, you can discover her documentary movies in german, english and spanish languages. #documentary #geopolitical #journalism Blender Go on the instance Blender The Official Blender Foundation PeerTube instance give you access to videos presenting the evolutions of the 3D creation software, tutorials and animated films supported by the Blender Foundation. All videos published on this instance are under Creative Commons Attribution licence. #Blender Browse contents What is? PeerTube aspires to be a decentralized and free/libre alternative to video broadcasting services. Our aim is not to replace them, but rather to simultaneously offer something else, with different values. A federation of interconnected hosting services PeerTube is not meant to become a huge platform that would centralize videos from all around the world. Rather, it is a network of inter-connected small videos hosters. Anyone with a modicum of technical skills can host a PeerTube server, aka an instance. Each instance hosts its users and their videos. In this way, every instance is created, moderated and maintained independently by various administrators. Discover PeerTube instances You can still watch from your account videos hosted by other instances though if the administrator of your instance had previously connected it with other instances. This is just how a federation works! And there's more! PeerTube uses ActivityPub, a federating protocol that allows you to interact with other software, provided they also use this protocol. For example, PeerTube and Mastodon -a Twitter alternative- are connected: you can follow a PeerTube user from Mastodon (the latest videos from the PeerTube account you follow will appear in your feed), and even comment on a PeerTube-hosted video directly from your Mastodon's account. Open-source, free/libre license code Mainstream online video broadcasting services make money off of your data by analyzing your interactions so that they can then bombard your with targeted advertising. Peertube is not subject to any corporate monopoly, does not rely on ads and does not track you. Most importantly, you are a person to PeerTube, not a product in need of profiling so as to be stuck in video loops. For example, PeerTube doesn't use any biased recommendation algorithms to keep you online for hours on end. All of this is made possible by Peertube's free/libre license (GNU-AGPL). Its code is a digital "common", that belongs to everybody, instead of a secret formula that belongs to Google (in the case of Youtube) or to Vivendi/Bolloré (Dailymotion). This free/libre license guarantees our fundamental freedoms as users and allows many contributors to offer evolutions and new features. Are you a video maker? With PeerTube, choose your hosting company and the rules you believe in. YouTube has clearly gone astray: its hoster, Google-Alphabet, can enforce its ContentID system (the infamous "Robocopyright") or its videos recommendation system, all of which appear to be as obscure as unfair. Direct contact with a human-scale hoster allows for two things: you no longer are the client of a huge tech company, and you can nurture a special relationship with your hoster, who distributes your data. With PeerTube, you get to choose your hosting provider according to their terms of use, such as their disk space limit per user, their moderation policy, who they chose to federate with... You are not speaking with a huge tech company, so you can talk it out in case of any issue, need, desire... Browse/discover PeerTube instances About peer-to-peer broadcasting and watching The PeerTube software can, whenever necessary, use a peer-to-peer protocol (P2P) to broadcast viral videos, lowering the load of their hosts. In this way, when you watch a video, your computer contributes to its broadcast. If a lot of people are watching the same video at the same time, their browser automatically send smalls pieces of the video to the other viewers. The server resources are not over-exploited: the stream is split, the network optimized. It might not look like it, but thanks to peer-to-peer broadcasting, popular video makers and their videos are no longer forced to be hosted by big companies, whose infrastructure can stand thousands of views at the same time... or to pay for a robust but extremely expensive independent video host. Your move! Browse contentsSign up Enjoy every feature: history, subscriptions, playlists, notifications... Who is behind? Peertube is a free/libre software funded by a French non-profit organization: Framasoft Our organization started in 2004, and now devotes itself to popular education about digital technology issues. We are a small structure of less than 40 members and under 10 employees, well-known for the De-google-ify Internet project, when we offered 34 ethical and alternative online tools. As a public interest organization, over 90% of our funding comes from donations (tax deductible for French taxpayers). Thanks to our crowdfunding (from March to July 2018), Framasoft were able to employ PeerTube's main developer. After a beta release in March 2018, release 1 came out in November 2018. Since then, several intermediary releases have brought many features along. Several collectives have already created PeerTube hosts, laying the foundation for the federation. The more people use, support, and contribute to PeerTube, the quicker it will become a concrete alternative to platforms like YouTube. Donate to FramasoftLegal noticesContactNewsletterForum Press kitJoinPeerTube GitPeerTube Git Website developed by Framasoft and designed by Maiwann Illustrations from What is PeerTube video, created by LILA - ZeMarmot Team PeerTube mascot created by David Revoy PeerTube news! content licensed under CC-BY-SA 
journal-code4lib-org-790	----	The Code4Lib Journal Mission Editorial Committee Process and Structure Code4Lib Issue 50, 2021-02-10 Editorial Eric Hanson Resuming our publication schedule Managing an institutional repository workflow with GitLab and a folder-based deposit system Whitney R. Johnson-Freeman, Mark E. Phillips, and Kristy K. Phillips Institutional Repositories (IR) exist in a variety of configurations and in various states of development across the country. Each organization with an IR has a workflow that can range from explicitly documented and codified sets of software and human workflows, to ad hoc assortments of methods for working with faculty to acquire, process and load items into a repository. The University of North Texas (UNT) Libraries has managed an IR called UNT Scholarly Works for the past decade but has until recently relied on ad hoc workflows. Over the past six months, we have worked to improve our processes in a way that is extensible and flexible while also providing a clear workflow for our staff to process submitted and harvested content. Our approach makes use of GitLab and its associated tools to track and communicate priorities for a multi-user team processing resources. We paired this Web-based management with a folder-based system for moving the deposited resources through a sequential set of processes that are necessary to describe, upload, and preserve the resource. This strategy can be used in a number of different applications and can serve as a set of building blocks that can be configured in different ways. This article will discuss which components of GitLab are used together as tools for tracking deposits from faculty as they move through different steps in the workflow. Likewise, the folder-based workflow queue will be presented and described as implemented at UNT, and examples for how we have used it in different situations will be presented. Customizing Alma and Primo for Home & Locker Delivery Christina L. Hennessey Like many Ex Libris libraries in Fall 2020, our library at California State University, Northridge (CSUN) was not physically open to the public during the 2020-2021 academic year, but we wanted to continue to support the research and study needs of our over 38,000 university students and 4,000 faculty and staff. This article will explain our Alma and Primo implementation to allow for home mail delivery of physical items, including policy decisions, workflow changes, customization of request forms through labels and delivery skins, customization of Alma letters, a Python solution to add the “home” address type to patron addresses to make it all work, and will include relevant code samples in Python, XSL, CSS, XML, and JSON. In Spring 2021, we will add the on-site locker delivery option in addition to home delivery, and this article will include new system changes made for that option. GaNCH: Using Linked Open Data for Georgia’s Natural, Cultural and Historic Organizations’ Disaster Response Cliff Landis, Christine Wiseman, Allyson F. Smith, Matthew Stephens In June 2019, the Atlanta University Center Robert W. Woodruff Library received a LYRASIS Catalyst Fund grant to support the creation of a publicly editable directory of Georgia’s Natural, Cultural and Historical Organizations (NCHs), allowing for quick retrieval of location and contact information for disaster response. By the end of the project, over 1,900 entries for NCH organizations in Georgia were compiled, updated, and uploaded to Wikidata, the linked open data database from the Wikimedia Foundation. These entries included directory contact information and GIS coordinates that appear on a map presented on the GaNCH project website (https://ganch.auctr.edu/), allowing emergency responders to quickly search for NCHs by region and county in the event of a disaster. In this article we discuss the design principles, methods, and challenges encountered in building and implementing this tool, including the impact the tool has had on statewide disaster response after implementation. Archive This Moment D.C.: A Case Study of Participatory Collecting During COVID-19 Julie Burns, Laura Farley, Siobhan C. Hagan, Paul Kelly, and Lisa Warwick When the COVID-19 pandemic brought life in Washington, D.C. to a standstill in March 2020, staff at DC Public Library began looking for ways to document how this historic event was affecting everyday life. Recognizing the value of first-person accounts for historical research, staff launched Archive This Moment D.C. to preserve the story of daily life in the District during the stay-at-home order. Materials were collected from public Instagram and Twitter posts submitted through the hashtag #archivethismomentdc. In addition to social media, creators also submitted materials using an Airtable webform set up for the project and through email. Over 2,000 digital files were collected. This article will discuss the planning, professional collaboration, promotion, selection, access, and lessons learned from the project; as well as the technical setup, collection strategies, and metadata requirements. In particular, this article will include a discussion of the evolving collection scope of the project and the need for clear ethical guidelines surrounding privacy when collecting materials in real-time. Advancing ARKs in the Historical Ontology Space Mat Kelly, Christopher B. Rauch, Jane Greenberg, Sam Grabus, Joan Boone, John Kunze and Peter M. Logan This paper presents the application of Archival Resource Keys (ARKs) for persistent identification and resolution of concepts in historical ontologies. Our use case is the 1910 Library of Congress Subject Headings (LCSH), which we have converted to the Simple Knowledge Organization System (SKOS) format and will use for representing a corpus of historical Encyclopedia Britannica articles. We report on the steps taken to assign ARKs in support of the Nineteenth-Century Knowledge Project, where we are using the HIVE vocabulary tool to automatically assign subject metadata from both the 1910 LCSH and the contemporary LCSH faceted, topical vocabulary to enable the study of the evolution of knowledge. Considered Content: a Design System for Equity, Accessibility, and Sustainability Erinn Aspinall, Amy Drayer, Gabe Ormsby, and Jen Neveau The University of Minnesota Libraries developed and applied a principles-based design system to their Health Sciences Library website. With the design system at its center, the revised site was able to achieve accessible, ethical, inclusive, sustainable, responsible, and universal design. The final site was built with elegantly accessible semantic HTML-focused code on Drupal 8 with highly curated and considered content, meeting and exceeding WCAG 2.1 AA guidance and addressing cognitive and learning considerations through the use of plain language, templated pages for consistent page-level organization, and no hidden content. As a result, the site better supports all users regardless of their abilities, attention level, mental status, reading level, and reliability of their internet connection, all of which are especially critical now as an elevated number of people experience crises, anxieties, and depression. Robustifying Links To Combat Reference Rot Shawn Jones, Martin Klein, and Herbert Van de Sompel Links to web resources frequently break, and linked content can change at unpredictable rates. These dynamics of the Web are detrimental when references to web resources provide evidence or supporting information. In this paper, we highlight the significance of reference rot, provide an overview of existing techniques and their characteristics to address it, and introduce our Robust Links approach, including its web service and underlying API. Robustifying links offers a proactive, uniform, and machine-actionable way to combat reference rot. In addition, we discuss our reasoning and approach aimed at keeping the approach functional for the long term. To showcase our approach, we have robustified all links in this article. Machine Learning Based Chat Analysis Christopher Brousseau, Justin Johnson, Curtis Thacker The BYU library implemented a Machine Learning-based tool to perform various text analysis tasks on transcripts of chat-based interactions between patrons and librarians. These text analysis tasks included estimating patron satisfaction and classifying queries into various categories such as Research/Reference, Directional, Tech/Troubleshooting, Policy/Procedure, and others. An accuracy of 78% or better was achieved for each category. This paper details the implementation details and explores potential applications for the text analysis tool. Always Be Migrating Elizabeth McAulay At the University of California, Los Angeles, the Digital Library Program is in the midst of a large, multi-faceted migration project. This article presents a narrative of migration and a new mindset for technology and library staff in their ever-changing infrastructure and systems. This article posits that migration from system to system should be integrated into normal activities so that it is not a singular event or major project, but so that it is a process built into the core activities of a unit. ISSN 1940-5758 Current Issue Issue 50, 2021-02-10 Previous Issues Issue 49, 2020-08-10 Issue 48, 2020-05-11 Issue 47, 2020-02-17 Issue 46, 2019-11-05 Older Issues For Authors Call for Submissions Article Guidelines Log in This work is licensed under a Creative Commons Attribution 3.0 United States License. 
kcoyle-blogspot-com-8471	----	Coyle's InFormation Coyle's InFormation Comments on the digital age, which, as we all know, is 42. Monday, March 01, 2021 Digitization Wars, Redux  (NB: IANAL)   Because this is long, you can download it as a PDF here. From 2004 to 2016 the book world (authors, publishers, libraries, and booksellers) was involved in the complex and legally fraught activities around Google’s book digitization project. Once known as “Google Book Search,” the company claimed that it was digitizing books to be able to provide search services across the print corpus, much as it provides search capabilities over texts and other media that are hosted throughout the Internet.  Both the US Authors Guild and the Association of American Publishers sued Google (both separately and together) for violation of copyright. These suits took a number of turns including proposals for settlements that were arcane in their complexity and that ultimately failed. Finally, in 2016 the legal question was decided: digitizing to create an index is fair use as long as only minor portions of the original text are shown to users in the form of context-specific snippets.  We now have another question about book digitization: can books be digitized for the purpose of substituting remote lending in the place of the lending of a physical copy? This has been referred to as “Controlled Digital Lending (CDL),” a term developed by the Internet Archive for its online book lending services. The Archive has considerable experience with both digitization and providing online access to materials in various formats, and its Open Library site has been providing digital downloads of out of copyright books for more than a decade. Controlled digital lending applies solely to works that are presumed to be in copyright.  Controlled digital lending works like this: the Archive obtains and retains a physical copy of a book. The book is digitized and added to the Open Library catalog of works. Users can borrow the book for a limited time (2 weeks) after which the book “returns” to the Open Library. While the book is checked out to a user no other user can borrow that “copy.” The digital copy is linked one-to-one with a physical copy, so if more than one copy of the physical book is owned then there is one digital loan available for each physical copy.  The Archive is not alone in experimenting with lending of digitized copies: some libraries have partnered with the Archive’s digitization and lending service to provide digital lending for library-owned materials. In the case of the Archive the physical books are not available for lending. Physical libraries that are experimenting with CDL face the added step of making sure that the physical book is removed from circulation while the digitized book is on loan, and reversing that on return of the digital book.  Although CDL has an air of legality due to limiting lending to one user at a time, authors and publishers associations had raised objections to the practice. [nwu] However, in March of 2020 the Archive took a daring step that pushed their version of the CDL into litigation: using the closing of many physical libraries due to the COVID pandemic as its rationale, the Archive renamed its lending service the National Emergency Library [nel] and eliminated the one-to-one link between physical and digital copies. Ironically this meant that the Archive was then actually doing what the book industry had accused it of (either out of misunderstanding or as an exaggeration of the threat posed): it was making and lending digital copies beyond its physical holdings. The Archive stated that the National Emergency Library would last only until June of 2020, presumably because by then the COVID danger would have passed and libraries would have re-opened. In June the Archive’s book lending service returned to the one-to-one model. Also in June a suit was filed by four publishers (Hachette, HarperCollins, Penguin Random House, and Wiley) in the US District Court of the Southern District of New York. [suit]  The Controlled Digital Lending, like the Google Books project, holds many interesting questions about the nature of “digital vs physical,” not only in a legal sense but in a sense of what it means to read and to be a reader today. The lawsuit not only does not further our understanding of this fascinating question; it sinks immediately into hyperbole, fear-mongering, and either mis-information or mis-direction. That is, admittedly, the nature of a lawsuit. What follows here is not that analysis but gives a few of the questions that are foremost in my mind.  Apples and Oranges   Each of the players in this drama has admirable reasons for their actions. The publishers explain in their suit that they are acting in support of authors, in particular to protect the income of authors so that they may continue to write. The Authors’ Guild provides some data on author income, and by their estimate the average full-time author earns less than $20,000 per year, putting them at poverty level.[aghard] (If that average includes the earnings of highly paid best selling authors, then the actual earnings of many authors is quite a bit less than that.)  The Internet Archive is motivated to provide democratic access to the content of books to anyone who needs or wants it. Even before the pandemic caused many libraries to close the collection housed at the Archive contained some works that are available only in a few research libraries. This is because many of the books were digitized during the Google Books project which digitized books from a small number of very large research libraries whose collections differ significantly from those of the public libraries available to most citizens.  Where the pronouncements of both parties fail is in making a false equivalence between some authors and all authors, and between some books and all books, and the result is that this is a lawsuit pitting apples against oranges. We saw in the lawsuits against Google that some academic authors, who may gain status based on their publications but very little if any income, did not see themselves as among those harmed by the book digitization project. Notably the authors in this current suit, as listed in the bibliography of pirated books in the appendix to the lawsuit, are ones whose works would be characterized best as “popular” and “commercial,” not academic: James Patterson, J. D. Salinger, Malcolm Gladwell, Toni Morrison, Laura Ingalls Wilder, and others. Not only do the living authors here earn above the poverty level, all of them provide significant revenue for the publishers themselves. And all of the books listed are in print and available in the marketplace. No mention is made of out-of-print books, no academic publishers seem to be involved.  On the part of the Archive, they state that their digitized books fill an educational purpose, and that their collection includes books that are not available in digital format from publishers: “ While Overdrive, Hoopla, and other streaming services provide patrons access to latest best sellers and popular titles,  the long tail of reading and research materials available deep within a library’s print collection are often not available through these large commercial services.  What this means is that when libraries face closures in times of crisis, patrons are left with access to only a fraction of the materials that the library holds in its collection.”[cdl-blog] This is undoubtedly true for some of the digitized books, but the main thesis of the lawsuit points out that the Archive has digitized and is also lending current popular titles. The list of books included in the appendix of the lawsuit shows that there are in-copyright and most likely in-print books of a popular reading nature that have been part of the CDL. These titles are available in print and may also be available as ebooks from the publishers. Thus while the publishers are arguing that current, popular books should not be digitized and loaned (apples), the Archive is arguing that they are providing access to items not available elsewhere, and for educational purposes (oranges).  The Law  The suit states that publishers are not questioning copyright law, only violations of the law. “For the avoidance of doubt, this lawsuit is not about the occasional transmission of a title under appropriately limited circumstances, nor about anything permissioned or in the public domain. On the contrary, it is about IA’s purposeful collection of truckloads of in-copyright books to scan, reproduce, and then distribute digital bootleg versions online.” ([Suit] Page 3). This brings up a whole range of legal issues in regard to distributing digital copies of copyrighted works. There have been lengthy arguments about whether copyright law could permit first sale rights for digital items, and the answer has generally been no; some copyright holders have made the argument that since transfer of a digital file is necessarily the making of a copy there can be no first sale rights for those files. [1stSale] [ag1] Some ebook systems, such as the Kindle, have allowed time-limited person-to-person lending for some ebooks. This is governed by license terms between Amazon and the publishers, not by the first sale rights of the analog world.  Section 108 of the copyright law does allow libraries and archives to make a limited number of copies The first point of section 108 states that libraries can make a single copy of a work as long as 1) it is not for commercial advantage, 2) the collection is open to the public and 3) the reproduction includes the copyright notice from the original. This sounds to be what the Archive is doing. However, the next two sections (b and c) provide limitations on that first section that appear to put the Archive in legal jeopardy: section “b” clarifies that copies may be made for preservation or security; section “c” states that the copies can be made if the original item is deteriorating and a replacement can no longer be purchased. Neither of these applies to the Archive’s lending.   In addition to its lending program, the Archive provides downloads of scanned books in DAISY format for those who are certified as visually impaired by the National Library Service for the Blind and Physically Handicapped in the US. This is covered in 121A of the copyright law, Title17, which allows the distribution of copyrighted works in accessible formats. This service could possibly be cited as a justification of the scanning of in-copyright works at the Archive, although without mitigating the complaints about lending those copies to others. This is a laudable service of the Archive if scans are usable by the visually impaired, but the DAISY-compatible files are based on the OCR’d text, which can be quite dirty. Without data on downloads under this program it is hard to know the extent to which this program benefits visually impaired readers.   Lending  Most likely as part of the strategy of the lawsuit, very little mention is made of “lending.” Instead the suit uses terms like “download” and “distribution” which imply that the user of the Archive’s service is given a permanent copy of the book “With just a few clicks, any Internet-connected user can download complete digital copies of in-copyright books from Defendant.” ([suit] Page 2). “... distributing the resulting illegal bootleg copies for free over the Internet to individuals worldwide.” ([suit] Page 14). Publishers were reluctant to allow the creation of ebooks for many years until they saw that DRM would protect the digital copies. It then was another couple of years before they could feel confident about lending - and by lending I mean lending by libraries. It appears that Overdrive, the main library lending platform for ebooks, worked closely with publishers to gain their trust. The lawsuit questions whether the lending technology created by the Archive can be trusted. “...Plaintiffs have legitimate fears regarding the security of their works both as stored by IA on its servers” ([suit] Page 47). In essence, the suit accuses IA of a lack of transparency about its lending operation. Of course, any collaboration between IA and publishers around the technology is not possible because the two are entirely at odds and the publishers would reasonably not cooperate with folks they see as engaged in piracy of their property.  Even if the Archive’s lending technology were proven to be secure, lending alone is not the issue: the Archive copied the publishers’ books without permission prior to lending. In other words, they were lending content that they neither owned (in digital form) nor had licensed for digital distribution. Libraries pay, and pay dearly, for the ebook lending service that they provide to their users. The restrictions on ebooks may seem to be a money-grab on the part of publishers, but from their point of view it is a revenue stream that CDL threatens.  Is it About the Money? “... IA rakes in money from its infringing services…” ([suit] Page 40). (Note: publishers earn, IA “rakes in”) “Moreover, while Defendant promotes its non-profit status, it is in fact a highly commercial enterprise with millions of dollars of annual revenues, including financial schemes that provide funding for IA’s infringing activities. ([suit] Page 4). These arguments directly address section (a)(1) of Title 17, section 108: “(1) the reproduction or distribution is made without any purpose of direct or indirect commercial advantage”.  At various points in the suit there are references to the Archive’s income, both for its scanning services and donations, as well as an unveiled show of envy at the over $100 million that Brewster Kahle and his wife have in their jointly owned foundation. This is an attempt to show that the Archive derives “direct or indirect commercial advantage” from CDL. Non-profit organizations do indeed have income, otherwise they could not function, and “non-profit” does not mean a lack of a revenue stream, it means returning revenue to the organization instead of taking it as profit. The argument relating to income is weakened by the fact that the Archive is not charging for the books it lends. However, much depends on how the courts will interpret “indirect commercial advantage.” The suit argues that the Archive benefits generally from the scanned books because this enhances the Archive’s reputation which possibly results in more donations. There is a section in the suit relating to the “sponsor a book” program where someone can donate a specific amount to the Archive to digitize a book. How many of us have not gotten a solicitation from a non-profit that makes statements like: “$10 will feed a child for a day; $100 will buy seed for a farmer, etc.”? The attempt to correlate free use of materials with income may be hard to prove.  Reading  Decades ago, when the service Questia was just being launched (Questia ceased operation December 21, 2020), Questia sales people assured a group of us that their books were for “research, not reading.” Google used a similar argument to support its scanning operation, something like “search, not reading.” The court decision in Google’s case decided that Google’s scanning was fair use (and transformative) because the books were not available for reading, as Google was not presenting the full text of the book to its audience.[suit-g]  The Archive has taken the opposite approach, a “books are for reading” view. Beginning with public domain books, many from the Google books project, and then with in-copyright books, the Archive has promoted reading. It developed its own in-browser reading software to facilitate reading of the books online. [reader] (*See note below) Although the publishers sued Google for its scanning, they lost due to the “search, not reading” aspect of that project. The Archive has been very clear about its support of reading, which takes the Google justification off the table.  “Moreover, IA’s massive book digitization business has no new purpose that is fundamentally different than that of the Publishers: both distribute entire books for reading.” ([suit] Page 5).   However, the Archive's statistics on loaned books shows that a large proportion of the books are used for 30 minutes or less.  “Patrons may be using the checked-out book for fact checking or research, but we suspect a large number of people are browsing the book in a way similar to browsing library shelves.” [ia1]   In its article on the CDL, the Center for Democracy and Technology notes that “the majority of books borrowed through NEL were used for less than 30 minutes, suggesting that CDL’s primary use is for fact-checking and research, a purpose that courts deem favorable in a finding of fair use.” [cdt] The complication is that the same service seems to be used both for reading of entire books and as a place to browse or to check individual facts (the facts themselves cannot be copyrighted). These may involve different sets of books, once again making it difficult to characterize the entire set of digitized books under a single legal claim.  The publishers claim that the Archive is competing with them using pirated versions of their own products. That leads us to the question of whether the Archive’s books, presented for reading, are effectively substitutes for those of the publishers. Although the Archive offers actual copies, those copies that are significantly inferior to the original. However, the question of quality did not change the judgment in the lawsuit against copying of texts by Kinko’s [kinkos], which produced mediocre photocopies from printed and bound publications. It seems unlikely that the quality differential will serve to absolve the Archive from copyright infringement even though the poor quality of some of the books interferes with their readability.  Digital is Different Publishers have found a way to monetize digital versions, in spite of some risks, by taking advantage of the ability to control digital files with technology and by licensing, not selling, those files to individuals and to libraries. It’s a “new product” that gets around First Sale because, as it is argued, every transfer of a digital file makes a copy, and it is the making of copies that is covered by copyright law. [1stSale]  The upshot of this is that because a digital resource is licensed, not sold, the right to pass along, lend, or re-sell a copy (as per Title 17 section 109) does not apply even though technology solutions that would delete the sender’s copy as the file safely reaches the recipient are not only plausible but have been developed. [resale]  “Like other copyright sectors that license education technology or entertainment software, publishers either license ebooks to consumers or sell them pursuant to special agreements or terms.” ([suit] Page 15) “When an ebook customer obtains access to the title in a digital format, there are set terms that determine what the user can or cannot do with the underlying file.”([suit] Page 16) This control goes beyond the copyright holder’s rights in law: DRM can exercise controls over the actual use of a file, limiting it to specific formats or devices, allowing or not allowing text-to-speech capabilities, even limiting copying to the clipboard. Publishers and Libraries  The suit claims that publishers and libraries have reached an agreement, an equilibrium. “To Plaintiffs, libraries are not just customers but allies in a shared mission to make books available to those who have a desire to read, including, especially, those who lack the financial means to purchase their own copies.” ([suit] Page 17). In the suit, publishers contrast the Archive’s operation with the relationship that publishers have with libraries. In contrast with the Archive’s lending program, libraries are the “good guys.” “... the Publishers have established independent and distinct distribution models for ebooks, including a market for lending ebooks through libraries, which are governed by different terms and expectations than print books.”([suit] Page 6). These “different terms” include charging much higher prices to libraries for ebooks, limiting the number of times an ebook can be loaned. [pricing1] [pricing2] “Legitimate libraries, on the other hand, license ebooks from publishers for limited periods of time or a limited number of loans; or at much higher prices than the ebooks available for individual purchase.” [agol] The equilibrium of which publishers speak looks less equal from the library side of the equation: library literature is replete with stories about the avarice of publishers in relation to library lending of ebooks. Some authors/publishers even speak out against library lending of physical books, claiming that this cuts into sales. (This same argument has been made for physical books.) “If, as Macmillan has determined, 45% of ebook reads are occurring through libraries and that percentage is only growing, it means that we are training readers to read ebooks for free through libraries instead of buying them. With author earnings down to new lows, we cannot tolerate ever-decreasing book sales that result in even lower author earnings.” [agliblend][ag42] The ease of access to digital books has become a boon for book sales, and ebook sales are now rising while hard copy sales fall. This economic factor is a motivator for any of those engaged with the book market. The Archive’s CDL is a direct affront to the revenue stream that publishers have carved out for specific digital products. There are indications that the ease of borrowing of ebooks - not even needing to go to the physical library to borrow a book - is seen as a threat by publishers. This has already played out in other media, from music to movies.  It would be hard to argue that access to the Archive’s digitized books is merely a substitute for library access. Many people do not have actual physical library access to the books that the Archive lends, especially those digitized from the collections of academic libraries. This is particularly true when you consider that the Archive’s materials are available to anyone in the world with access to the Internet. If you don’t have an economic interest in book sales, and especially if you are an educator or researcher, this expanded access could feel long overdue.  We need numbers  We really do not know much about the uses of the Archive’s book collection. The lawsuit cites some statistics of “views” to show that the infringement has taken place, but the page in question does not explain what is meant by a “view”. Archive pages for downloadable files of metadata records also report “views” which most likely reflect views of that web page, since there is nothing viewable other than the page itself. Open Library book pages give “currently reading” and “have read” stats, but these are tags that users can manually add to the page for the work. To compound things, the 127 books cited in the suit have been removed from the lending service (and are identified in the Archive as being in the collection “litigation works)  Although numbers may not affect the legality of the controlled digital lending, the social impact of the Archive’s contribution to reading and research would be clearer if we had this information. Although the Archive has provided a small number of testimonials, a proof of use in educational settings would bolster the claims of social benefit which in turn could strengthen a fair use defense.  Notes (*) The NWU has a slide show [nwu2] that explains what it calls Controlled Digital Lending at the Archive. Unfortunately this document conflates the Archive's book Reader with CDL and therefore muddies the water. It muddies it because it does not distinguish between sending files to dedicated devices (which is what Kindle is) or dedicated software like what libraries use via software like Libby, and the Archive's use of a web-based reader. It is not beyond reason to suppose that the Archive's Reader software does not fully secure loaned items. The NWU claims that files are left in the browser cache that represent all book pages viewed: "There’s no attempt whatsoever to restrict how long any user retains these images". (I cannot reproduce this. In my minor experiments those files disappear at the end of the lending period, but this requires more concerted study.) However, this is not a fault of CDL but a fault of the Reader software. The reader is software that works within a browser window. In general, electronic files that require secure and limited use are not used within browsers, which are general purpose programs. Conflating the Archive's Reader software with Controlled Digital Lending will only hinder understanding. Already CDL has multiple components: Digitization of in-copyright materials Lending of digital copies of in-copyright materials that are owned by the library in a 1-to-1 relation to physical copies We can add #3, the leakage of page copies via the browser cache, but I maintain that poorly functioning software does not automatically moot points 1 and 2. I would prefer that we take each point on its own in order to get a clear idea of the issues. The NWU slides also refer to the Archive's API which allows linking to individual pages within books. This is an interesting legal area because it may be determined to be fair use regardless of the legality of the underlying copy. This becomes yet another issue to be discussed by the legal teams, but it is separate from the question of controlled digital lending. Let's stay focused.   Citations [1stSale] https://abovethelaw.com/2017/11/a-digital-take-on-the-first-sale-doctrine/  [ag1]https://www.authorsguild.org/industry-advocacy/reselling-a-digital-file-infringes-copyright/  [ag42] https://www.authorsguild.org/industry-advocacy/authors-guild-survey-shows-drastic-42-percent-decline-in-authors-earnings-in-last-decade/  [aghard] https://www.authorsguild.org/the-writing-life/why-is-it-so-goddamned-hard-to-make-a-living-as-a-writer-today/ [aglibend] https://www.authorsguild.org/industry-advocacy/macmillan-announces-new-library-lending-terms-for-ebooks/ [agol] https://www.authorsguild.org/industry-advocacy/update-open-library/  [cdl-blog] https://blog.archive.org/2020/03/09/controlled-digital-lending-and-open-libraries-helping-libraries-and-readers-in-times-of-crisis/  [cdt] https://cdt.org/insights/up-next-controlled-digital-lendings-first-legal-battle-as-publishers-take-on-the-internet-archive/  [kinkos] https://law.justia.com/cases/federal/district-courts/FSupp/758/1522/1809457 [nel] http://blog.archive.org/national-emergency-library/ [nwu] "Appeal from the victims of Controlled Digital Lending (CDL)". (Retrieved 2021-01-10)  [nwu2] "What is the Internet Archive doing with our books?" https://nwu.org/wp-content/uploads/2020/04/NWU-Internet-Archive-webinar-27APR2020.pdf [pricing1] https://www.authorsguild.org/industry-advocacy/e-book-library-pricing-the-game-changes-again/  [pricing2] https://americanlibrariesmagazine.org/blogs/e-content/ebook-pricing-wars-publishers-perspective/  [reader] Bookreader  [resale] https://www.hollywoodreporter.com/thr-esq/appeals-court-weighs-resale-digital-files-1168577  [suit] https://www.courtlistener.com/recap/gov.uscourts.nysd.537900/gov.uscourts.nysd.537900.1.0.pdf  [suit-g] https://cases.justia.com/federal/appellate-courts/ca2/13-4829/13-4829-2015-10-16.pdf?ts=1445005805 Posted by Karen Coyle at 11:54 AM No comments: Labels: ebooks books digitization Internet Archive Open Library Controlled digital lending Thursday, June 25, 2020 Women designing Those of us in the library community are generally aware of our premier "designing woman," the so-called "Mother of MARC," Henriette Avram. Avram designed the MAchine Reading Cataloging record in the mid-1960's, a record format that is still being used today. MARC was way ahead of its time using variable length data fields and a unique character set that was sufficient for most European languages, all thanks to Avram's vision and skill. I'd like to introduce you here to some of the designing women of the University of California library automation project, the project that created one of the first online catalogs in the beginning of the 1980's, MELVYL. Briefly, MELVYL was a union catalog that combined data from the libraries of the nine (at that time) University of California campuses. It was first brought up as a test system in 1980 and went "live" to the campuses in 1982. Work on the catalog began in or around 1980, and various designs were put forward and tested. Key designers were Linda Gallaher-Brown, who had one of the first masters degrees in computer science from UCLA, and Kathy Klemperer, who like many of us was a librarian turned systems designer. We were struggling with how to create a functional relational database of bibliographic data (as defined by the MARC record) with computing resources that today would seem laughable but were "cutting edge" for that time. I remember Linda remarking that during one of her school terms she returned to her studies to learn that the newer generation of computers would have this thing called an "operating system" and she thought "why would you need one?" By the time of this photo she had come to appreciate what an operating system could do for you. The one we used at the time was IBM's OS 360/370. Kathy Klemperer was the creator of the database design diagrams that were so distinctive we called them "Klemperer-grams." Here's one from 1985: MELVYL database design Klemperer-gram, 1985 Drawn and lettered by hand, not only did these describe a workable database design, they were impressively beautiful. Note that this not only predates the proposed 2009 RDA "database scenario" for a relational bibliographic design by 24 years, it provides a more detailed and most likely a more accurate such design. RDA "Scenario 1" data design, 2009 In the early days of the catalog we had a separate file and interface for the cataloged serials based on a statewide project (including the California State Universities). Although it was possible to catalog serials in the MARC format, the systems that had the detailed information about which issues the libraries held was stored in serials control databases that were separate from the library catalog, and many serials were represented by crusty cards that had been created decades before library automation. The group below developed and managed the CALLS (California Academic Library List of Serials). Four of those pictured were programmers, two were serials data specialists, and four had library degrees. Obviously, these are overlapping sets. The project heads were Barbara Radke (right) and Theresa Montgomery (front, second from right). At one point while I was still working on the MELVYL project, but probably around the very late 1990's or early 2000's, I gathered up some organization charts that had been issued over the years and quickly calculated that during its history the project the technical staff that had created this early marvel had varied from 3/4 to 2/3 female. I did some talks at various conferences in which I called MELVYL a system "created by women." At my retirement in 2003 I said the same thing in front of the entire current staff, and it was not well-received by all. In that audience was one well-known member of the profession who later declared that he felt women needed more mentoring in technology because he had always worked primarily with men, even though he had indeed worked in an organization with a predominantly female technical staff, and another colleague who was incredulous when I stated once that women are not a minority, but over 50% of the world's population. He just couldn't believe it. While outright discrimination and harassment of women are issues that need to be addressed, the invisibility of women in the eyes of their colleagues and institutions is horribly damaging. There are many interesting projects, not the least the Wikipedia Women in Red, that aim to show that there is no lack of accomplished women in the world, it's the acknowledgment of their accomplishments that falls short. In the library profession we have many women whose stories are worth telling. Please, let's make sure that future generations know that they have foremothers to look to for inspiration. Posted by Karen Coyle at 9:47 AM 1 comment: Labels: library catalogs, library history, open data, women and technology Monday, May 25, 2020 1982 I've been trying to capture what I remember about the early days of library automation. Mostly my memory is about fun discoveries in my particular area (processing MARC records into the online catalog). I did run into an offprint of some articles in ITAL from 1982 (*) which provide very specific information about the technical environment, and I thought some folks might find that interesting. This refers to the University of California MELVYL union catalog, which at the time had about 800,000 records. Operating system: IBM 360/370 Programming language: PL/I CPU: 24 megabytes of memory Storage: 22 disk drives, ~ 10 gigabytes DBMS: ADABAS The disk drives were each about the size of an industrial washing machine. In fact, we referred to the room that held them as "the laundromat." Telecommunications was a big deal because there was no telecommunications network linking the libraries of the University of California. There wasn't even one connecting the campuses at all. The article talks about the various possibilities, from an X.25 network to the new TCP/IP protocol that allows "internetwork communication." The first network was a set of dedicated lines leased from the phone company that could transmit 120 characters per second (character = byte) to about 8 ASCII terminals at each campus over a 9600 baud line. There was a hope to be able to double the number of terminals. In the speculation about the future, there was doubt that it would be possible to open up the library system to folks outside of the UC campuses, much less internationally. (MELVYL was one of the early libraries to be open access worldwide over the Internet, just a few years later.) It was also thought that libraries would charge other libraries to view their catalogs, kind of like an inter-library loan. And for anyone who has an interest in Z39.50, one section of the article by David Shaughnessy and Clifford Lynch on telecommunications outlines a need for catalog-to-catalog communication which sounds very much like the first glimmer of that protocol. ----- (*) Various authors in a special edition: (1982). In-Depth: University of California MELVYL. Information Technology and Libraries, 1(4) I wish I could give a better citation but my offprint does not have page numbers and I can't find this indexed anywhere. (Cue here the usual irony that libraries are terrible at preserving their own story.) Posted by Karen Coyle at 6:25 AM No comments: Labels: library catalogs, library history Monday, April 27, 2020 Ceci n'est pas une Bibliothèque On March 24, 2020, the Internet Archive announced that it would "suspend waitlists for the 1.4 million (and growing) books in our lending library," a service they then named The National Emergency Library. These books were previously available for lending on a one-to-one basis with the physical book owned by the Archive, and as with physical books users would have to wait for the book to be returned before they could borrow it. Worded as a suspension of waitlists due to the closure of schools and libraries caused by the presence of the coronavirus-19, this announcement essentially eliminated the one-to-one nature of the Archive's Controlled Digital Lending program. Publishers were already making threatening noises about the digital lending when it adhered to lending limitations, and surely will be even more incensed about this unrestricted lending. I am not going to comment on the legality of the Internet Archive's lending practices. Legal minds, perhaps motivated by future lawsuits, will weigh in on that. I do, however, have much to say on the use of the term "library" for this set of books. It's a topic worthy of a lengthy treatment, but I'll give only a brief account here. LIBRARY … BIBLIOTHÈQUE … BIBLIOTEK The roots “LIBR…” and “BIBLIO…” both come down to us from ancient words for trees and tree bark. It is presumed that said bark was the surface for early writings. “LIBR…”, from the Latin word liber meaning “book,” in many languages is a prefix that indicates a bookseller’s shop, while in English it has come to mean a collection of books and from that also the room or building where books are kept. “BIBLIO…” derives instead from the Greek biblion (one book) and biblia (books, plural). We get the word Bible through the Greek root, which leaked into old Latin and meant The Book. Therefore it is no wonder that in the minds of many people, books = library.  In fact, most libraries are large collections of books, but that does not mean that every large collection of books is a library. Amazon has a large number of books, but is not a library; it is a store where books are sold. Google has quite a few books in its "book search" and even allows you to view portions of the books without payment, but it is also not a library, it's a search engine. The Internet Archive, Amazon, and Google all have catalogs of metadata for the books they are offering, some of it taken from actual library catalogs, but a catalog does not make a quantity of books into a library. After all, Home Depot has a catalog, Walmart has a catalog; in essence, any business with an inventory has a catalog. "...most libraries are large collections of books, but that does not mean that every large collection of books is a library." The Library Test First, I want to note that the Internet Archive has met the State of California test to be defined as a library, and this has made it possible for the Archive to apply for library-related grants for some of its projects. That is a Good Thing because it has surely strengthened the Archive and its activities. However, it must be said that the State of California requirements are pretty minimal, and seem to be limited to a non-profit organization making materials available to the general public without discrimination. There doesn't seem to be a distinction between "library" and "archive" in the state legal code, although librarians and archivists would not generally consider them easily lumped together as equivalent services. The Collection The Archive's blog post says "the Internet Archive currently lends about as many as a US library that serves a population of about 30,000." As a comparison, I found in the statistics gathered by the California State Library those of the Benicia Public Library in Benicia California. Benicia is a city with a population of 31,000; the library has about 88,000 books. Well, you might say, that's not as good as over one million books at the Internet Archive. But, here's the thing: those are not 88,000 random books, they are books chosen to be, as far as the librarians could know, the best books for that small city. If Benicia residents were, for example, primarily Chinese-speaking, the library would surely have many books in Chinese. If the city had a large number of young families then the children's section would get particular attention. The users of the Internet Archive's books are a self-selected (and currently un-defined) set of Internet users. Equally difficult to define is the collection that is available to them: This library brings together all the books from Phillips Academy Andover and Marygrove College, and much of Trent University’s collections, along with over a million other books donated from other libraries to readers worldwide that are locked out of their libraries. Each of these is (or was, in the case of Marygrove, which has closed) a collection tailored to the didactic needs of that institution. How one translates that, if one can, to the larger Internet population is unknown. That a collection has served a specific set of users does not mean that it can serve all users equally well. Then there is that other million books, which are a complete black box. Library science I've argued before against dumping a large and undistinguished set of books on a populace, regardless of the good intentions of those doing so. Why not give the library users of a small city these one million books? The main reason is the ability of the library to fulfill the 5 Laws of Library Science: Books are for use. Every reader his or her book. Every book its reader. Save the time of the reader. The library is a growing organism. [0] The online collection of the Internet Archive nicely fulfills laws 1 and 5: the digital books are designed for use, and the library can grow somewhat indefinitely. The other three laws are unfortunately hindered by the somewhat haphazard nature of the set of books, combined with the lack of user services. Of the goals of librarianship, matching readers to books is the most difficult. Let's start with law 3, "every book its reader." When you follow the URL to the National Emergency Library, you see something like this: The lack of cover art is not the problem here. Look at what books you find: two meeting reports, one journal publication, and a book about hand surgery, all from 1925. Scroll down for a bit and you will find it hard to locate items that are less obscure than this, although undoubtedly there are some good reads in this collection. These are not the books whose readers will likely be found in our hypothetical small city. These are books that even some higher education institutions would probably choose not to have in their collections. While these make the total number of available books large, they may not make the total number of useful books large. Winnowing this set to one or more (probably more) wheat-filled collections could greatly increase the usability of this set of books. "While these make the total number of available books large, they may not make the total number of useful books large." A large "anything goes" set of documents is a real challenge for laws 2 and 4: every reader his or her book, and save the time of the reader. The more chaff you have the harder it is for a library user to find the wheat they are seeking. The larger the collection the more of the burden is placed on the user to formulate a targeted search query and to have the background to know which items to skip over. The larger the retrieved set, the less likely that any user will scroll through the entire display to find the best book for their purposes. This is the case for any large library catalog, but these libraries have built their collection around a particular set of goals. Those goals matter. Goals are developed to address a number of factors, like: What are the topics of interest to my readers and my institution? How representative must my collection be in each topic area? What are the essential works in each topic area? What depth of coverage is needed for each topic? [1] If we assume (and we absolutely must assume this) that the user entering the library is seeking information that he or she lacks, then we cannot expect users to approach the library as an expert in the topic being researched. Although anyone can type in a simple query, fewer can assess the validity and the scope of the results. A search on "California history" in the National Emergency Library yields some interesting-looking books, but are these the best books on the topic? Are any key titles missing? These are the questions that librarians answer when developing collections. The creation of a well-rounded collection is a difficult task. There are actual measurements that can be run against library collections to determine if they have the coverage that can be expected compared to similar libraries. I don't know if any such statistical packages can look beyond quantitative measures to judge the quality of the collection; the ones I'm aware of look at call number ranges, not individual titles.  There Library Service The Archive's own documentation states that "The Internet Archive focuses on preservation and providing access to digital cultural artifacts. For assistance with research or appraisal, you are bound to find the information you seek elsewhere on the internet." After which it advises people to get help through their local public library. Helping users find materials suited to their need is a key service provided by libraries. When I began working in libraries in the dark ages of the 1960's, users generally entered the library and went directly to the reference desk to state the question that brought them to the institution. This changed when catalogs went online and were searchable by keyword, but prior to then the catalog in a public library was primarily a tool for librarians to use when helping patrons. Still, libraries have real or virtual reference desks because users are not expected to have the knowledge of libraries or of topics that would allow them to function entirely on their own. And while this is true for libraries it is also true, perhaps even more so, for archives whose collections can be difficult to navigate without specialized information. Admitting that you give no help to users seeking materials makes the use of the term "library" ... unfortunate. What is to be done? There are undoubtedly a lot of useful materials among the digital books at the Internet Archive. However, someone needing materials has no idea whether they can expect to find what they need in this amalgamation. The burden of determining whether the Archive's collection might suit their needs is left entirely up to the members of this very fuzzy set called "Internet users." That the collection lends at the rate of a public library serving a population of 30,000 shows that it is most likely under-utilized. Because the nature of the collection is unknown one can't approach, say, a teacher of middle-school biology and say: "they've got what you need." Yet the Archive cannot implement a policy to complete areas of the collection unless it knows what it has as compared to known needs. "... these warehouses of potentially readable text will remain under-utilized until we can discover a way to make them useful in the ways that libraries have proved to be useful." I wish I could say that a solution would be simple - but it would not. For example, it would be great to extract from this collection works that are commonly held in specific topic areas in small, medium and large libraries. The statistical packages that analyze library holdings all are, AFAIK, proprietary. (If anyone knows of an open source package that does this, please shout it out!) If would also be great to be able to connect library collections of analog books to their digital equivalents. That too is more complex than one would expect, and would have to be much simpler to be offered openly. [2] While some organizations move forward with digitizing books and other hard copy materials, these warehouses of potentially readable text will remain under-utilized until we can discover a way to make them useful in the ways that libraries have proved to be useful. This will mean taking seriously what modern librarianship has developed over its circa 2 centuries, and in particular those 5 laws that give us a philosophy to guide our vision of service to the users of libraries. ----- [0] Even if you are familiar with the 5 laws you may not know that Ranganathan was not as succinct as this short list may imply. The book in which he introduces these concepts is over 450 pages long, with extended definitions and many homey anecdotes and stories. [1] A search on "collection development policy" will yield many pages of policies that you can peruse. To make this a "one click" here are a few *non-representative* policies that you can take a peek at: Hennepin County (public) Lansing Community College (community college) Stanford University, Science Library (research library) [2] Dan Scott and I did a project of this nature with a Bay Area public library and it took a huge amount of human intervention to determine whether the items matched were really "equivalent". That's a discussion for another time, but, man, books are more complicated than they appear. Posted by Karen Coyle at 8:08 AM No comments: Labels: books, Digital libraries, OpenLibrary Monday, February 03, 2020 Use the Leader, Luke! If you learned the MARC format "on the job" or in some other library context you may have learned that the record is structured as fields with 3-digit tags, each with two numeric indicators, and that subfields have a subfield indicator (often shown as "$" because it is a non-printable character) and a single character subfield code (a-z, 0-9). That is all true for the MARC records that libraries create and process, but the MAchine Readable Cataloging standard (Z39.2 or ISO 2709) has other possibilities that we are not using. Our "MARC" (currently MARC21) is a single selection from among those possibilities, in essence an application profile of the MARC standard. The key to the possibilities afforded by MARC is in the MARC Leader, and in particular in two positions that our systems generally ignore because they always contain the same values in our data: Leader byte 10 -- Indicator count Leader byte 11 -- Subfield code length In MARC21 records, Leader byte 10 is always "2" meaning that fields have 2-byte indicators, and Leader byte 11 is always 2 because the subfield code is always two characters in length. That was a decision made early on in the life of MARC records in libraries, and it's easy to forget that there were other options that were not taken. Let's take a short look at the possibilities the record format affords beyond our choice. Both of these Leader positions are single bytes that can take values from 0 to 9. An application could use the MARC record format and have zero indicators. It isn't hard to imagine an application that has no need of indicators or that has determined to make use of subfields in their stead. As an example, the provenance of vocabulary data for thesauri like LCSH or the Art and Architecture Thesaurus could always be coded in a subfield rather than in an indicator: 650 $a Religion and science $2 LCSH Another common use of indicators in MARC21 is to give a byte count for the non-filing initial articles on title strings. Istead of using an indicator value for this some libraries outside of the US developed a non-printing code to make the beginning and end of the non-filing portion. I'll use backslashes to represent these codes in this example: 245 $a \The \Birds of North America I am not saying that all indicators in MARC21 should or even could be eliminated, but that we shouldn't assume that our current practice is the only way to code data. In the other direction, what if you could have more than two indicators? The MARC record would allow you have have as many as nine. In addition, there is nothing to say that each byte in the indicator has to be a separate data element; you could have nine indicator positions that were defined as two data elements (4 + 5), or some other number (1 + 2 + 6). Expanding the number of indicators, or beginning with a larger number, could have prevented the split in provenance codes for subject vocabularies between one indicator value and the overflow subfield, $2, when the number exceeded the capability of a single numerical byte. Having three or four bytes for those codes in the indicator and expanding the values to include a-z would have been enough to include the full list of authorities for the data in the indicators. (Although I would still prefer putting them all in $2 using the mnemonic codes for ease of input.) In the first University of California union catalog in the early 1980's we expanded the MARC indicators to hold an additional two bytes (or was it four?) so that we could record, for each MARC field, which library had contributed it. Our union catalog record was a composite MARC record with fields from any and all of the over 300 libraries across the University of California system that contributed to the union catalog as dozen or so separate record feeds from OCLC and RLIN. We treated the added indicator bytes as sets of bits, turning on bits to represent the catalog feeds from the libraries. If two or more libraries submitted exactly the same MARC field we stored the field once and turned on a bit for each separate library feed. If a library submitted a field for a record that was new to the record, we added the field and turned on the appropriate bit. When we created a user display we selected fields from only one of the libraries. (The rules for that selection process were something of a secret so as not to hurt anyone's feelings, but there was a "best" record for display.) It was a multi-library MARC record, made possible by the ability to use more than two indicators. Now on to the subfield code. The rule for MARC21 is that there is a single subfield code and that is a lower case a-z and 0-9. The numeric codes have special meaning and do not vary by field; the alphabetic codes aare a bit more flexible. That gives use 26 possible subfields per tag, plus the 10 pre-defined numeric ones. The MARC21 standard has chosen to limit the alphabetic subfield codes to lower case characters. As the fields reached the limits of the available subfield codes (and many did over time) you might think that the easiest solution would be to allow upper case letters as subfield codes. Although the subfield code limitation was reached decades ago for some fields I can personally attest to the fact that suggesting the expansion of subfield codes to upper case letters was met with horrified glares at the MARC standards meeting. While clearly in 1968 the range of a-z seemed ample, that has not be the case for nearly half of the life-span of MARC. The MARC Leader allows one to define up to 9 characters total for subfield codes. The value in this Leader position includes the subfield delimiter so this means that you can have a subfield delimiter and up to 8 characters to encode a subfield. Even expanding from a-z to aa-zz provides vastly more possibilities, and allow upper case as well give you a dizzying array of choices. The other thing to mention is that there is no prescription that field tags must be numeric. They are limited to three characters in the MARC standard, but those could be a-z, A-Z, 0-9, not just 0-9, greatly expanding the possibilities for adding new tags. In fact, if you have been in the position to view internal systems records in your vendor system you may have been able to see that non-numeric tags have been used for internal system purposes, like noting who made each edit, whether functions like automated authority control have been performed on the record, etc. Many of the "violations" of the MARC21 rules listed here have been exploited internally -- and since early days of library systems. There are other modifiable Leader values, in particular the one that determines the maximum length of a field, Leader 20. MARC21 has Leader 20 set at "4" meaning that fields cannot be longer than 9999. That could be longer, although the record size itself is set at only 5 bytes, so a record cannot be longer than 99999. However, one could limit fields to 999 (Leader value 20 set at "3") for an application that does less pre-composing of data compared to MARC21 and therefore comfortably fits within a shorter field length.  The reason that has been given, over time, why none of these changes were made was always: it's too late, we can't change our systems now. This is, as Caesar might have said, cacas tauri. Systems have been able to absorb some pretty intense changes to the record format and its contents, and a change like adding more subfield codes would not be impossible. The problem is not really with the MARC21 record but with our inability (or refusal) to plan and execute the changes needed to evolve our systems. We could sit down today and develop a plan and a timeline. If you are skeptical, here's an example of how one could manage a change in length to the subfield codes: a MARC21 record is retrieved for editing read the Leader 10 of the MARC21 record if the value is "2" and you need to add a new subfield that uses the subfield code plus two characters, convert all of the subfield codes in the record: $a becomes $aa, $b becomes $ba, etc. $0 becomes $01, $1 becomes $11, etc. Leader 10 code is changed to "3" (alternatively, convert all records opened for editing) a MARC21 record is retrieved for display read the Leader 10 of the MARC21 record if the value is "2" use the internal table of subfield codes for records with the value "2" if the value is "3" use the internal table of subfield codes for records with the value "3" Sounds impossible? We moved from AACR to AACR2, and now from AACR2 to RDA without going back and converting all of our records to the new content.  We have added new fields to our records, such as the 336, 337, 338 for RDA values, without converting all of the earlier records in our files to have these fields. The same with new subfields, like $0, which has only been added in recent years. Our files have been using mixed record types for at least a couple of generations -- generations of systems and generations of catalogers. Alas, the time to make these kinds of changes this was many years ago. Would it be worth doing today? That depends on whether we anticipate a change to BIBFRAME (or some other data format) in the near future. Changes do continue to be made to the MARC21 record; perhaps it would have a longer future if we could broach the subject of fixing some of the errors that were introduced in the past, in particular those that arose because of the limitations of MARC21 that could be rectified with an expansion of that record standard. That may also help us not carry over some of the problems in MARC21 that are caused by these limitations to a new record format that does not need to be limited in these ways. Epilogue Although the MARC  record was incredibly advanced compared to other data formats of its time (the mid-1960's), it has some limitations that cannot be overcome within the standard itself. One obvious one is the limitation of the record length to 5 bytes. Another is the fact that there are only two levels of nesting of data: the field and the subfield. There are times when a sub-subfield would be useful, such as when adding information that relates to only one subfield, not the entire field (provenance, external URL link). I can't advocate for continuing the data format that is often called "binary MARC" simply because it has limitations that require work-arounds. MARCXML, as defined as a standard, gets around the field and record length limitations, but it is not allowed to vary from the MARC21 limitations on field and subfield coding. It would be incredibly logical to move to a "non-binary" record format (XML, JSON, etc.) beginning with the existing MARC21 and  to allow expansions where needed. It is the stubborn adherence to the ISO 2709 format really has limited library data, and it is all the more puzzling because other solutions that can keep the data itself intact have been available for many decades. Posted by Karen Coyle at 6:59 AM No comments: Labels: MARC Tuesday, January 28, 2020 Pamflets I was always a bit confused about the inclusion of "pamflets" in the subtitle of the Decimal System, such as this title page from the 1922 edition: Did libraries at the time collect numerous pamphlets? For them to be the second-named type of material after books was especially puzzling. I may have discovered an answer to my puzzlement, if not THE answer, in Andrea Costadoro's 1856 work: A "pamphlet" in 1856 was not (necessarily) what I had in mind, which was a flimsy publication of the type given out by businesses, tourist destinations, or public health offices. In the 1800's it appears that a pamphlet was a literary type, not a physical format. Costadoro says: "It has been a matter of discussion what books should be considered pamphlets and what not. If this appellation is intended merely to refer to the SIZE of the book, the question can be scarecely worth considering ; but if it is meant to refer to the NATURE of a work, it may be considered to be of the same class and to stand in the same connexion with the word Treatise as the words Tract ; Hints ; Remarks ; &c, when these terms are descriptive of the nature of the books to which they are affixed." (p. 42) To be on the shelves of libraries, and cataloged, it is possible that these pamphlets were indeed bound, perhaps by the library itself.  The Library of Congress genre list today has a cross-reference from "pamphlet" to "Tract (ephemera)". While Costadoro's definition doesn't give any particular subject content to the type of work, LC's definition says that these are often issued by religious or political groups for proselytizing. So these are pamphlets in the sense of the political pamphlets of our revolutionary war. Today they would be blog posts, or articles in Buzzfeed or Slate or any one of hundreds of online sites that post such content. Churches I have visited often have short publications available near the entrance, and there is always the Watchtower, distributed by Jehovah's Witnesses at key locations throughout the world, and which is something between a pamphlet (in the modern sense) and a journal issue. These are probably not gathered in most libraries today. In Dewey's time the printing (and collecting by libraries) of sermons was quite common. In a world where many people either were not literate or did not have access to much reading material, the Sunday sermon was a "long form" work, read by a pastor who was probably not as eloquent as the published "stars" of the Sunday gatherings. Some sermons were brought together into collections and published, others were published (and seemingly bound) on their own.  Dewey is often criticized for the bias in his classification, but what you find in the early editions serves as a brief overview of the printed materials that the US (and mostly East Coast) culture of that time valued.  What now puzzles me is what took the place of these tracts between the time of Dewey and the Web. I can find archives of political and cultural pamphlets in various countries and they all seem to end around the 1920's-30's, although some specific collections, such as the Samizdat publications in the Soviet Union, exist in other time periods. Of course the other question now is: how many of today's tracts and treatises will survive if they are not published in book form? Posted by Karen Coyle at 1:15 PM No comments: Labels: classification, library history Saturday, November 23, 2019 The Work The word "work" generally means something brought about by human effort, and at times implies that this effort involves some level of creativity. We talk about "works of art" referring to paintings hanging on walls. The "works" of Beethoven are a large number of musical pieces that we may have heard. The "works" of Shakespeare are plays, in printed form but also performed. In these statements the "work" encompasses the whole of the thing referred to, from the intellectual content to the final presentation. This is not the same use of the term as is found in the Library Reference Model (LRM). If you are unfamiliar with the LRM, it is the successor to FRBR (which I am assuming you have heard of) and it includes the basic concepts of work, expression, manifestation and item that were first introduced in that previous study. "Work," as used in the LRM is a concept designed for use in library cataloging data. It is narrower than the common use of the term illustrated in the previous paragraph and is defined thus: Class: Work Definition: An abstract notion of an artistic or intellectual creation. In this definition the term only includes the idea of a non-corporeal conceptual entity, not the totality that would be implied in the phrase "the works of Shakespeare." That totality is described when the work is realized through an LRM-defined "expression" which in turn is produced in an LRM-defined "manifestation" with an LRM-defined "item" as its instance.* These four entities are generally referred to as a group with the acronym WEMI. Because many in the library world are very familiar with the LRM definition of work, we have to use caution when using the word outside the specific LRM environment. In particular, we must not impose the LRM definition on uses of the work that are not intending that meaning. One should expect that the use of the LRM definition of work would be rarely found in any conversation that is not about the library cataloging model for which it was defined. However, it is harder to distinguish uses within the library world where one might expect the use to be adherent to the LRM. To show this, I want to propose a particular use case. Let's say that a very large bibliographic database has many records of bibliographic description. The use case is that it is deemed to be easier for users to navigate that large database if they could get search results that cluster works rather than getting long lists of similar or nearly identical bibliographic items. Logically the cluster looks like this: In data design, it will have a form something like this: This is a great idea, and it does appear to have a similarity to the LRM definition of work: it is gathering those bibliographic entries that are judged to represent the same intellectual content. However, there are reasons why the LRM-defined work could not be used in this instance. The first is that there is only one WEMI relationship for work, and that is from LRM work to LRM expression. Clearly the bibliographic records in this large library catalog are not LRM expressions; they are full bibliographic descriptions including, potentially, all of the entities defined in the LRM. To this you might say: but there is expression data in the bibliographic record, so we can think of this work as linking to the expression data in that record. That leads us to the second reason: the entities of WEMI are defined as being disjoint. That means that no single "thing" can be more than one of those entities; nothing can be simultaneously a work and an expression, or any other combination of WEMI entities. So if the only link we have available in the model is from work to expression, unless we can somehow convince ourselves that the bibliographic record ONLY represents the expression (which it clearly does not since it has data elements from at least three of the LRM entities) any such link will violate the rule of disjointness. Therefore, the work in our library system can have much in common with the conceptual definition of the LRM work, but it is not the same work entity as is defined in that model. This brings me back to my earlier blog post with a proposal for a generalized definition of WEMI-like entities for created works.  The WEMI concepts are useful in practice, but the LRM model has some constraints that prevent some desirable uses of those entities. Providing unconstrained entities would expand the utility of the WEMI concepts both within the library community, as evidenced by the use case here, and in the non-library communities that I highlight in that previous blog post and in a slide presentation. To be clear, "unconstrained" refers not only to the removal of the disjointness between entities, but also to allow the creation of links between the WEMI entities and non-WEMI entities, something that is not anticipated in the LRM. The work cluster of bibliographic records would need a general relationship, perhaps, as in the case of VIAF, linked through a shared cluster identifier and an entity type identifying the cluster as representing an unconstrained work. ---- * The other terms are defined in the LRM as: Class: Expression Definition: A realization of a single work usually in a physical form. Class: Manifestation Definition: The physical embodiment of one or more expressions. Class: Item Definition: An exemplar of a single manifestation. Posted by Karen Coyle at 4:13 AM No comments: Labels: FRBR, library catalogs, LRM, metadata Older Posts Home Subscribe to: Posts (Atom) CopyRight Coyle's InFormation by Karen Coyle is licensed under a Creative Commons Attribution-Noncommercial 3.0 United States License. Karen Karen Coyle Where I'll Be DC2020, Ottawa, Sep 14-17 What I'm Reading Paper, M. Kurlansky The Coming of the Third Reich, R. J. Evans A Room of Ones Own, V. Woolf Blog Archive ▼  2021 (1) ▼  March (1) Digitization Wars, Redux ►  2020 (5) ►  June (1) ►  May (1) ►  April (1) ►  February (1) ►  January (1) ►  2019 (4) ►  November (1) ►  April (1) ►  March (1) ►  January (1) ►  2018 (3) ►  November (1) ►  August (2) ►  2017 (8) ►  October (1) ►  August (1) ►  July (1) ►  June (1) ►  May (1) ►  April (2) ►  February (1) ►  2016 (20) ►  December (2) ►  November (1) ►  September (1) ►  August (5) ►  July (2) ►  June (5) ►  February (1) ►  January (3) ►  2015 (13) ►  November (1) ►  October (2) ►  September (2) ►  August (1) ►  July (1) ►  May (2) ►  April (2) ►  January (2) ►  2014 (15) ►  November (2) ►  October (3) ►  September (3) ►  June (1) ►  May (1) ►  April (2) ►  February (3) ►  2013 (25) ►  November (1) ►  October (6) ►  September (2) ►  August (1) ►  July (3) ►  June (2) ►  May (1) ►  April (1) ►  March (3) ►  February (2) ►  January (3) ►  2012 (32) ►  December (1) ►  November (2) ►  October (4) ►  September (3) ►  August (1) ►  July (5) ►  June (2) ►  May (4) ►  April (5) ►  March (1) ►  February (1) ►  January (3) ►  2011 (38) ►  December (3) ►  November (1) ►  October (2) ►  September (9) ►  August (8) ►  July (3) ►  June (1) ►  May (3) ►  April (2) ►  March (3) ►  February (2) ►  January (1) ►  2010 (26) ►  December (5) ►  October (2) ►  September (1) ►  August (1) ►  July (4) ►  May (2) ►  April (3) ►  March (2) ►  February (5) ►  January (1) ►  2009 (38) ►  December (1) ►  November (3) ►  October (2) ►  September (4) ►  August (6) ►  July (6) ►  June (2) ►  May (3) ►  April (2) ►  March (2) ►  February (1) ►  January (6) ►  2008 (55) ►  December (6) ►  November (8) ►  October (1) ►  September (7) ►  August (3) ►  July (4) ►  June (3) ►  May (3) ►  April (11) ►  March (1) ►  February (3) ►  January (5) ►  2007 (58) ►  December (2) ►  November (4) ►  October (3) ►  September (4) ►  August (4) ►  July (6) ►  June (5) ►  May (5) ►  April (2) ►  March (15) ►  February (4) ►  January (4) ►  2006 (20) ►  December (3) ►  November (5) ►  October (4) ►  September (5) ►  August (3) Labels library catalogs (50) googlebooks (48) FRBR (46) cataloging (41) linked data (32) RDA (26) RDF (24) oclc (23) MARC (20) copyright (19) metadata (19) Digital libraries (18) women and technology (18) semantic web (15) digitization (12) books (11) intellectual freedom (11) bibframe (10) open access (9) women technology (8) Standards (7) classification (7) internet (7) Google (6) RDA DCMI (6) classification LCSH (6) ebooks (6) kosovo (6) DCMI (5) ER models (5) OpenLibrary (5) authority control (5) libraries (5) skyriver (5) wish list (5) identifiers (4) library history (4) open data (4) reading (4) schema.org (4) search (4) vocabularies (4) Wikipedia (3) knowledge organization (3) privacy (2) women (2) DRM (1) FOAF (1) LRM (1) RFID (1) SHACL (1) application profiles (1) ebooks books digitization Internet Archive Open Library Controlled digital lending (1) lcsh intell (1) linux (1) names (1) politics (1) About Me Karen Coyle BERKELEY, CA, United States Librarian, techie, social commentator, once called "public intellectual" by someone who couldn't think of a better title. View my complete profile Simple theme. Theme images by gaffera. Powered by Blogger. 
labiks-org-3226	----	Principal - LABIKS Ir para o conteúdo LinkedInInstagramTwitterYouTubeFacebookE-mail Buscar resultados para: QUEM SOMOS BIBLIOTECA PROJETOS Relatório Anual Mapa Painel de Dados BLOG CONTATO Buscar resultados para: Loading... SISTEMA DE BICICLETAS COMPARTILHADAS NA AMÉRICA LATINA RELATÓRIO 2019 PAINEL DE DADOS OS SISTEMASLATINO-AMERICANOS2019 NAVEGUE MAPA LATINO-AMERICANO DE SISTEMAS DE BICICLETAS COMPARTILHADAS VISITE Principaladmin2021-01-26T18:57:42+00:00 LATIN AMERICAN BIKE KNOWLEDGE SHARING A LABIKS nasce com a missão de reunir, compartilhar e potencializar conhecimento sobre os Sistemas de Bicicletas Públicas da América Latina. Nós realmente acreditamos no valor e contribuição das pesquisas para alcançarmos cidades e comunidades mais sustentáveis. Sendo assim, nós trabalhamos por mais transparência e responsabilidade governamental na América Latina. CONHEÇA A LABIKS LATIN AMERICAN BIKE KNOWLEDGE SHARING A LABIKS nasce com a missão de reunir, compartilhar e potencializar conhecimento sobre os Sistemas de Bicicletas Públicas da América Latina. Nós realmente acreditamos no valor e contribuição das pesquisas para alcançarmos cidades e comunidades mais sustentáveis. Sendo assim, nós trabalhamos por mais transparência e responsabilidade governamental na América Latina. CONHEÇA A LABIKS NOSSOS DESAFIOS Transformar Conhecimento em Ação JUNTE-SE A LABIKS! Para que mais cidades possam usufruir de Sistemas de Bicicletas Compartilhadas de qualidade é importante estimular a capacitação de todos os atores sobre tendências e boas práticas aplicadas ao planejamento, financiamento, gestão e monitoramento destes sistemas. Assim, a LABIKS convida pesquisadores, governos, indústria, financiadores e todos os interessados a serem parceiros desta iniciativa. Junte-se a nós! Latin American Bike Knowledge Sharing Site feito pela Liquefeito Ir ao Topo 
lawlesst-github-io-5080	----	Ted Lawless Ted Lawless Work notebook Datasette hosting costs I've been hosting a Datasette (https://baseballdb.lawlesst.net, aka baseballdb) of historical baseball data for a few years and the last year or so it has been hosted on Google Cloud Run I thought I would share my hosting costs for 2020 as a point of reference for others who might be interested in running a Datasette but aren't sure how much it may cost. The total hosting cost on Google Cloud Run for 2020 for the baseballdb was $51.31, or a monthly average of about $4.28 USD The monthly bill did vary a fair amount from as high as $13 in May to as low as $2 in March Since I did no deployments during this time or updates to the site, I assume the variation in costs is related to the amount queries the Datasette was serving I don't have a good sense of how many total queries per month this instance is serving since I'm not using Google Analytics or similar. Google does report that it is subtracting $49.28 in credits for the year but I don't expect those credits/promotions to expire anytime soon since my projected costs for 2021 is $59. This cost information is somewhat incomplete without knowing the number of queries served per month but it is a benchmark Connecting Python's RDFLib to AWS Neptune I've written previously about using Python's RDFLib to connect to various triple stores For a current project, I'm using Amazon Neptune as a triple store and the RDFLib SPARQLStore implemenation did not work out of the box I thought I would share my solution. The problem Neptune returns ntriples by default and RDFLib, by default in version 4.2.2, is expecting CONSTRUCT queries to return RDF/XML The solution is to override RDFLib's SPARQLStore to explictly request RDF/XML from Neptune via HTTP content negotiation. Once this is in place, you can query and update Neptune via SPARQL with RDFLib the same way that you would other triple stores. Code If you are interested in working with Neptune using RDFLib, here's a "NeptuneStore" and "NeptuneUpdateStore" implementation that you can use. Usable sample researcher profile data I've published a small set of web harvesting scripts to fetch information about researchers and their activities from the NIH Intramural Research Program website. On various projects I've been involved with, it has been difficult to acquire usable sample, or test data, about researchers and their activities You either need access to a HR system and a research information system (for the activities) or create mock data Mock, or fake data, doesn't work well when you want to start integrating information across systems or develop tools to find new publications It's hard to build a publication harvesting tool without real author names and research interests. To that end, the scripts I've published crawl the NIH Intramural Research Program website and pull out profile information for the thousand or so researchers that are members of the program, including a name, email, photo, short biography research interests, and the Pubmed IDs for selected publications. A second script harvests the organizational structure of the program Both types of data are outputted to a simple JSON structure that then can be mapped to your destination system Exploring 10 years of the New Yorker Fiction Podcast with Wikidata Note: The online Datasette that supported the sample queries below is no longer available The raw data is at: https://github.com/lawlesst/new-yorker-fiction-podcast-data. The New Yorker Fiction Podcast recently celebrated its ten year anniversary For those of you not familiar, this is a monthly podcast hosted by New Yorker fiction editor Deborah Treisman where a writer who has published a short story in the New Yorker selects a favorite story from the magazine's archive and reads and discusses it on the podcast with Treissman.1 I've been a regular listener to the podcast since it started in 2007 and thought it would be fun to look a little deeper at who has been invited to read and what authors they selected to read and discuss. The New Yorker posts all episodes of the Fiction podcast on their website in nice clean, browseable HTML pages I wrote a Python script to step through the pages and pull out the basic details about each episode: title url summary date published writer reader The reader and the writer for each story is embedded in the title so a bit of text processing was required to cleanly identify each reader and writer I also had to manually reconcile a few episodes that didn't follow the same pattern as the others. All code used here and harvested data is available on Github. Matching to Wikidata I then took each of the writers and readers and matched them to Wikidata using the searchentities API. With the Wikidata ID, I'm able to retrieve many attributes each reader and writer by querying the Wikidata SPARQL endpoint, such as gender, date of birth, awards received, Library of Congress identifier, etc. Publishing with Datasette I saved this harvested data to two CSV files - episodes.csv and people.csv - and then built a sqlite database to publish with Datasette using the built-in integration with Zeit Now Now Publishing Complete Lahman Baseball Database with Datasette Summary: The Datasette API available at https://baseballdb.lawlesst.net now contains the full Lahman Baseball Database. In a previous post, I described how I'm using Datasette to publish a subset of the Lahman Baseball Database At that time, I only published three of the 27 tables available in the database I've since expanded that Datasette API to include the complete Baseball Database. The process for this was quite straightforward I ran the MySQL dump Lahman helpfully provides through this mysql2sqlite tool to provide an import file for sqlite Importing into sqlite for publishing with Datasette was as simple as: $ ./mysql2sqlite lahman2016.sql | sqlite3 baseball.db The complete sqlite version of the Lahman database is 31 megabytes. Querying With the full database now loaded, there are many more interesting queries that can be run Publishing the Lahman Baseball Database with Datasette Summary: publishing the Lahman Baseball Database with Datasette API available at https://baseballdb.lawlesst.net. For those of us interested in open data, an exciting new tool was released this month It's by Simon Willison and called Datasette Datasette allows you to very quickly convert CSV files to a sqlite database and publish on the web with an API Head over to Simon's site for more details SPARQL to Pandas Dataframes Update: See this Python module for converting SPARQL query results into Pandas dataframes. Using Pandas to explore data SPARQL Pandas is a Python based power tool for munging and analyzing data While working with data from SPARQL endpoints, you may prefer to explore and analyze it with pandas given its full feature set, strong documentation and large community of users. The code below is an example of issuing a query to the Wikidata SPARQL endpoint and loading the data into a pandas dataframe and running basic operations on the returned data. This is a modified version of code from Su Labs Here we remove the types returned by the SPARQL endpoint since they add noise and we will prefer to handle datatypes with Pandas. {% notebook sparql_dataframe.ipynb %} With a few lines of code, we can connect data stored in SPARQL endpoints with pandas, the powerful Python data munging and analysis library. See the Su Labs tutorial for more examples. You can also download the examples from this post as a Jupyter notebook. Querying Wikidata to Identify Globally Famous Baseball Players Earlier this year I had the pleasure of attending a lecture by Cesar Hidalgo of MIT's Media Lab One of the projects Hidalgo discussed was Pantheon Pantheon is a website and dataset that ranks "globally famous individuals" based on a metric the team created called the Historical Popularity Index (HPI) A key component of HPI is the number of Wikipedia pages an individual has in in various languages For a complete description of the project, see: Yu, A Python ETL and JSON-LD I've written an extension to petl, a Python ETL library, that applies JSON-LD contexts to data tables for transformation into RDF. The problem Converting existing data to RDF, such as for VIVO, often involves taking tabular data exported from a system of record, transforming or augmenting it in some way, and then mapping it to RDF for ingest into the platform The W3C maintains an extensive list of tools designed to map tabular data to RDF. General purpose CSV to RDF tools, however, almost always require some advanced preparation or cleaning of the data This means that developers and data wranglers often have to write custom code This code can quickly become verbose and difficult to maintain Using an ETL toolkit can help with this. ETL with Python One such ETL tool that I'm having good results with is petl, Python ETL OrgRef data as RDF Summary: Notes on mapping OrgRef to DBPedia and publishing with Linked Data Fragments . This past fall, Data Salon, a UK-based data services company, released an open dataset about academic and research organizations called OrgRef The data is available as a CSV and contains basic information about over 30,000 organizations. OrgRef was created with publishers in mind, and so its main focus is on institutions involved with academic content: universities, colleges, schools, hospitals, government agencies and companies involved in research. This announcement caught our attention at my place of work because we are compiling information about educational organizations in multiple systems, including a VIVO instance, and are looking for manageable ways to consume Linked Data that will enrich or augment our local systems Since the OrgRef data has been curated and focuses on a useful subset of data that we are interested in, it seemed to be a good candidate for investigation, even it isn't published as RDF Due to it's size, it is also easier to work with than attempting to consume and process something like VIAF or DBPedia itself. Process We downloaded the OrgRef CSV dataset and used the ever helpful csvkit tool to get handle on what data elements exist. $ csvstat --unique orgref.csv 1 Name: 31149 2 
lawlesst-github-io-574	----	Ted Lawless Ted Lawless I'm Ted Lawless, an application developer based in Ann Arbor, MI working in higher education. I post brief articles or technical notes from time to time about working with metadata, Web APIs and data management tools. See the list below. I've also compiled a list of presentations and projects that I've been involved with. If any of this is of interest to you, please feel free to contact me via email (lawlesst at gmail), Github , LinkedIn, or Twitter. Posts Datasette hosting costs 1-16-21 Connecting Python's RDFLib to AWS Neptune 03-15-19 Usable sample researcher profile data 05-19-18 Exploring 10 years of the New Yorker Fiction Podcast with Wikidata 02-06-18 Now Publishing Complete Lahman Baseball Database with Datasette 12-03-17 Publishing the Lahman Baseball Database with Datasette 11-20-17 SPARQL to Pandas Dataframes 10-26-17 Querying Wikidata to Identify Globally Famous Baseball Players 10-18-16 Python ETL and JSON-LD 12-05-15 OrgRef data as RDF 01-10-2015 See a full list of posts or the RSS feed. Ted Lawless, 2021 lawlesst at gmail Github LinkedIn Twitter 
legalhackers-org-6697	----	Global Chapters | Legal Hackers Navigation Our Story Blog Global Chapters Press Videos Governance 2019 Summit Search Our Story Blog Global Chapters Press Videos Governance 2019 Summit Search Global Chapters Global Chapters Want to be a part of the largest grassroots legal innovation movement in the world? Join us! Legal Hackers is an open and collaborative community of individuals passionate about exploring and building creative solutions to some of the most pressing issues at the intersection of law and technology. Since 2012, we have used the hashtag #legalhack to share the activities of the global Legal Hackers community. Legal Hackers Online Communities Twitter: @legalhackers Facebook: www.facebook.com/groups/legalhackers Slack: legalhackers.slack.com (invitation here) LinkedIn: https://www.linkedin.com/groups/4782208 Hashtag: #legalhack Legal Hackers Local Chapters Legal Hackers is the largest grassroots legal innovation community in the world, with chapters in many major cities. Check out the list below to find your local Legal Hackers community. If you don’t see a community near you below, apply to start your own by clicking the appropriate link below: I want to start a traditional Legal Hackers Chapter for my city or region I want to start a student-only Legal Hackers Student Group for my university Questions? Read more about the differences between Chapters and Student Groups here, or email us at us at: info [at] legalhackers [dot] org. North AmericaEuropeAfricaAsiaAustralia & New ZealandLATAMStudent Groups Atlanta, Georgia Baltimore, Maryland Boston, Massachusetts Chicago, Illinois Cleveland, Ohio DFW (Dallas-Fort Worth), Texas Denver, Colorado Detroit, Michigan Houston, Texas Kansas City, Missouri London, Ontario Miami, Florida Minneapolis-St. Paul, Minnesota Montreal, Québec Nashville, Tennessee New Orleans, Louisiana New York, New York North Carolina Orlando, Florida Ottawa, Ontario Philadelphia, Pennsylvania Portland, Oregon Puerto Rico Salt Lake City, Utah San Diego, California San Francisco, California Seattle, Washington Toronto, Ontario Tulsa, Oklahoma Vancouver, British Columbia Washington, D.C. Amsterdam, Netherlands Asturias, Spain Athens, Greece Barcelona, Spain Bari, Italy Belfast, Northern Ireland Belgrade, Serbia Berlin, Germany Bern, Switzerland Bilbao, Spain Bologna, Italy Bristol, England Brno, Czech Republic Brussels, Belgium Bucharest, Romania Chișinău, Moldova Cologne/Bonn, Germany Copenhagen, Denmark Dublin, Ireland Estonia Firenze, Italy Frankfurt, Germany Geneva, Switzerland Genova, Italy Ghent, Belgium The Hague, Netherlands Hamburg, Germany Helsinki, Finland Istanbul, Turkey Kyiv, Ukraine Limassol, Cyprus Lisbon, Portugal Ljubljana, Slovenia London, England Luxembourg Lviv, Ukraine Madrid, Spain Malaga, Spain Manchester, England Mantova, Italy Milan, Italy Moscow, Russia Munich, Germany Napoli-Campania, Italy Novi Sad, Serbia Nürnberg, Germany Padova, Italy Paris, France Perugia, Italy Pescara, Italy Pisa, Italy Porto, Portugal Preston, England Roma, Italy Rijeka, Croatia Scotland Sheffield, England Skopje, Macedonia Sofia, Bulgaria St. Petersburg, Russia Stockholm, Sweden Timisoara, Romania Torino, Italy Toulouse, France Trieste, Italy Valencia, Spain Venezia, Italy Verona, Italy Vienna, Austria Vilnius, Lithuania Warsaw, Poland Zagreb, Croatia Zurich, Switzerland Abuja, Nigeria Accra, Ghana Alexandria, Egypt Cape Town, South Africa Casablanca, Morocco Douala, Cameroon Enugu, Nigeria Harare, Zimbabwe Imo, Nigeria Kampala, Uganda Lagos, Nigeria Luanda, Angola Nairobi, Kenya Almaty, Kazakhstan Ankara, Turkey Bhopal, India Chandigarh, India Delhi, India Goa, India Hong Kong Jakarta, Indonesia Jeddah, Saudi Arabia Kuala Lumpur, Malaysia Lahore, Pakistan Lucknow, India Manila, Philippines Patna, India Pune, India Seoul, South Korea Singapore Tokyo, Japan Melbourne, Australia Perth, Australia Sydney, Australia Wellington, New Zealand Aguascalientes, Mexico Arequipa, Peru Baja, Mexico Barranquilla, Colombia Belém, Brazil Belo Horizonte, Brazil Bogota, Colombia Brasília, Brazil Buenos Aires, Argentina Campinas, Brazil Cuiabá, Brazil Curitiba, Brazil Cusco, Peru Fortaleza, Brazil Goiânia, Brazil Guadalajara, Mexico Guatemala City, Guatemala Guayaquil, Ecuador Imperatriz. Brazil Jaraguá do Sul, Brazil Lavras, Brazil Lima, Peru Manaus, Brazil Manizales, Colombia Maringá, Brazil Medellin, Colombia Mexico City, Mexico Mogi das Cruzes, Brazil Monterrey, Mexico Montevideo, Uruguay Natal, Brazil Panama City, Panama Passo Fundo, Brazil Pereira, Colombia Petrolina, Brazil Porto Alegre, Brazil Porto Velho, Brazil Puebla, Mexico Querétaro, Mexico Quito, Ecuador Recife, Brazil Rio de Janiero, Brazil Salvador, Brazil Santa Cruz, Bolivia Santo André, Brazil São Paulo, Brazil San Salvador, El Salvador Sete Lagoas, Brazil Tegucigalpa, Hondouras Tepic, Mexico Kansas, USA – Kansas University New Brunswick, Canada – University of New Brunswick New York, USA – Brooklyn Law School North Carolina, USA – Wake Forest University School of Law Tennessee, USA – University of Tennessee College of Law Toronto, Canada – University of Toronto Coventry, England – University of Warwick Kyiv, Ukraine – National University of Kyiv-Mohyla Academy Kyiv, Ukraine – Taras Shevchenko National University of Kyiv London, England – University College London Sheffield, England – University of Sheffield Tarragona, Spain – Universitat Rovira i Virgili Kharagpur, India – Indian Institute of Technology Quito, Ecuador – Pontificia Universidad Católica del Ecuador (PUCE) Tweets by LegalHackers Our Story Blog Global Chapters Press Videos Governance 2019 Summit Type and Press “enter” to Search Our Story Global Chapters Governance Press 
librarian-aedileworks-com-5940	----	Librarian of Things Skip to content Librarian of Things Weeknote 9 (2021) §1 Zotero PDF Reader A new look and functionality for Zotero’s PDF Reader is still in beta. I can’t wait for this version to be unleashed! §2 MIT D2O Earlier this week, MIT Press announced a new Open Access Monograph program. It appears that the transition of scholarly ebooks to another form of subscription product is continuing. §3 AI but Canadian I’m glad to see that the Federal Government has an Advisory Council on AI and I hope they are going to meaningfully fulfill their mandate. We are already late of the gate on this front. The city where I live is already trialing software that will suggest where road safety investments should be made based on an AI’s recommendations. §4 Discovering Science Misconduct via Image Integrity Not new but new to me. I’ve recently started following Elisabeth Bik on Twitter and it has been an eye-opening experience. Bik, a microbiologist from the Netherlands who moved to the United States almost two decades ago, is a widely lauded super-spotter of duplicated images in the scientific literature. On a typical day, she’ll scan dozens of biomedical papers by eye, looking for instances in which images are reused and reported as results from different experiments, or where parts of images are cloned, flipped, shifted or rotated to create ‘new’ data… Her skill and doggedness have earned her a worldwide following. “She has an uncommon ability to detect even the most complicated manipulation,” says Enrico Bucci, co-founder of the research-integrity firm Resis in Samone, Italy. Not every issue means a paper is fraudulent or wrong. But some do, which causes deep concern for many researchers. “It’s a terrible problem that we can’t rely on some aspects of the scientific literature,” says Ferric Fang, a microbiologist at the University of Washington, Seattle, who worked on a study with Bik in which she analysed more than 20,000 biomedical papers, finding problematic duplications in roughly 4% of them (E. M. Bik et al. mBio 7, e00809-16; 2016). “You have postdocs and students wasting months or years chasing things which turn out to not be valid,” he says. Nature 581, 132-136 (2020), doi: https://doi.org/10.1038/d41586-020-01363-z §5 And here’s one thing I did this week! Author Mita WilliamsPosted on March 5, 2021Categories weeknotesLeave a comment on Weeknote 9 (2021) Weeknote 8 (late) 2021 Last week I had a week that was more taxing than normal and I had nothing in the tank by Friday. So I’m putting together last week’s weeknotes today. Also, going forward each section heading has been anchor tagged for your link sharing needs. e.g. §1 §2 §3 §4 §5 and §6. I say this recognizing that the weeknote format resists social sharing which I consider a feature not a bug. §1 We Are Here From Library and Archives Canada: Over the past three years, We Are Here: Sharing Stories has digitized and described over 590,000 images of archival and published materials related to First Nations, Inuit and the Métis Nation. Digitized and described content includes textual documents, photographs, artworks and maps as well as numerous language publications. All items are searchable and linked in our Collection Search or Aurora databases. In order to make it easier to locate recently digitized Indigenous heritage content at LAC, we have created a searchable list of the collections and introduced a Google map feature – allowing users to browse archival materials by geographic region! Visit the We Are Here: Sharing Stories page to pick your destination and start your research! Those who know me, know that I’ve been advocating for more means of discovery via maps and location for a while now. While my own mapping has slowed down, I still bookmarked Georeferencing in QGIS 2.0 from The Programming Historian today. If used appropriately, maps hold a great deal of potential as a means to discover works related to indigenous peoples. Some forms of Indigenous Knowledge Organization such as the X̱wi7x̱wa Classification Scheme emphasize geographic grouping over alphabetical grouping. §2 Bookfeedme, Seymour! * Not every author has a newsletter that you can subscribe to in order to be informed when they have a new book out. You would think it would be easier to be notified otherwise, but with the mothballing of Amazon Alerts, the only other way I know to be notified is through Bookfeed.io which uses the Google Books API at its core. If don’t have a familiarity with RSS, see About Feeds for more help. * musical reference §3 Best article title in librarianship for 2021 Ain’t no party like a LibGuides Party / ’cause a LibGuides Party is mandatory ** ** musical reference §4 This is the time and this is the record of the time *** ScholComm librarians ask: Do we want a Version of Record or Record of Versions? *** musical reference §5 The 5000 Fingers of Dr. T **** A Hand With Many Fingers is a first-person investigative thriller. While searching through a dusty CIA archive you uncover a real Cold War conspiracy. Every document you find has new leads to research. But the archive might not be as empty as you think…   – Slowly unravel a thrilling historical conspiracy – Discover new clues through careful archival research – Assemble your theories using corkboard and twine – Experience a story of creeping paranoia **** musical reference / movie reference Hat tip: Errant Signal’s Bad Bosses, Beautiful Vistas, and Baffling Mysteries: Blips Episode 8 §6 Citational politics bibliography I’m not entirely sure how this bibliography on the politics of citation and references crossed my twitter stream, but I immediately bookmarked it. The bibliography is from a working group of CLEAR from Memorial University: Civic Laboratory for Environmental Action Research (CLEAR) is an interdisciplinary natural and social science lab space dedicated to good land relations directed by Dr. Max Liboiron at Memorial University, Canada. Equal parts research space, methods incubator, and social collective, CLEAR’s ways of doing things, from environmental monitoring of plastic pollution to how we run lab meetings, are based on values of humility, accountability, and anti-colonial research relations. We specialize in community-based and citizen science monitoring of plastic pollution, particularly in wild food webs, and the creation and use of anti-colonial research methodologies. To change science and research from its colonial, macho, and elitist norms, CLEAR works at the level of protocol. Rather than lead with good intentions, we work to ensure that every step of research and every moment of laboratory life exemplifies our values and commitments. To see more of how we do this, see the CLEAR Lab Book, our methodologies, and media coverage of the lab. About CLEAR I have no musical reference for this. Author Mita WilliamsPosted on March 1, 2021Categories citations, weeknotesLeave a comment on Weeknote 8 (late) 2021 Weeknote 7 (2021) Today the library is closed as is my place of work’s tradition on the last day of Reading Week. But as I have three events (helping in a workshop, giving a presentation, participating in a focus group) in my calendar, I’m just going to work the day and bank the time for later. §1 Barbara Fister in The Atlantic! We are experiencing a moment that is exposing a schism between two groups: those who have faith that there is a way to arrive at truth using epistemological practices that originated during the Enlightenment, and those who believe that events and experiences are portents to be interpreted in ways that align with their personal values. As the sociologist and media scholar Francesca Tripodi has demonstrated, many conservatives read the news using techniques learned through Bible study, shunning secular interpretations of events as biased and inconsistent with their exegesis of primary texts such as presidential speeches and the Constitution. The faithful can even acquire anthologies of Donald Trump’s infamous tweets to aid in their study of coded messages. While people using these literacy practices are not unaware of mainstream media narratives, they distrust them in favor of their own research, which is tied to personal experience and a high level of skepticism toward secular institutions of knowledge. This opens up opportunities for conservative and extremist political actors to exploit the strong ties between the Republican Party and white evangelical Christians. The conspiracy theory known as QAnon is a perfect—and worrisome—example of how this works. After all, QAnon is something of a syncretic religion. But its influence doesn’t stop with religious communities. While at its core it’s a 21st-century reboot of a medieval anti-Semitic trope (blood libel), it has shed some of its Christian vestments to gain significant traction among non-evangelical audiences. §2 New to me: Andromeda Yelton’s course reading list dedicated to AI in the Library. Hat-tip to Beck Tench. §3 I recently suggested that MPOW’s next Journal Club should deviate from looking at the library literature and reflect on personal knowledge management. I’m not sure how much take up there will be on the topic, but I love reading about how other people deliberately set up how they set up systems to help them learn. Case in point: Cecily Walker’s Thoughts Like A Runaway Train: Notes on Information Management with Zettelkasten Fun fact: I first learned of Zettelkasten from Beck Tench. Author Mita WilliamsPosted on February 19, 2021Categories weeknotes2 Comments on Weeknote 7 (2021) Weeknote 6 (2021) Another week in which I was doing a lot of behind the scenes work. §1 Duly noted: Here’s the article in full. §2 Years ago, I gave a keynote called Libraries are for use. And by use, I mean copying that featured the short and sad story of a person who was unable to donate their ebook to their local library. I thought of this slide this week when I learned the the DPLA is now offering an ebook creation service that allows library to an ebook collection — albeit of openly licensed or public domain works. I downloaded the SimplyE app for my iPad and I found it simple and well-designed. Having access to a good set of public domain work is great although I was slightly disappointed that there it wasn’t possible to import my own collection of ebooks into the app. But if I was a library, that’s what I could do. §3 I’m not ready to share my thoughts on this next matter yet but I’ve been recently re-considering how much of our knowledge is socially constructed. As such, I am still mulling over Harold Jarche’s Subject Matter Networks. It begins, We live in a networked world. Is it even possible for one person to have sufficient expertise to understand a complex situation such as this pandemic? So do we rely on one subject matter expert or rather a subject matter network? Author Mita WilliamsPosted on February 12, 2021February 12, 2021Categories weeknotesLeave a comment on Weeknote 6 (2021) Weeknote 5 (2021) §1 Last Friday I was interviewed for the podcast The Grasscast — a game-themed podcast named after the book, The Grasshopper: Games, Life, and Utopia. I ramble a little bit in the episode as I tried to be more open and conversational than concise and correct. But I also spoke that way because for some of the questions, no pat answer came immediately to mind. There was one question that stumped me but in my trying to answer, I think I found something I had not considered before. The question was, What is one bad thing about games? And I tried to convey that, unlike video games where you can play with strangers, most tabletop games are generally constrained by the preferences of your social circles. In order to convince others to spend time on a game that might think is too complicated for them or not for them, you need to have be a successful evangelist. Also the episode drifts into chatter about libraries, copyright and ebooks. §2 This week, I reviewed and published another batch of works for our institutional repository from our department of History that was prepared by our library assistants at Leddy At this point, we have reviewed and uploaded the works of half the faculty from this department. I’m hoping to finish the rest this month but I think I have some outstanding H5P work that might push the end of this project til March. §3 This morning I assisted with an online workshop called Data Analysis and Visualization in R for Ecologists that was being lead by a colleague of mine. R Version 4.0.3 (“Bunny-Wunnies Freak Out”) was released on 2020-10-10. The release of R 4.0.4 (“Lost Library Book”) is scheduled for Monday 2021-02-15. §4 On Sunday, I published a short response to “Windsor Works – An Economic Development Strategy” which is going to City Council on Monday. Why am I writing about this document here? I am mention this here because the proposed strategy (L.I.F.T.) lists the following as potential metric for measuring the strategy’s success… Take it from me, someone who knows a quite a bit about citations — the city should use another metric — perhaps one pertaining to local unemployment levels instead. §5 A viral post from 2019 resurfaced on my FB feed this week and unlike most of the posts I read there, this one did spark joy: And it struck me how much I loved that the anti-prom was being at the library. So I started doing some research! It appears to me that some anti-proms are technically better described as alternative proms. These proms have been established as an explicitly safe place where LGBTQ young people can enjoy prom. Other anti-proms are true morps. I now wonder what other anti-traditions should find a home at the public library. Author Mita WilliamsPosted on February 5, 2021February 5, 2021Categories weeknotesLeave a comment on Weeknote 5 (2021) Weeknote 4 (2021) I don’t have much that I can report in this week’s note. You are just going to have to take my word that this week, a large amount of my time was spent at meetings pertaining to my library department, my union, and anti-black racism work. §1 Last year, around this same time, some colleagues from the University and I organized an speaking event called Safer Communities in a ‘Smart Tech’ World: We need to talk about Amazon Ring in Windsor. Windsor’s Mayor proposes we be the first city in Canada to buy into the Ring Network. As residents of Windsor, we have concerns with this potential project. Seeing no venue for residents of Windsor to share their fears of surveillance and loss of privacy through this private-partnership, we hosted an evening of talks on January 22nd, 2020 at The Performance Hall at the University of Windsor’s School of Creative Arts Windsor Armories Building. Our keynote speaker was Chris Gilliard, heard recently on CBC’s Spark. Since that evening, we have been in the media raising our concerns, asking questions, and encouraging others to do the same. The City of Windsor has yet to have entered an agreement with Amazon Ring. This is good news. This week, the City of Windsor announced that it has entered a one-year deal partnership with Ford Mobility Canada to share data and insights via Ford’s Safety Insights platform. I don’t think this is good news for reasons outlined in this post called Safety Insights, Data Privacy, and Spatial Justice. §2 This week I learned a neat Tweetdeck hack. If set up a search as column, you can limit the results for that term using the number of ‘engagements’: §3 §4 I haven’t read this but I have it bookmarked for potential future reference: The weaponization of web archives: Data craft and COVID-19 publics: An unprecedented volume of harmful health misinformation linked to the coronavirus pandemic has led to the appearance of misinformation tactics that leverage web archives in order to evade content moderation on social media platforms. Here we present newly identified manipulation techniques designed to maximize the value, longevity, and spread of harmful and non-factual content across social media using provenance information from web archives and social media analytics. After identifying conspiracy content that has been archived by human actors with the Wayback Machine, we report on user patterns of “screensampling,” where images of archived misinformation are spread via social platforms. We argue that archived web resources from the Internet Archive’s Wayback Machine and subsequent screenshots contribute to the COVID-19 “misinfodemic” in platforms. Understanding these manipulation tactics that use sources from web archives reveals something vexing about information practices during pandemics—the desire to access reliable information even after it has been moderated and fact-checked, for some individuals, will give health misinformation and conspiracy theories more traction because it has been labeled as specious content by platforms. §5 I’m going to leave this tweet here because I might pick up this thread in the future: This reminds me of a talk given in 2018 by Data & Society Founder and President, danah boyd called You Think You Want Media Literacy… Do You? This essay still haunts me, largely because we still don’t have good answers for the questions that Dr. Boyd asks of us and the stakes have only gotten higher. Author Mita WilliamsPosted on January 29, 2021January 29, 2021Categories weeknotesLeave a comment on Weeknote 4 (2021) Weeknote 3 (2021) Hey. I missed last week’s weeknote. But we are here now. §1 This week I gave a class on searching scientific literature to a group of biology masters students. While I was making my slides comparing the Advanced Search capabilities of Web of Science and Scopus, I discovered this weird behaviour of Google Scholar: a phrase search generated more hits than not. I understand that Google Scholar performs ‘stemming’ instead of truncation in generating search results but this still makes no sense to me. §2 New to me: if you belong to an organization that is already a member of CrossRef, you are eligible to use a Similarity Check of documents for an additional fee. Perhaps this is a service we could provide to our OJS editors. §3 I’m still working through the Canadian Journal of Academic Librarianship special issue on Academic Libraries and the Irrational. Long time readers know that I have a fondness for the study of organizational culture and so it should not be too surprising that the first piece I wanted to read was The Digital Disease in Academic Libraries. It begins…. THOUGH several recent books and articles have been written about change and adaptation in contemporary academic libraries (Mossop 2013; Eden 2015; Lewis 2016), there are few critical examinations of change practices at the organizational level. One example, from which this paper draws its title, is Braden Cannon’s (2013) The Canadian Disease, where the term disease is used to explore the trend of amalgamating libraries, archives, and museums into monolithic organizations. Though it is centered on the impact of institutional convergence, Cannon’s analysis uses an ethical lens to critique the bureaucratic absurdity of combined library-archive-museum structures. This article follows in Cannon’s steps, using observations from organizational de-sign and management literature to critique a current trend in the strategic planning processes and structures of contemporary academic libraries. My target is our field’s ongoing obsession with digital transformation beyond the shift from paper-based to electronic resources, examined in a North American context and framed here as The Digital Disease. I don’t want to spoil the article but I do want to include this zinger of a symptom which is the first of several: If your library’s organizational chart highlights digital forms of existing functions, you might have The Digital Disease. Kris Joseph, The Digital Disease in Academic Libraries, Canadian Journal of Academic Librarianship, Vol 6 (2020) Ouch. That truth hurts almost as much as this tweet did: Author Mita WilliamsPosted on January 22, 2021January 22, 2021Categories weeknotesLeave a comment on Weeknote 3 (2021) Weeknote 1 (2021) This week’s post is not going to capture my ability to be productive while white supremacists appeared to be ushered in and out of the US Capitol building by complicit police and COVID-19 continued to ravage my community because our provincial government doesn’t want to spend money on the most vulnerable. Instead, I’m just going to share what I’ve learned this week that might prove useful to others. This week I added works to three faculty member’s ORCiD profiles using ORCiD’s trusted individual functionality. One of these professors was works in the field of Psychology and I found the most works for that researcher using BASE (Bielefeld Academic Search Engine) including APA datasets not found elsewhere. Similarly, I found obscure ERIC documents using The Lens.org. Unfortunately, you can’t directly import records into The Lens into an ORCiD profile unless you create a Lens profile for yourself. I’ve added The Lens to my list of free resources to consult when looking for research. This list already includes Google Scholar and Dimensions.ai. fin Author Mita WilliamsPosted on January 8, 2021February 4, 2021Categories weeknotesLeave a comment on Weeknote 1 (2021) Weeknote 50 (2020) §1 It looks like Andromeda Yelton is sharing weeknotes (“This week in AI“). I can’t wait to see what she shares with us all in 2021. §2 Earlier this fall, Clarivate Analytics announced that it was moving toward a future that calculated the Journal Impact Factor (JIF) based on the date of electronic publication and not the date of print publication… This discrepancy between how Clarivate treated traditional print versus online-only journals aroused skepticism among scientists, some of whom… cynically suggested that editors may be purposefully extending their lag in an attempt to artificially raise their scores. Changes to Journal Impact Factor Announced for 2021, Scholarly Kitchen, Phil Davis, Dec 7, 2020 I don’t think there is anything cynical about the observation that journal publishers picked up a trick from those booksellers who actively engage in promoting pre-publication book sales because those weeks of sales are accumulated and counted in the first week of publication which results in a better chance of landing on the New York Times Bestseller list. §3 In 2020, a team at Georgia State University compiled a report on virtual learning best practices. While evidence in the field is “sparse” and “inconsistent,” the report noted that logistical issues like accessing materials—and not content-specific problems like failures of comprehension—were often among the most significant obstacles to online learning. It wasn’t that students didn’t understand photosynthesis in a virtual setting, in other words—it was that they didn’t find (or simply didn’t access) the lesson on photosynthesis at all. That basic insight echoed a 2019 study that highlighted the crucial need to organize virtual classrooms even more intentionally than physical ones. Remote teachers should use a single, dedicated hub for important documents like assignments… The 10 Most Significant Education Studies of 2020, Edutopia, By Youki Terada, Stephen Merrill, December 4, 2020 §4 I’m pleased to say that with some much appreciated asssistance, our OJS instances are now able to allow to connect authors with their ORCiD profiles. This means that all authors who have articles accepted by these journals will receive an email asking if they would like to connect to ORCiD. I was curious how many authors from one of our existing journals had existing ORCiD profiles and so I did a quick check. This is how I did it. First, I used OJS’s export function to download all the metadata available at an article level. Next, I used the the information from that .csv file to create a new spreadsheet of full names. I then opened this file using OpenRefine. Then, through the generosity from Jeff Chiu, I was able check these last names with the ORCiD api using the OpenRefine Reconciliation Service and Chiu’s SmartName server: http://refine.codefork.com/reconcile/orcid/smartnames. Using the smart name integration, I can limit the list to those names very likely to match. With this set of likely suspects in hand, I can locate the authors in the OJS backend and then send invitations from the OJS server from their author profile (via the published article’s metadata page): §5 I can’t wait to properly tuck into this issue of The Canadian Journal of Academic Librarianship with its Special Focus on Academic Libraries and the Irrational §6 Happy Solstice, everyone. Author Mita WilliamsPosted on December 21, 2020December 21, 2020Categories weeknotesLeave a comment on Weeknote 50 (2020) Weeknote 49 (2020) §1 I don’t have much to report in regards to the work I’ve been doing this week. I tried to get our ORCiD-OJS plugin to work but there is some small strange bug that needs to be squished. Luckily, next week I will have the benefit of assistance from the good people of CRKN and ORCiD-CA. What else? I uploaded a bunch of files into our IR. I set up a site for an online-only conference being planned for next year. And I finally got around to trying to update a manuscript for potential publication. But this writing has been very difficult as my attention has been sent elsewhere many times this week. §2 Unfortunately I wasn’t able to catch the live Teach-In #AgainstSurveillance on Tuesday but luckily the talks have been captured and made available at http://againstsurveillance.net/ So many of our platforms are designed to extract user data. But not all of them are. Our institutions of higher education could choose to invest in free range ed-tech instead. §3 Bonus links! Making a hash out of knitting with data shannon_mattern’s Library | Zotero Mystery File! Author Mita WilliamsPosted on December 4, 2020December 4, 2020Categories weeknotesLeave a comment on Weeknote 49 (2020) Posts navigation Page 1 Page 2 … Page 6 Next page About me Librarian of Things is a blog by me, Mita Williams, who used to blog at New Jack Librarian until Blogger.com finally gave up the ghost. If you don’t have an RSS reader, you can subscribe for email delivery through mailchimp. You can learn more about my work at aedileworks.com as well as my other blogs and my weekly newsletter. If you are an editor of a scholarly journal and think that a post could be expanded into a more academic form, please let me know. Search for: Search Recent Posts Weeknote 9 (2021) Weeknote 8 (late) 2021 Weeknote 7 (2021) Weeknote 6 (2021) Weeknote 5 (2021) Archives March 2021 February 2021 January 2021 December 2020 November 2020 October 2020 September 2020 June 2020 May 2020 April 2020 October 2019 June 2019 May 2019 April 2019 March 2019 January 2019 July 2018 June 2018 May 2018 April 2018 December 2017 June 2017 May 2017 April 2017 November 2016 August 2016 July 2016 Meta Log in Entries feed Comments feed WordPress.org Librarian of Things Proudly powered by WordPress 
librarian-aedileworks-com-650	----	Librarian of Things Librarian of Things Weeknote 9 (2021) §1 Zotero PDF Reader A new look and functionality for Zotero&#8217;s PDF Reader is still in beta. I can&#8217;t wait for this version to be unleashed! §2 MIT D2O Earlier this week, MIT Press announced a new Open Access Monograph program. It appears that the transition of scholarly ebooks to another form of subscription product &#8230; Continue reading "Weeknote 9 (2021)" Weeknote 8 (late) 2021 Last week I had a week that was more taxing than normal and I had nothing in the tank by Friday. So I&#8217;m putting together last week&#8217;s weeknotes today. Also, going forward each section heading has been anchor tagged for your link sharing needs. e.g. §1 §2 §3 §4 §5 and §6. I say this &#8230; Continue reading "Weeknote 8 (late) 2021" Weeknote 7 (2021) Today the library is closed as is my place of work&#8217;s tradition on the last day of Reading Week. But as I have three events (helping in a workshop, giving a presentation, participating in a focus group) in my calendar, I&#8217;m just going to work the day and bank the time for later. §1 Barbara &#8230; Continue reading "Weeknote 7 (2021)" Weeknote 6 (2021) Another week in which I was doing a lot of behind the scenes work. §1 Duly noted: Here&#8217;s the article in full. §2 Years ago, I gave a keynote called Libraries are for use. And by use, I mean copying that featured the short and sad story of a person who was unable to donate &#8230; Continue reading "Weeknote 6 (2021)" Weeknote 5 (2021) §1 Last Friday I was interviewed for the podcast The Grasscast — a game-themed podcast named after the book, The Grasshopper: Games, Life, and Utopia. I ramble a little bit in the episode as I tried to be more open and conversational than concise and correct. But I also spoke that way because for some &#8230; Continue reading "Weeknote 5 (2021)" Weeknote 4 (2021) I don&#8217;t have much that I can report in this week&#8217;s note. You are just going to have to take my word that this week, a large amount of my time was spent at meetings pertaining to my library department, my union, and anti-black racism work. §1 Last year, around this same time, some colleagues &#8230; Continue reading "Weeknote 4 (2021)" Weeknote 3 (2021) Hey. I missed last week&#8217;s weeknote. But we are here now. §1 This week I gave a class on searching scientific literature to a group of biology masters students. While I was making my slides comparing the Advanced Search capabilities of Web of Science and Scopus, I discovered this weird behaviour of Google Scholar: a &#8230; Continue reading "Weeknote 3 (2021)" Weeknote 1 (2021) This week&#8217;s post is not going to capture my ability to be productive while white supremacists appeared to be ushered in and out of the US Capitol building by complicit police and COVID-19 continued to ravage my community because our provincial government doesn&#8217;t want to spend money on the most vulnerable. Instead, I&#8217;m just going &#8230; Continue reading "Weeknote 1 (2021)" Weeknote 50 (2020) §1 It looks like Andromeda Yelton is sharing weeknotes (&#8220;This week in AI&#8220;). I can&#8217;t wait to see what she shares with us all in 2021. §2 Earlier this fall, Clarivate Analytics&#160;announced that it was moving toward a future that calculated the Journal Impact Factor (JIF) based on the date of electronic publication and not &#8230; Continue reading "Weeknote 50 (2020)" Weeknote 49 (2020) §1 I don&#8217;t have much to report in regards to the work I&#8217;ve been doing this week. I tried to get our ORCiD-OJS plugin to work but there is some small strange bug that needs to be squished. Luckily, next week I will have the benefit of assistance from the good people of CRKN and &#8230; Continue reading "Weeknote 49 (2020)" 
libraries-uc-edu-2824	----	Libraries | University Of Cincinnati Skip to main content Use the form to search UC's web site for pages, programs, directory profiles and more. Libraries Online Library For Faculty For Graduate Students For Undergraduates For Staff Libraries Archives and Rare Books About the Archives and Rare Books Library Annual Summary Research Policies Staff FAQs Desiderata Collections Urban Studies Rare Books University Archives German Americana Local Government Records Search ARB Collections Records Management Disposal Submission Form Online Exhibits Special Projects Services Genealogy Research Image Reproduction and Use Archives and Rare Books Teaching Support Internship Program CCM CCM Catalog Search CCM Research About the CCM Library CCM Staff Directory CCML FAQs CCM Services CCM Special-Collections CEAS About CEAS Library History Floor Plan Guide for New Faculty Course Research Guides Research Resources Ask a Librarian Tutorial Videos Senior Design Reports Special Collections The Armstrong Collection The Cooperative Engineer The Strauss Collection Contact Us Ask a Librarian Reserve a Room CECH About Faculty & Staff Our Collections Borrowing Guidelines Find Us Services Poster Printing Reserves Instruction Resources MakerLab Technology for Checkout Study Spaces Info Commons Chemistry-Biology About Staff Oesper History of Chemistry Collection Services Getting Around the Library Ask a Librarian Classics About the Classics Library Classics Library Guide Snapshot of the Classics Collections Highlights of Classics Books Classics Collection Development Policy Why a Classics Library? Classics Book of the Month Classics Library's Open Access Link of the Month Classics Library Book Desiderata Virtual Tour of the Classics Library Usage Statistics Staff Directory Recent Book Acquisitions Classicizing Cincinnati Classics Library Policies Classics Library Collections German Classics Dissertations Modern Greek Journal Collection Classics Map Collection Greek Rare Book Collection Latin Rare Book Collection Classics Books with Author Signatures UC Department of Classics Archive Classics Library Services Group Study Room in Classics Scanners, Printers, Copiers in Classics Tours and Drop-Ins Classics Library Picture Gallery DAAP Collections Architecture Drawings Related Regional Libraries Exhibits Instruction Services Study Rooms Contact Us DAAP Library COVID-19 Updates Geology-Mathematics-Physics About the Library History Getting around the Library Help Ask-a-Librarian Help for Faculty Help for Students Help for Undergraduate Students Services New Books Special Collections Rare Book Collection Willis G. Meyer Map Collection Guidebook Collection Health Sciences Services Membership Room Reservations Borrow HSL-IT Research Help HSL History HSL Directions HSL Staff Directory Winkler Center About Cecil Striker Society & Lecture Resources Services Langsam Law UC Blue Ash About the UCBA Library UCBA Library Faculty & Staff UCBA Library Policies Vision and Core Values Annual Reports Student Employment at UCBA Library COVID-19 Services FAQs Borrowing Materials at UCBA Library Borrowing & Returning Reserves Equipment Lending Study Spaces Resources for UCBA Faculty and Staff Course Reserve Guidelines Collections Library Liaison Program Space Reservations Teaching Support Ask the UCBA Library UC Clermont Library About UC Clermont Library Student Employment Support the Library Collection Development Contact Us 2020-2021 FAQs Borrow Materials Technology & Equipment Textbook Reserves Study Spaces Policies and Guidelines Teaching Support Information Literacy Course Materials Course Reserves Ask UC Clermont Winkler Center Other Area Libraries Ask Find, Request, Borrow Search for Materials Call Number Locator (Langsam Library) Borrow Materials Borrow Equipment Renew Materials Request Materials Reserves E-Reserves Faculty Guidelines Traditional Course Reserves Reserves - Contacts Copyright Resources Textbook Affordability Help Finding and Using Materials Interlibrary Loan Special Collections FAQ Digital Collections Research & Teaching Support Research Data Services Lab Spaces Workshops and Education Meet the Team UC Data Day Data & Computational Science Series Data Tools Testimonials Data Visualization Showcase Digital Scholarship Center Citing Sources Copyright Repositories Subject Librarians UC Press Teaching Support Workshops & Trainings Ask a Librarian Online Reference Shelf Library Materials for Online Teaching Spaces & Tech Room Reservations Adaptive Technologies Library Media Space Student Technology Resources Center Borrow Equipment About Covid-19 Click & Collect Health and Safety Protocols Hours and Locations Contact Us Employment ohiolink-luminaries Staff Directory Giving Adopt-a-book Funding Donors Strategic Plan Tenets Pillars Ten Initiatives News and Events Policies Acceptable Use Gift Policy Source Library Faculty Resources for Library Faculty Library Faculty Directory Dean's Welcome Core Beliefs Login Off Campus Access Affiliate and Guest Access Help and Troubleshooting Tools VPN Interlibrary Loan Interlibrary Lending Policies My Library Record Pay Fines Fine Appeal Form Articles Books Journals Databases Search Summon to find articles, books, and more Advanced Summon Search | Find by DOI or PMID | More Search Options |Help Search the Library Catalog for books and more Advanced Catalog Search | Guest Access |  More Search Options | Help Find E-Journals or Print Journals E-Journals | Print Journals | Browzine | More Search Options | Help Search the A-Z Indexes databases list Browse Databases | Top 10 Databases | Academic Search Complete | More Search Options UC Online Library Whether onsite or online, we continue to connect students, faculty, researchers and scholars to dynamic data, information and resources. UC Online Library Service Updates Off Campus Access Contact Us Interlibrary Loan Research Guides Browse all guides Spring 2021 Return to Campus As we step into Spring Term 2021, our motto “Strength in Unity” continues to take on added meaning. Health and safety remain a top priority in an environment featuring virtual, hybrid, HyFlex and in-person classes, testing as a critical component toward a safer community as well as remote work options. Visit UC's Public Health Site Online Library Searching for a resource, have a question or simply browsing for fun? We've brought all online resources together in one place. Online Library Digital Technologies & Innovation UC Libraries creates and utilizes learning tools and research platforms that transform the user experience and the creation of new knowledge. Special Collections UC Libraries preserves and provides access to special collections and the scholarly and historical record of the university, including archival as well as born-digital content and datasets. View   With 4.3 million volumes and access to thousands of electronic resources available 24/7 through our online library catalog, UC Libraries' virtual and physical locations offer resources for everyone. UC Libraries includes the Walter C. Langsam Library, the Archives and Rare Books Library, the Donald C. Harrison Health Sciences Library, and eight college and departmental libraries serving constituents in applied science, architecture, art, biology, chemistry, classics, design, education, engineering, geology, mathematics, music, physics and planning.   Give to UC Libraries Library News "Off the Shelf and into the Lab" webinar May 6 April 14, 2021 Event: May 6, 2021 7:00 PM Join the Henry R. Winkler Center for the History of the Health Professions and the Cecil Striker Society for the History of Medicine at 7 p.m., Wednesday, May 6, for the third lecture in the Cecil Striker Webinar series. Faculty Awards 2021: Arlene Johnson April 6, 2021 Through her many roles in her 20 years at the University of Cincinnati, Arlene Johnson has served students, faculty and staff in the pursuit of knowledge — fitting for the recipient of the Faculty Senate Exemplary Service to the University Award. ‘CAN UC my mask’ canned food sculpture temporarily installed in... March 23, 2021 The masked Bearcat is showing school pride while reminding everyone to stay safe by wearing a mask. Debug Query for this More News Library Blog News from the Library Blog UCBA Library Needs You!  Now Hiring for Summer Semester  Fri, 23 Apr 2021 UCBA Library Needs You!  Now Hiring for Summer Semester  ARE YOU…  Friendly and welcoming?   Eager to help students, staff and faculty?   If so, consider joining the UCBA Library Team!   Apply:  https://libraries.uc.edu/libraries/ucba/about/employment.html    April 20 Service Note: Access to library resources is currently down Tue, 20 Apr 2021 UPDATE: All access has been restored. ________________________________ All access to library resources through the proxy server is currently down. OCLC is working on the issue and we expect a resolution shortly. We apologize for the inconvenience. If you know the resource URL you are attempting to access, try this page: https://libapps.libraries.uc.edu/proxy/proxygoto.php. The URL for the […] The Preservation Lab celebrates Preservation Week 2021: Preservation in Action Mon, 19 Apr 2021 Join The Preservation Lab April 26-30 as they celebrate the American Library Association’s (ALA) Preservation Week, “Preservation in Action.” More information, including a schedule of the week’s events, is available on the Preservation’s blog. “Off the Shelf and into the Lab” May 6th webinar to highlight medical history, preservation and the UC Libraries’ Adopt-A-Book program Wed, 14 Apr 2021 Join the Henry R. Winkler Center for the History of the Health Professions and the Cecil Striker Society for the History of Medicine, Thursday, May 6 at 7:00 p.m. for the 3rd lecture in the Cecil Striker Webinar series. Off the Shelf and into the Lab: Medical History, Preservation and the University of Cincinnati Libraries’ […] Ending the HIV Epidemic, a panel discussion April 21 Mon, 12 Apr 2021 Join UC Libraries online Wednesday, April 21, 1:00 p.m. for “Ending the HIV Epidemic,” a panel discussion. Learn from various Cincinnati area HIV/AIDS service providers about how long-standing HIV prevention efforts combined with education on treatment, viral load suppression and concerted efforts by multiple agencies are being utilized to make HIV infection a thing of […] University of Cincinnati Libraries PO Box 210033 Cincinnati, Ohio 45221-0033 Contact Us | Staff Directory UC Tools Canopy & Canvas One Stop Email Catalyst Shuttle Tracker IT Help UC VPN Bearcats Landing About Us Maps & Directions Jobs News Diversity Governance & Policies Directory Events Calendar University of Cincinnati | 2600 Clifton Ave. | Cincinnati, OH 45221 | ph: 513-556-6000 Alerts | Clery and HEOA Notice | Notice of Non-Discrimination | eAccessibility Concern | Privacy Statement | Copyright Information © 2020 University of Cincinnati University of Cincinnati Libraries PO Box 210033 Cincinnati, Ohio 45221-0033 Contact Us | Staff Directory © 2020 University of Cincinnati chat loading... 
library-brown-edu-7056	----	Brown University Library Digital Technologies Skip to content Find & Borrow Articles, Journals, & Databases Subject Support Hours & Locations Ask a Question Now Off-Campus Access Library A-Z Brown University Library Digital Technologies Menu and widgets Home BDR Blacklight Digital Preservation Drupal Josiah OCRA ORCID Researchers@Brown Web WordPress Search the DT Blog Search for: Authors Ben Cail (22) Hector Correa (9) Jean Rainwater (9) Kerri Hicks (6) Kevin Powell (6) Ted Lawless (3) Adam Bradley (2) Birkin Diana (1) Bundler 2.1.4 and homeless accounts This week we upgraded a couple of our applications to Ruby 2.7 and Bundler 2.1.4 and one of the changes that we noticed was that Bundler was complaining about not being able to write to the /opt/local directory. Turns out this problem shows up because the account that we use to run our application is a system account that does not have a home folder. This is how the problems shows up: $ su - system_account $ pwd /opt/local $ mkdir test_app $ cd test_app $ pwd /opt/local/test_app $ gem install bundler -v 2.1.4 $ bundler --version `/opt/local` is not writable. Bundler will use `/tmp/bundler20200731-59360-174h3lz59360' as your home directory temporarily. Bundler version 2.1.4 Notice that Bundler complains about the /opt/local directory not being writable, that’s because we don’t have home for this user, in fact env $HOME outputs /opt/local rather than the typical /home/username. Although Bundler is smart enough to use a temporary folder instead and continue, the net result of this is that if we set a configuration value for Bundler in one execution and try to use that configuration value in the next execution Bundler won’t be able to find the value that we set in the first execution (my guess is because the value was saved in a temporary folder.) Below is an example of this. Notice how we set the path value to vendor/bundle in the first command, but then when we inspect the configuration in the second command the configuration does not report the value that we just set: # First - set the path value $ bundle config set path 'vendor/bundle' `/opt/local` is not writable. Bundler will use `/tmp/bundler20200731-60203-16okmcg60203' as your home directory temporarily. # Then - inspect the configuration $ bundle config `/opt/local` is not writable. Bundler will use `/tmp/bundler20200731-60292-1r50oed60292' as your home directory temporarily. Settings are listed in order of priority. The top value will be used. Ideally the call to bundle config will report the vendor/bundle path that we set, but it does not in this case. In fact if we run bundle install next Bundler will install the gems in $GEM_PATH rather than using the custom vendor/bundle directory that we indicated. Working around the issue One way to work around this issue is to tell Bundler that the HOME directory is the one from where we are running bundler (i.e. /opt/local/test_app) in our case. # First - set the path value  # (no warning is reported) $ HOME=/opt/local/test_app/ bundle config set path 'vendor/bundle' # Then - inspect the configuration $ bundle config `/opt/local` is not writable. Bundler will use `/tmp/bundler20200731-63230-11dmgcb63230' as your home directory temporarily. Settings are listed in order of priority. The top value will be used. path Set for your local app (/opt/local/test_app/.bundle/config): "vendor/bundle" Notice that we didn’t get a warning in the first command (since we indicated a HOME directory) and then, even though we didn’t pass a HOME directory to the second command, our value was picked up and shows the correct value for the path setting (vendor/bundle). So it seems to me that when HOME is set to a non-writable directory (/opt/local in our case) Bundler picks up the values from ./bundle/config if it is available even as it complains about /opt/local not being writable. If we were to run bundle install now it will install the gems in our local vendor/bundle directory. This is good for us, Bundler is using the value that we configured for the path setting (even though it still complains that it cannot write to /opt/local.) We could avoid the warning in the second command if we pass the HOME value here too: $ HOME=/opt/local/test-app/ bundle config Settings are listed in order of priority. The top value will be used. path Set for your local app (/opt/local/test-app/.bundle/config): "vendor/bundle" But the fact the Bundler picks up the correct values from ./bundle/config when HOME is set to a non-writable directory was important for us because it meant that when the app runs under Apache/Passenger it will also work. This is more or less how the configuration for our apps in http.conf looks like, notice that we are not setting the HOME value. <Location />   PassengerBaseURI /test-app   PassengerUser system_account   PassengerRuby /opt/local/rubies/ruby-2.7.1/bin/ruby   PassengerAppRoot /opt/local/test-app   SetEnv GEM_PATH /opt/local/.gem/ruby/2.7.1/ </Location> Some final thoughts Perhaps a better solution would be to set a HOME directory for our system_account, but we have not tried that, we didn’t want to make such a wide reaching change to our environment just to please Bundler. Plus this might be problematic in our development servers where we share the same system_account for multiple applications (this is not a problem in our production servers) We have no idea when this change took effect in Bundler. We went from Bundler 1.17.1 (released in October/2018) to Bundler 2.1.4 (released in January/2020) and there were many releases in between. Perhaps this was documented somewhere and we missed it. In our particular situation we noticed this issue because one of our gems needed very specific parameters to be built during bundle install. We set those values via a call to bundle config build.mysql2 --with-mysql-dir=xxx mysql-lib=yyy and those values were lost by the time we ran bundle install and the installation kept failing. Luckily we found a work around and were able to install the gem with the specific parameters. Posted on July 31, 2020February 2, 2021Author hcorreaCategories Programming Upgrading from Solr 4 to Solr 7 A few weeks ago we upgraded the version of Solr that we use in our Discovery layer, we went from Solr 4.9 to Solr 7.5. Although we have been using Solr 7.x in other areas of the library this was a significant upgrade for us because searching is the raison d’être of our Discovery layer and we wanted to make sure that the search results did not change in unexpected ways with the new field and server configurations in Solr. All in all the process went smooth for our users. This blog post elaborates on some of the things that we had to do in order to upgrade. Managed Schema This is the first Solr that we setup to use the managed-schema feature in Solr. This allows us to define field types and fields via the Schema API rather than by editing XML files. All in all this was a good decision and it allows us to recreate our Solr instances by running a shell script rather than by copying XML files. This feature was very handy during testing when we needed to recreate our Solr core for testing purposes multiple times. You can see the script that we use to recreate our Solr core in GitHub. We are still tweaking how we manage updates to our schema. For now we are using a low-tech approach in which we create small scripts to add fields to the schema that is conceptually similar to what Rails does with database migrations, but our approach is still very manual. Default Field Definitions The default field definitions in Solr 7 are different from the default field definitions in Solr 4, this is not surprising given that we skipped two major versions of Solr, but it was one one the hardest things to reconcile. Our Solr 4 was setup and configured many years ago and the upgrade forced us to look very close into exactly what kind of transformations we were doing to our data and decide what should be modified in Solr 7 to support the Solr 4 behavior versus what should be updated to use new Solr 7 features. Our first approach was to manually inspect the “schema.xml” in Solr 4 and compare it with the “managed-schema” file in Solr 7 which is also an XML file. We soon found that this was too cumbersome and error prone. But we found the output of the LukeRequestHandler to be much more concise and easier to compare between the versions of Solr, and lucky us, the output of the LukeRequestHandler is identical in both versions of Solr! Using the LukeRequestHandler we dumped our Solr schema to XML files and compare those files with a traditional file compare tool, we used the built-in file compare option in VS Code but any file compare tool would do. These are the commands that we used to dump the schema to XML files: curl http://solr-4-url/admin/luke?numTerms=0 > luke4.xml curl http://solr-7-url/admin/luke?numTerms=0 > luke7.xml The output of the LukeRequestHandler includes both the type of field (e.g. string) and the schema definition (single value vs multi-value, indexed, tokenized, et cetera.)  <lst name="title_display"> <str name="type">string</str> <str name="schema">--SD------------l</str> </lst> Another benefit of using the LukeRequestHandler instead of going by the fields defined in schema.xml is that the LukeRequestHandler only outputs fields that are indeed used in the Solr core, whereas schema.xml lists fields that were used at one point even if we don’t use them anymore. ICUFoldingFilter In Solr 4 a few of the default field types used the ICUFoldingFilter which handles diacritics so that a word like “México” is equivalent to “Mexico”. This filter used to be available by default in a Solr 4 installation but that is not the case anymore. In Solr 7 ICUFoldingFilter is not enabled by default and you must edit your solrconfig.xml as indicated in the documentation to enable it (see previous link). <lib dir="../../../contrib/analysis-extras/lib" regex="icu4j.*\.jar" /> <lib dir="../../../contrib/analysis-extras/lucene-libs" regex="lucene-analyzers-icu.*\.jar" /> and then you can use it in a field type by adding it as a filter: curl -X POST -H 'Content-type:application/json' --data-binary '{ "add-field-type" : {     "name":"text_search",     "class":"solr.TextField",     "analyzer" : {        "tokenizer":{"class":"solr.StandardTokenizerFactory"},        "filters":[          {"class":"solr.ICUFoldingFilterFactory"},          ...      ]    }  } }' $SOLR_CORE_URL/schema Handle Select HandleSelect is a parameter that is defined in the solrconfig.xml and in previous versions of Solr it used to default to true but starting in Solr 7 it defaults to false. The version of Blacklight that we are using (5.19) expects this value to be true. This parameter is what allows Blacklight to use a request handler like “search” (without a leading slash) instead of “/search”. Enabling handleSelect is easy, just edit the requestDispatcher setting in the solrconfig.xml <requestDispatcher handleSelect="true"> LocalParams and Dereferencing Our current version of Blacklight uses LocalParams and Dereferencing heavily and support for these two features changed drastically in Solr 7.2. This is a good enhancement in Solr but it caught us by surprise.  The gist of the problem is that if the solrconfig.xml sets the query parser to DisMax or eDisMax then Solr will not recognize a query like this:  {!qf=$title_qf} We tried several workarounds and settled on setting the default parser (defType) in solrconfig.xml to Lucene and requesting eDisMax explicitly from the client application: {!type=dismax qf=$title_qff}Coffee&df=id It’s worth nothing that passing defType as a normal query string parameter to change the parser did not work for us for queries using LocalParams and Dereferencing.  Stop words One of the settings that we changed in our new field definitions was the use of stop words. We are now not using stop words when indexing title fields. This was one of the benefits of us doing a full review of each one of our field types and tweak them during the upgrade. The result is that now searches for titles that are only stop words (like “There there”) return the expected results. Validating Results To validate that our new field definitions and server side configuration in Solr 7 were compatible with that we had in Solr 4 we did several kinds of tests, some of them manual and others automated. We have small suite of unit tests that Jeanette Norris and Ted Lawless created years ago and that we still use to validate some well known scenarios that we want to support. You can see those “relevancy” tests in our GitHub repository. We also captured thousands of live searches from our Discovery layer using Solr 4 and replayed them with Solr 7 to make sure that the results of both systems were compatible. To determine that results were compatible we counted how many of the top 10 results, top 5, and top 1 were included in the results of both Solr instances. The following picture shows an example of how the results looks like. The code that we used to run the searches on both Solr and generate the table is on our GitHub repo. CJK Searches The main reason for us to upgrade from Solr 4 to Solr 7 was to add support for Chinese, Japanese, and Korean (CJK) searches. The way our Solr 4 index was created we did not support searches in these languages. In our Solr 7 core we are using the built-in CJK fields definitions and our results are much better. This will be the subject of future blog post. Stay tuned. Posted on January 30, 2020February 2, 2021Author hcorreaCategories SolrTags Blacklight, Josiah, Solr PyPI packages Recently, we published two Python packages to PyPI: bdrxml and bdrcmodels. No one else is using those packages, as far as I know, and it takes some effort to put them up there, but there are benefits from publishing them. Putting a package on PyPI makes it easier for other code we package up to depend on bdrxml. For our indexing package, we can switch from this: ‘bdrxml @ https://github.com/Brown-University-Library/bdrxml/archive/v1.0a1.zip#sha1=5802ed82ee80a9627657cbb222fe9c056f73ad2c’, to this: ‘bdrxml>=1.0’, in setup.py, which is simpler. This also lets us using Python’s package version checking to not pin bdrxml to just one version, which is helpful when we embed the indexing package in another project that may use a different version of bdrxml. Publishing these first two packages also gave us experience, which will help if we publish more packages to PyPI. Posted on June 12, 2019Author Ben CailCategories Uncategorized New RIAMCO website A few days ago we released a new version of the Rhode Island Archival and Manuscript Collections Online (RIAMCO) website. The new version is a brand new codebase. This post describes a few of the new features that we implemented as part of the rewrite and how we designed the system to support them. The RIAMCO website hosts information about archival and manuscript collections in Rhode Island. These collections (also known as finding aids) are stored as XML files using the Encoded Archival Description (EAD) standard and indexed into Solr to allow for full text searching and filtering. Look and feel The overall look and feel of the RIAMCO site is heavily influenced by the work that the folks at the NYU Libraries did on their site. Like NYU’s site and Brown’s Discovery tool the RIAMCO site uses the typical facets on the left, content on the right style that is common in many library and archive websites. Below a screenshot on how the main search page looks like: Architecture Our previous site was put together over many years and it involved many separate applications written in different languages: the frontend was written in PHP, the indexer in Java, and the admin tool in (Python/Django). During this rewrite we bundled the code for the frontend and the indexer into a single application written in Ruby on Rails. [As of September 13th, 2019 the Rails application also provides the admin interface.] You can view a diagram of this architecture and few more notes about it on this document. Indexing Like the previous version of the site, we are using Solr to power the search feature of the site. However, in the previous version each collection was indexed as a single Solr document whereas in the new version we are splitting each collection into many Solr documents: one document to store the main collection information (scope, biographical info, call number, et cetera), plus one document for each item in the inventory of the collection. This new indexing strategy significantly increased the number of Solr documents that we store. We went from from 1100+ Solr documents (one for each collection) to 300,000+ Solr documents (one for each item in the inventory of those collections). The advantage of this approach is that now we can search and find items at a much granular level than we did before. For example, we can tell a user that we found a match on “Box HE-4 Folder 354” of the Harris Ephemera collection for their search on blue moon rather than just telling them that there is a match somewhere in the 25 boxes (3,000 folders) in the “Harris Ephemera” collection. In order to keep the relationship between all the Solr documents for a given collection we are using an extra ead_id_s field to store the id of the collection that each document belongs to. If we have a collection “A” with three items in the inventory they will have the following information in Solr: {id: "A", ead_id_s: "A"} // the main collection record {id: "A-1", ead_id_s: "A"} // item 1 in the inventory {id: "A-2", ead_id_s: "A"} // item 2 in the inventory {id: "A-3", ead_id_s: "A"} // item 3 in the inventory This structure allows us to use the Result Grouping feature in Solr to group results from a search into the appropriate collection. With this structure in place we can then show the results grouped by collection as you can see in the previous screenshot. The code to index our EAD files into Solr is on the Ead class. We had do add some extra logic to handle cases when a match is found only on a Solr document for an inventory item (but not on the main collection) so that we can also display the main collection information along the inventory information in the search results. The code for this is on the search_grouped() function of the Search class. Hit highlighting Another feature that we implemented on the new site is hit highlighting. Although this is a feature that Solr supports out of the box there is some extra coding that we had to do to structure the information in a way that makes sense to our users. In particular things get tricky when the hit was found in a multi value field or when Solr only returns a snippet of the original value in the highlights results. The logic that we wrote to handle this is on the SearchItem class. Advanced Search We also did an overhaul to the Advanced Search feature. The layout of the page is very typical (it follows the style used in most Blacklight applications) but the code behind it allows us to implement several new features. For example, we allow the user to select any value from the facets (not only one of the first 10 values for that facet) and to select more than one value from those facets. We also added a “Check” button to show the user what kind of Boolean expression would be generated for the query that they have entered. Below is a screenshot of the results of the check syntax for a sample query. There are several tweaks and optimizations that we would like to do on this page, for example, opening the facet by Format is quite slow and it could be optimized. Also, the code to parse the expression could be written to use a more standard Tokenizer/Parser structure. We’ll get to that later on… hopefully : ) Individual finding aids Like on the previous version of the site, the rendering of individual finding aids is done by applying XSLT transformations to the XML with the finding aid data. We made a few tweaks to the XSLT to integrate them on the new site but the vast majority of the transformations came as-is from the previous site. You can see the XSLT files in our GitHub repo. It’s interesting that GitHub reports that half of the code for the new site is XSLT: 49% XSLT, 24% HTML, and 24% Ruby. Keep in mind that these numbers do not take into account the Ruby on Rails code (which is massive.) Source code The source code for the new application is available in GitHub. Acknowledgements Although I wrote the code for the new site, there were plenty of people that helped me along the way in this implementation, in particular Karen Eberhart and Joe Mancino. Karen provided the specs for the new site, answered my many questions about the structure of EAD files, and suggested many improvements and tweaks to make the site better. Joe helped me find the code for the original site and indexer, and setup the environment for the new one. Posted on June 5, 2019February 2, 2021Author hcorreaCategories Programming, RIAMCO, SolrTags Solr Deploying with shiv I recently watched a talk called “Containerless Django – Deploying without Docker”, by Peter Baumgartner. Peter lists some benefits of Docker: that it gives you a pipeline for getting code tested and deployed, the container adds some security to the app, state can be isolated in the container, and it lets you run the exact same code in development and production. Peter also lists some drawbacks to Docker: it’s a lot of code that could slow things down or have bugs, docker artifacts can be relatively large, and it adds extra abstractions to the system (eg. filesystem, network). He argues that an ideal deployment would include downloading a binary, creating a configuration file, and running it (like one can do with compiled C or Go programs). Peter describes a process of deploying Django apps by creating a zipapp using shiv and goodconf, and deploying it with systemd constraints that add to the security. He argues that this process achieves most of the benefits of  Docker, but more simply, and that there’s a sweet spot for application size where this type of deploy is a good solution. I decided to try using shiv with our image server Loris. I ran the shiv command “shiv -o loris.pyz .”, and I got the following error: User “loris” and or group “loris” do(es) not exist. Please create this user, e.g.: `useradd -d /var/www/loris -s /sbin/false loris` The issue is that in the Loris setup.py file, the install process not only checks for the loris user as shown in the error, but it also sets up directories on the filesystem (including setting the owner and permission, which requires root permissions). I submitted a PR to remove the filesystem setup from the Python package installation (and put it in a script the user can run), and hopefully in the future it will be easier to package up Loris and deploy it different ways. Posted on May 17, 2019Author Ben CailCategories BDR, Programming Checksums In the BDR, we calculate checksums automatically on ingest (Fedora 3 provides that functionality for us), so all new content binaries going into the BDR get a checksum, which we can go back and check later as needed. We can also pass checksums into the BDR API, and then we verify that Fedora calculates the same checksum for the ingested file, which shows that the content wasn’t modified since the first checksum was calculated. We have only been able to use MD5 checksums, but we want to be able to use more checksum types. This isn’t a problem for Fedora, which can calculate multiple checksum types, such as MD5, SHA1, SHA256, and SHA512. However, there is a complicating factor – if Fedora gets a checksum mismatch, by default it returns a 500 response code with no message, so we can’t tell whether it was a checksum mismatch or some other server error. Thanks to Ben Armintor, though, we found that we can update our Fedora configuration so it returns the Checksum Mismatch information. Another issue in this process is that we use eulfedora (which doesn’t seem to be maintained anymore). If a checksum mismatch happens, it raises a DigitalObjectSaveFailure, but we want to know that there was a checksum mismatch. We forked eulfedora and exposed the checksum mismatch information. Now we can remove some extra code that we had in our APIs, since more functionality is handled in Fedora/eulfedora, and we can use multiple checksum types. Posted on March 29, 2019Author Ben CailCategories BDR Exporting Django data We recently had a couple cases where we wanted to dump the data out of a Django database. In the first case (“tracker”), we were shutting down a legacy application, but needed to preserve the data in a different form for users. In the second case (“deposits”), we were backing up some obsolete data before removing it from the database. We handled the processes in two different ways. Tracker For the tracker, we used an export script to extract the data. Here’s a modified version of the script: def export_data(): now = datetime.datetime.now() dir_name = 'data_%s_%s_%s' % (now.year, now.month, now.day) d = os.mkdir(dir_name) file_name = os.path.join(dir_name, 'tracker_items.dat') with open(file_name, 'wb') as f: f.write(u'\u241f'.join([ 'project name', 'container identifier', 'container name', 'identifier', 'name', 'dimensions', 'note', 'create digital surrogate', 'qc digital surrogate', 'create metadata record', 'qc metadata record', 'create submission package']).encode('utf8')) f.write('\u241e'.encode('utf8')) for project in models.Project.objects.all(): for container in project.container_set.all(): print(container) for item in container.item_set.all(): data = u'\u241f'.join([ project.name.strip(), container.identifier.strip(), container.name.strip(), item.identifier.strip(), item.name.strip(), item.dimensions.strip(), item.note.strip() ]) item_actions = u'\u241f'.join([str(item_action) for item_action in item.itemaction_set.all().order_by('id')]) line_data = u'%s\u241f%s\u241e' % (data, item_actions) f.write(line_data.encode('utf8')) As you can see, we looped through different Django models and pulled out fields, writing everything to a file. We used the Unicode Record and Unit Separators as delimiters. One advantage of using those is that your data can have commas, tabs, newlines, … and it won’t matter. You still don’t have to quote or escape anything. Then we converted the data to a spreadsheet that users can view and search: import openpyxl workbook = openpyxl.Workbook() worksheet = workbook.active with open('tracker_items.dat', 'rb') as f: data = f.read() lines = data.decode('utf8').split('\u241e') print(len(lines)) print(lines[0]) print(lines[-1]) for line in lines: fields = line.split('\u241f') worksheet.append(fields) workbook.save('tracker_items.xlsx') Deposits For the deposits project, we just used the built-in Django dumpdata command: python manage.py dumpdata -o data_20180727.dat That output file could be used to load data back into a database if needed. Posted on March 25, 2019Author Ben CailCategories Uncategorized Searching for hierarchical data in Solr Recently I had to index a dataset into Solr in which the original items had a hierarchical relationship among them. In processing this data I took some time to look into the ancestor_path and descendent_path features that Solr provides out of the box and see if and how they could help to issue searches based on the hierarchy of the data. This post elaborates on what I learned in the process. Let’s start with some sample hierarchical data to illustrate the kind of relationship that I am describing in this post. Below is a short list of databases and programming languages organized by type. Databases ├─ Relational │ ├─ MySQL │ └─ PostgreSQL └─ Document ├─ Solr └─ MongoDB Programming Languages └─ Object Oriented ├─ Ruby └─ Python For the purposes of this post I am going to index each individual item shown in the hierarchy, not just the children items. In other words I am going to create 11 Solr documents: one for “Databases”, another for “Relational”, another for “MySQL”, and so on. Each document is saved with an id, a title, and a path. For example, the document for “Databases” is saved as: { "id": "001", "title_s": "Databases", "x_ancestor_path": "db", "x_descendent_path": "db" } and the one for “MySQL” is saved as: { "id": "003", "title_s": "MySQL", "x_ancestor_path": "db/rel/mysql", "x_descendent_path": "db/rel/mysql" } The x_ancestor_path and x_descendent_path fields in the JSON data represent the path for each of these documents in the hierarcy. For example, the top level “Databases” document uses the path “db” where the lowest level document “MySQL” uses “db/rel/mysql”. I am storing the exact same value on both fields so that later on we can see how each of them provides different features and addresses different use cases. ancestor_path and descendent_path The ancestor_path and descendent_path field types come predefined in Solr. Below is the definition of the descendent_path in a standard Solr 7 core: $ curl http://localhost:8983/solr/your-core/schema/fieldtypes/descendent_path { ... "indexAnalyzer":{ "tokenizer":{ "class":"solr.PathHierarchyTokenizerFactory", "delimiter":"/"}}, "queryAnalyzer":{ "tokenizer":{ "class":"solr.KeywordTokenizerFactory"}}}} Notice how it uses the PathHierarchyTokenizerFactory tokenizer when indexing values of this type and that it sets the delimiter property to /. This means that when values are indexed they will be split into individual tokens by this delimiter. For example the value “db/rel/mysql” will be split into “db”, “db/rel”, and “db/rel/mysql”. You can validate this in the Analysis Screen in the Solr Admin tool. The ancestor_path field is the exact opposite, it uses the PathHierarchyTokenizerFactory at query time and the KeywordTokenizerFactory at index time. There are also two dynamic field definitions *_descendent_path and *_ancestor_path that automatically create fields with these types. Hence the wonky x_descendent_path and x_ancestor_path field names that I am using in this demo. Finding descendants The descendent_path field definition in Solr can be used to find all the descendant documents in the hierarchy for a given path. For example, if I query for all documents where the descendant path is “db” (q=x_descendent_path:db) I should get all document in the “Databases” hierarchy, but not the ones under “Programming Languages”. For example: $ curl "http://localhost:8983/solr/your-core/select?q=x_descendent_path:db&fl=id,title_s,x_descendent_path" { ... "response":{"numFound":7,"start":0,"docs":[ { "id":"001", "title_s":"Databases", "x_descendent_path":"db"}, { "id":"002", "title_s":"Relational", "x_descendent_path":"db/rel"}, { "id":"003", "title_s":"MySQL", "x_descendent_path":"db/rel/mysql"}, { "id":"004", "title_s":"PostgreSQL", "x_descendent_path":"db/rel/pg"}, { "id":"005", "title_s":"Document", "x_descendent_path":"db/doc"}, { "id":"006", "title_s":"MongoDB", "x_descendent_path":"db/doc/mongo"}, { "id":"007", "title_s":"Solr", "x_descendent_path":"db/doc/solr"}] }} Finding ancestors The ancestor_path not surprisingly can be used to achieve the reverse. Given the path of a given document we can query Solr to find all its ancestors in the hierarchy. For example if I query Solr for the documents where x_ancestor_path is “db/doc/solr” (q=x_ancestor_path:db/doc/solr) I should get “Databases”, “Document”, and “Solr” as shown below: $ curl "http://localhost:8983/solr/your-core/select?q=x_ancestor_path:db/doc/solr&fl=id,title_s,x_ancestor_path" { ... "response":{"numFound":3,"start":0,"docs":[ { "id":"001", "title_s":"Databases", "x_ancestor_path":"db"}, { "id":"005", "title_s":"Document", "x_ancestor_path":"db/doc"}, { "id":"007", "title_s":"Solr", "x_ancestor_path":"db/doc/solr"}] }} If you are curious how this works internally, you could issue a query with debugQuery=true and look at how the query value “db/doc/solr” was parsed. Notice how Solr splits the query value by the / delimiter and uses something called SynonymQuery() to handle the individual values as synonyms: $ curl "http://localhost:8983/solr/your-core/select?q=x_ancestor_path:db/doc/solr&debugQuery=true" { ... "debug":{ "rawquerystring":"x_ancestor_path:db/doc/solr", "parsedquery":"SynonymQuery(Synonym(x_ancestor_path:db x_ancestor_path:db/doc x_ancestor_path:db/doc/solr))", ... } One little gotcha Given that Solr is splitting the path values by the / delimiter and that we can see those values in the Analysis Screen (or when passing debugQuery=true) we might expect to be able to fetch those values from the document somehow. But that is not the case. The individual tokens are not stored in a way that you can fetch them, i.e. there is no way for us to fetch the individual “db”, “db/doc”, and “db/doc/solr” values when fetching document id “007”. In hindsight this is standard Solr behavior but something that threw me off initially. Posted on January 10, 2019February 2, 2021Author hcorreaCategories Programming, Solr Monitoring Passenger’s Requests in Queue over time As I mentioned in a previous post we use Phusion Passenger as the application server to host our Ruby applications. A while ago upon the recommendation of my coworker Ben Cail I created a cron job that calls passenger-status every 5 minutes to log the status of Passenger in our servers.  Below is a sample of the passenger-status output: Version : 5.1.12 Date : Mon Jul 30 10:42:54 -0400 2018 Instance: 8x6dq9uX (Apache/2.2.15 (Unix) DAV/2 Phusion_Passenger/5.1.12) ----------- General information ----------- Max pool size : 6 App groups : 1 Processes : 6 Requests in top-level queue : 0 ----------- Application groups ----------- /path/to/our/app: App root: /path/to/our/app Requests in queue: 3 * PID: 43810 Sessions: 1 Processed: 20472 Uptime: 1d 7h 31m 25s CPU: 0% Memory : 249M Last used: 1s ag * PID: 2628 Sessions: 1 Processed: 1059 Uptime: 4h 34m 39s CPU: 0% Memory : 138M Last used: 1s ago * PID: 2838 Sessions: 1 Processed: 634 Uptime: 4h 30m 47s CPU: 0% Memory : 134M Last used: 1s ago * PID: 16836 Sessions: 1 Processed: 262 Uptime: 2h 14m 46s CPU: 0% Memory : 160M Last used: 1s ago * PID: 27431 Sessions: 1 Processed: 49 Uptime: 25m 27s CPU: 0% Memory : 119M Last used: 0s ago * PID: 27476 Sessions: 1 Processed: 37 Uptime: 25m 0s CPU: 0% Memory : 117M Last used: 0s ago Our cron job to log this information over time is something like this: /path/to/.gem/gems/passenger-5.1.12/bin/passenger-status >> ./logs/passenger_status.log Last week we had some issues in which our production server was experiencing short outages. Upon review we noticed that we were having a unusual amount of traffic coming to our server (most of it from crawlers submitting bad requests.) One of the tools that we used to validate the status of our server was the passenger_status.log file created via the aforementioned cron job. The key piece of information that we use is the “Requests in queue” value highlighted above. We parsed this value of out the passenger_status.log file to see how it changed in the last 30 days. The result showed that although we have had a couple of outages recently the number of “requests in queue” dramatically increased about two weeks ago and it had stayed high ever since. The graph below shows what we found. Notice how after August 19th the value of “requests in queue” has been constantly high, whereas before August 19th it was almost always zero or below 10. We looked closely to our Apache and Rails logs and determined the traffic that was causing the problem. We took a few steps to handle it and now our servers are behaving as normal again. Notice how we are back to zero requests in queue on August 31st in the graph above. The Ruby code that we use to parse our passenger_status.log file is pretty simple, it just grabs the line with the date and the line with the number of requests in queue, parses their values, and outputs the result to a tab delimited file that then we can use to create a graph in Excel or RAWGraphs. Below is the Ruby code: require "date" log_file = "passenger_status.log" excel_date = true def date_from_line(line, excel_date) index = line.index(":") return nil if index == nil date_as_text = line[index+2..-1].strip # Thu Aug 30 14:00:01 -0400 2018 datetime = DateTime.parse(date_as_text).to_s # 2018-08-30T14:00:01-04:00 if excel_date return datetime[0..9] + " " + datetime[11..15] # 2018-08-30 14:00 end datetime end def count_from_line(line) return line.gsub("Requests in queue:", "").to_i end puts "timestamp\trequest_in_queue" date = "N/A" File.readlines(log_file).each do |line| if line.start_with?("Date ") date = date_from_line(line, excel_date) elsif line.include?("Requests in queue:") request_count = count_from_line(line) puts "\"#{date}\"\t#{request_count}" end end In this particular case the number of requests in queue was caused by bad/unwanted traffic. If the increase in traffic had been legitimate we would have taken a different route, like adding more processes to our Passenger instance to handle the traffic. Posted on September 4, 2018February 2, 2021Author hcorreaCategories Blacklight, Josiah, Programming, Web Looking at the Oxford Common Filesystem Layout (OCFL) Currently, the BDR contains about 34TB of content. The storage layer is Fedora 3, and the data is stored internally by Fedora (instead of being stored externally). However, Fedora 3 is end-of-life. This means that we either maintain it ourselves, or migrate to something else. However, we don’t want to migrate 34TB, and then have to migrate it again if we change software again. We’d like to be able to change our software, without migrating all our data. This is where the Oxford Common Filesystem Layout (OCFL) work is interesting. OCFL is an effort to define how repository objects should be laid out on the filesystem. OCFL is still very much a work-in-progress, but the “Need” section of the specification speaks directly to what I described above. If we set up our data using OCFL, hopefully we can upgrade and change our software as necessary without having to move all the data around. Another benefit of the OCFL effort is that it’s work being done by people from multiple institutions, building on other work and experience in this area, to define a good, well-thought-out layout for repository objects. Finally, using a common specification for the filesystem layout of our repository means that there’s a better chance that other software will understand how to interact with our files on disk. The more people using the same filesystem layout, the more potential collaborators and applications for implementing the OCFL specification – safely creating, updating, and serving out content for the repository. Posted on July 24, 2018Author Ben CailCategories BDR Posts navigation Page 1 Page 2 … Page 6 Next page Proudly powered by WordPress 
library-brown-edu-879	----	Brown University Library Digital Technologies Brown University Library Digital Technologies Bundler 2.1.4 and homeless accounts This week we upgraded a couple of our applications to Ruby 2.7 and Bundler 2.1.4 and one of the changes that we noticed was that Bundler was complaining about not being able to write to the /opt/local directory. Turns out this problem shows up because the account that we use to run our application is &#8230; Continue reading Bundler 2.1.4 and homeless accounts Upgrading from Solr 4 to Solr 7 A few weeks ago we upgraded the version of Solr that we use in our Discovery layer, we went from Solr 4.9 to Solr 7.5. Although we have been using Solr 7.x in other areas of the library this was a significant upgrade for us because searching is the raison d&#8217;être of our Discovery layer &#8230; Continue reading Upgrading from Solr 4 to Solr 7 PyPI packages Recently, we published two Python packages to PyPI: bdrxml and bdrcmodels. No one else is using those packages, as far as I know, and it takes some effort to put them up there, but there are benefits from publishing them. Putting a package on PyPI makes it easier for other code we package up to &#8230; Continue reading PyPI packages New RIAMCO website A few days ago we released a new version of the Rhode Island Archival and Manuscript Collections Online (RIAMCO) website. The new version is a brand new codebase. This post describes a few of the new features that we implemented as part of the rewrite and how we designed the system to support them. The &#8230; Continue reading New RIAMCO website Deploying with shiv I recently watched a talk called &#8220;Containerless Django &#8211; Deploying without Docker&#8221;, by Peter Baumgartner. Peter lists some benefits of Docker: that it gives you a pipeline for getting code tested and deployed, the container adds some security to the app, state can be isolated in the container, and it lets you run the exact &#8230; Continue reading Deploying with shiv Checksums In the BDR, we calculate checksums automatically on ingest (Fedora 3 provides that functionality for us), so all new content binaries going into the BDR get a checksum, which we can go back and check later as needed. We can also pass checksums into the BDR API, and then we verify that Fedora calculates the &#8230; Continue reading Checksums Exporting Django data We recently had a couple cases where we wanted to dump the data out of a Django database. In the first case (&#8220;tracker&#8221;), we were shutting down a legacy application, but needed to preserve the data in a different form for users. In the second case (&#8220;deposits&#8221;), we were backing up some obsolete data before &#8230; Continue reading Exporting Django data Searching for hierarchical data in Solr Recently I had to index a dataset into Solr in which the original items had a hierarchical relationship among them. In processing this data I took some time to look into the ancestor_path and descendent_path features that Solr provides out of the box and see if and how they could help to issue searches based &#8230; Continue reading Searching for hierarchical data in Solr Monitoring Passenger’s Requests in Queue over time As I mentioned in a previous post we use Phusion Passenger as the application server to host our Ruby applications. A while ago upon the recommendation of my coworker Ben Cail I created a cron job that calls passenger-status every 5 minutes to log the status of Passenger in our servers.  Below is a sample &#8230; Continue reading Monitoring Passenger&#8217;s Requests in Queue over time Looking at the Oxford Common Filesystem Layout (OCFL) Currently, the BDR contains about 34TB of content. The storage layer is Fedora 3, and the data is stored internally by Fedora (instead of being stored externally). However, Fedora 3 is end-of-life. This means that we either maintain it ourselves, or migrate to something else. However, we don&#8217;t want to migrate 34TB, and then have &#8230; Continue reading Looking at the Oxford Common Filesystem Layout (OCFL) 
libraryservices-jiscinvolve-org-1117	----	What is “Plan M”? – Library services Skip to the content Search Library services Providing libraries with shared services to save time and money Menu About Advisory Groups Library Services Advisory Group Terms of Reference Meetings & minutes Members Library Hub Community Advisory Board Jisc Library Services Menu Search Search for: Close search Close Menu Coronavirus Library Hub Discover Library Hub Compare Library Hub Cataloguing NBK Licensing TLSS KB+ CMCAB Analytics Advisory Group Plan M Events Communications About Advisory GroupsShow sub menu Library Services Advisory GroupShow sub menu Terms of Reference Meetings & minutes Members Library Hub Community Advisory Board Jisc Library Services Categories Data NBK Plan M What is “Plan M”? Post author By Neil Grindley Post date 17 December 2019 No Comments on What is “Plan M”? Plan M is a very wide-ranging discussion that has been going on throughout the second half of 2019 involving many different stakeholders across the library community. The ‘M’ stands for ‘metadata’. In a nutshell, the way that metadata for academic and specialist libraries is created, sold, licensed, shared and re-used in the UK needs a re-think. Plan M is an initiative that is being facilitated by Jisc but is really a conversation between libraries, suppliers and intermediary organisations to streamline the metadata marketplace in the UK so that it is more coherent, transparent, robust and sustainable. The catalyst for this conversation has been a focus on aggregating and sharing library data at a new ambitious scale via the National Bibliographic Knowledgebase (NBK) and through Jisc Library Hub  services. Three new resources are available: A concise description of Plan M objectives and next steps as of December 2019 A slide deck (7 slides) providing a Plan M summary A fuller description of Plan M providing more context and definition A word document (4 pages) – Plan M – Definition and Direction A synthesis of discussions relating to Plan M during the period May – October 2019 A word document (7 pages) – Plan M – Review of Stakeholder Input Final We will be in touch with all stakeholders in the New Year to take forward this plan and look forward to working with everybody. Needless to say, if anyone has comments or queries about Plan M then drop us a line at nbk@jisc.ac.uk Best wishes and a merry Xmas The NBK Team Tags NBK By Neil Grindley Director of Content & Discovery Services at Jisc. With oversight of Jisc's library and archival discovery services and content solutions for HE and FE. View Archive → ← Collection Management: Share the Experience @ event summaries → Season’s Greetings and Christmas Closure Leave a Reply Cancel reply Your email address will not be published. Required fields are marked * Comment Notify me of followup comments via e-mail Name * Email * Website Search for: Recent Posts Consultation on National Bibliographic Metadata Agreements Libraries of the Enlightenment gallery at the British Museum Update on WorldCat Synchronisation for NBK Contributor Libraries The International Anthony Burgess Foundation Library Next steps with Plan M Discussions (BDS) Meta Log in Entries feed Comments feed WordPress.org Archives Archives Select Month April 2021 March 2021 January 2021 December 2020 November 2020 October 2020 September 2020 August 2020 July 2020 June 2020 May 2020 April 2020 March 2020 February 2020 January 2020 December 2019 November 2019 October 2019 July 2019 June 2019 April 2019 February 2019 January 2019 December 2018 November 2018 October 2018 August 2018 July 2018 June 2018 May 2018 April 2018 March 2018 February 2018 January 2018 December 2017 November 2017 October 2017 September 2017 August 2017 June 2017 May 2017 April 2017 March 2017 February 2017 January 2017 December 2016 November 2016 October 2016 September 2016 August 2016 July 2016 June 2016 May 2016 April 2016 February 2016 January 2016 December 2015 July 2015 May 2015 April 2015 March 2015 February 2015 November 2013 April 2013 March 2013 December 2012 November 2012 October 2012 August 2012 July 2012 May 2012 Categories Advisory Group (2) Analytics (2) CMCAB (20) Communications (8) Contributors (14) Coronavirus (5) Data (12) Events (11) Interface (4) KB+ (1) LHCAB (9) libraries (16) Library Hub Cataloguing (14) Library Hub Compare (33) Library Hub Discover (66) Licensing (5) NBK (35) NBKCG (1) open access (1) Plan M (11) Planning (2) Retention Data (5) Survey (2) TLSS (1) Uncategorized (1) Recent Comments Nadine Edwards on Library Hub Cataloguing: hear about our plans for 2021 and beyond Paul on The Gerald Coke Handel Collection at the Foundling Museum Jeff Edmunds on Moving Plan M Forwards – We Need Your Help! Bethan Ruddock on Driving Transformation with the NBK – where have we got to and where next? Jane Daniels on Driving Transformation with the NBK – where have we got to and where next? Site info: Privacy Cookies Accessibility To the top ↑ Up ↑ We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies. Cookie settingsACCEPT Manage consent Close Privacy Overview This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience. Necessary Necessary Always Enabled Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information. Non-necessary Non-necessary Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website. 
librecatproject-wordpress-com-7211	----	Catmandu Catmandu About Download Tutorial May 22, 2019 Catmandu 1.20 On May 21th 2019, Nicolas Steenlant (our main developer and guru of Catmandu) released version 1.20 of our Catmandu toolkit with some very interesting new features. The main addition is a brand new way how Catmandu Fix-es can be implemented using the new Catmandu::Path implementation. This coding by Nicolas will make it much easier and straightforward to implement any kind of fixes in Perl. In the previous versions of Catmandu there were only two options to create new fixes: Create a Perl package in the Catmandu::Fix namespace which implements a fix method. This was very easy: update the $data hash you got as first argument, return the updated $data and you were done. Then disadvantage was that accessing fields in a deeply nested record was tricky and slow to code. Create a Perl package in the Catmandu::Fix namespace which implemented emit functions. These were functions that generate Perl code on the fly. Using emit functions it was easier to get fast access to deeply nested data. But, to create Fix packages was pretty complex. In Catmandu 1.20 there is now support for a third and easy way to create new Fixes using the Catmandu::Fix::Builder and Catmandu::Fix::Path class. Let me give an simple example of a skeleton Fix that does nothing: package Catmandu::Fix::rot13; use Catmandu::Sane; use Moo; use Catmandu::Util::Path qw(as_path); use Catmandu::Fix::Has; with 'Catmandu::Fix::Builder'; has path => (fix_arg => 1); sub _build_fixer { my ($self) = @_; sub { my $data = $_[0]; # ..do some magic here ... $data; } } 1; In the code above we start implementing a rot13(path) Fix that should read a string on a JSON path and encrypt it using the ROT13 algorithm. This Fix is only the skeleton which doesn’t do anything. What we have is: We import the as_path method be able to easily access data on JSON paths/ We import Catmandu::Fix::Has to be able to use has path constructs to read in arguments for our Fix. We import Catmandu::Fix::Builder to use the new Catmandu 1.20 builder class provides a _build_fixermethod. The builder is nothing more than a closure that reads the data, does some action on the data and return the data. We can use this skeleton builder to implement our ROT13 algorithm. Add these lines instead of the # do some magic part: # On the path update the string value... as_path($self->path)->updater( if_string => sub { my $value = shift; $value =~ tr{N-ZA-Mn-za-m}{A-Za-z}; $value; }, )->($data); The as_path method receives a JSON path string an creates an object which you can use to manipulate data on that path. One can update the values found with the updater method, or read data at that path with the getter method or create a new path with the creator method. In our example, we update the string found at the JSON path using if_string condition. The updaterhas many conditions: if_string needs a closure what should happen when a string is found on the JSON path. if_array_ref needs a closure what should happen when an array is found on the JSON path. if_hash_refneeds a closure what should happen when a hash is found on the JSON path. In our case we are only interested in transforming strings using our rot13(path) fix. The ROT13 algorithm is very easy and only switched the order of some characters. When we execute this fix on some sample data we get this result: $ catmandu -I lib convert Null to YAML --fix 'add_field(demo,hello);rot13v2(demo)' --- demo: uryyb ... In this case the Fix can be written much shorter when we know that every Catmandu::Path method return a closure (hint: look at the ->($data) in the code. The complete Fix can look like: package Catmandu::Fix::rot13; use Catmandu::Sane; use Moo; use Catmandu::Util::Path qw(as_path); use Catmandu::Fix::Has; with 'Catmandu::Fix::Builder'; has path => (fix_arg => 1); sub _build_fixer { my ($self) = @_; # On the path update the string value... as_path($self->path)->updater( if_string => sub { my $value = shift; $value =~ tr{N-ZA-Mn-za-m}{A-Za-z}; $value; }, ); } 1; This is as easy as it can get to manipulate deeply nested data with your own Perl tools. All the code is in Perl, there is no limit on the number of external CPAN packages one can include in these Builder fixes. We can’t wait what Catmandu extensions you will create. Written by hochstenbach Leave a comment Posted in Advanced, Updates Tagged with catmandu, fix language, perl April 8, 2019 LPW 2018: “Contrarian Perl” – Tom Hukins At 09:10, Tom Hukins shares his enthusiasm for Catmandu! Written by hochstenbach Leave a comment Posted in Uncategorized June 22, 2017 Introducing FileStores Catmandu is always our tool of choice when working with structured data. Using the Elasticsearch or MongoDB Catmandu::Store-s it is quite trivial to store and retrieve metadata records. Storing and retrieving a YAML, JSON (and by extension XML, MARC, CSV,…) files can be as easy as the commands below: $ catmandu import YAML to database < input.yml $ catmandu import JSON to database < input.json $ catmandu import MARC to database < marc.data $ catmandu export database to YAML > output.yml A catmandu.yml  configuration file is required with the connection parameters to the database: $ cat catmandu.yml --- store: database: package: ElasticSearch options: client: '1_0::Direct' index_name: catmandu ... Given these tools to import and export and even transform structured data, can this be extended to unstructured data? In institutional repositories like LibreCat we would like to manage metadata records and binary content (for example PDF files related to the metadata).  Catmandu 1.06 introduces the Catmandu::FileStore as an extension to the already existing Catmandu::Store to manage binary content. A Catmandu::FileStore is a Catmandu::Store where each Catmandu::Bag acts as a “container” or a “folder” that can contain zero or more records describing File content. The files records themselves contain pointers to a backend storage implementation capable of serialising and streaming binary files. Out of the box, one Catmandu::FileStore implementation is available Catmandu::Store::File::Simple, or short File::Simple, which stores files in a directory. Some examples. To add a file to a FileStore, the stream command needs to be executed: $ catmandu stream /tmp/myfile.pdf to File::Simple --root /data --bag 1234 --id myfile.pdf In the command above: /tmp/myfile.pdf is the file up be uploaded to the File::Store. File::Simple is the name of the File::Store implementation which requires one mandatory parameter, --root /data which is the root directory where all files are stored.  The--bag 1234 is the “container” or “folder” which contains the uploaded files (with a numeric identifier 1234). And the --id myfile.pdf is the identifier for the new created file record. To download the file from the File::Store, the stream command needs to be executed in opposite order: $ catmandu stream File::Simple --root /data --bag 1234 --id myfile.pdf to /tmp/file.pdf or $ catmandu stream File::Simple --root /data --bag 1234 --id myfile.pdf > /tmp/file.pdf On the file system the files are stored in some deep nested structure to be able to spread out the File::Store over many disks: /data `--/000 `--/001 `--/234 `--/myfile.pdf A listing of all “containers” can be retreived by requesting an export of the default (index) bag of the File::Store: $ catmandu export File::Simple --root /data to YAML _id: 1234 ... A listing of all files in the container “1234” can be done by adding the bag name to the export command: $ catmandu export File::Simple --root /data --bag 1234 to YAML _id: myfile.pdf _stream: !!perl/code '{ "DUMMY" }' content_type: application/pdf created: 1498125394 md5: '' modified: 1498125394 size: 883202 ... Each File::Store implementation supports at least the fields presented above: _id: the name of the file _stream: a callback function to retrieve the content of the file (requires an IO::Handle as input) content_type: the MIME-Type of the file created: a timestamp when the file was created modified: a timestamp when the file was last modified size: the byte length of the file md5: optional a MD5 checksum We envision in Catmandu that many implementations of FileStores can be created to be able to store files in GitHub, BagIts, Fedora Commons and more backends. Using the Catmandu::Plugin::SideCar  Catmandu::FileStore-s and Catmandu::Store-s can be combined as one endpoint. Using Catmandu::Store::Multi and Catmandu::Store::File::Multi many different implementations of Stores and FileStores can be combined. This is a short introduction, but I hope you will experiment a bit with the new functionality and provide feedback to our project. Written by hochstenbach Leave a comment Posted in Uncategorized March 24, 2017 Catmandu 1.04 Catmandu 1.04 has been released to with some nice new features. There are some new Fix routines that were asked by our community: error The “error” fix stops immediately the execution of the Fix script and throws an error. Use this to abort the processing of a data stream: $ cat myfix.fix unless exists(id)     error("no id found?!") end $ catmandu convert JSON --fix myfix.fix < data.json valid The “valid” fix condition can be used to validate a record (or part of a record) against a JSONSchema. For instance we can select only the valid records from a stream: $ catmandu convert JSON --fix 'select valid('', JSONSchema, schema:myschema.json)' < data.json Or, create some logging: $ cat myfix.fix unless valid(author, JSONSchema, schema:authors.json) log("errors in the author field") end $ catmandu convert JSON --fix myfix.fix < data.json rename The “rename” fix can be used to recursively change the names of fields in your documents. For example, when you have this JSON input: { "foo.bar": "123", "my.name": "Patrick" } you can transform all periods (.) in the key names to underscores with this fix: rename('','\.','_') The first parameter is the fields “rename” should work on (in our case it is an empty string, meaning the complete record). The second and third parameters are the regex search and replace parameters. The result of this fix is: { "foo_bar": "123", "my_name": "Patrick" } The “rename” fix will only work on the keys of JSON paths. For example, given the following path: my.deep.path.x.y.z The keys are: my deep path x y z The second and third argument search and replaces these seperate keys. When you want to change the paths as a whole take a look at the “collapse()” and “expand()” fixes in combination with the “rename” fix: collapse() rename('',"my\.deep","my.very.very.deep") expand() Now the generated path will be: my.very.very.deep.path.x.y.z Of course the example above could be written more simple as “move_field(my.deep,my.very.very.deep)”, but it serves as an example  that powerful renaming is possible. import_from_string This Fix is a generalisation of the “from_json” Fix. It can transform a serialised string field in your data into an array of data. For instance, take the following YAML record: --- foo: '{"name":"patrick"}' ... The field ‘foo’ contains a JSON fragment. You can transform this JSON into real data using the following fix: import_from_string(foo,JSON) Which creates a ‘foo’ array containing the deserialised JSON: --- foo: - name: patrick The “import_from_string” look very much like the “from_json” string, but you can use any Catmandu::Importer. It always created an array of hashes. For instance, given the following YAML record: --- foo: "name;hobby\nnicolas;drawing\npatrick;music" You can transform the CSV fragment in the ‘foo’ field into data by using this fix: import_from_string(foo,CSV,sep_char:";") Which gives as result: --- foo: - hobby: drawing name: nicolas - hobby: music name: patrick ... I the same way it can process MARC, XML, RDF, YAML or any other format supported by Catmandu. export_to_string The fix “export_to_string” is the opposite of “import_from_string” and is the generalisation of the “to_json” fix. Given the YAML from the previous example: --- foo: - hobby: drawing name: nicolas - hobby: music name: patrick ... You can create a CSV fragment in the ‘foo’ field with the following fix: export_to_string(foo,CSV,sep_char:";") Which gives as result: --- foo: "name;hobby\nnicolas;drawing\npatrick;music" search_in_store The fix “search_in_store” is a generalisation of the “lookup_in_store” fix. The latter is used to query the “_id” field in a Catmandu::Store and return the first hit. The former, “search_in_store” can query any field in a store and return all (or a subset) of the results. For instance, given the YAML record: --- foo: "(title:ABC OR author:dave) AND NOT year:2013" ... then the following fix will replace the ‘foo’ field with the result of the query in a Solr index: search_in_store('foo', store:Solr, url: 'http://localhost:8983/solr/catalog') As a result, the document will be updated like: --- foo: start: 0, limit: 0, hits: [...], total: 1000 ... where start: the starting index of the search result limit: the number of result per page hits: an array containing the data from the result page total: the total number of search results Every Catmandu::Solr can have another layout of the result page. Look at the documentation of the Catmandu::Solr implementations for the specific details. Thanks for all your support for Catmandu and keep on data converting 🙂 Written by hochstenbach Leave a comment Posted in Uncategorized June 16, 2016 Metadata Analysis at the Command-Line I was last week at the ELAG  2016 conference in Copenhagen and attended the excellent workshop by Christina Harlow  of Cornell University on migrating digital collections metadata to RDF and Fedora4. One of the important steps required to migrate and model data to RDF is understanding what your data is about. Probably old systems need to be converted for which little or no documentation is available. Instead of manually processing large XML or MARC dumps, tools like metadata breakers can be used to find out which fields are available in the legacy system and how they are used. Mark Phillips of the University of North Texas wrote recently in Code4Lib a very inspiring article how this could be done in Python. In this blog post I’ll demonstrate how this can be done using a new Catmandu tool: Catmandu::Breaker. To follow the examples below, you need to have a system with Catmandu installed. The Catmandu::Breaker tools can then be installed with the command: $ sudo cpan Catmandu::Breaker A breaker is a command that transforms data into a line format that can be easily processed with Unix command line tools such as grep, sort, uniq, cut and many more. If you need an introduction into Unix tools for data processing please follow the examples Johan Rolschewski of Berlin State Library and I presented as an ELAG bootcamp. As a simple example lets create a YAML file and demonstrate how this file can be analysed using Catmandu::Breaker: $ cat test.yaml --- name: John colors: - black - yellow - red institution: name: Acme years: - 1949 - 1950 - 1951 - 1952 This example has a combination of simple name/value pairs a list of colors and a deeply nested field. To transform this data into the breaker format execute the command: $ catmandu convert YAML to Breaker < test.yaml 1 colors[] black 1 colors[] yellow 1 colors[] red 1 institution.name Acme 1 institution.years[] 1949 1 institution.years[] 1950 1 institution.years[] 1951 1 institution.years[] 1952 1 name John The breaker format is a tab-delimited output with three columns: An record identifier: read from the _id field in the input data, or a counter when no such field is present. A field name. Nested fields are seperated by dots (.) and list are indicated by the square brackets ([]) A field value When you have a very large JSON or YAML field and need to find all the values of a deeply nested field you could do something like: $ catmandu convert YAML to Breaker < data.yaml | grep "institution.years" Using Catmandu you can do this analysis on input formats such as JSON, YAML, XML, CSV, XLS (Excell). Just replace the YAML by any of these formats and run the breaker command. Catmandu can also connect to OAI-PMH, Z39.50 or databases such as MongoDB, ElasticSearch, Solr or even relational databases such as MySQL, Postgres and Oracle. For instance to get a breaker format for an OAI-PMH repository issue a command like: $ catmandu convert OAI --url http://lib.ugent.be/oai to Breaker If your data is in a database you could issue an SQL query like: $ catmandu convert DBI --dsn 'dbi:Oracle' --query 'SELECT * from TABLE WHERE ...' --user 'user/password' to Breaker Some formats, such as MARC, doesn’t provide a great breaker format. In Catmandu, MARC files are parsed into a list of list. Running a breaker on a MARC input you get this: $ catmandu convert MARC to Breaker < t/camel.usmarc | head fol05731351 record[][] LDR fol05731351 record[][] _ fol05731351 record[][] 00755cam 22002414a 4500 fol05731351 record[][] 001 fol05731351 record[][] _ fol05731351 record[][] fol05731351 fol05731351 record[][] 082 fol05731351 record[][] 0 fol05731351 record[][] 0 fol05731351 record[][] a The MARC fields are part of the data, not part of the field name. This can be fixed by adding a special ‘marc’ handler to the breaker command: $ catmandu convert MARC to Breaker --handler marc < t/camel.usmarc | head fol05731351 LDR 00755cam 22002414a 4500 fol05731351 001 fol05731351 fol05731351 003 IMchF fol05731351 005 20000613133448.0 fol05731351 008 000107s2000 nyua 001 0 eng fol05731351 010a 00020737 fol05731351 020a 0471383147 (paper/cd-rom : alk. paper) fol05731351 040a DLC fol05731351 040c DLC fol05731351 040d DLC Now all the MARC subfields are visible in the output. You can use this format to find, for instance, all unique values in a MARC file. Lets try to find all unique 008 values: $ catmandu convert MARC to Breaker --handler marc < camel.usmarc | grep "\t008" | cut -f 3 | sort -u 000107s2000 nyua 001 0 eng 000203s2000 mau 001 0 eng 000315s1999 njua 001 0 eng 000318s1999 cau b 001 0 eng 000318s1999 caua 001 0 eng 000518s2000 mau 001 0 eng 000612s2000 mau 000 0 eng 000612s2000 mau 100 0 eng 000614s2000 mau 000 0 eng 000630s2000 cau 001 0 eng 00801nam 22002778a 4500 Catmandu::Breaker doesn’t only break input data in a easy format for command line processing, it can also do a statistical analysis on the breaker output. First process some data into the breaker format and save the result in a file: $ catmandu convert MARC to Breaker --handler marc < t/camel.usmarc > result.breaker Now, use this file as input for the ‘catmandu breaker’ command: $ catmandu breaker result.breaker | name | count | zeros | zeros% | min | max | mean | median | mode | variance | stdev | uniq | entropy | |------|-------|-------|--------|-----|-----|------|--------|--------|----------|-------|------|---------| | 001 | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 10 | 3.3/3.3 | | 003 | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0.0/3.3 | | 005 | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 10 | 3.3/3.3 | | 008 | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 10 | 3.3/3.3 | | 010a | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 10 | 3.3/3.3 | | 020a | 9 | 1 | 10.0 | 0 | 1 | 0.9 | 1 | 1 | 0.09 | 0.3 | 9 | 3.3/3.3 | | 040a | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0.0/3.3 | | 040c | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0.0/3.3 | | 040d | 5 | 5 | 50.0 | 0 | 1 | 0.5 | 0.5 | [0, 1] | 0.25 | 0.5 | 1 | 1.0/3.3 | | 042a | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0.0/3.3 | | 050a | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0.0/3.3 | | 050b | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 10 | 3.3/3.3 | | 0822 | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0.0/3.3 | | 082a | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 3 | 0.9/3.3 | | 100a | 9 | 1 | 10.0 | 0 | 1 | 0.9 | 1 | 1 | 0.09 | 0.3 | 8 | 3.1/3.3 | | 100d | 1 | 9 | 90.0 | 0 | 1 | 0.1 | 0 | 0 | 0.09 | 0.3 | 1 | 0.5/3.3 | | 100q | 1 | 9 | 90.0 | 0 | 1 | 0.1 | 0 | 0 | 0.09 | 0.3 | 1 | 0.5/3.3 | | 111a | 1 | 9 | 90.0 | 0 | 1 | 0.1 | 0 | 0 | 0.09 | 0.3 | 1 | 0.5/3.3 | | 111c | 1 | 9 | 90.0 | 0 | 1 | 0.1 | 0 | 0 | 0.09 | 0.3 | 1 | 0.5/3.3 | | 111d | 1 | 9 | 90.0 | 0 | 1 | 0.1 | 0 | 0 | 0.09 | 0.3 | 1 | 0.5/3.3 | | 245a | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 9 | 3.1/3.3 | | 245b | 3 | 7 | 70.0 | 0 | 1 | 0.3 | 0 | 0 | 0.21 | 0.46 | 3 | 1.4/3.3 | | 245c | 9 | 1 | 10.0 | 0 | 1 | 0.9 | 1 | 1 | 0.09 | 0.3 | 8 | 3.1/3.3 | | 250a | 3 | 7 | 70.0 | 0 | 1 | 0.3 | 0 | 0 | 0.21 | 0.46 | 3 | 1.4/3.3 | | 260a | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 6 | 2.3/3.3 | | 260b | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 5 | 2.0/3.3 | | 260c | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 2 | 0.9/3.3 | | 263a | 6 | 4 | 40.0 | 0 | 1 | 0.6 | 1 | 1 | 0.24 | 0.49 | 4 | 2.0/3.3 | | 300a | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 5 | 1.8/3.3 | | 300b | 3 | 7 | 70.0 | 0 | 1 | 0.3 | 0 | 0 | 0.21 | 0.46 | 1 | 0.9/3.3 | | 300c | 4 | 6 | 60.0 | 0 | 1 | 0.4 | 0 | 0 | 0.24 | 0.49 | 4 | 1.8/3.3 | | 300e | 1 | 9 | 90.0 | 0 | 1 | 0.1 | 0 | 0 | 0.09 | 0.3 | 1 | 0.5/3.3 | | 500a | 2 | 8 | 80.0 | 0 | 1 | 0.2 | 0 | 0 | 0.16 | 0.4 | 2 | 0.9/3.3 | | 504a | 1 | 9 | 90.0 | 0 | 1 | 0.1 | 0 | 0 | 0.09 | 0.3 | 1 | 0.5/3.3 | | 630a | 2 | 9 | 90.0 | 0 | 2 | 0.2 | 0 | 0 | 0.36 | 0.6 | 2 | 0.9/3.5 | | 650a | 15 | 0 | 0.0 | 1 | 3 | 1.5 | 1 | 1 | 0.65 | 0.81 | 6 | 1.7/3.9 | | 650v | 1 | 9 | 90.0 | 0 | 1 | 0.1 | 0 | 0 | 0.09 | 0.3 | 1 | 0.5/3.3 | | 700a | 5 | 7 | 70.0 | 0 | 2 | 0.5 | 0 | 0 | 0.65 | 0.81 | 5 | 1.9/3.6 | | LDR | 10 | 0 | 0.0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 10 | 3.3/3.3 As a result you get a table listing the usage of subfields in all the input records. From this output we can learn: The ‘001’ field is available in 10 records (see: count) One record doesn’t contain a ‘020a’ subfield (see: zeros) The ‘650a’ is available in all records at least once at most 3 times (see: min, max) Only 8 out of 10 ‘100a’ subfields have unique values (see: uniq) The last column ‘entropy’ provides a number how interesting the field is for search engines. The higher the entropy, the more uniq content can be found. I hope this tools are of some use in your projects! Written by hochstenbach 8 Comments Posted in Uncategorized May 10, 2016 Catmandu 1.01 Catmandu 1.01 has been released today. There has been some speed improvements processing fixes due to switching from the Data::Util to the Ref::Util package which has better a support on many Perl platforms. For the command line there is now support for preprocessing  Fix scripts. This means, one can read in variables from the command line into a Fix script. For instance, when processing data you might want to keep some provenance data about your data sources in the output. This can be done with the following commands: $ catmandu convert MARC --fix myfixes.fix --var source=Publisher1 --var date=2014-2015 < data.mrc with a myfixes.fix like: add_field(my_source,{{source}}) add_field(my_data,{{date}}) marc_field(245,title) marc_field(022,issn) . . . etc . . Your JSON output will now contain the clean ‘title’ and ‘issn’ fields but also for each record a ‘my_source’ with value ‘Publisher1’ and a ‘my_date’ with value ‘2014-2015’. By using the Text::Hogan compiler full support of the mustache language is available. In this new Catmandu version there have been also some new fix functions you might want to try out, see our Fixes Cheat Sheet for a full overview.   Written by hochstenbach Leave a comment Posted in Updates April 20, 2016 Parallel Processing with Catmandu In this blog post I’ll show a technique to scale out your data processing with Catmandu. All catmandu scripts use a single process, in a single thread. This means that if you need to process 2 times as much data , you need 2 times at much time. Running a catmandu convert command with the -v option will show you the speed of a typical conversion: $ catmandu convert -v MARC to JSON --fix heavy_load.fix < input.marc > output.json added 100 (55/sec) added 200 (76/sec) added 300 (87/sec) added 400 (92/sec) added 500 (90/sec) added 600 (94/sec) added 700 (97/sec) added 800 (97/sec) added 900 (96/sec) added 1000 (97/sec) In the example above we process an ‘input.marc’ MARC file into a ‘output.json’ JSON file with some difficult data cleaning in the ‘heave_load.fix’ Fix script. Using a single process we can reach about 97 records per second. It would take 2.8 hours to process one million records and 28 hours to process ten million records. Can we make this any faster? When you buy a computer they are all equipped with multiple processors. Using a single process, only one of these processors are used for calculations. One would get much ‘bang for the buck’  if all the processors could be used. One technique to do that is called ‘parallel processing’. To check the amount of processors available on your machine use the file ‘/proc/cpuinfo’: on your Linux system: $ cat /proc/cpuinfo | grep processor processor : 0 processor : 1 The example above  shows two lines: I have two cores available to do processing on my laptop. In my library we have servers which contain  4 , 8 , 16 or more processors. This means that if we could do our calculations in a smart way then our processing could be 2, 4, 8 or 16 times as fast (in principle). To check if your computer  is using all that calculating power, use the ‘uptime’ command: $ uptime 11:15:21 up 622 days, 1:53, 2 users, load average: 1.23, 1.70, 1.95 In the example above I ran did ‘uptime’ on one of our servers with 4 processors. It shows a load average of about 1.23 to 1.95. This means that in the last 15 minutes between 1 and 2 processors where being used and the other two did nothing. If the load average is less than the number of cores (4 in our case) it means: the server is waiting for input. If the load average is equal to the number of cores  it means: the server  is using all the CPU power available. If the load is bigger than the number of cores, then there is more work available than can be executed by the machine, some processes need to wait. Now you know some Unix commands we can start using the processing power available on your machine. In my examples I’m going to use a Unix tool called ‘GNU parallel’ to run Catmandu  scripts on all the processors in my machine in the most efficient way possible. To do this you need to install GNU parallel: sudo yum install parallel The second ingredient we need is a way to cut our input data into many parts. For instance if we have a 4 processor machine we would like to create 4 equal chunks of data to process in parallel. There are very many ways to cut your data in to many parts. I’ll show you a trick we use in at Ghent University library with help of a MongoDB installation. First install, MongoDB and the MongoDB catmandu plugins (these examples are taken from our CentOS documentation): $ sudo cat > /etc/yum.repos.d/mongodb.repo <<EOF [mongodb] baseurl=http://downloads-distro.mongodb.org/repo/redhat/os/x86_64 gpgcheck=0 enabled=1 name=MongoDB.org repository EOF $ sudo yum install -y mongodb-org mongodb-org-server mongodb-org-shell mongodb-org-mongos mongodb-org-tools $ sudo cpanm Catmandu::Store::MongoDB Next, we are going to store our input data in a MongoDB database with help of a Catmandu Fix script that adds some random numbers the data: $ catmandu import MARC to MongoDB --database_name data --fix random.fix < input.marc With the ‘random.fix’ like: random("part.rand2","2") random("part.rand4","4") random("part.rand8","8") random("part.rand16","16") random("part.rand32","32") The ‘random()’ Fix function will be available in Catmandu 1.003 but can also be downloaded here (install it in a directory ‘lib/Catmandu/Fix’). This will will make sure that every record in your input file contains four random numbers ‘part.rand2’, ‘part.rand4′ ,’part.rand8′,’part.rand16′,’part.rand32’. This will makes it possible to chop your data into two, four, eight, sixteen or thirty-two parts depending on the number of processors you have in your machine. To access one chunk of your data the ‘catmandu export’ command can be used with a query. For instance, to export two equal chunks  do: $ catmandu export MongoDB --database_name -q '{"part.rand2":0}' > part1 $ catmandu export MongoDB --database_name -q '{"part.rand2":1}' > part2 We are going to use these catmandu commands in a Bash script which makes use of GNU parallel run many conversions simultaneously. #!/bin/bash # file: parallel.sh CPU=$1 if [ "${CPU}" == "" ]; then /usr/bin/parallel -u $0 {} <<EOF 0 1 EOF elif [ "${CPU}" != "" ]; then catmandu export MongoDB --database_name data -q "{\"part.rand2\":${CPU}}" to JSON --line_delimited 1 --fix heavy_load.fix > result.${CPU}.json fi This example script above shows how a conversion process could run on a 2-processor machine. The lines with ‘/usr/bin/parallel’ show how GNU parallel is used to call this script with two arguments ‘0’ and ‘1’ (for the 2-processor example). In the lines with ‘catmandu export’ shows how chunks of data are read from the database and processed with the ‘heavy_load.fix’ Fix script. If you have a 32-processor machine, you would need to provide parallel an input which contains the numbers 0,1,2 to 31 and change the query to ‘part.rand32’. GNU parallel is a very powerfull command. It gives the opportunity to run many processes in parallel and even to spread out the load over many machines if you have a cluster. When all these machines have access to your MongoDB database then all can receive chunks of data to be processed. The only task left is to combine all results which can be as easy as a simple ‘cat’ command: $ cat result.*.json > final_result.json Written by hochstenbach 4 Comments Posted in Advanced Tagged with catmandu, JSON Path, library, Linux, marc, parallel procesing, perl February 25, 2016 Catmandu 1.00 After 4 years of programming, 88 minor releases we are finally there: the release of Catmandu 1.00! We have pushed the test coverage of the code to 93.97% and added and cleaned a lot of our documentation. For the new features read our Changes file. A few important changes should be noted.     By default Catmandu will read and write valid JSON files. In previous versions the default input format was (new)line delimited JSON records as in: {"record":"1"} {"record":"2"} {"record":"3"} instead of the valid JSON array format: [{"record":"1"},{"record":"2"},{"record":"3"}] The old format can still be used as input but will be read much faster when using the –line_delimited  option on the command line. Thus, write: # fast $ catmandu convert JSON --line_delimited 1 < lines.json.txt instead of: # slow $ catmandu convert JSON < lines.json.txt By default Catmandu will export in the valid JSON-array format. If you still need to use the old format, then provide the –line_delimited option on the command line: $ catmandu convert YAML to JSON --line_delimited 1 < data.yaml We thank all contributors for these wonderful four years of open source coding and we wish you all four new hacking years. Our thanks goes to: Nicolas Steenlant Christian Pietsch Dave Sherohman Dries Moreels Friedrich Summann Jakob Voss Johann Rolschewski Jonas Smedegaard Jörgen Eriksson Magnus Enger Maria Hedberg Mathias Lösch Najko Jahn Nicolas Franck Patrick Hochstenbach Petra Kohorst Robin Sheat Snorri Briem Upasana Shukla Vitali Peil Deutsche Forschungsgemeinschaft for providing us the travel funds Lund University Library , Ghent University Library and Bielefeld University Library to provide us a very welcome environment for open source collaboration. Written by hochstenbach Leave a comment Posted in Uncategorized June 19, 2015 Catmandu Chat On Friday June 26 2015 16:00 CEST, we’ll  provide a one hour introduction/demo into processing data with Catmandu. If you are interested, join us on the event page: https://plus.google.com/hangouts/_/event/c6jcknos8egjlthk658m1btha9o More instructions on the exact Google Hangout coordinates for this chat will follow on this web page at Friday June 26 15:45. To enter the chat session, a working version of the Catmandu VirtualBox needs to be running on your system: https://librecatproject.wordpress.com/get-catmandu/ Written by hochstenbach Leave a comment Posted in Events June 3, 2015 Matching authors against VIAF identities At Ghent University Library we enrich catalog records with VIAF identities to enhance the search experience in the catalog. When searching for all the books about ‘Chekov’ we want to match all name variants of this author. Consult VIAF http://viaf.org/viaf/95216565/#Chekhov,_Anton_Pavlovich,_1860-1904 and you will see many of them. Chekhov Čehov Tsjechof Txékhov etc Any of the these names variants can be available in the catalog data if authority control is not in place (or not maintained). Searching any of these names should result in results for all the variants. In the past it was a labor intensive, manual job for catalogers to maintain an authority file. Using results from Linked Data Fragments research by Ruben Verborgh (iMinds) and the Catmandu-RDF tools created by Jakob Voss (GBV) and RDF-LDF by Patrick Hochstenbach, Ghent University started an experiment to automatically enrich authors with VIAF identities. In this blog post we will report on the setup and results of this experiment which will also be reported at ELAG2015. Context Three ingredients are needed to create a web of data: A scalable way to produce data. The infrastructure to publish data. Clients accessing the data and reusing them in new contexts. On the production site there doesn’t seem to be any problem creating huge datasets by libraries. Any transformation of library data to linked data will quickly generate an enormous number of RDF triples. We see this in the size of public available datasets: UGent Academic Bibliography: 12.000.000 triples Libris catalog: 50.000.000 triples Gallica: 72.000.000 triples DBPedia: 500.000.000 triples VIAF: 600.000.000 triples Europeana: 900.000.000 triples The European Library: 3.500.000.000 triples PubChem: 60.000.000.000 triples Also for accessing data, from a consumers perspective the “easy” part seems to be covered. Instead of thousands of APIs available and many documents formats for any dataset, SPARQL and RDF provide the programmer a single protocol and document model. The claim of the Linked Data Fragments researchers is that on the publication side, reliable queryable access to public Linked Data datasets largely remains problematic due to the low availability percentages of public SPARQL endpoints [Ref]. This is confirmed by the 2013 study by researchers from Pontificia Universidad Católica in Chili and National University of Ireland where more than half of the public SPARQL endpoints seem to be offline 1.5 days per month. This gives an availability rate of less than 95% [Ref]. The source of this high rate of inavailability can be traced back to the service model of Linked Data where two extremes exists to publish data (see image below). From: http://www.slideshare.net/RubenVerborgh/dbpedias-triple-pattern-fragments At one side, data dumps (or dereferencing of URLs) can be made available which requires a simple HTTP server and lots of processing power on the client side. At the other side, an open SPARQL endpoint can be provided which requires a lot of processing power (hence, hardware investment) on the serverside. With SPARQL endpoints, clients can demand the execution of arbitrarily complicated queries. Furthermore, since each client requests unique, highly specific queries, regular caching mechanisms are ineffective, since they can only optimized for repeated identical requests. This situation can be compared with providing a database SQL dump to endusers or open database connection on which any possible SQL statement can be executed. To a lesser extent libraries are well aware of the different modes of operation between running OAI-PMH services and Z39.50/SRU services. Linked Data Fragment researchers provide a third way, Triple Pattern Fragments, to publish data which tries to provide the best of both worlds: access to a full dump of datasets while providing a queryable and cachable interface. For more information on the scalability of this solution I refer to the report  presented at the 5th International USEWOD Workshop. The experiment VIAF doesn’t provide a public SPARQL endpoint, but a complete dump of the data is available at http://viaf.org/viaf/data/. In our experiments we used the VIAF (Virtual International Authority File), which is made available under the ODC Attribution License.  From this dump we created a HDT database. HDT provides a very efficient format to compress RDF data while maintaining browser and search functionality. Using command line tools RDF/XML, Turtle and NTriples can be compressed into a HDT file with an index. This standalone file can be used to without the need of a database to query huge datasets. A VIAF conversion to HDT results in a 7 GB file and a 4 GB index. Using the Linked Data Fragments server by Ruben Verborgh, available at https://github.com/LinkedDataFragments/Server.js, this HDT file can be published as a NodeJS application. For a demonstration of this server visit the iMinds experimental setup at: http://data.linkeddatafragments.org/viaf Using Triple Pattern Fragments a simple REST protocol is available to query this dataset. For instance it is possible to download the complete dataset using this query: $ curl -H "Accept: text/turtle" http://data.linkeddatafragments.org/viaf If we only want the triples concerning Chekhov (http://viaf.org/viaf/95216565) we can provide a query parameter: $ curl -H "Accept: text/turtle" http://data.linkeddatafragments.org/viaf?subject=http://viaf.org/viaf/95216565 Likewise, using the predicate and object query any combination of triples can be requested from the server. $ curl -H "Accept: text/turtle" http://data.linkeddatafragments.org/viaf?object="Chekhov" The memory requirements of this server are small enough to run a copy of the VIAF database on a MacBook Air laptop with 8GB RAM. Using specialised Triple Pattern Fragments clients, SPARQL queries can be executed against this server. For the Catmandu project we created a Perl client RDF::LDF which is integrated into Catmandu-RDF. To request all triples from the endpoint use: $ catmandu convert RDF --url http://data.linkeddatafragments.org/viaf --sparql 'SELECT * {?s ?p ?o}' Or, only those Triples that are about “Chekhov”: $ catmandu convert RDF --url http://data.linkeddatafragments.org/viaf --sparql 'SELECT * {?s ?p "Chekhov"}' In the Ghent University experiment a more direct approach was taken to match authors to VIAF. First, as input a MARC dump from the catalog is being streamed into a Perl program using a Catmandu iterator. Then, we extract the 100 and 700 fields which contain $a (name) and $d (date) subfields. These two fields are combined in a search query, as if we would search: Chekhov, Anton Pavlovich, 1860-1904 If there is exactly one hit in our local VIAF copy, then the result is reported. A complete script to process MARC files this way is available at a GitHub gist. To run the program against a MARC dump execute the import_viaf.pl command: $ ./import_viaf.pl --type USMARC file.mrc 000000089-2 7001 L $$aEdwards, Everett Eugene,$$d1900- http://viaf.org/viaf/110156902 000000122-8 1001 L $$aClelland, Marjorie Bolton,$$d1912- http://viaf.org/viaf/24253418 000000124-4 7001 L $$aSchein, Edgar H. 000000124-4 7001 L $$aKilbridge, Maurice D.,$$d1920- http://viaf.org/viaf/29125668 000000124-4 7001 L $$aWiseman, Frederick. 000000221-6 1001 L $$aMiller, Wilhelm,$$d1869- http://viaf.org/viaf/104464511 000000256-9 1001 L $$aHazlett, Thomas C.,$$d1928- http://viaf.org/viaf/65541341 [edit: 2017-05-18 an updated version of the code is available as a Git project https://github.com/LibreCat/MARC2RDF ] All the authors in the MARC dump will be exported. If there is exactly one single match against VIAF it will be added to the author field. We ran this command for one night in a single thread against 338.426 authors containing a date and found 135.257 exact matches in VIAF (=40%). In a quite recent follow up of our experiments, we investigated how LDF clients can be used in a federated setup. When combining in the LDF algorithm the triples result from many LDF servers, one SPARQL query can be run over many machines. These results are demonstrated at the iMinds demo site where a single SPARQL query can be executed over the combined VIAF and DBPedia datasets. A Perl implementation of this federated search is available in the latest version of RDF-LDF at GitHub. We strongly believe in the success of this setup and the scalability of this solution as demonstrated by Ruben Verborgh at the USEWOD Workshop. Using Linked Data Fragments a range of solutions are available to publish data on the web. From simple data dumps to a full SPARQL endpoint any service level can be provided given the resources available. For more than a half year DBPedia has been running an LDF server with 99.9994% availability on a 8 CPU , 15 GB RAM Amazon server with 4.5 million requests. Scaling out, services such has the LOD Laundromat cleans 650.000 datasets and provides access to them using a single fat LDF server (256 GB RAM). For more information on the Federated searches with  Linked Data Fragments  visit the blog post of Ruben Verborgh at: http://ruben.verborgh.org/blog/2015/06/09/federated-sparql-queries-in-your-browser/ Written by hochstenbach Leave a comment Posted in Advanced Tagged with LDF, Linked Data, marc, perl, RDF, SPARQL, Triple Pattern Fragments, VIAF Older Posts Recent Posts Catmandu 1.20 LPW 2018: “Contrarian Perl” – Tom Hukins Introducing FileStores Catmandu 1.04 Metadata Analysis at the Command-Line Catmandu 1.01 Parallel Processing with Catmandu Catmandu 1.00 Catmandu Chat Matching authors against VIAF identities Preprocessing Catmandu fixes Earthquake in Kathmandu Importing files from a hotfolder directory LibreCat/Memento Hackathon Day 18: Merry Christmas! Day 17: Exporting RDF data with Catmandu Day 16: Importing RDF data with Catmandu Day 15 : MARC to Dublin Core Day 14: Set up your own OAI data service Day 13: Harvest data with OAI-PMH Day 12: Index your data with ElasticSearch Day 11: Store your data in MongoDB Day 10: Working with CSV and Excel files Day 9: Processing MARC with Catmandu Day 8: Processing JSON data from webservices Day 7: Catmandu JSON paths Day 6: Introduction into Catmandu Day 5: Editing text with nano Day 4: grep, less and wc Day 3: Bash basics Create a free website or blog at WordPress.com. Catmandu Create a free website or blog at WordPress.com. Email (Required) Name (Required) Website   Loading Comments... Comment × Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use. To find out more, including how to control cookies, see here: Cookie Policy 
librecatproject-wordpress-com-7844	----	Catmandu Catmandu Catmandu 1.20 On May 21th 2019, Nicolas Steenlant (our main developer and guru of Catmandu) released version 1.20 of our Catmandu toolkit with some very interesting new features. The main addition is a brand new way how Catmandu Fix-es can be implemented using the new Catmandu::Path implementation. This coding by Nicolas will make it much easier and [&#8230;] LPW 2018: &#8220;Contrarian Perl&#8221; &#8211; Tom Hukins At 09:10, Tom Hukins shares his enthusiasm for Catmandu! Introducing FileStores Catmandu is always our tool of choice when working with structured data. Using the Elasticsearch or MongoDB Catmandu::Store-s it is quite trivial to store and retrieve metadata records. Storing and retrieving a YAML, JSON (and by extension XML, MARC, CSV,&#8230;) files can be as easy as the commands below: $ catmandu import YAML to database [&#8230;] Catmandu 1.04 Catmandu 1.04 has been released to with some nice new features. There are some new Fix routines that were asked by our community: error The &#8220;error&#8221; fix stops immediately the execution of the Fix script and throws an error. Use this to abort the processing of a data stream: $ cat myfix.fix unless exists(id)     error("no [&#8230;] Metadata Analysis at the Command-Line I was last week at the ELAG  2016 conference in Copenhagen and attended the excellent workshop by Christina Harlow  of Cornell University on migrating digital collections metadata to RDF and Fedora4. One of the important steps required to migrate and model data to RDF is understanding what your data is about. Probably old systems need to [&#8230;] Catmandu 1.01 Catmandu 1.01 has been released today. There has been some speed improvements processing fixes due to switching from the Data::Util to the Ref::Util package which has better a support on many Perl platforms. For the command line there is now support for preprocessing  Fix scripts. This means, one can read in variables from the command line into [&#8230;] Parallel Processing with Catmandu In this blog post I&#8217;ll show a technique to scale out your data processing with Catmandu. All catmandu scripts use a single process, in a single thread. This means that if you need to process 2 times as much data , you need 2 times at much time. Running a catmandu convert command with the [&#8230;] Catmandu 1.00 After 4 years of programming, 88 minor releases we are finally there: the release of Catmandu 1.00! We have pushed the test coverage of the code to 93.97% and added and cleaned a lot of our documentation. For the new features read our Changes file. A few important changes should be noted. &#160; &#160; By default [&#8230;] Catmandu Chat On Friday June 26 2015 16:00 CEST, we&#8217;ll  provide a one hour introduction/demo into processing data with Catmandu. If you are interested, join us on the event page: https://plus.google.com/hangouts/_/event/c6jcknos8egjlthk658m1btha9o More instructions on the exact Google Hangout coordinates for this chat will follow on this web page at Friday June 26 15:45. To enter the chat session, [&#8230;] Matching authors against VIAF identities At Ghent University Library we enrich catalog records with VIAF identities to enhance the search experience in the catalog. When searching for all the books about &#8216;Chekov&#8217; we want to match all name variants of this author. Consult VIAF http://viaf.org/viaf/95216565/#Chekhov,_Anton_Pavlovich,_1860-1904 and you will see many of them. Chekhov Čehov Tsjechof Txékhov etc Any of the these names variants can be [&#8230;] 
lisletters-fiander-info-7514	----	Rapid Communications skip to main | skip to sidebar Rapid Communications Rapid, but irregular, communications from the frontiers of Library Technology Wednesday, April 20, 2016 Mac OS vs Emacs: Getting on the right (exec) PATH One of the minor annoyances about using Emacs on Mac OS is that the PATH environment variable isn't set properly when you launch Emacs from the GUI (that is, the way we always do it). This is because the Mac OS GUI doesn't really care about the shell as a way to launch things, but if you are using brew, or other packages that install command line tools, you do. Apple has changed the way that the PATH is set over the years, and the old environment.plist method doesn't actually work anymore, for security reasons. For the past few releases, the official way to properly set up the PATH is to use the path_helper utility program. But again, that only really works if your shell profile or rc file is run before you launch Emacs. So, we need to put a bit of code into Emacs' site_start.el file to get things set up for us: (when (file-executable-p "/usr/libexec/path_helper") (let ((path (shell-command-to-string "eval `/usr/libexec/path_helper -s`; echo -n \"$PATH\""))) (setenv "PATH" path) (setq exec-path (append (parse-colon-path path) (list exec-directory))))) This code runs the path_helper utility, saves the output into a string, and then uses the string to set both the PATH environment variable and the Emacs exec-path lisp variable, which Emacs uses to run subprocesses when it doesn't need to launch a shell. If you are using the brew version of Emacs, put this code in /usr/local/share/emacs/site-lisp/site-start.el and restart Emacs. Posted by David J. Fiander at 9:35 am No comments: Tuesday, January 20, 2015 Finding ISBNs in the the digits of π For some reason, a blog post from 2010 about searching for ISBNs in the first fifty million digits of π suddenly became popular on the net again at the end of last week (mid-January 2015). The only problem is that Geoff, the author, only looks for ISBN-13s, which all start with the sequence "978". There aren't many occurrences of "978" in even the first fifty million digits of π, so it's not hard to check them all to see if they are the beginning of a potential ISBN, and then find out if that potential ISBN was ever assigned to a book. But he completely ignores all of the ISBN-10s that might be hidden in π. So, since I already have code to validate ISBN checksums and to look up ISBNs in OCLC WorldCat, I decided to check for ISBN-10s myself. I don't have easy access to the first fifty million digits of π, but I did manage to find the first million digits online without too much difficulty. An ISBN-10 is a ten character long string that uniquely identifies a book. An example is "0-13-152414-3". The dashes are optional and exist mostly to make it easier for humans, just like the dashes in a phone number. The first character of an ISBN-10 indicate the language in which the book is published: 0 and 1 are for English, 2 is for French, and so on. The last character of the ISBN is a "check digit", which is supposed to help systems figure out if the ISBN is correct or not. It will catch many common types of errors, like swapping two characters in the ISBN: "0-13-125414-3" is invalid. Here are the first one hundred digits of π: 3.141592653589793238462643383279502884197169399375 105820974944592307816406286208998628034825342117067 To search for "potential (English) ISBN-10s", all one needs to do is search for every 0 or 1 in the first 999,990 digits of π (there is a "1" three digits from the end, but then there aren't enough digits left over to find a full ISBN, so we can stop early) and check to see if the ten digit sequence of characters starting with that 0 or 1 has a valid check digit at the end. The sequence "1415926535", highlighted in red, fails the test, because "5" is not the correct check digit; but the sequence "0781640628" highlighted in green is a potential ISBN. There are approximately 200,000 zeros and ones in the first million digits of π, but "only" 18,273 of them appear at the beginning of a potential ISBN-10. Checking those 18,273 potentials against the WorldCat bibliographic database results in 1,168 valid ISBNs. The first one is at position 3,102: ISBN 0306803844, for the book The evolution of weapons and warfare by Trevor N. Dupuy. The last one is at position 996,919: ISBN 0415597234 for the book Exploring language assessment and testing : language in action by Anthony Green. Here's the full dataset. Posted by David J. Fiander at 5:27 pm 4 comments: Saturday, March 10, 2012 Software Upgrades and The Parable of the Windows A librarian friend of mine recently expressed some surprise at the fact that a library system would spend almost $140,000 to upgrade their ILS software, when the vendor is known to be hostile to its customers and not actually very good with new development on their products. The short answer is that it's easier to upgrade than to think. Especially when an "upgrade" will be seen as easier than a "migration" to a different vendor's system (note: open ILS platforms like Evergreen and Koha may be read as being different vendors for the sake of convenience). In fact, when an ILS vendor discontinues support for a product and tells its customers that they have to migrate to another product if they want to continue to purchase support, it is the rare library that will take this opportunity to re-examine all its options and decide to migrate to a different vendor's product. A simple demonstration of this thinking, on a scale that most of us can imagine, is what happened when my partner and I decided that it was time to replace the windows in our house several years ago. There are a couple of things you need to know about replacing the windows in your house, if you've never done this before: Most normal folks replace the windows in their house over the course of several years, doing two or three windows every year or two. If one is replacing the huge bay window in the living room, then that might be the only window that one does that year. Windows are expensive enough that one can't really afford to do them all at once. Windows are fungible. For the most part, one company's windows look exactly like another company's. Unless you're working hard at getting a particular colour of flashing on the outside of the window, nobody looking at your house from the sidewalk would notice that the master bedroom window and the livingroom window were made by different companies. Like any responsible homeowners, we called several local window places, got quotations from three or four of them for the windows we wanted replaced that year, made our decision about which vendor we were going to use for the first round of window replacements, and placed an order. A month or so later, on a day that the weather was going to be good, a crew from the company arrived, knocked big holes in the front of our house to take out the old windows and install the new ones. A couple of years went by, and we decided it was time to do the next couple of windows, so my partner, who was always far more organized about this sort of thing that me, called three or four window companies and asked them to come out to get quotations for the work. At least one of the vendors declined, and another vendor did come out and give us a quote but he was very surprised that we were going through this process again, because normally, once a householder has gone through the process once, they tend to use the same window company for all the windows, even if several years have passed, or if the type of work is very different from the earlier work (such as replacing the living room bay window after a couple of rounds of replacing bedroom windows). In general, once a decision has been made, people tend to stick with that plan. I think it's a matter of, "Well, I made this decision last year, and at the time, this company was good, so they're probably still good," combined, perhaps, with a bit of thinking that changing vendors in mid-stream implies that I didn't make a good decision earlier. And there is, of course, always the thought that it's better to stick with the devil you know that the one you don't. Posted by David J. Fiander at 5:21 pm 3 comments: Sunday, January 02, 2011 Using QR Codes in the Library This started out as a set of internal guidelines for the staff at MPOW, but some friends expressed interest in it, and it seems to have struck a nerve, so I'm posting it here, so it is easier for people to find and to link to. Using QR Codes in the Library QR codes are new to North American, but have been around for a while in Japan, where they originated, and where everybody has a cellphone that can read the codes. They make it simpler to take information from the real world and load it into your phone. As such, they should only be used when the information will be useful for somebody on the go, and shouldn't normally be used if the person accessing the information will probably be on a computer to begin with. Do Use QR Codes: On posters and display projectors to guide users to mobile-friendly websites. To share your contact information on posters, display projectors, or your business card. This makes it simpler for users to add you to their addressbook without having to type it all in. In display cabinets or art exhibits to link to supplementary information about the items on display. Don't use QR Codes: to record your contact information in your email signature. Somebody reading your email can easily copy the information from your signature to their addressbook. to share URLs for rich, or full-sized, websites. The only URLs you should be sharing via QR codes for are mobile-friendly sites. When Using QR Codes: Make sure to include a human readable URL, preferably one that's easy to remember, near the QR code for people without QR Code scanners to use. Posted by David J. Fiander at 7:07 pm No comments: Monday, April 06, 2009 A Manifesto for the Library Last week John Blyberg, Kathryn Greenhill, and Cindi Trainor spent some time together thinking about what the library is for and what its future might hold. The result of that deep thinking has now been published on John's blog under the title "The Darien Statements on the Library and Librarians." Opening with the ringing statement that The purpose of the Library is to preserve the integrity of civilization they then provide their own gloss on what this means for individual libraries, and for librarians. There is a lively discussion going on in the comments on John's blog, as well as less thoughtful sniping going on in more "annoying" blogs. I think that this is something that will engender quite a bit of conversation in the month's to come. Posted by David J. Fiander at 6:13 pm No comments: Sunday, April 05, 2009 I'm a Shover and Maker! Since only a few people can be named "Movers and Shakers" by Library Journal, Joshua Neff and Steven Lawson created the "Shovers and Makers" awards "for the rest of us," under the auspices of the not entirely serious Library Society of the World. I'm very pleased to report that I have been named a 2009 Shover and Maker (by myself, as are all the winners). The Shovers and Makers awards are a fun way to share what we've done over the past year or two and they're definitely a lot simpler than writing the annual performance review that HR wants. Think of this as practice for writing the speaker's bio for the conference keynote you dream of being invited to give. Posted by David J. Fiander at 8:22 am No comments: Sunday, January 25, 2009 LITA Tears Down the Walls At ALA Midwinter 2009, Jason Griffey and the LITA folks took advantage of the conference center's wireless network to provide quick and easy access to the Top Tech Trends panel for those of us that couldn't be there in person. The low-bandwidth option was a CoverItLive live-blogging feed of comments from attending that also included photos by Cindi Trainor, and a feed of twitters from attendees. The high-bandwidth option was a live (and recorded) video stream of the event that Jason captured using the webcam built into his laptop. Aside from the LITA planned events, the fact that we could all sit in meant that there were lots of virtual conversations in chat rooms and other forums that sprung up as people joined in from afar. Unfortunately, because my Sunday morning is filled with laundry and other domestic pleasures, I wasn't able to join in on the "live" chatter going on in parallel with the video or livebloggin. Owing to funding constraints and my own priorities, my participation at ALA is limited. I've been to LITA Forum once, and might go again, but I focus more on the OLA other regional events. This virtual option from LITA let me get a peek at what's going on and hear what the "big thinkers" at LITA have to say. I hope they can keep it up, and will definitely be talking to local folks about how we might be able to emulate LITA in our own events. Posted by David J. Fiander at 12:34 pm No comments: Older Posts Home Subscribe to: Posts (Atom) About Me David J. Fiander I'm a former software developer who's now the web services librarian at a university. The great thing about that job title is that nobody knows what I do. View my complete profile Last.FM Weekly Chart Blog archive ▼  2016 (1) ▼  April (1) Mac OS vs Emacs: Getting on the right (exec) PATH ►  2015 (1) ►  January (1) ►  2012 (1) ►  March (1) ►  2011 (1) ►  January (1) ►  2009 (4) ►  April (2) ►  January (2) ►  2008 (22) ►  September (1) ►  August (1) ►  July (1) ►  June (2) ►  May (3) ►  April (3) ►  March (4) ►  February (4) ►  January (3) ►  2007 (6) ►  December (2) ►  July (1) ►  June (1) ►  March (2) ►  2006 (13) ►  December (1) ►  November (1) ►  October (1) ►  September (1) ►  July (1) ►  May (1) ►  April (1) ►  March (1) ►  February (5) This work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 2.5 Canada License.   
litablog-org-4795	----	LITA Blog LITA Blog Empowering libraries through technology Jobs in Information Technology: August 25, 2020 New This Week Coordinator of Digital Scholarship and Programs, Marquette University Libraries, Milwaukee WI Digital Scholarship Coordinator, UNC Charlotte, Charlotte, NC Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Jobs in Information Technology: August 13, 2020 New This Week Information Systems Manager (PDF), The Community Library Association, Ketchum, ID Children&#8217;s Librarian, Buhl Public Library, Buhl, ID Technology Integration Librarian, Drexel University Libraries, Philadelphia, PA Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Your Core Community Update Much has been happening behind-the-scenes to prepare for Core’s upcoming launch on September 1st, so we want to update you on the progress we’ve made. At the 2020 ALA Virtual Conference Council meetings, the ALA Council approved the creation of Core, so we’re official! It’s been a difficult summer for everyone given the global situation, but this was a milestone we’re excited to reach. What We’ve Been Doing In May, the Core Transition Committee (the 9 division presidents plus senior staff) formed 11 working groups of members from all 3 divisions to make recommendations about how to proceed with our awards/scholarships, budget/finance, committees, communications, conference programming, continuing education, fundraising/sponsorships, interest groups, member engagement, nominations for 2021 president-elect, publications, and standards. These groups have done an amazing amount of work in a very short time period, and we’re grateful to these members for their commitment and effort. We’re happy to report... Free LITA Webinar ~ Library Tech Response to Covid-19 ~ August 5th Sign up for this free LITA webinar: Library Tech Response to Covid-19 Libraries are taking the necessary precautions to create a safe environment during the pandemic. Social distancing isn’t the only solution, but providing access to loanable technologies, including handling and quarantine of equipment, cleaning, and other safety and health concerns are just some of the measures put in place. With the ongoing disruption to library services caused by COVID-19, what reopening planning policies should be considered for usage? In this free 90-minute presentation, our presenters will share tips that might be helpful to other librarians before they reopen. The presenters will also talk about the&#160;evolution of the phased plan from the establishment of a temporary computer lab in the library as COVID-19 began to spread in March 2020, to the current phased approach for gradual reopening. Justin will also offer insight into managed access, technology and services, workflows, messaging,... Jobs in Information Technology: July 29, 2020 New This Week Library Director, Walpole Town Library, Walpole, NH Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Core Call for 2021 ALA Annual Program Proposals Submit an ALA 2021 Annual Conference program proposal for ALA’s newest division, Core: Leadership, Infrastructure, Futures, which will begin on September 1, 2020. Proposals are due September 30, 2020, and you don’t need to be a Core member to submit a proposal. Submit your idea using this proposal form. Core welcomes topics of interest to a wide range of library professionals in many different areas, including… 1. Access and Equity Advocacy in areas such as copyright, equity of access, open access, net neutrality, and privacy Preservation Week Equity, diversity, and inclusion, both within the division and the profession, as related to Core’s subject areas 2. Assessment Emphasizing the role of assessment in demonstrating the impacts of libraries or library services Assessment tools, methods, guidelines, standards, and policies and procedures 3. Leadership and Management Developing leaders at every level Best practices for inclusion by using an equity lens to examine leadership... Core Call for Webinar Proposals Submit a webinar proposal for ALA’s newest division, Core: Leadership, Infrastructure, Futures, which will begin on September 1, 2020. Proposals are due September 1, 2020, and you don’t need to be a Core member to submit a proposal. Early submissions are encouraged and will be considered for September and October presentations. Submit your idea using this proposal form. Core webinars reach a wide range of library professionals in many different areas, including… 1.&#160;Access and Equity Advocacy in areas such as copyright, equity of access, open access, net neutrality, and privacy Preservation Week Equity, diversity, and inclusion, both within the division and the profession, as related to Core’s subject areas 2.&#160;Assessment Emphasizing the role of assessment in demonstrating the impacts of libraries or library services Assessment tools, methods, guidelines, standards, and policies and procedures 3.&#160;Leadership Developing leaders at every level Best practices for inclusion by using an equity lens to examine... Core Virtual Forum is excited to announce our 2020 Keynote Speakers! Core Virtual Forum welcomes our 2020 Keynote speakers, Dr. Meredith D. Clark and Sofia Leung! Both speakers embody our theme in leading through their ideas and are catalysts for change to empower our community and move the library profession forward. Dr. Clark is a journalist and Assistant Professor in Media Studies at the University of Virginia. She is Academic Lead for Documenting the Now II, funded by the Andrew W. Mellon Foundation. Dr. Clark develops new scholarship on teaching students about digital archiving and community-based archives from a media studies perspective. She will be a 2020-2021 fellow with Data &#38; Society. She is a faculty affiliate at the Center on Digital Culture and Society at the University of Pennsylvania. And, she sits on the advisory boards for Project Information Literacy, and for the Center for Critical Race and Digital Studies at New York University. Clark is an in-demand media consultant... Catch up on the June 2020 Issue of Information Technology and Libraries The June 2020 issue of Information Technology and Libraries (ITAL) was published on June 15. Editor Ken Varnum and LITA President Emily Morton-Owens reflect on the past three months in their Letter from the Editor, A Blank Page, and LITA President’s Message, A Framework for Member Success, respectively. Kevin Ford is the author of this issue’s “Editorial Board Thoughts” column, Seeing through Vocabularies. Rounding out our editorial section, the June “Public Libraries Leading the Way” section offers two items. Chuck McAndrew of the Lebanon (New Hampshire) Public Libraries describes his leadership in the IMLS-funded LibraryVPN project. Melody Friedenthal, of the Worcester (Massachusetts) Public Library talks about how she approached and teaches an Intro to Coding Using Python course. Peer-reviewed Content Virtual Reality as a Tool for Student Orientation in Distance Education Programs: A Study of New Library and Information Science Students Dr. Sandra Valenti, Brady Lund, Ting Wang Virtual reality... Jobs in Information Technology: July 8, 2020 New This Week Dean of Libraries, San Jose State University, San Jose, CA Deputy Library Director, City of Carlsbad, Carlsbad, CA Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Jobs in Information Technology: July 2, 2020 New This Week Web Services Librarian, Chester Fritz Library, University of North Dakota, Grand Forks, ND Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Jobs in Information Technology: June 26, 2020 New This Week Metadata Librarian, Librarian I or II, University of Northern British Columbia, Prince George, British Columbia, Canada Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Jobs in Information Technology: June 19, 2020 New This Week Information Technology Librarian,&#160;University of Maryland, Baltimore County, Baltimore, MD Associate University Librarian for Research and Learning,&#160;Columbia University Libraries, New York, NY Library Technology/Programmer Analyst III,&#160;Virginia Beach Public Library,&#160;Virginia Beach, VA Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Core Virtual Happy Hour Social ~ June 26 Our Joint Happy Hour social at Midwinter was such a success that next week we’re bringing Happy Hour to you online—and registration is free! We invite members of ALCTS, LITA, and LLAMA to join us on Friday, June 26, 5:00-7:00 pm Central Time for Virtual Happy Hour networking and/or play with your peers in a game of Scattergories.&#160;Wear your favorite pop culture T-shirt, bring your best Zoom background, grab a beverage, and meet us online for a great time! Attendees will automatically be entered to win free registration to attend the Core Virtual Forum.&#160;Winner must be present to redeem prize.&#160;Registration is required. Register now at: bit.ly/2NeNprH Michael Carroll Awarded 2020 LITA/Christian Larew Memorial Scholarship Michael Carroll has been selected to receive the 2020&#160;LITA/Christian Larew Memorial Scholarship ($3,000) sponsored by the Library and Information Technology Association (LITA) and Baker &#38; Taylor. This scholarship is for master’s level study, with an emphasis on library technology and/or automation, at a library school program accredited by the American Library Association. Criteria for the Scholarship includes previous academic excellence, evidence of leadership potential, and a commitment to a career in library automation and information technology. The Larew Scholarship Committee was impressed by&#160;what Michael has already accomplished and look forward to seeing what he will accomplish after graduation in 2021.&#160;Michael&#160;has already shown&#160;a strong interest in digitization projects.&#160;He currently manages&#160;a&#160;team of students working with digitization.&#160;Previously, he has scanned and cataloged many collections.&#160;He has also assisted the Presbyterian Historical Society&#160;in creating&#160;sustainable processes for digitization.&#160;Michael has also shown his willingness and ability to work&#160;with a wide variety of&#160;projects and technologies that span&#160;both&#160;technical&#160;and non-technical&#160;including... We are back on Twitter Friday for #LITAchat The fourth in this series of #LITAchats will start on Friday, June 12 from 12-1 Central Standard Time on Twitter. We will be asking you to chat with us about self-care. What are you doing to take care of yourselves during this time? How do you unplug without feeling guilty?&#160; We hope you’ll join us for #LITAchat and chat about self-care techniques and figuring out how to better take care of ourselves during these tough times. We&#8217;re looking forward to hearing from you! Join LITA on Twitter Catch up on the last #LITAchat Join us for ALCTS/LITA/LLAMA e-Forum! Please join us for a joint ALCTS/LITA/LLAMA e-forum discussion. It’s free and open to everyone! Registration information is at the end of the message, along with subscription management options for existing listserv members. Continuing to Manage the Impact of COVID-19 on Libraries June 9-10, 2020 Moderated by Alyse Jordan, Steven Pryor, Nicole Lewis and Rebecca Uhl Please join us for an e-forum discussion. It’s free and open to everyone! Registration information is at the end of the message. Each day, discussion begins and ends at: Pacific: 7 a.m. – 3 p.m. Mountain: 8 a.m. – 4 p.m. Central: 9 a.m. – 5 p.m. Eastern: 10 a.m. – 6 p.m. Over the past several months, COVID-19 has significantly impacted libraries and library technical service units and departments, including requiring staff to work remotely and determining what services they can provide. As states begin to reopen, libraries face challenges as they determine... Together Against Racism ALA and Core are committed to dismantling racism and white supremacy. Along with the ALA Executive Board, we endorse the&#160;Black Caucus of the American Library Association (BCALA)’s May 28 statement&#160;condemning the brutal murder of George Floyd at the hands of Minneapolis Police Department officers. In their statement, BCALA cites Floyd’s death as “the latest in a long line of recent and historical violence against Black people in the United States.” Not only does Core support the sentiments of BCALA, we vow to align our values regarding equity, diversity, and inclusion with those of BCALA and other organizations that represent marginalized communities within ALA. We also stand strong with the Asian/Pacific American community, which has been the target of xenophobia and racism in the wake of the outbreak of COVID-19, and support the&#160;Asian/Pacific American Librarians Association (APALA) and their statement&#160;that, “There is no excuse for discriminatory sentiments and actions towards Asians... We are back on Twitter tomorrow for #LITAchat Are you ready for the next Twitter #LITAchat? Join the discussion on Friday, May 22, from 12-1pm Central Time. We will be asking you to tell us about challenges with working from home. Are there things you can’t do and wish you could? Are there issues with your home setup in general?&#160;Anne Pepitone will lead the discussion. We invite you to join us tomorrow to share your experiences and chat with your colleagues. Follow LITA on Twitter Catch up on the last #LITAchat We&#8217;re looking forward to hearing from you! -The LITA Membership Development Committee LITA Job Board Analysis Report – Laura Costello (Chair, Assessment & Research) LITA Assessment & Research and Diversity & Inclusion Committees Background &#38; Data This report comes from a joint analysis conducted by LITA&#8217;s Assessment &#38; Research and Diversity &#38; Inclusion committees in Fall 2019. The analysis focused on the new and emerging trends in skills in library technology jobs and the types of positions that are currently in demand. It also touches on trends in diversity and inclusion in job postings and best practices for writing job ads that attract a diverse and talented candidate pool.&#160; The committees were provided with a list of 678 job postings from the LITA job board between 2015-2019. Data included the employer information, the position title, the location (city/state) the posting date. Some postings also included a short description. The Assessment &#38; Research Committee augmented the dataset with job description, responsibilities, qualifications, and salary information for a 25% sample of the postings from each year using archival job posting information. Committee members also assigned... Congratulations to Dr. Jian Qin, winner of the 2020 LITA/OCLC Kilgour Research Award Dr. Jian Qin has been selected as the recipient of the 2020&#160;Frederick G. Kilgour Award for Research in Library and Information Technology, sponsored by OCLC and the Library and Information Technology Association (LITA). She&#160;is the Professor and Director at the iSchool, Syracuse University.&#160;&#160;The Kilgour Award honors research relevant to the development of information technologies, especially work which shows promise of having a positive and substantive impact on any aspect(s) of the publication, storage, retrieval and dissemination of information, or the processes by which information and data are manipulated and managed. It recognizes a body of work probably spanning years, if not the majority of a career. The winner receives $2,000, and a citation. Dr. Qin’s recent research projects include metadata modeling for gravitational wave research data management and big metadata analytics using GenBank metadata records for DNA sequences, both with funding from NSF. She also collaborated with a colleague to develop a Capability Maturity Model... LITA/ALA Survey of Library Response to COVID-19 The Library and Information Technology Association (LITA) and its ALA partners are seeking a new round of feedback about the work of libraries as they respond to the COVID-19 crisis, releasing a survey and requesting feedback by 11:59 p.m. CDT, Monday, May 18, 2020. Please complete the survey by clicking on the following link: https://www.surveymonkey.com/r/libraries-respond-to-covid-19-may-2020.&#160; LITA and its ALA partners know that libraries across the United States are taking unprecedented steps to answer the needs of their communities, and this survey will help build a better understanding of those efforts. LITA and its ALA partners will use the results to advocate on behalf of libraries at the national level, communicate aggregated results with the public and media, create content and professional development opportunities to address library staff needs, and share some raw, anonymized data elements with state-level staff and library support organizations for their own advocacy needs.&#160; Additional information about... #CoreForum2020 is now a Virtual Event! Join your ALA colleagues from across divisions for the 2020 Forum, which is now a virtual event!&#160; WHERE: In light of the COVID-19 public health crisis, leadership within LITA, ALCTS, and LLAMA made the decision to move the conference online to create a safe, interactive environment accessible for all. WHAT: Call for proposals have been extended to Friday June 12, 2020.&#160; WHEN: Forum is scheduled November 18 and 20, 2020 HOW: Share your ideas and experiences with library projects by submitting a talk for the inaugural event for Core:&#160; https://forum.lita.org/call-for-proposals For more information about the LITA, ALCTS, LLAMA (Core) Forum, please visit https://forum.lita.org&#160; Jobs in Information Technology: May 6, 2020 New This Week Web Services Librarian, Fairfield University, Fairfield, CT Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. WFH? Boost your skill set with LITA CE! Reserve your spot and learn new skills to enhance your career with LITA online continuing education offerings. Buying Strategies 101 for Information TechnologyWednesday, May&#160;27, 2020, 1:00-2:30 pm Central TimePresenter:&#160;Michael Rodriguez, Collections Strategist at the University of Connecticut In this 90-minute webinar, you’ll learn&#160;best practices, terminology, and concepts for effectively negotiating contracts for the purchase of information technology (IT) products and services. View details&#160;and&#160;Register here. Using Images from the Internet in a Webpage: How to Find and CiteWednesday, June&#160;3, 2020, 2:00-3:30 pm Central TimePresenter:&#160;Lauren Bryant, Priority Associate Librarian of Ray W. Howard Library In this 90-minute&#160;webinar, you’ll learn&#160;practical ways&#160;to quickly find and filter creative commons licensed images online, learn how to hyperlink a citation for a website, and&#160;how to use creative&#160;commons images for thumbnails in videos and&#160;how to cite the image in unconventional situations like this. View details&#160;and&#160;Register here. Troublesome Technology Trends: Bridging the Learning DivideWednesday, June 17, 2020, 1:00-2:30 pm... May 5/1 Twitter #LITAchat Last week, Anne Pepitone kicked off the discussion with Zoom Virtual Backgrounds, shared her favorites, and provided tips on how to use them. The next Twitter #LITAchat will be on Friday, May 1, from 12-1pm Central Time when we&#8217;ll talk about apps that help you work from home. What do you use to help with project management, time management, deadlines, or to just stay focused? We invite you to join us tomorrow to share, learn, and chat about it with your colleagues. Follow LITA on Twitter. We&#8217;re looking forward to hearing from you! -The LITA Membership Development Committee Jobs in Information Technology: April 29, 2020 New This Week Two Associate Dean Positions, James Madison University Libraries, Harrisonburg, VA Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Data Privacy While Working From Home Today&#8217;s guest post is brought to you by our recent presenter, Becky Yoose. Special thanks to Becky for being willing to answer the questions we didn&#8217;t have time for during our webinar! Hello everyone from your friendly neighborhood library data privacy consultant! We covered a lot of material earlier this month in &#8220;A Crash Course in Protecting Library Data While Working From Home,&#8221; co-sponsored by LITA and OIF. We had a number of questions during the webinar, some of which were left unanswered at the end. Below are three questions in particular that we didn’t get to in the webinar. Enjoy! Working from home without a web-based ILS We don&#8217;t have a web-based version of our ILS and our County-based IT department says they can&#8217;t set up remote desktop (something to do with their firewall)… do you have any recommendations on how to advocate for remote desktop? If I have... Strategies for Surviving a Staffing Crisis Library staff are no strangers to budget and staffing reductions. Most of us have way too much experience doing more with less, covering unfilled positions, and rigging solutions out of the digital equivalent of chewing gum and bailing wire, because we can’t afford to buy all the tools we need. In the last two years, my department at Northern Arizona University’s Cline Library operated with roughly half the usual amount of staff. In this post, I’ll share a few strategies that helped us get through this challenging time. First, a quick introduction. My department, Content, Discovery &#38; Delivery services, includes the digital services unit (formerly library technology services) as well as collection management (including electronic resources management), acquisitions, cataloging, physical processing, interlibrary loan and document delivery, and course reserves. We are a technology-intensive department, both as users and implementers/supporters of technology. Here are some of the strategies we used to... April 4/24 Twitter #LITAchat A lot has changed since we had our last Twitter #LITAchat, Core passed and then COVID 19 happened. We are all navigating new territory in our jobs and life overall. So we wanted to bring you a weekly set of LITAChats discussing our shared experiences during these strange times.&#160; The first in this series of LITAchats will start on Friday, April 24 from 12-1pm Central Standard Time. We will be asking you to show us your Zoom Virtual Backgrounds! We know that Zoom conferencing has been popular among many workplaces so we thought what would be better than showcasing some of the creative backgrounds everyone has been using. If you don’t have a background no worries, you can share about the best backgrounds you have seen from colleagues. Don’t know how to turn on Zoom Virtual Backgrounds? We will cover that too! We hope you’ll join us on Twitter for... Congratulations to Samantha Grabus, winner of the 2020 LITA/Ex Libris Student Writing Award Samantha Grabus has been selected as the winner of the 2020 Student Writing Award sponsored by Ex Libris Group and the Library and Information Technology Association (LITA) for her paper titled “Evaluating the Impact of the Long S upon 18th-Century Encyclopedia Britannica Automatic Subject Metadata Generation Results.” Grabus is a Research Assistant and PhD student at Drexel University Metadata Research Center. &#8220;This valuable work of original research helps to quantify the scope of a problem that is of interest not only in the field of library and information science, but that also, as Grabus notes in her conclusion, could affect research in fields from the digital humanities to the sciences,&#8221; said Julia Bauder, the Chair of this year&#8217;s selection committee. When notified she had won, Grabus remarked, “I am thrilled and honored to receive the 2020 LITA/Ex Libris Student Writing Award. I would like to extend my gratitude to the award committee... Jobs in Information Technology: April 15, 2020 New This Week Web and Digital Scholarship Technologies Librarian, Marquette University Libraries, Milwaukee, WI CEO / Library Director, Orange County Library System, Orlando, FL Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. ALA LITA Emerging Leaders: Inventing a Sustainable Division In January 2020, the latest cohort of Emerging Leaders met at ALA Midwinter to begin their projects. LITA sponsored two Emerging Leaders this year: Kelsey Flynn, Adult Services Specialist at White Oak Library, and Paige Walker, Digital Collections &#38; Preservation Librarian at Boston College. Kelsey and Paige are part of Emerging Leaders Group G, &#8220;Inventing a Sustainable Division,&#8221; in which they’ve been charged with identifying measures that LITA can take to improve its fiscal and environmental sustainability. As a first step in their assessment, the group distributed a survey to LITA members that will quantify interest in sustainable measures such as virtual conferences and webinars. Want to help? Complete the survey to give feedback that may shape the direction of our chapter. Group G is fortunate to have several other talented library workers on its team:&#160; Kristen Cooper, Plant Sciences Librarian at University of Minnesota Tonya Ferrell, OER Coordinator at... Latest in LITA eLearning So much has changed since COVID-19. Online learning is in greater demand and we are working hard to provide you with resources and more professional development opportunities that strengthens the library community. We hope you are well and staying safe. There&#8217;s a seat waiting for you. Register today! Digital Inception: Building a digital scholarship/humanities curriculum as a subject librarian Wednesday, April 22, 2020 1:00 &#8211; 2:30 p.m. Central Time Presenter: Marcela Isuster, Education and Humanities Librarian, McGill University This presentation will guide attendees in building a digital scholarship curriculum from a subject librarian position. It will explore how to identify opportunities, reach out to faculty, and advertise your services. It will also showcase activities, lesson plans, and free tools for digital publication, data mining, text analysis, mapping, a section on finding training opportunities and strategies to support colleagues and create capacity in your institutions. In this 90-minute webinar, you&#8217;ll learn:... Join us this Fall for #CoreForum2020 – Proposal Deadline Extended! Call for Proposals have now been extended to Friday, May 22, 2020. Share your&#160;ideas and experiences about library technology, leadership, collections, preservation, assessment, and metadata at the inaugural meeting of Core, a joining of LITA/ALCTS/LLAMA. We welcome your session proposal. For more information about the call for proposals and our theme of exploring ideas and making them reality, visit the 2020 Forum website: https://forum.lita.org&#160; Event Details November 19-21, 2020 Baltimore, MD Renaissance Baltimore Harborplace Hotel COVID-19 Planning The 2020 LITA/ALCTS/LLAMA Forum Planning Committee is currently evaluating a contingency plan, should the COVID-19 public health crisis impact Forum in November. Core Is Approved! We’re thrilled to announce that Core: Leadership, Infrastructure, Futures is moving forward, thanks to our members. The three existing divisions’ members all voted to approve the bylaws change that will unite ALCTS, LITA, and LLAMA to form Core: ALCTS: 91% yes LITA: 96% yes LLAMA: 96% yes The presidents of the three divisions, Jennifer Bowen, ALCTS, Emily Morton-Owens, LITA, and Anne Cooper Moore, LLAMA, shared the following statement: “We first want to thank our members for supporting Core. Their belief in this vision, that we can accomplish more together than we can separately, has inspired us, and we look forward to working with all members to build this new and sustainable ALA division. We also want to thank the Core Steering Committee, and all the members who were part of project teams, town halls and focus groups. We would not have reached this moment without their incredible work.” ALA Executive... Free LITA Webinar: Protect Library Data While Working From Home A Crash Course in Protecting Library Data While Working From Home Presenter: Becky Yoose, Founder / Library Data Privacy Consultant, LDH Consulting Services Thursday, April 9, 2020 1:00 &#8211; 2:00 pm Central Time There’s a seat waiting for you…&#160;Register for this free LITA webinar today! Libraries across the U.S. rapidly closed their doors to both public and staff in the last two weeks, leaving many staff to work from home. Several library workers might be working from home for the first time in their current positions, while many others were not fully prepared to switch over to remote work in a matter of days, or even hours, before the library closed. In the rush to migrate library workers to remote work and to migrate physical library programs and services to online, data privacy and security sometimes gets lost in the mix. Unfamiliar settings, new routines, and increased reliance on vendor... Jobs in Information Technology: March 25, 2020 New This Week Head of Library Technology Services, East Carolina University, Greenville, NC Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. March 2020 ITAL Issue Now Available The March 2020 issue of Information Technology and Libraries (ITAL) is available now. In this issue, ITAL Editor Ken Varnum shares his support of LITA, ALCTS, and LLAMA merging to form a new ALA division, Core. Our content includes a message from LITA President, Emily Morton-Owens. “A Framework for Member Success,“ Morton-Owens discusses the current challenges of LITA as a membership organization and reinvention being the key to survival. Also in this edition, Laurie Willis discusses the pros and cons of handling major projects in-house versus hiring a vendor in &#8220;Tackling Big Projects.&#8221; Sheryl Cormicle Knox and Trenton Smiley discuss using digital tactics as a cost-effective way to increase marketing reach in &#8220;Google Us!&#8221; Featured Articles: “User Experience Methods and Maturity in Academic Libraries,” Scott W. H. Young, Zoe Chao, and Adam Chandler This article presents a mixed-methods study of the methods and maturity of user experience (UX) practice in... Learn How to Build your own Digital Scholarship/Humanities Curriculum with this LITA webinar Are you a subject librarian interested in building digital scholarships? Join us for the upcoming webinar &#8220;Digital Inception: Building a digital scholarship/humanities curriculum as a subject librarian,&#8221; on Wednesday, April 22, from 1:00 &#8211; 2:30 pm CST.  Digital scholarship is gaining momentum in academia. What started as a humanities movement is now present in most disciplines. Introducing digital scholarship to students can benefit them in multiple ways: it helps them interact with new trends in scholarship, appeals to different kinds of learners, helps them develop new and emerging literacies, and gives them the opportunity to be creative. This 90-minute&#160;presentation will guide attendees in building a digital scholarship curriculum from a subject librarian position. It will explore how to identify opportunities, reach out to faculty, and advertise your services. It will also showcase activities, lesson plans, and free tools for digital publication, data mining, text analysis, mapping, etc. Finally, the presentation will... Jobs in Information Technology: March 11, 2020 New this week Project Manager for Resource Sharing Initiatives,&#160;Harvard University,&#160;Cambridge, MA Research Data Services Librarian,&#160;University of Kentucky Libraries,&#160;Lexington, KY Digital Archivist,&#160;Rice University, Fondren Library,&#160;Houston, TX Associate Director, Technical Services,&#160;Yale University,&#160;New Haven, CT Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Congratulations to Alison Macrina, winner of the 2020 LITA/Library Hi Tech Award The LITA/Library Hi Tech Awards Committee is pleased to select Alison Macrina as the 2020 recipient of the LITA/Library Hi-Tech Award. Macrina led the Tor Relay Initiative in New Hampshire, is the founder and executive director of the Library Freedom Project, and has written and taught extensively in the areas of digital privacy, surveillance, and user anonymity in the context of libraries and librarianship. In this role, Macrina was instrumental in creating the Library Freedom Institute, which trained its first cohort in 2018 and will train its third cohort in 2020. Macrina has also spoken on digital privacy and the work of the Library Freedom Project across the United States and published&#160;Anonymity, the first book in ALA&#8217;s Library Futures Series, in 2019. The committee was fortunate to receive several outstanding nominations for the 2020 award. Macrina stood out in this strong pool of candidates for the broad reach and impact... Nominate yourself or someone you know for the next LITA Top Tech Trends panel of speakers LITA is looking for dynamic speakers with knowledge about the top trends in technology and how they intersect with information security and privacy. Library technology is quickly evolving with trends such as VR, cloud computing and AI. As library technology continues to impact our profession and those that we serve, security and privacy are quickly becoming top concerns. We hope this panel will provide insight and information about these technology trends for you to discuss within your own organization. If you or someone you know would be a great fit for this exciting panel, please submit your nomination today.&#160;&#160; Submit your nominations – the deadline is April 17, 2020. The session is planned for Sunday, June 28, 2020, 2:30 – 3:30 pm, at the 2020 ALA Annual Conference in Chicago, IL. A moderator and several panelists will each discuss trends impacting libraries, ideas for use cases, and practical approaches for... Jobs in Information Technology: March 4, 2020 New this week Wilson Distinguished Professorship, University of North Carolina at Chapel Hill, Chapel Hill, NC Coordinator of Library Technical Services, Berea College, Berea, KY UI/UX Designer, University of Rochester Libraries, Rochester, NY Technical Support and Hardware Specialist &#8211; 2 Openings, St. Lawrence University, Canton, NY ​​​​​​​Software Engineer, Library Systems, Stanford Health Care, Palo Alto, CA Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Hebah Emara is our 2019-20 LITA/OCLC Spectrum Scholar LITA and OCLC are funding Hebah Emara&#8217;s participation in the ALA Spectrum Scholars program as part of their commitment to help diversify the library technology field. Emara&#160;is a second year distance student at the University of Missouri – Columbia School of Information Science and Learning Technologies MLIS program. She is interested in the ways libraries and technology intersect. Her background in IT and love of learning about technology, computers, and programming drew her to working in library technology. Libraries’ ability to bridge the digital divide and their use of technology to provide opportunities to their communities and solve problems are also of particular interest to Emara. Her decision to apply to the Spectrum Scholarship was fueled by a desire to learn from a community of peers and mentors.&#160; Emara&#160;is currently the co-chair of a Tech UnConference to be held in April 2020 and organized by MentorNJ in collaboration with the... Share your ideas and library projects by submitting a session proposal for the 2020 Forum! 2020 Forum Call for Proposals Submission Deadline: March 30, 2020 November 19-21, 2020 Baltimore, Maryland Renaissance Baltimore Harborplace Hotel Do you have an idea or project that you would like to share? Does your library have a creative or inventive solution to a common problem? Submit a proposal for the 2020 LITA/ALCTS/LLAMA Forum! Submission deadline is March 30th. Our library community is rich in ideas and shared experiences. The 2020 Forum Theme embodies our purpose to share knowledge and gain new insights by exploring ideas through an interactive, hands-on experience. We hope that this Forum can be an inspiration to share, finish, and be a catalyst to implement ideas… together. We invite those who choose to lead through their ideas to submit proposals for&#160;sessions or preconference workshops, as well as&#160;nominate keynote speakers. This is an opportunity to share your ideas or unfinished work, inciting collaboration and advancing the library profession... Early-bird Registration for the Exchange Ends in Three Days! The March 1 early-bird registration deadline for the Exchange is approaching. Register today and save! There&#8217;s still time to register for the Exchange at a discount, with early-bird registration rates at $199 for ALCTS, LITA, and LLAMA members; $255 for ALA individual members; $289 for non-members; $79 for student/retired members; $475 for groups; and $795 for institutions. Early-bird registration ends March 1. Taking place May 4, 6, and 8, the Exchange will engage a wide range of presenters and participants, facilitating enriching conversations and learning opportunities in a three-day, fully online, virtual forum. Programming includes keynote presentations from Emily Drabinski and Rebekkah Smith Aldrich, and sessions focusing on leadership and change management, continuity and sustainability, and collaborations and cooperative endeavors. In addition to these sessions, the Exchange will offer lightning rounds and virtual poster sessions. For up-to-date details on sessions, be sure to check the Exchange website as new information... Jobs in Information Technology: February 26, 2020 New this week Back End Drupal Web Developer,&#160;Multnomah County Library, Portland, OR Distance Education &#38; Outreach Librarian,&#160;Winona State University,&#160;Winona, MN Senior Systems Specialist,&#160;PrairieCat, Library Consortium,&#160;Coal Valley, IL Training and Outreach Coordinator,&#160;PrairieCat, Library Consortium,&#160;Coal Valley, IL Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Deadline Extended to March 15 – Submit a Proposal to Teach for LITA The deadline to submit LITA education proposals has been extended to March 15th. We&#8217;re seeking instructors passionate about library technology topics to share their expertise and teach a webinar, webinar series, or online course for LITA this year. Instructors receive a $500 honorarium for an online course or $150 for a webinar, split among instructors. Check out our list of current and past course offerings to see what topics have been covered recently. Be part of another slate of compelling and useful online education programs this year! Submit your LITA education proposal today! For questions or comments related to teaching for LITA, contact us at lita@ala.org or (312) 280-4268. The 2020 Census Starts in Two Weeks — Are Your Computers Ready? Post courtesy of Gavin Baker, ALA Office of Public Policy and Advocacy, Deputy Director, Public Policy and Government Relations On March 12, millions of American households will begin receiving mailings inviting them to respond to the 2020 Census. To get an accurate count, everyone has to respond – if they don’t, our libraries and communities will lose needed funding. As the mailings arrive, patrons may come to your library with questions – and, with a new option to respond online, to complete the questionnaire using the library’s computers or internet. To help you prepare, ALA has a new, two-page tip sheet, &#8220;Libraries and the 2020 Census: Responding to the Census,&#8221; that provides key dates, options for responding, and advice for libraries preparing for the 2020 Census. For instance, the tip sheet explains these important facts: Ways to Respond: Households can respond to the Census online, by phone, or by mail... News Regarding the Future of LITA after the Core Vote Dear LITA members, We&#8217;re writing about the implications of LITA’s budget for the upcoming 2020-21 fiscal year, which starts September 1, 2020. We have reviewed the budget and affirmed that LITA will need to disband if the Core vote does not succeed. Since the Great Recession, membership in professional organizations has been declining consistently. LITA has followed the same pattern and as a result, has been running at a deficit for a number of years. Each year, LITA spends more on staff, events, equipment, software, and supplies than it takes in through memberships and event registrations. We were previously able to close our budgets through the use of our net asset balance which is, in effect, like a nest egg for the division. Of course, that could not continue indefinitely. Our path towards sustainability has culminated in the proposal to form Core: Leadership, Infrastructure, Futures. The new division would come with... Boards of ALCTS, LITA and LLAMA put Core on March 2020 ballot The Boards of the Association for Library Collections &#38; Technical Services (ALCTS), Library Information Technology Association (LITA) and the Library Leadership &#38; Management Association (LLAMA) have all voted unanimously to send to members their recommendation that the divisions form a new division, Core: Leadership, Infrastructure, Futures.&#160; ALCTS, LITA and LLAMA will vote on the recommendation during the upcoming American Library Association (ALA) election. If approved by all three memberships, and the ALA Council, the three long-time divisions will end operations on August 31, 2020, and merge into Core on September 1. Members of the three Boards emphasized that Core will continue to support the groups in which members currently find their professional homes while also creating new opportunities to work across traditional division lines. It is also envisioned that Core would strengthen member engagement efforts and provide new career-support services. If one or more of the division memberships do not... Jobs in Information Technology: February 19, 2020 New this week Librarian (Emphasis in User Experience and Technology), Chabot College, Hayward, CA Librarian II (ILS Admin &#38; Tech Services), Duluth Public Library, Duluth, MN Distance Education &#38; Outreach Librarian, Winona State University, Winona, MN Head, Digital Initiatives &#8211; Tisch Library, Tufts University, Medford, MA Online Learning and User Experience Librarian, Ast or Asc Professor, SIU Edwardsville, Edwardsville, IL Discovery and Systems Librarian, Hamilton College, Clinton, NY Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Early-bird registration ends March 1st for the Exchange With stimulating programming, including discussion forums and virtual poster sessions, the Exchange will engage a wide range of presenters and participants, facilitating enriching conversations and learning opportunities in a three-day, fully online, virtual forum. Programming includes keynote presentations from Emily Drabinski and Rebekkah Smith Aldrich, and sessions focusing on leadership and change management, continuity and sustainability, and collaborations and cooperative endeavors. The Exchange will take place May 4, 6, and 8. In addition to these sessions, the Exchange will offer lightning rounds and virtual poster sessions. For up-to-date details on sessions, be sure to check the Exchange website as new information is being added regularly. Early-bird registration rates are $199 for ALCTS, LITA, and LLAMA members, $255 for ALA individual members, $289 for non-members, $79 for student members, $475 for groups, and $795 for institutions. Early-bird registration ends March 1. Want to register your group or institution? Groups watching the... Jobs in Information Technology: February 13, 2020 New this week Upper School Librarian (PDF), St. Christopher&#8217;s School, Richmond, VA Diversity and Engagement Librarian, Ast or Asc Professor, SIU Edwardsville, Edwardsville, IL Repository Services Manager, Washington University, Saint Louis, MO Information Technology Librarian, Albin O. Kuhn Library &#38; Gallery (UMBC), Baltimore, MD Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. LITA Blog Call for Contributors We&#8217;re looking for new contributors for the LITA Blog! Do you have just a single idea for a post or a series of posts? No problem! We&#8217;re always looking for guest contributors with new ideas. Do you have thoughts and ideas about technology in libraries that you&#8217;d like to share with LITA members? Apply to be a regular contributor! If you&#8217;re a member of LITA, consider either becoming a regular contributor for the next year or submitting a post or two as a guest. Apply today! Learn the latest in Library UX with this LITA Webinar There’s a seat waiting for you… Register for this LITA webinar today! How to Talk About Library UX &#8211; Redux Presenter: Michael Schofield Librarian / Director of Engineering, WhereBy.Us Wednesday, March 11, 2020 12:00 – 1:00 pm Central Time The last time we did this webinar was in 2016 &#8211; and a lot&#8217;s changed. The goal then was to help establish some practical benchmarks for how to think about the user experience and UX design in libraries, which suffered from a lack of useful vocabulary and concepts: while we might be able to evangelize the importance of UX, LibUXers struggled with translating their championship into the kinds of bureaucratic goals that unlocked real budget for our initiatives. It&#8217;s one thing to say, &#8220;the patron experience is critical!&#8221; It&#8217;s another thing to say, &#8220;the experience is critical &#8211; so pay for OptimalWorkshop, or hire a UX Librarian, or give me a... Joint Working Group on eBooks and Digital Content in Libraries John Klima, the LITA Representative to the Working Group on eBooks and Digital Content, recently agreed to an interview about the latest update from ALA Midwinter 2020. Watch the blog for more updates from John about the Working Group in the coming months! What is the mission and purpose of the Working Group on eBooks and Digital Content? Quoting from the minutes of the ALA Executive Board Fall meeting in October of 2019: [The purpose of this working group is] to address library concerns with publishers and content providers specifically to develop a variety of digital content license models that will allow libraries to provide content more effectively, allowing options to choose between one-at-a-time, metered, and other options to be made at point of sale; to make all content available in print and for which digital variants have been created to make the digital content equally available to libraries without... 2020 Forum Call for Proposals LITA, ALCTS and LLAMA are now accepting proposals for the 2020 Forum, November 19-21 at the Renaissance Baltimore Harborplace Hotel in Baltimore, MD. Intention and Serendipity: Exploration of Ideas through Purposeful and Chance Connections Submission Deadline: March 30, 2020 Our library community is rich in ideas and shared experiences. The 2020 Forum Theme embodies our purpose to share knowledge and gain new insights by exploring ideas through an interactive, hands-on experience. We hope that this Forum can be an inspiration to share, finish, and be a catalyst to implement ideas…together. We invite those who choose to lead through their ideas to submit proposals for&#160;sessions or preconference workshops, as well as&#160;nominate keynote speakers. This is an opportunity to share your ideas or unfinished work, inciting collaboration and advancing the library profession forward through meaningful dialogue. We encourage diversity in presenters from a wide range of background, libraries, and experiences. We deliberately... LITA announces the 2020 Excellence in Children’s and Young Adult Science Fiction Notable Lists The LITA Committee Recognizing Excellence in Children’s and Young Adult Science Fiction presents the 2020 Excellence in Children’s and Young Adult Science Fiction Notable Lists. The lists are composed of notable children’s and young adult science fiction published between November 2018 and October 2019 and organized into three age-appropriate categories. The annotated lists will be posted on the website at&#160;www.sfnotables.org. The Golden Duck Notable Picture Books List is selected from books intended for pre-school children and very early readers, up to 6 years old. Recognition is given to the author and the illustrator: Field Trip to the Moon by John Hare. Margaret Ferguson Books Hello by Aiko Ikegami. Creston Books How to be on the Moon by Viviane Schwarz. Candlewick Press Out There by Tom Sullivan. Balzer + Bray The Babysitter From Another Planet by Stephen Savage. Neal Porter Books The Space Walk by Brian Biggs. Dial Books for Young... Jobs in Information Technology: February 5, 2020 New this week (Tenure-Track) Senior Assistant Librarian, Sonoma State UniversityRohnert Park, CA Data Services Librarian for the Sciences, Harvard UniversityCambridge, MA Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Teach for LITA: Submit Proposals by February 16 Reminder: The deadline to submit LITA education proposals is February 16th. Please share our CFP with your colleagues. We are seeking instructors passionate about library technology topics to share their expertise and teach a webinar, webinar series, or online course for LITA this year. All topics related to the intersection of technology and libraries are welcomed, including: Machine Learning IT Project Management Data Visualization Javascript, including: jquery, json, d3.js Library-related APIs Change management in technology Big Data, High Performance Computing Python, R, GitHub, OpenRefine, and other programming/coding topics in a library context Supporting Digital Scholarship/Humanities Virtual and Augmented Reality Linked Data Implementation or Participation in Open Source Technologies or Communities Open Educational Resources, Creating and Providing Access to Open Ebooks and Other Educational Materials Managing Technology Training Diversity/Inclusion and Technology Accessibility Issues and Library Technology Technology in Special Libraries Ethics of Library Technology (e.g., Privacy Concerns, Social Justice Implications) Library/Learning Management... Jobs in Information Technology: January 29, 2020 New this week STEM, Instruction, and Assessment Librarian,&#160;McDaniel College, Westminster, MD Data Science/Analysis Research Librarian,&#160;Hamilton College, Clinton, NY Electronic Resources Librarian, Brown University, Providence, RI Systems Librarian, Brown University, Providence, RI Head, Technical Services, Brown University, Providence, RI Network and Systems Administrator,&#160;St. Lawrence University, Canton, NY Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Emily Drabinski, Rebekkah Smith Aldrich to deliver keynotes at the Exchange Virtual Forum The Association for Library Collections and Technical Services (ALCTS), the Library Information Technology Association (LITA) and the Library Leadership and Management Association (LLAMA) have announced that Emily Drabinski and Rebekkah Smith Aldrich will deliver keynote addresses at the Exchange Virtual Forum. The theme for the Exchange is &#8220;Building the Future Together,&#8221; and it will take place on the afternoons of May 4, 6 and 8. Each day has a different focus, with day 1 exploring leadership and change management; day 2 examining continuity and sustainability; and day 3 focusing on collaborations and cooperative endeavors. Drabinski&#8217;s keynote will be on May 4, and Smith Aldrich&#8217;s will be on May 8.  Emily Drabinski is the Critical Pedagogy Librarian at Mina Rees Library, Graduate Center, City University of New York (CUNY). She is also the liaison to the School of Labor and Urban Studies and other CUNY masters and doctoral programs. Drabinski&#8217;s research includes... Jobs in Information Technology: January 22, 2020 New this week Information Technology and Web Services (ITWS) Department Head, Auraria Library, Denver, CO Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Advice for the New Systems Librarian – Building Relationships 2.0 Advice for the New Systems Librarian &#8211; Building Relationships, Part 2 Previous articles in this series: Building Relationships, Helpful Resources, A Day in the Life I am at the two-year mark of being in my role as systems librarian at Jacksonville University, and I continue to love what I do. I am working on larger-scale projects and continuing to learn new things every week. There has not been a challenge or new skill to learn yet that I have been afraid of. My first post in this series highlighted groups and departments that may be helpful in learning your new role. Now that I’m a little more seasoned, I have had the opportunity to work with even more departments and individuals at my institution on various projects. Some of these departments may be unique to me, but I would imagine you would find counterparts where you work. The Academic Technology... Jobs in Information Technology: January 15, 2020 New this week Performing and Visual Arts Librarian, Butler University, Indianapolis, IN Librarian, The College of Lake County, Grayslake, IL User Experience (UX) Librarian, UNC Charlotte, J. Murrey Atkins Library, Charlotte, NC Southeast Asia Digital Librarian, Cornell University, Ithaca, NY Head of Digital Infrastructure Services at UConn Library, University of Connecticut, Storrs, CT Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. LITA Education Call for Proposals for 2020 What library technology topics are you passionate about? Have something you can help others learn? LITA invites you to share your expertise with an international audience! Our courses and webinars are based on topics of interest to library technology workers and technology managers at all levels in all types of libraries. Taught by experts, they reach beyond physical conferences to bring high quality continuing education to the library world. We deliberately seek and strongly encourage submissions from underrepresented groups, such as women, people of color, the LGBTQA+ community, and people with disabilities. Submit a proposal by February 16th to teach a webinar, webinar series, or online course for Winter/Spring/Summer/Fall 2020. All topics related to the intersection of technology and libraries are welcomed, including: Machine Learning IT Project Management Data Visualization Javascript, including: jquery, json, d3.js Library-related APIs Change management in technology Big Data, High Performance Computing Python, R, GitHub, OpenRefine,... Jobs in Information Technology: January 8, 2020 New this week Web Services &#38; Discovery Manager, American University Library, Washington, DCSenior Research Librarian, Finnegan, Washington, DC Electronic Resources and Discovery Librarian, Auburn University, AL ​​​​​​​Discovery &#38; Systems Librarian, California State University, Dominguez Hills, Carson, CA Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. UX “don’ts” we still need from Erika Hall The second edition of Erika Hall’s Just Enough Research dropped October 2019; although this excellent volume was previously unknown to me I am taking the opportunity now to consume, embody, and evangelize Hall’s approach to user research. Or, as Hall might put it, I’m a willing convert to the gospel of “Enoughening”. Hall is a seasoned design consultant and co-founder of Mule Design Studio but her commercial approach is tempered by a no-nonsense attitude that makes her solutions and suggestions palatable to a small UX team such as my own at Indiana University Bloomington Libraries. Rather than conduct a formulaic book review of Just Enough Research, I want to highlight some specific things Hall tells the reader not to do in their UX research. This list of five “don’ts” summarize Hall’s tone, style, and approach. It will also highlight the thesis of the second edition’s brand new chapter on surveys.... Jobs in Information Technology: December 18, 2019 New this week Vice Provost and University Librarian, University of Oregon,&#160;Eugene, OR Data Migration Specialist (Telecommuting position), Bywater Solutions,&#160;Remote Position Research Librarian, Oak Ridge National Laboratory, Oak Ridge, TN Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Announcing the new LITA eLearning Coordinator We are proud to announce that Kira Litvin will be the new LITA eLearning Coordinator. Litvin has been the Continuing Education Coordinator at the Colorado School for Public Health for the past six months. She provides distance/online learning library services and instruction and works regularly with other librarians, instructional designers, faculty, and educators to collaborate on instructional delivery projects. &#8220;I am passionate about being a librarian and working with people in an online environment! &#160;For the past nine years I have worked with libraries that are exclusively online. My roles include administering and managing electronic library systems, including Springshare products, and providing virtual reference and instruction to students, faculty and staff. More recently I have transitioned to working as an eLearning Instructional Designer which means I design and develop instructional content available for asynchronous learning and professional development. As online learning continues to grow, I believe that libraries need to... Submit a Nomination for 2020 Awards and Scholarships Hugh C. Atkinson Memorial Award The award honors the life and accomplishments of Hugh C. Atkinson by soliciting nominations and recognizing the outstanding accomplishments of an academic librarian who has worked in the areas of library automation or library management and has made contributions (including risk taking) toward the improvement of library services or to library development or research. Nomination deadline: January 9, 2020 Winner receives a cash award and a plaque. Learn more about the requirements for the Atkinson Memorial Award. Ex Libris Student Writing Award The LITA/Ex Libris Student Writing Award is given for the best unpublished manuscript on a topic in the area of libraries and information technology written by a student or students enrolled in an ALA-accredited library and information studies graduate program. Application deadline: February 28, 2020 Winner receives a $1,000 cash and a plaque. Learn more about the requirements for the Ex Libris Student... Submit a Nomination for the Hugh C. Atkinson Memorial Award LITA, ACRL, ALCTS, and LLAMA invite nominations for the 2020 Hugh C. Atkinson Memorial Award. Please submit your nominations by January 9, 2020. The award honors the life and accomplishments of Hugh C. Atkinson by recognizing the outstanding accomplishments of an academic librarian who has worked in the areas of library automation or library management and has made contributions (including risk taking) toward the improvement of library services or to library development or research. Winners receive a cash award and a plaque. This award is funded by an endowment created by divisional, individual, and vendor contributions given in memory of Hugh C. Atkinson. The nominee must be a librarian employed in one of the following during the year prior to application for this award: University, college, or community college library Non-profit consortium, or a consortium comprised of non-profits that provides resources/services/support to&#160; academic libraries The nominee must have a minimum... Core Update – 12/12/2019 Greetings again from the Steering Committee of Core: Leadership, Infrastructure, Futures, a proposed division of ALA. Coming up this Friday, December 13 is the last of four town halls we are holding this fall to share information and elicit your input. Please join us! Register for Town Hall 4 today. ALCTS, LITA, and LLAMA division staff will lead this town hall with a focus on Core’s mission, vision, and values; benefits organizationally; benefits to members; and opportunities in the future. Our speakers will be Jenny Levine (LITA Executive Director), Julie Reese (ALCTS Deputy Executive Director), and Kerry Ward (LLAMA Executive Director and interim ALCTS Executive Director). We’re excited to share an updated Core proposal document for ALA member feedback and review, strengthened by your input. We invite further comments on this updated proposal through Sunday, December 15. Meanwhile, division staff will incorporate your comments and finalize this proposal document for... Jobs in Information Technology: December 11, 2019 New this week Senior Specialist &#8211; Makerspace,&#160;Middle Tennessee State University, Walker Library,&#160;Murfreesboro, TN User Experience Librarian,&#160;Auburn University,&#160;Auburn University, AL Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Announcing the new LITA Blog Editor We are proud to announce that Jessica Gilbert Redman will be the new editor of the&#160;LITA&#160;Blog.&#160; Gilbert Redman has been the web services librarian at the University of North Dakota for the past three years. She coordinates and writes for the library blog and maintains the library website. She has completed a post-graduate certificate in user experience and always seeks to ensure that end users are able to easily find the information they need to complete their research. Additionally, she realizes communication is the key component in any relationship, be it between libraries and their users or between colleagues, and she always strives to make communication easier for all involved. &#8220;I am excited to become more involved in LITA, and I think the position of LITA Blog Editor is an excellent way to meet more people within LITA and ALA, and to maintain a finger on the pulse of new... Jobs in Information Technology: December 4, 2019 New this week Digital Discovery Librarian/Assistant Librarian, Miami University, Oxford, OH Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Jobs in Information Technology: November 27, 2019 New this week Web and Digital Scholarship Technologies Librarian,&#160;Marquette University Libraries, Milwaukee, WI Digital Access and Metadata Librarian,&#160;Marquette University Libraries, Milwaukee, WI Librarian (San Ramon Campus),&#160;Contra Costa Community College District,&#160;San Ramon, CA Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Support LITA Scholarships this #GivingTuesday It’s almost #GivingTuesday, so we’re highlighting the difference that LITA scholarships can make, and inviting you to join us in increasing access to LITA events by donating to our scholarship fund today. You can help us to provide more scholarships to events like AvramCamp and LITA Forum, as well as sponsor Emerging Leaders, with your donation today! Your donation of $25 could open up untold opportunities for other library technology professionals. “The LITA scholarship afforded me the opportunity to present at the 2019 AvramCamp and ALA conference. It was an incredible opportunity to network with dozens of information professionals, build connections with people in the field, ask them all of my questions and exchange our technical acumen and job experiences. As a result, I have been offered two interviewing opportunities that were an incredibly valuable experience for my career development. I am very grateful to LITA for the opportunity to... Jobs in Information Technology: November 22, 2019 New This Week﻿ Metadata Specialist III, Metadata Services, The New York Public Library, New York, NY eResources Librarian,&#160;University of Maryland, Baltimore County, Baltimore, MD Multiple Librarian Positions,&#160;George Washington University, Washington DC INFORMATION TECHNOLOGY ANALYST,&#160;San Mateo County Libraries, San Mateo County, CA Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Call for Blog Coordinator for the Exchange: An ALCTS/LITA/LLAMA Collaboration The Exchange: An ALCTS/LITA/LLAMA Collaboration brings together experiences, ideas, expertise, and individuals from the three ALA divisions. Broadly organized around the theme of “Building the Future Together,” the Exchange will examine the topic in relation to collections, leadership, technology, innovation, sustainability, and collaborations. Participants from diverse areas of librarianship will find the three days of presentations, panels, and lightning rounds both thought-provoking and highly relevant to their current and future career paths. The Exchange will engage a wide range of presenters and participants, facilitating enriching conversations and learning opportunities. Divisional members and non-members alike are encouraged to register and bring their questions, experiences, and perspectives to the events. As part of the conference experience, the Exchange plans to host regular blog posts in advance of the conference. Blog posts will serve multiple purposes: generate excitement and interest in content, encourage participation outside of simply watching presentations, and provide an avenue... The Exchange Call for Proposals and Informational Webinar ALCTS, LITA, and LLAMA are now accepting proposals for the Exchange: Building the Future Together, a virtual forum scheduled for May 4, 6, and 8, 2020. The twelve hour virtual event will take place over three afternoons, featuring the following themes and topics: Day 1 &#8211; Leadership and Change Management Day 2 &#8211; Continuity and Sustainability Day 3 &#8211; Collaborations and Cooperative Endeavors Session Formats The Exchange will feature the following session formats: Full-session Proposals Presenters prepare content for a 35-minute session, with an additional 10-minute Q&#38;A period for all presenters. Full-session proposals may include multiple presentations with content that is topically related. Lightning Round Each participant is given five minutes to give a presentation. At the end of the lightning round, there will be a 10-15-minute Q&#38;A period for all presenters in the session. Topics for lightning rounds related to innovative projects or research are encouraged. Proposals will be... Registration is Now Open for the Exchange In May 2020, join ALCTS, LITA, and LLAMA for an exciting and engaging virtual forum. Registration is now open! &#160; The Exchange: An ALCTS/LITA/LLAMA Collaboration brings together experiences, ideas, expertise, and individuals from the three ALA divisions. Broadly organized around the theme of “Building the Future Together,” the Exchange will examine the topic in relation to collections, leadership, technology, innovation, sustainability, and collaborations. Participants from diverse areas of librarianship will find the three days of presentations, panels, and lightning rounds both thought-provoking and highly relevant to their current and future career paths. The Exchange will engage a wide range of presenters and participants, facilitating enriching conversations and learning opportunities. Divisional members and non-members alike are encouraged to register and bring their questions, experiences, and perspectives to the events. “Building on the rich educational traditions of the three divisions, the Exchange provides the opportunity to break down silos and explore synergies... Core Call for Comment Greetings again from the Steering Committee of&#160;Core: Leadership, Infrastructure, Futures, a proposed division of ALA. The Steering Committee welcomes comments on the&#160;draft division proposal documentation&#160;through November 25th. Please join the conversation! Your perspectives and input are shaping the identity and priorities of the proposed division. We’re asking for you to respond to the documents with key questions in mind, including: Does this make sense to someone new to ALCTS/ LITA/ LLAMA? Does this piece of the plan reflect how members want the new division to function? Are there any points that are cause for concern? If you’re interested in helping us in the review process or other work ahead, please&#160;consider volunteering&#160;for&#160;Core.&#160;We’re eager to collaborate with you! We’re working hard to ensure everyone can participate in the&#160;Core&#160;conversation, so please&#160;let us know&#160;what could make&#160;Core&#160;a compelling and worthy division home for you.&#160;Keep the feedback and input coming! Full details for all our&#160;upcoming events&#160;are... LIS Students: Apply for the 2020 Larew Scholarship for Tuition Help The Library and Information Technology Association (LITA) and Baker &#38; Taylor are accepting applications for the LITA/Christian (Chris) Larew Memorial Scholarship for those who plan to follow a career in library and information technology, demonstrate potential leadership, and hold a strong commitment to library automation. The winner will receive a $3,000 check and a citation.&#160;The application form is open through March 1, 2020. Criteria for the Scholarship includes previous academic excellence, evidence of leadership potential, and a commitment to a career in library automation and information technology. Candidates should illustrate their qualifications for the scholarships with a statement indicating the nature of their library experience, letters of reference and a personal statement of the applicant’s view of what they can bring to the profession.&#160;Winners must have been accepted to&#160;a Master of Library Science (MLS) program recognized by the American Library Association. References, transcripts, and other documents must be postmarked no... Jobs in Information Technology: November 13, 2019 New This Week﻿ Full Time Faculty &#8211; Non Tenure Track,&#160;SJSU School of Information, San Jose, CA Digital Collections Librarian, Union College,&#160;Schenectady, NY Web Services Librarian,&#160;University of Oregon Libraries, Eugene, OR GALILEO Programmer/Analyst,&#160;University of Georgia Libraries, Athens, GA Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. LITA Opens Call for Innovative LIS Student Writing Award for 2020 The Library and Information Technology Association (LITA), a division of the American Library Association (ALA), is pleased to offer an award for the best unpublished manuscript submitted by a student or students enrolled in&#160;an ALA-accredited graduate program. Sponsored by&#160;LITA&#160;and&#160;Ex&#160;Libris, the award consists of $1,000, publication in&#160;LITA’s&#160;referred journal,&#160;Information Technology and Libraries (ITAL), and a certificate. The deadline for submission of the manuscript is February 28, 2020. The award recognizes superior student writing and is intended to enhance the professional development of students. The manuscript can be written on any aspect of libraries and information technology. Examples include, but are not limited to, digital libraries, metadata, authorization and authentication, electronic journals and electronic publishing, open source software, distributed systems and networks, computer security, intellectual property rights, technical standards, desktop applications, online catalogs and bibliographic systems, universal access to technology, and library consortia. To be eligible, applicants must follow&#160;these&#160;guidelines&#160;and fill out&#160;the application form&#160;(PDF).... Jobs in Information Technology: November 6, 2019 New This Week﻿ Open Educational Resources Production Manager, Oregon State University &#8211; Ecampus, Corvallis, OR User Experience Librarian, Northwestern University, Evanston, IL Institute for Clinical and Translational Research (ICTR) Librarian, University of Maryland, Baltimore, Baltimore, MD Director of Collections &#38; Access, Wheaton College, Norton, MA Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Nominate a Colleague Doing Cutting Edge Work in Tech Education for the LITA Library Hi Tech Award Nominations are open&#160;for the 2020&#160;LITA/Library Hi Tech Award, which is given each year to an individual or institution for outstanding achievement in educating the profession about cutting edge technology within the field of library and information technology. Sponsored by the&#160;Library and Information Technology Association&#160;(LITA) and Library Hi Tech, the award includes a citation of merit and a $1,000 stipend provided by&#160;Emerald Publishing, publishers of Library Hi Tech. The deadline for nominations is December 31, 2019. The award, given to either a living individual or an institution, may recognize a single seminal work or a body of work created during or continuing into the five years immediately preceding the award year. The body of work need not be limited to published texts but can include course plans or actual courses and/or non-print publications such as visual media. Awards are intended to recognize living persons rather than to honor the deceased; therefore,... Propose a Topic for the ITAL “Public Libraries Leading the Way” Column Information Technology and Libraries (ITAL), the quarterly open-access journal published by ALA’s Library Information Technology Association, is looking for contributors for its regular “Public Libraries Leading the Way” column. This column highlights a technology-based innovation or approach to problem solving from a public library perspective. Topics we are interested in include the following, but proposals on any other technology topic are welcome. 3-D printing and makerspaces Civic technology Drones Diversity, equity, and inclusion and technology Privacy and cyber-security Virtual and augmented reality Artificial intelligence Big data Internet of things Robotics Geographic information systems and mapping Library analytics and data-driven services Anything else related to public libraries and innovations in technology To propose a topic, use this brief form, which will ask you for three pieces of information: Your name Your email address A brief (75-150 word) summary of your proposed column that describes your library, the technology you wish to... ALCTS, LITA and LLAMA collaborate for virtual forum The Association for Library Collections &#38; Technical Services (ALCTS), the Library and Information Technology Association (LITA) and the Library Leadership &#38; Management Association (LLAMA) have collaborated to create The Exchange, an interactive, virtual forum designed to bring together experiences, ideas, expertise and individuals from these American Library Association (ALA) divisions. Modeled after the 2017 ALCTS Exchange, the Exchange will be held May 4, May 6 and May 8 in 2020 with the theme “Building the Future Together.” As a fully online interactive forum, the Exchange will give participants the opportunity to share the latest research, trends and developments in collections, leadership, technology, innovation, sustainability and collaborations. Participants from diverse areas of librarianship will find the three days of presentations, panels and activities both thought-provoking and highly relevant to their current and future career paths. The Exchange will engage an array of presenters and participants, facilitating enriching conversations and learning opportunities.... Submit your 2020 Annual Meeting Request by Feb 7 The LITA meeting request form is now open for the 2020 ALA Annual Conference in Chicago, IL. All LITA committee and interest group chairs should use it to let us know if you plan to meet at Annual. We&#8217;re looking forward to seeing what you have planned. The deadline to submit your meeting request is Friday, February 7, 2020. We&#8217;re going to change how we&#8217;ve listed meetings in the past. If you do NOT submit this form, your group will NOT be included in the list of LITA session on our website, the online scheduler, or the print program. While we&#8217;ll still hold the Joint Chairs meeting on Saturday from 8:30-10:00am and use that same room for committee and IG meetings from 10:30-11:30am, your group will only be listed if you submit this form. You should also use it if you want to request a meeting on a different day... Submit a Nomination for the Prestigious Kilgour Technology Research Award LITA and OCLC invite nominations for the 2020 Frederick G. Kilgour Award for Research in Library and Information Technology. Submit your nomination no later than December 31, 2019. The Kilgour Research Award recognizes research relevant to the development of information technologies, in particular research showing promise of having a positive and substantive impact on any aspect of the publication, storage, retrieval, and dissemination of information or how information and data are manipulated and managed. The winner receives $2,000 cash, an award citation, and an expense-paid trip (airfare and two nights lodging) to the 2020 ALA Annual Conference in Chicago, IL. Nominations will be accepted from any member of the American Library Association. Nominating letters must address how the research is relevant to libraries; is creative in its design or methodology; builds on existing research or enhances potential for future exploration; and/or solves an important current problem in the delivery of... Core Update – October 23, 2019 Greetings again from the Steering Committee of Core: Leadership, Infrastructure, Futures, a proposed division of ALA. Thank you for all of your questions and feedback about the proposed new division!&#160;The Steering Committee has been revising Core documents based on what we’ve heard from you so far in order to share draft bylaws and other information with you soon. We want you to know that we are continuing to listen and incorporate the feedback you’re providing via Town Halls, Twitter Chats, the Core feedback form, and more.&#160; In our next Steering Committee meeting, we will be discussing how we can support the operational involvement of interested volunteers. If you have ideas on how members should be involved, please share them with us through&#160;the feedback form.&#160; We’re working hard to ensure everyone can participate in the Core conversation, so please&#160;let us know&#160;what could make Core a compelling and worthy division home for... Jobs in Information Technology: October 23, 2019 New This Week﻿ Metadata &#38; Research Support Specialist, Open Society Research Services, Open Society Foundations, New York, NY Head of Public Services in The Daniel Library, The Citadel, The Military College of South Carolina, Charleston, SC Engineering and Science Liaison, MIT, Cambridge, MA Head of Technical Services &#8211; Library, The Citadel, The Military College of South Carolina, Charleston, SC Analyst Programmer 3, Oregon State University Libraries and Press, Corvallis, OR Collection Information Specialist, Isabella Stewart Gardner Museum, Boston, MA Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. Jobs in Information Technology: October 16, 2019 New This Week Metadata Librarian for Distinctive Collections, MIT, Cambridge, MA Electronic Access Librarian, University of Rochester, Rochester, NY Dean, University Libraries, University of Northern Colorado, Greeley, CO Administrative/Metadata Specialist, ASR International Corp., Monterey, CA Core Systems Librarian, University of Oregon Libraries, Eugene, OR Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. September 2019 ITAL Issue Now Available The September 2019 issue of Information Technology and Libraries (ITAL) is available now. In this issue, ITAL Editor Ken Varnum announces six new members of the ITAL Editorial Board. Our content includes a recap of Emily Morton-Owens&#8217; President&#8217;s Inaugural Message, &#8220;Sustaining LITA&#8220;, discussing the many ways LITA strives to provide a sustainable member organization. In this edition of our &#8220;Public Libraries Leading the Way&#8221; series, Thomas Lamanna discusses ways libraries can utilize their current resources and provide ideas on how to maximize effectiveness and roll new technologies into operations in &#8220;On Educating Patrons on Privacy and Maximizing Library Resources.&#8220; Featured Articles: &#8220;Library-Authored Web Content and the Need for Content Strategy,&#8221; Courtney McDonald and Heidi Burkhardt Increasingly sophisticated content management systems (CMS) allow librarians to publish content via the web and within the private domain of institutional learning management systems. “Libraries as publishers”may bring to mind roles in scholarly communication and... Jobs in Information Technology: October 9, 2019 New This Week﻿ Information Research Specialist, Harvard Business School, Boston, MA 2020-2021 Library Residency Program (Provost’s Postdoctoral Fellowship), New York University, Division of Libraries, New York, NY Executive Director, Library Connection, Inc, Windsor, CT Associate University Librarian, Cornell University, Ithaca, NY Visit the LITA Jobs Site for additional job listings and information on submitting your own job posting. New vacancy listings are posted on Wednesday afternoons. Latest LITA Learnings There&#8217;s a seat waiting for you&#8230; Register today for a LITA webinar! Guiding Students through Digital Citizenship Presenter: Casey Davis Instructional Designer (IT), Arizona State University Wednesday, October 16, 2019 12:00 &#8211; 1:30 pm Central Time As academic librarians, we help build our students into digital citizens.&#160;It&#8217;s our duty to make sure students have the tools and resources to be savvy tech users, become information literate, and understand the permanence of their digital actions. In this 90-minute webinar,&#160;you&#8217;ll learn research-based best practices you can implement using the framework of the hero&#8217;s journey&#160;without creating an additional burden on faculty, staff, and students. Learning objectives for this program include: • An expanded understanding of digital citizenship within the context of college/university life •&#160;Examining areas where increased awareness and practice is needed within the college/university community • Creating authentic training for increasing digital citizenship within the college/university community View details&#160;and&#160;Register here. In-House vs.... 
litablog-org-6683	----	LITA Blog – Empowering libraries through technology LITA Blog Empowering libraries through technology Toggle navigation About Regular Contributors Get Involved! Join LITA LITA Jobs Jobs in Information Technology: August 25, 2020 August 25, 2020August 28, 2020| Jenny Levine New This Week Coordinator of Digital Scholarship and Programs, Marquette University Libraries, Milwaukee WI Digital Scholarship Coordinator, UNC Charlotte, Charlotte, NC Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Continue Reading LITA Jobs Jobs in Information Technology: August 13, 2020 August 13, 2020August 13, 2020| Jenny Levine New This Week Information Systems Manager (PDF), The Community Library Association, Ketchum, ID Children’s Librarian, Buhl Public Library, Buhl, ID Technology Integration Librarian, Drexel University Libraries, Philadelphia, PA Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Continue Reading Core Update Your Core Community Update August 4, 2020| Chrishelle Thomas Much has been happening behind-the-scenes to prepare for Core’s upcoming launch on September 1st, so we want to update you on the progress we’ve made. At the 2020 ALA Virtual Conference Council meetings, the ALA Council approved the creation of Core, so we’re official! It’s been a difficult summer for everyone given the global situation, but this was a milestone we’re excited to reach. What We’ve Been Doing In May, the Core Transition Committee (the 9 division presidents plus senior staff) formed 11 working groups of members from all 3 divisions to make recommendations about how to proceed with our awards/scholarships, budget/finance, committees, communications, conference programming, continuing education, fundraising/sponsorships, interest groups, member engagement, nominations for 2021 president-elect, publications, and standards. These groups have done an amazing amount of work in a very short time period, and we’re grateful to these members for their commitment and effort. We’re happy to report… Continue Reading Education Free LITA Webinar ~ Library Tech Response to Covid-19 ~ August 5th July 31, 2020| Chrishelle Thomas Sign up for this free LITA webinar: Library Tech Response to Covid-19 Libraries are taking the necessary precautions to create a safe environment during the pandemic. Social distancing isn’t the only solution, but providing access to loanable technologies, including handling and quarantine of equipment, cleaning, and other safety and health concerns are just some of the measures put in place. With the ongoing disruption to library services caused by COVID-19, what reopening planning policies should be considered for usage? In this free 90-minute presentation, our presenters will share tips that might be helpful to other librarians before they reopen. The presenters will also talk about the evolution of the phased plan from the establishment of a temporary computer lab in the library as COVID-19 began to spread in March 2020, to the current phased approach for gradual reopening. Justin will also offer insight into managed access, technology and services, workflows, messaging,… Continue Reading LITA Jobs Jobs in Information Technology: July 29, 2020 July 29, 2020| Jenny Levine New This Week Library Director, Walpole Town Library, Walpole, NH Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Continue Reading Program Planning Core Call for 2021 ALA Annual Program Proposals July 24, 2020| Chrishelle Thomas Submit an ALA 2021 Annual Conference program proposal for ALA’s newest division, Core: Leadership, Infrastructure, Futures, which will begin on September 1, 2020. Proposals are due September 30, 2020, and you don’t need to be a Core member to submit a proposal. Submit your idea using this proposal form. Core welcomes topics of interest to a wide range of library professionals in many different areas, including… 1. Access and Equity Advocacy in areas such as copyright, equity of access, open access, net neutrality, and privacy Preservation Week Equity, diversity, and inclusion, both within the division and the profession, as related to Core’s subject areas 2. Assessment Emphasizing the role of assessment in demonstrating the impacts of libraries or library services Assessment tools, methods, guidelines, standards, and policies and procedures 3. Leadership and Management Developing leaders at every level Best practices for inclusion by using an equity lens to examine leadership… Continue Reading Education Core Call for Webinar Proposals July 16, 2020| Chrishelle Thomas Submit a webinar proposal for ALA’s newest division, Core: Leadership, Infrastructure, Futures, which will begin on September 1, 2020. Proposals are due September 1, 2020, and you don’t need to be a Core member to submit a proposal. Early submissions are encouraged and will be considered for September and October presentations. Submit your idea using this proposal form. Core webinars reach a wide range of library professionals in many different areas, including… 1. Access and Equity Advocacy in areas such as copyright, equity of access, open access, net neutrality, and privacy Preservation Week Equity, diversity, and inclusion, both within the division and the profession, as related to Core’s subject areas 2. Assessment Emphasizing the role of assessment in demonstrating the impacts of libraries or library services Assessment tools, methods, guidelines, standards, and policies and procedures 3. Leadership Developing leaders at every level Best practices for inclusion by using an equity lens to examine… Continue Reading Core Virtual Forum Core Virtual Forum is excited to announce our 2020 Keynote Speakers! July 13, 2020July 13, 2020| Chrishelle Thomas Core Virtual Forum welcomes our 2020 Keynote speakers, Dr. Meredith D. Clark and Sofia Leung! Both speakers embody our theme in leading through their ideas and are catalysts for change to empower our community and move the library profession forward. Dr. Clark is a journalist and Assistant Professor in Media Studies at the University of Virginia. She is Academic Lead for Documenting the Now II, funded by the Andrew W. Mellon Foundation. Dr. Clark develops new scholarship on teaching students about digital archiving and community-based archives from a media studies perspective. She will be a 2020-2021 fellow with Data & Society. She is a faculty affiliate at the Center on Digital Culture and Society at the University of Pennsylvania. And, she sits on the advisory boards for Project Information Literacy, and for the Center for Critical Race and Digital Studies at New York University. Clark is an in-demand media consultant… Continue Reading ITAL Catch up on the June 2020 Issue of Information Technology and Libraries July 8, 2020| Chrishelle Thomas The June 2020 issue of Information Technology and Libraries (ITAL) was published on June 15. Editor Ken Varnum and LITA President Emily Morton-Owens reflect on the past three months in their Letter from the Editor, A Blank Page, and LITA President’s Message, A Framework for Member Success, respectively. Kevin Ford is the author of this issue’s “Editorial Board Thoughts” column, Seeing through Vocabularies. Rounding out our editorial section, the June “Public Libraries Leading the Way” section offers two items. Chuck McAndrew of the Lebanon (New Hampshire) Public Libraries describes his leadership in the IMLS-funded LibraryVPN project. Melody Friedenthal, of the Worcester (Massachusetts) Public Library talks about how she approached and teaches an Intro to Coding Using Python course. Peer-reviewed Content Virtual Reality as a Tool for Student Orientation in Distance Education Programs: A Study of New Library and Information Science Students Dr. Sandra Valenti, Brady Lund, Ting Wang Virtual reality… Continue Reading LITA Jobs Jobs in Information Technology: July 8, 2020 July 8, 2020July 9, 2020| Jenny Levine New This Week Dean of Libraries, San Jose State University, San Jose, CA Deputy Library Director, City of Carlsbad, Carlsbad, CA Visit the LITA Jobs Site for additional job openings and information on submitting your own job posting. Continue Reading Posts navigation Older posts Upcoming Events Bibliometrics for Librarians Presented by Phillip Doehle and Clarke Lakovakis on July 9, 2020 – July 30, 2020 Virtual Reality, Augmented Reality, Mixed Reality and the Academic Library Presenters: Dr. Plamen Miltenoff and Mark Gill Offered: August 6, 2020 – August 27, 2020 Core Virtual Forum Visit our website for the latest updates on the Core Virtual Forum in Fall 2020. Recent Posts Jobs in Information Technology: August 25, 2020 Jobs in Information Technology: August 13, 2020 Your Core Community Update Free LITA Webinar ~ Library Tech Response to Covid-19 ~ August 5th Jobs in Information Technology: July 29, 2020 Archives Archives Select Month August 2020 July 2020 June 2020 May 2020 April 2020 March 2020 February 2020 January 2020 December 2019 November 2019 October 2019 September 2019 August 2019 July 2019 June 2019 May 2019 April 2019 March 2019 February 2019 January 2019 December 2018 November 2018 October 2018 September 2018 August 2018 July 2018 June 2018 May 2018 April 2018 March 2018 February 2018 January 2018 December 2017 November 2017 October 2017 September 2017 August 2017 July 2017 June 2017 May 2017 April 2017 March 2017 February 2017 January 2017 December 2016 November 2016 October 2016 September 2016 August 2016 July 2016 June 2016 May 2016 April 2016 March 2016 February 2016 January 2016 December 2015 November 2015 October 2015 September 2015 August 2015 July 2015 June 2015 May 2015 April 2015 March 2015 February 2015 January 2015 December 2014 November 2014 October 2014 September 2014 August 2014 July 2014 June 2014 May 2014 April 2014 March 2014 February 2014 January 2014 December 2013 November 2013 October 2013 September 2013 August 2013 July 2013 June 2013 May 2013 April 2013 March 2013 February 2013 January 2013 December 2012 November 2012 October 2012 September 2012 August 2012 July 2012 June 2012 May 2012 April 2012 March 2012 February 2012 January 2012 December 2011 November 2011 October 2011 September 2011 August 2011 July 2011 June 2011 May 2011 April 2011 March 2011 February 2011 January 2011 December 2010 November 2010 October 2010 September 2010 August 2010 July 2010 June 2010 May 2010 April 2010 March 2010 February 2010 January 2010 December 2009 November 2009 October 2009 September 2009 August 2009 July 2009 June 2009 May 2009 April 2009 March 2009 February 2009 January 2009 December 2008 November 2008 October 2008 September 2008 August 2008 July 2008 June 2008 May 2008 April 2008 March 2008 February 2008 January 2008 December 2007 November 2007 October 2007 September 2007 August 2007 July 2007 June 2007 May 2007 April 2007 March 2007 February 2007 January 2007 December 2006 November 2006 October 2006 September 2006 August 2006 July 2006 June 2006 May 2006 April 2006 March 2006 February 2006 January 2006 November 2005 October 2005 September 2005 August 2005 July 2005 June 2005 Categories CategoriesSelect Category A New Division Discussions ALA Annual Conferences    2005    2006    2007    2008    2009    2010    2011    2012    2013    2014    2015    2016    2017    2018    2019    2020 ALA Midwinter Meetings    2006    2007    2008    2009    2010    2011    2012    2013    2014    2015    2017    2018    2019 Awards and Scholarships Begin Transmission BIGWIG Blogging Help Committees and Interest Groups Core Update Education Emerging technologies General information Imagineering Institutional Repositories Instruction and online learning Legislation & Regulation Library experiences LITA Board of Directors LITA Bylaws LITA Elections LITA Forums    2005    2006    2007    2008    2009    2010    2012    2013    2015    2016    2017    2018    2020    Core Virtual Forum LITA Jobs LITA Officers News & Noteworthy Original Content Podcast President’s Post Program Planning Publications    ITAL Roundup SF Notables Spotlight Series Standards Watch Sunday Routines Technical services Top Technology Trends Topic + reaction Twitter Chats Uncategorized Website management and User Experience the blog of the Library and Information Technology Association Privacy Policy Powered by WordPress | WordPress Theme by Tidyhive 
literarymachin-es-37	----	literary machines - digital libraries, books, archives literary machines digital libraries, books, archives. About me 05 Jul 2020 Archiviiify A short guide to download digitized books from Internet Archive and rehost on your own infrastructure using IIIF with full-text search. 31 Jan 2018 pywb 2.0 - docker quickstart Four years have passed since i first wrote of pywb: it was a young tool at the time, but already usable and extremely simple to deploy. Since then a lot of works has been done by Ilya Kreymer (and others), resulting in all the new features available with the 2.0 release. Also, some very big webarchiving initiatives have moved and used pywb in these years: Webrecorder itself, Rhizome, Perma, Arquivo PT in Portugal, the Italian National Library in Florence (Italy), (others i’m missing). 05 Oct 2017 Anonymous webarchiving Webarchiving activities, as any other activity where an HTTP client is involved, leave marks of their steps: the web server you are visiting or crawling will save your IP address in its logs (or even worse it can decide to ban your IP). This is usually not a problem, there are plenty of good reasons for a webserver to keep logs of its visitors. But sometimes you may need to protect your own identity when you are visiting or saving something from a website, and there a lot of sensitive careers that need this protection: activists, journalist, political dissidents. TOR has been invented for this, and today offer a good protection to browse anonymously the web. Can we also archive the web through TOR? 03 Sep 2016 Open BNI Il 30 maggio 2016 viene annunciato il rilascio libero della Bibliografia Nazionale Italiana (BNI). Viene apprezzata l’apertura di questo catalogo (anche se con i limiti dei soli pdf), e da profano di biblioteconomia faccio anche una domanda sull’effettivo caso d’uso della BNI. Il 30 agosto 2016 viene annunciato il rilascio delle annate 2015 e 2016 anche in formato UNIMARC e MARCXML. Incuriosito dal catalogo inizio ad esplorarlo, per pensare a possibili trasformazioni (triple rdf) o arricchimenti con/verso altri dati (wikidata). 03 Mar 2015 Epub linkrot Linkrot also affects epub files (who would have thought! :)). How to check the health of external links in epub books (required tools: a shell, atool, pup, gnu parallel). 26 Feb 2015 SKOS Nuovo Soggettario, api e autocomplete Come creare una api per un form con autocompletamento usando i termini del Nuovo Soggettario, con i Sorted Sets di Redis e Nginx+Lua. 23 Nov 2014 Serve deepzoom images from a zip archive with openseadragon vips is a fast image processing system. Version higher than 7.40 can generate static tiles of big images in deepzoom format, saving them directly into a zip archive. 23 Oct 2014 a wayback machine (pywb) on a cheap, shared host For a long time the only free (i’m unaware of commercial ones) implementation of a web archival replay software has been the Wayback Machine (now Openwayback). It’s a stable and mature software, with a strong community behind. To use it you need to be confident with the deploy of a java web application; not so difficult, and documentation is exaustive. But there is a new player in the game, pywb, developed by Ilya Kramer, a former Internet Archive developer. Built in python, relatively simpler than wayback, and now used in a pro archiving project at Rhizome. 22 Sep 2014 Opendata dell'Anagrafe Biblioteche Come usare gli opendata dell’Anagrafe delle Biblioteche Italiane e disegnare su una mappa web gli indirizzi delle biblioteche. 05 Sep 2014 api json dell'opac SBN Alcuni mesi fa è stata rilasciata da ICCU una app mobile per consultare l’OPAC SBN. Anche se graficamente poco accattivante l’app funziona bene, e trovo molto utili le funzioni di ricerca di un libro scansionando il codice a barre con la camera del telefonino, e la possibilità di bookmarkare dei preferiti. Incuriosito dal funzionamento ho pensato di analizzarne il traffico http. Page 1 of 1 Subscribe! all content is licensed under a Creative Commons Attribution 4.0 International License made with jekyll + kasper theme 
literarymachin-es-7470	----	literary machines literary machines digital libraries, books, archives Archiviiify A short guide to download digitized books from Internet Archive and rehost on your own infrastructure using IIIF with full-text search. pywb 2.0 - docker quickstart Four years have passed since i first wrote of pywb: it was a young tool at the time, but already usable and extremely simple to deploy. Since then a lot of works has been done by Ilya Kreymer (and others), resulting in all the new features available with the 2.0 release. Also, some very big webarchiving initiatives have moved and used pywb in these years: Webrecorder itself, Rhizome, Perma, Arquivo PT in Portugal, the Italian National Library in Florence (Italy), (others i’m missing). Anonymous webarchiving Webarchiving activities, as any other activity where an HTTP client is involved, leave marks of their steps: the web server you are visiting or crawling will save your IP address in its logs (or even worse it can decide to ban your IP). This is usually not a problem, there are plenty of good reasons for a webserver to keep logs of its visitors. But sometimes you may need to protect your own identity when you are visiting or saving something from a website, and there a lot of sensitive careers that need this protection: activists, journalist, political dissidents. TOR has been invented for this, and today offer a good protection to browse anonymously the web. Can we also archive the web through TOR? Open BNI Il 30 maggio 2016 viene annunciato il rilascio libero della Bibliografia Nazionale Italiana (BNI). Viene apprezzata l’apertura di questo catalogo (anche se con i limiti dei soli pdf), e da profano di biblioteconomia faccio anche una domanda sull’effettivo caso d’uso della BNI. Il 30 agosto 2016 viene annunciato il rilascio delle annate 2015 e 2016 anche in formato UNIMARC e MARCXML. Incuriosito dal catalogo inizio ad esplorarlo, per pensare a possibili trasformazioni (triple rdf) o arricchimenti con/verso altri dati (wikidata). Epub linkrot Linkrot also affects epub files (who would have thought! :)). How to check the health of external links in epub books (required tools: a shell, atool, pup, gnu parallel). SKOS Nuovo Soggettario, api e autocomplete Come creare una api per un form con autocompletamento usando i termini del Nuovo Soggettario, con i Sorted Sets di Redis e Nginx+Lua. Serve deepzoom images from a zip archive with openseadragon vips is a fast image processing system. Version higher than 7.40 can generate static tiles of big images in deepzoom format, saving them directly into a zip archive. a wayback machine (pywb) on a cheap, shared host For a long time the only free (i’m unaware of commercial ones) implementation of a web archival replay software has been the Wayback Machine (now Openwayback). It’s a stable and mature software, with a strong community behind. To use it you need to be confident with the deploy of a java web application; not so difficult, and documentation is exaustive. But there is a new player in the game, pywb, developed by Ilya Kramer, a former Internet Archive developer. Built in python, relatively simpler than wayback, and now used in a pro archiving project at Rhizome. Opendata dell’Anagrafe Biblioteche Come usare gli opendata dell’Anagrafe delle Biblioteche Italiane e disegnare su una mappa web gli indirizzi delle biblioteche. api json dell’opac SBN Alcuni mesi fa è stata rilasciata da ICCU una app mobile per consultare l’OPAC SBN. Anche se graficamente poco accattivante l’app funziona bene, e trovo molto utili le funzioni di ricerca di un libro scansionando il codice a barre con la camera del telefonino, e la possibilità di bookmarkare dei preferiti. Incuriosito dal funzionamento ho pensato di analizzarne il traffico http. 
lostrses-github-io-4979	----	AHA! | An Arts & Humanities Adventure View on GitHub AHA! An Arts & Humanities Adventure You are a researcher in the Classics department. As part of your current research project, you have become interested in the life of a woman called Fabrica Collaborare, who lived in Roman Britain. There’s not much written specifically about Fabrica, but you have seen her name mentioned in several texts from that time. You are not looking forward to the task of having to look at lots more texts to find out where Fabrica - and the Collaborare family - are mentioned. On your way out of the library to get a cup of coffee, you meet your colleague Priya, and tell her about your problem. She tells you about a group at the university who might be able to help. You haven’t heard of the RSE team before: Priya tells you that ‘RSE’ stands for Research Software Engineering, and that their office is in room 20.21. Go to room 20.21 AHA! maintained by lostRSEs Published with GitHub Pages 
maisonbisson-com-2871	----	MaisonBisson Menu Close Home Search Subscribe ☰Menu     MaisonBisson a bunch of stuff I would have emailed you about Scroll DownPage 1 of 112 Older Posts → Every journalist Ryu Spaeth on the dirty job of journalism: [E]very journalist […] at some point will have to face the morally indefensible way we go about our business: namely, using other people to tell a story about the world. Not everyone dupes their subjects into trusting them, but absolutely everyone robs other people of their stories to tell their own. Every journalist knows this flushed feeling, a mix of triumph and guilt, of securing the story that will redound glory unto them, not the subject. Some subjects who have no outlet, who are voiceless, approve of this arrangement, since they have no other way of getting their story heard. But even they will not wholly recognize their own depiction in the newspaper, by virtue of the fact that it was told by someone else with their own agenda. This is what Jonathan Franzen has called the “inescapable shame of being a storyteller”—that it involves stealing from another person, much in the way some people believe a photograph steals a bit of the sitter’s soul. Casey Bisson on #journalism, #reporting, #storytelling, 1 Dec 2020The three tribes of the internet Authors Primavera De Filippi, Juan Ortiz Freuler, and Joshua Tan outline three competing narratives that have shaped the internet: libertarian, corporate, and nationalist. “[These narratives] emerged from a community of shared interests; each calls for a set of institutional arrangements; each endures in today’s politics.” » about 400 words Casey Bisson on #Internet, #Hyperspace, #Law, #Governance, #Libertarian, #Corporate, #Nationalist, #Berkman Klein Center, #Harvard Berkman Center, 30 Nov 2020Happy D.B. Cooper Day D.B. Cooper day is celebrated on this day, the Saturday following Thanksgiving, every year. Casey Bisson on #Agent Smith, #Aircraft hijacking, #Aviation accidents and incidents, #D.B. Cooper, #FBI, #Federal Bureau of Investigation, #festival, #Hijackers, #hijacking, #mysteries, #skyjacking, 28 Nov 2020Vitaminwater's #nophoneforayear contest Back in the before times, Vitaminwater invited applicants to a contest to go a full year without a smartphone or tablet. It was partly in response to rising concerns over the effect of all those alerts on our brains. Over 100,000 people clamored for the chance, but author Elana A. Mugdan’s entry stood out with an amusing video, and in February 2019 the company took away her iPhone 5s and handed her a Kyocera flip phone. » about 600 words Casey Bisson on #Vitaminwater, #nophoneforayear, #scrollfreeforayear, #smartphones, #ethical technology, #humane technology, 22 Nov 2020Membership-driven news media From The Membership Guide’s handbook/manifesto: Journalism is facing both a trust crisis and a sustainability crisis. Membership answers to both. It is a social contract between a news organization and its members in which members give their time, money, energy, expertise, and connections to support a cause that they believe in. In exchange, the news organization offers transparency and opportunities to meaningfully contribute to both the sustainability and impact of the organization. Elsewhere it continues: Membership is not subscription by another name, nor a brand campaign that can be toggled on and off. …and: Memberful routines are workflows that connect audience members to journalism and the people producing it. Routines are the basis for a strong membership strategy. Notice that audience members are specified here, which is likely a wider group than your members. Casey Bisson on #membership, #journalism, #monetization, #publishers, #news organizations, #media, 23 Oct 2020Political bias in social media algorithms and media monetization models New reports reveal yet more structural political biases in consumption and monetization models. » about 300 wordsCasey Bisson on #Politics, #Media, #Algorithms, #Monetization, #Bias, #Journalism, #Social media, #News organizations, 22 Oct 2020Media monetization vs. internet advertising Media face structural, regulatory, and technical hurdles to effectively monetizing with ads on the internet, but there are some solutions that are working. » about 1000 words Casey Bisson on #advertising, #ads, #media monetization, #monetization models, #media, #journalism, #news organizations, 14 Aug 2020The argument against likes: aim for deeper, more genuine interactions Sweet Pea on the state of social media and dating apps: “We are not creating a healthy society when we’re telling millions of young people that the key to happy relationships is photo worthy of an impulsive right swipe.” » about 800 words Casey Bisson on #likes, #social media, #dating apps, #social software, #signal, 8 Aug 2020Paid reactions: virtual awards and tipping Reddit and Twitch both allow members to pay for the privilege of reacting to other member's content with special awards or stickers. » about 600 words Casey Bisson on #social media, #reactions, #paid reactions, #virtual awards, #tipping, #revenue, #Reddit, #Twitch, 7 Aug 2020Reactions Facebook introduced reactions with an emphasis on both the nuance they enabled and the mobile convenience: “[I]f you are sharing something that is sad [...] it might not feel comfortable to Like that post.” Later: “Commenting might afford nuanced responses, but composing those responses on a [mobile] keypad takes too much time.” » about 800 words Casey Bisson on #reactions, #likes, #social media, #Facebook, #Instagram, 6 Aug 2020“Likes” vs. “Faves” Twitter switched from Faves to Likes in 2015. “You might like a lot of things, but not everything can be your *favorite*,” they explained. Weeks after the change, liking activity for existing users was up 6% and 9% for new users. » about 500 words Casey Bisson on #Likes, #Faves, #social media, #Twitter, #Facebook, #microcopy, 5 Aug 2020Honey cocktails: eau de lavender Liquor.com’s recipe for eau de lavender, from a larger collection of cocktails with honey. They all look and sound delightful, but I can vouch for the eau de lavender. Ingredients 1 1/2 oz Tequila 3/4 oz Fresh lemon juice 3/4 oz Honey syrup1 1 Egg white 1 dash Scrappy’s lavender bitters Garnish: Lavender sprig Steps Add all ingredients into a shaker and dry-shake (without ice). Add ice and shake again to emulsify thoroughly. Strain into a chilled coupe glass. Garnish with a lavender sprig. Honey syrup: Add 1/2 cup honey and 1/2 cup water to a small saucepan over medium heat. (You can experiment and decide how much of a honey flavor you want in your syrup. The more honey you use, the thicker the syrup and stronger in flavor it will be.) Stir until blended. Strain into a jar and seal tightly with a lid. Will keep for 1 month in the refrigerator. ↩︎ Casey Bisson on #cocktails, #mixology, #honey, 12 May 2020Satellite tracking If you’re not reading Skyriddles blog, then you’re not tracking the sky above. And you might have missed the re-discovery of a satellite launched in 1967 and lost for nearly 50 years. As it turns out, there’s a lot of stuff that’s been forgotten up there, and quite a bit that some are trying to hide. The blog is an entertaining view into the world satellites, including communication, spy, weather, research, and the occasional probe going further afield. Casey Bisson on #satellite tracking, #space, 19 Apr 2020I'm missing restaurants now @nakedlunchsf was notable for having both a strong contender for the best burger in the city, _and_... Casey Bisson on #photo, #photoblog, #stayhome, #supportlocalbusiness, 24 Mar 2020When unzip fails on macOS with UTF8 unzip can fail on macOS when UTF-8 chars are in the archive. The solution is to use ditto. Via a Github issue: ditto -V -x -k --sequesterRsrc --rsrc FILENAME.ZIP DESTINATIONDIRECTORY Casey Bisson on #zip, #unzip, #macOS, #utf8, 4 Feb 2020TikTok vs. Instagram Zuckerberg describes TikTok as “almost like the Explore Tab that we have on Instagram,” but Connie Chan suggests he's missing the deeper value of AI, and TechCrunch's Josh Constantine suggests Zuck is missing the bigger difference in intent on TikTok. » about 400 words Casey Bisson on #TikTok, #Instagram, #social media, #social software, #social networks, #social signals, #artificial intelligence, #AI, 3 Jan 2020Swipegram template Benjamin Lee’s instructions and downloadable template to make panoramic carousel Instagrams (AKA #swipegram), as illustrated via his animation above. » about 100 words Casey Bisson on #instagram, #template, #swipegram, 29 Dec 2019“It is clear that the books owned the shop... “It is clear that the books owned the shop rather than the other way about. Everywhere they... Casey Bisson on #photo, #photoblog, #lovemaine, #portlandmaine, #mustbevancouver, #penderstreet, #downtownvancouver, 1 Dec 2019“Life is like riding a bicycle... “Life is like riding a bicycle. To keep your balance, you must keep moving.” —wisdom by Albert... Casey Bisson on #photo, #photoblog, #forahappymoment, #voreskbh, #visitcopenhagen, #buyfilmnotmegapixels, #ig_denmark, #fujipro400h, #ishootfilm, #travelog, #filmisnotdead, #visitdenmark, #mytinyatlas, #pro400h, #fuji, #believeinfilm, #københavn, #analoguepeople, #instapassport, #staybrokeshootfilm, #hasselblad, #igerscopenhagen, #flashesofdelight, #exploringtheglobe, 8 Nov 2019Notes about Spotify creator features Spotify often gets bashed by top creators. The service pays just $0.00397 per stream, but with 108 million users listening to an average of 25 hours per month, those streams can add up for creators who can get the listener’s attention. Spotify verifies artists who then get additional benefits on the platform. Some artists find success the traditional route, some optimize their work for the system, others work the system…and some really work it. Relevance to other network/aggregation platforms: tiny payments add up, and given a platform, creators will find a way to get and maximize value from it. The critical component is customers. Casey Bisson on #Spotify, #creators, #social networks, #revenue, #aggregation, 3 Nov 2019ExifTool examples I use for encoding analog camera details I’m a stickler for detail and love to add exif metadata for my film cameras to my scanned images. These are my notes to self about the data I use most often. I only wish exif had fields to record the film details too. » about 400 wordsCasey Bisson on #exiftool, #photography, #exif, #metadata, 3 Nov 2019Random notes on Instagram Delete your old photos, rebrand your page, and delete it entirely are all common advice. Plus some tools and traps to be aware of. » about 600 words Casey Bisson on #Instagram, #social media, #photography, 17 Oct 2019Every media has its tastemakers and influencers Every media, network, or platform has would-be influencers or promoters who can help connect consumers with creators. Don’t mistake the value of these tastemakers, and be sure to find a place for them to create new value for your platform. » about 400 wordsCasey Bisson on #Spotify, #Instagram, #social media, #social networks, #influencers, #tastemakers, 15 Oct 2019Storehouse: the most wonderful story sharing flop ever Storehouse shuttered in summer 2016, just a couple years after they launched, but the app and website introduced or made beautiful a few features that remain interesting now. » about 400 wordsCasey Bisson on #Storehouse, #photo sharing, #story sharing, #microblogging, #blogging, #social media, #user-generated content, #ugc, 13 Oct 2019Page 1 of 112 Older Posts →MaisonBisson 
managemetadata-com-4642	----	Metadata Matters Metadata Matters It's all about the services It’s not just me that’s getting old Having just celebrated (?) another birthday at the tail end of 2015, the topics of age and change have been even more on my mind than usual. And then two events converged. First I had a chat with Ted Fons in a hallway at Midwinter, and he asked about using an older article I’d published [&#8230;] Denying the Non-English Speaking World Not long ago I encountered the analysis of BibFrame published by Rob Sanderson with contributions by a group of well-known librarians. It’s a pretty impressive document&#8211;well organized and clearly referenced. But in fact there’s also a significant amount of personal opinion in it, the nature of which is somewhat masked by the references to others [&#8230;] Review of: DRAFT Principles for Evaluating Metadata Standards Metadata standards is a huge topic and evaluation a difficult task, one I’ve been involved in for quite a while. So I was pretty excited when I saw the link for &#8220;DRAFT Principles for Evaluating Metadata Standards&#8221;, but after reading it? Not so much. If we’re talking about “principles” in the sense of ‘stating-the-obvious-as-a-first-step’, well, [&#8230;] The Jane-athons continue! The Jane-athon series is alive, well, and expanding its original vision. I wrote about the first ‘official’ Jane-athon earlier this year, after the first event at Midwinter 2015. Since then the excitement generated at the first one has spawned others: the Ag-athon in the UK in May 2015, sponsored by CILIP the Maurice Dance in [&#8230;] Separating ideology, politics and utility Those of you who pay attention to politics (no matter where you are) are very likely to be shaking your head over candidates, results or policy. It’s a never ending source of frustration and/or entertainment here in the U.S., and I’ve noticed that the commentators seem to be focusing in on issues of ideology and [&#8230;] Semantic Versioning and Vocabularies A decade ago, when the Open Metadata Registry (OMR) was just being developed as the NSDL Registry, the vocabulary world was a very different place than it is today. At that point we were tightly focussed on SKOS (not fully cooked at that point, but Jon was on the WG that was developing it, so [&#8230;] Five Star Vocabulary Use Most of us in the library and cultural heritage communities interested in metadata are well aware of Tim Berners-Lee’s five star ratings for linked open data (in fact, some of us actually have the mug). The five star rating for LOD, intended to encourage us to follow five basic rules for linked data is useful, [&#8230;] What do we mean when we talk about ‘meaning’? Over the past weekend I participated in a Twitter conversation on the topic of meaning, data, transformation and packaging. The conversation is too long to repost here, but looking from July 11-12 for @metadata_maven should pick most of it up. Aside from my usual frustration at the message limitations in Twitter, there seemed to be [&#8230;] Fresh From ALA, What’s New? In the old days, when I was on MARBI as liaison for AALL, I used to write a fairly detailed report, and after that wrote it up for my Cornell colleagues. The gist of those reports was to describe what happened, and if there might be implications to consider from the decisions. I don’t propose [&#8230;] What’s up with this Jane-athon stuff? The RDA Development Team started talking about developing training for the ‘new’ RDA, with a focus on the vocabularies, in the fall of 2014. We had some notion of what we didn’t want to do: we didn’t want yet another ‘sage on the stage’ event, we wanted to re-purpose the ‘hackathon’ model from a software [&#8230;] 
managemetadata-com-643	----	Metadata Matters | It's all about the services Pagetitle: Metadata Matters It's all about the services Blog About Archives Log in Schnellnavigation: Jump to start of page | Jump to posts | Jump to navigation It’s not just me that’s getting old Having just celebrated (?) another birthday at the tail end of 2015, the topics of age and change have been even more on my mind than usual. And then two events converged. First I had a chat with Ted Fons in a hallway at Midwinter, and he asked about using an older article I’d published with Karen Coyle way back in early 2007 (“Resource Description and Access (RDA): Cataloging Rules for the 20th Century”). The second thing was a message from Research Gate that reported that the article in question was easily the most popular thing I’d ever published. My big worry in terms of having Ted use that article was that RDA had experienced several sea changes in the nine (!) years since the article was published (Jan./Feb. 2007), so I cautioned Ted about using it. Then I decided I needed to reread the article and see whether I had spoken too soon. The historic rationale holds up very well, but it’s important to note that at the time that article was written, the JSC (now the RSC) was foundering, reluctant to make the needed changes to cut ties to AACR2. The quotes from the CC:DA illustrate how deep the frustration was at that time. There was a real turning point looming for RDA, and I’d like to believe that the article pushed a lot of people to be less conservative and more emboldened to look beyond the cataloger tradition. In April of 2007, a mere few months from when this article came out, ALA Publishing arranged for the famous “London Meeting” that changed the course of RDA. Gordon Dunsire and I were at that meeting–in fact it was the first time we met. I didn’t even know much about him aside from his article in the same DLIB issue. As it turns out, the RDA article was elevated to the top spot, thus stealing some of his thunder, so he wasn’t very happy with me. The decision made in London to allow DCMI to participate by building the vocabularies was a game changer, and Gordon and I were named co-chairs of a Task Group to manage that process. So as I re-read the article, I realized that the most important bits at the time are probably mostly of historical interest at this point. I think the most important takeaway is that RDA has come a very long way since 2007, and in some significant ways is now leading the pack in terms of its model and vocabulary management policies (more about that to come). And I still like the title! …even though it’s no longer a true description of the 21st Century RDA. By Diane Hillmann, February 9, 2016, 9:19 am (UTC-5) RDA, Uncategorized Post a comment Denying the Non-English Speaking World Not long ago I encountered the analysis of BibFrame published by Rob Sanderson with contributions by a group of well-known librarians. It’s a pretty impressive document–well organized and clearly referenced. But in fact there’s also a significant amount of personal opinion in it, the nature of which is somewhat masked by the references to others holding the same opinion. I have a real concern about some of those points where an assertion of ‘best practices’ are particularly arguable. The one that sticks in my craw particularly shows up in section 2.2.5: 2.2.5 Use Natural Keys in URIs References: [manning], [ldbook], [gld-bp], [cooluris] Although the client must treat URIs as opaque strings, it is good practice to construct URIs in a systematic and human readable fashion for both instances and ontology terms. A natural key is one that appears in the information about the resource, such as some unique identifier for the resource, or the label of the property for ontology terms. While the machine does not care about structure, memorability or readability of URIs, the developers that write the code do. Completely random URIs introduce difficult to detect semantic and algorithmic errors in both publication and consumption of the data. Analysis: The use of natural keys is a strength of BIBFRAME, compared to similarly scoped efforts in similar communities such as the RDA and CIDOC-CRM vocabularies which use completely opaque numbers such as P10001 (hasRespondent) or E33 (Linguistic Entity). RDA further misses the target in this area by going on to define multiple URIs for each term with language tagged labels in the URI, such as rda:hasRespondent.en mapping to P10001. This is a different predicate from the numerical version, and using owl:sameAs to connect the two just makes everyone’s lives more difficult unnecessarily. In general, labels for the predicates and classes should be provided in the ontology document, along with thorough and understandable descriptions in multiple languages, not in the URI structure. This sounds fine so long as you accept the idea that ‘natural’ means English, because, of course, all developers, no matter their first language, must be fluent enough in English to work with English-only standards and applications. This mis-use of ‘natural’ reminds me of other problematic usages, such as the former practice in the adoption community (of which I have been a part for 40 years) where ‘natural’ was routinely used to refer to birth parents, thus relegating adoptive parents to the ‘un-natural’ realm. So in this case, if ‘natural’ means English, are all other languages inherently un-natural in the world of development? The library world has been dominated by the ‘Anglo-American’ notions of standard practice for a very long time, and happily, RDA is leading away from that, both in governance and in development of vocabularies and tools. The Multilingual strategy adopted by RDA is based on the following points: More than a decade of managing vocabularies has convinced us that opaque identifiers are extremely valuable for managing URIs, because they need not be changed as labels change (only as definitions change). The kinds of ‘churn’ we saw in the original version of RDA (2008-2013) convinced us that label-based URIs were a significant problem (and cost) that became worse as the vocabularies grew over time. We get the argument that opaque URIs are often difficult for humans to use–but the tools we’re building (the RDA Registry as case in point) are intended to give human developers what they want for their tasks (human readable URIs, in a variety of languages) but ensure that the URIs for properties and values are set up based on what machines need. In this way, changes in the lexical URIs (human-readable) can be maintained properly without costly change in the canonical URIs that travel with the data content itself. The multiple language translations (and distributed translation management by language communities) also enable humans to build discovery and display mechanisms for users that are speakers of a variety of languages. This has been a particularly important value for national libraries outside the US, but also potentially for libraries in the US meeting the needs of non-English language communities closer to home. It’s too easy for the English-first library development community to insist that URIs be readable in English and to turn a blind eye to the degree that this imposes understanding of the English language and Anglo-American library culture on the rest of the world. This is not automatically the intellectual gift that the distributors of that culture assume it to be. It shouldn’t be necessary for non-Anglo-American catalogers to learn and understand Anglo-American language and culture in order to express metadata for a non-Anglo audience. This is the rough equivalent of the Philadelphia cheese steak vendor who put up a sign reading “This is America. When ordering speak in English”. We understand that for English-speaking developers bibframe.org/vocab/title is initially easier to use than rdaregistry.info/Elements/w/P10088 or even (heaven forefend!) “130_0#$a” (in RDF: marc21rdf.info/elements/1XX/M1300_a). That’s why RDA provides rdaregistry.info/Elements/w/titleOfTheWork.en but also, eventually, rdaregistry.info/Elements/w/拥有该作品的标题.ch and rdaregistry.info/Elements/w/tieneTítuloDeLaObra.es, et al (you do understand Latin of course). These ‘unnatural’ Lexical Aliases will be provided by the ‘native’ language speakers of their respective national library communities. As one of the many thousands of librarians who ‘speak’ MARC to one another–despite our language differences–I am loathe to give up that international language to an English-only world. That seems like a step backwards. By Diane Hillmann, January 3, 2016, 5:05 pm (UTC-5) BibFrame, Linked data, RDA, Vocabularies 1 Comment (Show inline) Review of: DRAFT Principles for Evaluating Metadata Standards Metadata standards is a huge topic and evaluation a difficult task, one I’ve been involved in for quite a while. So I was pretty excited when I saw the link for “DRAFT Principles for Evaluating Metadata Standards”, but after reading it? Not so much. If we’re talking about “principles” in the sense of ‘stating-the-obvious-as-a-first-step’, well, okay—but I’m still not very excited. I do note that the earlier version link uses the title ‘draft checklist’, and I certainly think that’s a bit more real than ‘draft principles’ for this effort. But even taken as a draft, the text manages to use lots of terms without defining them—not a good thing in an environment where semantics is so important. Let’s start with a review of the document itself, then maybe I can suggest some alternative paths forward. First off, I have a problem with the preamble: “These principles are intended for use by libraries, archives and museum (LAM) communities for the development, maintenance, governance, selection, use and assessment of metadata standards. They apply to metadata structures (field lists, property definitions, etc.), but can also be used with content standards and value vocabularies”. Those tasks (“development, maintenance, governance, selection, use and assessment” are pretty all encompassing, but yet the connection between those tasks and the overall “evaluation” is unclear. And, of course, without definitions, it’s difficult to understand how ‘evaluation’ relates to ‘assessment’ in this context—are they they same thing? Moving on to the second part about what kind of metadata standards that might be evaluated, we have a very general term, ‘metadata structures’, with what look to be examples of such structures (field lists, property definitions, etc.). Some would argue (including me) that a field list is not a structure without a notion of connections between the fields; and although property definitions may be part of a ‘structure’ (as I understand it, at least), they are not a structure, per se. And what is meant by the term ‘content standards’, and how is that different from ‘metadata structures’? The term ’value vocabularies’ goes by many names, and is not something that can go without a definition. I say this as an author/co-author of a lot of papers that use this term, and we always define it within the context of the paper for just that reason. There are many more places in the text where fuzziness in terminology is a problem (maybe not a problem for a checklist, but certainly for principles). Some examples: 1. What is meant by ’network’? There are many different kinds, and if you mean to refer to the Internet, for goodness sakes say so. ‘Things’ rather than ‘strings’ is good, but it will take a while to make it happen in legacy data, which we’ll be dealing with for some time, most likely forever. Prospectively created data is a bit easier, but still not a cakewalk — if the ‘network’ is the global Internet, then “leveraging ‘by-reference’ models” present yet-to-be-solved problems of network latency, caching, provenance, security, persistence, and most importantly: stability. Metadata models for both properties and controlled values are an essential part of LAM systems and simply saying that metadata is “most efficient when connected with the broader network” doesn’t necessarily make it so. 2. ‘Open’ can mean many things. Are we talking specific kinds of licenses, or the lack of a license? What kind of re-use are you talking about? Extension? Wholesale adoption with namespace substitution? How does semantic mapping fit into this? (In lieu of a definition, see the paper at (1) below) 3. This principle seems to imply that “metadata creation” is the sole province of human practitioners and seriously muddies the meaning of the word creation by drawing a distinction between passive system-created metadata and human-created metadata. Metadata is metadata and standards apply regardless. What do you mean by ‘benefit user communities’? Whose communities? Please define what is meant by ‘value’ in this context? How would metadata practitioners ‘dictate the level of description provided based on the situation at hand’? 4. As an evaluative ‘principle’ this seems overly vague. How would you evaluate a metadata standard’s ability to ‘easily’ support ‘emerging’ research? What is meant by ‘exchange/access methods’ and what do they have to do with metadata standards for new kinds of research? 5. I agree totally with the sentence “Metadata standards are only as valuable and current as their communities of practice,” but the one following makes little sense to me. “ … metadata in LAM institutions have been very stable over the last 40 years …” Really? It could easily be argued that the reason for that perceived stability is the continual inability of implementers to “be a driving force for change” within a governance model that has at the same time been resistant to change. The existence of the DCMI usage board, MARBI, the various boards advising the RDA Steering Committee, all speak to the involvement of ‘implementers’. Yet there’s an implication in this ‘principle’ that stability is liable to no longer be the case and that implementers ‘driving’ will somehow make that inevitable lack of stability palatable. I would submit that stability of the standard should be the guiding principle rather than the democracy of its governance. 6. “Extensible, embeddable, and interoperable” sounds good, but each is more complex than this triumvirate seems. Interoperability in particular is something that we should all keep in mind, but although admirable, interoperability rarely succeeds in practice because of the practical incompatibility of different models. DC, MARC21, BibFrame, RDA, and Schema.org are examples of this — despite their ‘modularity’ they generally can’t simply be used as ‘modules’ because of differences in the thinking behind the model and their respective audiences. I would also argue that ‘lite style implementations’ make sense only if ‘lite’ means a dumbed-down core that can be mapped to by more detailed metadata. But stressing the ‘lite implementations’ as a specified part of an overall standard gives too much power to the creator of the standard, rather than the creator of the data. Instead we should encourage the use of application profiles, so that the particular choices and usages of the creating entity are well documented, and others can use the data in full or in part according to their needs. I predict that lossy data transfer will be less acceptable in the reality than it is in the abstract, and would suggest that dumb data is more expensive over the longer term (and certainly doesn’t support ‘new research methods’ at all). “Incorporation into local systems” really can only be accomplished by building local systems that adhere to their own local metadata model and are able to map that model in/out to more global models. Extensible and embeddable are very different from interoperable in that context. 7. The last section, after the inarguable first sentence, describes what the DCMI ‘dumb-down’ principle defined nearly twenty years ago, and that strategy still makes sense in a lot of situations. But ‘graceful degradation’ and ‘supporting new and unexpected uses’ requires smart data to start with. ‘Lite’ implementation choices (as in #6 above) preclude either of those options, IMO, and ‘adding value’ of any kind (much less by using ‘ontological inferencing’) is in no way easily achievable. I intend to be present at the session in Boston [9:00-10:00 Boston Conference and Exhibition Center, 107AB] and since I’ve asked most of my questions here I intend not to talk much. Let’s see how successful I can be at that! It may well be that a document this short and generalized isn’t yet ready to be a useful tool for metadata practitioners (especially without definitions!). That doesn’t mean that the topics that it’s trying to address aren’t important, just that the comprehensive goals in the preamble are not yet being met in this document. There are efforts going on in other arenas–the NISO Bibliography Roadmap work, for instance, that should have an important impact on many of these issues, which suggests that it might be wise for the Committee to pause and take another look around. Maybe a good glossary would be a important step? Dunsire, Gordon, et al. “A Reconsideration of Mapping in a Semantic World”, paper presented at International Conference on Dublin Core and Metadata Applications, The Hague, 2011. Available at: dcpapers.dublincore.org/pubs/article/view/3622/1848 By Diane Hillmann, December 14, 2015, 4:59 pm (UTC-5) ALA Conferences, Systems, Vocabularies 1 Comment (Show inline) The Jane-athons continue! The Jane-athon series is alive, well, and expanding its original vision. I wrote about the first ‘official’ Jane-athon earlier this year, after the first event at Midwinter 2015. Since then the excitement generated at the first one has spawned others: the Ag-athon in the UK in May 2015, sponsored by CILIP the Maurice Dance in New Zealand (October 16, 2015 at the National Library of New Zealand in Wellington, focused on Maurice Gee) the Jane-in (at ALA San Francisco at Annual 2015) the RLS-athon (November 9, 2015, Edinburgh, Scotland), following the JSC meeting there and focused on Robert Louis Stevenson Like good librarians we have an archive of the Jane-athon materials, for use by anyone who wants to look at or use the presentations or the data created at the Jane-athons We’re still at it: the next Jane-athon in the series will be the Boston Thing-athon at Harvard University on January 7, 2016. Looking at the list of topics gives a good idea about how the Jane-athons are morphing to a broader focus than that of a creator, while training folks to create data with RIMMF. The first three topics (which may change–watch this space) focus not on specific creators, but on moving forward on topics identified of interest to a broader community. * Strings vs things. A focus on replacing strings in metadata with URIs for things. * Institutional repositories, archives and scholarly communication. A focus on issues in relating and linking data in institutional repositories and archives with library catalogs. * Rare materials and RDA. A continuing discussion on the development of RDA and DCRM2 begun at the JSC meeting and the international seminar on RDA and rare materials held in November 2015. For beginners new to RDA and RIMMF but with an interest in creating data, we offer: * Digitization. A focus on how RDA relates metadata for digitized resources to the metadata for original resources, and how RIMMF can be used to improve the quality of MARC 21 records during digitization projects. * Undergraduate editions. A focus on issues of multiple editions that have little or no change in content vs. significant changes in content, and how RDA accommodates the different scenarios. Further on the horizon is a recently approved Jane-athon for the AALL conference in July 2016, focusing on Hugo Grotius (inevitably, a Hugo-athon, but there’s no link yet). NOTE: The Thing-a-thon coming up at ALA Midwinter is being held on Thursday rather than the traditional Friday to open the attendance to those who have other commitments on Friday. Another new wrinkle is the venue–an actual library away from the conference center! Whether you’re a cataloger or not-a-cataloger, there will be plenty of activities and discussions that should pique your interest. Do yourself a favor and register for a fun and informative day at the Thing-athon to begin your Midwinter experience! Instructions for registering (whether or not you plan to register for MW) can be found on the Toolkit Blog. By Diane Hillmann, December 7, 2015, 11:19 am (UTC-5) Uncategorized Post a comment Separating ideology, politics and utility Those of you who pay attention to politics (no matter where you are) are very likely to be shaking your head over candidates, results or policy. It’s a never ending source of frustration and/or entertainment here in the U.S., and I’ve noticed that the commentators seem to be focusing in on issues of ideology and faith, particularly where it bumps up against politics. The visit of Pope Francis seemed to be taking everyone’s attention while he was here, but though this event has added some ‘green’ to the discussion, it hasn’t pushed much off the political plate. Politics and faith bump up against each other in the metadata world, too. What with traditionalists still thinking in MARC tags and AACR2, to the technical types rolling their eyes at any mention of MARC and trying to push the conversation towards RDA, RDF, BibFrame, schema.org, etc., there are plenty of metadata politics available to flavor the discussion. The good news for us is that the conflicts and differences we confront in the metadata world are much more amenable to useful solution than the politics crowding our news feeds. I remember well the days when the choice of metadata schema was critical to projects and libraries. Unfortunately, we’re all still behaving as if the proliferation of ‘new’ schemas makes the whole business more complicated–that’s because we’re still thinking we need to choose one or another, ignoring the commonality at the core of the new metadata effort. But times have changed, and we don’t all need to use the same schema to be interoperable (just like we don’t all need to speak English or Esperanto to communicate). But what we do need to think about is what the needs of our organization are at all stages of the workflow: from creating, publishing, consuming, through integrating our metadata to make it useful in the various efforts in which we engage. One thing we do need to consider as we talk about creating new metadata is whether it will need to work with other data that already exists in our institution. If MARC is what we have, then one requirement may be to be able to maintain the level of richness we’ve built up in the past and still move that rich data forward with us. This suggests to me that RDA, which RIMMF has demonstrated can be losslessly mapped to and from MARC, might be the best choice for the creation of new metadata. Back in the day, when Dublin Core was the shiny new thing, the notion of ‘dumb-down’ was hatched, and though not an elegantly named principle, it still works. It says that rich metadata can be mapped fairly easily into a less-rich schema (‘dumbed down’), but once transformed in a lossy way, it can’t easily be ‘smartened up’. But in a world of many publishers of linked data, and many consumers of that data, the notion of transforming rich metadata into any number of other schemas and letting the consumer chose what they want, is fairly straightforward, and does not require firm knowledge (or correct assumptions) of what the consumers actually need. This is a strategy well-tested with OAI-PMH which established a floor of Simple Dublin Core but encouraged the provision of any number of other formats as well, including MARC. As consumers, libraries and other cultural institutions are also better served by choices. Depending on the services they’re trying to support, they can choose what flavor of data meets their needs best, instead of being offered only what the provider assumes they want. This strategy leaves open the possibility of serving MARC as one of the choices, allowing those institutions still nursing an aged ILS to continue to participate. Of course, the consumers of data need to think about how they aggregate and integrate the data they consume, how to improve that data, and how to make their data services coherent. That’s the part of the new create, publish, consume, integrate cycle that scares many librarians, but it shouldn’t–really! So, it’s not about choosing the ‘right’ metadata format, it’s about having a fuller and more expansive notion about sharing data and learning some new skills. Let’s kiss the politics goodbye, and get on with it. By Diane Hillmann, October 12, 2015, 10:08 am (UTC-5) Linked data, RDA, Vocabularies 1 Comment (Show inline) Semantic Versioning and Vocabularies A decade ago, when the Open Metadata Registry (OMR) was just being developed as the NSDL Registry, the vocabulary world was a very different place than it is today. At that point we were tightly focussed on SKOS (not fully cooked at that point, but Jon was on the WG that was developing it, so we felt pretty secure diving in). But we were thinking about versioning in the Open World of RDF even then. The NSDL Registry kept careful track of all changes to a vocabulary (who, what, when) and the only way to get data in was through the user interface. We ran an early experiment in making versions based on dynamic, timestamp-based snapshots (we called them ‘time slices’, Git calls them ‘commit snapshots’) available for value vocabularies, but this failed to gain any traction. This seemed to be partly because, well, it was a decade ago for one, and while it attempted to solve an Open World problem with versioned URIs, it created a new set of problems for Closed World experimenters. Ultimately, we left the versions issue to sit and stew for a bit (6 years!). All that started to change in 2008 as we started working with RDA, and needed to move past value vocabularies into properties and classes, and beyond that into issues around uploading data into the OMR. Lately, Git and GitHub have started taking off and provide a way for us to make some important jumps in functionality that have culminated in the OMR/GitHub-based RDA Registry. Sounds easy and intuitive now, but it sure wasn’t at the time, and what most people don’t know is that the OMR is still where RDA/RDF data originates — it wasn’t supplanted by Git/Github, but is chugging along in the background. The OMR’s RDF CMS is still visible and usable by all, but folks managing larger vocabularies now have more options. One important aspect of the use of Git and GitHub was the ability to rethink versioning. Just about a year ago our paper on this topic (Versioning Vocabularies in a Linked Data World, by Diane Hillmann, Gordon Dunsire and Jon Phipps) was presented to the IFLA Satellite meeting in Paris. We used as our model the way software on our various devices and systems is updated–more and more these changes happen without much (if any) interaction with us. In the world of vocabularies defining the properties and values in linked data, most updating is still very manual (if done at all), and the important information about what has changed and when is often hidden behind web pages or downloadable files that provide no machine-understandable connections identifying changes. And just solving the change management issue does little to solve the inevitable ‘vocabulary rot’ that can make published ‘linked data’ less and less meaningful, accurate, and useful over time. Building stable change management practices is a very critical missing piece of the linked data publishing puzzle. The problem will grow exponentially as language versions and inter-vocabulary mappings start to show up as well — and it won’t be too long before that happens. Please take a look at the paper and join in the conversation! By Diane Hillmann, September 20, 2015, 6:41 pm (UTC-5) RDA, Tools, Vocabularies Post a comment Five Star Vocabulary Use Most of us in the library and cultural heritage communities interested in metadata are well aware of Tim Berners-Lee’s five star ratings for linked open data (in fact, some of us actually have the mug). The five star rating for LOD, intended to encourage us to follow five basic rules for linked data is useful, but, as we’ve discussed it over the years, a basic question rises up: What good is linked data without (property) vocabularies? Vocabulary manager types like me and my peeps are always thinking like this, and recently we came across solid evidence that we are not alone in the universe. Check out: “Five Stars of Linked Data Vocabulary Use”, published last year as part of the Semantic Web Journal. The five authors posit that TBL’s five star linked data is just the precondition to what we really need: vocabularies. They point out that the original 5 star rating says nothing about vocabularies, but that Linked Data without vocabularies is not useful at all: “Just converting a CSV file to a set of RDF triples and linking them to another set of triples does not necessarily make the data more (re)usable to humans or machines.” Needless to say, we share this viewpoint! I’m not going to steal their thunder and list here all five star categories–you really should read the article (it’s short), but only note that the lowest level is a zero star rating that covers LD with no vocabularies. The five star rating is reserved for vocabularies that are linked to other vocabularies, which is pretty cool, and not easy to accomplish by the original publisher as a soloist. These five star ratings are a terrific start to good practices documentation for vocabularies used in LOD, which we’ve had in our minds for some time. Stay tuned. By Diane Hillmann, August 7, 2015, 1:50 pm (UTC-5) Linked data, Vocabularies Post a comment What do we mean when we talk about ‘meaning’? Over the past weekend I participated in a Twitter conversation on the topic of meaning, data, transformation and packaging. The conversation is too long to repost here, but looking from July 11-12 for @metadata_maven should pick most of it up. Aside from my usual frustration at the message limitations in Twitter, there seemed to be a lot of confusion about what exactly we mean about ‘meaning’ and how it gets expressed in data. I had a skype conversation with @jonphipps about it, and thought I could reproduce that here, in a way that could add to the original conversation, perhaps clarifying a few things. [Probably good to read the Twitter conversation ahead of reading the rest of this.] Jon Phipps: I think the problem that the people in that conversation are trying to address is that MARC has done triple duty as a local and global serialization (format) for storage, supporting indexing and display; a global data interchange format; and a focal point for creating agreement about the rules everyone is expected to follow to populate the data (AACR2, RDA). If you walk away from that, even if you don’t kill it, nothing else is going to be able to serve that particular set of functions. But that’s the way everyone chooses to discuss bibframe, or schema.org, or any other ‘marc replacement’. Diane Hillmann: Yeah, but how does ‘meaning’ merely expressed on a wiki page help in any way? Isn’t the idea to have meaning expressed with the data itself? Jon Phipps: It depends on whether you see RDF as a meaning transport mechanism or a data transport mechanism. That’s the difference between semantic data and linked data. Diane Hillmann: It’s both, don’t you think? Jon Phipps: Semantic data is the smart subset of linked data. Diane Hillmann: Nice tagline Jon Phipps: Zepheira, and now DC, seem to be increasingly looking at RDF as merely linked data. I should say a transport mechanism for ‘linked’ data. Diane Hillmann: It’s easier that way. Jon Phipps: Exactly. Basically what they’re saying is that meaning is up to the receiver’s system to determine. Dc:title of ‘Mr.’ is fine in that world–it even validates according to the ‘new’ AP thinking. It’s all easier for the data producers if they don’t have to care about vocabularies. But the value of RDF is that it’s brilliantly designed to transport knowledge, not just data. RDF data is intended to live in a world where any Thing can be described by any Thing, and all of those descriptions can be aggregated over time to form a more complete description of the Thing Being Described. Knowledge transfer really benefits from Semantic Web concepts like inferences and entailments and even truthiness (in addition to just validation). If you discount and even reject those concepts in a linked data world than you might as well ship your data around as CSV or even SQL files and be done with it. One of the things about MARC is that it’s incredibly semantically rich (marc21rdf.info) and has also been brilliantly designed by a lot of people over a lot of years to convey an equally rich body of bibliographic knowledge. But throwing away even a small portion of that knowledge in pursuit of a far dumber linked data holy grail is a lot like saying that since most people only use a relatively limited number of words (especially when they’re texting) we have no need for a 50,000 word, or even a 5,000 word, dictionary. MARC makes knowledge transfer look relatively easy because the knowledge is embedded in a vocabulary every cataloger learns and speaks fairly fluently. It looks like it’s just a (truly limiting) data format so it’s easy to think that replacing it is just a matter of coming up with a fresh new format, like RDF. But it’s going to be a lot harder than that, which is tacitly acknowledged by the many-faceted effort to permanently dumb-down bibliographic metadata, and it’s one of the reasons why I think bibframe.org, bibfra.me, and schema.org might end up being very destructive, given the way they’re being promoted (be sure to Park Your MARC somewhere). [That’s why we’re so focused on the RDA data model (which can actually be semantically richer than MARC), why we helped create marc21rdf.info, and why we’re working at building out our RDF vocabulary management services.] Diane Hillmann: This would be a great conversation to record for a podcast 😉 Jon Phipps: I’m not saying proper vocabulary management is easy. Look at us for instance, we haven’t bothered to publish the OMR vocabs and only one person has noticed (so far). But they’re in active use in every OMR-generated vocab. The point I was making was that we we’re no better, as publishers of theoretically semantic metadata, at making sure the data was ‘meaningful’ by making sure that the vocabs resolved, had definitions, etc. [P.S. We’re now working on publishing our registry vocabularies.] By Diane Hillmann, July 16, 2015, 9:35 pm (UTC-5) Linked data, RDA, Vocabularies 1 Comment (Show inline) Fresh From ALA, What’s New? In the old days, when I was on MARBI as liaison for AALL, I used to write a fairly detailed report, and after that wrote it up for my Cornell colleagues. The gist of those reports was to describe what happened, and if there might be implications to consider from the decisions. I don’t propose to do that here, but it does feel as if I’m acting in a familiar ‘reporting’ mode. In an early Saturday presentation sponsored by the Linked Library Data IG, we heard about BibFrame and VIVO. I was very interested to see how VIVO has grown (having seen it as an infant), but was puzzled by the suggestion that it or FOAF could substitute for the functionality embedded in authority records. For one thing, auth records are about disambiguating names, and not describing people–much as some believe that’s where authority control should be going. Even when we stop using text strings as identifiers, we’ll still need that function and should be thinking carefully whether adding other functions makes good sense. Later on Saturday, at the Cataloging Norms IG meeting, Nancy Fallgren spoke on the NLM collaboration with Zepheira, GW, (and others) on BibFrame Lite. They’re now testing the Kuali OLE cataloging module for use with BF Lite, which will include a triple store. An important quote from Nancy: “Legacy data should not drive development.” So true, but neither should we be starting over, or discarding data, just to simplify data creation, thus losing the ability to respond to the more complex needs in cataloging, which aren’t going away, (a point demonstrated usefully in the recent Jane-athons). I was the last speaker on that program, and spoke on the topic of “What Can We Do About Our Legacy Data?” I was primarily asking questions and discussing options, not providing answers. The one thing I am adamant about is that nobody should be throwing away their MARC records. I even came up with a simple rule: “Park the MARC”. After all, storage is cheap, and nobody really knows how the current situation will settle out. Data is easy to dumb down, but not so easy to smarten up, and there may be do-overs in store for some down the road, after the experimentation is done and the tradeoffs clearer. I also attended the BibFrame Update, and noted that there’s still no open discussion about the ‘classic’ (as in ‘Classic Coke’) BibFrame version used by LC, and the ‘new’ (as in ‘New Coke’) BibFrame Lite version being developed by Zepheira, which is apparently the vocabulary they’re using in their projects and training. It seems like it could be a useful discussion, but somebody’s got to start it. It’s not gonna be me. The most interesting part of that update from my point of view was hearing Sally McCallum talk about the testing of BibFrame by LC’s catalogers. The tool they’re planning on using (in development, I believe) will use RDA labels and include rule numbers from the RDA Toolkit. Now, there’s a test I really want to hear about at Midwinter! But of course all of that RDA ‘testing’ they insisted on several years ago to determine if the RDA rules could be applied to MARC21 doesn’t (can’t) apply to BibFrame Classic so … Will there be a new round of much publicized and eagerly anticipated shared institutional testing of this new tool and its assumptions? Just askin’. By Diane Hillmann, July 10, 2015, 10:10 am (UTC-5) ALA Conferences, BibFrame, RDA, Vocabularies Post a comment What’s up with this Jane-athon stuff? The RDA Development Team started talking about developing training for the ‘new’ RDA, with a focus on the vocabularies, in the fall of 2014. We had some notion of what we didn’t want to do: we didn’t want yet another ‘sage on the stage’ event, we wanted to re-purpose the ‘hackathon’ model from a software focus to data creation (including a major hands-on aspect), and we wanted to demonstrate what RDA looked like (and could do) in a native RDA environment, without reference to MARC. This was a tall order. Using RIMMF for the data creation was a no-brainer: the developers had been using the RDA Registry to feed new vocabulary elements into their their software (effectively becoming the RDA Registry’s first client), and were fully committed to FRBR. Deborah Fritz had been training librarians and other on RIMMF for years, gathering feedback and building enthusiasm. It was Deborah who came up with the Jane-athon idea, and the RDA Development group took it and ran with it. Using the Jane Austen theme was a brilliant part of Deborah’s idea. Everybody knows about JA, and the number of spin offs, rip-offs and re-tellings of the novels (in many media formats) made her work a natural for examining why RDA and FRBR make sense. One goal stated everywhere in the marketing materials for our first Jane outing was that we wanted people to have fun. All of us have been part of the audience and on the dais for many information sessions, for RDA and other issues, and neither position has ever been much fun, useful as the sessions might have been. The same goes for webinars, which, as they’ve developed in library-land tend to be dry, boring, and completely bereft of human interaction. And there was a lot of fun at that first Jane-athon–I venture to say that 90% of the folks in the room left with smiles and thanks. We got an amazing response to our evaluation survey, and the preponderance of responses were expansive, positive, and clearly designed to help the organizers to do better the next time. The various folks from ALA Publishing who stood at the back and watched the fun were absolutely amazed at the noise, the laughter, and the collaboration in evidence. No small part of the success of Jane-athon 1 rested with the team leaders at each table, and the coaches going from table to table helping out with puzzling issues, ensuring that participants were able to create data using RIMMF that could be aggregated for examination later in the day. From the beginning we thought of Jane 1 as the first of many. In the first flush of success as participants signed up and enthusiasm built, we talked publicly about making it possible to do local Jane-athons, but we realized that our small group would have difficulty doing smaller events with less expertise on site to the same standard we set at Jane-athon 1. We had to do a better job in thinking through the local expansion and how to ensure that local participants get the same (or similar) value from the experience before responding to requests. As a step in that direction CILIP in the UK is planning an Ag-athon on May 22, 2015 which will add much to the collective experience as well as to the data store that began with the first Jane-athon and will be an increasingly important factor as we work through the issues of sharing data. The collection and storage of the Jane-athon data was envisioned prior to the first event, and the R-Balls site was designed as a place to store and share RIMMF-based information. Though a valuable step towards shareable RDA data, rballs have their limits. The data itself can be curated by human experts or available with warts, depending on the needs of the user of the data. For the longer term, RIMMF can output RDF statements based on the rball info, and a triple store is in development for experimentation and exploration. There are plans to improve the visualization of this data and demonstrate its use at Jane-athon 2 in San Francisco, which will include more about RDA and linked data, as well as what the created data can be used for, in particular, for new and improved services. So, what are the implications of the first Jane-athon’s success for libraries interested in linked data? One of the biggest misunderstandings floating around libraryland in linked data conversations is that it’s necessary to make one and only one choice of format, and eschew all others (kind of like saying that everyone has to speak English to participate in LOD). This is not just incorrect, it’s also dangerous. In the MARC era, there was truly no choice for libraries–to participate in record sharing they had to use MARC. But the technology has changed, and rapidly evolving semantic mapping strategies [see: dcpapers.dublincore.org/pubs/article/view/3622] will enable libraries to use the most appropriate schemas and tools for creating data to be used in their local context, and others for distributing that data to partners, collaborators, or the larger world. Another widely circulated meme is that RDA/FRBR is ‘too complicated’ for what libraries need; we’re encouraged to ‘simplify, simplify’ and assured that we’ll still be able to do what we need. Hmm, well, simplification is an attractive idea, until one remembers that the environment we work in, with evolving carriers, versions, and creative ideas for marketing materials to libraries is getting more complex than ever. Without the specificity to describe what we have (or have access to), we push the problem out to our users to figure out on their own. Libraries have always tried to be smarter than that, and that requires “smart” , not “dumb”, metadata. Of course the corollary to the ‘too complicated’ argument lies the notion that a) we’re not smart enough to figure out how to do RDA and FRBR right, and b) complex means more expensive. I refuse to give space to a), but b) is an important consideration. I urge you to take a look at the Jane-athon data and consider the fact that Jane Austen wrote very few novels, but they’ve been re-published with various editions, versions and commentaries for almost two centuries. Once you add the ‘based on’, ‘inspired by’ and the enormous trail created by those trying to use Jane’s popularity to sell stuff (“Sense and Sensibility and Sea Monsters” is a favorite of mine), you can see the problem. Think of a pyramid with a very expansive base, and a very sharp point, and consider that the works that everything at the bottom wants to link to don’t require repeating the description of each novel every time in RDA. And we’re not adding notes to descriptions that are based on the outdated notion that the only use for information about the relationship between “Sense and Sensibility and Sea Monsters” and Jane’s “Sense and Sensibility” is a human being who looks far enough into the description to read the note. One of the big revelations for most Jane-athon participants was to see how well RIMMF translated legacy MARC records into RDA, with links between the WEM levels and others to the named agents in the record. It’s very slick, and most importantly, not lossy. Consider that RIMMF also outputs in both MARC and RDF–and you see something of a missing link (if not the Golden Gate Bridge :-). Not to say there aren’t issues to be considered with RDA as with other options. There are certainly those, and they’ll be discussed at the Jane-In in San Francisco as well as at the RDA Forum on the following day, which will focus on current RDA upgrades and the future of RDA and cataloging. (More detailed information on the Forum will be available shortly). Don’t miss the fun, take a look at the details and then go ahead and register. And catalogers, try your best to entice your developers to come too. We’ll set up a table for them, and you’ll improve the conversation level at home considerably! By Diane Hillmann, May 18, 2015, 10:13 am (UTC-5) Linked data, RDA, Uncategorized 1 Comment (Show inline) Older articles » Schnellnavigation: Jump to start of page | Jump to posts | Jump to navigation Syndication RDF Articles RSS2 Articles ATOM Articles Archives February 2016 January 2016 December 2015 October 2015 September 2015 August 2015 July 2015 May 2015 February 2015 December 2014 November 2014 October 2014 September 2014 February 2014 December 2013 July 2013 May 2013 October 2012 August 2012 July 2012 June 2012 May 2012 April 2012 March 2012 December 2011 September 2011 April 2011 March 2011 February 2011 January 2011 October 2010 September 2010 August 2010 July 2010 June 2010 April 2010 March 2010 February 2010 January 2010 November 2009 August 2009 July 2009 May 2009 April 2009 March 2009 February 2009 January 2009 December 2008 November 2008 Categories ALA Conferences (15) BibFrame (3) Dublin Core (5) Futures (27) Legislative Data Project (1) Linked data (27) MARC21 in RDF (11) Meeting reports (8) Presentations (9) RDA (32) Tools (5) Systems (7) Teaching (2) Uncategorized (12) Vocabularies (11) April S M T W T F S « Feb «-»     1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 Search Search the archive Latest comments Denying the Non-English Speaking World  1 Karen Coyle What do we mean when we talk about ‘meaning’?  1 Owen Stephens Why Are We Waiting for the ILS to Change?  2 Diane Hillmann, Dan Scott Mapping without taggregations  2 Gordon Dunsire, Kathleen Lamantia If We Were Asked  2 Eddie F. Fitzgerald, Chuck Getting to higher MARC branches  2 Gordon Dunsire, Karen Coyle Blogroll Bibliographic Wilderness Catalogablog Cataloging Futures Celeripedean Coyle’s Information Go to Hellman Inkdroid LITABlog Lorcan Dempsey’s Weblog Metadata Blog Metalogue The FRBR Blog The Registry blog Thingology Virtual Dave … Real Blog Weibel Lines Linkroll Buttons Schnellnavigation: Jump to start of page | Jump to posts | Jump to navigation Metadata Matters is powered by WordPress v4.4.2 and Binary Blue v1.4.1 
marcedit-reeset-net-8985	----	7.5.27 Updated: 4/26/2021 * Enhancement: MarcEditor: Added a button to provide quick access to the available task list. * Enhancement: MarcEditor: code is in place to begin allowing users to show/hide menu/toolbar buttons. This should be available in a near term update. 7.5.25 Updated: 4/25/2021 * Bug Fix: Internet Archive => HathiTrust plugin updates to correct debug link generation. * Update: File Assoc. updates * Update: Installer - file extensions will now assign to 7.5.x 7.5.20 Updated: 4/19/2021 * Enhancement: Z39.50 -- users can add more than 2 criteria. * Update: Plugin -- Internet Archive => HathiTrust Plugin updated to allow for multiple date type searches. * Update: Z39.50 UI changes to make it easier to prevent data from being hidden on high zoom * Update: In the Preferences, the Task location can now allow Environment variables in the file path (example: %APPDATA%) * Update: Updated JSON/RDF Components * Bug Fix: Validate Headings window was freezing when using some of the new linked data rule options. * Enhancement: Custom Reports -- added a UI validation to ensure required data is provided (this wasn't previously the case). * Enhancement: MarcValidator -- added some updated language in the error changing. * Bug Fix: MarcValidator -- make sure that all file handles are closed (there was a case where one of the handles was remaining opened and could, potentially, result in a locked process). 7.5.8 Updated: 4/3/2021 * Enhancement: MarcEditor global Edit functions -- a new Preview option has been added (Replace All, Add Field, Delete Field, Copy Field, Edit Indicators, Edit Field, Edit Subfield, Swap Field) * Enhancement: UI enhancement to ensure that a status message is present so users know the process is running (Replace All, Add Field, Delete Field, Copy Field, Edit Indicators, Edit Field, Edit Subfield, Swap Field) * Enhancement: MarcEngine -- added JSON => XML translation * Enhancement: XML/JSON Profile Wizard - added support for JSON-LD formatted data. * Enhancement: XSLT -- including XSLT for the Homosaurus vocabulary * Enhancement: OCLC API -- surfacing more debugging information to make it easier to see when an issue is occuring * Bug Fix: MarcValidator -- Ensured all file handles are closing and released * Behavior Change: KBART 2 MARC Plugin - tool will preference ISBN 13 if present (currently, it selects the last ISBN if multiples of the same type are present) * Bug Fix: Installer -- cleaned up some old files * Behavior Change: OCLC has discontinued providing work id information in worldcat.org. I've shifted to using the classify api till a better option is found. * Clean-up: UI Clean up in the migration wizard * Clean-up: UI clean up of the main window * Bug Fix/Clean-up: Corrected UI to add back missing icons (for example, in the Extract Selected Records form) 7.5.2 Updated: 2/7/2021 * Enhancement: Updated Plugin Manager * Enhancement: OCLC Connexion Plugin Added/Converted * Enhancement: Internet Archive => HathiTrust Packager Added/Converted * Enhancement: MARC => KBART Converter Added/Converted * Enhancement: Make Check Digit Added/Converted * Enhancement: Microlif => Mnemonic Converted Added/Converted * RIS => MARC Plugin Added/Converted * Enhancement: Installer evaluates for the 64-bit Access Database Engine (2016) on 64 bit systems * Enhancement: Installer evaluates for the 2015 C++ Runtime required by the Access Database Engine on 64 bit systems * Behavior Change: Restart as 32-bit program has been hidden * Enhancement: MARC SQL Explorer has been folded into the primary MarcEdit Application [results in a reduction of dependencies] * Bug Fix: Clustering Tools -- Beta build wasn't allowing the clustering tools to function correctly. * Enhancement: OCLC Search -- Batch Searching has been allowed * Enhancement: OCLC Integration -- New Session Diagnostics option added for debugging processes * Bug Fix: Integration Settings Import -- If no settings have ever been set and the initial file hasn’t been created, import will say it’s completed, but it won’t. * Bug Fix: OCLC Integration -- if the expires_at element is null or fails to parse, it can throw an error. This is now trapped and will attempt to reauthorize. * Bug Fix: Console: Added process to consume event processing for validate and split tasks. 7.5.1 Updated: 2/2/2021 * Bug Fix: Installer throws an error when attempting to install per user * Bug Fix: MarcEditor -- MarcEdit will be deprecating legacy page loading. This option is now ignored if set and will be removed entirely in future builds. 7.5.0 Updated: 2/1/2021 * Change: Allow OS to manage supported supported Security Protocol types. * Change: Remove com.sun dependency related to dns and httpserver * Change: Changed AppData Path * Change: First install automatically imports settings from MarcEdit 7.0-3.x * Change: Field Count - simplify UI (consolidate elements) * Change: 008 Windows -- update help urls to oclc * Change: Generate FAST Headings -- update help urls * Change: .NET changes thread stats queuing. Updating thread processing on forms: * Generate FAST Headings * Batch Process Records * Build Links * Main Window * RDA Helper * Delete Selected Records * MARC Tools * Check URL Tools * MARCValidator * MARCEngine * task manager * Z39.50 * ILS Integration Processing * Character Conversions * Format Handing (delimited text, openrefine, etc.) * Change: XML Function List -- update process for opening URLs * Change: Z39.50 Preferences Window - update process for opening URLs * Change: About Windows -- new information, updated how version information is calculated. * Change: Catalog Calculator Window -- update process for opening URLs * Change: Generate Call Numbers -- update process for opening URLs * Change: Generate Material Formats -- update process for opening URLs * Change: Tab Delimiter -- remove context windows * Change: Tab Delimiter -- new options UI * Change: Tab Delimiter -- normalization changes * Change: Remove Old Help HTML Page * Change: Remove old Hex Editor Page * Change: Updated Hex Editor to integrate into main program * Change: Main Window -- remove custom scheduler dependency * Change: UI Update to allow more items * Change: Main Window -- new icon * Change: Main Window -- update process for opening URLs * Change: Main Window -- removed context menus * Change: Main Window -- Upgrade changes to new executable name * Change: Main Window -- Updated the following menu Items: * Edit Linked Data Tools * Removed old help menu item * Added new application shortcut * Change: OCLC Bulk Downloader -- new UI elements to correspond to new OCLC API * Change: OCLC Search Page -- new UI elements to correspond to new OCLC API * Change: Preferences -- Updates related to various preference changes: * Hex Editor * Integrations * Editor * Other * Change: RDA Helper -- update process for opening URLs * Change: RDA Helper -- Opening files for editing * Change: Removed the Script Maker * Change: Templates for Perl and vbscripts includes * Change: Removed Find/Search XML in the XML Editor and consolidated in existing windows * Change: Delete Selected Records: Exposed the form and controls to the MarcEditor * Change: Sparql Browser -- update process for opening URLs * Change: Sparql Browser -- removed context menus * Change: TroubleShooting Wizard -- Added more error codes and kb information to the Wizard * Change: UNIMARC Utility -- controls change, configurable transform selections * Change: MARC Utilities -- removed the context menu * Change: First Run Wizard -- new options, new agent images * Change: XML Editor -- Delete Block Addition * Change: XML Editor -- XQuery transform support * Change: XML Profile Wizard -- option to process attributes * Change: MarcEditor -- Status Bar control doesn't exist in NET 5.0. Control has changed. * Change: MarcEditor -- Improved Page Loading * Change: MarcEditor -- File Tracking updated to handle times when the file opened is a temp record * Change: MarcEditor -- removed ~7k of old code * Change: MarcEditor -- Added Delete Selected Records Option * Change: Removed helper code used by Installer * Change: Removed Office2007 menu formatting code * Change: Consolidated Extensions into new class (removed 3 files) * Change: Removed calls Marshalled to the Windows API -- replaced with Managed Code * Change: OpenRefine Format handler updated to capture changes between OpenRefine versions * Change: MarcEngine -- namespace update to 75 * Change: Wizard -- missing unicode font options more obvious * Change: Wizard install puts font in program directory so that additional users can simply copy (not download) the font on use * Change: checkurls: removed support for insecure crypto-types * Change: checkurls: additional heuristics to respond dynamically to http status codes * Change: All Components -- .NET 5.0 includes a new codepages library that allows for extended codepage support beyond the default framework. Added across the project. * Change: MarcValidator -- new rules process that attempts to determine if records are too long for processing when validating rules or structure. * Change: Command-line -- batch process switch has been added to the tasks processing function * Change: Options -- Allow user path to be reset. * Bug Fix: Main Window -- corrects process for determining version for update * Bug Fix: Main Window -- Updated image * Bug Fix: When doing first run, wizard not showing in some cases. * Bug Fix: Main Window -- Last Tool used sometimes shows duplicates * Bug Fix: RDA Helper -- $e processing * Bug Fix: RDA Helper -- punctuation in the $e * Bug Fix: XML Profile Wizard -- When the top element is selected, it's not viewed for processing (which means not seeing element data or attribute data) * Bug Fix: MarcEditor -- Page Processing correct to handle invalid formatted data better * Bug Fix: Installation Wizard -- if a unicode font was installed during the first run process, it wouldn't be recognized. * Bug Fix: MarcValidator fails when attempting to process a .mrk file from outside the MarcEditor * Bug Fix: Linked Data Processing: When processing services with multiple redirects -- process may stop pre-maturely. (Example: LC's id.loc.gov 3xx processing) * Bug Fix: Edit Field -- Find fields with just spaces are trimmed, causing the field data to process improperly. * Bug Fix: RDA Helper will fail if LDR length is incorrect when attempting to determine character encoding 
mashable-com-9799	----	The 10 Founding Fathers of the Web We're using cookies to improve your experience. Find out more. Hidden main menu item Mashable Video Entertainment Movies Gaming Television Culture Web Culture Sex & Relationships Celebrities Memes Parenting Social Media Tech Business Apps Gadgets Reviews Mobile Smart Home How To Mashable Choice Science Climate Space Social Good LGBTQ Feminism Gender Equality Activism Non-profits AMPLIFY Shop Tech VPN Headphones Speakers Laptops Web Hosting Antivirus Lifestyle Black Friday Home Kitchen Gift Guides Gaming Culture Dating Pets Subscription Boxes Carry On Best of Tech Best VPN Best Cheap VPN Best Streaming Services Best Cheap Laptops Best Running Headphones Best Bluetooth Speakers Best of Culture Best Dating Sites Best Free Dating Sites Best Dating Sites for Introverts Best DNA Tests Best Dog DNA Tests Best Subscription Boxes Best of Lifestyle Best Airfryer Best Cordless Vacuum Best Instant Pot Best Gifts Under $50 Best Robot Vacuums Best Vacuum for Pet Hair Black Friday Search More Channels Video Entertainment Culture Tech Science Social Good AMPLIFY Company Masthead Licensing & Reprints Archive Mashable Careers Contact Contact Us Submit News Mashable Shop Advertise Advertise AdChoices Legal Privacy Policy Terms of Use Cookie Policy Accessibility Statement Do Not Sell My Personal Information Resources Travel Security How To Mashable Deals Gift Guides Sites Job Board Social Good Summit International Mashable Australia Mashable Benelux Mashable India Mashable Italia Mashable ME Mashable Pakistan Mashable SE Asia Mashable UK Entertainment Like Follow The 10 Founding Fathers of the Web By Christina Warren2010-07-04 14:17:28 UTC While the phrase "founding fathers" is often used in conjunction with men like Benjamin Franklin, Thomas Jefferson and George Washington, we wanted the think about the phrase on the global level. And what is more global than the world wide web? Thus, this holiday, we're taking a look at 10 individuals who have been instrumental in helping to shape the world wide web and the culture of the Internet as we know it today. Check out our round up below to learn about some of the most influential people in the creation and development of the ideas and technologies that have led to today's web experience. Let us know in the comments if you think we've missed anyone! 1. Tim Berners-Lee Why He Matters: Tim Berners-Lee is credited as the inventor of the World Wide Web. A physicist, Berners-Lee and his team built the world's very first web browser, WorldWideWeb, the first web server and the HyperText-based markup language HTML. Berners-Lee founded and is the current director of the World Wide Web Consortium (W3C), a standards body that oversees the development of the web as a whole. While the Internet itself dates back 1969, it was Berners-Lee who was able to bring together the concept of the Internet and hypertext, which set the foundation for the Internet as we know it today. Because CERN (the European Organization for Nuclear Research) didn't make the World Wide Web proprietary and never charged for dues, its protocols were widely adopted. 2. Marc Andreessen Why He Matters: Marc Andreessen co-authored Mosaic, the first widely-used web browser and he founded Netscape Communications. While Mosaic wasn't the first graphical web browser, it was the first to garner significant attention. It was also the first browser to display images inline with text. After designing and programing Mosaic, Andreessen went on to co-found Netscape Communications. Netscape's flagship product, Netscape Navigator, had an enormous impact, by helping to bring the web to mainstream users. In 1998, Netscape released the code base for Netscape Communicator under an open source license. That project, known as Mozilla, became the basis of what we now know as Firefox. 3. Brian Behlendorf Why He Matters: Brian Behlendorf was the primary developer of the Apache Web Server and one of the founding members of the Apache Group. While working as the webmaster for Wired Magazines's HotWired web site, Behlendorf found himself making changes and patches to the HTTP server first developed at NCSA at the University of Illinois at Urbana-Champaign. After realizing that others were also adding their own patches, he put together an electronic mailing list to help coordinate the work. By February 1995, the project had been given a name - Apache - and the entire codebase from the original NCSA server was rewritten and re-optimized. The real genius with Apache, other than its free and open source nature, was that it was built to be extensible. That meant that ISPs could easily add their own extensions or plugins to better optimize the server, allowing hundreds of sites to be hosted from just one computer server. Apache remains the most popular web server on the Internet. 4, 5, 6. Rasmus Lerdorf, Andi Gutmans and Zeev Suraski Why They Matter: Lerdorf, Gutmans and Suraski are all responsible for what we know as PHP, the scripting language that remains one of the most used web languages for creating dynamic web pages. Rasmus Lerdorf first created PHP in 1995 and he was the main developer of the project for its first two versions. In 1997, Gutmans and Suraski decided to extend PHP, rewriting the parser and creating what became known as PHP 3. The two then went on to rewrite the core of PHP, naming it the Zend Engine, and using that to power PHP 4. Gutmans and Suraski further went on to found Zend Technologies, which continues to do much of the development of PHP. While Larry Wall's Perl was one of the first general-purpose scripting languages to really take off on the web, the ease of use and embedability of PHP is what has made it take over as the defacto "P" in the LAMP stack (LAMP being a default set of components on which many web applications are based). 7. Brad Fitzpatrick Why He Matters: Creator of LiveJournal, in many ways the proto-social network, the original author of memcached and the original authentication protocol for OpenID. Fitzpatrick created LiveJournal in college, as a way for he and his friends to keep one another up to date with what they were doing. It evolved into a larger blogging community and implemented many features, like Friends Lists, the ability to create user polls, support for blog clients, the ability to send text messages to users, the ability to post by phone, post by e-mail, create group blogs and more that have become a standard part of communities like Facebook, Tumblr, MySpace, WordPress.com and Posterous today. As LiveJournal grew and started to use more and more resources, Fitzpatrick started the memcached project as a way to speed up dynamic web applications and alleviate database load. It does this by pooling together the free memory from across your web servers and then allocate it out as needed. This makes it easy for large projects to scale. Memcached is in use by Wikipedia, Flickr, Facebook, WordPress, Twitter, Craigslist and more. 8. Brendan Eich Why He Matters: He created JavaScript and now serves as the CTO of the Mozilla Corporation. Eich created JavaScript while at Netscape, first under the name Mocha, then under the name LiveScript, and finally as JavaScript. JavaScript made its official debut in December of 1995. JavaScript quickly became one of the most popular web programming languages, even if its use cases in the early days were often visual abominations. However, as time has progressed, the advent of JavaScript libraries and frameworks, coupled with the power of Ajax has made JavaScript an integral part of the standards-based web. 9. John Resig Why He Matters: John Resig is the creator and lead developer of jQuery, the most popular JavaScript library on the web. While other JavaScript libraries, such as Sam Stephenson's Protoype, preceded jQuery, jQuery's goal of being compatible across web browsers is what really sets it apart. In the last two years especially, the momentum around jQuery has exploded and it is now reportedly in use by 31% of the top 10,000 most visited websites. It's extensibility and the jQuery UI toolkit has also made it a popular adoption target in enterprise application development. Any JavaScript library that can make the leap from web developers to enterprise app builders is the real deal. JavaScript continues to be one of the big forces within the standards-based web and jQuery is helping to lead the charge. 10. Jonathan Gay Why He Matters: He co-founded FutureWave Software and for more than a decade was the main programmer and visionary behind Flash. While not everyone is a fan of Adobe Flash, it's important to remember how influential and instrumental the technology has been over the course of the last 15 years. Gay wrote a vector drawing program called SmartSketch back in 1993 for the PenPoint operating system, and after PenPoint was discontinued, the technology in SmartSketch was repurposed as a tool that could create animation that could be played back on web pages. This product, FutureSplash Animator, was acquired by Macromedia in 1996 and renamed Flash. After the acquisition, Gay became Vice President of Engineering at Macromedia and he led the Flash engineering team. Over the years, his team implemented new elements to Flash, like Actionscript. However, perhaps Gay's pinnacle achievement with Flash was in the team he spearheaded to create what was then known as the Flash Communication Server (it's now the Flash Media Server) which let Flash Player use the RTMP protocol to stream audio and video over the web. In essence, this technology is what allowed YouTube to be, well, YouTube. More development and design resources from Mashable: - Top 10 Resources for Design Inspiration - HOW TO: Get Up-to-Date on WordPress 3.0 - 7 Hackathons Around the World and the Web - 10 Web Design Bloggers You Should Follow - Top 10 Beautiful Minimalist Icon Sets [img credits: European Parliament, Marc Andreessen, Ilya Schurov, chrys/Sebastian Bergmann, crucially, jsconf, badubadu] Topics: brendan eich, Dev & Design, founding fathers, john resig, marc andreessen, rasmus lerdorf, Social Media, web development, World Wide Web Masthead Jobs Advertise Mashable Shop Contact Privacy Terms Facebook mashable Twitter mashable Feeds mashable Pinterest mashable YouTube mashable StumbleUpon mashable LinkedIn mashable Better Business Bureau Accredited Business is a global, multi-platform media and entertainment company. Powered by its own proprietary technology, Mashable is the go-to source for tech, digital culture and entertainment content for its dedicated and influential audience around the globe. ©2021 Mashable, Inc. All Rights Reserved. Mashable, MashBash and Mashable House are among the federally registered trademarks of Ziff Davis, LLC and may not be used by third parties without explicit permission. 
matienzo-org-2040	----	Posts | Mark A. Matienzo skip to content W3C SVG Main navigation Menu About Now Posts Notes Music Projects Publications Presentations Press Posts IAH Forecast - Disquiet Junto Project 0476 Publish date: February 15, 2021 Tags: music black tent by Mark A. Matienzo An experiment with recording a new single using VCV Rack and REAPER based on a compositional prompt. I ended up recording two tracks. Perfecting a favorite: oatmeal chocolate chip cookies Publish date: November 29, 2020 Tags: recipes food by Mark A. Matienzo I have a horrible sweet tooth, and I absolutely love oatmeal chocolate chip cookies. I tend to bake as a means to cope with stress, and of course, more often then that means making these cookies. After making many iterations, I’ve settled upon this recipe as the ultimate version to which all compare. (Read more …) In Memoriam and Appreciation of Rob Casson (1974-2020) Publish date: October 1, 2020 Tags: code4lib Personal by Mark A. Matienzo The world lost one of its brightest and most charming lights earlier this week, Rob Casson. Many of us knew Rob through the Code4Lib community and conferences and his work at Miami University Libraries. We miss his generosity, patience, sense of humor, and genuine kindness. Those of us who got the chance to socialize with him also remember his passion for music, and some of us were even lucky to see live shows in the evenings between conference sessions and other social activities. On Sunday, October 4 at 1:30 PM Pacific/4:30 PM Eastern, those of us who knew him through Code4Lib and the world of libraries are encouraged to gather to share our memories of him and to appreciate his life and work. Please join me and my co-organizers, Mike Giarlo and Declan Fleming on Zoom (registration required). Robert Casson (robcaSSon), 30 Jan 1974 - 29 Sep 2020. Photo: Declan Fleming. (Read more …) First SOTA activation Publish date: September 29, 2020 Tags: ham radio by Mark A. Matienzo About a month ago, I got my ham radio license, and soon after I got pretty curious about Summits on the Air (SOTA), an award scheme focused on safe and low impact portable operation from mountaintops. While I like to hike, I’m arguably a pretty casual hiker, and living in California provides a surprising number of options within 45 minutes driving time for SOTA newbies. (Read more …) Optimizing friction Publish date: August 10, 2020 Tags: indieweb music plan9 food by Mark A. Matienzo Over and in response to the last few months, I’ve been reflecting about intentionality, and how I spend my time creating things. I have tried to improve the indiewebbiness of my site, and understanding what it means to “scratch my own itch”. This resonates particularly lately because it’s leading me to mull over which parts should be hard and easy. Unsurprisingly, much of that is personal preference, and figuring out how I want to optimize from the perspective of user experience. Friction in UX can be a powerful tool, part of what I’m trying to find is where I want to retain friction as it helps me remain intentional. (Read more …) A Hugo shortcode for embedding Mirador Publish date: July 25, 2020 Tags: iiif hugo by Mark A. Matienzo I spent a little time over the last day or so trying to bodge together a shortcode for Hugo to embed an instance of Mirador. While it’s not quite as simple (or full-featured) as I’d like, it’s nonetheless a starting point. The shortcode generates a snippet of HTML that gets loaded into Hugo pages, but (unfortunately) most of the heavy lifting is done by a separate static page that gets included as an <iframe/> within the page. That page parses URL parameters to pass some of the parameters when Mirador gets instantiated. Getting a consistent way to load multiple IIIF manifests, either into comparison view or for populating a resource list also needs some work, which also led me to grapple with thinking through the IIIF Content State API spec, which will require some more attention, too. (Read more …) Besieged Publish date: July 17, 2020 Tags: personal by Mark A. Matienzo I have spent the last four and a half months feeling like everything is slipping from my grasp – personally, professionally, and in between. The torpor of life under a pandemic and a world wracked with pain has led me to feel like I am stuck in slowly-drying glue. Planning too far ahead seems nearly pointless. And yet, every day, we are asked to undertake haruspicy, to speculate about how our organizations and ourselves should respond to the remaining uncertainty, ideally with precision. The world keeps turning and we are asked to keep up, while taking care of family members, grieving our losses, or dealing with other challenges amplified by the present circumstances. At the same time, I feel myself slowing down, or at least to continue trying to slow down. I have not read anything more substantial than an article since February, despite getting a stack of books out of the library in preparation for more time at home. The cognitive load of mailing packages can sometimes be too much. (Read more …) Comments on revisions to SAA statement on Diversity and Inclusion Publish date: February 27, 2020 Tags: archives saa dei by Mark A. Matienzo The SAA Council has issued a call for comments on the SAA Statement on Diversity and Inclusion. As noted in the announcement, the revision includes changes to expand the statement to cover equity as well. Comments are open on the revisions until March 12, 2020, and what follows are the comments that I’ve submitted. (Read more …) Books read, January-February 2020 Publish date: February 13, 2020 Tags: reading by Mark A. Matienzo I’m trying to do a better job tracking what I’ve been reading. Here’s a start. (Read more …) Solidarity, logistics, and infrastructure on Prime Day Publish date: July 15, 2019 Tags: labor by Mark A. Matienzo July 15 and 16th are “Prime Day,” Amazon’s attempt to drive up sales and artificial demand around things we don’t need at prices they’ve convinced us that we can afford. Thanks to Mar Hicks, many of us heard that workers at a Shakopee, Minnesota fulfillment center are holding a six-hour work stoppage on one of the busiest days of the year. Alongside, many have called for a boycott on Amazon and its subsidiaries (Whole Foods, Goodreads, Twitch, etc.), and others have called for a general strike to protest Amazon’s collaboration with Palantir in aiding ICE. With all of this in mind, I’ve been reflecting on what larger scale industrial actions could look like when we look at Amazon’s simultaneous leveraging of centralization and unreliability of single resources to provide critical infrastructure for the IT sector and its own operations. (Read more …) « Prev Next » invert colors: Feed · Navigate by years or tags 
matrix-org-4087	----	Matrix.org This app works best with JavaScript enabled. DiscoverDevelopFoundationBlogFAQsMatrix LiveShopTry NowTry Now An open network for secure, decentralized communication Learn More An open network for secure, decentralized communication Get started Imagine a world... ...where it is as simple to message or call anyone as it is to send them an email. ...where you can communicate without being forced to install the same app. ...where you can choose who hosts your communication. ...where your conversations are secured by E2E encryption. ...where there’s a simple standard HTTP API for sharing real-time data on the web. This is Matrix. Matrix is an open source project that publishes the Matrix open standard for secure, decentralised, real-time communication, and its Apache licensed reference implementations. Maintained by the non-profit Matrix.org Foundation, we aim to create an open platform which is as independent, vibrant and evolving as the Web itself... but for communication. As of June 2019, Matrix is out of beta, and the protocol is fully suitable for production usage. Messaging Matrix gives you simple HTTP APIs and SDKs (iOS, Android, Web) to create chatrooms, direct chats and chat bots, complete with end-to-end encryption, file transfer, synchronised conversation history, formatted messages, read receipts and more. Conversations are replicated over all the servers participating in them, meaning there are no single point of control or failure. You can reach any other user in the global Matrix ecosystem of over 25M users, even including those on other networks via bridges. Learn more End-to-End Encryption Matrix provides state-of-the-art end-to-end-encryption via the Olm and Megolm cryptographic ratchets. This ensures that only the intended recipients can ever decrypt your messages, while warning if any unexpected devices are added to the conversation. Matrix’s encryption is based on the Double Ratchet Algorithm popularised by Signal, but extended to support encryption to rooms containing thousands of devices. Olm and Megolm are specified as an open standard and implementations are released under the Apache license, independently audited by NCC Group. Learn more VoIP With the advent of WebRTC, developers gained the ability to exchange high quality voice and video calls – but no standard way to actually route the calls. Matrix is the missing signalling layer for WebRTC. If you are building VoIP into your app, or want to expose your existing VoIP app to a wider audience, building on Matrix’s SDKs and bridges should be a no-brainer. Learn more Bridging Matrix owes its name to its ability to bridge existing platforms into a global open matrix of communication. Bridges are core to Matrix and designed to be as easy to write as possible, with Matrix providing the highest common denominator language to link the networks together. The core Matrix team maintains bridges to Slack, IRC, XMPP and Gitter, and meanwhile the wider Matrix community provides bridges for Telegram, Discord, WhatsApp, Facebook, Hangouts, Signal and many more. Learn more IOT, VR and more... Matrix can handle any type of real-time data, not only messaging and VoIP. By building bridges to as many IoT silos as possible, data can be securely published on the Matrix network. IoT solutions built on Matrix are unified, rather than locked to specific vendors, and can even publish or consume Matrix data directly from devices via ultra-low bandwidth transports (100bps or less) Meanwhile AR and VR vendors are recreating the silos we’ve seen in instant messaging rather than working together towards an open ecosystem. Matrix can be the unifying layer for both communication and world data in AR and VR. How does it work? Matrix is really a decentralised conversation store rather than a messaging protocol. When you send a message in Matrix, it is replicated over all the servers whose users are participating in a given conversation - similarly to how commits are replicated between Git repositories. There is no single point of control or failure in a Matrix conversation which spans multiple servers: the act of communication with someone elsewhere in Matrix shares ownership of the conversation equally with them. Even if your server goes offline, the conversation can continue uninterrupted elsewhere until it returns. This means that every server has total self-sovereignty over its users data - and anyone can choose or run their own server and participate in the wider Matrix network. This is how Matrix democratises control over communication. By default, Matrix uses simple HTTPS+JSON APIs as its baseline transport, but also embraces more sophisticated transports such as WebSockets or ultra-low-bandwidth Matrix via CoAP+Noise. Next Here are three Matrix homeservers, each with one client connected. The clients are all participating in the same Matrix room, which is synchronised across the three participating servers. Alice sends a JSON message to a room on her homeserver.curl -XPOST -d '{"msgtype":"m.text", "body":"hello"}' "https://matrix.alice.com/_matrix/client /v2/rooms/ROOM_ID/send/m.room.message ?access_token=ACCESS_TOKEN" { "event_id": "$YUwRidLecu:alice.com" } Alice's homeserver adds the JSON to its graph of history, linking it to the most recent unlinked object(s) in the graph. The server then signs the JSON including the signatures of the parent objects to calculate a tamper-resistent signature for the history. The server then sends the signed JSON over HTTPS to any other servers which are participating in the room.curl –XPOST –H 'Authorization: X-Matrix origin=alice.com,...' –d '{ "ts": 1413414391521, "origin": "alice.com", "destination": "bob.com", "pdus": [{ "event_id": "$YUwRidLecu:alice.com", "content": { "body": "hello world", "msgtype": "m.text" }, ... "pdu_type": "m.room.message", "signatures": { "matrix.org": { "ed25519:auto": "jZXTwAH/7EZ..." } }, "sender": "@alice:alice.com" }] }' https://matrix.bob.com:8448/_matrix/federation/v1/send/916d... The destination servers perform a series of checks on the message: Validate the message signature to protect against tampering with history Validate the HTTP request's auth signature to protect against identity spoofing Validate whether Alice's historical permissions allow her to send this particular message If these checks pass, the JSON is added to the destination servers' graphs. Destination clients receive Alice's message with a long-lived GET request. (Clients are free to implement more efficient transports than polling as desired).curl "https://matrix.bob.com/_matrix/client /v2/sync?access_token=ACCESS_TOKEN" { "next_batch": "s72595_4483_1934", "rooms": [{ "room_id": "!KrLWMLDnZAyTapqLWW:alice.com", "events": { "batch": [{ "event_id": "$YUwRidLecu:alice.com", "type": "m.room.message", "content": { "body": "I am a fish", "msgtype": "m.text", }, "origin_server_ts": 1417731086797, "sender": "@alice:alice.com" }], }, }] } Bob sends a response to Alice's message, and his server adds his message into his copy of the room's history, linking it to the most recent unlinked object in the graph - Alice's last message. Meanwhile, Charlie also responds to Alice's message - racing with Bob's message. Alice, Bob and Charlie's homeservers all have different views of the message history at this point - but Matrix is designed to handle this inconsistency. Bob's homeserver relays his message through to Alice and Charlie's servers, who accept it. At this point Alice and Bob are in sync, but Charlie's room history has split - both messages 2 and 3 follow on from message 1. This is not a problem; Charlie's client will be told about Bob's message and can handle it however it chooses. Charlie's homeserver relays his message through as well, at which point all 3 servers have a consistent view of history again (including the race between Bob and Charlie). All three clients have seen all three messages, and the room history is now back in sync across the participating servers. Later on, Alice sends another message - her homeserver adds it to her history, and links it to the most recent unlinked objects in the graph: Bob and Charlie's messages. This effectively merges the split in history and asserts the integrity of the room (or at least her view of it). Alice's message is then relayed to the other participating servers, which accept it and update their own history with the same rules, ensuring eventual consistency and integrity of the distributed room history. An Open Standard Simple pragmatic RESTful HTTP/JSON APIs by default Open specification of the Matrix standard Fully decentralised conversations with no single points of control or failure End-to-end encryption via Olm and Megolm WebRTC VoIP/Video calling using Matrix signalling Real-time synchronised history and state across all clients Integrates with existing 3rd party IDs to authenticate and discover users Maintained by the non-profit Matrix.org Foundation Group conversations, read receipts, typing notifications, presence... Latest News This Week in Matrix 2021-04-23 2021-04-23 by Ben Parsons Dept of Status of Matrix 🌡️ Open Tech Will Save Us news Neil announced: Folks, incredibly Open Tech Will Save Us is over a year old… Read more Synapse 1.32.2 released 2021-04-22 by Dan Callahan Synapse 1.32.2 is out! Synapse now requires Python 3.6 (or later) and we've made a few small changes which you should be aware of before… Read more This Week in Matrix 2021-04-16 2021-04-16 by Ben Parsons Matrix Live 🎙 This week we hosted Open Tech Will Save Us #12! Creating a SAFE SUPPORT CHAT on the Matrix API Kim Allen of PRIMAL GLOW… Read more Old Gitter bridge end of life (2021-04-21) - to be replaced with native bridge 2021-04-15 by Bridge Team Next week on Wednesday (2021-04-21), the old Gitter bridge (identified as @gitterbot:matrix.org ) will be shut down and any plumbed rooms… Read more View all posts Explore Matrix Try Matrix Clients Bots SDKs Hosting SDKs Native SDKs for multiple platforms, including: Python JavaScript Android iOS View all SDKs Open Source Join thousands of other developers in our open source repositories, including: Synapse JavaScript SDK Android SDK iOS SDK View all on GitHub The Matrix Foundation Matrix is managed through an open governance process, looked after by The Matrix.org Foundation - a non-profit UK Community Interest Company. It acts as a neutral guardian of the Matrix spec, nurturing and growing Matrix for the benefit of the whole ecosystem. The Guardians are the legal directors of the Foundation, responsible for ensuring that it keeps on mission and neutrally protects the development of Matrix. What people are saying I have seen the future of distributed collaboration and it is Matrix. The .NET binding looks old, incomplete and I maintained. If we get GSoC students this year, I’ll be happy to mentor, in the meantime I should probably contribute to it: https://t.co/nJY4iNHaLQ — Miguel de Icaza (@migueldeicaza) February 6, 2019 I finally started a spreadsheet to compare relative security, privacy, compatibility, and features of various messenger systems. TL;DR @RiotChat / @matrixdotorg is winning on all fronts. https://t.co/7zxczdjwwJ — Lance R. Vick (@lrvick) October 13, 2018 I certainly wouldn't trust a proprietary software driven by some Russian crypto millionaires any more (or any less) than Whatsapp. Or Threema, Wire, Keybase, and not even Signal. Friends tell their friends to choose open, federated protocols. Like @Matrixdotorg and @RiotChat. https://t.co/gOUwFwCcra — martin ➬ (@martinkrafft) May 21, 2019 We are spending more and more time in @matrixdotorg. @RiotChat works like a charm, better than @SlackHQ for many things and of course way better than IRC. It's awesome to have so many open communities forming and being able to jump from one channel to the other. Give it a try! 📢 pic.twitter.com/5uL1D4ryQo — poliastro (@poliastro_py) March 5, 2019 Support Matrix If you share our vision, or are building on top of Matrix, please consider donating... Support us on Patreon for great rewards including access to the supporters-only podcast, and even a voice at our weekly meetings. Find out more at our Patreon page. Patreon Liberapay Cryptocurrency You can also send us cryptocurrency: BTC 1LxowEgsquZ3UPZ68wHf8v2MDZw82dVmAE ETH 0xA5f9a4f9E024F6D727f7afdA9257e22329A97485 If you share our vision, or are building on top of Matrix, please consider donating. See our current Elliptic supporters! Thank you to our incredible sponsors Status A Mobile OS, Built for Ethereum. Learn more UpCloud Matrix.org is generously hosted by UpCloud! Host your homeserver via UpCloud and get a $25 credit. Learn more Private Internet Access Private Internet Access™ VPN Service encrypts your connection and provides you with an anonymous IP to protect your privacy. Learn more InBlockchain INBlockchain is a full-service firm focusing on consulting, incubating and facilitating crowdsales for promising blockchain startups. Learn more Omisego OmiseGO is a public Ethereum-based financial technology for use in mainstream digital wallets. Learn more Tendermint Byzantine fault-tolerant replicated state machines in any programming language. Learn more DiscoverTry MatrixClientsBotsSDKsHosting GuidesGetting StartedClient-Server APIInstall SynapseBridgesAll guides Develop DocsSpecAPI PlaygroundCode BlogAll PostsThis Week In MatrixSecurityRSS MoreFAQsMatrix LiveSecurity Disclosure PolicyCode of Conduct for Matrix.orgLegalContactSite Source © 2021 The Matrix.org Foundation C.I.C. 
mellon-org-9194	----	The Andrew W. Mellon Foundation   COVID-19 Response & Recovery Mellon continues to distribute additional funds to help shore up struggling arts and cultural organizations and higher learning institutions. Learn more COVID-19 Response & Recovery Mellon continues to distribute additional funds to help shore up struggling arts and cultural organizations and higher learning institutions. Learn more   About Mission History Founders Andrew W. Mellon Staff Trustees Annual Reports 2019 Annual Report Financials Social Bond Framework Investment Overview Policies Code of Ethics Conflicts of Interest and Disclosure Policy Equal Opportunity and Anti-Harassment Policy Third-Party Reports of Misconduct or Misuse of Foundation Funds Whistleblower Policy Careers Contact Information Programs Higher Learning Research Universities and Institutes Liberal Arts Colleges Mellon Mays Undergraduate Fellowship Program New Directions Fellowships Sawyer Seminars Regranting Programs Inquiries and Guidelines Call for Proposals: The Future of Higher Learning in Prison Arts and Culture Regranting Programs Inquiries and Guidelines Art Museum Staff Demographic Survey Public Knowledge Publishing Preservation Access Services Inquiries and Guidelines Call for Proposals to Community-Based Archives Humanities in Place Initiatives The Monuments Project Monuments FAQ COVID-19 Response & Recovery Liberation and Learning Puerto Rico Just Futures Research Mellon Research Forum Research Reports Institutional Research Grants Grants Database Grantmaking Policies and Guidelines Grantmaking Policies Grant Proposal Guidelines Grant Reporting Guidelines Grant Modifications and Matching Payments Guides and Forms News & Blog Events   COVID-19 Response & Recovery Mellon continues to distribute additional funds to help shore up struggling arts and cultural organizations and higher learning institutions. Learn more COVID-19 Response & Recovery Mellon continues to distribute additional funds to help shore up struggling arts and cultural organizations and higher learning institutions. Learn more 20 poets, 10 poems, 25 years of National Poetry Month. Learn More Press Releases A Statement on Voting Rights April 14, 2021 Shared Experiences Blog A Legendary Poet's Home Becomes a Sanctuary for Young Artists April 6, 2021 Press Releases Mellon Foundation Announces Five New Proposals Funded through the Monuments Project February 9, 2021 Press Releases Library of Congress Enriches America’s Story by Connecting with Minority Communities, Funded by $15M Andrew W. Mellon Foundation Grant January 27, 2021 COVID-19 Response, Press Releases Andrew W. Mellon Foundation Launches "Creatives Rebuild New York" January 12, 2021 Mellon News With Books and New Focus, Mellon Foundation to Foster Social Equity June 30, 2020 All news & blog posts     About the Andrew W. Mellon Foundation As the largest supporter of the arts and humanities in the US, the Mellon Foundation seeks to build just communities where ideas and imagination can thrive. Learn More   Stay Connected Sign up to stay informed about news and events at the Mellon Foundation. By signing up, you agree to our Privacy Policy. *Required Mellon by the numbers (2010 - Present) Continents 6 Grants 4,852 Awarded $2.53 BILLION Grants database   Our Core Programs We believe that the arts and humanities are where we express our complex humanity. That belief is at the core of our grantmaking programs:   Higher LearningEnriching our understanding of a complex world, Higher Learning supports inclusive, multivocal humanities education and diverse learning environments with a focus on historically underserved populations. Arts and CultureArts and Culture celebrates the power of the arts to challenge and activate the human spirit while nurturing a robust and equitable arts and culture ecosystem. Public KnowledgePublic Knowledge supports the creation and preservation of our shared cultural record to help us explore and better understand our intertwined humanity. Humanities in PlaceHumanities in Place supports a fuller, more complex telling of American histories and lived experiences by deepening the range of how and where our stories are told. Created with sketchtool.  STAY CONNECTED Sign up to stay informed about news and events at the Mellon Foundation. By signing up, you agree to our Privacy Policy. *Required Sign up below to receive emails from The Andrew W Mellon Foundation. By doing so you agree to our Privacy Policy. Email Address* First Name Last Name Organization Title Send me information about (check as many as apply) Higher Learning Arts and Culture Humanities in Place- Monuments Public Knowledge - Libraries, Archives, Publishing, and Tech Press Releases and Announcements Events Subscribe Stay connected Sign up to stay informed about news and events at the Mellon Foundation. By signing up, you agree to our Privacy Policy. *Required   Follow us FacebookTwitterLinkedin Created with sketchtool.Instagram  Terms of Use Privacy RSS Contact Us © 2021 The Andrew W. Mellon Foundation. 
matienzo-org-7395	----	Posts on Mark A. Matienzo Posts on Mark A. Matienzo Recent content in Posts on Mark A. Matienzo IAH Forecast - Disquiet Junto Project 0476 An experiment with recording a new single using VCV Rack and REAPER based on a compositional prompt. I ended up recording two tracks. Perfecting a favorite: oatmeal chocolate chip cookies I have a horrible sweet tooth, and I absolutely love oatmeal chocolate chip cookies. I tend to bake as a means to cope with stress, and of course, more often then that means making these cookies. After making many iterations, I&rsquo;ve settled upon this recipe as the ultimate version to which all compare. In Memoriam and Appreciation of Rob Casson (1974-2020) The world lost one of its brightest and most charming lights earlier this week, Rob Casson. Many of us knew Rob through the Code4Lib community and conferences and his work at Miami University Libraries. We miss his generosity, patience, sense of humor, and genuine kindness. Those of us who got the chance to socialize with him also remember his passion for music, and some of us were even lucky to see live shows in the evenings between conference sessions and other social activities. On Sunday, October 4 at 1:30 PM Pacific/4:30 PM Eastern, those of us who knew him through Code4Lib and the world of libraries are encouraged to gather to share our memories of him and to appreciate his life and work. Please join me and my co-organizers, Mike Giarlo and Declan Fleming on Zoom (registration required). Robert Casson (robcaSSon), 30 Jan 1974 - 29 Sep 2020. Photo: Declan Fleming. First SOTA activation About a month ago, I got my ham radio license, and soon after I got pretty curious about Summits on the Air (SOTA), an award scheme focused on safe and low impact portable operation from mountaintops. While I like to hike, I&rsquo;m arguably a pretty casual hiker, and living in California provides a surprising number of options within 45 minutes driving time for SOTA newbies. Optimizing friction Over and in response to the last few months, I&rsquo;ve been reflecting about intentionality, and how I spend my time creating things. I have tried to improve the indiewebbiness of my site, and understanding what it means to &ldquo;scratch my own itch&rdquo;. This resonates particularly lately because it&rsquo;s leading me to mull over which parts should be hard and easy. Unsurprisingly, much of that is personal preference, and figuring out how I want to optimize from the perspective of user experience. Friction in UX can be a powerful tool, part of what I&rsquo;m trying to find is where I want to retain friction as it helps me remain intentional. A Hugo shortcode for embedding Mirador I spent a little time over the last day or so trying to bodge together a shortcode for Hugo to embed an instance of Mirador. While it&rsquo;s not quite as simple (or full-featured) as I&rsquo;d like, it&rsquo;s nonetheless a starting point. The shortcode generates a snippet of HTML that gets loaded into Hugo pages, but (unfortunately) most of the heavy lifting is done by a separate static page that gets included as an &lt;iframe/&gt; within the page. That page parses URL parameters to pass some of the parameters when Mirador gets instantiated. Getting a consistent way to load multiple IIIF manifests, either into comparison view or for populating a resource list also needs some work, which also led me to grapple with thinking through the IIIF Content State API spec, which will require some more attention, too. Besieged I have spent the last four and a half months feeling like everything is slipping from my grasp – personally, professionally, and in between. The torpor of life under a pandemic and a world wracked with pain has led me to feel like I am stuck in slowly-drying glue. Planning too far ahead seems nearly pointless. And yet, every day, we are asked to undertake haruspicy, to speculate about how our organizations and ourselves should respond to the remaining uncertainty, ideally with precision. The world keeps turning and we are asked to keep up, while taking care of family members, grieving our losses, or dealing with other challenges amplified by the present circumstances. At the same time, I feel myself slowing down, or at least to continue trying to slow down. I have not read anything more substantial than an article since February, despite getting a stack of books out of the library in preparation for more time at home. The cognitive load of mailing packages can sometimes be too much. Comments on revisions to SAA statement on Diversity and Inclusion The SAA Council has issued a call for comments on the SAA Statement on Diversity and Inclusion. As noted in the announcement, the revision includes changes to expand the statement to cover equity as well. Comments are open on the revisions until March 12, 2020, and what follows are the comments that I&rsquo;ve submitted. Books read, January-February 2020 I&rsquo;m trying to do a better job tracking what I&rsquo;ve been reading. Here&rsquo;s a start. Solidarity, logistics, and infrastructure on Prime Day July 15 and 16th are &ldquo;Prime Day,&rdquo; Amazon&rsquo;s attempt to drive up sales and artificial demand around things we don&rsquo;t need at prices they&rsquo;ve convinced us that we can afford. Thanks to Mar Hicks, many of us heard that workers at a Shakopee, Minnesota fulfillment center are holding a six-hour work stoppage on one of the busiest days of the year. Alongside, many have called for a boycott on Amazon and its subsidiaries (Whole Foods, Goodreads, Twitch, etc.), and others have called for a general strike to protest Amazon&rsquo;s collaboration with Palantir in aiding ICE. With all of this in mind, I&rsquo;ve been reflecting on what larger scale industrial actions could look like when we look at Amazon&rsquo;s simultaneous leveraging of centralization and unreliability of single resources to provide critical infrastructure for the IT sector and its own operations. 2018: a year in gratitude This year was largely complicated and often felt like a massive garbage fire to myself and my crew. I didn&rsquo;t accomplish a number of my goals and was inconsistent about others, so recapping awesome things I did doesn&rsquo;t feel appropriate and also happens to be a soft reminder of either failure or things not going as planned. I also tend to hate &ldquo;best of the year&rdquo; lists but I find them helpful to remember about where I found joy or the ability to connect to something outside of myself. I suppose this is an attempt to reconcile those things, or perhaps more in line with the end of year spirit, a way to articulate gratitude to the people and things around me that impacted me. When basil has gone to seed: contemplative pesto We are growing three kinds of basil in our garden: &ldquo;regular&rdquo; basil, purple basil, and Magic Mountain basil. The regular basil and Magic Mountain basil have been thriving quite a bit; the purple basil, less so, as it is growing at the base of the regular basil plant. But the other two, my goodness. The regular old basil was going to seed, though, much to the chagrin of my partner. I&rsquo;d promised for weeks on end to do something with all that basil, as the stems grew woodier, and as the flowers turned from brilliant white to the brown of kraft paper. Meanwhile, the Magic Mountain basil also grew tall and bushy, went to flower, but only because that&rsquo;s what it&rsquo;s supposed to do. Evidence of Them: Digitization, Preservation, and Labor This is a lightly edited version of the presentation I gave as part of as a part of Session 507: Digitization IS/NOT Preservation at the 2018 Society of American Archivists Annual Meeting. The session was overall pure fire, with thoughtful, funny, provocative, and challenging presentations by Julia Kim, Frances Harrell, Tre Berney, Andrew Robb, Snowden Becker, Fletcher Durant, Siobhan Hagan, and Sarah Werner. My heart goes out to all of them. All of the images used in the presentation were adapted from The Art of Google Books. What one says and does not say: vulnerability, leadership, and professional trajectories An extended reflection on professional trajectories, leadership, vulnerability, community, and finding my voice, written as part of my participation in the IT Leadership Program. Beyond hearing (one another): radical empathy in archives-as-workplace I am writing this amidst being crammed into a seat flying back from New York City, after a few days of intensive meetings. Between a number of good and less ideal things, my mind has felt really unsettled lately, and I&rsquo;m working through some professional malaise, and feeling a bit rudderless. In an attempt to give myself something be myself optimistic about and to set some direction, I reread Michelle Caswell and Marika Cifor&rsquo;s 2016 Archivaria article &ldquo;From Human Rights to Feminist Ethics: Radical Empathy in Archives&rdquo;. Part of their analysis outlines four affective shifts in archival relationships based on radical empathy - those between 1) archivist and records creator, 2) archivist and records subject, 3) archivist and user, and 4) archivist and larger community. Given a long list of topics on my mind (precarity, developing inclusive workplaces and cultures, my own uncertain pathway), it felt like there was plenty of space to identify other shifts. Sending WebSub notifications from static sites using Netlify functions As part of my iterative intentions for 2018, I started a project to rebuild and simplify my website. I&rsquo;ve used Jekyll for quite some time (either by itself or with Octopress), and as part of the latest iteration of the site, I&rsquo;ve been working to align the site more with Indieweb principles, and to smooth the deployment path for my site by hosting it on Netlify. One challenge with Jekyll and other static site generators is that &ldquo;dynamic-ish&rdquo; functionality, including sending notifications through protocols like WebSub. The trouble is knowing where these actions fit into the build process for your site: you don&rsquo;t want to send the notifications before your site gets built, or pushed to the CDN hosting your site. Recently, Netlify announced a private beta for its new Netlify Functions service, which provides lambda-style functions deployed as part of your site deployment. One of the neat features that exists as of the beta is the ability to trigger the functions via Netlify events, like when your site successfully deploys. Notes on ITLP Workshop 1 readings I completed my reading and viewing assignments for my cohort&rsquo;s IT Leadership Program Workshop 1 (January 9-January 11 at UC Berkeley.) This is a brief set of notes for my own use about how all of them tie together. Iterative Intentions for 2018 While I enjoy seeing what my friends are setting their intentions towards in the new year, I don&rsquo;t really believe in new year&rsquo;s resolutions for myself. They tend to wear on me heavily whenever I&rsquo;ve proclaimed a long list of things I&rsquo;m hoping to get better at. Instead, this year, I&rsquo;m starting with a very short list. My hope is that I can commit to a small number of good habits at a time, which I can then build on iteratively. I want to have the windows of reinforcement stay small at first (maybe a week or two), and once I feel satisfied about whichever habits I&rsquo;ve committed to, I can add more. I&rsquo;m starting with three items: Rebuilding this website: simplified tooling; new layout/style; using and publishing more structured data, and a partial implementation of a stack following Indieweb and Solid principles. The last part is intentionally slippery, but I mostly really care about sending and receiving notifications at this point. A Push-to-Talk Conference Call Foot Pedal My current position at DPLA, especially since we are remote-first organization, requires me to be on lots of conference calls, both video and audio. While I&rsquo;ve learned the value of staying muted while I&rsquo;m not talking, there are a couple of things that make this challenging. First, I usually need the window for the call to have focus to unmute myself by the platform&rsquo;s designated keystroke. Forget that working well if you need to bring something up in another window, or switch to another application. Secondly, while we have our own preferred platform internally (Google Hangouts), I have to use countless others, too; each of those platforms has its own separate keystroke to mute. This all leads to a less than ideal situation, and naturally, I figured there must be a better way. How We Work: The DPLA Technology Team Core Values One of the most important aspects of the work of the DPLA Technology Team is ensuring that we maintain a common frame of reference for all of our efforts. This is situated in multiple aspects - in terms of our shared technical knowledge, the overall DPLA strategic plan, and more. Overall, however, the guiding principles for our work are best understood through the core values that inform how we work together within our team, as well as with our colleagues at DPLA and across the network of our stakeholders and collaborators. These values are not only designed to be aspirational; instead, they also inform practical aspects of our day to day work, allowing us to work together effectively through their articulation of cultural norms and expectations. In addition, our values encourage us to be intentional about our work, even when faced with challenges from deadlines, staff capacity, and other external pressures. Open, Free, and Secure to All: DPLA Launches Full Support for HTTPS DPLA is pleased to announce that the entirety of our website, including our portal, exhibitions, Primary Source Sets, and our API, are now accessible using HTTPS by default. DPLA takes user privacy seriously, and the infrastructural changes that we have made to support HTTPS allows us to extend this dedication further and become signatories of the Library Digital Privacy Pledge of 2015-2016, developed by our colleagues at the Library Freedom Project. DPLA and the International Image Interoperability Framework DPLA, along with representatives of a number of institutions including Stanford University, the Yale Center for British Art, the Bibliothèque nationale de France, and more, is presenting at Access to the World’s Images, a series of events related to the International Image Interoperability Framework (IIIF) in New York City, hosted by the Museum of Modern Art and the New York Academy of Medicine. The events will showcase how institutions are leveraging IIIF to reduce total cost and time to deploy image delivery solutions, while simultaneously improving end user experience with a new host of rich and dynamic features, and promote collaboration within the IIIF community through facilitated conversations and working group meetings. Ever to Excel: Towards an Apologetics of the Spreadsheet This is the written version of my presentation from Code4lib 2016 in Philadelphia, on March 8, 2016. My presentation was part of a panel with my friends Christina Harlow, Ted Lawless, and Matt Zumwalt, after which we had some discussion moderated by Matt Miller. My slides are available, as are the video of all talks from the panel. My Jekyll todo list A running list of things I want to do or have done. A lot of this relates to adopting the IndieWeb ethos IndieWebCamp NYC 2016 I&rsquo;m at IndieWebCamp NYC and I just added some microformats data to my site. Hurrah! Edit: And I&rsquo;ve successfully sent a Webmention by hand from the command line. Time to add that to the Jekyll build process&hellip; Developing and implementing a technical framework for interoperable rights statements Within the Technical Working Group of the interoperability-working-on-rights/">International Rights Statements Working Group, we have been focusing our efforts on identifying a set of requirements and a technically sound and sustainable plan to implement the rights statements under development. Now that two of the Working Group’s white papers have been released, we realized it was a good time to build on the introductory blog post by our Co-Chairs, Emily Gore and Paul Keller. Accordingly, we hope this post provides a good introduction to our technical white paper, Recommendations for the Technical Infrastructure for Standardized International Rights Statements, and more generally, how our thinking has changed throughout the activities of the working group. DPLAFest Attendees: Support LGBTQ Youth in Indiana! This is a joint blog post by DPLAFest attendees Benjamin Armintor and Christina Harlow, and DPLA staff members Mark Matienzo and Tom Johnson. After the passage of SEA 101 (the Indiana Religious Freedom Restoration Act), many scheduled attendees of DPLAFest were conflicted about its location in Indianapolis. Emily Gore, DPLA Director for Content, captured both this conflict and the opportunity the location provides when she wrote: We should want to support our hosts and the businesses in Indianapolis who are standing up against this law… At DPLAfest, we will also have visible ways to show that we are against this kind of discrimination, including enshrining our values in our Code of Conduct. We encourage you to use this as an opportunity to let your voice and your dollars speak. As DPLAFest attendees, patronizing businesses identifying themselves with Open for Service is an important start, but some of us wanted to do more. During our visit to Indianapolis, we are donating money to local charities supporting the communities and values that SEA 101 threatens. Profit & Pleasure in Goat Keeping Two weeks ago, we officially announced the initial release of Krikri, our new metadata aggregation, mapping, and enrichment toolkit. In light of its importance, we would like to take a moment for a more informal introduction to the newest members of DPLA’s herd. Krikri and Heiðrún (a.k.a. Heidrun; pronounced like hey-droon) are key to many of DPLA’s plans and serve as a critical piece of infrastructure for DPLA. They are also names for, or types, of goats. What DPLA and DLF Can Learn from Code4lib This post has been crossposted to the Digital Library Federation blog. Code4lib 2015 was held last week from February 9-12, 2015 in Portland, Oregon. The Code4lib conferences have grown in the last ten years, both in terms of size and scope of topics. This growth is particularly impressive when you consider that much of the work of organizing the conference falls upon a circulating group of volunteers, with additional organizational support from organizations like the Digital Library Federation. It has become clear to me that the Code4lib community is interested in ensuring that it can develop and support compelling and useful conferences for everyone who chooses to participate. A Helping Hand: Free Software and the DPLA As you probably know, DPLA is committed to making cultural heritage materials held in America's libraries, archives, and museums freely available to all, and we provide maximally open data to encourage transformative uses of those materials by developers. In addition, DPLA is also proud to distribute the software we produce to support our mission to the wider community. The Greatest Adventure With apologies to Rankin/Bass and Glenn Yarbrough, the greatest adventure is what lies ahead. After almost four great years working for Manuscripts and Archives at the Yale University Library and two and a half rewarding years as the Technical Architect on ArchivesSpace, I am excited to announce that I&#8217;ve accepted a position as the Director of Technology for the Digital Public Library of America, a small but well-supported non-profit dedicated to free and open access to cultural heritage materials. More information about my new position can be found in the press release. While I am sad to be leaving a great institution and a great project, both with fantastic colleagues, I look forward to contributing my time, energy and expertise to the addressing the huge challenges and encouraging the exciting possibilities of DPLA. If you&#8217;d like to join me in this adventure, I&#8217;m also happy to announce that DPLA will be hiring two Technology Specialists very soon, so if you&#8217;re interested or have any questions, please don&#8217;t hesitate to contact me! Computer Anonymous New York In my previous post, I wrote about wanting to address issues of privilege in the space between archives and technology. As a first step, I mentioned organizing a New York group of Computer Anonymous. I&rsquo;m pleased to announce that we&rsquo;ve scheduled our first meeting: Tuesday, October 29, 2013, 6:30 PM - ?, at Pacific Standard, 82 Fourth Avenue, Brooklyn, NY We have about seven people who have indicated that they&rsquo;re planning on attending. If you&rsquo;re interested, please comment here, contact me via Twitter or Email, or leave a comment on this Github issue. I believe that a Computer Anonymous group in New York is a great chance to start having both tough and positive conversations. I realize that it won&rsquo;t solve everything, and that our initial location may not be ideal, but I&rsquo;m certainly amenable to other ideas and doing better outreach. I want to see both the technology and archives professions become more diverse, more equitable, and healther communities that in which I can encourage others to join. Cha(lle)nging the dynamics of privilege in archives and technology Like others, I found the presidential address of Jackie Dooley last August&rsquo;s Society of American Archivists annual meeting to be problematic. At the time, I had little more to add than what was articulated by others, such as Sam Winn&rsquo;s post on professional privilege. As the dust settles, though I&rsquo;ve gotten a lot more clarity. The Society of American Archivists is not really an easy place to examine our privilege or our struggle. There are many ways in which we desperately need to examine privilege within the context of our profession as well as the overall organization, but for now, I&rsquo;m going to limit this post to addressing an issue that has been racing through my head since the SAA annual meeting, which concern privilege and the intersection of archives and technology, the area in which I work. I am nothing if not enthusiastic about open culture and open source software and their transformative potential. I release my own work (meaning software, presentations, writing, etc. Collaboration Before Preservation: Recovering Born Digital Records in the Stephen Gendin Papers For some, the phrase &ldquo;born digital resources&rdquo; may be unfamiliar, but Ricky Erway, Senior Program Officer at OCLC Research wrote a brief essay entitled Defining &ldquo;Born Digital&rdquo;, which provides a handy, working definition: &ldquo;items created and managed in digital form.&rdquo; Manuscripts and Archives, the Beinecke Rare Book and Manuscript Library, and Yale University Library overall have had a notable history of working with born digital resources over the past ten years. Emotion, Archives, Interactive Fiction, and Linked Data [Edit (Feb 24, 2013): Thanks to the fantastic work of Tara Robertson, the video of my lightning talk is now available!] I gave a lightning talk entitled [&ldquo;Wielding the Whip: Affect, Archives, and Ontological Fusion&rdquo;]({{ root_url }}/storage/2013/2013Feb-code4lib-lightning-talk) at the 2013 Code4lib conference in Chicago, Illinois. This lightning talk was one of the most difficult presentations I&rsquo;ve ever given for a number of reasons, including the emotional aspect of the content itself, as well as the fact that several of the ideas I was trying to articulate weren&rsquo;t fully baked. I&rsquo;ve been thinking about this for the four to six months in various capacities and with different focuses, especially as I read more interactive fiction and learn more about it (as well as about hypertext in general). This post serves as an expansion of some of the ideas in my lightning talk and as a way to further the discussion around the following question: Can we write interactive fiction and (semi-/para-)fictional hypertext that leverages linked data to create an emotional connection to the &ldquo;real world&rdquo;? 24 Hours: The Day of Digital Archives Thursday, October 6 was the Day of Digital Archives, organized by friend and colleague Gretchen Gueguen at the University of Virginia. I missed the post deadline yesterday, but it's been a busy week, so I might as well walk through some of the highlights of my work related to digital archives that occurred during that 24 hours from 12 am Thursday to 12 am Friday. 12 AM: It's late, but I'm finishing the last bit of work of writing up lecture notes. This fall, I am teaching a class on digital preservation as an adjunct in the iSchool at Drexel University. The iSchool is on the quarter system, so we have only ten weeks to cover a wide variety of material. Last week the students got an introduction to the Reference Model for an Open Archival Information System, and this week's topics (on which I am writing the lecture notes) are selection and appraisal, assessment, provenance, and authenticity. Some of the sources of the week's material include a forthcoming case study from the City of Vancouver Archives, the DCC Curation Manual's chapter on appraisal and selection, sections of the CLIR publication Authenticity in a Digital Environment, and the final report of the W3C Provenance Incubator Group. How to Hack SAA Inspired by my friend Declan Fleming's "How to Hack Code4lib," I have been motivated to put together a guide to surviving and enjoying the Annual Meeting. It can be a seemingly scary (and potentially lonely) experience if it's your first conference, and we archivists are not always known for our extrovertedness. So, without further ado, here is my brief list of suggestions - again, some of which have been shamelessly stolen adapted from Declan's guide. Tweeting Up at SAA2011 Thanks to the great work of Lance (@newmsi), Rachel Donahue (@sheepeeh), and Angelique Richardson (@RandomArchivist) last year, the first SAA Tweetup was pulled off successfully in Washington, DC. Given that this year's SAA Annual Meeting is just a few weeks away, Hillel Arnold (@helrond) and I have elected to organize one in Chicago, as well. We're holding this year's Tweetup on Thursday, August 25, starting at 9 PM, at the Clark Street Ale House, which is about a mile from the conference hotel and easily walkable and accessible by public transportation. Feel free to join us after the alumni mixers - and please join us even if you don't use Twitter. Please RSVP at http://twtvite.com/saa11tweetup; while RSVPs are not required, they will help us and the bar plan ahead. Supporting Hyatt Workers and UNITE HERE Local 1 at the 2011 Annual Meeting of SAA Some of us archivists have growing concerns regarding the long-standing labor dispute between UNITE HERE Local 1 and the management of the Hyatt Regency Chicago, the location of the 2011 Annual Meeting of the Society of American Archivists. Most recently, this labor dispute has led to a one-day strike of housekeepers, dishwashers, bellmen and other hotel workers on June 20, 2011. SAA has not given its membership any guidance to its membership about how to support UNITE HERE Local 1 and the Hyatt's hotel workers. Accordingly, my colleague Hillel Arnold and I have put together an website for archivists to find and share ideas. This website, Support Hyatt Workers at SAA2011: An Unofficial Resource, is now live, and provides ideas for actions that anyone can perform, plus lists of those specifically for individuals who have either chosen not to attend and for those that are attending. This site allows anyone to contribute and comment either generally on a given page or in response to particular ideas. Sumer Is Icumen In I have spent the last several months in a fog. Emotions tend to get the better of me whenever faced with a barrier in my work life. It's gotten increasingly difficult for me to see the forest for the trees, no matter how much I tell myself that my work is for the greater good of my unit, my institution, and archivy. Self-doubt creeps in, as does stress, frustration, depression. Positivity begins to wane, with optimism replaced by apathy and sarcasm. You stop seeing the good in things and other people, and you stop being inspired. You desperately want to get away, pull the plug, clean the slate, or otherwise just put everything to a grinding halt. You stop asking "why can't I do that?" and start asking "why should I care?" instead. I don't think this is the first time I've faced burnout, and while it certainly won't be the last, the extent to which it's affected me this time around is astounding. In Memoriam: Robert Frost, 1952-2011 I am sad to announce the passing of Robert L. "Bob" Frost (1952-2011). Bob was an associate professor at the University of Michigan School of Information, my alma mater, where he had taught since 2000. Bob had been battling cancer for over two years. Ed Vielmetti has written an obituary of Bob on his blog, including the announcement from SI Dean Jeffrey Mackie-Mason. Bob was an inspiration to many of us SI alums, and his magnetic personality, sharp wit, and joie de vivre ensured he had a bevy of his students and colleagues buzzing around him at any given time. I had the opportunity to take his class Material Culture and the Interpretation of Objects in the spring of 2004, my final semester at SI. The class was intense in a way that few of my other classes at Michigan were, and it provoked my continuing curiosity in identifying theoretical frameworks to analyze the everyday world. Bob reinforced my fascination with Wilhelm Reich and The Fugs by introducing me to Dušan Makavejev's W. WikiLeaks & the Archives & Records Profession: a Panel Discussion UPDATE: The text of my remarks can be now found online at https://matienzo.org/presentations/2011/wikileaks/. I am honored to be one of the speakers at "WikiLeaks & the Archives & Records Profession," a panel discussion organized by the Archivists Roundtable of Metropolitan New York and the Metropolitan New York City Chapter of ARMA International. The panel will be on January 25, 2011 at the Center for Jewish History. From the announcement: Do WikiLeaks and its complex, attendant issues shift our conceptualization of our roles as information professionals? How might WikiLeaks change the public's views on usage of and access to archives and records? To what extent is the most recent release of diplomatic cables a product of information mismanagement? Addressing these and many more questions, our confirmed speakers include Trudy Peterson, former Acting Archivist of the United States (1993-1995) and current representative for the Society of American Archivists on the Department of State's Historical Advisory Committee; Fred Pulzello, Solutions Architect in the Information Governance practice at MicroLink LLC; Jim Fortmuller, Manager of Systems Security at Kelley Drye & Warren LLP in Washington, DC; Mark Matienzo, Digital Archivist in Manuscripts and Archives at Yale University Library; and Derek Bambauer, Associate Professor of Law at Brooklyn Law School. What's Your Delicious Story? Update: I've added a question on Quora about this too - feel free to contribute your story there. In my last post, I talked a bit about the notion of Delicious being a platform with a myriad of uses, and I've been actively wondering about this since then. Upon further reflection, I've realized that the best way to figure this out is actually to engage and ask people directly. Accordingly, I'm asking for your help. Of course it's upsetting that Delicious is being sunsetted, but other than individual users and Archive Team, people seem to be doing very little about it. Delicious is clearly more than the bookmarks. I want to gather information about how people like you and me actually used it beyond it's obvious functionality. Did you use it to manage resources for your dissertation? Did you use it to communicate with family about a serious event or illness? How did you go beyond the boundaries of it being just " Delicious and the Preservation of "Platforms" Just as plenty of others have, I recoiled in horror when I heard that Delicious (née del.icio.us) was being "sunsetted". Regardless of the red flags that have been raised about its potentially imminent demise, I've still been using it on a daily basis. I've been an active user for over 6.5 years, which is longer than I can say for just about any other web platform or service. I deleted my Friendster and Myspace accounts quite a while ago; I've been on Flickr almost as long as Delicious, but the bookmarking wins out by a good four months or so. I started using Delicious in my final semester of library school, and it shows. I used it for procrastinating as well as a way to organize research materials before I had Zotero. The bulk of the bookmarks from that first day of use (February 24, 2004) were likely imports from my browser, but I quickly showed a facility for adding stuff that I saw as interesting, useful, etc. Update: Aus-Archivists Not Dead? Earlier today I'd posted about the Australian Society of Archivists' announcement about the Aus-Archivists listserv being "lost." Tim Sherratt, an Australian colleague and friend of this blog, announced this post on ArchivesLive, the Ning group created by the ASA seemingly to replace the listserv. Pat Jackson, ASA President, has already responded with an update: The ASA National Office has not lost the Aus-Archivists list-serv. We have moved from an outsourced service provider to managing our new server at the National Office. The Aus-Archivists list-serv was a bit too ancient for our spanking new server to manage. In terms of the posterity of the contents of the list-serv, the wonderful discussions and debate it fostered and engendered, they are not lost. It is our intention to post them to the ASA website where they can be perused. Further to that, it is my understanding that the Aus-Archivists list-serv is also deemed to be permanent under the ASA retention schedule. The ASA will be investigating other methods of storing the list-serv for permanent retention. Goodbye, Aus-Archivists: Listservs and the Commitment to Digital Preservation [Update: Aus-Archivists might not be gone for good, as ASA intends to share the entire run of postings on its website. See this post for details.] Despite my relative distaste for the A&amp;A list, I have previously found it useful and argued for its retention when it was threatened in 2007. I still agree with most of what I wrote 3.5 years ago, although I might have toned things down in retrospect. In an effort to find other e-mail discussion lists on archives that engaged my interest, I joined Arcan-L (the Canadian archivists' listserv) and Aus-Archivists (the Australian archivists' listserv, maintained by the Australian Society of Archivists). Surprisingly, Aus-Archivists had been idle since around the end of October. I noticed this tweet from the Australian Society of Archivists only in passing at the beginning of November: The ASA Office would just like everyone to know that our List Serv is still currently unavailable, we apologize for any inconvenience... I didn't hear anything else between then and earlier today. I should note that I'm not a member of ASA, and so I can't speak to any communication they had with their membership. However, today a message was sent out by Pat Jackson, the ASA President, to all Aus-Archivists subscribers, announcing that the listserv was lost entirely. Disco-Powered pymarc I'd been long interested in starting to develop code using some sort of MapReduce implementation for distributed computing. I have never been able to get my head around Hadoop, so I gave up with that pretty quickly. I recently discovered Disco, a MapReduce framework with an Erlang-based core. Disco also allows you to to write your worker code in Python, which was a huge plus to me. After stumbling through the tutorial, I took the word count demo and put together some basic code using pymarc that gathered tag count statistics for a bunch of MARC files. The code's still in a very early form, and arguably should carve up large files into smaller chunks to pass off to the worker processes; I've gotten around this for the time being by splitting up the files using yaz-marcdump. Once I split the files, I pushed them into a tag of DDFS, the Disco Distributed File System. This was a useful way for me to write some demo code both for using pymarc and Disco. The Future of ArchivesBlogs Every project has it's day. I've administered ArchivesBlogs for four years now. Originally, I created it to fill a void when blogging was new to the archival profession, and archivists were having to make the case for dedicating staff time to shepherding early social media projects. Four years later, things are much different; I'm less interested in Web 2.0 (professionally speaking), more archivists are blogging, and more repositories are maintaining their own blogs. Despite the changes in the archival blogosphere and repository administration, archivists still contact me occasionally and remind me of the value of ArchivesBlogs. It's also lead to some interesting debates in the past. I still think it has its place, but I don't want to be the only person shaping its future. I've also been thinking for a while that I want to get out of the aggregation business, and I believe time to put together a succession plan. The reality is that I don't have the time to rethink what ArchivesBlogs could be, or even give it the care and feeding it needs to keep running. Why I Have Given Up On the Archives and Archivists List I am certainly not the first person to chime in on this topic, and I certainly hope not to be the last. Inspired by two fantastic posts by Ben Bromley and Maureen Callahan, I have chosen to discuss the reasons why I have given up on the Archives and Archivists List. Unlike Ben and Maureen, who discuss why they choose not to post to the list, I'm also including reasons why I choose not to read or subscribe to the list anymore. For what it's worth, until yesterday, I had been on the A&amp;A List for almost nine long years. I don't think the majority of the traffic is terribly useful. This can be incredibly frustrating, especially there's a question on topic you happen to know something about. Telling someone how to perform a Google search is not an adequate response.Given the signal-to-noise ratio of the list, useful or timely messages can be easily buried. Off-topic messages seem to be the rule rather than the exception. With Little Fanfare, dLIST Goes Down I've been meaning to blog about this for a while. DLIST, the Digital Library of Information Science and Technology, maintained by the University of Arizona School of Information Resources and Library Science, has been down for at least three months. Any URL formerly part of DLIST gets automatically redirected to an announcement page that reads as follows: Aging hardware and conversion issues following a system crash have taken their toll on DLIST, the University of Arizona's Digitial Library of Information Science and Technology. We are currently exploring choices and alternatives both to short term recovery and long term sustainability. The resources and metadata are fully recovered, and we hope to put them back online in a new repository soon. If you or your institution would like to assist with the DLIST project, please contact us at sirls@email.arizona.edu. Thanks for your support! While I feel for the difficulties they've had in maintaining it, I have to admit that it's a bit frustrating for me from the standpoint of someone who submitted material to DLIST. Code4lib 2010: Southern Hospitality I recently returned from a trip to Asheville, North Carolina for this year's Code4lib conference. Despite the unavoidable hiccups that some attendees experienced as they tried to head home from the conference, I believe that this year's conference was the most successful one that I happened to attend. If I'm right, I think this year had a record number of attendees, a record number of new attendees, and much tighter organization to make the new folks feel welcome. The social activities were certainly more planned and organized than last year, which was a welcome change. While I certainly didn't mind hollering out to the crowd that I would be going to see some bands or to a particular restaurant like I had in previous years, it was nice to see other folks take the lead. The newcomer dinners seemed to go pretty well; the brews cruise and barbecue excursions went smoothly; and even the game(s) of Werewolf seemed to take a life of their own. Description Peddlers and Data.gov: Two Peas In a Pod As you may have heard, the National Archives issued a press release today announcing the release of three data sets on Data.gov: The first milestone of the Open Government Directive was met on January 22 with the release of new datasets on Data.gov. Each major government agency has uploaded at least three datasets in this initial action. The National Archives released the 2007—2009 Code of Federal Regulations and two datasets from its Archival Research Catalog. This is the first time this material is available as raw data in XML format. The Archival Research Catalog, or ARC, is NARA's primary access system for archival description, representing 68% of NARA's entire holdings. This breaks down to the following: 2,720,765 cubic feet 520 record groups 2,365 collections 102,598 series 3,265,988 file units 292,887 items In addition, there are 6,354,765,793 logical data records and 465,050 artifacts described in ARC. NARA's decision to share this data is a breakthrough for archives and people who love data. Onward And Upward... It's fitting that this the hundredth (gosh, only the hundredth?) post, because I have rather important news. First, my fellow developers/producers/UX designers at The New York Public Library and I have been dealing with every minute detail on the upcoming, Drupal-based replacement to the NYPL website. You can see a live preview at http://new.nypl.org/. I can proudly say that this project has helped both me personally and NYPL overall play nice in the open source world - we've been actively contributing code, reporting bugs, and sending patches to the Drupal project. Also, our site search is based on Solr, which always bears mention. In addition, after a working tirelessly as a developer at NYPL for the last year and a half, I have decided to move onward and upward. I am leaving the cozy environs of the still-recently renovated office space I share with my spectacular coworkers. It was not an easy decision by far, but it feels like the best one overall. Clifford Lynch Clarifies Position on Open Source ILSes Clifford Lynch, Executive Director of the Coalition for Networked Information, has responded to the leaked SirsiDynix report that spreads horrific untruths about open source. Marshall Breeding posted Lynch's response on GuidePosts. In particular, Lynch notes the following: I don't think that I ever wrote those words down in an article; I suppose I may have said something to that effect in an interview or q&amp;a in some conference program like ALA Top Tech, though perhaps no quite as strongly as it's expressed here. I have without question spoken out about my concerns regarding investment in open source ILS development in the last few years. IF I did say this, it feels like it's used a little out of context -- or maybe the better characterization is over-simplistically -- in the report. ... I think there are still major problems -- many of which we really don't know how to solve effectively, and which call for sustained and extensive research and development -- in various areas where ILS get involved in information discovery and the support of research and teaching. SirsiDynix Report Leaked, Spreading Fear, Uncertainty and Doubt about Open Source Thanks to Twitter, I discovered that Wikileaks has posted a report written by SirsiDynix Vice President for Innovation StephenAbram which spreads a fantastic amount of fear, uncertainty and doubt about both open source software in general and, more specifically, the suitability of open source integrated library systems. As the summary provided by Wikileaks states, This document was released only to a select number of existing customers of the company SirsiDynix, a proprietary library automation software vendor. It has not been released more broadly specifically because of the misinformation about open source software and possible libel per se against certain competitors contained therein ... The source states that the document should be leaked so that everyone can see to what extent SirsiDynix will attempt to spread falsehoods and smear open source and the proponents of open source. In addition, as you may have heard, the Queens Library is suing SirsiDynix for breach of contract; for what it's worth, the initial conference is scheduled for next Monday, November 2, 2009. pybhl: Accessing the Biodiversity Heritage Library's Data Using OpenURL and Python Via Twitter, I heard about the Biodiversity Heritage Library's relatively new OpenURL Resolver, announced in their blog about a month ago. More specifically, I head about Matt Yoder's new Ruby library, rubyBHL, which exploits the BHL OpenURL Resolver to provide metadata about items in their holdings and does some additional screenscraping to return things like links to the OCRed version of the text. In typical fashion, I've ported Matt's library to Python, and have released my code. pybhl is available from my site, PyPI, and Github. Use should be fairly straightforward, as seen below: import pybhl import pprint b = pybhl.BHLOpenURLRequest(genre='book', aulast='smith', aufirst='john', date='1900', spage='5', volume='4') r = b.get_response() len(r.data['citations']) 3 pprint.pprint(r.data['citations'][1]) {u'ATitle': u'', u'Authors': [u'Smith, John Donnell,'], u'Date': u'1895', u'EPage': u'', u'Edition': u'', u'Genre': u'Journal', u'Isbn': u'', u'Issn': u'', u'ItemUrl': u'http://www.biodiversitylibrary.org/item/15284', u'Language': u'Latin', u'Lccn': u'', u'Oclc': u'10330096', u'Pages': u'', u'PublicationFrequency': u'', u'PublisherName': u'H.N. Patterson,', u'PublisherPlace': u'Oquawkae [Ill.] :', u'SPage': u'Page 5', u'STitle': u'', u'Subjects': [u'Central America', u'Guatemala', u'Plants', u''], u'Title': u'Enumeratio plantarum Guatemalensium imprimis a H. Access and Description Reconsidered What exactly is archival access, and how does archival description make it possible? I feel like that in some form or another I've been struggling with this question throughout my career. Recently, this blog post from The Top Shelf, the blog of the University of Texas at San Antonio Archives and Special Collections Department, came across my radar, wherein they write (emphasis in original): UTSA Archives and Special Collections is among the growing number of archives to create an online presence for every one of its collections. ... We were able to utilize inventories generated by former and current collection assistants to create guides to the collection with folder-level and box-level descriptions. The project resulted in access to more than 130 collections and 2000 linear feet of materials. What defines that accessibility? I certainly don't intend to be a negative Nancy about this - adding finding aids and other descriptive metadata about collections is obviously useful. But how has it necessarily increased access to the materials themselves? AIP Receives NHPRC Funding To Digitize Samuel Goudsmit Papers I'm happy to pass on the news that my former employer, the Niels Bohr Library & Archives of the American Institute of Physics, has received funding from the National Historical Publications and Records Commission to digitize the entirety of the Samuel Goudsmit papers. From the announcement on the Center for History of Physics/Niels Bohr Library & Archives Facebook page: Goudsmit (1902—1978) was a Dutch-educated physicist who spent his career in the US and was involved at the cutting edge of physics for over 50 years. He was an important player in the development of quantum mechanics in the 1920s and 1930s; he then served as scientific head of the Alsos Mission during World War II, which assessed the progress of the German atomic bomb project. Goudsmit became a senior scientist at Brookhaven National Laboratory and editor-in-chief of the American Physical Society. The papers consist of an estimated 66,000 documents, which include correspondence, research notebooks, lectures, reports, and captured German war documents; the collection is the most used in the library. A Gentle Reminder On the eve of teaching my first class of my course (LIS901-08, or, Building Digital Libraries: Infrastructural and Social Aspects) at LIU's Palmer School of Information and Library Science, I'd like to remind you of the following. The syllabus is available on online, if you're curious. Privacy, Censorship, and Good Records Management: Brooklyn Public Library in the Crosshairs Over at librarian.net, Jessamyn West has a brief write up about a post on the New York Times' City Room blog about placing access restrictions on offensive material (in this case, one of Hergé's early Tintin books at the Brooklyn Public Library). More interestingly, she notes, is that the Times was given access and accordingly republished challenges from BPL patrons and other community members. Quite astutely, Jessamyn recognizes that the patrons' addresses are removed but their names and City/State information are published. If your name is, for example, [name redacted], redacting your address doesn't really protect your anonymity. I'm curious what the balance is between patron privacy and making municipal records available. It's a good question that doesn't have an incredibly straightforward answer. My first concern was about whether BPL had kept the challenge correspondence beyond the mandated dates in the New York State records schedules. After doing some digging, on the New York State Archives' website, I came across Schedule MI-1 (" Everything is Bigger in Texas, Including My Talks on The Semantic Web I'll be at the Society of American Archivists Annual Meeting next week in Austin, Texas. It looks to be a jam-packed week for me, with a full-day Standards Committee/TSDS meeting on Tuesday, followed by THATCamp Austin in the evening, an (expanded version of my) presentation on Linked Data and Archival Description during the EAD Roundtable on Wednesday, and Thursday's session (number 101): "Building, Managing, and Participating in Online Communities: Avoiding Culture Shock Online" (with Jeanne Kramer-Smyth, Deborah Wythe, and Camille Cloutier). And to think I haven't even considered which other sessions I'm going to! Anyhow, I hope to see you there, and please make either or both of my presentations if you can. Must Contextual Description Be Bound To Records Description? I've been struggling with the fact that (American) archival practice seems to bind contextual description (i.e., description of records creators) to records description. Much of these thoughts have been stirring in my head as a result of my class at Rare Book School. If we take a relatively hardline approach, e.g. the kind suggested by Chris Hurley ("contextual data should be developed independently of the perceived uses to which it will be put", 1, see also 2), it makes total sense to separate them entirely. In fact, it starts making me mad that the &lt;bioghist&gt; tag exists at all in EAD. Contextual description requires that it be written from a standpoint relative to that of the creator it describes. I guess what I keep getting hung up on is if there could be a relevant case that really merits this direct intellectual binding. I therefore appeal to you, humble readers, to provide me with your counsel. Do you think there are any such cases, and if so, why? Seeking Nominations for Co-Chair, RLG Programs Roundtable Apologies for any duplication - we're just trying to get the word out! As co-chairs of the RLG Programs Roundtable of the Society of American Archivists, we're seeking nominees to co-chair of the Roundtable for 2009-2011. If you'd like to nominate yourself or someone else, please email Mark Matienzo, Co-Chair, at mark at matienzo.org. Please submit all nominations no later than 5 PM Eastern Time on Friday, August 7. Serving in a leadership position for a Section or Roundtable is a great way to learn about SAA and its governance, contribute to new directions for the Society, and work with other archivists on interesting projects. It is also a great way to serve the Society! Your RLG Roundtable Co-Chairs, Thomas G. Knoles Marcus A. McCorison Librarian American Antiquarian Society Mark Matienzo Applications Developer, Digital Experience Group The New York Public Library The Archival, The Irreconcilable, and The Unwebbable: Three Horsemen and/or Stooges This week in Charlottesville has been a whirlwind exploration of standards and implementation strategies thus far during my class, Designing Archival Description Systems, at Rare Book School. My classmates and I have been under the esteemed tutelage of Daniel Pitti, who has served as the technical architect for both EAD and EAC. Interestingly, there's been a whole lot of talk about linking data, linked data, and Linked Data, date normalization, and print versus online presentation, among other things. In addition, a few things have floated past on my radar screen this week that have seemed particularly pertinent to the class. The first of these was a post by Stefano Mazzocchi of Metaweb, "On Data Reconciliation Strategies and Their Impact on the Web of Data". In Stefano's post, he wrote about the problem of a priori data reconciliation vs. a posteriori; in other words, whether you iron out the kinks, apply properties like owl:sameAs, etc., on the way in or on the way out. "Summer Camp for Archivists" Sounds So Much Better Crossposted to NYPL Labs. I'm staying with colleagues and good friends during my week-long stint in Charlottesville, Virginia for Rare Book School. If you're here - particularly if you're in my class (Daniel Pitti's Designing Archival Description Systems) - let me know. I'm looking forward to a heady week dealing with descriptive standards, knowledge representation, and as always, doing my best to sell the archives world on Linked Data. Notes and thoughts will follow, as always, on here. "Using the OCLC WorldCat APIs" now available in Python Magazine As of last Thursday, I have been inducted into the pantheon of published Python programmers (aye, abuse of alliteration is always acceptable). My article, "Using the OCLC WorldCat APIs," appears in the latest issue (June 2009) of Python Magazine. I'd like to thank my editor, Brandon Craig Rhodes, for helping me along in the process, not the least of which includes catching bugs that I'd overlooked. The article includes a brief history lesson about OCLC, WorldCat, and the WorldCat Affiliate APIs, a detailed introduction to worldcat, my Python module to interact with OCLC's APIs, and a brief introduction to SIMILE Exhibit, which helps generate the holdings mashup referenced earlier on my blog. Subscribers to Python Magazine have access to a copy of the code containing a functional OCLC Web Services key ("wskey") to explore the application. NYART Presentation: Archives & The Semantic Web This last Tuesday, I spoke at the Annual Meeting of the Archivists' Roundtable of Metropolitan New York, where I gave a talk on archives and the Semantic Web. The presentation went over very well, and colleagues from both the archives field and the semantic technology field were in attendance. I did my best to keep the presentation not overtly technical and cover just enough to get archivists to think about how things could be in the future. I also have to give a big hat tip to Dan Chudnov, whose recent keynote at the Texas Conference on Digital Libraries helped me organize my thoughts. Enjoy the slides, and as always, I relish any feedback from the rest of you. Drupal For Archivists: Documenting the Asian/Pacific American Community with Drupal Over the course of the last academic year, I have been part of a team working on survey project aimed at identifying and describing archival collections relating to the Asian and Pacific American community in the New York City metropolitan area. The results of the fifty-plus collections we surveyed have been posted on our Drupal-powered website, which has been an excellent fit for the needs of this project and has also enabled us to engage many of the challenges the project has presented. By way of introduction, this survey project seeks to address the underrepresentation of East Coast Asian/Pacific Americans in historical scholarship and archival repositories by working with community-based organizations and individuals to survey their records and raise awareness within the community about the importance of documenting and preserving their histories. Funded by a Documentary Heritage Project grant from METRO: Metropolitan New York Library Council, the project is a collaborative effort between the Asian/Pacific/American Institute and the Tamiment Library/Robert F. worldcat In The Wild at OCLC's WorldCat Mashathon in Amsterdam It's good to see other people using your code. Thanks to the OCLC Devnet Blog, I found out that Etienne Posthumus used worldcat for a demo application he built during the WorldCat Mashathon in Amsterdam last week. Even more interesting is that Etienne's application was deployed on Google App Engine. Courtesy of OCLC's Alice Sneary, there is a brief video of Etienne presenting his application to the other Mashathon attendees: Batch Reindexing for Drupal + Solr Crossposted to NYPL Labs. Sorry for any duplication! Hey, do you use Drupal on a site with several thousand nodes? Do you also use the Apache Solr Integration module? If you're like me, you've probably needed to reindex your site but couldn't be bothered to wait for those pesky cron runs to finish — in fact, that's what led me to file a feature request on the module to begin with. Well, fret no more, because thanks to me and Greg Kallenberg, my illustrious fellow Applications Developer at NYPL DGTL, you can finally use Drupal's Batch API to reindex your site. The module is available as an attachment from that same issue node on drupal.org. Nota bene: this is a really rough module, with code swiped pretty shamelessly from the Example Use of the Batch API page on drupal.org. It works, though, and it works well enough as we tear stuff down and build it back up over and over again. DigitalNZ and Brooklyn Museum API Modules for Python I've been busy the last few weeks, so I didn't even really announce this to begin with! I've been playing around with some of the cultural heritage APIs that are available, some of which I learned about while I was at Museums and the Web 2009. While I was away I released code for a Python module for interacting with the Brooklyn Museum Collections API. After chatting with Virginia Gow from DigitalNZ, I also got motivated to write a Python module to interact with the DigitalNZ API. The code for both is fairly unpolished, but I'm always ready for feedback! Both modules are available as Mercurial repositories linked from my Bitbucket account. There's also a small cluster of us working on a museum API wiki to begin sorting out some of these issues. Comparably speaking, the library and archives world has it somewhat easy... The Medium Is Not The Message "Electronic records" is a particularly awful phrase and does not even actually capture anything about the underlying records at all. As far as the term goes, it's not too far off from "machine readable records." As a profession, can we start actually thinking critically about the underlying technical issues and push for using terms that more accurately describe what it is we're dealing with? I understand it's a convenient catch-all term, but there is a large range of issues that differ with the kinds of data and systems. Drupal for Archivists: A Drupal-built Archives Reference Blog When Mark asked me to write about our use of Drupal at the Dickinson College Archives and Special Collections, the first thing I thought about was when our Archives Reference Blog was initially launched in April 2007. I couldn't believe that it has been two years already. I am pleased to report that my colleagues at Dickinson and I are enormously happy with the results of those two years. I hope others may find this brief explanation of how and why we are using Drupal as a reference management tool to be helpful and instructive. The concept for our implementation of Drupal was a simple one. I was thinking about the fact that we help researchers everyday to locate information that they want, but that what they discover among our collections or learn from them seldom gets shared, except by those who write for publication. So, what if we shared via the web, through a simple blog format, the basic questions posed by our researchers along with a simple summary of the results? Why You Should Support Linked Data If you don't, I'll make your data linkable. Coming Soon: Drupal for Archivists I've been fairly quiet lately as I've been busy with this and that, but I thought I'd let everyone know that I've been beginning to put together a series of posts entitled "Drupal for Archivists." Drupal, as you may or may not know, is a flexible and extensible open source content management system. There will be a general overview of some of the important concepts, but it'll focus less on the basics of getting people up and running — there are plenty of resources out there, such as the wonderful tutorials and articles available from Lullabot. Instead, I've drafted a handful of guest bloggers to discuss how and why they're using Drupal. Keep your eyes peeled! Brooklyn Museum Releases API The always groundbreaking Brooklyn Museum has now released an API to allow the public to interact with their collections data. I can't even tell you how happy I am about this in terms of an open data perspective. Also, this is the direction that makes the whole "detailed curation by passionate amateurs" thing possible. There are only three simple methods for accessing the data. Ideally, it would be nice to see them put their collections metadata up as linked data, but now I'm daring to dream a little. Hey, wait a minute! I think that's the perfect way to start playing around with the API. Doing some digging through the documentation, I'm seeing that all the objects and creators seem to have URIs. Take a crack at it - the registration form is ready for you. Moving worldcat to Mercurial and Bitbucket It's official - I've moved the codebase for worldcat, my Python module for working with the OCLC WorldCat APIs, to be hosted on Bitbucket, which uses the Mercurial distributed version control system. You can find the new codebase at http://bitbucket.org/anarchivist/worldcat/. Make Me A Structured Vocabulary Or I'll Make One For You The Society of American Archivists released the Thesaurus for Use in College and University Archives as an electronic publication this week. Specifically, it was issued as a series of PDF files. Is this data stored in some sort of structured format somewhere? If so, it's not available directly from the SAA site. There's no good reason why TUCUA shouldn't be converted to structured, linkable data, expressed using SKOS, the Simple Knowledge Organization System. It's not like I need another project, but I'm sure I could write some scraper to harvest the terms out of the PDF, and while I'm at it, I could write one to also harvest the Glossary of Archival Terminology. Someone, please stop me. I really don't need another project. Go FOAF Yourself I'm really looking forward to next week's code4lib conference in Providence, despite my utter failure to complete or implement the project on which I am presenting. In particular, I'm really looking forward to the linked data preconference. Like some of my other fellow attendees, I've hammered out a FOAF file for the preconference already so that Ed Summers' combo FOAF crawler and attendee info web app. This is what the sample output looks using my FOAF data. It's good to see we're well on our way to have an easily creatable sample type of RDF data for people to play with. At a bare minimum, you can create your FOAF data using FOAF-A-Matic and then edit it to add the assertions you need to get it to play nice with Ed's application. See you in Providence, but go FOAF yourself first. Developing Metrics for Experimental Forms of Outreach ArchivesNext recently inquired about how archivists measure success of 2.0 initiatives. It's hard to determine some 2.0-ish initiatives will really impact statistics when you don't really define what the results you're trying to see. I'd like to open the question further — how do we begin developing metrics for things that sit on the cusp between forms of outreach? Furthermore, I'm curious to see where this information is captured — do archivists wait until the end to gather survey data, or if they working towards something like we at NYPL Labs are doing with Infomaki, our new usability tool developed by Michael Lascarides, our user analyst. dEAD Reckoning #2: Mixing/Matching With Namespaces and Application Profiles So, it's time for another rant about my issues with EAD. This one is a pretty straightforward and short one, and comes down to the issue that I should essentially be able to mix and match metadata schemas. This is not a new idea, and I'm tired of the archives community treating it like it is one. Application profiles, as they are called, allow us to define a structured way to combine elements from different schemas, prevent addition of new and arbitrary elements, and tighten existing standards for particular use cases. However, to a certain extent, the EAD community has accepted the concept of combining XML namespaces but on a very limited level. The creation of the EAD 2002 Schema allows EAD data to be embedded into other XML documents, such as METS. However, I can't do it the other way around; for example, I can't work a MODS or MARCXML record into a finding aid. Why not? You're All Sheep Made by Twittersheep, a new project made (in part) by my acquaintance Ted Roden, a creative technologist for New York Times Research & Development. A Bird's Eye View of Archival Collections Mitchell Whitelaw is a Senior Lecturer in the Faculty of Design and Creative Practice at the University of Canberra and the 2008 winner of the National Archives of Australia's Ian Maclean Award. According to the NAA's site, the Ian Maclean Award commemorates archivist Ian Maclean, and is awarded to individuals interested in conducting research that will benefit the archival and historical profession in Australia and promote the important contribution that archives make to society. Dr. Whitelaw has been keeping the world up to date on his work using his blog, The Visible Archive. His work fits well with my colleague Jeanne Kramer-Smyth's archival data visualization project, ArchivesZ, as well as the multidimensional visualization projects underway at the Humanities Advanced Technology & Information Institute at the University of Glasgow. However, his project fascinates me for a few specific reasons. First of all, the scale of the datasets he's working with are astronomically larger than those that any other archival visualization project has tried to tackle so far. API Fun: Visualizing Holdings Locations In my previous post, I included a screenshot of a prototype, but glossed over what it actually does. Given an OCLC record number and a ZIP code, it plots the locations of the nearest holdings of that item on a Google Map. Pulled off in Python (as all good mashups should be), along with SIMILE Exhibit, it uses the following modules: geopy simplejson web.py and, of course, worldcat. If you want to try it out, head on over here. The curent of the code will soon be able as part of the examples directory in the distribution for worldcat, which can be found in my Subversion repository. This Is All I'm Going To Say On This Here Blogsite Concerning The Brouhaha About The Policy for Use and Transfer of WorldCat Records Because I Have Other, More Interesting And More Complex Problems To Solve (And So Do You) The moderated discussion hosted and sponsored by Nylink went pretty well. Also, I don't need the records to have fun with the data "” I just need robust APIs. (In fact, as I said today, I'd prefer not to have to deal with the MARC records directly.) Robust APIs would help making prototypes like this one I hacked together in a few hours into a real, usable service. Lightening the load: Drupal and Python Man, if this isn't a "you got your peanut butter in my chocolate thing" or what! As I wrote over on the NYPL Labs blog, we've been up to our necks in Drupal at MPOW, and I've found that one of the great advantages of using it is rapid prototyping without having to write a whole lot of code. Again, that's how I feel about Python, too, but you knew that already. Once you've got a prototype built, how do you start piping stuff into it? In Drupal 6, a lot of the contrib modules to do this need work - most notably, I'm thinking about node_import, which as of yet still has no (official) CCK support for Drupal 6 and CCK 2. In addition, you could be stuck with having to write PHP code for the heavy lifting, but where's the joy in that? Well, it so happens that the glue becomes the solvent in this slow, slow dance. dEAD Reckoning #1: A FaTHEADed Failure For Faceted Terms and Headings in EAD A while back, I wrote a Bad MARC Rant, and I considered titling this a Bad Metadata Rant. However, as the kids say, I got mad beef with a little metadata standard called Encoded Archival Description. Accordingly, I figured I should begin a new series of posts discussing some of these issues that I have with something that is, for better or for worse, a technological fixture of our profession. This is in part prompted by thoughts that I've had as a result of participating in EAD@10 and attending the Something New for Something Old conference sponsored by the PACSCL Consortial Survey Initiative. Anyhow, onto my first bone to pick with EAD. I'm incredibly unsatisfied with the controlled access heading tag &lt;controlaccess/&gt;. First of all, it can occur within itself, and because of this, I fear that there will be some sort of weird instance where I have to end up parsing a series of these tags 3 levels deep. Also, it can contain a &lt;chronlist/&gt;, which also seems pretty strange given that I've never seen any example of events being used as controlled access terms in this way. Going off the Rails: Really Rapid Prototyping With Drupal Previously posted on http://labs.nypl.org/. The other Labs denizens and I are going off the rails on a crazy train deeper down the rabbit hole of reimplementing the NYPL site in Drupal. As I pile my work on the fire, I've found that building things in Drupal is easier than I'd ever thought it to be. It's a scary thought, in part because I'm no fan of PHP (the language of Drupal's codebase). Really, though, doing some things can be dead simple. It's a bit of a truism in the Drupal world at this point that you can build a heck of a lot just by using the CCK and Views modules. The important part is that you can build a heck of a lot without really having to know a whole lot of code. This is what threw me off for so long - I didn't realize that I was putting too much thought into building a model like I normally would with another application framework. Does SAA Need To Support Who I Am? There's been a whole lot of discussion in the archivoblogosphere about the perceived need for quasi-informal interest groups that are fundamentally driven by identity. While I agree with this in theory, I must register my opposition to having SAA promote, support, or provide any sort of infrastructure for such groups. Fundamentally, I am against this because I believe it poses a strong threat to the privacy of archivists. deliciouscopy: a dumb solution for a dumb problem You'd think there was some sort of tried and true script for Delicious users to repost bookmarks from their inboxes into their accounts, especially given that there are often shared accounts where multiple people will tag things as "for:foo" to have them show up on foo's Delicious account. Well, there wasn't, until now (at least as far as I could tell). Enter deliciouscopy. It uses pydelicious, as well as the Universal Feed Parser and simplejson. It reads a user's inbox, checks to see if poster of the for:whomever tag was added to your network, and reposts accordingly, adding a via: tag for attribution. It even does some dead simple logging if you need that sort of thing. The code's all there, and GPL license blah blah blah. I hacked this together in about an hour for something at MPOW - namely to repost things to our shared account. It's based on Michael Noll's deliciousmonitor.py but diverges from it fairly quickly. Enjoy, and give any feedback if you must. Idle Hands Are The Devil's Plaything I've had my hands full lately. Two weeks ago I was at the MCN conference (wherein, among other things, I have continued my dominion as Archduke of Archival Description by taking over the MCN Standards SIG chair position from The Bancroft Library's Mary Elings), and next week I'm off to Philadelphia for the PACSCL Something New for Something Old conference. I hammered out the coherent, written version of my paper I gave at EAD@10. I prepared a proposal for next February's code4lib conference in Providence (ahem, vote for mine, if you're so inclined): Building on Galen Charlton's investigations into distributed version control systems for metadata management, I offer a prototype system for managing archival finding aids in EAD (Encoded Archival Description). My prototype relies on distributed version control to help archivists maintain transparency in their work and uses post-commit hooks to initiate indexing and publishing processes. In addition, this prototype can be generalized for any XML-based metadata schema. On top of that, I'm working with a fine group of folks on the RLG Programs project to analyze EAD editing and creation tools, doing hardcore schema mapping at work, and somehow finding enough time to play a little Doukutsu Monogatari to unwind. Developing Automated Repository Deposit Modules for Archivists' Toolkit? I'd like to gauge interest for people to help add code to Archivists' Toolkit to automate the deposit of digital objects into digital repositories. At first glance, the biggest issue is having to deal with differing deposit APIs for each repository, but using something like SWORD would make sense to bridge this gap. Any and all feedback is welcome! Python WorldCat Module v0.1.2 Now Available In preparation for the upcoming WorldCat Hackathon starting this Friday, I've made a few changes to worldcat, my Python module for interacting with OCLC's APIs. Most notably, I've added iterators for SRU and OpenSearch requests, which (like the rest of the module) painfully need documentation. It's available either via download from my site or via PyPI; please submit bug reports to the issue tracker as they arise. EDIT: I've bumped up the version number another micro number to 0.1.1 as I've just added the improvements mentioned by Xiaoming Liu on the WorldCat DevNet Blog (LCCN query support, support for tab-delimited and CSV responses for xISSNRequests, and support for PHP object responses for all xIDRequests). EDIT: Thanks to Thomas Dukleth, I was told that code for the Hackathon was to be licensed under the BSD License. Accordingly, I've now dual licensed the module under both GPL and BSD. V8-Powered Libraries and the Happiness Engines that Run Them Previously posted on http://labs.nypl.org/. A week ago today, a few of my DEG colleagues and I went to see Liz Lawley from RIT's Lab for Social Computing give a talk entitled "Libraries as Happiness Engines." It was a modified version of a talk she gave at this year's CiL conference. The gist of the talk was that gaming in libraries means not just using established games to draw the public into the library, but also to begin implementing game mechanics into libraries that allow them to flourish as social spaces. In particular, these game mechanics include things like collecting, points, feedback, exchanges, and customization. I've been ruminating on this for the last week or so in a couple different ways. First of all, I've been trying to figure out how we could implement game mechanics within NYPL. An Open Letter to SAA Council and the 2009 Program Committee I apologize for using my blog to soapbox, but I felt like this was a significant concern that I should share with my readers. If you wish to support my position, please consider sending an e-mail to SAA Council and the 2009 Program Committee Chairs. Dear 2009 Program Committee Members and SAA Council Members, I understand that we are nearing the deadlines for submission of proposals for sessions at the 2009 Annual Meeting of the Society of American Archivists. I also understand the reasons behind having an earlier deadline than past years. However, I am deeply concerned with the decision to have the deadline set to be October 8, 2008, which is Yom Kippur and the day which the Jewish High Holidays end. As is often the case, conference proposals often coalesce at the last minute, and this is further complicated by the fact that the beginning of Rosh Hashana fell on September 29, 2008. I recognize that the deadline is most likely immutable at this point, but I am asking that SAA Council and future Program Committees pay attention to when the High Holidays fall in future years. The Apex of Hipster XML GeekDOM: TEI-Encoded Dylan Via Language Log: The Electronic Textual Cultures Lab (ETCL) at the University of Victoria has, in an effort to draw more attention to TEI, chosen to prepare an encoded version of the lyrics to Bob Dylan's "Subterranean Homesick Blues" and overlaid the resulting XML over the song's video. The resulting video is available, naturally, on YouTube. ETCL's Ray Siemens writes about the reasoning behind this on the TEI Video Widgets blog: At the last gathering of the Text Encoding Initiative Consortium, in Maryland, a few of us were discussing the ways in which TEI has eluded some specific types of social-cultural representation that are especially current today . . . things like an avatar, or something that could manifest itself as a youtube posting. A quick search of youtube did reveal a significant and strong presence of sorts, but it was that of Tei the Korean pop singer (pronounced, we're told, "˜tay'); so, our quest began there, setting out modestly to create a video widget that would balance T-E-I and Tei in the youtube world. Introducing djabberdjaw djabberdjaw is an alpha-quality Jabber bot written in Python that uses Django as an administrative interface to manage bot and user profiles. I've included a couple of plugins out of the box that will allow you to perform queries against Z39.50 targets and OCLC's xISBN API (assuming you have the requisite modules). djabberdjaw requires Django 1.0 or later, jabberbot, and xmpppy. It's available either from PyPI (including using easy_install) or via Subversion. You can browse the Subversion repository, too. ArchivesBlogs 3.0 Thanks to Jeanne from Spellbound Blog, I was made aware of the fact that ArchivesBlogs hadn't really been doing its job. So, I ripped out its guts and put it back together. It's running the latest, shiniest versions of WordPress, FeedWordPress, and Auto Delete Posts, and now it has added Feedburner and WP Stats goodness. Let me know if you discover any peculiarities in the updated set up. Slaying the Scary Monsters Previously posted on http://labs.nypl.org/. Getting up to speed is hard anywhere, and it's especially difficult in a large, complex institution like NYPL. Other than just understanding the projects that you're given, you also are thrown headfirst into making sense of the culture, the organization, and all the unspoken and occasionally unseen things that allow you to do your job. There's no clear place to start this, so a good portion of the time you have to keep on top of that while you start thrashing away at your work. The question remains, though, how do you organize this stuff? How do you enable sensemaking in yourself and your peers? Everything Old is New Again Goodbye, WordPress - I've been drinking more of the KoolAid. I rebuilt my personal/professional site (not this blog) in Drupal. Migrating the content was pretty easy (about 15 static pages, no posts). The functionality is astounding - I only started working on redoing it yesterday and I've already got a great infrastructure. Expect a detailed post before too long, or at least a link to a colophon on said site. Matienzo, The San Francisco Treat I'm packing up and heading out to SFO this evening for SAA2008. Right now I'm frantically backing up my Zotero repository, making sure I have a bunch of sources to peruse on the plane as I hack away on my slides for EAD@10. You might be surprised that my idea of me jumping out of a cake in the shape of an &lt;archdesc&gt; tag wearing a bathing suit was not even considered, so it looks like I'll actually have to put some coherent thoughts together. I've got to make a grand entrance somehow. I'll be chairing the Description Section meeting as well, so behave yourselves, kids. Bad MARC Rant #1: Leader Positions 06 and 08 I understand why the MARC leader position 08 is a good idea in theory. In fact, MARBI Proposal 97-07 suggests: a change in definition to Leader/08 code "a" for clarification; making code "t" (Manuscript language materials) obsolete in Leader/06 and using code "a" instead; redefinitions of codes "a" and "p" in Leader/06; renaming the 008 for Books to "Textual (Nonserial); and deleting field 006 for Mixed material. I can safely say that some pretty funky stuff gets cataloged with the leader position 08 set as "a," and much of it is incorrect, at $MPOW and otherwise. What is Leader/08 actually supposed to be used for? MARBI Proposal 97-07 again states: Code a indicates that the material is described according to archival descriptive rules, which focus on the contextual relationships between items and on their provenance rather than on bibliographic detail. The specific set of rules for description may be found in 040 $e. All forms of material can be controlled archivally. Python WorldCat API module now available I'd like to humbly announce that I've written a pre-pre-alpha Python module for working with the WorldCat Search API and the xID APIs. The code needs a fair amount of work, namely unit tests and documentation. I've released the code under the GPL. The module, called "worldcat", is available from the Python Package Index. You can also checkout a copy of the code from my Subversion repository. Seriously, Follow Our Lead OCLC's Lorcan Dempsey makes a great point as usual in his post "Making tracks": In recent presentations, I have been suggesting that libraries will need to adopt more archival skills as they manage digital collections and think about provenance, evidential integrity, and context, and that they will also need to adopt more museum perspectives as they think about how their digital collections work as educational resources, and consider exhibitions and interpretive environments. I doubt that any archivist would disagree with this. Even better, I think this offers a great opportunity to reach out and have those in allied fields really understand how and why we've done things slightly different for so long. I'm glad to see that my new employer has picked up on this holistic approach with platforms like the NYPL Blogs. Now, It Can Be Told After a little over two years processing, referencing, and cataloging, and hacking at AIP, I'm skipping up to the City That Never Sleeps to join Jay Datema, Josh Greenberg, and company in the NYPL Labs. I'd be lying if I said I wasn't thrilled about this opportunity, and I'm ready to see where my new job will take me. The next major hurdle will be finding a place to live, so if you're privy to anything in Brooklyn, please let me know. ICA Releases International Standard for Describing Functions The ICA's Committee of Best Practices and Standards released the first edition of the International Standard for Describing Functions (ISDF). Like much of ICA's other work in descriptive standards for archives, ISDF is designed to be used in conjunction with established standards such as ISAD(G) and ISAAR(CPF), as well as standards in preparation such as ISIAH. ISDF will assist both archivists and users to understand the contextual aspects of the creation of records of corporate bodies. Through ISDF and related standards, archivists will be able to develop improved descriptive systems that can be potentially implemented using a Linked Data model. Google Message Discovery Amidst this week of notorious hoaxes, Google has launched Google Message Discovery as an enterprise-focused add on for its Google Apps platform. Google Message Discovery goes well beyond a simple and reliable e-mail backup system and provides three key features of interest to records managers: Content-addressable storage for electronic mail stored immediately upon sending or retrieval Creating explicit retention policies based upon time Compliance with relevant laws and best practices Straightforward discovery for any use, regardless if internal or concerning litigation Google Message Discovery, as well as other related offerings such as e-mail security, clearly has its origins in Google's acquisition of Postini last year. Postini isn't some startup with dubious or perpetually beta offerings (e.g. Dodgeball or GrandCentral); some of their better known clients include BASF and Merrill Lynch. At $25 per user per year, the service seems to be an incredible steal. Easy Peasy: Using the Flickr API in Python Since I'm often required to hit the ground running at $MPOW on projects, I was a little concerned when I roped myself into assisting our photo archives with a Flickr project. The first goal was to get a subset of the photos uploaded, and quickly. Googling and poking around the Cheeseshop led me to Beej's FlickrAPI for Python. Little did I know that it would be dead simple to get this project going. To authenticate: def create_session(api_key, api_secret): """Creates as session using FlickrAPI.""" session = flickrapi.FlickrAPI(api_key, api_secret) (token, frob) = session.get_token_part_one(perms='write') if not token: raw_input("Hit return after authorizing this program with Flickr") session.get_token_part_two((token, frob)) return session That was less painful than the PPD test for tuberculosis. Oh, and uploading? flickr.upload(filename=fn, title=title, description=desc, tags=tags, callback=status) Using this little code plus a few other tidbits, I created an uploader that parses CSV files of image metadata exported from an Access database. And when done, the results look a little something like this. Movin' and shakin' in the archives world ArchivesNext recently discussed Library Journal's annual list of "Movers and Shakers," pondering what a comparable list in the archival profession would look like. For those who don't know, the list recognizes "library advocates, community builders, 2.0 gurus, innovators, marketers, mentors, and problem solvers transforming libraries." After some rumination, ArchivesNext is now calling for nominations to generate a similar list. Do your civic duty and nominate either a project, an individual, or even a situation worthy of this recognition! Behind The Times: Where I Finally Speak Of code4lib 2008 OK, OK. A post about code4libcon 2008 is long overdue. The minor details: the weather was nice, food was decent, good beer was abundant, and live music was enjoyable. Onto the real meat... This time around, I felt like I got a whole lot more out of attending; I'm not sure if this is due to the changing nature of my job, increased attention, or some other factor, like neckferrets and dongles. The great majority of the talks, be they keynotes, traditional presentations, or even just lightning talks, were excellent. Furthermore, this time around I felt a whole lot more connected to the miasma - so much so, in fact, that I ended up giving two lightning talks (or three, depending on if you consider the one I gave with Gabriel Farrell on kobold_chiefain Fac-Back-OPAC). The most impressive thing overall, though, were lolcats that came out to play: Thanks to the work of Noel Peden and Dan Scott, the videos should be up soon enough. DataPortability.org and the Dream of a Web 2.0 Backup System I just discovered DataPortability.org through Peter Van Garderen's blog post about it. I was entirely surprised that I'd heard nary a peep about it. Some basic examination (running a WHOIS query on the domain) shows that it's still a fairly new project. I have to say, though, that I'm entirely impressed. Those involved have given a whole lot of thought to how they're going to be doing things, as evidenced by those who have signed up to be involved and the DataPortability Charter. To wit, the Charter's principles tend to speak for themselves: We want sovereignty over the profiles, relationships, content and media we create and maintain. We want open formats, protocols and policies for identity discovery, data import, export and sync. We want to protect user rights and privacy. And, of course, the thing that made me squeal with delight like a pig in mud: 4. DataPortability will not inventing any new standards. I mean, that's probably the best news that someone like me could get. Announcing zgw.py, or, how I stopped worrying and learned to love Z39.50 After more than a few late nights and long weekends, I'm proud to announce that I've completed my latest pet programming project. zgw.py is a lightweight Z39.50-Web gateway, written, naturally, in Python. None of this would be possible without the following Python modules: Aaron Lav's PyZ3950, the beast of burden; Ed Summers' pymarc, the smooth-talking translator; and web.py, quite possibly the best and most straightforward Python web framework available. I initially undertook this project as an excuse to play with PyZ3950 and to teach myself the workings of web.py; I'd played with Django, but it seemed entirely excessive for what I was working on. First, I should mention that zgw.py isn't designed to be a complete implementation of a Z39.50 gateway. There are many areas in which there is much to be desired, and it's probably not as elegant as some would like. However, that wasn't the point of the project. My ultimate goal was to create a simple client that could be used as a starting point from which to develop a complete web application. No Excuses To The Power of Infinity I have no excuses for not updating this blog. I thought about forcing myself to comply some sort of resolution - you know, given the new year and all - but everyone knows how those turn out. Regardless, I have a whole backlog of things to post about, most notably being the countless Python programming projects I've been working on lately. Expect more posts to arise over the next few days as a result of this. Also, I have no excuses for botching up ArchivesBlogs temporarily by mucking about and wiping out some of WordPress's databases that make FeedWordPress, the plugin that grabs content for ArchivesBlogs, do its thing. The recovery was simpler than I thought it would be, but this is probably the largest amount of unplanned downtime we've had. Keep your eyes open, as a replacement for FeedWordpress may itself becoming along sooner or later. Web 2.0, Disaster, and Archives Many of Web 2.0's detractors argue about it's real value, but given the wildfires in Southern California, I was happy to see it really put to good use. KPBS, a San Diego radio station, has been using Flickr and, even more shocking (at least for some), Twitter as ways to disseminate information and news quickly. The use of Twitter is particularly interesting as it can send out SMS messages. You might recall a few years ago when protesters in the Philippines used SMS to organize political rallies and warn of police retaliation. The California State Library Blog also has provided information from the California State Archivist about archives affected by the fires. In addition, information about disaster recovery for libraries and archives is available both on a regional level by the San Diego/Imperial County Libraries Disaster Response Network and on the state level by the California Preservation Program. Please hold those affected by the fires in your thoughts, and if you can, contact SILDRN or the CPP to help. ArchivesBlogs Upgrades & Related Weirdness I've updated ArchivesBlogs to the latest version of WordPress, as well as the latest versions of the plugins that do the heavy lifting (FeedWordPress and Auto Delete Posts). In so doing, I found that the database structure of WordPress 2.3 is radically different, causing some of my elegant work to break (namely, the use of the Auto Delete Posts plugin, for which I wrote a patch). You may have seen duplicate posts, no new posts on specific feeds (language and blog type), and possibly other unpredicted outcomes. Everything seems to be working properly now, so if you see anything strange or that doesn't work, let me know. Dust in the Wind(y City) SAA2007 came and went. Everyone knows that I'm no good at liveblogging or semi-liveblogging, so don't expect an exhaustive report - potentially better sources include ArchivesNext and Spellbound Blog. Here are my personal highlights, which is just about the best that this here boy archivist can pull off. The pre-conference SAA Research Forum. While I only got to see the second half of the day, this is where the meat was according to those who were there for the whole thing. The Description Section steering committee meeting. This was probably the most instructive for me as I'm the incoming chair. Hacking away on my remarks most of the week and successfully pulling off our session. Jennifer Schaffner from OCLC/RLG Programs substituted for Merrilee Proffitt and did a swell job. She's a great person to discuss all these crazy ideas with for two reasons - she's established in the profession and new to OCLC! I eagerly await her posts at hangingtogether. Hey Chicago I'm in the Windy City for SAA2007. I'll be pretty busy the first few days in town, but remember, if you want to find me, just look for the glasses. Also, make sure you come to the Description Section meeting and Session 503 on Friday! Happy Birthday ArchivesBlogs! It was one year ago today that I made ArchivesBlogs available to the public. Time sure seems to fly by fast! Since then there have been a lot of changes - layout, platform, and hosting - but still, I remain involved for the long haul. Thanks to all who provided suggestions, submitted blogs to be syndicated, and any other guidance along the way. ArchivesBlogs now syndicates nearly 100 blogs in 9 languages! When Life Hands You MARC, make pymarc It's a bad pun, but what can you expect from someone who neglects his blogs as much as I do? I've been busy, somewhat, and one of my latest forays has been getting a grip on Python, an absolutely wonderful programming language. I actually enjoy writing code again, which is more than a bit scary. I was sick of the mangled scripts and workflows I came up with at MPOW to handle converting MARC data to HTML and other such nonsense. Writing Perl made me feel unclean. After playing around with Ed Summers' pymarc module, I began hacking about and putting my own hooks into the code here and there. I longed for MARC8 to Unicode conversion, which is a necessary evil. Digging around, I came across Aaron Lav's PyZ3950 module, which had its own little MARC code. After bugging Ed via #code4lib, and hassling Aaron in the process, Ed began incorporating the code and I started some testing. Just a short while later, the conversion code worked. Archives Camp: Talking About Archives 2.0 ArchivesNext recently discussed the possibility of having some "Archives 2.0"-themed events this summer, and I think it's a great idea. Now, we may not be able to throw something together in time for SAA, but it seems like the idea of at least meeting up informally is percolating. There's a wealth of opportunities available for archives and archivists to improve access to their holdings through social software and the like. My vision, as I said in a comment on the post, would be to end up with an unconference along the lines of a Library Camp (or more generally, a BarCamp), maybe with lightning talks if enough of us have something to show off or talk about. Like Library Camp, I'd like to see a "bridging the gap" session where we learn and share ways about how to talk to IT staff and other stakeholders essential to our ideas taking off. I facilitated a such a session at Library Camp East, and although trying at times, it was really instructive. NARA Frees Their Data, Somewhat I'm a bit surprised that this hasn't come across anyone's radar, because it seems awfully damn significant to me. According to this post on the A&amp;A listserv by Michael Ravnitzky, the National Archives and Records Administration released an exhaustive database of box holdings of all the Federal Records Centers. He doesn't really say how he obtained this database, but my guess is he just asked based upon his background and interest in public access to government information - I've come across his name on material relating to FOIA before. The file he received from NARA is a 155 MB Microsoft Access database, and soon after he posted about it to the listserv, Jordan Hayes and Phil Lapsley took the opportunity to host the database, converted it to MySQL, and wrote a few simple query forms for the database in PHP. Hayes also provided some basic documentation on how to use the forms since MySQL query syntax is probably not familiar to most people on the A&amp;A list. Sticking My Neck Out It's been some time since I've had a substantive post, and I don't really intend to write one now. I figured I should mention, however, that I've been featured lately in print and in the blogosphere. Jessamyn West of librarian.net interviewed me for an article ("Saving Digital History") in Library Journal netConnect. In addition, I was tapped by the wonderful folks at Booktruck for the latest installment in their "Ask a Male Librarian" series. I swear someday soon I'll write something much more interesting and less self-promotional. Upgrading Kubuntu to Feisty Beta Breaks Privoxy While I fully intend to go over my full experience upgrading to the latest development release of Kubuntu, one of the things that I first noticed was that Privoxy didn't seem to work or to be speaking with Tor, preventing me from that lovely "anonymous" browsing experience. I noticed that in the upgrade the ever important "forward-socks4a / localhost:9050 ." line in /etc/privoxy/config wasn't in the upgraded version (actually, it shouldn't be). Apparently during the upgrade, I told it to clobber my config file with the one distributed, saving my old version (luckily) to /etc/privoxy/config.dpkg-old. Once that I added that line back, I'm now able to surf a bit more safely. Protection From Human Pests A few months ago (while I was at NACO training) I got a reader's card at the Library of Congress. For a while I pretty actively went and requested books on Saturday afternoons. In particular, I was interested in archival manuals from outside the United States. One of the most interesting books I found was S. M. Jaffar's Problems of an Archivist, a manual written in Pakistan in 1948. I was struck by the following passage ("Protection From Human Pests"), taken from pp. 28-29: "Human pests" and "White Huns" are the common epithets applied to human species acting as enemies of archives. History has recorded many such instances of vandalism as the wholesale destruction of priceless treasures of art and literature, the burning of big and beautiful libraries, the transport of camel-loads of books to distant countries and the sale of valuable manuscripts at ridiculously low prices. The transfer of artistic and literary treasures of subjugated countries by the conquerors to their homelands to adorn their own museums and libraries has depleted those countries of that wealth. Five Non-Library Blogs I Read I won't bother waiting to be tagged to do this, because all the cool kids already are. I read too many blogs already, so here we go. Mary Eats is, as one would easily assume, a blog about food. Mary started the blog while she and her husband were living in Korea, and thus there's an overwhelming emphasis on Korean food and restaurants. She moved to Seattle relatively recently and began culinary school, too. My two favorite parts of this blog are when she makes videos and when she makes comics, like this one about konbu. Language Log is a blog written by linguistics faculty from around the world, wherein they tackle important and not-so-important issues like linguistic prescriptivism, 419 scammers, the Pirahà language, and cheese steak rolls served at Chinese restaurants in Philadelphia, all with a good sense of humor. Information Aesthetics covers all sorts of stuff related to information visualization. Essentially, it's just one massive blog full of data porn, from treemaps to Youtube videos using Isotype symbols. Two Work-Safe Tidbits about Archives and Erotica First, via my associates at booktruck.org, I came across a review of the comic book Demonslayer v. 2.2, by a certain Marat Mychaels, et al. at Comics Should Be Good. While the fact that the reviewers pan the comic book seems only marginally of interest to those of us wading in archivy, I should draw your attention to a specific part of this issue. Apparently one of the characters goes to visit the Director of Archives at the New York Museum of Natural History, who has chosen to decorate his office in the style of some seemingly life-sized works by (fellow Peruvian) Boris Vallejo. Secondly, everyone knows how much of a pain digital preservation is, particularly in terms of born-digital cultural materials. So, who should archivists and curators look to for guidance? Kurt Bollacker, digital research manager at the Long Now Foundation (and formerly of the Internet Archive), holds up the pornography industry as a potential leader of the pack. Possible ArchivesBlogs Downtime: Software Upgrade I finally noticed that FeedWordPress, the plugin I use to maintain ArchivesBlogs, has been updated within the last month to work with WordPress 2.1 and higher. I hope to get this working pretty soon, but I apologize in advance if it ends up going down for a few days. Throwing Out the Baby, the Bathwater, and the Bathtub: The Sad State of the Archives and Archivists Listserv Today, Nancy Beaumont, Executive Director of the Society of American Archivists, made an announcement on the Archives & Archivists listserv that SAA would no longer retain the first thirteen years of posts from the listserv. During this time the listserv was hosted by Miami University of Ohio, and last September, the list was moved to an SAA server. This stems from a decision made by SAA Council that they not retain the archives for three reasons: 1) an appraisal decision informed by the SAA's archives at the University of Wisconsin - Milwaukee, 2) a consideration of administrative issues, and 3) a consideration of cost. While the appraisal decision is well-informed by the claim that the list archives do not have evidential value as SAA records, the belief that these records have little informational value does not sit well with me. The list archives document the development of archives becoming a stronger profession in the face of technology and the creation of a tight-knit social network. Braindump I'm really behind on posting, and I apologize. There are a few action items that I should mention before I clear my brain to allow me to start posting things with actual content. ArchivesBlogs moved, but mail to archivesblogs.com was not working for a while. A few people mentioned this to me, but I didn't get this resolved until just last week. After who knows how many attempts trying to get something posted on Boing Boing, I finally made it when I had more information about the hottest chili peppers in the world. I now have a food blog, so if you're interested, check it out. It's called Feeding the Hungry Ghost. Now that that stuff is out of the way, I can start posting about "important" things again, like my trip to Georgia for code4lib 2007. Tomato "Foam"? I know, I know - you're probably thinking "foams are so over," regardless which side of the molecular gastronomy fence you sit on. If you're a fan of the strange powders and physical state changes of food, you might be saying "C'mon, everybody knows that espuma is the new foam!" Yeah, right - and aire is the new espuma. They're all pretty much the same thing, and you've got to be bullshitting yourself if you think that Adrià and his ilk don't know this already. If you're convinced that all this stuff is mumbo jumbo designed to take away from traditional technique, then fine. I don't particularly care either way. I made a foam that wasn't really a foam ... or was it? I was bored tonight when I was about to make supper for myself. Yesterday I got a whole bunch of free samples from National Starch, but I haven't really been able to do anything with them since I've left them sitting in my office. Eatin' Fresh and (Mostly) Raw I left work early yesterday for a doctor's appointment, which left little time for lunch. On the way there, I snacked on some almonds and raisins to tide me over. By the time I finally got done with the tests and consultation my stomach was making unholy groans that sounded like ghosts were plaguing my GI tract. Since there were a few things I wanted to pick up anyhow, I headed to Whole Foods and stopped by the deli first to get a sandwich. For what it's worth, I got the "tuna niçoise" sandwich, which wasn't all that niçoise (it tasted alright, though). Despite warnings of the possibility of olive pits listed on the wrapper, I couldn't find a single piece of olive anywhere close to it except in another area of the deli case. The sandwich was much larger than the amount of food I've become used to eating in one sitting, so I roamed the aisles stuffed to the gills with tuna, bread, green beans, and hardboiled eggs. Adventures in Fermentation: Yogurt For Beginners I decided somewhat spontaneously to make my own yogurt after casually reading about the process and realizing how incredibly simple it is. After consulting a wide variety of sources - both print and electronic, like any good information professional would - I set to the task at hand. Mise en place A large stainless steel or aluminum saucepan A large stainless steel, aluminum, unglazed ceramic, or heat-resistant glass bowl A large wooden or stainless steel spoon A kitchen thermometer A heating pad Towels A ladle Containers for storing the finished yogurt Ingredients 1 quart of high-quality, organic milk 1 pint of organic heavy cream (if so desired) 1/4 to 1/2 cup organic yogurt with live cultures (see below) Some, but not all, of the directions that I read suggested that you sterilize all equipment before you begin making the yogurt by immersing it in boiling water. If you decide to do so, I would strongly suggest that you avoid using plastic containers to store the yogurt. ArchivesBlogs On The Move Thanks to the wonderful people at ibiblio, ArchivesBlogs will be changing hosts! If you're not familiar with ibiblio, it's one of the largest and oldest public digital library collections on the Internet. In addition to the upcoming hosting of ArchivesBlogs, ibiblio also hosts librarian.net and Library Web. Pardon any interruptions in access given the impending move; everything should be settled within a few days. Also, a few changes I've made to the backend should fix most of the continuing issues with certain feeds not aggregating. Let me know if there are any problems that still occur. ArchivesBlogs Revamped After many late nights toiling away, I'm done with the latest version of ArchivesBlogs. I've changed things quite a bit - most notably, I've switched platforms from Plagger to Wordpress using the FeedWordpress plugin to do the heavy lifting of syndication. I've decided to do away with the old OPML structure as well since the taxonomy wasn't as refined as I would have liked. Instead, FeedWordpress can categorize posts as they come in, which has allowed me to create a brand new taxonomy for ArchivesBlogs based on language. Each language can also have its own feed now. The one thing missing that I'm really itching to put back in place are the social bookmarking links; none of the plugins I've come across so far seem to like my theme, so I may just end up writing my own plugin. Anyhow, please give me feedback - I'm itching to do more. Is Open Data the Point? I've been thinking about the biblioblogosphere's reaction to Casey Bisson's decision to use the $50,000 he was awarded by the Mellon Foundation for his work on WPopac to purchase LC bibliographic data and open it up to anyone who wanted to take a crack at it. Yes, this is a "Good Thing," and valuable to the library community as a whole, but I feel like there are some things we're overlooking. Dan Chudnov and I seem to agree, but I'm not going to go so far to damn those who herald this as a "new era." It's a little premature to say where it will go, but I have to admit that I'm occasionally confused and often a little bit insulted by some of the talk surrounding this issue. I wonder how interesting all the bibliographic data of LC is to begin with. What's in the dump paid for by the Mellon Award money? I'd guess monographs and serials, and probably audiovisual materials. The State of Open Source Archival Management Software It's been a while since I've written here, but other responsibilities at both at home and work have kept me busy. To get back into the swing of writing regularly, I thought I'd take a look at one of the biggest hot-button topics in archives this year: the development and release of open source archival management systems. Between this year's and last year's SAA conferences, there were three sessions that, at least in part, dealt with the development of open source software for archives. In turn, this reflected the three major projects that archivists have been developing: Archivists' Toolkit, Archon, and ICA-AtoM. Archivists' Toolkit is the oldest of the three projects; the first meeting and project proposal date from 2002. It may very well be the best funded of the three projects, as it received a $847,000 grant from the Mellon Foundation. However, it also seems to be the least mature, in my opinion, as I've not seen a live demo that's publicly accessible. MARAC Friday Afternoon Report The mid-Atlantic archivists are in a brief recess between now and the final session of the day, and it's been thoroughly interesting to say the least. I missed the caucus meetings this morning, unfortunately, but the plenary session was well worth it because it's got the gears turning about archival access systems even though it wasn't directly about them. Paul Israel of the Edison Papers Project spoke at length about Edison's legacy and collaboration with others. The talk emphasized that Thomas Edison was much more than a great inventor and owed a great deal of his success to his entrepreneurial nature, which I didn't know much about. While we didn't get to see him give us an interactive presentation of the site, I noticed how exhaustive the digital edition was. While the architecture of the site is a little confusing for me, there's so much content I didn't know where to begin or even what to search for! The series notes are a great way to browse through the collection, though. Morristown Calling: MARAC Fall Meeting I'm at the Westin Governor Morris in Morristown, New Jersey for the MARAC Fall Meeting. I just got back from visiting the Morris Museum with a few folks, and now I'm enjoying the (expensive) wireless connection here. This time around I don't know so many folks here, so shoot me an e-mail or comment if you're in attendance. Expect a more detailed post soon; I'm exhausted from being up early to catch Amtrak! Library Camp East post-mortem I know this post is well overdue, but the last few weeks have kept me extremely busy. Library Camp East was amazing; fun, thought-provoking, and inspiring. John Blyberg and Alan Kirk Gray (as well as the rest of the Darien Library Staff) did a heck of a job preparing for all of us descending into the auditorium. They even gave me a cool mug that my co-workers envy. I also finally got to meet Dan Chudnov and Casey Bisson, whose blogs I've followed for a while now. Jessamyn West and John posted nearly exhaustive lists of posts by LCE attendees for reference. (For what it's worth, Jessamyn also tips her hat to ArchivesBlogs and apologizes for us not meeting at two conferences so far. I share the blame!) Fortunately for my readers, I have precious little to add in terms of comments (although I tagged some Library Camp-related links on Unalog). I actually was called into service to lead a session by accident (I happened to be scratching my nose), but I was happy enough to moderate the discussion on how techies and non-techies can learn to talk to each other. ArchivesBlogs 2.0 After doing some frantic hacking this week I'm happy to announce that I've unveiled the second major revision to ArchivesBlogs. Other than a change in color, I have added the subscription list in the sidebar using a slightly modified version of Dan McTough's Optimal browser for OPML.The OPML file it renders is also the subscription list used by Plagger. Anyhow, let me know what you think. I'm sure there are some kinks that need to be ironed out. I'm off to Library Camp East early tomorrow (a 4:05 AM train out of DC). I hope to write-up a post-mortem soon after. On what "archives blogs" are and what ArchivesBlogs is not I had fully earmarked addressing Thomas G. Lannon's "Archive Blogs" post on Documenting Sources, his blog, for over a week now after discovering it in my requisite vanity search of Technorati. Other things (even reading) have kept me busy, though, hence the unintentional neglect. I've had plenty of time to reflect upon it at this point, so I might as well respond to some of his points. He first asks the following: What is an Archive Blog? This should be a crucial question as the growing field of "blogs about archives" offers up posts stretching from the recent SAA conference to South Carolina Gamecocks. Perhaps it would it be helpful to make a distinction between official blogs relating to news and services from archival repositories and personal blogs written by people who happen to work in archives? It is an important question indeed. When I came up with the idea for ArchivesBlogs (and when I was still calling it " SAA 2007 Session Proposal: The Changing Nature of Description and OPACs During the Description Section meeting at this year's SAA conference, I made an informal proposal for a session concerning the changing nature of OPACs, changes in the library cataloging world, and the impact of those on descriptive practice in archives and manuscript repositories. I'd like to invite any of you, if you're interested, to let me know if you'd be interested in assisting me with putting together a proposal on this topic. A small group of us met briefly after the Description Section meeting and discussed the possible formats and areas of discussion. We determined that a seminar-style discussion seemed most appropriate, with perhaps a brief presentation on a specific area presented by the panelists on a given aspect of these issues. Possible areas for presentation and discussion include: The changing nature of the OPAC in the library world: open-source, problems with vendors, adding Web 2.0-like features (the "next generation of finding aids" session at this year's conference included good examples of this) The impact of changes at LC and the OCLC/RLG merger: LC's decision to end creating series authority records, rumors of abandoning LCSH, decreased importance of cataloging in general to LC administrators, the future of NUCMC and ArchiveGrid The impact of Meissner and Greene's " Coming soon: ArchivesBlogs 2.0? After two weeks of use, Plagger has proven itself to be pretty resilient. I've been asking myself how I can make ArchivesBlogs even better, and I've finally got a few ideas. A site redesign. I'd like different colors. Categorizing the feeds, e.g. separating blogs by individuals from repository blogs. This will probably end up with me creating a couple of Plagger configurations and dumping them into different subdirectories on ArchivesBlogs. Better support for tags. It'd be nice to pull them out and have automagically linked Technorati tags. Scrubbing HTML from the feeds to create valid XHTML for the syndication page(s). Plagger supports the Perl module HTML::Scrubber so it seems. This is a Big Deal to someone like me. Adding a directory - most likely in OPML - for as many blogs about archives and archivists as possible since it's just not possible to do that for some blogs using Plagger. The most straightforward example are archives with blogs that are part of a library-wide blog and therefore don't have their own feeds. ArchivesBlogs news: Disappearing Blogspot Blogs ArchivesBlogs has been going strong for over a week now. If you use Blogspot and had a blog previously syndicated by ArchivesBlogs, your content may be temporarily unsyndicated. The specific problem is HTTP 502 error, which seems to indicate a problem with a proxy server at Blogspot. In any rate, they should return soon enough -- it would be nice to have the 9 blogs back! ArchivesBlogs update: service links I've upgraded Plagger (the software behind ArchivesBlogs) to the latest version and it's allowed me to add service links to del.icio.us, unalog, digg, Reddit, and Technorati. I suppose I could add more (ma.gnolia, Furl, etc.), but I'll hold off doing that for the sake of cluttering the interface for the time being. If you have any service links you'd like to see, let me know and I might be able to hack something together. Announcing ArchivesBlogs Since my last post about syndicating blogs about archives, I've played around with the idea and different software packages to do it, including Planet and Plagger. I'm happy to announce that after a few days work I was able to put something together. ArchivesBlogs is an aggregator for blogs about archives. It runs Plagger and updates hourly, outputting HTML, RSS, Atom, OPML (for import into other aggregator), and a FOAFroll. The site design is simple, but i'm happy with it. I took whatever archives blogs I knew about and added them, so if you know of any others or you want yours removed, let me know. Syndicating archives blogs I still haven't had enough time to process everything I took in or ideas I came up with as a result of the SAA conference. Many were more diligent than I and I'm sorry to say I didn't meet them, but some highlights follow: Geof Huth took notes on the SAA Awards Ceremony, Christie Peterson pitted Archon against Archivist's Toolkit, Jessamyn West blogged about her session on blogs, Peter Van Garderen discusses his experience at the conference including his session on archives and Web 2.0, and Merrilee Proffitt from RLG mentioned the blog session and RLG Roundtable. I'm not even up to speed on the rest of the archival blogs out there. In a stroke of genius and madness I've got an idea that I may put into motion. I'm thinking about setting up an instance of Planet, a Python-powered web-based news aggregator. It's pretty common in the FLOSS world, and has been picked up by the code4lib folks; they're running theirs as planet code4lib. Report from SAA: Archival Solidarity and International Cooperation The Archival Solidarity Session was really great and generated a lot of dialog. It was originally organized by Nancy Marrelli of Concordia University (Montréal), but she couldn't make it on account of a family emergency. Trudy Huskamp Peterson led the discussion in her place and did a wonderful job. Essentially, Archival Solidarity is a project involving the ICA's Section of Professional Associations that concerns "international archival development" through bilateral projects. There are several major issues at play. First, existing methods of international development are not working for archival projects, either because of bureaucracy in general or archives being of lower priority in comparison to needs such as sanitation, adequate health care, and the like. We identified that one of the most critical aspects is the lack of communication or methods to share information. There is no central "hub," formal or informal, that allows archivists to share information about assistance needed or offered. The International Fund for Archival Development (FIDA), coordinated by the ICA, was supposed to serve as such, but apparently operational issues prevent it from working effectively. Report from SAA: Give Me Free WiFi I'm at the Hilton Washington, the site of the SAA conference. I've registered and picked up my free totebag. I, and others, have bemoaned the lack of connectivity in the conference area. Wireless is only available in the lobby, so it seems, and it's rather pricy ($5.95 for 4 hours or $9.95 for 24 hours). I know archivists are often thought of as being technologically behind (whether we are is a Pandora's box that I won't open in this post), but I feel that some sort of net access is necessary at every conference. I'm just barely able to get it through my cell phone, which is how I'm posting now. Unfortunately, I get no reception on the conference floor so I needed to make my way up to the lobby anyhow. I missed the Standards Committee meeting since I was a little late and I didn't want to barge in since the doors were closed. It's nearly time for the Archival Solidarity session, which sounds interesting to me since I'd like to get involved in ICA. Conference Time I'm one of several bloggers attending the SAA conference the rest of this week. Nothing against CoSA or NAGARA, but I'm attending the conference for the organization to which I belong. My schedule is pretty packed, and if you're one of us be sure to attend the Description Section meeting since I'm running for Vice-Chair. SocketsCDR Audio Zine 3 out soon! I'm going to be on the latest installment of the SocketsCDR audio zine, curated this time around by Rebecca Mills of The Caution Curves. Sean, the SocketsCDR label honcho, just posted the cover artwork for it and it looks like a great line-up, including friends like The Plums and Stamen & Pistils. This will be my first release in a while (other than the collab CD with myself, Cotton Museum, and Actual Birds on Casanova Temptations). More details will follow, naturally. Upgrading Kubuntu Breezy to Dapper Upon hearing about yesterday's release of Kubuntu 6.06, I decided to upgrade from the previous release, Kubuntu 5.10. I'd like to say that it went off without a hitch, but it didn't. It did, however, go mostly well, and I realized that my problem was that I continued to use applications while Adept installed the new packages. I couldn't install all the packages, and I ended up with a minorly disfunctional kernel that wouldn't allow ndiswrapper to load properly, preventing me from using my internal wireless card. Once I rebooted (and used a spare PCMCIA wireless card to gain connectivity), I was able to finish installing the rest of the packages that had not finished properly and rebooted again. Everything pretty much worked, but I'm having to tweak some lost settings, most notably in KMail. Other than that, it's been working out fine! RLG + OCLC = Clog Roc? The technical services world has been in an uproar lately, between LC's decision to stop creating series authority records (particularly since they didn't consult PCC members beforehand) and the fallout after Calhoun report. We might as well have another drink, because as librarian.net reports (along with several others), OCLC and RLG are about to merge. It's mindblowing to think that RLG employees did not find out any sooner than the rest of us, and that either organization has yet to consult its members. However, RLG plans to do so, but it will be interesting to see how this pans out. In particular, some folks worried about the merging of data and the future of RedLightGreen. I know it's not considerably better, but they seem to be overlooking Open WorldCat. Change of Platform Nearly a year ago I switched from Wordpress to Drupal. I chose to switch back, partially because it was capable of doing way more than I needed it to! I thought I didn't want to be limited by blog software, but apparently that's not a terribly huge concern anymore. The old site had frightfully little content (three posts in Dalliance, a few personal posts, and links to papers). I'm redoing my non-blog site with PURLs since I don't have access to an e-prints server to which I can upload my varied previous academic work. Anyhow, the important stuff is soon to come, with Dalliance possibly moving to another host (probably Wordpress.com). Anything linking to one of the papers or my code snippets will be edited as needed. An updated version of Nick Gerakines' mail2rss.pl A little over a month ago, Nick Gerakines posted a Perl script to be called from a Procmail configuration file. It seemed to work pretty well, but the anal-retentive cataloger/standards geek in me decided to pass the results through a feed validator. It failed in a few key areas: missing version attribute in the rss tag, improper guid and link tags, and a pubDate with a non-RFC822 date. These all seemed pretty easy to fix, so I went ahead and made some changes. My fixes are a bit inelegant, but they create valid RSS 2.0. It was pretty trivial to add an RSS version number and to fix the guid error; the latter just required adding the isPermaLink="false" attribute to that tag. However, Nick's original code required parsing the pubDate tags to determine when to kill data that was over 6 hours old. I didn't want to be bothered parsing an RFC822 date with this, so I moved that information into a category tag. Mid-November 2005 updates: Dalliance off the ground! Site changes galore! DC Not Bad! I've finally gotten around to doing some serious work on the site. I've completed the first post for my defunct blog, and it's about one of my favorite songwriters ever, Dr. Franklin Bruno. I've also figured out some of the odd intricacies of Drupal and am finally getting this site to have a look and feel of which I can be proud. I've settled in nicely to Washington, DC, and I'm living in a decent area of town within a reasonable interest of a decent watering hole, groceries, and the Metro. Halloween has come and gone; I dressed up as everyone's favorite St. Vitus dancer, Ian Curtis, complete with requisite noose. My friend Corey took similar cues as far as the era and scope of his costume, and chose to dress up as Henry Rollins. The weather has stayed mostly warm, so I've been spoiled on that front too. More changes are coming soon, so stay alert. Off on my way: in transition to Washington, DC I'm pleased to announce that I will be joining the staff of the National Anthropological Archives and Human Studies Film Archives of the Smithsonian Institution's Department of Anthropology as a project archivist. I will have two initial primary responsibilities: cataloging Plains Indian ledger art for the non-profit ARTstor Project, and original cataloging and bibliographic enhancement of audio, film and video collections in support of the NAA's new Endangered Languages Program. This program also collaborates with the University of Utah's Center for American Indian Languages and is also part of the Documenting Endangered Languages project, supported by the National Science Foundation and the National Endowment for the Humanities. I will be starting work for the NAA/HSFA on September 6, 2005, and will be working on a 12 month term contract. 
meredith-wolfwater-com-5467	----	Information Wants To Be Free Information Wants To Be Free A librarian, writer and educator reflecting on the profession and the tools we use to serve our patrons Drop the ball When I visited my parents in December of 2019, they asked me to go through a box of old stuff they wanted to get rid of. My mother had kept basically all the art we did, the bajillion songs and poems I wrote, everything we did for school, etc. It was surprising how much she&#8217;d ... In all the bad&#8230; some good things Wow, this has been a hard year. No one&#8217;s life has been untouched by 2020 between the pandemic and unrelenting proof that the social safety net has been dismantled by late-stage capitalism, the state-sanctioned murders of black and brown people and ensuing protests, the horrendous wildfires that felt like horsemen of the coming climate apocalypse, and a stressful election. It&#8217;s horrifying. ... Making Customizable Interactive Tutorials with Google Forms In September, I gave a talk at Oregon State University&#8217;s Instruction Librarian Get-Together about the interactive tutorials I built at PCC last year that have been integral to our remote instructional strategy. I thought I&#8217;d share my slides and notes here in case others are inspired by what I did and to share the amazing ... The crushing expectations on working women and where&#8217;s my fucking village? On Friday and Saturday, my Twitter feed was full of anger and frustration over a blog post on the ALSC (Association for Library Services to Children) Blog. Entitled &#8220;How Motherhood Has Influenced Me as a Children’s Librarian,&#8221; the post was problematic because it suggested (probably unintentionally) that childless children&#8217;s librarians could not connect with patrons as much or have ... Recognition doesn&#8217;t have to be a zero sum game As usual, the week the 2020 Library Journal Movers and Shakers were announced, I saw plenty of complaints about the award and, in some cases, awardees. I’ve been reading this sort of hurtful negativity since 2006 when I was named a Mover and Shaker (and a friend of mine wrote a blog comment calling us “the ... Thoughts on work, well-being, solidarity, and advocacy in our current&#8230; situation I have been wanting to blog for weeks. I have several blog posts I started that I just couldn&#8217;t get through. My attention span reminds me of my son&#8217;s at age 5 when his teacher delicately suggested we should have him assessed for ADHD. It rapidly jumps between various tasks at hand, my family, my ... #LISMentalHealth: That time my brain and job tried to kill me Happy LIS Mental Health Week friends! I want to start this post by recognizing someone who has done a great deal to support library workers&#8217; mental health in the face of toxic workplaces, Kaetrena Davis Kendrick. Kaetrena has done some incredibly valuable research on low morale and toxic workplaces in librarianship and has created an awesome ... My year in books (and podcasts) 2019 This was a pretty good year for me. Nothing particularly amazing or wonderful or eventful happened to me, though my son has been such a source of pride and light for me that I sometimes can&#8217;t believe I&#8217;m his mom. I still live in the same messed up world we all do. My migraines have actually ... When libraries and librarians pretend to be neutral, they often cause harm Two recent events made me think (again) about the toxic nature of &#8220;library neutrality&#8221; and the fact that, more often than not, neutrality is whiteness/patriarchy/cis-heteronormativity/ableism/etc. parading around as neutrality and causing harm to folks from historically marginalized groups. The insidious thing about whiteness and these other dominant paradigms is that they are largely invisible to ... Thoughts at Mid-Career Part 5: Where to From Here? This is the fifth in a series of essays. You can access the rest here, though it’s not necessary to read them all or in order. &#8220;To me, the only habit worth ‘designing for’ is the habit of questioning one’s habitual ways of seeing” -Jenny Odell, How to do Nothing &#8220;We have to fight for this world, but we ... 
meredith-wolfwater-com-8104	----	Information Wants To Be Free | A librarian, writer and educator reflecting on the profession and the tools we use to serve our patrons Home About Speaking Writing Contact Facebook Twitter Google+ LinkedIn Skype RSS SEE FULL POST Drop the ball about me, librarianship, libraries, Work, Work-life balance by Meredith Farkas on 3/22/2021 with 4 comments When I visited my parents in December of 2019, they asked me to go through a box of old stuff they wanted to get rid of. My mother had kept basically all the art we did, the bajillion songs and poems I wrote, everything we did for school, etc. It was surprising how much she’d … continue reading ... SEE FULL POST In all the bad… some good things hi, libraries, Work, Work-life balance by Meredith Farkas on 12/29/2020 with 2 comments Wow, this has been a hard year. No one’s life has been untouched by 2020 between the pandemic and unrelenting proof that the social safety net has been dismantled by late-stage capitalism, the state-sanctioned murders of black and brown people and ensuing protests, the horrendous wildfires that felt like horsemen of the coming climate apocalypse, and a stressful election. It’s horrifying. … continue reading ... SEE FULL POST Making Customizable Interactive Tutorials with Google Forms free the information!, Higher Ed, instruction, librarianship, online education, reference, Work by Meredith Farkas on 11/5/2020 with 3 comments In September, I gave a talk at Oregon State University’s Instruction Librarian Get-Together about the interactive tutorials I built at PCC last year that have been integral to our remote instructional strategy. I thought I’d share my slides and notes here in case others are inspired by what I did and to share the amazing … continue reading ... SEE FULL POST The crushing expectations on working women and where’s my fucking village? career, libraries, management, Work, Work-life balance by Meredith Farkas on 8/3/2020 with 9 comments On Friday and Saturday, my Twitter feed was full of anger and frustration over a blog post on the ALSC (Association for Library Services to Children) Blog. Entitled “How Motherhood Has Influenced Me as a Children’s Librarian,” the post was problematic because it suggested (probably unintentionally) that childless children’s librarians could not connect with patrons as much or have … continue reading ... SEE FULL POST Recognition doesn’t have to be a zero sum game librarianship, libraries, management, speaking, Work by Meredith Farkas on 5/18/2020 with 1 comment As usual, the week the 2020 Library Journal Movers and Shakers were announced, I saw plenty of complaints about the award and, in some cases, awardees. I’ve been reading this sort of hurtful negativity since 2006 when I was named a Mover and Shaker (and a friend of mine wrote a blog comment calling us “the … continue reading ... SEE FULL POST Thoughts on work, well-being, solidarity, and advocacy in our current… situation ALA, hi, librarianship, libraries, management, Work, Work-life balance by Meredith Farkas on 4/8/2020 with 3 comments I have been wanting to blog for weeks. I have several blog posts I started that I just couldn’t get through. My attention span reminds me of my son’s at age 5 when his teacher delicately suggested we should have him assessed for ADHD. It rapidly jumps between various tasks at hand, my family, my … continue reading ... SEE FULL POST #LISMentalHealth: That time my brain and job tried to kill me about me, classic blunders, librarianship, Work, Work-life balance by Meredith Farkas on 2/18/2020 with 6 comments Happy LIS Mental Health Week friends! I want to start this post by recognizing someone who has done a great deal to support library workers’ mental health in the face of toxic workplaces, Kaetrena Davis Kendrick. Kaetrena has done some incredibly valuable research on low morale and toxic workplaces in librarianship and has created an awesome … continue reading ... SEE FULL POST My year in books (and podcasts) 2019 about me, hi by Meredith Farkas on 12/31/2019 with 3 comments This was a pretty good year for me. Nothing particularly amazing or wonderful or eventful happened to me, though my son has been such a source of pride and light for me that I sometimes can’t believe I’m his mom. I still live in the same messed up world we all do. My migraines have actually … continue reading ... SEE FULL POST When libraries and librarians pretend to be neutral, they often cause harm intellectual freedom, librarianship, libraries by Meredith Farkas on 11/4/2019 with 4 comments Two recent events made me think (again) about the toxic nature of “library neutrality” and the fact that, more often than not, neutrality is whiteness/patriarchy/cis-heteronormativity/ableism/etc. parading around as neutrality and causing harm to folks from historically marginalized groups. The insidious thing about whiteness and these other dominant paradigms is that they are largely invisible to … continue reading ... SEE FULL POST Thoughts at Mid-Career Part 5: Where to From Here? about me, librarianship, mid-career, social software, Work, Work-life balance by Meredith Farkas on 9/11/2019 with 9 comments This is the fifth in a series of essays. You can access the rest here, though it’s not necessary to read them all or in order. “To me, the only habit worth ‘designing for’ is the habit of questioning one’s habitual ways of seeing” -Jenny Odell, How to do Nothing “We have to fight for this world, but we … continue reading ... SEE FULL POST Thoughts at Mid-Career Part 4 – The Cult of Productivity: You’re Never Doing Enough about me, career, librarianship, mid-career, social software, Work, Work-life balance by Meredith Farkas on 8/28/2019 with 4 comments This is the fourth in a series of essays. You can access the rest here, though it’s not necessary to read them all or in order. “These days, I just want to slow down. I want to pull the shutters closed and block out the world… The more time I have, the more I realize that all that … continue reading ... SEE FULL POST Thoughts at Mid-Career Part 3 – Our Achievement Culture: What You’re Doing Will Never Be Enough career, librarianship, libraries, management, mid-career, MPOW, Work, Work-life balance by Meredith Farkas on 8/19/2019 with 2 comments This is the third in a series of essays. You can access the rest here, though it’s not necessary to read them all or in order. Of all my annoying qualities, my most self-destructive may be that if you put a ladder in front of me, I’ll try to climb it. Doesn’t matter if the entire premise … continue reading ... SEE FULL POST Thoughts at Mid-Career Part 2 – Ambition: You are Not Enough about me, career, gender, management, mid-career, Work, Work-life balance, writing by Meredith Farkas on 8/7/2019 with 4 comments This is the second in a series of essays. You can access the first here, though it’s not necessary to read them all or in order: “So maybe my great ambition, such as it is, is to refrain from engagement with systems that purport to tell me what I’m worth compared to anyone else. Maybe … continue reading ... SEE FULL POST Thoughts at Mid-Career Part 1 – Letting Go, Questioning, and Pathfinding about me, librarianship, mid-career, Work, Work-life balance by Meredith Farkas on 8/2/2019 with 9 comments This is the first in a (probably) five-part series of essays. For about two years, until January, I felt a disturbing lack of ambition. I felt directionless and passionless; devoid of my usual neverending energy and interest. I chalked it up to mid-career malaise, but it was more than that. Having only in the past … continue reading ... SEE FULL POST My year in books, 2018 hi by Meredith Farkas on 1/3/2019 with 2 comments I had such good intentions to blog more this year, but the second half of 2018 has thrown me a lot of curveballs emotionally and it’s pulled me away from a lot of the things that keep me engaged with others (funny how that seems to happen when you need people the most).Books are always a … continue reading ... SEE FULL POST “Devaluing” the MLS vs. respect for all library workers ALA, librarianship, libraries, library school, management, Work by Meredith Farkas on 6/28/2018 with 27 comments I’m sure some of you remember the big push last year and early this year to require the MLS for the Executive Director of the American Library Association (ALA) — if you don’t, here is an article, column, and blog post about it. One big argument I kept hearing was that we needed someone who understood and … continue reading ... SEE FULL POST We are atomized. We are monetized. We are ephemera. Do we deserve more online? our digital future, social software, tech trends by Meredith Farkas on 6/12/2018 with 5 comments In March and April, I took about 5 weeks off from social media. I didn’t post anything to or look at Twitter, Facebook, or Instagram. I’d wondered if I’d feel disconnected or feel some irresistible pull like an addict to their drug of choice. To be honest, I didn’t really feel any of that. I didn’t … continue reading ... SEE FULL POST Wayfinding and balance at mid-career about me, gender, librarianship, management, tenure track, Work, Work-life balance by Meredith Farkas on 2/20/2018 with 9 comments It’s LIS Mental Health Week; a week focused on raising awareness of mental health. This post isn’t about mental health per se, but something that I think, for me, is very much exacerbated by anxiety and the constant negative self-appraisal that comes with it. Two blog posts really resonated with me recently. Sarah Houghton (who I believe … continue reading ... SEE FULL POST My year in books 2017 about me, ebooks, Work-life balance by Meredith Farkas on 12/27/2017 with 1 comment Reading this year has been so many things for me. An escape. A way to educate myself. A way to see my own struggles in a different way through another’s story. A way to understand the struggles of others. A way to better understand where I came from. This year I think I’ve read more than … continue reading ... SEE FULL POST Saying goodbye to the Library Success Wiki Wikis by Meredith Farkas on 11/6/2017 with 2 comments In July 2005, on the heels of the successful ALA Annual 2005 Wiki, I developed the Library Success Wiki. Here’s what I said about it then: “I would like this wiki to be a one-stop-shop for inspiration. All over the country, librarians are developing successful programs and doing innovative things with technology that no one … continue reading ... 1 2 3 4 5 6 » Last Meredith Farkas, Author, Information Wants to be Free Subscribe via RSS Subscribe to this blog From the Archives From the Archives Select Month March 2021 December 2020 November 2020 August 2020 May 2020 April 2020 February 2020 December 2019 November 2019 September 2019 August 2019 January 2019 June 2018 February 2018 December 2017 November 2017 October 2017 September 2017 July 2017 June 2017 March 2017 January 2017 December 2016 November 2016 October 2016 September 2016 August 2016 June 2016 March 2016 January 2016 December 2015 November 2015 October 2015 September 2015 June 2015 April 2015 March 2015 February 2015 January 2015 December 2014 November 2014 September 2014 August 2014 July 2014 June 2014 May 2014 March 2014 February 2014 January 2014 December 2013 November 2013 October 2013 September 2013 August 2013 July 2013 May 2013 April 2013 March 2013 January 2013 December 2012 November 2012 October 2012 September 2012 August 2012 July 2012 June 2012 May 2012 April 2012 February 2012 January 2012 December 2011 October 2011 September 2011 August 2011 July 2011 June 2011 May 2011 March 2011 February 2011 January 2011 December 2010 October 2010 September 2010 August 2010 July 2010 June 2010 May 2010 April 2010 March 2010 February 2010 January 2010 November 2009 October 2009 August 2009 July 2009 June 2009 April 2009 March 2009 February 2009 January 2009 December 2008 November 2008 October 2008 September 2008 August 2008 July 2008 June 2008 May 2008 April 2008 March 2008 February 2008 January 2008 December 2007 November 2007 October 2007 September 2007 August 2007 July 2007 June 2007 May 2007 April 2007 March 2007 February 2007 January 2007 December 2006 November 2006 October 2006 September 2006 August 2006 July 2006 June 2006 May 2006 April 2006 March 2006 February 2006 January 2006 December 2005 November 2005 October 2005 September 2005 August 2005 July 2005 June 2005 May 2005 April 2005 March 2005 February 2005 January 2005 December 2004 November 2004 Categories about me ALA american libraries assessment blogging Book career classic blunders comment08 community college libraries community colleges ebooks election farce free the information! gender General hi Higher Ed Inspiring Stuff instruction intellectual freedom job search knowledge management librarianship libraries library school librarydayinthelife management mid-career MPOW online education open access open source our digital future random reference research RSS and Syndication screencasting search social bookmarking social software speaking tech trends tenure track Vermont Wikis Work Work-life balance writing Recent Comments Michelle on Drop the ball Michael on Drop the ball Ellen Hoffmann on Drop the ball Dee Dee Greene on Drop the ball Pbk - Customer Service | Pottery Barn Kids on Customer service problems with Pottery Barn Kids – Part Deux Most Popular Posts The essence of Library 2.0? (72) Skills for the 21st Century Librarian (71) Keeping it real (63) Libraries in Social Networking Software (57) Ebooks and Libraries: A Stream of Concerns (56) Disclaimer This blog contains the author’s personal thoughts, which do not necessarily reflect the views of her employer. Comments The author reserves the right to delete any comments she deems offensive, irrelevant, fraudulent, or blatant advertisements. Go to top 
ndsa-diglib-org-1796	----	CLIR + DLF WordPress Farm – Just another WordPress site Skip to content CLIR + DLF Wordpress Farm Just another WordPress site Registration has been disabled. CLIR + DLF Wordpress Farm Proudly powered by WordPress Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset 
ndsa-org-2231	----	National Digital Stewardship Alliance - Digital Library Federation DLF About Overview Calendar FAQ Foundational Principles Leadership Strategic Plan Membership Join the NDSA Member Orientation Members Groups Overview Interest Groups Content Infrastructure Standards and Practices Active Working Groups Communications and Publications Conference Program Fixity Survey Innovation Awards Levels of Preservation Staffing Survey Publications Overview Levels of Preservation NDSA Agenda OSF Repository Conference DigiPres Conference 2021 DigiPres Conference 2021 CFP Past DigiPres Conferences News National Digital Stewardship Alliance an international membership organization that supplies advocacy, expertise, and support for the preservation of digital heritage Learn More Congratulations to the The Levels of Digital Preservation a 2020 Digital Preservation Award Winner! about The NDSA is a consortium of 260 partnering organizations, including universities, professional associations, businesses, government agencies, and nonprofit organizations, all committed to the long-term preservation of digital information. Members work together to preserve access to our digital heritage. NDSA is hosted by the Digital Library Federation. Join NDSA Join the NDSA and its Interest Groups! NDSA Agenda Challenges and opportunities for digital stewardship. Leadership Providing strategic leadership to the NDSA. Awards Recognizing excellence in the field. Blog A resource for NDSA members and the broader community. Annual Conference Attend DigiPres and other NDSA events. Latest Posts Twitter Tweets by @NDSA2 NDSA About Members Groups Calendar Social Twitter iTunes Youtube News LinkedIn Contact NDSA c/o CLIR 211 Union Street Suite 100-PMB1027 Alexandria, VA 22314 E: ndsa@diglib.org NDSA The NDSA is proudly hosted by the Digital Library Federation at CLIR. All content on this site is available for re-use under a CC BY-SA 4.0 International License. DLF View this page on GitHub 
ndsa-org-5355	----	Digital Preservation Conference DLF About Overview Calendar FAQ Foundational Principles Leadership Strategic Plan Membership Join the NDSA Member Orientation Members Groups Overview Interest Groups Content Infrastructure Standards and Practices Active Working Groups Communications and Publications Conference Program Fixity Survey Innovation Awards Levels of Preservation Staffing Survey Publications Overview Levels of Preservation NDSA Agenda OSF Repository Conference DigiPres Conference 2021 DigiPres Conference 2021 CFP Past DigiPres Conferences News Digital Preservation Conference About the NDSA and Digital Preservation 2021 The NDSA is a consortium of 260 organizations committed to the long-term preservation and stewardship of digital information and cultural heritage, for the benefit of present and future generations. Digital Preservation 2021 will be a crucial venue for intellectual exchange, community-building, development of good practices, and national-level agenda-setting in the field, helping to chart future directions for both the NDSA and digital stewardship. The NDSA strives to create a safe, accessible, welcoming, and inclusive event, and operates under the DLF Forum’s Code of Conduct. Call for Proposals The NDSA invites proposals for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this year on November 4th. See the 2021 CFP for more information. 2021 DigiPres Organizing Committee Tricia Patterson, Harvard University (2021 Chair) Jes Neal, Williams College (2021 Vice-Chair, 2022 Chair) Rachel Appel, University of Pennsylvania Libraries Heather Barnes, Wake Forest University Moriah Caruso, University of Washington Libraries Greg Colati, University of Connecticut Chelsea Denault, Michigan Digital Preservation Network Stacey Erdman, Arizona State University Angela Fritz, Wisconsin Historical Society Deirdre Joyce, Syracuse University Alex Kinnaman, Virginia Tech University Libraries Monique Lassere, Houghton Library at Harvard University Ruby Lorraine Martinez, University of Illinois at Urbana-Champaign Krista Oldham, Clemson Patrice-Andre “Max” Prud’homme, Oklahoma State University Thomas Pulhamus, University of Delaware Aliya Reich, CLIR/DLF Amy Rudersdorf, AVP Kim Schroeder, Wayne State University Beth Shields, Oregon State University Kristen Weischedel, IIT Lauren Work, University of Virginia Library (Communications Liaison) Frederick Zarndt, Global Connexions Calendar and Past Meetings Future Events: For the latest on upcoming events, see our NDSA calendar. More events relevant to the NDSA’s mission are to be found on the DLF Community Calendar. Past Meetings: An archive of Digital Preservation meetings from 2011-2020 can be found on the Past Digital Preservation Conference page. NDSA About Members Groups Calendar Social Twitter iTunes Youtube News LinkedIn Contact NDSA c/o CLIR 211 Union Street Suite 100-PMB1027 Alexandria, VA 22314 E: ndsa@diglib.org NDSA The NDSA is proudly hosted by the Digital Library Federation at CLIR. All content on this site is available for re-use under a CC BY-SA 4.0 International License. DLF View this page on GitHub 
ndsa-org-6020	----	Digital Preservation 2021 Embracing Digitality - Call for Proposals DLF About Overview Calendar FAQ Foundational Principles Leadership Strategic Plan Membership Join the NDSA Member Orientation Members Groups Overview Interest Groups Content Infrastructure Standards and Practices Active Working Groups Communications and Publications Conference Program Fixity Survey Innovation Awards Levels of Preservation Staffing Survey Publications Overview Levels of Preservation NDSA Agenda OSF Repository Conference DigiPres Conference 2021 DigiPres Conference 2021 CFP Past DigiPres Conferences News Digital Preservation 2021 Embracing Digitality - Call for Proposals The National Digital Stewardship Alliance (NDSA) invites proposals for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this year on November 4th. Digitality - the experience of living in a digital culture - has been accelerated by the global pandemic, shifting how we think, work, and exist in digital spaces. Digital stewardship professionals have demonstrated that we are able to respond creatively to the preservation, discovery, and access of information beyond the physical environment. How does the advancement of digitality expand the landscape of possibilities for people, systems, the environment, and the world? What opportunities have we gained, and what must we be wary of losing? How can we best position our profession to embrace digitality and intentionally develop strategies, tools, and practices that move us forward as a community? How can we foster partnerships with other professional backgrounds to join us in this effort? Please note that proposals do not have to adhere to our conference theme to be considered, but we especially encourage proposals related to Embracing Digitality, particularly presentations that address: Emergent institutional or social/cultural barriers, risks, and opportunities inherent in preserving digitality Collaboration and dismantling digital stewardship silos Balancing innovation with long-term planning and maintenance Critical examination of digital existence(s) and how it impacts the scope of our work Envisioning a roadmap for the future of our profession or “Where do we go from here, and who are we going with?” Because of the virtual format and our interest in minimizing screen fatigue but still facilitating community connection, we will be offering a reduced number of sessions than are typically offered during the in-person Digital Preservation conference. To make space for as many voices as possible, individuals may present only once on the conference program, though names may be listed more than once in affiliation with awards and/or projects. We will offer additional ways for community members to share content and resources whether conference proposals are accepted or not. Proposals are due by MONDAY, MAY 17, 2021 AT 11:59pm EST. Submission length and format Submissions are invited in the following lengths and formats: 45-minute Panel: Panels with 3-4 speakers on a shared topic, and an emphasis on discussion, will be given 45-minutes. In line with the rest of the programming, strong preference will be given to panels that are fully inclusive and reflect a wide range of expression and identity. 15-minute Talk/Demo: Presentations and demonstrations are allocated 15 minutes each, and speakers should reserve time within that allotment (3-5 minutes) for interactive exchanges on next steps, possible NDSA community action, and discussion or debate. 6:2:1 Lightning Talk: Share your ideas and/or projects in a lightning talk of six slides, in 2 minutes, using one keyword or picture per slide. Solution Rooms: Looking to connect with colleagues and brainstorm solutions to a preservation problem? Propose it for the Solution Room! These will be 25-minute breakout rooms in Zoom where you can receive peer support on answers to a digital stewardship challenge. Submission Requirements: Proposal title Submission format and event: Varies by event First and last names, organizational affiliations, and email addresses for all authors / presenters Abstract (50 words max) Proposal (250 works max for all formats except for panels, up to 500 words) Five keywords for your proposal All submissions will be peer-reviewed by NDSA’s Digital Preservation 2021 Program Committee. The DigiPres Planning Committee will give strong preference to programming that is fully inclusive and reflects a wide range of expression and identity. Presenters will be notified of their acceptance in June and guaranteed a registration slot. Accepted presentations, panels, and lightning talks will be delivered via pre-recorded video that will “go live” at specific times during the conference, to avoid technology challenges and to provide a more accessible format to all of our attendees. Presenters will be expected to be in attendance and available during their presentation time for live Q&A. Presenters will receive support in the form of tutorials, resources, and individual assistance. Proposals are due by MONDAY, MAY 17, 2021 AT 11:59pm EST. About the NDSA and Digital Preservation 2021 The NDSA is a consortium of over 250 organizations committed to the long-term preservation and stewardship of digital information and cultural heritage. Digital Preservation is the major meeting and conference of the NDSA. Open to members and non-members alike, it highlights the theory and practice of digital stewardship and preservation, data curation, the digital object lifecycle, and related issues. Digital Preservation 2021 (#DigiPres21) is held in partnership with our host organization, the Council on Library and Information Resources’ (CLIR) Digital Library Federation. Separate calls are being issued for CLIR+DLF’s 2021 events, the 2021 DLF Forum (November 1-3) and associated workshop series Learn@DLF (November 8-10). NDSA strives to create a safe, accessible, welcoming, and inclusive event, and adheres to DLF’s Code of Conduct. Questions? Feel free to reach out to ndsa-digipres@lists.clir.org and someone will get back to you as soon as possible. NDSA About Members Groups Calendar Social Twitter iTunes Youtube News LinkedIn Contact NDSA c/o CLIR 211 Union Street Suite 100-PMB1027 Alexandria, VA 22314 E: ndsa@diglib.org NDSA The NDSA is proudly hosted by the Digital Library Federation at CLIR. All content on this site is available for re-use under a CC BY-SA 4.0 International License. DLF View this page on GitHub 
ndsa-org-7447	----	Redirecting… Redirecting… Click here if you are not redirected. 
netwerkdigitaalerfgoed-nl-9978	----	Netwerk Digitaal Erfgoed Zichtbaar Bruikbaar Houdbaar Netwerkbreed Bekijk de Erfgoedkit Wie wij zijn Wat wij doen Nieuws Agenda Sluiten Bekijk de Erfgoedkit Wie wij zijn Wat wij doen Nieuws Agenda Zichtbaar Bruikbaar Houdbaar Netwerkbreed Netwerkbreed Tools Digitaal Erfgoed Referentie Architectuur (DERA) E en van de belangrijkste aandachtspunten bij digitaal erfgoed is de toegankelijkheid. Als instellingen de informatie onderling verbinden, kan het erfgoed nog beter gevonden en door de gebruikers beleefd worden. De Nationale Strategie Digitaal Erfgoed is daarop gericht en de Digitaal Erfgoed Referentie Architectuur (DERA) beschrijft welke afspraken hiervoor nodig zijn. De erfgoedwereld staat met de DERA niet op zichzelf. Zo heeft de Nederlandse overheid NORA (Nederlandse Overheid Referentie Architectuur) en beschikt het onderwijs over ROSA (Referentie Onderwijs Sector Architectuur). Een referentiearchitectuur geeft namelijk richting aan samenwerkende partijen. De DERA beschrijft hoe er met behulp van linked open data en technieken kan worden samengewerkt over de grenzen van instellingen en sectoren heen. Als erfgoedinstelling kun je bekijken welke rol je in het Netwerk Digitaal Erfgoed kunt spelen. En wat daarvoor nodig is. Een voorbeeld Een voorbeeld van de toegepaste DERA is het eind 2020 gelanceerde platform Van Gogh Worldwide, waar informatie te vinden is over ruim duizend kunstwerken in Nederlandse erfgoedinstellingen en bij particulieren. Elke deelnemende erfgoedinstelling heeft hiervoor zijn data aangeleverd. Het platform doet niets anders dan die data op elkaar aansluiten. De informatie vanuit al die zogeheten bronsystemen wordt als linked data gepubliceerd en kan daardoor onderling verbonden worden. Hierdoor is de informatie betrouwbaar en up-to-date. Daarmee voldoet Van Gogh Worldwide zowel aan de Nationale Strategie als aan de DERA. DERA-wiki Informatie uit de nieuwe DERA kun je nalezen op de DERA-wiki. Doordat aanpassingen aan de DERA meteen in de wiki worden opgenomen, bevat deze de meest actuele informatie. De onderdelen van de referentiearchitectuur, zoals de principes of bedrijfsfuncties, komen elk apart aan bod en zijn daardoor makkelijker te lezen. Daarnaast bevat de wiki informatie over actuele ontwikkelingen. De backlog maakt bijvoorbeeld inzichtelijk welke werkzaamheden aan de DERA gepland zijn. Architectuurraad De ontwikkeling en het beheer van de Digitaal Erfgoed Referentie Architectuur (DERA) ligt in handen van een architectuurraad. Hierin zijn de zes landelijke erfgoedinstellingen vertegenwoordigd, aangevuld met enkele informatiearchitecten uit organisaties en provincies. Op deze manier kunnen de (informatie)technologische ontwikkelingen snel vertaling krijgen in de DERA. Werkwijze Het ministerie van Onderwijs, Cultuur en Wetenschap (OCW) is eigenaar van de DERA. De Architectuurraad, onder voorzitterschap van OCW, zorgt voor de doorontwikkeling van deze referentiearchitectuur. De Architectuurraad maakt de referentiearchitectuur steeds concreter met afspraken over gemeenschappelijke voorzieningen, standaarden, applicaties, technieken en licenties. In april 2020 verscheen versie 3.0. In het nieuw toegevoegde hoofdstuk ‘Architectuurpatronen’ treffen instellingen en beheerders van portals een eerste inventarisatie van oplossingsrichtingen aan om daarmee vanuit de eigen ICT-infrastructuur aan te kunnen sluiten op de DERA. Downloads DERA 3.0 2 MB .pdf Inleidende DERA folder 82 KB .pdf Samenstelling van de Architectuurraad De Architectuurraad kent de volgende deelnemers: Ministerie van OCW (voorzitter): Bram Gaakeer Utrechts Archief: Annelot Vijn Provincie Limburg: Ivo Dahlman Koninklijke Bibliotheek: Erik van den Bergh Het Nieuwe Instituut: Gijs Broos DEN: Marcus Cohen RKD: Reinier van het Zelfde Universiteitsbibliotheek Leiden: Laurens Sesink Nederlands Instituut voor Beeld en Geluid: Willem Melder Rijksdienst voor het Cultureel Erfgoed: Frans van der Zande Wil je meer weten? Heb je vragen over de DERA of de toepassing ervan in jouw organisatie? Neem dan contact op met de voorzitter van de Architectuurraad, Bram Gakeer. b.gaakeer@minocw.nl Bekijk ook deze activiteiten Houdbaar Duurzaam bewaren, Tools Duurzaamheidsbeleid Houdbaar Tools Kostprijsmodel Digitale Duurzaamheid Houdbaar Duurzaam bewaren, Tools Webarchivering Wie wij zijn Wat wij doen Nieuws Agenda Zichtbaar Bruikbaar Houdbaar Netwerkbreed Pers & communicatie Privacyverklaring Toegankelijkheid Contact Volg ons op: Schrijf je in voor onze nieuwsbrief en blijf altijd op de hoogte 
news-archivesunleashed-org-1032	----	None 
news-archivesunleashed-org-5429	----	None 
news-docnow-io-1641	----	twarc2. twarc has been redesigned from the… | by Ed Summers | Apr, 2021 | Documenting DocNow Sign in Tweets + Python + Data = twarctwarc2 Ed SummersFollow Apr 7 · 11 min read twarc has been redesigned from the ground up to work with the new Twitter v2 API and their Academic Research track. Many thanks for the code and design contributions of Betsy Alpert, Igor Brigadir, Sam Hames, Jeff Sauer, and Daniel Verdeer that have made twarc2 possible, as well as early feedback from Dan Kerchner, Shane Lin, Miles McCain, 李荣蓬, David Thiel, Melanie Walsh and Laura Wrubel. Extra special thanks to the Institute for Future Environments at Queensland University of Technology for supporting Betsy and Sam in their work, and for the continued support of the Mellon Foundation. Back in August of last year Twitter announced early access to their new v2 API, and their plans to sunset the v1.1 API that has been active for almost the last 10 years. Over the lifetime of their v1.1 API Twitter has become deeply embedded in the media landscape. As magazines, newspapers and television have moved onto the web they have increasingly adopted tweets as a mechanism for citing politicians, celebrities and organizations, while also using them to document current events, generate leads and gather feedback for evolving stories. As a result Twitter has also become a popular object of study for humanities and social science researchers looking to understand the world as reflected, refracted and distorted by/in social media. On the surface the v2 API update seems pretty insignificant since the shape of a tweet, its parts, properties and affordances, aren’t changing at all. Tweets with 280 characters of text, images and video will continue to be posted, retweeted and quoted. However behind the scenes the representation of a tweet as data, and the quotas that control the rates at which this data can flow between apps and other third party services will be greatly transformed. Needless to say, v2 represents a big change for the Documenting the Now project. Along with community members we’ve developed and maintained open source tools like twarc that talk directly to the Twitter API to help users to search for and collect live tweets that match criteria like hashtags, names and geographic locations. Today we’re excited to announce the release of twarc v2 which has been designed from the ground up to work with the v2 API and Twitter’s new Academic Research track. Clearly it’s extremely problematic having a multi-national corporation act as a gatekeeper for who counts as an academic researcher, and what constitutes academic research. We need look no further than the recent experiences of Timnit Gebru and Margaret Mitchell at Google for an example of what happens when research questions run up against the business objectives of capital. We only know their stories because Gebru and Mitchell’s bravely took a principled approach, where many researchers would have knowingly or unknowingly shaped their research to better fit the needs of the company. So it is important for us that twarc still be usable by people with and without access to the Academic Research Track. But we have heard from many users that the Academic Research Track presents new opportunities for Twitter data collection that are essential for researchers interested in the observability of social media platforms. Twitter is making a good faith effort to work with the academic research community, and we thought twarc should support it, even if big challenges lie ahead. So why are people interested in the Academic Research Track? Once your application has been approved you are able to collect data from the full history of Tweets, at no cost. This is a massive improvement over the v1.1 access which was limited to a one week window and researchers had to pay for access. Access to the full archive means it’s now possible to study events that have happened in the past back to the beginning of Twitter in 2006. If you do create any historical datasets we’d love for you to share the tweet identifier datasets in The Catalog. However this opening up of access on the one hand comes with a simultaneous contraction in terms of how much data can be collected at one time. The remainder of this post describes some of the details and the design decisions we have made with twarc2 to address them. If you would prefer to watch a quick introduction to using twarc v2 please check out this short video: Installation If you are familiar with installing twarc nothing is changed. You still install (or upgrade) with pip as you did before: $ pip install --upgrade twarc In fact you will still have full access to the v1.1 API just as you did before. So the old commands will continue to work as they did¹ $ twarc search blacklivesmatter > tweets.jsonl twarc2 was designed to let you to continue to use Twitter’s v1.1 API undisturbed until it is finally turned off by Twitter, at which point the functionality will be removed from twarc. All the support for the v2 API is mediated by a new command line utility twarc2. For example to search for blacklivesmatter tweets and write them to a file tweets.jsonl * $ twarc2 search blacklivesmatter > tweets.jsonl All the usual twarc functionality such as searching for tweets, collecting live tweets from the streaming API endpoint, requesting user timelines and user metadata are all still there, twarc2 --help gives you the details. But while the interface looks the same there's quite a bit different going on behind the scenes. Representation Truth be told, there is no shortage of open source libraries and tools for interacting with the Twitter API. In the past twarc has made a bit of a name for itself by catering to a niche group of users who want a reliable, programmable way to collect the canonical JSON representation of a tweet. JavaScript Object Notation (JSON) is the language of Web APIs, and Twitter has kept its JSON representation of a tweet relatively stable over the years. Rather than making lots of decisions about the many ways you might want to collect, model and analyze tweets twarc has tried to do one thing and do it well (data collection) and get out of the way so that you can use (or create) the tools for putting this data to use. But the JSON representation of a tweet in the Twitter v2 API is completely burst apart. The v2 base representation of a tweet is extremely lean and minimal, and just includes the text of the tweet its identifier and a handful of other things. All the details about the user who created the tweet, embedded media, and more are not included. Fortunately this information is still available, but the user needs to craft their API request to request tweets using a set of expansions that tell the Twitter API what additional entities to include. In addition for each expansion there are a set of field options to include that control what of these expansions is returned. So rather than there being a single JSON representation of a tweet API users now have the ability to shape the data based on what they need, much like how GraphQL APIs work. This kind of makes you wonder why Twitter didn’t make their GraphQL API available. For specific use cases this customizability is very useful, but the mutability of the representation of a tweet presents challenges when collecting data for future use. If you didn’t request the right expansions or fields when collecting the data then you won’t be able to analyze that data later when doing your research. To solve for this twarc2 has been designed to collect the richest possible representation for a tweet, by requesting all possible expansions and field combinations for tweets. See the expansions module for the details if you are interested. This takes a significant burden off of users to digest the API documentation, and craft the correct API requests themselves. In addition the twarc community will be monitoring the Twitter API documentation going forward to incorporate new expansions and fields as they will inevitably be added in the future. Flattening This is diving into the weeds a little bit, but it’s worth noting here that Twitter’s introduction of expansions allows data that was once duplicated across multiple tweets (such as user information, media, retweets, etc) to be included once per response from the API. This means that instead of seeing information about the user who created a tweet in the context of their tweet the user will be referenced using an identifier, and this identifier will map to user metadata in the outer envelope of the response. It makes sense why Twitter have introduced expansions since it means in a set of 100 tweets from a given user the user information will just be included once rather than repeated 100 times, which means less data, less network traffic and less money. It’s even more significant when consider the large number of possible expansions. However this pass by-reference rather than by-value presents some challenges for stream based processing which expects each tweet to be self-contained. For this reason we’ve introduce the idea of flattening the response data when persisting the JSON to disk. This means that tools and data pipelines that expect to operate on a stream of tweets can continue to do so. Since the representation of a tweet is so dependent on how data is requested we’ve taken the opportunity to introduce a small stanza of twarc specific metadata using the __twarc prefix. This metadata records what API endpoint the data was requested from, and when. This information is critically important when interpreting the data, because some information about a tweet like its retweet and quote counts are constantly changing. Data Flows As mentioned above you can still collect tweets from the search and streaming API endpoints in a way that seems quite similar to the v1 API. The big changes however are the quotas associated with these endpoints which govern how much can be collected. These quotas control how many requests can be sent to Twitter in 15 minute intervals. In fact these quotas are not much changed, but what’s new are app wide quotas that constrain how many tweets a given application (app) can collect every month. An app in this context is a piece of software (e.g. your twarc software) identified by unique API keys set up in the Twitter Developer Portal. The standard API access sets a 500,000 tweet per month limit. This is a huge change considering there were no monthly app limits before. If you get approved for the Academic Research track your app quota is increased to 10 million per month. This is markedly better but the achievable data volume is still nothing like the v1.1 API, as these graphs attempt to illustrate: Monthly Data Collection for Apps from Search Monthly Data Collection for Apps from the Streamtwarc2 will still observe the same rate limits, but once you’ve collected your portion for the month there’s not much that can be done, for that app at least. Apart from the quotas Twitter’s streaming endpoint in v2 is substantially changed which impacts how users interact with twarc. Previously twarc users would be able to create up to to two connections to the filter stream API. This could be done by simply: twarc filter obama > obama.jsonl However in the Twitter v2 API only apps can connect to the filter stream, and they can only connect once. At first this seems like a major limitation but rather than creating a connection per query the v2 API allows you to build a set of rules for tweets to match, which in turns controls what tweets are included in the stream. This means you can collect for multiple types of queries at the same time, and the tweets will come back with a piece of metadata indicating what rule caused its inclusion. This translates into a markedly different set of interactions at the command line for collecting from the stream where you first need to set your stream rules and then open a connection to fetch it. twarc2 stream-rules add blacklivesmatter twarc2 stream > tweets.jsonl One useful side effect of this is that you can update the stream (add and remove rules) while the stream is in motion: twarc2 stream-rules add blm While you are limited by the API quota in terms of how many tweets you can collect, tweets are not “dropped on the floor” when the volume gets too high. Once upon a time the v1.1 filter stream was rumored to be rate limited when your stream exceeds 1% of the total volume of new tweets. Plugins In addition to twarc helping you collect tweets the GitHub repository has also been a place to collect a set of utilities for working with the data. For example there are scripts for extracting and unshortening urls, identifying suspended/deleted content, extracting videos, buiding wordclouds, putting tweets on maps, displaying network graph visualizations, counting hashtags, and more. These utilities all work like Unix filters where the input is a stream of tweets and the output varies depending on what the utility is doing, e.g. a Gephi file for a network visualization, or a folder of mp4 files for video extraction. While this has worked well in general the kitchen sink approach has been difficult to manage from a configuration management perspective. Users have to download these scripts manually from GitHub or by cloning the repository. For some users this is fine, but it’s a bit of a barrier to entry for users who have just installed twarc with pip. Furthermore these plugins often have their own dependencies which twarc itself does not. This lets twarc can stay pretty lean, and things like youtube_dl, NetworkX or Pandas can be installed by people that want to use utilities that need them. But since there is no way to install the utilities there isn’t a way to ensure that the dependencies are installed, which can lead to users needing to diagnose missing libraries themselves. Finally the plugins have typically lacked their own tests. twarc’s test suite has really helped us track changes to the Twitter API and to make sure that it continues to operate properly as new functionality has been added. But nothing like this has existed for the utilities. We’ve noticed that over time some of them need updating. Also their command line arguments have drifted over time which can lead to some inconsistencies in how they are used. So with twarc2 we’ve introduced the idea of plugins which extend the functionality of the twarc2 command, are distributed on PyPI separately from twarc, and exist in their own GitHub repositories where they can be developed and tested independently of twarc itself. This is all achieved through twarc2’s use of the click library and specifically click-plugins. So now if you would like to convert your collected tweets to CSV you can install the twarc-csv: $ pip install twarc-csv $ twarc2 search covid19 > covid19.jsonl $ twarc2 csv covid19.jsonl > covid19.csv Or if you want to extract embedded and referenced videos from tweets you can install twarc-videos which will write all the videos to a directory: $ pip install twarc-videos $ twarc2 videos covid19.jsonl --download-dir covid19-videos You can write these plugins yourself and release them as needed. Check out the plugin reference implementation tweet-ids for a simple example to adapt. We’re still in the process of porting some of the most useful utilities over and would love to see ideas for new plugins. Check out the current list of twarc2 plugins and use the twarc issue tracker on GitHub to join the discussion. You may notice from the list of plugins that twarc now (finally) has documentation on ReadTheDocs external from the documentation that was previously only available on GitHub. We got by with GitHub’s rendering of Markdown documents for a while, but GitHub’s boilerplate designed for developers can prove to be quite confusing for users who aren’t used to selectively ignoring it. ReadTheDocs allows us to manage the command line and API documentation for twarc, and to showcase the work that has gone into the Spanish, Japanese, Portuguese, Swedish, Swahili and Chinese translations. Feedback Thanks for reading this far! We hope you will give twarc2 a try. Let us know what you think either in comments here, in the DocNow Slack or over on GitHub. ✨ ✨ Happy twarcing! ✨ ✨ ✨ Windows users will want to indicate the output file using a second argument rather than redirecting output with >. See this page for details. Documenting DocNow News from the Documenting the Now Project Follow 15 2 Some rights reserved Twitter Social Media Archive Research Python 15 claps 15 2 Written by Ed Summers Follow I’m a software developer at @umd_mith & study archives on/of the web at @iSchoolUMD Follow Documenting DocNow Follow News from the Documenting the Now Project Follow Written by Ed Summers Follow I’m a software developer at @umd_mith & study archives on/of the web at @iSchoolUMD Documenting DocNow Follow News from the Documenting the Now Project More From Medium Streams Ed Summers in Documenting DocNow Remembering Bassem Masri Ed Summers in Documenting DocNow Trump’s Tweets Ed Summers in Documenting DocNow Twint: Twitter Scraping Without Twitter’s API Basil K Jose in Analytics Vidhya Headlines and Congressional Twitter: Sentiment and Topic Analysis Adhitya Venkatraman in Towards Data Science What do Twitter users think about the Nagorno Karabakh War? Kaustav Bhattacharjee in Social Media: Theories, Ethics, and Analytics A Bigram Analysis of the EU General Data Protection Regulation Henson Lee in Towards Data Science Twitter analysis of the current political situation in Belarus Daria Minsky in Towards Data Science Learn more. Medium is an open platform where 170 million readers come to find insightful and dynamic thinking. Here, expert and undiscovered voices alike dive into the heart of any topic and bring new ideas to the surface. Learn more Make Medium yours. Follow the writers, publications, and topics that matter to you, and you’ll see them on your homepage and in your inbox. Explore Share your thinking. If you have a story to tell, knowledge to share, or a perspective to offer — welcome home. It’s easy and free to post your thinking on any topic. Write on Medium About Help Legal Get the Medium app 
niso-plus-6557	----	Help Blog Subscribe NISO Plus 2021 Help Register for NISO Plus 2021 NISO Plus 2021 Program NISO Plus Scholarship Program NISO Plus Planning Committee NISO Plus 2021 Call for Proposals NISO Plus 2020 NISO Plus 2020 Documents Sponsors 2020 Travel & Lodging 2020 Speakers 2020 Program 2020 About NISO Homepage Code of Conduct NISO Plus Planning Committee Credits NISO Plus on Twitter Subscribe In the News… Contact NISO Videos NISO Repository Sponsors How to Sponsor Help Blog Subscribe NISO Plus 2021 Help Register for NISO Plus 2021 NISO Plus 2021 Program NISO Plus Scholarship Program NISO Plus Planning Committee NISO Plus 2021 Call for Proposals NISO Plus 2020 NISO Plus 2020 Documents Sponsors 2020 Travel & Lodging 2020 Speakers 2020 Program 2020 About NISO Homepage Code of Conduct NISO Plus Planning Committee Credits NISO Plus on Twitter Subscribe In the News… Contact NISO Videos NISO Repository Sponsors How to Sponsor The NISO Plus Conference Global Conversations :: Global Connections Join us again in 2022 About The NISO Plus conference is a continuation of the NFAIS Annual Conference, but in a new and exciting form. We are taking the 60+ year history of the NFAIS Annual conference and expanding it to contain even more of the information professions. We want the NISO Plus Conference to be a place where publishers, vendors, librarians, archivists, product managers, metadata specialists, electronic resource managers, and much more come together to both solve existing problems and more importantly have conversations that prevent future problems from ever occurring. Please contact us with any questions or concerns NISO Plus News Feedback April 19, 2021April 19, 2021 No Comment on What You Told Us About #NISOPlus21  What You Told Us About #NISOPlus21  Feedback March 24, 2021March 22, 2021 No Comment on What I Learned At NISO Plus 2021 What I Learned At NISO Plus 2021 Announcements March 17, 2021March 17, 2021 No Comment on NISO Plus 2021 Highlights  NISO Plus 2021 Highlights  SEE ALL POSTS National Information Standards Organization (NISO) :: 3600 Clipper Mill Road, Ste. 302 :: Baltimore, MD21211 :: Phone: (301) 654-2512 :: nisohq@niso.org Signup for our Newsletters This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Cookie settingsACCEPT Privacy & Cookies Policy Close Privacy Overview This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience. Necessary Necessary Always Enabled Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information. Non-necessary Non-necessary Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website. SAVE & ACCEPT 
nowviskie-org-4140	----	Bethany Nowviskie Bethany Nowviskie foreword (to the past) Congratulations to Melissa Terras and Paul Gooding on the publication of an important new collection of essays entitled Electronic Legal... a pledge: self-examination and concrete action in the JMU Libraries “The beauty of anti-racism is that you don’t have to pretend to be free of racism to be an anti-racist.... change us, too [The following is a brief talk I gave at the opening plenary of RBMS 2019, a meeting of the Rare... from the grass roots [This is a cleaned-up version of the text from which I spoke at the 2019 conference of Research Libraries UK,... how the light gets in I took a chance on a hackberry bowl at a farmer’s market—blue-stained and turned like a drop of water. It’s... reconstitute the world [What follows is the text of a talk I gave in two different contexts last week, as &#8220;Reconstitute the World:... 5 spectra for speculative knowledge design [Last weekend, I joined the inspiring, interdisciplinary Ecotopian Toolkit gathering hosted by Penn&#8217;s Program in Environmental Humanities. (How lucky was I? We even got... we raise our voices [Crossposted Statement on US Administration Budget Proposal from the &#8220;Director&#8217;s Desk&#8221; at the Digital Library Federation blog.] Last night, the... iv. coda: speculative computing (2004) [Shannon Mattern&#8217;s wry observation that &#8220;speculative now seems to be the universal prefix&#8221; got me thinking about time and unpredictability, and reminded me... inauguration day January 20th has inaugurated the worst and longest case of writer’s block of my life. I hate to write, under... open invitations [These are unedited remarks from the closing plenary of the 2016 DLF Forum, written about 15 minutes before it began,... speculative collections [This is the text of a talk I gave last week, as &#8220;Speculative Collections and the Emancipatory Library,&#8221; to close... alternate futures/usable pasts [While I&#8217;m cleaning up the text of a talk I gave at Harvard&#8217;s Hazen Symposium last week (see #HazenatHarvard or Merrilee&#8217;s... everywhere, every when This is the text of a presentation I made yesterday at a wonderful Columbia University symposium called Insuetude (still ongoing),... capacity through care [This is the draft of an invited contribution to a forum on &#8220;care&#8221; that will appear in Debates in the... hallowmas [Trigger warning: miscarriage.] Ten years ago today, I lost the baby that might have come after my son, and not... on capacity and care [This is the blended and edited text of two talks I gave last week. One, titled &#8220;On Capacity and Care,&#8221;... supporting practice in community [Here&#8217;s a cleaned-up version of brief remarks I made in a panel discussion on &#8220;Cultivating Digital Library Professionals,&#8221; at Tuesday&#8217;s... a game nonetheless [I recently had the pleasure of responding to a creative and beautifully grounded talk by Kevin Hamilton of the University... open and shut I recently collaborated on a project a little outside the ordinary for me: a case study for a chapter in... all at once Thirteen years ago, I was a graduate student in English literature when the Twin Towers collapsed, a fireball erupted from... charter-ing a path [Cross-posted from the Re:Thinking blog at CLIR, the Council on Library and Information Resources, where I&#8217;m honored to serve as... speculative computing & the centers to come [This is a short talk I prepared for a panel discussion today with Brett Bobley, Ed Ayers, and Stephen Robertson,... johannes factotum & the ends of expertise [This—more or less—is the text of a keynote talk I delivered last week in Atlanta, at the 2014 DLF Forum:... neatline & visualization as interpretation [This post is re-published from an invited response to a February 2014 MediaCommons question of the week: &#8220;How can we better... a kit for hosting Speaking in Code [Cross-posted from the Re:Thinking blog at CLIR, the Council on Library and Information Resources, where I&#8217;m honored to be serving... digital humanities in the anthropocene [Update: I&#8217;ve made low-res versions of my slides and an audio reading available for download on Vimeo, Alex Gil has... anthropocene abstract I am deeply honored to have been invited to give a plenary lecture at this year&#8217;s Digital Humanities conference, planned... asking for it A report published this week by OCLC Research asks the burning question of no one, no where: &#8220;Does every research... on the origin of “hack” and “yack” One of the least helpful constructs of our “digital humanities” moment has been a supposed active opposition, drawn out over... 
nowviskie-org-7578	----	Bethany Nowviskie Skip to content Bethany Nowviskie Menu Bio Minor Arcana JMU Libraries CV Search for: Search reconstitute the world speculative collections on capacity and care foreword (to the past) Posted on 26 October 202026 October 2020 by Bethany Nowviskie Congratulations to Melissa Terras and Paul Gooding on the publication of an important new collection of essays entitled Electronic Legal Deposit: Shaping the Library Collections of the Future! This volume takes a global outlook on challenges and successes in preserving digital information, and stems from their Digital Library Futures AHRC project, which first analyzed the impact of electronic legal deposit legislation on academic libraries and their users in the UK. More from Melissa here, including “An Ark to Save Learning from Deluge? Reconceptualising Legal Deposit after the Digital Turn,” an OA version of the opening chapter she & Paul contributed to the collection. I was honored to be asked to write a Foreword to the book, which I share here, under Facet Publishing’s Green OA agreement, as my own author’s last copy of a single chapter from an edited collection. I thought I’d post it, particularly, now — as next week not only marks World Digital Preservation Day, but another highly significant Election Day in the United States. We are four years on from the moment I describe below… On the morning of November 9th, 2016, I looked out over a Milwaukee ballroom crowded with librarians, archivists, and specialists in digital preservation. Some were pensive. Many were weeping. Others seemed stricken. My audience had gathered for the first joint conference of the Digital Library Federation (DLF, the US-based nonprofit organization I then directed) with its new partner, the National Digital Stewardship Alliance (NDSA)—a cross-industry group that had recently come under DLF’s wing from its place of genesis at the Library of Congress. We were strangers and friends, largely though not exclusively American, united in a community of practice and the common cause of a dedication to the future of libraries, archives, and their holdings and information services in the digital age. But it suddenly felt as if we didn’t know what information was, and whether—despite all our efforts, expertise, and the shared infrastructure that our memory institutions represented—its future could be made secure. The unexpected outcome of the US presidential election, announced in the wee hours the night before, had cast a pall over this professional audience that crossed party lines. How could so many confident, data-driven predictions have been so wrong? What shared social understandings—built from the seeming common landscape of ubiquitous digital information that we had met to manage and survey—had never, in fact, been shared or were even commonly legible at all? And what evidentiary traces of this time would remain, in a political scene of post-truth posturing, the devaluation of expert knowledge, and the willingness of our new authorities—soon to become as evident on federal websites as in press conferences and cable news punditry—to revise and resubmit the historical record? The weeks and months that followed, for DLF and NDSA members, were filled with action. While the End of Term Web Archive project sprang to its regular work of harvesting US federal domains at moments of presidential transition, reports that Trump administration officials had ordered the removal of information on climate change and animal welfare from the websites of the Environmental Protection agency and US Department of Agriculture fostered a fear of the widespread deletion of scientific records, and prompted emergency ‘Data Rescue’ download parties. A new DLF Government Records Transparency and Accountability working group was launched. Its members began watch-dogging preparations for the 2020 US Census and highlighting House and Senate bills meant to curtail scientific and demographic data creation; scrutinizing proposed changes to the records retention schedules of federal agencies and seeking ways to make the arcanum of their digital preservation workflows more accessible to the general public; and—amid new threats of the deportation of immigrants and the continued rise of violent nationalism—asking crucial questions about what electronic information should be made discoverable and accessible, for the protection of vulnerable persons. The Social Sciences Research Council convened a meeting on challenges to the digital preservation of documents of particular value to historians, economists, cultural anthropologists, and other social scientists, and the PEGI Project—focusing on the Preservation of Electronic Government Information—commissioned a wide-ranging report on at-risk, born-digital information meant to be held by US federal depository libraries and other cultural memory institutions for long-term public access and use. Over time, reflective, pedagogical, and awareness-raising projects like Endangered Data Week emerged, ties among the NDSA and international organizations like the UK-based Digital Preservation Coalition were strengthened, and conversations on college campuses (fueled by the Cambridge Analytica scandal and the work of scholars of race, technology, and social media like Safiya Noble and Siva Vaidhyanathan) turned more squarely to data ethics and algorithmic literacy. Frenetic Data Rescue parties gave over to the more measured advocacy and storytelling approach of the Data Refuge movement. And in the UK, an AHRC-funded ‘Digital Library Futures’ project led by Paul Gooding and Melissa Terras (the seed of this edited collection) offered a golden opportunity to reflect—in the light of altered global understandings of the preservation and access challenges surrounding digital information—on the parliamentary Legal Deposit Libraries (Non Print Works) Regulations of 2013, which extended collecting practices dating to the Early Modern Period to new media formats beyond the book. You hold in your hands (or view on your screens, or listen to through e-readers, or encounter in some other way I can’t yet foresee) an important and timely volume. It is well balanced between reflection-and-outlook and practice-and-method in what our editors call the ‘contested space’ of e-legal deposit—taking on the international and very long-term consequences of our present-day conception, regulation, assembly, positioning, and use of library-held digital collections. In other words, the essays assembled here cross space and time. The editors take a necessarily global view in bringing together a broad array of national approaches to the legal deposit of materials that already circulate in world-wide networks. And while the authors they’ve invited to contribute certainly take a long view of digital information, they also frequently address, head-on, the ways that electronic legal deposit forces our attention not just on posterity, but on the here-and-now of what media consumption means and how it works in the digital age. Rather than asking us to rest our imaginations on a far-future prospect in which reading is conducted as it ever was in print (was any such act, as Jerome McGann would ask, self-identical?), the authors of these essays, collectively, assert that the kaleidoscopic mediations of e-legal deposit show us we’ve never really known what reading is.  The best thinkers on libraries question the very assumptions that our memory institutions rest upon, while elevating and honoring both their promise and the centuries of labor and careful (if not always disinterested or benign) intent that have made them what they are. Melissa Terras and Paul Gooding are among the best, and the perspectives they have assembled here—from publishers, eminent librarians and archivists, technologists, organizers, and scholars—make this edited collection an essential contribution to the literature on digital preservation. It is a necessary book that grapples with legal, practical, technical, and conceptual problems: with the distinctive visions and values of libraries; with the necessarily concomitant development of policies and platforms; and even with the very nature of our documentary heritage, at a moment when print-era logics break down. What I most appreciate is that this book—like the notion of e-legal deposit itself—calls for careful consideration of both present-day services and research possibilities not yet dreamt of. In this, it serves the true mission of legal deposit libraries: to be a stable bridge between a past that is perpetually constructed by our acts of preservation and erasure—and the many futures we may mediate but can barely imagine. Posted in higher ed, infrastructure a pledge: self-examination and concrete action in the JMU Libraries Posted on 17 June 202017 June 2020 by Bethany Nowviskie “The beauty of anti-racism is that you don’t have to pretend to be free of racism to be an anti-racist. Anti-racism is the commitment to fight racism wherever you find it, including in yourself. And it’s the only way forward.” — Ijeoma Oluo, author of So You Want to Talk About Race. Black lives matter. Too long have we allowed acts of racism and deeply ingrained, institutionalized forces of white supremacy to devalue, endanger, and grievously harm Black people and members of other minoritized and marginalized groups. State-sanctioned violence and racial terror exist alongside slower and more deep-seated forces of inequality, anti-Blackness, colonization, militarization, class warfare, and oppression. As members of the JMU Libraries Dean’s Council and Council on Diversity, Equity, and Inclusion, we acknowledge these forces to be both national and local, shaping the daily lived experiences of our students, faculty, staff, and community members. As a blended library and educational technology organization operating within a PWI, the JMU Libraries both participates in and is damaged by the whiteness and privilege of our institutions and fields. Supporting the James Madison University community through a global pandemic has helped us see imbalances, biases, and fault lines of inequality more clearly. We pledge self-examination and concrete action. Libraries and educational technology organizations hold power, and can share or even cede it. As we strive to create welcoming spaces and services for all members of our community, we assert the fundamental non-neutrality of libraries and the necessity of taking visible and real action against the forces of racism and oppression that affect BIPOC students, faculty, staff, and community members. Specifically, and in order to “fight racism wherever [we] find it, including in [ourselves],” we commit to: Listen to BIPOC and student voices, recognizing that they have long spoken on these issues and have too often gone unheard. Educate ourselves and ask questions of all the work we do. (“To what end? To whose benefit? Whose comfort is centered? Who has most agency and voice? Who is silenced, ignored, or harmed? Who is elevated, honored, and made to feel safe? Who can experience and express joy?”)  Set public and increasingly measurable goals related to diversity, equity, inclusion, and anti-racism, so that we may be held accountable. Continue to examine, revise, and augment our collections, services, policies, spending patterns, and commitments, in order to institutionalize better practices and create offerings with enduring impact. Learn from, and do better by, our own colleagues. We are a predominantly white organization and it is likely that we will make mistakes as we try to live up to this pledge. When that happens, we will do the work to learn and rectify. We will apologize, examine our actions and embedded power structures, attempt to mitigate any harm caused by our actions, and we will do better. Continue reading “a pledge: self-examination and concrete action in the JMU Libraries” Posted in higher ed change us, too Posted on 30 June 201922 May 2020 by Bethany Nowviskie [The following is a brief talk I gave at the opening plenary of RBMS 2019, a meeting of the Rare Books and Manuscripts section of the ACRL/ALA. This year’s theme was “Response and Responsibility: Special Collections and Climate Change,” and my co-panelists were Frances Beinecke of the National Resources Defense Council and Brenda Ekwurzel of the Union of Concerned Scientists. Many thanks to 2019 conference chairs Ben Goldman and Kate Hutchens, session chair Melissa Hubbard, and outgoing RBMS chair Shannon Supple. The talk draws together some of my past writings, all of which are linked to and freely available. Images in my slide deck, as here, were by Catherine Nelson.] Six years ago, I began writing about cultural heritage and cultural memory in the context of our ongoing climate disaster. Starting to write and talk publicly was a frank attempt to assuage my terror and my grief—my personal grief at past and coming losses in the natural world, and the sense of terror growing inside me, both at the long-term future of the digital and physical collections in my charge, and at the unplanned-for environmental hardships and accelerating social unrest my two young children, then six and nine years old, would one day face. I latched, as people trained as scholars sometimes do, onto a set of rich and varied theoretical frameworks. These were developed by others grappling with the exact same existential dread: some quite recent, some going back to the 1960s, the 1920s, even the 1870s—demonstrating, for me, not just the continuity of scientific agreement on the facts of climate change and the need for collective action (as my co-panelists have demonstrated), but scholarly and artistic agreement on the generative value of responses from what would become the environmental humanities and from practices I might call green speculative design. The concepts and theories I lighted on, however, served another function. They allowed me simultaneously to elevate and to sublimate many of my hardest-hitting feelings. In other words, I put my fears into a linguistic machine labeled “the Anthropocene”—engineered to extract angst and allow me to crank out historicized, lyrical melancholy on the other end. Since then I’ve also become concerned that, alongside and through the explicit, theoretical frameworks I found in the literature, I leaned unconsciously—as cis-gender white women and other members of dominant groups almost inevitably do—on implicit frameworks of white supremacy, on my gender privilege, and on the settler ideologies that got us here in the first place, all of which uphold and support the kind of emotional and fundamentally self-centered response I was first disposed to make. I see more clearly now that none of this is about my own relatively vastly privileged children and well-tended collections—except insofar as both of them exist within broader networks and collectives of care, as one achingly beloved and all-too-transitory part. Please don’t misunderstand me: it remains absolutely vital that we honor our attachments, and acknowledge the complexity and deep reality of our emotional responses to living through the sixth great mass extinction of life on this planet—vital to compassionate teaching and leadership, to responsible stewardship, and to defining value systems that help us become more humane in the face of problems of inhuman scale. Grappling with our emotions as librarians and archivists (and as curators, conservators, collectors, community organizers, scholars, and scientists) will be a major part of the work of this conference. It is also vital to doing work that appreciates its own inner standing point, and uses its positionality to promote understanding and effect change. But I’ve felt my own orientation changing. For me, all of this is, every day, less and less about my feelings on special collections and climate change—except to the degree that those feelings drive me toward actions that have systemic impact and are consonant with a set of values we may share. So this is a brief talk that will try to walk you (for what it’s worth) along the intellectual path I’ve taken over the past six years—in the space of about sixteen minutes. Continue reading “change us, too” Posted in design, infrastructureTagged embodied from the grass roots Posted on 24 March 201930 June 2019 by Bethany Nowviskie [This is a cleaned-up version of the text from which I spoke at the 2019 conference of Research Libraries UK, held at the Wellcome Collection in London last week. I’d like to thank my wonderful hosts for an opportunity to reflect on my time at DLF. As I said to the crowd, I hope the talk offers some useful—or at least productively vexing—ideas.] At a meeting in which the status of libraries as “neutral spaces” has been asserted and lauded, I feel obligated to confess: I’m not a believer in dispassionate and disinterested neutrality—not for human beings nor for the institutions that we continually reinforce or reinvent, based on our interactions in and through them. My training as a humanities scholar has shown me all the ways that it is in fact impossible for us to step wholly out of our multiple, layered, subjective positions, interpretive frameworks, and embodied existence. It has also taught me the dangers of assuming—no matter how noble our intentions—that socially constructed institutions might likewise escape their historical and contemporary positioning, and somehow operate as neutral actors in neutral space. Happily, we don’t need neutrality to move constructively from independent points of view to shared understandings and collective action. There are models for this. The ones I will focus on today are broadly “DH-adjacent,” and they depend, sometimes uncomfortably, on the vulnerability, subjectivity, and autonomy of the people who engage with them—foregrounding the ways that individual professional roles intersect with personal lives as they come together around shared missions and goals. And as I discuss them, please note that I’ll be referring to the digital humanities and to digital librarianship somewhat loosely—in their cultural lineaments—speaking to the diffuse and socially constructed way both are practiced on the ground. In particular, I’ll reference a DH that is (for my purposes today) relatively unconcerned with technologies, methods, and objects of study. It’s my hope that shifting our focus—after much fruitful discussion, this week, of concrete research support—to a digital humanities that can also be understood as organizational, positional, and intersubjective might prompt some structural attunement to new ways of working in libraries. And I do this here, at a consortial gathering of “the most significant research libraries in the UK and Ireland,” because I think that self-consciously expanding our attention in library leadership from the pragmatic provision of data, platforms, skills-teaching, and research support for DH, outward to its larger organizational frame is one way of cracking open serious and opportune contributions by people who would not consider themselves digital humanists at all. This likely includes many of you, your colleagues in university administration across areas and functions, and most members of your libraries’ personnel. Such a change in focus invites all of us to be attentive to the deeper and fundamentally different kinds of engagement and transformation we might foster through DH as a vector and perhaps with only simple re-inflections of the resources we already devote to the field. It could also open our organizations up to illuminating partnerships with communities of practice who frankly don’t give a fig about academic disciplinary labels or whether they are or are not “doing DH.” I also speak to library leaders because my call is not for work to be done by individual scholars as researchers and teachers alone, nor even by small teams of librarians laboring in support of the research and cultural heritage enterprise—but rather by our fully-engaged institutions as altered structures of power. Continue reading “from the grass roots” Posted in administrivia, higher edTagged community-archives, digital humanities, libraries, politics how the light gets in Posted on 12 January 201913 January 2019 by Bethany Nowviskie I took a chance on a hackberry bowl at a farmer’s market—blue-stained and turned like a drop of water. It’s a good name for it. He had hacked it down at the bottom of his garden. (They’re filling in the timber where the oaks aren’t coming back.) But the craftsman had never worked that kind of wood before, kiln-dried at steamy summer’s height. “Will it split?” It did. Now it’s winter, and I make kintsukuroi, a golden repair. I found the wax conservators use on gilded picture-frames, and had some mailed from London. It softens in the heat of hands. Go on. Let the dry air crack you open. You can break and be mended again. Posted in infrastructure, past lives Posts navigation 1 2 … 16 Next recent travel/talks April 23, 2021: McLeod Memorial Lecture, WUSTL, on “Cultural Memory and the Peri-Pandemic Library” March 2020-April 2021: speaking/travel hiatus during the pandemic February 21, 2020: featured talk, AAAD 2019: “Black Temporalities: Past, Present, and Future” July 2019 – January 2020: speaking/travel hiatus while starting my new position at James Madison University June 27, 2019: Tensions of Europe 2019 keynote on machine learning & historical understanding, Luxembourg June 19, 2019: RMBS 2019 opening plenary on climate change & libraries/archives, Baltimore June 2-7, 2019: teaching Rare Book School in Philadelphia: “Community Archives and Digital Cultural Memory” March 22, 2019: RLUK 2019 keynote on DH at the grassroots, London themes themesSelect Category administrivia design documents geospatial higher ed infrastructure past lives soft circuits & code swinburne twittering unfiltered archives archives Select Month October 2020 June 2020 June 2019 March 2019 January 2019 June 2018 April 2017 March 2017 February 2017 November 2016 October 2016 April 2016 February 2016 November 2015 October 2015 May 2015 March 2015 February 2015 November 2014 July 2014 May 2014 February 2014 January 2014 October 2013 September 2013 August 2013 May 2013 January 2013 November 2012 October 2012 June 2012 April 2012 March 2012 January 2012 November 2011 October 2011 September 2011 June 2011 May 2011 April 2011 January 2011 December 2010 October 2010 September 2010 June 2010 April 2010 March 2010 January 2010 December 2009 October 2009 July 2009 June 2009 May 2009 recent posts foreword (to the past) a pledge: self-examination and concrete action in the JMU Libraries change us, too from the grass roots how the light gets in reconstitute the world 5 spectra for speculative knowledge design we raise our voices iv. coda: speculative computing (2004) inauguration day open invitations speculative collections alternate futures/usable pasts everywhere, every when oldies but goodies digital humanities in the anthropocene asking for it toward a new deal resistance in the materials too small to fail reality bytes lazy consensus a skunk in the library why, oh why, CC-BY? what do girls dig? standard disclaimer This site and its contents are my responsibility alone, and may not reflect the opinions of my employer, colleagues, students, children, or imaginary friends. yours Everything here is free to use under a Creative Commons Attribution 4.0 International License. Twitter LinkedIn GitHub Flickr Instagram Powered by Miniva WordPress Theme 
nymag-com-6814	----	A Unified Monetary Theory of Beeple and Biden My Week in New York: A Saturday newsletter from the Editors. Sign Up | Dismiss Intelligencer The Cut Vulture The Strategist Curbed Grub Street Magazine Subscribe to the Magazine Give a Gift Subscription Buy Back Issues Current Issue Contents Subscribe Sign In Account Profile Sign Out Menu Menu Close Close Politics Business Technology Ideas Newsletters Like Us Follow Us NYMag.com New York Magazine Intelligencer Vulture The Cut The Strategist Grub Street Curbed Search Subscribe Give A Gift Menu Menu Close Close Politics Business Technology Ideas Newsletters Like Us Follow Us NYMag.com New York Magazine Intelligencer Vulture The Cut The Strategist Grub Street Curbed Share Tweet Pin It +Comments Leave a Comment Search money Apr. 12, 2021 BidenBucks Is Beeple Is Bitcoin In a system rigged by the rich, outsiders have to make their own volatility. By Scott Galloway @profgalloway What Is the Meaning of All This Money? A series about the ever-more-chaotic future of finance. Animation: QuickHoney What Is the Meaning of All This Money? A series about the ever-more-chaotic future of finance. Animation: QuickHoney Intelligencer’s Jebediah Reed spoke to Scott Galloway, a host of the New York and Vox Media podcasts Pivot and The Prof G Show, respectively, about the transformation of the economy. One of the most valuable living artists is a guy who makes GIFs. A Reddit mob sent GameStop shares soaring. Meanwhile — in the midst of a once-in-a-century pandemic and an economic crisis — the stock market only goes up. Are these isolated things or part of something bigger? I think it all comes back to one central theme: income inequality. Capitalism is sort of this gangster construct that leverages a species’ selfishness and creates all sorts of prosperity from that selfishness. But the key to successful capitalism has always been a middle class. At the turn of the millennium, America was the only superpower, and we had the most prosperous middle class in the world. In the past 20 years, the key feature of China’s rise into a superpower has been adding several hundred million people to its middle class. But for the past 50 years in America, we have decided to transfer wealth from the middle class to the shareholder class. The lower and middle classes haven’t done any worse, and they haven’t done any better but the share of income controlled by the top one percent has exploded. And I think that creates all sorts of externalities. Externalities like GameStop. GameStop was a mini-revolution. Young people want volatility. If you have assets and you’re already rich, you want to take volatility down. You want things to stay the way they are. But young people are willing to take risks because they can afford to lose everything. For the opportunity to double their money, they will risk losing everything. Imagine a person who has the least to lose: He’s in solitary confinement in a supermax-security prison. That person wants maximum volatility. He prays for such volatility, that there’s a revolution and they open the prison. People under the age of 40 are fed up. They have less than half of the economic security, as measured by the ratio of wealth to income, that their parents did at their age. Their share of overall wealth has crashed. A lot of them are bored. A lot of them have some stimulus money in their pocket. And in the case of GameStop, they did what’s kind of a mob short squeeze. Normally, a short squeeze is where you force a person who is betting against a stock to buy it and the stock skyrockets. GameStop was being wildly overshorted by professional investors. What redditors found was that you could say to people, “Okay, this is a movement. This is an opportunity to stick it to the Man. If we all go buy some GameStop, this thing will scream upward.” This is a group of people saying, “Let’s go after baby-boomers, who continue to soak us,” and they create a narrative and a story. What I think will emerge — what’s most tragic about the meme-stock movement — is that, sure, there are some people on Reddit who made some money, but when all this unwinds, we’re going to find out it was the same hedge funds and entrenched players who made the majority of the money. It’s ironic. It’s like trying to understand Trump voters who are voting for someone who’s going to take away their health care. Still, the whole thing, the narrative of the movement, is that we have to stop this intergenerational wealth transfer from young to old. The meme-stock movement all comes down to one fact, and that is that for the first time in our nation’s history, a 30-year-old isn’t doing as well as his or her parents were at 30. That creates shame and rage. So a phenomenon like GameStop is semi-disenfranchised young people with a little bit of money in their pockets finding a way to create volatility in a system that’s been rigged. Creative destruction is good for young people and bad for the entrenched. The shedding of skin from existing players to new innovators — it’s a means of transferring wealth. Unless you let the winds of creative destruction blow, all you’re doing is cementing the wealth and status of the incumbents. That brings us to COVID and the bailouts. The government pumping trillions of new dollars into the economy. The shareholder class played the pandemic like a Stradivarius in order to expand its wealth. These people have weaponized our elected representatives. From what I’m told, the average billionaire talks to a senator once a month. They influence policy. One of the more insidious methods of mass entrenchment is complexity. The more complex the tax code gets, the more there’s a transfer from the poor to the rich because you need expensive people to navigate it. I believe that the trillions in bailouts from both the Trump and Biden administrations will ultimately be judged in history as a crime against the middle class in America and future generations. Something like a third of that money has gone to people. The rest has gone to corporations and governments. We have fetishized corporations. We have decided that we should be more humane and empathetic and loving toward corporations and more Darwinistic and harsh with individuals. In theory, bailouts are an effort to prevent a financial crisis. But what this bailout has done, what it’s meant to have done, is protect and entrench an existing wealthy class. For example, the Paycheck Protection Program is nothing but a crime against the young. Some of the wealthiest people in America are small-business owners. Giving them nearly a trillion dollars is mostly a direct subsidy to rich people to keep them rich. The Strand Book Store in New York? Got a $1 million or $2 million PPP loan, with a couple hundred employees. And in theory, it’s all about the employees: We need to keep our employees. Well, okay. The Strand Book Store is owned by a senator’s wife who has a personal net worth probably in the tens of millions. What the government should have done is instead of roughly a third of that stimulus money going to people, the majority should have gone to people. And you should have let people decide which restaurants and which companies stay in business post-pandemic. All we have done with these stimulus packages, these bailout packages, is try to reduce volatility and keep the existing rich rich. Imagine a great little restaurant that goes out of business. You think, Well, that’s a shame. Yeah. It’s a shame for the current 50-year-old owners. But it also means that the real estate and the supplies — dishes, the stove — go down in cost, and it gives a 28-year-old, a recent graduate of a Brooklyn culinary academy, her shot at owning a restaurant. Closures mean layoffs, of course. But new ventures quickly take up the slack. And in an empathetic — or even sane — system, direct payments to anyone affected could carry them through the transition. In the 2008 financial crisis, we did stimulus, but stocks were allowed to fall. We basically said, “All right, we’re going into a massive recession, but what we need to do is make sure it’s not a depression.” Now, with COVID, that’s not enough. We decided that not only is a depression not tolerable but recessions aren’t tolerable. We threw trillions at the problem — so much stimulus that the markets went up. Assets have never been higher because we keep printing money and doing more stimulus. Yet as a percentage of GDP, wages have tanked. How do young people make money? Wages. And then who owns assets? Old rich people. So all we said is, “Okay, people who get the majority of their income through wages, i.e., young people, get screwed. And people who have the majority of their earnings or wealth in assets like real estate and stocks do really well.” Explicitly or implicitly, we are making it clear that if you’re over the age of 60 or own assets, America’s mission is to maintain your wealth. So, in many ways, while we talk a big game about not wanting to be European, we’ve decided that we want to create dynasties. What would a healthy capitalist response to COVID have looked like? In World War II, one Chrysler factory in Michigan punched out more tanks than the entire Third Reich. We have not had a full-throated capitalist response because the reality is that the sense of urgency hasn’t been there. If Amazon stock had gone down 70 percent, and not up 70 percent, in the past 12 months? When a van with a smile on it delivers my espresso pods tomorrow morning, someone in a white lab coat would be jumping out and jabbing me. The full-throated capitalist response is not happening here in the United States because the people who control the government have just not endured that much pain. As a matter of fact, it’s more like, “Stop, stop, it hurts so good.” The dirty secret of the pandemic is that during one of the worst crises in history — as measured by death and velocity of death — you have the people who essentially control the government living their best lives. They die at a lower rate, they get sick at a lower rate, and COVID-19 for the shareholder class has meant more time on Netflix and more time with family — and their wealth has exploded. The shareholder class played the pandemic like a Stradivarius. All these bailouts do is keep the rich rich. I’m not saying that rich people aren’t empathetic. I’m not saying that rich people want people to die. But if this had impacted them — if this had cut their wealth in half instead of doubled their wealth — we would have made the responses in Taiwan and in Singapore and in South Korea look like amateur hour. If Walmart’s stock had been cut in half, instead of going up 50 percent, and if someone walked into one of its stores and refused to wear a mask, Walmart would have Tasered them and arrested them, instead of trying to thread this needle between liberty and public health. We have decided, in this pandemic, that half a million people dying is bad but the NASDAQ declining would be tragic because it would reduce the wealth of old people. And again, we don’t even want to acknowledge that if the NASDAQ were to get cut in half, that’s bad for the existing rich, but it presents opportunities for the not yet rich. So to go back to the 2008 crisis, stocks crashed. It meant that existing rich people were less rich. People who owned $1 million in Apple stock woke up and they owned $400,000 in Apple stock. But it gave a new generation the opportunity to buy Apple at nine times revenues instead of 30 times revenues. Quite frankly, the reason I’m economically secure is that as I was coming into my income-earning years in ’08, I was able to buy Apple and Amazon on sale. That is a transfer, if you will, of wealth from the existing rich to people who haven’t had a chance. If Brooklyn real estate goes from $2,000 a square foot to $1,000, it gives outsiders a chance to own real estate. Today, we’ve decided we don’t want those opportunities for young people. Is that one way to look at NFTs? A new generation creating its own opportunity to get rich? Rich people have always wanted to use their money to be more attractive to mates. And one of the ways you become more attractive is by owning things that are scarce. So, okay, only a limited number of people can own an original masterpiece of art. There’s only one Mona Lisa — but wait, we’ll do lithographs, where we create a kind of iron cast of it, print 200 of them and then break the cast. That’s a way of leveraging scarcity. NFTs are just another way of trying to tap into the top one percent’s desire to create additional forms of scarcity. It’s staggering, the market’s ability to produce a product when people have cash in hand. Have you bought an NFT? I don’t own an NFT or a single digital coin because I’ve always bought into the notion that you don’t buy anything you don’t understand. While I believe I understand crypto better than 99 percent of the population, I still don’t understand it. How do you see bitcoin and ethereum, both up massively this year, in this broader context? Are they just another way of finding volatility in a rigged system, or is there genuine innovation that’s creating real value? I see crypto as a mini-revolution, just like GameStop. The central banks and governments are all conspiring to create more money to keep the shareholder class wealthy. Young people think, That’s not good for me, so I’m going to exit the ecosystem and I’m going to create my own currency. Typically, digitization, when it comes into any sector or asset class, starts creating a consolidation. The top few players soak up all of the value. What’s crazy is it’s happening in currency. So I think what you’re seeing is bitcoin is becoming a major currency. We’ll have the euro, the Chinese yuan, the dollar, maybe the yen — and bitcoin. And then everything else just gets hammered. The heat around crypto is going to result in a lot of innovation. It’s going to be both very interesting and very frightening. I believe there is a nonzero probability that a prestige institution — let’s say Stanford — says, “All right, we’re coming out of the closet. We’re basically a hedge fund that educates the children of our investors. If you give $10 million, your kid’s getting in. And that’s what we’re about. We do good research, which has a societal benefit. We’ll admit a certain percentage of what we’ll call freakishly remarkable middle- or lower-income kids to make us feel better about ourselves, but we’re primarily an institution for over-educated academics. And also for the children of rich people.” And then, “We’re issuing 100,000 Stanford coins. And each one of those coins, maybe it sits on the ethereum network and it has a set of smart contracts on top of it. And what it says is each owner of this coin can send one individual to any school of Stanford’s at any time. So if you have a 17-year-old or an 18-year-old, he or she gets to go to Stanford. If you’re 30, and you’re in private equity, and you want to take classes in finance, you can take them. If you want to come to Stanford events, if you want to use the career center, if you want to get invited to alumni events or football games, be part of the Stanford community, you have to own a coin. And every year, we’re going to increase the number of coins by 4 percent — that is, at population growth or slightly more than population growth.” Now, what would those coins go for? I think those coins, conservatively, would go for $1 million each. One criticism might be that Stanford loses its mission to educate those freakishly remarkable middle- and lower-income kids. But whoever owns the coins could make them conditional. “I want it to be kids from the following Zip Codes who are raised in homes with single parents with household incomes of less than $70,000.” In other words, wealthy people, or even the government, could buy coins and then put smart contracts on top of the coin to determine who gets that seat. It sounds horrible, but there’s also a certain amount of transparency and clarity to that versus the weird dance that we have now with “Here’s $10 million, wink, wink. My kid will get in, right?” What it gets rid of is the sailing coach at Stanford who raised hundreds of thousands of dollars and recommended that a kid of a rich person be admitted. That guy ended up with an anklet and home arrest because he effectively took bribes. You could just be more transparent and say, “Anyone who owns a million-dollar coin gets to send someone to Stanford.” According to your calculations, minting the coins would be a $100 billion liquidity event for Stanford. I think I’m being conservative. This has already been done. David Bowie said, “I’m going to securitize the royalties from my music catalogue.” He raised $55 million with a bond backed by his music earnings and used that money to secure additional rights to his past work. He paid off the bonds ten years later, and when he died in 2016, his estate was worth $100 million. He basically securitized future cash flows to pull his earnings forward and put them to better use. That’s what this would be, in essence. Stanford could raise so much money. The university truly would become a hedge fund — a $130 billion hedge fund, including its existing endowment of about $30 billion. Let’s be conservative and say the stock market returns 4 percent to 6 percent a year. You have $4 billion to $6 billion in operating income, which is greater than Stanford’s budget right now. And you issue another 2,000 to 4,000 coins, which is another $2 billion to $4 billion a year. So you have a self-sustaining entity that has an operating budget of $4 billion to $8 billion, which is dramatically more money than it spends now. I think you’re going to see more and more prestigious institutions adopt a coin strategy. And let’s use another example: Jackson Memorial Hospital. I live in Delray Beach, Florida. The local hospital is Bethesda. If your kid breaks his arm and you have to go to the emergency room, fine. If you want to go give blood, fine. If you want to get a vasectomy, fine. But if you have late-stage lung cancer or you have leukemia, everybody here knows you go down to Jackson Memorial, which is the better hospital. It attracts better doctors, has more resources, is considered a tier-one, world-class teaching hospital. Why wouldn’t it go to a coin strategy? “All right, we’re going to issue coins to 100,000 families. Anybody who wants a coin gets whatever medical treatment they want. No cost. You and your family get treated.” And by the way, again, if we want to adjust up or down for the social good, people might say, “When I die, I’m going to buy ten coins. They can be used for low-income families that need help.” “I’m going to give ten coins to a church for health care.” Whatever it might be. There are a lot of wealthy families that would say, “Yeah, I want access to Jackson. And I also don’t want everyone to have access to it. I want scarcity value. I want a signal.” And it could raise billions of dollars overnight. I think that’s where we’re headed. What’s an example of how corporations might use the same strategy? I’ve advised a lot of luxury brands. Hermès is privately owned by a family; Chanel is privately owned by a family. They’re the two strongest luxury brands in the world, and they’re privately owned so they can do what they want. If I were Chanel, again, I would issue a million coins and expand it by just 20,000 new coins a year. And I’d say, “Anyone who owns a coin gets the following: They are the only ones in the world who are allowed to buy Chanel products. They’re assigned a fashion adviser, who has much better taste than them, who is available 24/7 and says, ‘I’m going to give you everything from your lipstick to your luggage to the right color blouse that brings out the color of your eyes.’ ” You get invited to special events. You have proprietary access to fashion shows and trips to Paris to the Chanel factory. You are the only person, or one of the few people, in Delray Beach who gets to wear Chanel. That’s fascinating. And we take you to the next level of artisanship, fashion, and proximity to God. Because, at the end of the day, luxury items are about being more attractive to potential mates. But more than that, because the most beautiful artisanship in the world has been a function of places of worship, we instinctively believe that to be close to Bottega Veneta’s mesh bag is to be closer to God. So I think Chanel and Hermès could issue a million coins apiece and charge $10,000 per coin to start. And by the way, they don’t give away the clothes for free. They just charge them at cost, no markup. I think they could actually get $50,000 a coin. Times a million, that’s $50 billion. I think there’d be a rush on this shit like there’s no tomorrow. I think there are probably people out there who think, I would like access to Chanel for the rest of my life, and I’d like to be one of only a few who have access to it. Think about it like this: For the woman who has everything, and the wealthy guy who’s constantly trying to think of a great gift for her, for $10,000, or whatever price, wouldn’t you give her a Chanel coin? But all this innovation would only make the wealth gap worse, right? It’s rich people finding out how to get richer and further rig the game. One hundred percent. Crypto’s innovation is its ability to create what I’ll call credible scarcity. The credible-scarcity component of our existing currencies — they’re losing the credibility part. When the government decides to print $4 trillion in debt, in new money that we don’t really have a discernible plan to pay back, the scarcity value of money is losing its credibility. Crypto is leveraging our instincts around scarcity. They’re so powerful. The moment you perceive that there’s not enough food, or when you hear that the pandemic closed a Walmart or an Amazon distribution center and there’s a shortage of toilet paper, you see people — at least in Florida — saying, “Honey, grab the Glock. We’re going to Publix.” When our species senses scarcity, we become obsessed and irrational. Luxury has always leveraged it, and now crypto is too. It all taps straight into the human limbic system. Crypto taps into our species’ immediate transition from “I sense credible scarcity” to “I become obsessed with it.” We don’t go, “Oh, you know what? There just aren’t that many Ferraris, so I don’t like them. I’m not attracted to them.” We go, “Oh my God, they only make 700 Ferraris a year? My whole fucking life, I’m going to work for a Ferrari. That’s what I want. I’m obsessed with it.” As you can see, though, all of this heads toward a dystopian future where income inequality is going to get even greater. And we’re going to have to get used to the notion of redistribution of income or make a massive investment in retraining or vocational education for young people. You must think that the child-care benefit in the latest stimulus is a step in the right direction, in that sense. I would argue that the unsung hero of right now is Senator Michael Bennet of Colorado, who’s been talking about an earned-income tax credit for years. And I think his education and his proselytizing and work on it over the past several years resulted in it being a big part of this stimulus. It’s the best component. Households with less than $25,000 in income are going to increase their income by 20 percent, and a lot of that’s going to come from the child tax credit. I think that is overdue and outstanding — and a great investment. The fact that young people have fewer prospects than we did at their age means the compact, the most important compact we have in any society, and that is hope for a younger generation, has been broken. And when that happens, you end up with revolution. Right now, we are having what I’ll call border skirmishes — meme stocks, for example — that could erupt into revolution. So we need to figure out a way to increase the prospects for the one-third of young people who are not going to college. America has become about “How do we take kids from the top one percent of income-earning households or freakishly remarkable kids and turn them into billionaires?” That’s not America. America is about giving the bottom 90 percent a chance to get into the top 10 percent. And we need to return to that. We need to make a massive investment and retransfer wealth. You’re fairly critical of the COVID stimulus packages. Do you see any difference between the first two rounds, which came under the Trump administration, and the third round, done under Biden? There’s a strong distinction. The way capitalism works is that you allow full-body-contact violence at a corporate level, which creates competition, which creates innovation, which creates prosperity, and then it needs to sit on a bed of empathy in terms of taxation, such that you can redistribute — I’ll use the R-word! — to seniors so they have Social Security, or to the homeless, or to invest in infrastructure. In general, I think the basis of capitalism is that we should protect people, not companies. And where the first two stimulus packages were corrupt is that they were more about protecting companies than people. A majority went to entities, like small businesses, corporations, and state governments. With the Biden stimulus, more than half of it goes to individuals, and the majority of that is going to people in the lower end. In my opinion, I don’t think a family of four making $150,000 should get $8,000, but still, there are some exciting things about this. If you divide the population into quintiles, the lowest quintile will see its income increase by 20 percent. So I would argue they still got this wrong, but it’s less wrong than the previous two. This third tranche is dramatically different. Those stimulus programs are going straight onto the national debt — now approaching $30 trillion, up from $5 trillion in 2000. Are you worried about inflation? The honest answer is: I don’t know. Modern Monetary Theory, to me, is frightening. This notion of “No, we can just continue to print money” — I think at some point, you have to pay it back. People say, “Well, the dollar hasn’t weakened.” But we’ve had universal money-printing. So relative to other currencies, the dollar is fine. It has still undercut the purchasing power of young people. We’re in uncharted territory. Modern Monetary Theory was a theory, and now it’s in practice. We’ve been — I don’t want to say we’ve been forced, but we’ve opted to print an unprecedented amount of capital. My viewpoint is that it isn’t a problem until it is. As the world’s reserve currency, we have more latitude to print money and do things than other countries. But at some point, I just don’t believe there’s a free lunch here. There seems to be an abundance of capital everywhere. Special-purpose acquisition companies, for instance. SPACs raised close to $100 billion in the quarter that just closed, which is the same as the whole previous year, which itself was multiples higher than any year prior. It’s crazy. I think of it as a couple of things. It’s a decentralization of power. In the old way, a company might go to Goldman Sachs and say, “We want to go public.” And Goldman’s operating committee, which approves IPOs, decides if its institutional buyers would want to own the stock. It makes a discretionary call around whether the company should be public. With SPACs, executives get together and say, “We’re as smart as Goldman. We’ll get capital, and we’ll go decide who should be public.” To a certain extent, it’s a regression to the mean because the number of public companies has been cut in half in the past 20 years, so we’re just kind of catching up. And the other trend you’ve got to keep in mind is just never underestimate the market’s ability to create products when consumers have cash in hand. By the way, the investment banks still win because they collect fees on these SPACs. There were some early successes. Some of the initial SPACs did really well — Virgin Galactic, Opendoor. And so, like any asset class that’s performing well, it has just attracted probably too much capital. SPACs aren’t a new innovation; they’ve just grown in popularity. SPACs have underperformed the market over the long term because, generally speaking, again it goes back to scarcity. When a company passes muster with the Goldman’s operating committee that has to bless an IPO, it’s considered best of breed. There’s no doubt that SPACs are evidence of froth. A lot of these companies will be worth less than they are now, but some giants will be birthed from it. Some of the most talented executives in the world are raising money and going hunting for companies. And it’s just part of the cycle. We’re in a part of the cycle where it’s great to be a seller. What does that mean in practical terms? I serve on boards, and all my advice comes in different shades of one color and that color is sell. Issue stock, sell assets, sell. I coach a lot of entrepreneurs, and if they call me and say they’re raising $20 million, I’ll say, “Raise 30 and take 10 off the table. And hope I’m wrong. Hope that thing doubles, and call me back and tell me what a fool I was.” It’s just a great time to sell across almost any asset class. How long do you think this moment will last? I’m 100 percent certain the market is going to correct. I’m almost 100 percent certain that neither I nor anyone else knows when. Plenty of pundits called the dot-com implosion perfectly, but the thing is, they called it in 1997, and the markets went up another 50 percent before it finally did crash. One investment strategy that has proved to be a terrible one is believing that you can time the market. So much of it is about your personal situation. If you’re in your 60s and you have assets, you’re not looking to get rich; you’re looking to not get poor. So sell. But I sold my company L2 in 2017. I thought the markets were at all-time highs then. We sold our company for eight times revenues, which is extraordinary. Could we sell it for more now? Probably. So I can’t tell anybody for certain that this is the top. So basically the world is more greedy than scared right now. Oh, 100 percent. We’ve been to this movie before. The question is what do you do? Financial theory tells you that you always want to be in the market because there’s just a small number of days where the markets rocket upward, and you want to participate in that. It’s just very hard to time. I would just say that I think this is a great time, if you have assets, to diversify. You have real estate at all-time highs. Art, all-time highs. And crypto setting a new high every day. The stock market’s setting all-time highs every day. Apple, for most of its life, has traded somewhere between call it 12 and 16 times earnings. It’s at 35 right now. This is just an environment where there’s been such incredible printing of money that assets are trading at incredibly rich valuations, whether it’s a private company that traditionally wouldn’t be thought of as a public company, or a cryptocurrency, or a Grayson Perry print, or beachfront real estate in Laguna Niguel. *This article appears in the April 12, 2021, issue of New York Magazine. Thank you for supporting our journalism. Subscribe Now! More From This Series I’m Rich Now. It’s Weird. You’re Not Crazy. Money Is. The NFT Ledger Tags: money nft cryptocurrencies beeple bitcoin q&a just asking questions finance politics technology future of finance new york magazine More +Comments Leave a Comment A Unified Monetary Theory of Beeple and Biden Most Viewed Stories George W. Bush Can’t Paint His Way Out of Hell Joe Biden’s First 100 Days Reshaped America If D.C. Becomes a State, the National Mall Gets to Vote for President Supreme Court Accepts Case That Could Overturn State Gun Laws COVID Is Going Away, and It’s Making Some Trump Fans Crazy THE FEED 12:25 p.m. The CDC makes big changes to outdoor masking guidance People who are fully vaccinated against the Covid-19 virus don’t need to wear masks when walking, hiking, biking, running alone or gathering in small groups outside, federal health officials said, taking a major step to ease pandemic restrictions while encouraging more people to get shots. The same applies to dining at a restaurant outside, and to small outdoor gatherings that include some unvaccinated people, the U.S. Centers for Disease Control and Prevention said Tuesday. Yet vaccinated people should still wear masks in public settings indoors and outdoors where there is a substantial risk of Covid-19 transmission, such as concerts, sporting events and other crowded gatherings, the CDC said. CDC Eases Mask Guidelines for Vaccinated People Outdoors —Wall Street Journal 12:09 p.m. politics politics Biden to Give Nearly 400,000 Workers a $15 Minimum Wage By Sarah Jones An executive order will do what Congress refuses to do, at least for employees of federal contractors. 11:23 a.m. nyc mayoral race nyc mayoral race All the Endorsements in the NYC Mayoral Race By Nia Prater With the June 22 primary approaching, Democratic mayoral candidates are racking up endorsements from politicians, organizations, and celebrities. Most Popular George W. Bush Can’t Paint His Way Out of Hell By Sarah Jones Joe Biden’s First 100 Days Reshaped America By Jonathan Chait If D.C. Becomes a State, the National Mall Gets to Vote for President By Ben Jacobs Supreme Court Accepts Case That Could Overturn State Gun Laws By Ed Kilgore COVID Is Going Away, and It’s Making Some Trump Fans Crazy By Jonathan Chait 11:19 a.m. pivot pivot Will Apple’s Strike Against Facebook Really Do Much Damage? By Intelligencer Staff New iPhone privacy controls may threaten Mark Zuckerberg’s business, but customer inertia is a powerful thing. 10:56 a.m. A positive trend in the U.S. Average COVID-19 cases per day: 12 days ago: 71,282 10 days ago: 70,097 8 days ago: 67,451 6 days ago: 63,807 4 days ago: 61,957 2 days ago: 59,586 Right now: 55,272 —@ryanstruyk 10:22 a.m. the national interest the national interest COVID Is Going Away, and It’s Making Some Trump Fans Crazy By Jonathan Chait The war on vaccines and masks must continue! 10:00 a.m. games games Last Year’s NFL Draft Was a Charming Oasis. Now the Circus Is Back. By Will Leitch This year’s big event is set to bring back the hype nobody missed. 9:52 a.m. tax policy tax policy Rich Investors Make a Poor Case Against Biden’s Tax Plan By Eric Levitz Wealthy capitalists warn that America cannot have shared prosperity unless they pay low taxes on income they didn’t work for. They are wrong. 9:04 a.m. culture wars culture wars You Don’t Have to Be Woke to Dislike Anti-Woke Democrats By Ed Kilgore James Carville’s surface-level analysis of Louisiana politics is the 2021 version of “hippie punching.” 8:00 a.m. encounter encounter Eric Adams’s Pro-Police Bet By David Freedlander Eight weeks before the mayoral primary and second in the polls, Brooklyn’s borough president sharpens his case for more cops. 7:52 a.m. Lowering prescription drug costs won’t be in Biden’s American Families Plan The White House isn’t expected to include a measure aimed at lowering the price of prescription drugs in its coming antipoverty package, according to people familiar with the matter, in an omission likely to disappoint top Democrats on Capitol Hill. President Biden will detail the roughly $1.8 trillion proposal during a speech to Congress later this week, rolling out another major spending plan weeks after he released a $2.3 trillion infrastructure plan. While officials are still finalizing the plan, it is set to include funding for child care, universal prekindergarten and tuition-free community college, among other measures. Prescription-Drug Price Cuts Set to Be Left Out of White House Proposal —Wall Street Journal 4/26/2021 gavin newsom recall gavin newsom recall It’s Official: Newsom Will Face Recall Election By Ed Kilgore The recall election likely won’t be held until November, and the Republican operatives who made it happen face long odds. 4/26/2021 coronavirus vaccine coronavirus vaccine U.S. to Ship Entire Stock of AstraZeneca Shots Throughout the World By Matt Stieb The decision to share 60 million doses is the biggest step yet in U.S. participation to vaccinate beyond its borders. 4/26/2021 department of justice department of justice DOJ Launches Probe of Louisville Police After Killing of Breonna Taylor By Matt Stieb The inquiry, announced by Merrick Garland, is the second such “pattern or practice” investigation into systemic police abuses in the past week. 4/26/2021 vision 2020 vision 2020 Arizona Republicans Want to Keep Recounting 2020 Vote Until Trump Wins By Ed Kilgore A bizarre post-postelection audit ordered by the Arizona state senate and conducted by a MAGA-adjacent firm shows 2020 may never end for Trump fans. 4/26/2021 Times spikes “op-ed” The New York Times announced its opinion section will no longer use the term “op-ed.” Outside opinion contributions will be labeled as “guest essays,” instead. In a post announcing the change, opinion editor Katie Kingsbury described the label — a holdover from print newspaper design referring to opinions published on the opposite (“op”) page as editorials (“ed”) — as “clubby newspaper jargon.” After 50 years, The New York Times is retiring the term “op-ed” —Nieman Lab 4/26/2021 politics politics New York Will Lose Seat in Congress Because 89 People Didn’t Fill Out the Census By Ed Kilgore 2020 census reapportionment data showed seven House seats changing hands, but a net gain of just three in states carried by Trump. 4/26/2021 politics politics If D.C. Becomes a State, the National Mall Gets to Vote for President By Ben Jacobs The constitutional quirk inside the push for a 51st state. 4/26/2021 New York offices can return to 75 percent capacity in a few weeks New York state will allow offices to expand capacity to 75% and will move forward with plans to hold the state fair this summer, Gov. Andrew Cuomo said Monday. … The increase in office capacity to 75% from the current 50% will take effect May 15[.] Casinos and gambling facilities will also be able then to go to 50% capacity from 25%, while gyms and fitness centers outside New York City will be allowed to expand capacity to 50% from 33%. Starting May 19, outdoor spectator events can go to 33% capacity from 20%, Mr. Cuomo also said. This includes both entertainment and sporting events. New York State Eases Covid-19 Restrictions at Offices —The Wall Street Journal 4/26/2021 Remember the Office? By The Editors 4/26/2021 This follows DOJ’s announcement a day after the Derek Chauvin verdict that it will investigate the Minneapolis Police Department Attorney General Merrick Garland on Monday announced an investigation into the policing practices of the Louisville Police Department. It marks ​the department’s second ‘pattern or practice’ investigation of a police force in the past five days alone. Garland made the announcement at the Justice Department Monday afternoon. The Louisville Police Department has faced heavy scrutiny over the past year following the police shooting of Breonna Taylor, a black emergency technician who was killed during a botched raid on her Kentucky apartment after three plainclothes officers entered her home while serving a no-knock warrant. Only one of the three officers involved in the raid has faced criminal charges. AG Garland announces investigation of Louisville PD's policing practices —ABC News 4/21/2021 Justice Department Launches Probe of Minneapolis Police Practices By Nia Prater 4/26/2021 guns guns Supreme Court Accepts Case That Could Overturn State Gun Laws By Ed Kilgore A New York concealed-carry licensing regulation that requires demonstration of a distinctive need for self-defense outside the home could soon fall. 4/26/2021 coronavirus coronavirus Poll: Few Unvaccinated Americans Willing to Take J&J Shot By Paola Rosa-Aquino The pause over rare blood clots apparently crushed confidence in the shot. 4/26/2021 Interesting coincidence: Emergent BioSolutions’ chief executive sold off stock weeks before news broke that the company’s Baltimore plant ruined 15 million J&J vaccine doses … Emergent’s stock price had tumbled on Feb. 19, following the company’s published financial results. Emergent stock has fallen since mid-February to about $62 a share from $125 a share, or just more than 50 percent. But the decline has had less of an impact than it might have on the personal finances of Emergent’s chief executive, Robert G. Kramer, who sold more than $10 million worth of his stock in the company in January and early February, securities filings show. Based on the market price, the stocks that Kramer sold would now fetch about $5.5 million. The transactions were Kramer’s first substantive sales of Emergent stock since April 2016, according to a review of securities filings by The Washington Post. Those 2016 sales by Kramer, along with sales by other Emergent executives around the same time, were the subject of a lawsuit brought by investors who alleged that executives offloaded stocks after making misleading claims about the scale of an upcoming order from the government for an anthrax vaccine. When the order turned out to be smaller than analysts anticipated, the share price fell. Emergent denied the allegations, but the parties later agreed to a settlement in which Emergent paid the investors $6.5 million. CEO of vaccine maker sold $10 million in stock before company ruined Johnson & Johnson doses —Washington Post 4/26/2021 the office the office What It Was Like to Work for Ruth Bader Ginsburg By Irin Carmon “We hired a college student to take care of the babies during the day. We had playpens and cribs and baby carriages.” 4/26/2021 the national interest the national interest Joe Biden’s First 100 Days Reshaped America By Jonathan Chait Learning from his quiet, seismic young presidency. 4/25/2021 coronavirus coronavirus European Union Says Vaccinated Americans Can Visit This Summer By Matt Stieb There is no timeline yet for U.S. tourism in the E.U., though the announcement marks the beginning of a de facto vaccine passport abroad. 4/25/2021 coronavirus vaccines coronavirus vaccines Millions of Americans Are Skipping Their Second Shot By Matt Stieb Almost 8 percent of those who got their first dose of the Pfizer and Moderna vaccines did not get the second, which provides crucial protection. 4/25/2021 coronavirus coronavirus U.S. to Send Vaccine Materials to India Amid Unprecedented COVID Surge By Matt Stieb The Biden administration has suspended its rule to not share the vaccine, as India reports hundreds of thousands of new cases each day. 4/25/2021 poll roundup poll roundup Biden Has Modest Approval From Americans Ahead of 100-Day Mark By Chas Danner But his pandemic response enjoys solid popularity, and the public appears to like the post-Trump return to presidential normalcy. 4/25/2021 The Big moderate Easy Progressives suffered a disappointing setback on Saturday, after their favored candidate lost to a more establishment-aligned opponent in a special congressional election in Louisiana. State Sen. Troy Carter, who was backed by top leaders of the Congressional Black Caucus, beat state Sen. Karen Carter Peterson in a runoff to fill a vacant House seat that quickly turned into a turf war for sparring factions of the Democratic Party. He overcame more than a million dollars in outside spending backing Peterson to win, 56 percent to 44 percent, when The Associated Press called the race. The district is majority Black and safe Democratic territory; it includes almost all of New Orleans and stretches north toward Baton Rouge. The former incumbent, Cedric Richmond, vacated the seat to join the Biden administration. Troy Carter wins Louisiana special election in blow to progressives —Politico Sign In to Comment Already a subscriber? Log in or link your magazine subscription Email You\'ll receive the next newsletter in your inbox. *Sorry, there was a problem signing you up. This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply. Already a subscriber? Log in or link your magazine subscription Email You\'ll receive the next newsletter in your inbox. *Sorry, there was a problem signing you up. This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply. Already a subscriber? Log in or link your magazine subscription Email You\'ll receive the next newsletter in your inbox. *Sorry, there was a problem signing you up. This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply. Already a subscriber? Log in or link your magazine subscription Email You\'ll receive the next newsletter in your inbox. *Sorry, there was a problem signing you up. This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply. Like Us Follow Us Newsletters About Us Help Contact Media Kit We’re Hiring Press Privacy Terms Ad Choices Do Not Sell My Info Accessibility intelligencer is a Vox Media Network. © 2021 Vox Media, LLC. All rights reserved. 
oc-lc-1101	----	Next Generation of Metadata | OCLC Welcome Recordings Metadata is changing. Innovations in librarianship are exerting pressure on metadata management practices to evolve as librarians are required to provide metadata for far more resources of various types and to collaborate on institutional or multi-institutional projects with fewer staff. In 2020, Karen Smith-Yoshimura published the report, “Transitioning to the Next Generation of Metadata”, which brought together six years of discussions with the OCLC Research Library Partners Metadata Managers Focus Group that shone a light on the evolution of the next generation of metadata. Following this in early 2021 the report, “Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project” was published. This report shares the CONTENTdm Linked Data Pilot project findings, where OCLC and five partner institutions investigated methods for—and the feasibility of—transforming metadata into linked data to improve the discoverability and management of digitized cultural materials. During the spring of 2021, OCLC Research ran a discussion series focused on these two reports where participants were able to share their own experiences, get a better understanding of the topic area, and gain confidence in planning ahead. The Series The series consists of three components: Opening plenary webinar Tuesday 23 February 2021, 15:00 (CET) An opening plenary webinar giving an overview of next generation metadata, how and why it is changing, and the impact this could have. OCLC speakers in this session include Rachel Frick (Executive Director, Research Library Partnership), John Chapman (Senior Product Manager, Metadata Services), Annette Dortmund (Senior Product Manager), and Titia van der Werf (Senior Program Officer). Interactive round table During the first two weeks of March 2021. Eight interactive small round table online discussions exploring how existing initiatives are shaping the landscape of next generation metadata, gaining insights and fresh ideas, and aligning each other’s perspectives into a shared perspective. These sessions followed the same structure, but were based on language. Closing plenary webinar Tuesday 13 April 2021, 15:00 (CET) A closing plenary webinar by OCLC and representatives from the round table discussions to bring together what was discussed and to share highlights from the sessions with the wider group. OCLC speakers in this session include Andrew K. Pace (Executive Director for Technical Research), Rachel Frick (Executive Director, Research Library Partnership), John Chapman (Senior Product Manager, Metadata Services), Annette Dortmund (Senior Product Manager), and Titia van der Werf (Senior Program Officer). Watch the recordings and read the blog posts Recordings and blog posts Because what is known must be shared.® +1-800-848-5878 Next Generation of metadata Overview Register About OCLC Home Membership Products Research Events About © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates Privacy statement Cookie notice Cookie settings Accessibility statement ISO 27001 Certificate 
officialnosk-github-io-8821	----	Software Freedom Day 2019 | NOSK SFD 2019, NOSK Speakers Schedule Makers Venue Sponsors Register At NCIT Balkumari, Lalitpur September 21 September 21 · At NCIT · Balkumari, Lalitpur Let's celebrate Software Freedom Day (SFD). SFD is a public education effort with the aim of increasing awareness of Free Software and its virtues, and encouraging its use. See the schedule -> Register while it lasts Hurry! Book seat for workshops and tech talks Register But wait. What is SFD? Celebration of Free and Open Source Software Software Freedom Day (SFD) is an annual worldwide celebration of Free Software. SFD is a public education effort with the aim of increasing awareness of Free Software and its virtues, and encouraging its use. Speakers Learn directly from the leaders that have powered one of the most transformational periods in information technology here in Nepal. Tech talk speakers Er. Dipesh Das Manager, GDG Birgunj Er. Kumar Pudasaini Network Engineer Sushil Kumar Sah Kantipur Media Group Opening speakers Mohan Khadka Er. Nipesh Shrestha Mentors Ashish Tiwari Independent Developer Ashmina Kattel Mobile App Developer, Makura Creations Nischal Lal Shrestha Independent Standard-based Developer Ramesh Giri Mobile App Developer, Makura Creations Sagar Devkota Game Developer, Time and Update Saroj Maharjan Nepal Television Suman Gautam Mobile App Developer, Chaitanya Designs Umesh Basnet Mobile App Developer, Young Innovations Experience Attend talks and workshops on various FOSS, play games, code, eat and celebrate at the Software Freedom Day. Take a look at this small taste of the 2016 edition: allowfullscreen Venue Where is it? NCIT Balkumari, Lalitpur Experience the future at one of the best IT colleges in Nepal. NCIT, a pioneer private institution providing engineering education in Nepal, is renowned for excellence in teaching & research, while maintaining close and mutually beneficial links with various sectors Get directions for NCIT · Balkumari, Lalitpur -> Sponsors Your generous contribution provides an unique opportunity for over a thousand young men and woman to learn about the benefits of using Free and Open Source Software plus celebrate the worldwide event. Event organizers Supported by Bronze Sponsor Other Sponsor Register While we welcome every one of you to our event, we are only able to accomodate a few of you to our tech talks and workcamps. So, hurry up now, and register for the activities you're interested in. Register At NCIT Balkumari, Lalitpur September 21 September 21 · At NCIT · Balkumari, Lalitpur Let's celebrate Software Freedom Day (SFD). SFD is a public education effort with the aim of increasing awareness of Free Software and its virtues, and encouraging its use. See the schedule -> Register while it's still available Register Speakers Venue Sponsors Schedule Register Makers of SFD 2019 Code of Conduct Facebook GitHub 
okee-ee-9294	----	OKEE | Open Knowledge Estonia MTÜ Facebook Twitter Github Avaleht Kes me oleme Mida me teeme Projektid Andmekool Andmeklubi Blogi Andmekool Kontakt Select Page Uudised.  Märtsis tähistame rahvusvahelist avaandmete päeva 19. veebruar 2021 Märtsis tähistame rahvusvahelist avaandmete päeva - osale OKEE andmeklubis ning hääleta oma lemmikute andmete kasutajate poolt! Iga-aastane rahvusvaheline avaandmete päev tuleb taas 6. märtsil 2021. Avaandmete päev on oluline tähtpäev, mil tõsta... read more Kultuuriandmete rakendamine muutub järjest suuremaks väljakutseks 26. jaanuar 2021 Kokkuvõte avaandmete foorumilt 2020. Andmed muutuvad meie elus järjest kesksemaks ning kõik tulevikutooted on andmepõhised, kirjutab Tallinna Ülikooli teadur Andres Kõnno avaandmete portaalis. Digikultuuriaasta 2020 koos COVID-19 pandeemiaga suunas... read more Andmeklubis on külas Statistikaamet 21. jaanuar 2021 2021 aasta esimeses andmeklubis arutame, kust ja kuidas võiks andmeid saada? Üritus toimub 26. jaanuaril kell 16:00 Zoomi keskkonnas. (Ava)andmed ning statistika on oluline ressurss, mille toel leida valdkonnaspetsiifilistele probleemidele või... read more Vaata kõiki uudiseid Andmekool. Andmekool aitab läbi koolituste tõsta andmepädevust, on partneriks projektide läbiviimisel ning aitab tehniliste lahenduste rakendamisel, näiteks juhtimislaudade loomisel. Loe lähemalt   Andmeklubi. Andmeklubid on on Open Knowledge Estonia korraldatud regulaarsed kohtumised, et andmepõhiselt probleemide üle arutada, aidata konkreetsete juhtumitega, kuulata inspireerivaid esinejaid ja omavahel suhelda. Loe lähemalt   Mis on OKEE? MTÜ Open Knowledge Estonia. Andmepädevuse ja digitaalse kirjaoskuse edendaja Eestis. Rahvusvahelise School of Data võrgustiku liige. Partner avalikule sektorile ja vabaühendustele. Loe meist lähemalt   Projektid.  Open Knowledge Estonia algatab ja osaleb andmeteemadega tegelevates projektides. Loe lähemalt   Blogi. Eesti avaandmete turu suurus võib aastaks 2025 ulatuda 445 miljoni euroni 11. veebruar 2021 | Blogi Autorid: Esther Huyer, Laura van Knippenberg Euroopa andmeportaali eksperdid Esther Huyer ja Laura van Knippenberg ennustavad, et Eesti avaandmete turumaht võib praeguse kiire kasvu jätkudes jõuda 2025. aastaks 445 miljoni euroni. Avaandmete majandusliku potentsiaali... read more Kohaliku omavalitsuse avaandmed: milleks ja millest alustada? 2. veebruar 2021 | Blogi Autor: Maarja Olesk Kui Brett Goldstein 2011. aastal Chicago linna andmepealikuna tööle asus, oli linn avaandmete vallas juba esimesed sammud teinud. Olemas oli avaandmete portaal mõne andmehulgaga, kogukonnas leidus ka andmete kasutamise huvilisi, ent areng seisis... read more Uus paradigma kultuuriandmetes: massilise digiteerimise asemel tuleks keskenduda kasutajate vajadustele 7. jaanuar 2021 | Blogi Autor: Dr. Andres Kõnno, Tallinna Ülikooli teadur ja BFMi nüüdismeedia magistriõppe kuraator Kultuuriandmetest ja digikultuurist rääkides oli aasta 2020 eriline mitmel põhjusel. Laiemale avalikkusele meenuvad tõenäoliselt ennekõike COVID-19 pandeemia tõttu... read more Roherevolutsioon majanduses on võimalik tänu andmetele 18. detsember 2020 | Blogi Autor: Maarja Olesk Üleminek keskkonnahoidlikule majandusele ei nõua üksnes idealismi ja pealehakkamist, vaid ka andmeid, andmeid ja andmeid. 2019. aasta sügisel, umbes samal ajal kui Rootsi kliimaaktivist Greta Thunberg mööda maailma seilas ja keskkonnaprobleeme... read more TÜ keeletehnoloogia professor: eestikeelne masintõlge ei oleks ilma avaandmeteta võimalik 31. august 2020 | Blogi Autor: Maarja Olesk Tartu Ülikooli teadlaste loodud eestikeelne masintõlkemootor tugineb läbinisti avaandmetele, sest ükski asutus ei jõuaks üksi maailmast miljoneid tõlkenäiteid kokku koguda, rõhutab Tartu Ülikooli keeletehnoloogia õppetooli juhataja Mark Fišel... read more TalTechi teadur: keeleandmete kasutamisel automaatse kõnetuvastuse arendamiseks on palju halli ala 31. august 2020 | Blogi Autor: Maarja Olesk Tallinna Tehnikaülikooli keeletehnoloogia labori juhataja ja vanemteadur Tanel Alumäe rääkis avaandmete portaalile, kuidas teadlased andmete abil automaatse kõnetuvastuse mudeleid treenivad, mis andmete kasutamises segadust tekitab ja miks on... read more Pilk koroonakriisile läbi avaandmete 8. juuni 2020 | Blogi Autor: Maarja Olesk Eriolukord on paljudes riikides selleks korraks läbi, aga koroonaviirus on jätkuvalt meiega. Lisaks sellele, et õppisime korralikult käsi pesema, andis viirusepuhang maailmale teisegi õppetunni – inimeste elu ja surma puudutavate otsuste tegemiseks... read more Ikka need isikuandmed 21. veebruar 2020 | Blogi Autor: Maarja Olesk Kus avaandmetest juttu, seal kerkib varem või hiljem küsimus isikuandmete kaitsest. Kas isikuandmete kaitsmiseks tuleks andmekraanid kinni keerata? Kuidas avaandmete avaldamisel kindel olla, et kellegi isiklik elu avatusega pihta ei saa? Advokaadid... read more Tõetamm võrsub avaandmetest 4. november 2019 | Blogi Autor: Maarja Olesk Ometi üks rõõmus uudis metsandusvaldkonnast – Statistikaamet ja Riigikantselei on koos puu istutanud! Virtuaalne tõetamm mõõdab Eesti edusamme riiklike eesmärkide saavutamisel ning seda saavad vaatamas ja katsumas käia kõik, kel internet käepärast.... read more Vaata kogu blogi Privaatsuspoliitika Kontakt Facebook Twitter MTÜ Open Knowledge Estonia seisab avatud andmeteadmuse eest. Huvi korral võta meiega ühendust info@okee.ee 
oknp-org-2110	----	Open Knowledge Nepal – Liberating Knowledge for Opportunity Skip to content Toggle navigation Home About us Our Vision Our Impact Our Team Blog Project Service Training and Capacity Building Specialized Technical Services Research, Analysis and Writing Contact Liberating Knowledge for Opportunities "A world where knowledge creates power for the many, not the few." THIS IS THE WORLD WE CHOOSE. Know More We bring together a diverse community, building a network of individuals and organisations, founded on key principles. Open Knowledge Nepal is a non-profit civic tech organization comprised of openness aficionados. We believe that openness of data is powerful in order to have participatory government with civil society, eventually leading to sustainable development. The organization has been involved in research, advocacy, training, workshops and developing tools related to open knowledge. We also provide data services and solutions to various agencies and specialized in solving data related problems through consultation and tools development. Our Service Training and Capacity Building We work to empower CSOs, journalists and citizens with the skills they need to use data effectively, through both online and in-person “learning through doing” workshops.... Specialized Technical Services We help people to solve the technical, social and legal challenges of opening up data and are specialized in providing data-driven services and technical expertise.... Research, Analysis and Writing We are one of the open data pioneers of Nepal and always ready explore new areas in depth, including developing principles and standards as part of our analysis.... “Today we find ourselves in the midst of an open data revolution” Recent Blog March 19, 2021 Open Data Day 2021: Datathon in Tulsipur On the occasion of the International Open Data Day (March 6, 2021), Open Knowledge Nepal organized a day-long ‘Datathon’ in Tulsipur Dang, where in-total 30 participants joined the event... Read More March 18, 2021 Registration for Women in Data Virtual Conference 2021 is now open! While Nepal’s first Women in Data Conference was hosted by The Data for Development (D4D) Programme on February 23, 2019 with the theme डाटा शक्ति नारी शक्�... Read More March 14, 2021 Rebuilding community to work on climate and environment data The event summary of this blog post was written by Dikpal Khatri Chhetry. Being a community-driven organization, the COVID-19 pandemic was a nightmare for Open Knowledge Nepal. Like many org... Read More Our Partner About Us Email Us Donate Twitter Facebook Content on this site, made by Open Knowledge Nepal, is licensed under a Creative Commons Attribution 4.0 International License. 
opendatacharter-net-5761	----	The International Open Data Charter Skip to content Home About Who we are Our history Government adopters Endorsing institutions Charter Principles Adopt the ODC Principles Endorse the ODC Principles Download Blog Resources FAQ LATEST Open Data to Restore the Earth The (Updated) Open Up Guide for Climate Action LEARN MORE LATEST Governments Looking to Increase Women’s Economic Empowerment Might Want to Look in the Mirror Cross-posted from CGD’s Blog LEARN MORE LATEST [Spotlight] From climate openness to climate justice The role of open data to better adapt and be resilient to climate change LEARN MORE LATEST [Spotlight] De la apertura a la justicia climática El rol de los datos abiertos para una mejor adaptación y resiliencia al cambio climático LEARN MORE LATEST Supporting effective use of data for anticorruption in OGP: Africa’s experience Open Data as a tool to fight corruption LEARN MORE LATEST Local insights for global action: Slovenia Highlights from their latest COVID-19 data meetup LEARN MORE Vision We want a world in which governments collect, share, and use well-governed data, to respond effectively and accountably to our most pressing social, economic, and environmental challenges. Principles Open By Default 1. Timely and Comprehensive 2. Accessible and Usable 3. Comparable and Interoperable 4. For Improved Governance & Citizen Engagement 5. For Inclusive Development and Innovation 6. Adopt the ODC Principles Blog Check out our latest posts Open Data to Restore the Earth The (Updated) Open Up Guide for Climate Action READ MORE Governments Looking to Increase Women’s Economic Empowerment Might Want to Look in the Mirror Cross-posted from CGD’s Blog READ MORE [Spotlight] From climate openness to climate justice The role of open data to better adapt and be resilient to climate change READ MORE [Spotlight] De la apertura a la justicia climática El rol de los datos abiertos para una mejor adaptación y resiliencia al cambio climático READ MORE Supporting effective use of data for anticorruption in OGP: Africa’s experience Open Data as a tool to fight corruption READ MORE Local insights for global action: Slovenia Highlights from their latest COVID-19 data meetup READ MORE Mission To make data open and freely available, while protecting the rights of people and communities. Adopted by 81 governments     Government adopters Endorsed by 72 organizations     Endorsing organizations Join the ODC network Connect with more than 100 data practitioners working to open up data worldwide. About Who we are Our history Government adopters Endorsing institutions ODC Principles Adopt the ODC Principles Endorse the ODC Principles Download Resources Resources Reports Blog FAQ Search for: @opendatacharter info@opendatacharter.net @opendatacharter Home About Charter Blog Resources FAQ Content on this site is licensed under a Creative Commons Attribution 4.0 International License. 
opendata-transport-nsw-gov-au-15	----	TfNSW Open Data Hub and Developer Portal Skip to main content Log inRegister Opal Fares data is now available! Search Log inRegister Toggle navigation Browse Data Developers Get Started Documentation Developer Information Marketing Status Resources FAQs Innovation Innovation Challenges Open Data Day 2020 MaaS Data Specification Transport for NSW Endorsed Apps Forum Blog Open data available Today we supply real-time data to apps with over 7 million unique customer downloads in total. The Open Data program will make these datasets, along with other transport data, more broadly available. This data will be capable of supporting apps and a whole lot more. Learn More Get started! We have a range of resources to help you get started using our data. Read through our user guide, API basics or documentation for information or visit the troubleshooting page if you are stuck. Learn more Transport for NSW Innovation Transport for NSW is committed to fostering innovation by providing open access to our data. Future Transport holds a number of events and challenges to uncover new ways of using our data. Learn more ‹ › Data Catalogue Browse our extensive data catalogue and access resources More General Info Learn more about the open data hub and developer portal More Developers Find everything you need to get started and access our data More Useful Links A list of useful links that will assist you in developing great apps More Product Showcase AnyTrip AnyTrip Overview AnyTrip lets you track public transport vehicles around you in real-time using a live map. It will also show you upcoming departures from your favourite stops and stations. Check which service you are currently travelling on Check real-time departure information Get an at-a-glance view of all public transport across NSW Access AnyTrip web app through a web browser. Datasets Trip Planner API Public Transport - Timetables - For Realtime Public Transport - Location Facilities and Operators Traffic Visit website Google Play store URL Apple store URL Boatable Boatable Overview Log your trip with Boatable and enjoy the waterways safely with tips before you head off, and alerts while out and about. Boatable is a boating assistance app that helps recreational boaters navigate the seas. It includes features such as a range of maps, boat detail records, weather, logging of trips and safety information for both on and off the water. Boatable is endorsed by Transport for NSW following the Boating Companion Innovation Challenge in 2018. Datasets AidToNavigation: channel markers and other selected aids to navigation BoatRamp: boat ramps with attributes as displayed on the boat-ramp locator on the RMS website CoastalBar: indicative locations of coastal bars as described in the Marine Safety Regulation 2016 NoTowing: areas where towing of persons is prohibited as designated by signs along the waterway NoWash: areas where the generation of wash is prohibited as designated by signs along the waterway PublicMooring: courtesy and emergency moorings RestrictedZone: areas of restricted waters for port security or naval purposes Speed: areas of speed limits (in knots) as designated by signs along the waterway ShallowWater: areas of assumed shallow water (depth of less than approximately 2 metres in tidal waters and the shallower water in inland waters at full supply) Speed: areas of speed restrictions (in knots) as designated by signs along the waterway WebCamera: locations of bar cameras, including URLs to webpages on the RMS Internet site for viewing of the live feed Visit website Google Play store URL Apple store URL Bustle Bustle Overview Bustle is an Australian first implementation of digital advertising screens together with real-time Public Transport information. Combining advertising and public transport information creates a unique service to naturally capture attention of customers and provide a community service. Screens are customised with local Train / Metro / Light-Rail and Bus real-time information. In a world where brands are fighting to be noticed and people's attention is short - Bustle creates a unique experience for your customers to remember you. Datasets Timetables Complete GTFS Public Transport - Timetables - For Realtime Visit website Citymapper Citymapper Overview Citymapper is a multi-modal trip planning app, making cities easier to use. The app allows for checking of nearby departures in real-time and has the ability to find the fastest route combining bus, train, ferry, light rail, taxi, car share, bike share and walking. Commuters can decide which transport option based on time, the amount of calories burned, and also receive alerts for route disruptions and more. Citymapper is endorsed by Transport for NSW following the Travel Choices Innovation Challenge in 2017. Datasets Public transport scheduled and real-time information Visit website Google Play store URL Apple store URL Deckee Deckee Overview Deckee is a social app that helps boaters explore the waterways safely, share fishing and anchorage reports, find local marine services and much more. Deckee is a boating assistance app that helps recreational boaters explore the waterways through a community-based contributions. It also has safety alerts, weather conditions, helps with boat insurance finding all in one app. Deckee is endorsed by Transport for NSW following the Boating Companion Innovation Challenge in 2018. Datasets AidToNavigation: channel markers and other selected aids to navigation BoatRamp: boat ramps with attributes as displayed on the boat-ramp locator on the RMS website CoastalBar: indicative locations of coastal bars as described in the Marine Safety Regulation 2016 NoTowing: areas where towing of persons is prohibited as designated by signs along the waterway NoWash: areas where the generation of wash is prohibited as designated by signs along the waterway PublicMooring: courtesy and emergency moorings RestrictedZone: areas of restricted waters for port security or naval purposes Speed: areas of speed limits (in knots) as designated by signs along the waterway ShallowWater: areas of assumed shallow water (depth of less than approximately 2 metres in tidal waters and the shallower water in inland waters at full supply) Speed: areas of speed restrictions (in knots) as designated by signs along the waterway WebCamera: locations of bar cameras, including URLs to webpages on the RMS Internet site for viewing of the live feed Visit website Google Play store URL Apple store URL EasyDrop EasyDrop Overview EasyDrop is an interactive app for commercial delivery drivers. The app helps drivers manage their daily delivery schedule, find the fastest route and provides up to date traffic conditions. Drivers can also stay in touch with their customers through the app; with one click delivering becomes delivered! Developed by OzPoint. View the developer discussing their relationship with TfNSW here Datasets Roads real-time information Visit website Apple store URL Embark Embark Overview Embark not only lets you plan trips around greater Sydney, but works in hundreds of cities around Australia and the world - now there's no need to download a new transport app when you're travelling. Embark shows you live arrival times for buses, trains, light rail, ferries, Uber, and more. Additionally, Embark also gives you insight into calorie burn for walking routes, carbon emissions comparisons for public transport routes, and wheelchair accessibility information for each service. Accessible Travel Provides wheelchair accessibility information for supported services Supports VoiceOver for iPhone Visit website Google Play store URL Apple store URL finderful.com finderful.com Overview finderful.com is a website designed to help you decide where to buy or rent in Sydney, New South Wales, and Australia. You can search for homes or suburbs based on a wide range of customisable search preferences you set, including number of bedrooms and bathrooms, safer suburbs, or cheaper homes. finderful.com uses Transport for NSW public transport datasets in its searches to calculate travel times from every home to your workplace or any other point of interest. It even lets you select your preferred transport mode! finderful.com is endorsed by Transport for NSW following the Travel Choices Innovation Challenge in 2017. finderful.com can help you find homes and suburbs that best suit you. Visit website Google Maps Google Maps Overview Navigate your world faster and easier with Google Maps. Over 220 countries and territories mapped and hundreds of millions of businesses and places on the map. Get real-time GPS navigation, traffic, and transit info, and explore local neighbourhoods by knowing where to eat, drink and go - no matter what part of the world you’re in. Accessible Travel Provides accessibility information for travelling with a wheelchair. Datasets Timetables Complete GTFS Public Transport - Timetables - For Realtime Public Transport - Realtime Alerts Public Transport - Realtime Vehicle Positions Public Transport - Realtime Trip Update Public Transport - Location Facilities and Operators Visit website Google Play store URL Apple store URL L2P L2P Overview L2P app is a digital logbook designed for the latest generation of learner drivers. Its intuitive user interface features a countdown to motivate learners, real time tracking and recording of driving sessions plus an online educational platform with video clips for demonstrations. Visit website Google Play store URL Apple store URL Licence Ready Licence Ready Overview With digital driving instruction, personalised training and NSW log book, this is a fantastic app for learner drivers. Download across devices and utilise with multiple supervisors for flexible learning. Visit website Google Play store URL Apple store URL Live Traffic NSW Live Traffic NSW Overview Live Traffic NSW provides to-the-minute updates about incidents and conditions that may affect a user’s journey. Users can save their favourite routes to receive scheduled alerts and can check images from live traffic cameras that update every 60 seconds. The app also enables Driving Mode so users can receive audio alerts of nearby incidents while on their trip. Live Traffic NSW can be used in both Sydney and regional NSW. Datasets Live traffic information Visit website Google Play store URL Apple store URL Metarove Metarove Overview Metarove was developed to make public transport trip planning easier for customers with limited mobility. The app provides real-time information about departure times, route recalculation, trip plan updates and options if services are delayed or cancelled. The app is highly customisable and users can set personal walking speeds, maximum physical travel distances or display accessible journeys only. Developed by Metarove. View the developer discussing their relationship with TfNSW here or see how Metarove works for customers with limited mobility here Datasets Public transport scheduled and real-time information Visit website Google Play store URL Moovit Moovit Overview Moovit provides users with real-time information and fastest routes for the public transport network. Users can send live reports about their travel experience, such as cleanliness and seat availability, through the app. This live information helps improve route plans and provides other users with more accurate travel time estimates. Developed by Moovit. Datasets Public transport scheduled and real-time information Visit website Microsoft store URL Google Play store URL Apple store URL My House Geek My House Geek Overview My House Geek allows users to get to know a neighbourhood before they move in. A map allows users to search for properties, as well as discover nearby schools, public transport services and other places of interest such as shopping centres and child care facilities. Using transport data, the map shows users how far places of interest are from their selected property. Users can even save and share all searches! Visit website Next Station Next Station Overview Next Station provides trip planning and timetables, real-time vehicle position and service alerts in English, Simplified and Traditional Chinese covering Sydney’s public transport network, NSW TrainLink and NSW regional buses. 【Next Station 下一站】提供6大服务，包括 1. 行程规划，2. 实时时间表，3. 实时公交位置，4. 实时中文服务信息，5. 路线地图，6.车厢空间拥挤度。【Next Station 下一站】涵盖火车，城轨，公交，轻轨及渡船。【Next Station 下一站】提供全中文界面及实时翻译所有公共交通服务信息。 【Next Station 下一站】提供6大服務，包括 1. 行程規劃，2. 實時時間表，3. 實時公交位置，4. 實時中文服務信息，5. 路線地圖，6.車廂空間擁擠度。【Next Station 下一站】涵蓋火車，城軌，巴士，輕軌及渡船。【Next Station 下一站】提供全中文界面及實時翻譯所有公共交通服務信息。 Datasets Public transport schedules and real-time information Visit website Google Play store URL Apple store URL NextThere NextThere Overview NextThere provides users with real-time service information. The app tracks a user’s location and shows when the next trains, buses, ferries or light rail are due to depart from that location. An easy-to- read map tracks services and users can receive alerts about disruptions before their journey. Developed by AppJourney Pty Ltd. View the developer discussing their relationship with TfNSW here Datasets Public transport scheduled and real-time information Visit website Apple store URL Opal Travel Opal Travel Overview Opal Travel allows users to plan their trip and check their Opal fare estimate for train, bus, ferry and light rail services. Adult, child/youth and senior/pensioner fares are displayed as well as Opal card retailers. NFC Android users can also scan their Opal card to check their balance, journey count and last tap details. Details of a trip plan can be displayed as a map or as text-based instructions, and user can receive service alerts. Users can also save regular trips and locations as favourites. Trip planning is enabled for both Sydney and regional NSW. Datasets Public transport scheduled and real-time information Visit website Google Play store URL Apple store URL Rome2rio Rome2rio Overview Rome2rio is a global travel planning website and app that allows users to quickly find their way from A to B using any combination of transport. Simply type in two addresses anywhere in the world and you’ll be offered a choice of journey options that seamlessly stitch together flights, transfers, trains, ferries, and buses, to get you from door to door – all with prices, estimated durations, and booking details at lightning-fast speed. Rome2rio covers 10 million locations around the world and includes routing from more than 5,000 transportation operators. Datasets Timetables Complete GTFS Visit website Google Play store URL Apple store URL Roundtrip Roundtrip Overview Roundtrip is an app for NSW Learner Drivers that makes it super easy to record supervised driving practice. Tap the record button, enter your odometer and you’re off! Roundtrip will track your time, start and end location, weather and more, so you focus on learning to drive. You can also use Roundtrip to view and unlock learning goals, see your total practice times and submit your digital logbook to the RMS. No more paper logbook needed! Visit website Apple store URL Snarl Snarl Overview Snarl provides up-to-date information about accidents and congestion across the NSW, QLD and VIC road networks. Users can check traffic conditions before their journey, or on the go using driving mode. In driving mode, auto driving detection will warn users of current conditions, accidents and incidents while they are on the move. Developed by Snarl. View the developer discussing access to open data here Datasets Roads real-time information Visit website Google Play store URL Apple store URL Stop Announcer Stop Announcer Overview Stop Announcer is a route guidance app that provides audio notifications of stops made along a trip. The app will announce bus stops, train stations, ferry wharves and light rail stops. It will also alert users when their selected stop has been reached. View the developer discussing access to Open Data here Datasets Public transport scheduled and real-time information Visit website Google Play store URL Apple store URL Transit Transit Overview Transit App provides real-time trip planning including departure times, timetables and route maps all in big text and bright colours. Users can set reminders, get notifications about disruptions and view visual stop notifications on a map. The app also allows users to request an Uber. Developed by TransitApp, Inc. Datasets Public transport scheduled and real-time information Visit website Google Play store URL Apple store URL TripChecker TripChecker Overview TripChecker is the world's free, take-anywhere transit companion. Users can get live vehicle departures and key travel info in Sydney, London and New York - and other major cities around the world, including every town in Great Britain. Next train or bus arrival is instantly shown to users, as well as live journey planning, instant disruptions info and traffic reports. TripChecker will even wake users when they arrive at their stop! Datasets Public transport scheduled and real-time information Google Play store URL Apple store URL TripGo TripGo Overview TripGo allows users to compare public and private transport options. It provides information including estimated costs , fastest modes and routes based on real-time data. Users can manually search for trip options, or TripGo can use their calendar to provide options automatically. The app can also provide users with door-to-door directions. Developed by SkedGo. View the developer discuss their relationship with TfNSW here Datasets Public transport scheduled and roads real-time information Visit website Google Play store URL Apple store URL Triptastic Triptastic Overview Triptastic shows users the next available service from their current location, service disruption information and vehicle tracking. From within the app users can search for suburbs, routes, stops and businesses or explore a range of interactive, detailed maps of routes, stops and service frequencies. Developed by AppJourney Pty Ltd. Datasets Public transport scheduled and real-time information Visit website Apple store URL TripView TripView Overview TripView displays train, bus, ferry and light rail timetables for Sydney. Users can view a summary of next services or a full timetable. Alarms can also be set for upcoming trips. Developed by Grofsoft. Datasets Public transport scheduled and real-time information Visit website Microsoft store URL Google Play store URL Apple store URL Waverley Transport Waverley Transport Overview Our hyperlocal MaaS solution provides multimodal journey options for customers to have a seamless travel experience. It is customised for the Waverley Council area. The main objectives are easing traffic congestion and reducing parking demand. Developed by Smart Cities Transport (SCT) Datasets Public Transport - Real-time Trip Update Public Transport - Timetables - For Realtime Other datasets including Waverley Council Datasets Visit website Google Play store URL Apple store URL Media Browse media releases, promotional videos, information for third party apps and more. More Performance and Analytics Transport Performance and Analytics (TPA) operates as a Centre of Excellence, providing objective and credible transport data, advice and analysis. TPA combines the Bureau of Transport Statistics and Bureau of Freight Statistics and provides the evidence base that helps drive strategic decision making in support of an effective transport system. More Since the launch of the Open Data Hub... 45,714 Registered Users 6,055 Registered Applications 9,511,627,337 API Hits Open Data What is Open Data? General Information Contact Us General Enquiry @DataTfNSW Data Request Media Developers Developer Information Data Catalogue Get Started Documentation API Explorer Apply for Showcase Marketing Widget Support Forum Support FAQs Troubleshooting Useful Links Legal Terms Privacy Policy Acceptable Use Policy Data Licence Follow us Back to top Transport for NSW acknowledges the traditional owners and custodians of the land, and respects Elders past, present and future. 
opengeoscales-github-io-3589	----	CarbonGeoScales | A framework for standardizing GHGs emissions open data at multiple geographical scales. CarbonGeoScales A framework for standardizing GHGs emissions open data at multiple geographical scales. View on GitHub This document presents the CarbonGeoScales project: A framework for standardizing available GHGs emissions open data at multiple geographical scales. A complementary technical report presenting a comprehensive cartography of GHGs emissions open data with our first explorations and modeling is available in this link. Key words: Open Data, Geospatial scales, Climate Change, GHG emissions, Standardization, Open Source Author and project manager: Saif Shabou Github repository: CarbonData Date: 11/03/2021 Context Greenhouse gas (GHGs) emissions from human activities are considered as the most significant driver of observed climate change since the mid-20th century. In order to better mitigate climate change impacts, policy makers, scientists, carbon and environment consultancy specialists, data journalists and citizens need easy and free access to relevant and accurate data on GHGs emissions at various spatial scales. In accordance with the Paris Agreement, countries are required to submit transparent, comparable and complete reporting of GHGs inventories based on the methodologies developed by the IPCC. In addition to these Top-Down approaches, Bottom-Up methods are being used for completing emissions estimates based on satellite imagery and carbon monitoring stations at different spatial scales. At subnational scales, important efforts and specific protocols are deployed providing methodologies for measuring city-scale emissions since they represent a large percent of energy-related GHGs emissions. Pain points While several open data platforms and portals are engaged in publishing relevant collected GHGs emissions data (Carbon Disclosure Project, World Resources Institute, EDGAR database,UNFCCC…), the use of this data remain often complex and time consuming due to their different formats, schemas, standards, scales, protocols, units,spatio-temporal coverage, data definitions… Identification and mapping of the appropriate sources as well as various data processing treatments (cleaning, recoding, normalization, cross-referencing, aggregating…) are often necessary before being able to use the available data and combine it with other indicators. Our solution: CarbonGeoScales Goals The main goal of CarbonGeoScales consists of providing a centralized access point to updated, accurate, harmonized, and aggregated GHGs emissions data at multiple geographical scales. As an open source and collaborative project, different data processing treatments are accessible in order to provide users with transformation processes implemented on raw collected data. The resulting compiled database is accessible by download and through specific API for querying and integrating aggregated data into potential applications. Resulting harmonized GHGs emissions data are completed with consistent documentation, articles and dashboards for exploring its content. By facilitating access to GHGs emissions open data, CarbonGeoScales is participating to help policy makers to build more effective emissions reduction strategies at different territorial scales and to track their progress more accurately. Standardidzed data facilitates data use for carbon analytics specitalists (scientists, NGOs and carbon consultancy offices) in order to combine it with external datasets. Technical components CarbonGeoScales is based on a complete data workflow in order to deliver relevant and harmonized GHGs emissions datasets: Data catalog for referencing identified GHGs emissions data and associated metadatas. A standardized data model for storing different attributes of GHGs emissions data (values, dates, units, sectors, scopes, geographical entity informations…) A set of connectors for integrating selected raw data into the defined data model based on specific mapping rules. Harmonized databases for storing collected and transformed GHGs emissions data. Data processing pipelines for implementing the necessary transformations (recoding, enriching, cleaning, aggregating…). A Rest API for querying the compiled database. A set of reporting, articles and dashboards for providing insights on GHGs emissions at various scales. Used datasets We identified various GHGs emissions datasets providing information on emissions estimates based on multiple protocols and standards. The table below lists main datasets by specifying data providers, data sources, geographical scale and spatial resolutions. A more detailed analysis of data contents is provided in this report. Data provider Description Geoscale Data sources Access The World Resources Institute (WRI) The World Resources Institute compiles various sources of GHG emissions and provide access to this data through a specific tool: CLIMAT WATCH. Provided GHG emissions data is based on various data sources: CAIT database, UNFCCC, PIK. World scale Country scale CAIT UNFCCC PIK Get data The United Nations Framework Convention on Climate Change (UNFCCC) The UNFCCC compiles and shares national annual greenhouse gases inventories submitted in accordance with with the reporting requirements adopted under the Climate Change Convention World scale Country scale UNFCCC Get data Potsdam Institute for Climate Impact Research (PIK) The Postdam Institute provides the PRIMAP-hist dataset, which combines several published datasets to create a comprehensive set of greenhouse gas emission pathways for every country and Kyoto gas covering the years 1850 to 2017, and all UNFCCC (United Nations Framework Convention on Climate Change) member states, as well as most non-UNFCCC territories. World scale Country scale UNFCCC EDGAR Get data The Joint Research Center of Euorpean Commission The Joint Research center produces the Emission Database for Global Atmospheric Research (EDGAR). EDGAR provides independent estimates of the global anthropogenic emissions and emission trends, based on publicly available statistics, for the atmospheric modeling community as well as for policy makers. This scientific independent emission inventory is characterized by a coherent world historical trend from 1970 to year x-3, including emissions of all greenhouse gases, air pollutants and aerosols. Data are presented for all countries, with emissions provided per main source category, and spatially allocated on a 0.1x0.1 grid over the globe World scale Country scale Grid scale EDGAR Get data The Organization for Economic Co-Operation and Development (OECD) The OECD publishes datasets presenting trends in man-made emissions of major greenhouse gases and emissions by gas. OECD scale Country scale UNFCCC Get data European Environmental Agency (EEA) The EEA compiles and provides data on greenhouse gas emissions and removals, sent by countries to UNFCCC and the EU Greenhouse Gas Monitoring Mechanism (EU Member States). Europe scale Country scale UNFCCC Get data Eurostat Eurostat (European Statistical Office) is a Directorate-General of the European Commission. It provides statistical information to the institutions of the European Union (EU) such as a comprehensive set of climate change-related data including GHG emissions statistics. Eurostat maintains a data portal for exploring emissions data. Europe scale Country scale EEA UNFCCC Get data Our World In Data Our World In Data compile, maintain and shares CO2 and GHG emissions data. It is updated regularly and includes data on CO2 emissions (annual, per capita, cumulative and consumption-based), other greenhouse gases, energy mix, and other relevant metrics. World scale Country scale GCP CAIT Get data Global Carbon Project (GCP) GCP is a global research project that seeks to quantify global greenhouse gas emissions and their causes. It provides data on carbon fluxes resulting from human activities and natural processes and a platform to explore and visualize the most up-to-date data (Gloabl carbon Atlas) World scale Country scale City scale GCP Get data Carbon Disclosure Project (CDP) The CDP is an international non-profit organisation that helps companies and cities disclose their environmental impact and GHG emissions. CDP provides an open data protal for exploring companies and city-wide collected data. City scale CDP Get data Users By providing harmonized, relevant and ready-to-use multi-scale GHGs emissions open data, CarbonGeoScales fulfill the needs of various profiles of GHGs data users and carbon issues specialists: Data journalists specialized in climate change issues who need easy access to standardized open data with visualizations charts and comprehensive documentation to support their articles and reports. Carbon consultancy specialists who need GHGs emissions estimates for conducting carbon foortprint analysis to their clients (companies, facility level, municipalities…). CarbonGeoScales provide them with API access to processed data at different scales. API use might be monetized according of request number or specific attributes and features to integrate (data update frequency ,support ,specific use cases…). Policy makers for better mitigating climate actions and tracking their progress in reducing emissions at various geographical and administrative scales by comparative analysis which need standardized data. Scientists working in downscaling GHGs emissions estimates who need high-resolution bottom-up GHGs emissions estimates for parametrizing carbon monitoring simulation in combination with atmospheric concentration measurements and image stallites retrieval. NGOs specialists in climate issues and carbon analytics by providing them with standardized data and visualizations to integrate in their reoprts, websites and applications. Roadmap Beta version The figure below presents prelimenrary roadmap for delivering the Beta version of CarbonGeoScales as a part of Data For Good acceleration 3 months program starting from April 1st 2021. This first prototype adresses only the standardization of Bottmo-Up GHGs emission data at multiple scales in Fench terriotory. Future releases Different improvements and new features have been identified for next releases of CarbonGeoScales: GHGs emissions data: Including Top-down estimates based on satellite images and ground based carbon monitoring stations in order to leverage uncertainties due to reporting bias. Data visualization: Developing geo-visualization dashboards for exploring CarbonGeoScales database Geographical coverage: Extending the spatial coverage of compiled data and implementing Beta version prototype to other territories and geographical scales. Communication: Developing a dedicated web site for hosting and sharing documentation and articles based on CarbonGeoScales database and API. Performance: Improving database performance and data models by providing appropriate data storage and pipleines infrastructure. Project management Code source: The actual code source of CarbonGeoScales project is hosted in this Github repository. Github Project Board is used for managing the collaboration in the developement of different fetaures and issues. Documentation: The actual documentation is hosted in Github wiki pages. More appropriate documentation will be available soon in a specific documentation website. Contribution modalities: Contribution modalities are incremently specified in the readme page. A slack canal will be shared soon for facilitating interaction between contributors. Partnerships CarbonGeoScales has been selected by Data For Good in order to deliver first prototype based on French GHGs emissions data. More partnerhips need to be set up with organisms specialized in open data, climate change issues and city-scale carbon monitoring in order to improve the consistency of delivered standardized data: Open Data: Our World in Data, World resources Institute, The Open Source geospatial Foundation (OSCGeo), Open Data Institute, Open Knowldge Foundation, Open data for development, Global Partnership for Sustainable Development Data… Carbon Data: Open Carbon Watch, Integrated Carbon Observation System, OS-climat, Carbon Disclosure Project (CDP)… City scale data: C40 cities, Global Govenant of Mayors for Climate & Energy, Local Governments for Sustainability (ICLEI), Carbon Neutral Cities Alliance… CarbonGeoScales is maintained by OpenGeoScales. This page was generated by GitHub Pages. 
opensourceexile-blogspot-com-9318	----	Open Source Exile Open Source Exile An open sourcer in exile Tuesday, 19 March 2019 #ChristchurchMosqueShootings This post is a personal reflection on the recent events in Christchurch. Many people have proposed different responses making some very good points. Here are my thoughts: Racism and bigotry has never been solved by wagging fingers at bigots. It has been solved by empowering the targets and systematically calling out minor acts of racism and bigotry so it becomes de-normalised. There have been lots of great suggestions as to how to empowering the targets in the last couple of days; listen to the targets on how they need to be empowered, not a white guy like me. Enact a law that permanently raises the New Zealand refugee quota automatically in response to anti-immigrant hate crimes (starting with the Christchurch incident). This explicitly and clearly makes anti-immigrant hate crimes’ primary motivation self-defeating. Doubling our quote also raises it in line with international norms. Ban the commercial trading of firearms, moving their import to the not-for-profit sector (i.e. gun clubs) or to a personal activity. This removes the incentives behind the current Gun City advertisements and tempers commercial incentives for importing guns. Introduce a systematic buy-back program for weapons (guns, replica swords, etc). Make owning a gun an inconvenience, doubly so in urban areas. This likely involves significantly tightening the licencing requirements (restricting types of guns, requiring advanced first aid and similar courses, etc) and random checks on licensees’ secure lockup measures, etc. It may also involve requiring licensees to report shooting trips, shooting range visits, etc, etc. Done right, this may even have the side-effect of improving our conservation efforts by getting a better idea of who’s shooting what introduced and native animals Gun range licenses should be managed in a similar way to alcohol licenses, with renewals, public notifications etc. Update the rules around legal deposit so that when organisations and publishers selectively remove or update content from their websites they are required to notify the National Library and that National Library can broadcast this taken-down content. This attempts to preserve the public record by amplifying the Streisand effect; efforts by public figures to sanitise their pasts without public apology need to be resisted. If we’re orchestrating large-scale take-downs of offensive New Zealand content (such as videos of shooters shooting people) from the web, we need to reconcile this with certain statutory duties, such as the requirement that the National Library collect and archive New Zealand web content. Collecting and archiving such offensive material may sound bizarre, but not doing so leaves us open to the kinds of revisionism that appears to fuel this kind of behaviour. If we’re going to continue to have religious education / schooling, it needs to address issues of religious hate rather than being a covert recruitment operation as it appears to be at the moment. We need to ask ourselves whether some of our brands (particularly sports brands) need to change their branding. The most effective way is probably the Christchurch City Council drafting a bylaw saying that local sports people and teams using it’s facilities must be named after animals with no negative connotations, with a limited 10 year exception for existing teams to meet their contractual obligations. Other councils would soon follow and giving a realistic time frame for renaming allows for planning around merchandising, team apparel and so forth. Have an explicit fund for public actors (museums, galleries, libraries, academics, tohunga, imams, etc) to generate ‘content’ (everything from peer review papers to museum experiences, from school teaching resources to Te Ara articles, from poetry competitions to murals) on some of the deeper issues here. There’s a great need for young and old to engage with these issues, now and in the decades to come. Find ways to amplify minority / oppressed voices. In theory blogs and social media were meant to be a way that we could find and the media pick up on theses voices in times like these, but across many media outlets this is manifestly not happening. We’re seeing straight white males write that New Zealand has no discrimination problems and editors sending those pieces to print. We’re seeing ‘but he was such a nice young man’ stories. It’s no coincidence that the media outlets and pundits that are doing this are largely the same ones who have previously be accused of racism. We need to find ways to fix this, if necessary leveraging advertisers and/or adding conditions to spectrum licenses. We need to seriously reflect on whether an apology is needed in relation to the 2007 New Zealand police raids, which now stand in a new light. The law of unintended consequences means that there will be side effects. The most obvious two from this list may be increased barriers to recreational gun clubs (including Olympic pistol shooting, which is pretty hard to argue isn’t a genuine sport, but which has never really been all that big in New Zealand) and the decreased amateur shooting of pest species (deer, pig, etc) on public conservation land (which is a more serious issue). Posted by Stuart Yeates at 22:27 No comments: Monday, 3 October 2016 How would we know when it was time to move from TEI/XML to TEI/JSON? This post inspired by TEI Next by Hugh Cayless. How would we know when it was time to move from TEI/XML to TEI/JSON? If we stand back and think about what it is we (the TEI community) need from the format : A common format for storing and communicating Texts and augmentations of Texts (Transcriptions, Manuscript Description, Critical Apparatus, Authority Control, etc, etc.). A body of documentation for shared use and understanding of that format. A method of validating Texts in the format as being in the format. A method of transforming Texts in the format for computation, display or migration. The ability to reuse the work of other communities so we don't have to build everything for ourselves (Unicode, IETF language tags, URIs, parsers, validators, outsourcing providers who are tooled up to at least have a conversation about what we're trying to do, etc) [Everyone will have their slightly different priorities for a list like this, but I'm sure we can agree that a list of important functionality could be drawn up and expanded to requirements list at a sufficiently granular level so we can assess different potential technologies against those items. ]  If we really want to ponder whether TEI/JSON is the next step after TEI/XML we need to compare the two approaches against such as list of requirements. Personally I'm confident that TEI/XML will come out in front right now. Whether javascript has potential to replace XSLT as the preferred method for really exciting interfaces to TEI/XML docs is a much more open question, in my mind.   That's not to say that the criticisms of XML aren't true (they are) or valid (they are) or worth repeating (they are), but perfection is commonly the enemy of progress. Posted by Stuart Yeates at 21:21 2 comments: Sunday, 2 October 2016 Whither TEI? The Next Thirty Years This post is a direct response to some of the organisational issues raised in https://scalablereading.northwestern.edu/?p=477 I completely agree that we need to significantly broaden the base of the TEI. A 200 x 500 campaign is a great idea, but better is a 2,000 x 250 goal, or a 20,000 x 250 goal. If we can reduce the cost to the normal range of a hardback text, most libraries will have delegated signing authority to individuals in acquisitions and only one person will need to be convinced, rather than a chain of people. But how could we scale 20,000 institutions? To scale like that, we to think (a) in terms of scale and (b) in terms of how to make it easy for members to be a part of us. Scale (1) A recent excellent innovation in the the TEI community has been the appointment of a social media coordinator. This is a great thing and I’ve certainly learnt about happenings I would not have otherwise been exposed to. But by nature the concept of ‘a social media coordinator’ can’t scale (one person in one time zone with one set of priorities...). If we look at what mature large-scale open projects do for social media (debian, wikimedia, etc), planets are almost always part of the solution. A planet for TEI might include (in no particular): 20x blog feeds from TEI-specific projects 20x blog feeds from TEI-using projects (limited to those posts tagged TEI) 1x RSS feed for changes to the TEI wiki (limited to one / day each) 1x RSS feed for jenkins server (limited to successful build only; limited to one / day each; tweaked to include full context and links) 20x RSS feeds for github repositories not covered by jenkins server (limited to one / day each) 10x RSS feeds for other sundry repositories (limited to one / day each) 50x blog feeds from TEI-people (limited to those posts tagged TEI) 15x RSS feeds from TEI-people’s zotero bibliographic databases (limited to those bibs tagged TEI; limited to one / day each) 1x RSS feed for official TEI news 7x RSS feed of edits for the TEI article on each language wikipedia (limited to one / day each) 1x RSS feed of announcements from the JTEI 1x RSS feed of new papers in the JTEI … The diversity of the planet would be incredible compared to current views of the TEI community and it’s all generated as a byproduct of what people are already doing. There might be some pressure to improve commit messages in some repos, but that might not be all bad. Of course the whole planet is available as an RSS feed and there are RSS-to-facebook (and twitter, yammer, etc) converters if you wish to do TEI in your favourite social media. If the need for a curated facebook feed remains, there is now a diverse constant feed of items to select within. This is a social media approach at scale. Scale (2) There is an annual international conference which is great to attend. There is a perception that engagement in the TEI community requires attendance at the said conference. It’s a huge barrier to entry to small projects, particularly those in far-away places (think global south / developing world / etc). The TEI community should seriously consider a policy for decision making that explicitly removes assumptions about attendances. Something as simple as requiring draft papers intended for submission and agendas to be published and 30 days in advance of meetings and a notice to be posted to TEI-L. That would allow for thoughtful global input, scaling community from those who can attend an annual international conference to a wider group of people who care about the TEI and have time to contribute. Make it easy (1) Libraries (at least the library I work in and libraries I talk to) buy resources based on suggestions and lobbying by faculty but renew resources based largely on usage. If we want 20,000 libraries to have TEI on automatic renewal we need usage statistics. The players in the field are SUSHI and COUNTER (SUSHI is a harvesting system for COUNTER). Maybe the TEI offers members stats at 10 diverse TEI-using sites. It’s not clear to me without deep investigation whether the TEI could offer these stats to members at very little on-going cost to us, but it would be a member benefit that all acquisitions librarians, their supervisors and their auditors could understand and use to evaluate their TEI membership subscription. I believe that that comparison would be favourable. Of course, the TEI-using sites generating the traffic are going to want at least some cut of the subs, even if it’s just a discount against their own membership (thus driving the number of participating sites up and the perceived member benefits up) and free support for the stats-generating infrastructure. For the sake of clarity: I’m not suggesting charging for access to content, I’m suggesting charging institutions for access to statistics related to access to the content by their users. Make it easy (2) Academics using computers for research, whether or not they think or call the field digital humanities face a relatively large number of policies and rules imposed by their institutions, funders and governments. The TEI community can / should be selling itself as he approach to meet these. Copyright issues? Have some corpora that are available under a CC license. Need to prove academic outputs are archivable? Here’s the PRONOM entry (Note: I’m currently working on this) Management doesn’t think the department as the depth of TEI experience to enroll PhDs in TEI-centric work? Here’s a map of global TEI people to help you find local backups in case staff move on. Looking for a TEI consultant? A different facet of the same map gives you what you need. You’re a random academic who knows nothing about the TEI but assigned a TEI-centric paper as part of a national research assessment exercise? Here’s an outline of TEI’s academic credentials. .... Make it easy (3) Librarians love quality MARC / MARCXML records. Many of us have quality MARC / MARCXML records for our TEI-based web content. Might this be offered as a member benefit? Make it easy (4) As far as I can tell the TEI community makes very little attempt to reach out to academic communities other than ‘literature departments and cognate humanities disciplines’ attracting a more diverse range of skills and academics will increase our community in depth and breadth. Outreach could be: Something like CSS Zen Garden http://www.csszengarden.com/ only backed by TEI rather than HTML A list of ‘hard problems’ that we face that various divergent disciplines might want to set as second or third year projects. Each problem would have a brief description of the problem, pointers to Things like: Transformation for display for documents have five foot levels of footnotes, multiple obscure scripts, non-Unicode characters, and so forth. Schema / ODD auto-generation from a corpus of documents ... Engaging with a group like http://software-carpentry.org/ to ubiquify TEI training .. End Note I'm not advocating that any particular approach is the cure-all for everything that might be ailing the TEI community, but the current status-quo is increasingly seeming like benign neglect. We need to change the way we think about TEI as a community. Posted by Stuart Yeates at 09:38 No comments: Tuesday, 20 October 2015 Thoughts on the NDFNZ wikipedia panel Last week I was on an NDFNZ wikipedia panel with Courtney Johnston, Sara Barham and Mike Dickison. Having reflected a little and watched the youtube at https://www.youtube.com/watch?v=3b8X2SQO1UA I've got some comments to make (or to repeat, as the case may be). Many people, including apparently including Courtney, seemed to get the most enjoyment out of writing the ‘body text’ of articles. This is fine, because the body text (the core textual content of the article) is the core of what the encyclopaedia is about. If you can’t be bothered with wikiprojects, categories, infoboxes, common names and wikidata, you’re not alone and there’s no reason you need to delve into them to any extent. If you start an article with body text and references that’s fine; other people will to a greater or less extent do that work for you over time. If you’re starting a non-trivial number of similar articles, get yourself a prototype which does most of the stuff for you (I still use https://en.wikipedia.org/wiki/User:Stuartyeates/sandbox/academicbio which I wrote for doing New Zealand women academics). If you need a prototype like this, feel free to ask me. If you have a list of things (people, public art works, exhibitions) in some machine readable format (Excel, CSV, etc) it’s pretty straightforward to turn them into a table like https://en.wikipedia.org/wiki/Wikipedia:WikiProject_New_Zealand/Requested_articles/Craft#Proposed_artists or https://en.wikipedia.org/wiki/Enjoy_Public_Art_Gallery Send me your data and what kind of direction you want to take it. If you have a random thing that you think needs a Wikipedia article, add to https://en.wikipedia.org/wiki/Wikipedia:WikiProject_New_Zealand/Requested_articles  if you have a hundred things that you think need articles, start a subpage, a la https://en.wikipedia.org/wiki/Wikipedia:WikiProject_New_Zealand/Requested_articles/Craft and https://en.wikipedia.org/wiki/Wikipedia:WikiProject_New_Zealand/Requested_articles/New_Zealand_academic_biographies both completed projects of mine. Sara mentioned that they were thinking of getting subject matter experts to contribute to relevant wikipedia articles. In theory this is a great idea and some famous subject matter experts contributed to Britannica, so this is well-established ground. However, there have been some recent wikipedia failures particularly in the sciences. People used to ground-breaking writing may have difficulty switching to a genre where no original ideas are permitted and everything needs to be balanced and referenced. Preparing for the event, I created a list of things the awesome Dowse team could do as follow-ups to they craft artists work, but we never got to that in the session, so I've listed them here: [[List of public art in Lower Hutt]] Since public art is out of copyright, someone could spend a couple of weeks taking photos of all the public art and creating a table with clickable thumbnail, name, artist, date, notes and GPS coordinates. Could probably steal some logic from somewhere to make the table convertible to a set of points inside a GPS for a tour. Publish from their archives a complete list of every exhibition ever held at the Dowse since founding. Each exhibition is a shout-out to the artists involved and the list can be used to check for potentially missing wikipedia articles. Digitise and release photos taken at exhibition openings, capturing the people, fashion and feeling of those era. The hard part of this, of course, is labelling the people. Reach out to their broader community to use the Dowse blog to publish community-written obituaries and similar content (i.e. encourage the generation of quality secondary sources). Engage with your local artists and politicians by taking pictures at Dowse events, uploading them to commons and adding them to the subjects’ wikipedia articles—have attending a Dowse exhibition opening being the easiest way for locals to get a new wikipedia image. I've not listed the 'digitise the collections' option, since at the end of the day, the value of this (to wikipedia) declines over time (because there are more and more alternative sources) and the price of putting them online declines. I'd much rather people tried new innovative things when they had the agility and leadership that lets them do it, because that's how the community as a whole moves forward. Posted by Stuart Yeates at 07:24 No comments: Labels: wikipedia Thursday, 15 October 2015 Feedback on NLNZ ‘DigitalNZ Concepts API‘ This blog post is feedback on a recent blog post ‘Introducing the DigitalNZ Concepts API’ http://digitalnz.org/blog/posts/introducing-the-digitalnz-concepts-api by the National Library of New Zealand’s DigitalNZ team. Some of the feedback also rests on conversations I've had with various NLNZ staffers and other interested parties and a great stack of my own prejudices. I've not actually generated an API key and run the thing, since I'm currently on parental leave. Parts of the Concepts API look very much like authority control, but authority control is not mentioned in the blog post or the docs that I can find. It may be that there are good reasons for this (such as parallel comms in the pipeline for the authority control community) but there are also potentially very worrying reasons. Clarity is needed here when the system goes live. All the URLs in examples are HTTP, but the ALA’s Freedom to Read Statement requires all practical measures be taken to ensure the confidentiality of the reader’s searching and reading. Thus, if the API is to be used for real-time searching, HTTPS URLs must be an option.  There is insufficient detail of of the identifiers in use. If I'm building a system to interoperate with the Concepts API, which identifiers should I be keeping at my end to identify things that the DigitalNZ end? The clearer this definition is, the more robust this interoperability is likely to be, there’s a very good reason for the highly structured formats of identifiers such as ISNI and ISBN. If nothing else a regexp would be very useful. Personally I’d recommend browsing around http://id.loc.gov/ a little and rethinking the URL structure too. There needs to be an insanely clear statement on the exact relationship between DigitalNZ Concepts and those authority control systems mapped into VIAF. Both DigitalNZ Concepts and VIAF are semi-automated authority matching systems and if we’re not carefully they’ll end up polluting each other (as for example, DNB already has with gender data).  Deep interoperability is going to require large-scale matching of DigitalNZ Concepts with things in a wide variety of GLAM collections and incorporating identifiers into those collections’ metadata. That doesn't appear possible with the current licensing arrangements. Maybe a flat-file dump (csv or json) of all the Concepts under a CC0 license? URLs to rights-obsessed partners could be excluded. If non-techies are to understand Concepts, http://api.digitalnz.org/concepts/448 is going to have to provide human-comprehensible content without an API key (I’m guessing that this is going to happen when it comes out of beta?) Mistakes happen (see https://en.wikipedia.org/wiki/Wikipedia:VIAF/errors for recently found errors in VIAF, for example). There needs to be a clear contact point and likely timescale for getting errors fixed.  Having said all that, it looks great! Posted by Stuart Yeates at 10:02 2 comments: Monday, 14 July 2014 BIBFRAME Adrian Pohl ‏wrote some excellent thoughts about the current state of BIBFRAME at http://www.uebertext.org/2014/07/name-authority-files-linked-data.html The following started as a direct response but, after limiting myself to where I felt I knew what I was talking about and felt I was being constructive, turned out to be much much narrower in scope. My primary concern in relation to BIBFRAME is interlinking and in particular authority control. My concern is that a number of the players (BIBFRAME, ISNI, GND, ORCID, Wikipedia, etc) define key concepts differently and that without careful consideration and planning we will end up muddying our data with bad mappings. The key concepts in question are those for persons, names, identities, sex and gender (there may be others that I’m not aware of). Let me give you an example. In the 19th Century there was a mass creation of male pseudonyms to allow women to publish novels. A very few of these rose to such prominence that the authors outed themselves as women (think Currer Bell), but the overwhelming majority didn’t. In the late 20th and early 21st Centuries, entries for the books published were created in computerised catalogue systems and some entries found their way into the GND. My understanding is that the GND assigned gender to entries based entirely on the name of the pseudonym (I’ll admit I don’t have a good source for that statement, it may be largely parable). When a new public-edited encyclopedia based on reliable sources called Wikipedia arose, the GND was very successfully cross-linked with Wikipedia, with hundreds of thousands of articles were linked to the catalogues of their works. Information that was in the GND was sucked into a portion of Wikipedia called Wikidata. A problem now arose: there were no reliable sources for the sex information in GND that had been sucked Wikidata by GND, the main part of Wikipedia (which requires strict sources) blocked itself from showing Wikidata sex information. A secondary problem was that the GND sex data was in ISO 5218 format (male/female/unknown/not applicable) whereas Wikipedia talks not about sex but gender and is more than happy for that to include fa'afafine and similar concepts. Fortunately, Wikidata keeps track of where assertions come from, so the sex info can, in theory, be removed; but while people in Wikipedia care passionately about this, no one on the Wikidata side of the fence seems to understand what the problem is. Stalemate. There were two separate issues here: a mismatch between the Person in Wikipedia and the Pseudonym (I think) in GND; and a mismatch between a cataloguer-assigned ISO 5218 value and a free-form self-identified value.  The deeper the interactions between our respective authority control systems become, the more these issues are going to come up, but we need them to come up at the planning and strategy stages of our work, rather than halfway through (or worse, once we think we’ve finished). My proposed solution to this is examples: pick a small number of ‘hard cases’ and map them between as many pairs of these systems as possible. The hard cases should include at least: Charlotte Brontë (or similar); a contemporary author who has transitioned between genders and published broadly similar work under both identities; a contemporary author who publishes in different genre using different identities; ... The cases should be accompanied by instructions for dealing with existing mistakes found (and errors will be found, see https://en.wikipedia.org/wiki/Wikipedia:VIAF/errors for some of the errors recently found during he Wikipedia/VIAF matching). If such an effort gets off the ground, I'll put my hand up to do the Wikipedia component (as distinct from the Wikidata component). Posted by Stuart Yeates at 09:27 3 comments: Labels: bibframe, GND, linked data, VIAF, wikipedia Wednesday, 19 June 2013 A wikipedia strategy for the Royal Society of New Zealand Over the last 48 hours I’ve had a very unsatisfactory conversation with the individual(s) behind the @royalsocietynz twitter account regarding wikipedia. Rather than talk about what went wrong, I’d like to suggest a simple strategy that builds the Society’s causes in the long term. First up, our resources: we have three wikipedia pages strongly related the Society, Royal Society of New Zealand, Rutherford Medal (Royal Society of New Zealand) and Hector Memorial Medal; we have a twitter account that appears to be widely followed; we have some employee of RSNZ with no apparent wikipedia skills wanting to use wikipedia to advance the public-facing causes of the Society, which are: “to foster in the New Zealand community a culture that supports science, technology, and the humanities, including (without limitation)—the promotion of public awareness, knowledge, and understanding of science, technology, and the humanities; and the advancement of science and technology education: to encourage, promote, and recognise excellence in science, technology, and the humanities” The first thing to notice is that promoting the Society is not a cause of the Society, so no effort should be expending polishing the Royal Society of New Zealand article (which would also breach wikipedia’s conflict of interest guidelines). The second thing to notice is that the two medal pages contain long lists of recipients, people whose contributions to science and the humanities in New Zealand are widely recognised by the Society itself. This, to me, suggests a strategy: leverage @royalsocietynz’s followers to improve the coverage of New Zealand science and humanities on wikipedia: Once a week for a month or two, @royalsocietynz tweets about a medal recipient with a link to their wikipedia biography. In the initial phase recipients are picked with reasonably comprehensive wikipedia pages (possibly taking steps to improve the gender and racial demographic of those covered to meet inclusion targets). By the end of this part followers of @royalsocietynz have been exposed to wikipedia biographies of New Zealand people. In the second part, @royalsocietynz still tweets links to the wikipedia pages of recipients, but picks ‘stubs’ (wikipedia pages with little or almost no actual content). Tweets could look like ‘Hector Medal recipient XXX’s biography is looking bare. Anyone have secondary sources on them?’ In this part followers of @royalsocietynz are exposed to wikipedia biographies and the fact that secondary sources are needed to improve them. Hopefully a proportion of @royalsocietynz’s followers have access to the secondary sources and enough crowdsourcing / generic computer confidence to jump in and improve the article. In the third part, @royalsocietynz picks recipients who don’t yet have a wikipedia biography at all. Rather than linking to wikipedia, @royalsocietynz links to an obituary or other biography (ideally two or three) to get us started. In the fourth part @royalsocietynz finds other New Zealand related lists and get the by-now highly trained editors to work through them in the same fashion. This strategy has a number of pitfalls for the unwary, including: Wikipedia biographies of living people (BLPs) are strictly policed (primarily due to libel laws); the solution is to try new and experimental things out on the biographies of people who are safely dead. Copyright laws prevent cut and pasting content into wikipedia; the solution is to encourage people to rewrite material from a source into an encyclopedic style instead. Recentism is a serious flaw in wikipedia (if the Society is 150 years old, each of those decades should be approximately equally represented; coverage of recent political machinations or triumphs should not outweigh entire decades); the solution is to identify sources for pre-digital events and promote their use. Systematic bias is an on-going problem in wikipedia, just as it is elsewhere; a solution in this case might be to set goals for coverage of women, Māori and/or non-science academics; another solution might be for the Society to trawl it's records and archives lists of  minorities to publish digitally. Everything on wikipedia needs to be based on significant coverage in reliable sources that are independent of the subject; the solution is to start with the sources first. Conflict of interest statement: I’m a high-active editor on wikipedia and am a significant contributor to all many of the wikipedia articles linked to from this post. Posted by Stuart Yeates at 19:46 No comments: Friday, 2 December 2011 Prep notes for NDF2011 demonstration I didn't really have a presentation for my demonstration at the NDF, but the event team have asked for presentations, so here are the notes for my practice demonstration that I did within the library. The notes served as an advert to attract punters to the demo; as a conversation starter in the actual demo and as a set of bookmarks of the URLs I wanted to open. Depending on what people are interested in, I'll be doing three things *) Demonstrating basic editing, perhaps by creating a page from the requested articles at http://en.wikipedia.org/wiki/Wikipedia:WikiProject_New_Zealand/Requested_articles *) Discussing some of the quality control processes I've been involved with (http://en.wikipedia.org/wiki/Wikipedia:Articles_for_deletion and http://en.wikipedia.org/wiki/New_pages_patrol) *) Discussing how wikipedia handles authority control issues using redirects (https://secure.wikimedia.org/wikipedia/en/wiki/Wikipedia:Redirect ) and disambiguation (https://secure.wikimedia.org/wikipedia/en/wiki/Wikipedia:Disambiguation ) I'm also open to suggestions of other things to talk about. Posted by Stuart Yeates at 14:11 No comments: Labels: NDF, wikipedia Thursday, 1 December 2011 Metadata vocabularies LODLAM NZ cares about At today's LODLAM NZ, in Wellington, I co-hosted a vocabulary schema / interoperability session. I kicked off the session with a list of the metadata schema we care about and counts of how many people in the room cared about it. Here are the results: 8 Library of Congress / NACO Name Authority List 7 Māori Subject Headings 6 Library of Congress Subject Headings 5 SONZ 5 Linnean 4 Getty Thesauri 3 Marsden Research Subject Codes / ANZRSC Codes 3 SCOT 3 Iwi Hapū List 2 Australian Pictorial Thesaurus 1 Powerhouse Object Names Thesaurus 0 MESH This straw poll naturally only reflects on the participants who attended this particular session and counting was somewhat haphazard (people were still coming into the room), but is gives a sample of the scope. I don't recall whether the heading was "Metadata we care about" or "Vocabularies we care about," but it was something very close to that. Posted by Stuart Yeates at 20:20 4 comments: Wednesday, 30 November 2011 Unexpected advice During the NDF2011 today I was in "Digital initiatives in Māori communities" put on the the talented Honiana Love and Claire Hall from the Te Reo o Taranaki Charitable Trust about their work on He Kete Kōrero. At the end I asked a question "Most of us [the audience] are in institutions with te Reo Māori holdings or cultural objects of some description. What small thing can we do to help enable our collections for the iwi and hapū source communities? Use Māori Subject Headings? The Iwi / Hapū list? Geotagging? ..." Quick-as-a-blink the response was "Geotagging." If I understood the answer (given mainly by Honiana) correctly, the point was that geotagging is much more useful because it's much more likely to be done right in contexts like this. Presumably because geotagging lends itself to checking, validation and visualisations that make errors easy to spot in ways that these other metadata forms don't; it's better understood by those processing the documents and processing the data. I think it's fabulous that we're getting feedback from indigenous groups using information systems in indigenous contexts, particularly feedback about previous attempts to cater to their needs. If this is the experience of other indigenous groups, it's really important. Posted by Stuart Yeates at 21:08 No comments: Labels: Māori, metadata, NDF Saturday, 26 November 2011 Goodbye 'social-media' world You may or may not have noticed, but recently a number of 'social media' services have begun looking and working very similarly. Facebook is the poster-child, followed by google+ and twitter. Their modus operandi is to entice you to interact with family-members, friends and acquaintances and then leverage your interactions to both sell your attention advertisers and entice other members of you social circle to join the service. There are, naturally, a number of shiny baubles you get for participating it the sale of your eyeballs to the highest bidder, but recently I have come to the conclusion that my eyeballs (and those of my friends, loved ones and colleagues) are worth more. I'll be signing off google plus, twitter and facebook shortly. I my return for particular events, particularly those with a critical mass the size of Jupiter, but I shall not be using them regularly. I remain serenely confident that all babies born in my extended circle are cute, I do not need to see their pictures. I will continue using other social media as before (email, wikipedia, irc, skype, etc) as usual. My deepest apologies to those who joined at least party on my account. Posted by Stuart Yeates at 21:58 No comments: Labels: facebook, social network, twitter Sunday, 6 November 2011 Recreational authority control Over the last week or two I've been having a bit of a play with Ngā Ūpoko Tukutuku / The Māori Subject Headings (for the uninitiated, think of the widely used Library of Congress Subject Headings, done Post-Colonial and bi-lingually but in the same technology) the main thing I've been doing is trying to munge the MSH into Wikipedia (Wikipedia being my addiction du jour). My thinking has been to increase the use of MSH by taking it, as it were, to where the people are. I've been working with the English language Wikipedia, since the Māori language Wikipedia has fewer pages and sees much less use. My first step was to download the MSH in MARC XML format (available from the website) and use XSL to transform it into a wikipedia table (warning: large page). When looking at that table, each row is a subject heading, with the first column being the the te reo Māori term, the second being permutations of the related terms and the third being the scope notes. I started a discussion about my thoughts (warning: large page) and got a clear green light to create redirects (or 'related terms' in librarian speak) for MSH terms which are culturally-specific to Māori culture. I'm about 50% of the way through the 1300 terms of the MSH and have 115 redirects in the newly created Category:Redirects from Māori language terms. That may sound pretty average, until you remember that institutions are increasingly rolling out tools such as Summon, which use wikipedia redirects for auto-completion, taking these mappings to the heart of most Māori speakers in higher and further education. I don't have a time-frame for the redirects to appear, but they haven't appeared in Otago's Summon, whereas redirects I created ~ two years ago have; type 'jack yeates' and pause to see it at work. Posted by Stuart Yeates at 21:24 No comments: Tuesday, 16 August 2011 Thoughts on "Letter about the TEI" from Martin Mueller Thoughts on "Letter about the TEI" from Martin Mueller Note: I am a member of the TEI council, but this message is should be read as personal position at the time of writing, not a council position, nor the position of my employer. Reading Martin's missive was painful. I should have responded earlier, I think perhaps I was hoping someone else could say what I wanted to say and I could just say "me too." They haven't so I've become the someone else. I don't think that Martin's "fairly radical model" is nearly radical enough. I'd like to propose a significantly more radical model as strawman: 1) The TEI shall maintain a document called the 'The TEI Principals.' The purpose of The TEI is to advance The TEI Principals. 2) Institutional membership of The TEI is open to groups which publish, collect and/or curate documents in formats released by The TEI. Institutional membership requires members acknowledge The TEI Principals and permits the members to be listed at http://www.tei-c.org/Activities/Projects/ and use The TEI logos and branding. 3) Individual membership of The TEI is open to individuals; individual membership requires members acknowledge The TEI Principals and subscribe to The TEI mailing list at http://listserv.brown.edu/?A0=TEI-L. 4) All business of The TEI is conducted in public. Business which needs be conducted in private (for example employment matters, contract negotiation, etc) shall be considered out of scope for The TEI. 5) Changes to the structure of The TEI will be discussed on the TEI mailing list and put to a democratic vote with a voting period of at least one month, a two-thirds majority of votes cast is required to pass a motion, which shall be in English. 6) Groups of members may form for activities from time-to-time, such as members meetings, summer schools, promotions of The TEI or collective digitisation efforts, but these groups are not The TEI, even if the word 'TEI' appears as part of their name. I'll admit that there are a couple of issues not covered here (such as who holds the IPR), but it's only a straw man for discussion. Feel free to fire it as necessary. Posted by Stuart Yeates at 19:46 1 comment: Thursday, 23 June 2011 unit testing framework for XSL transformations? I'm part of the TEI community, which maintains an XML standard which is commonly transformed to HTML for presentation (more rarely PDF). The TEI standard is relatively large but relatively well documented, the transformation to HTML has thus far been largely piecemeal (from a software engineering point of view) and not error free. Recently we've come under pressure to introduce significantly more complexity into transformations, both to produce ePub (which is wrapped HTML bundled with media and metadata files) and HTML5 (which can represent more of the formal semantics in TEI). The software engineer in me sees unit testing the a way to reduce our errors while opening development up to a larger more diverse group of people with a larger more diverse set of features they want to see implemented. The problem is, that I can't seem to find a decent unit testing framework for XSLT. Does anyone know of one? Our requirements are: XSLT 2.0; free to use; runnable on our ubuntu build server; testing the transformation with multiple arguments; etc; We're already using: XSD, RNG, DTD and schematron schemas, epubcheck, xmllint, standard HTML validators, etc. Having the framework drive these too would be useful. The kinds of things we want to test include: Footnotes appear once and only once Footnotes are referenced in the text and there's a back link from the footnote to the appropriate point in the text Internal references (tables of contents, indexes, etc) point somewhere Language encoding used xml:lang survives from the TEI to the HTML That all the paragraphs in the TEI appear at least once in the HTML That local links work Sanity check tables Internal links within parallel texts .... Any of many languages could be used to represent these tests, but ideally it should have a DOM library and be able to run that library across entire directories of files. Most of our community speak XML fluently, so leveraging that would be good. Posted by Stuart Yeates at 22:02 No comments: Wednesday, 23 March 2011 Is there a place for readers' collectives in the bright new world of eBooks? The transition costs of migrating from the world of books-as-physical-artefacts-of-pulped-tree to the world of books-as-bitstreams are going to be non-trivial. Current attempts to drive the change (and by implication apportion those costs to other parties) have largely been driven by publishers, distributors and resellers of physical books in combination with the e-commerce and electronics industries which make and market the physical eBook readers on which eBooks are largely read. The e-commerce and electronics industries appear to see traditional publishing as an industry full of lumbering giants unable to compete with the rapid pace of change in the electronics industry and the associated turbulence in business models, and have moved to poach market-share. By-and-large they've been very successful. Amazon and Apple have shipped millions of devices billed as 'eBook readers' and pretty much all best-selling books are available on one platform or another. This top tier, however, is the easy stuff. It's not surprising that money can be made from the latest bodice-ripping page-turner, but most of the interesting reading and the majority of the units sold are outside the best-seller list, on the so-called 'long tail.' There's a whole range of books that I'm interested in that don't appear to be on the business plan of any of the current eBook publishers, and I'll miss them if they're not converted: The back catalogue of local poetry. Almost nothing ever gets reprinted, even if the original has a tiny print run and the author goes on to have a wonderfully successful career. Some gets anthologised and a few authors are big enough to have a posthumous collected works, when their work is no longer cutting edge. Some fabulous theses. I'm thinking of things like: http://ir.canterbury.ac.nz/handle/10092/1978, http://victoria.lconz.ac.nz/vwebv/holdingsInfo?bibId=69659 and http://otago.lconz.ac.nz/vwebv/holdingsInfo?bibId=241527 Lots of te reo Māori material (pick your local indigenous language if you're reading this outside New Zealand) Local writing by local authors. Note that all of these are local content---no foreign mega-corporation is going to regard this as their home-turf. Getting these documents from the old world to the new is going to require a local program run by (read funded by) locals. Would you pay for these things? I would, if it gave me what I wanted. What is it that readers want? We're all readers, of one kind or another, and we all want a different range of things, but I believe that what readers want / expect out of the digital transition is: To genuinely own books. Not to own them until they drop their eReader in the bath and lose everything. Not to own them until a company they've never heard of goes bust and turns off a DRM server they've never heard of. Not to own them until technology moves on and some new format is in use. To own them in a manner which enables them to use them for at least their entire lifetime. To own them in a manner that poses at least a question for their heirs. A choice of quality books. Quality in the broadest sense of the word. Choice in the broadest sense of the word. Universality is a pipe-dream, of course, but with releasing good books faster than I can read them. A quality recommendation service. We all have trusted sources of information about books: friends, acquaintances, librarians or reviewers that history have suggested have similar ideas as us about what a good read is. To get some credit for already having bought the book in pulp-of-murdered-tree work. Lots of us have collections of wood-pulp and like to maintain the illusion that in some way that makes us well read. Books bought to their attention based on whether they're worth reading, rather than what publishers have excess stock of. Since the concept of 'stock' largely vanishes with the transition from print to digital this shouldn't be too much of a problem. Confidentially for their reading habits. If you've never come across it, go and read the ALA's The Freedom to Read Statement A not-for-profit readers' collective It seems to me that the way to manage the transition from the old world to the new is as a not-for-profit readers' collective. By that I mean a subscription-funded system in which readers sign up for a range of works every year. The works are digitised by the collective (the expensive step, paid for up-front), distributed to the subscribers in open file formats such as ePub (very cheap via the internet) and kept in escrow for them (a tiny but perpetual cost, more on this later). Authors, of course, need to pay their mortgage, and part of the digitisation would be obtaining the rights to the work. Authors of new work would be paid a 'reasonable' sum, based on their statue as authors (I have no idea what the current remuneration of authors is like, so I won't be specific). The collective would acquire (non-exclusive) the rights to digitise the work if not born digital, to edit it, distribute it to collective members and to sell it to non-members internationally (i.e. distribute it through 'conventional' digital book channels). In the case of sale to non-members through conventional digital book channels the author would get a cut. Sane and mutually beneficial deals could be worked out with libraries of various sizes. Generally speaking, I'd anticipate the rights to digitise and distribute in-copyright but out-of-print poetry would would be fairly cheap; the rights to fabulous old university theses cheaper; and rights to out-of-copyright materials are, of course, free. The cost of rights to new novels / poetry would hugely depend on statue of the author and the quality of the work, which is where the collective would need to either employ a professional editor to make these calls or vote based on sample chapters / poems or some combination of the two. Costs of quality digitisation is non-trivial, but costs are much lower in bulk and dropping all the time. Depending on the platform in use, members of the collective might be recruited as proof-readers for OCR errors. That leaves the question of how to fund the the escrow. The escrow system stores copies of all the books the collective has digitised for the future use of the collectives' members and is required to give efficacy to the promise that readers really own the books. By being held in escrow, the copies survive the collective going bankrupt, being wound up, or evolving into something completely different, but requires funding. The simplest method of obtaining funding would be to align the collective with another established consumer of local literature and have them underwrite the escrow, a university, major library, or similar. The difference between a not-for-profit readers' collective and an academic press? Of hundreds of years, major universities have had academic presses which publish quality content under the universities' auspices. The key difference between the not-for-profit readers' collective I am proposing and an academic press is that the collective would attempt to publish the unpublished and out-of-print books that the members wanted rather than aiming to meet some quality criterion. I acknowledge a popularist bias here, but it's the members who are paying the subscriptions. Which links in the book chain do we want to cut out? There are some links in the current book production chain which we need to keep, there are others wouldn't have a serious future in a not-for-profit. Certainly there is a role for judgement in which works to purchase with the collective's money. There is a role for editing, both large-scale and copy-editing. There is a role for illustrating works, be it cover images or icons. I don't believe there is a future for roles directly relating to the production, distribution, accounting for, sale, warehousing or pulping of physical books. There may be a role for the marketing books, depending on the business model (I'd like to think that most of the current marketing expense can be replaced by combination of author-driven promotion and word-of-month promotion, but I've been known to dream). Clearly there is an evolving techie role too. The role not mentioned above that I'd must like to see cut, of course, is that of the multinational corporation as gatekeeper, holding all the copyrights and clipping tickets (and wings). Posted by Stuart Yeates at 20:13 5 comments: Saturday, 20 November 2010 HOWTO: Deep linking into the NZETC site As the heaving mass of activity that is the mixandmash competition heats up, I have come to realise that I should have better documented a feature of the NZETC site, the ability to extract the TEI xml annotated with the IDs for deep linking. Our content's archival form is TEI xml, which we massage for various output formats. There is a link from the top level of every document to the TEI for the document, which people are welcome to use in their mashups and remixes. Unfortunately, between that TEI and our HTML output is a deep magic that involves moving footnotes, moving page breaks, breaking pages into nicely browsable chunks, floating marginal notes, etc., and this makes it hard to deep link back to the website from anything derived from that TEI. There is another form of the TEI available which is annotated with whether or not each structural element maps 1:1 to an HTML: nzetc:has-text and what the ID of that page is: nzetc:id This annotated XML is found by replacing the 'tei-source' in the URL with 'etexts' Thus for The Laws of England, Compiled and translated into the Māori language at http://www.nzetc.org/tm/scholarly/tei-GorLaws.html there is the raw TEI at http://www.nzetc.org/tei-source/GorLaws.xml and the annotated TEI at http://www.nzetc.org/etexts/GorLaws.xml Looking in the annotated TEI at http://www.nzetc.org/etexts/GorLaws.xml we see for example: <div xml:id="t1-g1-t1-front1-tp1" xml:lang="en" rend="center" type="titlePage" nzetc:id="tei-GorLaws-t1-g1-t1-front1-tp1" nzetc:depth="5" nzetc:string-length="200" nzetc:has-text="true"> This means that this div has it's own page (because it has nzetc:has-text="true" and that the ID of that page is tei-GorLaws-t1-g1-t1-front1-tp1 (because of the nzetc:id="tei-GorLaws-t1-g1-t1-front1-tp1"). The ID can be plugged into: http://www.nzetc.org/tm/scholarly/<ID>.html to get a URL for the HTML. Thus the URL for this div is http://www.nzetc.org/tm/scholarly/tei-GorLaws-t1-g1-t1-front1-tp1.html This process should work for both text and figures. Happy remixing everyone! Posted by Stuart Yeates at 10:20 1 comment: Sunday, 8 November 2009 ePubs and quality You may have heard news about the release of "bookserver" by the good folks at the Internet Archive. This is a DRM-free ePub ecosystem, initially stocked with the prodigious output of Google's book scanning project and the Internet Archive's own book scanning project. To see how the NZETC stacked up against the much larger (and better funded) collection I picked one of our Maori Language dictionaries. Our Maori and Pacifica dictionaries month-after-month make up the bulk of our top five must used resources, so they're in-demand resources. They're also an appropriate choice because when they were encoded by the NZETC into TEI, the decision was made not to use full dictionary encoding, but a cheaper/easier tradeoff which didn't capture the linguistic semantics of the underlying entries, but treated them as typeset text. I was interested in how well this tradeoff was wearing. I did my comparison using the new firefox ePub plugin, things will be slightly different if you're reading these ePubs on an iPhone or Kindle. The ePub I looked at was A Dictionary of the Maori Language by Herbert W. Williams. The NZETC has the 1957 sixth edition. There are two versions of the work on bookserver. A 1852 second edition scanned by Google books (original at the New York Public library) and a 1871 third edition scanned by the Internet Archive in association with Microsoft (original in the University of California library system). All the processing of both works appear to be been done in the U.S. The original print used macrons (NZETC), acutes (Google) and breves (Internet Archive) to mark long vowels. Find them here. Lets take a look at some entries from each, starting at 'kapukapu': NZETC: kapukapu. 1. n. Sole of the foot. 2. Apparently a synonym for kaunoti, the firestick which was kept steady with the foot. Tena ka riro, i runga i nga hanga a Taikomako, i te kapukapu, i te kaunoti (M. 351). 3. v.i. Curl (as a wave). Ka kapukapu mai te ngaru. 4. Gush. 5. Gleam, glisten. Katahi ki te huka o Huiarau, kapukapu ana tera. Kapua, n. 1. Cloud, bank of clouds. E tutakitaki ana nga kapua o te rangi, kei runga te Mangoroa e kopae pu ana (P.). 2. A flinty stone. = kapuarangi. 3. Polyprion oxygeneios, a fish. = hapuku. 4. An edible species of fungus. 5. Part of the titi pattern of tattooing. Kapuarangi, n. A variety of matā, or cutting stone, of inferior quality. = kapua, 2. Kāpuhi, kāpuhipuhi, n. Cluster of branches at the top of a tree. Kāpui, v.t. 1. Gather up in a bunch. Ka kapuitia nga rau o te kiekie, ka herea. 2. Lace up or draw in the mouth of a bag. 3. Earth up crops, or cover up embers with ashes to keep them alight. kāpuipui, v.t. Gather up litter, etc. Kāpuka, n. Griselinia littoralis, a tree. = papauma. Kapukiore, n. Coprosma australis, a shrub. = kanono. Kāpuku = kōpuku, n. Gunwale. Google Books: Kapukapu, s. Sole of the foot, Eldpukdpu, v. To curl* as a wave. Ka kapukapu mai te ngaru; The wave curls over. Kapunga, v. To take up with both hands held together, Kapungatia he kai i te omu; Take up food from the oven. (B. C, Kapura, s. Fire, -' Tahuna he kapura ; Kindle a fire. Kapurangi, s. Rubbish; weeds, Kara, s. An old man, Tena korua ko kara ? How are you and the old man ? Kara, s> Basaltic stone. He kara te kamaka nei; This stone is kara. Karaha, s. A calabash. ♦Kardhi, *. Glass, Internet Archive: kapukapu, n. sole of the foot. kapukapu, v. i. 1. curl (as a wave). Ka kapukapu mai te ngaru. 2. gush. kakapii, small basket for cooked food. Kapua, n. cloud; hank of clouds, Kapunga, n. palm of the hand. kapunga, \. t. take up in both hands together. Kapiira, n. fire. Kapiiranga, n. handful. kapuranga, v. t. take up by hand-fuls. Kapurangatia nga otaota na e ia. v. i. dawn. Ka kapuranga te ata. Kapur&ngi, n. rubbish; uveds. I. K&r&, n. old man. Tena korua ko kara. II. K&r&, n. secret plan; conspiracy. Kei te whakatakoto kara mo Te Horo kia patua. k&k&r&, D. scent; smell. k&k&r&, a. savoury; odoriferous. k^ar&, n. a shell-iish. Unlike the other two, the NZETC version has accents, bold and italics in the right place. It' the only one with a workable and useful table of contents. It is also edition which has been extensively revised and expanded. Google's second edition has many character errors, while the Internet Archive's third edition has many 'á' mis-recognised as '&.' The Google and Internet Achive versions are also available as PDFs, but of course, without fancy tables of contents these PDFs are pretty challenging to navigate and because they're built from page images, they're huge. It's tempting to say that the NZETC version is better than either of the others, and from a naïve point of it is, but it's more accurate to say that it's different. It's a digitised version of a book revised more than a hundred years after the 1852 second edition scanned by Google books. People who're interested in the history of the language are likely to pick the 1852 edition over the 1957 edition nine times out of ten. Technical work is currently underway to enable third parties like the Internet Archive's bookserver to more easily redistribute our ePubs. For some semi-arcane reasons it's linked to upcoming new search functionality. Posted by Stuart Yeates at 20:36 No comments: Labels: library, macrons, maori, nzetc What LibraryThing metadata can the NZETC reasonable stuff inside it's CC'd epubs? This is the second blog following on from an excellent talk about librarything by LibraryThing's Tim given the VUW in Wellington after his trip to LIANZA. The NZETC publishes all of it's works as epubs (a file format primarily aimed at mobile devices), which are literally processed crawls of it's website bundled with some metadata. For some of the NZETC works (such as Erewhon and The Life of Captain James Cook), LibraryThing has a lot more metadata than the NZETC, becuase many LibraryThing users have the works and have entered metadata for them. Bundling as much metadata into the epubs makes sense, because these are commonly designed for offline use---call-back hooks are unlikely to be avaliable. So what kinds of data am I interested in? 1) Traditional bibliographic metadata. Both LT and NZETC have this down really well. 2) Images. LT has many many cover images, NZETC has images of plates from inside many works too. 3) Unique identification (ISBNs, ISSNs, work ids, etc). LT does very well at this, NZETC very poorly 4) Genre and style information. LT has tags to do fancy statistical analysis on, and does. NZETC has full text to do fancy statistical analysis on, but doesn't. 5) Intra-document links. LT has work as the smallest unit. NZETC reproduces original document tables of contents and indexes, cross references and annotations. 6) Inter-document links. LT has none. NZETC captures both 'mentions' and 'cites' relationships between documents. While most current-generation ebook readers, of course, can do nothing with most of this metadata, but I'm looking forward to the day when we have full-fledged OpenURL resolvers which can do interesting things, primarily picking the best copy (most local / highest quality / most appropiate format / cheapest) of a work to display to a user; and browsing works by genre (LibraryThing does genre very well, via tags). Posted by Stuart Yeates at 14:56 1 comment: Labels: epubs, library, librarything, nzetc Thursday, 15 October 2009 Interlinking of collections: the quest continues After an excellent talk today about LibraryThing by LibraryThing's Tim, I got enthused to see how LibraryThing stacks up against other libraries for having matches in it's authority control system for entities we (the NZETC) care about. The answer is averagely. For copies of printed books less than a hundred years old (or reprinted in the last hundred years), and their authors, LibraryThing seems to do every well. These are the books likely to be in active circulation in personal libraries, so it stands to reason that these would be well covered. I tried half a dozen books from our Nineteenth-Century Novels Collection, and most were missing, Erewhon, of course, was well represented. LibraryThing doesn't have the "Treaty of Waitangi" (a set of manuscripts) but it does have "Facsimiles of the Treaty of Waitangi." It's not clear to me whether these would be merged under their cataloguing rules. Coverage of non-core bibliographic entities was lacking. Places get a little odd. Sydney is "http://www.librarything.com/place/Sydney,%20New%20South%20Wales,%20Australia" but Wellington is "http://www.librarything.com/place/Wellington" and Anzac Cove appears to be is missing altogether. This doesn't seem like a sane authority control system for places, as far as I can see. People who are the subjects rather than the authors of books didn't come out so well. I couldn't find Abel Janszoon Tasman, Pōtatau Te Wherowhero or Charles Frederick Goldie, all of which are near and dear to our hearts. Here is the spreadsheet of how different web-enabled systems map entities we care about. Correction: It seems that the correct URL for Wellington is http://www.librarything.com/place/Wellington,%20New%20Zealand which brings sanity back. Posted by Stuart Yeates at 21:36 No comments: Labels: authority, community building, metadata, semantic web, social network, taxonomy Saturday, 19 September 2009 eBook readers need OpenURL resolvers Everyone's talking about the next generation of eBook readers having larger reading area, more battery life and more readable screen. I'd give up all of those, however, for an eBook reader that had an internal OpenURL resolver. OpenURL is the nifty protocol that libraries use to find the closest copy of a electronic resources and direct patrons to copies that the library might have already licensed from commercial parties. It's all about finding the version of a resource that is most accessible to the user, dynamically. Say I've loaded 500 eBooks into my eBook reader: a couple of encyclopedias and dictionaries; a stack of books I was meant to read in school but only skimmed and have been meaning to get back to; current block-busters; guidebooks to the half-dozen countries I'm planning on visiting over the next couple of years; classics I've always meant to read (Tolstoy, Chaucer, Cervantes, Plato, Descartes, Nietzsche); and local writers (Baxter, Duff, Ihimaera, Hulme, ...). My eBooks by Nietzsche are going to refer to books by Descartes and Plato; my eBooks by Descartes are going to refer to books by Plato; my encyclopaedias are going to refer to pretty much everything; most of the works in translation are going to contain terms which I'm going to need help with (help which theencyclopedias and dictionaries can provide). Ask yourself, though, whether you'd want to flick between works on the current generation of readers---very painful, since these devices are not designed for efficient navigation between eBooks, but linear reading of them. You can't follow links between them, of course, because on current systems links must point either with the same eBook or out on to the internet---pointing to other eBooks on the same device is verboten. OpenURL can solve this by catching those URLs and making them point to local copies of works (and thus available for free even when the internet is unavailable) where possible while still retaining their Until eBook readers have a mechanism like this eBooks will be at most a replacement only for paperback novels---not personal libraries. Posted by Stuart Yeates at 21:57 1 comment: Older Posts Home Subscribe to: Posts (Atom) About Me Stuart Yeates View my complete profile Blog archive ▼  2019 (1) ▼  March (1) #ChristchurchMosqueShootings ►  2016 (2) ►  October (2) ►  2015 (2) ►  October (2) ►  2014 (1) ►  July (1) ►  2013 (1) ►  June (1) ►  2011 (8) ►  December (2) ►  November (3) ►  August (1) ►  June (1) ►  March (1) ►  2010 (1) ►  November (1) ►  2009 (14) ►  November (2) ►  October (1) ►  September (3) ►  July (1) ►  June (1) ►  May (2) ►  February (3) ►  January (1) ►  2008 (25) ►  October (2) ►  September (2) ►  August (6) ►  July (1) ►  June (7) ►  May (3) ►  April (1) ►  March (3) Shared Google Reader items Simple theme. Powered by Blogger. 
opinionator-blogs-nytimes-com-3308	----	bell hooks: Buddhism, the Beats and Loving Blackness - The New York Times Sections Home Search Skip to content The New York Times Opinionator | bell hooks: Buddhism, the Beats and Loving Blackness Search Subscribe Now Log In 0 Settings Close search Site Search Navigation Search NYTimes.com Clear this text input Go Loading... See next articles See previous articles Site Navigation Site Mobile Navigation Supported by Opinionator A Gathering of Opinion From Around the Web Search The Stone bell hooks: Buddhism, the Beats and Loving Blackness By George Yancy and Bell Hooks December 10, 2015 3:35 am December 10, 2015 3:35 am The Stone is a forum for contemporary philosophers and other thinkers on issues both timely and timeless. This is the last in this series of 2015 interviews with philosophers on race. This week’s conversation is with the scholar, critic and public intellectual bell hooks, who is currently the distinguished professor in residence of Appalachian studies at Berea College. She is the author of many books, including “Writing Beyond Race: Living Theory and Practice” — George Yancy George Yancy: Over the years you have used the expression “imperialist white supremacist capitalist patriarchy” to describe the power structure underlying the social order. Why tie those terms together as opposed to stressing any one of them in isolation? bell hooks: We can’t begin to understand the nature of domination if we don’t understand how these systems connect with one another. Significantly, this phrase has always moved me because it doesn’t value one system over another. For so many years in the feminist movement, women were saying that gender is the only aspect of identity that really matters, that domination only came into the world because of rape. Then we had so many race-oriented folks who were saying, “Race is the most important thing. We don’t even need to be talking about class or gender.” So for me, that phrase always reminds me of a global context, of the context of class, of empire, of capitalism, of racism and of patriarchy. Those things are all linked — an interlocking system. G.Y.: I’ve heard you speak many times and I noticed that you do so with a very keen sense of humor. What is the role of humor in your work? Photo bell hooksCredit b.h.: We cannot have a meaningful revolution without humor. Every time we see the left or any group trying to move forward politically in a radical way, when they’re humorless, they fail. Humor is essential to the integrative balance that we need to deal with diversity and difference and the building of community. For example, I love to be in conversation with Cornel West. We always go high and we go low, and we always bring the joyful humor in. The last talk he and I gave together, many people were upset because we were silly together. But I consider it a high holy calling that we can be humorous together. How many times do we see an African-American man and an African-American woman talking together, critiquing one another, and yet having delicious, humorous delight? It’s a miracle. G.Y.: What is your view of the feminist movement today, and how has your relationship to it changed over time? I believe whole-heartedly that the only way out of domination is love. b.h.: My militant commitment to feminism remains strong, and the main reason is that feminism has been the contemporary social movement that has most embraced self-interrogation. When we, women of color, began to tell white women that females were not a homogenous group, that we had to face the reality of racial difference, many white women stepped up to the plate. I’m a feminist in solidarity with white women today for that reason, because I saw these women grow in their willingness to open their minds and change the whole direction of feminist thought, writing and action. This continues to be one of the most remarkable, awesome aspects of the contemporary feminist movement. The left has not done this, radical black men have not done this, where someone comes in and says, “Look, what you’re pushing, the ideology, is all messed up. You’ve got to shift your perspective.” Feminism made that paradigm shift, though not without hostility, not without some women feeling we were forcing race on them. This change still amazes me. G.Y.: What should we do in our daily lives to combat, in that phrase of yours, the power and influence of white supremacist capitalist patriarchy? What can be done on the proverbial ground? NOW IN PRINT The Stone Reader: Modern Philosophy in 133 Arguments An anthology of essays from The Times’s philosophy series, published by Liveright. b.h.: I live in a small, predominantly white town in the Bible Belt. Rather than saying, “What would Jesus do?” I always think, “What does Martin Luther King want me to do today?” Then I decide what Martin Luther King wants me to do today is to go out into the world and in every way that I can, small and large, build a beloved community. As a Buddhist Christian, I also think of Buddhist monk Thich Nhat Hanh’s saying, “Let’s throw this pebble into the water, it may not go far in the beginning, but it will ripple out.” So, every day, I’m challenging myself, “What are you doing, bell, for the creation of the beloved community?” Because that’s the underground, local, insistence that I be a fundamental part of the world that I’m in. I’ve been to the Farmer’s Market, I’ve been to the church bazaar this morning. I really push myself to relate to people, that is, people that I might not feel as comfortable relating to. There are many Kentucky hillbilly white persons who look at me with contempt. They cannot turn me around. I am doing the same thing as those civil rights activists, those black folk and those white folk who sat in at those diners and who marched. It’s about humanization. And I can’t think of another way to imagine how we’re going to get out of the crisis of racial hatred if it’s not through the will to humanize. Personally, I draw incredible strength from the images of black people and white people in social movements. I personally did not think “Selma” was a great film, but the strength that I gained from the film was thinking about all of those people, those white folks who see “Selma” and say, “My God, this is unjust! Let’s go do our part.” And it’s awesome when we’re called. There are many times in this life of mine when I ask myself, “What are you willing to give your life for, bell? When are you willing to get out in the streets knowing that you’re risking your health?” And if those older black women who were there in Selma, Ala., can do this stuff, it just reminds you how incredibly vital this history of struggle has been towards allowing you and I to be in the state of privilege that we live within today. G.Y.: That point hits home, especially as I think about my own intellectual identity and yet often fail to think about the privilege that comes with it. The connecting tie to black, white, Hispanic, native people, Asian people is the greed and the materialism that we all invest in and share. b.h.: I am a total intellectual. I tell people that intellectual work is the laboratory that I go into every day. Without all of those people engaged in civil rights struggles, I would not be here in this laboratory. I mean, how many black women have had the good fortune to write more than 30 books? When I wake up at 4 or 5 in the morning, I do my prayers and meditations, and then I have what I call my “study hours.” I try to read a book a day, a nonfiction book, and then I get to read total trash for the rest of the day. That’s luxury, that’s privilege of a high order – the privilege to think critically, and then the privilege to be able to act on what you know. G.Y.: Absolutely. You’ve talked about how theory can function as a place of healing. Can you say more about that? b.h.: I always start with children. Most children are amazing critical thinkers before we silence them. I think that theory is essentially a way to make sense of the world; as a gifted child growing up in a dysfunctional family where giftedness was not appreciated, what held me above water was the idea of thinking through, “Why are Mom and Dad the way they are?” And those are questions that are at the heart of critical thinking. And that’s why I think critical thinking and theory can be such a source of healing. It moves us forward. And, of course, I don’t know about other thinkers and writers, but I have the good fortune every day of my life to have somebody contacting me, either on the streets or by mail, telling me about how my work has changed their life, how it has enabled them to go forward. And what greater gift to be had as a thinker-theorist, than that? G.Y.: How do you prevent yourself from being seduced by that? I think that there is that temptation by intellectuals/scholars, who are well known, to be seduced into a state of narcissism. How do you resist that? b.h.: First of all, I live in a city of 12,000 people where most of them don’t have a clue about who bell hooks is for the most part, or where someone asks “Is bell hooks a person?” There is humility in the life that I lead, because one thing about having my given name, Gloria Jean, which is such a great Appalachian hillbilly name, is that I’m not walking around in my daily life usually as bell hooks. I’m walking around in the dailiness of my life as just the ordinary Gloria Jean. That’s changing a bit in the little town that I live in because more of me as a thinker, writer and artist is coming out into the world of the town that I live in. I think that I’ve been coming out more and more in the fact that the work that I’m writing is about spirituality, because one of the central aspects that has kept me grounded in my life has been spirituality. Growing up, when my mom used to tell me, “You’re really smart, but you’re not better than anyone else,” I used to think, “Why does she go on about that?” And, of course, now I see why. It was to keep me grounded and to keep me respecting the different ways of knowing and the knowledges of other people, and not thinking “Oh, I am so smart,” which I think can happen to many well-known intellectuals. I always kind of chuckle at people labeling me a public intellectual. I chuckle because people used to say, “How have you written so much?” and I’d say, “By not having a life.” There is nothing public about the energy, the discipline and solitude it takes to produce so much writing. I think of public intellectuals as very different, because I think that they’re airing their work for that public engagement. Really, in all the years of my writing that was not my intention. It was to produce theory that people could use. I have this phrase that I use, “working with the work.” So if somebody comes up to me, and they have one of those bell hooks books that’s abused and battered, and every page is underlined, I know they’ve been working with the work. And that’s where it is for me. G.Y.: Is there a connection between teaching as a space of healing and your understanding of love? b.h.: Well, I believe whole-heartedly that the only way out of domination is love, and the only way into really being able to connect with others, and to know how to be, is to be participating in every aspect of your life as a sacrament of love, and that includes teaching. I don’t do a lot of teaching these days. I am semi-retired. Because, like any act of love, it takes a lot of your energy. I was just talking with a neighbor about what it feels like to be working at a need-based college like Berea, where none of our students pay tuition, and many of them come from the hills of Appalachia. We often get discouraged anytime we feel that our college isn’t living up to its history of integration and of racial inclusion. But then we’d see we have students who are doing such amazing things, from the hills of Virginia, or Tennessee. You just know, I am right where I am meant to be, doing what I should be doing, and giving and receiving the love that comes anytime we do that work well. Poverty has become infinitely more violent than it ever was when I was a girl. You lived next door to very poor black people, but who had very joyful lives. That’s not the poverty of today. G.Y.: You’ve conceptualized love as the opposite of estrangement. Can you say something about that? b.h.: When we engage love as action, you can’t act without connecting. I often think of that phrase, only connect. In terms of white supremacy right now for instance, the police stopped me a few weeks ago here in Berea, because I was doing something wrong. I initially felt fear, and I was thinking about the fact that in all of my 60-some years of my life in this country, I have never felt afraid of policemen before, but I feel afraid now. He was just total sweetness. And yet I thought, what a horrible change in our society that that level of estrangement has taken place that was not there before. I know that the essential experience of black men and women has always been different, but from the time I was a girl to now, I never thought the police were my enemy. Yet, what black woman witnessing the incredible abuse of Sandra Bland can’t shake in her boots if she’s being stopped by the police? When I was watching that video, I was amazed the police didn’t shoot her on the spot! White supremacist white people are crazy. I used to talk about patriarchy as a mental illness of disordered desire, but white supremacy is equally a serious and profound mental illness, and it leads people to do completely and utterly insane things. I think one of the things that is going on in our society is the normalization of mental illness, and the normalization of white supremacy, and the evocation and the spreading of this is part of that mental illness. So remember that we are a culture in crisis. Our crisis is as much a spiritual crisis as it is a political crisis, and that’s why Martin Luther King, Jr. was so profoundly prescient in describing how the work of love would be necessary to have a transformative impact. G.Y.: And of course, that doesn’t mean that you don’t find an important place in your work for rage, as in your book “Killing Rage”? b.h.: Oh, absolutely. The first time that I got to be with Thich Nhat Hanh, I had just been longing to meet him. I was like, I’m going to meet this incredibly holy man. On the day that I was going to him, every step of the way I felt that I was encountering some kind of racism or sexism. When I got to him, the first thing out of my mouth was, “I am so angry!” And he, of course, Mr. Calm himself, Mr. Peace, said, “Well, you know, hold on to your anger, and use it as compost for your garden.” And I thought, “Yes, yes, I can do that!” I tell that story to people all the time. I was telling him about the struggles I was having with my male partner at the time and he said, “It is O.K. to say I want to kill you, but then you need to step back from that, and remember what brought you to this person in the first place.” And I think that if we think of anger as compost, we think of it as energy that can be recycled in the direction of our good. It is an empowering force. If we don’t think about it that way, it becomes a debilitating and destructive force. One of the things white people gave us when they gave us integration was full access to the tormenting reality of desire, and the expectation of constant consumption. G.Y.: Since you mentioned Sandra Bland, and there are so many other cases that we can mention, how can we use the trauma that black people are experiencing, or reconfigure that trauma into compost? How can black people do that? What does that look like therapeutically, or collectively? b.h.: We have to be willing to be truthful. And to be truthful, we have to say, the problem that black people face, the trauma of white supremacy in our lives, is not limited to police brutality. That’s just one aspect. I often say that the issue for young black males is the street. If you only have the streets, you encounter violence on all sides: black on black violence, the violence of addiction, and the violence of police brutality. So the question is why at this stage of our history, with so many wealthy black people, and so many gifted black people, how do we provide a place other than the streets for black males? And it is so gendered, because the street, in an imperialist white supremacist capitalist patriarchy, is male, especially when it is dark. There is so much feeling of being lost that it is beyond the trauma of racism. It is the trauma of imperialist white supremacist capitalist patriarchy, because poverty has become infinitely more violent than it ever was when I was a girl. You lived next door to very poor black people, but who had very joyful lives. That’s not the poverty of today. G.Y.: How is the poverty of today different? b.h.: Let’s face it, one of the things white people gave us when they gave us integration was full access to the tormenting reality of desire, and the expectation of constant consumption. So part of the difference of poverty today is this sort of world of fantasy — fantasizing that you’ll win the lottery, fantasizing that money will come. I always cling to Lorraine Hansberry’s mama saying in “A in Raisin in the Sun,” “Since when did money become life?” I think that with the poverty of my growing up that I lived with and among, we were always made to feel like money is not what life is all about. That’s the total difference for everyone living right now, because most people in our culture believe money is everything. That is the big tie, the connecting tie to black, white, Hispanic, native people, Asian people — the greed and the materialism that we all invest in and share. G.Y.: When you make that claim, I can see some readers saying that bell is pathologizing black spaces. b.h.: As I said, we have normalized mental illness in this society. So it’s not the pathologizing of black spaces; it’s saying that the majority of cultural spaces in our society are infused with pathology. That’s why it’s so hard to get out of it, because it has become the culture that is being fed to us every day. None of us can escape it unless we do so by conscious living and conscious loving, and that’s become harder for everybody. I don’t have a problem stating the fact that trauma creates wounds, and most of our wounds are not healed as African-Americans. We’re not really different in that way from all the others who are wounded. Let’s face it — wounded white people frequently can cover up their wounds, because they have greater access to material power. I find it fascinating that every day you go to the supermarket, and you look at the people, and you look at us, and you look at all of this media that is parading the sorrows and the mental illnesses of the white rich in our society. And it’s like everybody just skips over that. Nobody would raise the question, “why don’t we pathologize the rich?” We actually believe that they suffer mental illness, and that they deserve healing. The issue for us as black people is that very few people feel that we deserve healing. Which is why we have very few systems that promote healing in our lives. The primary system that ever promoted healing in black people is the church, and we see what is going on in most churches today. They’ve become an extension of that material greed. One of the reasons for why so much black rebel anti-racist movements failed is because they didn’t take care of the home as a site of resistance. G.Y.: As you shared being stopped by police, I thought of your book “Black Looks: Race and Representation,” where you describe whiteness as a site of terror. Has that changed for you? b.h.: I don’t think that has changed for most black people. That particular essay, “Representations of Whiteness in the Black Imagination,” talks about whiteness, the black imagination, and how many of us live in fear of whiteness. And I emphasize the story about the policeman because for many of us that fear of whiteness has intensified. I think that white people, for the most part, never think about black people wanting to be in black only spaces, because we do not feel safe. In my last book, “Writing Beyond Race: Living Theory and Practice,” I really wanted to raise and problematize the question: Where do we feel safe as black people? I definitely return to the home as a place of spiritual possibility, home as a holy place. I bought my current house from a conservative white male capitalist who lives across the street from me, and I’m so happy in my little home. I tell people, when I open the doors of my house it’s like these arms come out, and they’re just embracing me. I think that is part of our radical resistance to the culture of domination. I know that I’m not who he imagined in this little house. He imagined a nice white family with two kids, and I think on some level it was very hard for him to sell his house to a radical black woman, a radical black feminist woman. I think all of us, in terms of houses, have our idea, when we love our home, of who we want to be in it. But I think black folks in general across class have to restore that sense of resistance in the home. When we look at the history of anti-racist rebels among black people, so much organizing happened in people’s homes. I always think about Mary McLeod Bethune: “Let’s just start the college in your living room.” Self-determination really does begin at home. We’re finding out that one of the reasons for why so much black rebel anti-racist movements failed is because they didn’t take care of the home as a site of resistance. So, you have very wounded people trying to lead movements in a world beyond the home, but they were simply not psychologically fit to lead. G.Y.: That’s an important segue to the question about your concept of “soul healing” with respect to black men. What does soul healing among black men look like? And what role do you think black women play in helping to help nurture that soul healing? b.h.: Every now and then, George, I write a book that hardly anyone pays any attention to. One such book in my life is my book on black masculinity, “We Real Cool: Black Men and Masculinity.” An aspect of that book that I found deeply moving is when I use the metaphor of Isis and Osiris. Osiris is attacked, and his body parts are spread all over. Isis, the stern mother, sister and lover, goes and fetches those parts and puts him back together again. That sort of metaphor of harmony and friction that can be soul-healing for black people is so real to me. Often I feel sad, because I think we are in a culture that keeps black men and women further apart from one another, rather than meeting us in that place of shared history, shared story. I am so grateful for the black male friends in my life. Like so many professional black women, I don’t have a partner. I would like to have one, but I’ve been grateful for having conscious, caring, black male comrades and friends, who keep me from any kind of integration of black masculinity, who just keep me in this space of loving blackness. To have that kind of bonding is precious. These are the constructive moments of our time, and they’re not televised. When Malcolm X said we have to see each other with new eyes, I think that’s where self-determination begins and how we are with one another. Let’s face it, so many black males and females have suffered mental abandonment, and more than police brutality, that’s the core for many of us of our trauma. Betrayal is always about abandonment. And many of us have been emotionally abandoned. These are the wounds we have yet to correctly attend to so both black children and biracial children can have the opportunity to truly care for themselves in a way that’s optimal for all. G.Y.: How are your Buddhist practices and your feminist practices mutually reinforcing? b.h.: Well, I would have to say my Buddhist Christian practice challenges me, as does feminism. Buddhism continues to inspire me because there is such an emphasis on practice. What are you doing? Right livelihood, right action. We are back to that self-interrogation that is so crucial. It’s funny that you would link Buddhism and feminism, because I think one of the things that I’m grappling with at this stage of my life is how much of the core grounding in ethical-spiritual values has been the solid ground on which I stood. That ground is from both Buddhism and Christianity, and then feminism that helped me as a young woman to find and appreciate that ground. The spirituality piece came up for me in my love of Beat poetry. I came to Buddhism through the Beats, through Gary Snyder and Jack Kerouac — they all sort of gave me this other space of groundedness. I talk about spirituality more now than ever before, because I see my students suffering more than ever before, especially women students who feel like so much is expected of them. They’ve got to be the equals of men, but then they’ve got to be submissive if they are heteronormative, they have to find a partner. It’s just so much demand that has led them to depression, to addiction, or suicide. And it’s amazing how spirituality grounds them. Feminism does not ground me. It is the discipline that comes from spiritual practice that is the foundation of my life. If we talk about what a disciplined writer I have been and hope to continue to be, that discipline starts with a spiritual practice. It’s just every day, every day, every day. This interview was conducted by email and edited. Previous interviews in this series (with Linda Martin Alcoff, Judith Butler, Noam Chomsky, Charles Mills, Falguni A. Sheth and others) can be found here. George Yancy is a professor of philosophy at Emory University. He has written, edited and co-edited numerous books, including “Black Bodies, White Gazes,” “Look, a White!” and “Pursuing Trayvon Martin,” co-edited with Janine Jones. Follow The New York Times Opinion section on Facebook and on Twitter, and sign up for the Opinion Today newsletter. What's Next Loading... Previous Post Imagine a Medicare ‘Part Q’ for Quality at the End of Life Next Post Are These 10 Lies Justified? The Stone features the writing of contemporary philosophers and other thinkers on issues both timely and timeless. The series moderator is Simon Critchley. He teaches philosophy at The New School for Social Research in New York. To contact the editors of The Stone, send an e-mail to opinionator@nytimes.com. Please include “The Stone” in the subject field. The Stone RSS Inside Opinionator Fixes Private Lives Couch The Stone Moviegoers More Contributors Anxiety Bedside Disunion Draft Errol Morris Menagerie The Conversation The End The Great Divide All Contributors & Series » April 26, 2016 Guiding a First Generation to College Students who are new to America or lack college-educated parents often don’t know their options.Read more… April 19, 2016 How Dwindling Fish Stocks Got a Reprieve Giving fishermen a business incentive to fish sustainably can “unleash their creative capacity” to help solve the problem, says one expert. Read more… More From Fixes » April 21, 2016 Fractured: A First Date It wasn’t my heart that he broke.Read more… March 17, 2016 Steph Curry, the Prophet of Basketball What desperate, humiliating steps would I take in order to watch him play?Read more… More From Private Lives » April 19, 2016 Should Therapists Write About Patients? Even when we disguise their identities, we risk betraying them.Read more… April 12, 2016 Grieving My Patient’s Friend It isn’t unusual for therapists to get emotionally attached to people we’ve never met. Read more… More From Couch » April 18, 2016 The Perils of Being a Black Philosopher After reading so many hateful messages I began to feel sick, literally.Read more… April 16, 2016 Is That Even a Thing? What this language trend says about us.Read more… More From The Stone » February 26, 2016 Bruni and Douthat Agree: #OscarsSoPolitical The Moviegoers pick who should and who will win at the Academy Awards — and pick apart Hollywood’s diversity problem.Read more… December 28, 2015 Escaping to a Galaxy Far, Far, Far Away The “Force” holds great appeal compared with our anxieties here on earth, as seen in other films this season. Read more… More From Moviegoers » February 6, 2016 Not Just a Death, a System Failure My mother’s death was so wrenching that I applied to medical school to help change the way people die in America. Read more… January 27, 2016 When the Hospital Is Not a Haven Had I prolonged my Indian grandmother’s suffering with my stubborn belief in the power of medicine to fix things? Read more… More From The End » August 15, 2015 Puzzling Through My Fiction What I learned about writing from doing crossword puzzles.Read more… July 11, 2015 Writing Books Very Few Will Read When a family commissions a work, they’re more interested in stories, lessons and values, rather than in sensation.Read more… More From Draft » July 27, 2015 10 Things I’d Tell My Former (Medicated) Self I’ve been drug-free for nearly a month. Here is what I learned about my own seven-month weaning process. Read more… June 26, 2015 Singleminded As I decrease my medications, the urgency I feel around men and relationships subsides. Read more… More From Anxiety » June 22, 2015 Every Creeping Thing That Creepeth Why can’t we all just get along?Read more… June 13, 2015 Birds of New York: A Soundscape Composing with orchestral instruments was fine. But I found a richer palette of melody, counterpoint and rhythm already in the air.Read more… More From Menagerie » June 10, 2015 Disunion: The Final Q & A Four years ago, Disunion convened a panel of experts to discuss the outbreak of the Civil War. Now, those experts are back to discuss the war’s end, and its legacy. Read more… June 4, 2015 What Do You Know? A Civil War Pop Quiz. If you read the series (or if you’re just a huge Civil War nerd), what have you learned? Read more… More From Disunion » January 3, 2015 When Prisoners Are Patients Should convicted felons receive free health care?Read more… September 6, 2014 When It’s the Doctor Who Can’t Let Go Too many physicians think palliative care means giving up.Read more… More From Bedside » November 5, 2014 The Republican Party In Triumph Brooks and Collins on the full extent of the Election Day devastation of Democrats, including some who weren’t on the ballot.Read more… October 28, 2014 Political Infections Brooks and Collins on conflicting responses to Ebola, the meaning of the midterms and the pleasure of voting for effective crooks.Read more… More From The Conversation » June 27, 2014 Inequality Is Not Inevitable Inexorable laws of economics aren’t tearing us apart. Our policies are.Read more… June 21, 2014 Gaming the Poor Modern slot machine parlors have sophisticated methods of milking less affluent gamblers.Read more… More From The Great Divide » March 28, 2014 The Certainty of Donald Rumsfeld (Part 4) The absence of evidence, the evidence of absence, and the Iraq War.Read more… March 27, 2014 The Certainty of Donald Rumsfeld (Part 3) Could Pearl Harbor be called a “failure of imagination,” and in that sense was it similar to the attacks of 9/11?Read more… More From Errol Morris » Archive Select Month April 2016 March 2016 February 2016 January 2016 December 2015 November 2015 October 2015 September 2015 August 2015 July 2015 June 2015 May 2015 April 2015 March 2015 February 2015 Recent Posts Fixes Guiding a First Generation to College Students who are new to America or lack college-educated parents often don’t know their options.Read more… Private Lives Fractured: A First Date It wasn’t my heart that he broke.Read more… Fixes How Dwindling Fish Stocks Got a Reprieve Giving fishermen a business incentive to fish sustainably can “unleash their creative capacity” to help solve the problem, says one expert. Read more… Couch Should Therapists Write About Patients? Even when we disguise their identities, we risk betraying them.Read more… The Stone The Perils of Being a Black Philosopher After reading so many hateful messages I began to feel sick, literally.Read more… Follow us on @nytopinionator on twitter Twitter Follow © 2017 The New York Times Company Contact Us Work With Us Advertise Your Ad Choices Privacy Terms of Service Terms of Sale Site Map Help Site Feedback Subscriptions 
orcid-org-2815	----	ORCID Skip to main content For full functionality of this site it is necessary to enable JavaScript. Here are the instructions for enabling JavaScript in your web browser. ORCID uses cookies to improve your experience and to help us understand how you use our websites. Learn more about how we use cookies. 
orcid-org-8589	----	ORCID Please enable JavaScript to continue using this application. 
outgoing-typepad-com-6324	----	Outgoing Outgoing Library metadata techniques and trends by Thom Hickey Astronimcal FITS images Now for something a little different: Since retiring from OCLC I don't do a lot with library metadata, but I've recently had some fun exploring astronomical images, which come with their own data/metadata format FITS, the Flexible Image Transport System. Everyone that wants to share astronomy data uses FITS.  It was developed in the early 1980's and has a strong FORTRAN flavor in the how the data is stored.  Having processed the variable length fields inherent in MARC records with FORTRAN I can appreciate the attractiveness of fixed length blocks, arrays of binary data and 80-byte card images to the engineers/scientists of the time. One of the wonders of our time is all the astronomical work that is being done, and that within a year or two of the observations much of the data is publicly available.  The image at the head of this post came from the Hubble Legacy Archive which has an interface that will allow you to search by name and star catalog numbers, select the type of image you are interested in, and view previews of images before downloading the FITS file. Of course most of the fun in working with the images is writing some of the code that makes it possible.  There are lots of programs available that will help you look at FITS files, such as FITS Liberator which will 'liberate' FITS images into something that Photoshop can process.  Those are nice, but farther away from bare metal than I like to be.  So I wrote a little program in J that does some rudimentary processing with FITS data.  J (download it here) is a slightly obscure (but actively used and maintained) language derived from APL. Possibly more accurately J evolved from APL in an effort led by Kenneth Iverson, the inventor of APL.  While it does take some initial effort to become proficient in array languages such as J, it is remarkable how much can be done in a few characters.  Admittedly those few characters may take some deciphering, but so would the much longer code they replace.  In some ways it reminds me of trying to use a new alphabet, such as Cyrillic.  At first the script is confusing or actually misleading, but once learned they just become letters.  I got introduced to APL in the late 1970's when I first joined OCLC.  At the time OCLC ran on Sigma computers from Xerox/Honeywell.  Xerox tried to compete with IBM in the early 70's and  APL that was one of the few languages available on Sigma machines (in general we did most things in CP-V assembler which was really quite nice).  Their APL was clunky and slow and OCLC didn't have an APL terminal, but it worked and I used it to do some research into how people were using search keys on the OCLC system (not so well!). J can be described as a fusion of APL and Backus's FP.  It is open source, easy to install, does not require a special alphabet and the things it can do with arrays are amazing, if not always immediately obvious.  One of the things I like about it is the brevity of the code.  Having experimented with compact code in Python (Z39.50 client on a t-shirt),  it is surprisingly easy to work with dense code because you can see so much of it at once. J does invite a certain amount of points-free coding, a style that confuses me at times, but can be quite elegant. Map-reduce, is another style of functional programming that can be difficult at first to get comfortable with, but turns out to be very powerful.  We used map-reduce extensively at OCLC, so I came to J with some familiarity of that aspect of the language.  The code that produced the image at the top of the post can be found at github.com/ThomasBHickey/JFits.  It consists of about 60 lines of code in two files and has a couple of sample FITS image samples as well.  Since it is one of my first J programs, I asked the J programming forum to take a look at it, and they came back with a number of suggestions, so most of the clever code probably came from them.  I've tried to keep the code reasonably straightforward so it might be worth a look, but if you are interested in astronomy, it isn't that hard to explore what's available without doing any programming at all.  Don't expect the images to look just like the ones you see published, however.  Those have had a fair amount of processing (often in Photoshop) to tease out the most pleasing parts and suppress many of the instrument artifacts that seem to be in all the images. --Th       April 22, 2018 | Permalink | Comments (0) FRBR and Humphry Clinker Some may remember OCLC Research's work (obsession?) with Tobias Smollett's The Adventures of Humphry Clinker.  I believe it was Ed O'Neill that got us started with it, using it as an example of a work with a well defined text (it was Smollett's last novel and evidently never revised by him), but with many manifestations since it first appeared in 1771.  It is an important early (picaresque epistolary) novel, and popular through most of the 19th century. At any rate, we spent quite a bit of time with the bibliographic records in WorldCat that describe the various editions of Humphry Clinker and I recently happened upon a notebook that had printouts of 106 Humphry Clinker records as they were in WorldCat in August of 1988.  The highest OCLC number in the group is just under 17 million, and we thought that was a lot (they are now nearing one billion). At any rate, 106 records isn't that many, so I thought it would be interesting to compare them to current WorldCat and our FRBR work clustering. The first thing that struck me was how old fashioned the records look now.  Comparing them to the current records, they have all been touched in some way.  They now have many more subject headings and class numbers, RDA fields, typos corrected and quite few have been merged as duplicates. Here's a summary of what I found, comparing them to the 'enhanced' version of WorldCat used for FRBR processing. 10 of the 106 records have been merged into other WorldCat records (properly as far as I could tell).  All of the others except one are collected together in one FRBR 'work' and linked to the VIAF work record http://viaf.org/viaf/180810175. The one exception turns out to be bound with Smollett's Peregrine Pickle, and so qualifies as a collected work and currently is not linked to either one. In fact, the FRBR cluster found an additional 14 records created before August 1988 that it considers Humprhy Clinker.  Looking at them, they all either spelled Humphry as Humphrey, or didn't have the title in English.  Evidently I didn't pull in the Humphrey Clinkers, either by design or oversight.  In fact, back in the 1980's, our software wasn't sophisticated to find even small spelling variants such as Humprhy vs Humphrey, much less non-English versions. As part of this retrospective, I pulled all the WorldCat records in the current Humphry Clinker work set: 730 records! I mentioned earlier that all the records appear to have been 'touched' since 1988.  To get some feel for that, I looked at the records' 040 field that shows who modified the record.  The earliest 20 of the 1988 records had 14 modifications made to them, half by OCLC and half by other libraries.  The earliest 20 in the current sample found almost 10 times that: 136 modifications, 85 of those made by OCLC. In contrast, the most recent 20 records added to WorldCat have been modified 9 times, all by OCLC.  Altogether, the 730 current records show 1,856 modifications, 1,502 of those by OCLC. Of course, one of the most striking changes that WorldCat has undergone since 1988 is the addition of metadata in languages other than English.  In fact, 301 of the 730 current Humphry Clinker records are non-Engish descriptions, altogether in 14 different languages: English, German, French, Danish, Polish, Spanish, Italian, Dutch, Catalan, Swedish, Romanian, Hungarian, Slovenian, and Serbian. Looking at the language of the books being described, 49 of the 730 were not in English, not counting the 15 'undetermined': German, Russian, French, Hungarian, Romanian and Danish.  VIAF was able to find (or create) 8 non-English expression records. --Th The image at the top is by Isaac Cruikshank from the Fine Arts Museums of San Francisco.  February 26, 2016 | Permalink | Comments (1) More about justlinks We had an earlier post about the 'justlinks' view of VIAF clusters, but I thought it would be worthwhile to explore how that can combine with other VIAF functionality. First a reminder of how the justlinks view works.  While the default view of clusters to Web browsers is the HTML interface, VIAF clusters can be displayed in several ways, including the raw XML, RDF XML, MARC-21 and justlinks JSON.  Here's a request for justlinks.json: http://viaf.org/viaf/36978042/justlinks.json which returns: { "viafID":"36978042", "B2Q":["0000279733"], "BAV":["ADV11117013"], "BNE":["XX904401"], "BNF":["http://catalogue.bnf.fr/ark:/12148/cb122767803"], "DNB":["http://d-nb.info/gnd/114712638"], "ISNI":["000000010888091X"], "LAC":["0064G7865"], "LC":["n90602202"], "LNB":["LNC10-000054199"], "N6I":["vtls000101241"], "NKC":["js20080511012"], "NLA":["000035338539"], "NLI":["000501536"], "NLP":["a11737736"], "NSK":["000051380"], "NTA":["073902861"], "NUKAT":["vtls000205390"], "PTBNP":["70922"], "SELIBR":["256753"], "SUDOC":["031580661"], "WKP":["Q6678817"], "XA":["2219"], "ORCID":["http://orcid.org/0000000229258764"], "Wikipedia":["http://en.wikipedia.org/wiki/Lorcan_Dempsey"]} Ralph LeVan came up with this and we think it is pretty neat!  But wait, it gets even better! Each of the IDs in this record that is a 'source record' ID to VIAF (in this case everything except the ORCID ID and the en.wikipedia URI) can be used to retrieve the cluster.  Here's how to pull justlinks.json using the LC ID: http://viaf.org/viaf/sourceID/LC|n90602202/justlinks.json HTTPS works too: https://viaf.org/viaf/sourceID/NSK|000051380/justlinks.json All the different views of the clusters can be requested either through the explicit URI's shown here, or through HTTP headers, and they in turn can be  combined with sourceID redirection. --Th November 09, 2015 | Permalink | Comments (0) Extracting information from VIAF Occasionally I run into someone trying to extract information out of VIAF and having a difficult time. Here's a simple example of how I'd begin extracting titles for a given VIAF ID.  Far from industrial strength, but might get you started. The problem: Have a file of VIAF IDs (one/line).  Want a file of the titles, each proceeded by the VIAF ID of the record they were found in. There are lots of ways to do this, but my inclination is to do it in Python (I ran this in version 2.7.1) and to use the raw VIAF XML record: from __future__ import print_function import sys, urllib from xml.etree import cElementTree as ET # reads in list of VIAF IDs one/line # writes out VIAFID\tTitle one/line # worry about the name space ns = {'v':'http://viaf.org/viaf/terms#'} ttlPath='v:titles/v:work/v:title' def titlesFromVIAF(viafXML, path):     vel = ET.fromstring(viafXML)     for el in vel.findall(path, ns):         yield el.text for line in sys.stdin:     viafid = line.strip()     viafURL = 'https://viaf.org/viaf/%s'%viafid     viafXML = urllib.urlopen(viafURL).read()     for ttl in titlesFromVIAF(viafXML, ttlPath):       print('%s\t%s'%(viafid, ttl.encode('utf-8'))) That's about as short as I could get it and have it readable in this narrow window.  We've been using the new print function (and division!) for some time now, with an eye towards Python 3. --Th Update 2015.09.16: Cleaned up how namespace is specified September 14, 2015 | Permalink | Comments (0) Next » About Search   Recent Posts Astronimcal FITS images FRBR and Humphry Clinker More about justlinks Extracting information from VIAF Matching names to VIAF In defense of MARC VIAF RDF Changes Moving to Wikidata Testing date parsing by fuzzing Another JSON encoding for MARC data Subscribe to this blog's feed Links Weibel Lines Inquiring Librarian Quædam cuiusdam LibraryCog Lorcan Dempsey's weblog Some Sculptures Furniture in Ohio Archives April 2018 February 2016 November 2015 September 2015 May 2015 April 2015 March 2015 February 2015 October 2014 July 2014 April 2018 Sun Mon Tue Wed Thu Fri Sat 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30           
philomousos-blogspot-com-5867	----	Scriptio Continua Scriptio Continua Thoughts on software development, Digital Humanities, the ancient world, and whatever else crosses my radar. All original content herein is licensed under a Creative Commons Attribution license. Friday, June 02, 2017 Reminder In the midst of the ongoing disaster that has befallen the country, I had a reminder recently that healthcare in the USA is still a wreck. When I had my episode of food poisoning (or whatever it was) in Michigan recently, my concerned wife took me to an urgent care. We of course had to pay out-of-pocket for service (about $100), as we were way outside our network (the group of providers who have agreements with our insurance company). I submitted the paperwork to our insurance company when we got home (Duke uses Aetna), to see if they would reimburse some of that amount. Nope. Rejected, because we didn't call them first to get approval—not something you think of at a time like that. Thank God I waved off the 911 responders when my daughter called them after I first got sick and almost passed out. We might have been out thousands of dollars. And this is with really first-class insurance, mind you. I have great insurance through Duke. You can't get much better in this country. People from countries with real healthcare systems find this kind of thing shocking, but it's par for the course here. And our government is actively trying to make it worse. It's just one more bit of dreadful in a sea's worth, but it's worth remembering that the disastrous state of healthcare in the US affects all of us, even the lucky ones with insurance through our jobs. And again, our government is trying its best to make it worse. You can be quite sure it will be worse for everyone. Posted by Unknown at 9:07 AM No comments: Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Monday, May 01, 2017 Experiencing Technical Difficulties I've been struggling with a case of burnout for a while now. It's a common problem in programming, where we have to maintain a fairly high level of creative energy all the time, and unlike my colleagues in academia or the library, I'm not eligible for research leave or sabbaticals. Vacation is the only opportunity for recharging my creative batteries, but that's hard too when there are a lot of tasks that can't wait. I have taken the day off to work before, but that just seems stupid. So I grind away, hoping the fog will lift. A few weeks ago, the kids and I joined my wife on a work trip to Michigan. It was supposed to be a mini-vacation for us, but I got violently ill after lunch one day—during a UMich campus tour. It sucked about as much as it possibly could. My marvelous elder daughter dealt with the situation handily, but of course we ended up missing most of the tour, and I ended up in bed the rest of the day, barring the occasional run to the bathroom. My world narrowed down to a point. I was quite happy to lie there, not thinking. I could have read or watched television, but I didn't want to. Trying the occasional sip of gatorade was as much as I felt like. For someone who normally craves input all the time, it was very peaceful. It revealed to me again on how much of a knife-edge my consciousness really is. It would take very little to knock it off the shelf to shatter on the ground. My father has Alzheimer's Disease, and this has already happened to him. Where once there was an acutely perceptive and inquiring mind, there remains only his personality, which seems in his case to be the last thing to go. I try to spend time with him at least once or twice a week, both to take a little pressure off my mother and to check on their general well-being. We take walks. Physically, he's in great shape for a man in his 80s. And there are still flashes of the person he was. He can't really hold a conversation, and will ask the same questions over and over again, my answers slipping away as soon as they're heard, but as we walked the other day, accompanied by loud birdsong, he piped up "We hear you!" to the birds, his sense of humor suddenly back on the surface. We are lucky that my parents have fantastic insurance and a good retirement plan, courtesy of an employer, the Episcopal Church, that cares about its people beyond the period of their usefulness. Burnout is a species of depression, really. It is the same sort of thing as writer's block. Your motivation simply falls out from under you. You know what needs to be done, but it's hard to summon the energy to do it. The current political climate doesn't help, as we careen towards the cliff's edge like the last ride of Thelma and Louise, having (I hope metaphorically, but probably not for many of us) chosen death over a constrained future, for the sake of poking authority in the eye. My children will suffer because the Baby Boomers have decided to try to take it all with them, because as a society we've fallen in love with Death. All we can do really is try to arm the kids against the hard times to come, their country having chosen war, terror, and oppression in preference to the idea that someone undeserving might receive any benefit from society. We Gen-Xers at least had some opportunity to get a foot on the ladder. Their generation will face a much more tightly constrained set of choices, with a much bigger downside if they make the wrong ones. I don't write much about my children online, because we want to keep them as much as possible out of the view of the social media Panopticon until they're mature enough to make their own decisions about confronting it. At least they may have a chance to start their lives without the neoliberal machine knowing everything about them. They won't have anything like the support I had, and when we've dismantled our brief gesture towards health care as a human right and insurance decisions are made by AIs that know everything about you going back to your childhood, things are going to be quite difficult. A symptom, I think, of my burnout is my addiction to science fiction and urban fantasy novels. They give me a chance to check out from the real world for a while, but I think it's become a real addiction rather than an escape valve. Our society rolls ever forward toward what promises to be an actual dystopia with all the trappings: oppressed, perhaps enslaved underclasses, policed by unaccountable quasi-military forces, hyper-wealthy elites living in walled gardens with the latest technology, violent and unpredictable weather, massive unemployment and social unrest, food and water shortages, and ubiquitous surveillance. Escapism increasingly seems unwise. Some of that future can be averted if we choose not to be selfish and paranoid, to stop oppressing our fellow citizens and to stop demonizing immigrants, to put technology at the service of bettering society and surviving the now-inevitable changes to our climate. But we are not making good choices. Massive unemployment is a few technological innovations away. It doesn't have to be a disaster, indeed it could lead to a renaissance, but I think we're too set in our thinking to avoid the disaster scenario. The unemployed are lazy after all, our culture tells us, they must deserve the bad things that have happened to them. Our institutions are set up to push them back towards work by curtailing their benefits. But It could never happen to me, could it? And that comes back around to why I try to grind my way through burnout rather than taking time to recover from it. I live in an "at will" state. I could, in theory, be fired because my boss saw an ugly dog on the way in to work. That wouldn't happen, I hasten to say—I work with wonderful, supportive people. But there are no guarantees to be had. People can be relied on, but institutions that have not been explicitly set up to support us cannot, and institutional structures and rules tend to win in the end. Best to keep at it and hope the spark comes back. It usually does. Posted by Unknown at 12:28 PM No comments: Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Monday, February 22, 2016 Thank You Back in the day, Joel Spolsky had a very influential tech blog, and one of the pieces he wrote described the kind of software developer he liked to hire, one who was "Smart, and gets things done." He later turned it into a book (http://www.amazon.com/Smart-Gets-Things-Done-Technical/dp/1590598385). Steve Yegge, who was also a very influential blogger in the oughties, wrote a followup, in which he tackled the problem of how you find and hire developers who are smarter than you. Given the handicaps of human psychology, how do you even recognize what you're looking at? His rubric for identifying these people (flipping Spolsky's) was "Done, and gets things smart". That is, this legendary "10X" developer was the sort who wouldn't just get done the stuff that needed to be done, but would actually anticipate what needed to be done. When you asked them to add a new feature, they'd respond that it was already done, or that they'd just need a few minutes, because they'd built things in such a way that adding your feature that you just thought of would be trivial. They wouldn't just finish projects, they'd make everything better—they'd create code that other developers could easily build upon. Essentially, they'd make everyone around them more effective as well. I've been thinking a lot about this over the last few months, as I've worked on finishing a project started by Sebastian Rahtz: integrating support for the new "Pure ODD" syntax into the TEI Stylesheets. The idea is to have a TEI syntax for describing the content an element can have, rather than falling back on embedded RelaxNG. Lou Burnard has written about it here: https://jtei.revues.org/842. Sebastian wrote the XSLT Stylesheets and the supporting infrastructure which are both the reference implementation for publishing TEI and the primary mechanism by which the TEI Guidelines themselves are published. And they are the basis of TEI schema generation as well. So if you use TEI at all, you have Sebastian to thank. Picking up after Sebastian's retirement last year has been a tough job. It was immediately obvious to me just how much he had done, and had been doing for the TEI all along. When Gabriel Bodard described to me how the TEI Council worked, after I was elected for the first time, he said something like: "There'll be a bunch of people arguing about how to implement a feature, or even whether it can be done, and then Sebastian will pipe up from the corner and say 'Oh, I just did it while you were talking.'" You only have to look at the contributors pages for both the TEI and the Stylesheets to see that Sebastian was indeed operating at a 10X level. Quietly, without making any fuss about it, he's been making the TEI work for many years. The contributions of software developers are often easily overlooked. We only notice when things don't work, not when everything goes smoothly, because that's what's supposed to happen, isn't it? Even in Digital Humanities, which you'd expect to be self-aware about this sort of thing, the intellectual contributions of software developers can often be swept under the rug. So I want to go on record, shouting a loud THANK YOU to Sebastian for doing so much and for making the TEI infrastructure smart. ***** UPDATE 2016-3-16 I heard the sad news last night that Sebastian passed away yesterday on the Ides of March. We are much diminished by his loss. Posted by Unknown at 12:38 PM 1 comment: Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Friday, October 25, 2013 DH Data Talk Last night I was on a panel organized by Duke Libraries' Digital Scholarship group. The panelists each gave some brief remarks and then we had what I thought was a really productive and interesting discussion. The following are my own remarks, with links to my slides (opens a new tab). In my notes, //slide// means click forward (not always to a new slide, maybe just a fragment). This is me, and I work //slide// for this outfit. I'm going to talk just a little about a an old project and a new one, and not really give any details about either, but surface a couple of problems that I hope will be fodder for discussion. //slide// The old project is Papyri.info and publishes all kinds of data about ancient documents mostly written in ink on papyrus. The new one, Integrating Digital Epigraphies (IDEs), is about doing much the same thing for ancient documents mostly incised on stone. If I had to characterize (most of) the work I'm doing right now, I'd say I'm working on detecting and making machine-actionable the scholarly links and networks embedded in a variety of related projects, with data sources including plain text, XML, Relational Databases, web services, and images. These encompass critical editions of texts (often in large corpora), bibliography, citations in books and articles, images posted on Flickr, and databases of texts. You could think of what I'm doing as recognizing patterns and then converting those into actual links; building a scaffold for the digital representation of networks of scholarship. This is hard work. //slide// It's hard because while superficial patterns are easy to detect, //slide// without access to the system of thought underlying those patterns (and computers can't do that yet—maybe never), those patterns are really just proxies kicked up by the underlying system. They don't themselves have meaning, but they're all you have to hold on to. //slide// Our brains (with some prior training) are very good at navigating this kind of mess, but digital systems require explicit instructions //slide// —though granted, you can sometimes use machine learning techniques to generate those. When I say I'm working on making scholarly networks machine actionable, I'm talking about encoding as digital relations the graph of references embedded in these books, articles and corpora, and in the metadata of digital images. There are various ways one might do this, and the one we're most deeply into right now is called //slide// RDF. RDF models knowledge as a set of simple statements in the form Subject, Predicate, Object. //slide// So A cites B, for example. RDF is a web technology, so all three of these elements may be URIs that you could open in a web browser, //slide// and if you use URIs in RDF, then the object of one statement can be the subject of another, and so on. //slide// So you can use it to model logical chains of knowledge. Now notice that these statements are axioms. You can't qualify them, at least not in a fine-grained way. So this works great in a closed system (papyri.info), where we get to decide what the facts are; it's going to be much more problematic in IDEs, where we'll be coordinating data from at least half a dozen partners. Partners who may not agree on everything. //slide// What I've got is the same problem from a different angle—I need to model a big pile of opinion but all I have to do it with are facts. Part of the solution to these problems has to be about learning how to make the insertion of machine-actionable links and facts (or at least assertions), part of—that is, a side-effect of—the normal processes of resource creation and curation. But it also has to be about building systems that can cope with ambiguity and opinion. Posted by Unknown at 9:48 AM No comments: Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Wednesday, September 11, 2013 Outside the tent Yesterday was a bad day. I’m chasing a messed-up software problem whose main symptom is the application consuming all available memory and then falling over without leaving a useful stacktrace. Steve Ramsay quit Twitter. A colleague I have huge respect for is leaving a project that’s foundational and is going to be parked because of it (that and the lack of funding). This all sucks. As I said on Twitter, it feels like we’ve hit a tipping point. I think DH has moved on and left a bunch of us behind. I have to start this off by saying that I really have nothing to complain about, even if some of this sounds like whining. I love my job, my colleagues, and I’m doing my best to get over being a member of a Carolina family working at Duke :-). I’m also thinking about these things a lot in the run up to Speaking in Code. For some time now I’ve been feeling uneasy about how I should present myself and my work. A few years ago, I’d have confidently said I work on Digital Humanities projects. Before that, I was into Humanities Computing. But now? I’m not sure what I do is really DH any more. I suspect the DH community is no longer interested in the same things as people like me, who write software to enable humanistic inquiry and also like to think (and when possible write and teach) about how that software instantiates ideas about the data involved in humanistic inquiry. On one level, this is fine. Time, and academic fashion, marches on. It is a little embarrassing though given that I’m a “Senior Digital Humanities Programmer”. Moreover, the field of “programming” daily spews forth fresh examples of unbelievable, poisonous, misogyny and seems largely incapable of recognizing what a shitty situation its in because of it. The tech industry is in moral crisis. We live in a dystopian, panoptic geek revenge fantasy infested by absurd beliefs in meritocracy, full of entrenched inequalities, focused on white upper-class problems, inherently hostile to minorities, rife with blatant sexism and generally incapable of reaching anyone beyond early adopter audiences of people just like us. (from https://medium.com/about-work/f6ccd5a6c197) I think communities who fight against this kind of oppression, like #DHPoco, for example, are where DH is going. But while I completely support them and think they’re doing good, important work, I feel a great lack of confidence that I can participate in any meaningful way in those conversations, both because of the professional baggage I bring with me and because they’re doing a different kind of DH. I don’t really see a category for the kinds of things I write about on DHThis or DHNow, for example. If you want to be part of a community that HELPS DEFINE #digitalhumanities please join and promote #DHThis today! http://t.co/VTWjtGQbgr — Adeline Koh (@adelinekoh) September 10, 2013 This is great stuff, but it’s also not going to be a venue for me wittering on about Digital Classics or text encoding. It could be my impostor syndrome kicking in, but I really doubt they’re interested. It does seem like a side-effect of the shift toward a more theoretical DH is an environment less welcoming to participation by “staff”. It’s paradoxical that the opening up of DH also comes with a reversion to the old academic hierarchies. I’m constantly amazed at how resilient human insitutions are. If Digital Humanities isn’t really what I do, and if Programmer comes with a load of toxic slime attached to it, perhaps “Senior” is all I have left. Of course, in programmer terms, “senior” doesn’t really mean “has many years of experience”, it’s code for “actually knows how to program”. You see ads for senior programmers with 2-3 years of experience all the time. By that standard, I’m not Senior, I’m Ancient. Job titles are something that come attached to staff, and they are terrible, constricting things. I don’t think that what I and many of my colleagues do has become useless, even if we no longer fit the DH label. It still seems important to do that work. Maybe we’re back to doing Humanities Computing. I do think we’re mostly better off because Digital Humanities happened, but maybe we have to say goodbye to it as it heads off to new horizons and get back to doing the hard work that needs to be done in a Humanities that’s at least more open to digital approaches than it used to be. What I’m left wondering is where the place of the developer (and, for that matter other DH collaborators) is in DH if DH is now the establishment and looks structurally pretty much like the old establishment did. Is digital humanities development a commodity? Are DH developers interchangeable? Should we be? Programming in industry is typically regarded as a commodity. Programmers are in a weird position, both providers of indispensable value, and held at arm’s length. The problem businesses have is how to harness a resource that is essentially creative and therefore very subject to human inconsistency. It’s hard to find good programmers, and hard to filter for programming talent. Programmers get burned out, bored, pissed off, distracted. Best to keep a big pool of them and rotate them out when they become unreliable or too expensive or replace them when they leave. Comparisons to graduate students and adjunct faculty may not escape the reader, though at least programmers are usually better-compensated. Academia has a slightly different programmer problem: it’s really hard to find good DH programmers and staffing up just for a project may be completely impossible. The only solution I see is to treat it as analogous to hiring faculty: you have to identify good people and recruit them and train people you’d want to hire. You also have to give them a fair amount of autonomy—to deal with them as people rather than commodities. What you can’t count on doing is retaining them as contingent labor on soft money. But here we’re back around to the faculty/staff problem: the institutions mostly only deal with tenure-track faculty in this way. Libraries seem to be the only academic institutions capable of addressing the problem at all. But they’re also the insitutions most likely to come under financial pressure and they have other things to worry about. It’s not fair to expect them to come riding over the hill. The ideal would situation would be if there existed positions to which experts could be recruited who had sufficient autonomy to deal with faculty on their own level (this essentially means being able to say ‘no’), who might or might not have advanced degrees, who might teach and/or publish, but wouldn’t have either as their primary focus. They might be librarians, or research faculty, or something else we haven’t named yet. All of this would cost money though. What’s the alternative? Outsourcing? Be prepared to spend all your grant money paying industry rates. Grad Students? Many are very talented and have the right skills, but will they be willing to risk sacrificing the chance of a faculty career by dedicating themselves to your project? Will your project be maintainable when they move on? Mia Ridge, in her twitter feed, reminds me that in England there exist people called “Research Software Engineers”. Notes from #rse2013 breakout discussions appearing at https://t.co/PD0ItLBb8t - lots of resonances with #musetech #codespeak — Mia (@mia_out) September 11, 2013 There are worse labels, but it sounds like they have exactly the same set of problems I’m talking about here. Posted by Unknown at 1:05 PM 12 comments: Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Monday, July 15, 2013 Missing DH I'm watching the tweets from #dh2013 starting to roll in and feeling kind of sad (and, let's be honest, left out) not to be there. Conference attendance has been hard the last few years because I didn't have any travel funding in my old job. So I've tended only to go to conferences close to home or where I could get grant funding to pay for them. It's also quite hard sometimes to decide what conferences to go to. On a self-funded basis, I can manage about one a year. So deciding which one can be hard. I'm a technologist working in a library, on digital humanities projects, with a focus on markup technologies and on ancient studies. So my list is something like: DH JCDL One of many language-focused conferences The TEI annual meeting Balisage I could also make a case for conferences in my home discipline, Classics, but I haven't been to the APA annual meeting in over a decade. Now that the Digital Classics Association exists, that might change. I tend to cycle through the list above. Last year I went to the TEI meeting, the year before, I went to Clojure/conj and DH (because a grant paid). The year before that, I went to Balisage, which is an absolutely fabulous conference if you're a markup geek like me (seriously, go if you get the chance). DH is a nice compromise though, because you get a bit of everything. It's also attended by a whole bunch of my friends, and people I'd very much like to become friends with. I didn't bother submitting a proposal for this year, because my job situation was very much up in the air at the time, and indeed, I started working at DC3 just a couple of weeks ago. DH 2013 would have been unfeasible for all kinds of reasons, but I'm still bummed out not to be there. Have a great time y'all. I'll be following from a distance. Posted by Unknown at 4:31 PM No comments: Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Wednesday, February 06, 2013 First Contact It seems like I've had many versions of this conversation in the last few months, as new projects begin to ramp up: Client: I want to do something cool to publish my work. Developer: OK. Tell me what you'd like to do. Client: Um. I need you to to tell me what's possible, so I can tell you what I want. Developer: We can do pretty much anything. I need you to tell me what you want so I can figure out how to make it. Almost every introductory meeting with a client/customer starts out this way. There's a kind of negotiation period where we figure out how to speak each other's language, often by drawing crude pictures. We look at things and decide how to describe them in a way we both understand. We wave our hands in the air and sometimes get annoyed that the other person is being so dense. It's crucially important not to short-circuit this process though. You and your client likely have vastly different understandings of what can be done, how hard it is to do what needs to be done, and even whether it's worth doing. The initial negotiation sets the tone for the rest of the relationship. If you hurry through it, and let things progress while there are still major misunderstandings in the air, Bad Things will certainly happen. Like: Client: This isn't what I wanted at all! Developer: But I built exactly what you asked for! Posted by Unknown at 8:51 AM No comments: Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest Older Posts Home Subscribe to: Posts (Atom) Followers Blog Archive ▼  2017 (2) ▼  June (1) Reminder ►  May (1) ►  2016 (1) ►  February (1) ►  2013 (4) ►  October (1) ►  September (1) ►  July (1) ►  February (1) ►  2012 (3) ►  April (1) ►  March (2) ►  2011 (6) ►  November (2) ►  June (1) ►  January (3) ►  2010 (5) ►  December (2) ►  May (2) ►  March (1) ►  2009 (6) ►  December (2) ►  October (2) ►  August (1) ►  January (1) ►  2008 (9) ►  December (1) ►  October (2) ►  September (1) ►  August (2) ►  May (1) ►  March (1) ►  January (1) ►  2007 (5) ►  October (1) ►  May (1) ►  March (1) ►  February (1) ►  January (1) ►  2006 (2) ►  August (2) ►  2005 (6) ►  October (1) ►  April (1) ►  March (1) ►  February (3) ►  2004 (2) ►  October (2) About Me Unknown View my complete profile Awesome Inc. theme. Powered by Blogger. 
pinboard-in-8214	----	Pinboard recent ‧ popular ‧ tour ‧ howto      log in code4lib   2675 « earlier     (400) https://twitter.com/rudokemper/status/1371454887721119748/photo/1 RT @rudokemper: Floored and honored to have been invited to give a keynote for the #c4l21 #code4lib conference next Monday. I can't wait to share about our work building open-source tech for communities to map oral histories, and how my journey started in the library + archive space! @code4lib c4l21  code4lib  from twitter_favs 5 weeks ago by bsscdt  copy to mine Untitled (https://d1keuthy5s86c8.cloudfront.net/static/ems/upload/files/code4lib21_discogs_blacklight.pdf) RT @sf433: Really happy to share, “Dynamic Integration of Discogs Data within a Blacklight Catalog” From now on I’m going to ask myself, “Can this talk be a poster?” #code4lib code4lib  from twitter 5 weeks ago by rybesh  copy to mine The Code4Lib Journal – Advancing ARKs in the Historical Ontology Space code4lib  digitallibraries  digitalpreservation  data  ontology  identifiers  digitalhumanities  ark  computationalarchivalscience  cas  archives  journalarticle  6 weeks ago by geephroh  copy to mine The Code4Lib Journal – Managing an institutional repository workflow with GitLab and a folder-based deposit system Managing an institutional repository workflow with GitLab and a folder-based deposit system by Whitney R. Johnson-Freeman, @vphill, and Kristy K. Phillips #code4lib Journal issue 50. code4lib  from twitter_favs 10 weeks ago by aarontay  copy to mine LISTSERV 16.5 - CODE4LIB Archives RT @kiru: I forgot to post the call earlier: The Code4Lib Journal () is looking for volunteers to join its editorial committee. Deadline: 12 Oct. #code4lib code4lib  from twitter september 2020 by miaridge  copy to mine 20 - C4L [5] Future Role of Libraries in Researcher Workflows - Google Slides research-lifecycle  code4lib  publish  scholarly-communication  march 2020 by elibtronic  copy to mine Twitter New issue of the The #Code4Lib Journal published. Some terrific looking papers, including a review of PIDs for heri… Code4Lib  from twitter_favs february 2020 by aarontay  copy to mine (500) https://journal.code4lib.org/ RT @kiru: I am very happy to announce the publication of the @Code4Lib Journal issue #47: webscraping… 47  code4lib  from twitter february 2020 by miaridge  copy to mine The Code4Lib Journal – COLUMN: We Love Open Source Software. No, You Can’t Have Our Code Librarians are among the strongest proponents of open source software. Paradoxically, libraries are also among the least likely to actively contribute their code to open source projects. This article identifies and discusses six main reasons this dichotomy exists and offers ways to get around them. Code4Lib  library  LIBT  opensource  finalproject  december 2019 by pfhyper  copy to mine The Code4Lib Journal – Barriers to Initiation of Open Source Software Projects in Libraries Libraries share a number of core values with the Open Source Software (OSS) movement, suggesting there should be a natural tendency toward library participation in OSS projects. However Dale Askey’s 2008 Code4Lib column entitled “We Love Open Source Software. No, You Can’t Have Our Code,” claims that while libraries are strong proponents of OSS, they are unlikely to actually contribute to OSS projects. He identifies, but does not empirically substantiate, six barriers that he believes contribute to this apparent inconsistency. In this study we empirically investigate not only Askey’s central claim but also the six barriers he proposes. In contrast to Askey’s assertion, we find that initiation of and contribution to OSS projects are, in fact, common practices in libraries. However, we also find that these practices are far from ubiquitous; as Askey suggests, many libraries do have opportunities to initiate OSS projects, but choose not to do so. Further, we find support for only four of Askey’s six OSS barriers. Thus, our results confirm many, but not all, of Askey’s assertions. Code4Lib  library  LIBT  opensource  finalproject  december 2019 by pfhyper  copy to mine Twitter RT @kiru: The #Code4Lib Journal's issue 46 (2019/4) has been just published: . Worldcat Search API, Go… Code4Lib  from twitter november 2019 by jbfink  copy to mine Twitter RT @mjingle: Who's excited for the next #code4lib conference?! It will be in Pittsburgh, PA from March 8-11. Is your org interes… code4lib  from twitter november 2019 by jbfink  copy to mine Attempto Project nlp  basic  cnl  computationalLinguistics  controlledLanguage  controlled_language  code4lib  compsci  english  knowledgeRepresentation  september 2019 by blebo  copy to mine Twitter When our grandchildren ask about the Great #code4lib IRC Battle of the Tisane, we will serve them both tea and coff… code4lib  from twitter_favs august 2019 by danbri  copy to mine Code4Lib 2019 Recap – bloggERS! code4lib  digitallibraries  research  saa  archives  july 2019 by geephroh  copy to mine Digital Technologies Development Librarian | NC State University Libraries We're hiring a Digital Technologies Development Librarian @ncsulibraries ! #job #libjobs #code4lib #dlf #libtech dlf  libtech  code4lib  job  libjobs  from twitter_favs july 2019 by cdmorris  copy to mine Twitter 3) All the men who want to preserve the idea of a #Code4Lib discussion space as one that's free of such topics as s… Code4Lib  from twitter_favs july 2019 by jbfink  copy to mine Google Refine cheat sheet (code4lib) openRefine  code4lib  how-to  cheatsheet  may 2019 by Psammead  copy to mine Untitled (https://www.youtube.com/watch?v=ICbLVnCHpnw) Code4Lib Southeast happening today! Live stream starting at 9:30am eastern. #code4libse2019 #code4lib code4libse2019  code4lib  from twitter_favs may 2019 by cdmorris  copy to mine Twitter It occurs to me the #code4lib statement of support for Chris Bourg, , offers a better model… code4lib  from twitter april 2019 by lbjay  copy to mine « earlier     related tags 2016  2017  47  accessibility  algorithms  analytics  archives  ark  article  awesome  basic  bias  blog  blogs  c4l16  c4l17  c4l18  c4l19  c4l21  c4ln18  c4lse  career  cas  center  cheatsheet  cnl  code  code4lib-2018  code4libse2018  code4libse2019  compsci  computationalarchivalscience  computationallinguistics  conference  controlled_language  controlledlanguage  crowdsourcing  culture  data  dev  dh  dh2016  dighum  digitalhumanities  digitallibraries  digitalpreservation  diversity  dlf  docker  dpla  english  excel  failure  finalproject  floss  github  harlow  history  how-to  identifiers  inclusion  ipfs  ischoolui  job  journal  journalarticle  journals_code4lib  keynote  knowledgerepresentation  libjobs  libraries  library-tech  library  library_technology  libt  libtech  libtechwomen  libux  lightning_talks  lita  litaux  lodlam  lt  mansplaining  mashcat  memorylab  musetech  news  nlp  ontology  open  open_source  openrefine  opensource  presentation  privacy  programming  publish  python  research-lifecycle  research  reuse  saa  scholarly-communication  search  security  sharing  software  source  spreadsheet  spreadsheets  tabularformats  teamwork  tech  technology  ux  washingtondc  webarchive  webdev  work-life  xml  xpath  yay  Copy this bookmark: description: - tags: grab all tags - clear tags to read © Nine Fives Software. Problems or questions? Contact <support@pinboard.in>. TOS ‧ privacy ‧ about ‧ blog ‧ FAQ ‧ resources ‧ security 
planetcataloging-org-1794	----	Planet Cataloging Planet Cataloging April 26, 2021 TSLL TechScans (Technical Services Law Librarians) Getting to Know Larissa Sullivant 1. Introduce yourself.  My name is Larissa Sullivant. I am the Head of Collection Services and Adjunct Lecture in Law at the Ruth Lilly Law Library, Indiana University Robert McKinney School of Law.  I started my professional career as a Slavic cataloger at the University of Michigan Graduate Library, and for the last 20 years I have been a law librarian.  2. Does your job title actually describe what you do? Why/why not?   I think that my job title, Head of Collection Services, reflects my duties accurately, with responsibilities that include bibliographic and statistical analysis of the Library’s collection; collection promotion, bibliographic selection, and “weeding”; electronic resources management, acquisitions, cataloging, and serials control; supervision of the Technical Services staff. I also handle negotiation of contracts and vendor relations and assist our Library director in budgeting. I have regular hours at the Reference Desk, which during the pandemic means handling virtual reference. The last may not seem semantically connected to the job title, but it is an important part of being successful in my position: I need to know what our stakeholders read and research. 3. What are you reading right now?    As a native Russian speaker, I am understandably drawn to that nation’s rich, literary traditions. I am currently re-reading Nikolai Gogol’s The Overcoat and Other Short Stories. Each of the stories is a parable of human tragedies and failings: vanity, pettiness, hypocrisy, self-absorption, cruelty towards others, etc. My favorites are The Nose and The Overcoat. The Nose has a decisive element of the absurd: a human-sized, disembodied nose of a privy counselor comes to life, parading around town and acting as a public official. The story is bitingly satirical, a critique of social hierarchies, which is a recurrent theme in Gogol’s work. The Overcoat concerns an impoverished clerk’s efforts to get a new and decent overcoat, so that his co-workers would stop berating him. In heartbreaking detail, it describes the clerk’s efforts in acquiring an overcoat, his various humiliations, and what happens after he finally gets his new coat.  4. If you could work in any library (either a type of library or a specific one), what would it be? Why? I am happy where I am: directing the Technical Services unit at the Ruth Lilly Law Library.  I enjoy every aspect of my duties and responsibilities.  My colleagues, both faculty and staff, are well-respected within the Library and the Law School communities and are wonderful and knowledgeable people. I truly enjoy working with all of them! 5. You suddenly have a free day at work, what project would you work on?  I think slow days in most work environments are rare. If I suddenly had a free day, I would focus first on organization – getting the paper and information monster under control – since, as all librarians know, organization is the key to everything else.  After that, I would chip away at one of my current projects: a comprehensive inventory of our microform collection, reconciling the online catalog bibliographic data with the physical microfiche and microfilm holdings.  TSLL Tech Scans Blog by noreply@blogger.com (Lauren Seney) at April 26, 2021 04:55 PM Planet Cataloging is an automatically-generated aggregation of blogs related to cataloging and metadata designed and maintained by Jennifer W. Baxmeyer and Kevin S. Clarke. Please feel free to email us if you think a blog should be added to or removed from this list. Authors: If you would prefer your blog NOT be included here, we will be glad to remove it. Please send an email to let us know. Subscribe to Planet Cataloging! Blog Roll 025.431: The Dewey blog Bibliographic Wilderness Blog of the Ohio Library Council Technical Services Division Catalogablog Cataloger 2.0 Cataloging Futures Cataloging thoughts (Stephen Denney) Celeripedean (Jennifer Eustis) CommonPlace.net (Lukas Koster) Coyle's InFormation First thus (James Weinheimer) Hectic Pace International Society for Knowledge Organization (ISKO) UK Local Weather (Matthew Beacom) Lorcan Dempsey's weblog METADATA and more (Maureen P. Walsh) Mashcat Metadata Matters (Diane Hillmann) Metalibrarian OCLC Next Open Metadata Registry Blog Organizing Stuff Outgoing Problem Cataloger QUICK T.S. (Dodie Gaudet) Resource Description & Access (RDA) (Salman Haider) TSLL TechScans (Technical Services Law Librarians) Terry's Worklog Thingology (LibraryThing's ideas blog) Universal Decimal Classification Various librarian-like stuff Weibel Lines Work and Expression Z666.7.B39 (www.jenniferbax.net) catalogingRules (Amber Billey) mod librarian (Tracy Guza) Last updated: April 27, 2021 05:00 PM All times are UTC. Powered by: 
planet-code4lib-org-3294	----	None 
planet-code4lib-org-4970	----	Planet Code4Lib http://planet.code4lib.org ACRL TechConnect ACRL TechConnect Aaron Schmidt Influx Library User Experience Alf Eaton, Alf HubLog Andrew Pace Hectic Pace Andromeda Yelton andromeda yelton Archival Connections Archival Connections Archives Unleashed Project Archives Unleashed - Medium Ariadne Magazine Table of contents: issue76 Bethany Nowviskie Bethany Nowviskie Bohyun Kim Library Hat Brown University Library Digital Technologies Projects Brown University Library Digital Technologies Casey Bisson MaisonBisson Chris Beer blog.cbeer.info Code4Lib Code4Lib Journal The Code4Lib Journal Conal Tuohy Conal Tuohy's blog Coral Sheldon-Hess Coral Sheldon-Hess CrossRef Coyle's InFormation Cynthia Ng Learning (Lib)Tech D-Lib D-Lib Magazine Dan Cohen Dan Cohen Dan Scott Coffee|Code: Dan Scott's blog - coding David Fiander Rapid Communications David Rosenthal DSHR's Blog Digital Library Federation DLF District Dispatch District Dispatch Dominic Bordelon Dominic Bordelon DuraSpace News News – Duraspace.org Ed Summers inkdroid Equinox Software The Open Source Experts Eric Hellman Go To Hellman Eric Lease Morgan Planet Eric Lease Morgan Erin White Libraries – erin white Europeana » Technical Evergreen ILS Evergreen ILS FOSS4Lib Recent Releases Releases FOSS4Lib Updated Packages Updated Packages | FOSS4Lib Future Archives (Bodleian Library) futureArch, or the future of archives... Galen Charlton Meta Interchange HangingTogether Hanging Together Harvard Library Innovation Lab The Harvard Library Innovation Lab Hugh Cayless Scriptio Continua Hugh Rundle Information Flaneur Ian Davis Internet Alchemy In the Library, With the Lead Pipe In the Library with the Lead Pipe Information Technology and Libraries Information Technology and Libraries Islandora Islandora Jakob Voss en – Jakoblog Jason Ronallo Preliminary Inventory of Digital Collections by Jason Ronallo Jeremy Frumkin The Digital Librarian Jez Cope eRambler Jodi Schneider jodischneider.com/blog John Mark Ockerbloom Everybody's Libraries Jonathan Brinley x + 3 Jonathan Rochkind Bibliographic Wilderness Journal of Web Librarianship tandf: Journal of Web Librarianship: Table of Contents Karen G. Schneider Free Range Librarian LITA LITA Blog LibUX Library User Experience Community - Medium LibX LibX Library Tech Talk (U of Michigan) Library Tech Talk - U-M Library LibraryThing (Thingology) The Thingology Blog LibreCat/Catmandu blog Catmandu Lorcan Dempsey Lorcan Dempsey's Weblog Lucidworks Lucidworks Lukas Koster commonplace.net M. Ryan Hess Fail!lab Manage Metadata (Diane Hillmann and Jon Phipps) Metadata Matters Mark E. Phillips mark e. phillips journal Mark Matienzo Posts on Mark A. Matienzo Mashcat mashcat Max Planck Digital Library Max Planck vLib News Meredith Farkas Information Wants To Be Free Mita Williams Librarian of Things NYPL Special Collection NYPL Blogs: SpecialCollections.txt Nick Ruest Nick Ruest Nicole Engard What I Learned Today… OCLC Dev Network OCLC Developers Network News Open Knowledge Foundation Open Knowledge Foundation blog Open Library The Open Library Blog Peter Murray Disruptive Library Technology Jester Peter Sefton ptsefton.com Raffaele Messuti literary machines Ranti Junus ranti.10centuries.org Raymond Yee Data Unbound Richard Wallis Roy Tennant Roy Tennant: Digital Libraries – The Digital Shift Samvera Samvera Shelley Gullikson Shelley Gullikson State Library of Denmark Software Development at Royal Danish Library Stuart Yeates Open Source Exile Suzanne Chapman UsersLib Tara Robertson Tara Robertson Consulting Ted Lawless Ted Lawless Terry Reese Terry's Worklog Thom Hickey Outgoing Tim Ribaric eLIBtronic.ca Villanova Library Technology Blog Falvey Memorial Library Blog William Denton William Denton ZBW German National Library of Economics ZBW Labs Zotero Zotero flickr Recent Uploads tagged code4lib pinboard Pinboard (items tagged code4lib) 
planet-code4lib-org-78	----	None 
planet-code4lib-org-902	----	Planet Code4Lib Tue, 27 Apr 2021 16:42:43 +0000 Tue, 27 Apr 2021 16:42:43 +0000 Code4Lib c4l-planet-admins@library.oregonstate.edu 
planet-code4lib-org-9456	----	Planet Code4Lib Search the bloggers of Planet Code4Lib using Google Custom Search. UTF-8 code4lib library peter@OhioLINK.edu data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAABAAAAAQCAMAAAAoLQ9TAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAgY0hSTQAAeiYAAICEAAD6AAAAgOgAAHUwAADqYAAAOpgAABdwnLpRPAAAAwBQTFRF////s7OzNDQ0SkpKioqKbGxsoaGhe3t7KSkp6enpWFhYHR0dzMzMl5eXYmJivr6+8/PzPj4+2traAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAVWdikQAAAAlwSFlzAAALEwAACxMBAJqcGAAAAI1JREFUKFNVTlsCwyAIE0SMLdSV+x92WLuP5SvmISnlgQLgTTcui/gTMGLi52oSDKMUzFbagQ/CXDqO4zEjwMMO0AihImJmQWEUYdOlwEhpLpC6a80KX04JlV75fC4Za8LvFu9hELmotva+TyfvK/RO5ZZ1Zkplb00q1b2pyI50lpafc9bW8ATdy2l1+1/rhQUihcZntwAAAABJRU5ErkJggg== false http://planet.code4lib.org 
planet-infomotions-com-6819	----	Planet Eric Lease Morgan Home Alex Catalogue Serials Blog Musings Planet Sandbox Writings Catholic Portal Comments on: DH @ Notre Dame Life of a Librarian Mini-musings Musings Readings Water collection About this planet Timeline view January 05, 2021 Musings Reading texts through the use of network graphs You shall know a word by the company it keeps. --John Rupert Firth I am finally getting my brain around the process of reading texts through the use of network graphs. Words in-and-of themselves do not carry very much meaning; the connotation of words is lost without context; the meaning and connotation of words only really present themselves when words are used in conjunction with other words. That is why, in world of natural language processing, things like ngrams, noun phrases, and grammars are so important. Heck, things like topic modelers (such as MALLET) and semantic indexers (such as Word2Vec) assume the co-occurrence of words is indicative of meaning. With this in mind, network graphs can be used to literally illustrate the relationship of words. As you may or may not know, network graphs are mathematical models composed of "nodes" and "edges". Nodes denote things, and in my world, nodes are usually words or documents. Edges denote the relationships -- measurements -- between nodes. In the work I do, these measurements are usually the distances between words or the percentage a given document is about a given topic. Once the nodes and edges are manifested in a data structure -- usually some sort of matrix -- they can be computed against and ultimately visualized. This is what I have learned how to do. Below is a little Python script called "txt2graphml.py". Given a plain text file, one of two normalization functions, and an integer, the script will output a common network graph data structure called "GraphML". The script does its good work through the use of two Python modules, Textacy and NetworkX. The first takes a stream of plain text, parses it into words, normalizes them by finding their lemmas or lower casing them, and then calculates the number of times the given word is in proximity to other words. The normalized words are the nodes, and the proximities are the edges. The second module simply takes the output of the former and serializes it into a GraphML file. The script is relatively tiny; about 33% of the code includes comments: #!/usr/bin/env python # txt2graphml.py - given the path to a text file, a normalizer, # and the size of window, output a graphml file # Eric Lease Morgan <emorgan@nd.edu> # January 4, 2021 - first cut; because of /dev/stdout, will probably break under Windows # configure MODEL = 'en_core_web_sm' # require import networkx as nx import os import spacy import sys import textacy # get input if len( sys.argv ) != 4 : sys.exit( "Usage: " + sys.argv[ 0 ] + " <file> <lemma|lower> <window>" ) file = sys.argv[ 1 ] normalize = sys.argv[ 2 ] window = int( sys.argv[ 3 ] ) # get the text to process text = open( file ).read() # create model and then then use it against the text size = ( os.stat( file ).st_size ) + 1 nlp = spacy.load( MODEL, max_length=size, disable=( 'tagger', 'parser', 'ner', 'textcat' ) ) doc = nlp( text ) # create a graph; the magic happens here G = textacy.spacier.doc_extensions.to_semantic_network( doc, normalize=normalize, nodes='words', edge_weighting='cooc_freq', window_width=window ) # output the graph and done nx.write_graphml( G, '/dev/stdout' ) exit() One can take GraphML files and open them in Gephi, a program intended to render network graphs and provide a means to interact with them. Using Gephi is not easy; the use of Gephi requires practice, and I have been practicing off and on for the past few years. (Geesh!) In any event, I used both txt2graphml.py and Gephi to "read" a few of my recent blog postings, and I believe the results are somewhat illuminating. I believe the results illustrate the salient word combinations of each posting. Files. Functions. Tools. Content. Etc. Each "reading" is presented below: The tools I use to do my application development The combined use of two tools to create content The process I'm employing to read the works of Horace There are many caveats to this whole process. First, the denoting of nodes & edges is not trivial, but txt2graphml.py helps. Second, like many visualization processes, the difficulty of visualization is directly proportional to the amount of given data; it is not possible to illustrate the relationship of every word to every other word unless a person has a really, really, really big piece of paper. Third, like I already said, Gephi is not easy to use; Gephi has so many bells, whistles, and options that it is easy to get overwhelmed. That said, the linked zip file includes sample data, txt2graphml.py, a few GraphML files, and a Gephi project so you can get give it a whirl, if you so desire. Forever and a day we seem to suffering from information overload. Through the ages different tools have been employed to overcome this problem. The venerable library catalog is an excellent example. My personal goal is to learn how certain big ideas (love, honor, truth, justice, beauty, etc.) have been manifested over time, but the corpus of content describing these things is... overwhelming. The Distant Reader is a system designed to address this problem, and I am now on my way to adding network graphs to its toolbox. Maybe you can employ similar techniques in the work you do? January 05, 2021 12:32 AM January 01, 2021 Musings The Works of Horace, Bound The other day I bound the (almost) complete works of Horace. For whatever reason, I decided to learn about bit about Horace, a Roman poet who lived between 65 and 8 BC. To commence upon this goal I downloaded a transcribed version of Horace's works from Project Gutenberg. I marked up the document in TEI and transformed the resulting XML into a FO (Formatting Objects) file, and then used a FO processor (Apache FOP) to create a PDF file. The PDF file is simple with only a title page, table-of-contents, chapters always starting on the right-hand page, and page numbers. What's really important is the pages' margins. They are wide and thus amenable to lots of annotation. I then duplex printed all 400 pages. Four hundred pages (two hundred pages duplex printed) is too large to effectively bind. Consequently I divided the works into two parts and bound them. The binding is simple. I started with two boards just less than the size of the paper. I then wrapped the boards with a single large piece of paper, and I covered up the insides with another piece of paper. I then wrapped a book block within the resulting case. Finally, I used a Japanese stab stitch to hold the whole thing together. Repeat for part #2. The results are very strong, very portable, and very functional, depicted below: covers binding For better or for worse, I seem to practice and enjoy a wide spectrum of library-esque activities. Moreover, sometimes my vocation is also may avocation. Geesh! P.S. Why are the works (almost) complete? Because the Gutenberg version does not include something called "Carmen Saeculare". I guess you get what you pay for. January 01, 2021 05:00 AM December 30, 2020 Musings How to write in a book There are two files attached to this blog posting, and together they outline and demonstrate how to write in a book. The first file -- a thumbnail of which is displayed below -- is a one-page handout literally illustrating the technique I employ to annotate printed documents, such as books or journal articles. Handout From the handout: For the most part, books are containers for data & information, and as such they are not sacred items to be worshiped, but instead things to be actively used. By actively reading (and writing in) books, a person can not only get more out of their reading, but a person can add value to the material as well as enable themselves to review the material quickly... Here is a list of possible techniques to use in an active reading process. Each assumes you have a pencil or pen, and you "draw" symbols to annote the text:... The symbols listed above are only guidelines. Create your own symbols, but use them sparingly. The goal is to bring out the most salient points, identify declarative sentences, add value, and make judgements, but not diagram each & every item. The second file is a journal article, "Sexism in the Academy" by Troy Vettese in N+1, Issue 34 (https://nplusonemag.com/issue-34/essays/sexism-in-the-academy/). The file has been "marked-up" with my personal annotations. Give yourself 120 seconds, which is much less time than it would take for you to even print the file. Look over the document, and then ask yourself three questions: What problem might the author be addressing? What are some possible solutions to the problem? What does the reader (me, Eric) think the most important point is? I'll bet you'll be able to answer the questions in less than two minutes. "Reading is FUNdemental." December 30, 2020 05:00 AM December 29, 2020 Musings TEI Toolbox, or "How a geeky librarian reads Horace" tldnr; By marking up documents in XML/TEI, you create sets of well-structured narrative data, and consequently, this enables you to "read" the documents in new & different ways. Horace, not Who was Horace and what did he write about? To answer this question, I suppose I could do some sort of Google search and hope for the best. Through an application of my information literacy skills, I suppose I could read an entry about Horace in an encyclopedia, of which I have many. One of those encyclopedias could be Wikipedia, of which I am a fan. Unfortunately, these approaches rely on the judgements of other people, and while other people have more experience & expertise than myself, it is still important for me to make up my own mind. To answer questions -- to educate myself -- I combine the advice of others with personal experience. Thus, the sole use of Google and/or encyclopedias fail me. To put in another way, in order to answer my question, I ought to read Horace's works. For this librarian, obtaining the complete works of Horace is a trivial task. Search something like Project Gutenberg, the Internet Archive, Google Books, or the HathiTrust. Download item. Read it in its electronic form, or print it and read it in a more traditional manner. Gasp! I could even borrow a copy from a library or purchase a copy. In the former case, I am not allowed to write in the item, and in the later case the format may not be amenable to personal annotation. (Dont' tell anybody, but I advocate writing in books. I even facilitate workshops on how to systematically do such a thing.) Obtaining a copy of Horace's works and reading it in a traditional manner is all well and good, but the process is expensive in terms of time, and the process does not easily lend itself to computer assistance. After all, a computer can remember much better than I can. It can process things much faster than I can. And a computer can communicate with other computers much more throughly than I can. Thus, this geeky librarian wants to read Horace with the help of a computer. This is where the TEI Toolbox comes in. The TEI Toolbox is a fledging system of Bash, Perl, and Python scripts used to create and transform Text Encoding Initiative (TEI) files into other files, and these other files lend themselves to alternative forms of reading. More specifically, given a TEI file, the Toolbox can: validated it parse it into smaller blocks such as chapters and paragraphs, and save the results for later use mark-up each word in each sentence in terms of parts-of-speech; "morphadorn" it transform it into plain text, for other computing purposes transform it into HTML, for online reading transform it into PDF, specifically designed for printing distill its content into a relational (SQLite) database complete with bibliographics, parts-of-speech, and named-entities create a word-embedding (word2vec) database create a (Solr) full-text index complete with parts-of-speech, named-entities, etc. search the totality of the above in any number of different ways compare & contrast documents in any number of different ways Thus, given a valid TEI file, I can not only print a version of it amenable to traditional reading (and writing in), but I can also explore & navigate a text for the purposes of scholarly investigation. Such is exactly what I am doing with the complete works of Horace. My first step was to identify a plain text version of Horace's works, and the version at Project Gutenberg was just fine. Next, I marked up the plain text into valid TEI using a set of Barebones BBEdit macros of my own design. This process was tedious and took me about an hour. I then used my Toolbox's ./bin/carrel-initialize.sh script to create a tiny file system. I then used the ./bin/carrel-build.sh script to perform most of the actions outlined above. This resulted in a set of platform-independent files saved in a directory named "horace". For example, it includes: TEI/XML file; it all starts here PDF file suitable for printing HTML file complete with metadata and hundreds of navigation links plain text files such as the complete works as a single file, chapters, and paragraphs the relational database file the word embedding file To date, I have printed the PDF file, and I plan to bind it before the week is out. I will then commence upon reading (and writing in) it in the traditional manner. In the meantime, I have used the Toolbox to index the whole with Solr, and I have queried the resulting index for some of my favorite themes. Consequently, I have gotten a jump start on my reading. What I think is really cool (or "kewl"), is how the search results return pointers to the exact locations of the hits in the HTML file. This means I can view the search results within the context of the whole work, like a concordance on steroids. For example, below are sample queries for "love AND war". Notice how the results are hyperlinked within the complete work: While you, great Lollius, declaim at Rome... O thou fountain of Bandusia, clearer than... When first Greece, her wars being over, b... Here are some results for "god AND law": There was a certain freedman, who, an old... Orpheus, the priest and Interpreter of th... O ye elder of the youths, though you are ... And finally, "(man OR men) AND justice)": What shame or bound can there be to our a... Damasippus is mad for purchasing antique ... Have you any regard for reputation, which... All of the above only scratches the surface of what is possible with the Toolbox, but the essence of the Toolbox is this: by marking up a document in TEI you transform a narrative text into a set of structured data amenable to computer analysis. From where I sit, the process of marking up a document is a form of close reading. Printing a version of the text and reading (and writing in) it lends itself to additional methods of use & understanding. Finally, by exploiting derivative versions of the text with a computer, even more methods of understanding present themselves. Hopefully, I will share some of those other techniques in future postings. Now, I'm off to my workshop to bind the book, all 400 pages of it... "Reading is FUNdemental." December 29, 2020 05:00 AM December 27, 2020 Musings Cool hack with wget and xmllint I'm rather proud of a cool hack I created through the combined use of the venerable utilities wget and xmllint. Eye Candy by Eric A few weeks ago I quit WordPress because it was too expensive, and this necessitated the resurrection of my personal TEI publishing system. Much to my satisfaction, the system still works quite well, and it is very satisfying when I can easily bring back to life an application which is more than a decade old. The system works like this: 1) write content, 2) mark-up content in rudimentary TEI, 3) pour content into database, 4) generate valid TEI, 5) transform TEI into HTML, 6) go to Step #1 until satisfied, and finally, 7) create RSS feed. But since software is never done, the system was lacking. More specifically, when I wrote my publishing system RSS feeds did not include content, just metadata. Since then an extended element was added to the RSS namespace, specifically one called "content". [2] This namespace allows a publisher to include HTML in their syndication but with two caveats: 1) only the true content of an HTML file is included in the syndication, meaning nothing from the HTML head element, and 2) no relative URLs are allowed because if they were, then all the URLs would be broken. ("Duh!") Consequently, if I wanted my content to be truly syndicated, then would need to enhance my RSS feed generator. This is where wget and xmllint make the scene. Given a URL, wget will... get the content at the other end of the URL, and as an added bonus and through the combined use of the -k and -O switches, wget will also tranform all relative URLs of a cached HTML file into absolute URLs. [3] Very nice. Thus, Issue #2, above, can be resolved. To resolve Issue #1, I know that my returned HTML is well-formed, and consequently I can extract the desired content through the use of an XPath statement. Given this XPath statement, xmllint can return the desired content. [4] For a good time, I can also use xmllint to reformat the output into a nicely formatted hierarchical structure. Finally, because both of these utilities support I/O through standard input and standard output, they can be glued together with a few tiny (Bash) commands: # configure URL="http://infomotions.com/musings/my-ide/" TMP="/tmp/file.html" XPATH='/html/body/div/div/div' # do the work CONTENT=$( wget -qkO "$TMP" "$URL"; cat "$TMP" | xmllint --xpath "$XPATH" - | xmllint --format - | tail -n +2 ) Very elegant. The final step is/was to tranlate the Bash commands into Perl code and thus incorporate the hack into my RSS generator. "Voila!" Again, software is never done, and if it were, then it would be called "hardware"; software requires maintenance, and after a while the maintenance can become more expensive than the development. It is very satisfying when maintenance is so inexpensive compared to development. Jettisoning WordPress was the right thing for me to do, especially considering the costs -- tiny. December 27, 2020 05:00 AM December 20, 2020 Musings My integrated development environment (IDE) My integrated development environment (IDE) consists of three items: 1) a terminal application (Mac OS X Terminal), 2) a text editor (Barebones's BBEdit), and 3) a file transfer application (Panic's Transmit). I guess it goes without saying, I do all my work on top of Mac OS X. Mac OS X Terminal Barebones BBEdit Panic Transmit At the very least, I need a terminal application, and Mac OS X's terminal works just fine. Open a connection to my local host, or SSH to a remote host. Use the resulting shell to navigate the file system and execute (that sounds so violent) commands. Increasingly I write Bash scripts to do my work. Given a relatively sane Linux environment, one would be surprised how much functionality can be harnessed with simple shell scripts. BBEdit is my most frequently used application. Very rarely do I use some sort of word processor to do any of writing. "Religious wars" are fought over text editors, so I won't belabor my points. BBEdit will open just about any file, and it will easily open files measured in megabytes in size. Its find/replace functions are full-featured. I frequently use its sort function, duplicate line function, remove line breaks function, markup function, and reformat XML and JSON functions. It also supports the creation macros, knows about my local shell, and can do AppleScript. BBEdit can even be opened from the command line, meaning it can take STDOUT is input. Fun! While BBedit suports SFTP, my go to file transfer application is Transmit. Transmit knows many file transport protocols, not just SFTP. For example, instead of using a Web browser to navigate a Google Drive (dumb), I can mount the drive with Transmit, and the result is much more responsive. Very similar to my terminal, I use it to connect to a remote host, navigate the file system, and then I create, move, rename, and delete files. Simple. One of the coolest bits of functionality is the ability to download a text file, have it opened in my editor, and when I save the text file, then it is saved on the remote host. Thus, there is little need to know a terminal-based editor like vi, emac, or nano, but I do use vi or nano every once in a while. I have never felt the need for a "real" IDE. Too much overhead. No need to set any debugging points nor trace the value of a variable. I don't feel the need for a bazillion windows, panes, nor panels. An IDE feels too much a shell for my shell. Yet another thing to learn and an obfuscation of what is really going on. This is just my style. There are many different ways to cook an omlet, paint a painting, sing a song, etc. The same holds true maintaining computers, running software, and writing programs. To each his^h^h^h their own. December 20, 2020 05:00 AM December 19, 2020 Mini-musings Final blog posting This is probably my final blog posting using the WordPress software, and I hope to pick up posting on Infomotions’ Musings. WordPress is a significant piece of software, and while its functionality is undeniable, maintaining the software in a constant process. It has become too expensive for me. Moreover over, blog software, such as WordPress, was suppose to enable two additional types of functionality that have not really come to fruition. The first is/was syndication. Blog software was expected to support things like RSS feeds. While blog software does support RSS, people to not seem to create/maintain lists of blogs and RSS feeds for reading. The idea of RSS has not come to fruition in the expected way. Similarly, blog were expected to support commenting in the form of academic dialog, but that has not really come to fruition either; blog comments are usually terse and do not really foster discussion. For these reasons, I am foregoing WordPress, and I hope to return to use the of my personal TEI publishing process. I feel as if my personal process will be more long-lasting. In order to make this transition, I have used a WordPress plug-in called Simply Static. Install the software, play with the settings, create a static site, review results, and repeat if necessary. The software seems to work pretty well. Also, paying the roll of librarian, I have made an effort classify my blog postings while diminishing the number of items in the “miscellaneous” category. By converting my blog to a static site and removing WordPress from my site, I feel as if I am making the Infomotions web presence simpler and easier to maintain. Sure, I am loosing some functionality, but that loss is smaller than the amount of time, effort, and worry I incur by running software I know too little about. by Eric Lease Morgan at December 19, 2020 04:02 PM Date created: 2000-05-19 Date updated: 2011-05-03 URL: http://infomotions.com/ 
planet-infomotions-com-8010	----	Planet Eric Lease Morgan Planet Eric Lease Morgan Reading texts through the use of network graphs The Works of Horace, Bound How to write in a book TEI Toolbox, or "How a geeky librarian reads Horace" Cool hack with wget and xmllint My integrated development environment (IDE) Final blog posting OpenRefine and the Distant Reader Topic Modeling Tool – Enumerating and visualizing latent themes The Distant Reader and concordancing with AntConc The Distant Reader Workbook Wordle and the Distant Reader The Distant Reader and a Web-based demonstration Distant Reader “study carrels”: A manifest A Distant Reader Field Trip to Bloomington What is the Distant Reader and why should I care? Project Gutenberg and the Distant Reader OJS Toolbox The Distant Reader and its five different types of input Invitation to hack the Distant Reader Fantastic Futures: My take-aways marc2catalog Charting &amp; graphing with Tableau Public Charting &amp; graphing with Tableau Public Extracting parts-of-speech and named entities with Stanford tools Extracting parts-of-speech and named entities with Stanford tools Creating a plain text version of a corpus with Tika Creating a plain text version of a corpus with Tika Identifying themes and clustering documents using MALLET Identifying themes and clustering documents using MALLET Introduction to the NLTK Introduction to the NLTK Using Voyant Tools to do some “distant reading” Using Voyant Tools to do some “distant reading” Project English: An Index to English/American literature spanning six centuries Using a concordance (AntConc) to facilitate searching keywords in context Using a concordance (AntConc) to facilitate searching keywords in context Word clouds with Wordle Word clouds with Wordle An introduction to the NLTK: A Jupyter Notebook An introduction to the NLTK: A Jupyter Notebook What is text mining, and why should I care? What is text mining, and why should I care? LexisNexis hacks Freebo@ND and library catalogues How to do text mining in 69 words How to do text mining in 69 words Stories: Interesting projects I worked on this past year Freebo@ND tei2json: Summarizing the structure of Early English poetry and prose Synonymizer: Using Wordnet to create a synonym file for Solr Tiny road trip: An Americana travelogue Blueprint for a system surrounding Catholic social thought &amp; human rights How not to work during a sabbatical Achieving perfection Achieving perfection VIAF Finder VIAF Finder Making stone soup: Working together for the advancement of learning and teaching Making stone soup: Working together for the advancement of learning and teaching Protected: Simile Timeline test Editing authorities at the speed of four records per minute Editing authorities at the speed of four records per minute Failure to communicate Failure to communicate Using BIBFRAME for bibliographic description Using BIBFRAME for bibliographic description XML 101 XML 101 Mr. Serials continues Mr. Serials continues Re-MARCable Re-MARCable MARC, MARCXML, and MODS MARC, MARCXML, and MODS “Sum reflextions” on travel “Sum reflextions” on travel What is old is new again What is old is new again Painting in Tuscany Painting in Tuscany My water collection predicts the future My water collection predicts the future JSTOR Workset Browser Early English love was black &amp; white Some automated analysis of Richard Baxter’s works Some automated analysis of Richard Baxter’s works Some automated analysis of Ralph Waldo Emerson’s works Some automated analysis of Henry David Thoreau’s works EEBO-TCP Workset Browser Developments with EEBO Boxplots, histograms, and scatter plots. Oh, my! HathiTrust Workset Browser on GitHub HathiTrust Resource Center Workset Browser Marrying close and distant reading: A THATCamp project Marrying close and distant reading: A THATCamp project Text files Hands-on text analysis workshop distance.cgi – My first Python-based CGI script Great Books Survey Great Books Survey My second Python script, dispersion.py My first R script, wordcloud.r My first Python script, concordance.py Doing What I’m Not Suppose To Do Doing What I’m Not Suppose To Do Hundredth Psalm to the Tune of "Green Sleeves": Digital Approaches to Shakespeare's Language of Genre Publishing LOD with a bent toward archivists Publishing LOD with a bent toward archivists Theme from Macroanalysis: Digital Methods and Literary History (Topics in the Digital Humanities) Fun with Koha Fun with Koha Matisse: "Jazz" Jazz, (Henri Matisse) Context for the creation of Jazz Lexicons and sentiment analysis – Notes to self What’s Eric Reading? Librarians And Scholars: Partners In Digital Humanities Digital Scholarship in the Humanities a Creative Arts The HuNI Virtual Laboratory Digital Collections As Research Infrastructure Fun with ElasticSearch and MARC Fun with ElasticSearch and MARC Visualising Data: A Travelogue ORCID Outreach Meeting (May 21 &amp; 22, 2014) CrossRef’s Text and Data Mining (TDM) API Ranking and extraction of relevant single words in text Level statistics of words: Finding keywords in literary texts and symbolic sequences Corpus Stylistics, Stylometry, and the Styles of Henry James Narrative framing of consumer sentiment in online restaurant reviews Code4Lib jobs topic Linked Archival Metadata: A Guidebook (version 0.99) Trends and gaps in linked data for archives LiAM Guidebook: Executive summary Rome in three days, an archivists introduction to linked data publishing Rome in a day, the archivist on a linked data pilgrimage way Four “itineraries” for putting linked data into practice for the archivist Italian Lectures on Semantic Web and Linked Data Linked Archival Metadata: A Guidebook The 3D Printing Working Group is maturing, complete with a shiny new mailing list What is linked data and why should I care? Impressed with ReLoad Digital humanities and libraries Tiny Text Mining Tools Three RDF data models for archival collections LiAM Guidebook – a new draft Linked data projects of interest to archivists (and other cultural heritage personnel) RDF tools for the archivist Semantic Web browsers Writing A Book University of Notre Dame 3-D Printing Working Group Semantic Web application SPARQL tutorial CrossRef’s Prospect API Analyzing search results using JSTOR’s Data For Research LiAM source code: Perl poetry LiAM source code: Perl poetry Linked data and archival practice: Or, There is more than one way to skin a cat. Archival linked data use cases Beginner’s glossary to linked data RDF serializations CURL and content-negotiation Questions from a library science student about RDF and linked data Paper Machines Linked Archival Metadata: A Guidebook — a fledgling draft RDF ontologies for archival descriptions Simple text analysis with Voyant Tools LiAM Guidebook tools LiAM Guidebook linked data sites LiAM Guidebook citations Publishing archival descriptions as linked data via databases Publishing linked data by way of EAD files Semantic Web in Libraries 2013 LiAM SPARQL Endpoint LiAM SPARQL Endpoint Initial pile of RDF Illustrating RDF Transforming MARC to RDF Tiny list of part-of-speech taggers Simple linked data recipe for libraries, museums, and archives OAI2LOD RDF triple stores Fun with bibliographic indexes, bibliographic data management software, and Z39.50 Quick And Dirty Website Analysis EAD2RDF EAD2RDF OAI2LOD Server OAI2LOD Server Network Detroit and Great Lakes THATCamp Data Information Literacy @ Purdue 3-D printing in the Center For Digital Scholarship Initialized a list of tools in the LiAM Guidebook, plus other stuff Guidebook moved to liamproject HathiTrust Research Center Perl Library What is Linked Data and why should I care? Jane &amp; Ade Stevenson as well as LOCAH and Linking Lives Linking Lives Challenges Of Linked Open Data Linked Archival Metadata: A Guidebook Drive By Shared Data: A Travelogue Beth Plale, Yiming Sun, and the HathiTrust Research Center JSTOR Tool — A Programatic sketch Matt Sag and copyright Catholic pamphlets workflow Copyright And The Digital Humanities Digital Scholarship Grilled Cheese Lunch Editors across campus: A reverse travelogue Digital humanities and the liberal arts Introduction to text mining Welcome! Genderizing names Editors Across The Campus Visualization and GIS Ted Underwood and “Learning what we don’t know about literary history” Visualizations and geographic information systems A couple of Open Access Week events New Media From the Middle Ages To The Digital Age Ted Underwood DH Lunch #2 So many editors! Digital humanities centers Lunch and lightning talks Inaugural Digital Humanities Working Group lunch: Meeting notes Yet more about HathiTrust items Inaugural digital humanities lunch Granting opportunity Visualization tools Notre Dame Digital Humanities mailing list Serial publications with editors at Notre Dame Exploiting the content of the HathiTrust, epilogue Exploiting the content of the HathiTrust, continued Exploiting the content of the HathiTrust Computational methods in the humanities and sciences Patron-Driven Acquisitions: A Symposium Lourdes, France E-Reading: A Colloquium at the University of Toronto Summarizing the state of the Catholic Youth Literature Project Summary of the Catholic Pamphlets Project Patron-Driven Acquisitions: A Symposium at the University of Notre Dame Value and Benefits of Text Mining Hello, World Users, narcissism and control – tracking the impact of scholarly publications in the 21st century Digital Research Data Sharing and Management From Stacks to the Web: the Transformation of Academic Library Collecting Emotional Intelligence Interim Report: Interviews with Research Support Professionals Research Infrastructures in the Digital Humanities TriLUG, open source software, and satisfaction TriLUG, open source software, and satisfaction Institutional Repositories, Open Access, and Scholarly Communication: A Study of Conflicting Paradigms 400 Catholic pamphlets digitized Field trip to the Mansueto Library at the University of Chicago Scholarly publishing presentations Tablet-base “reading” Big Tent Digital Humanities Meeting Catholic Pamphlets and practice workflow River Jordan at Yardenit (Israel) Use &amp; understand: A DPLA beta-sprint proposal Use &amp; understand: A DPLA beta-sprint proposal Catholic Youth Literature Project update Catholic Youth Literature Project: A Beginning Pot-Luck Picnic and Mini-Disc Golf Tournament Code4Lib Midwest: A Travelogue Raising awareness of open access publications Raising awareness of open access publications Poor man’s restoration Poor man’s restoration My DPLA Beta-Sprint Proposal: The movie My DPLA Beta-Sprint Proposal: The movie Trip to the Internet Archive, Fort Wayne (Indiana) DraftReportWithTransclusion LLD Vocabularies and Datasets UseCaseReport Digital Humanities Implementation Grants Reading revolutions: Online digital text and implications for reading in academe Report and Recommendations of the U.S. RDA Test Coordinating Committee: Executive Summary Usability Testing of VuFind at an Academic Library The Catholic Pamphlets Project at the University of Notre Dame DPLA Beta Sprint Submission DPLA Beta Sprint Submission Digging into data using new collaborative infrastructures supporting humanities-based computer science research Next-generation library catalogs, or ‘Are we there yet?’ Next-generation library catalogs, or ‘Are we there yet?’ HathiTrust: A research library at Web scale Rapid capture: Faster throughput in digitization of special collections Fun with RSS and the RSS aggregator called Planet Fun with RSS and the RSS aggregator called Planet Research Data Inventory Book reviews for Web app development Book reviews for Web app development Data Management Day Alex Lite (version 2.0) Alex Lite (version 2.0) Where in the world is the mail going? Where in the world is the mail going? Constant chatter at Code4Lib Constant chatter at Code4Lib Data management &amp; curation groups How “great” are the Great Books? How “great” are the Great Books? Code4Lib Conference, 2011 Code4Lib Conference, 2011 Subject Librarian's Guide to Collaborating on e-Science Projects Skilling Up to Do Data: Whose Role, Whose Responsibility, Whose Career? Words, Patterns and Documents: Experiments in Machine Learning and Text Analysis Vive la Différence! Text Mining Gender Difference in French Literature Gender, Race, and Nationality in Black Drama, 1950-2006: Mining Differences in Language Use in Authors and their Characters How to Write a Data Management Plan for a National Science Foundation (NSF) Proposal Meeting Funders’ Data Policies: Blueprint for a Research Data Management Service Group (RDMSG) Data Curation at the University of California, San Diego: Partnerships and networks Conducting a Data Interview E-Science and Data Support Services A Study of ARL Member Institutions Cloud-sourcing Research Collections: Managing Print in the Mass-digitized Library Environment Advanced Scholar Research with the Knowledge Kiosk Horizon Report, 2011 Edition Making data maximally available Managing Research Data 101 Foray’s into parts-of-speech Foray’s into parts-of-speech Elements of a data management plan Kotter's 8-step change model Visualizing co-occurrences with Protovis Visualizing co-occurrences with Protovis MIT’s SIMILE timeline widget MIT’s SIMILE timeline widget 6th International Data Curation Conference Two more data creator interviews Three data webinars Implementing Open Access: policy case studies Illustrating IDCC 2010 Illustrating IDCC 2010 Ruler &amp; Compass by Andrew Sutton Ruler &amp; Compass by Andrew Sutton Text mining Charles Dickens Text mining Charles Dickens AngelFund4Code4Lib AngelFund4Code4Lib Crowd sourcing the Great Books Crowd sourcing the Great Books Great Books data set Great Books data set Data tsunamis and explosions David Dickinson and New Testament manuscripts Data curation at ECDL 2010 ECDL 2010: A Travelogue ECDL 2010: A Travelogue XForms for Libraries, An Introduction Automatic Aggregation of Faculty Publications from Personal Web Pages Dan Marmion Dan Marmion Interpreting MARC: Where’s the Bibliographic Data? Why Purchase When You Can Repurpose? Using Crosswalks to Enhance User Access Hacking Summon Editorial Introduction – A Cataloger’s Perspective on the Code4Lib Journal Managing Library IT Workflow with Bugzilla Selected Internet Resources on Digital Research Data Curation Undiscovered public knowledge Undiscovered Public Knowledge: a Ten-Year Update Diddling with data Great Books data dictionary Great Books data dictionary Data curation in Purdue Twitter, Facebook, Delicious, and Alex Twitter, Facebook, Delicious, and Alex Where in the world are windmills, my man Friday, and love? Where in the world are windmills, my man Friday, and love? River Teith at Doune Castle (Scotland) River Clyde at Bothwell Castle (Scotland) Ngrams, concordances, and librarianship Ngrams, concordances, and librarianship Lingua::EN::Bigram (version 0.03) Lingua::EN::Bigram (version 0.03) Lingua::EN::Bigram (version 0.02) Lingua::EN::Bigram (version 0.02) Cool URIs Cool URIs Hello world! rsync, a really cool utility rsync, a really cool utility Social Side of Science Data Sharing: Distilling Past Efforts Preserving Research Data Retooling Libraries for the Data Challenge University Investment in the Library, Phase II: An International Study of the Library's Value to the Grants Process Doing OCR against new testament manuscripts Steps Toward Large-Scale Data Integration in the Sciences: Summary of a Workshop WiLSWorld, 2010 WiLSWorld, 2010 Digital Humanities 2010: A Travelogue Digital Humanities 2010: A Travelogue Digital Repository Strategic Information Gathering Project Data-Enabled Science in the Mathematical and Physical Sciences How “great” is this article? How “great” is this article? River Thames at Windsor Castle ALA 2010 ALA 2010 Principles and Good Practice for Preserving Data Text mining against NGC4Lib Text mining against NGC4Lib The Next Next-Generation Library Catalog The Next Next-Generation Library Catalog Measuring the Great Books Measuring the Great Books Collecting the Great Books Collecting the Great Books Inaugural Code4Lib “Midwest” Regional Meeting Inaugural Code4Lib “Midwest” Regional Meeting How “great” are the Great Books? How “great” are the Great Books? Not really reading Not really reading Cyberinfrastructure Days at the University of Notre Dame Cyberinfrastructure Days at the University of Notre Dame About Infomotions Image Gallery: Flickr as cloud computing About Infomotions Image Gallery: Flickr as cloud computing Shiny new website Shiny new website Grand River at Grand Rapids (Michigan) Counting words Counting words Open source software and libraries: A current SWOT analysis Great Ideas Coefficient Great Ideas Coefficient Indexing and abstracting My first ePub file My first ePub file Alex Catalogue Widget Alex Catalogue Widget Michael Hart in Roanoke (Indiana) Michael Hart in Roanoke (Indiana) Preservationists have the most challenging job Preservationists have the most challenging job How to make a book (#2 of 3) How to make a book (#2 of 3) Good and best open source software Good and best open source software Valencia and Madrid: A Travelogue Valencia and Madrid: A Travelogue Colloquium on Digital Humanities and Computer Science: A Travelogue Colloquium on Digital Humanities and Computer Science: A Travelogue Park of the Pleasant Retreat, Madrid (Spain) Mediterranean Sea at Valencia (Spain) A few possibilities for librarianship by 2015 Alex Catalogue collection policy Alex Catalogue collection policy Alex, the movie! Alex, the movie! Collecting water and putting it on the Web (Part III of III) Collecting water and putting it on the Web (Part III of III) Collecting water and putting it on the Web (Part II of III) Collecting water and putting it on the Web (Part II of III) Collecting water and putting it on the Web (Part I of III) Collecting water and putting it on the Web (Part I of III) Web-scale discovery services Web-scale discovery services How to make a book (#1 of 3) How to make a book (#1 of 3) Book review of Larry McMurtry’s Books Book review of Larry McMurtry’s Books Browsing the Alex Catalogue Browsing the Alex Catalogue Indexing and searching the Alex Catalogue Indexing and searching the Alex Catalogue History of Science Microsoft Surface at Ball State Microsoft Surface at Ball State What's needed next: A Culture of candor Frequent term-based text clustering Web-scale discovery indexes and "next generation" library catalogs Automatic metadata generation Automatic metadata generation Linked data applications Alex on Google Alex on Google Top Tech Trends for ALA Annual, Summer 2009 Top Tech Trends for ALA Annual, Summer 2009 Mass Digitization Mini-Symposium: A Reverse Travelogue Mass Digitization Mini-Symposium: A Reverse Travelogue Atlantic Ocean at Christ of the Abyss statue (Key Largo, FL) Lingua::EN::Bigram (version 0.01) Lingua::EN::Bigram (version 0.01) Lingua::Concordance (version 0.01) Lingua::Concordance (version 0.01) Mississippi River at Gateway To The West (St. Louis, MO) EAD2MARC EAD2MARC Text mining: Books and Perl modules Text mining: Books and Perl modules Interent Archive content in “discovery” systems Interent Archive content in “discovery” systems TFIDF In Libraries: Part III of III (For thinkers) TFIDF In Libraries: Part III of III (For thinkers) Tidal Basin at the Jefferson Memorial (Washington, DC) Mass digitization and opportunities for librarianship in 15 minutes The decline of books The decline of books Implementing user-centered experiences in a networked environment Code4Lib Software Award: Loose ends Code4Lib Software Award: Loose ends TFIDF In Libraries: Part II of III (For programmers) TFIDF In Libraries: Part II of III (For programmers) Ralph Waldo Emerson’s Essays Ralph Waldo Emerson’s Essays TFIDF In Libraries: Part I of III (For Librarians) TFIDF In Libraries: Part I of III (For Librarians) Statistical interpretation of term specificity and its application in retrieval A day at CIL 2009 A day at CIL 2009 Quick Trip to Purdue Quick Trip to Purdue Library Technology Conference, 2009: A Travelogue Library Technology Conference, 2009: A Travelogue Open source software: Controlling your computing environment "Next-Generation" Library Catalogs Mississippi River at St. Anthony Falls (Minneapolis) Technology Trends and Libraries: So many opportunities Code4Lib Open Source Software Award Code4Lib Open Source Software Award Code4Lib Conference, Providence (Rhode Island) 2009 Code4Lib Conference, Providence (Rhode Island) 2009 Henry David Thoreau’s Walden Henry David Thoreau’s Walden Eric Lease Morgan’s Top Tech Trends for ALA Mid-Winter, 2009 Eric Lease Morgan’s Top Tech Trends for ALA Mid-Winter, 2009 YAAC: Yet Another Alex Catalogue YAAC: Yet Another Alex Catalogue ISBN numbers ISBN numbers Fun with WebService::Solr, Part III of III Fun with WebService::Solr, Part III of III Why you can't find a library book in your search engine Fun with WebService::Solr, Part II of III Fun with WebService::Solr, Part II of III Mr. Serials is dead. Long live Mr. Serials Mr. Serials is dead. Long live Mr. Serials Fun with WebService::Solr, Part I of III Fun with WebService::Solr, Part I of III LCSH, SKOS, and Linked Data Visit to Ball State University Visit to Ball State University A Day with OLE A Day with OLE ASIS&amp;T Bulletin on open source software ASIS&amp;T Bulletin on open source software Fun with the Internet Archive Fun with the Internet Archive Snow blowing and librarianship Snow blowing and librarianship Tarzan of the Apes Tarzan of the Apes Open Source Software in Libraries: Opportunities and Expenses WorldCat Hackathon WorldCat Hackathon VUFind at PALINET VUFind at PALINET Next-Generation Library Catalogues: A Presentation at Libraries Australia Darling Harbor, Sydney (Australia) Lake Ontario at Hamilton, Ontario (Canada) Lake Huron at Sarnia (Canada) Dinner with Google Dinner with Google MyLibrary: A digital library framework &amp;amp; toolkit MyLibrary: A Digital library framework &amp; toolbox MyLibrary: A Digital library framework &amp; toolbox MBooks, revisited MBooks, revisited wordcloud.pl wordcloud.pl Last of the Mohicans and services against texts Last of the Mohicans and services against texts Crowd sourcing TEI files Crowd sourcing TEI files Metadata and data structures Metadata and data structures Origami is arscient, and so is librarianship Origami is arscient, and so is librarianship On the move with the Mobile Web On the move with the Mobile Web TPM — technological protection measures TPM — technological protection measures Against The Grain is not Against The Grain is not E-journal archiving solutions E-journal archiving solutions Web 2.0 and “next-generation” library catalogs Web 2.0 and “next-generation” library catalogs Alex Lite: A Tiny, standards-compliant, and portable catalogue of electronic texts Alex Lite: A Tiny, standards-compliant, and portable catalogue of electronic texts Indexing MARC records with MARC4J and Lucene Indexing MARC records with MARC4J and Lucene Encoded Archival Description (EAD) files everywhere Encoded Archival Description (EAD) files everywhere eXtensible Catalog (XC): A very transparent approach eXtensible Catalog (XC): A very transparent approach Top Tech Trends for ALA (Summer ’08) Top Tech Trends for ALA (Summer ’08) Google Onebox module to search LDAP Google Onebox module to search LDAP DLF ILS Discovery Internet Task Group Technical Recommendation DLF ILS Discovery Internet Task Group Technical Recommendation Introduction to the Catholic Research Resources Alliance HyperNote Pro: a text annotating HyperCard stack HyperNote Pro: a text annotating HyperCard stack Steve Cisler Steve Cisler Feather River at Paradise, California Code4Lib Journal Perl module (version .003) Open Library, the movie! get-mbooks.pl Hello, World! Cape Cod Bay at Race Point Next Generation Data Format Salto do Itiquira Open Library Developer's Meeting: One Web Page for Every Book Ever Published Atom Syndication Format Getting to know the Atom Publishing Protocol, Part 1: Create and edit Web resources with the Atom Publishing Protocol Atom Publishing Protocol Today's digital information landscape Dr. Strangelove, or How we learned to live with Google Next Generation Library Catalogs in Fifteen Minutes Success of Open Source by Steven Weber: A book review Catalog Collectivism: XC and the Future of Library Search Headwaters of the Missouri River Open source software at the Montana State University Libraries Symposium Original MyLibrary Canal surrounding Kastellet, Copenhagen, Denmark Sum Top Tech Trends for the Summer of 2007 Lake Erie at Cedar Point Amusement Park, OH Mineral water from Puyehue, Chile Lago Paranoa, Brazilia (Brazil) Leading a large group Wise crowds with long tails Trip to Rochester to Learn about XC Open Repositories, 2007: A Travelogue Unordered list of "top tech trends" Whirlwind in Windsor surrounding integrated library systems: My symposium notes Thinking outside the books: A Travel log MyLibrary 3.x and a Next Generation Library Catalogue ECDL 2006: A Travel log Mediterranean Sea at Alicante (Spain) Building the "next generation" library catalog Institute on Scholarly Communication: A travel log North Channel at Laurentian Isle, Canada American Library Association Annual Meeting, 2006 Joint Conference on Digital Libraries, 2006 Mississippi River at Oak Alley Plantation Rethink the role of the library catalog Top Tech Trends for ALA 2006; "Sum" pontifications Next generation library catalog What is SRW/U? first monday on a tuesday: a travel log Ohio Valley Group of Technical Services Librarian Annual Meeting Being innovative Atlantic Ocean at the Forty Steps (Newport, RI) Mass digitization (again) All things open Mass digitization Zagreb, Croatia: A travel log MyLibrary workshop Fountain at Trg Bana Jelacica Open source software for libraries in 30 minutes Library services and in-house software development OAI4: To CERN and Back Again Lake Geneva at Jet d Eau, Geneva, Switzerland Exploiting "Light-weight" Protocols and Open Source Tools to Implement Digital Library Collections and Services Technical Skills of Librarianship Creating and managing XML with open source software Rock Run at Ralston, PA Introduction to Web Services Top Technology Trends, 2005 Implementing SRU in Perl Morgan Territory Regional Park, CA IOLUG Spring Program Short visit to CRL Agean Sea at Kos, Greece Erie Canal at Fairport, NY So you want a new website IESR/OCKHAM in Manchester Indiana Library Federation Annual Meeting River Lune, Lancaster, UK My personal TEI publishing system Atlantic Ocean at Hay Beach, Shelter Island, NY Open access publishing Roman Bath, Bath, UK Symposium on open access and digital preservation Jimmy Carter Water, Atlanta, GA European Conference on Digital Libraries, 2004 Puget Sound at Port Orchard, WA OCKHAM in Corvallis, OR Marys Peak spring water Ogle Lake, Brown County State Park, IN Natural Bridges State Park, Monterey Bay, Santa Cruz, CA Yellowstone River Fountain of Youth, St. Augustine, FL Introduction to Search/Retrieve URL Service (SRU) Portal implementation issues and challenges Bath Creek at Bath, NC Open source software in libraries Really Rudimentary Catalog MCN Annual Conference Lake Mead at Hoover Dam LITA National Forum, 2003 Open source software in libraries: A workshop MyLibrary: A Copernican revolution in libraries Caribbean Sea at Lime Cay, Kingston, Jamaica Gulf of Mexico at Galveston Island State Park Mill Water at Mission San Jose, San Antonio, TX What is information architecture? Texas Library Association Annual Meeting, 2003 Building your library's portal Salton Sea, CA Pacific Ocean at Big Sur, CA Pacific Ocean at La Jolla, CA Getting started with XML: A workshop Usability for the Web: Designing Web sites that work DAIAD Goes to Ann Arbor OCKHAM@Emory (January, 2003) Web Services at OCLC Access 2002, Windsor, Ontario Lake St. Claire at Windsor, Ontario Usability in less than 60 minutes European Conference on Digital Libraries Making information easier to find with MyLibrary Roman Forum in Rome, Italy Implementing "Light-weight Reference Models" in MyLibrary Tanana River at Fairbanks, Alaska Mendenhall Glacier at Juneau, Alaska Lancaster Square, Conwy, Wales River Teifi at Cenarth Falls, Cenarth, Wales Atlantic Ocean at Mwnt, Wales Atlantic Ocean at St. Justinians, Wales Atlantic Ocean at Roch, Wales Loch Lomond American Library Association Annual Meeting, Atlanta, GA, 2002 Stone Mountain, Atlanta, GA St. Joesph River at Bristol, IN OCKHAM in Atlanta DLF in Chicago Isabella River in the Boundry Waters Canoe Area Wilderness, MN Open Source Software in libraries ASIS &amp;amp; T 2002 Information Architecture Summit: Refining the craft Baltimore Harbor, Baltimore, MD What is the Open Archives Initiative? Ontario Library Association (OLA) Annual Meeting, 2002 Reflection Pool, University of Notre Dame, Notre Dame, IN Lake Michigan at Warren Dunes State Park, IN Ohio River at Point Pleasant, OH Open source software in libraries Amazon River, Peru Comparing Open Source Indexers Smart HTML pages with PHP Data Services for the Sciences: A Needs Assessment Summary Report of the Research Data Management Study Group Portal webliography Gift cultures, librarianship, and open source software development DBMs and Web Delivery Review of some ebook technology CAP '99 SIGIR '99 MyLibrary@NCState Marketing through usability Catalogs of the future Raleigh-Worcester-Lansing Adaptive technologies Sometimes the question is more important than the answer Networking 2000 Languaging '99 Possibilities for proactive library services Systems administration requires people skills Communication is the key to our success Imagine, if only we had... Marketing future libraries Springboards for stategic planning Eric visits Savannah Different type of distance education Indexing, indexing, indexing MyLibrary in your library Becoming a 600-pound gorilla Access control in libraries We love databases! Computer literacy for librarians Pointers 4 searching, searching 4 pointers From Amtrak to Artemia Salina Unique collections and Fahrenheit 451 Creating user-friendly electronic information systems Tuileries Gardens, Paris (France) Evaluating Index Morganagus Becoming a World Wide Web server expert See You See A Librarian Final Report Learning to use the tools of the trade Cataloging digital mediums Readability, browsability, searchability plus assistance ListWebber II On being a systems librarian Cataloging Internet Resources: A Beginning Tennessee Library Association Clarence meets Alcuin Extending your HTML on a Macintosh using macro languages Adding Internet resources to our OPACs Description and evaluation of the Mr. Serials Process Gateways and electronic publishing Teaching a new dog old tricks WILS' World Conference 95: A travel log ALA 1995 Annual Conference: A Mini-Travel Log Ties That Bind: Converging Communities - A Travel Log USAIN Annual Conference 1995: A travel log Internet for Anthropologists WebEdge: A travel log Using World Wide Web and WAIS technologies Introduction to World Wide Web Servers Short trip to Duke Opportunities for technical services staff Email.cgi version 5.0.3 World-Wide Web and Mosaic: An overview for librarians Simple HTML Editor (SHE) version 2.9 Alcuin, an NCSU Libraries guide Implementing TCP/IP communications with HyperCard Day in the life of Mr. D. MicroPhone scripts for searching MEDLARS MARC Reader: a HyperCard script to demystify the MARC record Random Musing: HyperNote Pro Caribbiean Sea at Robins Bay, Jamaica 
pleroma-social-6885	----	Pleroma — a lightweight fediverse server Documentation API Blog News Documentation API Blog News Pleroma Free and open communication for everyone. Pleroma is social networking software compatible with other Fediverse software such as Mastodon, Misskey, Pixelfed and many others. For a friendly introduction to Pleroma and the Fediverse, check The Big Pleroma and Fediverse FAQ and read What is Pleroma? Getting Started Start using Pleroma by joining an existing Pleroma instance or check the installation guide to setting up your own server. Join an Instance Installation Guide About Pleroma Our latest release is v2.3.0. Pleroma is free software, all development and issue tracking happens over at the project's GitLab instance. There are multiple frontends to use with Pleroma to suit all kinds user preferences: Pleroma FE, our 'official' highly customizable frontend. Soapbox, a simple easy to learn and use alternative. Masto FE, a Pleroma focused fork of Mastodon's multi column frontend. Featured Instances Want to try Pleroma out but don't know which one of the many instances to join? Here's a short list of public community ran instances with open registration. outerheaven.club stereophonic.space cawfee.club shitposter.club blob.cat fedi.absturztau.be cdrom.tokyo udongein.xyz Other helpful resources Statistics and configuration of pleroma instances Only statistics of pleroma instances #pleroma and #pleroma-dev IRC channels on freenode Contact You can contact us via email at contact@pleroma.social. 
polimappers-github-io-202	----	PoliMappers News, events and contacts Who are we? PoliMappers is a volunteer students' group, based at Politecnico di Milano, created in December 2016. The mission of the group is to train and motivate the next generation of volunteer mappers and to do mapping using free and open source software within the university as well as primary and secondary schools. PoliMappers is the first European chapter of the international association YouthMappers, founded in 2014 in the United States of America with the support of United States Agency for International Development (USAID). YouthMappers aims to cultivate a new generation of leaders in the field of open geospatial data and technologies, with the purpose of creating through them resilient communities of the future. It is a global network that currently includes 243 chapters in 55 countries (updated 02/2021). In accordance with the terms of participation of YouthMappers, the association's activities include at least two mapping activities per year: mapping the local area and participating in a YouthMappers network promoted campaign. Social networks Chat Join us on Telegram and chat with us! Ask the link at our email! Contacts Write us at polimappers at gmail dot com Statistics Calendar Recent documents Scroll to discover new documents! Latest news Tweets by PoliMappers 
programminghistorian-org-1696	----	The Programming Historian | Programming Historian Donate to The Programming Historian today! The Programming Historian About About PH Project Team Research Privacy Policy Contribute Overview Report a bug Reviewer Guidelines Author Guidelines Translator Guidelines Editor Guidelines Technical Contributions Lessons Support Us Institutional Partnership Programme Individual Supporters Our Supporters Blog en es fr pt The Programming Historian The Programming Historian Enter The Programming Historian (The initial English version) 84 lessons ISSN: 2397-2068 We publish novice-friendly, peer-reviewed tutorials that help humanists learn a wide range of digital tools, techniques, and workflows to facilitate research and teaching. Entrar The Programming Historian en español 49 lecciones ISSN: 2517-5769 Publicamos tutoriales revisados por pares dirigidos a humanistas que quieran aprender una amplia gama de herramientas digitales, técnicas computacionales y flujos de trabajo útiles para investigar y enseñar. Entrez The Programming Historian en français 16 leçons ISSN: 2631-9462 Nous publions des tutoriels évalués par des pairs qui permettent l'initiation à et l'apprentissage d'un large éventail d'outils numériques, de techniques et de flux de travail pour faciliter la recherche et l'enseignement en sciences humaines et sociales. Entrar The Programming Historian em português 1 lições ISSN: 2753-9296 Publicamos tutoriais acessíveis, avaliados por pares, que ajudam os humanistas a aprender uma ampla gama de ferramentas digitais, técnicas computacionais e metodologias de trabalho que facilitam a pesquisa e o ensino. The Programming Historian (ISSN: 2397-2068) is released under a CC-BY license. This project is administered by ProgHist Limited, Company Number 12192946. ISSN 2397-2068 (English) ISSN 2517-5769 (Spanish) ISSN 2631-9462 (French) ISSN 2753-9296 (Portuguese) Hosted on GitHub Site last updated 19 April 2021 RSS feed subscriptions See page history Make a suggestion Lesson retirement policy Translation concordance 
planet-code4lib-org-1931	----	Planet Code4Lib http://planet.code4lib.org Planet Code4Lib - http://planet.code4lib.org Ed Summers: 856 https://inkdroid.org/2021/04/27/coincidence/ <p> Coincidence? </p> <p> <a href="https://www.loc.gov/marc/bibliographic/bd856.html"> <img class="img-responsive" src="https://inkdroid.org/images/856a.png" /> </a> </p> <p> <br /> </p> <p> <a href="https://www.wikidata.org/wiki/Property:P856"> <img class="img-responsive" src="https://inkdroid.org/images/856b.png" /> </a> </p> 2021-04-27T16:13:12+00:00 Digital Library Federation: The #DLFteach Toolkit: Recommending EPUBs for Accessibility https://www.diglib.org/the-dlfteach-toolkit-recommending-epubs-for-accessibility/ <p><em><img alt="DLF Digital Library Pedagogy Logo" class="alignright wp-image-11430 size-medium" height="300" src="https://www.diglib.org/wp-content/uploads/sites/3/2015/12/dlf-diglib-pedagogy-logo-300x300.png" width="300" />This post was written by Hal Hinderliter, as part of <strong>Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit</strong>, a blog series from DLF’s Digital Library Pedagogy group highlighting the experiences of  digital librarians and archivists who utilize the <a href="https://dlfteach.pubpub.org/toolkit" rel="noopener" target="_blank">#DLFteach Toolkit</a> and are new to teaching and/or digital tools.</em></p> <p><em>The <a href="https://wiki.diglib.org/Pedagogy" rel="noopener" target="_blank">Digital Library Pedagogy working group</a>, also known as #DLFteach, is a grassroots community of practice, empowering digital library practitioners to see themselves as teachers and equip teaching librarians to engage learners in how digital library technologies shape our knowledge infrastructure. The group is open to anyone interested in learning about or collaborating on digital library pedagogy. <a href="https://groups.google.com/forum/#!forum/dlf-pedagogy" rel="noopener" target="_blank">Join our Google Group</a> to get involved.</em></p> <p> </p> <hr /> <p>For this blog post, I’ve opted to provide some background information on the topic of my #DLFteach Toolkit entry: the EPUB (not an acronym) format, used for books and other documents. Librarians, instructors, instructional designers and anyone else who needs to select file formats for content distribution should be aware of what EPUB has to offer!</p> <p><strong>Electronic books: the fight over formats</strong></p> <p>The production and circulation of books, journals, and other long-form texts has been radically impacted by the growth of computer-mediated communication. Electronic books (“e-books”) first emerged near a half-century ago as text-only ASCII files, but are now widely available in a multitude of different file formats. Most notably, three competing options have been competing for market dominance: PDF files, KF8 files (for Amazon’s Kindle devices), and the open-source EPUB format. The popularity of handheld Kindle devices has created a devoted fan base for KF8 e-books, but in academia the ubiquitous PDF file remains the most common way to distribute self-contained digital documents. In contrast to these options, a growing movement is urging that libraries and schools eschew Kindles and abandon their reliance on PDFs in favor of the EPUB electronic book format.</p> <p>The EPUB file format preserves documents as self-contained packages that manage navigation and presentation separately from the document’s reflowable content, allowing users to alter font sizes, typefaces, and color schemes to suit their individual preferences. E-books saved in the EPUB format are compatible with Apple’s iPads and iPhones as well as Sony’s Reader, Barnes &amp; Nobles Nook, and an expansive selection of software applications for desktop, laptop, and tablet computers. Increasingly, that list includes screen reader software such as Voice Dream and VitalSource Bookshelf, meaning that a single file format – EPUB 3 – can be readily accessed by both sighted and visually impaired audiences.</p> <p>The lineage of EPUB can be traced back to the Digital Audio-based Information System (DAISY), developed in 1994 under the direction of the Swedish Library of Talking Books and Braille. Today, EPUB is an open-source standard that is managed by the International Digital Publishing Forum, part of the W3C. In contrast to the proprietary origins of both PDF and KF8 e-books, modifications to the open EPUB standard have always been subject to public input and debate.</p> <p><strong>Accessibility in Academia: EPUB versus PDF</strong></p> <p>Proponents of universal design principles recommend the use of documents that are fully accessible to everyone, including users of assistive technologies, e.g., screen readers and refreshable braille displays. The DTBook format, a precursor to EPUB, was specifically referenced by Rose et al. (2006) in their initial delineation of Universal Design for Learning (UDL) as part of UDL’s requirement for multiple means of presentation. At the time, the assumption was that DTBooks would be distributed only to students who needed accessible texts, with either printed copies or PDF files for sighted learners. Today, however, it is no longer necessary to provide multiple formats, since EPUB 3 (the accessibility community’s preferred replacement for DTBooks) can be used with equal efficacy by all types of students.</p> <p>In contrast, PDF files can range from completely inaccessible to largely accessible, depending on the amount of effort the publisher expended during the remediation process. PDF files generated from word processing programs (e.g., Microsoft Word) are not accessible by default, but instead require additional tweaks that necessitate the use of Adobe’s Acrobat Pro software (the version of Acrobat that retails for $179 per year). Users of assistive technologies have no recourse but to attempt opening a PDF file before often finding that the document lacks structure (needed for navigation), alt tags, metadata, or other crucial features. Even for sighted learners, PDFs downloaded from their university’s online repository will be difficult to view on smartphones, since PDF’s fixed page dimensions will require endless zooming and scrolling to display each column of text at an adequate font size.</p> <p>The superior accessibility of EPUB has inspired major publishers to establish academic repositories of articles in EPUB format, e.g., ABC-CLIO, ACLS Humanities, EBSCO E-Books, Proquest’s Ebrary, Elsevier’s ScienceDirect, Taylor &amp; Francis. Many digital-only journals offer their editions as EPUBs. For example, Trude Eikebrokk, editor of <em>Professions &amp; Professionalism</em>, investigated the advantages of publishing in the EPUB format as described in this excerpt from the online journal Code{4}lib:</p> <blockquote><p>There are two important reasons why we wanted to replace PDF as our primary e-journal format. PDF is a print format. It will never be the best choice for reading on tablets (e.g. iPad) or smartphones, and it is challenging to read PDF files on e-book readers … We wanted to replace or supplement the PDF format with EPUB to better support digital reading. Our second reason for replacing PDF with EPUB was to alleviate accessibility challenges. PDF is a format that can cause many barriers, especially for users of screen readers (synthetic speech or Braille). For example, Excel tables are converted into images, which makes it impossible for screen readers to access the table content. PDF documents might also lack search and navigation support, due to either security restrictions, a lack of coded structure in text formats, or the use of PDF image formats. This can make it difficult for any reader to use the document effectively and impossible for screen reader users. On the other hand, correct use of XHTML markup and CSS style sheets in an EPUB file will result in search and navigation functionalities, support for text-to-speech/braille and speech recognition technologies. Accessibility is therefore an essential aspect of publishing e-journals: we must consider diverse user perspectives and make universal design a part of the publishing process.</p></blockquote> <p><strong>The Future of EPUB </strong></p> <p>A robust community of accessibility activists, publishers, and e-book developers continues to advance the EPUB specification. The update to EPUB3 added synchronized audio narration, embedded video, MathML equations, HTML5 animations, and Javascript-based interactivity to the format’s existing support for metadata, hyperlinks, embedded fonts, text (saved as XHTML files) and illustrations in both Scalable Vector Graphic (SVG) and pixel-based formats. Next up: the recently announced upgrade to EPUB 3.2, which embraces documents created under the 3.0 standard while improving support for Accessible Rich Internet Applications (ARIA) and other forms of rich media. If you’re ready to join this revolution, have a run through the #DLFteach Toolkit’s <a href="https://dlfteach.pubpub.org/pub/ebook-makerspace/release/1" rel="noopener" target="_blank">EPUB MakerSpace lesson plan</a>!</p> <p>The post <a href="https://www.diglib.org/the-dlfteach-toolkit-recommending-epubs-for-accessibility/" rel="nofollow">The #DLFteach Toolkit: Recommending EPUBs for Accessibility</a> appeared first on <a href="https://www.diglib.org" rel="nofollow">DLF</a>.</p> 2021-04-27T13:00:54+00:00 Gayle HangingTogether: Nederlandse ronde tafel sessie over next generation metadata: Denk groter dan NACO en WorldCat http://feedproxy.google.com/~r/Hangingtogetherorg/~3/n-ABc9qABiA/ <p><em>Met dank aan Ellen Hartman, OCLC, voor het vertalen van de oorspronkelijke <a href="https://hangingtogether.org/?p=9210">Engelstalige blogpost</a>.</em></p> <p>Op 8 maart 2021 werd een Nederlandse ronde tafel discussie georganiseerd als onderdeel van de OCLC Research <a href="http://oc.lc/metadata-series" rel="noreferrer noopener" target="_blank">Discussieserie over Next Generation metadata</a>. </p> <div class="wp-block-image"><figure class="alignright size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2021/03/Icon-Metadata-series.png"><img alt="OCLC metadata discussion series" class="wp-image-8919" height="129" src="https://hangingtogether.org/wp-content/uploads/2021/03/Icon-Metadata-series.png" width="190" /></a></figure></div> <p>Bibliothecarissen, met achtergronden in metadata, bibliotheeksystemen, de nationale bibliografie en back-office processen, namen deel aan deze sessie. Hierbij werd een mooie variatie aan academische en erfgoed instellingen in Nederland en België vertegenwoordigd. De deelnemers waren geëngageerd, eerlijk en leverden met hun kennis en inzicht constructieve bijdragen aan een prettige uitwisseling van kennis. </p> <h3>In kaart brengen van initiatieven </h3> <div class="wp-block-image"><figure class="alignleft size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2021/04/Landscape-BNL-1.png"><img alt="" class="wp-image-9211" height="347" src="https://hangingtogether.org/wp-content/uploads/2021/04/Landscape-BNL-1.png" width="346" /></a>Kaart van next-gen metadata initiatieven (Nederlandse sessie)</figure></div> <p>Net als in de andere ronde tafel sessies werden de deelnemers gevraagd om in kaart te helpen brengen wat voor next generation metadata initiatieven er in Nederland en België worden ontplooid. De kaart die daarmee werd gevuld laat zien dat in deze regio een sterke vertegenwoordiging is van bibliografische en erfgoed projecten (zie de linker helft van de matrix). Verschillende next-generation metadata projecten van de Koninklijke Bibliotheek Nederland werden omschreven, zoals: </p> <ul><li><strong>Automatische metadata creatie</strong>, waarbij tools voor het taggen en catalogiseren van naam authority records worden geïdentificeerd en getest. </li><li><strong>De </strong><strong>Entity</strong><strong> </strong><strong>Finder</strong>, een tool die wordt ontwikkeld om RDA entities (personen, werken en expressies) te helpen ontlenen vanuit authorities en bibliografische records. </li></ul> <p>De <strong>Digita</strong><strong>le</strong><strong> Erfgoed Referentie Architectuur (</strong><a href="https://netwerkdigitaalerfgoed.nl/activiteiten/dera/" rel="noreferrer noopener" target="_blank"><strong>DERA</strong></a><strong>)</strong><strong> </strong>is ontwikkeld als onderdeel van een nationale strategie voor digitaal erfgoed in Nederland. Het is een framework voor het beheren en publiceren van erfgoed informatie als linked open data (LOD), op basis van overeengekomen conventies en afspraken. Het <a href="https://vangoghworldwide.org/" rel="noreferrer noopener" target="_blank">van Gogh Worldwide</a> platform is een voorbeeld van de applicatie van DERA, waar metadata gerelateerd aan de kunstwerken van van Gogh, die in bezit zijn van Nederlandse erfgoed instellingen en in privé bezit worden geaggregeerd.   </p> <p>Een noemenswaardig in kaart gebracht initiatief op het gebied van Research Informatie Management (RIM) en Scholarly Communications was de Nederlandse Open Knowledge Base. Een in het afgelopen jaar opgestart initiatief binnen de context van de deal tussen <a href="https://www.elsevier.com/about/press-releases/corporate/dutch-research-institutions-and-elsevier-initiate-worlds-first-national-open-science-partnership" rel="noreferrer noopener" target="_blank">Elsevier en VSNU, NFU en NWO</a> om gezamenlijk open science services te ontwikkelen op basis van RIM systemen, Elsevier databases, analytics oplossingen en de databases van de Nederlandse onderzoeksinstellingen. De Open Knowledge Base zal nieuwe applicaties kunnen voeden met informatie, zoals een dashboard voor het monitoren van de sustainable development goals van de universiteiten. Het uitgangspunt van de Knowledge Base is het significant kunnen verbeteren van de analyse van de impact van research. </p> <h3>Wat houdt ons tegen? </h3> <p>Ondanks dat er tijdens de sessie innovatieve projecten in kaart werden gebracht, werd er net als in sommige andere sessies, onduidelijkheid gevoeld over hoe we nu verder door kunnen ontwikkelen. Ook was er sprake van enig ongeduld met de snelheid van de transitie naar next generation metadata. Sommige bibliotheken waren gefrustreerd over het gebrek aan tools binnen de huidige generatie systemen om deze transitie te versnellen. Zoals de integratie van Persistant Identifiers (PID), lokale authorities of links met externe bronnen. Meerdere tools moeten gebruiken voor een workflow voelt als een stap terug in plaats van vooruit.  </p> <p>Buiten praktische belemmeringen werd de discussie vooral gedomineerd door de vraag wat ons tegenhoudt in deze ontwikkeling. Met zoveel bibliografische data die al als LOD gepubliceerd wordt, wat is er dan verder nodig om deze data te linken? Zouden we niet op zoek moeten naar partners om samen een kennis-ecosysteem te ontwikkelen? </p> <h3>Vertrouwen op externe data </h3> <p>Een deelnemer gaf aan dat bibliotheken voorzichtig of terughoudend zijn met de databronnen waarmee ze willen linken. Authority files zijn betrouwbare bronnen, waarvoor er nog geen gelijkwaardige alternatieven bestaan in het zich nog ontwikkelende linked data ecosysteem. Het gebrek aan conventies voor de betrouwbaarheid is misschien een reden waarom bibliotheken misschien wat terughoudend zijn in het aangaan van linked data partnerschappen of terug deinzen voor het vertrouwen op externe data, zelfs van gevestigde bronnen als Wikidata. Want, het linken naar een databron is een indicatie van vertrouwen en een erkenning van de datakwaliteit. </p> <p>Het gesprek ging vervolgens verder over linked datamodellen. Welke data creëer je zelf? Hoe geef je je data vorm en link je met andere data? Sommige deelnemers gaven aan dat er nog steeds een gebrek aan afspraken en duidelijkheid is over concepten zoals een “werk”. Anderen gaven aan dat het vormgeven van concepten precies is waar linked data om draait en dat meerdere onthologieën naast elkaar kunnen bestaan. In andere woorden, het is misschien niet nodig om de naamgeving in harde standaarden te vatten. </p> <blockquote class="wp-block-quote"><p><em>“Er is geen uniek semantisch model. Wanneer je verwijst naar gegevens die al door anderen zijn gedefinieerd, geef je de controle over dat stukje informatie op, en dat kan een mentale barrière zijn tegen het op de juiste manier werken met linked data. Het is veel veiliger om alle data in je eigen silo op te slaan en te beheren. Maar op het moment dat je dat los kunt laten, kan de wereld natuurlijk veel rijker worden dan je in je eentje ooit kunt bereiken.”</em> </p></blockquote> <h3>Oefenen met denken in linked data </h3> <p>Het gesprek ging verder met een discussie over wat we kunnen doen om bibliotheekmedewerkers die catalogiseren te trainen. Een van de deelnemers vond dat het handig zou zijn om te beginnen met ze te leren te denken in linked dataconcepten en om te oefenen met het opbouwen van een knowledge graph en het experimenteren met het bouwen van verschillende structuren. Net als dat een kind dat doet door met LEGO te spelen. De deelnemers waren het erover eens dat we op dit moment nog te weinig kennis hebben van de mogelijkheden en de consequenties van het gebruik van linked data. </p> <blockquote class="wp-block-quote"><p><em>“We moeten leren onszelf te zien als uitgevers van metadata, zodat anderen het kunnen vinden – maar we hebben geen idee wie de anderen zijn, we moeten zelfs groter denken dan de NACO van de Library of Congress of WorldCat. We hebben het niet langer over de records die we maken, maar over stukjes records die uniek zijn, want veel komt al van elders. We moeten ons dit realiseren en onszelf afvragen: wat is onze rol in het grotere geheel? Dit is erg moeilijk om te doen!”</em> </p></blockquote> <p>De deelnemers gaven aan dat het erg belangrijk was om deze discussie binnen hun bibliotheek op gang te brengen. Maar hoe doe je dat precies? Het is een groot onderwerp en het zou mooi zijn als daar vanuit het management ook aandacht voor is. </p> <h3>Niet relevant voor mijn bibliotheek </h3> <p>Een leidinggevende binnen de deelnemersgroep reageerde hierop en gaf aan: </p> <blockquote class="wp-block-quote"><p><em>“Het valt me op dat de hoeveelheid bibliotheken die hier nog echt mee te maken hebben kleiner wordt. (…) [In mijn bibliotheek] produceren we nauwelijks zelf nog metadata. (…) Als we kijken naar wat we zelf nog produceren is dat bijvoorbeeld nog het beschrijven van foto’s van een studentenvereniging, eigenlijk niets dus. Metadata is eigenlijk alleen nog een onderwerp voor een kleine groep specialisten.”</em> </p></blockquote> <p>Hoe provocerend deze observatie ook was, dit weerspiegelt wel een realiteit die we moeten erkennen en tegelijkertijd in perspectief moeten plaatsen. Daar was helaas geen tijd voor, want de sessie liep ten einde. Het was zeker een gesprek waar we nog een tijd hadden kunnen doorpraten! </p> <h3>Over de OCLC Research Discussie Serie over Next Generation Metadata </h3> <p>In maart 2021 hield <a href="https://www.oclc.org/research/home.html" rel="noreferrer noopener" target="_blank">OCLC Research</a> een <a href="https://www.oclc.org/go/en/events/next-generation-of-metadata.html" rel="noreferrer noopener" target="_blank">discussiereeks</a> gericht op twee rapporten: </p> <ol><li>“<a href="https://www.oclc.org/research/publications/2020/oclcresearch-transitioning-next-generation-metadata.html" rel="noreferrer noopener" target="_blank">Transitioning to the Next Generation of Metadata</a>”   </li></ol> <ol start="2"><li>“<a href="https://www.oclc.org/research/publications/2021/oclcresearch-transforming-metadata-into-linked-data.html" rel="noreferrer noopener" target="_blank">Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project</a>”.   </li></ol> <p>De rondetafelgesprekken werden gehouden in verschillende Europese talen en de deelnemers konden hun eigen ervaringen delen, een beter begrip krijgen van het onderwerp en kregen handvatten om vol vertrouwen plannen te maken voor de toekomst. . </p> <p>De <a href="https://hangingtogether.org/?p=8918">plenaire openingssessie </a>opende de vloer voor discussie en verkenning en introduceerde het thema en de bijbehorende onderwerpen. Samenvattingen van alle rondetafelgesprekken worden gepubliceerd op de OCLC Research-blog <a href="https://hangingtogether.org/">Hanging Together</a>. </p> <p>Op de afsluitende plenaire vergadering op 13 april werden de verschillende rondetafelgesprekken samengevat.  </p> <p>The post <a href="https://hangingtogether.org/?p=9238" rel="nofollow">Nederlandse ronde tafel sessie over next generation metadata: Denk groter dan NACO en WorldCat</a> appeared first on <a href="https://hangingtogether.org" rel="nofollow">Hanging Together</a>.</p> <div class="feedflare"> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=n-ABc9qABiA:1KcIOuFj8Pc:yIl2AUoC8zA"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=yIl2AUoC8zA" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=n-ABc9qABiA:1KcIOuFj8Pc:D7DqB2pKExk"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=n-ABc9qABiA:1KcIOuFj8Pc:D7DqB2pKExk" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=n-ABc9qABiA:1KcIOuFj8Pc:-BTjWOF_DHI"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=n-ABc9qABiA:1KcIOuFj8Pc:-BTjWOF_DHI" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=n-ABc9qABiA:1KcIOuFj8Pc:V_sGLiPBpWU"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=n-ABc9qABiA:1KcIOuFj8Pc:V_sGLiPBpWU" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=n-ABc9qABiA:1KcIOuFj8Pc:qj6IDK7rITs"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=qj6IDK7rITs" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=n-ABc9qABiA:1KcIOuFj8Pc:l6gmwiTKsz0"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=l6gmwiTKsz0" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=n-ABc9qABiA:1KcIOuFj8Pc:gIN9vFwOqvQ"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=n-ABc9qABiA:1KcIOuFj8Pc:gIN9vFwOqvQ" /></a> </div><img alt="" height="1" src="http://feeds.feedburner.com/~r/Hangingtogetherorg/~4/n-ABc9qABiA" width="1" /> 2021-04-27T08:57:45+00:00 Titia van der Werf Open Knowledge Foundation: Open Data Day 2021 – it’s a wrap https://blog.okfn.org/2021/04/23/open-data-day-2021-its-a-wrap/ <p><a href="https://i2.wp.com/blog.okfn.org/files/2021/04/Open-Data-Day-2021-image-for-ODD-2021_-its-a-wrap-blogpost_social-media.png?ssl=1"><img alt="Open Data Day 2021 event flyers" class="aligncenter size-full wp-image-27025" height="311" src="https://i2.wp.com/blog.okfn.org/files/2021/04/Open-Data-Day-2021-image-for-ODD-2021_-its-a-wrap-blogpost_social-media.png?resize=600%2C311&amp;ssl=1" width="600" /></a></p> <p><span style="font-weight: 400;">On Saturday 6th March 2021, the eleventh </span><a href="http://opendataday.org/"><span style="font-weight: 400;">Open Data Day</span></a><span style="font-weight: 400;"> took place with people around the world organising </span><a href="https://opendataday.org/#map"><span style="font-weight: 400;">over 300 events</span></a><span style="font-weight: 400;"> to celebrate, promote and spread the use of open data.</span></p> <p><span style="font-weight: 400;">Thanks to the generous support of this year’s mini-grant funders –</span><a href="https://news.microsoft.com/opendata/"><span style="font-weight: 400;">Microsoft</span></a><span style="font-weight: 400;">, </span><a href="https://www.gov.uk/government/organisations/foreign-commonwealth-development-office"><span style="font-weight: 400;">UK Foreign, Commonwealth and Development Office</span></a><span style="font-weight: 400;">, </span><a href="https://www.mapbox.com/"><span style="font-weight: 400;">Mapbox</span></a><span style="font-weight: 400;">, </span><a href="https://www.gfdrr.org/en"><span style="font-weight: 400;">Global Facility for Disaster Reduction and Recovery</span></a><span style="font-weight: 400;">, </span><a href="https://idatosabiertos.org/en/"><span style="font-weight: 400;">Latin American Open Data Initiative</span></a><span style="font-weight: 400;">, </span><a href="https://www.open-contracting.org/"><span style="font-weight: 400;">Open Contracting Partnership</span></a><span style="font-weight: 400;"> and </span><a href="https://www.datopian.com/"><span style="font-weight: 400;">Datopian</span></a><span style="font-weight: 400;"> – the Open Knowledge Foundation offered more than 60 mini-grants to help organisations run online or in-person events for </span><a href="http://opendataday.org/"><span style="font-weight: 400;">Open Data Day</span></a><span style="font-weight: 400;">.</span></p> <p><span style="font-weight: 400;">We captured some of the great conversations across </span><a href="https://twitter.com/i/events/1367921103876853760"><span style="font-weight: 400;">Asia/the Pacific</span></a><span style="font-weight: 400;">, </span><a href="https://twitter.com/i/events/1367921296332447748"><span style="font-weight: 400;">Europe/Middle East/Africa</span></a><span style="font-weight: 400;"> and </span><a href="https://twitter.com/i/events/1367920618763677699"><span style="font-weight: 400;">the Americas</span></a><span style="font-weight: 400;"> using </span><a href="https://twitter.com/i/events/1235558238260772865"><span style="font-weight: 400;">Twitter Moments</span></a><span style="font-weight: 400;">.</span></p> <p><span style="font-weight: 400;">Below you can discover all the organisations supported by this year’s scheme as well as seeing photos/videos and reading their reports to help you find out how the events went, what lessons they learned and why they love Open Data Day:</span></p> <h2><span style="font-weight: 400;">Environmental data</span></h2> <ul> <li style="font-weight: 400;"><a href="http://codeforpakistan.org/"><span style="font-weight: 400;">Code for Pakistan</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">A hack day to open and publish the block coordinates of the plantation conducted during the billion tree tsunami in Pakistan</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/mapping-pakistan-s-billion-tree-tsunami/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.drmafrica.org/"><span style="font-weight: 400;">DRM Africa</span></a><span style="font-weight: 400;"> (Democratic Republic of the Congo)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Preventing vulnerable communities from river floods through risk data collection, analysis and communication</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-floods-mapping-in-the-drc/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://escueladefiscales.com/"><span style="font-weight: 400;">Escuela de Fiscales</span></a><span style="font-weight: 400;"> (Argentina)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Our goal is to show the community and other civil society organizations the importance of open data in preserving and caring for the environment, and the urgency of taking action against climate change and pollution, and how open data can improve public politics with the participation of citizens</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-ambiente-y-desarrollo-sostenible/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://www.gdcbemina.com/"><span style="font-weight: 400;">Government Degree College Bemina,J and K Higher Education</span></a><span style="font-weight: 400;"> (India)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Make the community aware about the availability and benefits of environmental data for addressing environmental concerns in Kashmir Valley</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-environment-data-2021/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://futurelab.mx/"><span style="font-weight: 400;">Future Lab</span></a><span style="font-weight: 400;"> (Mexico)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Engage with the local community and enable citizen participation through the use of open data for the proposal of cleaner and more sustainable public policies</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-day-leon-2021/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.mijasmultimedia.org/"><span style="font-weight: 400;">Mijas Multimedia</span></a><span style="font-weight: 400;"> (Democratic Republic of the Congo)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Strengthen the community resilience to the rapid rise of Lake Tanganyika through the use of open data</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/strengthen-the-community-resilience-to-the-rapid-rise-of-the-lake-tanganyika-thr/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://facebook/savenigerianmangroves"><span style="font-weight: 400;">Niger Delta SnapShots</span></a><span style="font-weight: 400;"> (Nigeria)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Use open data to uncover hidden threats damaging Nigerian mangrove and demonstrate the necessity for urgent action to save Nigerian Mangrove</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-day-port-harcourt-2021/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://oknp.org/"><span style="font-weight: 400;">Open Knowledge Nepal</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Organise a datathon that will bring open data enthusiasts to work on the real-time air quality data and Twitter bot enhancement, so that people can use the service and get informed with the recent situations of air quality in their surroundings</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/datadive-kathmandu/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://permapeople.org/"><span style="font-weight: 400;">PermaPeople</span></a><span style="font-weight: 400;"> (Germany)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Present and discuss the importance and challenges of collecting and sharing open source data on plants and growing to assist in the growth of the regenerative movement</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/permapeople-open-source-planting-data-for-the-regenerative-movement/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.facebook.com/zavznz/"><span style="font-weight: 400;">Zanzibar Volunteers for Environmental Conservation</span></a><span style="font-weight: 400;"> (Tanzania)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">The main goal is to contribute to open data initiatives by helping the students understand more about open data and environmental issues</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-and-environmental-conservation-on-zanzibar/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> </ul> <h2><span style="font-weight: 400;">Tracking public money flows</span></h2> <ul> <li style="font-weight: 400;"><a href="https://afonte.info/"><span style="font-weight: 400;">Afonte Jornalismo de Dados</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Brazilian are tired of corruption, and Open Data Day Porto Alegre 2021 will provide relevant and open-access information to show the path to investigate public expenses and how they are connected to politicians and even companies</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-day-porto-alegre/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://www.dataphyte.com/"><span style="font-weight: 400;">Dataphyte</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Train participants on how to track Covid-19 spending using open government data to unearth malpractices and corruption in the management of the pandemic</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/abuja-open-data-day/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://www.datosconcepcion.com.ar/"><span style="font-weight: 400;">Datos Concepción</span></a><span style="font-weight: 400;"> (Argentina)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Show companies and organizations that received contracts related to COVID-19</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/concepcion-transparente-gastos-covid-19/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.equitywatchinitiative.org/"><span style="font-weight: 400;">Equity Watch Initiative</span></a><span style="font-weight: 400;"> (Nigeria)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Using data to ensure that various gender equality and women empowerment projects in Nsukka Local Government Area deliver on promises</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/does-the-freedom-of-information-act-apply-in-nigeria-particularly-with-respect/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://is.gd/grafosportfolio"><span style="font-weight: 400;">HackBo / Grafoscopio</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">To intertwine mini wikis, chatbots and public oversight of public expenses, starting with a particular project in the neighborhood, to showcase how grassroots developed civic tech and open government could be bridged, as an empowering alternative to the opaque extractivist social media where such interaction is happening (Facebook) beyond the reach and real interest of civic communities</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/cajica-ciudadanias-y-digitales-gobierno-abierto/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.ojoconmipisto.com/"><span style="font-weight: 400;">Ojoconmipisto</span></a><span style="font-weight: 400;"> (Guatemala)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Teaching local journalists data visualisation techniques</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/que-vivan-los-datos-data-journalism-training-for-open-data-day-2021-in-guatema/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://okee.ee/"><span style="font-weight: 400;">Open Knowledge Estonia</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Estonian procurement registry doesn’t use OCDS, but the common European standard (TED). Our goal is to cross-match the datasets concerning donations and business registries, in order to automatically detect potential conflict of interests</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/okee-andmeklubi/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.ulacit.ac.cr/"><span style="font-weight: 400;">Universidad Latinoamericana de Ciencia y Tecnología (ULACIT</span></a><span style="font-weight: 400;">)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Data Challenge: take advantage of the </span><a href="http://datosabiertospj.eastus.cloudapp.azure.com/dataset/estandar-de-datos-de-contrataciones-abiertas-ocds"><span style="font-weight: 400;">first dataset published under the OCDS</span></a><span style="font-weight: 400;"> in the country and improve the data literacy of university students</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/data-challenge-take-advantage-of-the-first-dataset-published-under-the-ocds-in/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.waterwide.org/"><span style="font-weight: 400;">Water With Development Initiative</span></a><span style="font-weight: 400;"> (Nigeria)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Increase transparency and accountability discussing the use of existing WASH data</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/the-realities-of-wash-data/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> </ul> <h2><span style="font-weight: 400;">Open mapping</span></h2> <ul> <li style="font-weight: 400;"><a href="https://dihslovenia.si/"><span style="font-weight: 400;">DIH Slovenia</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Disseminating existing open mapping solutions, sharing best practices and discussion of possibilities for improving life in communities through open mapping</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/dan-odprtih-podatkov-2021-open-data-day-2021-slovenia/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://sites.google.com/view/communitymappingonlineevent"><span style="font-weight: 400;">Federal University of Bahia</span></a><span style="font-weight: 400;"> (Brazil)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Strengthen a global network of community data collectors from communities, organisations, as well as academic institutions by 1) focusing on sharing experiences from specific cases where particular mapping tools were used as part of strategies of community empowerment and 2) using the insights to subsequently co-design a platform to empower data collectors globally</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/community-mapping-online-event-2021/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://geoladiesph.github.io/home"><span style="font-weight: 400;">Geoladies PH</span></a><span style="font-weight: 400;"> (Philippines)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Since March is International Women’s Month and 31st March is International Transgender Day of Visibility, we would like to hold an event that empowers and engages women (cisgender and transgender) to map out features and amenities (women support desks, breastfeeding stations, gender-neutral comfort rooms, and LGBT safe spaces) and feature lightning talks to highlight women in mapping</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/mapababae-2021-choosetochallenge-gender-inequality-in-open-mapping/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://geosm.org/"><span style="font-weight: 400;">GEOSM</span></a><span style="font-weight: 400;"> (Cameroon)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Host a “geo-evangelisation”, workshop in the use of JOSM (Java OpenStreetMap ) and GEOSM (the first 100% African open source geolocation platform)</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/geoevangelisation-workshop-in-the-use-of-josm-and-geosm-the-first-100-african/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://cedat.mak.ac.ug/research/ilabs/"><span style="font-weight: 400;">iLabs@Mak Project</span></a><span style="font-weight: 400;"> (Uganda)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">To understand and value the need of Farmers’ Live Geo Map across food value chain in Africa to better food traceability and security</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/scoping-out-a-farmers-geo-map-in-africa-to-better-food-traceability-and-security/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://labiks.org/"><span style="font-weight: 400;">LABIKS – Latin American Bike Knowledge Sharing</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">To promote and stimulate the sharing of open data about the bike-sharing systems in Latin America and to promote and discuss our online open map, aiming to improve it</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-mapping-and-successful-bsss/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://utopix.cc/"><span style="font-weight: 400;">Monitor de Femicidios de UTOPIX</span></a><span style="font-weight: 400;"> (Venezuela)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Monitoring of femicide cases in Venezuela</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-day-monitoreando-femicidios-en-venezuela-con-utopix/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.periferiacenter.com/"><span style="font-weight: 400;">Periféria Policy and Research Center</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Learn about the relevance of open data in collective/critical mapping of gentrification in Hungary</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-for-critical-mapping-of-gentrification-in-hungary/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://polimappers.github.io/"><span style="font-weight: 400;">PoliMappers</span></a><span style="font-weight: 400;"> (Italy)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Host an introductory mapping event on OpenStreetMap so that students and people interested in collaborating gain the basic skills needed to tackle more advanced tools later in the year</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/collaborative-and-humanitarian-mapping-1-openstreetmap-introduction-open-data/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://www.smartct.org/"><span style="font-weight: 400;">SmartCT</span></a><span style="font-weight: 400;"> (Philippines)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Launch the MapaTanda Initiative (a portmanteau of Mapa — which means a map — and Tanda — which can mean an older adult but can also mean remember); which is an initiative that seeks to improve the number and quality of data in OpenStreetMap that are important and relevant to older adults (senior citizens) and the ageing population (60+ years old) in the Philippines</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/mapatanda-launch-mapping-with-and-for-the-older-adults-senior-citizens-and-a/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.facebook.com/raya.mam/"><span style="font-weight: 400;">SUZA Youthmappers</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Create awareness on open data data use, and how the students can use the data in developing innovative web and mobile applications to solve existing challenges in the society</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-day-by-suza-youthmappers/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://tutela.network/"><span style="font-weight: 400;">TuTela Learning Network</span></a><span style="font-weight: 400;"> in collaboration with local activists and researchers</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Start a debate on alternative, community-managed forms of housing in the city of Lisbon based on the model of grant of use and raising awareness on the importance of accessible data on available real estate resources owned by the city</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/common-place-talking-about-co-housing-and-communities-of-sharing/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://ufsj.edu.br/bdgc/uaigeo.php"><span style="font-weight: 400;">Unificar Ações e Informações Geoespaciais – UAIGeo – Universidade Federal de São João del-Rei (UFSJ) </span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Disseminate the use and importance of open data to support the solution of territorial tension points, the use of water and the preservation of cultural heritage, as well as providing participants with contacts with collaborative mapping applications</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-day-uaigeo-ufsj/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> </ul> <h2><span style="font-weight: 400;">Data for equal development</span></h2> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">254 Youth Policy Cafe (Kenya)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Undertake a webinar via the Zoom Platform themed “Leveraging Open Data as an Asset for Inclusive &amp; Sustainable Development in Kenya”</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/leveraging-open-data-as-an-asset-for-inclusive-and-sustainable-development/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://accesa.org/"><span style="font-weight: 400;">ACCESA</span></a><span style="font-weight: 400;"> (Costa Rica)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Explore, map, visualize and disseminate key data about the projects being implemented by the Territorial Councils of Rural Development, the main participatory bodies for fostering rural development in Costa Rica, and assess their progress, the money being spent on them, the results obtained, and their impact in narrowing the many social gaps that currently affect the different rural regions of the country</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/data-expedition-costa-rica/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://afroimpacto.com/"><span style="font-weight: 400;">Afroimpacto</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Discuss the importance to the black community of the open data discussion</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/ii-festival-afroempreendedor/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://costhonduras.hn/"><span style="font-weight: 400;">CoST Honduras</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Present how we can promote sustainable infrastructure by using data disclosed under the Open Contracting for Infrastructure Data Standard and engage citizens and civil society organisations to demand government accountability by using a tool called InfraS</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/datos-abiertos-en-infraestructura/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.dadosabertosdefeira.com.br/"><span style="font-weight: 400;">Dados Abertos de Feira</span></a><span style="font-weight: 400;"> (Brazil)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Promote and discuss the open data knowledge to our local community (city of Feira de Santana, countryside of Brazil), bringing together the academy, government agents and the society itself</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-day-feira/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://datafest.ge/"><span style="font-weight: 400;">DataFest Tbilisi</span></a><span style="font-weight: 400;"> (Georgia)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Highlight and promote the use of data and data-driven products as an effective way to tackle pressing social issues and inequality</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/pechakucha-night-tbilisi-open-data-for-equal-development/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.facebook.com/Demokrasya-259828814738336"><span style="font-weight: 400;">Demokrasya</span></a><span style="font-weight: 400;"> (Democratic Republic of the Congo)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Raise awareness of the Congolese community especially the women’s rights community on the use of open data in defending the women’s accessibility to employment</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-day-bukavu/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.eduna.org/"><span style="font-weight: 400;">Fundación Eduna</span></a><span style="font-weight: 400;"> (Colombia)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Develop activities to address the issue of strengthening the capacity for creative thinking of children and young people in the central region of Colombia making use and taking advantage of open data</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/eduna-open-data-day-region-central-de-colombia-tunja/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://www.generonumero.media/"><span style="font-weight: 400;">Gênero e Número</span></a><span style="font-weight: 400;"> (Brazil)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Explore open data to get a comprehensive landscape on the labour market for women in Brazil during the pandemic</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/genero-e-dados-o-mercado-de-trabalho-para-mulheres-na-pandemia/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><span style="font-weight: 400;">Girls’ Tech-Changer Community (Cameroon)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Show the benefits of open data (such as an increase in efficiency, transparency, innovation, and economic growth) and to encourage the adoption of open data policies in various government bodies, businesses, and civil societies</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/spreading-the-message-of-open-data-to-girls-in-cameroon/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://femsom.org/"><span style="font-weight: 400;">Hawa Feminist Coalition</span></a><span style="font-weight: 400;"> (Somalia)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Advance the production, dissemination and openness of sex-disaggregated data in Somalia in support of evidence-based planning and policy-making as well as tracking of progress by the government and other stakeholders to achieve the Sustainable Development Goals (SDGs)</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/advancing-the-production-dissemination-and-openness-of-sex-disaggregated-data-i/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://hopeforgirlsandwomen.com/"><span style="font-weight: 400;">Hope for Girls and Women Tanzania</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Teaching community about the benefit of using data for development</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-towards-the-sdgs-in-matare-village-serengeti/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://iyafp.org/"><span style="font-weight: 400;">International Youth Alliance for Family Planning- TOGO (IYAFP-TOGO)</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Develop an open map of contraceptive methods and service availability in Agbalepedo area</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/les-donnees-ouvertes-pour-la-promotion-des-pratiques-fondees-sur-les-donnees-pro/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.ipandetec.org/"><span style="font-weight: 400;">IPANDETEC</span></a><span style="font-weight: 400;"> (Panama)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Train Panamanian women on their current position, role and future in the world of open data</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/the-role-of-women-in-opening-data/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://iwatchafrica.org/"><span style="font-weight: 400;">iWatch Africa</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Demonstrate how equal development within the digital ecosystem in Africa can be improved by leveraging data on online abuse and harassment of female journalists</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/leveraging-data-on-abuse-of-female-journalists-to-promote-equal-development-with/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://www.facebook.com/Kiyita-foundation-107296167310697/"><span style="font-weight: 400;">Kiyita Foundation</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Encourage local women to get access to data about economic development</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/women-s-economic-development-and-access-to-data-resources-in-uganda/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://www.association-maidi.mg/"><span style="font-weight: 400;">Madagascar Initiatives for Digital Innovation</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Make participants understand the value of data for development</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/data-for-sdgs-session/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://officialnosk.github.io/"><span style="font-weight: 400;">Nepal Open Source Klub</span></a> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">We will create a glossary of technical terms and words that are commonly used on websites/in software and translate those into Nepali</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/create-a-glossary-of-commonly-used-technical-terms-with-their-translation-in-the/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://www.nuktaafrica.co.tz/"><span style="font-weight: 400;">Nukta Africa</span></a><span style="font-weight: 400;"> (Tanzania)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Maximizing the use of open data to increase accountability through data journalism</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/data-journalism-for-accountability/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://programminghistorian.org/"><span style="font-weight: 400;">Programming Historian</span></a><span style="font-weight: 400;"> (Chile)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Walk participants through the process of visualising qualitative and quantitative development open data for equal development in Latin America, using open access tools</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-day-datos-abiertos-y-humanidades-digitales/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="http://punchup.world/"><span style="font-weight: 400;">Punch Up</span></a><span style="font-weight: 400;"> (Thailand)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Emphasise what would be lost if we don’t have open data in our country</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-for-democracy-emphasise-what-would-be-lost-if-we-dont-have-open-data/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><span style="font-weight: 400;">Rausing Zimbabwe</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Create a platform and outlet for information distribution, updates and discussion with communities on the issues surrounding peace and security in the age of the pandemic</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-development-in-an-age-of-social-fissures-insecurity-and-covid19-in-ru/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> <li style="font-weight: 400;"><a href="https://legalhackers.org/people/"><span style="font-weight: 400;">Vilnius Legal Hackers</span></a><span style="font-weight: 400;"> (Lithuania)</span> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Implement more transparency into funeral business of Lithuania</span></li> <li style="font-weight: 400;"><a href="https://opendataday.org/events/2021/reports/open-data-day-2021-hackathon/"><span style="font-weight: 400;">Read event report</span></a></li> </ul> </li> </ul> <p><span style="font-weight: 400;">Thanks to everyone who organised or took part in these celebrations and see you next year for Open Data Day 2022!</span></p> <p><b>Need more information?</b></p> <p><span style="font-weight: 400;">If you have any questions, you can reach out to the Open Knowledge Foundation’s Open Data Day team by emailing </span><a href="mailto:opendataday@okfn.org"><span style="font-weight: 400;">opendataday@okfn.org</span></a><span style="font-weight: 400;"> or on Twitter via </span><a href="https://twitter.com/okfn"><span style="font-weight: 400;">@OKFN</span></a><span style="font-weight: 400;">.</span></p> 2021-04-23T11:04:55+00:00 Stephen Abbott Pugh Digital Library Federation: The #DLFteach Toolkit: Participatory Mapping In a Pandemic https://www.diglib.org/the-dlfteach-toolkit-participatory-mapping-in-a-pandemic/ <p><em><img alt="DLF Digital Library Pedagogy Logo" class="wp-image-11430 size-medium alignright" height="300" src="https://www.diglib.org/wp-content/uploads/sites/3/2015/12/dlf-diglib-pedagogy-logo-300x300.png" width="300" />This post was written by Jeanine Finn (Claremont Colleges Library), as part of <strong>Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit</strong>, a blog series from DLF’s Digital Library Pedagogy group highlighting the experiences of  digital librarians and archivists who utilize the <a href="https://dlfteach.pubpub.org/toolkit">#DLFteach Toolkit</a> and are new to teaching and/or digital tools.</em></p> <p><em>The <a href="https://wiki.diglib.org/Pedagogy">Digital Library Pedagogy working group</a>, also known as #DLFteach, is a grassroots community of practice, empowering digital library practitioners to see themselves as teachers and equip teaching librarians to engage learners in how digital library technologies shape our knowledge infrastructure. The group is open to anyone interested in learning about or collaborating on digital library pedagogy. <a href="https://groups.google.com/forum/#!forum/dlf-pedagogy">Join our Google Group</a> to get involved.</em></p> <hr /> <p><strong><a href="https://dlfteach.pubpub.org/pub/health-history-participatory-mapping/release/1">See the original lesson plan in the #DLFteach Toolkit.</a></strong></p> <p>Our original activity was designed around using a <a href="https://community.esri.com/t5/education-blog/participatory-mapping-with-google-forms-google-sheets-and-arcgis/ba-p/883782">live GoogleSheet in coordination with ArcGIS Online</a> to collaboratively map historic locations for an in-class lesson to introduce students to geospatial analysis concepts. In our example, a history instructor had identified a list of cholera outbreaks with place names from 18th-century colonial reports.</p> <p>In the original activity, students were co-located in a library classroom, reviewing the historic cholera data in groups. A Google Sheet was created and shared with everyone in the class for students to enter “tidied” data from the historic texts collaboratively. The students then worked with a live link from Google Sheets, allowing the outbreak locations to be served directly to the ArcGIS Online map. It was successful and a useful tool for encouraging engagement and for getting familiar with GIS.</p> <p>Then COVID-19 in 2020 arrived. Instead of a centuries-distant disease outbreak, students learning digital mapping this past year were thrust into socially-distant instructional settings driven by a contemporary pandemic that radically altered their modes of learning. The collaborative affordances of tools like ArcGIS Online were pressed into service to help students collaborate effectively and meaningfully in real-time while learning from home.</p> <p>As an example, one geology professor at Pomona College encouraged her students to explore the geology of their local environment. Building on shared readings and lectures on geologic history and rock formations, students were encouraged to research the history of the land around them, and include photographs, observations, and other details to enrich the ArcGIS StoryMap. The final map included photographs and geology facts from students’ home locations around the world.</p> <p style="text-align: center;"><img alt="Geology of places we live: Group projects for Module 1 &quot;Geology of the solid Earth&quot; in GEOL 20E. 1, Pomona College, September 29, 2020" class="aligncenter wp-image-23480 size-full" height="468" src="https://www.diglib.org/wp-content/uploads/sites/3/2021/04/pedagogy-blog.jpg" width="842" />Header for Geology class group StoryMap at Pomona College, Fall 2020</p> <p> </p> <p>A key feature of the ArcGIS StoryMap platform that appealed to the instructor was the ability for the students to work collaboratively on the platform itself — not across shared files on folders on Box, GSuite, the LMS, etc. While this functioned reasonably well, there were several roadblocks to effective collaboration that we encountered along the way. Most of the challenges related to permissions settings related to ArcGIS Online administration, as the “shared update” features are not set as default permissions. Other challenges included file size limitations for images the students wished to upload, the inability of more than one user to edit the same file simultaneously, and potential security issues (including firewalls) in nations with more restrictive internet laws.</p> <p>Reflecting on these uses of StoryMaps over this past semester, we encourage instructors and library staff interested in to:</p> <ol> <li>Review user license permissions and best practices for ArcGIS StoryMap collaboration from Esri (some links below).</li> <li>Plan ahead to help students with collecting appropriate images, including discussions of file size and copyright.</li> <li>Encourage the instructor to coordinate student groups with defined roles and responsibilities to lessen the likelihood of multiple editors working on the same StoryMap at once (which can cause corruption of the files.</li> <li>Get clarity from IT and other support staff as needed to determine if students are working remotely from countries that may have restrictions on internet use.</li> </ol> <p> </p> <p><strong>Resources:</strong></p> <p><strong>Participatory Mapping with Google Forms, Google Sheets, and ArcGIS Online (Esri community education blog): </strong><a href="https://community.esri.com/t5/education-blog/participatory-mapping-with-google-forms-google-sheets-and-arcgis/ba-p/883782">https://community.esri.com/t5/education-blog/participatory-mapping-with-google-forms-google-sheets-and-arcgis/ba-p/883782</a></p> <p><strong>Optimize group settings to share stories like never before (Esri ArcGIS blog): </strong><a href="https://www.esri.com/arcgis-blog/products/story-maps/constituent-engagement/optimize-group-settings-to-share-stories-like-never-before/">https://www.esri.com/arcgis-blog/products/story-maps/constituent-engagement/optimize-group-settings-to-share-stories-like-never-before/</a></p> <p><strong>Teach with Story Maps: Announcing the Story Maps Curriculum Portal (University of Minnesota, U-Spatial: </strong><a href="https://research.umn.edu/units/uspatial/news/teach-story-maps-announcing-story-maps-curriculum-portal">https://research.umn.edu/units/uspatial/news/teach-story-maps-announcing-story-maps-curriculum-portal</a></p> <p><strong>Getting Started with ArcGIS StoryMaps (Esri): </strong><a href="https://storymaps.arcgis.com/stories/cea22a609a1d4cccb8d54c650b595bc4">https://storymaps.arcgis.com/stories/cea22a609a1d4cccb8d54c650b595bc4</a></p> <p>VI Conclusion recommendations</p> <p>Gather materials ahead of time. Photographs from digital archives, maps<br /> There may be data cleaning issues.</p> <p>The post <a href="https://www.diglib.org/the-dlfteach-toolkit-participatory-mapping-in-a-pandemic/" rel="nofollow">The #DLFteach Toolkit: Participatory Mapping In a Pandemic</a> appeared first on <a href="https://www.diglib.org" rel="nofollow">DLF</a>.</p> 2021-04-22T19:11:26+00:00 Gayle David Rosenthal: Dogecoin Disrupts Bitcoin! https://blog.dshr.org/2021/04/dogecoin-disrupts-bitcoin.html <a href="https://1.bp.blogspot.com/-M2blMjauQgI/YICNVt1FkaI/AAAAAAAAGOI/qtaqIbztIoAg-AyNQmRSC0dsanangZz-gCLcBGAsYHQ/s707/MuskTweet.png" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"><img border="0" height="200" src="https://1.bp.blogspot.com/-M2blMjauQgI/YICNVt1FkaI/AAAAAAAAGOI/qtaqIbztIoAg-AyNQmRSC0dsanangZz-gCLcBGAsYHQ/w192-h200/MuskTweet.png" width="192" /></a>Two topics I've posted about recently, <a href="https://blog.dshr.org/2021/04/elon-musk-threat-or-menace.html">Elon Musk's cult</a> and <a href="https://blog.dshr.org/2021/01/the-bitcoin-price.html">the illusory "prices" of cryptocurrencies</a>, just intersected in spectacular fashion. On April 14 the Bitcoin "price" peaked at $63.4K. Early on April 15, the Musk cult saw this tweet from their prophet. Immediately, the Dogecoin "price" took off like a Falcon 9.<br /><br />A day later, Jemima Kelley reported that <a href="https://www.ft.com/content/e16137dc-2cad-473c-a4d4-bfd6743bfeb1"><i>If you believe, they put a Dogecoin on the moon</i></a>. That was to say that:<br /><blockquote>Dogecoin — the crypto token that was started as a joke and that is the favourite of Elon Musk — is having a bit of a moment. And when we say a bit of a moment, we mean that it is on a lunar trajectory (in crypto talk: it is going to da moon).<br /><br />At the time of writing this, it is up over 200 per cent in the past 24 hours — more than tripling in value (for those of you who need help on percentages, it is Friday afternoon after all). Over the past week it’s up more than 550 per cent (almost seven times higher!). </blockquote>The headlines tell the story — Timothy B. Lee's <a href="https://arstechnica.com/tech-policy/2021/04/dogecoin-has-risen-400-percent-in-the-last-week-because-why-not/"><i>Dogecoin has risen 400 percent in the last week because why not</i></a> and Joanna Ossinger's <a href="https://www.bloomberg.com/news/articles/2021-04-20/dogecoin-rips-in-meme-fueled-frenzy-as-biggest-cryptos-struggle"><i>Dogecoin Rips in Meme-Fueled Frenzy on Pot-Smoking Holiday</i></a>.<br /><br /><a href="https://1.bp.blogspot.com/-BnQEyNtn6U0/YHyq-SCMEzI/AAAAAAAAGNc/yPdUmfgfzvYgzdCbdqeylmymly4z1BPYgCLcBGAsYHQ/s720/DogecoinPrice.png" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"><img border="0" height="122" src="https://1.bp.blogspot.com/-BnQEyNtn6U0/YHyq-SCMEzI/AAAAAAAAGNc/yPdUmfgfzvYgzdCbdqeylmymly4z1BPYgCLcBGAsYHQ/w200-h122/DogecoinPrice.png" width="200" /></a>The Dogecoin "price" graph Kelly posted was almost vertical. The same day, Peter Schiff, the notorious gold-bug, <a href="https://twitter.com/PeterSchiff/status/1383134331946827776">tweeted</a>:<br /> <blockquote> So far in 2021 #Bitcoin has lost 97% of its value verses #Dogecoin. The market has spoken. Dogecoin is eating Bitcoin. All the Bitcoin pumpers who claim Bitcoin is better than gold because its price has risen more than gold's must now concede that Dogecoin is better than Bitcoin. </blockquote>Below the fold I look back at this revolution in crypto-land.<br /><span><a name="more"></a></span><br />I'm writing on April 21, and the Bitcoin "price" is around $55K, about 87% of its peak on April 14. In the same period Dogecoin's "price" peaked at $0.37, and is now around $0.32, or 267% of its $0.12 "price" on April 14. There are some reasons for Bitcoin's slump apart from people rotating out of BTC into DOGE in response to Musk's tweet. <a href="https://cryptobriefing.com/bitcoin-crashes-accident-xinjiang-china-halts-mining-operation/">Nivesh Rustgi reports</a>:<br /><blockquote>Bitcoin’s hashrate dropped 25% from all-time highs after an accident in the Xinjiang region’s mining industry caused flooding and a gas explosion, leading to 12 deaths with 21 workers trapped since.<br />...<br />The leading Bitcoin mining data centers in the region have closed operations to comply with the fire and safety inspections.<br /><br />The Chinese central authority is conducting site inspections “on individual mining operations and related local government agencies,” tweeted Dovey Wan, partner at Primitive Crypto. <br />...<br />The accident has reignited the centralization problems arising from China’s dominance of the Bitcoin mining sector, despite global expansion efforts. </blockquote>The drop in the hash rate had the obvious effects. <a href="https://davidgerard.co.uk/blockchain/2021/04/20/news-coinbase-goes-public-bitcoin-hashrate-goes-down-nfts-go-down-proof-of-space-trashes-hard-disk-market/">David Gerard reports</a>:<br /><blockquote>The Bitcoin hash rate dropped from 220 exahashes per second to 165 EH/s. The rate of new blocks slowed. The Bitcoin mempool — the backlog of transactions waiting to be processed — has filled. Transaction fees peaked at just over $50 average on 18 April. </blockquote>The average BTC transaction fee is now just short of $60, with a median fee over $26! The BTC blockchain did around 350K transactions on April 15, but on April 16 it could only manage 190K.<br /><br />It is also true that DOGE had upward momentum <i>before</i> Musk's tweet. After being nearly flat for almost a month, it had already doubled since April 6.<br /><br />Kelly quotes <a href="https://www.ft.com/content/e16137dc-2cad-473c-a4d4-bfd6743bfeb1">David Kimberley at Freetrade</a>:<br /><blockquote>Dogecoin’s rise is a classic example of greater fool theory at play, Dogecoin investors are basically betting they’ll be able to cash out by selling to the next person wanting to invest. People are buying the cryptocurrency, not because they think it has any meaningful value, but because they hope others will pile in, push the price up and then they can sell off and make a quick buck.<br /><br />But when everyone is doing this, the bubble eventually has to burst and you’re going to be left short-changed if you don’t get out in time. And it’s almost impossible to say when that’s going to happen. </blockquote>Kelly also quotes Khadim Shubber explaining that this is <a href="https://www.ft.com/content/0668ecd8-bf16-3b48-9d24-46a3755cb74b">all just entertainment</a>:<br /><blockquote>Bitcoin, and cryptocurrencies in general, are not directly analogous to the fairly mundane practice of buying a Lottery ticket, but this part of its appeal is often ignored in favour of more intellectual or high-brow explanations.<br /><br />It has all the hallmarks of a fun game, played out across the planet with few barriers to entry and all the joy and pain that usually accompanies gambling.<br /><br />There’s a single, addictive reward system: the price. The volatility of cryptocurrencies is often highlighted as a failing, but in fact it’s a key part of its appeal. Where’s the fun in an asset whose price snoozes along a predictable path?<br /><br />The rollercoaster rise and fall and rise again of the crypto world means that it’s never boring. If it’s down one day (and boy was it down yesterday) well, maybe the next day it’ll be up again. </blockquote>Note the importance of volatility. In a must-read interview that <i>New York Magazine</i> entitled <a href="https://nymag.com/intelligencer/2021/04/unified-monetary-theory-beeple-biden.html"><i>BidenBucks Is Beeple Is Bitcoin</i></a> Prof. George Galloway also stressed the importance of volatility:<br /><blockquote>Young people want volatility. If you have assets and you’re already rich, you want to take volatility down. You want things to stay the way they are. But young people are willing to take risks because they can afford to lose everything. For the opportunity to double their money, they will risk losing everything. Imagine a person who has the least to lose: He’s in solitary confinement in a supermax-security prison. That person wants maximum volatility. He prays for such volatility, that there’s a revolution and they open the prison.<br /><br />People under the age of 40 are fed up. They have less than half of the economic security, as measured by the ratio of wealth to income, that their parents did at their age. Their share of overall wealth has crashed. A lot of them are bored. A lot of them have some stimulus money in their pocket. And in the case of GameStop, they did what’s kind of a mob short squeeze.<br />...<br />I see crypto as a mini-revolution, just like GameStop. The central banks and governments are all conspiring to create more money to keep the shareholder class wealthy. Young people think, <i>That’s not good for me, so I’m going to exit the ecosystem and I’m going to create my own currency</i>. </blockquote>This all reinforces my <a href="https://blog.dshr.org/2021/01/the-bitcoin-price.html">skepticism about the "price" and "market cap" of cryptocurrencies</a>. 2021-04-22T16:00:00+00:00 David. (noreply@blogger.com) David Rosenthal: What Is The Point? https://blog.dshr.org/2021/04/what-is-point.html During a discussion of <a href="https://blog.dshr.org/2021/04/nfts-and-web-archiving.html">NFTs</a>, Larry Masinter pointed me to his 2012 proposal <a href="https://tools.ietf.org/html/draft-masinter-dated-uri-10"><i>The 'tdb' and 'duri' URI schemes, based on dated URIs</i></a>. The proposal's abstract reads:<br /><small><pre>This document defines two URI schemes. The first, 'duri' (standing<br />for "dated URI"), identifies a resource as of a particular time.<br />This allows explicit reference to the "time of retrieval", similar to<br />the way in which bibliographic references containing URIs are often<br />written.<br /><br />The second scheme, 'tdb' ( standing for "Thing Described By"),<br />provides a way of minting URIs for anything that can be described, by<br />the means of identifying a description as of a particular time.<br />These schemes were posited as "thought experiments", and therefore<br />this document is designated as Experimental.<br /></pre></small>As far as I can tell, this proposal went nowhere, but it raises a question that is also raised by NFTs. <i>What is the point of a link that is unlikely to continue to resolve to the expected content?</i> Below the fold I explore this question.<br /><span><a name="more"></a></span><br />I think there are two main reasons why <tt>duri:</tt> went nowhere:<br /><ul> <li>The <tt>duri:</tt> concept implies that Web content in general is not static, but it is actually much more dynamic than that. Even the duri: specification admits this:<br /> <small> <pre>There are many URIs which are, unfortunately, not particularly<br />"uniform", in the sense that two clients can observe completely<br />different content for the same resource, at exactly the same time.<br /></pre> </small> Personalization, advertisements, geolocation, watermarks, all make it very unlikely that either several clients accessing the same URI at the same time, or a single client accessing the same URI at different times, would see the same content.</li> <li>When this proposal was put forward in 2012, it was competing with a less elegant but much more useful competitor that had been in use for 16 years. The <tt>duri:</tt> specificartion admits that:<br /> <small> <pre>There are no direct resolution servers or processes for 'duri' or<br />'tdb' URIs. However, a 'duri' URI might be "resolvable" in the sense<br />that a resource that was accessed at a point in time might have the<br />result of that access cached or archived in an Internet archive<br />service. See, for example, the "Internet Archive" project<br /></pre> </small> But the <tt>duri:</tt> URI doesn't provide the information needed to resolve to the "cached or archived" content. The Internet Archive's <a href="https://archive.org/web/">Wayback Machine</a> uses URIs which, instead of the prefix <tt>duri:[datetime]:</tt> have the prefix <tt>https://web.archive.org/web/[datetime]/</tt>. This is more useful, both because browsers will actually resolve these URIs, and because they resolve to a service devoted to delivering the content of the URI at the specified time.</li></ul> The competition for <tt>duri:</tt> was not merely long established, but also actually did what users presumably wanted, which was to resolve to the content of the specified URL at the specified time.<br /><br />It is true that a user creating a Wayback Machine URL, perhaps using the "Save Page Now" button, would preserve the content accessed by the Wayback Machine's crawler. which might be different from that accessed by the user themselves. But the user could compare the two versions at the time of creation, and avoid using the created Wayback Machine URL if the differences were significant. Publishing a Wayback Machine URL carries an implicit warranty that the creator regarded any differences as insignificant.<br /><br />The history of <tt>duri:</tt> suggests that there isn't a lot of point in "durable" URIs lacking an expectation that they will continue to resolve to the original content. NFTs have the expectation, but lack the mechanism necessary to satisfy the expectation.<br /><br /> 2021-04-22T15:00:00+00:00 David. (noreply@blogger.com) HangingTogether: Recognizing bias in research data – and research data management http://feedproxy.google.com/~r/Hangingtogetherorg/~3/881MOyEyges/ <div class="wp-block-image"><figure class="alignleft size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2021/04/window-scaled.jpg"><img alt="" class="wp-image-9239" height="217" src="https://hangingtogether.org/wp-content/uploads/2021/04/window-1024x791.jpg" width="281" /></a>Photo by <a href="https://unsplash.com/@thehighdynamic?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText">Bálint Szabó</a> on <a href="https://unsplash.com/s/photos/skewed?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText">Unsplash</a></figure></div> <p>As the COVID pandemic grinds on, vaccinations are top of mind. A <a href="https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2776562">recent article</a> published in <em>JAMA Network Open</em> examined whether vaccination clinical trials over the last decade adequately represented various demographic groups in their studies. According to the authors, the results suggested they did not: “among US-based vaccine clinical trials, members of racial/ethnic minority groups and older adults were underrepresented, whereas female adults were overrepresented.” The authors concluded that “diversity enrollment targets should be included for all vaccine trials targeting epidemiologically important infections.”</p> <div class="wp-block-image"><figure class="alignright size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2021/04/TJGrant.jpg"><img alt="" class="wp-image-9243" height="206" src="https://hangingtogether.org/wp-content/uploads/2021/04/TJGrant-891x1024.jpg" width="180" /></a>Dr. Tiffany Grant</figure></div> <p>My colleague <a href="https://www.oclc.org/research/people/bryant-rebecca.html">Rebecca Bryant</a> and I recently enjoyed an interesting and thought-provoking conversation with <a href="https://orcid.org/0000-0001-7036-8247">Dr. Tiffany Grant</a>, Assistant Director for Research and Informatics with the <a href="https://libraries.uc.edu/">University of Cincinnati Libraries</a> (an <a href="https://www.oclc.org/research/partnership.html">OCLC Research Library Partnership</a> member) on the topic of bias in research data. Dr. Grant neatly summed up the issue by observing that <strong>data collected should be inclusive of all the groups who are impacted by outcomes</strong>. As the <em>JAMA</em> article illustrates, that is clearly not always the case – and the consequences can be significant for decision- and policy-making in critical areas like health care.</p> <p>The issue of bias in research data has been acknowledged for some time; for example, the launch of the Human Genome Project in the late 1990s/early 2000s helped raise awareness of the problem, as did observed differences in health care outcomes across demographic groups. And efforts are underway to help remedy some of the gaps. One initiative, the US National Institutes of Health’s <a href="https://allofus.nih.gov/">All of Us Research Program</a>, aims to build a database of health data collected from a diverse cohort of at least one million participants. The rationale for the project is clearly laid out: “To develop individualized plans for disease prevention and treatment, researchers need more data about the differences that make each of us unique. Having a diverse group of participants can lead to important breakthroughs. These discoveries may help make health care better for everyone.”</p> <p>Extrapolation of findings observed in one group to all other groups often leads to poor inferences, and researchers should take this into account when designing data collection strategies. The peer review process should act as a filter for identifying research studies that overlook this point in their design – but how well is it working? As in many other aspects of our work and social lives, <strong>unconscious bias may play a role here</strong>: lack of awareness of the problem on the part of reviewers means that studies with flawed research designs may slip through.</p> <p>And that leads us to what Dr. Grant believes is <strong>the principal remedy for the problem of bias in research data: education</strong>. Researchers need training that helps them recognize potential sources of bias in data collection, as well as understand the implications of bias for interpretation and generalization of their findings. The first step in solving a problem is to recognize that there <em>is</em> a problem. Some disciplines are further along than others in addressing bias in research data, but in Dr. Grant’s view, there is still ample scope for raising awareness across campus about this topic.</p> <p><strong>Academic libraries can help with this</strong>, by providing workshops and training programs, and gathering relevant information resources. At the University of Cincinnati, librarians are often embedded in research teams, providing an excellent opportunity to share their expertise on this issue. Raising awareness about bias in research data is also an opportunity to partner with other campus units, such as the office of research, colleges/schools, and research institutes (for more information on how to develop and sustain cross-campus partnerships around research support services see our recent <a href="https://www.oclc.org/research/publications/2020/oclcresearch-social-interoperability-research-support.html">OCLC Research report on social interoperability</a>).</p> <p>Many institutions are currently implementing Equality, Diversity, and Inclusion (EDI) training, and modules addressing bias in research data might be introduced as part of EDI curricula for researchers. This could also be an area of focus for professional development programs supporting doctoral, postdoctoral, and other early-career researchers. It seems that many EDI initiatives focus on issues related to personal interactions or recruiting more members of underrepresented groups into the field. For researchers, it may be useful to supplement this training with additional programs that <strong>focus on EDI issues as they specifically relate to the responsible conduct of research</strong>. In other words, how do EDI-related issues manifest in the research process, and how can researchers effectively address them? A great example is the <a href="https://weallcount.com/learn-with-us/">training</a> offered by <a href="https://weallcount.com/">We All Count</a>, a project aimed at increasing equity in data science.</p> <p>Funders can also contribute toward mitigating bias in research data, by issuing research design guidelines on inclusion of underrepresented groups, and by establishing criteria for scoring grant proposals on the basis of how well these guidelines are addressed. <strong>The big “carrots and sticks” wielded by funders are a powerful tool for both raising awareness and shifting behaviors</strong>.</p> <p>Bias in research data extends to bias in research data management (RDM). <strong>Situations where access to and ability to use archived data sets is not equitable is another form of bias</strong>. While it is good to mandate that data sets be archived under “open” conditions, as many funders already do, the spirit of the mandate is compromised if the data sets are put into systems that are not accessible and usable to everyone. It is important to recognize that the risk of introducing bias into research data exists throughout the research lifecycle, including curation activities such as data storage, description, and preservation.</p> <p>Our conversation focused on bias in research data in STEM fields – particularly medicine – but the issue also deserves attention in the context of the social sciences, as well as the arts and humanities. Our summary here highlights just a sample of the topics worthy of discussion in this area, with much to unpack in each one. We are grateful to Dr. Grant for starting a conversation with us on this important issue and look forward to continuing it in the future as part of our ongoing work on RDM and other forms of research support services.</p> <p><em>Like so many other organizations, OCLC is reflecting on equity, diversity, and inclusion, as well as taking action. Check out an </em><a href="https://www.oclc.org/en/about/diversity-and-advancing-racial-equity.html"><em>overview of that work</em></a><em>, and explore </em><a href="https://www.oclc.org/research/areas/community-catalysts/edi.html"><em>efforts</em></a><em> being undertaken in OCLC’s Membership and Research Division. Thanks to Tiffany Grant, Rebecca Bryant, and Merrilee Proffitt for providing helpful suggestions that improved this post!</em></p> <p>The post <a href="https://hangingtogether.org/?p=9235" rel="nofollow">Recognizing bias in research data – and research data management</a> appeared first on <a href="https://hangingtogether.org" rel="nofollow">Hanging Together</a>.</p> <div class="feedflare"> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=881MOyEyges:mMJqu0u8hqw:yIl2AUoC8zA"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=yIl2AUoC8zA" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=881MOyEyges:mMJqu0u8hqw:D7DqB2pKExk"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=881MOyEyges:mMJqu0u8hqw:D7DqB2pKExk" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=881MOyEyges:mMJqu0u8hqw:-BTjWOF_DHI"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=881MOyEyges:mMJqu0u8hqw:-BTjWOF_DHI" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=881MOyEyges:mMJqu0u8hqw:V_sGLiPBpWU"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=881MOyEyges:mMJqu0u8hqw:V_sGLiPBpWU" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=881MOyEyges:mMJqu0u8hqw:qj6IDK7rITs"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=qj6IDK7rITs" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=881MOyEyges:mMJqu0u8hqw:l6gmwiTKsz0"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=l6gmwiTKsz0" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=881MOyEyges:mMJqu0u8hqw:gIN9vFwOqvQ"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=881MOyEyges:mMJqu0u8hqw:gIN9vFwOqvQ" /></a> </div><img alt="" height="1" src="http://feeds.feedburner.com/~r/Hangingtogetherorg/~4/881MOyEyges" width="1" /> 2021-04-21T16:17:38+00:00 Brian Lavoie Lucidworks: Enhance Product Discovery with AI-Powered Recommenders https://lucidworks.com/post/ai-powered-recommenders-for-product-discovery/ <p>Learn how AI-powered recommenders put the right products and content in front of your customers, with just the right amount of human touch.</p> <p>The post <a href="https://lucidworks.com/post/ai-powered-recommenders-for-product-discovery/" rel="nofollow">Enhance Product Discovery with AI-Powered Recommenders</a> appeared first on <a href="https://lucidworks.com" rel="nofollow">Lucidworks</a>.</p> 2021-04-21T15:35:11+00:00 Andy Wibbels Tara Robertson: Distributing DEI Work Across the Organization https://tararobertson.ca/2021/distributing-dei-work-across-the-organization/ <p>I enjoyed being a guest on <a href="https://learn.seedandspark.com/business-inclusion/">Seed&amp;Spark</a>‘s first monthly office hours session where Stefanie Monge, Lara McLeod and I talked about distributing diversity, equity and inclusion work across organizations.</p> <p></p> <div>Here’s some of the work that I mentioned:</div> <ul> <li>Megan Carpenter’s <a href="https://www.linkedin.com/pulse/get-wrong-me-what-i-need-from-allies-megan-carpenter/">Get It Wrong For Me: What I Need From Allies</a></li> <li>Amy Edmondson on <a href="https://hbr.org/2011/04/strategies-for-learning-from-failure">psychological safety</a></li> <li>Roxane Gay and Tressie McMillan Cottom’s podcast <a href="https://www.heartoslay.com/">Hear to Slay</a>, which really is the Black feminist podcast of my dreams.</li> <li>Mozilla’s <a href="https://www.mozilla.org/en-US/about/governance/policies/participation/">Community Participation Guidelines</a></li> </ul> <div></div> <div id="blockColorblindContent"></div> <p>The post <a href="https://tararobertson.ca/2021/distributing-dei-work-across-the-organization/" rel="nofollow">Distributing DEI Work Across the Organization</a> appeared first on <a href="https://tararobertson.ca" rel="nofollow">Tara Robertson Consulting</a>.</p> 2021-04-20T17:17:50+00:00 Tara Robertson Terry Reese: Thoughts on NACOs proposed process on updating CJK records https://blog.reeset.net/archives/2967 <p>I would like to take a few minutes and share my thoughts about an updated best practice recently posted by the PCC and NACO related to an update on CJK records. The update is found here: <a href="https://www.loc.gov/aba/pcc/naco/CJK/CJK-Best-Practice-NCR.docx">https://www.loc.gov/aba/pcc/naco/CJK/CJK-Best-Practice-NCR.docx</a>. I’m not certain if this is active or a simply a proposal, but I’ve been having a number of private discussions with members at the Library of Congress and the PCC as I’ve been trying to understand the genesis for this policy change. I personally believe that formally adopting a policy like this would be exceptionally problematic, and I wanted to flesh out my thoughts on why and some potential better options that could fix the issue that this problem is attempting to solve.</p> <p>But first, I owe some folks an apology. In chatting with some folks at LC (because, let’s be clear, this proposal was created specifically because there are local, limiting practices at LC that artificially are complicating this work) – it came to my attention that the individuals that spent a good deal of time considering and creating this proposal have received some unfair criticism – and I think I bare a lot of responsibility for that. I have done work creating best practices and standards and its thankless, difficult work. Because of that, in cases where I disagree with a particular best practice, my preference has been to address those privately and attempt to understand and share my issues with a set of practices. This is what I have been doing related to this work. However, on the MarcEdit list (a private list), when a request was made related to a feature request in MarcEdit to support this work – I was less thoughtful in my response as the proposed change could fundamentally undo almost a decade of work as I have dealt with thousands of libraries stymied by these kinds of best practices that have significant unintended consequences. My regret is that I’ve been told that my thoughts shared on the MarcEdit list, have been used by others in more public spaces to take this committee’s work to task. This is unfortunate and disappointing, and something I should have been more thoughtful of in my responses on the MarcEdit list. Especially, given that every member of that committee is doing this work as a service to the community. I know I forget that sometimes. So, to the folks that did this work – I’ve not followed (or seen) any feedback you may have received, but in as much that I’m sure I played a part in any push back you may have received, I’m sorry.</p> <h3>What does this problem seek to solve?</h3> <p>If you look at the proposal, I think that the writers do a good job identifying the issue. Essentially, this issue is unique to authority records. At present, NACO still requires that records created within the program only utilize UTF8 characters that fall within the MARC-8 repertoire. OCLC, the pipeline for creating these records, enforces this rule by invalidating records with UTF8 characters outside the MARC8 range. The proposal seeks to address this by encouraging the use of NRC (Numeric Character Reference) data in UTF8 records, to work around these normalization issues.</p> <p>So, in a nutshell, that is the problem, and that is the proposed solution. But before we move on, let’s talk a little bit about how we got here. This problem currently exists because of, what I believe to be, an extremely narrow and unproductive read of what MARC8 repertoire actually means. For those not in Libraries, MARC8 is essentially a made-up character encoding, used only in libraries, that has so outlived its usefulness. Modern systems have largely stopped supporting it outside of legacy ingest workflows. The issue is that for every academic library or national library that has transitioned to UTF8, hundreds of small libraries or organizations around the world have not. MARC8 continues to exist because the infrastructure that supports these smaller libraries is built around it.</p> <p>But again, I think it is worth thinking about today, what actually is the MARC8 repertoire. Previously, this had been a hard set of defined values. But really, that changed in 2004ish when LC updated guidance and introduced the concept of NRCs to preserve lossless data transfer between systems that were fully UTF8 compliant and older MARC8 systems. NRCs in MARC8 were workable, because it left local systems the ability to handle (or not handle) the data as it seen fit and finally provided an avenue for the Library community as a whole to move on from the limitations MARC8 was imposing on systems. It allowed for the facilitation of data into non-MARC formats that were UTF8 compliant and provided a pathway to allow data from other metadata formats, the ability to reuse that data in MARC records. I would argue that today, the MARC8 repertoire includes NRC notation – and to assume or pretend otherwise, is shortsighted and revisionist.</p> <p>But why is all of this important. Well, it is at the heart of the problem that we find ourselves in. For authority data, the Library of Congress appears to have adopted this very narrow view of what MARC8 means (against their own stated recommendations) and as a result, NACO and OCLC place artificial limits on the pipeline. There are lots of reasons why LC does this, I recognize they are moving slowly because any changes that they make are often met with some level of resistance from members of our community – but in this case, this paralysis is causing more harm to the community than good.</p> <h3>Why this proposal is problematic?</h3> <p>So, this is the environment that we are working in and the issue this proposal sought to solve. The issue, however, is that the proposal attempts to solve this problem by adopting a MARC8 solution and applying it within UTF8 data – essentially making the case that NRC values can be embedded in UTF8 records to ensure lossless data entry. And while I can see why someone might think that – that assumption is fundamentally incorrect. When LC developed its guidance on NRC notation, this was guidance that was specifically directed in the lossless translation of data to MARC8. UTF8 data has no need for NRC notation. This does not mean that it does not sometimes show up – and as a practical purpose, I’ve spent thousands of hours working with Libraries dealing with the issues this creates in local systems. Aside from the issues this creates in MARC systems around indexing and discovery, it makes data almost impossible to be used outside of that system and in times of migration. In thinking about the implications of this change in the context of MarcEdit, I had the following, specific concerns:</p> <ol> <li>NRC data in UTF8 records would break existing workflows for users with current generation systems that would have no reason to expect this data as being present in UTF8 MARC records</li> <li>It would make normalization functionally virtually impossible and potentially re-introduce a problem I spent months solving for organizations related to how UTF8 data is normalized and introduced into local systems.</li> <li>It would break many of the transformation options.  MarcEdit allows for the flow of data to many different metadata formats – all are built on the concept that the first thing MarcEdit does is clean up character encodings to ensure the output data is in UTF8.</li> <li>MarcEdit is used by ~20k active users and ~60k annual users.  Over 1/3 of those users do not use MARC21 and do not use MARC-8.  Allowing the mixing of NRCs and UTF8 data potentially breaks functionality for broad groups of international users.</li> </ol> <p>While I very much appreciate the issue that this is attempting to solve, I’ve spent years working with libraries where this kind of practice would introduce a long-term data issue that is very difficult to identify and fix and often shows up unexpectedly when it comes time to migration or share this information with other services, communities, or organizations.</p> <h3>So what is the solution?</h3> <p><b> </b></p> <p>I think that we can address this issue on two fronts. First, I would advise NACO and OCLC to essentially stop limiting data entry to this very limited notion of MARC8 repertoire. In all other contexts, OCLC provides the ability to enter any valid UTF8 data. This current limit within the authority process is artificial and unnecessary. OCLC could easily remove it, and NACO could amend their process to allow record entry to utilize any valid UTF8 character. This would address the problem that this group was attempting to solve for catalogers creating these records.</p> <p>The second step could take two forms. If LC continues to ignore their own guidance and cleave to an outdated concept of the MARC8 repertoire – OCLC could provide to LC via their pipeline a version of the records where data includes NRC notation for use in LCs own systems. It would mean that I would not recommend using LC as a trusted system for downloading authorities if this was the practice unless I had an internal local process to remove any NRC data found in valid UTF8 records. Essentially, we essentially treat LC’s requirements as a disease and quarantine them and their influence in this process. Of course, what would be more ideal, is LC making the decision to accept UTF8 data without restrictions and rely on applicable guidance and MARC21 best practice by supporting UTF8 data fully, and for those still needing MARC8 data – providing that data using the lossless process of NRCs (per their own recommendations).</p> <h3>Conclusion</h3> <p>Ultimately, this proposal is a recognition that the current NACO rules and process is broken and broken in a way that it is actively undermining other work in the PCC around linked data development. And while I very much appreciate the thoughtful work that went into the consideration of a different approach, I think the unintended side affects would cause more long-term damage that any short-term gains. Ultimately, what we need is for the principles to rethink why these limitations are in place, and, honestly, really consider ways that we start to deemphasize the role LC plays as a standard holder if in that role, LC’s presence continues to be an impediment for moving libraries forward.</p> 2021-04-20T16:56:13+00:00 reeset Lucidworks: How to Deliver Impactful Digital Commerce Experiences https://lucidworks.com/post/deliver-relevant-digital-commerce-experiences/ <p>Acquia and Lucidworks share tips for how to deliver meaningful and relevant digital commerce experiences that create customer connections.</p> <p>The post <a href="https://lucidworks.com/post/deliver-relevant-digital-commerce-experiences/" rel="nofollow">How to Deliver Impactful Digital Commerce Experiences</a> appeared first on <a href="https://lucidworks.com" rel="nofollow">Lucidworks</a>.</p> 2021-04-20T16:32:04+00:00 Jenny Gomez HangingTogether: Accomplishments and priorities for the OCLC Research Library Partnership http://feedproxy.google.com/~r/Hangingtogetherorg/~3/sV5OSw6YBAI/ <p>With 2021 well underway, the OCLC Research Library Partnership is as active as ever. We are heartened by the positive feedback and engagement our Partners have provided in response to our programming and research directions. Thank you to those who have shared your stories of success and challenge; listening to your voices is what guides us and drives us forward. We warmly welcome the <a href="https://protect-us.mimecast.com/s/1nNBCZ6gDzhoKXZ3fx0UFK?domain=can01.safelinks.protection.outlook.com">University of Notre Dame</a>, <a href="https://www.oclc.org/research/news/2021/waterloo-joins-rlp.html">University of Waterloo</a>, and OCAD University into the Partnership and are pleased to see how they have jumped right into engagement with SHARES and other activities. </p> <h2><strong>The SHARES resource sharing community</strong></h2> <div class="wp-block-image"><figure class="alignleft size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2021/04/caleb-chen-l9Vrl5RT-jw-unsplash-1-scaled.jpg"><img alt="" class="wp-image-9226" height="324" src="https://hangingtogether.org/wp-content/uploads/2021/04/caleb-chen-l9Vrl5RT-jw-unsplash-1-768x1024.jpg" width="243" /></a>Photo by <a href="https://unsplash.com/@calebchen?utm_source=unsplash&amp;utm_medium=referral&amp;utm_content=creditCopyText">Caleb Chen</a> on <a href="https://unsplash.com/photos/l9Vrl5RT-jw">Unsplash</a></figure></div> <p>The SHARES community has been a source of support and encouragement as resource sharing professionals around the world strive to meet their communities’ information needs during COVID-19. During the last year, Dennis Massie has convened more than 50 SHARES town halls to date to learn how SHARES members are changing practice to adapt to quickly evolving circumstances. Dennis has <a href="https://protect-us.mimecast.com/s/SbpUC2kXA7tE50qNfXY-8_?domain=can01.safelinks.protection.outlook.com">documented how resource sharing practices have changed</a>.  </p> <p>Inspired by the SHARES community, we are also excited to have launched the <a href="https://protect-us.mimecast.com/s/J99AC3130yfRDW15IE80n8?domain=can01.safelinks.protection.outlook.com">OCLC Interlibrary Loan Cost Calculator</a>. For library administrators and funders to evaluate collection sharing services properly, they need access to current cost information, as well as benchmarks against which to measure their own library’s data. The Cost Calculator is a free online tool that has the potential to act as a virtual real-time ILL cost study. Designed in collaboration with resource sharing experts and built by OCLC Research staff, the calculator has been in the hands of beta testers and early adopters since October 2019. A <a href="https://protect-us.mimecast.com/s/kUR1C4x2Dzs74RNpf3uaGG?domain=can01.safelinks.protection.outlook.com">recorded webinar</a> gives a guided tour of what the tool does (and does not do), what information users need to gather, how developers addressed privacy issues, and how individual institutions and the library community can benefit.</p> <h2><em>Total cost of stewardship: responsible collection building in archives and special collections</em></h2> <p>A big thanks to our Partners who contributed to the <a href="https://protect-us.mimecast.com/s/E9RsC5yMERsWkwmVC9JKFl?domain=can01.safelinks.protection.outlook.com"><em>Total Cost of Stewardship: Responsible Collection Building in Archives and Special Collections</em></a>. This publication addresses the ongoing challenge of descriptive backlogs in archives and special collections by connecting collection development decisions with stewardship responsibilities. The report proposes a Total Cost of Stewardship framework for bringing together these important, interconnected functions. Developed by the RLP’s <a href="https://protect-us.mimecast.com/s/i6rdC68M7BFGLZ2ntv7wOI?domain=can01.safelinks.protection.outlook.com">Collection Building and Operational Impacts Working Group</a>, the Total Cost of Stewardship Framework is a model that considers the value of a potential acquisition and its alignment with institutional mission and goals alongside the cost to acquire, care for, and manage it, the labor and specialized skills required to do that work, and institutional capacity to care for and store collections.</p> <p>This publication includes a suite of communication and cost estimation tools to help decision makers assess available resources, budgets, and timelines to plan with confidence and set realistic expectations to meet important goals. The report and accompanying resources provide special collections and archives with tools to support their efforts to meet the challenges of contemporary collecting and to ensure they are equitably serving and broadly documenting their communities.</p> <h2><strong>Transitioning to the next generation of metadata</strong></h2> <p>In December, we had a bittersweet moment celebrating Senior Program Officer Karen Smith-Yoshimura’s <a href="https://protect-us.mimecast.com/s/y7VKC82g7MIwNBV7tyC6dn?domain=can01.safelinks.protection.outlook.com">retirement</a>. As Mercy Procaccini and others take over the role of coordinating the stalwart Metadata Managers Focus Group, we are taking time to refine how this dynamic group works and plans future discussions together to better support their efforts. A <a href="https://protect-us.mimecast.com/s/wJaWC9rj7DFY4ry5txxueM?domain=can01.safelinks.protection.outlook.com">synthesis of this group’s discussions</a> from the past six years traces how metadata services are transitioning to the “next generation of metadata.”</p> <h2><strong>Transforming metadata into linked data</strong></h2> <p>The RLP’s commitment to advancing learning and operational support for linked data continues with the January publication of <a href="https://protect-us.mimecast.com/s/FDjJC0RMy0IrNYORCNu4WE?domain=can01.safelinks.protection.outlook.com"><em>Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project</em></a>. The report details a pilot project that investigated methods for—and the feasibility of—transforming metadata into linked data to improve the discoverability and management of digitized cultural materials and their descriptions. Five institutions partnered with OCLC to collaborate on this linked data project, representing a diverse cross-section of different types of institutions: The Cleveland Public Library The Huntington Library, Art Museum, and Botanical Gardens The Minnesota Digital Library Temple University Libraries University of Miami Libraries. </p> <p>OCLC has invested in <a href="https://protect-us.mimecast.com/s/PfTqCjRBV0IyoZL7CEBwV6?domain=can01.safelinks.protection.outlook.com">pathbreaking linked data work</a> for over a decade, and it is wonderful to add the publication to this knowledge base.</p> <h2>S<strong>ocial interoperability in research support</strong>   </h2> <p>In the area of research support, Rebecca Bryant developed a robust series of webinars as a follow-on to the 2019–2020 OCLC Research project, Social Interoperability in Research Support. The resulting report, <a href="https://protect-us.mimecast.com/s/dGqoCkRgVrIq2oVMHlQgca?domain=can01.safelinks.protection.outlook.com"><em>Social Interoperability in Research Support: Cross-campus Partnerships and the University Research Enterprise</em></a>, synthesizes information about the highly decentralized, complex research support ecosystem at US research institutions. The report additionally offers a conceptual model of campus research support stakeholders and provides recommendations for establishing and stewarding successful cross-campus relationships. The <a href="https://protect-us.mimecast.com/s/T5VjClY0V2uzR6JBTQSf42?domain=can01.safelinks.protection.outlook.com">social interoperability webinar series</a> complements this work by offering in-depth case studies and “stakeholder spotlights” from RLP institutions, demonstrating how other campus  are eager to collaborate with the library. This is a great example of the type of programming you can find in our <a href="https://protect-us.mimecast.com/s/4owqCmZkVYuRykwmIRS3NX?domain=can01.safelinks.protection.outlook.com">Works in Progress Webinar Series</a>. </p> <h2><strong>Equity, diversity, and inclusion</strong></h2> <p>Our team has been digging into issues of equity, diversity, and inclusion: we’ve <a href="https://protect-us.mimecast.com/s/WUboCo2m9jIK26N7Hn7yib?domain=can01.safelinks.protection.outlook.com">developed a “practice group”</a> to help our team be better situated to engaging in difficult conversations around race, and we also have been learning and engaging in conversations about the <a href="https://protect-us.mimecast.com/s/lDDLCpYn6kux4prjiXZCmB?domain=can01.safelinks.protection.outlook.com">difficulty of cataloging topics relating to Indigenous peoples</a> in respectful ways. </p> <p>This work has helped to prepare the way for important new work that I’m pleased to share with you today. OCLC will be working in consultation with <a href="https://protect-us.mimecast.com/s/2HRfCqxoXls7YJApf5OW6B?domain=can01.safelinks.protection.outlook.com">Shift Collective</a> on The <a href="https://protect-us.mimecast.com/s/J4AFCrkpNmtDQ4K5FgbjuQ?domain=can01.safelinks.protection.outlook.com">Andrew W. Mellon-funded</a> convening, <a href="https://protect-us.mimecast.com/s/e-9FCv2w78IywrmDCLkLTY?domain=can01.safelinks.protection.outlook.com">Reimagine Descriptive Workflows</a>. The project will bring together a wide range of community stakeholders to interrogate the existing descriptive workflow infrastructure to imagine new workflows that are inclusive, equitable, scalable, and sustainable. We are following an approach developed in other work we have carried out, such as the <a href="https://protect-us.mimecast.com/s/YKwjCwpx8KFRlgMDIZTw5s?domain=can01.safelinks.protection.outlook.com"><em>Research and Learning Agenda for Archives, Special, and Distinctive Collections in Research Libraries</em></a>, and more recently, in <a href="https://protect-us.mimecast.com/s/JsxkCxkyN2tQEP62SoK6Zp?domain=can01.safelinks.protection.outlook.com"><em>Responsible Operations: Data Science, Machine Learning, and AI in Libraries</em></a>. In that vein, we will host a virtual convening later this year to inform a Community Agenda publication. </p> <p>Reimagine Descriptive Workflows is the next stage of a journey that we’ve been on for some time, informed by <a href="https://protect-us.mimecast.com/s/khwbCyPzO9hyOAvlCX7g1E?domain=can01.safelinks.protection.outlook.com">numerous webinars, surveys, and individual conversations</a>. I am very grateful to team members and the RLP community for their contributions and guidance. We are truly “learning together.”</p> <h2>Looking forward</h2> <p>If you are at an OCLC RLP affiliated institution and would like to learn more about how to get the most out of your RLP affiliation, please contact you<a href="https://www.oclc.org/research/partnership/roster.html">r staff liaison</a> (or anyone on our <a href="https://www.oclc.org/research/people/rlp.html">energetic team</a>) and we be happy to set up a virtual orientation or refresher on our programs and opportunities for active learning.</p> <p>It is with deep gratitude that I offer my thanks to to our Partners for their investment in the Research Library Partnership. We are committed to offering our very best to serve your research and learning needs.</p> <p>The post <a href="https://hangingtogether.org/?p=9224" rel="nofollow">Accomplishments and priorities for the OCLC Research Library Partnership</a> appeared first on <a href="https://hangingtogether.org" rel="nofollow">Hanging Together</a>.</p> <div class="feedflare"> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=sV5OSw6YBAI:s1srvxGMm3E:yIl2AUoC8zA"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=yIl2AUoC8zA" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=sV5OSw6YBAI:s1srvxGMm3E:D7DqB2pKExk"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=sV5OSw6YBAI:s1srvxGMm3E:D7DqB2pKExk" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=sV5OSw6YBAI:s1srvxGMm3E:-BTjWOF_DHI"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=sV5OSw6YBAI:s1srvxGMm3E:-BTjWOF_DHI" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=sV5OSw6YBAI:s1srvxGMm3E:V_sGLiPBpWU"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=sV5OSw6YBAI:s1srvxGMm3E:V_sGLiPBpWU" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=sV5OSw6YBAI:s1srvxGMm3E:qj6IDK7rITs"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=qj6IDK7rITs" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=sV5OSw6YBAI:s1srvxGMm3E:l6gmwiTKsz0"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=l6gmwiTKsz0" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=sV5OSw6YBAI:s1srvxGMm3E:gIN9vFwOqvQ"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=sV5OSw6YBAI:s1srvxGMm3E:gIN9vFwOqvQ" /></a> </div><img alt="" height="1" src="http://feeds.feedburner.com/~r/Hangingtogetherorg/~4/sV5OSw6YBAI" width="1" /> 2021-04-19T20:30:00+00:00 Rachel Frick Open Knowledge Foundation: Watch the Net Zero Challenge pitch contest https://blog.okfn.org/2021/04/19/watch-the-net-zero-challenge-pitch-contest/ <p>This week, <a href="https://blog.okfn.org/2021/03/24/meet-the-projects-shortlisted-for-the-net-zero-challenge/">five shortlisted teams</a> took part in the final stage of the <a href="https://www.netzerochallenge.info">Net Zero Challenge</a> – <span class="color_16">a global competition to </span><span class="color_16">identify, promote and support </span><span class="color_16">innovative, practical and scalable uses of open data that advance climate action. </span></p> <p><span class="color_16">The five teams presented their three-minute project pitches to the Net Zero Challenge <a href="https://blog.okfn.org/2021/03/31/meet-our-panel-of-experts-for-the-net-zero-challenge-pitch-contest/">Panel of Experts</a>, and a live audience. Each pitch was followed by a live Q&amp;A. </span></p> <p><span class="color_16">The winner of the pitch contest will be announced in the next few days.</span></p> <p>If you didn’t have the chance to attend the event in person, watch the event <a href="https://www.youtube.com/playlist?list=PLOGV29UsPM6jd8wohLc2aJx1NH5Y5rTwc">here (46.08.min)</a> or see below for links to individual pitches.</p> <p>A full unedited video of the event is at the bottom of this page.</p> <p><a href="https://i2.wp.com/blog.okfn.org/files/2021/03/Net-Zero-Challenge-logo-with-supporters-logo-1.png?ssl=1"><img alt="" class="aligncenter size-full wp-image-26893" height="224" src="https://i2.wp.com/blog.okfn.org/files/2021/03/Net-Zero-Challenge-logo-with-supporters-logo-1.png?resize=600%2C224&amp;ssl=1" width="600" /></a></p> <p><em><strong>Introduction </strong>– by James Hamilton, Director of the Net Zero Challenge</em></p> <p><em><a href="https://www.youtube.com/watch?v=J02w35_W5oQ">Watch video here (4.50min)</a> // <a href="https://blog.okfn.org/files/2021/04/NZC-Pitch-Contest-Slide-Show.pdf">Introduction Slide Deck</a> </em></p> <p><a href="https://i0.wp.com/blog.okfn.org/files/2021/04/Snapshot_NetZeroChallenge_Slide-Deck_April21-v2.pptx_.jpg?ssl=1"><img alt="" class="alignnone wp-image-26983" height="338" src="https://i0.wp.com/blog.okfn.org/files/2021/04/Snapshot_NetZeroChallenge_Slide-Deck_April21-v2.pptx_.jpg?resize=600%2C338&amp;ssl=1" width="600" /></a></p> <p><strong>Pitch 1</strong> – by Matt Sullivan from <a href="https://snapshotclimate.com.au"><b>Snapshot Climate Tool</b></a> which provides greenhouse gas emission profiles for every local government region (municipality) in Australia.</p> <p><em><a href="https://www.youtube.com/watch?v=TZrt5E1ljTc">Watch pitch video here (10.25min)</a> // </em><a href="https://blog.okfn.org/files/2021/04/Snapshot_NetZeroChallenge_Slide-Deck_April21-v2.pdf"><em>Snapshot Slide Deck</em></a></p> <p><a href="https://i1.wp.com/blog.okfn.org/files/2021/04/First-page-CarbonGeoScales-Net-Zero-Challenge.jpg?ssl=1"><img alt="" class="alignnone wp-image-27008" height="338" src="https://i1.wp.com/blog.okfn.org/files/2021/04/First-page-CarbonGeoScales-Net-Zero-Challenge.jpg?resize=600%2C338&amp;ssl=1" width="600" /></a></p> <p><strong>Pitch 2</strong> – by Saif Shabou from <a href="https://opengeoscales.github.io/CarbonGeoScales/"><b>CarbonGeoScales</b></a> which is a framework for standardising open data for green house gas emissions at multiple geographical scales (built by a team from France).</p> <p><em><a href="https://www.youtube.com/watch?v=qrtMWU6-ljg">Watch pitch video here (9.07min)</a> // <a href="https://blog.okfn.org/files/2021/04/CarbonGeoScales-Net-Zero-Challenge.pdf">CarbonGeoScales Slide Deck</a></em></p> <p><a href="https://i2.wp.com/blog.okfn.org/files/2021/04/Avian-index.png?ssl=1"><img alt="" class="aligncenter size-full wp-image-27012" height="344" src="https://i2.wp.com/blog.okfn.org/files/2021/04/Avian-index.png?resize=600%2C344&amp;ssl=1" width="600" /></a></p> <p><strong>Pitch 3</strong> – by Jeremy Dickens. He presents <b>Citizen Science Avian Index for Sustainable Forests</b> a new bio monitoring tool that uses open data on bird observations to provide crucial information on forest ecological conditions (from South Africa).</p> <p><a href="https://www.youtube.com/watch?v=ihjwCnPeBJ0"><em>Watch pitch video here (7.03min)</em></a>  // <a href="https://blog.okfn.org/files/2021/04/Net-Zero-Challenge-Pitch.pdf">Avian Index – Slide Deck</a></p> <p><a href="https://i2.wp.com/blog.okfn.org/files/2021/04/IMG_20210415_233337_681.jpg?ssl=1"><img alt="" class="alignnone wp-image-26999" height="338" src="https://i2.wp.com/blog.okfn.org/files/2021/04/IMG_20210415_233337_681.jpg?resize=600%2C338&amp;ssl=1" width="600" /></a></p> <p><strong>Pitch 4</strong> – by Cristian Gregorini from <a href="http://escueladefiscales.com/index.php/proyecto-yarquen/"><b>Project Yarquen</b></a> which is a new API tool and website to organise climate relevant open data for use by civil society organisations, environmental activists, data journalists and people interested in environmental issues (built by a team from Argentina).</p> <p><a href="https://www.youtube.com/watch?v=CXQUdyNBwl8"><em>Watch pitch video here (8.20min)</em></a></p> <p><a href="https://i0.wp.com/blog.okfn.org/files/2021/04/3-Clima-de-Eleição-Governmental-Plans-Analysis-04_2021-ingles.jpg?ssl=1"><img alt="" class="alignnone wp-image-26997" height="338" src="https://i0.wp.com/blog.okfn.org/files/2021/04/3-Clima-de-Eleição-Governmental-Plans-Analysis-04_2021-ingles.jpg?resize=600%2C338&amp;ssl=1" width="600" /></a></p> <p><strong>Pitch 5</strong> – by Beatriz Pagy from <a href="https://climadeeleicao.com.br/analise-de-planos/"><span class="qu"><span class="gD">Clima de Eleição</span></span></a> which analyses recognition of climate change issues by prospective election candidates in Brazil, enabling voters to make informed decisions about who to vote in to office.</p> <p><em><a href="https://www.youtube.com/watch?v=e5iILoWSXJ8">Watch pitch video here (5.37min)</a> // <a href="https://blog.okfn.org/files/2021/04/Clima-de-Eleição-Slide-Deck.pdf">Clima de Eleição – Slide Deck</a></em></p> <p><a href="https://i1.wp.com/blog.okfn.org/files/2021/04/Copy-of-NZC-Pitch-Contest-Slide-Show.jpg?ssl=1"><img alt="" class="alignnone wp-image-27001" height="338" src="https://i1.wp.com/blog.okfn.org/files/2021/04/Copy-of-NZC-Pitch-Contest-Slide-Show.jpg?resize=600%2C338&amp;ssl=1" width="600" /></a></p> <p><em><strong>Concluding remarks </strong>– by James Hamilton, Director of the Net Zero Challenge</em></p> <p><a href="https://www.youtube.com/watch?v=OGq9NJsn1LY"><em>Watch video here (0.46min)</em></a></p> <hr /> <p>A full unedited video of the Net Zero Challenge is <a href="https://www.youtube.com/watch?v=0SLCTO31AIE">here</a> (55.28min)</p> <hr /> <p><em>There are many people who collaborated to make this event possible. </em></p> <p><i>We wish to thank both <a href="https://www.microsoft.com/en-us/">Microsoft</a> and the <a href="https://www.gov.uk/government/organisations/foreign-commonwealth-development-office">UK Foreign, Commonwealth &amp; Development Office</a> for their support for the Net Zero Challenge. Thanks also to <a href="https://opendatacharter.net/">Open Data Charter</a> and the <a href="https://opendata.transport.nsw.gov.au/">Open Data &amp; Innovation Team at Transport for New South Wales</a> for their strategic advice during the development of this project. The event would not have been possible without the enthusiastic hard work of the <a href="https://blog.okfn.org/2021/03/31/meet-our-panel-of-experts-for-the-net-zero-challenge-pitch-contest/">Panel of Experts</a> who will judge the winning entry, and the audience who asked such great questions. Finally – to all the pitch teams. Your projects inspire us and we hope your participation in the Net Zero Challenge has been – and will continue to be – supportive for your work as you use open data to advance climate action.</i></p> 2021-04-19T09:43:32+00:00 James Hamilton Hugh Rundle: A barbaric yawp https://www.hughrundle.net/a-barbaric-yawp/ <p>Over the Easter break I made a little <a href="https://www.rust-lang.org/">Rust</a> tool for sending <a href="https://docs.joinmastodon.org/user/posting/">toots</a> and/or <a href="https://help.twitter.com/en/using-twitter/how-to-tweet">tweets</a> from a command line. Of course there are dozens of existing tools that enable either of these, but I had a specific use in mind, and also wanted a reasonably small and achievable project to keep learning Rust.</p> <p>For various reasons I've recently been thinking about the power of <a href="https://en.wikipedia.org/wiki/Unix_philosophy">"the Unix philosophy"</a>, generally summarised as:</p> <blockquote> <ul> <li>Write programs that do one thing and do it well.</li> <li>Write programs to work together.</li> <li>Write programs to handle text streams, because that is a universal interface.</li> </ul> </blockquote> <p><a href="https://github.com/hughrun/yawp">My little program</a> takes a text string as input, and sends the same string to the output, the intention being not so much that it would normally be used manually on its own (though it can be) but more that it can "work together" with other programs or scripts. The "one thing" it does (I will leave the question of "well" to other people to judge) is post a tweet and/or toot to social media. It's very much a unidirectional, broadcast tool, not one for having a conversation. In that sense, it's like Whitman's "Barbaric yawp", subject of <a href="https://www.youtube.com/watch?v=gQU3EphIpMY">my favourite scene in Dead Poets Society</a> and a pretty nice description of what social media has become in a decade or so. Calling the program <code>yawp</code> therefore seemed fitting.</p> <p><code>yawp</code> takes text from standard input (<code>stdin</code>), publishes that text as a tweet and/or a toot, and then prints it to standard output (<code>stdout</code>). Like I said, it's not particularly complex, and not even all that useful for your daily social media posting needs, but the point is for it to be part of a tool chain. For this reason <code>yawp</code> takes the configuration it needs to interact with the Mastodon and Twitter APIs from environment (<code>ENV</code>) variables, because these are quite easy to set programatically and a fairly "universal interface" for setting and getting values to be used in programs.</p> <p>Here's a simple example of sending a tweet:</p> <pre class="language-bash"><code class="language-bash"><span class="highlight-line">yawp <span class="token string">'Hello, World!'</span> -t</span></code></pre> <p>We could also send a toot by piping from the <code>echo</code> program (the <code>-</code> tells <code>yawp</code> to use <code>stdin</code> instead of looking for an argument like it uses above):</p> <pre class="language-bash"><code class="language-bash"><span class="highlight-line"><span class="token keyword">echo</span> <span class="token string">'Hello again, World!'</span> <span class="token operator">|</span> yawp - -m</span></code></pre> <p>In <code>bash</code>, you can send the contents of a file to <code>stdin</code>, so we could do this too:</p> <pre class="language-bash"><code class="language-bash"><span class="highlight-line">yawp - -mt <span class="token operator">&lt;</span>message.txt</span></code></pre> <p>But really the point is to use <code>yawp</code> to do something like this:</p> <pre class="language-bash"><code class="language-bash"><span class="highlight-line">app_that_creates_message <span class="token operator">|</span> yawp - -mt <span class="token operator">|</span> do_something_else.sh <span class="token operator">&gt;&gt;</span> yawping.log</span></code></pre> <p>Anyway, enjoy firing your barbaric yawps into the cacophony.</p> <hr /> 2021-04-19T00:54:56+00:00 Hugh Rundle Andromeda Yelton: I haven’t failed, I’ve just tried a lot of ML approaches that don’t work https://andromedayelton.com/2021/04/16/i-havent-failed-ive-just-tried-a-lot-of-ml-approaches-that-dont-work/ <p>“Let’s blog every Friday,” I thought. “It’ll be great. People can see what I’m doing with ML, and it will be a useful practice for me!” And then I went through weeks on end of feeling like I had nothing to report because I was trying approach after approach to this one problem that simply didn’t work, hence not blogging. And finally realized: oh, the process <em>is</em> the thing to talk about…</p> <p>Hi. I’m Andromeda! I am trying to make a neural net better at recognizing people in archival photos. After running a series of experiments — enough for me to have written 3,804 words of notes — I now have a neural net that is ten times worse at its task. <img alt="🎉" class="wp-smiley" src="https://s0.wp.com/wp-content/mu-plugins/wpcom-smileys/twemoji/2/72x72/1f389.png" style="height: 1em;" /></p> <p>And now I have 3,804 words of notes to turn into a blog post (a situation which gets harder every week). So let me catch you up on the outline of the problem:</p> <ol><li>Download a whole bunch of archival photos and their metadata (thanks, <a href="https://dp.la">DPLA</a>!)</li><li>Use a face <em>detection</em> ML library to locate faces, crop them out, and save them in a standardized way</li><li>Benchmark an off-the-shelf face <em>recognition</em> system to see how good it is at identifying these faces</li><li>Retrain it</li><li>Benchmark my new system</li></ol> <p>Step 3: profit, right? Well. Let me also catch you up on some problems along the way:</p> <h2>Alas, metadata</h2> <p>Archival photos are great because they have <em>metadata</em>, and metadata is like <em>labels</em>, and labels mean you can do supervised learning, right?</p> <p>Well….</p> <p>Is he “Du Bois, W. E. B. (William Edward Burghardt), 1868-1963” or “Du Bois, W. E. B. (William Edward Burghardt) 1868-1963” or “Du Bois, W. E. B. (William Edward Burghardt)” or “W.E.B. Du Bois”? I mean, these are all options. People have used a lot of different metadata practices at different institutions and in different times. But I’m going to confuse the poor computer if I imply to it that all these photos of the <em>same</em> person are photos of <em>different</em> people. (I have gone through several attempts to resolve this computationally without needing to do everything by hand, with only modest success.)</p> <p>What about “Photographs”? That appears in the list of subject labels for lots of things in my data set. “Photographs” is a person, right? I ended up pulling in an entire other ML component here — <a href="https://spacy.io/">spaCy</a>, to do some natural language processing to at least guess which lines are probably names, so I can clear the rest of them out of my way. But spaCy only has ~90% accuracy on personal names anyway and, guess what, because everything is terrible, in predictable ways, it has no idea “Kweisi Mfume” is a person.</p> <p>Is a person who appears in the photo guaranteed to be a person who appears in the photo? Nope.</p> <p>Is a person who appears in the metadata guaranteed to be a person who appears in the photo? Also nope! Often they’re a photographer or other creator. Sometimes they are the subject of the depicted event, but not themselves in the photo. (spaCy will happily tell you that there’s personal name content in something like “Martin Luther King Day”, but MLK is unlikely to appear in a photo of an MLK day event.)</p> <h2>Oh dear, linear algebra</h2> <p>OK but let’s imagine for the sake of argument that we live in a perfect world where the metadata is exactly what we need — no more, no less — and its formatting is perfectly consistent. <img alt="🦄" class="wp-smiley" src="https://s0.wp.com/wp-content/mu-plugins/wpcom-smileys/twemoji/2/72x72/1f984.png" style="height: 1em;" /></p> <p>Here you are, in this perfect world, confronted with a photo that contains <em>two</em> people and has <em>two</em> names. How do you like them apples?</p> <p>I spent more time than I care to admit trying to figure this out. Can I bootstrap from photos that have one person and one name — identify those, subtract them out of photos of two people, go from there? (Not reliably — there’s a lot of data I never reach that way — and it’s horribly inefficient.)</p> <p>Can I do something <em>extremely clever</em> with matrix multiplication? Like…once I generate vector space embeddings of all the photos, can I do some sort of like dot-product thing across all of my photos, or big batches of them, and correlate the closest-match photos with overlaps in metadata? Not only is this a process which begs the question — I’d have to do that with the ML system I have not yet optimized for archival photo recognition, thus possibly just baking bad data in — but have I mentioned I have taken exactly one linear algebra class, which I didn’t really grasp, in 1995?</p> <p>What if I train yet another ML system to do some kind of k-means clustering on the embeddings? This is both a promising approach and some really first-rate yak-shaving, combining all the question-begging concerns of the previous paragraph with all the crystalline clarity of black box ML.</p> <p>Possibly at this point it would have been faster to tag them all by hand, but that would be admitting defeat. Also I don’t have a research assistant, which, let’s be honest, is the person who would usually be doing this actual work. I <em>do</em> have a 14-year-old and I am strongly considering paying her to do it for me, but to facilitate that I’d have to actually build a web interface and probably learn more about AWS, and the prospect of reading AWS documentation has a bracing way of reminding me of all of the more delightful and engaging elements of my todo list, like calling some people on the actual telephone to sort out however they’ve screwed up some health insurance billing.</p> <h2>Nowhere to go but up</h2> <p>Despite all of that, I did actually get all the way through the 5 steps above. I have a truly, spectacularly terrible neural net. Go me! But at a thousand-plus words, perhaps I should leave that story for next week….</p> 2021-04-16T21:08:54+00:00 Andromeda Lucidworks: Tips for Mixed Reality in Retail https://lucidworks.com/post/tips-for-mixed-reality-in-retail/ <p>How retailers are turning to virtual reality, augmented reality, and mixed reality applications to recreate the in-store experience from anywhere.</p> <p>The post <a href="https://lucidworks.com/post/tips-for-mixed-reality-in-retail/" rel="nofollow">Tips for Mixed Reality in Retail</a> appeared first on <a href="https://lucidworks.com" rel="nofollow">Lucidworks</a>.</p> 2021-04-16T17:51:10+00:00 Andy Wibbels Erin White: Talk: Using light from the dumpster fire to illuminate a more just digital world https://erinrwhite.com/talk-using-light-from-the-dumpster-fire-to-illuminate-a-more-just-digital-world/ <p>This February I gave a lightning talk for the <a class="wp-editor-md-post-content-link" href="https://www.instagram.com/rvadsgn/?hl=en">Richmond Design Group</a>. My question: what if we use the light from the dumpster fire of 2020 to see an equitable, just digital world? How can we change our thinking to build the future web we need?</p> <p>Presentation is embedded here; text of talk is below.</p> <p></p> <p>Hi everybody, I’m Erin. Before I get started I want to say thank you to the RVA Design Group organizers. This is hard work and some folks have been doing it for YEARS. Thank you to the organizers of this group for doing this work and for inviting me to speak.</p> <p>This talk isn’t about 2020. This talk is about the future. But to understand the future, we gotta look back.</p> <h2>The web in 1996</h2> <p>Travel with me to 1996. Twenty-five years ago!</p> <p>I want to transport us back to the mindset of the early web. The fundamental idea of hyperlinks, which we now take for granted, really twisted everyone’s noodles. So much of the promise of the early web was that with broad access to publish in hypertext, the opportunities were limitless. Technologists saw the web as an equalizing space where systems of oppression that exist in the real world wouldn’t matter, and that we’d all be equal and free from prejudice. Nice idea, right?</p> <p>You don’t need to’ve been around since 1996 to know that’s just not the way things have gone down.</p> <p>Pictured before you are some of the <a class="wp-editor-md-post-content-link" href="https://mashable.com/2010/07/04/web-founding-fathers/">early web pioneers</a>. Notice a pattern here?</p> <p>These early visions of the web, including <a class="wp-editor-md-post-content-link" href="https://www.eff.org/cyberspace-independence">Barlow’s declaration of independence of cyberspace</a>, while inspiring and exciting, were crafted by the same types of folks who wrote the actual declaration of independence: the landed gentry, white men with privilege. Their vision for the web echoed the declaration of independence’s authors’ attempts to describe the world they envisioned. And what followed was the inevitable conflict with reality.</p> <p>We all now hold these truths to be self-evident:</p> <ul> <li>The systems humans build reflect humans’ biases and prejudices.</li> <li>We continue to struggle to diversify the technology industry.</li> <li>Knowledge is interest-driven.</li> <li>Inequality exists, online and off.</li> <li>Celebrating, rather than diminishing, folks’ intersecting identities is vital to human flourishing.</li> </ul> <h2>The web we have known</h2> <blockquote><p> <strong>Profit first:</strong> monetization, ads, the funnel, dark patterns<br /> <strong>Can we?:</strong> Innovation for innovation’s sake<br /> <strong>Solutionism:</strong> code will save us<br /> <strong>Visual design:</strong> aesthetics over usability<br /> <strong>Lone genius:</strong> “hard” skills and rock star coders<br /> <strong>Short term thinking:</strong> move fast, break stuff<br /> <strong>Shipping:</strong> new features, forsaking infrastructure </p></blockquote> <p>Let’s move forward quickly through the past 25 years or so of the web, of digital design.</p> <p>All of the web we know today has been shaped in some way by intersecting matrices of domination: colonialism, capitalism, white supremacy, patriarchy. (Thank you, <a class="wp-editor-md-post-content-link" href="https://opinionator.blogs.nytimes.com/2015/12/10/bell-hooks-buddhism-the-beats-and-loving-blackness/">bell hooks</a>.)</p> <p>The digital worlds where we spend our time – and that we build!! – exist in this way.</p> <p>This is not an indictment of anyone’s individual work, so please don’t take it personally. What I’m talking about here is the digital milieu where we live our lives.</p> <p>The funnel drives everything. Folks who work in nonprofits and public entities often tie ourselves in knots to retrofit our use cases in order to use common web tools (google analytics, anyone?)</p> <p>In chasing innovation<img alt="™" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.0.1/72x72/2122.png" style="height: 1em;" /> we often overlook important infrastructure work, and devalue work — like web accessibility, truly user-centered design, care work, documentation, customer support <strong>and even care for ourselves and our teams</strong> — that doesn’t drive the bottom line. We frequently write checks for our future selves to cash, knowing damn well that we’ll keep burying ourselves in technical debt. That’s some tough stuff for us to carry with us every day.</p> <p>The “move fast” mentality has resulted in explosive growth, but at what cost? And in creating urgency where it doesn’t need to exist, focusing on new things rather than repair, the end result is that we’re building a house of cards. And we’re exhausted.</p> <p>To zoom way out, this is another manifestation of late capitalism. Emphasis on LATE. Because…2020 happened.</p> <h2>What 2020 taught us</h2> <blockquote><p> Hard times amplify existing inequalities<br /> Cutting corners mortgages our future<br /> Infrastructure is essential<br /> “Colorblind”/color-evasive policy doesn’t cut it<br /> Inclusive design is vital<br /> We have a duty to each other<br /> Technology is only one piece<br /> <strong>Together, we rise</strong> </p></blockquote> <p>The past year has been awful for pretty much everybody.</p> <p>But what the light from this dumpster fire has illuminated is that <strong>things have actually been awful for a lot of people, for a long time</strong>. This year has shown us how perilous it is to avoid important infrastructure work and to pursue innovation over access. It’s also shown us that what is sometimes referred to as colorblindness — I use the term color-evasiveness because it is not ableist and it is more accurate — a color-evasive approach that assumes everyone’s needs are the same in fact leaves people out, especially folks who need the most support.</p> <p>We’ve learned that technology is a crucial tool and that it’s just one thing that keeps us connected to each other as humans.</p> <p>Finally, we’ve learned that if we work together we can actually make shit happen, despite a world that tells us individual action is meaningless. Like biscuits in a pan, when we connect, we rise together.</p> <p>Marginalized folks have been saying this shit for years.<br /> More of us than ever see these things now.<br /> And now we can’t, and shouldn’t, unsee it.</p> <h1>The web we can build together</h1> <blockquote><p> Current state:<br /> – Profit first<br /> – Can we?<br /> – Solutionism<br /> – Aesthetics<br /> – “Hard” skills<br /> – Rockstar coders<br /> – Short term thinking<br /> – Shipping</p> <p> Future state:<br /> – People first: security, privacy, inclusion<br /> – Should we?<br /> – Holistic design<br /> – Accessibility<br /> – Soft skills<br /> – Teams<br /> – Long term thinking<br /> – Sustaining </p></blockquote> <p>So let’s talk about the future. I told you this would be a talk about the future.</p> <p>Like many of y’all I have had a very hard time this year thinking about the future at all. It’s hard to make plans. It’s hard to know what the next few weeks, months, years will look like. And who will be there to see it with us.</p> <p>But sometimes, when I can think clearly about something besides just making it through every day, I wonder.</p> <p>What does a people-first digital world look like? Who’s been missing this whole time?</p> <p>Just because we can do something, does it mean we should?</p> <p>Will technology actually solve this problem? Are we even defining the problem correctly?</p> <p>What does it mean to design knowing that even “able-bodied” folks are only temporarily so? And that our products need to be used, by humans, in various contexts and emotional states?</p> <p>(There are also false binaries here: aesthetics vs. accessibility; abled and disabled; binaries are dangerous!)</p> <p>How can we nourish our collaborations with each other, with our teams, with our users? And focus on the wisdom of the folks in the room rather than assigning individuals as heroes?</p> <p>How can we build for maintenance and repair? How do we stop writing checks our future selves to cash – with interest?</p> <p>Some of this here, I am speaking of as a web user and a web creator. I’ve only ever worked in the public sector. When I talk with folks working in the private sector I always do some amount of translating. At the end of the day, we’re solving many of the same problems.</p> <p>But what can private-sector workers learn from folks who come from a public-sector organization?</p> <p>And, as we think about what we build online, how can we also apply that thinking to our real-life communities? What is our role in shaping the public conversation around the use of technologies? I offer a few ideas here, but don’t want them to limit your thinking.</p> <h2>Consider the public sector</h2> <blockquote class="twitter-tweet"> <p dir="ltr" lang="en">Here’s a thread about public service. <img alt="⚖" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.0.1/72x72/2696.png" style="height: 1em;" /><img alt="🏛" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.0.1/72x72/1f3db.png" style="height: 1em;" /> <img alt="💪🏼" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.0.1/72x72/1f4aa-1f3fc.png" style="height: 1em;" /><img alt="💻" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.0.1/72x72/1f4bb.png" style="height: 1em;" /><img alt="🇺🇸" class="wp-smiley" src="https://s.w.org/images/core/emoji/13.0.1/72x72/1f1fa-1f1f8.png" style="height: 1em;" /></p> <p>— Dana Chisnell (she / her) (@danachis) <a href="https://twitter.com/danachis/status/1357835164118876161?ref_src=twsrc%5Etfw">February 5, 2021</a></p></blockquote> <p></p> <p>I don’t have a ton of time left today. I wanted to talk about public service like the very excellent Dana Chisnell here.</p> <p>Like I said, I’ve worked in the public sector, in higher ed, for a long time. It’s my bread and butter. It’s weird, it’s hard, it’s great.</p> <p>There’s a lot of work to be done, and it ain’t happening at civic hackathons or from external contractors. The call needs to come from inside the house.</p> <h3>Working in the public sector</h3> <blockquote class="twitter-tweet"> <p dir="ltr" lang="en">Government should be<br />– inclusive of all people<br />– responsive to needs of the people<br />– effective in its duties &amp; purpose</p> <p>— Dana Chisnell (she / her) (@danachis) <a href="https://twitter.com/danachis/status/1357835374324760576?ref_src=twsrc%5Etfw">February 5, 2021</a></p></blockquote> <p><br /> I want you to consider for a minute how many folks are working in the public sector right now, and how technical expertise — especially in-house expertise — is something that is desperately needed.</p> <p>Pictured here are the <a class="wp-editor-md-post-content-link" href="http://richmondgov.com/">old website</a> and <a class="wp-editor-md-post-content-link" href="https://www.rva.gov/">new website</a> for the city of Richmond. I have a whole ‘nother talk about that new Richmond website. I FOIA’d the contracts for this website. There are 112 accessibility errors on the homepage alone. It’s been in development for 3 years and still isn’t in full production.</p> <p>Bottom line, good government work matters, and it’s hard to find. Important work is put out for the lowest bidder and often external agencies don’t get it right. What would it look like to have that expertise in-house?</p> <h3>Influencing technology policy</h3> <p>We also desperately need lawmakers and citizens who understand technology and ask important questions about ethics and human impact of systems decisions.</p> <p>Pictured here are some headlines as well as a contract from the City of Richmond. Y’all know <a class="wp-editor-md-post-content-link" href="https://www.muckrock.com/foi/richmond-151/soma-global-rpd-contract-71482/">we spent $1.5 million on a predictive policing system</a> that will <a class="wp-editor-md-post-content-link" href="https://www.technologyreview.com/2020/07/17/1005396/predictive-policing-algorithms-racist-dismantled-machine-learning-bias-criminal-justice/">disproportionately harm citizens of color</a>? And that earlier this month, City Council voted to <a class="wp-editor-md-post-content-link" href="https://www.wtvr.com/news/local-news/vcu-police-to-join-rpds-records-system">allow Richmond and VCU PD’s to start sharing their data</a> in that system?</p> <p>The surveillance state abides. Technology facilitates.</p> <p>I dare say these technologies are designed to bank on the fact that <a class="wp-editor-md-post-content-link" href="https://medium.com/s/story/mocking-congress-wont-make-it-tech-literate-21a2c3208d3e">lawmakers don’t know what they’re looking at</a>.</p> <p>My theory is, in addition to holding deep prejudices, lawmakers are also deeply baffled by technology. The hard questions aren’t being asked, or they’re coming too late, and they’re coming from citizens who have to <a class="wp-editor-md-post-content-link" href="https://www.nbc12.com/2019/04/04/group-raises-concern-over-rpds-record-management-system/">put themselves in harm’s way</a> to do so.</p> <p>Technophobia is another harmful element that’s emerged in the past decades. What would a world look like where technology is not a thing to shrug off as un-understandable, but is instead deftly co-designed to meet our needs, rather than licensed to our city for 1.5 million dollars? What if everyone knew that technology is not neutral?</p> <h2>Closing</h2> <p>This is some of the future I can see. I hope that it’s sparked new thoughts for you.</p> <p>Let’s envision a future together. What has the light illuminated for you?</p> <p>Thank you!</p> 2021-04-16T14:27:12+00:00 erinrwhite David Rosenthal: NFTs and Web Archiving https://blog.dshr.org/2021/04/nfts-and-web-archiving.html One of the earliest observations of the behavior of the Web at scale was "link rot". There were a lot of 404s, broken links. Research showed that the half-life of Web pages was alarmingly short. Even in 1996 this problem was obvious enough for Brewster Kahle to found the <a href="https://archive.org/about/">Internet Archive</a> to address it. From the Wikipedia entry for <a href="https://en.wikipedia.org/wiki/Link_rot">Link Rot</a>:<br /><blockquote>A 2003 study found that on the Web, about one link out of every 200 broke each week,<sup>[<a href="https://web.archive.org/web/20110709175020/http://www2003.org/cdrom/papers/refereed/p097/P97%20sources/p97-fetterly.html">1</a>]</sup> suggesting a <a href="https://en.wikipedia.org/wiki/Half-life">half-life</a> of 138 weeks. This rate was largely confirmed by a 2016–2017 study of links in Yahoo! Directory (which had stopped updating in 2014 after 21 years of development) that found the half-life of the directory's links to be two years.<sup>[<a href="https://web.archive.org/web/20171017041901/http://blog.zomdir.com/2017/10/the-half-life-of-link-is-two-year.html">2</a>]</sup></blockquote>One might have thought that academic journals were a relatively stable part of the Web, but research showed that their references decayed too, just somewhat less rapidly. A 2013 study found a <a href="https://doi.org/10.1186/1471-2105-14-S14-S5">half-life of 9.3 years</a>. See my 2015 post <a href="https://blog.dshr.org/2015/02/the-evanescent-web.html"><i>The Evanescent Web</i></a>. <br /><br />I expect you have noticed the latest outbreak of blockchain-enabled insanity, <a href="https://davidgerard.co.uk/blockchain/2021/03/11/nfts-crypto-grifters-try-to-scam-artists-again/">Non-Fungible Tokens (NFTs)</a>. Someone <a href="https://www.msn.com/en-us/money/markets/buyer-of-2469-million-beeple-nft-is-a-crypto-investor-using-the-pseudonym-metakovan/ar-BB1ewF66">"paying $69M for a JPEG"</a> or <a href="https://www.nytimes.com/2021/03/25/business/nyt-column-nft.html">$560K for a <i>New York Times</i> column</a> attracted a <a href="https://www.nytimes.com/2021/04/09/business/nft-bitcoin-stocks-bonds.html">lot of attention</a>. Follow me below the fold for the connection between NFTs, "link rot" and Web archiving.<br /><span><a name="more"></a></span><br />Kahle's idea for addressing "link rot", which became the <a href="https://archive.org/web/">Wayback Machine</a>, was to make a copy of the content at some URL, say:<br /><blockquote><small><tt>http://www.example.com/page.html</tt></small></blockquote>keep the copy for posterity, and re-publish it at a URL like:<br /><blockquote><small><tt>https://web.archive.org/web/19960615083712/http://www.example.com/page.html</tt></small></blockquote>What is the difference between the two URLs? The original is controlled by Example.Com, Inc.; they can change or delete it on a whim. The copy is controlled by the Internet Archive, whose mission is to preserve it unchanged "for ever". The original is subject to "link rot", the second is, one hopes, not subject to "link rot". The Wayback Machine's URLs have three components:<br /><ul><li><tt>https://web.archive.org/web/</tt> locates the archival copy at the Internet Archive.</li><li><tt>19960615083712</tt> indicates that the copy was made on 15<sup>th</sup> June, 1996 at 8:37:12.</li><li><tt>http://www.example.com/page.html</tt> is the URL from which the copy was made.</li></ul>The fact that the archival copy is at a <i>different</i> URL from the original causes a set of problems that have bedevilled Web archiving. One is that, if the original goes away, all the links that pointed to it break, even though there may be an archival copy to which they could point to fulfill the intent of the link creator. Another is that, if the content at the original URL changes, the link will continue to resolve but the content it returns may no longer reflect the intent of the link creator, although there may be an archival copy that does. Even in the early days of the Web it was evident that Web pages changed and vanished at an alarming rate.<br /><br />The point is that the meaning of a generic Web URL is "whatever content, or lack of content, you find at this location". That is why URL stands for <i>Universal Resource Locator</i>. Note the difference with URI, which stands for Universal Resource <i>Identifier</i>. Anyone can create a URL or URI linking to whatever content they choose, but doing so provides no rights in or control over the linked-to content.<br /><br /><a href="https://1.bp.blogspot.com/-9FdTsU4DxSg/YHIguMJ5mzI/AAAAAAAAGM0/J-xhq3cpVh4EGQoYVuCkbuHRfeXbmQHhQCLcBGAsYHQ/s871/404.jpeg" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"><img border="0" height="200" src="https://1.bp.blogspot.com/-9FdTsU4DxSg/YHIguMJ5mzI/AAAAAAAAGM0/J-xhq3cpVh4EGQoYVuCkbuHRfeXbmQHhQCLcBGAsYHQ/w119-h200/404.jpeg" width="119" /></a>In <a href="https://www.vice.com/en/article/pkdj79/peoples-expensive-nfts-keep-vanishing-this-is-why"><i>People's Expensive NFTs Keep Vanishing. This Is Why</i></a>, Ben Munster reports that:<br /><blockquote>over the past few months, numerous individuals have complained about their NFTs going “<a href="https://www.reddit.com/r/opensea/comments/lxohum/nft_i_purchased_is_missing_on_opensea/">missing</a>,” “<a href="https://www.reddit.com/r/Metamask/comments/jhvlfe/nft_disappearing_from_wallet/">disappearing</a>,” or becoming otherwise unavailable on social media. This despite the oft-repeated NFT sales pitch: that NFT artworks are logged immutably, and irreversibly, onto the Ethereum blockchain. </blockquote>So NTFs have the same problem that Web pages do. Isn't the blockchain supposed to make things immortal and immutable?<br /><br />Kyle Orland's <a href="https://arstechnica.com/gaming/2021/03/ars-technicas-non-fungible-guide-to-nfts/"><i>Ars Technica’s non-fungible guide to NFTs</i></a> provides an over-simplified explanation:<br /><blockquote>When NFT’s are used to represent digital files (like GIFs or videos), however, those files usually aren’t stored directly “on-chain” in the token itself. Doing so for any decently sized file could get prohibitively expensive, given the cost of replicating those files across every user on the chain. Instead, most NFTs store the actual content as a simple URI string in their metadata, pointing to an Internet address where the digital thing actually resides. </blockquote><a href="https://1.bp.blogspot.com/-OVca-MfGl7c/YHJL55JzLtI/AAAAAAAAGNE/nQUcCEg2Yacant3nMocL6w1JNy1FVsZaQCLcBGAsYHQ/s535/sassaman.png" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"><img border="0" height="200" src="https://1.bp.blogspot.com/-OVca-MfGl7c/YHJL55JzLtI/AAAAAAAAGNE/nQUcCEg2Yacant3nMocL6w1JNy1FVsZaQCLcBGAsYHQ/w57-h200/sassaman.png" width="57" /></a><i>NFTs are just links to the content they represent, not the content itself</i>. The Bitcoin blockchain actually does contain some images, such as this <a href="https://leung-btc.medium.com/len-sassaman-and-satoshi-e483c85c2b10">ASCII portrait of Len Sassaman</a> and some <a href="https://fc18.ifca.ai/preproceedings/6.pdf">pornographic images</a>. But the blocks of the Bitcoin blockchain were originally limited to 1MB and are now effectively limited to around 2MB, enough space for small image files. <a href="https://ethgasstation.info/blog/ethereum-block-size/"><i>What’s the Maximum Ethereum Block Size?</i></a> explains:<br /><blockquote>Instead of a fixed limit, Ethereum block size is bound by how many units of <a href="https://ethgasstation.info/blog/what-is-gas/">gas</a> can be spent per block. This limit is known as the block gas limit ... At the time of writing this, miners are currently accepting blocks with an average block gas limit of around 10,000,000 gas. Currently, the average Ethereum block size is anywhere between 20 to 30 kb in size. </blockquote>That's a little out-of-date. Currently the block gas limit is <a href="https://etherchain.org/charts/blockGasLimit">around 12.5M gas per block</a> and the average block is <a href="https://etherscan.io/chart/blocksize">about 45KB</a>. Nowhere near enough space for a $69M JPEG. The NFT for an artwork can only be a link. Most NFTs are <a href="https://ethereum.org/en/developers/docs/standards/tokens/erc-721/">ERC-721 tokens</a>, providing the <a href="https://github.com/ethereum/EIPs/blob/master/EIPS/eip-721.md">optional Metadata extension</a>:<br /><small><pre>/// @title ERC-721 Non-Fungible Token Standard, optional metadata extension<br />/// @dev See https://eips.ethereum.org/EIPS/eip-721<br />/// Note: the ERC-165 identifier for this interface is 0x5b5e139f.<br />interface ERC721Metadata /* is ERC721 */ {<br /> /// @notice A descriptive name for a collection of NFTs in this contract<br /> function name() external view returns (string _name);<br /><br /> /// @notice An abbreviated name for NFTs in this contract<br /> function symbol() external view returns (string _symbol);<br /><br /> /// @notice A distinct Uniform Resource Identifier (URI) for a given asset.<br /> /// @dev Throws if `_tokenId` is not a valid NFT. URIs are defined in RFC<br /> /// 3986. The URI may point to a JSON file that conforms to the "ERC721<br /> /// Metadata JSON Schema".<br /> function tokenURI(uint256 _tokenId) external view returns (string);<br />}<br /></pre></small>The <a href="https://github.com/ethereum/EIPs/blob/master/EIPS/eip-721.md">Metadata JSON Schema</a> specifies an object with three string properties:<br /><ul><li><b>name</b>: "Identifies the asset to which this NFT represents"</li><li><b>description</b>: "Describes the asset to which this NFT represents"</li><li><b>image</b>: "A URI pointing to a resource with mime type image/* representing the asset to which this NFT represents. Consider making any images at a width between 320 and 1080 pixels and aspect ratio between 1.91:1 and 4:5 inclusive."</li></ul>Note that the JSON metadata is not in the Ethereum blockchain, it is only pointed to by the token on the chain. If the art-work is the "image", it is two links away from the blockchain. So, given the evanescent nature of Web links, the standard provides no guarantee that the metadata exists, or is unchanged from when the token was created. Even if it is, the standard provides no guarantee that the art-work exists or is unchanged from when the token is created.<br /><br /><a href="https://1.bp.blogspot.com/-3gHulErJwSU/YHJLoyM0oGI/AAAAAAAAGM8/NonXNltdTK8z4e38N32yxrTIJHT3L94dQCLcBGAsYHQ/s270/Banksy.png" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"><img border="0" src="https://1.bp.blogspot.com/-3gHulErJwSU/YHJLoyM0oGI/AAAAAAAAGM8/NonXNltdTK8z4e38N32yxrTIJHT3L94dQCLcBGAsYHQ/s0/Banksy.png" /></a><b>Caveat emptor</b> — Absent unspecified actions, the purchaser of an NFT is buying a supposedly immutable, non-fungible object that points to a URI pointing to another URI. In practice both are typically URLs. The token provides no assurance that either of these links resolves to content, or that the <a href="https://www.youtube.com/watch?v=vxkwRNIZgdY">content they resolve to at any later time</a> is what the purchaser believed at the time of purchase. There is no guarantee that the creator of the NFT had any copyright in, or other rights to, the content to which either of the links resolves at any particular time.<br /><br />There are thus two issues to be resolved about the content of each of the NFT's links:<br /><ul><li>Does it <i>exist</i>? I.e. does it resolve to any content?</li><li>Is it <i>valid</i>? I.e. is the content to which it resolves unchanged from the time of purchase?</li></ul>These are the same questions posed by the Holy Grail of Web archiving, persistent URLs.<br /><br />Assuming existence for now, how can validity be assured? There have been a number of systems that address this problem by switching from naming files by their <i>location</i>, as URLs do, to naming files by their <i>content</i> by using the hash of the content as its name. The idea was the basis for Bram Cohen's highly successful <a href="https://en.wikipedia.org/wiki/BitTorrent">BitTorrent</a> — it doesn't matter where the data comes from provided its integrity is assured because the hash in the name matches the hash of the content.<br /><br />The content-addressable file system most used for NFTs is the <a href="https://ipfs.io/">Interplanetary File System</a> (IPFS). From its <a href="https://en.wikipedia.org/wiki/InterPlanetary_File_System#Design">Wikipedia page</a>:<br /><blockquote>As opposed to a centrally located server, IPFS is built around a decentralized system<sup>[<a href="https://doi.org/10.5038%2F1944-0472.13.1.1743">5</a>]</sup> of user-operators who hold a portion of the overall data, creating a resilient system of file storage and sharing. Any user in the network can serve a file by its content address, and other peers in the network can find and request that content from any node who has it using a <a href="https://en.wikipedia.org/wiki/Distributed_hash_table">distributed hash table</a> (DHT). In contrast to BitTorrent, IPFS aims to create a single global network. This means that if Alice and Bob publish a block of data with the same hash, the peers downloading the content from Alice will exchange data with the ones downloading it from Bob.<sup>[<a href="https://docs.ipfs.io/concepts/content-addressing/">6</a>]</sup> IPFS aims to replace protocols used for static webpage delivery by using gateways which are accessible with HTTP.<sup>[<a href="https://docs.ipfs.io/concepts/ipfs-gateway/">7</a>]</sup> Users may choose not to install an IPFS client on their device and instead use a public gateway. </blockquote>If the purchaser gets both the NFT's metadata and the content to which it refers via IPFS URIs, they can be assured that the data is valid. What do these IPFS URIs look like? The (excellent) <a href="https://docs.ipfs.io/how-to/address-ipfs-on-web/#dweb-addressing-in-brief">IPFS documentation</a> explains:<br /><blockquote><pre>https://ipfs.io/ipfs/&lt;CID&gt;<br /># e.g<br />https://ipfs.io/ipfs/Qme7ss3ARVgxv6rXqVPiikMJ8u2NLgmgszg13pYrDKEoiu<br /></pre>Browsers that support IPFS can redirect these requests to your local IPFS node, while those that don't can fetch the resource from the ipfs.io gateway.<br /><br />You can swap out <tt>ipfs.io</tt> for your own http-to-ipfs gateway, but you are then obliged to keep that gateway running <i>forever</i>. If your gateway goes down, users with IPFS aware tools will still be able to fetch the content from the IPFS network as long as any node still hosts it, but for those without, the link will be broken. Don't do that. </blockquote>Note the <i>assumption</i> here that the ipfs.io gateway will be running forever. Note also that only some browsers are capable of accessing IPFS content without using a gateway. Thus the ipfs.io gateway is a single point of failure, although the failure is not complete. In practice NFTs using IPFS URIs are dependent upon the continued existence of Protocol Labs, the organization behind IPFS. The ipfs.io URIs in the NFT metadata are actually URLs; they don't point to IPFS, but to a Web server that accesses IPFS.<br /><br />Pointing to the NFT's metadata and content using IPFS URIs assures their <i>validity</i> but does it assure their <i>existence</i>? The IPFS documentation's section <a href="https://docs.ipfs.io/concepts/persistence/"><i>Persistence, permanence, and pinning</i></a> explains:<br /><blockquote>Nodes on the IPFS network can automatically cache resources they download, and keep those resources available for other nodes. This system depends on nodes being willing and able to cache and share resources with the network. Storage is finite, so nodes need to clear out some of their previously cached resources to make room for new resources. This process is called <i>garbage collection</i>.<br /><br />To ensure that data persists on IPFS, and is not deleted during garbage collection, data can be pinned to one or more IPFS nodes. Pinning gives you control over disk space and data retention. As such, you should use that control to pin any content you wish to keep on IPFS indefinitely. </blockquote>To assure the existence of the NFT's metadata and content they must both be not just written to IPFS but also <a href="https://docs.ipfs.io/concepts/persistence/#pinning-in-context">pinned to at least one IPFS node</a>.<br /><blockquote>To ensure that your important data is retained, you may want to use a pinning service. These services run lots of IPFS nodes and allow users to pin data on those nodes for a fee. Some services offer free storage-allowance for new users. Pinning services are handy when:<br /><ul><li>You don't have a lot of disk space, but you want to ensure your data sticks around.</li><li>Your computer is a laptop, phone, or tablet that will have intermittent connectivity to the network. Still, you want to be able to access your data on IPFS from anywhere at any time, even when the device you added it from is offline.</li><li>You want a backup that ensures your data is always available from another computer on the network if you accidentally delete or garbage-collect your data on your own computer.</li> </ul></blockquote>Thus to assure the existence of the NFT's metadata and content pinning must be rented from a pinning service, another single point of failure.<br /><br />In summary, it is possible to take enough precautions and pay enough ongoing fees to be reasonably assured that your $69M NFT and its metadata and the JPEG it refers to will remain accessible. Whether in practice these precautions are taken is definitely not always the case. <a href="https://davidgerard.co.uk/blockchain/2021/03/29/news-vanishing-nfts-free-keene-not-so-free-coinbase-wash-trading-litecoin/">David Gerard reports</a><a>:<br /></a><blockquote><a>But functionally, IPFS works the same way as BitTorrent with magnet links — if nobody bothers seeding your file, there’s no file there. Nifty Gateway turn out not to bother to seed literally the files they sold, a few weeks later. [</a><a href="https://twitter.com/CheckMyNFT/status/1371633090318249984">Twitter</a>; <a href="https://twitter.com/CheckMyNFT/status/1372253288863825925">Twitter</a>] </blockquote>Anil Dash claims to have invented, with Kevin McCoy, the concept of NFTs referencing Web URLs in 2014. He writes in his must-read <a href="https://www.theatlantic.com/ideas/archive/2021/04/nfts-werent-supposed-end-like/618488/"><i>NFTs Weren’t Supposed to End Like This</i></a>:<br /><blockquote>Seven years later, all of today’s popular NFT platforms still use the same shortcut. This means that when someone buys an NFT, they’re not buying the actual digital artwork; they’re <a href="https://twitter.com/jonty/status/1372163423446917122">buying a link to it</a>. And worse, they’re buying a link that, in many cases, lives on the website of a new start-up that’s likely to fail within a few years. Decades from now, how will anyone verify whether the linked artwork is the original?<br /><br />All common NFT platforms today share some of these weaknesses. They still depend on one company staying in business to verify your art. They still depend on the old-fashioned pre-blockchain internet, where an artwork would suddenly vanish if someone forgot to renew a domain name. “Right now NFTs are built on an absolute house of cards constructed by the people selling them,” the software engineer Jonty Wareing <a href="https://twitter.com/jonty/status/1372170724459343874?s=20">recently wrote on Twitter</a>. </blockquote>My only disagreement with Dash is that, as someone who worked on archiving the "old-fashioned pre-blockchain internet" for two decades, I don't believe that there is a new-fangled post-blockchain Internet that makes the problems go away. And neither does <a href="https://davidgerard.co.uk/blockchain/2021/03/11/nfts-crypto-grifters-try-to-scam-artists-again/">David Gerard</a>:<br /><blockquote>The pictures for NFTs are often stored on the Interplanetary File System, or IPFS. Blockchain promoters talk like IPFS is some sort of bulletproof cloud storage that works by magic and unicorns. </blockquote> 2021-04-16T00:18:47+00:00 David. (noreply@blogger.com) Journal of Web Librarianship: The Impact of the COVID-19 Pandemic on Digital Library Usage: A Public Library Case Study https://www.tandfonline.com/doi/full/10.1080/19322909.2021.1913465?ai=1dl&mi=co84bk&af=R . <br /> 2021-04-15T05:50:28+00:00 Jelena Ćirić Evergreen ILS: Evergreen 3.7.0 released https://evergreen-ils.org/evergreen-3-7-0-released/ <p>The Evergreen Community is pleased to announce the release of Evergreen 3.7.0. Evergreen is highly-scalable software for libraries that helps library patrons find library materials and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries.</p> <p>Evergreen 3.7.0 is a major release that includes the following new features of note:</p> <ul> <li>Support for SAML-based Single Sign On</li> <li>Hold Groups, a feature that allows staff to add multiple users to a named hold group bucket and place title-level holds for a record for that entire set of users</li> <li>The Bootstrap public catalog skin is now the default</li> <li>“Did you mean?” functionality for catalog search focused on making suggestions for single search terms</li> <li>Holdings on the public catalog record details page can now be sorted by geographic proximity</li> <li>Library Groups, a feature that allows defining groups of organizational units outside of the hierarchy that can be used to limit catalog search results</li> <li>Expired staff accounts can now be blocked from logging in</li> <li>Publisher data in the public catalog display is now drawn from both the 260 and 264 field</li> <li>The staff catalog can now save all search results (up to 1,000) to a bucket in a single operation</li> <li>New opt-in settings for overdue and predue email notifications</li> <li>A new setting to allow expired patrons to renew loans</li> <li>Porting of additional interfaces to Angular, including Scan Item as Missing Pieces and Shelving Location Groups</li> </ul> <p>Evergreen admins installing or upgrading to 3.7.0 should be aware of the following:</p> <ul> <li>The minimum version of PostgreSQL required to run Evergreen 3.6 is PostgreSQL 9.6.</li> <li>The minimum version of OpenSRF is 3.2.</li> <li>This release adds anew OpenSRF service, <code>open-ils.geo</code>.</li> <li>The release also adds several new Perl module dependencies, <code>Geo::Coder::Google</code>, <code>Geo::Coder::OSM</code>, <code>String::KeyboardDistance</code>, and <code>Text::Levenshtein::Damerau::XS</code>.</li> <li>The database update procedure has more steps than usual; please consult the <a href="http://evergreen-ils.org/documentation/release/RELEASE_NOTES_3_7.html#_database_upgrade_procedure">upgrade section</a> of the release notes.</li> </ul> <p>The release is available on the <a href="https://evergreen-ils.org/egdownloads/">Evergreen downloads</a> page. Additional information, including a full list of new features, can be found in the <a href="http://evergreen-ils.org/documentation/release/RELEASE_NOTES_3_7.html">release notes</a>.</p> 2021-04-14T19:54:32+00:00 Galen Charlton Lucidworks: Build Semantic Search at Speed https://lucidworks.com/post/how-to-build-fast-semantic-search/ <p>Learn more about using semantic machine learning methodologies to power more relevant search results across your organization. </p> <p>The post <a href="https://lucidworks.com/post/how-to-build-fast-semantic-search/" rel="nofollow">Build Semantic Search at Speed</a> appeared first on <a href="https://lucidworks.com" rel="nofollow">Lucidworks</a>.</p> 2021-04-14T17:06:13+00:00 Elizabeth Edmiston Open Knowledge Foundation: Unveiling the new Frictionless Data documentation portal https://blog.okfn.org/2021/04/14/unveiling-the-new-frictionless-data-documentation-portal/ <p><a href="https://i2.wp.com/blog.okfn.org/files/2021/04/Frictionless-Framework.png?ssl=1"><img alt="" class="aligncenter size-full wp-image-26962" height="251" src="https://i2.wp.com/blog.okfn.org/files/2021/04/Frictionless-Framework.png?resize=600%2C251&amp;ssl=1" width="600" /></a></p> <p><span style="font-weight: 400;">Have you used <a href="https://frictionlessdata.io/">Frictionless Data</a> documentation in the past and been confused or wanted more examples? Are you a brand new Frictionless Data user looking to get started learning? </span></p> <p><span style="font-weight: 400;">We invite you all to visit our new and improved <a href="https://framework.frictionlessdata.io/">documentation portal</a>. </span></p> <p><span style="font-weight: 400;">Thanks to a <a href="https://frictionlessdata.io/blog/2021/01/13/partnering-with-odi/">fund that the Open Knowledge Foundation was awarded</a> from the <a href="https://theodi.org/">Open Data Institute</a></span><span style="font-weight: 400;">, we have completely reworked the guides of our <a href="https://framework.frictionlessdata.io/">Frictionless Data Framework website</a> according to the suggestions from a cohort of users gathered during <a href="https://blog.okfn.org/2021/01/12/partnering-with-odi-to-improve-frictionless-data/">several feedback sessions</a> throughout the months of February and March. </span></p> <p><span style="font-weight: 400;">We cannot stress enough how precious those feedback sessions have been to us. They were an excellent opportunity to connect with our users and reflect together with them on how to make all our guides more useful for current and future users. The enthusiasm and engagement that the community showed for the process was great to see and reminded us that the link with the community should be at the core of open source projects.</span></p> <p><span style="font-weight: 400;">We were amazed by the amount of extremely useful inputs that we got. While we are still digesting some of the suggestions and working out how to best implement them, we have made many changes to make the documentation a smoother, Frictionless experience.</span></p> <h3><span style="font-weight: 400;">So what’s new?</span></h3> <p><span style="font-weight: 400;">A common theme from the feedback sessions was that it was sometimes difficult for novice users to understand the whole potential of the <a href="https://frictionlessdata.io/specs/">Frictionless specifications</a>. To help make this clearer, we added a more detailed explanation, user examples and user stories to our <a href="https://framework.frictionlessdata.io/docs/guides/introduction">Introduction</a></span><span style="font-weight: 400;">. We also added some extra installation tips and a troubleshooting section to our <a href="https://framework.frictionlessdata.io/docs/guides/quick-start">Quick Start guide</a>.</span></p> <p><span style="font-weight: 400;">The users also suggested several code changes, like more realistic code examples, better explanations of functions, and the ability to run code examples in both the Command Line and Python. This last suggestion was prompted because most of the guides use a mix of Command Line and Python syntax, which was confusing to our users. We have clarified that by adding a switch in the code snippets that allows user to work with a pure Python Syntax or pure Command Line (when possible), as you can see <a href="https://framework.frictionlessdata.io/docs/guides/basic-examples">here</a></span><span style="font-weight: 400;">. We also put together an <a href="https://framework.frictionlessdata.io/docs/faq/">FAQ section</a> based on questions that were often asked on our <a href="https://discord.com/invite/Sewv6av">Discord chat</a></span><span style="font-weight: 400;">. I</span><span style="font-weight: 400;">f you have suggestions for other common questions to add, let us know!</span><span style="font-weight: 400;"><br /> </span><span style="font-weight: 400;"><br /> </span><span style="font-weight: 400;">The documentation revamping process also included the publication of <a href="https://framework.frictionlessdata.io/docs/tutorials/tutorials-overview">new tutorials</a>. We worked on two new Frictionless tutorials, which are published under the Notebooks link in the navigation menu. While working on those, we got inspired by the feedback sessions and realised that it made sense to give our community the possibility to contribute to the project with some real life examples of Frictionless Data use. The user selection process has started and we hope to get the new tutorials online by the end of the month, so stay tuned!</span></p> <h3><span style="font-weight: 400;">What’s next?</span></h3> <p><span style="font-weight: 400;">Our commitment to continually improving our documentation is not over with this project coming to an end! Do you have suggestions for changes you would like to see in our documentation? Please reach out to us or open a <a href="https://github.com/frictionlessdata/frictionless-py/pulls">pull request</a> to contribute. Everyone is welcome to contribute! Learn how to do it <a href="https://framework.frictionlessdata.io/docs/development/contributing">here</a>.</span></p> <h3><span style="font-weight: 400;">Thanks, thanks, thanks!</span></h3> <p><span style="font-weight: 400;">Once again, we are very grateful to the Open Data Institute for giving us the chance to focus on this documentation in order to improve it. We cannot thank enough all our users who took part in the feedback sessions. Your contributions were precious.</span></p> <h3><span style="font-weight: 400;">More about Frictionless Data</span></h3> <p><span style="font-weight: 400;"><a href="https://frictionlessdata.io/">Frictionless Data</a> is a set of specifications for data and metadata interoperability, accompanied by a collection of software libraries that implement these specifications, and a range of best practices for data management. The project is funded by the <a href="https://sloan.org/">Sloan Foundation</a>.</span></p> 2021-04-14T11:22:28+00:00 Sara Petti David Rosenthal: Cryptocurrency's Carbon Footprint https://blog.dshr.org/2021/04/cryptocurrencys-carbon-footprint.html <a href="https://www.scmp.com/news/china/science/article/3128653/chinas-bitcoin-mines-could-derail-carbon-neutrality-goals-study"><i>China’s bitcoin mines could derail carbon neutrality goals, study says</i></a> and <a href="https://www.newscientist.com/article/2273672-bitcoin-mining-emissions-in-china-will-hit-130-million-tonnes-by-2024/"><i>Bitcoin mining emissions in China will hit 130 million tonnes by 2024</i></a>, the headlines say it all. Excusing this climate-destroying externality of Proof-of-Work blockchains requires a continuous flow of new misleading arguments. Below the fold I discuss one of the more recent novelties.<br /><span><a name="more"></a></span><br />In <a href="https://twoquants.com/bitcoin-carbon-footprint-part-2/"><i>Bitcoin and Ethereum Carbon Footprints – Part 2</i></a>, Moritz Seibert claims the reason for mining is to get the mining reward: <blockquote>Bitcoin transactions themselves don’t cause a lot of power usage. Getting the network to accept a transaction consumes almost no power, but having ASIC miners grind through the mathematical ether to solve valid blocks does. Miners are incentivized to do this because they are compensated for it. Presently, that compensation includes a block reward which is paid in bitcoin (6.25 BTC per block) as well as a miner fee (transaction fee). Transaction fees are denominated in fractional bitcoins and paid by the initiator of the transaction. Today, about 15% of total miners’ rewards are transactions fees, and about 85% are block rewards. </blockquote>So, he argues, Bitcoin's current catastrophic carbon footprint doesn't matter because, as the reward decreases, so will the carbon footprint: <blockquote>This also means that the power usage of the Bitcoin network won’t scale linearly with the number of transactions as the network becomes predominantly fee-based and less rewards-based (which causes a lot of power to the thrown at it in light of increasing BTC prices), and especially if those transactions take place on secondary layers. In other words, taking the ratio of “Bitcoin’s total power usage” to “Number of transactions” to calculate the “Power cost per transaction” falsely implies that all transactions hit the final settlement layer (they don’t) and disregards the fact that the final state of the Bitcoin base layer is a fee-based state which requires a very small fraction of Bitcoin’s overall power usage today (no more block rewards). </blockquote>Seibert has some vague idea that there are implications of this not just for the carbon footprint but also for the security of the Bitcoin blockchain:<br /><blockquote>Going forward however, miners’ primary revenue source will change from block rewards to the fees paid for the processing of transactions, which don’t per se cause high carbon emissions. Bitcoin is set to become be a purely fee-based system (which may pose a risk to the security of the system itself if the overall hash rate declines, but that’s a topic for another article because a blockchain that is fully reliant on fees requires that BTCs are transacted with rather than held in Michael Saylor-style as HODLing leads to low BTC velocity, which does not contribute to security in a setup where fees are the only rewards for miners.) </blockquote>Lets leave aside the stunning irresponsibility of arguing that it is acceptable to dump huge amounts of long-lasting greenhouse gas into the atmosphere <i>now</i> because you believe that in the <i>future</i> you will dump less. How realistic is the idea that decreasing the mining reward will decrease the carbon footprint?<br /><br /><a href="https://1.bp.blogspot.com/-PRyZFaNop8Q/YG8pn2viBGI/AAAAAAAAGMI/EdaGlm9sEzYZQ99ii179wU8pEtu5e9UHQCLcBGAsYHQ/s1354/HashRate.png" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="69" src="https://1.bp.blogspot.com/-PRyZFaNop8Q/YG8pn2viBGI/AAAAAAAAGMI/EdaGlm9sEzYZQ99ii179wU8pEtu5e9UHQCLcBGAsYHQ/w400-h69/HashRate.png" width="400" /></a><br />The graph shows the history of the hash rate, which is a proxy for the carbon footprint. You can see the effect of the "halvening", when on May 11<sup>th</sup> 2020 the mining reward halved. There was a temporary drop, but the hash rate resumed its inexorable rise. This experiment shows that reducing the mining reward doesn't reduce the carbon footprint. So why does Seibert think that eliminating it will reduce the carbon footprint?<br /><br />The answer appears to be that Seibert thinks the purpose of mining is to create new Bitcoins, that the reason for the vast expenditure of energy is to make the process of creating new coins secure, and that it has nothing to do with the security of transactions. This completely misunderstands the technology.<br /><br />In <a href="https://dx.doi.org/10.2139/ssrn.3197300"><i>The Economic Limits of Bitcoin and the Blockchain</i></a>, Eric Budish examines the return on investment in two kinds of attacks on a blockchain like Bitcoin's. The simpler one is a <a href="https://en.wikipedia.org/wiki/Double-spending#51%_attack">51% attack</a>, in which an attacker controls the majority of the mining power. Budish explains what this <a href="https://dx.doi.org/10.2139/ssrn.3197300">allows the attacker to do</a>:<br /><blockquote>An attacker could (i) spend Bitcoins, i.e., engage in a transaction in which he sends his Bitcoins to some merchant in exchange for goods or assets; then (ii) allow that transaction to be added to the public blockchain (i.e., the longest chain); and then subsequently (iii) remove that transaction from the public blockchain, by building an alternative longest chain, which he can do with certainty given his majority of computing power. The merchant, upon seeing the transaction added to the public blockchain in (ii), gives the attacker goods or assets in exchange for the Bitcoins, perhaps after an escrow period. But, when the attacker removes the transaction from the public blockchain in (iii), the merchant effectively loses his Bitcoins, allowing the attacker to “double spend” the coins elsewhere. </blockquote>Such attacks are endemic among the smaller alt-coins; for example <a href="https://www.coindesk.com/ethereum-classic-blockchain-subject-to-yet-another-51-attack">there were three successful attacks on Ethereum Classic in a single month</a> last year. Clearly, Seibert's future "transaction only" Bitcoin must defend against them.<br /><br />There are two ways to mount a 51% attack, from the outside or from the inside. An outside attack requires more mining power than the insiders are using, whereas an insider attack only needs a majority of the mining power to conspire. Bitcoin miners collaborate in "mining pools" to reduce volatility of their income, and for many years it would have taken only three or so pools to conspire for a successful attack. But assuming insiders are honest, outsiders must acquire more mining power than the insiders are using. Clearly, Bitcoin insiders are using so much mining power that this isn't feasible.<br /><br />The point of mining isn't to create new Bitcoins. Mining is needed to make the process of adding a block to the chain, and thus adding a set of transactions to the chain, so expensive that it isn't worth it for an attacker to subvert the process. <i>The cost, and thus in the case of Proof of Work the carbon footprint, is the whole point</i>. As <a href="https://dx.doi.org/10.2139/ssrn.3197300">Budish wrote</a>:<br /><blockquote>From a computer security perspective, the key thing to note ... is that the security of the blockchain is linear in the amount of expenditure on mining power, ... In contrast, in many other contexts investments in computer security yield convex returns (e.g., traditional uses of cryptography) — analogously to how a lock on a door increases the security of a house by more than the cost of the lock. </blockquote>Lets consider the possible futures of a fee-based Bitcoin blockchain. It turns out that currently fee revenue is a smaller proportion of total miner revenue than Seibert claims. Here is the chart of total revenue (~$60M/day):<br /><a href="https://1.bp.blogspot.com/-56mgie89Od0/YG-PQ3vBJEI/AAAAAAAAGMY/-i3YUqSHOW0r3EMGWMn5dMVsMMFGZHjYgCLcBGAsYHQ/s924/MinersRevenue.png" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://1.bp.blogspot.com/-56mgie89Od0/YG-PQ3vBJEI/AAAAAAAAGMY/-i3YUqSHOW0r3EMGWMn5dMVsMMFGZHjYgCLcBGAsYHQ/s320/MinersRevenue.png" width="320" /></a><br />And here is the chart of fee revenue (~$5M/day):<br /><a href="https://1.bp.blogspot.com/-H0DrgWyx-yM/YG-PYCPdpSI/AAAAAAAAGMc/ZVobA6pbi28cU9to-DZXpvU-VQhGQQRbACLcBGAsYHQ/s897/FeeRevenue.png" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://1.bp.blogspot.com/-H0DrgWyx-yM/YG-PYCPdpSI/AAAAAAAAGMc/ZVobA6pbi28cU9to-DZXpvU-VQhGQQRbACLcBGAsYHQ/s320/FeeRevenue.png" width="320" /></a><br />Thus the split is about 8% fee, 92% reward:<br /><ul><li>If <b>security stays the same, blocksize stays the same</b>, fees must increase to keep the cost of a 51% attack high enough.<br /><a href="https://1.bp.blogspot.com/-B76VO04m5uc/YG91JPgOPnI/AAAAAAAAGMQ/97cBpiwx_5gaVJVGpLmCHWamFql2Wy_WgCLcBGAsYHQ/s924/TransactionFees.png" style="margin-left: 1em; margin-right: 1em;"><img border="0" src="https://1.bp.blogspot.com/-B76VO04m5uc/YG91JPgOPnI/AAAAAAAAGMQ/97cBpiwx_5gaVJVGpLmCHWamFql2Wy_WgCLcBGAsYHQ/s320/TransactionFees.png" width="320" /></a><br />The chart shows the average fee hovering around $20, so the average cost of a single transaction would be over $240. This might be a problem for Seibert's requirement that "BTCs are transacted with rather than held".</li><li>If <b>blocksize stays the same, fees stay the same</b>, security must decrease because the fees cannot cover the cost of enough hash power to deter a 51% attack. Similarly, in this case it would be 12 times cheaper to mount a 51% attack, which would greatly increase the risk of delivering anything in return for Bitcoin. It is already the case that users are advised to wait 6 blocks (about an hour) before treating a transaction as final. Waiting nearly half a day before finality would probably be a disincentive.</li><li>If <b>fees stay the same, security stays the same</b>, blocksize must increase to allow for enough transactions so that their fees cover the cost of enough hash power to deter a 51% attack. Since 2017 <a href="https://bitcoinmagazine.com/guides/what-is-the-bitcoin-block-size-limit">Bitcoin blocks have been effectively limited to around 2MB</a>, and the blockchain is now over <a href="https://ycharts.com/indicators/bitcoin_blockchain_size">one-third of a Terabyte</a> growing at over 25%/yr. Increasing the size limit to say 22MB would solve the long-term problem of a fee-based system at the cost of reducing miners income in the short term by reducing the scarcity value of a slot in a block. Doubling the effective size of the block caused a huge controversy in the Bitcoin community for precisely this short vs. long conflict, so a much larger increase would be even more controversial. Not to mention that the size of the blockchain a year from now would be 3 times bigger imposing additional storage costs on miners.<br /><br />That is just the supply side. On the demand side it is an open question as to whether there would be 12 times the current demand for transactions costing $20 and taking an hour which, at least in the US, must each be <a href="https://www.irs.gov/irb/2014-16_IRB#NOT-2014-21">reported to the tax authorities</a>.</li></ul><table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: right;"><tbody><tr><td style="text-align: center;"><a href="https://1.bp.blogspot.com/-05pccBrpdJs/YG-mZxXGvxI/AAAAAAAAGMo/WMd5caaGmBUOAFAepwxOixDAmj9if42vACLcBGAsYHQ/s483/ShortVsLong.png" style="clear: right; display: block; margin-left: auto; margin-right: auto; padding: 1em 0px; text-align: center;"><img alt="" border="0" height="174" src="https://1.bp.blogspot.com/-05pccBrpdJs/YG-mZxXGvxI/AAAAAAAAGMo/WMd5caaGmBUOAFAepwxOixDAmj9if42vACLcBGAsYHQ/w200-h174/ShortVsLong.png" width="200" /></a></td></tr><tr><td class="tr-caption" style="text-align: center;"><a href="https://datamish.com/">Short vs. Long</a><br /></td></tr></tbody></table>None of these alternatives look attractive. But there's also a second type of attack in <a href="https://dx.doi.org/10.2139/ssrn.3197300">Budish's analysis</a>, which he calls "sabotage". He quotes <a href="https://arxiv.org/abs/1402.2009">Rosenfeld</a>:<br /><blockquote>In this section we will assume <i>q &lt; p</i> [i.e., that the attacker does not have a majority]. Otherwise, all bets are off with the current Bitcoin protocol ... The honest miners, who no longer receive any rewards, would quit due to lack of incentive; this will make it even easier for the attacker to maintain his dominance. This will cause either the collapse of Bitcoin or a move to a modified protocol. As such, <i>this attack is best seen as an attempt to destroy Bitcoin</i>, motivated not by the desire to obtain Bitcoin value, but rather wishing to maintain entrenched economical systems or obtain speculative profits from holding a short position. </blockquote>Short interest in Bitcoin is currently small relative to the total stock, but much larger relative to the circulating supply. <a href="https://dx.doi.org/10.2139/ssrn.3197300">Budish analyzes</a> various sabotage attack cases, with a parameter <i>∆<sub>attack</sub></i> representing the proportion of the Bitcoin value destroyed by the attack: <blockquote>For example, if <i>∆<sub>attack</sub></i> = 1, i.e., if the attack causes a total collapse of the value of Bitcoin, the attacker loses exactly as much in Bitcoin value as he gains from double spending; in effect, there is no chance to “double” spend after all. ... However, <i>∆<sub>attack</sub></i> is something of a “pick your poison” parameter. If <i>∆<sub>attack</sub></i> is small, then the system is vulnerable to the double-spending attack ... and the implicit transactions tax on economic activity using the blockchain has to be high. If <i>∆<sub>attack</sub></i> is large, then a short time period of access to a large amount of computing power can sabotage the blockchain. </blockquote>The current cryptocurrency bubble ensures that everyone is making enough paper profits from the golden eggs to deter them from killing the goose that lays them. But it is easy to create scenarios in which a rush for the exits might make killing the goose seem like the best way out.<br /><br />Seibert's misunderstanding illustrates the fundamental problem with permissionless blockchains. As I wrote in <a href="https://blog.dshr.org/2020/10/a-note-on-blockchains.html">A Note On Blockchains</a>: <blockquote>If joining the replica set of a permissionless blockchain is free, it will be vulnerable to <a href="https://en.wikipedia.org/wiki/Sybil_attack">Sybil attacks</a>, in which an attacker creates many apparently independent replicas which are actually under his sole control. If creating and maintaining a replica is free, anyone can authorize any change they choose simply by creating enough Sybil replicas.<br /><br /><i>Defending against Sybil attacks requires that membership in a replica set be expensive.</i></blockquote>There are many attempts to provide less environmentally damaging ways to make adding a block to a blockchain expensive, but attempts to make adding a block <i>cheaper</i> are self-defeating because they make the blockchain less secure.<br /><br />There are two reasons why the primary use of a permissionless blockchain cannot be transactions as opposed to HODL-ing:<br /><ul><li>The lack of synchronization between the peers means that transactions must necessarily be slow.</li><li>The need to defend against Sybil attacks means either that transactions must necessarily be expensive, or that blocks must be impractically large.</li></ul> 2021-04-13T15:00:00+00:00 David. (noreply@blogger.com) Islandora: Islandora Open Meeting: April 27, 2021 https://islandora.ca/content/islandora-open-meeting-april-27-2021 <span class="field field-node--title field-name-title field-type-string field-label-hidden">Islandora Open Meeting: April 27, 2021</span> <span class="field field-node--uid field-name-uid field-type-entity-reference field-label-hidden" rel="schema:author"><span xml:lang="">agriffith</span></span> <span class="field field-node--created field-name-created field-type-created field-label-hidden">Tue, 04/13/2021 - 16:11</span> <div class="clearfix field field-node--body field-name-body field-type-text-with-summary field-label-above"> <div class="field-label">Body</div> <div class="field-items"> <div class="field-item"><p>We are happy to announce the date of our next Open Meeting! Join us on <strong>April 27, 2021</strong> any time between <strong>10:00-2:00pm EDT</strong>. The Open Meetings are drop-in style sessions where users of all levels and abilities gather to ask questions, share use cases and get updates on Islandora. There will be experienced Islandora 8 users on hand to answer questions or give demos. We would love for your to join us any time during the 4-hour window, so feel free to pop by any time!</p> <p>More details about the Open Meeting, and the Zoom link to join, are in <a href="https://docs.google.com/document/d/1Acv6MJB1gux1zDLXYPKvORa7hgGC5UTdO4RwoTXYWBM/edit?usp=sharing">this Google doc</a>. </p> <p>Registration is not required. If you would like a calendar invite as a reminder, please let us know at community@islandora.ca.</p></div> </div> </div> 2021-04-13T14:11:45+00:00 agriffith Digital Library Federation: Call for Proposals open for NDSA Digital Preservation 2021! https://www.diglib.org/call-for-proposals-open-for-ndsa-digital-preservation-2021/ <p><a href="https://ndsa.org/conference/"><img alt="NDSA Digital Preservation Banner" class="wp-image-23450 size-full" height="312" src="https://www.diglib.org/wp-content/uploads/sites/3/2021/04/DigiPres-2021-820x312_downloaded.jpeg" width="820" /></a></p> <p><span style="font-weight: 400;">The </span><a href="http://ndsa.diglib.org/"><span style="font-weight: 400;">NDSA</span></a><span style="font-weight: 400;"> is very pleased to announce the Call for Proposals is open for </span><a href="https://ndsa.org/conference/digital-preservation-2021/cfp/"><b>Digital Preservation 2021: Embracing Digitality (#DigiPres21)</b></a><span style="font-weight: 400;"> to be held </span><b>ONLINE this year on November 4th, 2021</b><span style="font-weight: 400;"> during World Digital Preservation Day.</span></p> <p><span style="font-weight: 400;">Submissions from members and nonmembers alike are welcome, and you can learn more about session format options through the CFP. </span><a href="https://ndsa.org/conference/digital-preservation-2021/cfp/"><b>The deadline to submit proposals is Monday, May 17, at 11:59pm Eastern Time.</b></a></p> <p><a href="https://ndsa.org/conference/"><span style="font-weight: 400;">Digital Preservation 2021 (#DigiPres21)</span></a><span style="font-weight: 400;"> is held in partnership with our host organization, the Council on Library and Information Resources’ (CLIR)</span> <a href="https://www.diglib.org/"><span style="font-weight: 400;">Digital Library Federation</span></a><span style="font-weight: 400;">. Separate calls are being issued for CLIR+DLF’s 2021 events, the</span> <a href="https://forum2021.diglib.org/"><span style="font-weight: 400;">2021 DLF Forum</span></a> <span style="font-weight: 400;">(November 1-3) and associated workshop series</span> <a href="https://forum2021.diglib.org/learndlf/"><span style="font-weight: 400;">Learn@DLF</span></a> <span style="font-weight: 400;">(November 8-10). NDSA strives to create a safe, accessible, welcoming, and inclusive event, and adheres to</span> <a href="https://www.diglib.org/about/code-of-conduct/"><span style="font-weight: 400;">DLF’s Code of Conduct</span></a><span style="font-weight: 400;">.</span></p> <p><span style="font-weight: 400;">We look forward to seeing you online on November 4th,</span></p> <p><span style="font-weight: 400;">~ 2021 DigiPres Planning Committee</span></p> <p>The post <a href="https://www.diglib.org/call-for-proposals-open-for-ndsa-digital-preservation-2021/" rel="nofollow">Call for Proposals open for NDSA Digital Preservation 2021!</a> appeared first on <a href="https://www.diglib.org" rel="nofollow">DLF</a>.</p> 2021-04-13T13:58:22+00:00 kussmann HangingTogether: Dutch round table on next generation metadata: think bigger than NACO and WorldCat http://feedproxy.google.com/~r/Hangingtogetherorg/~3/jk2gsfc1Ez8/ <div class="wp-block-image is-style-default"><figure class="alignright size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2021/03/Icon-Metadata-series.png"><img alt="OCLC metadata discussion series" class="wp-image-8919" height="129" src="https://hangingtogether.org/wp-content/uploads/2021/03/Icon-Metadata-series.png" width="190" /></a></figure></div> <p>As part of the OCLC Research <a href="http://oc.lc/metadata-series" rel="noreferrer noopener" target="_blank">Discussion Series on Next Generation Metadata</a>, this blog post reports back from the Dutch language round table discussion held on March 8, 2021. (A Dutch translation is available <a href="https://hangingtogether.org/?p=9238" rel="noreferrer noopener" target="_blank">here</a>).</p> <p>Librarians – with backgrounds in metadata, library systems, reference work, national bibliography, and back-office processes – joined the session, representing a nice mix of academic and heritage institutions from the Netherlands and Belgium. The participants were engaged, candid, and thoughtful and this stimulated constructive knowledge exchange in a pleasant atmosphere.  </p> <h3>Mapping exercise</h3> <div class="wp-block-image is-style-default"><figure class="alignleft size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2021/04/Landscape-BNL-1.png"><img alt="" class="wp-image-9211" height="347" src="https://hangingtogether.org/wp-content/uploads/2021/04/Landscape-BNL-1.png" width="346" /></a>Map of next-gen metadata projects (Dutch session)</figure></div> <p>As in all the other round table discussions, participants started with taking stock of next generation metadata projects in their region or initiatives they were aware of elsewhere. The resulting map shows a strong representation of bibliographic and cultural heritage data-projects (see upper- and lower-left quadrants of the matrix). Several next-generation metadata research projects of the National Library of the Netherlands were listed and described, such as:</p> <ul><li><strong>Automatic Metadata Generation</strong>, which identifies and tests tools to support subject tagging and cataloging of name authority records;</li><li><strong>The Entity Finder</strong>, a tool being developed to help extract RDA entities (persons, works, expressions) from both authority and bibliographic records.</li></ul> <p>The <strong>Digital Heritage Reference Architecture</strong> (<a href="https://netwerkdigitaalerfgoed.nl/activiteiten/dera/" rel="noreferrer noopener" target="_blank">DERA</a>) was developed as part of the national strategy for digital heritage in the Netherlands. It is a framework for managing and publishing heritage information as Linked Open Data (LOD), according to agreed practices and conventions. The <a href="https://vangoghworldwide.org/" rel="noreferrer noopener" target="_blank">Van Gogh Worldwide</a> platform is an exemplar of the application of DERA – where metadata, relating to the painter’s art works residing at 17 different Dutch heritage institutions and private collectors, have been pulled from source systems by API.</p> <p>A noteworthy initiative listed in the RIM/Scholarly Communications quadrant of the matrix is the <strong>NL-Open Knowledge Base</strong>, an initiative in the context of last year’s <a href="https://www.elsevier.com/about/press-releases/corporate/dutch-research-institutions-and-elsevier-initiate-worlds-first-national-open-science-partnership" rel="noreferrer noopener" target="_blank">deal between Elsevier and the Dutch Research institutions</a>, to jointly develop open science services based on their RIM systems, Elsevier’s databases and analytics solutions and the Dutch funding organizations’ databases. The envisaged Open Knowledge Base could potentially feed new applications – for example, a dashboard to monitor the achievement of the universities’ Sustainable Development Goals – and allow to significantly improve the analysis of research impact.</p> <h3>What is keeping us from moving forward?</h3> <p>Notwithstanding the state-of-the-art projects mentioned during the mapping exercise, the participants were impatient about the pace of the transition to the next generation of metadata. One participant experienced frustration with having to use multiple tools for a workflow that supports the transition, namely: integration of PIDs, local authorities, or links to and from external sources. Another participant noted that there is still a lot of efficiency to be gained in the value chain:</p> <blockquote class="wp-block-quote is-style-default"><p> <em>“When we look at the supply chain, it is absurd to start from scratch because there is already so much data. When a book comes out on the market, it must already have been described. There should not be a need to start from scratch in the library.”</em></p></blockquote> <p>The group also wondered – with so many bibliographic datasets already published as Linked Open Data – what else needs to be done to interconnect them in meaningful ways?</p> <p>The question of what is keeping us from moving forward dominated the discussion.</p> <h3>Trusting external data</h3> <p>One participant suggested that libraries are cautious about the data sources they link up with. Authority files are persistent and reliable data sources, which have yet to find their counterparts in the newly emerging linked data ecosystem. The lack of conventions around reliability and persistence might be a reason why libraries are hesitant entering into linked data partnerships or holding back from relying on external data – even from established sources, such as Wikidata. After all, linking to a data source is an indication of trust and recognition of data quality.</p> <p>The conversation moved to data models: which linked data do you create yourself? How will you design it and link it up to other data? Some participants found there was still a lack of agreement and clarity about the meaning of key concepts such as a “work”. Others pointed out that defining the meaning of concepts used is exactly what linked data is about and this feature allows the co-existence of multiple ontologies – in other words, there is no need any longer to fix semantics in hard standards. </p> <blockquote class="wp-block-quote is-style-default"><p>“<em>There is no unique semantic model. When you refer to data that has already been defined by others, you relinquish control over that piece of information, and that can be a mental barrier against doing linked data the proper way. It is much safer to store and manage all the data in your own silo. But the moment you can let go of that, the world can become much richer than you can ever achieve on your own.”</em></p></blockquote> <h3>Thinking in terms of linked data</h3> <p>The conversation turned to the need to train cataloging staff. One participant thought it would be helpful to get started by learning to think in terms of linked data, to mentally practice building linked data graphs and play with different possible structures, as one does with LEGO bricks. The group agreed there is still too little understanding of the possibilities and of the consequences of practicing linked data. </p> <blockquote class="wp-block-quote is-style-default"><p>“<em>We have to learn to see ourselves as publishers of metadata, so that others can find it – but we have no idea who the others are, we have to think even bigger than the Library of Congress’s NACO or WorldCat. We are no longer talking about the records we create, but about pieces of records that are unique, because a lot already comes from elsewhere. We have to wrap our minds around this and ask ourselves: What is our role in the bigger picture? This is very hard to do!</em>”</p></blockquote> <p>The group thought it was very important to start having that discussion within the library. But how exactly do you do that? It’s a big topic and it must be initiated by the library’s leadership team.</p> <h3>Not relevant for my library</h3> <p>One university library leader in the group reacted to this and said: </p> <blockquote class="wp-block-quote is-style-default"><p>“<em>What strikes me is that the number of libraries faced with this challenge is shrinking. (…) [In my library] we hardly produce any metadata anymore. (…) If we look at what we still produce ourselves, it is about describing photos of student fraternities (…). It’s almost nothing anymore. Metadata has really become a topic for a small group of specialists.”</em></p></blockquote> <p>The group objected that this observation was overlooking the importance of the discovery needs of the communities libraries serve. However provocative this observation was, it reflects a reality that we need to acknowledge and at the same time put in perspective. Alas, there was no time for that, as the session was wrapping up. It had certainly been a conversation to be continued!</p> <h3><strong>About the OCLC Research Discussion Series on Next Generation Metadata</strong></h3> <p>In March 2021, <a href="https://www.oclc.org/research/home.html" rel="noreferrer noopener" target="_blank">OCLC Research</a> conducted a <a href="https://www.oclc.org/go/en/events/next-generation-of-metadata.html" rel="noreferrer noopener" target="_blank">discussion series</a> focused on two reports: </p> <ol type="1"><li>“<a href="https://www.oclc.org/research/publications/2020/oclcresearch-transitioning-next-generation-metadata.html" rel="noreferrer noopener" target="_blank">Transitioning to the Next Generation of Metadata</a>” </li><li>“<a href="https://www.oclc.org/research/publications/2021/oclcresearch-transforming-metadata-into-linked-data.html" rel="noreferrer noopener" target="_blank">Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project</a>”. </li></ol> <p>The round table discussions were held in different European languages and participants were able share their own experiences, get a better understanding of the topic area, and gain confidence in planning ahead. </p> <p>The <a href="https://hangingtogether.org/?p=8918" rel="noreferrer noopener" target="_blank">Opening Plenary Session</a> opened the forum for discussion and exploration and introduced the theme and its topics. Summaries of all eight round table discussions are published on the OCLC Research blog, <a href="https://hangingtogether.org/" rel="noreferrer noopener" target="_blank">Hanging Together</a>. This is the last post and it is preceded by the posts reporting on the<a href="https://hangingtogether.org/?p=9015" rel="noreferrer noopener" target="_blank"> first English session</a>, the <a href="https://hangingtogether.org/?p=9025" rel="noreferrer noopener" target="_blank">Italian session</a>, the <a href="https://hangingtogether.org/?p=9033" rel="noreferrer noopener" target="_blank">second English session</a>, the <a href="https://hangingtogether.org/?p=9099" rel="noreferrer noopener" target="_blank">French session</a>, the <a href="https://hangingtogether.org/?p=9090" rel="noreferrer noopener" target="_blank">German session</a>, the <a href="https://hangingtogether.org/?p=9171" rel="noreferrer noopener" target="_blank">Spanish session</a> and the <a href="https://hangingtogether.org/?p=9201" rel="noreferrer noopener" target="_blank">third English session</a>.</p> <p>The Closing Plenary Session on April 13 will synthesize the different round table discussions. Registration is still open for this webinar: <a href="https://www.oclc.org/go/en/events/next-generation-of-metadata/Register.html" rel="noreferrer noopener" target="_blank">please join us</a>! </p> <p>The post <a href="https://hangingtogether.org/?p=9210" rel="nofollow">Dutch round table on next generation metadata: think bigger than NACO and WorldCat</a> appeared first on <a href="https://hangingtogether.org" rel="nofollow">Hanging Together</a>.</p> <div class="feedflare"> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=jk2gsfc1Ez8:SoCDwy6mfvY:yIl2AUoC8zA"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=yIl2AUoC8zA" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=jk2gsfc1Ez8:SoCDwy6mfvY:D7DqB2pKExk"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=jk2gsfc1Ez8:SoCDwy6mfvY:D7DqB2pKExk" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=jk2gsfc1Ez8:SoCDwy6mfvY:-BTjWOF_DHI"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=jk2gsfc1Ez8:SoCDwy6mfvY:-BTjWOF_DHI" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=jk2gsfc1Ez8:SoCDwy6mfvY:V_sGLiPBpWU"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=jk2gsfc1Ez8:SoCDwy6mfvY:V_sGLiPBpWU" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=jk2gsfc1Ez8:SoCDwy6mfvY:qj6IDK7rITs"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=qj6IDK7rITs" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=jk2gsfc1Ez8:SoCDwy6mfvY:l6gmwiTKsz0"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=l6gmwiTKsz0" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=jk2gsfc1Ez8:SoCDwy6mfvY:gIN9vFwOqvQ"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=jk2gsfc1Ez8:SoCDwy6mfvY:gIN9vFwOqvQ" /></a> </div><img alt="" height="1" src="http://feeds.feedburner.com/~r/Hangingtogetherorg/~4/jk2gsfc1Ez8" width="1" /> 2021-04-13T09:54:30+00:00 Titia van der Werf Digital Library Federation: 2021 AMIA Cross-Pollinator: Justine Thomas https://www.diglib.org/2021-amia-cross-pollinator-justine-thomas/ <p><span style="font-weight: 400;"><img alt="Justine Thomas" class="alignleft wp-image-21399 size-medium" height="300" src="https://www.diglib.org/wp-content/uploads/sites/3/2020/01/Justine-Thomas-Headshot-1-200x300.jpg" width="200" />The Association of Moving Image Archivists (AMIA) and DLF will be sending <strong>Justine Thomas</strong> to attend the 2021 virtual <a href="http://www.amiaconference.net/amia-dlf-hack-day-2/">DLF/AMIA Hack Day</a> and AMIA spring conference! As this year’s “cross-pollinator,” Justine will enrich both the Hack Day event and the AMIA conference, sharing a vision of the library world from her perspective.<br /> </span></p> <h3><span style="font-weight: 400;">About the Awardee</span></h3> <p>Justine Thomas (<a href="https://twitter.com/JustineThomasM">@JustineThomasM</a>) is currently a Digital Programs Contractor at the National Museum of American History (NMAH) focusing on digital asset management and collections information support. Prior to graduating in 2019 with a Master’s in Museum Studies from the George Washington University, Justine worked at NMAH as a collections processing intern in the Archives Center and as a Public Programs Facilitator encouraging visitors to discuss American democracy and social justice issues.</p> <h3></h3> <p> </p> <h3>About Hack Day and the Award</h3> <p><img alt="" class="alignleft size-full wp-image-19337" height="138" src="https://www.diglib.org/wp-content/uploads/sites/3/2018/08/AMIA-Logo-17-350.jpg" width="350" /> <img alt="" class="alignleft wp-image-15743 size-medium" height="104" src="https://www.diglib.org/wp-content/uploads/sites/3/2016/09/DLF_logo_export-300x104.png" width="300" /></p> <p> </p> <p> </p> <p> </p> <p> </p> <p> </p> <p>The seventh <a href="http://www.amiaconference.net/amia-dlf-hack-day-2/">AMIA+DLF Hack Day</a> <em>(online April 1-15)</em> will be a unique opportunity for practitioners and managers of digital audiovisual collections to join with developers and engineers to remotely collaborate to develop solutions for digital audiovisual preservation and access.</p> <p>The goal of the AMIA + DLF Award is to bring “cross-pollinators”–developers and software engineers who can provide unique perspectives to moving image and sound archivists’ work with digital materials, share a vision of the library world from their perspective, and enrich the Hack Day event–to the conference.</p> <p>Find out more about this year’s Hack Day activities <a href="http://www.amiaconference.net/amia-dlf-hack-day-2/">here</a>.</p> <p>The post <a href="https://www.diglib.org/2021-amia-cross-pollinator-justine-thomas/" rel="nofollow">2021 AMIA Cross-Pollinator: Justine Thomas</a> appeared first on <a href="https://www.diglib.org" rel="nofollow">DLF</a>.</p> 2021-04-12T19:19:57+00:00 Gayle Evergreen ILS: Evergreen 3.7-rc available https://evergreen-ils.org/evergreen-3-7-rc-available/ <p>The Evergreen Community is pleased to announce the availability of the release candidate for Evergreen 3.7. This release follows up on the recent <a href="http://evergreen-ils.org/evergreen-3-7-beta-available/">beta release</a>. The general release of 3.7.0 is planned for Wednesday, 14 April 2021. Between now and then, please <a href="http://evergreen-ils.org/egdownloads/">download</a> the release candidate and try it out.</p> <p>Additional information, including a full list of new features, can be found in the <a href="http://evergreen-ils.org/documentation/release/RELEASE_NOTES_3_7.html">release notes</a>.</p> 2021-04-12T18:21:26+00:00 Galen Charlton Jez Cope: Intro to the fediverse https://erambler.co.uk/blog/intro-to-the-fediverse/ <p>Wow, it turns out to be 10 years since I wrote <a href="https://erambler.co.uk/blog/beginners-guide-to-twitter-part-i/">this beginners guide to Twitter</a>. Things have moved on a <em>loooooong</em> way since then.</p> <p>Far from being the interesting, disruptive technology it was back then, Twitter has become part of the mainstream, the establishment. Almost everyone and everything is on Twitter now, which has both pros and cons.</p> <h1 id="so-whats-the-problem">So what’s the problem?</h1> <p>It’s now possible to follow all sorts of useful information feeds, from live updates on transport delays to your favourite sports team’s play-by-play performance to an almost infinite number of cat pictures. In my professional life it’s almost guaranteed that anyone I meet will be on Twitter, meaning that I can contact them to follow up at a later date without having to exchange contact details (and they have options to block me if they don’t like that).</p> <p>On the other hand, a medium where everyone’s opinion is equally valid regardless of knowledge or life experience has turned some parts of the internet into a toxic swamp of hatred and vitriol. It’s easier than ever to forget that we have more common ground with any random stranger than we have similarities, and that’s led to some truly awful acts and a poisonous political arena.</p> <p>Part of the problem here is that each of the social media platforms is controlled by a single entity with almost no accountability to anyone other than shareholders. Technological change has been so rapid that the regulatory regime has no idea how to handle them, leaving them largely free to operate how they want. This has led to a whole heap of nasty consequences that many other people have done a much better job of documenting than I could (Shoshana Zuboff’s book <a href="https://en.wikipedia.org/wiki/The_Age_of_Surveillance_Capitalism">The Age of Surveillance Capitalism</a> is a good example). What I’m going to focus on instead are some possible alternatives.</p> <p>If you accept the above argument, one obvious solution is to break up the effective monopoly enjoyed by Facebook, Twitter <em>et al</em>. We need to be able to retain the wonderful affordances of social media but democratise control of it, so that it can never be dominated by a small number of overly powerful players.</p> <h1 id="whats-the-solution">What’s the solution?</h1> <p>There’s actually a thing that already exists, that almost everyone is familiar with and that already works like this.</p> <p>It’s email.</p> <p>There are a hundred thousand email servers, but my email can always find your inbox if I know your address because that address identifies both you and the email service you use, and they communicate using the same protocol, <a href="https://en.wikipedia.org/wiki/Simple_Mail_Transfer_Protocol">Simple Mail Transfer Protocol (SMTP)</a><sup id="fnref:1"><a class="footnote-ref" href="https://erambler.co.uk/index.xml#fn:1">1</a></sup>. I can’t send a message to your Twitter from my Facebook though, because they’re completely incompatible, like oil and water. Facebook has no idea how to talk to Twitter and vice versa (and the companies that control them have zero interest in such interoperability anyway).</p> <p>Just like email, a <em>federated</em> social media service like <a href="https://joinmastodon.org">Mastodon</a> allows you to use any compatible server, or even run your own, and follow accounts on your home server or anywhere else, even servers running <em>different software</em> as long as they use the same <a href="http://activitypub.rocks">ActivityPub</a> protocol.</p> <p>There’s no lock-in because you can move to another server any time you like, and interact with all the same people from your new home, just like changing your email address. Smaller servers mean that no one server ends up with enough power to take over and control everything, as the social media giants do with their own platforms. But at the same time, a small server with a small moderator team can enforce local policy much more easily and block accounts or whole servers that host trolls, nazis or other poisonous people.</p> <h1 id="how-do-i-try-it">How do I try it?</h1> <p>I have no problem with anyone for choosing to continue to use what we’re already calling “traditional” social media; frankly, Facebook and Twitter are still useful for me to keep in touch with a lot of my friends. However, I do think it’s useful to know some of the alternatives if only to make a more informed decision to stick with your current choices. Most of these services only ask for an email address when you sign up and use of your real name vs a pseudonym is entirely optional so there’s not really any risk in signing up and giving one a try. That said, make sure you take sensible precautions like not reusing a password from another account.</p> <table> <thead> <tr> <th>Instead of…</th> <th>Try…</th> </tr> </thead> <tbody> <tr> <td>Twitter, Facebook</td> <td><a href="https://joinmastodon.org/">Mastodon</a>, <a href="https://pleroma.social/">Pleroma</a>, <a href="https://misskey.io/">Misskey</a></td> </tr> <tr> <td>Slack, Discord, IRC</td> <td><a href="https://matrix.org/">Matrix</a></td> </tr> <tr> <td>WhatsApp, FB Messenger, Telegram</td> <td>Also <a href="https://matrix.org/">Matrix</a></td> </tr> <tr> <td>Instagram, Flickr</td> <td><a href="https://pixelfed.org/">PixelFed</a></td> </tr> <tr> <td>YouTube</td> <td><a href="https://joinpeertube.org/">PeerTube</a></td> </tr> <tr> <td>The web</td> <td><a href="https://ipfs.io/">Interplanetary File System (IPFS)</a></td> </tr> </tbody> </table> <section class="footnotes"> <hr /> <ol> <li id="fn:1"> <p>Which, if you can believe it, was formalised nearly 40 years ago in 1982 and has only had fairly minor changes since then! <a class="footnote-backref" href="https://erambler.co.uk/index.xml#fnref:1">↩︎</a></p> </li> </ol> </section> 2021-04-11T19:25:45+00:00 HangingTogether: Third English round table on next generation metadata: investing in the utility of authorities and identifiers http://feedproxy.google.com/~r/Hangingtogetherorg/~3/aNGp69Bt8hU/ <p><em>Thanks to George Bingham, UK Account Manager at OCLC, for contributing this post as part of the Metadata Series blog posts.</em> </p> <div class="wp-block-image is-style-default"><figure class="alignright size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2021/03/Icon-Metadata-series.png"><img alt="OCLC metadata discussion series" class="wp-image-8919" height="129" src="https://hangingtogether.org/wp-content/uploads/2021/03/Icon-Metadata-series.png" width="190" /></a></figure></div> <p>As part of the OCLC Research <a href="http://oc.lc/metadata-series" rel="noreferrer noopener" target="_blank">Discussion Series on Next Generation Metadata</a>, this blog post reports back from the third English language round table discussion held on March 23, 2021.  The session was scheduled to facilitate a UK-centric discussion with a panel of library representatives from the UK with backgrounds in bibliographic control, special collections, collections management, metadata standards and computer science – a diverse and engaged discussion group.</p> <h3>Mapping exercise</h3> <div class="wp-block-image is-style-default"><figure class="alignleft size-large is-resized"><a href="https://hangingtogether.org/wp-content/uploads/2021/04/Muralmap-UK.png"><img alt="" class="wp-image-9202" height="401" src="https://hangingtogether.org/wp-content/uploads/2021/04/Muralmap-UK.png" width="407" /></a>Map of next-gen metadata projects (third English session)</figure></div> <p>As with other round table sessions, the group started with mapping next generation metadata projects that participants were aware of, on a 2×2 matrix characterizing the application area: bibliographic data, cultural heritage data, research information management (RIM) data, and for anything else, the category, “Other”. The resulting map gave a nice overview of some of the building blocks of the emerging next generation metadata infrastructure, focussing in this session on the various national and international identifier initiatives – <a href="https://isni.org/" rel="noreferrer noopener" target="_blank">ISNI</a>, <a href="http://viaf.org/" rel="noreferrer noopener" target="_blank">VIAF</a>, <a href="https://www.oclc.org/en/fast.html" rel="noreferrer noopener" target="_blank">FAST</a>, <a href="https://www.loc.gov/aba/pcc/naco/" rel="noreferrer noopener" target="_blank">LC/NACO</a> authority file and <a href="https://www.loc.gov/aba/pcc/saco/" rel="noreferrer noopener" target="_blank">LC/SACO</a> subject lists, and <a href="https://orcid.org/" rel="noreferrer noopener" target="_blank">ORCID</a> – and metadata and linked data infrastructure projects such as <a href="https://libraryservices.jiscinvolve.org/wp/2019/12/plan-m/" rel="noreferrer noopener" target="_blank">Plan-M</a> (an initiative, facilitated by Jisc, to rethink the way that metadata for academic and specialist libraries is created, sold, licensed, shared, and re-used in the UK), <a href="https://www.loc.gov/bibframe/" rel="noreferrer noopener" target="_blank">BIBFrame</a> and OCLC’s <a href="https://www.oclc.org/en/worldcat/oclc-and-linked-data/shared-entity-management-infrastructure.html" rel="noreferrer noopener" target="_blank">Shared Entity Management Infrastructure</a>. </p> <p>The map also raises interesting questions about some of the potential or actual obstacles to the spread of next generation metadata: </p> <blockquote class="wp-block-quote is-style-default"><p>What to do about missing identifiers? How to incorporate extant regional databases and union catalogs into the national and international landscape? How “open” are institutions’ local archive management systems? Who is willing to pay for linked data?   </p></blockquote> <h3><strong>Contributing to Library of Congress authorities</strong></h3> <p>The discussion panel agreed that there is a pressing need for metadata to be less hierarchical, which linked data delivers, and that a collaborative approach is the best way forward. One example is the development of the UK funnel for NACO and SACO, which has reinforced the need for a more national approach in the UK. The funnel allows the UK Higher Education institutions to contribute to the LC name and subject authorities using a single channel – rather than each library setting up its own channel. Because they work together as a group to make their contributions to the authority files, the quality and the “authority” of their contributions is significantly increased.</p> <h3><strong>Registering and seeding ISNIs</strong></h3> <p>One panelist reported on a one-year trial with ISNI for the institution’s legal deposit library, as a first step into working with linked data. It is hoped that it will prove to be a sustainable way forward. There is considerable enthusiasm and interest for this project amongst the institution’s practitioners, a vital ingredient for a successful next generation metadata initiative.</p> <p>Another panelist expanded on several ongoing projects with the aim of embedding ISNI identifiers within the value chain and getting them out to where cataloguers can pick them up. For example, publishers are starting to use them in their ONIX feeds to enable them to create clusters of records. Also, cataloging agencies in the UK are being supplied with ISNI identifiers so that they can embed them in the metadata at source, in the cataloging-in-publication (CIP) metadata, that they supply to libraries in the UK.</p> <p>Efforts are also under way to systematically match ISNI entries against VIAF entries, and to provide a reconciliation file to enable OCLC to update the VIAF with the most recent ISNI. These could then be fed through to the Library of Congress, who can then use these to update NACO files. </p> <blockquote class="wp-block-quote is-style-default"><p>With 6 million files to update, this is a perfect example of a leading edge dynamic next generation metadata initiative that will have to overcome the considerable challenge of scalability for it to succeed at a global level.</p></blockquote> <h3><strong>Challenges faced by identifiers</strong></h3> <p>The discussion moved on to the other challenges faced by identifier schemes. It was noted that encouraging a more widespread collaborative approach would rely on honesty amongst the contributors. There would need to be built in assurances that the tags/data come from a trusted source. Would the more collaborative approach introduce too much scope for duplicate identifiers being created, and too many variations on preferred names? Cultural expectations would have to be clearly defined and adhered to. And last but by no means least is the challenge of providing the resources needed to upscale to a national and international scope.</p> <h3><strong>Obstacles in moving towards next generation metadata</strong> </h3> <p>Participants raised concerns that library management systems are not keeping pace with current discussions on next generation metadata or with real world implementations, to the extent that they may be the biggest obstacle in the move towards next generation metadata. It was recognized that moving to linked data involves a big conceptual and technical leap from the current string-based metadata creation, sharing and management practices, tools and methodologies. </p> <blockquote class="wp-block-quote is-style-default"><p>Progress can only be made in small steps, and there is still much work to be done to demonstrate the benefits of next generation metadata, a prerequisite if we are to complete the essential step of gaining the support of senior management and buy-in from system suppliers.  </p></blockquote> <h3>If we don’t lead, will someone else take over?</h3> <p>Towards the end of the session, a brief discussion arose around the possibility (and danger) of organizations outside the library sector “taking over” if we can’t manage the transition ourselves. Amazon was cited as already becoming regarded as a good model to follow for metadata standards, despite what we know to be its shortcomings: it does not promote high quality data, and there are numerous problems concealed within the data, that are not evident to non-professionals. These quality issues would become very problematic if they are allowed to become pervasive in the global metadata landscape.</p> <blockquote class="wp-block-quote is-style-default"><p>“<em>Our insistence on ‘perfect data’ is a good thing, but are people just giving up on it because it’s too difficult to attain?</em>”   </p></blockquote> <h3><strong>About the OCLC Research Discussion Series on Next Generation Metadata</strong></h3> <p>In March 2021, <a href="https://www.oclc.org/research/home.html" rel="noreferrer noopener" target="_blank">OCLC Research</a> conducted a <a href="https://www.oclc.org/go/en/events/next-generation-of-metadata.html" rel="noreferrer noopener" target="_blank">discussion series</a> focused on two reports: </p> <ol type="1"><li>“<a href="https://www.oclc.org/research/publications/2020/oclcresearch-transitioning-next-generation-metadata.html" rel="noreferrer noopener" target="_blank">Transitioning to the Next Generation of Metadata</a>” </li><li>“<a href="https://www.oclc.org/research/publications/2021/oclcresearch-transforming-metadata-into-linked-data.html" rel="noreferrer noopener" target="_blank">Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project</a>”. </li></ol> <p>The round table discussions were held in different European languages and participants were able share their own experiences, get a better understanding of the topic area, and gain confidence in planning ahead. </p> <p>The <a href="https://hangingtogether.org/?p=8918" rel="noreferrer noopener" target="_blank">Opening Plenary Session</a> opened the forum for discussion and exploration and introduced the theme and its topics. Summaries of all eight round table discussions are published on the OCLC Research blog, <a href="https://hangingtogether.org/" rel="noreferrer noopener" target="_blank">Hanging Together</a>. This post is preceded by the posts reporting on the<a href="https://hangingtogether.org/?p=9015" rel="noreferrer noopener" target="_blank"> first English session</a>, the <a href="https://hangingtogether.org/?p=9025" rel="noreferrer noopener" target="_blank">Italian session</a>, the <a href="https://hangingtogether.org/?p=9033" rel="noreferrer noopener" target="_blank">second English session</a>, the <a href="https://hangingtogether.org/?p=9099" rel="noreferrer noopener" target="_blank">French session</a>, the <a href="https://hangingtogether.org/?p=9090" rel="noreferrer noopener" target="_blank">German session</a>, and the <a href="https://hangingtogether.org/?p=9171" rel="noreferrer noopener" target="_blank">Spanish session</a>.</p> <p>The Closing Plenary Session on April 13 will synthesize the different round table discussions. Registration is still open for this webinar: <a href="https://www.oclc.org/go/en/events/next-generation-of-metadata/Register.html" rel="noreferrer noopener" target="_blank">please join us</a>! </p> <p>The post <a href="https://hangingtogether.org/?p=9201" rel="nofollow">Third English round table on next generation metadata: investing in the utility of authorities and identifiers</a> appeared first on <a href="https://hangingtogether.org" rel="nofollow">Hanging Together</a>.</p> <div class="feedflare"> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=aNGp69Bt8hU:2WdCwK4HXSI:yIl2AUoC8zA"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=yIl2AUoC8zA" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=aNGp69Bt8hU:2WdCwK4HXSI:D7DqB2pKExk"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=aNGp69Bt8hU:2WdCwK4HXSI:D7DqB2pKExk" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=aNGp69Bt8hU:2WdCwK4HXSI:-BTjWOF_DHI"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=aNGp69Bt8hU:2WdCwK4HXSI:-BTjWOF_DHI" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=aNGp69Bt8hU:2WdCwK4HXSI:V_sGLiPBpWU"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=aNGp69Bt8hU:2WdCwK4HXSI:V_sGLiPBpWU" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=aNGp69Bt8hU:2WdCwK4HXSI:qj6IDK7rITs"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=qj6IDK7rITs" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=aNGp69Bt8hU:2WdCwK4HXSI:l6gmwiTKsz0"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?d=l6gmwiTKsz0" /></a> <a href="http://feeds.feedburner.com/~ff/Hangingtogetherorg?a=aNGp69Bt8hU:2WdCwK4HXSI:gIN9vFwOqvQ"><img border="0" src="http://feeds.feedburner.com/~ff/Hangingtogetherorg?i=aNGp69Bt8hU:2WdCwK4HXSI:gIN9vFwOqvQ" /></a> </div><img alt="" height="1" src="http://feeds.feedburner.com/~r/Hangingtogetherorg/~4/aNGp69Bt8hU" width="1" /> 2021-04-09T15:49:35+00:00 Titia van der Werf Peter Murray: More Thoughts on Pre-recording Conference Talks https://dltj.org/article/pre-recording-conference-talks-redux/ <p>Over the weekend, I posted an article here about <a href="https://dltj.org/article/pre-recording-conference-talks">pre-recording conference talks</a> and sent a <a href="https://twitter.com/DataG/status/1379033577392898048">tweet</a> about the idea on Monday. I hoped to generate discussion about recording talks to fill in gaps—positive and negative—about the concept, and I was not disappointed. I’m particularly thankful to Lisa Janicke Hinchliffe and Andromeda Yelton along with Jason Griffey, Junior Tidal, and Edward Lim Junhao for generously sharing their thoughts. Daniel S and Kate Deibel also commented on the Code4Lib Slack team. I added to the previous article’s bullet points and am expanding on some of the issues here. I’m inviting everyone mentioned to let me know if I’m mischaracterizing their thoughts, and I will correct this post if I hear from them. (I haven’t found a good comments system to hook into this static site blog.)</p> <h2 id="pre-recorded-talks-limit-presentation-format">Pre-recorded Talks Limit Presentation Format</h2> <p>Lisa Janicke Hinchliffe made this point early in the feedback:</p> <blockquote class="twitter-tweet"><p dir="ltr" lang="en">@DataG For me downside is it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? I was required to turn workshops into talks this year. Even tho tech can do more. Not at all best pedagogy for learning</p>— Lisa Janicke Hinchliffe (@lisalibrarian) <a href="https://twitter.com/lisalibrarian/status/1379060316634497025?ref_src=twsrc%5Etfw">April 5, 2021</a></blockquote> <p>Jason <a href="https://twitter.com/griffey/status/1379062462755053568">described</a> the “flipped classroom” model that he had in mind as the NISOplus2021 program was being developed. The flipped classroom model is one where students do the work of reading material and watching lectures, then come to the interactive time with the instructors ready with questions and comments about the material. Rather than the instructor lecturing during class time, the class time becomes a discussion about the material. For NISOplus, “the recording is the material the speaker and attendees are discussing” during the live Zoom meetings.</p> <p>In the previous post, I described how the speaker could respond in text chat while the recording replay is beneficial. Lisa went on to say:</p> <blockquote class="twitter-tweet"><p dir="ltr" lang="en">@DataG Q+A is useful but isn't an interactive session. To me, interactive = participants are co-creating the session, not watching then commenting on it.</p>— Lisa Janicke Hinchliffe (@lisalibrarian) <a href="https://twitter.com/lisalibrarian/status/1379063215313854465?ref_src=twsrc%5Etfw">April 5, 2021</a></blockquote> <p>She <a href="https://twitter.com/lisalibrarian/status/1379065709687402497">described an example</a>: the SSP preconference she ran at CHS. I’m paraphrasing her tweets in this paragraph. The preconference had a short keynote and an “Oprah-style” panel discussion (not pre-prepared talks). This was done live; nothing was recorded. After the panel, people worked in small groups using Zoom and a set of Google Slides to guide the group work. The small groups reported their discussions back to all participants.</p> <p>Andromeda <a href="https://twitter.com/ThatAndromeda/status/1379070103141122048">points out</a> (paraphrasing twitter-speak): “Presenters will need much more— and more specialized—skills to pull it off, and it takes a lot more work.” And Lisa <a href="https://twitter.com/lisalibrarian/status/1379071138383462404">adds</a>: “Just so there is no confusion … I don’t think being online makes it harder to do interactive. It’s the pre-recording. Interactive means participants co-create the session. A pause to chat isn’t going to shape what comes next on the recording.”</p> <h2 id="increased-technical-burden-on-speakers-and-organizers">Increased Technical Burden on Speakers and Organizers</h2> <blockquote class="twitter-tweet"><p dir="ltr" lang="en">@ThatAndromeda @DataG Totally agree on this. I had to pre-record a conference presentation recently and it was a terrible experience, logistically. I feel like it forces presenters to become video/sound editors, which is obviously another thing to worry about on top of content and accessibility.</p>— Junior Tidal (@JuniorTidal) <a href="https://twitter.com/JuniorTidal/status/1379075950617452547?ref_src=twsrc%5Etfw">April 5, 2021</a></blockquote> <p>Andromeda also <a href="https://twitter.com/ThatAndromeda/status/1379065758727217153">agreed</a> with this: “I will say one of the things I appreciated about NISO is that @griffey did ALL the video editing, so I was not forced to learn how that works.” She <a href="https://twitter.com/ThatAndromeda/status/1379067719019073537">continued</a>, “everyone has different requirements for prerecording, and in [Code4Lib’s] case they were extensive and kept changing.” And later <a href="https://twitter.com/ThatAndromeda/status/1379072014040842240">added</a>: “Part of the challenge is that every conference has its own tech stack/requirements. If as a presenter I have to <em>learn</em> that for every conference, it’s not reducing my workload.”</p> <p>It is hard not to agree with this; a high-quality (stylistically and technically) recording is not easy to do with today’s tools. This is also a technical burden for meeting organizers. The presenters will put a lot of work into talks—including making sure the recordings look good; whatever playback mechanism is used has to honor the fidelity of that recording. For instance, presenters who have gone through the effort to ensure the accessibility of the presentation color scheme want the conference platform to display the talk “as I created it.”</p> <p>The previous post noted that recorded talks also allow for the creation of better, non-real-time transcriptions. Lisa <a href="https://twitter.com/lisalibrarian/status/1379075784565010436">points out</a> that presenters will want to review that transcription for accuracy, which Jason <a href="https://twitter.com/griffey/status/1379088189483257859">noted</a> adds to the length of time needed before the start of a conference to complete the preparations.</p> <h2 id="increased-logistical-burden-on-presenters">Increased Logistical Burden on Presenters</h2> <blockquote class="twitter-tweet"><p dir="ltr" lang="en">@ThatAndromeda @DataG @griffey Even if prep is no more than the time it would take to deliver live (which has yet to be case for me and I'm good at this stuff), it is still double the time if you are expected to also show up live to watch along with everyone else.</p>— Lisa Janicke Hinchliffe (@lisalibrarian) <a href="https://twitter.com/lisalibrarian/status/1379072398025166850?ref_src=twsrc%5Etfw">April 5, 2021</a></blockquote> <p>This is a consideration I hadn’t thought through—that presenters have to devote more clock time to the presentation because first they have to record it and then they have to watch it. (Or, as Andromeda <a href="https://twitter.com/ThatAndromeda/status/1379074385357774850">added</a>, “significantly <em>more</em> than twice the time for some people, if they are recording a bunch in order to get it right and/or doing editing.”)</p> <h2 id="no-audience-reaction">No. Audience. Reaction.</h2> <blockquote class="twitter-tweet"><p dir="ltr" lang="en">@DataG @griffey 3) No. Audience. Reaction. I give a joke and no one laughs. Was it funny? Was it not funny? Talks are a *performance* and a *relationship*; I'm getting energy off the audience, I'm switching stuff on the fly to meet their vibe. Prerecorded/webinar is dead. Feels like I'm bombing.</p>— Andromeda Yelton (@ThatAndromeda) <a href="https://twitter.com/ThatAndromeda/status/1379068453030670354?ref_src=twsrc%5Etfw">April 5, 2021</a></blockquote> <p>Wow, yes. I imagine it would take a bit of imagination to get in the right mindset to give a talk to a small camera instead of an audience. I wonder how stand-up comedians are dealing with this as they try to put on virtual shows. Andromeda summed this up:</p> <blockquote class="twitter-tweet"><p dir="ltr" lang="en">@DataG @griffey oh and I mean 5) I don't get tenure or anything for speaking at conferences and goodness knows I don't get paid. So the ENTIRE benefit to me is that I enjoy doing the talk and connect to people around it. prerecorded talk + f2f conf removes one of these; online removes both.</p>— Andromeda Yelton (@ThatAndromeda) <a href="https://twitter.com/ThatAndromeda/status/1379069546317877249?ref_src=twsrc%5Etfw">April 5, 2021</a></blockquote> <p>Also in this heading could be “No Speaker Reaction”—or the inability for subsequent speakers at a conference to build on something that someone said earlier. In the Code4Lib Slack team, Daniel S noted: “One thing comes to mind on the pre-recording [is] the issue that prerecorded talks lose the ‘conversation’ aspect where some later talks at a conference will address or comment on earlier talks.” Kate Deibel added: “Exactly. Talks don’t get to spontaneously build off of each other or from other conversations that happen at the conference.”</p> <h2 id="currency-of-information">Currency of information</h2> <p>Lisa <a href="https://twitter.com/lisalibrarian/status/1379065709687402497">points out</a> that pre-recording talks before en event means there is a delay between the recording and the playback. In the example she pointed out, there was a talk at RLUK that pre-recorded would have been about the University of California working on an Open Access deal with Elsevier; live, it was able to be “the deal we announced earlier this week”.</p> <h2 id="conclusions">Conclusions?</h2> <p>Near the end of the discussion, Lisa added:</p> <blockquote class="twitter-tweet"><p dir="ltr" lang="en">@DataG @griffey @ThatAndromeda I also recommend going forward that the details re what is required of presenters be in the CfP. It was one thing for conferences that pivoted (huge effort!) but if you write the CfP since the pivot it should say if pre-record, platform used, etc.</p>— Lisa Janicke Hinchliffe (@lisalibrarian) <a href="https://twitter.com/lisalibrarian/status/1379083233418174465?ref_src=twsrc%5Etfw">April 5, 2021</a></blockquote> <p>…and Andromeda <a href="https://twitter.com/ThatAndromeda/status/1379084727202766848">added</a>: “Strong agree here. I understand that <em>this</em> year everyone was making it up as they went along, but going forward it’d be great to know that in advance.”</p> <p>That means conferences will need to take these needs into account well before the Call for Proposals (CfP) is published. A conference that is thinking now about pre-recording their talks must work through these issues and set expectations with presenters early.</p> <p>As I hoped, the Twiter replies tempered my eagerness for the all-recorded style with some real-world experience. There could be possibilities here, but adapting face-to-face meetings to a world with less travel won’t be simple and will take significant thought <em>beyond</em> the issues of technology platforms.</p> <p>Edward Lim Junhao <a href="https://twitter.com/BarbarianEd/status/1379110254424829953">summarized</a> this nicely: “I favor unpacking what makes up our prof conferences. I’m interested in recreating that shared experience, the networking, &amp; the serendipity of learning sth you didn’t know. I feel in-person conferences now have to offer more in order to justify people traveling to attend them.”</p> <p>Related, Andromeda <a href="https://twitter.com/ThatAndromeda/status/1379073227071361028">said</a>: “Also, for a conf that ultimately puts its talks online, it’s critical that it have SOMEthing beyond content delivery during the actual conference to make it worth registering rather than just waiting for youtube. realtime interaction with the speaker is a pretty solid option.”</p> <p>If you have something to add, <a href="https://twitter.com/intent/tweet?text=Hey%20%40DataG%2C%20about%20all%20pre-recorded%20presentations%3A%20">reach out to me on Twitter</a>. Given enough responses, I’ll create another summary. Let’s keep talking about what that looks like and sharing discoveries with each other.</p> <h2 id="the-tree-of-tweets">The Tree of Tweets</h2> <p>It was a great discussion, and I think I pulled in the major ideas in the summary above. With some guidance from <a href="https://twitter.com/edsu">Ed Summers</a>, I’m going to embed the <a href="https://treeverse.app/view/dhJ4irUj">Twitter threads</a> below using <a href="https://treeverse.app">Treeverse</a> by <a href="https://twitter.com/paulgb">Paul Butler</a>. We might be stretching the boundaries of what is possible, so no guarantees that this will be viewable for the long term.</p> <div class="feedflare"> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=z02XiCK7NrU:mXRcTGapROc:yIl2AUoC8zA"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=yIl2AUoC8zA" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=z02XiCK7NrU:mXRcTGapROc:qj6IDK7rITs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=qj6IDK7rITs" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=z02XiCK7NrU:mXRcTGapROc:dnMXMwOfBR0"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=dnMXMwOfBR0" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=z02XiCK7NrU:mXRcTGapROc:YwkR-u9nhCs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=YwkR-u9nhCs" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=z02XiCK7NrU:mXRcTGapROc:F7zBnMyn0Lo"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=z02XiCK7NrU:mXRcTGapROc:F7zBnMyn0Lo" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=z02XiCK7NrU:mXRcTGapROc:ACf-c_HutVc"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=ACf-c_HutVc" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=z02XiCK7NrU:mXRcTGapROc:gIN9vFwOqvQ"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=z02XiCK7NrU:mXRcTGapROc:gIN9vFwOqvQ" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=z02XiCK7NrU:mXRcTGapROc:-BTjWOF_DHI"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=z02XiCK7NrU:mXRcTGapROc:-BTjWOF_DHI" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=z02XiCK7NrU:mXRcTGapROc:V_sGLiPBpWU"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=z02XiCK7NrU:mXRcTGapROc:V_sGLiPBpWU" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=z02XiCK7NrU:mXRcTGapROc:H329GK52Scs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=H329GK52Scs" /></a> </div><img alt="" height="1" src="http://feeds.feedburner.com/~r/DisruptiveLibraryTechnologyJester/~4/z02XiCK7NrU" width="1" /> 2021-04-09T01:44:09+00:00 Peter Murray (jester@dltj.org) Peter Murray: Should All Conference Talks be Pre-recorded? https://dltj.org/article/pre-recording-conference-talks/ <p>The <a href="https://2021.Code4Lib.org/">Code4Lib conference</a> was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and conference organizers. Should all talks be pre-recorded, even when we are back face-to-face?</p> <p class="dltj-note"><strong>Note!</strong> After I posted a link to this article on Twitter, there was a great response of thoughtful comments. I've included new bullet points below and <a href="https://dltj.org/article/pre-recording-conference-talks-redux">summarized the responses in another blog post.</a></p> <p>As an entirely virtual conference, I think we can call Code4Lib 2021 a success. Success ≠ Perfect, of course, and last week the conference coordinating team got together on a Zoom call for a debriefing session. We had a lengthy discussion about what we learned and what we wanted to take forward to the 2022 conference, which we’re anticipating will be something with a face-to-face component.</p> <p>That last sentence was tough to compose: “…will be face-to-face”? “…will be both face-to-face and virtual”? (Or another fully virtual event?) Truth be told, I don’t think we know yet. I think we know with some certainty that the COVID pandemic will become much more manageable by this time next year—at least in North America and Europe. (Code4Lib draws from primarily North American library technologists with a few guests from other parts of the world.) I’m hearing from higher education institutions, though, that travel is going to be severely curtailed…if not for health risk reasons, then because budgets have been slashed. So one has to wonder what a conference will look like next year.</p> <p>I’ve been to two online conferences this year: <a href="https://niso.plus/">NISOplus21</a> and Code4Lib. Both meetings recorded talks in advance and started playback of the recordings at a fixed point in time. This was beneficial for a couple of reasons. For organizers and presenters, pre-recording allowed technical glitches to be worked through without the pressure of a live event happening. Technology is not nearly perfect enough or ubiquitously spread to count on it working in real-time. <sup id="fnref:1"><a class="footnote" href="https://dltj.org/article/pre-recording-conference-talks/#fn:1" rel="footnote">1</a></sup> NISOplus21 also used the recordings to get transcribed text for the videos. (Code4Lib used live transcriptions on the synchronous playback.) Attendees and presenters benefited from pre-recording because the presenters could be in the text chat channel to answer questions and provide insights. Having the presenter free during the playback offers new possibilities for making talks more engaging: responding in real-time to polls, getting forehand knowledge of topics for subsequent real-time question/answer sessions, and so forth. The synchronous playback time meant that there was a point when (almost) everyone was together watching the same talk—just as in face-to-face sessions.</p> <p>During the Code4Lib conference coordinating debrief call, I asked the question: “If we saw so many benefits to pre-recording talks, do we want to pre-record them all next year?” In addition to the reasons above, pre-recorded talks benefit those who are not comfortable speaking English or are first-time presenters. (They have a chance to re-do their talk as many times as they need in a much less stressful environment.) “Live” demos are much smoother because a recording can be restarted if something goes wrong. Each year, at least one presenter needs to use their own machine (custom software, local development environment, etc.), and swapping out presenter computers in real-time is risky. And it is undoubtedly easier to impose time requirements with recorded sessions. So why not pre-record all of the talks?</p> <p>I get it—it would be different to sit in a ballroom watching a recording play on big screens at the front of the room while the podium is empty. But is it so different as to dramatically change the experience of watching a speaker at a podium? In many respects, we had a dry-run of this during Code4Lib 2020. It was at the early stages of the coming lockdowns when institutions started barring employee travel, and we had to bring in many presenters remotely. I wrote a blog post <a href="https://dltj.org/article/zoom-remote-presenters/">describing the setup we used for remote presenters</a>, and at the end, I said:</p> <blockquote> <p><em>I had a few people comment that they were taken aback when they realized that there was no one standing at the podium during the presentation.</em></p> </blockquote> <p>Some attendees, at least, quickly adjusted to this format.</p> <p>For those with the means and privilege of traveling, there can still be face-to-face discussions in the hall, over meals, and social activities. For those that can’t travel (due to risks of traveling, family/personal responsibilities, or budget cuts), the attendee experience is a little more level—everyone is watching the same playback and in the same text backchannels during the talk. I can imagine a conference tool capable of segmenting chat sessions during the talk playback to “tables” where you and close colleagues can exchange ideas and then promote the best ones to a conference-wide chat room. Something like that would be beneficial as attendance grows for events with an online component, and it would be a new form of engagement that isn’t practical now.</p> <p>There are undoubtedly reasons not to pre-record all session talks (beyond the feels-weird-to-stare-at-an-unoccupied-ballroom-podium reasons). During the debriefing session, one person brought up that having all pre-recorded talks erodes the justification for in-person attendance. I can see a manager saying, “All of the talks are online…just watch it from your desk. Even your own presentation is pre-recorded, so there is no need for you to fly to the meeting.” That’s legitimate.</p> <p>So if you like bullet points, here’s how it lays out. Pre-recording all talks is better for:</p> <ul> <li><em>Accessibility:</em> better transcriptions for recorded audio versus real-time transcription (and probably at a lower cost, too)</li> <li><em>Engagement:</em> the speaker can be in the text chat during playback, and there could be new options for backchannel discussions</li> <li><em>Better quality:</em> speakers can re-record their talk as many times as needed</li> <li><em>Closer equality:</em> in-person attendees are having much the same experience during the talk as remote attendees</li> </ul> <p>Downsides for pre-recording all talks:</p> <ul> <li><em>Feels weird:</em> yeah, it would be different</li> <li><em>Erodes justification:</em> indeed a problem, especially for those for whom giving a speech is the only path to getting the networking benefits of face-to-face interaction</li> <li><em>Limits presentation format:</em> it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? (<a href="https://twitter.com/lisalibrarian/status/1379060316634497025">Lisa Janicke Hinchliffe</a>)</li> <li><em>Increased Technical Burden on Speaker and Organizers:</em> conference organizers asking presenters to do their own pre-recording is a barrier (<a href="https://twitter.com/JuniorTidal/status/1379075950617452547">Junior Tidal</a>), and organizers have added new requirements for themselves</li> <li><em>No Audience Feedback:</em> pre-recording forces the presenter into an unnatural state relative to the audience (<a href="https://twitter.com/ThatAndromeda/status/1379068453030670354">Andromeda Yelton</a>)</li> <li><em>Currency of information:</em> pre-recording talks before en event naturally introduces a delay between the recording and the playback. (<a href="https://twitter.com/lisalibrarian/status/1379065709687402497">Lisa Janicke Hinchliffe</a>)</li> </ul> <p>I’m curious to hear of other reasons, for and against. <a href="https://twitter.com/intent/tweet?text=Hey%20%40DataG%2C%20about%20all%20pre-recorded%20presentations%3A%20">Reach out to me on Twitter</a> if you have some. The COVID-19 pandemic has changed our society and will undoubtedly transform it in ways that we can’t even anticipate. Is the way that we hold professional conferences one of them?</p> <div class="footnotes"> <ol> <li id="fn:1"> <p>Can we just pause for a moment and consider the decades of work and layers of technology that make a modern teleconference call happen? For you younger folks, there was a time when one couldn’t assume the network to be there. As in: the operating system on your computer couldn’t be counted on to have a network stack built into it. In the earliest years of my career, we were tickled pink to have Macintoshes at the forefront of connectivity through <a href="https://en.wikipedia.org/wiki/GatorBox">GatorBoxes</a>. Go read the first paragraph of that Wikipedia article on GatorBoxes…TCP/IP was <em>tunneled through</em> LocalTalk running over PhoneNet on <em>unshielded twisted pairs</em> no faster than about <em>200 kbit/second</em>. (And we loved it!) Now the network is expected; needing to know about TCP/IP is pushed so far down the stack as to be forgotten…assumed. Sure, the software on top now is buggy and bloated—is my Zoom client working? has Zoom’s service gone down?—but the network…we take that for granted. <a class="reversefootnote" href="https://dltj.org/article/pre-recording-conference-talks/#fnref:1">↩</a></p> </li> </ol> </div><div class="feedflare"> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=jg1jZYYCQSI:o4Xyu7CpDEo:yIl2AUoC8zA"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=yIl2AUoC8zA" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=jg1jZYYCQSI:o4Xyu7CpDEo:qj6IDK7rITs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=qj6IDK7rITs" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=jg1jZYYCQSI:o4Xyu7CpDEo:dnMXMwOfBR0"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=dnMXMwOfBR0" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=jg1jZYYCQSI:o4Xyu7CpDEo:YwkR-u9nhCs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=YwkR-u9nhCs" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=jg1jZYYCQSI:o4Xyu7CpDEo:F7zBnMyn0Lo"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=jg1jZYYCQSI:o4Xyu7CpDEo:F7zBnMyn0Lo" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=jg1jZYYCQSI:o4Xyu7CpDEo:ACf-c_HutVc"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=ACf-c_HutVc" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=jg1jZYYCQSI:o4Xyu7CpDEo:gIN9vFwOqvQ"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=jg1jZYYCQSI:o4Xyu7CpDEo:gIN9vFwOqvQ" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=jg1jZYYCQSI:o4Xyu7CpDEo:-BTjWOF_DHI"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=jg1jZYYCQSI:o4Xyu7CpDEo:-BTjWOF_DHI" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=jg1jZYYCQSI:o4Xyu7CpDEo:V_sGLiPBpWU"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=jg1jZYYCQSI:o4Xyu7CpDEo:V_sGLiPBpWU" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=jg1jZYYCQSI:o4Xyu7CpDEo:H329GK52Scs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=H329GK52Scs" /></a> </div><img alt="" height="1" src="http://feeds.feedburner.com/~r/DisruptiveLibraryTechnologyJester/~4/jg1jZYYCQSI" width="1" /> 2021-04-09T01:28:16+00:00 Peter Murray (jester@dltj.org) Islandora: Upcoming DIG Sprint https://islandora.ca/content/upcoming-dig-sprint <span class="field field-node--title field-name-title field-type-string field-label-hidden">Upcoming DIG Sprint</span> <span class="field field-node--uid field-name-uid field-type-entity-reference field-label-hidden" rel="schema:author"><span xml:lang="">agriffith</span></span> <span class="field field-node--created field-name-created field-type-created field-label-hidden">Thu, 04/08/2021 - 20:03</span> <div class="clearfix field field-node--body field-name-body field-type-text-with-summary field-label-above"> <div class="field-label">Body</div> <div class="field-items"> <div class="field-item"><p dir="ltr">The Islandora Documentation Interest Group is holding a sprint!</p> <p dir="ltr">To support the<a href="https://islandora.ca/index.php/content/islandorans-unite-its-release-time"> upcoming release of Islandora</a>, the DIG has planned a 2-week documentation, writing-and-updating sprint to occur as part of the release process. To prepare for that effort, we’re going to spend <em>April 19 – 30th</em> on an <em>Auditing Sprint</em>, where volunteers will review existing documentation and complete <a href="https://docs.google.com/spreadsheets/d/1E-kRw9xE60CKK0qL1-phzeVKjEZu3qBKZ9d3LH1hDEE/edit#gid=1394671846">this spreadsheet</a>, providing a solid overview of the current status of our docs so we know where to best deploy our efforts during the release. This sprint will run alongside the upcoming Pre-Release Code Sprint, so if you’re not up for coding, auditing docs is a great way to contribute during sprint season!</p> <p dir="ltr">We are looking for volunteers to sign up to take on two sprint roles:</p> <p dir="ltr"><strong>Auditor:</strong> Review a page of documentation and fill out a row in the spreadsheet indicating things like the current status (‘Good Enough’ or ‘Needs Work’) , the goal for that particular page (e.g., “Explain how to create an object,” or “Compare Islandora 7 concepts to Islandora 8 concepts”), and the intended audience (Beginners, developers, etc.).</p> <p dir="ltr"><strong>Reviewer:</strong> Read through a page that has been audited and indicate if you agree with the auditor’s assessment, add additional notes or suggestions as needed; basically, give a second set of eyes on each page.</p> <p dir="ltr"> You can sign up for the sprint<a href="https://docs.google.com/spreadsheets/d/1E-kRw9xE60CKK0qL1-phzeVKjEZu3qBKZ9d3LH1hDEE/edit#gid=1595872825"> here</a>, and sign up for individual pages<a href="https://docs.google.com/spreadsheets/d/1E-kRw9xE60CKK0qL1-phzeVKjEZu3qBKZ9d3LH1hDEE/edit#gid=1394671846"> here</a>.</p> <p><br />  </p></div> </div> </div> 2021-04-08T18:03:18+00:00 agriffith Samvera: Registration now open for Samvera Virtual Connect, April 20 – 21 https://samvera.org/2021/04/08/registration-now-open-for-samvera-virtual-connect/ <p><a href="https://emory.zoom.us/webinar/register/WN_sfR-WxKyTl2klDjmVWAPWw">Registration is now open</a> for Samvera Virtual Connect 2021! Samvera Virtual Connect will take place April 20th -21st from 11am – 2pm EDT. <a href="https://emory.zoom.us/webinar/register/WN_sfR-WxKyTl2klDjmVWAPWw">Registration</a> is free and open to anyone with an interest in Samvera.</p> <p><a href="https://samvera.atlassian.net/wiki/spaces/samvera/pages/1198817390/Samvera+Virtual+Connect+2021+Program">This year’s program</a> is packed with presentations and lightning talks of interest to developers, managers, librarians, and other current or potential Samvera Community participants and technology users.</p> <p><a href="https://samvera.atlassian.net/wiki/spaces/samvera/pages/1198817390/Samvera+Virtual+Connect+2021+Program">Register and view the full program</a> on the Samvera wiki.</p> <p>The post <a href="https://samvera.org/2021/04/08/registration-now-open-for-samvera-virtual-connect/" rel="nofollow">Registration now open for Samvera Virtual Connect, April 20 – 21</a> appeared first on <a href="https://samvera.org" rel="nofollow">Samvera</a>.</p> 2021-04-08T17:47:13+00:00 Heather Greer Klein Lucidworks: Chatbots for Self-Resolution and Happier Customers https://lucidworks.com/post/chatbots-self-resolution/ <p>How chatbots and conversational applications with deep learning are helping customers resolve issues faster than ever.</p> <p>The post <a href="https://lucidworks.com/post/chatbots-self-resolution/" rel="nofollow">Chatbots for Self-Resolution and Happier Customers</a> appeared first on <a href="https://lucidworks.com" rel="nofollow">Lucidworks</a>.</p> 2021-04-08T14:58:38+00:00 Sommer Antrim Digital Library Federation: 2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals https://www.diglib.org/2021-dlf-forum-digipres-and-learndlf-calls-for-proposals/ <p><a href="https://forum2021.diglib.org/call-for-proposals/"><img alt="Join us online" class="alignleft size-full wp-image-23415" height="600" src="https://www.diglib.org/wp-content/uploads/sites/3/2021/04/DLF-2021-combo-1920x600-small.jpg" width="1920" /></a></p> <p><span style="font-weight: 400;">We’re delighted to share that it’s CFP season for CLIR’s annual events.</span></p> <p><span style="font-weight: 400;">Based on community feedback, we’ve made the decision to take our events online again in 2021. We look forward to new and better ways to come together—as always, with community at the center.</span></p> <p><span style="font-weight: 400;">Our events will take place on the following dates:</span></p> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">The </span><a href="https://forum2021.diglib.org"><b>DLF Forum</b></a> <span style="font-weight: 400;">(</span><a href="https://twitter.com/hashtag/dlfforum"><span style="font-weight: 400;">#DLFforum</span></a><span style="font-weight: 400;">, November 1-3), our signature event, includes digital library practitioners and others from member institutions and the broader community, for whom it serves as a meeting place, marketplace, and congress. Learn more and check out the CFP here: </span><a href="https://forum2021.diglib.org/call-for-proposals/"><span style="font-weight: 400;">https://forum2021.diglib.org/call-for-proposals/</span></a><span style="font-weight: 400;"> </span></li> <li style="font-weight: 400;"><span style="font-weight: 400;">NDSA’s </span><a href="https://ndsa.org/meetings/"><b>Digital Preservation 2021: Embracing Digitality</b></a><span style="font-weight: 400;"> (</span><a href="https://twitter.com/hashtag/digipres21"><span style="font-weight: 400;">#DigiPres21</span></a><span style="font-weight: 400;">, November 4), </span><a href="https://ndsa.org"><span style="font-weight: 400;">NDSA</span></a><span style="font-weight: 400;">’s major meeting and conference, will help to chart future directions for both the NDSA and digital stewardship, and is expected to be a crucial venue for intellectual exchange, community-building, development of best practices, and national-level agenda-setting in the field. Learn more and check out the CFP for this year’s event here: </span><a href="https://ndsa.org/conference/digital-preservation-2021/cfp/"><span style="font-weight: 400;">https://ndsa.org/conference/digital-preservation-2021/cfp/</span></a></li> <li style="font-weight: 400;"><a href="https://forum2021.diglib.org/learndlf/"><b>Learn@DLF</b></a><span style="font-weight: 400;"> (</span><a href="https://twitter.com/hashtag/learnatdlf"><span style="font-weight: 400;">#LearnAtDLF</span></a><span style="font-weight: 400;">, November 8-10) is our dedicated workshop series for digging into tools, techniques, workflows, and concepts. Through engaging, hands-on sessions, attendees will gain experience with new tools and resources, exchange ideas, and develop and share expertise with fellow community members. Learn more and check out the CFP here: </span><a href="https://forum2021.diglib.org/call-for-proposals/"><span style="font-weight: 400;">https://forum2021.diglib.org/call-for-proposals/</span></a><span style="font-weight: 400;"> </span></li> </ul> <p><span style="font-weight: 400;">For all events, we encourage proposals from members and non-members; regulars and newcomers; digital library practitioners and those in adjacent fields such as institutional research and educational technology; and students, early-career professionals and senior staff alike. Proposals to more than one event are permitted, though please submit different proposals for each. </span></p> <p><b>The </b><a href="https://forum2021.diglib.org"><b>DLF Forum</b></a><b> and </b><a href="https://forum2021.diglib.org/learndlf/"><b>Learn@DLF</b></a><b> CFP is here: </b><a href="https://forum2021.diglib.org/call-for-proposals/"><span style="font-weight: 400;">https://forum2021.diglib.org/call-for-proposals/</span></a><span style="font-weight: 400;"> </span></p> <p><a href="https://ndsa.org"><b>NDSA</b></a><b>’s Digital Preservation 2021: Embracing Digitality CFP is here: </b><a href="https://ndsa.org/conference/digital-preservation-2021/cfp/"><span style="font-weight: 400;">https://ndsa.org/conference/digital-preservation-2021/cfp/</span></a></p> <p><span style="font-weight: 400;">Session options range from 5-minute lighting talks at the Forum to half-day workshops at Learn@DLF, with many options in between.</span></p> <p><b>The deadline for all opportunities is Monday, May 17, at 11:59pm Eastern Time.</b></p> <p><span style="font-weight: 400;">If you have any questions, please write to us at </span><a href="mailto:forum@diglib.org"><span style="font-weight: 400;">forum@diglib.org</span></a><span style="font-weight: 400;">, and be sure to </span><a href="http://clir.informz.net/clir/pages/DLFforumNews1"><span style="font-weight: 400;">subscribe to our Forum newsletter</span></a><span style="font-weight: 400;"> to stay up on all Forum-related news. We’re looking forward to seeing you this fall.</span></p> <p><span style="font-weight: 400;">-Team DLF</span></p> <p>The post <a href="https://www.diglib.org/2021-dlf-forum-digipres-and-learndlf-calls-for-proposals/" rel="nofollow">2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals</a> appeared first on <a href="https://www.diglib.org" rel="nofollow">DLF</a>.</p> 2021-04-08T14:29:01+00:00 Gayle Peter Sefton: What did you do in the lockdowns PT? Part 1 - Music Videos http://ptsefton.com/2021/04/08/lockdowns1/index.html <p>Post looks too long? Don't want to read? Here's the summary. Last year Gail McGlinn* and I did the lockdown home-recording thing. We put out at least one song video per week for a year (and counting - we're up to 58 over 53 weeks). Searchable, sortable website <a href="http://teamhappyband.com/">here</a>. We learned some things, got better at performing for the phone camera and our microphones and better at mixing and publishing the result.</p> <p>* Disclosure Gail's my wife. We got married; she proposed, I <a href="https://youtu.be/Ct0RQv6hiN8">accepted</a>.</p> <div class="video-container"> <div class="caption"> <p><a href="https://youtu.be/Ct0RQv6hiN8">I may I might - Is this the world's best marriage proposal acceptance song?</a> (It did win a prize at a Ukulele festival for best song)</p> </div> </div> <p>(This post is littered with links to our songs, sorry but there are 58 of them and <em>someone</em> has to link to them.)</p> <p>In the second quarter of 2020 Gail McGlinn and I went from playing and singing in community music events (jams, gigs, get togethers) at least once a week to being at home every evening, like everyone else. Like lots of people we decided to put our efforts into home recording, not streaming cos that would be pointless for people with basically no audience, but we started making videos and releasing them under our band name <a href="http://teamhappyband.com/">Team Happy</a>.</p> <p>By release I mean "put on Facebook" and "sometimes remember to upload to YouTube".</p> <p>This post is about that experience and what we learned.</p> <p>Team Happy is the name we use to perform as a duo at open mic events and the odd community or ukulele festival. We were originally called "The Narrownecks" in honour of <a href="https://www.visitnsw.com/destinations/blue-mountains/katoomba-area/katoomba/attractions/narrow-neck-lookout">where we live</a>, for one gig, but then we found out there's another group with that name. Actually they're much better than us, <a href="https://www.facebook.com/watch/?v=1676961009053252">just go watch them</a>.</p> <p>Coming in to 2020 we already had a <a href="https://www.youtube.com/channel/UC4yZaVNUQp5iXCojJcT2rjA">YouTube channel</a> and it had a grand total of two videos on it with a handful of views - as in you could count them on your fingers. It's still a sad thing to behold, how many views we have - but it's not about views it's about getting discovered and having our songs performed by, oh I dunno, Casey Chambers? Keith Urban? (Oh yeah, that would mean we'd need views. Bugger.) Either that or it's about our personal journey and growth as people. Or continuing to contribute to our local music communities in lockdown (which is what Gail says it's about.). Seriously though, we think <a href="https://www.youtube.com/watch?v=1N0HbdJZsgk">I called your name</a> and <a href="https://www.youtube.com/watch?v=MHc3JpDdw8Y">Dry Pebbles</a> would go well on someone else's album.</p> <div class="video-container"> <div class="caption"> <p><a href="https://www.youtube.com/watch?v=MHc3JpDdw8Y">Dry Pebbles</a>, by Gail McGlinn - a song written tramping through the bush.</p> </div> </div> <div class="video-container"> <div class="caption"> <p><a href="https://www.youtube.com/watch?v=1N0HbdJZsgk">I called your name</a> by Peter Sefton</p> </div> </div> <p>Anyway, in late March we got out our recording gear and started. While phone cameras are fine for the quality of video we need, we wanted to do better than phone-camera sound. (Here's an example of that sound from one of our first recordings on my song <a href="https://www.youtube.com/watch?v=2ceePoxf0_4">Seventeen</a> - it's pretty muddy, like the lighting.)</p> <div class="video-container"> <div class="caption"> <p><a href="https://www.youtube.com/watch?v=2ceePoxf0_4">Seventeen</a> by Peter Sefton</p> </div> </div> <p>Initial attempts to get good audio involved feeding USB-audio from a sound mixer with a built in audio interface (a Yamaha MX10) into the phone itself and recording an audio track <em>with</em> the video - but this is clunky and you only get two tracks even though the mixer has multiple inputs. We soon graduated to using a DAW - a Digital Audio Workstation with our mixer, still only two tracks but much less mucking around with the phone.</p> <p>So this is more or less what we ended up with for the first few weeks - We'd record or "track" everything on the computer and then use it again to mix.</p> <div class="figure"> <img src="http://www.plantuml.com/plantuml/png/XL9VQy8m47-_Jt7uBgvzb36KpSh0CfdCROz7csfOcfGaLc7ik--QLYajOnJAvVVdBlSZ00ldoVfJgS2Ap9C-A86JHwhyoOe9VlVHxT7eWFr07zxBN1XCf6AE7DOoPyb0ayeCeq2NKM27P3p1mCPZyY8ipl1PUs4BoiGSFwca8s6wZku-taBJLZIqRdWBK-5vXg-2AXOyBT_dN6V6vkWjPLykk9KVMNSyZ9TQdd9fWeQQAiZuW_-dy8qw3-aSwoQYRbRCNORnl6WbjS2K7C-L-e0xOMlPxBfXkss3SwFa4wb9uVxMlAXPyhUckQENNWdNgVr8dYJfThKERD6HPOJJt9przA6D6Sfu08J_3UCqho43rrUu3GF26y9fYpBCoG04hjSW1eRxPmHe_lTyAbqehoIlFCVLfJBM0zStk82Yw8ayB8JEcrSElm00" /> <div class="caption">Our first-generation recording rig with annoying recording via a laptop</div> </div> <p>There's a thing you have to do to audio files called <em>mastering</em> which means getting them to a suitable volume level and dynamic range for distribution. Without it loud stuff is too quiet and quiet stuff is too quiet, and the music has no <em>punch</em>. This was a complete mystery to me to start with so I paid for online services that use AI to master tracks - kind of but not really making everything louder. At some point I started doing it myself, beginning the long process of learning the mysteries of compression and limiting and saving money. Haven't <em>mastered</em> it yet, though. Mastering is an actual profession, by the way and I'm not going to reach those heights.</p> <p>In May, we got a new bit of gear, the Tascam Model 12 an all in one mixer-recorder-interface that lets you track (that is record tracks) without a computer - much easier to deal with. A bit later we got a Zoom H5 portable recorder with built in mics and a couple of extra tracks for instruments so we can do stuff away from home - this got used on our month-long holiday in March 2021. Well it was almost a month, but there was a Rain Event and we came home a bit early. These machines let you capture tracks, including adding new ones without touching the computer which is a big win as far as I am concerned.</p> <div class="video-container"> <div class="caption"> <p><a href="https://www.youtube.com/watch?v=4_9J4QlnCkw">Gail singing Closer to fine</a> on The Strand in Townsville, in North Queensland, recorded on the H5 and (partly) mixed in the car on holidays.</p> </div> </div> <p>After a bit, and depending on the level of lockdown we'd have guests around to visit and when that was happening, we kept our distance at either end of our long lounge room and used a phone camera and microphone at each end.</p> <div class="figure"> <img src="http://www.plantuml.com/plantuml/png/ZPBFIyCm5CVlxwzuxAI7CPhXPGVPSAiA5iOG1M-XIJMO7oLDju7ulvk-pj4r7hoKyFryahn-ssKJk0-jqU1b6RXzrn8dTxmEhPSJ_1gcsmzdAVi6W0GWco1li6QzCEI5AZ4ZQh988nx1k9Ke-mlEJt1ES4WVRHDyQwGDk5PTtfiYIuyvIZ_RoT9v8JrUFVEcv0OB9wJ6ZE4Ctvmp-71pJLEIZibrBqf8XxdIagoz4km_Klh6hGugU5v-ugeLof78knMfu_QUU1rSZOSPxYIlfSVrwdNHpOZS2Ynu4wId8tIWUPBFaolBDT9Wv5NxuTY9qzqKvzEoETaXqVEys6xkYLOd260od0ftiDFRla0mR1pCHu4ORw7dEUixWVxtclPfZ6EdKBFBW06G_RkNoe08ae1T9g33EUcQMU3cM3iV8AxFSGLLDP02BAKLhT4_" /> <div class="caption">Our second-generation recording rig with stand-alone laptop-free tracking </div> </div> <p>This new setup made it much easier to do overdubs - capture more stuff into the Model 12 and make videos each time, like on this song of mine <a href="https://www.youtube.com/watch?v=EezBl_I549o">They Say Dancing</a> where I overdubbed guitar and bass over a live track.</p> <div class="video-container"> <div class="caption"> <p><a href="https://www.youtube.com/watch?v=EezBl_I549o">They Say Dancing</a> by Peter Sefton</p> </div> </div> <p>So what did we learn?</p> <ol> <li> <p>Perfect is the enemy of Done. Well, we knew that, but if you've decided to release a song every week, even if you're away on a holiday, or there are other things going on then there's no time to obsess over details - you have to get better at getting a useable take quickly or you won't be able to keep going for a year or more.</p> </li> <li> <p>Practice may not make perfect, but it's a better investment than new gear, or doing endless takes with the cameras rolling. We got better at picking a song (or deciding to write one or finish one off), playing it for a week or two and then getting the take.</p> </li> <li> <p>Simplify! We learned that to get a good performance sometimes it was better for only one of us to play or sing, that fancy parts increased the chance of major errors, meaning yet another take. If in doubt (like my harmony singing that's always in doubt) we're learning to leave it out.</p> </li> <li> <p>Nobody likes us! Actually we know that's not true, some of the songs get hundreds of plays on Facebook but not many people actually click the like button, maybe twenty or so. But then you run into people in the supermarket; they say "love the songs keep it up"! And there <em>are</em> quite a few people who listen every week on FB we just can't tell they're enjoying it. There are complex reasons for this lack of engagement - some people don't like to like things so that (they think) the evil FB can't track them. I think the default auto-play for video might be a factor too - the video starts playing, and that might not be a good time, so people skip forward to something else.</p> <p>It's kind of demoralizing that it is MUCH easier to get likes with pictures of the dog.</p> <div class="figure"> <img alt="Puppies win every time" src="http://ptsefton.com/2021/04/08/lockdowns1/pup.png" /> <div class="caption"> <p>Our spoiled covid-hound, Floki - about 18 months old. Much more <em>like</em>able on the socials than our music.</p> </div> <div> </div></div></li> <li> <p>YouTube definitely doesn't like us. I figured that some of the songs we sang would attract some kind of Youtube audience - we often search to see what kinds of covers of songs are out there and thought others might find us the same way, but we get almost no views on that platform. I also thought that adding some text about the gear we used might bring in some views. For example we were pretty early adopters of the Tascam Model 12. I had tried to find out what one sounded like in real life before I bought, with no success - and I thought people might drop by to hear us, but I don't think Google/YouTube is giving us any search-juice at all.</p> </li> </ol> <h1>Our personal favourites</h1> <p>Our Favourite cover we did (and we actually agreee on this - Team Happy is NOT an ironic name) was <a href="https://www.youtube.com/watch?v=nQeS4fSSu7k">Colour my World</a>. We'd just got the Tascam and Gail was able to double track herself - no mucking around with computers. We had fun that night.</p> <div class="video-container"> <div class="caption"> <p><a href="https://www.youtube.com/watch?v=nQeS4fSSu7k">Colour my World</a> - one of our fave covers to perform</p> </div> </div> <p>And my favourite original? Well i'm very proud of <a href="https://www.youtube.com/watch?v=XnfcJGVVVhU">All L'Amour for you</a> with lots of words and a bi-lingual pun - I wanted to do that on the local community radio just last weekend when we were asked in, but the host Richard 'Duck' Keegan kind of mentioned the aforementioned <a href="https://www.youtube.com/watch?v=1N0HbdJZsgk">I Called Your Name</a> so we did that instead along with <a href="https://www.youtube.com/watch?v=MHc3JpDdw8Y">Dry Pebbles</a> and <a href="https://www.youtube.com/watch?v=2ceePoxf0_4">Seventeen</a>.</p> <div class="video-container"> <div class="caption"> <p><a href="https://www.youtube.com/watch?v=XnfcJGVVVhU">All L'Amour for you</a> The last word on love and metaphors for love? By Peter Sefton.</p> </div> </div> <p>Gail's fave original? <a href="https://youtu.be/Ct0RQv6hiN8">I may I might</a>, the song that snagged her the best husband in South Katoomba over 1.95m tall. And she likes the tear jerker <a href="https://www.youtube.com/watch?v=EkfbSeDXUVo">Goodbye Mongrel dog</a> I wrote, on which she pays some pumpin' banjo.</p> <div class="video-container"> <div class="caption"> <p><a href="https://www.youtube.com/watch?v=EkfbSeDXUVo">Goodbye Mongrel dog</a> - a song that says goodbye to a (deceased) Mongrel dog who went by the name of Spensa.</p> </div> </div> <h1>Music-tech stuff and mixing tips</h1> <p>For those of you who care, here's a roundup of the main bits of kit that work well. We've reached the point where there's actually nothing on the shopping list - we can do everything for the foreseeable future with what we have.</p> <p>I have mentioned that we track using the Tascam Model 12 and the Zoom H5 - these are both great. The only drawback of the Zoom is that you can't see the screen (and thus the levels) from performance position. It also needed a better wind shield - I bought a dead-cat, shaggy thing to go over the mics that works if the wind is moderate.</p> <p>When I bought the Tascam I thought it was going to be all analogue through the mixer stage like their Model 16 and Model 24, but no, it's all digital. I don't think this is an issue having used it but it was not something they made all that explicit at launch. There's a digital Zoom equivalent (the L12) which is a bit smaller, and has more headphone outputs but at the expense of having to do mode-switching to to access all the functions. I think the Tascam will be easier to use for live shows when those start happening again.</p> <p>For video we just use our phones - for a while we had matching Pixel 4XLs then a Pixel 5 which drowned in a tropical stream. Yes they're waterproof, those models, but not when they have tiny cracks in the screen. No more $1000 phones for me.</p> <p><a href="https://www.reaper.fm/">Reaper</a> is bloody marvelous software. It's cheap for non-commercial use, incredibly powerful and extensible. I have not used any other Digital Audio Workstation other than Garage Band, that comes for free on the Apple Platform but as far as I can see there's no reason for non-technophobic home producers to pay any more than the Reaper fee for something else.</p> <p>Our mainstay mics are a slightly battered pair of <a href="https://www.soundonsound.com/reviews/audio-technica-at2020">Audio Technica AT2020</a>s - we had these for performing live with Gail's band U4ria - everyone gathered around a condenser mic, bluegrass style. For recording we either put one at either end of the room or mount them vertically in an X/Y configuration - 90° to get stereo. They're fairly airy and have come to be a big part of our sound. We tried some other cheap things that didn't work very well, and I got a pair of Australian <a href="https://www.soundonsound.com/reviews/rode-m5">Rode M5</a> pencil condenser mics, not expensive, that I hoped might be easier to mount X/Y but we didn't like them for vocals at all, though they're great on stringed instruments. We do have an <a href="https://en.wikipedia.org/wiki/Shure_SM58">SM58</a> and <a href="https://en.wikipedia.org/wiki/Shure_SM57">SM57</a> -- gotta love a microphone with a wikipedia page -- which see occasional use as vocal mics if we want a more rock 'n roll sound, or the guest singer is more used to a close-mic. And the SM57 for guitar amps sometimes.</p> <p>We tend to play our favourite acoustic instruments but when we have bass we use the Trace Elliot Elf amp which has a great compressor and a DI output (it can send a signal to the mixer/interface without going via the speaker). Sometimes we run the speaker and try not to let it bleed too much into the AT2020s, very occasionally we wear headphones for the first track and go direct so there's no bass bleed. I have done a bit of electric guitar with the Boss Katana 50 - to me it sounds good in the room that amp, but has not recorded well either via the headphone out or via an SM57. I get better results thru the bass amp. I don't have any kind of actual electric guitar <em>tone</em> sorted though I have seen lot of videos about how to achieve the elusive <em>tone</em>. Maybe one day.</p> <p>One thing that I wasn't expecting to happen - I dropped the top E of my little Made in Mexico Martin OOO Jr guitar to D (you know, like Keef) some time early in 2020 and it ended up staying there. Gives some nice new chord voicings (9ths mostly) and it's the same top 4 strings as a 5 string banjo with some very easy-to-grab chords. Have started doing it to Ukuleles too, putting them in open C.</p> <p><strong>A note on the bass:</strong> Playing bass is fun (we knew that before we started) but mixing it so it can be heard on a phone speaker is a real challenge. One approach that helps is using an acoustic bass which out of a lot more high frequency than a solid body electric this also helps because you don't have to have an amp on while you're tracking it live, but you can take a direct input from a pickup (or two) AND mic the bass giving you lots of signals with different EQ to play with. I gaffa-taped a guitar humbucker into my Artist Guitars 5 string acoustic and it sounds <em>huge</em>.</p> <p>The basic (ha!) trick I try to use for getting more high frequency for tiny speakers is to create a second track, saturate the signal with distortion and/or saturation effects to boost the upper harmonic content and then cut all the low frequency out and mix that so it can <em>just</em> be heard and <em>imply</em> the fundamental bass frequency in addition to the real bassy bass. Helps if you have some bridge pickup or under-saddle pickup in the signal if those are available and if you remember.</p> <p>I also like to add some phaser effect that gives some motion in the upper frequencies - for example my <a href="https://www.youtube.com/watch?v=ItqXJ1zsvqg">Perfect Country Pop Song</a> - too much phaser? Probably, but I can hear the bass on my phone and it <em>bounces</em> :). Phaser is Team Happy's favourite effect, nothing says perfect country pop (which is what we are, right?) like a phaser.</p> <div class="video-container"> <div class="caption"> <p><a href="https://www.youtube.com/watch?v=ItqXJ1zsvqg">Perfect Country Pop Song</a> - is it perfect or merely sublime? (This one has a cute puppy in it).</p> </div> </div> <p>Everything I know about music production is from YouTube. Everything I know about song writing is from deep in my soul. Thank you for reading all the way to the bottom. Normal service will resume next week.</p> 2021-04-07T22:00:00+00:00 ptsefton Lucidworks: Let Fusion Handle Search to Get the Most Out of SharePoint https://lucidworks.com/post/lucidworks-fusion-augments-sharepoint-capabilities-for-best-knowledge-management-experience/ <p>Augment Sharepoint with a flexible search platform to deliver the best knowledge management experience in the market.</p> <p>The post <a href="https://lucidworks.com/post/lucidworks-fusion-augments-sharepoint-capabilities-for-best-knowledge-management-experience/" rel="nofollow">Let Fusion Handle Search to Get the Most Out of SharePoint</a> appeared first on <a href="https://lucidworks.com" rel="nofollow">Lucidworks</a>.</p> 2021-04-07T15:54:54+00:00 Jenny Gomez Jez Cope: Collaborations Workshop 2021: collaborative ideas &amp; hackday https://erambler.co.uk/blog/collabw21-part-2/ <p><a href="https://erambler.co.uk/blog/collabw21-part-1/">My last post covered the more “traditional” lectures-and-panel-sessions approach</a> of the first half of the <a href="https://software.ac.uk/cw21">SSI Collaborations Workshop</a>. The rest of the workshop was much more interactive, consisting of a <a href="https://software.ac.uk/cw21/discussion-session">discussion session</a>, a <a href="https://software.ac.uk/cw21/collaborative-ideas-session">Collaborative Ideas session</a>, and a <a href="https://software.ac.uk/cw21/hack-day">whole-day hackathon</a>!</p> <p>The discussion session on day one had us choose a topic (from a list of topics proposed leading up to the workshop) and join a breakout room for that topic with the aim of producing a “speed blog” by then end of 90 minutes. Those speed blogs will be published on the <a href="https://software.ac.uk/blog">SSI blog</a> over the coming weeks, so I won’t go into that in more detail.</p> <p>The Collaborative Ideas session is a way of generating hackday ideas, by putting people together at random into small groups to each raise a topic of interest to them before discussing and coming up with a combined idea for a hackday project. Because of the serendipitous nature of the groupings, it’s a really good way of generating new ideas from unexpected combinations of individual interests.</p> <p>After that, all the ideas from the session, along with a few others proposed by various participants, were pitched as ideas for the hackday and people started to form teams. Not every idea pitched gets worked on during the hackday, but in the end 9 teams of roughly equal size formed to spend the third day working together.</p> <h2 id="my-teams-project-aha-an-arts--humanities-adventure">My team’s project: “AHA! An Arts &amp; Humanities Adventure”</h2> <p>There’s a lot of <a href="https://en.wikipedia.org/wiki/Fear_of_missing_out">FOMO</a> around choosing which team to join for an event like this: there were so many good ideas and I wanted to work on several of them! In the end I settled on a team developing an escape room concept to help Arts &amp; Humanities scholars understand the benefits of working with research software engineers for their research.</p> <p>Five of us rapidly mapped out an example storyline for an escape room, got a website set up with GitHub and populated it with the first few stages of the game. We decided to focus on a story that would help the reader get to grips with what an <a href="https://en.wikipedia.org/wiki/API">API</a> is and I’m amazed how much we managed to get done in less than a day’s work!</p> <p>You can <a href="https://lostrses.github.io/escape-room/">try playing through the escape room (so far) yourself on the web</a>, or <a href="https://github.com/lostRSEs/escape-room">take a look at the GitHub repository</a>, which contains the source of the website along with <a href="https://github.com/lostRSEs/escape-room/issues">a list of outstanding tasks to work on</a> if you’re interested in contributing.</p> <p>I’m not sure yet whether this project has enough momentum to keep going, but it was a really valuable way both of getting to know and building trust with some new people and demonstrating the concept is worth more work.</p> <h2 id="other-projects">Other projects</h2> <p>Here’s a brief rundown of the other projects worked on by teams on the day.</p> <dl> <dt>Coding Confessions</dt> <dd>Everyone starts somewhere and everyone cuts corners from time to time. Real developers copy and paste! Fight imposter syndrome by looking through some of these confessions or contributing your own. <a href="https://coding-confessions.github.io/">https://coding-confessions.github.io/</a></dd> <dt>CarpenPI</dt> <dd>A template to set up a Raspberry Pi with everything you need to run a Carpentries (<a href="https://carpentries.org/">https://carpentries.org/</a>) data science/software engineering workshop in a remote location without internet access. <a href="https://github.com/CarpenPi/docs/wiki">https://github.com/CarpenPi/docs/wiki</a></dd> <dt>Research Dugnads</dt> <dd>A guide to running an event that is a coming together of a research group or team to share knowledge, pass on skills, tidy and review code, among other software and working best practices (based on the <a href="https://en.wikipedia.org/wiki/Communal_work#Norway">Norwegian concept of a dugnad</a>, a form of “voluntary work done together with other people”) <a href="https://research-dugnads.github.io/dugnads-hq/">https://research-dugnads.github.io/dugnads-hq/</a></dd> <dt>Collaborations Workshop ideas</dt> <dd>A meta-project to collect together pitches and ideas from previous Collaborations Workshop conferences and hackdays, to analyse patterns and revisit ideas whose time might now have come. <a href="https://github.com/robintw/CW-ideas">https://github.com/robintw/CW-ideas</a></dd> <dt>howDescribedIs</dt> <dd>Integrate existing tools to improve the machine-readable metadata attached to open research projects by integrating projects like SOMEF, codemeta.json and HowFAIRIs (<a href="https://howfairis.readthedocs.io/en/latest/index.html)">https://howfairis.readthedocs.io/en/latest/index.html)</a>. Complete with CI and badges! <a href="https://github.com/KnowledgeCaptureAndDiscovery/somef-github-action">https://github.com/KnowledgeCaptureAndDiscovery/somef-github-action</a></dd> <dt>Software end-of-project plans</dt> <dd>Develop a template to plan and communicate what will happen when the fixed-term project funding for your research software ends. Will maintenance continue? When will the project sunset? Who owns the IP? <a href="https://github.com/elichad/software-twilight">https://github.com/elichad/software-twilight</a></dd> <dt>Habeas Corpus</dt> <dd>A corpus of machine readable data about software used in COVID-19 related research, based on the CORD19 dataset. <a href="https://github.com/softwaresaved/habeas-corpus">https://github.com/softwaresaved/habeas-corpus</a></dd> <dt>Credit-all</dt> <dd>Extend the all-contributors GitHub bot (<a href="https://allcontributors.org/">https://allcontributors.org/</a>) to include rich information about research project contributions such as the CASRAI Contributor Roles Taxonomy (<a href="https://casrai.org/credit/">https://casrai.org/credit/</a>) <a href="https://github.com/dokempf/credit-all">https://github.com/dokempf/credit-all</a></dd> </dl> <p>I’m excited to see so many metadata-related projects! I plan to take a closer look at what the Habeas Corpus, Credit-all and howDescribedIs teams did when I get time. I also really want to try running a dugnad with my team or for the <a href="https://glamdatasci.network">GLAM Data Science network</a>.</p> 2021-04-07T15:24:10+00:00 Journal of Web Librarianship: Meeting a Higher Standard: A Case Study of Accessibility Compliance in LibGuides upon the Adoption of WCAG 2.0 Guidelines https://www.tandfonline.com/doi/full/10.1080/19322909.2021.1907267?ai=1dl&mi=co84bk&af=R . <br /> 2021-04-07T07:21:18+00:00 Michael Chee Ed Summers: twarc2 https://inkdroid.org/2021/04/07/twarc2/ <p><img class="img-responsive" src="https://inkdroid.org/images/twarc2.png" /></p> <p><em>This post was originally published on <a href="https://news.docnow.io/twarc2-779278e66ea0">Medium</a> but I spent time writing it so I wanted to have it here too.</em></p> <p>TL;DR <a href="https://twarc-project.readthedocs.org">twarc</a> has been redesigned from the ground up to work with the <a href="https://blog.twitter.com/developer/enus/topics/tools/2020/introducingnewtwitterapi.html">new Twitter v2 API</a> and their <a href="https://blog.twitter.com/developer/enus/topics/tools/2021/enabling-the-future-of-academic-research-with-the-twitter-api.html">Academic Research track</a>. Many thanks for the code and design contributions of <a href="https://www.itee.uq.edu.au/think/elizabeth-alpert">Betsy Alpert</a>, <a href="https://twitter.com/igorbrigadir">Igor Brigadir</a>, <a href="https://www.linkedin.com/in/sam-hames-946b9478/?originalSubdomain=au">Sam Hames</a>, <a href="https://geog.umd.edu/gradprofile/sauer/jeffery-%28jeff%29">Jeff Sauer</a>, and <a href="https://www.linkedin.com/in/daniel-verdear-9aa5a2172/">Daniel Verdeer</a> that have made twarc2 possible, as well as early feedback from <a href="https://twitter.com/DanKerchner">Dan Kerchner,</a> <a href="https://scholarslab.lib.virginia.edu/people/shane-lin/">Shane Lin</a>, <a href="https://miles.land/">Miles McCain</a>, <a href="https://rongpeng.li/">李荣蓬</a>, <a href="https://cyber.fsi.stanford.edu/io/people/david-thiel">David Thiel</a>, <a href="https://melaniewalsh.org/">Melanie Walsh</a> and <a href="https://twitter.com/liblaura">Laura Wrubel</a>. Extra special thanks to the <a href="https://www.qut.edu.au/institute-for-future-environments">Institute for Future Environments</a> at <a href="https://www.qut.edu.au/">Queensland University of Technology</a> for supporting Betsy and Sam in their work, and for the continued support of the <a href="https://mellon.org/">Mellon Foundation</a>.</p> <hr /> <p>Back in August of last year Twitter <a href="https://blog.twitter.com/developer/en_us/topics/tools/2020/introducing_new_twitter_api.html">announced</a> early access to their new v2 API, and their <a href="https://developer.twitter.com/en/products/twitter-api/early-access/guide#rollingout">plans</a> to sunset the v1.1 API that has been active for almost the last 10 years. Over the lifetime of their v1.1 API Twitter has become deeply embedded in the media landscape. As magazines, newspapers and television have moved onto the web they have increasingly adopted tweets as a mechanism for citing politicians, celebrities and organizations, while also using them to document current events, <a href="https://www.tandfonline.com/doi/full/10.1080/1461670X.2011.571825?casa_token=XYL4hUS2LB8AAAAA%3As12sZht-gAo-3TAASDKB-L4GJIdBiVtzYhY6s7rCBPnNJZAIhbjobCL1AyXzoNf-WIKJwv9CMC4W">generate leads</a> and gather feedback for evolving stories. As a result Twitter has also become a popular <a href="https://scholar.google.com/scholar?q=%27twitter+data%22">object of study</a> for humanities and social science researchers looking to understand the world as reflected, refracted and distorted by/in social media.</p> <p>On the surface the v2 API update seems pretty insignificant since the shape of a tweet, its parts, properties and affordances, aren’t changing at all. Tweets with 280 characters of text, images and video will continue to be posted, retweeted and quoted. However behind the scenes the <em>representation</em> of a tweet <em>as data</em>, and the <em>quotas</em> that control the rates at which this data can flow between apps and other third party services will be greatly transformed.</p> <p>Needless to say, v2 represents a <em>big</em> change for the Documenting the Now project. Along with community members we’ve developed and maintained open source tools like <a href="https://github.com/docnow/twarc"><strong>twarc</strong></a> that talk directly to the Twitter API to help users to search for and collect live tweets that match criteria like hashtags, names and geographic locations. Today we’re excited to announce the release of twarc v2 which has been <a href="https://github.com/DocNow/twarc/wiki/twarc2">designed</a> from the ground up to work with the v2 API and Twitter’s new <a href="https://blog.twitter.com/developer/en_us/topics/tools/2021/enabling-the-future-of-academic-research-with-the-twitter-api.html">Academic Research track</a>.</p> <p>Clearly it’s extremely problematic having a multi-national corporation act as a gatekeeper for <em>who</em> counts as an academic researcher, and <em>what</em> constitutes academic research. We need look no further than the recent experiences of <a href="https://www.nytimes.com/2020/12/03/technology/google-researcher-timnit-gebru.html?searchResultPosition=4">Timnit Gebru</a> and <a href="https://www.nytimes.com/2021/02/19/technology/google-ethical-artificial-intelligence-team.html?searchResultPosition=2">Margaret Mitchell</a> at Google for an example of what happens when <a href="https://dl.acm.org/doi/abs/10.1145/3442188.3445922">research questions</a> run up against the business objectives of capital. We only know their stories because Gebru and Mitchell’s bravely took a principled approach, where many researchers would have <a href="https://en.wikipedia.org/wiki/Chilling_effect">knowingly or unknowingly</a> shaped their research to better fit the needs of the company.</p> <p>So it is important for us that twarc still be usable by people with and without access to the Academic Research Track. But we have heard from many users that the Academic Research Track presents new opportunities for Twitter data collection that are essential for researchers interested in the <a href="https://policyreview.info/articles/analysis/towards-platform-observability">observability</a> of social media platforms. Twitter is making a good faith effort to work with the academic research community, and we thought twarc should support it, even if big challenges lie ahead.</p> <p>So why are people interested in the Academic Research Track? Once your application has been approved you are able to collect data from the <em>full history</em> of Tweets, at no cost. This is a massive improvement over the v1.1 access which was limited to a one week window and researchers had to pay for access. Access to the full archive means it’s now possible to study events that have happened in the past back to the beginning of Twitter in 2006. If you do create any historical datasets we’d love for you to share the tweet identifier datasets in <a href="https://catalog.docnow.io">The Catalog</a>.</p> <p>However this opening up of access on the one hand comes with a simultaneous contraction in terms of how much data can be collected at one time. The remainder of this post describes some of the details and the design decisions we have made with twarc2 to address them. If you would prefer to watch a quick introduction to using twarc v2 please check out this short video:</p> <h3 id="installation">Installation</h3> <p>If you are familiar with installing twarc nothing is changed. You still install (or upgrade) with pip as you did before:</p> <pre><code>$ pip install --upgrade twarc</code></pre> <p>In fact you will still have full access to the v1.1 API just as you did before. So the old commands will continue to work as they did<a class="footnote-ref" href="https://inkdroid.org/feed.xml#fn1" id="fnref1"><sup>1</sup></a></p> <pre><code>$ twarc search blacklivesmatter &gt; tweets.jsonl</code></pre> <p>twarc2 was designed to let you to continue to use Twitter’s v1.1 API undisturbed until it is finally turned off by Twitter, at which point the functionality will be removed from twarc. All the support for the v2 API is mediated by a new command line utility twarc2. For example to search for <em>blacklivesmatter</em> tweets and write them to a file <em>tweets.jsonl</em>:</p> <pre><code>$ twarc2 search blacklivesmatter &gt; tweets.jsonl</code></pre> <p>All the usual twarc functionality such as searching for tweets, collecting live tweets from the streaming API endpoint, requesting user timelines and user metadata are all still there, <code>twarc2 --help</code> gives you the details. But while the interface looks the same there’s quite a bit different going on behind the scenes.</p> <h3 id="representation">Representation</h3> <p>Truth be told, there is no shortage of open source libraries and tools for interacting with the Twitter API. In the past twarc has made a bit of a name for itself by catering to a niche group of users who want a reliable, programmable way to collect the <em>canonical</em> JSON representation of a tweet. JavaScript Object Notation (JSON) is the language of Web APIs, and Twitter has kept its JSON representation of a tweet relatively stable over the years. Rather than making lots of decisions about the many ways you might want to collect, model and analyze tweets twarc has tried to <a href="https://en.wikipedia.org/wiki/Unix_philosophy#Do_One_Thing_and_Do_It_Well">do one thing and do it well</a> (data collection) and get out of the way so that you can use (or create) the tools for putting this data to use.</p> <p>But the JSON representation of a tweet in the Twitter v2 API is completely burst apart. The v2 base representation of a tweet is extremely lean and minimal, and just includes the text of the tweet its identifier and a handful of other things. All the details about the user who created the tweet, embedded media, and more are not included. Fortunately this information is still available, but the user needs to craft their API request to request tweets using a set of <a href="https://developer.twitter.com/en/docs/twitter-api/expansions">expansions</a> that tell the Twitter API what additional entities to include. In addition for each expansion there are a set of <a href="https://developer.twitter.com/en/docs/twitter-api/fields">field options</a> to include that control what of these expansions is returned.</p> <p>So rather than there being a single JSON representation of a tweet API users now have the ability to shape the data based on what they need, much like how GraphQL APIs work. This kind of makes you wonder why Twitter didn’t make <a href="https://blog.twitter.com/engineering/en_us/topics/infrastructure/2020/rebuild_twitter_public_api_2020.html#unified">their GraphQL API</a> available. For specific use cases this customizability is very useful, but the mutability of the representation of a tweet presents challenges when collecting data for future use. If you didn’t request the right expansions or fields when collecting the data then you won’t be able to analyze that data later when doing your research.</p> <p>To solve for this twarc2 has been designed to collect the richest possible representation for a tweet, by requesting <em>all</em> possible expansions and field combinations for tweets. See the <a href="https://github.com/DocNow/twarc/blob/main/twarc/expansions.py">expansions module</a> for the details if you are interested. This takes a significant burden off of users to digest the API documentation, and craft the correct API requests themselves. In addition the twarc community will be monitoring the Twitter API documentation going forward to incorporate new expansions and fields as they will inevitably be added in the future.</p> <h3 id="flattening">Flattening</h3> <p>This is diving into the weeds a little bit, but it’s worth noting here that Twitter’s introduction of <em>expansions</em> allows data that was once duplicated across multiple tweets (such as user information, media, retweets, etc) to be included once per response from the API. This means that instead of seeing information about the user who created a tweet in the context of their tweet the user will be referenced using an identifier, and this identifier will map to user metadata in the outer envelope of the response.</p> <p>It makes sense why Twitter have introduced expansions since it means in a set of 100 tweets from a given user the user information will just be included once rather than repeated 100 times, which means less data, less network traffic and less money. It’s even more significant when consider the large number of possible expansions. However this <a href="https://en.wikipedia.org/wiki/Evaluation_strategy#Call_by_reference">pass by-reference rather than by-value</a> presents some challenges for stream based processing which expects each tweet to be self-contained.</p> <p>For this reason we’ve introduce the idea of <em>flattening</em> the response data when persisting the JSON to disk. This means that tools and data pipelines that expect to operate on a stream of tweets can continue to do so. Since the representation of a tweet is so dependent on how data is requested we’ve taken the opportunity to introduce a small stanza of twarc specific metadata using the <code>__twarc</code> prefix.</p> <p>This metadata records what API endpoint the data was requested from, and when. This information is critically important when interpreting the data, because some information about a tweet like its retweet and quote counts are constantly changing.</p> <h3 id="data-flows">Data Flows</h3> <p>As mentioned above you can still collect tweets from the search and streaming API endpoints in a way that seems quite similar to the v1 API. The big changes however are the quotas associated with these endpoints which govern how much can be collected. These quotas control how many requests can be sent to Twitter in 15 minute intervals.</p> <p>In fact these quotas are not much changed, but what’s new are <em>app wide</em> quotas that constrain how many tweets a given application (app) can collect every month. An app in this context is a piece of software (e.g. your twarc software) identified by unique API keys set up in the <a href="https://developer.twitter.com/en/portal/dashboard">Twitter Developer Portal</a>. The standard API access sets a 500,000 tweet per month limit. This is a huge change considering there were no monthly app limits before. If you get approved for the Academic Research track your app quota is increased to 10 million per month. This is markedly better but the achievable data volume is still nothing like the v1.1 API, as these graphs attempt to illustrate:</p> <p><img class="img-responsive" src="https://inkdroid.org/images/twarc2-chart1.png" /></p> <p><img class="img-responsive" src="https://inkdroid.org/images/twarc2-chart2.png" /></p> <p>twarc2 will still observe the same rate limits, but once you’ve collected your portion for the month there’s not much that can be done, for that app at least.</p> <p>Apart from the quotas Twitter’s streaming endpoint in v2 is substantially changed which impacts how users interact with twarc. Previously twarc users would be able to create up to to two connections to the filter stream API. This could be done by simply:</p> <pre><code>twarc filter obama &gt; obama.jsonl</code></pre> <p>However in the Twitter v2 API only apps can connect to the filter stream, and they can only connect once. At first this seems like a major limitation but rather than creating a connection per query the v2 API allows you to build a set of rules for tweets to match, which in turns controls what tweets are included in the stream. This means you can collect for multiple types of queries at the same time, and the tweets will come back with a piece of metadata indicating what rule caused its inclusion.</p> <p>This translates into a markedly different set of interactions at the command line for collecting from the stream where you first need to set your stream rules and then open a connection to fetch it.</p> <pre><code>twarc2 stream-rules add blacklivesmatter twarc2 stream &gt; tweets.jsonl</code></pre> <p>One useful side effect of this is that you can update the stream (add and remove rules) while the stream is in motion:</p> <pre><code>twarc2 stream-rules add blm</code></pre> <p>While you are limited by the API quota in terms of how many tweets you can collect, tweets are not “dropped on the floor” when the volume gets too high. Once upon a time the v1.1 filter stream was rumored to be rate limited when your stream exceeds 1% of the total volume of new tweets.</p> <h3 id="plugins">Plugins</h3> <p>In addition to twarc helping you collect tweets the GitHub repository has also been a place to collect a <a href="https://github.com/DocNow/twarc/tree/main/utils">set of utilities</a> for working with the data. For example there are scripts for <a href="https://github.com/DocNow/twarc/blob/main/utils/urls.py">extracting</a> and <a href="https://github.com/DocNow/twarc/blob/main/utils/unshrtn.py">unshortening</a> urls, <a href="https://github.com/DocNow/twarc/blob/main/utils/deletes.py">identifying suspended/deleted content</a>, <a href="https://github.com/DocNow/twarc/blob/main/utils/youtubedl.py">extracting videos</a>, <a href="https://github.com/DocNow/twarc/blob/main/utils/wordcloud.py">buiding wordclouds</a>, <a href="https://github.com/DocNow/twarc/blob/main/utils/geo.py">putting tweets on maps</a>, displaying <a href="https://github.com/DocNow/twarc/blob/main/utils/network.py">network graph visualizations</a>, <a href="https://github.com/DocNow/twarc/blob/main/utils/tags.py">counting hashtags</a>, and more. These utilities all work like <a href="https://en.wikipedia.org/wiki/Filter_(software)">Unix filters</a> where the input is a stream of tweets and the output varies depending on what the utility is doing, e.g. a Gephi file for a network visualization, or a folder of mp4 files for video extraction.</p> <p>While this has worked well in general the <em>kitchen sink</em> approach has been difficult to manage from a configuration management perspective. Users have to download these scripts manually from GitHub or by cloning the repository. For some users this is fine, but it’s a bit of a barrier to entry for users who have just installed twarc with pip.</p> <p>Furthermore these plugins often have their own dependencies which twarc itself does not. This lets twarc can stay pretty lean, and things like youtube_dl, NetworkX or Pandas can be installed by people that want to use utilities that need them. But since there is no way to install the utilities there isn’t a way to ensure that the dependencies are installed, which can lead to users needing to diagnose missing libraries themselves.</p> <p>Finally the plugins have typically lacked their own tests. twarc’s test suite has really helped us track changes to the Twitter API and to make sure that it continues to operate properly as new functionality has been added. But nothing like this has existed for the utilities. We’ve noticed that over time some of them need updating. Also their command line arguments have drifted over time which can lead to some inconsistencies in how they are used.</p> <p>So with twarc2 we’ve introduced the idea of plugins which extend the functionality of the twarc2 command, are distributed on <a href="https://python.org/pypi/">PyPI</a> separately from twarc, and exist in their own GitHub repositories where they can be developed and tested independently of twarc itself. This is all achieved through twarc2’s use of the <a href="https://click.palletsprojects.com/">click</a> library and specifically <a href="https://pypi.org/project/click-plugins/">click-plugins</a>. So now if you would like to convert your collected tweets to CSV you can install the <a href="https://pypi.org/project/twarc-csv/">twarc-csv</a>:</p> <pre><code>$ pip install twarc-csv $ twarc2 search covid19 &gt; covid19.jsonl $ twarc2 csv covid19.jsonl &gt; covid19.csv</code></pre> <p>Or if you want to extract embedded and referenced videos from tweets you can install <a href="https://pypi.org/project/twarc-videos/">twarc-videos</a> which will write all the videos to a directory:</p> <pre><code>$ pip install twarc-videos $ twarc2 videos covid19.jsonl --download-dir covid19-videos</code></pre> <p>You can write these plugins yourself and release them as needed. Check out the plugin reference implementation <a href="https://github.com/docnow/twarc-ids/">tweet-ids</a> for a simple example to adapt. We’re still in the process of porting some of the most useful utilities over and would love to see ideas for new plugins. Check out the <a href="https://twarc-project.readthedocs.io/en/latest/plugins/">current list of twarc2 plugins</a> and use the <a href="https://github.com/docnow/twarc/issues">twarc issue tracker</a> on GitHub to join the discussion.</p> <p>You may notice from the list of plugins that twarc now (finally) has <a href="https://twarc-project.readthedocs.io/en/latest/">documentation on ReadTheDocs</a> external from the documentation that was previously only available on GitHub. We got by with GitHub’s rendering of Markdown documents for a while, but GitHub’s boilerplate designed for developers can prove to be quite confusing for users who aren’t used to selectively ignoring it. ReadTheDocs allows us to manage the command line and API documentation for twarc, and to showcase the work that has gone into the Spanish, Japanese, Portuguese, Swedish, Swahili and Chinese translations.</p> <h3 id="feedback">Feedback</h3> <p>Thanks for reading this far! We hope you will give twarc2 a try. Let us know what you think either in comments here, in the <a href="https://bit.ly/docnow-slack">DocNow Slack</a> or over on <a href="https://github.com/docnow/twarc/issues">GitHub</a>.</p> <p>✨ ✨ Happy twarcing! ✨ ✨ ✨</p> <section class="footnotes"> <hr /> <ol> <li id="fn1"><p>Windows users will want to indicate the output file using a second argument rather than redirecting output with <strong>&gt;</strong>. See <a href="https://twarc-project.readthedocs.io/en/latest/windows10/#escaping-characters-in-windows">this page</a> for details.<a class="footnote-back" href="https://inkdroid.org/feed.xml#fnref1">↩</a></p></li> </ol> </section> 2021-04-07T04:00:00+00:00 Peter Sefton: FAIR Data Management; It's a lifestyle not a lifecycle http://ptsefton.com/2021/04/07/rdmpic/index.html <p>I have been working with my colleague Marco La Rosa on summary diagrams that capture some important aspects of Research Data Management, and include the <a href="https://www.nature.com/articles/sdata201618">FAIR data principles</a>; that data should be Findable, Accessible, Interoperable and Reusable.</p> <p>But first, here's a rant about some modeling and diagramming styles and trends that I <strong>do not like</strong>.</p> <p>I took part in a fun Twitter thread recently <a href="https://twitter.com/FCTweedie/status/1359694360472821761">kicked off by Fiona Tweedie</a>.</p> <blockquote> <p>Fiona Tweedie <a href="https://twitter.com/FCTweedie">@FCTweedie</a> So my current bugbear is university processes that seem to forget that the actual work of higher ed is doing research and/ or teaching. This "research lifecycle" diagram from @UW is a stunning <a href="https://www.washington.edu/research/research-home-page/research-lifecycle-main-t1-blowout-2/">example</a>:</p> </blockquote> <img alt="The UW MyResearch Lifecycle with the four stages: Plan/Propose, Setup, Manage, and Closeout" src="http://ptsefton.com/2021/04/07/rdmpic/uw.png" /> <p>In this tweet Dr Tweedie has called out Yet Another Research Lifecycle Diagram That Leaves Out The Process Of You Know, Actually Doing Research. This process-elision happened more than once when I was working as an eResearch manager - management would get in the consultants to look at research systems, talk to the research office and graduate school and come up with a "journey map" of administrative processes that either didn't mention the actual DOING research or represented it as a tiny segment, never mind that it's, you know, the main thing researchers do when they're being researchers rather than teachers or administrators.</p> <p>At least the consultants would usually produce a 'journey map' that got you from point A to Point B using chevrons to &gt;&gt; indicate progress and didn't insist that everything was a 'lifecycle'.</p> <p>Something like:</p> <pre><code>Plan / Propose &gt;&gt; Setup &gt;&gt; Manage / Do Research &gt;&gt; Closeout </code></pre> <p>But all too commonly processes are represented using the tired old metaphor of a lifecycle.</p> <p><strong>Reminder: A lifecycle is a biological process; how organisms come into existence, reproduce and die via various means including producing seeds, splitting themselves in two, um, making love, laying eggs and so on.</strong></p> <p>It's really stretching the metaphor to talk about research in this way - maybe the research outputs in the UW "closeout" phase are eggs that hatch into new bouncing baby <em>proposals</em>?</p> <p>Regrettably, arranging things in circles and using the "lifecycle" metaphor is very common - see this Google image search for "Research Lifecycle":</p> <img src="http://ptsefton.com/2021/04/07/rdmpic/research_lifecycle_search.png" /> <p>I wonder if the diagramming tools that are available to people are part of the issue - Microsoft Word, for example can build cycles and other diagrams out of a bullet list.</p> <p>(I thought it would be amusing to draw the UW diagram from above as a set cogs but this happened - you can only have 3 cogs in a Word diagram.)</p> <img alt="Attempt to use Microsoft Word to make a diagram 4 cogs for Plan/Propose, Setup, Manage, and Closeout but it will only draw three of them" src="http://ptsefton.com/2021/04/07/rdmpic/cogs.png" /> <a id="main"> <h2>Research Data Management as a Cycle</h2> </a><p><a id="main">Now that I've got that off my chest let's look at research data management. Here's a diagram which is in fairly wide use, from </a><a href="https://guides.library.ucsc.edu/datamanagement/">The University of California</a>.</p> <img alt="" src="http://ptsefton.com/2021/04/07/rdmpic/ResearchCycleUCSC.jpg" /> <p>(This image has a CC-BY logo which means I can use it if I attribute it - but I'm not 100% clear on the original source of the diagram - it seems to be from UC somewhere.)</p> <p>Marco used this one in some presentations we gave. I thought we could do better.</p> <p>The good part of this diagram is that it shows research data management as a cyclical, recurring activity - which for FAIR data it needs to be.</p> <p>What I don't like:</p> <ol> <li> <p>I think it is trying to show a project (ie grant) level view of research with data management happening in ONE spot on the journey. Typically researchers do research all the time (or in between teaching or when they can get time on equipment) not at a particular point in some administrative "journey map". We often hear feedback that their research is a lifetime activity and does not happen the way administrators and IT think it does.</p> </li> <li> <p>"Archive" is shown as a single-step pre-publication. This is a terrible message; if we are to start really doing FAIR data then data need to be described and made findable and accessible ASAP.</p> </li> <li> <p>The big so-called lifecycle is (to me) very contrived and looks like a librarian view of the world with data searching as a stand-alone process before research data management planning. Not clear whether <em>Publication</em> means articles or data.</p> </li> <li> <p>"Data Search / Reuse" is a type of "Collection", and why is it happening before data management planning? "Re-Collection" is also a kind of collection, so we can probably collapse all those together (the <em>Findable</em> and <em>Accessible</em> in FAIR).</p> </li> <li> <p>It’s not clear whether Publication means articles or data or both.</p> </li> <li> <p>Most research uses some kind of data storage but very often not directly; people might be interacting with a lab notebook system or a data repository - at UTS we arrived at the concept of "workspaces" to capture this.</p> </li> </ol> <h1>The "Minimum Viable FAIR Diagram"</h1> <p>Marco and I have a sketch of a new diagram that attempts to address these issues and addresses what needs to be in place for broad-scale FAIR data practice.</p> <p>Two of the FAIR principles suggest services that need to be in place; ways to <em>Find</em> and <em>Access</em> data. The I and R in FAIR are not something that can be encapsulated in a service, as such, rather they imply that data are well described for re-use and <em>Interoperation</em> of systems and in <em>Reusable</em> formats.</p> <p>As it happens, there is a common infrastructure component which encapsulates <em>finding</em> data and <em>accessing</em>; the repository. Repositories are services which hold data and make it discoverable and accessible, with governance that ensures that data does not change without notice and is available for access over agreed time frames - sometimes with detailed access control. Repositories may be general purpose or specialized around a particular type of data: gene sequences, maps, code, microscope images etc. They may also be ad-hoc - at a lab level they could be a well laid out, well managed file system.</p> <p>Some well-funded disciplines have established global or national repositories and workflows for some or all of their data, notably physics and astronomy, bioinformatics, geophysical sciences, climate and marine science. Some of these may not be thought of by their community as repositories - but according to our functional definition they are repositories, even if they are "just" vast shared file systems or databases where everyone knows what's what and data managers keep stuff organized. Also, some institutions have institutional data repositories but it is by no means common practice across the whole of the research sector that data find their way into any of these repositories.</p> <p><strong>Remember: data storage is not all files-on-disks</strong>. Researchers use a very wide range of tools which may make data inaccessible outside of the tool. Examples include: cloud-based research (lab) notebook systems in which data is deposited alongside narrative activity logs; large shared virtual laboratories where data are uploaded; Secure eResearch Platforms (SERPs) which allow access only via virtualized desktops with severely constrained data ingress and egress; survey tools; content management systems; digital asset management systems; email (yes, it's true some folks use email as project archives!); to custom-made code for a single experiment.</p> <p><strong>Our general term for all of the infrastructures that researchers use for RDM day to day including general purpose storage is “workspaces”.</strong></p> <p>Many, if not most <em>workspaces</em> do not have high levels of governance, and data may be technically or legally inaccessible over the long term. They should not be considered as suitable archives or repositories - hence our emphasis on making sure that data can be described and deposited into general purpose, standards-driven repository services.</p> <p>The following is a snapshot of the core parts of an idealised FAIR data service. It shows the activities that researchers undertake, acquiring data from observations, instruments and by reuse, conducting analysis and data description in a working environment, and depositing results into one or more repositories.</p> <p>We wanted it to show:</p> <ul> <li> <p>That infrastructure services are required for research data management - researchers don't just "Archive" their data without support - they and those who will reuse data need repository services in some form.</p> </li> <li> <p>That research is conducted using workspace environments - more infrastructure.</p> </li> </ul> <img alt="A work-in-progress sketch of FAIR research data management." src="http://ptsefton.com/2021/04/07/rdmpic/new-diagram.png" /> <p>We (by which I mean Marco) will make this prettier soon.</p> <p>And yes, there is a legitimate cycle in this diagram it's the FIND -&gt; ACCESS -&gt; REUSE -&gt; DESCRIBE -&gt; DEPOSIT cycle that's inherent in the FAIR lifestyle.</p> <p>Things that might still be missing:</p> <ul> <li> <p>Some kind of rubbish bin - to show that workspaces are ephemeral and working data that doesn't make the cut may be culled, and that some data is held only for a time.</p> </li> <li> <p>What do you think's missing?</p> </li> </ul> <p>Thoughts anyone? Comments below or take it up on twitter with @ptsefton.</p> <p>(I have reworked parts of a document that Marco and I have been working on with Guido Aben for this document, and thanks to recent graduate Florence Sefton for picking up typos and sense-checking).</p> 2021-04-06T22:00:00+00:00 ptsefton David Rosenthal: Elon Musk: Threat or Menace? https://blog.dshr.org/2021/04/elon-musk-threat-or-menace.html Although both Tesla and SpaceX are major engineering achievements, Elon Musk seems completely unable to understand the concept of externalities, unaccounted-for costs that society bears as a result of these achievements.<br /><br />First, in <a href="https://www.ft.com/content/e4e8b571-c61c-499d-ad1b-f4bfb48e65c7"><i>Tesla: carbon offsetting, but in reverse</i></a>, Jaime Powell reacted to Tesla taking $1.6B in carbon offsets which provided the only profit Tesla ever made and putting them into Bitcoin:<br /><blockquote>Looked at differently, a single Bitcoin purchase at a price of ~$50,000 has a carbon footprint of 270 tons, the equivalent of 60 ICE cars.<br /><br />Tesla’s average selling price in the <a href="https://tesla-cdn.thron.com/static/1LRLZK_2020_Q4_Quarterly_Update_Deck_-_Searchable_LVA2GL.pdf?xseo=&amp;response-content-disposition=inline%3Bfilename%3D%22TSLA-Q4-2020-Update.pdf%22">fourth quarter</a> of 2020? $49,333.<br /><br />We’re not sure about you, but FT Alphaville is struggling to square the circle of “buy a Tesla with a bitcoin and create the carbon output of 60 internal combustion engine cars” with its legendary environmental ambitions.<br /> <br /> Unless, of course, that was never the point in the first place. </blockquote>Below the fold, more externalities Musk is ignoring.<br /><span><a name="more"></a></span><br />Second, there is Musk's obsession with establishing a colony on Mars. Even assuming SpaceX can stop their Starship second stage exploding on landing, and do the same with the much bigger first stage, the Mars colony scheme would have massive environmental impacts. <a href="https://www.nationalgeographic.com/science/article/elon-musk-spacex-exploring-mars-planets-space-science">Musk envisages</a> a huge fleet of Starships ferrying people and supplies to Mars for between 40 and 100 years. The climate effects of dumping this much rocket exhaust into the upper atmosphere over such a long period would be significant. The idea that a world suffering the catastrophic effects of climate change could sustain such an expensive program over many decades simply for the benfit of a miniscule fraction of the population is laughable.<br /><br />These externalities are in the future. But there are a more immediate set of externalities.<br /><br />Back in 2017 I expressed my skepticism about "Level 5" self-driving cars in <a href="https://blog.dshr.org/2017/11/techno-hype-part-1.html"><i>Techno-hype part 1</i></a>, stressing that the problem was that to get to Level 5, or as Musk calls it "Full Self-Driving", you need to pass through the levels where the software has to hand-off to the human. And the closer you get to Level 5, the harder this problem becomes:<br /><blockquote>Suppose, for the sake of argument, that self-driving cars three times as good as Waymo's are in wide use by normal people. A normal person would encounter a hand-off once in 15,000 miles of driving, or less than once a year. Driving would be something they'd be asked to do maybe 50 times in their life.<br /><br />Even if, when the hand-off happened, the human was not <a href="https://www.nytimes.com/2017/06/07/technology/google-self-driving-cars-handoff-problem.html">"climbing into the back seat, climbing out of an open car window, and even smooching"</a> and had full "situational awareness", they would be faced with a situation too complex for the car's software. How likely is it that they would have the skills needed to cope, when the last time they did any driving was over a year ago, and on average they've only driven 25 times in their life? Current testing of self-driving cars hands-off to drivers with more than a decade of driving experience, well over 100,000 miles of it. It bears no relationship to the hand-off problem with a mass deployment of self-driving technology. </blockquote>Mack Hogan's <a href="https://www.roadandtrack.com/news/a35878363/teslas-full-self-driving-beta-is-just-laughably-bad-and-potentially-dangerous/"><i>Tesla's "Full Self Driving" Beta Is Just Laughably Bad and Potentially Dangerous</i></a> starts:<br /><blockquote>A beta version of Tesla's "Full Self Driving" Autopilot update has begun rolling out to certain users. And man, if you thought "Full Self Driving" was even close to a reality, this <a href="https://youtu.be/antLneVlxcs">video of the system in action</a> will certainly relieve you of that notion. It is perhaps the best comprehensive video at illustrating just how morally dubious, technologically limited, and potentially dangerous Autopilot's "Full Self Driving" beta program is. </blockquote>Hogan sums up the lesson of the video:<br /><blockquote>Tesla's software clearly does a decent job of identifying cars, stop signs, pedestrians, bikes, traffic lights, and other basic obstacles. Yet to think this constitutes anything close to "full self-driving" is ludicrous. There's nothing wrong with having limited capabilities, but Tesla stands alone in its inability to acknowledge its own shortcomings. </blockquote>Hogan goes on to point out the externalities:<br /><blockquote>When technology is immature, the natural reaction is to continue working on it until it's ironed out. Tesla has opted against that strategy here, instead choosing to sell software it knows is incomplete, charging a substantial premium, and hoping that those who buy it have the nuanced, advanced understanding of its limitations—and the ability and responsibility to jump in and save it when it inevitably gets baffled. In short, every Tesla owner who purchases "Full Self-Driving" is serving as an unpaid safety supervisor, conducting research on Tesla's behalf. Perhaps more damning, the company takes no responsibility for its actions and leaves it up to driver discretion to decide when and where to test it out.<br /> <br /> That leads to videos like this, where early adopters carry out uncontrolled tests on city streets, with pedestrians, cyclists, and other drivers unaware that they're part of the experiment. If even one of those Tesla drivers slips up, <a href="https://www.roadandtrack.com/new-cars/car-technology/news/a29791/tesla-autopilot-fatal-crash-report/">the consequences can be deadly</a>. </blockquote>Of course, the drivers are only human so they do slip up:<br /><blockquote>the Tesla arrives at an intersection where it has a stop sign and cross traffic doesn't. It proceeds with two cars incoming, the first car narrowly passing the car's front bumper and the trailing car braking to avoid T-boning the Model 3. It is absolutely unbelievable and indefensible that the driver, who is supposed to be monitoring the car to ensure safe operation, did not intervene there. </blockquote>An example of the kinds of problems that can be caused by autonomous vehicles behaving in ways that humans don't expect is reported by Timothy B. Lee in <a href="https://arstechnica.com/cars/2021/04/why-its-so-hard-to-prove-that-self-driving-technology-is-safe/"><i>Fender bender in Arizona illustrates Waymo’s commercialization challenge</i></a>: <blockquote>A white Waymo minivan was traveling westbound in the middle of three westbound lanes on Chandler Boulevard, in autonomous mode, when it unexpectedly braked for no reason. A Waymo backup driver behind the wheel at the time told Chandler police that "all of a sudden the vehicle began to stop and gave a code to the effect of 'stop recommended' and came to a sudden stop without warning."<br /><br />A red Chevrolet Silverado pickup behind the vehicle swerved to the right but clipped its back panel, causing minor damage. Nobody was hurt. </blockquote>The Tesla in the video made a similar unexpected stop. Lee stresses that, unlike Tesla's, Waymo's responsible test program has resulted in a generally safe product, but <a href="https://arstechnica.com/cars/2021/04/why-its-so-hard-to-prove-that-self-driving-technology-is-safe/">not one that is safe enough</a>:<br /><blockquote>Waymo has racked up more than 20 million testing miles in Arizona, California, and other states. This is far more than any human being will drive in a lifetime. Waymo's vehicles have been involved in a relatively small number of crashes. These crashes have been overwhelmingly minor with no fatalities and few if any serious injuries. Waymo says that a large majority of those crashes have been the fault of the other driver. So it's very possible that Waymo's self-driving software is significantly safer than a human driver.<br />...<br />The more serious problem for Waymo is that the company can't be sure that the idiosyncrasies of its self-driving software won't contribute to a more serious crash in the future. Human drivers cause a fatality about once every 100 million miles of driving—far more miles than Waymo has tested so far. If Waymo scaled up rapidly, it would be taking a risk that an unnoticed flaw in Waymo's programming could lead to someone getting killed. </blockquote>I'm a pedestrian, cyclist and driver in an area infested with Teslas owned, but potentially not actually being driven, by fanatical early adopters and members of the cult of Musk. I'm personally at risk from these people believing that what they paid good money for was "Full Self Driving". When SpaceX tests Starship at their Boca Chica site they take precautions, including road closures, to ensure innocent bystanders aren't at risk from the <a href="https://youtu.be/cN7855POvJ8?t=183">rain of debris</a> when things go wrong. Tesla, not so much.<br /><br />Of course, Tesla doesn't tell the regulators that what the cult members paid for was "Full Self Driving"; that might cause legal problems. As Timothy B. Lee reports, <a href="https://arstechnica.com/cars/2021/03/tesla-full-self-driving-beta-isnt-designed-for-full-self-driving/"><i>Tesla: “Full self-driving beta” isn’t designed for full self-driving</i></a>:<br /><blockquote>"Despite the "full self-driving" name, Tesla admitted it doesn't consider the current beta software suitable for fully driverless operation. The company said it wouldn't start testing "true autonomous features" until some unspecified point in the future.<br />...<br />Tesla added that "we do not expect significant enhancements" that would "shift the responsibility for the entire dynamic driving task to the system." The system "will continue to be an SAE Level 2, advanced driver-assistance feature."<br /><br />SAE level 2 is industry jargon for a driver-assistance systems that perform functions like lane-keeping and adaptive cruise control. By definition, level 2 systems require continual human oversight. Fully driverless systems—like the taxi service Waymo is operating in the Phoenix area—are considered level 4 systems." </blockquote>There is an urgent need for regulators to step up and stop this dangerous madness:<br /><ul> <li>The NHTSA should force Tesla to disable "Full Self Driving" in all its vehicles until the technology has passed an approved test program</li> <li>Any vehicles taking part in such a test program on public roads should be clearly distinguishable from Teslas being driven by actual humans, for example with orange flashing lights. Self-driving test vehicles from less irresponsible companies such as Waymo are distinguishable in this way, Teslas in which some cult member has turned on "Full Self Driving Beta" are not.</li> <li>The FTC should force Tesla to refund, with interest, every dollar paid by their customers under the false pretense that they were paying for "Full Self Driving".</li></ul> 2021-04-06T15:00:00+00:00 David. (noreply@blogger.com) Jez Cope: Collaborations Workshop 2021: talks &amp; panel session https://erambler.co.uk/blog/collabw21-part-1/ <p>I’ve just finished attending (online) the three days of this year’s <a href="https://software.ac.uk/cw21">SSI Collaborations Workshop</a> (CW for short), and once again it’s been a brilliant experience, as well as mentally exhausting, so I thought I’d better get a summary down while it’s still fresh it my mind.</p> <p>Collaborations Workshop is, as the name suggests, much more focused on facilitating collaborations than a typical conference, and has settled into a structure that starts off with with longer keynotes and lectures, and progressively gets more interactive culminating with a <a href="https://en.wikipedia.org/wiki/Hackathon">hack day</a> on the third day.</p> <p>That’s a lot to write about, so for this post I’ll focus on the talks and panel session, and follow up with another post about the collaborative bits. I’ll also probably need to come back and add in more links to bits and pieces once slides and the “official” summary of the event become available.</p> <div class="card mb-5"> <header class="card-header"> <p class="card-header-title"> Updates </p> </header> <div class="card-content content"> <strong>2021-04-07</strong> Added links to recordings of keynotes and panel sessions </div> </div> <h2 id="provocations">Provocations</h2> <p>The first day began with two keynotes on this year’s main themes: FAIR Research Software and Diversity &amp; Inclusion, and day 2 had a great panel session focused on disability. All three were streamed live and the recordings remain available on Youtube:</p> <ul> <li><a href="https://www.youtube.com/watch?v=8viA4y1pz_8">View the <strong>keynotes</strong> recording</a>; <em><a href="https://invidious.xyz/watch?v=8viA4y1pz_8">Google-free alternative link</a></em></li> <li><a href="https://www.youtube.com/watch?v=65a8c06VHOY">View the <strong>panel session</strong> recording</a>; <em><a href="https://invidious.xyz/watch?v=65a8c06VHOY">Google-free alternative link</a></em></li> </ul> <h3 id="fair-research-software">FAIR Research Software</h3> <p>Dr Michelle Barker, Director of the <a href="https://www.researchsoft.org/">Research Software Alliance</a>, spoke on the challenges to recognition of software as part of the scholarly record: software is not often cited. The <a href="https://www.rd-alliance.org/groups/fair-4-research-software-fair4rs-wg">FAIR4RS working group</a> has been set up to investigate and create guidance on how the <a href="https://www.force11.org/fairprinciples">FAIR Principles</a> for data can be adapted to research software as well; as they stand, the Principles are not ideally suited to software. This work will only be the beginning though, as we will also need metrics, training, career paths and much more. ReSA itself has 3 focus areas: people, policy and infrastructure. If you’re interested in getting more involved in this, you can join the <a href="https://groups.google.com/g/research-software-alliance">ReSA email list</a>.</p> <h3 id="equality-diversity--inclusion-how-to-go-about-it">Equality, Diversity &amp; Inclusion: how to go about it</h3> <p>Dr Chonnettia Jones, Vice President of Research, <a href="https://www.msfhr.org/">Michael Smith Foundation for Health Research</a> spoke extensively and persuasively on the need for Equality, Diversity &amp; Inclusion (EDI) initiatives within research, as there is abundant robust evidence that <strong>all</strong> research outcomes are improved.</p> <p>She highlighted the difficulties current approaches to EDI have effecting structural change, and changing not just individual behaviours but the cultures &amp; practices that perpetuate iniquity. What initiatives are often constructed around making up for individual deficits, a bitter framing is to start from an understanding of individuals having equal stature but having different tired experiences. Commenting on the current focus on “research excellent” she pointed out that the hyper-competition this promotes is deeply unhealthy. suggesting instead that true excellence requires diversity, and we should focus on an inclusive excellence driven by inclusive leadership.</p> <h3 id="equality-diversity--inclusion-disability-issues">Equality, Diversity &amp; Inclusion: disability issues</h3> <p>Day 2’s EDI panel session brought together five disabled academics to discuss the problems of disability in research.</p> <ul> <li>Dr Becca Wilson, UKRI Innovation Fellow, Institute of Population Health Science, University of Liverpool (Chair)</li> <li>Phoenix C S Andrews (PhD Student, Information Studies, University of Sheffield and Freelance Writer)</li> <li>Dr Ella Gale (Research Associate and Machine Learning Subject Specialist, School of Chemistry, University of Bristol)</li> <li>Prof Robert Stevens (Professor and Head of Department of Computer Science, University of Manchester)</li> <li>Dr Robin Wilson (Freelance Data Scientist and SSI Fellow)</li> </ul> <p><em>NB. The discussion flowed quite freely so the following summary, so the following summary mixes up input from all the panel members.</em></p> <p>Researchers are often assumed to be single-minded in following their research calling, and aptness for jobs is often partly judged on “time send”, which disadvantages any disabled person who has been forced to take a career break. On top of this disabled people are often time-poor because of the extra time needed to manage their condition, leaving them with less “output” to show for their time served on many common metrics. This can partially affect early-career researchers, since resources for these are often restricted on a “years-since-PhD” criterion. Time poverty also makes funding with short deadlines that much harder to apply for. Employers add more demands right from the start: new starters are typically expected to complete a health and safety form, generally a brief affair that will suddenly become an 80-page bureaucratic nightmare if you tick the box declaring a disability.</p> <p>Many employers claim to be inclusive yet utterly fail to understand the needs of their disabled staff. Wheelchairs are liberating for those who use them (despite the awful but common phrase “wheelchair-bound”) and yet employers will refuse to insure a wheelchair while travelling for work, classifying it as a “high value personal item” that the owner would take the same responsibility for as an expensive camera. Computers open up the world for blind people in a way that was never possible without them, but it’s not unusual for mandatory training to be inaccessible to screen readers. Some of these barriers can be overcome, but doing so takes yet more time that could and should be spent on more important work.</p> <p>What can we do about it? Academia works on patronage whether we like it or not, so be the person who supports people who are different to you rather than focusing on the one you “recognise yourself in” to mentor. As a manager, it’s important to ask each individual what they need and believe them: they are the expert in their own condition and their lived experience of it. Don’t assume that because someone else in your organisation with the same disability needs one set of accommodations, it’s invalid for your staff member to require something totally different. And remember: disability is unusual as a protected characteristic in that anyone can acquire it at any time without warning!</p> <h2 id="lightning-talks">Lightning talks</h2> <p>Lightning talk sessions are always tricky to summarise, and while this doesn’t do them justice, here are a few highlights from my notes.</p> <h3 id="data--metadata">Data &amp; metadata</h3> <ul> <li>Malin Sandstrom talked about a <a href="https://doi.org/10.6084/m9.figshare.14331242.v1">much-needed refinement of contributor role taxonomies for scientific computing</a></li> <li>Stephan Druskat showcased a <a href="https://doi.org/10.6084/m9.figshare.14330426.v2">project to crowdsource a corpus of research software for further analysis</a></li> </ul> <h3 id="learning--teachingcommunity">Learning &amp; teaching/community</h3> <ul> <li>Matthew Bluteau introduced the concept of the <a href="https://doi.org/10.6084/m9.figshare.14330822.v1">“coding dojo” as a way to enhance community of practice</a>. A group of coders got together to practice &amp; learn by working together to solve a problem and explaining their work as they go <ul> <li>He described 2 models: a code jam, where people work in small groups, and the <a href="https://en.wikipedia.org/wiki/Randori">Randori</a> method, where 2 people do pair programming while the rest observe. I’m excited to try this out!</li> </ul> </li> <li>Steve Crouch talked about <a href="https://doi.org/10.6084/m9.figshare.14318477.v1">intermediate skills and helping people take the next step</a>, which I’m also very interested in with the <a href="https://glamdatasci.network">GLAM Data Science network</a></li> <li>Esther Plomp recounted experience of <a href="https://doi.org/10.6084/m9.figshare.14330420.v1">running multiple Carpentry workshops online</a>, while Diego Alonso Alvarez discussed <a href="https://doi.org/10.6084/m9.figshare.14315966.v1">planned workshops on making research software more usable with GUIs</a></li> <li>Shoaib Sufi <a href="https://doi.org/10.6084/m9.figshare.14307833.v1">showcased</a> the <a href="https://event-organisation-guide.readthedocs.io/en/latest/index.html">SSI’s new event organising guide</a></li> <li>Caroline Jay reported on a <a href="https://doi.org/10.6084/m9.figshare.14331242.v1">diary study into autonomy &amp; agency in RSE during COVID</a> <ul> <li><a href="https://doi.org/10.48420/14330807.V1">Lopez, T., Jay, C., Wermelinger, M., &amp; Sharp, H. (2021). How has the covid-19 pandemic affected working conditions for research software engineers? Unpublished manuscript.</a></li> </ul> </li> </ul> <h2 id="wrapping-up">Wrapping up</h2> <p>That’s not everything! But this post is getting pretty long so I’ll wrap up for now. I’ll try to follow up soon with a summary of the “collaborative” part of Collaborations Workshop: the idea-generating sessions and hackday!</p> 2021-04-05T20:56:09+00:00 Journal of Web Librarianship: Examination of Academic Library Websites Regarding COVID-19 Responsiveness https://www.tandfonline.com/doi/full/10.1080/19322909.2021.1906823?ai=1dl&mi=co84bk&af=R . <br /> 2021-04-05T07:44:27+00:00 Kristine Condic Terry Reese: MarcEdit 7.5 Update https://blog.reeset.net/archives/2961 <p>ChangeLog: <a href="https://marcedit.reeset.net/software/update75.txt">https://marcedit.reeset.net/software/update75.txt</a></p> <h2>Highlights</h2> <h4>Preview Changes</h4> <p>One of the most requested features over the years has been the ability to preview changes prior to running them.  As of 7.5.8 – a new preview option has been added to many of the global editing tools in the MarcEditor.  Currently, you will find the preview option attached to the following functions:</p> <ol> <li>Replace All</li> <li>Add New Field</li> <li>Delete Field</li> <li>Edit Subfield</li> <li>Edit Field</li> <li>Edit Indicator</li> <li>Copy Field</li> <li>Swap Field</li> </ol> <p>Functions that include a preview option will be denoted with the following button:</p> <p><a href="https://blog.reeset.net/wp-content/uploads/2021/04/image.png"><img alt="Add/Delete Field Option -- showing the Preview Button -- a button with a black down arrow" border="0" height="237" src="https://blog.reeset.net/wp-content/uploads/2021/04/image_thumb.png" style="display: inline; background-image: none;" title="Add/Delete Field Option -- showing the Preview Button -- a button with a black down arrow" width="680" /></a></p> <p>When this button is pressed, the following option is made available</p> <p><a href="https://blog.reeset.net/wp-content/uploads/2021/04/image-1.png"><img alt="Add/Delete Field -- Black button with an arrow -- shows Preview menu" border="0" height="224" src="https://blog.reeset.net/wp-content/uploads/2021/04/image_thumb-1.png" style="display: inline; background-image: none;" title="Add/Delete Field -- Black button with an arrow -- shows Preview menu" width="668" /></a></p> <p>When Preview Results is selected, the program will execute the defined action, and display the potential results in a display screen.  For example:</p> <p><a href="https://blog.reeset.net/wp-content/uploads/2021/04/image-2.png"><img alt="Preview Results page -- Grid Results" border="0" height="603" src="https://blog.reeset.net/wp-content/uploads/2021/04/image_thumb-2.png" style="display: inline; background-image: none;" title="Preview Results page -- Grid Results" width="897" /></a></p> <p>To protect performance, only 500 results at a time will be loaded into the preview grid, though users can keep adding results to the grid and continue to review items.  Additionally, users have the ability to search for items within the grid as well as jump to a specific record number (not row number).  </p> <p>These new options will show up first in the windows version of MarcEdit, but will be added to the MarcEdit Mac 3.5.x branch in the coming weeks.  </p> <h4></h4> <h4>New JSON =&gt; XML Translation</h4> <p>To better support the translation of data from JSON to MARC, I’ve included a JSON =&gt; MARC algorithm in the MARCEngine.  This will allow JSON data to serialized into XML.  The benefit of including this option, is that I’ve been able to update the XML Functions options to allow JSON to be a starting format.  This will specifically useful for users that want to make use of linked data vocabularies to generate MARC Authority records.  Users can direct MarcEdit to facilitate the translation from JSON to XML, and then create XSLT translations that can then be used to complete the process to MARCXML and MARC.  I’ve demonstrated how this process works using a vocabulary of interest to the #critcat community, the <a href="http://homosaurus.org" rel="noopener" target="_blank">Homosaurus</a> vocabulary (<a href="https://blog.reeset.net/archives/2953">How do I generate MARC authority records from the Homosaurus vocabulary? – Terry’s Worklog (reeset.net)</a>).</p> <h4> OCLC API Interactions</h4> <p>Working with the OCLC API is sometimes tricky.   MarcEdit utilizes a specific authentication process that requires OCLC keys be setup and configured to work a certain way.  When issues come up, it is sometimes very difficult to debug them.  I’ve updated the process and error handling to surface more information – so when problems occur and XML debugging information isn’t available, the actual exception and inner exception data will be surfaced instead.  This often can provide information to help understand why the process isn’t able to complete.</p> <h2>Wrap up</h2> <p>As noted, there have been a number of updates.  While many fall under the category of house-keeping (updating icons, UX improvements, actions, default values, etc.) – this update does include a number of often asked for, significant updates, that I hope will improve user workflows.</p> <p>–tr</p> 2021-04-04T02:26:28+00:00 reeset Terry Reese: How do I generate MARC authority records from the Homosaurus vocabulary? https://blog.reeset.net/archives/2953 <p>Step by step instructions here: <a href="https://youtu.be/FJsdQI3pZPQ" title="https://youtu.be/FJsdQI3pZPQ">https://youtu.be/FJsdQI3pZPQ</a></p> <p>Ok, so last week, I got an interesting question on the listserv where a user asked specifically about generating MARC records for use in one’s ILS system from a JSONLD vocabulary.  In this case, the vocabulary in question as Homosaurus (<a href="http://homosaurus.org/" rel="noopener" target="_blank">Homosaurus Vocabulary Site</a>) – and the questioner was specifically looking for a way to pull individual terms for generation into MARC Authority records to add to one’s ILS to improve search and discovery.</p> <p>When the question was first asked, my immediate thought was that this could likely be accommodated using the XML/JSON profiling wizard in MarcEdit.  This tool can review a sample XML or JSON file and allow a user to create a portable processing file based on the content in the file.  However, there were two issues with this approach:</p> <ol> <li>The profile wizard assumes that data format is static – i.e., the sample file is representative of other files.  Unfortunately, for this vocabulary, that isn’t the case.  </li> <li>The profile wizard was designed to work with JSON – JSON LD is actually a different animal due to the inclusion of the @ symbol.  </li> </ol> <p>While I updated the Profiler to recognize and work better with JSON-LD – the first challenge is one that doesn’t make this a good fit to create a generic process.  So, I looked at how this could be built into the normal processing options.</p> <p>To do this, I added a new default serialization, JSON=&gt;XML == which MarcEdit now supports.  This allows the tool to take a JSON file, and deserialize the data so that is output reliably as XML.  So, for example, here is a sample JSON-LD file (<a href="http://homosaurus.org/v2/adoptiveParents.jsonld" rel="noopener" target="_blank">homosaurus.org/v2/adoptiveParents.jsonld</a>):</p> <p></p> <pre>{ "@context": { "dc": "http://purl.org/dc/terms/", "skos": "http://www.w3.org/2004/02/skos/core#", "xsd": "http://www.w3.org/2001/XMLSchema#" }, "@id": "http://homosaurus.org/v2/adoptiveParents", "@type": "skos:Concept", "dc:identifier": "adoptiveParents", "dc:issued": { "@value": "2019-05-14", "@type": "xsd:date" }, "dc:modified": { "@value": "2019-05-14", "@type": "xsd:date" }, "skos:broader": { "@id": "http://homosaurus.org/v2/parentsLGBTQ" }, "skos:hasTopConcept": [ { "@id": "http://homosaurus.org/v2/familyMembers" }, { "@id": "http://homosaurus.org/v2/familiesLGBTQ" } ], "skos:inScheme": { "@id": "http://homosaurus.org/terms" }, "skos:prefLabel": "Adoptive parents", "skos:related": [ { "@id": "http://homosaurus.org/v2/socialParenthood" }, { "@id": "http://homosaurus.org/v2/LGBTQAdoption" }, { "@id": "http://homosaurus.org/v2/LGBTQAdoptiveParents" }, { "@id": "http://homosaurus.org/v2/birthParents" } ] } </pre> <p></p><p>In MarcEdit, the new JSON=&gt;XML process can take this file and output it in XML like this:</p> <p></p> <pre>&lt;?xml version="1.0"?&gt; &lt;records&gt; &lt;record&gt; &lt;context&gt; &lt;dc&gt;http://purl.org/dc/terms/&lt;/dc&gt; &lt;skos&gt;http://www.w3.org/2004/02/skos/core#&lt;/skos&gt; &lt;xsd&gt;http://www.w3.org/2001/XMLSchema#&lt;/xsd&gt; &lt;/context&gt; &lt;id&gt;http://homosaurus.org/v2/adoptiveParents&lt;/id&gt; &lt;type&gt;skos:Concept&lt;/type&gt; &lt;identifier&gt;adoptiveParents&lt;/identifier&gt; &lt;issued&gt; &lt;value&gt;2019-05-14&lt;/value&gt; &lt;type&gt;xsd:date&lt;/type&gt; &lt;/issued&gt; &lt;modified&gt; &lt;value&gt;2019-05-14&lt;/value&gt; &lt;type&gt;xsd:date&lt;/type&gt; &lt;/modified&gt; &lt;broader&gt; &lt;id&gt;http://homosaurus.org/v2/parentsLGBTQ&lt;/id&gt; &lt;/broader&gt; &lt;hasTopConcept&gt; &lt;id&gt;http://homosaurus.org/v2/familyMembers&lt;/id&gt; &lt;/hasTopConcept&gt; &lt;hasTopConcept&gt; &lt;id&gt;http://homosaurus.org/v2/familiesLGBTQ&lt;/id&gt; &lt;/hasTopConcept&gt; &lt;inScheme&gt; &lt;id&gt;http://homosaurus.org/terms&lt;/id&gt; &lt;/inScheme&gt; &lt;prefLabel&gt;Adoptive parents&lt;/prefLabel&gt; &lt;related&gt; &lt;id&gt;http://homosaurus.org/v2/socialParenthood&lt;/id&gt; &lt;/related&gt; &lt;related&gt; &lt;id&gt;http://homosaurus.org/v2/LGBTQAdoption&lt;/id&gt; &lt;/related&gt; &lt;related&gt; &lt;id&gt;http://homosaurus.org/v2/LGBTQAdoptiveParents&lt;/id&gt; &lt;/related&gt; &lt;related&gt; &lt;id&gt;http://homosaurus.org/v2/birthParents&lt;/id&gt; &lt;/related&gt; &lt;/record&gt; &lt;/records&gt; </pre> <p></p><p>The ability to reliably convert JSON/JSONLD to XML means that I can now allow users to utilize the same XSLT/XQUERY process MarcEdit utilizes for other library metadata format transformation.  All that was left to make this happen was to add a new origin data format to the XML Function template – and we are off and running.</p> <p>The end result is users could utilize this process with any JSON-LD vocabulary (assuming they created the XSLT) to facilitate the automation of MARC Authority data.  In this case of this vocabulary, I’ve created an XSLT and added it to my github space: <a href="https://github.com/reeset/marcedit_xslt_files/blob/master/homosaurus_xml.xsl" rel="noopener" target="_blank">https://github.com/reeset/marcedit_xslt_files/blob/master/homosaurus_xml.xsl</a></p> <p>but have included the XSLT in the MarcEdit XSLT directory in current downloads.</p> <p>In order to use this XSLT and allow your version of MarcEdit to generate MARC Authority records from this vocabulary – you would use the following steps:</p> <ol> <li>Be using MarcEdit 7.5.8+ or MarcEdit Mac 3.5.8+ (Mac version will be available around 4/8).  I have not decided if I will backport to 7.3-</li> <li>Open the XML Functions Editor in MarcEdit</li> <li>Add a new Transformation – using JSON as the original format, and MARC as the final.  Make sure the XSLT path is pointed to the location where you saved the downloaded XSLT file.</li> <li>Save</li> </ol> <p></p> <p>That should be pretty much it.  I’ve recorded the steps and placed them here: <a href="https://youtu.be/FJsdQI3pZPQ" title="https://youtu.be/FJsdQI3pZPQ">https://youtu.be/FJsdQI3pZPQ</a>, including some information on values you may wish to edit should you want to localize the XSLT.  </p> 2021-04-04T02:25:19+00:00 reeset Peter Murray: Publishers going-it-alone (for now?) with GetFTR https://dltj.org/article/publishers-alone-with-getftr/ <p>In early December 2019, a group of publishers announced <a href="https://www.getfulltextresearch.com/">Get-Full-Text-Research</a>, or GetFTR for short. I read about this first in Roger Schonfeld’s “<a href="https://scholarlykitchen.sspnet.org/2019/12/03/publishers-announce-plug-leakage/">Publishers Announce a Major New Service to Plug Leakage</a>” piece in <em>The Scholarly Kitchen</em> via Jeff Pooley’s <a href="https://twitter.com/jeffersonpooley/status/1201867300229517313">Twitter thread</a> and <a href="https://www.jeffpooley.com/2019/12/publishers-announce-a-major-new-service-to-plug-leakage/">blog post</a>. <a href="https://www.getfulltextresearch.com/how-getftr-works/">Details about how this works are thin</a>, so I’m leaning heavily on Roger’s description. I’m not as negative about this as Jeff, and I’m probably a little more opinionated than Roger. This is an interesting move by publishers, and—as the title of this post suggests—I am critical of the publisher’s “go-it-alone” approach.</p> <p>First, some disclosure might be in order. My background has me thinking of this in the context of how it impacts libraries and library consortia. For the past four years, I’ve been co-chair of the <a href="https://www.niso.org/topic-committees/information-discovery-interchange">NISO Information Discovery and Interchange topic committee</a> (and its predecessor, the “Discovery to Delivery” topic committee), so this is squarely in what I’ve been thinking about in the broader library-publisher professional space. I also traced the early development of RA21 and more recently am volunteering on the SeamlessAccess Entity Category and Attribute Bundles Working Group; that’ll become more important a little further down this post.</p> <p>I was nodding along with Roger’s narrative until I stopped short here:</p> <blockquote> <p>The five major publishing houses that are the driving forces behind GetFTR are not pursuing this initiative through one of the major industry collaborative bodies. All five are leading members of the STM Association, NISO, ORCID, Crossref, and CHORUS, to name several major industry groups. But rather than working through one of these existing groups, the houses plan instead to launch a new legal entity. </p> </blockquote> <blockquote> <p>While [Vice President of Product Strategy &amp; Partnerships for Wiley Todd] Toler and [Senior Director, Technology Strategy &amp; Partnerships for the American Chemical Society Ralph] Youngen were too politic to go deeply into the details of why this might be, it is clear that the leadership of the large houses have felt a major sense of mismatch between their business priorities on the one hand and the capabilities of these existing industry bodies. At recent industry events, publishing house CEOs have voiced extensive concerns about the lack of cooperation-driven innovation in the sector. For example, <a href="https://twitter.com/rschon/status/989137191094939649">Judy Verses from Wiley spoke to this issue in spring 2018</a>, and <a href="https://twitter.com/acochran12733/status/1184105258986815489">several executives did so at Frankfurt this fall</a>. In both cases, long standing members of the scholarly publishing sector questioned if these executives perhaps did not realize the extensive collaborations driven through Crossref and ORCID, among others. It is now clear to me that the issue is not a lack of knowledge but rather a concern at the executive level about the perceived inability of existing collaborative vehicles to enable the new strategic directions that publishers feel they must pursue. </p> </blockquote> <p>This is the publishers going-it-alone. To see Roger describe it, they are going to create this web service that allows publishers to determine the appropriate copy for a patron and do it without input from the libraries. Librarians will just be expected to put this web service widget into their discovery services to get “colored buttons indicating that the link will take [patrons] to the version of record, an alternative pathway, or (presumably in rare cases) no access at all.” (Let’s set aside for the moment the privacy implications of having a fourth-party web service recording all of the individual articles that come up in a patron’s search results.) Librarians will not get to decide the “alternative pathway” that is appropriate for the patron: “Some publishers might choose to provide access to a preprint or a read-only version, perhaps in some cases on some kind of metered basis.” (Roger goes on to say that he “expect[s] publishers will typically enable some alternative version for their content, in which case the vast majority of scholarly content will be freely available through publishers even if it is not open access in terms of licensing.” I’m not so confident.)</p> <p>No, thank you. If publishers want to engage in technical work to enable libraries and others to build web services that determine the direct link to an article based on a DOI, then great. Libraries can build a tool that consumes that information as well as takes into account information about preprint services, open access versions, interlibrary loan and other methods of access. But to ask libraries to accept this publisher-controlled access button in their discovery layers, their learning management systems, their scholarly profile services, and their other tools? That sounds destined for disappointment.</p> <p>I am only somewhat encouraged by the fact that RA21 started out as a small, isolated collaboration of publishers before they brought in NISO and invited libraries to join the discussion. Did it mean that it slowed down deployment of RA21? Undoubtedly yes. Did persnickety librarians demand transparent discussions and decisions about privacy-related concerns like what attributes the publisher would get about the patron in the Shibboleth-powered backchannel? Yes, but because the patrons weren’t there to advocate for themselves. Will it likely mean wider adoption? I’d like to think so.</p> <p>Have publishers learned that forcing these kinds of technologies onto users without consultation is a bad idea? At the moment it would appear not. Some of what publishers are seeking with GetFTR can be implemented with straight-up OpenURL or—at the very least—limited-scope additions to OpenURL (the <a href="https://www.niso.org/publications/z3988-2004-r2010">Z39.88</a> open standard!). So that they didn’t start with OpenURL, a robust existing standard, is both concerning and annoying. I’ll be watching and listening for points of engagement, so I remain hopeful.</p> <p>A few words about Jeff Pooley’s five-step “laughably creaky and friction-filled effort” that is SeamlessAccess. Many of the steps Jeff describes are invisible and well-established technical protocols. What Jeff fails to take into account is the very visible and friction-filled effect of patrons accessing content beyond the boundaries of campus-recognized internet network addresses. Those patrons get stopped at step two with a “pay $35 please” message. I’m all for removing that barrier entirely by making all published content “open access”. It is folly to think, though, that researchers and readers can enforce an open access business model on all publishers, so solutions like SeamlessAccess will have a place. (Which is to say nothing of the benefit of inter-institutional resource collaboration opened up by a more widely deployed Shibboleth infrastructure powered by SeamlessAccess.)</p><div class="feedflare"> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=AtRzhr0pxnM:yITQ_CLvnRc:yIl2AUoC8zA"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=yIl2AUoC8zA" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=AtRzhr0pxnM:yITQ_CLvnRc:qj6IDK7rITs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=qj6IDK7rITs" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=AtRzhr0pxnM:yITQ_CLvnRc:dnMXMwOfBR0"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=dnMXMwOfBR0" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=AtRzhr0pxnM:yITQ_CLvnRc:YwkR-u9nhCs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=YwkR-u9nhCs" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=AtRzhr0pxnM:yITQ_CLvnRc:F7zBnMyn0Lo"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=AtRzhr0pxnM:yITQ_CLvnRc:F7zBnMyn0Lo" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=AtRzhr0pxnM:yITQ_CLvnRc:ACf-c_HutVc"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=ACf-c_HutVc" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=AtRzhr0pxnM:yITQ_CLvnRc:gIN9vFwOqvQ"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=AtRzhr0pxnM:yITQ_CLvnRc:gIN9vFwOqvQ" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=AtRzhr0pxnM:yITQ_CLvnRc:-BTjWOF_DHI"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=AtRzhr0pxnM:yITQ_CLvnRc:-BTjWOF_DHI" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=AtRzhr0pxnM:yITQ_CLvnRc:V_sGLiPBpWU"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=AtRzhr0pxnM:yITQ_CLvnRc:V_sGLiPBpWU" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=AtRzhr0pxnM:yITQ_CLvnRc:H329GK52Scs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=H329GK52Scs" /></a> </div><img alt="" height="1" src="http://feeds.feedburner.com/~r/DisruptiveLibraryTechnologyJester/~4/AtRzhr0pxnM" width="1" /> 2021-04-04T02:00:17+00:00 Peter Murray (jester@dltj.org) Peter Murray: What is known about GetFTR at the end of 2019 https://dltj.org/article/getftr-update/ <p>In early December 2019, a group of publishers announced <a href="https://www.getfulltextresearch.com/">Get-Full-Text-Research</a>, or GetFTR for short. There was a heck of a response on social media, and the response was—on the whole—not positive from my librarian-dominated corner of Twitter. For my early take on GetFTR, see my December 3rd blog post “<em><a href="https://dltj.org/article/publishers-alone-with-getftr">Publishers going-it-alone (for now?) with GetFTR</a></em>.” As that post title suggests, I took the five founding GetFTR publishers to task on their take-it-or-leave-it approach. I think that is still a problem. To get you caught up, here is a list of other commentary.</p> <ul> <li>Roger Schonfeld’s December 3rd “<a href="https://scholarlykitchen.sspnet.org/2019/12/03/publishers-announce-plug-leakage/">Publishers Announce a Major New Service to Plug Leakage</a>” piece in <em>The Scholarly Kitchen</em></li> <li>Tweet from <a href="https://twitter.com/hvdsomp">Herbert Van de Sompel</a>, the lead author of the OpenURL spec, on <a href="https://twitter.com/hvdsomp/status/1202187556902838273">solving the appropriate copy problem</a></li> <li>December 5th post “<a href="https://blog.openaccessbutton.org/get-to-fulltext-ourselves-not-getftr-e952e798564b">Get To Fulltext Ourselves, Not GetFTR.</a>” on the <em>Open Access Button</em> blog</li> <li>Twitter thread on December 7th between <a href="https://twitter.com/cshillum">@cshillum</a> and <a href="https://twitter.com/lisalibrarian">@lisalibrarian</a> on the <a href="https://twitter.com/lisalibrarian/status/1203440731970580480">positioning of GetFTR in relation to link resolvers and an unanswered question about how GetFTR aligns with library interests</a></li> <li>Twitter thread started by <a href="https://twitter.com/TAC_NISO">@TAC_NISO</a> on December 9th <a href="https://twitter.com/TAC_NISO/status/1204036829470691328">looking for more information</a> with a link to <a href="https://www.stm-assoc.org/2019_12_03_Innovations_special_announcement.pdf">an STM Association presentation</a> added by <a href="https://twitter.com/aarontay">@aarontay</a></li> <li>A tree of tweets starting from <a href="https://twitter.com/mrgunn">@mrgunn</a>’s <a href="https://twitter.com/mrgunn/status/1203350200569303040"><em>[I don’t trust publishers to decide] is the crux of the whole thing.</em></a> In particular, threads of that tweet that include <a href="https://twitter.com/griffey/status/1203364561577009152">Jason Griffey of NISO saying he knew nothing about GetFTR</a> and <a href="https://twitter.com/bmittermaier/status/1203392247686926337">Bernhard Mittermaier’s point about hidden motivations behind GetFTR</a></li> <li>Twitter thread started by <a href="https://twitter.com/aarontay">@aarontay</a> on December 7th saying <a href="https://twitter.com/aarontay/status/1203197537810702336">“GetFTR is bad for researchers/readers and librarians. It only benefits publishers, change my mind.”</a></li> <li>Lisa Janicke Hinchliffe’s December 10th “<a href="https://scholarlykitchen.sspnet.org/2019/12/10/why-are-librarians-concerned-about-getftr/">Why are Librarians Concerned about GetFTR?</a>” in <em>The Scholarly Kitchen</em> and take note of the follow-up discussion in the comments</li> <li>Twitter thread between <a href="https://twitter.com/alison_mudditt">@alison_mudditt</a> and <a href="https://twitter.com/lisalibrarian">@lisalibrarian</a> clarifying <a href="https://twitter.com/lisalibrarian/status/1204476794583371776">PLOS is <em>not</em> on the Advisory Board</a> with some <a href="https://twitter.com/TAC_NISO">@TAC_NISO</a> as well.</li> <li>Ian Mulvany’s December 11th “<a href="http://scholarly-comms-product-blog.com/2019/12/11/thoughts_on_getftr_/">thoughts on GetFTR</a>” on ScholCommsProd</li> <li>GetFTR’s December 11th “<a href="https://www.getfulltextresearch.com/news/updating-the-community/">Updating the community</a>” post on their website</li> <li>The Spanish Federation of Associations of Archivists, Librarians, Archaeologists, Museologists and Documentalists (ANABAD)’s December 12th “<a href="https://www.anabad.org/getftr-nuevo-servicio-de-editores-para-agilizar-el-acceso-a-los-articulos-investigacion/">GetFTR: new publishers service to speed up access to research articles</a>” (original in Spanish, <a href="https://translate.google.com/translate?hl=en&amp;sl=es&amp;u=https://www.anabad.org/getftr-nuevo-servicio-de-editores-para-agilizar-el-acceso-a-los-articulos-investigacion/&amp;prev=search">Google Translate to English</a>)</li> <li>December 20th news entry from eContent Pro with the title “<a href="https://www.econtentpro.com/blog/what-getftr-means-for-journal-article-access/141">What GetFTR Means for Journal Article Access</a>” which I’ll only quarrel with this sentence: “Thus, GetFTR is a service where Academic articles are found and provided to you at absolutely no cost.” No—if you are in academia the cost is born by your <em>library</em> even if you don’t see it. But this seems like a third party service that isn’t directly related to publishers or libraries, so perhaps they can be forgiven for not getting that nuance.</li> <li>Wiley’s <em>Chemistry Views</em> news post on December 26th titled simply “<a href="https://www.chemistryviews.org/details/news/11207255/Get_Full_Text_Research_GetFTR.html">Get Full Text Research (GetFTR)</a>” is perhaps only notable for the sentence “Growing leakage has steadily eroded the ability of the publishers to monetize the value they create.”</li> </ul> <p>If you are looking for a short list of what to look at, I recommend these posts.</p> <h2 id="getftrs-community-update">GetFTR’s Community Update</h2> <p>On December 11—after the two posts I list below—an “<a href="https://www.getfulltextresearch.com/news/updating-the-community/">Updating the Community</a>” web page was posted to the GetFTR website. From a public relations perspective, it was…interesting.</p> <h3 id="we-are-committed-to-being-open-and-transparent">We are committed to being open and transparent</h3> <p>This section goes on to say, <em>“If the community feels we need to add librarians to our advisory group we will certainly do so and we will explore ways to ensure we engage with as many of our librarian stakeholders as possible.”</em> If the GetFTR leadership didn’t get the indication between December 3 and December 12 that librarians feel strongly about being at the table, then I don’t know what will. And it isn’t about being on the advisory group; it is about being seen and appreciated as important stakeholders in the research discovery process. I’m not sure who the “community” is in this section, but it is clear that librarians are—at best—an afterthought. That is not the kind of “open and transparent” that is welcoming.</p> <p>Later on in the <strong>Questions about library link resolvers</strong> section is this sentence:</p> <blockquote> <p>We have, or are planning to, consult with existing library advisory boards that participating publishers have, as this enables us to gather views from a significant number of librarians from all over the globe, at a range of different institutions.</p> </blockquote> <p>As I said in my previous post, I don’t know why GetFTR is not engaging in existing cross-community (publisher/technology-supplier/library) organizations to have this discussion. It feels intentional, which colors the perception of what the publishers are trying to accomplish. To be honest, I <em>don’t think</em> the publishers are using GetFTR to drive a wedge between library technology service providers (who are needed to make GetFTR a reality for libraries) and libraries themselves. But I can see how that interpretation could be made.</p> <h3 id="understandably-we-have-been-asked-about-privacy">Understandably, we have been asked about privacy.</h3> <p>I punted on privacy in my previous post, so let’s talk about it here. It remains to be seen what is included in the GetFTR API request between the browser and the publisher site. Sure, it needs to include the DOI and a token that identifies the patron’s institution. We can inspect that API request to ensure nothing else is included. But the fact that the design of GetFTR has the browser making the call to the publisher site means that the publisher site knows the IP address of the patron’s browser, and the IP address can be considered personally identifiable information. This issue could be fixed by having the link resolver or the discovery layer software make the API request, and according to the <strong>Questions about library link resolvers</strong> section of the community update, this may be under consideration.</p> <p>So, yes, an auditable privacy policy and implementation is key for for GetFTR.</p> <h3 id="getftr-is-fully-committed-to-supporting-third-party-aggregators">GetFTR is fully committed to supporting third-party aggregators</h3> <p>This is good to hear. I would love to see more information published about this, including how discipline-specific repositories and institutional repositories can have their holdings represented in GetFTR responses.</p> <h3 id="my-take-a-ways">My Take-a-ways</h3> <p>In the second to last paragraph: <em>“Researchers should have easy, seamless pathways to research, on whatever platform they are using, wherever they are.”</em> That is a statement that I think every library could sign onto. This <em>Updating the Community</em> is a good start, but the project has dug a deep hole of trust and it hasn’t reached level ground yet.</p> <h2 id="lisa-janicke-hinchliffes-why-are-librarians-concerned-about-getftr">Lisa Janicke Hinchliffe’s “Why are Librarians Concerned about GetFTR?”</h2> <p>Posted on <a href="https://scholarlykitchen.sspnet.org/2019/12/10/why-are-librarians-concerned-about-getftr/">December 10th in <em>The Scholarly Kitchen</em></a>, Lisa outlines a series of concerns from a librarian perspective. I agree with some of these; others are not an issue in my opinion.</p> <h3 id="librarian-concern-the-connection-to-seamless-access">Librarian Concern: The Connection to Seamless Access</h3> <p>Many librarians have expressed a concern about how patron information can leak to the publisher through ill-considered settings at an institution’s identity provider. Seamless Access can ease access control because it leverages a campus’ single sign-on solution—something that a library patron is likely to be familiar with. If the institution’s identity provider is overly permissive in the attributes about a patron that get transmitted to the publisher, then there is a serious risk of tying a user’s research activity to their identity and the bad things that come from that (patrons self-censoring their research paths, commoditization of patron activity, etc.). I’m serving on a Seamless Access task force that is addressing this issue, and I think there are technical, policy, and education solutions to this concern. In particular, I think some sort of intermediate display of the attributes being transmitted to the publisher is most appropriate.</p> <h3 id="librarian-concern-the-limited-user-base-enabled">Librarian Concern: The Limited User Base Enabled</h3> <p>As Lisa points out, the population of institutions that can take advantage of Seamless Access, a prerequisite for GetFTR, is very small and weighted heavily towards well-resourced institutions. To the extent that projects like Seamless Access (spurred on by a desire to have GetFTR-like functionality) helps with the adoption of SAML-based infrastructure like Shibboleth, then the whole academic community benefits from a shared authentication/identity layer that can be assumed to exist.</p> <h3 id="librarian-concern-the-insertion-of-new-stumbling-blocks">Librarian Concern: The Insertion of New Stumbling Blocks</h3> <p>Of the issues Lisa mentioned here, I’m not concerned about users being redirected to their campus single sign-on system in multiple browsers on multiple machines. This is something we should be training users about—there is a single website to put your username/password into for whatever you are accessing at the institution. That a user might already be logged into the institution single sign-on system in the course of doing other school work and never see a logon screen is an attractive benefit to this system.</p> <p>That said, it would be useful for an API call from a library’s discovery layer to a publisher’s GetFTR endpoint to be able to say, <em>“This is my user. Trust me when I say that they are from this institution.”</em> If that were possible, then the Seamless Access Where-Are-You-From service could be bypassed for the GetFTR purpose of determining whether a user’s institution has access to an article on the publisher’s site. It would sure be nice if librarians were involved in the specification of the underlying protocols early on so these use cases could be offered.</p> <h4 id="update">Update</h4> <p>Lisa reached out on Twitter to <a href="https://twitter.com/lisalibrarian/status/1211166457037574144">say</a> (in part): <em>“Issue is GetFTR doesn’t redirect and SA doesnt when you are IPauthenticated. Hence user ends up w mishmash of experience.”</em> I went back to read her <em>Scholarly Kitchen</em> post and realized I did not fully understand her point. If GetFTR is relying on a Seamless Access token to know which institution a user is coming from, then that token must get into the user’s browser. The details we have seen about GetFTR don’t address how that Seamless Access institution token is put in the user’s browser if the user has not been to the Seamless Access select-your-institution portal. One such case is when the user is coming from an IP-address-authenticated computer on a campus network. Do the GetFTR indicators appear even when the Seamless Access institution token is not stored in the browser? If at the publisher site the GetFTR response also uses the institution IP address table to determine entitlements, what does a user see when they have neither the Seamless Access institution token nor the institution IP address? And, to Lisa’s point, how does one explain this disparity to users? Is the situation better if the GetFTR determination is made in the link resolver rather than in the user browser?</p> <h3 id="librarian-concern-exclusion-from-advisory-committee">Librarian Concern: Exclusion from Advisory Committee</h3> <p>See previous paragraph. That librarians are not at the table offering use cases and technical advice means that the developers are likely closing off options that meet library needs. Addressing those needs would ease the acceptance of the GetFTR project as mutually beneficial. So an emphatic “AGREE!” with Lisa on her points in this section. Publishers—what were you thinking?</p> <h3 id="librarian-concern-getftr-replacing-the-library-link-resolver">Librarian Concern: GetFTR Replacing the Library Link Resolver</h3> <p>Libraries and library technology companies are making significant investments in tools that ease the path from discovery to delivery. Would the library’s link resolver benefit from a real-time API call to a publisher’s service that determines the direct URL to a specific DOI? Oh, yes—that would be mighty beneficial. The library could put that link right at the top of a series of options that include a link to a version of the article in a Green Open Access repository, redirection to a content aggregator, one-click access to an interlibrary-loan form, or even an option where the library purchases a copy of the article on behalf of the patron. (More likely, the link resolver would take the patron right to the article URL supplied by GetFTR, but the library link resolver needs to be in the loop to be able to offer the other options.)</p> <h3 id="my-take-a-ways-1">My Take-a-ways</h3> <p>The patron is affiliated with the institution, and the institution (through the library) is subscribing to services from the publisher. The institution’s library knows best what options are available to the patron (see above section). Want to know why librarians are concerned? Because they are inserting themselves as the arbiter of access to content, whether it is in the patron’s best interest or not. It is also useful to reinforce Lisa’s closing paragraph:</p> <blockquote> <p>Whether GetFTR will act to remediate these concerns remains to be seen. In some cases, I would expect that they will. In others, they may not. Publishers’ interests are not always aligned with library interests and they may accept a fraying relationship with the library community as the price to pay to pursue their strategic goals.</p> </blockquote> <h2 id="ian-mulvanys-thoughts-on-getftr">Ian Mulvany’s “thoughts on GetFTR”</h2> <p>Ian’s entire post from <a href="http://scholarly-comms-product-blog.com/2019/12/11/thoughts_on_getftr_/">December 11th in <em>ScholCommsProd</em></a> is worth reading. I think it is an insightful look at the technology and its implications. Here are some specific comments:</p> <h3 id="clarifying-the-relation-between-seamlessaccess-and-getftr">Clarifying the relation between SeamlessAccess and GetFTR</h3> <p>There are a couple of things that I disagree with:</p> <blockquote> <p>OK, so what is the difference, for the user, between seamlessaccess and GetFTR? I think that the difference is the following - with seamless access you the user have to log in to the publisher site. With GetFTR if you are providing pages that contain DOIs (like on a discovery service) to your researchers, you can give them links they can click on that have been setup to get those users direct access to the content. That means as a researcher, so long as the discovery service has you as an authenticated user, you don’t need to even think about logins, or publisher access credentials.</p> </blockquote> <p>To the best of my understanding, this is incorrect. With SeamlessAccess, the user is not “logging into the publisher site.” If the publisher site doesn’t know who a user is, the user is bounced back to their institution’s single sign-on service to authenticate. If the publisher site doesn’t know where a user is from, it invokes the SeamlessAccess Where-Are-You-From service to learn which institution’s single sign-on service is appropriate for the user. If a user follows a GetFTR-supplied link to a publisher site but the user doesn’t have the necessary authentication token from the institution’s single sign-on service, then they will be bounced back for the username/password and redirected to the publisher’s site. GetFTR signaling that an institution is entitled to view an article does not mean the user can get it without proving that they are a member of the institution.</p> <h3 id="what-does-this-mean-for-green-open-access">What does this mean for Green Open Access</h3> <p>A key point that Ian raises is this:</p> <blockquote> <p>One example of how this could suck, lets imagine that there is a very usable green OA version of an article, but the publisher wants to push me to using some “e-reader limited functionality version” that requires an account registration, or god forbid a browser exertion, or desktop app. If the publisher shows only this limited utility version, and not the green version, well that sucks.</p> </blockquote> <p>Oh, yeah…that does suck, and it is because the library—not the publisher of record—is better positioned to know what is best for a particular user.</p> <h3 id="will-getftr-be-adopted">Will GetFTR be adopted?</h3> <p>Ian asks, <em>“Will google scholar implement this, will other discovery services do so?”</em> I do wonder if GetFTR is big enough to attract the attention of Google Scholar and Microsoft Research. <a href="https://twitter.com/DataG/status/1202238590799032321">My gut tells me “no”</a>: I don’t think Google and Microsoft are going to add GetFTR buttons to their search results screens unless they are paid <em>a lot</em>. As for Google Scholar, it is more likely that Google would build something like GetFTR to get the analytics rather than rely on a publisher’s version.</p> <p>I’m even more doubtful that the companies pushing GetFTR can convince discovery layers makers to embed GetFTR into their software. Since the two widely adopted discovery layers (in North America, at least) are also aggregators of journal content, I don’t see the discovery-layer/aggregator companies devaluing their product by actively pushing users off their site.</p> <h3 id="my-take-a-ways-2">My Take-a-ways</h3> <p>It is also useful to reinforce Ian’s closing paragraph:</p> <blockquote> <p>I have two other recommendations for the GetFTR team. Both relate to building trust. First up, don’t list orgs as being on an advisory board, when they are not. Secondly it would be great to learn about the team behind the creation of the Service. At the moment its all very anonymous.</p> </blockquote> <h2 id="where-do-we-stand">Where Do We Stand?</h2> <p>Wow, I didn’t set out to write 2,500 words on this topic. At the start I was just taking some time to review everything that happened since this was announced at the start of December and see what sense I could make of it. It turned into a literature review of sort.</p> <p>While GetFTR has some powerful backers, it also has some pretty big blockers:</p> <ul> <li>Can GetFTR help spur adoption of Seamless Access enough to convince big and small institutions to invest in identity provider infrastructure and single sign-on systems?</li> <li>Will GetFTR grab the interest of Google, Google Scholar, and Microsoft Research (where admittedly a lot of article discovery is already happening)?</li> <li>Will developers of discovery layers and link resolvers prioritize GetFTR implementation in their services?</li> <li>Will libraries find enough value in GetFTR to enable it in their discovery layers and link resolvers?</li> <li>Would libraries argue against GetFTR in learning management systems, faculty profile systems, and other campus systems if its own services cannot be included in GetFTR displays?</li> </ul> <p>I don’t know, but I think it is up to the principles behind GetFTR to make more inclusive decisions. The next steps is theirs.</p><div class="feedflare"> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=YQmk4liucXc:RrND8q_cL0w:yIl2AUoC8zA"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=yIl2AUoC8zA" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=YQmk4liucXc:RrND8q_cL0w:qj6IDK7rITs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=qj6IDK7rITs" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=YQmk4liucXc:RrND8q_cL0w:dnMXMwOfBR0"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=dnMXMwOfBR0" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=YQmk4liucXc:RrND8q_cL0w:YwkR-u9nhCs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=YwkR-u9nhCs" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=YQmk4liucXc:RrND8q_cL0w:F7zBnMyn0Lo"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=YQmk4liucXc:RrND8q_cL0w:F7zBnMyn0Lo" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=YQmk4liucXc:RrND8q_cL0w:ACf-c_HutVc"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=ACf-c_HutVc" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=YQmk4liucXc:RrND8q_cL0w:gIN9vFwOqvQ"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=YQmk4liucXc:RrND8q_cL0w:gIN9vFwOqvQ" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=YQmk4liucXc:RrND8q_cL0w:-BTjWOF_DHI"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=YQmk4liucXc:RrND8q_cL0w:-BTjWOF_DHI" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=YQmk4liucXc:RrND8q_cL0w:V_sGLiPBpWU"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?i=YQmk4liucXc:RrND8q_cL0w:V_sGLiPBpWU" /></a> <a href="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?a=YQmk4liucXc:RrND8q_cL0w:H329GK52Scs"><img border="0" src="http://feeds.feedburner.com/~ff/DisruptiveLibraryTechnologyJester?d=H329GK52Scs" /></a> </div><img alt="" height="1" src="http://feeds.feedburner.com/~r/DisruptiveLibraryTechnologyJester/~4/YQmk4liucXc" width="1" /> 2021-04-04T02:00:17+00:00 Peter Murray (jester@dltj.org) 
planet-code4lib-org-7692	----	Planet Code4Lib Planet Code4Lib Planet Code4Lib - http://planet.code4lib.org Ed Summers: 856 Coincidence? Digital Library Federation: The #DLFteach Toolkit: Recommending EPUBs for Accessibility This post was written by Hal Hinderliter, as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital Library Pedagogy group highlighting the experiences of  digital librarians and archivists who utilize the #DLFteach Toolkit and are new to teaching and/or digital tools. The Digital Library Pedagogy working group, also known as #DLFteach, is a grassroots community of practice, empowering digital library practitioners to see themselves as teachers and equip teaching librarians to engage learners in how digital library technologies shape our knowledge infrastructure. The group is open to anyone interested in learning about or collaborating on digital library pedagogy. Join our Google Group to get involved.   For this blog post, I’ve opted to provide some background information on the topic of my #DLFteach Toolkit entry: the EPUB (not an acronym) format, used for books and other documents. Librarians, instructors, instructional designers and anyone else who needs to select file formats for content distribution should be aware of what EPUB has to offer! Electronic books: the fight over formats The production and circulation of books, journals, and other long-form texts has been radically impacted by the growth of computer-mediated communication. Electronic books (“e-books”) first emerged near a half-century ago as text-only ASCII files, but are now widely available in a multitude of different file formats. Most notably, three competing options have been competing for market dominance: PDF files, KF8 files (for Amazon’s Kindle devices), and the open-source EPUB format. The popularity of handheld Kindle devices has created a devoted fan base for KF8 e-books, but in academia the ubiquitous PDF file remains the most common way to distribute self-contained digital documents. In contrast to these options, a growing movement is urging that libraries and schools eschew Kindles and abandon their reliance on PDFs in favor of the EPUB electronic book format. The EPUB file format preserves documents as self-contained packages that manage navigation and presentation separately from the document’s reflowable content, allowing users to alter font sizes, typefaces, and color schemes to suit their individual preferences. E-books saved in the EPUB format are compatible with Apple’s iPads and iPhones as well as Sony’s Reader, Barnes &amp; Nobles Nook, and an expansive selection of software applications for desktop, laptop, and tablet computers. Increasingly, that list includes screen reader software such as Voice Dream and VitalSource Bookshelf, meaning that a single file format – EPUB 3 – can be readily accessed by both sighted and visually impaired audiences. The lineage of EPUB can be traced back to the Digital Audio-based Information System (DAISY), developed in 1994 under the direction of the Swedish Library of Talking Books and Braille. Today, EPUB is an open-source standard that is managed by the International Digital Publishing Forum, part of the W3C. In contrast to the proprietary origins of both PDF and KF8 e-books, modifications to the open EPUB standard have always been subject to public input and debate. Accessibility in Academia: EPUB versus PDF Proponents of universal design principles recommend the use of documents that are fully accessible to everyone, including users of assistive technologies, e.g., screen readers and refreshable braille displays. The DTBook format, a precursor to EPUB, was specifically referenced by Rose et al. (2006) in their initial delineation of Universal Design for Learning (UDL) as part of UDL’s requirement for multiple means of presentation. At the time, the assumption was that DTBooks would be distributed only to students who needed accessible texts, with either printed copies or PDF files for sighted learners. Today, however, it is no longer necessary to provide multiple formats, since EPUB 3 (the accessibility community’s preferred replacement for DTBooks) can be used with equal efficacy by all types of students. In contrast, PDF files can range from completely inaccessible to largely accessible, depending on the amount of effort the publisher expended during the remediation process. PDF files generated from word processing programs (e.g., Microsoft Word) are not accessible by default, but instead require additional tweaks that necessitate the use of Adobe’s Acrobat Pro software (the version of Acrobat that retails for $179 per year). Users of assistive technologies have no recourse but to attempt opening a PDF file before often finding that the document lacks structure (needed for navigation), alt tags, metadata, or other crucial features. Even for sighted learners, PDFs downloaded from their university’s online repository will be difficult to view on smartphones, since PDF’s fixed page dimensions will require endless zooming and scrolling to display each column of text at an adequate font size. The superior accessibility of EPUB has inspired major publishers to establish academic repositories of articles in EPUB format, e.g., ABC-CLIO, ACLS Humanities, EBSCO E-Books, Proquest’s Ebrary, Elsevier’s ScienceDirect, Taylor &amp; Francis. Many digital-only journals offer their editions as EPUBs. For example, Trude Eikebrokk, editor of Professions &amp; Professionalism, investigated the advantages of publishing in the EPUB format as described in this excerpt from the online journal Code{4}lib: There are two important reasons why we wanted to replace PDF as our primary e-journal format. PDF is a print format. It will never be the best choice for reading on tablets (e.g. iPad) or smartphones, and it is challenging to read PDF files on e-book readers … We wanted to replace or supplement the PDF format with EPUB to better support digital reading. Our second reason for replacing PDF with EPUB was to alleviate accessibility challenges. PDF is a format that can cause many barriers, especially for users of screen readers (synthetic speech or Braille). For example, Excel tables are converted into images, which makes it impossible for screen readers to access the table content. PDF documents might also lack search and navigation support, due to either security restrictions, a lack of coded structure in text formats, or the use of PDF image formats. This can make it difficult for any reader to use the document effectively and impossible for screen reader users. On the other hand, correct use of XHTML markup and CSS style sheets in an EPUB file will result in search and navigation functionalities, support for text-to-speech/braille and speech recognition technologies. Accessibility is therefore an essential aspect of publishing e-journals: we must consider diverse user perspectives and make universal design a part of the publishing process. The Future of EPUB A robust community of accessibility activists, publishers, and e-book developers continues to advance the EPUB specification. The update to EPUB3 added synchronized audio narration, embedded video, MathML equations, HTML5 animations, and Javascript-based interactivity to the format’s existing support for metadata, hyperlinks, embedded fonts, text (saved as XHTML files) and illustrations in both Scalable Vector Graphic (SVG) and pixel-based formats. Next up: the recently announced upgrade to EPUB 3.2, which embraces documents created under the 3.0 standard while improving support for Accessible Rich Internet Applications (ARIA) and other forms of rich media. If you’re ready to join this revolution, have a run through the #DLFteach Toolkit’s EPUB MakerSpace lesson plan! The post The #DLFteach Toolkit: Recommending EPUBs for Accessibility appeared first on DLF. HangingTogether: Nederlandse ronde tafel sessie over next generation metadata: Denk groter dan NACO en WorldCat Met dank aan Ellen Hartman, OCLC, voor het vertalen van de oorspronkelijke Engelstalige blogpost. Op 8 maart 2021 werd een Nederlandse ronde tafel discussie georganiseerd als onderdeel van de OCLC Research Discussieserie over Next Generation metadata.  Bibliothecarissen, met achtergronden in metadata, bibliotheeksystemen, de nationale bibliografie en back-office processen, namen deel aan deze sessie. Hierbij werd een mooie variatie aan academische en erfgoed instellingen in Nederland en België vertegenwoordigd. De deelnemers waren geëngageerd, eerlijk en leverden met hun kennis en inzicht constructieve bijdragen aan een prettige uitwisseling van kennis.  In kaart brengen van initiatieven  Kaart van next-gen metadata initiatieven (Nederlandse sessie) Net als in de andere ronde tafel sessies werden de deelnemers gevraagd om in kaart te helpen brengen wat voor next generation metadata initiatieven er in Nederland en België worden ontplooid. De kaart die daarmee werd gevuld laat zien dat in deze regio een sterke vertegenwoordiging is van bibliografische en erfgoed projecten (zie de linker helft van de matrix). Verschillende next-generation metadata projecten van de Koninklijke Bibliotheek Nederland werden omschreven, zoals: Automatische metadata creatie, waarbij tools voor het taggen en catalogiseren van naam authority records worden geïdentificeerd en getest. De Entity Finder, een tool die wordt ontwikkeld om RDA entities (personen, werken en expressies) te helpen ontlenen vanuit authorities en bibliografische records.  De Digitale Erfgoed Referentie Architectuur (DERA) is ontwikkeld als onderdeel van een nationale strategie voor digitaal erfgoed in Nederland. Het is een framework voor het beheren en publiceren van erfgoed informatie als linked open data (LOD), op basis van overeengekomen conventies en afspraken. Het van Gogh Worldwide platform is een voorbeeld van de applicatie van DERA, waar metadata gerelateerd aan de kunstwerken van van Gogh, die in bezit zijn van Nederlandse erfgoed instellingen en in privé bezit worden geaggregeerd.    Een noemenswaardig in kaart gebracht initiatief op het gebied van Research Informatie Management (RIM) en Scholarly Communications was de Nederlandse Open Knowledge Base. Een in het afgelopen jaar opgestart initiatief binnen de context van de deal tussen Elsevier en VSNU, NFU en NWO om gezamenlijk open science services te ontwikkelen op basis van RIM systemen, Elsevier databases, analytics oplossingen en de databases van de Nederlandse onderzoeksinstellingen. De Open Knowledge Base zal nieuwe applicaties kunnen voeden met informatie, zoals een dashboard voor het monitoren van de sustainable development goals van de universiteiten. Het uitgangspunt van de Knowledge Base is het significant kunnen verbeteren van de analyse van de impact van research.  Wat houdt ons tegen?  Ondanks dat er tijdens de sessie innovatieve projecten in kaart werden gebracht, werd er net als in sommige andere sessies, onduidelijkheid gevoeld over hoe we nu verder door kunnen ontwikkelen. Ook was er sprake van enig ongeduld met de snelheid van de transitie naar next generation metadata. Sommige bibliotheken waren gefrustreerd over het gebrek aan tools binnen de huidige generatie systemen om deze transitie te versnellen. Zoals de integratie van Persistant Identifiers (PID), lokale authorities of links met externe bronnen. Meerdere tools moeten gebruiken voor een workflow voelt als een stap terug in plaats van vooruit.   Buiten praktische belemmeringen werd de discussie vooral gedomineerd door de vraag wat ons tegenhoudt in deze ontwikkeling. Met zoveel bibliografische data die al als LOD gepubliceerd wordt, wat is er dan verder nodig om deze data te linken? Zouden we niet op zoek moeten naar partners om samen een kennis-ecosysteem te ontwikkelen?  Vertrouwen op externe data  Een deelnemer gaf aan dat bibliotheken voorzichtig of terughoudend zijn met de databronnen waarmee ze willen linken. Authority files zijn betrouwbare bronnen, waarvoor er nog geen gelijkwaardige alternatieven bestaan in het zich nog ontwikkelende linked data ecosysteem. Het gebrek aan conventies voor de betrouwbaarheid is misschien een reden waarom bibliotheken misschien wat terughoudend zijn in het aangaan van linked data partnerschappen of terug deinzen voor het vertrouwen op externe data, zelfs van gevestigde bronnen als Wikidata. Want, het linken naar een databron is een indicatie van vertrouwen en een erkenning van de datakwaliteit.  Het gesprek ging vervolgens verder over linked datamodellen. Welke data creëer je zelf? Hoe geef je je data vorm en link je met andere data? Sommige deelnemers gaven aan dat er nog steeds een gebrek aan afspraken en duidelijkheid is over concepten zoals een “werk”. Anderen gaven aan dat het vormgeven van concepten precies is waar linked data om draait en dat meerdere onthologieën naast elkaar kunnen bestaan. In andere woorden, het is misschien niet nodig om de naamgeving in harde standaarden te vatten. “Er is geen uniek semantisch model. Wanneer je verwijst naar gegevens die al door anderen zijn gedefinieerd, geef je de controle over dat stukje informatie op, en dat kan een mentale barrière zijn tegen het op de juiste manier werken met linked data. Het is veel veiliger om alle data in je eigen silo op te slaan en te beheren. Maar op het moment dat je dat los kunt laten, kan de wereld natuurlijk veel rijker worden dan je in je eentje ooit kunt bereiken.”  Oefenen met denken in linked data  Het gesprek ging verder met een discussie over wat we kunnen doen om bibliotheekmedewerkers die catalogiseren te trainen. Een van de deelnemers vond dat het handig zou zijn om te beginnen met ze te leren te denken in linked dataconcepten en om te oefenen met het opbouwen van een knowledge graph en het experimenteren met het bouwen van verschillende structuren. Net als dat een kind dat doet door met LEGO te spelen. De deelnemers waren het erover eens dat we op dit moment nog te weinig kennis hebben van de mogelijkheden en de consequenties van het gebruik van linked data. “We moeten leren onszelf te zien als uitgevers van metadata, zodat anderen het kunnen vinden – maar we hebben geen idee wie de anderen zijn, we moeten zelfs groter denken dan de NACO van de Library of Congress of WorldCat. We hebben het niet langer over de records die we maken, maar over stukjes records die uniek zijn, want veel komt al van elders. We moeten ons dit realiseren en onszelf afvragen: wat is onze rol in het grotere geheel? Dit is erg moeilijk om te doen!”  De deelnemers gaven aan dat het erg belangrijk was om deze discussie binnen hun bibliotheek op gang te brengen. Maar hoe doe je dat precies? Het is een groot onderwerp en het zou mooi zijn als daar vanuit het management ook aandacht voor is.  Niet relevant voor mijn bibliotheek  Een leidinggevende binnen de deelnemersgroep reageerde hierop en gaf aan: “Het valt me op dat de hoeveelheid bibliotheken die hier nog echt mee te maken hebben kleiner wordt. (…) [In mijn bibliotheek] produceren we nauwelijks zelf nog metadata. (…) Als we kijken naar wat we zelf nog produceren is dat bijvoorbeeld nog het beschrijven van foto’s van een studentenvereniging, eigenlijk niets dus. Metadata is eigenlijk alleen nog een onderwerp voor een kleine groep specialisten.”  Hoe provocerend deze observatie ook was, dit weerspiegelt wel een realiteit die we moeten erkennen en tegelijkertijd in perspectief moeten plaatsen. Daar was helaas geen tijd voor, want de sessie liep ten einde. Het was zeker een gesprek waar we nog een tijd hadden kunnen doorpraten!  Over de OCLC Research Discussie Serie over Next Generation Metadata  In maart 2021 hield OCLC Research een discussiereeks gericht op twee rapporten:  “Transitioning to the Next Generation of Metadata”    “Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project”.    De rondetafelgesprekken werden gehouden in verschillende Europese talen en de deelnemers konden hun eigen ervaringen delen, een beter begrip krijgen van het onderwerp en kregen handvatten om vol vertrouwen plannen te maken voor de toekomst. .  De plenaire openingssessie opende de vloer voor discussie en verkenning en introduceerde het thema en de bijbehorende onderwerpen. Samenvattingen van alle rondetafelgesprekken worden gepubliceerd op de OCLC Research-blog Hanging Together. Op de afsluitende plenaire vergadering op 13 april werden de verschillende rondetafelgesprekken samengevat.   The post Nederlandse ronde tafel sessie over next generation metadata: Denk groter dan NACO en WorldCat appeared first on Hanging Together. Open Knowledge Foundation: Open Data Day 2021 – it’s a wrap On Saturday 6th March 2021, the eleventh Open Data Day took place with people around the world organising over 300 events to celebrate, promote and spread the use of open data. Thanks to the generous support of this year’s mini-grant funders –Microsoft, UK Foreign, Commonwealth and Development Office, Mapbox, Global Facility for Disaster Reduction and Recovery, Latin American Open Data Initiative, Open Contracting Partnership and Datopian – the Open Knowledge Foundation offered more than 60 mini-grants to help organisations run online or in-person events for Open Data Day. We captured some of the great conversations across Asia/the Pacific, Europe/Middle East/Africa and the Americas using Twitter Moments. Below you can discover all the organisations supported by this year’s scheme as well as seeing photos/videos and reading their reports to help you find out how the events went, what lessons they learned and why they love Open Data Day: Environmental data Code for Pakistan A hack day to open and publish the block coordinates of the plantation conducted during the billion tree tsunami in Pakistan Read event report DRM Africa (Democratic Republic of the Congo) Preventing vulnerable communities from river floods through risk data collection, analysis and communication Read event report Escuela de Fiscales (Argentina) Our goal is to show the community and other civil society organizations the importance of open data in preserving and caring for the environment, and the urgency of taking action against climate change and pollution, and how open data can improve public politics with the participation of citizens Read event report Government Degree College Bemina,J and K Higher Education (India) Make the community aware about the availability and benefits of environmental data for addressing environmental concerns in Kashmir Valley Read event report Future Lab (Mexico) Engage with the local community and enable citizen participation through the use of open data for the proposal of cleaner and more sustainable public policies Read event report Mijas Multimedia (Democratic Republic of the Congo) Strengthen the community resilience to the rapid rise of Lake Tanganyika through the use of open data Read event report Niger Delta SnapShots (Nigeria) Use open data to uncover hidden threats damaging Nigerian mangrove and demonstrate the necessity for urgent action to save Nigerian Mangrove Read event report Open Knowledge Nepal Organise a datathon that will bring open data enthusiasts to work on the real-time air quality data and Twitter bot enhancement, so that people can use the service and get informed with the recent situations of air quality in their surroundings Read event report PermaPeople (Germany) Present and discuss the importance and challenges of collecting and sharing open source data on plants and growing to assist in the growth of the regenerative movement Read event report Zanzibar Volunteers for Environmental Conservation (Tanzania) The main goal is to contribute to open data initiatives by helping the students understand more about open data and environmental issues Read event report Tracking public money flows Afonte Jornalismo de Dados Brazilian are tired of corruption, and Open Data Day Porto Alegre 2021 will provide relevant and open-access information to show the path to investigate public expenses and how they are connected to politicians and even companies Read event report Dataphyte Train participants on how to track Covid-19 spending using open government data to unearth malpractices and corruption in the management of the pandemic Read event report Datos Concepción (Argentina) Show companies and organizations that received contracts related to COVID-19 Read event report Equity Watch Initiative (Nigeria) Using data to ensure that various gender equality and women empowerment projects in Nsukka Local Government Area deliver on promises Read event report HackBo / Grafoscopio To intertwine mini wikis, chatbots and public oversight of public expenses, starting with a particular project in the neighborhood, to showcase how grassroots developed civic tech and open government could be bridged, as an empowering alternative to the opaque extractivist social media where such interaction is happening (Facebook) beyond the reach and real interest of civic communities Read event report Ojoconmipisto (Guatemala) Teaching local journalists data visualisation techniques Read event report Open Knowledge Estonia Estonian procurement registry doesn’t use OCDS, but the common European standard (TED). Our goal is to cross-match the datasets concerning donations and business registries, in order to automatically detect potential conflict of interests Read event report Universidad Latinoamericana de Ciencia y Tecnología (ULACIT) Data Challenge: take advantage of the first dataset published under the OCDS in the country and improve the data literacy of university students Read event report Water With Development Initiative (Nigeria) Increase transparency and accountability discussing the use of existing WASH data Read event report Open mapping DIH Slovenia Disseminating existing open mapping solutions, sharing best practices and discussion of possibilities for improving life in communities through open mapping Read event report Federal University of Bahia (Brazil) Strengthen a global network of community data collectors from communities, organisations, as well as academic institutions by 1) focusing on sharing experiences from specific cases where particular mapping tools were used as part of strategies of community empowerment and 2) using the insights to subsequently co-design a platform to empower data collectors globally Read event report Geoladies PH (Philippines) Since March is International Women’s Month and 31st March is International Transgender Day of Visibility, we would like to hold an event that empowers and engages women (cisgender and transgender) to map out features and amenities (women support desks, breastfeeding stations, gender-neutral comfort rooms, and LGBT safe spaces) and feature lightning talks to highlight women in mapping Read event report GEOSM (Cameroon) Host a “geo-evangelisation”, workshop in the use of JOSM (Java OpenStreetMap ) and GEOSM (the first 100% African open source geolocation platform) Read event report iLabs@Mak Project (Uganda) To understand and value the need of Farmers’ Live Geo Map across food value chain in Africa to better food traceability and security Read event report LABIKS – Latin American Bike Knowledge Sharing To promote and stimulate the sharing of open data about the bike-sharing systems in Latin America and to promote and discuss our online open map, aiming to improve it Read event report Monitor de Femicidios de UTOPIX (Venezuela) Monitoring of femicide cases in Venezuela Read event report Periféria Policy and Research Center Learn about the relevance of open data in collective/critical mapping of gentrification in Hungary Read event report PoliMappers (Italy) Host an introductory mapping event on OpenStreetMap so that students and people interested in collaborating gain the basic skills needed to tackle more advanced tools later in the year Read event report SmartCT (Philippines) Launch the MapaTanda Initiative (a portmanteau of Mapa — which means a map — and Tanda — which can mean an older adult but can also mean remember); which is an initiative that seeks to improve the number and quality of data in OpenStreetMap that are important and relevant to older adults (senior citizens) and the ageing population (60+ years old) in the Philippines Read event report SUZA Youthmappers Create awareness on open data data use, and how the students can use the data in developing innovative web and mobile applications to solve existing challenges in the society Read event report TuTela Learning Network in collaboration with local activists and researchers Start a debate on alternative, community-managed forms of housing in the city of Lisbon based on the model of grant of use and raising awareness on the importance of accessible data on available real estate resources owned by the city Read event report Unificar Ações e Informações Geoespaciais – UAIGeo – Universidade Federal de São João del-Rei (UFSJ)  Disseminate the use and importance of open data to support the solution of territorial tension points, the use of water and the preservation of cultural heritage, as well as providing participants with contacts with collaborative mapping applications Read event report Data for equal development 254 Youth Policy Cafe (Kenya) Undertake a webinar via the Zoom Platform themed “Leveraging Open Data as an Asset for Inclusive &amp; Sustainable Development in Kenya” Read event report ACCESA (Costa Rica) Explore, map, visualize and disseminate key data about the projects being implemented by the Territorial Councils of Rural Development, the main participatory bodies for fostering rural development in Costa Rica, and assess their progress, the money being spent on them, the results obtained, and their impact in narrowing the many social gaps that currently affect the different rural regions of the country Read event report Afroimpacto Discuss the importance to the black community of the open data discussion Read event report CoST Honduras Present how we can promote sustainable infrastructure by using data disclosed under the Open Contracting for Infrastructure Data Standard and engage citizens and civil society organisations to demand government accountability by using a tool called InfraS Read event report Dados Abertos de Feira (Brazil) Promote and discuss the open data knowledge to our local community (city of Feira de Santana, countryside of Brazil), bringing together the academy, government agents and the society itself Read event report DataFest Tbilisi (Georgia) Highlight and promote the use of data and data-driven products as an effective way to tackle pressing social issues and inequality Read event report Demokrasya (Democratic Republic of the Congo) Raise awareness of the Congolese community especially the women’s rights community on the use of open data in defending the women’s accessibility to employment Read event report Fundación Eduna (Colombia) Develop activities to address the issue of strengthening the capacity for creative thinking of children and young people in the central region of Colombia making use and taking advantage of open data Read event report Gênero e Número (Brazil) Explore open data to get a comprehensive landscape on the labour market for women in Brazil during the pandemic Read event report Girls’ Tech-Changer Community (Cameroon) Show the benefits of open data (such as an increase in efficiency, transparency, innovation, and economic growth) and to encourage the adoption of open data policies in various government bodies, businesses, and civil societies Read event report Hawa Feminist Coalition (Somalia) Advance the production, dissemination and openness of sex-disaggregated data in Somalia in support of evidence-based planning and policy-making as well as tracking of progress by the government and other stakeholders to achieve the Sustainable Development Goals (SDGs) Read event report Hope for Girls and Women Tanzania Teaching community about the benefit of using data for development Read event report International Youth Alliance for Family Planning- TOGO (IYAFP-TOGO) Develop an open map of contraceptive methods and service availability in Agbalepedo area Read event report IPANDETEC (Panama) Train Panamanian women on their current position, role and future in the world of open data Read event report iWatch Africa Demonstrate how equal development within the digital ecosystem in Africa can be improved by leveraging data on online abuse and harassment of female journalists Read event report Kiyita Foundation Encourage local women to get access to data about economic development Read event report Madagascar Initiatives for Digital Innovation Make participants understand the value of data for development Read event report Nepal Open Source Klub We will create a glossary of technical terms and words that are commonly used on websites/in software and translate those into Nepali Read event report Nukta Africa (Tanzania) Maximizing the use of open data to increase accountability through data journalism Read event report Programming Historian (Chile) Walk participants through the process of visualising qualitative and quantitative development open data for equal development in Latin America, using open access tools Read event report Punch Up (Thailand) Emphasise what would be lost if we don’t have open data in our country Read event report Rausing Zimbabwe Create a platform and outlet for information distribution, updates and discussion with communities on the issues surrounding peace and security in the age of the pandemic Read event report Vilnius Legal Hackers (Lithuania) Implement more transparency into funeral business of Lithuania Read event report Thanks to everyone who organised or took part in these celebrations and see you next year for Open Data Day 2022! Need more information? If you have any questions, you can reach out to the Open Knowledge Foundation’s Open Data Day team by emailing opendataday@okfn.org or on Twitter via @OKFN. Digital Library Federation: The #DLFteach Toolkit: Participatory Mapping In a Pandemic This post was written by Jeanine Finn (Claremont Colleges Library), as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital Library Pedagogy group highlighting the experiences of  digital librarians and archivists who utilize the #DLFteach Toolkit and are new to teaching and/or digital tools. The Digital Library Pedagogy working group, also known as #DLFteach, is a grassroots community of practice, empowering digital library practitioners to see themselves as teachers and equip teaching librarians to engage learners in how digital library technologies shape our knowledge infrastructure. The group is open to anyone interested in learning about or collaborating on digital library pedagogy. Join our Google Group to get involved. See the original lesson plan in the #DLFteach Toolkit. Our original activity was designed around using a live GoogleSheet in coordination with ArcGIS Online to collaboratively map historic locations for an in-class lesson to introduce students to geospatial analysis concepts. In our example, a history instructor had identified a list of cholera outbreaks with place names from 18th-century colonial reports. In the original activity, students were co-located in a library classroom, reviewing the historic cholera data in groups. A Google Sheet was created and shared with everyone in the class for students to enter “tidied” data from the historic texts collaboratively. The students then worked with a live link from Google Sheets, allowing the outbreak locations to be served directly to the ArcGIS Online map. It was successful and a useful tool for encouraging engagement and for getting familiar with GIS. Then COVID-19 in 2020 arrived. Instead of a centuries-distant disease outbreak, students learning digital mapping this past year were thrust into socially-distant instructional settings driven by a contemporary pandemic that radically altered their modes of learning. The collaborative affordances of tools like ArcGIS Online were pressed into service to help students collaborate effectively and meaningfully in real-time while learning from home. As an example, one geology professor at Pomona College encouraged her students to explore the geology of their local environment. Building on shared readings and lectures on geologic history and rock formations, students were encouraged to research the history of the land around them, and include photographs, observations, and other details to enrich the ArcGIS StoryMap. The final map included photographs and geology facts from students’ home locations around the world. Header for Geology class group StoryMap at Pomona College, Fall 2020   A key feature of the ArcGIS StoryMap platform that appealed to the instructor was the ability for the students to work collaboratively on the platform itself — not across shared files on folders on Box, GSuite, the LMS, etc. While this functioned reasonably well, there were several roadblocks to effective collaboration that we encountered along the way. Most of the challenges related to permissions settings related to ArcGIS Online administration, as the “shared update” features are not set as default permissions. Other challenges included file size limitations for images the students wished to upload, the inability of more than one user to edit the same file simultaneously, and potential security issues (including firewalls) in nations with more restrictive internet laws. Reflecting on these uses of StoryMaps over this past semester, we encourage instructors and library staff interested in to: Review user license permissions and best practices for ArcGIS StoryMap collaboration from Esri (some links below). Plan ahead to help students with collecting appropriate images, including discussions of file size and copyright. Encourage the instructor to coordinate student groups with defined roles and responsibilities to lessen the likelihood of multiple editors working on the same StoryMap at once (which can cause corruption of the files. Get clarity from IT and other support staff as needed to determine if students are working remotely from countries that may have restrictions on internet use.   Resources: Participatory Mapping with Google Forms, Google Sheets, and ArcGIS Online (Esri community education blog): https://community.esri.com/t5/education-blog/participatory-mapping-with-google-forms-google-sheets-and-arcgis/ba-p/883782 Optimize group settings to share stories like never before (Esri ArcGIS blog): https://www.esri.com/arcgis-blog/products/story-maps/constituent-engagement/optimize-group-settings-to-share-stories-like-never-before/ Teach with Story Maps: Announcing the Story Maps Curriculum Portal (University of Minnesota, U-Spatial: https://research.umn.edu/units/uspatial/news/teach-story-maps-announcing-story-maps-curriculum-portal Getting Started with ArcGIS StoryMaps (Esri): https://storymaps.arcgis.com/stories/cea22a609a1d4cccb8d54c650b595bc4 VI Conclusion recommendations Gather materials ahead of time. Photographs from digital archives, maps There may be data cleaning issues. The post The #DLFteach Toolkit: Participatory Mapping In a Pandemic appeared first on DLF. David Rosenthal: Dogecoin Disrupts Bitcoin! Two topics I've posted about recently, Elon Musk's cult and the illusory "prices" of cryptocurrencies, just intersected in spectacular fashion. On April 14 the Bitcoin "price" peaked at $63.4K. Early on April 15, the Musk cult saw this tweet from their prophet. Immediately, the Dogecoin "price" took off like a Falcon 9.A day later, Jemima Kelley reported that If you believe, they put a Dogecoin on the moon. That was to say that:Dogecoin — the crypto token that was started as a joke and that is the favourite of Elon Musk — is having a bit of a moment. And when we say a bit of a moment, we mean that it is on a lunar trajectory (in crypto talk: it is going to da moon).At the time of writing this, it is up over 200 per cent in the past 24 hours — more than tripling in value (for those of you who need help on percentages, it is Friday afternoon after all). Over the past week it’s up more than 550 per cent (almost seven times higher!). The headlines tell the story — Timothy B. Lee's Dogecoin has risen 400 percent in the last week because why not and Joanna Ossinger's Dogecoin Rips in Meme-Fueled Frenzy on Pot-Smoking Holiday.The Dogecoin "price" graph Kelly posted was almost vertical. The same day, Peter Schiff, the notorious gold-bug, tweeted: So far in 2021 #Bitcoin has lost 97% of its value verses #Dogecoin. The market has spoken. Dogecoin is eating Bitcoin. All the Bitcoin pumpers who claim Bitcoin is better than gold because its price has risen more than gold's must now concede that Dogecoin is better than Bitcoin. Below the fold I look back at this revolution in crypto-land.I'm writing on April 21, and the Bitcoin "price" is around $55K, about 87% of its peak on April 14. In the same period Dogecoin's "price" peaked at $0.37, and is now around $0.32, or 267% of its $0.12 "price" on April 14. There are some reasons for Bitcoin's slump apart from people rotating out of BTC into DOGE in response to Musk's tweet. Nivesh Rustgi reports:Bitcoin’s hashrate dropped 25% from all-time highs after an accident in the Xinjiang region’s mining industry caused flooding and a gas explosion, leading to 12 deaths with 21 workers trapped since....The leading Bitcoin mining data centers in the region have closed operations to comply with the fire and safety inspections.The Chinese central authority is conducting site inspections “on individual mining operations and related local government agencies,” tweeted Dovey Wan, partner at Primitive Crypto. ...The accident has reignited the centralization problems arising from China’s dominance of the Bitcoin mining sector, despite global expansion efforts. The drop in the hash rate had the obvious effects. David Gerard reports:The Bitcoin hash rate dropped from 220 exahashes per second to 165 EH/s. The rate of new blocks slowed. The Bitcoin mempool — the backlog of transactions waiting to be processed — has filled. Transaction fees peaked at just over $50 average on 18 April. The average BTC transaction fee is now just short of $60, with a median fee over $26! The BTC blockchain did around 350K transactions on April 15, but on April 16 it could only manage 190K.It is also true that DOGE had upward momentum before Musk's tweet. After being nearly flat for almost a month, it had already doubled since April 6.Kelly quotes David Kimberley at Freetrade:Dogecoin’s rise is a classic example of greater fool theory at play, Dogecoin investors are basically betting they’ll be able to cash out by selling to the next person wanting to invest. People are buying the cryptocurrency, not because they think it has any meaningful value, but because they hope others will pile in, push the price up and then they can sell off and make a quick buck.But when everyone is doing this, the bubble eventually has to burst and you’re going to be left short-changed if you don’t get out in time. And it’s almost impossible to say when that’s going to happen. Kelly also quotes Khadim Shubber explaining that this is all just entertainment:Bitcoin, and cryptocurrencies in general, are not directly analogous to the fairly mundane practice of buying a Lottery ticket, but this part of its appeal is often ignored in favour of more intellectual or high-brow explanations.It has all the hallmarks of a fun game, played out across the planet with few barriers to entry and all the joy and pain that usually accompanies gambling.There’s a single, addictive reward system: the price. The volatility of cryptocurrencies is often highlighted as a failing, but in fact it’s a key part of its appeal. Where’s the fun in an asset whose price snoozes along a predictable path?The rollercoaster rise and fall and rise again of the crypto world means that it’s never boring. If it’s down one day (and boy was it down yesterday) well, maybe the next day it’ll be up again. Note the importance of volatility. In a must-read interview that New York Magazine entitled BidenBucks Is Beeple Is Bitcoin Prof. George Galloway also stressed the importance of volatility:Young people want volatility. If you have assets and you’re already rich, you want to take volatility down. You want things to stay the way they are. But young people are willing to take risks because they can afford to lose everything. For the opportunity to double their money, they will risk losing everything. Imagine a person who has the least to lose: He’s in solitary confinement in a supermax-security prison. That person wants maximum volatility. He prays for such volatility, that there’s a revolution and they open the prison.People under the age of 40 are fed up. They have less than half of the economic security, as measured by the ratio of wealth to income, that their parents did at their age. Their share of overall wealth has crashed. A lot of them are bored. A lot of them have some stimulus money in their pocket. And in the case of GameStop, they did what’s kind of a mob short squeeze....I see crypto as a mini-revolution, just like GameStop. The central banks and governments are all conspiring to create more money to keep the shareholder class wealthy. Young people think, That’s not good for me, so I’m going to exit the ecosystem and I’m going to create my own currency. This all reinforces my skepticism about the "price" and "market cap" of cryptocurrencies. David Rosenthal: What Is The Point? During a discussion of NFTs, Larry Masinter pointed me to his 2012 proposal The 'tdb' and 'duri' URI schemes, based on dated URIs. The proposal's abstract reads:This document defines two URI schemes. The first, 'duri' (standingfor "dated URI"), identifies a resource as of a particular time.This allows explicit reference to the "time of retrieval", similar tothe way in which bibliographic references containing URIs are oftenwritten.The second scheme, 'tdb' ( standing for "Thing Described By"),provides a way of minting URIs for anything that can be described, bythe means of identifying a description as of a particular time.These schemes were posited as "thought experiments", and thereforethis document is designated as Experimental.As far as I can tell, this proposal went nowhere, but it raises a question that is also raised by NFTs. What is the point of a link that is unlikely to continue to resolve to the expected content? Below the fold I explore this question.I think there are two main reasons why duri: went nowhere: The duri: concept implies that Web content in general is not static, but it is actually much more dynamic than that. Even the duri: specification admits this: There are many URIs which are, unfortunately, not particularly"uniform", in the sense that two clients can observe completelydifferent content for the same resource, at exactly the same time. Personalization, advertisements, geolocation, watermarks, all make it very unlikely that either several clients accessing the same URI at the same time, or a single client accessing the same URI at different times, would see the same content. When this proposal was put forward in 2012, it was competing with a less elegant but much more useful competitor that had been in use for 16 years. The duri: specificartion admits that: There are no direct resolution servers or processes for 'duri' or'tdb' URIs. However, a 'duri' URI might be "resolvable" in the sensethat a resource that was accessed at a point in time might have theresult of that access cached or archived in an Internet archiveservice. See, for example, the "Internet Archive" project But the duri: URI doesn't provide the information needed to resolve to the "cached or archived" content. The Internet Archive's Wayback Machine uses URIs which, instead of the prefix duri:[datetime]: have the prefix https://web.archive.org/web/[datetime]/. This is more useful, both because browsers will actually resolve these URIs, and because they resolve to a service devoted to delivering the content of the URI at the specified time. The competition for duri: was not merely long established, but also actually did what users presumably wanted, which was to resolve to the content of the specified URL at the specified time.It is true that a user creating a Wayback Machine URL, perhaps using the "Save Page Now" button, would preserve the content accessed by the Wayback Machine's crawler. which might be different from that accessed by the user themselves. But the user could compare the two versions at the time of creation, and avoid using the created Wayback Machine URL if the differences were significant. Publishing a Wayback Machine URL carries an implicit warranty that the creator regarded any differences as insignificant.The history of duri: suggests that there isn't a lot of point in "durable" URIs lacking an expectation that they will continue to resolve to the original content. NFTs have the expectation, but lack the mechanism necessary to satisfy the expectation. HangingTogether: Recognizing bias in research data – and research data management Photo by Bálint Szabó on Unsplash As the COVID pandemic grinds on, vaccinations are top of mind. A recent article published in JAMA Network Open examined whether vaccination clinical trials over the last decade adequately represented various demographic groups in their studies. According to the authors, the results suggested they did not: “among US-based vaccine clinical trials, members of racial/ethnic minority groups and older adults were underrepresented, whereas female adults were overrepresented.” The authors concluded that “diversity enrollment targets should be included for all vaccine trials targeting epidemiologically important infections.” Dr. Tiffany Grant My colleague Rebecca Bryant and I recently enjoyed an interesting and thought-provoking conversation with Dr. Tiffany Grant, Assistant Director for Research and Informatics with the University of Cincinnati Libraries (an OCLC Research Library Partnership member) on the topic of bias in research data. Dr. Grant neatly summed up the issue by observing that data collected should be inclusive of all the groups who are impacted by outcomes. As the JAMA article illustrates, that is clearly not always the case – and the consequences can be significant for decision- and policy-making in critical areas like health care. The issue of bias in research data has been acknowledged for some time; for example, the launch of the Human Genome Project in the late 1990s/early 2000s helped raise awareness of the problem, as did observed differences in health care outcomes across demographic groups. And efforts are underway to help remedy some of the gaps. One initiative, the US National Institutes of Health’s All of Us Research Program, aims to build a database of health data collected from a diverse cohort of at least one million participants. The rationale for the project is clearly laid out: “To develop individualized plans for disease prevention and treatment, researchers need more data about the differences that make each of us unique. Having a diverse group of participants can lead to important breakthroughs. These discoveries may help make health care better for everyone.” Extrapolation of findings observed in one group to all other groups often leads to poor inferences, and researchers should take this into account when designing data collection strategies. The peer review process should act as a filter for identifying research studies that overlook this point in their design – but how well is it working? As in many other aspects of our work and social lives, unconscious bias may play a role here: lack of awareness of the problem on the part of reviewers means that studies with flawed research designs may slip through. And that leads us to what Dr. Grant believes is the principal remedy for the problem of bias in research data: education. Researchers need training that helps them recognize potential sources of bias in data collection, as well as understand the implications of bias for interpretation and generalization of their findings. The first step in solving a problem is to recognize that there is a problem. Some disciplines are further along than others in addressing bias in research data, but in Dr. Grant’s view, there is still ample scope for raising awareness across campus about this topic. Academic libraries can help with this, by providing workshops and training programs, and gathering relevant information resources. At the University of Cincinnati, librarians are often embedded in research teams, providing an excellent opportunity to share their expertise on this issue. Raising awareness about bias in research data is also an opportunity to partner with other campus units, such as the office of research, colleges/schools, and research institutes (for more information on how to develop and sustain cross-campus partnerships around research support services see our recent OCLC Research report on social interoperability). Many institutions are currently implementing Equality, Diversity, and Inclusion (EDI) training, and modules addressing bias in research data might be introduced as part of EDI curricula for researchers. This could also be an area of focus for professional development programs supporting doctoral, postdoctoral, and other early-career researchers. It seems that many EDI initiatives focus on issues related to personal interactions or recruiting more members of underrepresented groups into the field. For researchers, it may be useful to supplement this training with additional programs that focus on EDI issues as they specifically relate to the responsible conduct of research. In other words, how do EDI-related issues manifest in the research process, and how can researchers effectively address them? A great example is the training offered by We All Count, a project aimed at increasing equity in data science. Funders can also contribute toward mitigating bias in research data, by issuing research design guidelines on inclusion of underrepresented groups, and by establishing criteria for scoring grant proposals on the basis of how well these guidelines are addressed. The big “carrots and sticks” wielded by funders are a powerful tool for both raising awareness and shifting behaviors. Bias in research data extends to bias in research data management (RDM). Situations where access to and ability to use archived data sets is not equitable is another form of bias. While it is good to mandate that data sets be archived under “open” conditions, as many funders already do, the spirit of the mandate is compromised if the data sets are put into systems that are not accessible and usable to everyone. It is important to recognize that the risk of introducing bias into research data exists throughout the research lifecycle, including curation activities such as data storage, description, and preservation. Our conversation focused on bias in research data in STEM fields – particularly medicine – but the issue also deserves attention in the context of the social sciences, as well as the arts and humanities. Our summary here highlights just a sample of the topics worthy of discussion in this area, with much to unpack in each one. We are grateful to Dr. Grant for starting a conversation with us on this important issue and look forward to continuing it in the future as part of our ongoing work on RDM and other forms of research support services. Like so many other organizations, OCLC is reflecting on equity, diversity, and inclusion, as well as taking action. Check out an overview of that work, and explore efforts being undertaken in OCLC’s Membership and Research Division. Thanks to Tiffany Grant, Rebecca Bryant, and Merrilee Proffitt for providing helpful suggestions that improved this post! The post Recognizing bias in research data – and research data management appeared first on Hanging Together. Lucidworks: Enhance Product Discovery with AI-Powered Recommenders Learn how AI-powered recommenders put the right products and content in front of your customers, with just the right amount of human touch. The post Enhance Product Discovery with AI-Powered Recommenders appeared first on Lucidworks. Tara Robertson: Distributing DEI Work Across the Organization I enjoyed being a guest on Seed&amp;Spark‘s first monthly office hours session where Stefanie Monge, Lara McLeod and I talked about distributing diversity, equity and inclusion work across organizations. Here’s some of the work that I mentioned: Megan Carpenter’s Get It Wrong For Me: What I Need From Allies Amy Edmondson on psychological safety Roxane Gay and Tressie McMillan Cottom’s podcast Hear to Slay, which really is the Black feminist podcast of my dreams. Mozilla’s Community Participation Guidelines The post Distributing DEI Work Across the Organization appeared first on Tara Robertson Consulting. Terry Reese: Thoughts on NACOs proposed process on updating CJK records I would like to take a few minutes and share my thoughts about an updated best practice recently posted by the PCC and NACO related to an update on CJK records. The update is found here: https://www.loc.gov/aba/pcc/naco/CJK/CJK-Best-Practice-NCR.docx. I’m not certain if this is active or a simply a proposal, but I’ve been having a number of private discussions with members at the Library of Congress and the PCC as I’ve been trying to understand the genesis for this policy change. I personally believe that formally adopting a policy like this would be exceptionally problematic, and I wanted to flesh out my thoughts on why and some potential better options that could fix the issue that this problem is attempting to solve. But first, I owe some folks an apology. In chatting with some folks at LC (because, let’s be clear, this proposal was created specifically because there are local, limiting practices at LC that artificially are complicating this work) – it came to my attention that the individuals that spent a good deal of time considering and creating this proposal have received some unfair criticism – and I think I bare a lot of responsibility for that. I have done work creating best practices and standards and its thankless, difficult work. Because of that, in cases where I disagree with a particular best practice, my preference has been to address those privately and attempt to understand and share my issues with a set of practices. This is what I have been doing related to this work. However, on the MarcEdit list (a private list), when a request was made related to a feature request in MarcEdit to support this work – I was less thoughtful in my response as the proposed change could fundamentally undo almost a decade of work as I have dealt with thousands of libraries stymied by these kinds of best practices that have significant unintended consequences. My regret is that I’ve been told that my thoughts shared on the MarcEdit list, have been used by others in more public spaces to take this committee’s work to task. This is unfortunate and disappointing, and something I should have been more thoughtful of in my responses on the MarcEdit list. Especially, given that every member of that committee is doing this work as a service to the community. I know I forget that sometimes. So, to the folks that did this work – I’ve not followed (or seen) any feedback you may have received, but in as much that I’m sure I played a part in any push back you may have received, I’m sorry. What does this problem seek to solve? If you look at the proposal, I think that the writers do a good job identifying the issue. Essentially, this issue is unique to authority records. At present, NACO still requires that records created within the program only utilize UTF8 characters that fall within the MARC-8 repertoire. OCLC, the pipeline for creating these records, enforces this rule by invalidating records with UTF8 characters outside the MARC8 range. The proposal seeks to address this by encouraging the use of NRC (Numeric Character Reference) data in UTF8 records, to work around these normalization issues. So, in a nutshell, that is the problem, and that is the proposed solution. But before we move on, let’s talk a little bit about how we got here. This problem currently exists because of, what I believe to be, an extremely narrow and unproductive read of what MARC8 repertoire actually means. For those not in Libraries, MARC8 is essentially a made-up character encoding, used only in libraries, that has so outlived its usefulness. Modern systems have largely stopped supporting it outside of legacy ingest workflows. The issue is that for every academic library or national library that has transitioned to UTF8, hundreds of small libraries or organizations around the world have not. MARC8 continues to exist because the infrastructure that supports these smaller libraries is built around it. But again, I think it is worth thinking about today, what actually is the MARC8 repertoire. Previously, this had been a hard set of defined values. But really, that changed in 2004ish when LC updated guidance and introduced the concept of NRCs to preserve lossless data transfer between systems that were fully UTF8 compliant and older MARC8 systems. NRCs in MARC8 were workable, because it left local systems the ability to handle (or not handle) the data as it seen fit and finally provided an avenue for the Library community as a whole to move on from the limitations MARC8 was imposing on systems. It allowed for the facilitation of data into non-MARC formats that were UTF8 compliant and provided a pathway to allow data from other metadata formats, the ability to reuse that data in MARC records. I would argue that today, the MARC8 repertoire includes NRC notation – and to assume or pretend otherwise, is shortsighted and revisionist. But why is all of this important. Well, it is at the heart of the problem that we find ourselves in. For authority data, the Library of Congress appears to have adopted this very narrow view of what MARC8 means (against their own stated recommendations) and as a result, NACO and OCLC place artificial limits on the pipeline. There are lots of reasons why LC does this, I recognize they are moving slowly because any changes that they make are often met with some level of resistance from members of our community – but in this case, this paralysis is causing more harm to the community than good. Why this proposal is problematic? So, this is the environment that we are working in and the issue this proposal sought to solve. The issue, however, is that the proposal attempts to solve this problem by adopting a MARC8 solution and applying it within UTF8 data – essentially making the case that NRC values can be embedded in UTF8 records to ensure lossless data entry. And while I can see why someone might think that – that assumption is fundamentally incorrect. When LC developed its guidance on NRC notation, this was guidance that was specifically directed in the lossless translation of data to MARC8. UTF8 data has no need for NRC notation. This does not mean that it does not sometimes show up – and as a practical purpose, I’ve spent thousands of hours working with Libraries dealing with the issues this creates in local systems. Aside from the issues this creates in MARC systems around indexing and discovery, it makes data almost impossible to be used outside of that system and in times of migration. In thinking about the implications of this change in the context of MarcEdit, I had the following, specific concerns: NRC data in UTF8 records would break existing workflows for users with current generation systems that would have no reason to expect this data as being present in UTF8 MARC records It would make normalization functionally virtually impossible and potentially re-introduce a problem I spent months solving for organizations related to how UTF8 data is normalized and introduced into local systems. It would break many of the transformation options.  MarcEdit allows for the flow of data to many different metadata formats – all are built on the concept that the first thing MarcEdit does is clean up character encodings to ensure the output data is in UTF8. MarcEdit is used by ~20k active users and ~60k annual users.  Over 1/3 of those users do not use MARC21 and do not use MARC-8.  Allowing the mixing of NRCs and UTF8 data potentially breaks functionality for broad groups of international users. While I very much appreciate the issue that this is attempting to solve, I’ve spent years working with libraries where this kind of practice would introduce a long-term data issue that is very difficult to identify and fix and often shows up unexpectedly when it comes time to migration or share this information with other services, communities, or organizations. So what is the solution?   I think that we can address this issue on two fronts. First, I would advise NACO and OCLC to essentially stop limiting data entry to this very limited notion of MARC8 repertoire. In all other contexts, OCLC provides the ability to enter any valid UTF8 data. This current limit within the authority process is artificial and unnecessary. OCLC could easily remove it, and NACO could amend their process to allow record entry to utilize any valid UTF8 character. This would address the problem that this group was attempting to solve for catalogers creating these records. The second step could take two forms. If LC continues to ignore their own guidance and cleave to an outdated concept of the MARC8 repertoire – OCLC could provide to LC via their pipeline a version of the records where data includes NRC notation for use in LCs own systems. It would mean that I would not recommend using LC as a trusted system for downloading authorities if this was the practice unless I had an internal local process to remove any NRC data found in valid UTF8 records. Essentially, we essentially treat LC’s requirements as a disease and quarantine them and their influence in this process. Of course, what would be more ideal, is LC making the decision to accept UTF8 data without restrictions and rely on applicable guidance and MARC21 best practice by supporting UTF8 data fully, and for those still needing MARC8 data – providing that data using the lossless process of NRCs (per their own recommendations). Conclusion Ultimately, this proposal is a recognition that the current NACO rules and process is broken and broken in a way that it is actively undermining other work in the PCC around linked data development. And while I very much appreciate the thoughtful work that went into the consideration of a different approach, I think the unintended side affects would cause more long-term damage that any short-term gains. Ultimately, what we need is for the principles to rethink why these limitations are in place, and, honestly, really consider ways that we start to deemphasize the role LC plays as a standard holder if in that role, LC’s presence continues to be an impediment for moving libraries forward. Lucidworks: How to Deliver Impactful Digital Commerce Experiences Acquia and Lucidworks share tips for how to deliver meaningful and relevant digital commerce experiences that create customer connections. The post How to Deliver Impactful Digital Commerce Experiences appeared first on Lucidworks. HangingTogether: Accomplishments and priorities for the OCLC Research Library Partnership With 2021 well underway, the OCLC Research Library Partnership is as active as ever. We are heartened by the positive feedback and engagement our Partners have provided in response to our programming and research directions. Thank you to those who have shared your stories of success and challenge; listening to your voices is what guides us and drives us forward. We warmly welcome the University of Notre Dame, University of Waterloo, and OCAD University into the Partnership and are pleased to see how they have jumped right into engagement with SHARES and other activities. The SHARES resource sharing community Photo by Caleb Chen on Unsplash The SHARES community has been a source of support and encouragement as resource sharing professionals around the world strive to meet their communities’ information needs during COVID-19. During the last year, Dennis Massie has convened more than 50 SHARES town halls to date to learn how SHARES members are changing practice to adapt to quickly evolving circumstances. Dennis has documented how resource sharing practices have changed.   Inspired by the SHARES community, we are also excited to have launched the OCLC Interlibrary Loan Cost Calculator. For library administrators and funders to evaluate collection sharing services properly, they need access to current cost information, as well as benchmarks against which to measure their own library’s data. The Cost Calculator is a free online tool that has the potential to act as a virtual real-time ILL cost study. Designed in collaboration with resource sharing experts and built by OCLC Research staff, the calculator has been in the hands of beta testers and early adopters since October 2019. A recorded webinar gives a guided tour of what the tool does (and does not do), what information users need to gather, how developers addressed privacy issues, and how individual institutions and the library community can benefit. Total cost of stewardship: responsible collection building in archives and special collections A big thanks to our Partners who contributed to the Total Cost of Stewardship: Responsible Collection Building in Archives and Special Collections. This publication addresses the ongoing challenge of descriptive backlogs in archives and special collections by connecting collection development decisions with stewardship responsibilities. The report proposes a Total Cost of Stewardship framework for bringing together these important, interconnected functions. Developed by the RLP’s Collection Building and Operational Impacts Working Group, the Total Cost of Stewardship Framework is a model that considers the value of a potential acquisition and its alignment with institutional mission and goals alongside the cost to acquire, care for, and manage it, the labor and specialized skills required to do that work, and institutional capacity to care for and store collections. This publication includes a suite of communication and cost estimation tools to help decision makers assess available resources, budgets, and timelines to plan with confidence and set realistic expectations to meet important goals. The report and accompanying resources provide special collections and archives with tools to support their efforts to meet the challenges of contemporary collecting and to ensure they are equitably serving and broadly documenting their communities. Transitioning to the next generation of metadata In December, we had a bittersweet moment celebrating Senior Program Officer Karen Smith-Yoshimura’s retirement. As Mercy Procaccini and others take over the role of coordinating the stalwart Metadata Managers Focus Group, we are taking time to refine how this dynamic group works and plans future discussions together to better support their efforts. A synthesis of this group’s discussions from the past six years traces how metadata services are transitioning to the “next generation of metadata.” Transforming metadata into linked data The RLP’s commitment to advancing learning and operational support for linked data continues with the January publication of Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project. The report details a pilot project that investigated methods for—and the feasibility of—transforming metadata into linked data to improve the discoverability and management of digitized cultural materials and their descriptions. Five institutions partnered with OCLC to collaborate on this linked data project, representing a diverse cross-section of different types of institutions: The Cleveland Public Library The Huntington Library, Art Museum, and Botanical Gardens The Minnesota Digital Library Temple University Libraries University of Miami Libraries. OCLC has invested in pathbreaking linked data work for over a decade, and it is wonderful to add the publication to this knowledge base. Social interoperability in research support   In the area of research support, Rebecca Bryant developed a robust series of webinars as a follow-on to the 2019–2020 OCLC Research project, Social Interoperability in Research Support. The resulting report, Social Interoperability in Research Support: Cross-campus Partnerships and the University Research Enterprise, synthesizes information about the highly decentralized, complex research support ecosystem at US research institutions. The report additionally offers a conceptual model of campus research support stakeholders and provides recommendations for establishing and stewarding successful cross-campus relationships. The social interoperability webinar series complements this work by offering in-depth case studies and “stakeholder spotlights” from RLP institutions, demonstrating how other campus  are eager to collaborate with the library. This is a great example of the type of programming you can find in our Works in Progress Webinar Series.  Equity, diversity, and inclusion Our team has been digging into issues of equity, diversity, and inclusion: we’ve developed a “practice group” to help our team be better situated to engaging in difficult conversations around race, and we also have been learning and engaging in conversations about the difficulty of cataloging topics relating to Indigenous peoples in respectful ways.  This work has helped to prepare the way for important new work that I’m pleased to share with you today. OCLC will be working in consultation with Shift Collective on The Andrew W. Mellon-funded convening, Reimagine Descriptive Workflows. The project will bring together a wide range of community stakeholders to interrogate the existing descriptive workflow infrastructure to imagine new workflows that are inclusive, equitable, scalable, and sustainable. We are following an approach developed in other work we have carried out, such as the Research and Learning Agenda for Archives, Special, and Distinctive Collections in Research Libraries, and more recently, in Responsible Operations: Data Science, Machine Learning, and AI in Libraries. In that vein, we will host a virtual convening later this year to inform a Community Agenda publication.  Reimagine Descriptive Workflows is the next stage of a journey that we’ve been on for some time, informed by numerous webinars, surveys, and individual conversations. I am very grateful to team members and the RLP community for their contributions and guidance. We are truly “learning together.” Looking forward If you are at an OCLC RLP affiliated institution and would like to learn more about how to get the most out of your RLP affiliation, please contact your staff liaison (or anyone on our energetic team) and we be happy to set up a virtual orientation or refresher on our programs and opportunities for active learning. It is with deep gratitude that I offer my thanks to to our Partners for their investment in the Research Library Partnership. We are committed to offering our very best to serve your research and learning needs. The post Accomplishments and priorities for the OCLC Research Library Partnership appeared first on Hanging Together. Open Knowledge Foundation: Watch the Net Zero Challenge pitch contest This week, five shortlisted teams took part in the final stage of the Net Zero Challenge – a global competition to identify, promote and support innovative, practical and scalable uses of open data that advance climate action. The five teams presented their three-minute project pitches to the Net Zero Challenge Panel of Experts, and a live audience. Each pitch was followed by a live Q&amp;A. The winner of the pitch contest will be announced in the next few days. If you didn’t have the chance to attend the event in person, watch the event here (46.08.min) or see below for links to individual pitches. A full unedited video of the event is at the bottom of this page. Introduction – by James Hamilton, Director of the Net Zero Challenge Watch video here (4.50min) // Introduction Slide Deck  Pitch 1 – by Matt Sullivan from Snapshot Climate Tool which provides greenhouse gas emission profiles for every local government region (municipality) in Australia. Watch pitch video here (10.25min) // Snapshot Slide Deck Pitch 2 – by Saif Shabou from CarbonGeoScales which is a framework for standardising open data for green house gas emissions at multiple geographical scales (built by a team from France). Watch pitch video here (9.07min) // CarbonGeoScales Slide Deck Pitch 3 – by Jeremy Dickens. He presents Citizen Science Avian Index for Sustainable Forests a new bio monitoring tool that uses open data on bird observations to provide crucial information on forest ecological conditions (from South Africa). Watch pitch video here (7.03min)  // Avian Index – Slide Deck Pitch 4 – by Cristian Gregorini from Project Yarquen which is a new API tool and website to organise climate relevant open data for use by civil society organisations, environmental activists, data journalists and people interested in environmental issues (built by a team from Argentina). Watch pitch video here (8.20min) Pitch 5 – by Beatriz Pagy from Clima de Eleição which analyses recognition of climate change issues by prospective election candidates in Brazil, enabling voters to make informed decisions about who to vote in to office. Watch pitch video here (5.37min) // Clima de Eleição – Slide Deck Concluding remarks – by James Hamilton, Director of the Net Zero Challenge Watch video here (0.46min) A full unedited video of the Net Zero Challenge is here (55.28min) There are many people who collaborated to make this event possible. We wish to thank both Microsoft and the UK Foreign, Commonwealth &amp; Development Office for their support for the Net Zero Challenge. Thanks also to Open Data Charter and the Open Data &amp; Innovation Team at Transport for New South Wales for their strategic advice during the development of this project. The event would not have been possible without the enthusiastic hard work of the Panel of Experts who will judge the winning entry, and the audience who asked such great questions. Finally – to all the pitch teams. Your projects inspire us and we hope your participation in the Net Zero Challenge has been – and will continue to be – supportive for your work as you use open data to advance climate action. Hugh Rundle: A barbaric yawp Over the Easter break I made a little Rust tool for sending toots and/or tweets from a command line. Of course there are dozens of existing tools that enable either of these, but I had a specific use in mind, and also wanted a reasonably small and achievable project to keep learning Rust. For various reasons I've recently been thinking about the power of "the Unix philosophy", generally summarised as: Write programs that do one thing and do it well. Write programs to work together. Write programs to handle text streams, because that is a universal interface. My little program takes a text string as input, and sends the same string to the output, the intention being not so much that it would normally be used manually on its own (though it can be) but more that it can "work together" with other programs or scripts. The "one thing" it does (I will leave the question of "well" to other people to judge) is post a tweet and/or toot to social media. It's very much a unidirectional, broadcast tool, not one for having a conversation. In that sense, it's like Whitman's "Barbaric yawp", subject of my favourite scene in Dead Poets Society and a pretty nice description of what social media has become in a decade or so. Calling the program yawp therefore seemed fitting. yawp takes text from standard input (stdin), publishes that text as a tweet and/or a toot, and then prints it to standard output (stdout). Like I said, it's not particularly complex, and not even all that useful for your daily social media posting needs, but the point is for it to be part of a tool chain. For this reason yawp takes the configuration it needs to interact with the Mastodon and Twitter APIs from environment (ENV) variables, because these are quite easy to set programatically and a fairly "universal interface" for setting and getting values to be used in programs. Here's a simple example of sending a tweet: yawp 'Hello, World!' -t We could also send a toot by piping from the echo program (the - tells yawp to use stdin instead of looking for an argument like it uses above): echo 'Hello again, World!' | yawp - -m In bash, you can send the contents of a file to stdin, so we could do this too: yawp - -mt &lt;message.txt But really the point is to use yawp to do something like this: app_that_creates_message | yawp - -mt | do_something_else.sh &gt;&gt; yawping.log Anyway, enjoy firing your barbaric yawps into the cacophony. Andromeda Yelton: I haven’t failed, I’ve just tried a lot of ML approaches that don’t work “Let’s blog every Friday,” I thought. “It’ll be great. People can see what I’m doing with ML, and it will be a useful practice for me!” And then I went through weeks on end of feeling like I had nothing to report because I was trying approach after approach to this one problem that simply didn’t work, hence not blogging. And finally realized: oh, the process is the thing to talk about… Hi. I’m Andromeda! I am trying to make a neural net better at recognizing people in archival photos. After running a series of experiments — enough for me to have written 3,804 words of notes — I now have a neural net that is ten times worse at its task. And now I have 3,804 words of notes to turn into a blog post (a situation which gets harder every week). So let me catch you up on the outline of the problem: Download a whole bunch of archival photos and their metadata (thanks, DPLA!)Use a face detection ML library to locate faces, crop them out, and save them in a standardized wayBenchmark an off-the-shelf face recognition system to see how good it is at identifying these facesRetrain itBenchmark my new system Step 3: profit, right? Well. Let me also catch you up on some problems along the way: Alas, metadata Archival photos are great because they have metadata, and metadata is like labels, and labels mean you can do supervised learning, right? Well…. Is he “Du Bois, W. E. B. (William Edward Burghardt), 1868-1963” or “Du Bois, W. E. B. (William Edward Burghardt) 1868-1963” or “Du Bois, W. E. B. (William Edward Burghardt)” or “W.E.B. Du Bois”? I mean, these are all options. People have used a lot of different metadata practices at different institutions and in different times. But I’m going to confuse the poor computer if I imply to it that all these photos of the same person are photos of different people. (I have gone through several attempts to resolve this computationally without needing to do everything by hand, with only modest success.) What about “Photographs”? That appears in the list of subject labels for lots of things in my data set. “Photographs” is a person, right? I ended up pulling in an entire other ML component here — spaCy, to do some natural language processing to at least guess which lines are probably names, so I can clear the rest of them out of my way. But spaCy only has ~90% accuracy on personal names anyway and, guess what, because everything is terrible, in predictable ways, it has no idea “Kweisi Mfume” is a person. Is a person who appears in the photo guaranteed to be a person who appears in the photo? Nope. Is a person who appears in the metadata guaranteed to be a person who appears in the photo? Also nope! Often they’re a photographer or other creator. Sometimes they are the subject of the depicted event, but not themselves in the photo. (spaCy will happily tell you that there’s personal name content in something like “Martin Luther King Day”, but MLK is unlikely to appear in a photo of an MLK day event.) Oh dear, linear algebra OK but let’s imagine for the sake of argument that we live in a perfect world where the metadata is exactly what we need — no more, no less — and its formatting is perfectly consistent. Here you are, in this perfect world, confronted with a photo that contains two people and has two names. How do you like them apples? I spent more time than I care to admit trying to figure this out. Can I bootstrap from photos that have one person and one name — identify those, subtract them out of photos of two people, go from there? (Not reliably — there’s a lot of data I never reach that way — and it’s horribly inefficient.) Can I do something extremely clever with matrix multiplication? Like…once I generate vector space embeddings of all the photos, can I do some sort of like dot-product thing across all of my photos, or big batches of them, and correlate the closest-match photos with overlaps in metadata? Not only is this a process which begs the question — I’d have to do that with the ML system I have not yet optimized for archival photo recognition, thus possibly just baking bad data in — but have I mentioned I have taken exactly one linear algebra class, which I didn’t really grasp, in 1995? What if I train yet another ML system to do some kind of k-means clustering on the embeddings? This is both a promising approach and some really first-rate yak-shaving, combining all the question-begging concerns of the previous paragraph with all the crystalline clarity of black box ML. Possibly at this point it would have been faster to tag them all by hand, but that would be admitting defeat. Also I don’t have a research assistant, which, let’s be honest, is the person who would usually be doing this actual work. I do have a 14-year-old and I am strongly considering paying her to do it for me, but to facilitate that I’d have to actually build a web interface and probably learn more about AWS, and the prospect of reading AWS documentation has a bracing way of reminding me of all of the more delightful and engaging elements of my todo list, like calling some people on the actual telephone to sort out however they’ve screwed up some health insurance billing. Nowhere to go but up Despite all of that, I did actually get all the way through the 5 steps above. I have a truly, spectacularly terrible neural net. Go me! But at a thousand-plus words, perhaps I should leave that story for next week…. Lucidworks: Tips for Mixed Reality in Retail How retailers are turning to virtual reality, augmented reality, and mixed reality applications to recreate the in-store experience from anywhere. The post Tips for Mixed Reality in Retail appeared first on Lucidworks. Erin White: Talk: Using light from the dumpster fire to illuminate a more just digital world This February I gave a lightning talk for the Richmond Design Group. My question: what if we use the light from the dumpster fire of 2020 to see an equitable, just digital world? How can we change our thinking to build the future web we need? Presentation is embedded here; text of talk is below. Hi everybody, I’m Erin. Before I get started I want to say thank you to the RVA Design Group organizers. This is hard work and some folks have been doing it for YEARS. Thank you to the organizers of this group for doing this work and for inviting me to speak. This talk isn’t about 2020. This talk is about the future. But to understand the future, we gotta look back. The web in 1996 Travel with me to 1996. Twenty-five years ago! I want to transport us back to the mindset of the early web. The fundamental idea of hyperlinks, which we now take for granted, really twisted everyone’s noodles. So much of the promise of the early web was that with broad access to publish in hypertext, the opportunities were limitless. Technologists saw the web as an equalizing space where systems of oppression that exist in the real world wouldn’t matter, and that we’d all be equal and free from prejudice. Nice idea, right? You don’t need to’ve been around since 1996 to know that’s just not the way things have gone down. Pictured before you are some of the early web pioneers. Notice a pattern here? These early visions of the web, including Barlow’s declaration of independence of cyberspace, while inspiring and exciting, were crafted by the same types of folks who wrote the actual declaration of independence: the landed gentry, white men with privilege. Their vision for the web echoed the declaration of independence’s authors’ attempts to describe the world they envisioned. And what followed was the inevitable conflict with reality. We all now hold these truths to be self-evident: The systems humans build reflect humans’ biases and prejudices. We continue to struggle to diversify the technology industry. Knowledge is interest-driven. Inequality exists, online and off. Celebrating, rather than diminishing, folks’ intersecting identities is vital to human flourishing. The web we have known Profit first: monetization, ads, the funnel, dark patterns Can we?: Innovation for innovation’s sake Solutionism: code will save us Visual design: aesthetics over usability Lone genius: “hard” skills and rock star coders Short term thinking: move fast, break stuff Shipping: new features, forsaking infrastructure Let’s move forward quickly through the past 25 years or so of the web, of digital design. All of the web we know today has been shaped in some way by intersecting matrices of domination: colonialism, capitalism, white supremacy, patriarchy. (Thank you, bell hooks.) The digital worlds where we spend our time – and that we build!! – exist in this way. This is not an indictment of anyone’s individual work, so please don’t take it personally. What I’m talking about here is the digital milieu where we live our lives. The funnel drives everything. Folks who work in nonprofits and public entities often tie ourselves in knots to retrofit our use cases in order to use common web tools (google analytics, anyone?) In chasing innovation we often overlook important infrastructure work, and devalue work — like web accessibility, truly user-centered design, care work, documentation, customer support and even care for ourselves and our teams — that doesn’t drive the bottom line. We frequently write checks for our future selves to cash, knowing damn well that we’ll keep burying ourselves in technical debt. That’s some tough stuff for us to carry with us every day. The “move fast” mentality has resulted in explosive growth, but at what cost? And in creating urgency where it doesn’t need to exist, focusing on new things rather than repair, the end result is that we’re building a house of cards. And we’re exhausted. To zoom way out, this is another manifestation of late capitalism. Emphasis on LATE. Because…2020 happened. What 2020 taught us Hard times amplify existing inequalities Cutting corners mortgages our future Infrastructure is essential “Colorblind”/color-evasive policy doesn’t cut it Inclusive design is vital We have a duty to each other Technology is only one piece Together, we rise The past year has been awful for pretty much everybody. But what the light from this dumpster fire has illuminated is that things have actually been awful for a lot of people, for a long time. This year has shown us how perilous it is to avoid important infrastructure work and to pursue innovation over access. It’s also shown us that what is sometimes referred to as colorblindness — I use the term color-evasiveness because it is not ableist and it is more accurate — a color-evasive approach that assumes everyone’s needs are the same in fact leaves people out, especially folks who need the most support. We’ve learned that technology is a crucial tool and that it’s just one thing that keeps us connected to each other as humans. Finally, we’ve learned that if we work together we can actually make shit happen, despite a world that tells us individual action is meaningless. Like biscuits in a pan, when we connect, we rise together. Marginalized folks have been saying this shit for years. More of us than ever see these things now. And now we can’t, and shouldn’t, unsee it. The web we can build together Current state: – Profit first – Can we? – Solutionism – Aesthetics – “Hard” skills – Rockstar coders – Short term thinking – Shipping Future state: – People first: security, privacy, inclusion – Should we? – Holistic design – Accessibility – Soft skills – Teams – Long term thinking – Sustaining So let’s talk about the future. I told you this would be a talk about the future. Like many of y’all I have had a very hard time this year thinking about the future at all. It’s hard to make plans. It’s hard to know what the next few weeks, months, years will look like. And who will be there to see it with us. But sometimes, when I can think clearly about something besides just making it through every day, I wonder. What does a people-first digital world look like? Who’s been missing this whole time? Just because we can do something, does it mean we should? Will technology actually solve this problem? Are we even defining the problem correctly? What does it mean to design knowing that even “able-bodied” folks are only temporarily so? And that our products need to be used, by humans, in various contexts and emotional states? (There are also false binaries here: aesthetics vs. accessibility; abled and disabled; binaries are dangerous!) How can we nourish our collaborations with each other, with our teams, with our users? And focus on the wisdom of the folks in the room rather than assigning individuals as heroes? How can we build for maintenance and repair? How do we stop writing checks our future selves to cash – with interest? Some of this here, I am speaking of as a web user and a web creator. I’ve only ever worked in the public sector. When I talk with folks working in the private sector I always do some amount of translating. At the end of the day, we’re solving many of the same problems. But what can private-sector workers learn from folks who come from a public-sector organization? And, as we think about what we build online, how can we also apply that thinking to our real-life communities? What is our role in shaping the public conversation around the use of technologies? I offer a few ideas here, but don’t want them to limit your thinking. Consider the public sector Here’s a thread about public service. — Dana Chisnell (she / her) (@danachis) February 5, 2021 I don’t have a ton of time left today. I wanted to talk about public service like the very excellent Dana Chisnell here. Like I said, I’ve worked in the public sector, in higher ed, for a long time. It’s my bread and butter. It’s weird, it’s hard, it’s great. There’s a lot of work to be done, and it ain’t happening at civic hackathons or from external contractors. The call needs to come from inside the house. Working in the public sector Government should be– inclusive of all people– responsive to needs of the people– effective in its duties &amp; purpose — Dana Chisnell (she / her) (@danachis) February 5, 2021 I want you to consider for a minute how many folks are working in the public sector right now, and how technical expertise — especially in-house expertise — is something that is desperately needed. Pictured here are the old website and new website for the city of Richmond. I have a whole ‘nother talk about that new Richmond website. I FOIA’d the contracts for this website. There are 112 accessibility errors on the homepage alone. It’s been in development for 3 years and still isn’t in full production. Bottom line, good government work matters, and it’s hard to find. Important work is put out for the lowest bidder and often external agencies don’t get it right. What would it look like to have that expertise in-house? Influencing technology policy We also desperately need lawmakers and citizens who understand technology and ask important questions about ethics and human impact of systems decisions. Pictured here are some headlines as well as a contract from the City of Richmond. Y’all know we spent $1.5 million on a predictive policing system that will disproportionately harm citizens of color? And that earlier this month, City Council voted to allow Richmond and VCU PD’s to start sharing their data in that system? The surveillance state abides. Technology facilitates. I dare say these technologies are designed to bank on the fact that lawmakers don’t know what they’re looking at. My theory is, in addition to holding deep prejudices, lawmakers are also deeply baffled by technology. The hard questions aren’t being asked, or they’re coming too late, and they’re coming from citizens who have to put themselves in harm’s way to do so. Technophobia is another harmful element that’s emerged in the past decades. What would a world look like where technology is not a thing to shrug off as un-understandable, but is instead deftly co-designed to meet our needs, rather than licensed to our city for 1.5 million dollars? What if everyone knew that technology is not neutral? Closing This is some of the future I can see. I hope that it’s sparked new thoughts for you. Let’s envision a future together. What has the light illuminated for you? Thank you! David Rosenthal: NFTs and Web Archiving One of the earliest observations of the behavior of the Web at scale was "link rot". There were a lot of 404s, broken links. Research showed that the half-life of Web pages was alarmingly short. Even in 1996 this problem was obvious enough for Brewster Kahle to found the Internet Archive to address it. From the Wikipedia entry for Link Rot:A 2003 study found that on the Web, about one link out of every 200 broke each week,[1] suggesting a half-life of 138 weeks. This rate was largely confirmed by a 2016–2017 study of links in Yahoo! Directory (which had stopped updating in 2014 after 21 years of development) that found the half-life of the directory's links to be two years.[2]One might have thought that academic journals were a relatively stable part of the Web, but research showed that their references decayed too, just somewhat less rapidly. A 2013 study found a half-life of 9.3 years. See my 2015 post The Evanescent Web. I expect you have noticed the latest outbreak of blockchain-enabled insanity, Non-Fungible Tokens (NFTs). Someone "paying $69M for a JPEG" or $560K for a New York Times column attracted a lot of attention. Follow me below the fold for the connection between NFTs, "link rot" and Web archiving.Kahle's idea for addressing "link rot", which became the Wayback Machine, was to make a copy of the content at some URL, say:http://www.example.com/page.htmlkeep the copy for posterity, and re-publish it at a URL like:https://web.archive.org/web/19960615083712/http://www.example.com/page.htmlWhat is the difference between the two URLs? The original is controlled by Example.Com, Inc.; they can change or delete it on a whim. The copy is controlled by the Internet Archive, whose mission is to preserve it unchanged "for ever". The original is subject to "link rot", the second is, one hopes, not subject to "link rot". The Wayback Machine's URLs have three components:https://web.archive.org/web/ locates the archival copy at the Internet Archive.19960615083712 indicates that the copy was made on 15th June, 1996 at 8:37:12.http://www.example.com/page.html is the URL from which the copy was made.The fact that the archival copy is at a different URL from the original causes a set of problems that have bedevilled Web archiving. One is that, if the original goes away, all the links that pointed to it break, even though there may be an archival copy to which they could point to fulfill the intent of the link creator. Another is that, if the content at the original URL changes, the link will continue to resolve but the content it returns may no longer reflect the intent of the link creator, although there may be an archival copy that does. Even in the early days of the Web it was evident that Web pages changed and vanished at an alarming rate.The point is that the meaning of a generic Web URL is "whatever content, or lack of content, you find at this location". That is why URL stands for Universal Resource Locator. Note the difference with URI, which stands for Universal Resource Identifier. Anyone can create a URL or URI linking to whatever content they choose, but doing so provides no rights in or control over the linked-to content.In People's Expensive NFTs Keep Vanishing. This Is Why, Ben Munster reports that:over the past few months, numerous individuals have complained about their NFTs going “missing,” “disappearing,” or becoming otherwise unavailable on social media. This despite the oft-repeated NFT sales pitch: that NFT artworks are logged immutably, and irreversibly, onto the Ethereum blockchain. So NTFs have the same problem that Web pages do. Isn't the blockchain supposed to make things immortal and immutable?Kyle Orland's Ars Technica’s non-fungible guide to NFTs provides an over-simplified explanation:When NFT’s are used to represent digital files (like GIFs or videos), however, those files usually aren’t stored directly “on-chain” in the token itself. Doing so for any decently sized file could get prohibitively expensive, given the cost of replicating those files across every user on the chain. Instead, most NFTs store the actual content as a simple URI string in their metadata, pointing to an Internet address where the digital thing actually resides. NFTs are just links to the content they represent, not the content itself. The Bitcoin blockchain actually does contain some images, such as this ASCII portrait of Len Sassaman and some pornographic images. But the blocks of the Bitcoin blockchain were originally limited to 1MB and are now effectively limited to around 2MB, enough space for small image files. What’s the Maximum Ethereum Block Size? explains:Instead of a fixed limit, Ethereum block size is bound by how many units of gas can be spent per block. This limit is known as the block gas limit ... At the time of writing this, miners are currently accepting blocks with an average block gas limit of around 10,000,000 gas. Currently, the average Ethereum block size is anywhere between 20 to 30 kb in size. That's a little out-of-date. Currently the block gas limit is around 12.5M gas per block and the average block is about 45KB. Nowhere near enough space for a $69M JPEG. The NFT for an artwork can only be a link. Most NFTs are ERC-721 tokens, providing the optional Metadata extension:/// @title ERC-721 Non-Fungible Token Standard, optional metadata extension/// @dev See https://eips.ethereum.org/EIPS/eip-721/// Note: the ERC-165 identifier for this interface is 0x5b5e139f.interface ERC721Metadata /* is ERC721 */ { /// @notice A descriptive name for a collection of NFTs in this contract function name() external view returns (string _name); /// @notice An abbreviated name for NFTs in this contract function symbol() external view returns (string _symbol); /// @notice A distinct Uniform Resource Identifier (URI) for a given asset. /// @dev Throws if `_tokenId` is not a valid NFT. URIs are defined in RFC /// 3986. The URI may point to a JSON file that conforms to the "ERC721 /// Metadata JSON Schema". function tokenURI(uint256 _tokenId) external view returns (string);}The Metadata JSON Schema specifies an object with three string properties:name: "Identifies the asset to which this NFT represents"description: "Describes the asset to which this NFT represents"image: "A URI pointing to a resource with mime type image/* representing the asset to which this NFT represents. Consider making any images at a width between 320 and 1080 pixels and aspect ratio between 1.91:1 and 4:5 inclusive."Note that the JSON metadata is not in the Ethereum blockchain, it is only pointed to by the token on the chain. If the art-work is the "image", it is two links away from the blockchain. So, given the evanescent nature of Web links, the standard provides no guarantee that the metadata exists, or is unchanged from when the token was created. Even if it is, the standard provides no guarantee that the art-work exists or is unchanged from when the token is created.Caveat emptor — Absent unspecified actions, the purchaser of an NFT is buying a supposedly immutable, non-fungible object that points to a URI pointing to another URI. In practice both are typically URLs. The token provides no assurance that either of these links resolves to content, or that the content they resolve to at any later time is what the purchaser believed at the time of purchase. There is no guarantee that the creator of the NFT had any copyright in, or other rights to, the content to which either of the links resolves at any particular time.There are thus two issues to be resolved about the content of each of the NFT's links:Does it exist? I.e. does it resolve to any content?Is it valid? I.e. is the content to which it resolves unchanged from the time of purchase?These are the same questions posed by the Holy Grail of Web archiving, persistent URLs.Assuming existence for now, how can validity be assured? There have been a number of systems that address this problem by switching from naming files by their location, as URLs do, to naming files by their content by using the hash of the content as its name. The idea was the basis for Bram Cohen's highly successful BitTorrent — it doesn't matter where the data comes from provided its integrity is assured because the hash in the name matches the hash of the content.The content-addressable file system most used for NFTs is the Interplanetary File System (IPFS). From its Wikipedia page:As opposed to a centrally located server, IPFS is built around a decentralized system[5] of user-operators who hold a portion of the overall data, creating a resilient system of file storage and sharing. Any user in the network can serve a file by its content address, and other peers in the network can find and request that content from any node who has it using a distributed hash table (DHT). In contrast to BitTorrent, IPFS aims to create a single global network. This means that if Alice and Bob publish a block of data with the same hash, the peers downloading the content from Alice will exchange data with the ones downloading it from Bob.[6] IPFS aims to replace protocols used for static webpage delivery by using gateways which are accessible with HTTP.[7] Users may choose not to install an IPFS client on their device and instead use a public gateway. If the purchaser gets both the NFT's metadata and the content to which it refers via IPFS URIs, they can be assured that the data is valid. What do these IPFS URIs look like? The (excellent) IPFS documentation explains:https://ipfs.io/ipfs/&lt;CID&gt;# e.ghttps://ipfs.io/ipfs/Qme7ss3ARVgxv6rXqVPiikMJ8u2NLgmgszg13pYrDKEoiuBrowsers that support IPFS can redirect these requests to your local IPFS node, while those that don't can fetch the resource from the ipfs.io gateway.You can swap out ipfs.io for your own http-to-ipfs gateway, but you are then obliged to keep that gateway running forever. If your gateway goes down, users with IPFS aware tools will still be able to fetch the content from the IPFS network as long as any node still hosts it, but for those without, the link will be broken. Don't do that. Note the assumption here that the ipfs.io gateway will be running forever. Note also that only some browsers are capable of accessing IPFS content without using a gateway. Thus the ipfs.io gateway is a single point of failure, although the failure is not complete. In practice NFTs using IPFS URIs are dependent upon the continued existence of Protocol Labs, the organization behind IPFS. The ipfs.io URIs in the NFT metadata are actually URLs; they don't point to IPFS, but to a Web server that accesses IPFS.Pointing to the NFT's metadata and content using IPFS URIs assures their validity but does it assure their existence? The IPFS documentation's section Persistence, permanence, and pinning explains:Nodes on the IPFS network can automatically cache resources they download, and keep those resources available for other nodes. This system depends on nodes being willing and able to cache and share resources with the network. Storage is finite, so nodes need to clear out some of their previously cached resources to make room for new resources. This process is called garbage collection.To ensure that data persists on IPFS, and is not deleted during garbage collection, data can be pinned to one or more IPFS nodes. Pinning gives you control over disk space and data retention. As such, you should use that control to pin any content you wish to keep on IPFS indefinitely. To assure the existence of the NFT's metadata and content they must both be not just written to IPFS but also pinned to at least one IPFS node.To ensure that your important data is retained, you may want to use a pinning service. These services run lots of IPFS nodes and allow users to pin data on those nodes for a fee. Some services offer free storage-allowance for new users. Pinning services are handy when:You don't have a lot of disk space, but you want to ensure your data sticks around.Your computer is a laptop, phone, or tablet that will have intermittent connectivity to the network. Still, you want to be able to access your data on IPFS from anywhere at any time, even when the device you added it from is offline.You want a backup that ensures your data is always available from another computer on the network if you accidentally delete or garbage-collect your data on your own computer. Thus to assure the existence of the NFT's metadata and content pinning must be rented from a pinning service, another single point of failure.In summary, it is possible to take enough precautions and pay enough ongoing fees to be reasonably assured that your $69M NFT and its metadata and the JPEG it refers to will remain accessible. Whether in practice these precautions are taken is definitely not always the case. David Gerard reports:But functionally, IPFS works the same way as BitTorrent with magnet links — if nobody bothers seeding your file, there’s no file there. Nifty Gateway turn out not to bother to seed literally the files they sold, a few weeks later. [Twitter; Twitter] Anil Dash claims to have invented, with Kevin McCoy, the concept of NFTs referencing Web URLs in 2014. He writes in his must-read NFTs Weren’t Supposed to End Like This:Seven years later, all of today’s popular NFT platforms still use the same shortcut. This means that when someone buys an NFT, they’re not buying the actual digital artwork; they’re buying a link to it. And worse, they’re buying a link that, in many cases, lives on the website of a new start-up that’s likely to fail within a few years. Decades from now, how will anyone verify whether the linked artwork is the original?All common NFT platforms today share some of these weaknesses. They still depend on one company staying in business to verify your art. They still depend on the old-fashioned pre-blockchain internet, where an artwork would suddenly vanish if someone forgot to renew a domain name. “Right now NFTs are built on an absolute house of cards constructed by the people selling them,” the software engineer Jonty Wareing recently wrote on Twitter. My only disagreement with Dash is that, as someone who worked on archiving the "old-fashioned pre-blockchain internet" for two decades, I don't believe that there is a new-fangled post-blockchain Internet that makes the problems go away. And neither does David Gerard:The pictures for NFTs are often stored on the Interplanetary File System, or IPFS. Blockchain promoters talk like IPFS is some sort of bulletproof cloud storage that works by magic and unicorns. Journal of Web Librarianship: The Impact of the COVID-19 Pandemic on Digital Library Usage: A Public Library Case Study . Evergreen ILS: Evergreen 3.7.0 released The Evergreen Community is pleased to announce the release of Evergreen 3.7.0. Evergreen is highly-scalable software for libraries that helps library patrons find library materials and helps libraries manage, catalog, and circulate those materials, no matter how large or complex the libraries. Evergreen 3.7.0 is a major release that includes the following new features of note: Support for SAML-based Single Sign On Hold Groups, a feature that allows staff to add multiple users to a named hold group bucket and place title-level holds for a record for that entire set of users The Bootstrap public catalog skin is now the default “Did you mean?” functionality for catalog search focused on making suggestions for single search terms Holdings on the public catalog record details page can now be sorted by geographic proximity Library Groups, a feature that allows defining groups of organizational units outside of the hierarchy that can be used to limit catalog search results Expired staff accounts can now be blocked from logging in Publisher data in the public catalog display is now drawn from both the 260 and 264 field The staff catalog can now save all search results (up to 1,000) to a bucket in a single operation New opt-in settings for overdue and predue email notifications A new setting to allow expired patrons to renew loans Porting of additional interfaces to Angular, including Scan Item as Missing Pieces and Shelving Location Groups Evergreen admins installing or upgrading to 3.7.0 should be aware of the following: The minimum version of PostgreSQL required to run Evergreen 3.6 is PostgreSQL 9.6. The minimum version of OpenSRF is 3.2. This release adds anew OpenSRF service, open-ils.geo. The release also adds several new Perl module dependencies, Geo::Coder::Google, Geo::Coder::OSM, String::KeyboardDistance, and Text::Levenshtein::Damerau::XS. The database update procedure has more steps than usual; please consult the upgrade section of the release notes. The release is available on the Evergreen downloads page. Additional information, including a full list of new features, can be found in the release notes. Lucidworks: Build Semantic Search at Speed Learn more about using semantic machine learning methodologies to power more relevant search results across your organization. The post Build Semantic Search at Speed appeared first on Lucidworks. Open Knowledge Foundation: Unveiling the new Frictionless Data documentation portal Have you used Frictionless Data documentation in the past and been confused or wanted more examples? Are you a brand new Frictionless Data user looking to get started learning?  We invite you all to visit our new and improved documentation portal. Thanks to a fund that the Open Knowledge Foundation was awarded from the Open Data Institute, we have completely reworked the guides of our Frictionless Data Framework website according to the suggestions from a cohort of users gathered during several feedback sessions throughout the months of February and March.  We cannot stress enough how precious those feedback sessions have been to us. They were an excellent opportunity to connect with our users and reflect together with them on how to make all our guides more useful for current and future users. The enthusiasm and engagement that the community showed for the process was great to see and reminded us that the link with the community should be at the core of open source projects. We were amazed by the amount of extremely useful inputs that we got. While we are still digesting some of the suggestions and working out how to best implement them, we have made many changes to make the documentation a smoother, Frictionless experience. So what’s new? A common theme from the feedback sessions was that it was sometimes difficult for novice users to understand the whole potential of the Frictionless specifications. To help make this clearer, we added a more detailed explanation, user examples and user stories to our Introduction. We also added some extra installation tips and a troubleshooting section to our Quick Start guide. The users also suggested several code changes, like more realistic code examples, better explanations of functions, and the ability to run code examples in both the Command Line and Python. This last suggestion was prompted because most of the guides use a mix of Command Line and Python syntax, which was confusing to our users. We have clarified that by adding a switch in the code snippets that allows user to work with a pure Python Syntax or pure Command Line (when possible), as you can see here. We also put together an FAQ section based on questions that were often asked on our Discord chat. If you have suggestions for other common questions to add, let us know! The documentation revamping process also included the publication of new tutorials. We worked on two new Frictionless tutorials, which are published under the Notebooks link in the navigation menu. While working on those, we got inspired by the feedback sessions and realised that it made sense to give our community the possibility to contribute to the project with some real life examples of Frictionless Data use. The user selection process has started and we hope to get the new tutorials online by the end of the month, so stay tuned! What’s next? Our commitment to continually improving our documentation is not over with this project coming to an end! Do you have suggestions for changes you would like to see in our documentation? Please reach out to us or open a pull request to contribute. Everyone is welcome to contribute! Learn how to do it here. Thanks, thanks, thanks! Once again, we are very grateful to the Open Data Institute for giving us the chance to focus on this documentation in order to improve it. We cannot thank enough all our users who took part in the feedback sessions. Your contributions were precious. More about Frictionless Data Frictionless Data is a set of specifications for data and metadata interoperability, accompanied by a collection of software libraries that implement these specifications, and a range of best practices for data management. The project is funded by the Sloan Foundation. David Rosenthal: Cryptocurrency's Carbon Footprint China’s bitcoin mines could derail carbon neutrality goals, study says and Bitcoin mining emissions in China will hit 130 million tonnes by 2024, the headlines say it all. Excusing this climate-destroying externality of Proof-of-Work blockchains requires a continuous flow of new misleading arguments. Below the fold I discuss one of the more recent novelties.In Bitcoin and Ethereum Carbon Footprints – Part 2, Moritz Seibert claims the reason for mining is to get the mining reward: Bitcoin transactions themselves don’t cause a lot of power usage. Getting the network to accept a transaction consumes almost no power, but having ASIC miners grind through the mathematical ether to solve valid blocks does. Miners are incentivized to do this because they are compensated for it. Presently, that compensation includes a block reward which is paid in bitcoin (6.25 BTC per block) as well as a miner fee (transaction fee). Transaction fees are denominated in fractional bitcoins and paid by the initiator of the transaction. Today, about 15% of total miners’ rewards are transactions fees, and about 85% are block rewards. So, he argues, Bitcoin's current catastrophic carbon footprint doesn't matter because, as the reward decreases, so will the carbon footprint: This also means that the power usage of the Bitcoin network won’t scale linearly with the number of transactions as the network becomes predominantly fee-based and less rewards-based (which causes a lot of power to the thrown at it in light of increasing BTC prices), and especially if those transactions take place on secondary layers. In other words, taking the ratio of “Bitcoin’s total power usage” to “Number of transactions” to calculate the “Power cost per transaction” falsely implies that all transactions hit the final settlement layer (they don’t) and disregards the fact that the final state of the Bitcoin base layer is a fee-based state which requires a very small fraction of Bitcoin’s overall power usage today (no more block rewards). Seibert has some vague idea that there are implications of this not just for the carbon footprint but also for the security of the Bitcoin blockchain:Going forward however, miners’ primary revenue source will change from block rewards to the fees paid for the processing of transactions, which don’t per se cause high carbon emissions. Bitcoin is set to become be a purely fee-based system (which may pose a risk to the security of the system itself if the overall hash rate declines, but that’s a topic for another article because a blockchain that is fully reliant on fees requires that BTCs are transacted with rather than held in Michael Saylor-style as HODLing leads to low BTC velocity, which does not contribute to security in a setup where fees are the only rewards for miners.) Lets leave aside the stunning irresponsibility of arguing that it is acceptable to dump huge amounts of long-lasting greenhouse gas into the atmosphere now because you believe that in the future you will dump less. How realistic is the idea that decreasing the mining reward will decrease the carbon footprint?The graph shows the history of the hash rate, which is a proxy for the carbon footprint. You can see the effect of the "halvening", when on May 11th 2020 the mining reward halved. There was a temporary drop, but the hash rate resumed its inexorable rise. This experiment shows that reducing the mining reward doesn't reduce the carbon footprint. So why does Seibert think that eliminating it will reduce the carbon footprint?The answer appears to be that Seibert thinks the purpose of mining is to create new Bitcoins, that the reason for the vast expenditure of energy is to make the process of creating new coins secure, and that it has nothing to do with the security of transactions. This completely misunderstands the technology.In The Economic Limits of Bitcoin and the Blockchain, Eric Budish examines the return on investment in two kinds of attacks on a blockchain like Bitcoin's. The simpler one is a 51% attack, in which an attacker controls the majority of the mining power. Budish explains what this allows the attacker to do:An attacker could (i) spend Bitcoins, i.e., engage in a transaction in which he sends his Bitcoins to some merchant in exchange for goods or assets; then (ii) allow that transaction to be added to the public blockchain (i.e., the longest chain); and then subsequently (iii) remove that transaction from the public blockchain, by building an alternative longest chain, which he can do with certainty given his majority of computing power. The merchant, upon seeing the transaction added to the public blockchain in (ii), gives the attacker goods or assets in exchange for the Bitcoins, perhaps after an escrow period. But, when the attacker removes the transaction from the public blockchain in (iii), the merchant effectively loses his Bitcoins, allowing the attacker to “double spend” the coins elsewhere. Such attacks are endemic among the smaller alt-coins; for example there were three successful attacks on Ethereum Classic in a single month last year. Clearly, Seibert's future "transaction only" Bitcoin must defend against them.There are two ways to mount a 51% attack, from the outside or from the inside. An outside attack requires more mining power than the insiders are using, whereas an insider attack only needs a majority of the mining power to conspire. Bitcoin miners collaborate in "mining pools" to reduce volatility of their income, and for many years it would have taken only three or so pools to conspire for a successful attack. But assuming insiders are honest, outsiders must acquire more mining power than the insiders are using. Clearly, Bitcoin insiders are using so much mining power that this isn't feasible.The point of mining isn't to create new Bitcoins. Mining is needed to make the process of adding a block to the chain, and thus adding a set of transactions to the chain, so expensive that it isn't worth it for an attacker to subvert the process. The cost, and thus in the case of Proof of Work the carbon footprint, is the whole point. As Budish wrote:From a computer security perspective, the key thing to note ... is that the security of the blockchain is linear in the amount of expenditure on mining power, ... In contrast, in many other contexts investments in computer security yield convex returns (e.g., traditional uses of cryptography) — analogously to how a lock on a door increases the security of a house by more than the cost of the lock. Lets consider the possible futures of a fee-based Bitcoin blockchain. It turns out that currently fee revenue is a smaller proportion of total miner revenue than Seibert claims. Here is the chart of total revenue (~$60M/day):And here is the chart of fee revenue (~$5M/day):Thus the split is about 8% fee, 92% reward:If security stays the same, blocksize stays the same, fees must increase to keep the cost of a 51% attack high enough.The chart shows the average fee hovering around $20, so the average cost of a single transaction would be over $240. This might be a problem for Seibert's requirement that "BTCs are transacted with rather than held".If blocksize stays the same, fees stay the same, security must decrease because the fees cannot cover the cost of enough hash power to deter a 51% attack. Similarly, in this case it would be 12 times cheaper to mount a 51% attack, which would greatly increase the risk of delivering anything in return for Bitcoin. It is already the case that users are advised to wait 6 blocks (about an hour) before treating a transaction as final. Waiting nearly half a day before finality would probably be a disincentive.If fees stay the same, security stays the same, blocksize must increase to allow for enough transactions so that their fees cover the cost of enough hash power to deter a 51% attack. Since 2017 Bitcoin blocks have been effectively limited to around 2MB, and the blockchain is now over one-third of a Terabyte growing at over 25%/yr. Increasing the size limit to say 22MB would solve the long-term problem of a fee-based system at the cost of reducing miners income in the short term by reducing the scarcity value of a slot in a block. Doubling the effective size of the block caused a huge controversy in the Bitcoin community for precisely this short vs. long conflict, so a much larger increase would be even more controversial. Not to mention that the size of the blockchain a year from now would be 3 times bigger imposing additional storage costs on miners.That is just the supply side. On the demand side it is an open question as to whether there would be 12 times the current demand for transactions costing $20 and taking an hour which, at least in the US, must each be reported to the tax authorities.Short vs. LongNone of these alternatives look attractive. But there's also a second type of attack in Budish's analysis, which he calls "sabotage". He quotes Rosenfeld:In this section we will assume q &lt; p [i.e., that the attacker does not have a majority]. Otherwise, all bets are off with the current Bitcoin protocol ... The honest miners, who no longer receive any rewards, would quit due to lack of incentive; this will make it even easier for the attacker to maintain his dominance. This will cause either the collapse of Bitcoin or a move to a modified protocol. As such, this attack is best seen as an attempt to destroy Bitcoin, motivated not by the desire to obtain Bitcoin value, but rather wishing to maintain entrenched economical systems or obtain speculative profits from holding a short position. Short interest in Bitcoin is currently small relative to the total stock, but much larger relative to the circulating supply. Budish analyzes various sabotage attack cases, with a parameter ∆attack representing the proportion of the Bitcoin value destroyed by the attack: For example, if ∆attack = 1, i.e., if the attack causes a total collapse of the value of Bitcoin, the attacker loses exactly as much in Bitcoin value as he gains from double spending; in effect, there is no chance to “double” spend after all. ... However, ∆attack is something of a “pick your poison” parameter. If ∆attack is small, then the system is vulnerable to the double-spending attack ... and the implicit transactions tax on economic activity using the blockchain has to be high. If ∆attack is large, then a short time period of access to a large amount of computing power can sabotage the blockchain. The current cryptocurrency bubble ensures that everyone is making enough paper profits from the golden eggs to deter them from killing the goose that lays them. But it is easy to create scenarios in which a rush for the exits might make killing the goose seem like the best way out.Seibert's misunderstanding illustrates the fundamental problem with permissionless blockchains. As I wrote in A Note On Blockchains: If joining the replica set of a permissionless blockchain is free, it will be vulnerable to Sybil attacks, in which an attacker creates many apparently independent replicas which are actually under his sole control. If creating and maintaining a replica is free, anyone can authorize any change they choose simply by creating enough Sybil replicas.Defending against Sybil attacks requires that membership in a replica set be expensive.There are many attempts to provide less environmentally damaging ways to make adding a block to a blockchain expensive, but attempts to make adding a block cheaper are self-defeating because they make the blockchain less secure.There are two reasons why the primary use of a permissionless blockchain cannot be transactions as opposed to HODL-ing:The lack of synchronization between the peers means that transactions must necessarily be slow.The need to defend against Sybil attacks means either that transactions must necessarily be expensive, or that blocks must be impractically large. Islandora: Islandora Open Meeting: April 27, 2021 Islandora Open Meeting: April 27, 2021 agriffith Tue, 04/13/2021 - 16:11 Body We are happy to announce the date of our next Open Meeting! Join us on April 27, 2021 any time between 10:00-2:00pm EDT. The Open Meetings are drop-in style sessions where users of all levels and abilities gather to ask questions, share use cases and get updates on Islandora. There will be experienced Islandora 8 users on hand to answer questions or give demos. We would love for your to join us any time during the 4-hour window, so feel free to pop by any time! More details about the Open Meeting, and the Zoom link to join, are in this Google doc.  Registration is not required. If you would like a calendar invite as a reminder, please let us know at community@islandora.ca. Digital Library Federation: Call for Proposals open for NDSA Digital Preservation 2021! The NDSA is very pleased to announce the Call for Proposals is open for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this year on November 4th, 2021 during World Digital Preservation Day. Submissions from members and nonmembers alike are welcome, and you can learn more about session format options through the CFP. The deadline to submit proposals is Monday, May 17, at 11:59pm Eastern Time. Digital Preservation 2021 (#DigiPres21) is held in partnership with our host organization, the Council on Library and Information Resources’ (CLIR) Digital Library Federation. Separate calls are being issued for CLIR+DLF’s 2021 events, the 2021 DLF Forum (November 1-3) and associated workshop series Learn@DLF (November 8-10). NDSA strives to create a safe, accessible, welcoming, and inclusive event, and adheres to DLF’s Code of Conduct. We look forward to seeing you online on November 4th, ~ 2021 DigiPres Planning Committee The post Call for Proposals open for NDSA Digital Preservation 2021! appeared first on DLF. HangingTogether: Dutch round table on next generation metadata: think bigger than NACO and WorldCat As part of the OCLC Research Discussion Series on Next Generation Metadata, this blog post reports back from the Dutch language round table discussion held on March 8, 2021. (A Dutch translation is available here). Librarians – with backgrounds in metadata, library systems, reference work, national bibliography, and back-office processes – joined the session, representing a nice mix of academic and heritage institutions from the Netherlands and Belgium. The participants were engaged, candid, and thoughtful and this stimulated constructive knowledge exchange in a pleasant atmosphere.   Mapping exercise Map of next-gen metadata projects (Dutch session) As in all the other round table discussions, participants started with taking stock of next generation metadata projects in their region or initiatives they were aware of elsewhere. The resulting map shows a strong representation of bibliographic and cultural heritage data-projects (see upper- and lower-left quadrants of the matrix). Several next-generation metadata research projects of the National Library of the Netherlands were listed and described, such as: Automatic Metadata Generation, which identifies and tests tools to support subject tagging and cataloging of name authority records;The Entity Finder, a tool being developed to help extract RDA entities (persons, works, expressions) from both authority and bibliographic records. The Digital Heritage Reference Architecture (DERA) was developed as part of the national strategy for digital heritage in the Netherlands. It is a framework for managing and publishing heritage information as Linked Open Data (LOD), according to agreed practices and conventions. The Van Gogh Worldwide platform is an exemplar of the application of DERA – where metadata, relating to the painter’s art works residing at 17 different Dutch heritage institutions and private collectors, have been pulled from source systems by API. A noteworthy initiative listed in the RIM/Scholarly Communications quadrant of the matrix is the NL-Open Knowledge Base, an initiative in the context of last year’s deal between Elsevier and the Dutch Research institutions, to jointly develop open science services based on their RIM systems, Elsevier’s databases and analytics solutions and the Dutch funding organizations’ databases. The envisaged Open Knowledge Base could potentially feed new applications – for example, a dashboard to monitor the achievement of the universities’ Sustainable Development Goals – and allow to significantly improve the analysis of research impact. What is keeping us from moving forward? Notwithstanding the state-of-the-art projects mentioned during the mapping exercise, the participants were impatient about the pace of the transition to the next generation of metadata. One participant experienced frustration with having to use multiple tools for a workflow that supports the transition, namely: integration of PIDs, local authorities, or links to and from external sources. Another participant noted that there is still a lot of efficiency to be gained in the value chain:  “When we look at the supply chain, it is absurd to start from scratch because there is already so much data. When a book comes out on the market, it must already have been described. There should not be a need to start from scratch in the library.” The group also wondered – with so many bibliographic datasets already published as Linked Open Data – what else needs to be done to interconnect them in meaningful ways? The question of what is keeping us from moving forward dominated the discussion. Trusting external data One participant suggested that libraries are cautious about the data sources they link up with. Authority files are persistent and reliable data sources, which have yet to find their counterparts in the newly emerging linked data ecosystem. The lack of conventions around reliability and persistence might be a reason why libraries are hesitant entering into linked data partnerships or holding back from relying on external data – even from established sources, such as Wikidata. After all, linking to a data source is an indication of trust and recognition of data quality. The conversation moved to data models: which linked data do you create yourself? How will you design it and link it up to other data? Some participants found there was still a lack of agreement and clarity about the meaning of key concepts such as a “work”. Others pointed out that defining the meaning of concepts used is exactly what linked data is about and this feature allows the co-existence of multiple ontologies – in other words, there is no need any longer to fix semantics in hard standards. “There is no unique semantic model. When you refer to data that has already been defined by others, you relinquish control over that piece of information, and that can be a mental barrier against doing linked data the proper way. It is much safer to store and manage all the data in your own silo. But the moment you can let go of that, the world can become much richer than you can ever achieve on your own.” Thinking in terms of linked data The conversation turned to the need to train cataloging staff. One participant thought it would be helpful to get started by learning to think in terms of linked data, to mentally practice building linked data graphs and play with different possible structures, as one does with LEGO bricks. The group agreed there is still too little understanding of the possibilities and of the consequences of practicing linked data. “We have to learn to see ourselves as publishers of metadata, so that others can find it – but we have no idea who the others are, we have to think even bigger than the Library of Congress’s NACO or WorldCat. We are no longer talking about the records we create, but about pieces of records that are unique, because a lot already comes from elsewhere. We have to wrap our minds around this and ask ourselves: What is our role in the bigger picture? This is very hard to do!” The group thought it was very important to start having that discussion within the library. But how exactly do you do that? It’s a big topic and it must be initiated by the library’s leadership team. Not relevant for my library One university library leader in the group reacted to this and said: “What strikes me is that the number of libraries faced with this challenge is shrinking. (…) [In my library] we hardly produce any metadata anymore. (…) If we look at what we still produce ourselves, it is about describing photos of student fraternities (…). It’s almost nothing anymore. Metadata has really become a topic for a small group of specialists.” The group objected that this observation was overlooking the importance of the discovery needs of the communities libraries serve. However provocative this observation was, it reflects a reality that we need to acknowledge and at the same time put in perspective. Alas, there was no time for that, as the session was wrapping up. It had certainly been a conversation to be continued! About the OCLC Research Discussion Series on Next Generation Metadata In March 2021, OCLC Research conducted a discussion series focused on two reports:  “Transitioning to the Next Generation of Metadata” “Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project”.  The round table discussions were held in different European languages and participants were able share their own experiences, get a better understanding of the topic area, and gain confidence in planning ahead.  The Opening Plenary Session opened the forum for discussion and exploration and introduced the theme and its topics. Summaries of all eight round table discussions are published on the OCLC Research blog, Hanging Together. This is the last post and it is preceded by the posts reporting on the first English session, the Italian session, the second English session, the French session, the German session, the Spanish session and the third English session. The Closing Plenary Session on April 13 will synthesize the different round table discussions. Registration is still open for this webinar: please join us!  The post Dutch round table on next generation metadata: think bigger than NACO and WorldCat appeared first on Hanging Together. Digital Library Federation: 2021 AMIA Cross-Pollinator: Justine Thomas The Association of Moving Image Archivists (AMIA) and DLF will be sending Justine Thomas to attend the 2021 virtual DLF/AMIA Hack Day and AMIA spring conference! As this year’s “cross-pollinator,” Justine will enrich both the Hack Day event and the AMIA conference, sharing a vision of the library world from her perspective. About the Awardee Justine Thomas (@JustineThomasM) is currently a Digital Programs Contractor at the National Museum of American History (NMAH) focusing on digital asset management and collections information support. Prior to graduating in 2019 with a Master’s in Museum Studies from the George Washington University, Justine worked at NMAH as a collections processing intern in the Archives Center and as a Public Programs Facilitator encouraging visitors to discuss American democracy and social justice issues.   About Hack Day and the Award           The seventh AMIA+DLF Hack Day (online April 1-15) will be a unique opportunity for practitioners and managers of digital audiovisual collections to join with developers and engineers to remotely collaborate to develop solutions for digital audiovisual preservation and access. The goal of the AMIA + DLF Award is to bring “cross-pollinators”–developers and software engineers who can provide unique perspectives to moving image and sound archivists’ work with digital materials, share a vision of the library world from their perspective, and enrich the Hack Day event–to the conference. Find out more about this year’s Hack Day activities here. The post 2021 AMIA Cross-Pollinator: Justine Thomas appeared first on DLF. Evergreen ILS: Evergreen 3.7-rc available The Evergreen Community is pleased to announce the availability of the release candidate for Evergreen 3.7. This release follows up on the recent beta release. The general release of 3.7.0 is planned for Wednesday, 14 April 2021. Between now and then, please download the release candidate and try it out. Additional information, including a full list of new features, can be found in the release notes. Jez Cope: Intro to the fediverse Wow, it turns out to be 10 years since I wrote this beginners guide to Twitter. Things have moved on a loooooong way since then. Far from being the interesting, disruptive technology it was back then, Twitter has become part of the mainstream, the establishment. Almost everyone and everything is on Twitter now, which has both pros and cons. So what’s the problem? It’s now possible to follow all sorts of useful information feeds, from live updates on transport delays to your favourite sports team’s play-by-play performance to an almost infinite number of cat pictures. In my professional life it’s almost guaranteed that anyone I meet will be on Twitter, meaning that I can contact them to follow up at a later date without having to exchange contact details (and they have options to block me if they don’t like that). On the other hand, a medium where everyone’s opinion is equally valid regardless of knowledge or life experience has turned some parts of the internet into a toxic swamp of hatred and vitriol. It’s easier than ever to forget that we have more common ground with any random stranger than we have similarities, and that’s led to some truly awful acts and a poisonous political arena. Part of the problem here is that each of the social media platforms is controlled by a single entity with almost no accountability to anyone other than shareholders. Technological change has been so rapid that the regulatory regime has no idea how to handle them, leaving them largely free to operate how they want. This has led to a whole heap of nasty consequences that many other people have done a much better job of documenting than I could (Shoshana Zuboff’s book The Age of Surveillance Capitalism is a good example). What I’m going to focus on instead are some possible alternatives. If you accept the above argument, one obvious solution is to break up the effective monopoly enjoyed by Facebook, Twitter et al. We need to be able to retain the wonderful affordances of social media but democratise control of it, so that it can never be dominated by a small number of overly powerful players. What’s the solution? There’s actually a thing that already exists, that almost everyone is familiar with and that already works like this. It’s email. There are a hundred thousand email servers, but my email can always find your inbox if I know your address because that address identifies both you and the email service you use, and they communicate using the same protocol, Simple Mail Transfer Protocol (SMTP)1. I can’t send a message to your Twitter from my Facebook though, because they’re completely incompatible, like oil and water. Facebook has no idea how to talk to Twitter and vice versa (and the companies that control them have zero interest in such interoperability anyway). Just like email, a federated social media service like Mastodon allows you to use any compatible server, or even run your own, and follow accounts on your home server or anywhere else, even servers running different software as long as they use the same ActivityPub protocol. There’s no lock-in because you can move to another server any time you like, and interact with all the same people from your new home, just like changing your email address. Smaller servers mean that no one server ends up with enough power to take over and control everything, as the social media giants do with their own platforms. But at the same time, a small server with a small moderator team can enforce local policy much more easily and block accounts or whole servers that host trolls, nazis or other poisonous people. How do I try it? I have no problem with anyone for choosing to continue to use what we’re already calling “traditional” social media; frankly, Facebook and Twitter are still useful for me to keep in touch with a lot of my friends. However, I do think it’s useful to know some of the alternatives if only to make a more informed decision to stick with your current choices. Most of these services only ask for an email address when you sign up and use of your real name vs a pseudonym is entirely optional so there’s not really any risk in signing up and giving one a try. That said, make sure you take sensible precautions like not reusing a password from another account. Instead of… Try… Twitter, Facebook Mastodon, Pleroma, Misskey Slack, Discord, IRC Matrix WhatsApp, FB Messenger, Telegram Also Matrix Instagram, Flickr PixelFed YouTube PeerTube The web Interplanetary File System (IPFS) Which, if you can believe it, was formalised nearly 40 years ago in 1982 and has only had fairly minor changes since then! ↩︎ HangingTogether: Third English round table on next generation metadata: investing in the utility of authorities and identifiers Thanks to George Bingham, UK Account Manager at OCLC, for contributing this post as part of the Metadata Series blog posts.  As part of the OCLC Research Discussion Series on Next Generation Metadata, this blog post reports back from the third English language round table discussion held on March 23, 2021.  The session was scheduled to facilitate a UK-centric discussion with a panel of library representatives from the UK with backgrounds in bibliographic control, special collections, collections management, metadata standards and computer science – a diverse and engaged discussion group. Mapping exercise Map of next-gen metadata projects (third English session) As with other round table sessions, the group started with mapping next generation metadata projects that participants were aware of, on a 2×2 matrix characterizing the application area: bibliographic data, cultural heritage data, research information management (RIM) data, and for anything else, the category, “Other”. The resulting map gave a nice overview of some of the building blocks of the emerging next generation metadata infrastructure, focussing in this session on the various national and international identifier initiatives – ISNI, VIAF, FAST, LC/NACO authority file and LC/SACO subject lists, and ORCID – and metadata and linked data infrastructure projects such as Plan-M (an initiative, facilitated by Jisc, to rethink the way that metadata for academic and specialist libraries is created, sold, licensed, shared, and re-used in the UK), BIBFrame and OCLC’s Shared Entity Management Infrastructure. The map also raises interesting questions about some of the potential or actual obstacles to the spread of next generation metadata: What to do about missing identifiers? How to incorporate extant regional databases and union catalogs into the national and international landscape? How “open” are institutions’ local archive management systems? Who is willing to pay for linked data?    Contributing to Library of Congress authorities The discussion panel agreed that there is a pressing need for metadata to be less hierarchical, which linked data delivers, and that a collaborative approach is the best way forward. One example is the development of the UK funnel for NACO and SACO, which has reinforced the need for a more national approach in the UK. The funnel allows the UK Higher Education institutions to contribute to the LC name and subject authorities using a single channel – rather than each library setting up its own channel. Because they work together as a group to make their contributions to the authority files, the quality and the “authority” of their contributions is significantly increased. Registering and seeding ISNIs One panelist reported on a one-year trial with ISNI for the institution’s legal deposit library, as a first step into working with linked data. It is hoped that it will prove to be a sustainable way forward. There is considerable enthusiasm and interest for this project amongst the institution’s practitioners, a vital ingredient for a successful next generation metadata initiative. Another panelist expanded on several ongoing projects with the aim of embedding ISNI identifiers within the value chain and getting them out to where cataloguers can pick them up. For example, publishers are starting to use them in their ONIX feeds to enable them to create clusters of records. Also, cataloging agencies in the UK are being supplied with ISNI identifiers so that they can embed them in the metadata at source, in the cataloging-in-publication (CIP) metadata, that they supply to libraries in the UK. Efforts are also under way to systematically match ISNI entries against VIAF entries, and to provide a reconciliation file to enable OCLC to update the VIAF with the most recent ISNI. These could then be fed through to the Library of Congress, who can then use these to update NACO files. With 6 million files to update, this is a perfect example of a leading edge dynamic next generation metadata initiative that will have to overcome the considerable challenge of scalability for it to succeed at a global level. Challenges faced by identifiers The discussion moved on to the other challenges faced by identifier schemes. It was noted that encouraging a more widespread collaborative approach would rely on honesty amongst the contributors. There would need to be built in assurances that the tags/data come from a trusted source. Would the more collaborative approach introduce too much scope for duplicate identifiers being created, and too many variations on preferred names? Cultural expectations would have to be clearly defined and adhered to. And last but by no means least is the challenge of providing the resources needed to upscale to a national and international scope. Obstacles in moving towards next generation metadata  Participants raised concerns that library management systems are not keeping pace with current discussions on next generation metadata or with real world implementations, to the extent that they may be the biggest obstacle in the move towards next generation metadata. It was recognized that moving to linked data involves a big conceptual and technical leap from the current string-based metadata creation, sharing and management practices, tools and methodologies. Progress can only be made in small steps, and there is still much work to be done to demonstrate the benefits of next generation metadata, a prerequisite if we are to complete the essential step of gaining the support of senior management and buy-in from system suppliers.   If we don’t lead, will someone else take over? Towards the end of the session, a brief discussion arose around the possibility (and danger) of organizations outside the library sector “taking over” if we can’t manage the transition ourselves. Amazon was cited as already becoming regarded as a good model to follow for metadata standards, despite what we know to be its shortcomings: it does not promote high quality data, and there are numerous problems concealed within the data, that are not evident to non-professionals. These quality issues would become very problematic if they are allowed to become pervasive in the global metadata landscape. “Our insistence on ‘perfect data’ is a good thing, but are people just giving up on it because it’s too difficult to attain?”    About the OCLC Research Discussion Series on Next Generation Metadata In March 2021, OCLC Research conducted a discussion series focused on two reports:  “Transitioning to the Next Generation of Metadata” “Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project”.  The round table discussions were held in different European languages and participants were able share their own experiences, get a better understanding of the topic area, and gain confidence in planning ahead.  The Opening Plenary Session opened the forum for discussion and exploration and introduced the theme and its topics. Summaries of all eight round table discussions are published on the OCLC Research blog, Hanging Together. This post is preceded by the posts reporting on the first English session, the Italian session, the second English session, the French session, the German session, and the Spanish session. The Closing Plenary Session on April 13 will synthesize the different round table discussions. Registration is still open for this webinar: please join us!  The post Third English round table on next generation metadata: investing in the utility of authorities and identifiers appeared first on Hanging Together. Peter Murray: More Thoughts on Pre-recording Conference Talks Over the weekend, I posted an article here about pre-recording conference talks and sent a tweet about the idea on Monday. I hoped to generate discussion about recording talks to fill in gaps—positive and negative—about the concept, and I was not disappointed. I’m particularly thankful to Lisa Janicke Hinchliffe and Andromeda Yelton along with Jason Griffey, Junior Tidal, and Edward Lim Junhao for generously sharing their thoughts. Daniel S and Kate Deibel also commented on the Code4Lib Slack team. I added to the previous article’s bullet points and am expanding on some of the issues here. I’m inviting everyone mentioned to let me know if I’m mischaracterizing their thoughts, and I will correct this post if I hear from them. (I haven’t found a good comments system to hook into this static site blog.) Pre-recorded Talks Limit Presentation Format Lisa Janicke Hinchliffe made this point early in the feedback: @DataG For me downside is it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? I was required to turn workshops into talks this year. Even tho tech can do more. Not at all best pedagogy for learning— Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 Jason described the “flipped classroom” model that he had in mind as the NISOplus2021 program was being developed. The flipped classroom model is one where students do the work of reading material and watching lectures, then come to the interactive time with the instructors ready with questions and comments about the material. Rather than the instructor lecturing during class time, the class time becomes a discussion about the material. For NISOplus, “the recording is the material the speaker and attendees are discussing” during the live Zoom meetings. In the previous post, I described how the speaker could respond in text chat while the recording replay is beneficial. Lisa went on to say: @DataG Q+A is useful but isn't an interactive session. To me, interactive = participants are co-creating the session, not watching then commenting on it.— Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 She described an example: the SSP preconference she ran at CHS. I’m paraphrasing her tweets in this paragraph. The preconference had a short keynote and an “Oprah-style” panel discussion (not pre-prepared talks). This was done live; nothing was recorded. After the panel, people worked in small groups using Zoom and a set of Google Slides to guide the group work. The small groups reported their discussions back to all participants. Andromeda points out (paraphrasing twitter-speak): “Presenters will need much more— and more specialized—skills to pull it off, and it takes a lot more work.” And Lisa adds: “Just so there is no confusion … I don’t think being online makes it harder to do interactive. It’s the pre-recording. Interactive means participants co-create the session. A pause to chat isn’t going to shape what comes next on the recording.” Increased Technical Burden on Speakers and Organizers @ThatAndromeda @DataG Totally agree on this. I had to pre-record a conference presentation recently and it was a terrible experience, logistically. I feel like it forces presenters to become video/sound editors, which is obviously another thing to worry about on top of content and accessibility.— Junior Tidal (@JuniorTidal) April 5, 2021 Andromeda also agreed with this: “I will say one of the things I appreciated about NISO is that @griffey did ALL the video editing, so I was not forced to learn how that works.” She continued, “everyone has different requirements for prerecording, and in [Code4Lib’s] case they were extensive and kept changing.” And later added: “Part of the challenge is that every conference has its own tech stack/requirements. If as a presenter I have to learn that for every conference, it’s not reducing my workload.” It is hard not to agree with this; a high-quality (stylistically and technically) recording is not easy to do with today’s tools. This is also a technical burden for meeting organizers. The presenters will put a lot of work into talks—including making sure the recordings look good; whatever playback mechanism is used has to honor the fidelity of that recording. For instance, presenters who have gone through the effort to ensure the accessibility of the presentation color scheme want the conference platform to display the talk “as I created it.” The previous post noted that recorded talks also allow for the creation of better, non-real-time transcriptions. Lisa points out that presenters will want to review that transcription for accuracy, which Jason noted adds to the length of time needed before the start of a conference to complete the preparations. Increased Logistical Burden on Presenters @ThatAndromeda @DataG @griffey Even if prep is no more than the time it would take to deliver live (which has yet to be case for me and I'm good at this stuff), it is still double the time if you are expected to also show up live to watch along with everyone else.— Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 This is a consideration I hadn’t thought through—that presenters have to devote more clock time to the presentation because first they have to record it and then they have to watch it. (Or, as Andromeda added, “significantly more than twice the time for some people, if they are recording a bunch in order to get it right and/or doing editing.”) No. Audience. Reaction. @DataG @griffey 3) No. Audience. Reaction. I give a joke and no one laughs. Was it funny? Was it not funny? Talks are a *performance* and a *relationship*; I'm getting energy off the audience, I'm switching stuff on the fly to meet their vibe. Prerecorded/webinar is dead. Feels like I'm bombing.— Andromeda Yelton (@ThatAndromeda) April 5, 2021 Wow, yes. I imagine it would take a bit of imagination to get in the right mindset to give a talk to a small camera instead of an audience. I wonder how stand-up comedians are dealing with this as they try to put on virtual shows. Andromeda summed this up: @DataG @griffey oh and I mean 5) I don't get tenure or anything for speaking at conferences and goodness knows I don't get paid. So the ENTIRE benefit to me is that I enjoy doing the talk and connect to people around it. prerecorded talk + f2f conf removes one of these; online removes both.— Andromeda Yelton (@ThatAndromeda) April 5, 2021 Also in this heading could be “No Speaker Reaction”—or the inability for subsequent speakers at a conference to build on something that someone said earlier. In the Code4Lib Slack team, Daniel S noted: “One thing comes to mind on the pre-recording [is] the issue that prerecorded talks lose the ‘conversation’ aspect where some later talks at a conference will address or comment on earlier talks.” Kate Deibel added: “Exactly. Talks don’t get to spontaneously build off of each other or from other conversations that happen at the conference.” Currency of information Lisa points out that pre-recording talks before en event means there is a delay between the recording and the playback. In the example she pointed out, there was a talk at RLUK that pre-recorded would have been about the University of California working on an Open Access deal with Elsevier; live, it was able to be “the deal we announced earlier this week”. Conclusions? Near the end of the discussion, Lisa added: @DataG @griffey @ThatAndromeda I also recommend going forward that the details re what is required of presenters be in the CfP. It was one thing for conferences that pivoted (huge effort!) but if you write the CfP since the pivot it should say if pre-record, platform used, etc.— Lisa Janicke Hinchliffe (@lisalibrarian) April 5, 2021 …and Andromeda added: “Strong agree here. I understand that this year everyone was making it up as they went along, but going forward it’d be great to know that in advance.” That means conferences will need to take these needs into account well before the Call for Proposals (CfP) is published. A conference that is thinking now about pre-recording their talks must work through these issues and set expectations with presenters early. As I hoped, the Twiter replies tempered my eagerness for the all-recorded style with some real-world experience. There could be possibilities here, but adapting face-to-face meetings to a world with less travel won’t be simple and will take significant thought beyond the issues of technology platforms. Edward Lim Junhao summarized this nicely: “I favor unpacking what makes up our prof conferences. I’m interested in recreating that shared experience, the networking, &amp; the serendipity of learning sth you didn’t know. I feel in-person conferences now have to offer more in order to justify people traveling to attend them.” Related, Andromeda said: “Also, for a conf that ultimately puts its talks online, it’s critical that it have SOMEthing beyond content delivery during the actual conference to make it worth registering rather than just waiting for youtube. realtime interaction with the speaker is a pretty solid option.” If you have something to add, reach out to me on Twitter. Given enough responses, I’ll create another summary. Let’s keep talking about what that looks like and sharing discoveries with each other. The Tree of Tweets It was a great discussion, and I think I pulled in the major ideas in the summary above. With some guidance from Ed Summers, I’m going to embed the Twitter threads below using Treeverse by Paul Butler. We might be stretching the boundaries of what is possible, so no guarantees that this will be viewable for the long term. Peter Murray: Should All Conference Talks be Pre-recorded? The Code4Lib conference was last week. That meeting used all pre-recorded talks, and we saw the benefits of pre-recording for attendees, presenters, and conference organizers. Should all talks be pre-recorded, even when we are back face-to-face? Note! After I posted a link to this article on Twitter, there was a great response of thoughtful comments. I've included new bullet points below and summarized the responses in another blog post. As an entirely virtual conference, I think we can call Code4Lib 2021 a success. Success ≠ Perfect, of course, and last week the conference coordinating team got together on a Zoom call for a debriefing session. We had a lengthy discussion about what we learned and what we wanted to take forward to the 2022 conference, which we’re anticipating will be something with a face-to-face component. That last sentence was tough to compose: “…will be face-to-face”? “…will be both face-to-face and virtual”? (Or another fully virtual event?) Truth be told, I don’t think we know yet. I think we know with some certainty that the COVID pandemic will become much more manageable by this time next year—at least in North America and Europe. (Code4Lib draws from primarily North American library technologists with a few guests from other parts of the world.) I’m hearing from higher education institutions, though, that travel is going to be severely curtailed…if not for health risk reasons, then because budgets have been slashed. So one has to wonder what a conference will look like next year. I’ve been to two online conferences this year: NISOplus21 and Code4Lib. Both meetings recorded talks in advance and started playback of the recordings at a fixed point in time. This was beneficial for a couple of reasons. For organizers and presenters, pre-recording allowed technical glitches to be worked through without the pressure of a live event happening. Technology is not nearly perfect enough or ubiquitously spread to count on it working in real-time. 1 NISOplus21 also used the recordings to get transcribed text for the videos. (Code4Lib used live transcriptions on the synchronous playback.) Attendees and presenters benefited from pre-recording because the presenters could be in the text chat channel to answer questions and provide insights. Having the presenter free during the playback offers new possibilities for making talks more engaging: responding in real-time to polls, getting forehand knowledge of topics for subsequent real-time question/answer sessions, and so forth. The synchronous playback time meant that there was a point when (almost) everyone was together watching the same talk—just as in face-to-face sessions. During the Code4Lib conference coordinating debrief call, I asked the question: “If we saw so many benefits to pre-recording talks, do we want to pre-record them all next year?” In addition to the reasons above, pre-recorded talks benefit those who are not comfortable speaking English or are first-time presenters. (They have a chance to re-do their talk as many times as they need in a much less stressful environment.) “Live” demos are much smoother because a recording can be restarted if something goes wrong. Each year, at least one presenter needs to use their own machine (custom software, local development environment, etc.), and swapping out presenter computers in real-time is risky. And it is undoubtedly easier to impose time requirements with recorded sessions. So why not pre-record all of the talks? I get it—it would be different to sit in a ballroom watching a recording play on big screens at the front of the room while the podium is empty. But is it so different as to dramatically change the experience of watching a speaker at a podium? In many respects, we had a dry-run of this during Code4Lib 2020. It was at the early stages of the coming lockdowns when institutions started barring employee travel, and we had to bring in many presenters remotely. I wrote a blog post describing the setup we used for remote presenters, and at the end, I said: I had a few people comment that they were taken aback when they realized that there was no one standing at the podium during the presentation. Some attendees, at least, quickly adjusted to this format. For those with the means and privilege of traveling, there can still be face-to-face discussions in the hall, over meals, and social activities. For those that can’t travel (due to risks of traveling, family/personal responsibilities, or budget cuts), the attendee experience is a little more level—everyone is watching the same playback and in the same text backchannels during the talk. I can imagine a conference tool capable of segmenting chat sessions during the talk playback to “tables” where you and close colleagues can exchange ideas and then promote the best ones to a conference-wide chat room. Something like that would be beneficial as attendance grows for events with an online component, and it would be a new form of engagement that isn’t practical now. There are undoubtedly reasons not to pre-record all session talks (beyond the feels-weird-to-stare-at-an-unoccupied-ballroom-podium reasons). During the debriefing session, one person brought up that having all pre-recorded talks erodes the justification for in-person attendance. I can see a manager saying, “All of the talks are online…just watch it from your desk. Even your own presentation is pre-recorded, so there is no need for you to fly to the meeting.” That’s legitimate. So if you like bullet points, here’s how it lays out. Pre-recording all talks is better for: Accessibility: better transcriptions for recorded audio versus real-time transcription (and probably at a lower cost, too) Engagement: the speaker can be in the text chat during playback, and there could be new options for backchannel discussions Better quality: speakers can re-record their talk as many times as needed Closer equality: in-person attendees are having much the same experience during the talk as remote attendees Downsides for pre-recording all talks: Feels weird: yeah, it would be different Erodes justification: indeed a problem, especially for those for whom giving a speech is the only path to getting the networking benefits of face-to-face interaction Limits presentation format: it forces every session into being a lecture. For two decades CfPs have emphasized how will this season be engaging/not just a talking head? (Lisa Janicke Hinchliffe) Increased Technical Burden on Speaker and Organizers: conference organizers asking presenters to do their own pre-recording is a barrier (Junior Tidal), and organizers have added new requirements for themselves No Audience Feedback: pre-recording forces the presenter into an unnatural state relative to the audience (Andromeda Yelton) Currency of information: pre-recording talks before en event naturally introduces a delay between the recording and the playback. (Lisa Janicke Hinchliffe) I’m curious to hear of other reasons, for and against. Reach out to me on Twitter if you have some. The COVID-19 pandemic has changed our society and will undoubtedly transform it in ways that we can’t even anticipate. Is the way that we hold professional conferences one of them? Can we just pause for a moment and consider the decades of work and layers of technology that make a modern teleconference call happen? For you younger folks, there was a time when one couldn’t assume the network to be there. As in: the operating system on your computer couldn’t be counted on to have a network stack built into it. In the earliest years of my career, we were tickled pink to have Macintoshes at the forefront of connectivity through GatorBoxes. Go read the first paragraph of that Wikipedia article on GatorBoxes…TCP/IP was tunneled through LocalTalk running over PhoneNet on unshielded twisted pairs no faster than about 200 kbit/second. (And we loved it!) Now the network is expected; needing to know about TCP/IP is pushed so far down the stack as to be forgotten…assumed. Sure, the software on top now is buggy and bloated—is my Zoom client working? has Zoom’s service gone down?—but the network…we take that for granted. ↩ Islandora: Upcoming DIG Sprint Upcoming DIG Sprint agriffith Thu, 04/08/2021 - 20:03 Body The Islandora Documentation Interest Group is holding a sprint! To support the upcoming release of Islandora, the DIG has planned a 2-week documentation, writing-and-updating sprint to occur as part of the release process. To prepare for that effort, we’re going to spend April 19 – 30th on an Auditing Sprint, where volunteers will review existing documentation and complete this spreadsheet, providing a solid overview of the current status of our docs so we know where to best deploy our efforts during the release. This sprint will run alongside the upcoming Pre-Release Code Sprint, so if you’re not up for coding, auditing docs is a great way to contribute during sprint season! We are looking for volunteers to sign up to take on two sprint roles: Auditor: Review a page of documentation and fill out a row in the spreadsheet indicating things like the current status (‘Good Enough’ or ‘Needs Work’) , the goal for that particular page (e.g., “Explain how to create an object,” or “Compare Islandora 7 concepts to Islandora 8 concepts”), and the intended audience (Beginners, developers, etc.). Reviewer: Read through a page that has been audited and indicate if you agree with the auditor’s assessment, add additional notes or suggestions as needed; basically, give a second set of eyes on each page.  You can sign up for the sprint here, and sign up for individual pages here.   Samvera: Registration now open for Samvera Virtual Connect, April 20 – 21 Registration is now open for Samvera Virtual Connect 2021! Samvera Virtual Connect will take place April 20th -21st from 11am – 2pm EDT. Registration is free and open to anyone with an interest in Samvera. This year’s program is packed with presentations and lightning talks of interest to developers, managers, librarians, and other current or potential Samvera Community participants and technology users. Register and view the full program on the Samvera wiki. The post Registration now open for Samvera Virtual Connect, April 20 – 21 appeared first on Samvera. Lucidworks: Chatbots for Self-Resolution and Happier Customers How chatbots and conversational applications with deep learning are helping customers resolve issues faster than ever. The post Chatbots for Self-Resolution and Happier Customers appeared first on Lucidworks. Digital Library Federation: 2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals We’re delighted to share that it’s CFP season for CLIR’s annual events. Based on community feedback, we’ve made the decision to take our events online again in 2021. We look forward to new and better ways to come together—as always, with community at the center. Our events will take place on the following dates: The DLF Forum (#DLFforum, November 1-3), our signature event, includes digital library practitioners and others from member institutions and the broader community, for whom it serves as a meeting place, marketplace, and congress. Learn more and check out the CFP here: https://forum2021.diglib.org/call-for-proposals/  NDSA’s Digital Preservation 2021: Embracing Digitality (#DigiPres21, November 4), NDSA’s major meeting and conference, will help to chart future directions for both the NDSA and digital stewardship, and is expected to be a crucial venue for intellectual exchange, community-building, development of best practices, and national-level agenda-setting in the field. Learn more and check out the CFP for this year’s event here: https://ndsa.org/conference/digital-preservation-2021/cfp/ Learn@DLF (#LearnAtDLF, November 8-10) is our dedicated workshop series for digging into tools, techniques, workflows, and concepts. Through engaging, hands-on sessions, attendees will gain experience with new tools and resources, exchange ideas, and develop and share expertise with fellow community members. Learn more and check out the CFP here: https://forum2021.diglib.org/call-for-proposals/  For all events, we encourage proposals from members and non-members; regulars and newcomers; digital library practitioners and those in adjacent fields such as institutional research and educational technology; and students, early-career professionals and senior staff alike. Proposals to more than one event are permitted, though please submit different proposals for each.  The DLF Forum and Learn@DLF CFP is here: https://forum2021.diglib.org/call-for-proposals/  NDSA’s Digital Preservation 2021: Embracing Digitality CFP is here: https://ndsa.org/conference/digital-preservation-2021/cfp/ Session options range from 5-minute lighting talks at the Forum to half-day workshops at Learn@DLF, with many options in between. The deadline for all opportunities is Monday, May 17, at 11:59pm Eastern Time. If you have any questions, please write to us at forum@diglib.org, and be sure to subscribe to our Forum newsletter to stay up on all Forum-related news. We’re looking forward to seeing you this fall. -Team DLF The post 2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals appeared first on DLF. Peter Sefton: What did you do in the lockdowns PT? Part 1 - Music Videos Post looks too long? Don't want to read? Here's the summary. Last year Gail McGlinn* and I did the lockdown home-recording thing. We put out at least one song video per week for a year (and counting - we're up to 58 over 53 weeks). Searchable, sortable website here. We learned some things, got better at performing for the phone camera and our microphones and better at mixing and publishing the result. * Disclosure Gail's my wife. We got married; she proposed, I accepted. I may I might - Is this the world's best marriage proposal acceptance song? (It did win a prize at a Ukulele festival for best song) (This post is littered with links to our songs, sorry but there are 58 of them and someone has to link to them.) In the second quarter of 2020 Gail McGlinn and I went from playing and singing in community music events (jams, gigs, get togethers) at least once a week to being at home every evening, like everyone else. Like lots of people we decided to put our efforts into home recording, not streaming cos that would be pointless for people with basically no audience, but we started making videos and releasing them under our band name Team Happy. By release I mean "put on Facebook" and "sometimes remember to upload to YouTube". This post is about that experience and what we learned. Team Happy is the name we use to perform as a duo at open mic events and the odd community or ukulele festival. We were originally called "The Narrownecks" in honour of where we live, for one gig, but then we found out there's another group with that name. Actually they're much better than us, just go watch them. Coming in to 2020 we already had a YouTube channel and it had a grand total of two videos on it with a handful of views - as in you could count them on your fingers. It's still a sad thing to behold, how many views we have - but it's not about views it's about getting discovered and having our songs performed by, oh I dunno, Casey Chambers? Keith Urban? (Oh yeah, that would mean we'd need views. Bugger.) Either that or it's about our personal journey and growth as people. Or continuing to contribute to our local music communities in lockdown (which is what Gail says it's about.). Seriously though, we think I called your name and Dry Pebbles would go well on someone else's album. Dry Pebbles, by Gail McGlinn - a song written tramping through the bush. I called your name by Peter Sefton Anyway, in late March we got out our recording gear and started. While phone cameras are fine for the quality of video we need, we wanted to do better than phone-camera sound. (Here's an example of that sound from one of our first recordings on my song Seventeen - it's pretty muddy, like the lighting.) Seventeen by Peter Sefton Initial attempts to get good audio involved feeding USB-audio from a sound mixer with a built in audio interface (a Yamaha MX10) into the phone itself and recording an audio track with the video - but this is clunky and you only get two tracks even though the mixer has multiple inputs. We soon graduated to using a DAW - a Digital Audio Workstation with our mixer, still only two tracks but much less mucking around with the phone. So this is more or less what we ended up with for the first few weeks - We'd record or "track" everything on the computer and then use it again to mix. Our first-generation recording rig with annoying recording via a laptop There's a thing you have to do to audio files called mastering which means getting them to a suitable volume level and dynamic range for distribution. Without it loud stuff is too quiet and quiet stuff is too quiet, and the music has no punch. This was a complete mystery to me to start with so I paid for online services that use AI to master tracks - kind of but not really making everything louder. At some point I started doing it myself, beginning the long process of learning the mysteries of compression and limiting and saving money. Haven't mastered it yet, though. Mastering is an actual profession, by the way and I'm not going to reach those heights. In May, we got a new bit of gear, the Tascam Model 12 an all in one mixer-recorder-interface that lets you track (that is record tracks) without a computer - much easier to deal with. A bit later we got a Zoom H5 portable recorder with built in mics and a couple of extra tracks for instruments so we can do stuff away from home - this got used on our month-long holiday in March 2021. Well it was almost a month, but there was a Rain Event and we came home a bit early. These machines let you capture tracks, including adding new ones without touching the computer which is a big win as far as I am concerned. Gail singing Closer to fine on The Strand in Townsville, in North Queensland, recorded on the H5 and (partly) mixed in the car on holidays. After a bit, and depending on the level of lockdown we'd have guests around to visit and when that was happening, we kept our distance at either end of our long lounge room and used a phone camera and microphone at each end. Our second-generation recording rig with stand-alone laptop-free tracking This new setup made it much easier to do overdubs - capture more stuff into the Model 12 and make videos each time, like on this song of mine They Say Dancing where I overdubbed guitar and bass over a live track. They Say Dancing by Peter Sefton So what did we learn? Perfect is the enemy of Done. Well, we knew that, but if you've decided to release a song every week, even if you're away on a holiday, or there are other things going on then there's no time to obsess over details - you have to get better at getting a useable take quickly or you won't be able to keep going for a year or more. Practice may not make perfect, but it's a better investment than new gear, or doing endless takes with the cameras rolling. We got better at picking a song (or deciding to write one or finish one off), playing it for a week or two and then getting the take. Simplify! We learned that to get a good performance sometimes it was better for only one of us to play or sing, that fancy parts increased the chance of major errors, meaning yet another take. If in doubt (like my harmony singing that's always in doubt) we're learning to leave it out. Nobody likes us! Actually we know that's not true, some of the songs get hundreds of plays on Facebook but not many people actually click the like button, maybe twenty or so. But then you run into people in the supermarket; they say "love the songs keep it up"! And there are quite a few people who listen every week on FB we just can't tell they're enjoying it. There are complex reasons for this lack of engagement - some people don't like to like things so that (they think) the evil FB can't track them. I think the default auto-play for video might be a factor too - the video starts playing, and that might not be a good time, so people skip forward to something else. It's kind of demoralizing that it is MUCH easier to get likes with pictures of the dog. Our spoiled covid-hound, Floki - about 18 months old. Much more likeable on the socials than our music. YouTube definitely doesn't like us. I figured that some of the songs we sang would attract some kind of Youtube audience - we often search to see what kinds of covers of songs are out there and thought others might find us the same way, but we get almost no views on that platform. I also thought that adding some text about the gear we used might bring in some views. For example we were pretty early adopters of the Tascam Model 12. I had tried to find out what one sounded like in real life before I bought, with no success - and I thought people might drop by to hear us, but I don't think Google/YouTube is giving us any search-juice at all. Our personal favourites Our Favourite cover we did (and we actually agreee on this - Team Happy is NOT an ironic name) was Colour my World. We'd just got the Tascam and Gail was able to double track herself - no mucking around with computers. We had fun that night. Colour my World - one of our fave covers to perform And my favourite original? Well i'm very proud of All L'Amour for you with lots of words and a bi-lingual pun - I wanted to do that on the local community radio just last weekend when we were asked in, but the host Richard 'Duck' Keegan kind of mentioned the aforementioned I Called Your Name so we did that instead along with Dry Pebbles and Seventeen. All L'Amour for you The last word on love and metaphors for love? By Peter Sefton. Gail's fave original? I may I might, the song that snagged her the best husband in South Katoomba over 1.95m tall. And she likes the tear jerker Goodbye Mongrel dog I wrote, on which she pays some pumpin' banjo. Goodbye Mongrel dog - a song that says goodbye to a (deceased) Mongrel dog who went by the name of Spensa. Music-tech stuff and mixing tips For those of you who care, here's a roundup of the main bits of kit that work well. We've reached the point where there's actually nothing on the shopping list - we can do everything for the foreseeable future with what we have. I have mentioned that we track using the Tascam Model 12 and the Zoom H5 - these are both great. The only drawback of the Zoom is that you can't see the screen (and thus the levels) from performance position. It also needed a better wind shield - I bought a dead-cat, shaggy thing to go over the mics that works if the wind is moderate. When I bought the Tascam I thought it was going to be all analogue through the mixer stage like their Model 16 and Model 24, but no, it's all digital. I don't think this is an issue having used it but it was not something they made all that explicit at launch. There's a digital Zoom equivalent (the L12) which is a bit smaller, and has more headphone outputs but at the expense of having to do mode-switching to to access all the functions. I think the Tascam will be easier to use for live shows when those start happening again. For video we just use our phones - for a while we had matching Pixel 4XLs then a Pixel 5 which drowned in a tropical stream. Yes they're waterproof, those models, but not when they have tiny cracks in the screen. No more $1000 phones for me. Reaper is bloody marvelous software. It's cheap for non-commercial use, incredibly powerful and extensible. I have not used any other Digital Audio Workstation other than Garage Band, that comes for free on the Apple Platform but as far as I can see there's no reason for non-technophobic home producers to pay any more than the Reaper fee for something else. Our mainstay mics are a slightly battered pair of Audio Technica AT2020s - we had these for performing live with Gail's band U4ria - everyone gathered around a condenser mic, bluegrass style. For recording we either put one at either end of the room or mount them vertically in an X/Y configuration - 90° to get stereo. They're fairly airy and have come to be a big part of our sound. We tried some other cheap things that didn't work very well, and I got a pair of Australian Rode M5 pencil condenser mics, not expensive, that I hoped might be easier to mount X/Y but we didn't like them for vocals at all, though they're great on stringed instruments. We do have an SM58 and SM57 -- gotta love a microphone with a wikipedia page -- which see occasional use as vocal mics if we want a more rock 'n roll sound, or the guest singer is more used to a close-mic. And the SM57 for guitar amps sometimes. We tend to play our favourite acoustic instruments but when we have bass we use the Trace Elliot Elf amp which has a great compressor and a DI output (it can send a signal to the mixer/interface without going via the speaker). Sometimes we run the speaker and try not to let it bleed too much into the AT2020s, very occasionally we wear headphones for the first track and go direct so there's no bass bleed. I have done a bit of electric guitar with the Boss Katana 50 - to me it sounds good in the room that amp, but has not recorded well either via the headphone out or via an SM57. I get better results thru the bass amp. I don't have any kind of actual electric guitar tone sorted though I have seen lot of videos about how to achieve the elusive tone. Maybe one day. One thing that I wasn't expecting to happen - I dropped the top E of my little Made in Mexico Martin OOO Jr guitar to D (you know, like Keef) some time early in 2020 and it ended up staying there. Gives some nice new chord voicings (9ths mostly) and it's the same top 4 strings as a 5 string banjo with some very easy-to-grab chords. Have started doing it to Ukuleles too, putting them in open C. A note on the bass: Playing bass is fun (we knew that before we started) but mixing it so it can be heard on a phone speaker is a real challenge. One approach that helps is using an acoustic bass which out of a lot more high frequency than a solid body electric this also helps because you don't have to have an amp on while you're tracking it live, but you can take a direct input from a pickup (or two) AND mic the bass giving you lots of signals with different EQ to play with. I gaffa-taped a guitar humbucker into my Artist Guitars 5 string acoustic and it sounds huge. The basic (ha!) trick I try to use for getting more high frequency for tiny speakers is to create a second track, saturate the signal with distortion and/or saturation effects to boost the upper harmonic content and then cut all the low frequency out and mix that so it can just be heard and imply the fundamental bass frequency in addition to the real bassy bass. Helps if you have some bridge pickup or under-saddle pickup in the signal if those are available and if you remember. I also like to add some phaser effect that gives some motion in the upper frequencies - for example my Perfect Country Pop Song - too much phaser? Probably, but I can hear the bass on my phone and it bounces :). Phaser is Team Happy's favourite effect, nothing says perfect country pop (which is what we are, right?) like a phaser. Perfect Country Pop Song - is it perfect or merely sublime? (This one has a cute puppy in it). Everything I know about music production is from YouTube. Everything I know about song writing is from deep in my soul. Thank you for reading all the way to the bottom. Normal service will resume next week. Lucidworks: Let Fusion Handle Search to Get the Most Out of SharePoint Augment Sharepoint with a flexible search platform to deliver the best knowledge management experience in the market. The post Let Fusion Handle Search to Get the Most Out of SharePoint appeared first on Lucidworks. Jez Cope: Collaborations Workshop 2021: collaborative ideas &amp; hackday My last post covered the more “traditional” lectures-and-panel-sessions approach of the first half of the SSI Collaborations Workshop. The rest of the workshop was much more interactive, consisting of a discussion session, a Collaborative Ideas session, and a whole-day hackathon! The discussion session on day one had us choose a topic (from a list of topics proposed leading up to the workshop) and join a breakout room for that topic with the aim of producing a “speed blog” by then end of 90 minutes. Those speed blogs will be published on the SSI blog over the coming weeks, so I won’t go into that in more detail. The Collaborative Ideas session is a way of generating hackday ideas, by putting people together at random into small groups to each raise a topic of interest to them before discussing and coming up with a combined idea for a hackday project. Because of the serendipitous nature of the groupings, it’s a really good way of generating new ideas from unexpected combinations of individual interests. After that, all the ideas from the session, along with a few others proposed by various participants, were pitched as ideas for the hackday and people started to form teams. Not every idea pitched gets worked on during the hackday, but in the end 9 teams of roughly equal size formed to spend the third day working together. My team’s project: “AHA! An Arts &amp; Humanities Adventure” There’s a lot of FOMO around choosing which team to join for an event like this: there were so many good ideas and I wanted to work on several of them! In the end I settled on a team developing an escape room concept to help Arts &amp; Humanities scholars understand the benefits of working with research software engineers for their research. Five of us rapidly mapped out an example storyline for an escape room, got a website set up with GitHub and populated it with the first few stages of the game. We decided to focus on a story that would help the reader get to grips with what an API is and I’m amazed how much we managed to get done in less than a day’s work! You can try playing through the escape room (so far) yourself on the web, or take a look at the GitHub repository, which contains the source of the website along with a list of outstanding tasks to work on if you’re interested in contributing. I’m not sure yet whether this project has enough momentum to keep going, but it was a really valuable way both of getting to know and building trust with some new people and demonstrating the concept is worth more work. Other projects Here’s a brief rundown of the other projects worked on by teams on the day. Coding Confessions Everyone starts somewhere and everyone cuts corners from time to time. Real developers copy and paste! Fight imposter syndrome by looking through some of these confessions or contributing your own. https://coding-confessions.github.io/ CarpenPI A template to set up a Raspberry Pi with everything you need to run a Carpentries (https://carpentries.org/) data science/software engineering workshop in a remote location without internet access. https://github.com/CarpenPi/docs/wiki Research Dugnads A guide to running an event that is a coming together of a research group or team to share knowledge, pass on skills, tidy and review code, among other software and working best practices (based on the Norwegian concept of a dugnad, a form of “voluntary work done together with other people”) https://research-dugnads.github.io/dugnads-hq/ Collaborations Workshop ideas A meta-project to collect together pitches and ideas from previous Collaborations Workshop conferences and hackdays, to analyse patterns and revisit ideas whose time might now have come. https://github.com/robintw/CW-ideas howDescribedIs Integrate existing tools to improve the machine-readable metadata attached to open research projects by integrating projects like SOMEF, codemeta.json and HowFAIRIs (https://howfairis.readthedocs.io/en/latest/index.html). Complete with CI and badges! https://github.com/KnowledgeCaptureAndDiscovery/somef-github-action Software end-of-project plans Develop a template to plan and communicate what will happen when the fixed-term project funding for your research software ends. Will maintenance continue? When will the project sunset? Who owns the IP? https://github.com/elichad/software-twilight Habeas Corpus A corpus of machine readable data about software used in COVID-19 related research, based on the CORD19 dataset. https://github.com/softwaresaved/habeas-corpus Credit-all Extend the all-contributors GitHub bot (https://allcontributors.org/) to include rich information about research project contributions such as the CASRAI Contributor Roles Taxonomy (https://casrai.org/credit/) https://github.com/dokempf/credit-all I’m excited to see so many metadata-related projects! I plan to take a closer look at what the Habeas Corpus, Credit-all and howDescribedIs teams did when I get time. I also really want to try running a dugnad with my team or for the GLAM Data Science network. Journal of Web Librarianship: Meeting a Higher Standard: A Case Study of Accessibility Compliance in LibGuides upon the Adoption of WCAG 2.0 Guidelines . Ed Summers: twarc2 This post was originally published on Medium but I spent time writing it so I wanted to have it here too. TL;DR twarc has been redesigned from the ground up to work with the new Twitter v2 API and their Academic Research track. Many thanks for the code and design contributions of Betsy Alpert, Igor Brigadir, Sam Hames, Jeff Sauer, and Daniel Verdeer that have made twarc2 possible, as well as early feedback from Dan Kerchner, Shane Lin, Miles McCain, 李荣蓬, David Thiel, Melanie Walsh and Laura Wrubel. Extra special thanks to the Institute for Future Environments at Queensland University of Technology for supporting Betsy and Sam in their work, and for the continued support of the Mellon Foundation. Back in August of last year Twitter announced early access to their new v2 API, and their plans to sunset the v1.1 API that has been active for almost the last 10 years. Over the lifetime of their v1.1 API Twitter has become deeply embedded in the media landscape. As magazines, newspapers and television have moved onto the web they have increasingly adopted tweets as a mechanism for citing politicians, celebrities and organizations, while also using them to document current events, generate leads and gather feedback for evolving stories. As a result Twitter has also become a popular object of study for humanities and social science researchers looking to understand the world as reflected, refracted and distorted by/in social media. On the surface the v2 API update seems pretty insignificant since the shape of a tweet, its parts, properties and affordances, aren’t changing at all. Tweets with 280 characters of text, images and video will continue to be posted, retweeted and quoted. However behind the scenes the representation of a tweet as data, and the quotas that control the rates at which this data can flow between apps and other third party services will be greatly transformed. Needless to say, v2 represents a big change for the Documenting the Now project. Along with community members we’ve developed and maintained open source tools like twarc that talk directly to the Twitter API to help users to search for and collect live tweets that match criteria like hashtags, names and geographic locations. Today we’re excited to announce the release of twarc v2 which has been designed from the ground up to work with the v2 API and Twitter’s new Academic Research track. Clearly it’s extremely problematic having a multi-national corporation act as a gatekeeper for who counts as an academic researcher, and what constitutes academic research. We need look no further than the recent experiences of Timnit Gebru and Margaret Mitchell at Google for an example of what happens when research questions run up against the business objectives of capital. We only know their stories because Gebru and Mitchell’s bravely took a principled approach, where many researchers would have knowingly or unknowingly shaped their research to better fit the needs of the company. So it is important for us that twarc still be usable by people with and without access to the Academic Research Track. But we have heard from many users that the Academic Research Track presents new opportunities for Twitter data collection that are essential for researchers interested in the observability of social media platforms. Twitter is making a good faith effort to work with the academic research community, and we thought twarc should support it, even if big challenges lie ahead. So why are people interested in the Academic Research Track? Once your application has been approved you are able to collect data from the full history of Tweets, at no cost. This is a massive improvement over the v1.1 access which was limited to a one week window and researchers had to pay for access. Access to the full archive means it’s now possible to study events that have happened in the past back to the beginning of Twitter in 2006. If you do create any historical datasets we’d love for you to share the tweet identifier datasets in The Catalog. However this opening up of access on the one hand comes with a simultaneous contraction in terms of how much data can be collected at one time. The remainder of this post describes some of the details and the design decisions we have made with twarc2 to address them. If you would prefer to watch a quick introduction to using twarc v2 please check out this short video: Installation If you are familiar with installing twarc nothing is changed. You still install (or upgrade) with pip as you did before: $ pip install --upgrade twarc In fact you will still have full access to the v1.1 API just as you did before. So the old commands will continue to work as they did1 $ twarc search blacklivesmatter &gt; tweets.jsonl twarc2 was designed to let you to continue to use Twitter’s v1.1 API undisturbed until it is finally turned off by Twitter, at which point the functionality will be removed from twarc. All the support for the v2 API is mediated by a new command line utility twarc2. For example to search for blacklivesmatter tweets and write them to a file tweets.jsonl: $ twarc2 search blacklivesmatter &gt; tweets.jsonl All the usual twarc functionality such as searching for tweets, collecting live tweets from the streaming API endpoint, requesting user timelines and user metadata are all still there, twarc2 --help gives you the details. But while the interface looks the same there’s quite a bit different going on behind the scenes. Representation Truth be told, there is no shortage of open source libraries and tools for interacting with the Twitter API. In the past twarc has made a bit of a name for itself by catering to a niche group of users who want a reliable, programmable way to collect the canonical JSON representation of a tweet. JavaScript Object Notation (JSON) is the language of Web APIs, and Twitter has kept its JSON representation of a tweet relatively stable over the years. Rather than making lots of decisions about the many ways you might want to collect, model and analyze tweets twarc has tried to do one thing and do it well (data collection) and get out of the way so that you can use (or create) the tools for putting this data to use. But the JSON representation of a tweet in the Twitter v2 API is completely burst apart. The v2 base representation of a tweet is extremely lean and minimal, and just includes the text of the tweet its identifier and a handful of other things. All the details about the user who created the tweet, embedded media, and more are not included. Fortunately this information is still available, but the user needs to craft their API request to request tweets using a set of expansions that tell the Twitter API what additional entities to include. In addition for each expansion there are a set of field options to include that control what of these expansions is returned. So rather than there being a single JSON representation of a tweet API users now have the ability to shape the data based on what they need, much like how GraphQL APIs work. This kind of makes you wonder why Twitter didn’t make their GraphQL API available. For specific use cases this customizability is very useful, but the mutability of the representation of a tweet presents challenges when collecting data for future use. If you didn’t request the right expansions or fields when collecting the data then you won’t be able to analyze that data later when doing your research. To solve for this twarc2 has been designed to collect the richest possible representation for a tweet, by requesting all possible expansions and field combinations for tweets. See the expansions module for the details if you are interested. This takes a significant burden off of users to digest the API documentation, and craft the correct API requests themselves. In addition the twarc community will be monitoring the Twitter API documentation going forward to incorporate new expansions and fields as they will inevitably be added in the future. Flattening This is diving into the weeds a little bit, but it’s worth noting here that Twitter’s introduction of expansions allows data that was once duplicated across multiple tweets (such as user information, media, retweets, etc) to be included once per response from the API. This means that instead of seeing information about the user who created a tweet in the context of their tweet the user will be referenced using an identifier, and this identifier will map to user metadata in the outer envelope of the response. It makes sense why Twitter have introduced expansions since it means in a set of 100 tweets from a given user the user information will just be included once rather than repeated 100 times, which means less data, less network traffic and less money. It’s even more significant when consider the large number of possible expansions. However this pass by-reference rather than by-value presents some challenges for stream based processing which expects each tweet to be self-contained. For this reason we’ve introduce the idea of flattening the response data when persisting the JSON to disk. This means that tools and data pipelines that expect to operate on a stream of tweets can continue to do so. Since the representation of a tweet is so dependent on how data is requested we’ve taken the opportunity to introduce a small stanza of twarc specific metadata using the __twarc prefix. This metadata records what API endpoint the data was requested from, and when. This information is critically important when interpreting the data, because some information about a tweet like its retweet and quote counts are constantly changing. Data Flows As mentioned above you can still collect tweets from the search and streaming API endpoints in a way that seems quite similar to the v1 API. The big changes however are the quotas associated with these endpoints which govern how much can be collected. These quotas control how many requests can be sent to Twitter in 15 minute intervals. In fact these quotas are not much changed, but what’s new are app wide quotas that constrain how many tweets a given application (app) can collect every month. An app in this context is a piece of software (e.g. your twarc software) identified by unique API keys set up in the Twitter Developer Portal. The standard API access sets a 500,000 tweet per month limit. This is a huge change considering there were no monthly app limits before. If you get approved for the Academic Research track your app quota is increased to 10 million per month. This is markedly better but the achievable data volume is still nothing like the v1.1 API, as these graphs attempt to illustrate: twarc2 will still observe the same rate limits, but once you’ve collected your portion for the month there’s not much that can be done, for that app at least. Apart from the quotas Twitter’s streaming endpoint in v2 is substantially changed which impacts how users interact with twarc. Previously twarc users would be able to create up to to two connections to the filter stream API. This could be done by simply: twarc filter obama &gt; obama.jsonl However in the Twitter v2 API only apps can connect to the filter stream, and they can only connect once. At first this seems like a major limitation but rather than creating a connection per query the v2 API allows you to build a set of rules for tweets to match, which in turns controls what tweets are included in the stream. This means you can collect for multiple types of queries at the same time, and the tweets will come back with a piece of metadata indicating what rule caused its inclusion. This translates into a markedly different set of interactions at the command line for collecting from the stream where you first need to set your stream rules and then open a connection to fetch it. twarc2 stream-rules add blacklivesmatter twarc2 stream &gt; tweets.jsonl One useful side effect of this is that you can update the stream (add and remove rules) while the stream is in motion: twarc2 stream-rules add blm While you are limited by the API quota in terms of how many tweets you can collect, tweets are not “dropped on the floor” when the volume gets too high. Once upon a time the v1.1 filter stream was rumored to be rate limited when your stream exceeds 1% of the total volume of new tweets. Plugins In addition to twarc helping you collect tweets the GitHub repository has also been a place to collect a set of utilities for working with the data. For example there are scripts for extracting and unshortening urls, identifying suspended/deleted content, extracting videos, buiding wordclouds, putting tweets on maps, displaying network graph visualizations, counting hashtags, and more. These utilities all work like Unix filters where the input is a stream of tweets and the output varies depending on what the utility is doing, e.g. a Gephi file for a network visualization, or a folder of mp4 files for video extraction. While this has worked well in general the kitchen sink approach has been difficult to manage from a configuration management perspective. Users have to download these scripts manually from GitHub or by cloning the repository. For some users this is fine, but it’s a bit of a barrier to entry for users who have just installed twarc with pip. Furthermore these plugins often have their own dependencies which twarc itself does not. This lets twarc can stay pretty lean, and things like youtube_dl, NetworkX or Pandas can be installed by people that want to use utilities that need them. But since there is no way to install the utilities there isn’t a way to ensure that the dependencies are installed, which can lead to users needing to diagnose missing libraries themselves. Finally the plugins have typically lacked their own tests. twarc’s test suite has really helped us track changes to the Twitter API and to make sure that it continues to operate properly as new functionality has been added. But nothing like this has existed for the utilities. We’ve noticed that over time some of them need updating. Also their command line arguments have drifted over time which can lead to some inconsistencies in how they are used. So with twarc2 we’ve introduced the idea of plugins which extend the functionality of the twarc2 command, are distributed on PyPI separately from twarc, and exist in their own GitHub repositories where they can be developed and tested independently of twarc itself. This is all achieved through twarc2’s use of the click library and specifically click-plugins. So now if you would like to convert your collected tweets to CSV you can install the twarc-csv: $ pip install twarc-csv $ twarc2 search covid19 &gt; covid19.jsonl $ twarc2 csv covid19.jsonl &gt; covid19.csv Or if you want to extract embedded and referenced videos from tweets you can install twarc-videos which will write all the videos to a directory: $ pip install twarc-videos $ twarc2 videos covid19.jsonl --download-dir covid19-videos You can write these plugins yourself and release them as needed. Check out the plugin reference implementation tweet-ids for a simple example to adapt. We’re still in the process of porting some of the most useful utilities over and would love to see ideas for new plugins. Check out the current list of twarc2 plugins and use the twarc issue tracker on GitHub to join the discussion. You may notice from the list of plugins that twarc now (finally) has documentation on ReadTheDocs external from the documentation that was previously only available on GitHub. We got by with GitHub’s rendering of Markdown documents for a while, but GitHub’s boilerplate designed for developers can prove to be quite confusing for users who aren’t used to selectively ignoring it. ReadTheDocs allows us to manage the command line and API documentation for twarc, and to showcase the work that has gone into the Spanish, Japanese, Portuguese, Swedish, Swahili and Chinese translations. Feedback Thanks for reading this far! We hope you will give twarc2 a try. Let us know what you think either in comments here, in the DocNow Slack or over on GitHub. ✨ ✨ Happy twarcing! ✨ ✨ ✨ Windows users will want to indicate the output file using a second argument rather than redirecting output with &gt;. See this page for details.↩ Peter Sefton: FAIR Data Management; It's a lifestyle not a lifecycle I have been working with my colleague Marco La Rosa on summary diagrams that capture some important aspects of Research Data Management, and include the FAIR data principles; that data should be Findable, Accessible, Interoperable and Reusable. But first, here's a rant about some modeling and diagramming styles and trends that I do not like. I took part in a fun Twitter thread recently kicked off by Fiona Tweedie. Fiona Tweedie @FCTweedie So my current bugbear is university processes that seem to forget that the actual work of higher ed is doing research and/ or teaching. This "research lifecycle" diagram from @UW is a stunning example: In this tweet Dr Tweedie has called out Yet Another Research Lifecycle Diagram That Leaves Out The Process Of You Know, Actually Doing Research. This process-elision happened more than once when I was working as an eResearch manager - management would get in the consultants to look at research systems, talk to the research office and graduate school and come up with a "journey map" of administrative processes that either didn't mention the actual DOING research or represented it as a tiny segment, never mind that it's, you know, the main thing researchers do when they're being researchers rather than teachers or administrators. At least the consultants would usually produce a 'journey map' that got you from point A to Point B using chevrons to &gt;&gt; indicate progress and didn't insist that everything was a 'lifecycle'. Something like: Plan / Propose &gt;&gt; Setup &gt;&gt; Manage / Do Research &gt;&gt; Closeout But all too commonly processes are represented using the tired old metaphor of a lifecycle. Reminder: A lifecycle is a biological process; how organisms come into existence, reproduce and die via various means including producing seeds, splitting themselves in two, um, making love, laying eggs and so on. It's really stretching the metaphor to talk about research in this way - maybe the research outputs in the UW "closeout" phase are eggs that hatch into new bouncing baby proposals? Regrettably, arranging things in circles and using the "lifecycle" metaphor is very common - see this Google image search for "Research Lifecycle": I wonder if the diagramming tools that are available to people are part of the issue - Microsoft Word, for example can build cycles and other diagrams out of a bullet list. (I thought it would be amusing to draw the UW diagram from above as a set cogs but this happened - you can only have 3 cogs in a Word diagram.) Research Data Management as a Cycle Now that I've got that off my chest let's look at research data management. Here's a diagram which is in fairly wide use, from The University of California. (This image has a CC-BY logo which means I can use it if I attribute it - but I'm not 100% clear on the original source of the diagram - it seems to be from UC somewhere.) Marco used this one in some presentations we gave. I thought we could do better. The good part of this diagram is that it shows research data management as a cyclical, recurring activity - which for FAIR data it needs to be. What I don't like: I think it is trying to show a project (ie grant) level view of research with data management happening in ONE spot on the journey. Typically researchers do research all the time (or in between teaching or when they can get time on equipment) not at a particular point in some administrative "journey map". We often hear feedback that their research is a lifetime activity and does not happen the way administrators and IT think it does. "Archive" is shown as a single-step pre-publication. This is a terrible message; if we are to start really doing FAIR data then data need to be described and made findable and accessible ASAP. The big so-called lifecycle is (to me) very contrived and looks like a librarian view of the world with data searching as a stand-alone process before research data management planning. Not clear whether Publication means articles or data. "Data Search / Reuse" is a type of "Collection", and why is it happening before data management planning? "Re-Collection" is also a kind of collection, so we can probably collapse all those together (the Findable and Accessible in FAIR). It’s not clear whether Publication means articles or data or both. Most research uses some kind of data storage but very often not directly; people might be interacting with a lab notebook system or a data repository - at UTS we arrived at the concept of "workspaces" to capture this. The "Minimum Viable FAIR Diagram" Marco and I have a sketch of a new diagram that attempts to address these issues and addresses what needs to be in place for broad-scale FAIR data practice. Two of the FAIR principles suggest services that need to be in place; ways to Find and Access data. The I and R in FAIR are not something that can be encapsulated in a service, as such, rather they imply that data are well described for re-use and Interoperation of systems and in Reusable formats. As it happens, there is a common infrastructure component which encapsulates finding data and accessing; the repository. Repositories are services which hold data and make it discoverable and accessible, with governance that ensures that data does not change without notice and is available for access over agreed time frames - sometimes with detailed access control. Repositories may be general purpose or specialized around a particular type of data: gene sequences, maps, code, microscope images etc. They may also be ad-hoc - at a lab level they could be a well laid out, well managed file system. Some well-funded disciplines have established global or national repositories and workflows for some or all of their data, notably physics and astronomy, bioinformatics, geophysical sciences, climate and marine science. Some of these may not be thought of by their community as repositories - but according to our functional definition they are repositories, even if they are "just" vast shared file systems or databases where everyone knows what's what and data managers keep stuff organized. Also, some institutions have institutional data repositories but it is by no means common practice across the whole of the research sector that data find their way into any of these repositories. Remember: data storage is not all files-on-disks. Researchers use a very wide range of tools which may make data inaccessible outside of the tool. Examples include: cloud-based research (lab) notebook systems in which data is deposited alongside narrative activity logs; large shared virtual laboratories where data are uploaded; Secure eResearch Platforms (SERPs) which allow access only via virtualized desktops with severely constrained data ingress and egress; survey tools; content management systems; digital asset management systems; email (yes, it's true some folks use email as project archives!); to custom-made code for a single experiment. Our general term for all of the infrastructures that researchers use for RDM day to day including general purpose storage is “workspaces”. Many, if not most workspaces do not have high levels of governance, and data may be technically or legally inaccessible over the long term. They should not be considered as suitable archives or repositories - hence our emphasis on making sure that data can be described and deposited into general purpose, standards-driven repository services. The following is a snapshot of the core parts of an idealised FAIR data service. It shows the activities that researchers undertake, acquiring data from observations, instruments and by reuse, conducting analysis and data description in a working environment, and depositing results into one or more repositories. We wanted it to show: That infrastructure services are required for research data management - researchers don't just "Archive" their data without support - they and those who will reuse data need repository services in some form. That research is conducted using workspace environments - more infrastructure. We (by which I mean Marco) will make this prettier soon. And yes, there is a legitimate cycle in this diagram it's the FIND -&gt; ACCESS -&gt; REUSE -&gt; DESCRIBE -&gt; DEPOSIT cycle that's inherent in the FAIR lifestyle. Things that might still be missing: Some kind of rubbish bin - to show that workspaces are ephemeral and working data that doesn't make the cut may be culled, and that some data is held only for a time. What do you think's missing? Thoughts anyone? Comments below or take it up on twitter with @ptsefton. (I have reworked parts of a document that Marco and I have been working on with Guido Aben for this document, and thanks to recent graduate Florence Sefton for picking up typos and sense-checking). David Rosenthal: Elon Musk: Threat or Menace? Although both Tesla and SpaceX are major engineering achievements, Elon Musk seems completely unable to understand the concept of externalities, unaccounted-for costs that society bears as a result of these achievements.First, in Tesla: carbon offsetting, but in reverse, Jaime Powell reacted to Tesla taking $1.6B in carbon offsets which provided the only profit Tesla ever made and putting them into Bitcoin:Looked at differently, a single Bitcoin purchase at a price of ~$50,000 has a carbon footprint of 270 tons, the equivalent of 60 ICE cars.Tesla’s average selling price in the fourth quarter of 2020? $49,333.We’re not sure about you, but FT Alphaville is struggling to square the circle of “buy a Tesla with a bitcoin and create the carbon output of 60 internal combustion engine cars” with its legendary environmental ambitions. Unless, of course, that was never the point in the first place. Below the fold, more externalities Musk is ignoring.Second, there is Musk's obsession with establishing a colony on Mars. Even assuming SpaceX can stop their Starship second stage exploding on landing, and do the same with the much bigger first stage, the Mars colony scheme would have massive environmental impacts. Musk envisages a huge fleet of Starships ferrying people and supplies to Mars for between 40 and 100 years. The climate effects of dumping this much rocket exhaust into the upper atmosphere over such a long period would be significant. The idea that a world suffering the catastrophic effects of climate change could sustain such an expensive program over many decades simply for the benfit of a miniscule fraction of the population is laughable.These externalities are in the future. But there are a more immediate set of externalities.Back in 2017 I expressed my skepticism about "Level 5" self-driving cars in Techno-hype part 1, stressing that the problem was that to get to Level 5, or as Musk calls it "Full Self-Driving", you need to pass through the levels where the software has to hand-off to the human. And the closer you get to Level 5, the harder this problem becomes:Suppose, for the sake of argument, that self-driving cars three times as good as Waymo's are in wide use by normal people. A normal person would encounter a hand-off once in 15,000 miles of driving, or less than once a year. Driving would be something they'd be asked to do maybe 50 times in their life.Even if, when the hand-off happened, the human was not "climbing into the back seat, climbing out of an open car window, and even smooching" and had full "situational awareness", they would be faced with a situation too complex for the car's software. How likely is it that they would have the skills needed to cope, when the last time they did any driving was over a year ago, and on average they've only driven 25 times in their life? Current testing of self-driving cars hands-off to drivers with more than a decade of driving experience, well over 100,000 miles of it. It bears no relationship to the hand-off problem with a mass deployment of self-driving technology. Mack Hogan's Tesla's "Full Self Driving" Beta Is Just Laughably Bad and Potentially Dangerous starts:A beta version of Tesla's "Full Self Driving" Autopilot update has begun rolling out to certain users. And man, if you thought "Full Self Driving" was even close to a reality, this video of the system in action will certainly relieve you of that notion. It is perhaps the best comprehensive video at illustrating just how morally dubious, technologically limited, and potentially dangerous Autopilot's "Full Self Driving" beta program is. Hogan sums up the lesson of the video:Tesla's software clearly does a decent job of identifying cars, stop signs, pedestrians, bikes, traffic lights, and other basic obstacles. Yet to think this constitutes anything close to "full self-driving" is ludicrous. There's nothing wrong with having limited capabilities, but Tesla stands alone in its inability to acknowledge its own shortcomings. Hogan goes on to point out the externalities:When technology is immature, the natural reaction is to continue working on it until it's ironed out. Tesla has opted against that strategy here, instead choosing to sell software it knows is incomplete, charging a substantial premium, and hoping that those who buy it have the nuanced, advanced understanding of its limitations—and the ability and responsibility to jump in and save it when it inevitably gets baffled. In short, every Tesla owner who purchases "Full Self-Driving" is serving as an unpaid safety supervisor, conducting research on Tesla's behalf. Perhaps more damning, the company takes no responsibility for its actions and leaves it up to driver discretion to decide when and where to test it out. That leads to videos like this, where early adopters carry out uncontrolled tests on city streets, with pedestrians, cyclists, and other drivers unaware that they're part of the experiment. If even one of those Tesla drivers slips up, the consequences can be deadly. Of course, the drivers are only human so they do slip up:the Tesla arrives at an intersection where it has a stop sign and cross traffic doesn't. It proceeds with two cars incoming, the first car narrowly passing the car's front bumper and the trailing car braking to avoid T-boning the Model 3. It is absolutely unbelievable and indefensible that the driver, who is supposed to be monitoring the car to ensure safe operation, did not intervene there. An example of the kinds of problems that can be caused by autonomous vehicles behaving in ways that humans don't expect is reported by Timothy B. Lee in Fender bender in Arizona illustrates Waymo’s commercialization challenge: A white Waymo minivan was traveling westbound in the middle of three westbound lanes on Chandler Boulevard, in autonomous mode, when it unexpectedly braked for no reason. A Waymo backup driver behind the wheel at the time told Chandler police that "all of a sudden the vehicle began to stop and gave a code to the effect of 'stop recommended' and came to a sudden stop without warning."A red Chevrolet Silverado pickup behind the vehicle swerved to the right but clipped its back panel, causing minor damage. Nobody was hurt. The Tesla in the video made a similar unexpected stop. Lee stresses that, unlike Tesla's, Waymo's responsible test program has resulted in a generally safe product, but not one that is safe enough:Waymo has racked up more than 20 million testing miles in Arizona, California, and other states. This is far more than any human being will drive in a lifetime. Waymo's vehicles have been involved in a relatively small number of crashes. These crashes have been overwhelmingly minor with no fatalities and few if any serious injuries. Waymo says that a large majority of those crashes have been the fault of the other driver. So it's very possible that Waymo's self-driving software is significantly safer than a human driver....The more serious problem for Waymo is that the company can't be sure that the idiosyncrasies of its self-driving software won't contribute to a more serious crash in the future. Human drivers cause a fatality about once every 100 million miles of driving—far more miles than Waymo has tested so far. If Waymo scaled up rapidly, it would be taking a risk that an unnoticed flaw in Waymo's programming could lead to someone getting killed. I'm a pedestrian, cyclist and driver in an area infested with Teslas owned, but potentially not actually being driven, by fanatical early adopters and members of the cult of Musk. I'm personally at risk from these people believing that what they paid good money for was "Full Self Driving". When SpaceX tests Starship at their Boca Chica site they take precautions, including road closures, to ensure innocent bystanders aren't at risk from the rain of debris when things go wrong. Tesla, not so much.Of course, Tesla doesn't tell the regulators that what the cult members paid for was "Full Self Driving"; that might cause legal problems. As Timothy B. Lee reports, Tesla: “Full self-driving beta” isn’t designed for full self-driving:"Despite the "full self-driving" name, Tesla admitted it doesn't consider the current beta software suitable for fully driverless operation. The company said it wouldn't start testing "true autonomous features" until some unspecified point in the future....Tesla added that "we do not expect significant enhancements" that would "shift the responsibility for the entire dynamic driving task to the system." The system "will continue to be an SAE Level 2, advanced driver-assistance feature."SAE level 2 is industry jargon for a driver-assistance systems that perform functions like lane-keeping and adaptive cruise control. By definition, level 2 systems require continual human oversight. Fully driverless systems—like the taxi service Waymo is operating in the Phoenix area—are considered level 4 systems." There is an urgent need for regulators to step up and stop this dangerous madness: The NHTSA should force Tesla to disable "Full Self Driving" in all its vehicles until the technology has passed an approved test program Any vehicles taking part in such a test program on public roads should be clearly distinguishable from Teslas being driven by actual humans, for example with orange flashing lights. Self-driving test vehicles from less irresponsible companies such as Waymo are distinguishable in this way, Teslas in which some cult member has turned on "Full Self Driving Beta" are not. The FTC should force Tesla to refund, with interest, every dollar paid by their customers under the false pretense that they were paying for "Full Self Driving". Jez Cope: Collaborations Workshop 2021: talks &amp; panel session I’ve just finished attending (online) the three days of this year’s SSI Collaborations Workshop (CW for short), and once again it’s been a brilliant experience, as well as mentally exhausting, so I thought I’d better get a summary down while it’s still fresh it my mind. Collaborations Workshop is, as the name suggests, much more focused on facilitating collaborations than a typical conference, and has settled into a structure that starts off with with longer keynotes and lectures, and progressively gets more interactive culminating with a hack day on the third day. That’s a lot to write about, so for this post I’ll focus on the talks and panel session, and follow up with another post about the collaborative bits. I’ll also probably need to come back and add in more links to bits and pieces once slides and the “official” summary of the event become available. Updates 2021-04-07 Added links to recordings of keynotes and panel sessions Provocations The first day began with two keynotes on this year’s main themes: FAIR Research Software and Diversity &amp; Inclusion, and day 2 had a great panel session focused on disability. All three were streamed live and the recordings remain available on Youtube: View the keynotes recording; Google-free alternative link View the panel session recording; Google-free alternative link FAIR Research Software Dr Michelle Barker, Director of the Research Software Alliance, spoke on the challenges to recognition of software as part of the scholarly record: software is not often cited. The FAIR4RS working group has been set up to investigate and create guidance on how the FAIR Principles for data can be adapted to research software as well; as they stand, the Principles are not ideally suited to software. This work will only be the beginning though, as we will also need metrics, training, career paths and much more. ReSA itself has 3 focus areas: people, policy and infrastructure. If you’re interested in getting more involved in this, you can join the ReSA email list. Equality, Diversity &amp; Inclusion: how to go about it Dr Chonnettia Jones, Vice President of Research, Michael Smith Foundation for Health Research spoke extensively and persuasively on the need for Equality, Diversity &amp; Inclusion (EDI) initiatives within research, as there is abundant robust evidence that all research outcomes are improved. She highlighted the difficulties current approaches to EDI have effecting structural change, and changing not just individual behaviours but the cultures &amp; practices that perpetuate iniquity. What initiatives are often constructed around making up for individual deficits, a bitter framing is to start from an understanding of individuals having equal stature but having different tired experiences. Commenting on the current focus on “research excellent” she pointed out that the hyper-competition this promotes is deeply unhealthy. suggesting instead that true excellence requires diversity, and we should focus on an inclusive excellence driven by inclusive leadership. Equality, Diversity &amp; Inclusion: disability issues Day 2’s EDI panel session brought together five disabled academics to discuss the problems of disability in research. Dr Becca Wilson, UKRI Innovation Fellow, Institute of Population Health Science, University of Liverpool (Chair) Phoenix C S Andrews (PhD Student, Information Studies, University of Sheffield and Freelance Writer) Dr Ella Gale (Research Associate and Machine Learning Subject Specialist, School of Chemistry, University of Bristol) Prof Robert Stevens (Professor and Head of Department of Computer Science, University of Manchester) Dr Robin Wilson (Freelance Data Scientist and SSI Fellow) NB. The discussion flowed quite freely so the following summary, so the following summary mixes up input from all the panel members. Researchers are often assumed to be single-minded in following their research calling, and aptness for jobs is often partly judged on “time send”, which disadvantages any disabled person who has been forced to take a career break. On top of this disabled people are often time-poor because of the extra time needed to manage their condition, leaving them with less “output” to show for their time served on many common metrics. This can partially affect early-career researchers, since resources for these are often restricted on a “years-since-PhD” criterion. Time poverty also makes funding with short deadlines that much harder to apply for. Employers add more demands right from the start: new starters are typically expected to complete a health and safety form, generally a brief affair that will suddenly become an 80-page bureaucratic nightmare if you tick the box declaring a disability. Many employers claim to be inclusive yet utterly fail to understand the needs of their disabled staff. Wheelchairs are liberating for those who use them (despite the awful but common phrase “wheelchair-bound”) and yet employers will refuse to insure a wheelchair while travelling for work, classifying it as a “high value personal item” that the owner would take the same responsibility for as an expensive camera. Computers open up the world for blind people in a way that was never possible without them, but it’s not unusual for mandatory training to be inaccessible to screen readers. Some of these barriers can be overcome, but doing so takes yet more time that could and should be spent on more important work. What can we do about it? Academia works on patronage whether we like it or not, so be the person who supports people who are different to you rather than focusing on the one you “recognise yourself in” to mentor. As a manager, it’s important to ask each individual what they need and believe them: they are the expert in their own condition and their lived experience of it. Don’t assume that because someone else in your organisation with the same disability needs one set of accommodations, it’s invalid for your staff member to require something totally different. And remember: disability is unusual as a protected characteristic in that anyone can acquire it at any time without warning! Lightning talks Lightning talk sessions are always tricky to summarise, and while this doesn’t do them justice, here are a few highlights from my notes. Data &amp; metadata Malin Sandstrom talked about a much-needed refinement of contributor role taxonomies for scientific computing Stephan Druskat showcased a project to crowdsource a corpus of research software for further analysis Learning &amp; teaching/community Matthew Bluteau introduced the concept of the “coding dojo” as a way to enhance community of practice. A group of coders got together to practice &amp; learn by working together to solve a problem and explaining their work as they go He described 2 models: a code jam, where people work in small groups, and the Randori method, where 2 people do pair programming while the rest observe. I’m excited to try this out! Steve Crouch talked about intermediate skills and helping people take the next step, which I’m also very interested in with the GLAM Data Science network Esther Plomp recounted experience of running multiple Carpentry workshops online, while Diego Alonso Alvarez discussed planned workshops on making research software more usable with GUIs Shoaib Sufi showcased the SSI’s new event organising guide Caroline Jay reported on a diary study into autonomy &amp; agency in RSE during COVID Lopez, T., Jay, C., Wermelinger, M., &amp; Sharp, H. (2021). How has the covid-19 pandemic affected working conditions for research software engineers? Unpublished manuscript. Wrapping up That’s not everything! But this post is getting pretty long so I’ll wrap up for now. I’ll try to follow up soon with a summary of the “collaborative” part of Collaborations Workshop: the idea-generating sessions and hackday! Journal of Web Librarianship: Examination of Academic Library Websites Regarding COVID-19 Responsiveness . Terry Reese: MarcEdit 7.5 Update ChangeLog: https://marcedit.reeset.net/software/update75.txt Highlights Preview Changes One of the most requested features over the years has been the ability to preview changes prior to running them.  As of 7.5.8 – a new preview option has been added to many of the global editing tools in the MarcEditor.  Currently, you will find the preview option attached to the following functions: Replace All Add New Field Delete Field Edit Subfield Edit Field Edit Indicator Copy Field Swap Field Functions that include a preview option will be denoted with the following button: When this button is pressed, the following option is made available When Preview Results is selected, the program will execute the defined action, and display the potential results in a display screen.  For example: To protect performance, only 500 results at a time will be loaded into the preview grid, though users can keep adding results to the grid and continue to review items.  Additionally, users have the ability to search for items within the grid as well as jump to a specific record number (not row number).  These new options will show up first in the windows version of MarcEdit, but will be added to the MarcEdit Mac 3.5.x branch in the coming weeks.  New JSON =&gt; XML Translation To better support the translation of data from JSON to MARC, I’ve included a JSON =&gt; MARC algorithm in the MARCEngine.  This will allow JSON data to serialized into XML.  The benefit of including this option, is that I’ve been able to update the XML Functions options to allow JSON to be a starting format.  This will specifically useful for users that want to make use of linked data vocabularies to generate MARC Authority records.  Users can direct MarcEdit to facilitate the translation from JSON to XML, and then create XSLT translations that can then be used to complete the process to MARCXML and MARC.  I’ve demonstrated how this process works using a vocabulary of interest to the #critcat community, the Homosaurus vocabulary (How do I generate MARC authority records from the Homosaurus vocabulary? – Terry’s Worklog (reeset.net)). OCLC API Interactions Working with the OCLC API is sometimes tricky.   MarcEdit utilizes a specific authentication process that requires OCLC keys be setup and configured to work a certain way.  When issues come up, it is sometimes very difficult to debug them.  I’ve updated the process and error handling to surface more information – so when problems occur and XML debugging information isn’t available, the actual exception and inner exception data will be surfaced instead.  This often can provide information to help understand why the process isn’t able to complete. Wrap up As noted, there have been a number of updates.  While many fall under the category of house-keeping (updating icons, UX improvements, actions, default values, etc.) – this update does include a number of often asked for, significant updates, that I hope will improve user workflows. –tr Terry Reese: How do I generate MARC authority records from the Homosaurus vocabulary? Step by step instructions here: https://youtu.be/FJsdQI3pZPQ Ok, so last week, I got an interesting question on the listserv where a user asked specifically about generating MARC records for use in one’s ILS system from a JSONLD vocabulary.  In this case, the vocabulary in question as Homosaurus (Homosaurus Vocabulary Site) – and the questioner was specifically looking for a way to pull individual terms for generation into MARC Authority records to add to one’s ILS to improve search and discovery. When the question was first asked, my immediate thought was that this could likely be accommodated using the XML/JSON profiling wizard in MarcEdit.  This tool can review a sample XML or JSON file and allow a user to create a portable processing file based on the content in the file.  However, there were two issues with this approach: The profile wizard assumes that data format is static – i.e., the sample file is representative of other files.  Unfortunately, for this vocabulary, that isn’t the case.  The profile wizard was designed to work with JSON – JSON LD is actually a different animal due to the inclusion of the @ symbol.  While I updated the Profiler to recognize and work better with JSON-LD – the first challenge is one that doesn’t make this a good fit to create a generic process.  So, I looked at how this could be built into the normal processing options. To do this, I added a new default serialization, JSON=&gt;XML == which MarcEdit now supports.  This allows the tool to take a JSON file, and deserialize the data so that is output reliably as XML.  So, for example, here is a sample JSON-LD file (homosaurus.org/v2/adoptiveParents.jsonld): { "@context": { "dc": "http://purl.org/dc/terms/", "skos": "http://www.w3.org/2004/02/skos/core#", "xsd": "http://www.w3.org/2001/XMLSchema#" }, "@id": "http://homosaurus.org/v2/adoptiveParents", "@type": "skos:Concept", "dc:identifier": "adoptiveParents", "dc:issued": { "@value": "2019-05-14", "@type": "xsd:date" }, "dc:modified": { "@value": "2019-05-14", "@type": "xsd:date" }, "skos:broader": { "@id": "http://homosaurus.org/v2/parentsLGBTQ" }, "skos:hasTopConcept": [ { "@id": "http://homosaurus.org/v2/familyMembers" }, { "@id": "http://homosaurus.org/v2/familiesLGBTQ" } ], "skos:inScheme": { "@id": "http://homosaurus.org/terms" }, "skos:prefLabel": "Adoptive parents", "skos:related": [ { "@id": "http://homosaurus.org/v2/socialParenthood" }, { "@id": "http://homosaurus.org/v2/LGBTQAdoption" }, { "@id": "http://homosaurus.org/v2/LGBTQAdoptiveParents" }, { "@id": "http://homosaurus.org/v2/birthParents" } ] } In MarcEdit, the new JSON=&gt;XML process can take this file and output it in XML like this: &lt;?xml version="1.0"?&gt; &lt;records&gt; &lt;record&gt; &lt;context&gt; &lt;dc&gt;http://purl.org/dc/terms/&lt;/dc&gt; &lt;skos&gt;http://www.w3.org/2004/02/skos/core#&lt;/skos&gt; &lt;xsd&gt;http://www.w3.org/2001/XMLSchema#&lt;/xsd&gt; &lt;/context&gt; &lt;id&gt;http://homosaurus.org/v2/adoptiveParents&lt;/id&gt; &lt;type&gt;skos:Concept&lt;/type&gt; &lt;identifier&gt;adoptiveParents&lt;/identifier&gt; &lt;issued&gt; &lt;value&gt;2019-05-14&lt;/value&gt; &lt;type&gt;xsd:date&lt;/type&gt; &lt;/issued&gt; &lt;modified&gt; &lt;value&gt;2019-05-14&lt;/value&gt; &lt;type&gt;xsd:date&lt;/type&gt; &lt;/modified&gt; &lt;broader&gt; &lt;id&gt;http://homosaurus.org/v2/parentsLGBTQ&lt;/id&gt; &lt;/broader&gt; &lt;hasTopConcept&gt; &lt;id&gt;http://homosaurus.org/v2/familyMembers&lt;/id&gt; &lt;/hasTopConcept&gt; &lt;hasTopConcept&gt; &lt;id&gt;http://homosaurus.org/v2/familiesLGBTQ&lt;/id&gt; &lt;/hasTopConcept&gt; &lt;inScheme&gt; &lt;id&gt;http://homosaurus.org/terms&lt;/id&gt; &lt;/inScheme&gt; &lt;prefLabel&gt;Adoptive parents&lt;/prefLabel&gt; &lt;related&gt; &lt;id&gt;http://homosaurus.org/v2/socialParenthood&lt;/id&gt; &lt;/related&gt; &lt;related&gt; &lt;id&gt;http://homosaurus.org/v2/LGBTQAdoption&lt;/id&gt; &lt;/related&gt; &lt;related&gt; &lt;id&gt;http://homosaurus.org/v2/LGBTQAdoptiveParents&lt;/id&gt; &lt;/related&gt; &lt;related&gt; &lt;id&gt;http://homosaurus.org/v2/birthParents&lt;/id&gt; &lt;/related&gt; &lt;/record&gt; &lt;/records&gt; The ability to reliably convert JSON/JSONLD to XML means that I can now allow users to utilize the same XSLT/XQUERY process MarcEdit utilizes for other library metadata format transformation.  All that was left to make this happen was to add a new origin data format to the XML Function template – and we are off and running. The end result is users could utilize this process with any JSON-LD vocabulary (assuming they created the XSLT) to facilitate the automation of MARC Authority data.  In this case of this vocabulary, I’ve created an XSLT and added it to my github space: https://github.com/reeset/marcedit_xslt_files/blob/master/homosaurus_xml.xsl but have included the XSLT in the MarcEdit XSLT directory in current downloads. In order to use this XSLT and allow your version of MarcEdit to generate MARC Authority records from this vocabulary – you would use the following steps: Be using MarcEdit 7.5.8+ or MarcEdit Mac 3.5.8+ (Mac version will be available around 4/8).  I have not decided if I will backport to 7.3- Open the XML Functions Editor in MarcEdit Add a new Transformation – using JSON as the original format, and MARC as the final.  Make sure the XSLT path is pointed to the location where you saved the downloaded XSLT file. Save That should be pretty much it.  I’ve recorded the steps and placed them here: https://youtu.be/FJsdQI3pZPQ, including some information on values you may wish to edit should you want to localize the XSLT.  Peter Murray: Publishers going-it-alone (for now?) with GetFTR In early December 2019, a group of publishers announced Get-Full-Text-Research, or GetFTR for short. I read about this first in Roger Schonfeld’s “Publishers Announce a Major New Service to Plug Leakage” piece in The Scholarly Kitchen via Jeff Pooley’s Twitter thread and blog post. Details about how this works are thin, so I’m leaning heavily on Roger’s description. I’m not as negative about this as Jeff, and I’m probably a little more opinionated than Roger. This is an interesting move by publishers, and—as the title of this post suggests—I am critical of the publisher’s “go-it-alone” approach. First, some disclosure might be in order. My background has me thinking of this in the context of how it impacts libraries and library consortia. For the past four years, I’ve been co-chair of the NISO Information Discovery and Interchange topic committee (and its predecessor, the “Discovery to Delivery” topic committee), so this is squarely in what I’ve been thinking about in the broader library-publisher professional space. I also traced the early development of RA21 and more recently am volunteering on the SeamlessAccess Entity Category and Attribute Bundles Working Group; that’ll become more important a little further down this post. I was nodding along with Roger’s narrative until I stopped short here: The five major publishing houses that are the driving forces behind GetFTR are not pursuing this initiative through one of the major industry collaborative bodies. All five are leading members of the STM Association, NISO, ORCID, Crossref, and CHORUS, to name several major industry groups. But rather than working through one of these existing groups, the houses plan instead to launch a new legal entity.  While [Vice President of Product Strategy &amp; Partnerships for Wiley Todd] Toler and [Senior Director, Technology Strategy &amp; Partnerships for the American Chemical Society Ralph] Youngen were too politic to go deeply into the details of why this might be, it is clear that the leadership of the large houses have felt a major sense of mismatch between their business priorities on the one hand and the capabilities of these existing industry bodies. At recent industry events, publishing house CEOs have voiced extensive concerns about the lack of cooperation-driven innovation in the sector. For example, Judy Verses from Wiley spoke to this issue in spring 2018, and several executives did so at Frankfurt this fall. In both cases, long standing members of the scholarly publishing sector questioned if these executives perhaps did not realize the extensive collaborations driven through Crossref and ORCID, among others. It is now clear to me that the issue is not a lack of knowledge but rather a concern at the executive level about the perceived inability of existing collaborative vehicles to enable the new strategic directions that publishers feel they must pursue.  This is the publishers going-it-alone. To see Roger describe it, they are going to create this web service that allows publishers to determine the appropriate copy for a patron and do it without input from the libraries. Librarians will just be expected to put this web service widget into their discovery services to get “colored buttons indicating that the link will take [patrons] to the version of record, an alternative pathway, or (presumably in rare cases) no access at all.” (Let’s set aside for the moment the privacy implications of having a fourth-party web service recording all of the individual articles that come up in a patron’s search results.) Librarians will not get to decide the “alternative pathway” that is appropriate for the patron: “Some publishers might choose to provide access to a preprint or a read-only version, perhaps in some cases on some kind of metered basis.” (Roger goes on to say that he “expect[s] publishers will typically enable some alternative version for their content, in which case the vast majority of scholarly content will be freely available through publishers even if it is not open access in terms of licensing.” I’m not so confident.) No, thank you. If publishers want to engage in technical work to enable libraries and others to build web services that determine the direct link to an article based on a DOI, then great. Libraries can build a tool that consumes that information as well as takes into account information about preprint services, open access versions, interlibrary loan and other methods of access. But to ask libraries to accept this publisher-controlled access button in their discovery layers, their learning management systems, their scholarly profile services, and their other tools? That sounds destined for disappointment. I am only somewhat encouraged by the fact that RA21 started out as a small, isolated collaboration of publishers before they brought in NISO and invited libraries to join the discussion. Did it mean that it slowed down deployment of RA21? Undoubtedly yes. Did persnickety librarians demand transparent discussions and decisions about privacy-related concerns like what attributes the publisher would get about the patron in the Shibboleth-powered backchannel? Yes, but because the patrons weren’t there to advocate for themselves. Will it likely mean wider adoption? I’d like to think so. Have publishers learned that forcing these kinds of technologies onto users without consultation is a bad idea? At the moment it would appear not. Some of what publishers are seeking with GetFTR can be implemented with straight-up OpenURL or—at the very least—limited-scope additions to OpenURL (the Z39.88 open standard!). So that they didn’t start with OpenURL, a robust existing standard, is both concerning and annoying. I’ll be watching and listening for points of engagement, so I remain hopeful. A few words about Jeff Pooley’s five-step “laughably creaky and friction-filled effort” that is SeamlessAccess. Many of the steps Jeff describes are invisible and well-established technical protocols. What Jeff fails to take into account is the very visible and friction-filled effect of patrons accessing content beyond the boundaries of campus-recognized internet network addresses. Those patrons get stopped at step two with a “pay $35 please” message. I’m all for removing that barrier entirely by making all published content “open access”. It is folly to think, though, that researchers and readers can enforce an open access business model on all publishers, so solutions like SeamlessAccess will have a place. (Which is to say nothing of the benefit of inter-institutional resource collaboration opened up by a more widely deployed Shibboleth infrastructure powered by SeamlessAccess.) Peter Murray: What is known about GetFTR at the end of 2019 In early December 2019, a group of publishers announced Get-Full-Text-Research, or GetFTR for short. There was a heck of a response on social media, and the response was—on the whole—not positive from my librarian-dominated corner of Twitter. For my early take on GetFTR, see my December 3rd blog post “Publishers going-it-alone (for now?) with GetFTR.” As that post title suggests, I took the five founding GetFTR publishers to task on their take-it-or-leave-it approach. I think that is still a problem. To get you caught up, here is a list of other commentary. Roger Schonfeld’s December 3rd “Publishers Announce a Major New Service to Plug Leakage” piece in The Scholarly Kitchen Tweet from Herbert Van de Sompel, the lead author of the OpenURL spec, on solving the appropriate copy problem December 5th post “Get To Fulltext Ourselves, Not GetFTR.” on the Open Access Button blog Twitter thread on December 7th between @cshillum and @lisalibrarian on the positioning of GetFTR in relation to link resolvers and an unanswered question about how GetFTR aligns with library interests Twitter thread started by @TAC_NISO on December 9th looking for more information with a link to an STM Association presentation added by @aarontay A tree of tweets starting from @mrgunn’s [I don’t trust publishers to decide] is the crux of the whole thing. In particular, threads of that tweet that include Jason Griffey of NISO saying he knew nothing about GetFTR and Bernhard Mittermaier’s point about hidden motivations behind GetFTR Twitter thread started by @aarontay on December 7th saying “GetFTR is bad for researchers/readers and librarians. It only benefits publishers, change my mind.” Lisa Janicke Hinchliffe’s December 10th “Why are Librarians Concerned about GetFTR?” in The Scholarly Kitchen and take note of the follow-up discussion in the comments Twitter thread between @alison_mudditt and @lisalibrarian clarifying PLOS is not on the Advisory Board with some @TAC_NISO as well. Ian Mulvany’s December 11th “thoughts on GetFTR” on ScholCommsProd GetFTR’s December 11th “Updating the community” post on their website The Spanish Federation of Associations of Archivists, Librarians, Archaeologists, Museologists and Documentalists (ANABAD)’s December 12th “GetFTR: new publishers service to speed up access to research articles” (original in Spanish, Google Translate to English) December 20th news entry from eContent Pro with the title “What GetFTR Means for Journal Article Access” which I’ll only quarrel with this sentence: “Thus, GetFTR is a service where Academic articles are found and provided to you at absolutely no cost.” No—if you are in academia the cost is born by your library even if you don’t see it. But this seems like a third party service that isn’t directly related to publishers or libraries, so perhaps they can be forgiven for not getting that nuance. Wiley’s Chemistry Views news post on December 26th titled simply “Get Full Text Research (GetFTR)” is perhaps only notable for the sentence “Growing leakage has steadily eroded the ability of the publishers to monetize the value they create.” If you are looking for a short list of what to look at, I recommend these posts. GetFTR’s Community Update On December 11—after the two posts I list below—an “Updating the Community” web page was posted to the GetFTR website. From a public relations perspective, it was…interesting. We are committed to being open and transparent This section goes on to say, “If the community feels we need to add librarians to our advisory group we will certainly do so and we will explore ways to ensure we engage with as many of our librarian stakeholders as possible.” If the GetFTR leadership didn’t get the indication between December 3 and December 12 that librarians feel strongly about being at the table, then I don’t know what will. And it isn’t about being on the advisory group; it is about being seen and appreciated as important stakeholders in the research discovery process. I’m not sure who the “community” is in this section, but it is clear that librarians are—at best—an afterthought. That is not the kind of “open and transparent” that is welcoming. Later on in the Questions about library link resolvers section is this sentence: We have, or are planning to, consult with existing library advisory boards that participating publishers have, as this enables us to gather views from a significant number of librarians from all over the globe, at a range of different institutions. As I said in my previous post, I don’t know why GetFTR is not engaging in existing cross-community (publisher/technology-supplier/library) organizations to have this discussion. It feels intentional, which colors the perception of what the publishers are trying to accomplish. To be honest, I don’t think the publishers are using GetFTR to drive a wedge between library technology service providers (who are needed to make GetFTR a reality for libraries) and libraries themselves. But I can see how that interpretation could be made. Understandably, we have been asked about privacy. I punted on privacy in my previous post, so let’s talk about it here. It remains to be seen what is included in the GetFTR API request between the browser and the publisher site. Sure, it needs to include the DOI and a token that identifies the patron’s institution. We can inspect that API request to ensure nothing else is included. But the fact that the design of GetFTR has the browser making the call to the publisher site means that the publisher site knows the IP address of the patron’s browser, and the IP address can be considered personally identifiable information. This issue could be fixed by having the link resolver or the discovery layer software make the API request, and according to the Questions about library link resolvers section of the community update, this may be under consideration. So, yes, an auditable privacy policy and implementation is key for for GetFTR. GetFTR is fully committed to supporting third-party aggregators This is good to hear. I would love to see more information published about this, including how discipline-specific repositories and institutional repositories can have their holdings represented in GetFTR responses. My Take-a-ways In the second to last paragraph: “Researchers should have easy, seamless pathways to research, on whatever platform they are using, wherever they are.” That is a statement that I think every library could sign onto. This Updating the Community is a good start, but the project has dug a deep hole of trust and it hasn’t reached level ground yet. Lisa Janicke Hinchliffe’s “Why are Librarians Concerned about GetFTR?” Posted on December 10th in The Scholarly Kitchen, Lisa outlines a series of concerns from a librarian perspective. I agree with some of these; others are not an issue in my opinion. Librarian Concern: The Connection to Seamless Access Many librarians have expressed a concern about how patron information can leak to the publisher through ill-considered settings at an institution’s identity provider. Seamless Access can ease access control because it leverages a campus’ single sign-on solution—something that a library patron is likely to be familiar with. If the institution’s identity provider is overly permissive in the attributes about a patron that get transmitted to the publisher, then there is a serious risk of tying a user’s research activity to their identity and the bad things that come from that (patrons self-censoring their research paths, commoditization of patron activity, etc.). I’m serving on a Seamless Access task force that is addressing this issue, and I think there are technical, policy, and education solutions to this concern. In particular, I think some sort of intermediate display of the attributes being transmitted to the publisher is most appropriate. Librarian Concern: The Limited User Base Enabled As Lisa points out, the population of institutions that can take advantage of Seamless Access, a prerequisite for GetFTR, is very small and weighted heavily towards well-resourced institutions. To the extent that projects like Seamless Access (spurred on by a desire to have GetFTR-like functionality) helps with the adoption of SAML-based infrastructure like Shibboleth, then the whole academic community benefits from a shared authentication/identity layer that can be assumed to exist. Librarian Concern: The Insertion of New Stumbling Blocks Of the issues Lisa mentioned here, I’m not concerned about users being redirected to their campus single sign-on system in multiple browsers on multiple machines. This is something we should be training users about—there is a single website to put your username/password into for whatever you are accessing at the institution. That a user might already be logged into the institution single sign-on system in the course of doing other school work and never see a logon screen is an attractive benefit to this system. That said, it would be useful for an API call from a library’s discovery layer to a publisher’s GetFTR endpoint to be able to say, “This is my user. Trust me when I say that they are from this institution.” If that were possible, then the Seamless Access Where-Are-You-From service could be bypassed for the GetFTR purpose of determining whether a user’s institution has access to an article on the publisher’s site. It would sure be nice if librarians were involved in the specification of the underlying protocols early on so these use cases could be offered. Update Lisa reached out on Twitter to say (in part): “Issue is GetFTR doesn’t redirect and SA doesnt when you are IPauthenticated. Hence user ends up w mishmash of experience.” I went back to read her Scholarly Kitchen post and realized I did not fully understand her point. If GetFTR is relying on a Seamless Access token to know which institution a user is coming from, then that token must get into the user’s browser. The details we have seen about GetFTR don’t address how that Seamless Access institution token is put in the user’s browser if the user has not been to the Seamless Access select-your-institution portal. One such case is when the user is coming from an IP-address-authenticated computer on a campus network. Do the GetFTR indicators appear even when the Seamless Access institution token is not stored in the browser? If at the publisher site the GetFTR response also uses the institution IP address table to determine entitlements, what does a user see when they have neither the Seamless Access institution token nor the institution IP address? And, to Lisa’s point, how does one explain this disparity to users? Is the situation better if the GetFTR determination is made in the link resolver rather than in the user browser? Librarian Concern: Exclusion from Advisory Committee See previous paragraph. That librarians are not at the table offering use cases and technical advice means that the developers are likely closing off options that meet library needs. Addressing those needs would ease the acceptance of the GetFTR project as mutually beneficial. So an emphatic “AGREE!” with Lisa on her points in this section. Publishers—what were you thinking? Librarian Concern: GetFTR Replacing the Library Link Resolver Libraries and library technology companies are making significant investments in tools that ease the path from discovery to delivery. Would the library’s link resolver benefit from a real-time API call to a publisher’s service that determines the direct URL to a specific DOI? Oh, yes—that would be mighty beneficial. The library could put that link right at the top of a series of options that include a link to a version of the article in a Green Open Access repository, redirection to a content aggregator, one-click access to an interlibrary-loan form, or even an option where the library purchases a copy of the article on behalf of the patron. (More likely, the link resolver would take the patron right to the article URL supplied by GetFTR, but the library link resolver needs to be in the loop to be able to offer the other options.) My Take-a-ways The patron is affiliated with the institution, and the institution (through the library) is subscribing to services from the publisher. The institution’s library knows best what options are available to the patron (see above section). Want to know why librarians are concerned? Because they are inserting themselves as the arbiter of access to content, whether it is in the patron’s best interest or not. It is also useful to reinforce Lisa’s closing paragraph: Whether GetFTR will act to remediate these concerns remains to be seen. In some cases, I would expect that they will. In others, they may not. Publishers’ interests are not always aligned with library interests and they may accept a fraying relationship with the library community as the price to pay to pursue their strategic goals. Ian Mulvany’s “thoughts on GetFTR” Ian’s entire post from December 11th in ScholCommsProd is worth reading. I think it is an insightful look at the technology and its implications. Here are some specific comments: Clarifying the relation between SeamlessAccess and GetFTR There are a couple of things that I disagree with: OK, so what is the difference, for the user, between seamlessaccess and GetFTR? I think that the difference is the following - with seamless access you the user have to log in to the publisher site. With GetFTR if you are providing pages that contain DOIs (like on a discovery service) to your researchers, you can give them links they can click on that have been setup to get those users direct access to the content. That means as a researcher, so long as the discovery service has you as an authenticated user, you don’t need to even think about logins, or publisher access credentials. To the best of my understanding, this is incorrect. With SeamlessAccess, the user is not “logging into the publisher site.” If the publisher site doesn’t know who a user is, the user is bounced back to their institution’s single sign-on service to authenticate. If the publisher site doesn’t know where a user is from, it invokes the SeamlessAccess Where-Are-You-From service to learn which institution’s single sign-on service is appropriate for the user. If a user follows a GetFTR-supplied link to a publisher site but the user doesn’t have the necessary authentication token from the institution’s single sign-on service, then they will be bounced back for the username/password and redirected to the publisher’s site. GetFTR signaling that an institution is entitled to view an article does not mean the user can get it without proving that they are a member of the institution. What does this mean for Green Open Access A key point that Ian raises is this: One example of how this could suck, lets imagine that there is a very usable green OA version of an article, but the publisher wants to push me to using some “e-reader limited functionality version” that requires an account registration, or god forbid a browser exertion, or desktop app. If the publisher shows only this limited utility version, and not the green version, well that sucks. Oh, yeah…that does suck, and it is because the library—not the publisher of record—is better positioned to know what is best for a particular user. Will GetFTR be adopted? Ian asks, “Will google scholar implement this, will other discovery services do so?” I do wonder if GetFTR is big enough to attract the attention of Google Scholar and Microsoft Research. My gut tells me “no”: I don’t think Google and Microsoft are going to add GetFTR buttons to their search results screens unless they are paid a lot. As for Google Scholar, it is more likely that Google would build something like GetFTR to get the analytics rather than rely on a publisher’s version. I’m even more doubtful that the companies pushing GetFTR can convince discovery layers makers to embed GetFTR into their software. Since the two widely adopted discovery layers (in North America, at least) are also aggregators of journal content, I don’t see the discovery-layer/aggregator companies devaluing their product by actively pushing users off their site. My Take-a-ways It is also useful to reinforce Ian’s closing paragraph: I have two other recommendations for the GetFTR team. Both relate to building trust. First up, don’t list orgs as being on an advisory board, when they are not. Secondly it would be great to learn about the team behind the creation of the Service. At the moment its all very anonymous. Where Do We Stand? Wow, I didn’t set out to write 2,500 words on this topic. At the start I was just taking some time to review everything that happened since this was announced at the start of December and see what sense I could make of it. It turned into a literature review of sort. While GetFTR has some powerful backers, it also has some pretty big blockers: Can GetFTR help spur adoption of Seamless Access enough to convince big and small institutions to invest in identity provider infrastructure and single sign-on systems? Will GetFTR grab the interest of Google, Google Scholar, and Microsoft Research (where admittedly a lot of article discovery is already happening)? Will developers of discovery layers and link resolvers prioritize GetFTR implementation in their services? Will libraries find enough value in GetFTR to enable it in their discovery layers and link resolvers? Would libraries argue against GetFTR in learning management systems, faculty profile systems, and other campus systems if its own services cannot be included in GetFTR displays? I don’t know, but I think it is up to the principles behind GetFTR to make more inclusive decisions. The next steps is theirs. 
maisonbisson-com-3466	----	MaisonBisson MaisonBisson Recent content on MaisonBisson Every journalist Ryu Spaeth on the dirty job of journalism: [E]very journalist [&hellip;] at some point will have to face the morally indefensible way we go about our business: namely, using other people to tell a story about the world. Not everyone dupes their subjects into trusting them, but absolutely everyone robs other people of their stories to tell their own. Every journalist knows this flushed feeling, a mix of triumph and guilt, of securing the story that will redound glory unto them, not the subject. The three tribes of the internet Authors Primavera De Filippi, Juan Ortiz Freuler, and Joshua Tan outline three competing narratives that have shaped the internet: libertarian, corporate, and nationalist. This matters because our physical lives are now deeply intertwined with and codependent on our internet activities. The latest information about Covid regulations in many communities is first released on Twitter, for example. A declaration is a political act, which describes what should be done. A narrative is a political tool, which elaborates on why it should be done. Happy D.B. Cooper Day D.B. Cooper day is celebrated on this day, the Saturday following Thanksgiving, every year. Vitaminwater's #nophoneforayear contest Back in the before times, Vitaminwater invited applicants to a contest to go a full year without a smartphone or tablet. It was partly in response to rising concerns over the effect of all those alerts on our brains. Over 100,000 people clamored for the chance, but author Elana A. Mugdan&rsquo;s entry stood out with an amusing video, and in February 2019 the company took away her iPhone 5s and handed her a Kyocera flip phone. Membership-driven news media From The Membership Guide&rsquo;s handbook/manifesto: Journalism is facing both a trust crisis and a sustainability crisis. Membership answers to both. It is a social contract between a news organization and its members in which members give their time, money, energy, expertise, and connections to support a cause that they believe in. In exchange, the news organization offers transparency and opportunities to meaningfully contribute to both the sustainability and impact of the organization. Political bias in social media algorithms and media monetization models New reports reveal yet more structural political biases in consumption and monetization models. Media monetization vs. internet advertising Structural problems The internet is structured in favor of ad networks. Ad spend grows approximately at the rate of inflation, but the inventory of pages on which those ads can appear grows with each new Instagram post (about 100MM per day). Internet advertising is far more automated than print, but the benefit goes to intermediaries and buyers. On average, publishers receive only about half of what advertisers pay for the advertising that appears in their publications. The argument against likes: aim for deeper, more genuine interactions It’s worth revisiting the infamous 2005 definition of social software as software that facilitates social encounters: &ldquo;Social software&rdquo; is about making it easy for people to do other things that make them happy: meeting, communicating, and hooking up. [&hellip;] The trick you want to accomplish is that when one person is using your software, it suddenly provides value to that person and their entire circle of friends, without the friends having had to do anything at all. Paid reactions: virtual awards and tipping Likes and reactions can stimulate more signal, leading to more user-activity on a site, but reactions that members pay to give to creators and other members on the site can be a revenue source. Reddit introduced Reddit Gold in 2010 in an announcement that was surprisingly candid about their need to raise money. The original Reddit Gold was a combination of both premium, ad-free subscription and a type of reaction that allowed premium members to &ldquo;gild&rdquo; a post. Reactions Reactions in Twitter DMs. Likes are a most perfect binary, but the meaning of a like can vary. Consider the following interpretations of likes on Instagram: This photo is incredibly inspiring to me and I want it hanging on my wall I like it when you like my photos and comments, so I will like your work as part of the social contract we have settled into I appreciate your comment on my photo and I want to recognize your participation It’s difficult, however, to &ldquo;like&rdquo; something with painful or negative emotions. “Likes” vs. “Faves” Wikipedia credits Vimeo for introducing the first like button as a more casual alternative to favorites. Facebook introduced the feature in early 2009, but Twitter’s story is an interesting investigation into the differences a word or an icon can make. Twitter switched from Faves to Likes on 3 November 2015. &ldquo;You might like a lot of things, but not everything can be your favorite&rdquo; explained Twitter’s announcement. They continued: [W]e know that at times the [Fave] star could be confusing, especially to newcomers. All my Flickr photos, for indexing and archiving Links to all my photos in Flickr. Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Photo page, original size, large size Honey cocktails: eau de lavender Liquor.com&rsquo;s recipe for eau de lavender, from a larger collection of cocktails with honey. They all look and sound delightful, but I can vouch for the eau de lavender. Ingredients 1 1/2 oz Tequila 3/4 oz Fresh lemon juice 3/4 oz Honey syrup1 1 Egg white 1 dash Scrappy&rsquo;s lavender bitters Garnish: Lavender sprig Steps Add all ingredients into a shaker and dry-shake (without ice). Add ice and shake again to emulsify thoroughly. Satellite tracking If you&rsquo;re not reading Skyriddles blog, then you&rsquo;re not tracking the sky above. And you might have missed the re-discovery of a satellite launched in 1967 and lost for nearly 50 years. As it turns out, there&rsquo;s a lot of stuff that&rsquo;s been forgotten up there, and quite a bit that some are trying to hide. The blog is an entertaining view into the world satellites, including communication, spy, weather, research, and the occasional probe going further afield. I'm missing restaurants now @nakedlunchsf was notable for having both a strong contender for the best burger in the city, and the best veggie sando. They kept the menu short and focused, and changed it up every few days based on what was in season and interesting. It was great food, but not fancy. The food, warm atmosphere, and a welcoming front of house team made the place a favorite for me and many others. When unzip fails on macOS with UTF8 unzip can fail on macOS when UTF-8 chars are in the archive. The solution is to use ditto. Via a Github issue: ditto -V -x -k --sequesterRsrc --rsrc FILENAME.ZIP DESTINATIONDIRECTORY TikTok vs. Instagram Connie Chan: Rather than asking users to tap into a video thumbnail or click into a channel, the app’s AI algorithms decide which videos to show users. The full-screen design of TikTok allows every video to unveil both positive and negative signals from users (positive = a like, follow, or watching until the end; negative = swipe away, press down). Even the speed at which users swipe a video away is a relevant signal. Swipegram template Benjamin Lee&rsquo;s instructions and downloadable template to make panoramic carousel Instagrams (AKA #swipegram), as illustrated via his animation above. “It is clear that the books owned the shop... “It is clear that the books owned the shop rather than the other way about. Everywhere they had run wild and taken possession of their habitat, breeding and multiplying, and clearly lacking any strong hand to keep them down.” Words by Agatha Christie In photo: 1️⃣ MacLeod’s Books, Vancouver, British Columbia #penderstreet #downtownvancouver #mustbevancouver 2️⃣ Carlson &amp; Turner Antiquarian Books, Portland, Maine #portlandmaine #lovemaine At Instagram. “Life is like riding a bicycle... “Life is like riding a bicycle. To keep your balance, you must keep moving.” —wisdom by Albert Einstein The Bosch Autoparts shop behind these commuters is now converted to an organic restaurant that anchors the northeast corner of Copenhagen’s fashionable meatpacking district. At Instagram. Notes about Spotify creator features Spotify often gets bashed by top creators. The service pays just $0.00397 per stream, but with 108 million users listening to an average of 25 hours per month, those streams can add up for creators who can get the listener&rsquo;s attention. Spotify verifies artists who then get additional benefits on the platform. Some artists find success the traditional route, some optimize their work for the system, others work the system&hellip;and some really work it. ExifTool examples I use for encoding analog camera details I&rsquo;m a stickler for detail and love to add exif metadata for my film cameras to my scanned images. These are my notes to self about the data I use most often. I only wish exif had fields to record the film details too. Random notes on Instagram Delete your photos Deleting your old photos is recurring advice to photograpehers. JP Danko suggests deleting photos just for simplicity of management. Similarly, Eric Kim recommends it for decluttering as well. From another side, Mike Dixon deletes photos as part of his reflection and self-improvement efforts. And Caleb Kerr argues emotional attachment to old photos is bad for your portfolio and can be a barrier to creating better work. Rebrand A number recommend starting from scratch. Every media has its tastemakers and influencers Every media, network, or platform has would-be influencers or promoters who can help connect consumers with creators. Don&rsquo;t mistake the value of these tastemakers, and be sure to find a place for them to create new value for your platform. Storehouse: the most wonderful story sharing flop ever Storehouse shuttered in summer 2016, just a couple years after they launched, but the app and website introduced or made beautiful a few features that remain interesting now. “He had ridden his horse into the saloon on a dare... “He had ridden his horse into the saloon on a dare—his practice was always to accept dares; it spiced life up a little.” Words: Larry McMurtry﻿ ﻿ At Instagram. Editorial efforts at scale Anywhere you can find content—even user-generated content—you&rsquo;ll find a content strategy and editors ensuring that content aligns to strategy (and to community standards). Somewhere, something incredible is waiting... “Somewhere, something incredible is waiting to be known” Words commonly misattributed to Carl Sagan, but most likely written by reporter Sharon Begley The eight-dish Submillimeter Array on Mauna Kea in Hawaii was one of a global federation of radio telescopes used to produce the world’s first images of a black hole earlier this year. From Wikipedia: “The radio frequencies accessible to this telescope range from 180–418 gigahertz (1. Don’t make it dull... Don’t make it dull﻿ If thou can’t make it colorful Words by Arrow﻿ At Instagram. About that table of “hidden rules among classes” The following table has been circulating recently. I sourced it to Framework for Understanding Poverty: A Cognitive Approach by Ruby Payne, PhD, who sells educational materials and consulting services through her company, aha! Process. POOR MIDDLE CLASS WEALTHY POSSESSIONS People. Things. One-of-a-kind objects, legacies, pedigrees. MONEY To be used, spent. To be managed. To be conserved invested. PERSONALITY Is for entertainment. the couple in the booth next door... the couple in the booth next door, just been up all night smoking cigarettes and talking about life as the waitress hovers with nothing else to do but daydream about the cop she wants to screw Words by﻿ David E Oprava ﻿ Silver Crest Donut Shop, San Francisco #americansquares﻿ At Instagram. Maybe life is all about twirling under one of those midnight skies... Maybe life is all about twirling under one of those midnight skies, cutting a swathe through the breeze and gently closing your eyes. words by Sanober Khan At Instagram. Design exercises for product leadership In a way, my career in tech started with graphic design. And as a not very good graphic designer, I eagerly looked for ways to improve my work. Nothing beats inspiration and skillful effort, but sometimes finding inspiration is a matter of changing how you look at the subject. There are some exercises that can help with that and sometimes offer a shortcut to inspiration when all else fails. Consider an illustration project in which you need to represent a subject. Sai Morgan You say Rolls I say Royce You say God give me a choice You say Lord I say Christ I don’t believe in Peter Pan Frankenstein or Superman Sai rode by on his bike and I invited him over for a photo. I’ve tried to send him the photos to the email address he gave me (s415morgan@[redacted]), but I haven’t heard back. Before I built a wall I’d... Before I built a wall I’d ask to know What I was walling in or walling out, And to whom I was like to give offense. Words by Bob Frost At Instagram. Normcore, mysticore, streetwear, and other words for “fashion” Normcore Normcore, at its most basic level, is fashionable people choosing to dress unfashionably, which is hardly a new idea. A case could be made that normcore has existed since the popularization of ready-to-wear clothing in the early 1920s. Any clothing that is not made by hand or commissioned specifically for a person is ready-to-wear. Almost immediately after the creation of ready-to-wear fashion, it became a trend to wear what everyone else was wearing, especially if you were a wealthy person not used to sharing clothes with the commoners. How big is S3? tl;dr: somewhere between 12-40 exabytes. Up in the air I go flying again/Up in the air and down! How do you like to go up in a swing, Up in the air so blue? Up in the air I go flying again, Up in the air and down! Words by Robert Louis Stevenson Music CC-by-nc-sa: “Cocek” by The Underscore Orkestra The swing is an installation at the #bombaybeachbiennale titled “The Water Ain’t That Bad, It’s Just Salty” by @damonjamesduke and @ssippi with the Bombay Bunny Club Glitter, glitter, everywhere Near the entrance, metal shelves taller than a man were laden with over one thousand jumbo jars of glitter samples arranged by formulation, color, and size: emerald hearts, pewter diamonds, and what appeared to be samples of the night sky collected from over the Atlantic Ocean. There were neon sparkles so pink you have only seen them in dreams, and rainbow hues that were simultaneously lilac and mint and all the colors of a fire. It's 2019, and we need to fight for the future of the internet There are obviously conflicting opinions about how to piece together new and complex regulation, legislation, or tech innovation. But this has been true throughout history whenever a new idea begins to be broadly adapted. Before the internet, we had to figure out how to manage cars and electricity and steam power and even the use of the written word (which many, including Socrates, actually argued against). The internet is no different. The Myth of the RV The myth of an RV is that you can go anywhere and bed down wherever you end up. The reality is that you can’t go just anywhere, and bedding down is not much more comfortable or convenient than tenting. Astrophotography in San Francisco From the Space Tourism Guide: Can You See the Milky Way in the Bay Area? Unfortunately, it is very difficult to see the Milky Way in San Francisco. Between the foggy weather and the light pollution from 7 million people, you can imagine that the faint light of our galaxy is lost to view. But C. Roy Yokingco argues: Some people say the Milky Way cannot be photographed within 50 miles of a major metropolitan area. Well, this photo of the Milky Way was captured 12 linear miles south of downtown San Francisco, California. Vijay Selvaraj @iamvijayselvaraj looking like he’s modeling the new EOS R for @canonusa while we were playing with strobes. At Instagram. On building the plane while flying it &ldquo;Building a plane while flying it&rdquo; or some variation has been used to describe situations in education (2011), education (2016). education (2017), health care, medicine, ride-hailing startups, business strategy, even fluffier business stories, and&hellip;this. And long before earning broad criticism for its use in tech, the phrase was vividly illustrated in an ad for Electronic Data Systems (EDS) that has since been appropriated for all the circumstances named above, as well as building churches: Ed Zak, photographer I found that hanging at Red&rsquo;s Java House and wanted to learn more about Ed Zak. I mean, with ad copy like this, how can you not want to know more? Find out why you should fly our to San Francisco to shoot with a photographer who will make you eat at Red&rsquo;s Java House and drive you around in this car. A photo shoot with Ed Zak is a photo shoot like no other. Competing approaches to deadlines and excellence Some people see deadlines as guidelines to aim for, not absolute dates by which a deliverable is expected by This view of deadlines as flexible guidelines can be seen throughout western culture, as exemplified by the ongoing, oft delayed Brexit negotiations. However, deadlines also compete against other factors in any project. Consider the three constraints in the project management triangle: A mathematical theory and evidence for hipster conformity in four parts Academic publishes mathematical theory for conformance among hipsters: https://arxiv.org/pdf/1410.8001.pdf MIT Tech Review covers it, with a fancy photo illustration using a stock photo of a hipster-looking male: https://www.technologyreview.com/s/613034/the-hipster-effect-why-anti-conformists-always-end-up-looking-the-same/ A hipster-looking male contacts MIT Tech Review to loudly complain about their using a picture of him without asking: https://twitter.com/glichfield/status/1103040764794363904 It turns out the hipster-looking male in the photo isn’t the same as the one who complained: https://twitter.com/glichfield/status/1103044630134882305 The problem with content management systems in three tweet storms Exhibit A: a 2019 series of tweets by Gideon Lichfield, editor of MIT Technology Review and formerly of Quarz, who asked: The legal case for emoji Emoji are showing up as evidence in court more frequently with each passing year. Between 2004 and 2019, there was an exponential rise in emoji and emoticon references in US court opinions, with over 30 percent of all cases appearing in 2018, according to Santa Clara University law professor Eric Goldman, who has been tracking all of the references to “emoji” and “emoticon” that show up in US court opinions. Inter-AZ cloud network performance Archana Kesavan of ThousandEyes speaking at NANOG75 reports that network traffic between AZs within a single region is generally &ldquo;reliable and consistent,&rdquo; and that tested cloud providers offer a &ldquo;robust regional backbone for [suitable for] redundant, multi-AZ architectures.&rdquo; ThousandEyes ran tests at ten minute intervals over 30 days, testing bidirectional loss, latency, and jitter. Kesavan reported the average inter-AZ latency for each tested cloud: AWS Azure GCP . Default fonts that could have been I learned about serif and sans serif typefaces, about varying the amount of space between different letter combinations, about what makes great typography great. It was beautiful, historical, artistically subtle in a way that science can’t capture, and I found it fascinating. From Steve Jobs in Stanford Graduation Address, explaining how he fell in love with typography during his time at Reed College. He studied calligraphy like a monk, but&hellip;. Spectre is here to stay As a result of our work on Spectre, we now know that information leaks may affect all processors that perform speculation&hellip;. Since the initial disclosure of three classes of speculative vulnerabilities, all major [CPU] vendors have reported affected products&hellip;. This class of flaws are deeper and more widely distributed than perhaps any security flaw in history, affecting billions of CPUs in production across all device classes. From Ross Mcilroy, Jaroslav Sevcik, Tobias Tebbi, Ben L. Titzer, and Toon Verwaest (all of Google) in Spectre is here to stay; An analysis of side-channels and speculative execution. They continue: Bare metal clouds are hard The problem, explains Eclypsium, is that a miscreant could rent a bare-metal server instance from a provider, then exploit a firmware-level vulnerability, such as one in UEFI or BMC code, to gain persistence on the machine, and the ability to covertly monitor every subsequent use of that server. In other words, injecting spyware into the server&rsquo;s motherboard software, which runs below and out of sight of the host operating system and antivirus, so that future renters of the box will be secretly snooped on. Indeed, the researchers found they could acquire, in the Softlayer cloud, a bare-metal server, modify the underlying BMC firmware, release the box for someone else to use, and then, by tracking the hardware serial number, wait to re-provision server to see if their firmware change was still intact. And it was. BMC is the Baseband Management Controller, the remote-controllable janitor of a server that has full access to the system. Taking Net Promoter Scores too far Pick somebody in your life and send them a message asking them how their day is going on a scale of one to 10. That&rsquo;s from author and game designer Jane McGonigal, quoted in Reader&rsquo;s Digest. Helvetica vs. Univers Univers was intrinsically superior to Helvetica. It had a much larger family at the outset, with 21 members compared to four in 1960. More importantly, its family was logically designed with consistent weights and widths, something that Helvetica never achieved until its redesign as Neue Helvetica in 1982. Univers’ characters, stripped of “unnecessary” elements such as the beard on ‘G’ or the curve on the tail of ‘y,’ were also more rationally designed. Spielberg on the theater experience There’s nothing like going to a big dark theater with people you’ve never met before, and having the experience wash over you. Steven Spielberg, quoted in Chaim Gartenberg&rsquo;s coverage of his speech at the Cinema Audio Society’s CAS Awards. Amusingly, according to Gartenberg, Spielberg has nothing against the streaming industry, he just really loves the theater experience and worries about what might happen to it. Still, it&rsquo;s hard not to imagine the filmmaker being a little bit swayed by the talk of Hollywood irrelevance in the face of Netflix. How Pixar dominated the last three decades of special effects Pixar&rsquo;s Renderman is the visual effects software Hollywood didn&rsquo;t think they needed (seriously, George Lucas sold off the Lucasfilm Computer Division in 1986). Years later, after producing landmark visual effects for films such as Terminator 2 and Jurassic Park and many more, the Academy of Motion Picture Arts and Sciences honored Pixar and the creators of Renderman with an Award of Merit in 2001 &ldquo;For their significant advancements to the field of motion picture rendering as exemplified in Pixar&rsquo;s &lsquo;Renderman. There are no architects at Facebook We get there through iteration. We don&rsquo;t try to build an architecture that is failproof. Building an architecture and worrying about it for months and months at a time before you actually go deploy it tends to not get us the result we want because by the time we&rsquo;ve actually deployed something the problem has moved or there are more technologies available to solve different problems. We take it seriously enough to say &ldquo;there are no architects on the team. The problem with economies of scale Economies of scale quickly become economies of hassle From Jessamyn, amplifying the exasperation people feel when daily activities are made more complex by poor application of technology. In the example given, the phone app reduces costs for the provider, but doesn&rsquo;t improve the experience for the customer. People may not expect parking to be delightful, but that&rsquo;s not an excuse for making it frustrating. Wither hardware startups? [I]t’s getting harder to find independent hardware startups that can scale up to something big without getting bought. From Dieter Bohn on the collective disappointment so many people feel about the Eero acquisition. The rise of product ecosystems is increasing the costs and risks for independent hardware startups in every category. (Perhaps that&rsquo;s why reMarkable positions itself as the intentionally unconnected alternative to our phones.) Turning off exposure preview on my Fuji X-E3 Nanda Kusumadi has quite a number of tips for configuring a Fuji X-E3. Those tips include using RAW photo recording and turning on 4K video capture (they&rsquo;re off by default), and one I hadn&rsquo;t considered: enabling Adobe RGB color space with its wider than sRGB gamut. I prefer not to use some of other the suggestions, such as enabling electronic shutter (it reduces dynamic range). One setting not mentioned in Nanda&rsquo;s tips is turning off exposure preview. Something from nothing: a dog park, a parade, and... On a lark, Jaime Kornick created Patrick&rsquo;s Park. Then she created a dog parade, then&hellip;. iHeart mentioned the Dog Parade on the radio, local publications wrote about it, and the RSVPs started rolling in. In total, more than 350 people said they were coming. That’s when I realized I needed to get a permit. Then she got a call: I told them the panel would consist of thought leaders within the canine community, bull shitting. Market risks and opportunities for Linux distro vendors IBM&rsquo;s acquisition of Red Hat got me thinking about how the market for commercially supported Linux distros is changing. IBM is trying to find a foothold in a maturing market dominated by AWS while the market for enterprise data centers is shrinking. So, where is Linux being used (or will be used), and what&rsquo;s changing in those spaces? To be clear: this is about commercial Linux distros, not upstack offerings like OpenStack, OpenShift, Kubernetes, etc. Kubesprawl This leads to the emerging pattern of “many clusters” rather than “one big shared” cluster. Its not uncommon to see customers of Google’s GKE Service have dozens of Kubernetes clusters deployed for multiple teams. Often each developer gets their own cluster. This kind of behavior leads to a shocking amount of Kubesprawl. From Paul Czarkowski discussing the reasons and potential solutions for the growing number of Kubernetes clusters. Hard solutions to container security The vulnerability allows a malicious container to (with minimal user interaction) overwrite the host runc binary and thus gain root-level code execution on the host. From Aleksa Sarai explaining the latest Linux container vulnerability. To me, the underlying message here is: Containers are Linux. From Scott McCarty washing his hands of it. Kata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workload isolation and security advantages of VMs. On asking the right questions Long before digital cameras killed film, Kodak and Fuji were locked in a desperate battle for market share. Film camera and 35mm film sales climbed steadily through most of the 20th century, and In 1990, Kodak dominated with 90% share of the film market, but then things started changing: Kodak was said to have done a survey to determine whether its color films were what pro and amateur photographers really wanted. Explore for inspiration, then test and focus Cultivate exploration: As a leader, you want to encourage people to entertain “unreasonable ideas” and give them time to formulate their hypotheses. Demanding data to confirm or kill a hypothesis too quickly can squash the intellectual play that is necessary for creativity. Then ruthlessly prioritize for focus: [Force] teams to focus narrowly on the most critical technical uncertainties and [rapidly experiment for] faster feedback. The philosophy is to learn what you have gotten wrong early and then move quickly in more-promising directions. Government drinking game The department of agriculture [had] an annual budget of $164bn and was charged with so many missions critical to the society that the people who worked there played a drinking game called Does the Department of Agriculture Do It? Someone would name a function of government, say, making sure that geese don’t gather at US airports, and fly into jet engines. Someone else would have to guess whether the agriculture department did it. It just looks better that way In Old English the past tense of &ldquo;can&rdquo; did not have an &ldquo;l&rdquo; in it, but &ldquo;should&rdquo; and &ldquo;would&rdquo; (as past tenses of &ldquo;shall&rdquo; and &ldquo;will&quot;) did. The &ldquo;l&rdquo; was stuck into &ldquo;could&rdquo; in the 15th century on analogy with the other two. From Arika Okrent, in a MentalFloss piece about the weird history of some spellings. The piece has other examples of spelling changes to conform words to some aesthetic or another, even when those changes were inconsistent with the history and etymology of the word. On building a culture of candid debate A good blueprint for [building a culture of candid debate] can be found in General Dwight D. Eisenhower’s battle-plan briefing to top officers of the Allied forces three weeks before the invasion of Normandy. As recounted in Eisenhower, a biography by Geoffrey Perret, the general started the meeting by saying, “I consider it the duty of anyone who sees a flaw in this plan not to hesitate to say so. Subtitling videos There are plenty of people and companies offering human or automated speech-to-text services for video captioning, but embedding those captions in a video was a curiosity to me. Bitfield AB&rsquo;s iSubtitle is a straightforward choice that does exactly what you expect and adds no complications. However, Google Drive doesn&rsquo;t import captions embedded in videos, and instead you have to upload them separately. Shuffle sharding in Dropbox's storage infrastructure First, some terms and context: [We aggregate blocks] into 1GB logical storage containers called buckets. [Buckets] are aggregated together and erasure coded for storage efficiency. We use the term volume to refer to one or more buckets replicated onto a set of physical storage nodes. OSDs [are] storage boxes full of disks that can store over a petabyte of data in a single machine, or over 8 PB per rack. Parts of a network you should know about If you’re running infrastructure and applications on AWS then you will encounter all of these things. They’re not the only parts of a network setup but they are, in my experience, the most important ones. The start of Graham Lyons&rsquo; introduction to networking on AWS, which (though the terms may change) is a pretty good primer for networking in any cloud environment. Though cloud infrastructure providers have to deal with things at a different later, Graham&rsquo;s post covers the basics—VPCs, subnets, availability zones, routing tables, gateways, and security groups—that customers need to manage when assembling their applications. We're gonna need a bigger PRNG cycle length... The general lesson here is that, even for a high quality PRNG, you can’t assume a random distribution unless the generator’s cycle length is much larger than the number of random values you’re generating. A good general heuristic is — If you need to use n random values you need a PRNG with a cycle length of at least n². From a 2015 post by Mike Malone on PRNGs vs. On Uber Eats nobody knows your restaurant is a popup For independent or family-owned restaurants with less traffic, Douglass points to the pop-up restaurant. Not to be confused with popup restaurants, which are dining concepts open for a limited time. Popups are cooking stations within the main kitchen of a restaurant dedicated to fulfilling delivery-only orders. Eater recently profiled a Dallas, TX-based chain called SushiYaa, which owns five physical locations but houses a couple dozen brands within them. The virtual brands are only available through Uber Eats. Interconnected, machine readable data, at scale The NGA provides a free database with no regulations on its use. MaxMind takes some coordinates from that database and slaps IP addresses on them. Then IP mapping sites, as well as phone carriers offering “find my phone” services, display those coordinates on maps as distinct and exact locations, ignoring the “accuracy radius” that is supposed to accompany them. “We assume the correctness of data, and often these people who are supposed to be competent make mistakes and those mistakes then are very detrimental to people’s daily lives,” said Olivier. Interfaces, surface area, durability A DOS program can be made to run unmodified on pretty much any computer made since the 80s. A JavaScript app might break with tomorrow’s Chrome update &mdash; Joe Groff (@jckarter) July 11, 2018 A DOS program can be made to run unmodified on pretty much any computer made since the 80s. A JavaScript app might break with tomorrow’s Chrome update From Joe Groff, who wonders if developers will choose old platforms running in emulators over more complex and volatile modern platforms. In praise of refactoring Under the right conditions refactoring provides a sort of express lane to becoming a master developer. [&hellip;] Through refactoring, a developer can develop insights, skills, and techniques more quickly by addressing a well understood problem from a more experienced perspective. Practice make perfect. If not the code, maybe the coder. From Patrick Goddi, who argues refactoring is about more than code quality. The day-to-day drudgery of state sponsored hacking After a review of bids and testing the capabilities of some of the exploits offered, the team decided to build its own malware. &ldquo;This is the only inexpensive way to get to the iPhone, except for the [Israeli] solution for 7 million and that’s only for WhatsApp,&rdquo; explained one team member in a message. &ldquo;We still need Viber, Skype, Gmail, and so on.” The same was true of the Android and Windows malware and the back-end tools used to manage the campaign. Who controls the menu? When people are given a menu of choices, they rarely ask: “what’s not on the menu?” “why am I being given these options and not others?” “do I know the menu provider’s goals?” “is this menu empowering for my original need, or are the choices actually a distraction?” (e.g. an overwhelmingly array of toothpastes) From Tristan Harris, co-founder of the Center for Humane Technology. It’s the first of ten magic tricks he pointed to that technology companies use to hijack users’ minds and emotions. Apple CloudKit uses FoundationDB Record Layer Together, the Record Layer and FoundationDB form the backbone of Apple&rsquo;s CloudKit. We wrote a paper describing how we built the Record Layer to run at massive scale and how CloudKit uses it. Today, you can read the preprint to learn more. From an anonymous FoundationDB blog post introducing relational database capabilities built atop FoundationDB&rsquo;s key-value store. The paper about CloudKit (PDF) is also worth a read. CloudKit is Apple&rsquo;s free at any legitimate scale back-end as a service for all iOS and MacOS apps. You can identify a dog on the internet, but will you bother to? You can construct any [effing] narrative by scouring the internet for people claiming something. It doesn&rsquo;t make it relevant. It doesn&rsquo;t make it true. From Agri Ismaïl&rsquo;s media criticism (start here). This isn&rsquo;t an issue of not knowing the dogs on the internet, it&rsquo;s a matter of not caring who&rsquo;s a dog in the interest of either clicks or political interest. Technology choices, belonging, and contempt I was taught to be contemptuous of the non-blessed narratives, and I was taught to pay for my continued access to the technical communities through perpetuating that contempt. I was taught to have an elevated sense of self-worth, driven by the elitism baked into the hacker ethos as I learned to program. By adopting the same patterns that other, more knowledgable people expressed I could feel more credible, more like a real part of the community, more like I belonged. Rollback buttons and time machines Adding a rollback button is not a neutral design choice. It affects the code that gets pushed. If developers incorrectly believe that their mistakes can be quickly reversed, they will tend to take more foolish risks. [&hellip;] Mounting a rollback button within easy reach [&hellip;] means that it’s more likely to be pressed carelessly in an emergency. Panic buttons are for when you’re panicking. From Dan McKinley, speaking about the complications and near impossibility of rolling back a deployment. Don't let requests linger In practice, we have fixed whole classes of reliability problems by forcing engineers to define deadlines in their service definitions. From Ruslan Nigmatullin and Alexey Ivanov on Dropbox&rsquo;s migration to gRPC. Also consider request replication. Polarization vs. judgement In a polarized climate, opponents would jeer even eloquence from an unwelcome source; partisans would chant lovingly for public incontinence if delivered on behalf of the home team. From Politico editor-in-chief John F. Harris, talking about Trump, but the point seems to apply far more broadly. Shooting down Star Wars as a vehicle for exploring human relationships with future technologies Into the ongoing fight between those who dismiss Star Wars as a shallow space opera vs. those who who would elevate the movies to a position of broader significance (so-called hard science fiction) strolls Jeremy Hsu, who points out: Regardless of writer-director Rian Johnson’s intentions for “The Last Jedi,” his story transformed the adorable robotic sidekick into a murder droid with a will of its own. That would normally have huge implications in a science fiction story that wants to seriously explore a coherent and logical futuristic world setting. Incident postmortems: customer communication Incidents happen. The question is whether or not we&rsquo;re learning from them. There are a bunch of postmortem resources collected here to help teams maximize the learning and service reliability improvements they can gain from an incident. However, there&rsquo;s a separate question about how to communicate about incidents with customers. This definitely involves communications during the incident, but I&rsquo;m especially interested in customer-facing communications after an incident. These seem to be the key questions customers need answers to: PID controllers are way cooler than the Wikipedia article lets on The Wikipedia entry on PID controllers is perfectly accurate, but it seems to bury the elegance of the technology and theory. Meanwhile, the article on gyroscopic autopilot (both maritime and aeronautical) makes no mention of PID controllers, despite that being the field in which the theory of PID controllers was developed. PID controllers are all around us. They make elevators accelerate and decelerate without knocking passengers to the floor or pinning them to the ceiling, they stabilize video for pros and consumers alike, they make anti-lock brakes work, and nearly every other automated task in the software and physical world where the control needs to be adjusted based on observed conditions. Wikipedia quotes: mathematical models of vagueness and ignorance [F]uzzy logic uses degrees of truth as a mathematical model of vagueness, while probability is a mathematical model of ignorance. From Wikipedia on fuzzy logic. iPads as primary computers: never say never This Twitter thread has some points worth considering for those interested in how our expectations and relationship with &ldquo;business tools&rdquo; changes over time: And, in case that tweet disappears, here&rsquo;s the key text and the referenced GUI review: I’m fascinated by the technical “class” obsession w/ iPads replacing laptops. This review of GUI and mouse is what I think some of the review of the iPad will look like in 20 years. Common root causes of intra data center network incidents at Facebook from 2011 to 2018 From A Large Scale Study of Data Center Network Reliability by Justin Meza, Tianyin Xu, Kaushik Veeraraghavan, and Onur Mutlu, the categorized root causes of intra data center incidents at Fabook from 2011 to 2018: Category Fraction Description Maintenance 17% Routine maintenance (for example, upgrading the software and firmware of network devices). Hardware 13% Failing devices (for example, faulty memory modules, processors, and ports). The entirely rational, yet surprising relationship between timecode broadcasts and Sputnik Many US folks just changed their clocks for daylight saving time, and here in California we&rsquo;re voting on a proposition that might lead to changes in California&rsquo;s time standards, so quite a number of people have time on their minds. Meanwhile, on a national level, Trump intends to defund one of the mechanisms we use to to synchronize time across the country. The National Institute for Standards and Technology operates timecode radio stations. Republics, power, and populism: their rise and fall Mike Duncan, writing in the Washington Post on the fall of the Roman Republic: Some in the Roman leadership could see clearly by the 130s and 120s B.C. that this socioeconomic dislocation was becoming an acute problem. They could see that, out in the countryside, families were losing their land, and in the cities, grain shortages were leading to panic and starvation. These poor families were certainly not sharing the benefits of Rome’s imperial wealth and power. Pour one out for the Sears Catalog, the original market disrupter Whet Moser pointed out this enlightening Twitter thread that explains an aspect of Sears I hadn&rsquo;t considered before: by disrupting retail stores with mail-order, it was empowering a demographic that was often underserved in their communities: The Sears catalog succeeded because it got the goods to people who couldn’t get to stores. One of those demographics? African-Americans. In a lengthy Twitter thread, Cornell historian Louis Hyman writes that it freed up black Southerners from going to general stores, which was often (at best) a humiliating experience. Donut tours everywhere I&rsquo;m a big enough fan of donuts that I&rsquo;ve planned tours to explore and celebrate them: 2004: The Lowell Donut Tour 2010: Donut Tour 2: this time it&rsquo;s personal Those tours focused on Massachusetts, but it turns out that isn&rsquo;t the only state with a strong donut heritage. The Butler County Visitors Bureau promotes a Donut Trail, including map, passport, and FAQ. Those who complete the passport can receive an exclusive Donut Trail t-shirt. How to date your foodstuffs Whet Moser, suddenly making sell-by dates on food products relevant to me: About a quarter of US methane emissions comes from food rotting in landfills. The dates on our packaged food products look so authoritative, but the way Moser tells it, they were invented by marketing folks to increase sales at the cost of disposing of otherwise good products that have an expired sell-by date. Fuji Instax back for Hasselblad Isaac Blankensmith writing in PetaPixel about building an Instax instant film back for a Hasselblad 500: Instant photos are magical. They develop before your eyes. You can share them, gift them, spill water on them, draw on them. The only problem is that most instant cameras are pretty cheap — that’s why I’ve always wanted to hack my medium format camera to take instant photos with shallow depth of field and sharpness. Can we train ourselves out of color blindness? Which one of the boxes has an irregular color? A screenshot of the iGame color vision test. I&rsquo;m very color blind by traditional tests, but my score in this one has improved over time. Am I learning the test, or&hellip;? PSA reminder about takt time From Wikipedia a common misconception is that takt time is related to the time it takes to actually make the product. In fact, takt time simply reflects the rate of production needed to match the demand. Said again: it’s the required rate, not the actual rate. Notes on observing the milky way Notes from Kevin Palmer at Dark Site Finder and Matt Quinn at PetaPixel. What is it? CC-by-nc-nd by Bryce Bradford Kevin Palmer: Every star you can see with the unaided eye is located within the milky way. [&hellip;] But when most people talk about “seeing the milky way”, they are talking about the core of the galaxy. Located in the constellation Sagittarius, this is the brightest part of the milky way. Restaurants, hotels, mustaches, wages Matthew Taub, writing in Atlas Obscura Around the same time, the first modern restaurants were rising around Paris. These establishments, primarily for the wealthy, sought to recreate the experience of dining in an upscale home. The experience was about more than food. Waiters had to retain the appearance of domestic valets, who were forbidden to wear mustaches as a sign of their rank. Diners were “paying to humiliate people in an almost institutional way,” says historian Gil Mihaely, who has published extensively on the subject of French masculinity. A cold day in Coaldale A cold day in the desert. Coaldale, Nevada Music CC-BY-NC-SA: Dan Warren, “The Debate” At Instagram. Bad maps are ruining American broadband Karl Bode in The Verge: In policy conversations, ISP lobbyists lean heavily on the FCC’s flawed data to falsely suggest that American broadband is dirt cheap and ultra competitive, despite real-world evidence to the contrary. ISPs also use this false reality to imply meaningful consumer protections aren’t necessary because the market is healthy (as we saw during the fight over net neutrality). S3 and CloudFront configuration frustration It turns out that the interaction between S3, CloudFront, and Route53 can be bumpy when setting up buckets as CDN origins. It’s apparently expected that a CloudFront URL will read data from the wrong bucket URL and redirect browsers there for the first hour or more. The message from AWS is “just wait,” which makes for a crappy experience. Time synchronization is rough CloudFlare on the frustrations of clock skew: It may surprise you to learn that, in practice, clients’ clocks are heavily skewed. A recent study of Chrome users showed that a significant fraction of reported TLS-certificate errors are caused by client-clock skew. During the period in which error reports were collected, 6.7% of client-reported times were behind by more than 24 hours. (0.05% were ahead by more than 24 hours.) This skew was a causal factor for at least 33. Parents in 1996 vs. 2016 This thread from Breanne Boland, which starts with a screenshot1 of another tweet: Your parents in 1996: Don’t trust ANYONE on the Internet. Your parents in 2016: Freedom Eagle dot Facebook says Hillary invented AIDS. Twin Beech, Beatty, NV Just outside Beatty Nevada you’ll find a weathered sign promising the services of a long-closed brothel, and next to it, an aircraft covered in generations of tags. The plane, a Twin Beach, made an abrupt and final landing in the 1970s as the unexpected end to a marketing stunt—or perhaps a dare—gone wrong. Sit at the bar in town for a while and you’ll get a number of stories. Windows 95 was 30MB. Today we have web pages heavier than that! The title is a quote from Nikita Prokopov, who is wallowing in disenchantment. Claim chowder from 2013: computational photography Way back in 2013 I wrote: I’m sure somebody will eventually develop software to automatically blur the backgrounds of our smartphone photos, but until then, this is basic physics. The new camera system in the iPhone XS seems to have moved computational photography from the world of parlor tricks to the mainstream. Update This blog post from the developer of Halide, a premium camera app for iOS, goes into a lot more detail about all the computation going on in the new cameras. The color of Copenhagen The color of #Copenhagen Is it yellow, brown, mustard? I love all the shades. At Instagram. The real Goldfinger: the London banker who broke the world Goldfinger, the 1964 Bond film, is based on a premise that is incredibly foreign to today&rsquo;s audiences: moving gold between countries was illegal. Oliver Bullough in The Guardian asks us all to think about that a bit more: The US government tried to defend the dollar/gold price, but every restriction it put on dollar movements just made it more profitable to keep your dollars in London, leading more money to leak offshore, and thus more pressure to build on the dollar/gold price. git foo A few git commands I find myself having to look up: Resolve Git merge conflicts in favor of their changes during a pull: git pull -Xtheirs git checkout --theirs the/conflicted.file Source Viewing Unpushed Git Commits git log origin/master..HEAD You can also view the diff using the same syntax: git diff origin/master..HEAD Or, &ldquo;for a little extra awesomeness&rdquo; git log --stat origin/master..HEAD Updated since it was first posted: Starting with Git 2. Things that make us dumber: air pollution, full bladders Air pollution is making us dumber, study shows: The team found that both verbal and math scores &ldquo;decreased with increasing cumulative air pollution exposure,&rdquo; with the decline in verbal scores being particularly pronounced among older, less educated men. Study links urge to pee with impairment: Snyder and his team ran the study on eight individuals, who each drank 250 milliliters of water every 15 minutes until they reached their “breaking point,” where they could no longer hold their urine. Maintenance and renewal Abby Sewell, with photographs by Jeff Heimsath, in The National Geographic: Every spring, communities gather to take part in a ceremony of renewal. Working together from each side of the river, the villagers run a massive cord of rope, more than a hundred feet long and thick as a person’s thigh, across the old bridge. Soon, the worn structure will be cut loose and tumble into the gorge below. Over three days of work, prayer, and celebration, a new bridge will be woven in its place. Hash rings, sharding, request replication Balancing data and activity between shards Your consistent hash ring leads to inconsistent performance: The basic consistent hashing algorithm presents some challenges. First, the random position assignment of each node on the ring leads to non-uniform data and load distribution. Second, the basic algorithm is oblivious to the heterogeneity in the performance of nodes. From https://www.cs.cornell.edu/projects/ladis2009/papers/lakshman-ladis2009.pdf, which explains that Cassandra addresses that common problem by “analyz[ing] load information on the ring and have lightly loaded nodes move on the ring to alleviate heavily loaded nodes. Steven Dean McClellan, Bombay Beach Steven Dean McClellan, Bombay Beach At Instagram. Improving automated fault injection Automated failure analysis is hard, manual failure analysis requires great expertise. Why this painting of dogs playing poker has endured for over 100 years Jackson Arn in Artsy: The “Dogs Playing Poker” paintings, by Cassius Marcellus Coolidge, belong to that pantheon of artworks—Michelangelo’s David, Da Vinci’s Mona Lisa, Botticelli’s The Birth of Venus, Van Gogh’s Starry Night, Hopper’s Nighthawks— that are immediately recognizable to people of all ages and backgrounds, including those who don’t readily admit to enjoying art. So how, pray tell, did a pack of dogs playing poker outlast so many other “serious” paintings? Willie in Christiana Willie has lived in Christiana since it was founded in 1971 At Instagram. Product managers, project managers, delivery managers, and engineering managers, according to Quora I&rsquo;m trying to write some job descriptions, so of course I found myself in Quora. What is the difference between program manager and delivery manager? As delivery manager, we ensure the projects are delivered on time and on budget. We are a slightly higher level work that project managers in the sense that we try not to escalate issues as much as resolving them and letting upper management know of relevant issues. Twin Beech, Beatty This beautiful old Twin Beech lies wrecked and abandoned near Beatty, NV. Locals tell stories of how the plane was used to shuttle guests from Las Vegas to the town’s brothel in the 1970s, but things went wrong with a publicity stunt, or perhaps a dare, and the plane made its final landing here. At Instagram. Love locks, Copenhagen Love locks in Copenhagen Toldbodgade bridge over Nyhavn inlet. At Instagram. Campanology, noun The Cambridge dictionary tells us that &ldquo;campanology&rdquo; means &ldquo;​the art or skill of ringing church bells.&rdquo; It doesn&rsquo;t give us a collective noun, however, but I&rsquo;m sure this is it: A group of bell ringers? That’s a &ldquo;pubfull&rdquo; With more at Pinterest. Bar Velo, Brooklyn Bar Velo, Brooklyn #MediumFormat #FujiGW690III At Instagram. Transamerica Pyramid, from Columbus Avenue #ispytransamericapyramid from the center of Columbus Avenue at Broadway At Instagram. Tantallon Castle, Scotland Tantallon Castle, Scotland At Instagram. VXLAN routing recommendations from Cumulous Networks VXLAN routing recommendations from Cumulous Networks, which offers switch software (but not client software). https://cumulusnetworks.com/blog/vxlan-designs-part-1/ VXLAN routing is the process in which a VTEP receives a VXLAN packet destined to itself, removes the VXLAN header and then performs a layer 3 route lookup on the inner decapsulated packet. Since the VTEP has to perform two sets of lookups, first on the encapsulated VXLAN traffic then on the decapsulated inner packet, it requires special hardware ASIC to perform both lookups in a single pass all in hardware. Flight of the bumblebee Flight of the bumblebee Music: Jazzy Ashes, CC BY-NC-SA The Underscore Orkestra At Instagram. Steven Dean McClellan, Bombay Beach Steven Dean McClellan, Bombay Beach At Instagram. Birdsong Birdsong, mural by Joshua Coffy At Instagram. Flickr get photo page from image name Let’s say you have an old-style Flickr photo URL like the following: http://www.flickr.com/photos/702783_509c609f44.jpg Now let’s say you want to find the page on Flickr for that photo? Put the photo ID in a URL like this: https://www.flickr.com/photo.gne?id=702783 Poulsen Welding Shop, Susanville, CA Poulsen Welding Shop, Susanville, CA Growing up, I remember welding and fabrication shops being common. Not so much anymore. There are just over 20,000 self-employed welders in the US today, according to the bureau of labor statistics, but getting historical data from them is approximately impossible. Looking for more, I found Assembling Magazine’s retrospective on how welding has changed in the past half century or so: New processes, such as electron beam welding, friction welding, plasma arc welding, friction stir welding, explosion welding and laser beam welding, have increased the range of materials and components that can be welded. Object storage prior art and lit review This list is not exhaustive. Instead, it is a selection of object storage implementations and details that appear interesting. Some themes that it many or all of these comparators struggled with include: New systems to meet scaling needs Facebook, Google, and Yahoo are all very open about having reinvented their object storage solutions to address evolving needs (typically cost and availability) as they scaled. Those players dramatically reinvented their systems without strong regard for backwards compatibility, but evidence suggests S3 has gone through similarly dramatic changes as well, but without breaking API compatibility. Naming things is hard. Naming people is harder. Michael Sherrod and Matthew Rayback scoured American census records searching for atrocious baby names. The results are compiled in an amusing little book called Bad Baby Names: The Worst True Names Parents Saddled Their Kids With—and You Can Too!. Among the names they discovered were “Toilet Queen,” “Leper,” “Cholera,” “Typhus,” “Stud Duck,” “Loser,”224 “Fat Meat,” “Meat Bloodsaw,” “Cash Whoredom,”“Headless,” “Dracula,” “Lust,” “Sloth,” “Freak Skull,” “Sexy Chambers,” “Tiny Hooker,” “Giant Pervis,” “Acne Fountain,” “Legend Belch,” and “Ghoul Nipple. Yongma Land Just a creepy fiberglass clown head at an abandoned amusement park outside Seoul At Instagram. Stereotypical photo of the Brooklyn Bridge Gray skies at the #BrooklynBridge At Instagram. Yarn bombed, San Francisco City Hall Yarn-bombed trees outside San Francisco City Hall At Instagram. Observing an abandoned building and open landscape, Coaldale, Nevada Open floor plan, Coaldale Junction, Nevada music: CC-BY-NC-SA Dan Warren At Instagram. Feature flags gone wrong BTW – if there is an SEC filing about your deployment, something may have gone terribly wrong. From Doug Seven explaining how, in 2014, that&rsquo;s exactly what happened. Rain, San Francisco Much-needed rain soaks the tables at San Francisco’s Ferry Building. At Instagram. Spencer Wynn: Hello Project Spencer Wynn&rsquo;s Hello Project is everything I need right now. Johnathan Little I first met Johnathan Little on US Route 95, about 25 miles due north of Pahrump, NV. He’d been walking since he left Oklahoma one day a while back. #vanlife gets a lot of love on Instagram, but Johnathan joined the #walkinglife to regain his self-respect and lose some weight, and he seems on a path to do both. This photo was from the second time I met him, on my way back from Beatty, NV. The KPA soldier guarding the door to North Korea The door behind this KPA soldier exits to North Korea. In addition to needing a stolid face, KPA soldiers must be expert martial artists, according to Wikipedia. At Instagram. No groceries, Mina, Nevada &ldquo;Grocery, sundries, ice cream&rdquo; in Mina, Nevada At Instagram. The paradox of tolerance Less well known is the paradox of tolerance: Unlimited tolerance must lead to the disappearance of tolerance. If we extend unlimited tolerance even to those who are intolerant, if we are not prepared to defend a tolerant society against the onslaught of the intolerant, then the tolerant will be destroyed, and tolerance with them. — In this formulation, I do not imply, for instance, that we should always suppress the utterance of intolerant philosophies; as long as we can counter them by rational argument and keep them in check by public opinion, suppression would certainly be unwise. AWS regions, AZs, and VPCs, NICs, IPs, and performance Jump to section: Availability zones and regions VPCs Elastic IPs and Elastic Network Interfaces Network performance Resources by scope Connectivity by scope Availability zones and regions AWS’ primary cloud is available in 15 regions, each with two to six availability zones, not including separately operated regions (with independent identity) for GovCloud and China. Most AWS services operate independently in each region (though identity is shared across regions in the primary cloud), and each service has its own (often region-specific) endpoint (many libraries and the AWS CLI simply insert the region name in the endpoint URL). Claim chowder: cloud storage Ten years ago Apple was still doing MacWorld Expo keynotes, and that year they introduced Time Capsule. My response was this: forget Time Capsule, I want a space ship: So here’s my real question: Why hasn’t Apple figured out how to offer me a storage solution that puts frequently used items on local disk, and less-frequently used items on a network disk? Seamlessly. Ten years later: cloud storage is definitely the norm. Dalhousie Castle Sunrise to sunset at @dalhousiecastle Music: “The Moments of Our Mornings” CC-BY-NC Kai Engel At Instagram. The Make Us Proud and YLD Offices Just another awesome day at the Make Us Proud and YLD offices (find them on Twitter). @tomholloway2212 is the star of this one, but you’ll see some others on the team working on a project for @joyent. Shot with an @alpinelabs Radian Music is CC-BY-NC-SA Dexter Britain At Instagram. Good enough, satisficing, and meeting market demand Nanda Kusumadi: Companies tend to over-serve customers in their products to the point that the surplus of performance metrics cannot be consumed. This leads to waste in R&amp;D, build and operational resources, basically a waste of human capital. Over-serving products have been optimised well beyond what a user can consume. Atomic Cafe neon The famous neon sign at @atomicliquors, #LasVegas’ oldest bar, where 1950s patrons used to enjoy views of nuclear tests from the roof. I had the joy of meeting the former owner, Joe Sobchik, on a visit in 2005. I stopped by around 7am (yes, I make a habit of visiting bars early in the morning) and found the owner, Joe Sobchik, sipping a coffee at the bar. He was a man full of stories, I could tell, but I was foolishly unprepared. AWS' Andy Troutman on component reusability What we do first is we build very simple foundational building block services … we will build the simplest possible service that you could think of. The next thing we do is we encourage an open marketplace within Amazon so individual teams can use, optimize, and extend these basic services. We use our individual [teams] as a test lab to experiment on better ways to do things, and when we find something that seems to be working, we look for ways to [grow it and use it more] broadly. Drivers and “standards” For both network and block storage, AWS is doing significant work to develop and maintain drivers in a variety of guest OSs. Some of this work improves performance for guest OSs running in any modern hardware virtualized environment, but not everything is directly portable. This discussion about adding ENA support for Netmap is one example. OTOH, Amazon seems to be sponsoring driver development (see FreeBSD) when they’re not doing it themselves (see Linux). Hardware virtualization has moved to hardware One of my takeaways from AWS&rsquo; bare metal announcements at re:Invent this week is that the compute, storage, and network aspects of hardware virtualization are now optimized and accelerated in hardware. AWS has moved beyond the limitations that constrained VM performance, and the work they&rsquo;ve done applies both to their bare metal hardware and their latest VM instance types. Notes from "life of a code change to a tier 1 service (Dev206)" at AWS re:Invent 2017 Andy Troutman&rsquo;s talk is useful in explaining complex deployment workflows to management types. Camera advice: a film camera for a novice A friend of mine sent me a question about a good film camera to get started with: My partner has been thinking for some time about her first camera and she likes the idea of film photography. Her birthday is coming up and I’m thinking of buying a camera as a surprise gift to bring on an upcoming backpacking trip. It’s just a thought. We don’t buy each other a lot of stuff because we’re big on experiences, and we save our money so we can travel to see each other. Dave Wascha's 20 years of product management advice in 25 minutes Dave Wascha (LI) speaking at Mind the Product in San Francisco on advice he wished he had as a younger product manager: Link to video. You should watch the video, but here&rsquo;s the short version: Listen to your customers: Focus on deeply understanding your customers’ problems. Don’t listen to your customers: It’s up to product managers to figure out solutions to those problems, not customers. My addition: they&rsquo;d ask for faster horses. VCRs that rewind faster A story, possibly apocryphal (i.e. I can no longer find the source), tells of electronics manufacturers asking customers what features they wanted in their home video equipment. “VCRs that rewind faster,” they cried. Instead they got DVDs that didn’t need rewinding. I was remembering that story and went looking to source it and all I could find was my blog post from a decade ago. Of course once we got DVDs, we then needed to solve the frustrations of the video rental store. Continuous disruption Trains were once seen as icons of freedom. They freed riders from the dust and bumps of horse or stagecoach travel, and dramatically shortened travel times. But that view of trains as agents of freedom changed with the development of the automobile—and the way it shifted control of routes and schedules from the railroad to the driver. This isn&rsquo;t about transportation policy1, it&rsquo;s about how previously novel solutions become subject to disruption once they become the baseline against which alternatives are compared. Mortmar, California Carniceria, liquor, grocery This was once North Shore, California, but many maps now label it Mortmar. At Instagram. Gender stereotypes, toys, and the Sears Catalog Elizabeth Sweet, writing in the New York Times, way back in 2012 on her research into the role of gender stereotypes in the marketing of toys: During my research into the role of gender in Sears catalog toy advertisements over the 20th century, I found that in 1975, very few toys were explicitly marketed according to gender, and nearly 70 percent showed no markings of gender whatsoever. In the 1970s, toy ads often defied gender stereotypes by showing girls building and playing airplane captain, and boys cooking in the kitchen. Lawrence Lessig: Republic, Lost Lawrence Lessig in a talk at Google in 2011 speaking on the topic of his book, Republic, Lost. His talk concludes: This nation faces critical problems requiring serious attention, but we don&rsquo;t have institutions capable of giving them this attention. They are distracted, unable to focus. And who is to blame for that? Who is responsible? I think it&rsquo;s too easy to point to the Blagojeviches and hold them responsible, to point to the Looking up at Muir Woods End of summer at #muirwoods with a @lomography #Spinner360 At Instagram. Extraterrestrial Highway, Nevada The Extraterrestrial Highway, just north of Area 51 At Instagram. Ranch hand at auction A ranch hand stands ready to call a bidder in the cowboy auction at the @californiamidstatefair. Though they’re traditionally agricultural events, fairs were typically founded by local businesses leaders seeking to grow commerce. Basically, they were the tech events of their time. At Instagram. Street jazz New Orleans-style jazz on the Embarcadero near Fisherman’s Wharf, shot on #Kodak #Ektar100 with a #hasselblad #hasselblad500elm At Instagram. Mendocino sunset Sunset on the Mendocino Coast outside at @heritagehouseresort At Instagram. No more border walls, please America’s greatest legacy is found in the freedoms we uphold for all, not the prohibitions we levy on others. Fences, walls, and travel bans are contrary to that legacy. #USMexicoBorder #BorderFence, #Calexico At Instagram. Hearst Castle tour Hearst Castle in 8mm Music: CC-BY-NC Charmed Life by Adam Selzer At Instagram. Contrails above Sutro Tower #Parallel #contrails above #SutroTower, from #TwinPeaks, #SanFrancisco At Instagram. User stories are documentation While writing up the draft docs for Joyent&rsquo;s Container Name Service I leaned heavily on the user stories and use-cases for the feature. It has me realizing that we should consider user stories to be the first draft of the user documentation. Indeed, consider that well-written docs and user stories have similar qualities: a user, goal, and benefit, in clear language that’s accessible in small, focused chunks. The CNS docs are now in our core documentation library, and I&rsquo;m happy that we&rsquo;ve updated the content management system to support deep linking to individual headings, like this one about adding CNS service tags when creating an instance with the triton CLI. Everybody smiles while rolling down the hill... Everybody smiles while rolling down the hill at the Bring Your Own Big Wheel event! At Instagram. Ancient Aztec chemistry A 50-50 blend of morning glory juice and latex created rubber with maximum bounciness, while a 75-25 mix of latex and morning glory made the most durable material. It seems they were making bouncy balls for fun and sport. But, to be clear about the ingredients: Morning glory plants tend to grow near rubber trees, and both plants were considered sacred in several Mesoamerican cultures. Morning glory, for example, was also used in religious ceremonies for its hallucinogenic properties. No gas at Mina, Nevada Mina, Nevada At Instagram. Echoes of product management advice in declarative vs. imperative programming The following line in a post about the difference between declarative vs. imperative programming caught my attention for the way it echoes product management best practices: [I]t’s often good not to think of how you want to accomplish a result, but instead what the component should look like in it’s new state. Of course it does matter how you get to where you&rsquo;re going, but it&rsquo;s a whole lot easier if you first focus on aligning everybody on goals and where you&rsquo;re going. The Hotel Huntington and SF skyline The Hotel Huntington (now @thescarlet_sf) atop #CaliforniaStreet, #SanFrancisco Music: “Faster Does It” by Kevin MacLeod (CC-BY) At Instagram. McWay Falls #McWayFalls in #BigSur Music: “Tomie’s Bubbles” by Candlegravity (CC-BY-NC-SA) At Instagram. Sutro Tower #SutroTower, #SanFrancisco Music: “Feeling Dark (Behind The Mask)” by 7OOP3D (CC-BY-NC) At Instagram. Tree, Paso Robles #Lonely #tree in a #field in #PasoRobles #California Music: “Silence Await” by idk (CC-BY) At Instagram. At the Little A’Le’Inn, Rachel... At the Little A’Le’Inn, Rachel Nevada. Film, light leaks, bikers, and aliens. At Instagram. Following a winding road #Summer on a #windingRoad in #Cambria #California Music: Shady Grove by Shake That Little Foot (CC-BY-NC-SA) At Instagram. The Top of the Mark #sunset at #TopOfTheMark, #SF Video: https://www.instagram.com/p/BGrSnL6heJS/ At Instagram. Hotel Huntington sign at sunset The Hotel Huntington (now @thescarlet_sf) atop #CaliforniaStreet, #SanFrancisco At Instagram. Winding Road, Cambria #Summer on a #windingRoad in #CambriaCalifornia At Instagram. Will Luo Will Luo at @tempestbarsf At Instagram. get list of functions in bash script…look for those in argv # Get function list as array funcs=($(declare -F -p | cut -d &quot; &quot; -f 3)) # parse out functions and non-functions i=1 declare -a cmdargs declare -a otherargs for var in &quot;$@&quot;; do if [[ &quot; ${funcs[@]} &quot; =~ &quot; ${var} &quot; ]]; then cmdargs[i]=${var} else otherargs[i]=${var} fi ((i++)) done echo ${cmdarg[*]} echo ${otherargs[*]} On disfluencies Your Speech Is Packed With Misunderstood, Unconscious Messages, by Julie Sedivy: Since disfluencies show that a speaker is thinking carefully about what she is about to say, they provide useful information to listeners, cueing them to focus attention on upcoming content that’s likely to be meaty. […]  Experiments with ums or uhs spliced in or out of speech show that when words are preceded by disfluencies, listeners recognize them faster and remember them more accurately. San Francisco’s Mark Hopkins Hotel San Francisco’s #MarkHopkins #Hotel at the top of #CaliforniaStreet, on #35mm #KodakFilm. #SanFrancisco #SanFranciscoCA #sfca #sf #OlympusStylus #KodakGold #KodakUltra #KodakUltraGold #analog #film At Instagram. Compact camera recommendations A friend asked the internet: Can anyone recommend a mirrorless camera? I have some travel coming up and I’m hesitant to lug my DSLR around. Of course I had an opinion: I go back and forth on this question myself. My current travel camera is a Sony RX100 mark 3 (the mark 4 was recently released). Some of my photos with that camera are on Flickr. If I decide to get a replacement for my for my bigger cameras, I’ll probably go with a full frame Sony A7 of some sort. Bring Your Own Big Wheel Brings Smiles Only a fool would try covering the #BringYourOwnBigWheel action on #film. I’m that fool. #SF #SanFrancisco #BigWheel #byobw #Hasselblad #Ektar #KodakFilm #film At Instagram. Zach Houston’s Poem Store #ZachHouston used to be a #Mission regular, peddling his #poetry from a #PoemStore made up of an old #mechanical #typewriter and carefully selected scrap papers. #SF #SanFrancisco #TheMission #ValenciaStreet At Instagram. Rewrite git repo URLs A question in a mail list I’m on introduced me to a git feature that was very new to me: it’s possible to have git rewrite the repository URLs to always use HTTPS or git+ssh, etc. This one-liner seems to force https: git config --global url.https://github.com/.insteadOf git://github.com/ Or you can add these to your .gitconfig: # Use https instead of git and git+ssh [url &quot;https://github.com/&quot;] insteadOf = git://github.com/ [url &quot;https://github.com/&quot;] insteadOf = git@github. The tools on the Jeremiah O’Brien... The tools on the Jeremiah O’Brien are built to work on steam cylinders larger than oil drums. They’re mounted to the wall like trophies. On Flickr: https://www.flickr.com/photos/maisonbisson/14098333435/ #bw #tools #JeremiahOBrien #LibertyShip #SF #SanFrancisco At Instagram. Docker stories from New Relic From New Relic’s August 2014 blog post: [W]e didn’t try to create a full PaaS framework all at once. Though this may be our eventual goal, it wouldn’t have solved the immediate deployment problem. We did not begin Dockerizing our applications by starting with those that have the highest data volume. Rather, we started with our simplest internal Web apps, particularly stateless things that could scale horizontally. Our early testing showed that high throughput apps are not a good choice for your first Docker deployment, due to the Docker network stack. Sinistrality vs. dextrality in design Photo CC-BY-SA Gerry Dincher This post on why people focus on the right-hand side of a design is an old one, but still valuable today: These days there is a lot of talk about emotional design and how to properly create a connection between users and our products. Focusing on the right-hand side of our designs can create these connections. We have the ability to influence and change a user’s belief in what is right and honest with our designs. Hasselblad Dating Hasselblad Historical and Blue Moon Camera both offer this table to translate Hasselblad serial numbers to year of manufacture: V = 1 H = 2 P = 3 I = 4 C = 5 T = 6 U = 7 R = 8 E = 9 S = 0 That should work for both the body and film magazines, though there are some exceptions noted in the comments at Blue Moon Camera: How Jackie Chan wins Tony Zhou’ video is genius, as are the nine principles of action comedy he’s identified: Start with a DISADVANTAGE Use the ENVIRONMENT Be CLEAR in your shots Action &amp; Reaction in the SAME frame Do as many TAKES as necessary Let the audience feel the RHYTHM In editing, TWO good hits = ONE great hit PAIN is humanizing Earn your FINISH Read the full video description for more, and consider donating to support his work. Photo hipster: playing with 110 cameras After playing with Fuji Instax and Polaroid (with The Impossible Project film) cameras, I realized I had to do something with Kodak. My grandfather worked for Kodak for years, and I have many memories of the stories he shared of that work. He retired in the late 70s, just as the final seeds of Kodak’s coming downfall were being sown, but well before anybody could see them for what they were. Backbone.js and WordPress The three are from 2013, so details may have changed, but they seemed useful enough that I’ve had them open in my browser for a while: http://kadamwhite.github.io/talks/2013/backbone-wordpress http://code.tutsplus.com/tutorials/using-backbone-within-the-wordpress-admin-the-back-end–wp-30056 http://code.tutsplus.com/articles/using-backbone-within-the-wordpress-admin-the-front-end–wp-30121 Parable of the Polygons is the future of journalism Okay, so I’m probably both taking that too far and ignoring the fact that interactive media have been a reality for a long time. So let me say what I really mean: media organizations that aren’t planning out how to tell stories with games and simulators will miss out. Here’s my example: Vi Hart and Nicky Case’s Parable of the Polygons shows us how bias, even small bias, can affect diversity. Unit test WordPress plugins like a ninja (in progress) cc-by Zach Dischner Unit testing a plugin can be easy, but if the plugin needs dashboard configuration or has dependencies on other plugins, it can quickly go off the tracks. And if you haven’t setup Travis integration, you’re missing out. Activate Travis CI To start with, go sign in to Travis now and activate your repos for testing. If you’re not already using Github to host the plugin, please start there. Unit testing WordPress plugins We’ve been unit testing some of our plugins using the old WordPress-tests framework and tips from this 2012 blog post. The good news is that the framework has since been incorporated into core WP, the bad news is that it was changed along the way, and it wasn’t exactly easy to get the test environment setup correctly for the old WordPress-tests. I’ve had a feeling there must be a better way, and today I discovered there is. Deliverables, iteration, and constraints When asked to give a timeline for project delivery, my first questions, of course, are about the details of the project. Then, I take a guess about the timeline and double it, and fight like hell to eliminate blockers and distractions for the team, work with them on implementation theories, ask leading questions that help balance the “optimum” solution against the timeline, and put up whatever obstacles I can to any changes to the plan. Ruins of Roebling’s Works From Flux Machine: a tumbler of Kevin Weir’s creepy gifs. The original is from the Library of Congress. If the name “Roebling” sounds familiar, it’s because this is the company, founded by John A. Roebling, that built the Brooklyn Bridge and setup a good business making cables, or wire rope. The Roebling brothers suspected the fire was German sabotage. Given the activities of the German ambassador at the time, the claim has a whiff of plausibility. Google’s link policies raise hell for simple bloggers I get a bunch of emails like this: We have recently received a notification from Google stating that our website has unnatural links pointing towards it. This has affected our rankings on Google and as a result, we’re trying to clear things up. Our website URL is www.builddirect.com. We noticed the following links are pointing to our website from your site: http://becomingdonnareed.com/ http://becomingdonnareed.com/blog/922/season-1-episode-23-style-note/ http://becomingdonnareed.com/blog/author/sandee/ http://becomingdonnareed.com/blog/category/style/ http://becomingdonnareed.com/blog/tag/crate-and-barrel/ http://becomingdonnareed.com/blog/tag/ikea/ http://becomingdonnareed.com/blog/tag/lumens/ http://becomingdonnareed. A/B Split Testing Calculators Mixpanel’s A/B testing calculator is a competent performer and valuable tool: Thumbtack’s split testing calculator, however, is a surprise standout: That their code is in Github is especially delightful. Algolia Search The multi-category autocomplete and autocomplete on filtering operators demos are interesting: Mastery Sarah Lewis on mastery: Mastery is in the reaching, not the arriving. It’s in constantly wanting to close that gap between where you are and where you want to be Rebuild iPhoto library Yeah, iPhoto is just about dead, and I’m probably a little crazy to still be using it at all, but I do and now I need to rebuild the library. The knowledgebase article can be summed to this: Hold down the Command and Option keys while opening iPhoto. You can’t just click the icon in the dock, you’ve got to double-click the icon in a real Finder window (or some other context that doesn’t trap the keys like the dock does). X-ray scanners vs. film I’ve been enjoying my Fuji Instax 210, but I’m preparing for an upcoming trip and just remembered the challenge of flying with real film. CC-BY-NC-SA Vegard Hagen. The Flickr Fuji Instax Room has a couple discussions on the topic, but the answers are inconclusive and unsupported by references. Some people shared personal experiences suggesting there was nothing to worry about: Studioesper: “never had any problems. I use to work by airports and go thru carry on xray just about everyday with a instax wide. Porn consumption by geography and type This is shamefully old news, but Pornhub released stats that correlate viewing preferences by geography and pulled out a quote too juicy to ignore: Dixie loves dicks so much that the percentage of gay viewers for every single state in the South is higher than the average of the legal gay marriage states. I’m concerned that some of the numbers are contradicted in three different places in the same article, but it suits my worldview, so why bother questioning it? Followup: Triggertrap latency and Fuji Instax tips Short answer: Triggertrap app audio triggering latency is too long to capture a fast moving event. The app, the dongle, my trusty EOS Rebel XTi, Lensbaby (manual focus, soft edge details), and Neewer flash worked, but too slowly. The phone was just inches from where I was throwing the dice, but the flash and camera were triggered after most of the action happened. Most of the time the die flew off the table before the picture was captured. Air-gap flashes for fun, and more fun This 2011 blog post by Maurice Ribble explains the problem with xenon flash tubes such as those typically used in photography: [X]enon flash tubes have a minimum duration of 1/40,000th of a second. That’s fast enough for most things, but not for a shooting bullet [that] travels around 1000 feet/second. In 1/40,000th of a second that bullet can travel about 1/3rd of an inch leading to blurry photographs of bullets. What’s the minimum latency when using Triggertrap audio triggering? CC-BY-NC-ND by airguy1988 The core point of Triggertrap is to release the camera shutter faster and more reliably than can be done by hand, so this is a bit concerning: The explosion was so fast, that the Triggertrap and camera just weren’t fast enough to capture it. So…what is the minimum latency between trigger noise and shutter signal when using the various Triggertrap devices? It turns out they’ve gotten a lot of questions, and perhaps no small number of complaints about this issue with their mobile app. Fuji Instax 210 Tips and Tricks CC-BY-NC-SA by Mychkine. On focusing and using the closeup attachment lens: If you want to take portraits, use [the included closeup adapter]. With the camera focus set to infinity, the point of sharp focus becomes 1 meter. With the same [closeup] attachment the .9-3m focus setting gives pin sharp results at 45cm. (Selfie range) The depth of field is quite shallow so it is easy to end up with blurred pictures if you mis judge the distance. Yeah, he’s probably right Apparently Nate Silver’s book on people being wrong is filled with errors: The text and chart are contradictory, and other errors in the comments. NCAR’s computers are water cooled, not fanned with oxygen. Meet the new media On the future of media, at The Awl: Of course a website’s fortunes can change overnight. That these fortunes are tied to the whims of a very small group of very large companies, whose interests are only somewhat aligned with those of publishers, however, is sort of new. The publishing opportunity may be bigger today than it’s ever been but the publisher’s role is less glamorous: When did the best sites on the internet, giant and small alike, become anonymous subcontractors to tech companies that operate on entirely different scales? The cameras I’ve enjoyed Big Huge Labs reminded me that my Flickr birthday is in just a few days. My first photo upload was on May 12, 2004. Flickr itself turned 10 in February, but it was the Big Huge Labs stat and the photo walks today that really got me thinking about how long it’s been. For whatever reason, that has me thinking about the cameras I’ve used over those years. Ten years is long enough that I had to go looking to remember some, and long enough that I found some I’d forgotten. Disclaimer in spam message You are receiving this e-mail because we just received a mass e-mail and the sender forgot to blind cc your addresses. We will only be sending this one e-mail so as to not pester you, so please contact us if you would like more information. People pay for photos like this First there was the bad engagement photos tumblr, but now it’s been one-upped by this crazy Russian wedding photos LiveJournal. Strobist David Hobby on HDR I’ve been re-reading David Hobby‘s Lighting 101 tutorial while at the same time exploring HDR (Wikipedia’s HDR article is a good read for those unfamiliar with it). The question that eventually came to mind was how the guy that wrote the following feels about HDR? How often have you heard this, usually with a tone of superiority: “I am a purist, I only shoot available light.” (Translation: I am scared shitless of flash. What makes us special? In Daily Kos this weekend: A Common Thread Among Young-Earth Creationists, Gun Enthusiasts, Marriage Exclusivists, and the 1%. The key point is that groups identify by what makes them “feel special.” Distilled, here are the four groups: Creationists: being created by god makes humans special Gun enthusiasts: their role in protecting liberty makes them special Marriage exclusivists: making marriage exclusive to straight people makes them special One percenters: their accumulated wealth makes them special I was interested in seeing the author’s evaluation of what may be a motivation for (some) members of the identified groups. On “do what you love” A friend forwarded Miya Tokumitsu’s essay “In the Name of Love” pointing out the Steve Jobs quote and summarizing that it “challenges the notion of work at what you love.” I read it with some frustration, then decided I had to ask my friend what he saw in it. I was already into my reply when I tried to look up other works by the author and discovered the piece has been positively covered by a lot of sites I respect. Magic Lantern for EOS M The EOS M is named as a “beta” supported camera, but you won’t find a download for it in the normal place. Instead, you’ll have to use a “Tragic Lantern” build at tl.bot-fly.com. This forum thread is about the development, while this forum thread includes more how-to and documentation. Canon EOS M running Magic Lantern. From magiclantern.fm Rumors Subcomandante Marcos, by Jose Villa, from Wikipedia It started at the coffee shop. Somebody pointed and made the claim, then everybody was laughing. “He looks just like him!” one said. “How would you know, he wore a mask!” exclaimed another. I looked him up. I could be accused of being a less interesting figure. How to identify context inside the WordPress dashboard On wp-hackers, Haluk Karamete asked: on admin pages, how can I detect that the current admin is dealing with a cpt? Andrew Nacin answered: get_current_screen()-&gt;post_type. [But] this will also specify a post type when it’s a taxonomy being edited. To filter that out, ensure that get_current_screen()-&gt;base == 'post', which is [true] for edit.php, post-new.php, and post.php (for all post types). Haluk didn’t elaborate on the cause of the question, but the answer is very good advice for those seeking to conditionally enqueue JS and styles only for specific post types. MySQL performance tips from around the web Gospel: use InnoDB, never MyISAM It seems everybody on StackExchange is singing from the same gospel: “[How can I] prevent queries from waiting for table level lock?” Answer: use InnoDB. The major advantages of InnoDB over MyISAM. “Even in a read-intesive system, just one DELETE or UPDATE statement will quickly nullify whatever benefits MyISAM has.” The main differences between InnoDB and MyISAM, including cache sizing recommendations. “How do you tune MySQL for a heavy InnoDB workload? Transcend WiFi SD card hacking links http://www.fernjager.net/post-8/sdcard: As a 400 MHz Linux system with 32 MB of RAM, using only ~100 mA @ 3.3 V, the possibilities are endless! http://haxit.blogspot.com/2013/08/hacking-transcend-wifi-sd-cards.html: This post is written with the intention of exposing not only the exploits which will allow you to root (or jailbreak) the device, but also the process of discovering and exploiting bugs, some of which are a dead end, while others lead to the holy root B-) ADS-B: the internet of things in the sky ADS-B is a civil aircraft tracking and telemetry standard that the FAA has ruled will replace transponders by 2020. Like a transponder, it’s used to identify air traffic, but with far more more information, such as altitude, heading, speed, and GPS location. The protocol also supports delivery of weather, terrain, and notices to aircraft. The ADS-B signals from aircraft in the sky are intended for receipt by both air traffic controllers on the ground and by other aircraft in the vicinity. Need two-way encryption without mcrypt? In a typical LAMP environment, but don’t have or can’t trust that mcrypt is available in PHP? Try MySQL’s AES_ENCRYPT and AES_DECRYPT. Go read the docs. Where to buy a submarine No need to explain why, I understand: you need a submarine. And you don’t need a bathtub toy (really?), you need something that will truly wow them at the yacht club. There are a few Soviet diesel subs built in the 1940s through 1950s that might be just the thing. Photo: public domain, from Wikipedia. Source. The Soviets built over 200 Whiskey-class subs, and quite a few of them are on the market now. Manhattan Project tours The Manhattan Project was among the US government’s’ first big secrets. It’s easy to forget that plutonium, the incredibly radioactive element at the core of the first atomic detonation, was only identified in 1941. Two years later Army Corps of Engineers started construction of Reactor B to produce it in industrial quantities. Today, Reactor B is a National Historic Landmark, and one of only a few locations of the sprawling Manhattan Project that the public can tour. Where on earth can I get an weotype list? It’s not like these aren’t documented, but I keep forgetting where. WOEID place types: $woetype = array( '7' =&gt; 'town', '8' =&gt; 'state-province', '9' =&gt; 'county-parish', '10' =&gt; 'district-ward', '11' =&gt; 'postcode', '12' =&gt; 'country', '19' =&gt; 'region', '22' =&gt; 'neighborhood-suburb', '24' =&gt; 'colloquial', '29' =&gt; 'continent', '31' =&gt; 'timezone', ); They can be queried via YQL: &lt;?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?&gt; &amp;lt;placeTypes xmlns=&quot;http://where.yahooapis.com/v1/schema.rng&quot; xmlns:yahoo=&quot;http://www.yahooapis.com/v1/base.rng&quot; yahoo:start=&quot;0&quot; yahoo:count=&quot;1&quot; yahoo:total=&quot;1&quot;&gt; &amp;lt;placeType yahoo:uri=&quot;http://where.yahooapis.com/v1/placetype/35&quot; xml:lang=&quot;en-us&quot;&gt; &amp;lt;placeTypeName code=&quot;35&quot;&gt;Historical Town&amp;lt;/placeTypeName&gt; &amp;lt;placeTypeDescription&gt;A historical populated settlement that is no longer known by its original name&amp;lt;/placeTypeDescription&gt; &amp;lt;/placeType&gt; &amp;lt;/placeTypes&gt; When not to use esc_js() From the codex for esc_js: If you’re not working with inline JS in HTML event handler attributes, a more suitable function to use is json_encode, which is built-in to PHP. Dynamic range vs. price and brand Dynamic range is what keeps skies blue while also capturing detail in the foreground. Without enough dynamic range, we’re forced to choose between a blue sky and dark foreground, or properly exposed foreground and white sky. I’ve been using multiple exposure HDR techniques to increase the dynamic range I can capture, but multiple exposures don’t work well with moving subjects. A camera that can capture good dynamic range in one shot would be better than one that requires multiple shots to do the same. Happy D. B. Cooper Day! The FBI&rsquo;s wanted poster for D.B. Cooper. D. B. Cooper, the guy who hijacked a plane in 1971 and then — mid-flight — jumped into the darkness with a bundle of cash and disappeared, is celebrated on this day, the Saturday following Thanksgiving. Granted, this is mostly just a thing in Ariel Washington, where it’s said to have started in 1974, but the participants are pretty passionate about it. A smaller microcontroller for smaller jobs I’ve been thinking a bit about how overkill a full Arduino is for Shutterfingers, and feeling a bit sheepish about how lazy I am about learning to use some other microcontroller. Then I found this guide talking about the ATtiny85: If you’re just blinking a few LEDs, and reading a single sensor, you can get the job done smaller and cheaper using a simple IC, like the ATtiny85. Using it requires a programmer socket and actually mounting the IC to a PCB, but it seems to have enough going on to be useful: If I did it over again, I’d make Shutterfingers smaller Shutterfingers is my simple servo controller that presses the shutter on cameras that don’t support remote control. My first attempt was in a sweet looking, but big aluminum case and incorporates a 6600 mAh battery to power the Arduino, servo, and external power for the camera. Well, it all works, but I’m not sure why I approached it that way. Having extra power for the camera is essential for some applications, but I’m not sure why I was so anxious to marry the two projects into one. Just catching on: MySQL supports tables in plain CSV The storage engine docs are quite clear — “the CSV storage engine stores data in text files using comma-separated values format” — and yet I never realized MySQL supported it. Sure, the tables don’t support indexes and repairing them seems riskier than with other tables, but it still seems to offer a lot of convenience for some things. A comment in the docs suggests how easy CSV exports can be: On gamification Stowe Boyd, remarking on the Pew Internet Project report on Gamification in which he was quoted: The need for a renewed push in the enterprise to reengage every person with their personal work, to find meaning and purpose, has never been greater. But adding badges to users’ profiles on whatever work management tool the company is on, showing that Bette is a super expert customer support staffer, or whatever, is the shallowest sort of employee recognition, like giving out coffee mugs to the folks with the lowest number of sick days. Shutterfingers works! I mentioned my plans to make a servo controller to mechanically press the shutter button on a camera when signaled from a motion control timelapse robot. The parts have arrived and it’s running on a breadboard. I’ve had to make a few changes to the code, including fixing a variable reference, but the biggest change was to implement the internal pull up resisters on the Arduino and reverse the logic. That simplifies the wiring. I guess I missed the Hand Car Regatta I followed the Raygun Gothic Rocketship from its former site near the Ferry Building in SF to its new location in Calgary, to the website of the artist collective that made it, to another of their projects: The Lumbering Contraption, to the Internet Archive cache of the website for the event at which the Contraption appeared, the abandoned Facebook page for the event and the April 2012 notice that, after four years, the event was well and truly over. PCB prototyping services ExpressPCB promises For a fixed price of $75, you will receive 3 identical 2 layer, 3.8″ x 2.5″ PCBs with solder mask and silkscreen layers. That seems like a good plan, but I’m also very new to this market. Are there other, better options? And, as long as I’m asking, what software is available for Macs to sketch out the schematics and layout PCBs? This spammy article names some free choices and led me to a Mac port of Kicad. Simple cameras John Gruber links to Mike Johnston’s post asking: I mean, with hundreds of cameras on the market, wouldn’t you think they could make one that was super-simple, just for that segment of the population that wants it? To this I offer the Panasonic Lumix LX3. I’ve been pretty in love with it lately, and I think it’s the perfect answer to that question. That’s the camera that defied the megapixel race of the late 2000s. Installing and using MEncoder for timelapsing I have a new computer, which has me looking up my old documentation on how I encode still photos from a timelapse series into a video file. As I often do, I’m blogging about it now to make it easier to find next time I need to remember what to install and what settings I’ve found work well. I’ve seen a number of different solutions, but I mostly use MEncoder, a command-line tool. Of course I want an Enfojer Enfojer is an enlarger that uses your smartphone as both light source and negative. It’s on Indigogo now. From the FAQ: What lens are we using in the Enfojer? It is a wide angle polycarbonate toy camera style meniscus lens. It blurs the image just right so you don’t see the pixels on your print. Yeah, we tried sharper and better ones, but the results were too sterile. Fujifilm X and Sony NEX lenses If I get a new camera system I’ll need new lenses. I’m looking carefully at the Sony NEX E-Mount and Fujifilm X-mount because they offer fairly compact cameras with large, APS-C sized sensors. On top of that, however, I usually like to shoot a very wide-angle lens. On a Sony NEX, my best choice might be Sony’s 10-18mm SEL-1018. That’s 15mm after the 1.5x crop factor, and that’s just fine. On the downside, it’s an $850 lens, and only has an F4 maximum aperture. What camera systems are worth it? Given that my feelings for Canon’s lackluster approach to mirrorless cameras, I’m now obligated to look for a new camera system, and that has me looking at cameras I’d previously ignored. Fujifilm’s X system is a recent entrant into the interchangeable lens mirrorless camera fray (note that not all the cameras in the X line sport interchangeable lenses, or similar sensor sizes or body types). The X-E1 received a gold rating from DPreview, and the new X-M1 is looking like another good camera as well. The EOS M system might as well be dead Amazon is now selling EOS M cameras for $329 with free shipping. At that price you have to think about buying it as a joke, but that’s exactly what it is. The camera is hobbled by Canon to avoid cannibalizing sales of their other products. Consider this: Fujifilm’s X series, Sony’s mirrorless NEX 6 and 7 cameras, Panasonic and Olympus‘ Micro Four Thirds mirrorless cameras, and others offer good manual controls despite their small size. Shutterfingers I started work on my first Arduino project today, though I have yet to get the hardware. The plan is to build a servo controller that can trigger the shutter on my Panasonic LX3 camera that lacks any sort of remote shutter release. I started looking into this before and found Cris Benton struggled with the problem as well. I’m planning to go down a path he blazed some years ago: put a servo on it. Building GEOS on CentOS It should be simple, but I ran into a number of errors. First I got stuck on libtool: line 990: g++: command not found. It turns out I needed to install g++ using: yum install gcc-c++ Then I got stuck on this one: platform.h:110:2: error: #error &quot;Can not compile without isnan function or macro [...] &quot;Coordinate.inl:38: error: ‘ISNAN’ was not declared in this scope The author of this page faced the problem, but the real insight came from this bug report on an unrelated project: About those battery life ratings I added battery life as a factor in my recent review of cameras, but what does the reported battery life of a camera mean? Assuming the 2003 translated PDF is correct, CIPA standards for camera battery life amount to something like this: Take pictures continuously until the camera shuts down due to power loss. Fire the flash at full power for every other photo, if the camera has a flash. Lumix LX3 sample photos A friend was asking about the Lumix LX7 I named in my camera roundup the other day and earlier this year. I keep the LX7 in the list because of my experience with it’s predecessor a couple generations earlier: the Lumix LX3. He asked how it performs, but I struggled at first to find photos demonstrating it. I began to wonder if my memory of the LX3 was a little more glowing than the reality. Why in-camera GPS matters I concluded my review of current camera options with the claim that I’d switch lens systems for a compact interchangeable lens camera that had built-in GPS. Why do I want GPS? Because the competition for all the cameras I listed there is my iPhone, and one of the reasons I prefer my phone is because every photo I take with it is a little breadcrumb helping me track my travels with very accurate date, time, and location information. Summer 2013 Camera Options I reviewed a lineup of cameras I’d consider to replace my aging Canon Rebel XTi and Panasonic Lumix LX3 back in February, but I’m on a roll after collecting some film camera party packs so I decided to update this list as well. Since I gathered my original list I’ve started using motion control robots and my photo habits have changed. Given that, the priority of some of the options has changed a bit as well. Back to the vault: old vacation pics shot on film My love letter to film cameras as a solution to smartphone addiction at parties had me looking for some old film photos. Do we enjoy the idea of film more than the reality? I found a set of photos from a vacation to Las Vegas in April 2001. It’s clear that whatever photographic technique I’d developed years before had gone fallow. At the time I was shooting with an Olympus Stylus Epic, probably on Kodak 400 or 800 speed print film. Film Camera Party-Packs In the old days, or the 1990s at least, party hosts distributed disposable cameras. Then digital cameras and smartphones after that became common. The number of photos has been growing, and in some cases so has the quality. But as the number of cameras has exploded so has the presence of cameras themselves in the photos, and as groups of people line up to be photographed, they’re often now outnumbered by photographers on the other side. Detect MySQL’s “too many connections” error WordPress appears to continue with execution even when MySQL refuses connections/queries after init. Here’s a comment in the MySQL docs suggesting how to detect the condition in raw PHP: $link = mysql_connect(&quot;localhost&quot;, &quot;mysql_user&quot;, &quot;mysql_password&quot;); if (mysql_errno() == 1203) { // 1203 == ER_TOO_MANY_USER_CONNECTIONS (mysqld_error.h) header(&quot;Location: http://your.site.com/alternate_page.php&quot;); exit; } Just a note to myself, but I wonder if there’s opportunity here. SF gentrification debate I wade into this topic wearily, but I do love my new city, even in the moments where it drifts from critically self-aware to navel gazing. Ian S. Port’s July 17 review of the media coverage of the gentrification debate included this nugget discussing Ilan Greenberg’s angle on the topic: [W]hat’s happening here isn’t gentrification at all, but merely middle-class residents using the word to conceal discomfort over richer people coming in and ruining their good time. Data sources for geographic boundaries world.geo.json To mock something fast and loose with geo-json data for the world, this is your fix. Legal status of this dataset: dubious? For a good time, drag them to http://bl.ocks.org/1431429 and paint the globe! world-atlas [A] convenient mechanism for generating TopoJSON files from Natural Earth. Natural Earth Natural Earth is a public domain map dataset available at 1:10m, 1:50m, and 1:110 million scales. Featuring tightly integrated vector and raster data, with Natural Earth you can make a variety of visually pleasing, well-crafted maps with cartography or GIS software. Built For A Purpose: Geographical Affordances and Crime In Cabinet spring 2013, Geoff Manaugh investigates the relationship between geography and the crimes that geography affords. In the 1990s, Los Angeles held the dubious title of “bank robbery capital of the world.” At its height, the city’s bank crime rate hit the incredible frequency of one bank robbed every forty-five minutes of every working day. [An FBI Special Agent once joked] the agency even developed its own typology of banks in the region, most notably the “stop and rob”: a bank, located at the bottom of both an exit ramp and an on-ramp of one of Southern California’s many freeways, that could be robbed as quickly and as casually as you might pull off the highway for gas. Peeking into other people’s photo rigs This all started because I went looking for a way to remote trigger a Panasonic Lumix LX 3. The internet is pretty certain that the only way to do it is mount a servo to mechanically press the shutter button. Sad. But that led me into Cris Benton‘s world of photography from poles. Yes, he mounts his camera at the end of a carp fishing pole (a noun so unknown to me I almost put it in quotes) to loft it up to 30′ in the air. Speeding up MySQL joins on tables with TEXT columns, maybe The thing about WordPress’ DB schema is that TEXT and VARCHAR content is mixed in the posts table (to say nothing of the frustrations of DATETIME columns). That’s not such a problem for a blog with a few hundred posts, but it’s a different matter when you have a few hundred thousand posts. And it wouldn’t even be a problem then, except for this quirk in MySQL: Instances of BLOB or TEXT columns in the result of a query that is processed using a temporary table causes the server to use a table on disk rather than in memory because the MEMORY storage engine does not support those data types (see Section 8. What is the difference utf8_unicode_ci and utf8_general_ci? From the MySQL manual: For any Unicode character set, operations performed using the xxx_general_ci collation are faster than those for the xxx_unicode_ci collation. For example, comparisons for the utf8_general_ci collation are faster, but slightly less correct, than comparisons for utf8_unicode_ci. They have a amusing “examples of the effect of collation” set on “sorting German umlauts,” but it unhelpfully uses latin1_* collations. And another table that helpfully explains: A difference between the collations is that this is true for utf8_general_ci: Canon + iOS tethering solutions There’s magic that happens inside the camera. Yes, magic. Most cameras expose the controls to that magic via some knobs and buttons and a small LCD screen. The knobs and other physical controls we like, but the screen pales in comparison to those on our iPhones. And that’s the thing, the hundreds of apps on our iPhones leaves us wondering why our DSLRs aren’t an open platform, ready to be reshaped by one app after another. Testing apply_filters() times Testing how long it takes to assign a variable versus assigning through WordPress’ &lt;a href=&quot;http://codex.wordpress.org/Function_Reference/apply_filters&quot;&gt;apply_filters()&lt;/a&gt;. Filters are core to WordPress, but I haven’t yet looked at the total number of apply_filters() calls used throughout the code. The answer to this question is that calling a non-existing filter before assignment is about 21 times more costly than simply assigning it. That’s nothing compared to the cost of actually doing some filtering, however. Clarity from a distance The sky looks big from earth, but it’s rather different the other way around. I’m not saying it’s not quite an experience, but inspecting the metadata on this photo of New York and surroundings taken on Christmas day, 2000, during the first International Space Station mission surprised me. To wit: it’s only a 180mm lens. Granted, that’s on an old Kodak DCS460 digital camera (a Nikon body with Kodak imaging unit attached) with a 1. 3rd party JS libraries cause downtime Facebook Connect went down hard tonight. HuffPo reports that their site was redirecting to a Facebook error page, even when people weren’t attempting to log in. Yep. Busted third-party JavaScript brings portions of the Internet to its knees: huffingtonpost.com/2013/02/07/fac… — Kent Brewster (@kentbrew) February 8, 2013 It makes me more comfortable with our decision to strip so many 3rd party javascripts from GigaOM during our last redesign. Camera frustrations and other first world problems I’m not a camera pro. I have some photos on Flickr, but it’s just for fun, so I don’t really need a new camera. But I do want one. Thing is, there a lot of cameras out there, but none of them has the Goldilocks factor. None has the right mix of features, size, and price that makes me happy. I now have an old Canon Rebel XTi, Panasonic Lumix LX3, and GoPro HD Hero 2 in my camera bag, but I began to feel an itch when I realized my 50mm F1. Testing file include times for a file that may or may not exist Question: Should you check for a file before attempting to include it, or just suppress errors? Calling file_exists requires stating it twice if the file does exist, so that could take longer. Answer: the file_exists pattern is more than five times faster than the @include pattern for a file that doesn’t exist, and not substantially slower when the file does exist. The test: &amp;lt;?php $start_time = $end_time = $i = 0; $start_time = microtime( TRUE ); for( $i = 0; $i &amp;lt;= 100000; $i++) { include __DIR__ . An American iPhone in Europe By way of update on my earlier post after researching options for AT&amp;T iPhone users in Europe (with an unlocked phone), I ended up not bothering with local SIM cards in either The Netherlands or France. A savvy user should be able to find a local pay as you go SIM plan that’s less expensive than AT&amp;T’s data roaming packages, but I’m that user and know very little about the local operators (not even all their names). SVN or git? @film_firl poked @WordPressVIP to ask @wordpressvip @mjangda @viper007bond MOOOOVE TO GIT!!! she half-kids. No really, please? — Christina Warren (@film_girl) January 18, 2013 @nacin piled on with @viper007bond @film_girl @mjangda VIP aside, it’s fairly crazy that WordPress.com hasn’t migrated. SVN != tenable dev environment. — Andrew Nacin (@nacin) January 18, 2013 @Viper007Bond tried to defend the team, and added @film_girl @wordpressvip @mjangda That said transitioning is not always worth it. Where did all the votes go? What happens to voting data after the election is over? What happens to all those certified results by polling place? How is it that there’s so much coverage leading up to and on the night of the election, but this guy seems to be one of the few sources of historical voting data? Amusingly, I found it linked on the Library of Congress’ website! There’s some very old sources from E. On wp_enqueue_scripts and admin_enqueue_scripts An argument has erupted over the WordPress actions wp_enqueue_scripts and admin_enqueue_scripts vs. init. One of the points was about specificity, and how wp_enqueue_scripts and admin_enqueue_scripts can reduce ambiguity. I didn’t realize I had strong opinions on it until the issue was pressed, but it turns out I think wp_enqueue_scripts and admin_enqueue_scripts are unnecessary and unfortunate additions to the actions API. Here’s what I wrote in that discussion thread: Is Spatula City the store that’s most specifically targeted to the sale of fine spatulas? Confirming that object references in arrays are preserved while cloning the arrays A short test to confirm references are preserved in cloned arrays. // create a stdClass object (using my lazy way of coercing arrays to objects) $object = (object) array( 'thing' =&gt; 'original' ); // add that object to an array element $array = array( 'object_one' =&gt; $object ); // clone the array by assignment to a new variable $array_two = $array; // add a new copy of the original object to a new element in the new array $array_two['object_two'] = $object; // show what we have so far var_dump( $object , $array , $array_two ); The result is: Ignoring noise in svn diffs svn diff -x &quot;-bw --ignore-eol-style&quot; is your friend when somebody decides to change the end of line style and strip all trailing whitespace from the files in your repo. Is Perl the best solution to write code that needs setuid? A bunch of searching the web for things related to setuid and shell scripts lead me to this answer in Stack Exchange: Perl explicitly supports setuid scripts in a secure way. In fact, your script can run setuid even if your OS ignored the setuid bit on scripts. This is because perl ships with a setuid root helper that performs the necessary checks and reinvokes the interpreter on the desired scripts with the desired privileges. There’s no ‘git cp filename’? Here’s a sequence of unbelievable things: Yes, despite a lifetime in Subversion, I’m really this new to git! I’m going to link to Livejournal in this post! Git really doesn’t have an equivalent to svn cp filename! I spent a surprisingly long time reviewing the man pages and surfing the internet to confirm this, but git really assumes you’ll never want to copy a file with history. Here’s that Livejournal link I promised, where markpasc has similar complaints — from 2008, no less. Aww, I got thanked! I recently backed the Syrp Genie, one of a handful of recent motion control timelapse projects on Kickstarter. It’s well past its expected ship date, but they done a good job of keeping backers updated on progress and just today they shared photos of the box that will soon be on it’s way to me. They’ve thanked backers with a card in every one of them. If you look closely, you’ll see my name straddling the “thanks” in the center. Greetings Library Scientist The California Library Association is pretty much like every other regional library association I’ve seen, not least because their most visible presence is their annual conference. It may be the season, but the CLA is more politically active than others I’ve known. At their core, most such associations exist to promote efficient transfer of operational knowledge from one library to another, from one generation to another. Libraries today Unfortunately, in less than a generation’s time, the very foundations of libraries has been rocked by technological, legal, and economic changes unlike any these organizations have seen before. Our Arbitrary Alphabet We have been gaslighted by the alphabet and now believe the arbitrary string of letters is actually organized according to some plan. Hegemonic Language and Arbitrary Order The signs used in writing originate in arbitrary decisions, but the connection with arbitrariness is lost when convention takes over. The convention of long usage kills even the memory of the initial arbitrariness of the signs and gives them an objective and seemingly inevitable presence. Strange things running on my Mac My iMac screen is dark and isn’t lighting up like I expect it to when I tap the keyboard. I can, however, SSH into it and see what it’s doing when not responding to me. I found GoogleSoftwareUpdateAgent running, this FAQ item vaguely tells me it’s part of Chrome, and that if I try to uninstall it without also uninstalling Chrome it will simply “be reinstalled after a few hours.” Action Camera Market Not Yet Saturated, According To Sony I wondered if the GoPro-style action camera market had already become saturated back in January, now I’ve learned that Sony apparently doesn’t think so. At least one imagines that’s the conclusion they came to before deciding to join the competition with a camera of their own. They call it the Action Cam, and it clearly takes its design cues from Contour. What does Sony offer to stand apart from the established players? USB Camera Control Problem The Canon EOS M doesn’t include a remote shutter release cable port, and the on-camera controls don’t expose features such as bulb-mode exposures. Further, simple remote shutter release doesn’t support the sophisticated camera control necessary to do timelapses with complex exposures. What kind of complex exposures? Imagine a timelapse going from day to night. During daylight the exposure might be f8, 1/1000 second at ISO 100, but the night exposure might require f4 1/15 second at ISO 400. Geography vs. Stereotypes Alphadesigner is trying to put a finger on it with his Mapping Stereotypes series. Others, including how Americans see Europe and the world according to America, are not nearly as well designed. We’d be fools, however, to think we invented the idea of mapping our prejudices. This Flickr set of maps from 1870 through 1915 is good evidence of that. Chance Vs. Lasers Via tweet: claw arcade games are not skill games, rather, the claw strength is randomized and is often only strong enough to successfully grab the prize in one attempt out of 18, or 800. Operator manuals linked in the Quora answer explain the different modes and odds. String cutting games, however, can be defeated with lasers! apiGrove: API Management Software apiGrove is an API management tool by Alcatel-Lucent. It proxies APIs (presumably those you built and host, though the example is for Twitter) , supports authenticated access, throttles to help manage demand, usage logging and reporting. More info @apiGrove, hat tip. Be Careful What You Measure Seth Godin on what to obsess over: What are you tracking? If you track concepts, your concepts are going to get better. If you track open rates or clickthrough, then your subject lines are going to get better. Up to you. It’s long something I’ve believed: if you measure it, you will attempt to maximize it, even if the metric is something you’d rather minimize, like CO2 emissions. Preparing My iPhone For Europe There’s uncertain talk of a European trip coming up, so I’m making nonspecific preparations for it. One of the questions I have is how to avoid hefty roaming charges from AT&amp;T. In previous trips abroad I’d purchased overseas voice and data add-ons so I could use my iPhone. That works, up to a point. On my return home from a trip to Taiwan a few years ago I got a call from AT&amp;T informing me that I’d gone over my data limit and was facing a $1500 charge for the usage. Higgs-Bugson A Higgs-bugson is a hypothetical error whose existence is suggested by log events and vague reports from the users that cannot be reproduced in development conditions. QA and user support teams point to the Higgs-bugson as an explanation for the results they see in the field. Software engineers, however, often deny the existence of the Higgs-Bugson and offer alternative theories that often blame the user. Engineers, after all, don’t write bugs. GoPro HD Hero 2 Lens Correction GoPro’s HD Hero 2 action camera is everywhere, so perhaps we’ll all be used to the fisheye’d images it produces soon. On the other hand, there are software solutions to rectify the image to rectilinear. Vimeo user Peter iNova has a few videos demonstrating his Photoshop action sets to straighten out an HD Hero’s output. A person could probably significantly improve performance by giving up on Photoshop and building a video filter based on the Panotools image manipulation library. Making Sense Of AT&T’s Shared Data Plans Kevin’s coverage at GigaOM helped, but what I really needed was a chart that compared the different options. I couldn’t find one, so I made my own: &lt;td valign=&quot;top&quot;&gt; &lt;strong&gt;2 iPhones&lt;/strong&gt; &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; &lt;strong&gt;3 iPhones&lt;/strong&gt; &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; &lt;strong&gt;4 iPhones&lt;/strong&gt; &lt;/td&gt; Shared data, unlimited minutes 1GB &lt;td valign=&quot;top&quot;&gt; $130 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $175 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $220 &lt;/td&gt; 4GB &lt;td valign=&quot;top&quot;&gt; $150 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $190 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $230 &lt;/td&gt; 6GB &lt;td valign=&quot;top&quot;&gt; $160 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $195 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $230 &lt;/td&gt; 10GB &lt;td valign=&quot;top&quot;&gt; $180 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $210 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $240 &lt;/td&gt; 15GB &lt;td valign=&quot;top&quot;&gt; $220 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $250 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $280 &lt;/td&gt; 20GB &lt;td valign=&quot;top&quot;&gt; $260 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $290 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; $320 &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; &lt;/td&gt; &lt;td valign=&quot;top&quot;&gt; &lt;/td&gt; Individual data, 700 shared minutes  300MB &lt;td valign=&quot;top&quot;&gt; 109. Motion Control Timelapse Projects On Kickstarter Some time ago I backed the Syrp Genie (estimated delivery July 2012), but today I learned of the Radian and Astro. Unlike the Radian and Astro, the Genie supports linear motion, but it’s also much more expensive, bigger, and appears to have more complex controls. Here are the videos for all three projects: [http://www.kickstarter.com/projects/syrp/genie-motion-control-time-lapse-device/widget/video.html] [http://www.kickstarter.com/projects/207087339/radian-a-motion-time-lapse-device-for-everyone/widget/video.html] [http://www.kickstarter.com/projects/1530895202/astro-time-lapse-motion-control/widget/video.html] Eduard Khil, Mr. Trololo, Dead At 77 Eduard Khil is dead. The man, whose work and career had earned high praise, including the Order of the Red Banner of Labour (1971), Lenin Komsomol Prize (1976), Order of Friendship of Peoples (1981), Meritorious Artist of the RSFSR (1968), People’s Artist of the RSFSR (1974), Order of Merit for the Fatherland (2009), and international fame with his performance of Trololo. The 1976 performance that made him famous: A 1984 stage performance: Composited Timelapse and Real-Time Skateboarding Video http://www.vimeo.com/41406753 Russel Houghten‘s Open Horizon is part skate film, part time lapse, and mostly awesome. Then somebody pointed to this Jimmy Plmer/Z-Flex video that shares a number of features with Houghten’s work, but is less ambitious in scope. At least they did a behind the scenes video that shows the sweet Red camera and rails. Find Neighbors On The Same IP What other sites share the same infrastructure with your site, or any other? Bing‘s IP search can answer. Do a search by IP number: ip:72.233.127.217 ip:158.136.1.105 ip:72.51.52.15 Site Load Performance Benchmarks The Loop’s Jim Dalrymple compiled the following numbers for the time it takes various tech sites to load in a browser in late 2011: The Loop: 38 requests; 38.66KB; 1.89 secs Daring Fireball: 23 requests; 49.82KB; 566 milliseconds Macworld: 130 requests; 338.32KB; 8.54 secs Ars Technica: 120 requests; 185.99KB; 2.08 secs Apple: 46 requests; 419KB; 1.39 secs CNN: 196 requests; 269.41KB; 4 secs BGR: 368 requests; 2.74MB; 35.33 secs AppleInsider: 141 requests; 649. Is This The Best IMDB API? IMDBAPI.com CSS Speech Bubbles Twitter front-end guy Nicolas Gallagher likes both CSS and speech bubbles enough to want them unadulterated by images and non-semantic markup. The lesson from his many examples is that it all comes down to an :after pseudo element that puts the little triangle in there: .speechbubble:after { content:&quot;&quot;; position:absolute; bottom:-15px; /* value = - border-top-width - border-bottom-width */ left:50px; /* controls horizontal position */ border-width:15px 15px 0; /* vary these values to change the angle of the vertex */ border-style:solid; border-color:#f3961c transparent; /* reduce the damage in FF3. Semantic News Markup And SEO Schema.org NewsArticle hNews rNews (and the war between rNews and hNews) Google News Technical Requirements On the likelihood of unicorns Research by Robert E. Hall and Susan E. Woodward shows that 87% of venture-backed firms exit for less than $10 million (67% exit for less than $1 million). In a world where Instagram can exit for $1 billion with no revenue or monetization plan, anything less than $10 million is an implosion. Marathon Spoiler Guides Marathon and Marathon 2: Durandal are available as iOS apps. The classic Marathon Spoiler Guides might be good companions. AirParrot Turns AppleTV Into A Secondary Display From the FAQ on the AirParrot site: What does AirParrot do? AirParrot lets you AirPlay your Mac’s screen to a second or third generation AppleTV. What you see on your Mac’s screen will appear on the AppleTV, wirelessly! How do I use AirParrot? Once you’ve opened AirParrot, click on the icon in your menu bar. Select the AirPlay device (such as your AppleTV) and then select which screen you want to mirror. SF Police, Fire, EMS, and Airport Radio Monitoring Listen in with RadioReference.com’s index of live police, fire, EMS, and airport radio feeds in San Francisco. Is This The Best Way To Copy Voicemails From An iPhone? Instructables tells us to get the files from the iPhone backup in ~/Library/Application Support/MobileSync/Backup/, but “iTunes renames all your files xxxxxxx.mddata. So all you need to do is figure out the original file name extension and you will be able to view the file.” Ugh, isn’t there a better way? HTML5 Form Elements Mark Pilgrim’s overview of HTML 5 form elements includes the following: Placeholder Text Autofocus Fields Email Addresses Web Addresses Numbers as Spinboxes Numbers as Sliders Date Pickers Search Boxes Color Pickers Form Validation Required Fields Further Reading Configuring Amazon Linux For Web Services (Spring 2012) I’ve tested this cookbook against Amazon Linux, but it will probably work just as well with the current version of CentOS. Basic Installation First, get root and update the OS: sudo -s yum update With that done, let’s get the basic packages and services installed: yum install mysql mysql-server mysql-devel httpd httpd-devel mod_ssl php php-devel php-mysql php-gd php-dom php-pear php-json memcached svn gcc pcre-devel make That gets us Apache HTTPD with SSL, PHP with a number of modules, Memcached, and a few system tools. PHP vs. Frameworks Six years ago this month the Zend framework preview was released and Rasmus Lerdorf published a blog post titled “The no-framework PHP MVC framework” (italics added). R. Rajesh Jeba Anbiah noted irony. Scanwiches Scanwiches: scans of sandwiches for education and delight. Above is Parisi Bakery’s ham, swiss, tomato, lettuce, mustard, mayo, on a hero. Prints were said to have been available — I’d like the dagwood, thank you — but the store seems in a sad state. Pew Internet Project: “19% of adults own a tablet computer” We’ve heard stories about how significant the growth of Apple’s iPad is, but Pew Internet and American Life Project Director Lee Rainie speaking at the National Federation of Advanced Information Services (NFAIS) conference on Mobile Devices and the Delivery of Information shared a stat that made me pause: 19% of adults own a tablet computer – iPad To clarify, that 19% does not include ebook readers (they’re tracked separately). Rob Reid’s Copyright Math Rob Reid’s Copyright Math at TED2012: the claimed effect of entertainment piracy to US economy is larger than value of most of our agricultural output. Pantone Yummies By Emilie Griottes: Open Access and Open Data Finally Getting Public Attention Complaints over the cost of academic journals have long been a trope that repeats at library conferences with no denouement, but there are new signs that might be changing. The issue is that a large portion of the research done in the US is performed by faculty paid by academic institutions and supported by public money, often grants from the NIH. A significant condition of promotion in academic careers is publication of original research in trusted journals, which is entirely reasonable to most everybody involved, except for the librarians who have to pay for the journals. The Microsoft Store Experience There’s a Microsoft Store right across from the Apple Store in the Valley Fair Mall. Cliff and I realized this after exiting the Apple Store there with a new keyboard and headphones. We’d never been in an MS Store before, so we ambled over with our clean white Apple-branded accessories in hand. The Windows Phone 7 display was in the back corner, attended by a nice woman who offered to fetch a Nokia Lumia 800 phone from the back for us to inspect. Marta Becket’s Final Performance Tonight Legend has it that Marta Becket rolled in to Death Valley Junction in 1967 and has been performing at the Amargosa Opera House since, but tonight is her last performance. I visited in 2004 and took in the show then. It’s a certain kind of show and performer that can run 45 years non-stop (it was in its 37th year when I saw it). Action Cameras You know about Countour and GoPro, but you may not have seen Drift and Swann. Is this a market that is getting saturated, or is it about to explode? Contour marketing video: GoPro marketing video: Drift marketing video: http://www.vimeo.com/26828058 Swann marketing video: Three of the cameras compared: Happy New Scriblio! The most recently released, stable version of Scriblio is marked 2.9-r1 and was last updated in June 2010. You can be forgiven for thinking development had ceased in the interim. Today, however, I’m proud to introduce a completely new Scriblio, re-written from the ground up to take advantage of the latest features of WordPress and eliminate the mistakes made in previous versions. This update allows users to search and explore WordPress sites using facets that represent the tags, categories and other aspects of the collection. How WordPress Taxonomy Query URLs Could Be More Awesomer (Updated, see below) WordPress 3.1 introduced some awesome new taxonomy query features, and the URL parsing allows some rudimentary syntax to query multiple terms and choose if the query is OR’d or AND’d. The URL syntax is as follows: A comma (,) between terms will return posts containing either term (logical OR), like this http://maisonbisson.com/post/tag/wordpress,mysql/ . A plus sign (+) between terms will return posts containing all terms (logical AND), like this http://maisonbisson. GE Public Relations Gets Smart To The Cool Video Thing The video from General Electric is cool, and shot at least in part with cameras mounted on RC helicopters, but strangely missing is any mention their manufacture of nuclear generation equipment such as the Fukushima plants that melted down earlier this year. “Hot Sweet Wings” and other wonders composed with the help of Songify Cliff introduced me to the wonder of the Songify app. Here are some tips to making the best of it: Longer text makes for better songs. Repetition makes for better songs, don’t be ashamed of repeating yourself. Speak in a monotone voice, let the app handle the tune. Speak nonsense. No sense in trying to make sense, it doesn’t make for a better song. If you insist on trying to make sense, then just pick a single sentence and repeat it several times with slight variations. Wikileaks Embassy Cables First Wikileaks published the collateral murder video, then a massive-but-redacted dump of diplomatic cables, then people figured out how to get the unredacted content. Though this information was already public, the ACLU pursued a FOIA request on these very cables, the result was a heavily redacted record of the cables, and a clear picture of the government’s ongoing touchiness about torture, rendition, Guantánamo, and targeted killings by drones. An On The Media segment (MP3 download) explains further. parallel-flickr Backs Up Your Flickr Library parallel-flickr: a tool for backing up your Flickr photos and generating a database backed website that honours the viewing permissions you’ve chosen on Flickr. More details from the website: It downloads and stores your original photos and their “640x” versions. Currently photos are stored locally but there’s a plan to add support for S3. For each photo it downloads and stores the contents of the flickr.photos.getInfo API method as a JSON file. Predator Drones Used In Domestic Police Action The LA Times on December 10 reported that Predator drones such as those now being used by the Air Force and CIA were used to support police in their investigation of cattle rustling. Theft of livestock has long been a serious matter, but regulations and procedures typically make it difficult to sell stolen cattle. According to Fred Frederikson of the North Dakota Stockmen’s Association, “all horses, mules and cattle leaving [North Dakota] must be brand inspected. The War On Cameras WNYC’s On The Media did a nice piece on it back in September (MP3 download): judging from the arrests and harassment, photographers are part of a terrorist plot. Or something. The CopBlock (tagline: “badges don’t grant extra rights”) map of actions taken against photographers is littered with activity. AlterEgo: Democratizing Two-Factor Security AlterEgo promises two-factor authentication security without the silly key-fob. Neat. Electric Chariot Sure, this electric chariot combines all the inconvenience of a scooter with some of the frustrations of an actual car, but it looks cool. Sort of. Though it’s made by a medical equipment manufacturer, at least it conforms to the rule of auto shows and objectifies the women demoing it as much as the vehicle itself. Correction: Steadicam Smoothee for GoPro HD Hero In my earlier post on Steadicams for GoPro HD Hero cameras I incorrectly stated that the Steadicam Smoothee is exclusively for iPhones and iPod Touches. They seem to have mounts for GoPro Hero and Flip Mino cameras as well, it’s just impossible to find that info on their website and most retailers don’t carry the other mounts. If you don’t mind the color, you can pick up a third-party mount for under $20 from Shapeways. Web Strategy Discussion Starter What follows is the text of a document I prepared to start and shape discussion about the future of the university website at my former place of work. The PDF version is what I actually presented, though in both instances I’ve redacted three types of information: the name of the institution (many already know, but that’s no reason to allow it to appear in search results), pay rates for proposed employees, and identification of proposed service providers. Which Steady Cam Is Best For A GoPro HD Hero2? I have a new GoPro HD Hero2, one of the best new video cameras available (if what you like in a video camera is a compact, wide-angle, and waterproof), and I’m looking for a way to steady it for handheld shots. The Steadicam Smoothee is built for iPhones. Their demo video and this comparison of the iPhone 4S with and without the Smoothee suggest it can work wonders, but it appears to be iPhone-only [correction: it’s officially compatible with the GoPro]. AOC 16″ USB-Connected Flat Panel AOC’s new 16″ USB-connected monitor looks like an interesting toy. It draws its power and signal from the USB. Mixed information suggests that four or eight can be connected to a single computer. At about $130, this could be a cheap way to build a large display wall. What Content Should a University Website Include? I no longer have a dog in this race, but in cleaning up my hard drive of old files I’ve run across a few items of note. For example, the above illustration I once used to describe the different content, audiences, and uses of a university website. Current students, prospective students, their family, faculty, employees, and their family all use and expect to get answers from the website. Websites for large organizations fail their users when they only share the details that they once exposed in view books and catalogs. What went wrong If I’m lucky, the only reason I get a phone call before 7am is because somebody on the east coast forgot about the timezones between them and me. The alternative is almost always bad news. Today I wasn’t lucky, and neither were a huge number of readers and users at GigaOM who received multiple copies of our daily newsletter. For a news and research organization that values — loves — its users as much as we do at GigaOM, this was all hell breaking loose. Comcast’s Folly [Harry Shearer]2, the bassist for Spinal Tap, voice talent for many characters in The Simpsons, and host of Le Show has no difficulty criticizing the unnecessary complexities of modern media technology, but not until his August 14 episode (subscribe to the podcast) has he admitted to the frustrations of modern cable. “It’s now easier to watch TV on your computer than on your TV,” says Shearer. Perhaps that’s why Comcast, the leading cable operator in the US, lost 238,000 TV subscribers last quarter, and the company has been seeing its subscriber base shrink for a while (though they’re showing growth in internet subscribers). The End Of Paper Domtar, “the largest integrated manufacturer and marketer of uncoated freesheet paper in North America and the second largest in the world,” launched a campaign to promote paper consumption. This much is old news, as the campaign is about a year old already. Among the messaging goals, according to the agency that designed it: It’s easier to learn on paper, because reading on paper is up to 30 percent faster than reading online. Search The Sears And Roebuck Catalog You’d think the Sears Archives would offer an online search of their historical catalogs, but the best you’ll find is a list of libraries holding the microfilms. Ancestry.com offers an online search, but only to paying members. I’m looking into this because I was looking for historical trends in consumer products and thought the catalog would be a good source. It might be, if only I was ambitious enough to go to my downtown library. Ed Rondthaler’s spelling reform flip chart http://www.vimeo.com/17561068 Ed makes a good argument for spelling reform, but he demonstrates an outstanding flair for presentation, even at the age of 102. Sara Cannon On Responsive Web Design At WCSF Sara Cannon‘s talk on responsive web design (resizing the page to suit different client devices) was spot on. Her slides are below, but she also recommends this A List Apart article on the matter, as well as Less Framework and 1140 CSS Grid (especially as alternatives to 960.gs). Responsive Web Design – WordCamp San Francisco View more presentations from Sara Cannon Estelle Weyl on CSS3 At WCSF I’ve long been a fan of CSS3, but Estelle Weyl‘s WordCamp SF talk on it charged me up again. Her slides are not to be missed. An Interesting Insight Into Mobile Safari On iOS A post in a Y Combinator discussion thread: Mobile Safari parses websites as a big canvas and then pretends the screen is a window through which you’re looking at the canvas. What you think of as scrolling, the browser thinks of as moving the canvas around (or the window depending on point of view). Because of that, no scroll events ever get fired. Even :fixed doesn’t behave as expected. AppleScript: Get Every Movie In iTunes AppleScript can be frustrating, but it’s an easy way to get info out of iTunes. The following is a fragment of a script I’m working on, this part simply gets a record list of every video in the current iTunes window that is a “movie” (the alternatives include music videos and TV shows, among others). Credit goes to some examples I found in Doug’s AppleScripts for iTunes. Boo, however, to a few scripts that are saved as “run only” and can’t be inspected, even for security. Civic Comparators It’s from early 2007, but Cameron Marlow’s comparison of SF to NYC neighborhoods and Jason Kottke’s comparison of the physical geography are amusing to me as a new San Franciscan. On the other hand, is it a sign of civic insecurity to make such comparisons? DoubleHappy Game Creator DoubleHappy, by Instinct, the same folks who make the GetShopped ecommerce plugin for WordPress, is an interesting game creation tool. All the game elements are stored in WordPress using custom post types and other advanced features, but it was their demo of the HTML5 editor that most amazed me. The games still play in Adobe Flash, but surely they’re working on rendering that to HTML5 as well. Using Keynote As a Motion Graphics Tool Bill Keaggy just posted on the XPLANE blog about using Apple’s Keynote presentation software to make motion graphics and movies. We’ve found that in some cases, a Keynote-authored video is what you might call the “good enough” solution. […] Keynote lets you create and edit presentations, make things move, is ridiculously easy to learn and exports to Quicktime. He offers his tips on how to make the best of it, as well as these videos made using Keynote: Notes To Self: Twitter’s Website Rocks On Mobile Devices Twitter’s mobile site rocks on my iPhone. Especially worth noting: they’ve figured out how to pin their header to the top while scrolling the content in the middle. They’re also using pushState() and other cool tricks to make the experience feel very native, but the scroll behavior is rare among web apps on iOS. Kent Brewster makes a point about how difficult it is in his Mistakes I Made Building Netflix for the iPhone talk from SXSW. WordPress nocache_headers() vs. Nginx Typically, you can call WordPress’ nocache_headers() function when you don’t want content to be cached. Typically, but when you’re serving from behind Nginx as a reverse proxy, consideration must be paid. It’s a year old now, so I shouldn’t have been surprised by it, but this thread on the Nginx forums explains that Cache-Control: private headers are meaningless when Nginx is being used as a reverse proxy: nginx completely ignores the ‘private’ keyword and will cache your document regardless. PHPQuery I have Matthew Batchelder to thank for introducing me to PHPQuery. I haven’t used it yet, but someday I’ll have need to select text elements from HTML using the PHP5 PEAR module. From the description “server-side, chainable, CSS3 selector driven Document Object Model (DOM) API based on jQuery JavaScript Library.” I Get Email: Food Tech Society’s Food Ingredient and Food Additive Forum The July 28 Food Ingredient and Functional Additive Forum looks to have a great lineup of talks, including Nano Food, Interesting Ingredients in Milk and Dairy Products, Ingredients in Functional Food and Drink, Sea food &amp; Frozen Industry, and 45 minutes (the longest of any of the talks) set aside just for soy sauce. Incoming Support Request You haven’t fixed the BING search page on Cafe World. It comes up when I click on an oven, when I click on a mission and then everything is ruined. FRONTERVILLE: I haven’t been able to play Fronterville for FOUR days. I can send gifts, but don’t know if anyone receives them but they must because I get gifts. But I have a spouse and it is stuck. It won’t custom or random or play or anything and it freezes the whole page so I can’t do a thing and there is a white avatar that says spouse? Smiley’s Bar, Bolinas, CA Captain, ship, crew, twelve points, and a shot of whisky at Smiley's I heard a story that the “Bolinas Border Patrol” removes all the signs pointing to town, so Cliffy and I had to go check it out. Border patrol or not, there are no signs, but Smiley’s bar is my kind of place. Given the story about the signs, I worried they’d be leery of outsiders, but it turned out to be the sort of place that welcomed you in and offered you a glass. Social Compass It looks gorgeous, but the points and bearings Brian Solis lays out in his Social Compass seem so obvious to me that I almost dismissed it as meaningless. Then I remembered there really are people who don’t know the message they’re trying to send will be filtered through people and technologies they can’t control and depend on adoption and repetition by agents working in their own interests. Anyway, there are more posters in his store. Radiation Is All Around Us The Environmental Protection Agency on radiation and cigarette smoke: Studies show filters on ordinary commercial cigarette remove only a modest amount of radioactivity from the smoke inhaled into the lungs of smokers. Link. Photo by lanier67. The Story Of Nukey Poo The video of Nuclear Boy and his stinky poo that’s supposed to explain Japan’s nuclear crisis isn’t the first time anybody has mixed poo and nuclear reactors. A reactor at Antarctica’s McMurdo Station that operated through the 1960s was nicknamed “nukey poo” because of its poor performance and reliability (though some reports simply point to “frequent radioactive leaks”). First, here’s the Japanese video: The original Nukey Poo was oficially named PM-A3. Nostalgic Joy: Apple 2 Emulators You can emulate an Apple ][ or Apple IIgs in your browser with a plugin and 32,000 disk images, including Oregon Trail. Don’t want to run an Apple //e in your browser? Download [Virtual ] for the job (you’ll need disk images and a ROM file). Sweet 16 can answer your Apple ][gs emulation fix, and there’s a surprisingly large collection of sort-of-recent software available, including Castle Wolfenstein 3D, an HTML editor, and AIM client. What Time Is It? The claim that changing the clocks saves energy is unsupportable by facts. Some say it’s more likely to spur consumption and benefit commercial interests, but I’m curious why the teabaggersparty people haven’t risen up against this alarming government intrusion into our private lives. Wijax Widget Lazy Loader Idea: A simple way to improve load-time performance by lazy loading some of the content on the page. Answer: Wijax. The more content in the initial download of the page, the longer readers have to wait to see it. Some content is critical to each page load, but why make people wait for every last piece of the page before they can start reading the post they came to see? Wijax allows you to defer loading widgets on the page so that they arrive after the main content. Net Render Your IE Compatibility Tests MaisonBisson in IE7 Geotek‘s NetRenderer makes it possible for me to see how badly old versions of IE are mangling my web pages without actually having to run the malware on a box of my own. Unfortunately, the IE8 rendered returns errors and hasn’t worked in a while. MaisonBisson in IE6 eBook User’s Bill of Rights It’s easy to see the eBook User’s Bill of Rights as a sign of the growing rift between libraries and content producers. Easy if you’re me, anyway. It connects very conveniently with Richard Stallman’s open letter to the Boston Public Library decrying what he summarizes as their complicity with DRM and abdication of their responsibilities as public institutions. All those things are easy, what’s hard is recognizing that the depth of change the publishing industry is facing. Van Ness Station Escalator Ambient Video Flickr Video More mesmerizing than a fireplace video? Saving Backup Space With Time Machine and iPhoto Three things that, when mixed, can consume a surprising amount of disk space: Backup automatically with Time Machine Use iPhoto and take a lot of photos Sync photos to one or more iOS devices like iPhones and iPads I do all three, and on top of that I have three current computers backing up to a 1GB Time Capsule. All of this combined was forcing Time Machine to expire old backups faster than I wanted as it churned through the disk space. WordPress comments_template() and wp_list_comments() Performance This thread on memory usage while executing WordPress’s comments_template() raised my awareness of performance issues related to displaying comments on posts in WordPress. The first thing to know is that all the comments on a given post are loaded into memory, even if the comments are paged and only a subset will be displayed. Then comments_template() calls update_comment_cache(), which has the effect of doubling that memory usage. Finally, wp_list_comments() and the Walker_Comment class can take a surprisingly long time to iterate through a long list of comments. GigaOM Mobile Site Launched This week we launched a new mobile theme at GigaOM.com. It was out for just a day or two before Dennis Bournique surprised us with a review on WAPReview.com. I have no way of knowing if I would have linked to the review if it wasn’t positive, but I would likely have found a way to link to this advice to other developers regarding URL consistency: A URL should lead to essentially the same content (reformatted in necessary) regardless of which browser is used. Helvetic Neue On The Web CSS Tricks tips “better helvetica.” Guillermo Esteves explains that specifying font names in CSS is really about specifying font families: If you want to use a specific font face, you have to use font-family along with the font-weight property, calling both the PostScript and screen names of that face for backwards compatibility Which, for a person trying to use Helvetica Neue Light means the following: font-family: &quot;HelveticaNeue-Light&quot;, &quot;Helvetica Neue Light&quot;, &quot;Helvetica Neue&quot;, sans-serif; font-weight: 300; Steve Cochrane, meanwhile, explores the use of Helvetica Neue Light and Ultra Light. Call it Rolling Shutter or Focal Plane Shutter, It Looks Weird…Cool I’ve been both frustrated by and in love with focal plane shutter distortion (Wikipedia calls it rolling shutter) for a while, now I’ve discovered there’s a group for it. One of the photos I pointed to in my earlier post was of a low-flying helicopter (bottom), a couple other photographers have captured the effect the distortion has on propellers: About Those Unencumbered Video Formats The Free Software Foundation tells us the H.264 AVCHD video encoding standard violates the very tenets of freedom, they claim competitors such as VP8/WebM and Ogg Theora are both unencumbered and technically equal to H.264. What they really mean is that software patents are evil. Now the MPEG LA, the body that administers the H.264 patents and a number of others has announced it’s forming a patent pool that covers VP8, proving that saying something is free doesn’t make it so. iPhone Camera Details I have to look this stuff up every time I play with Hugin, the open source panorama stitcher. Thankfully I can find it at Falk Lumo.com: Pixel pitch: 1.75 µm Sensor size: 4.54 x 3.39 mm^2, 5.67 mm diagonal Aspect ratio: 4.02:3 Focal length and aperture: 3.85 mm f/2.8 lens 35mm equivalent crop factor: 7.64 Equivalent 35mm focal length and aperture: 30 mm f/22 The comments there are top notch, but what’s not mentioned is how the video mode substantially narrows the field of view. WordPress MU/MS Empty Header and Broken Image Bug Fixed I just switched to a new server and found myself struggling with empty HTTP headers and broken or partial images. The problem is the memcache extension for PHP and WordPress MU/WordPress multisite’s need to reinstantiate the wp-cache after determining the correct blog for a given request. Versions of the memcache extension prior to 3.0 go wrong somehow and it shows up when you try to do an HTTP HEAD request on a page (the result is empty) or enable X-SendFile support for WP MU/MS’ file handling (all the files and images in the media library will break). Configuring Amazon Linux For Web Services UPDATED: an updated installation cookbook is available. Amazon has introduced their own distribution of linux with tweaks to optimize it for their Elastic Compute Cloud platform. Like CentOS, it appears to be based on Red Hat Enterprise Linux, though unlike the current versions of RHEL and CentOS, the packaged applications are up to date with current expectations. That’s refreshing news for those comfortable with RHEL, but uncomfortable its ancient packages. MySQL 5. World’s Largest Canned Food Structure Some records in the Guinness Book reflect outstanding accomplishments in hotly contested fields. Others reflect the imagination it now takes to create a new class of records. Food Industry Thailand‘s 150,000 food cans fall into the second category. Don’t get me wrong, though, I’m not suggesting anybody’s imagining new fields, just that they’re imagining themselves pursuing crazy records. Examples of things I think we should have records for, but I’m too lazy to look up: Happy Holidays From MaisonBisson! Another Cheesy Holiday Card From MaisonBisson And, for those who like cheese as much as us, from left to right: Cotswold Double Gloucester With Onion &amp; Chive Mannoni Pecorino Barbagio Point Reyes Toma We picked them mostly for color and texture, but they all tasted plenty good. I especially liked the Cotswold. A holiday gift, thanks to some genius and hardworking DJs, is in The Nest. Facebook iPhone App Is Happy To Suck In Your Contacts I discovered a Sync button in the Facebook app for iPhone today: Then I read the privacy notice: Clearing The Browser Cache On iPad Apple’s knowledge base article on it could be as simple as the following screenshot: Instead, the docs say something like: go to Settings, click the Safari tab, click the big clear cache button, duh. So Now You Know: World’s Heaviest Snow Plow This probably looks like a snow blower, but the railroads call it a snow plow. A rotary snow plow, yes, but still a snow plow. A 184 ton, 52 foot long snow plow. Caveman 92223 explains: The Union Pacific Railroad designed and built this monster in the Omaha Shop. This rotary snowplow is the heaviest snowplow ever built. This baby boasts a GM/EMD 16-cylinder, 3,000 horsepower, turbocharged diesel engine that drives an electric generator which provides the power to turn those massive 12-foot rotary blades at 150 RPM. Where Are San Francisco’s Love Padlocks? I discovered it in the Flickr Blog and followed it up with considerable Googling, but I can’t find any love padlocks in SF, much less a popular location for them. The Wikipedia article lists two dozen notable locations in Europe and Asia, but not one in the Americas. I searched Flickr’s San Francisco map and found two almost promising photos: an unrelated collection in the mission that was removed by municipal workers in 2005, and this one in my backyard that I plan to confirm shortly. Failed Hard Drive Noises There’s nothing amusing about this list of failed hard drive noises if you’re looking through it for a sound matching what drive on your desk is making (which I am), but I’m sure there’s some good material for the click-hop crowd. Photos by Jon Ross and James Harvey, used under CC license. Better XML/JSON Display In Safari I’m one of the few people who loves Safari, but I was happy to admit that it didn’t display XML or JSON very well. Marc Liyanage’s XML View Plugin fixes that. Improving Will Norris’ Open Graph Plugin Will Norris put together a nice WordPress plugin to place Open Graph metadata on the page. Today I patched it to address a few bugs I and others have found. The patch switches functions that depended on globalizing $post to use $wp_query-&gt;queried_object and similar. opengraph_default_url() is changed to try get_permalink() only when is_singlular() is true. Otherwise it uses the blog’s base URL. This isn’t perfect, but it’s better than having the front page and all tag/category/archive pages report their og:url as being the permalink for the first post on the page. Things Learned About The Gap Inc. Corporate Archives If a customer saw it, or if it was shared with employees, I want some version of it in our archive. –Rochelle McCune, Gap corporate archivist Rochelle took a few of us on a tour of the Gap Inc archives, a rather different archive than I’m familiar with. Things Learned About Natural Language Processing at THATcamp Bay Area The first session I joined at THATcamp was Aditi Muralidharan‘s text mining boot camp, and the topic seemed to set my agenda for the rest of the event (though I wish Aditi had also hosted her proposed data visualization session). Aditi’s blog: mininghumanities.com. If I understood correctly, much of Aditi’s presentation and experience is based on the Stanford Parser. Unfortunately, the project seems wrapped in some licensing difficulty: It’s GPL, but they claim a license is required for commercial use. Becoming Donna Reed Sandee has just launched her new site, Becoming Donna Reed: Armed with a notepad and pen, my trusty MacBook, and the desire to be the best domestic goddess I can be, I will watch the show from the beginning and find the lesson in each episode. Consider this your Cliff’s Notes on household harmony. She’ll still be updating The Feathered Nest with food recipes and insights on home decor while she divines the lessons of Donna Reed. What The Critics Are Missing About The Apple TV It’s not just the critics, nobody seems to get the story on Apple’s new TV-connected device right. Darrell Etherington at The Apple Blog says it’s a non starter for him, and Ars Technica’s John Siracusa describes it as just the most recent entry in a product line that has been “a persistent loser” for the company. Even John Gruber is damning it with faint praise. They’re all wrong. Of course the problem didn’t start there. Dancing Dog I’ve got a dozen top priorities this morning, but this dancing merengue dog just delayed them all. Twitter Is Like A Conversation In A Bar Mathew Ingram on Twitter, Esquire Magazine, and bars: It’s called social media because it’s social. In other words, it’s a conversation; and yes, sometimes it’s like a conversation in a bar. Speed WordPress MultiSite With X-Sendfile For Apache Like WordPress MU before, MultiSite implementations of WordPress 3.0 use a script to handle image and other attachment downloads. That script checks permissions and maps the request path to the files path on disk, then reads the file out to the web server, which sends it to the browser. That approach has some inefficiencies, and for me it introduces some problems. The process would often give up before completing the file transfer, resulting in broken images and truncated MP3s among other problems. Post Loop By Category Alex Bluesummers asked on a WordPress list: How do I order posts in the loop by whether or not it is in a category, then by date? Suppose I have 10 posts, of which 5 are in the category “Sports” and 5 are in the category “Blog News”. Both “Sports” and “Blog News” posts are mixed together chronologically. “Sports” and “Blog News” posts share other categories and tags. I want both types of posts to be present in the loop regardless of whether it’s the front page or category archive view, but ordered by “Sports” and “blog news” and then by date. Migrating From WordPress MU To WordPress 3.0 Multi Site I’ve been running a few instances of WordPress MU for a while now, so I was more than a little anxious about the merge of the MU functionality into the core of WordPress. It’s a good thing, but sometimes such dramatic changes pose rocky challenges. Not so in this case. Pete Mall blogged about it in May, and I’m happy to say that I followed those instructions (summary: upgrade, it will work) to upgrade both this site and Scriblio. Donut Tour 2010: The Video Please enable Javascript and Flash to view this Viddler video. We planned the donut tour. We did the donut tour. We ate donuts. We made five stops on the tour, but this video only covers four of them. We were too stuffed to say anything about Japonais, even though the donuts there were delicious. Here’s the full lineup: Donna’s Donuts (Yelp!) Ziggy’s Donuts (Yelp!) Kane’s Donuts (Yelp!) Sun Guang Bakery (Yelp! How To: Plan a Donut Tour Since 1938, the first Friday of June has been hailed throughout the US as National Donut Day. It was founded in recognition of the great comfort donuts provide to those who eat them, and to honor those who serve them. Museum of Family Camping Closed Memorial Day weekend is universally recognized as the start of summer. Tradition allows that we can start wearing white, gather family and friends for barbecue, and, for those so inclined, go camping. For the past many years it’s also been the start of the Museum of Family Camping’s season. The interior displays at the Museum of Family Camping celebrated many generations of camping history. My docent made much of the dingle stick (the vertical stick that holds the cooking tin); good manners demanded they be left at the camp site for the next camper. Sandee’s Homemade Wrapping Paper Sandee’s been getting into disposable art. First it was her holiday dames on the chalkboard in our kitchen, and more recently she’s been crafting one of a kind wrapping paper. It gets torn up and discarded in just a fraction of the time it takes her to sketch and shade it, but act of creation is what she enjoys. I guess that’s why her favorite artistic endeavor is baking. Step By Step: Turn On The iPhone/iPad’s Web Debugging Console You can’t view a web page’s source, and you can’t Command+F to search for text on the page, but you sure can get a debugging console to see the errors on the page. Here’s how: Find and open the Settings app [&lt;img src=&quot;http://farm5.static.flickr.com/4010/4644761694_259781990f_m.jpg&quot; alt=&quot;Start in the Settings app&quot; width=&quot;144&quot; height=&quot;155&quot; /&gt;][1] Select Safari [&lt;img src=&quot;farm5-static-flickr-com-4644742964_97b6af8b67_m.jpg&quot; alt=&quot;Safari in the Settings app&quot; width=&quot;160&quot; height=&quot;240&quot; /&gt;][2] Scroll down to find the Developer option at the bottom [&lt;img src=&quot;farm5-static-flickr-com-4644743974_60686fb614. iPad + Velcro = <3 http://www.vimeo.com/11886557 Huffington Post Introduces Badges and Social Rewards How do you make news fun? Or, how do you make moderating often fractious comments on news stories fun? You follow FourSquare’s example and introduce badges: The Moderator badge allows you to more actively participate in this process. If you are a Level 1 Moderator (earned by flagging at least 20 comments that we deleted, with a high ratio of good flags to mistaken ones), your flags now carry five times the weight of a standard flag. Mick Jagger On The Music Business Mick Jagger to BBC: [P]eople only made money out of records for a very, very small time […] if you look at the history of recorded music from 1900 to now, there was a 25 year period where artists did very well, but the rest of the time they didn’t. Via. Remixed: My Photo In TruthOut.Org I was happy to see one of my photos used as source material for this illustration in TruthOut.Org’s seven year reality check on the Iraq war. Will Mobile Flash Be Relevant When It Finally Works? John Gruber linked to the sizzle in Jeff Croft’s post: In the [FlashCamp Seattle] opening keynote, Ryan Stewart, a Flash Platform evangelist at Adobe, demoed Flash Player 10.1 running on his Nexus One phone. […] Here’s what happened: On his Mac, Ryan pulled up a site called Eco Zoo. It is, seemingly, a pretty intense example of Flash development — full of 3D rendering, rich interactions, and cute little characters. Listening Is Just The Start Jeff Howe writes: idea jams “allow people to discover the fringe question (or idea, or solution), then tweak it, discuss it and bring the community’s attention to it.” “Idea management is really a three-part process,” says Bob Pearson, who as Dell’s former chief of communities and conversation rode heard on IdeaStorm. “The first is listening. That’s obvious.” The second part, Pearson says, was integration, “actually disseminating the best ideas throughout our organization. Pearls Of Wisdom In Mail List Threads David Cloutman on Code4Lib: Don’t forget to look at trends outside of “Libraryland”. A lot of professional library discussion takes place in an echo chamber, and bad ideas often get repeated and gain credibility as a result. Librarians usually overstate the uniqueness of their organizations and professions. When the question, “What are other libraries doing?” arises in addressing a technical problem, don’t be afraid to generalize the question to other types of organizations. Respond To Your Next Subpoena Like A Pro Thanks to Kathleen Seidel, a fellow New Hampshire resident and blogger at &lt;neurodiversity.com&gt;, I now have what appears to be a good example of a motion to quash a subpoena (even cooler, she filed it pro se). I’ve also learned that NH is among the states that allows lawyers to issue subpoena in civil cases without prior approval of a judge. Take a look and prepare yourself for some law talking. Steve Jobs On Apple vs. Adobe and iPhone vs. Flash Steve Jobs’ Thoughts on Flash minces no words in its conclusion: Besides the fact that Flash is closed and proprietary, has major technical drawbacks, and doesn’t support touch based devices, there is an even more important reason we do not allow Flash on iPhones, iPods and iPads. We have discussed the downsides of using Flash to play video and interactive content from websites, but Adobe also wants developers to adopt Flash to create apps that run on our mobile devices. Blogging In Academia A comment in the University of Lincoln’s Audio Production course blog demonstrates the value of public blogging in academia: I am looking forward to beginning this course in September and have been finding these blogs very useful in providing a guide as to what sort of things to expect during my first year. Keep up the good work! Thanks to Joss Winn for the tip. SSD MySQL Performance The above graph and this MySQL performance blog story are from last year, but I believe are still relevant and instructive now. Sure, the FusionIO is faster, but how the hell can you beat a single SSD in terms of price/performance? RAID 10: 4.8 transactions per minute per dollar SSD: 27 transactions per minute per dollar FusionIO: 3.6 transactions per minute per dollar Improving P2 — Order Posts By Last Comment Date I’m a big fan of the P2 theme for WordPress. It makes it dead easy anybody familiar with WordPress to host a discussion site and improve collaboration across time and distance. That said, one feature I’d like to see is the ability to order the posts by the last comment date, rather than post date. When we started using P2 to power a workgroup discussion last year, I wrote a bit of code to sort the posts that way, here’s how: Irony: NH Liquor Commissioner Suspected Of DUI In 2007 it was the deputy chief of liquor enforcement. Last summer it was the Wolfboro police commissioner who was arrested importing 900 pounds of marijuana from Canada. This week it’s a liquor commissioner who was stopped on suspicion of DUI. I’m a carnie huckster, you know it and I know it, but that’s OK The title is a quote from Seth Stevenson Slate.com piece on pitchman Vince Offer, where he explains that Vince’s “smooth-talking condescension” is the most appropriate sales tactic in today’s cynical world. “Jaded consumers expect to get snowed and almost distrust the very pretense of trustworthiness.” The Rap Chop remix of Vince’s Slap Chop actually ran on TV. Three Sweet Globe Images [][3] Hey, it’s [Earth Day][4]! [3]: http://www.flickr.com/photos/kumasawa/3027658256/ &ldquo;&ldquo;Wind Andamento&rdquo; by Karen Ami (Cool Globes) by kumasawa, on Flickr&rdquo; [4]: http://en.wikipedia.org/wiki/Earth_Day Auctions and Negotiations: Starting Price Matters Via Mind Hacks: auctions with a low starting price may result in higher final sale prices than those with a high starting price. but negotiations with a high starting price often result in higher final sale prices. Cleaning Up Category Relationships In A WordPress Scriblio Site A few lines of SQL I used to clean up a Scriblio site. It’s probably useless to anybody but me. I’m not suggesting anybody else use this code, as it will result in changed or deleted data. Update the post author for catalog records (identified because they have a specific post meta entry): UPDATE wp_8_postmeta JOIN wp_8_posts ON wp_8_posts.ID = wp_8_postmeta.post_id SET post_author = 15 WHERE meta_key = 'scrib_meditor_content' Get the categories attached to every catalog record (except the “catalog” category): Loading: Global Warming Sure I’m a fan of Marilyn Monroe, but Stéphane Massa-Bidal’s activist illustration is even hotter. He’s online at Rétrofuturs.com. LA Times on iPad vs Kindle The Kindle feels like an e-reading device, whereas an iPad feels like reading. From latimes.com via Joseph Monninger. A Few Lines of SQL: Cloning Blogs In MU The following SQL is what I used to clone the content from one blog in MU to another for testing. It’s probably useless to anybody but me. Anybody who can’t figure out from the code that wp_8_posts is the source table and wp_13_posts is the destination probably shouldn’t try to use the code, as data will be lost. Clone the content from one MU blog into another: TRUNCATE TABLE wp_13_posts; INSERT INTO wp_13_posts SELECT * FROM wp_8_posts; TRUNCATE TABLE wp_13_postmeta; INSERT INTO wp_13_postmeta SELECT * FROM wp_8_postmeta; TRUNCATE TABLE wp_13_terms; INSERT INTO wp_13_terms SELECT * FROM wp_8_terms; TRUNCATE TABLE wp_13_term_taxonomy; INSERT INTO wp_13_term_taxonomy SELECT * FROM wp_8_term_taxonomy; TRUNCATE TABLE wp_13_term_relationships; INSERT INTO wp_13_term_relationships SELECT * FROM wp_8_term_relationships; TRUNCATE TABLE wp_13_bsuite4_search; INSERT INTO wp_13_bsuite4_search SELECT * FROM wp_8_bsuite4_search; TRUNCATE TABLE wp_8_scrib_harvest; Clone a few options: Solving Problems In Secret Matt Blaze computer and information science at University of Pennsylvania and blogs about security at Exhaustive Search. His recent post on mistakes in spying techniques, protocols, and hardware caught my interest: Indeed, the recent history of electronic surveillance is a veritable catalog of cautionary tales of technological errors, risks and unintended consequences. Sometime mishaps lead to well-publicized violations of the privacy of innocent people. There was, for example, the NSA’s disclosure earlier this year that it had been accidently “over-collecting” the communications of innocent Americans. The Reward For Re-Discovering Archive Collections Documentarians spend most of their time digging up materials that few people know exist. They frequent basements and dark storage rooms, endure conversations with crazy collectors, and typically develop vitamin-d deficiency and light sensitivity in search of what they need. Their reward for finding the material? A bill from the original creators (the ones who lost and forgot about the work in the first place) for the privilege of using it. iPhone Use Heavy at 7am, Bumps At Lunch, Peaks At 9pm Via Localytics: iPhone users generate 7% more traffic on the weekend than the average weekday. Saturday traffic ramps quickly from a morning low at 6:00 am to over 90% of peak usage by 11:00 am—and stays near the peak for the rest of the afternoon and evening. By comparison, weekday app usage is more concentrated in the evening with a slow ramp during the working day and a peak at 9:00 pm EST, when East Coast users are at home and West Coast users are commuting home. Is The Filesystem Finally Dead? From Rob Foster/Nimble Design: By releasing the iPhone OS, Apple is putting a bullet in the head of a long standing convention that most folks could do without. He’s talking about the filesystem. User-accessible filesystems, anyway. This isn’t news, I don’t think the Newton even had a hidden filesystem, but it hasn’t gotten old yet. My question: when will I finally get a system that cleverly mixes cloud and local storage to give me seamless access to all my photos, videos, music, and email…ever? Why PHP’s RegEx Is Slow, And What You Can Do About It (if you happen to be a committer on the PHP project) Regular Expression Matching Can Be Simple And Fast, by Russ Cox: Perl [and PHP and others] could not now remove backreference support, of course, but they could employ much faster algorithms when presented with regular expressions that don’t have backreferences. How much faster? About a million times (no, I do not exaggerate). I use a lot of regular expressions, and relatively few of them use backreferences. It’d be worth optimizing. Edison Phonograph EULA Think end user license agreements (EULAs) are recent inventions? Thomas Edison used them on his phonograph cylinder at the start of the 1900s. The EULA didn’t protect Edison from innovations elsewhere; discs quickly beat out cylinders once the patents expired. Photo from fouro. College Students Use, Love, Are Aware Of The Limitations Of Wikipedia How often do college students use Wikipedia? How Today’s College Students Use Wikipedia For Course-Related Research: Overall, college students use Wikipedia. But, they do so knowing its limitation. They use Wikipedia just as most of us do — because it is a quick way to get started and it has some, but not deep, credibility. 52% of respondents use Wikipedia frequently or always, typically at near the beginning at the start of research (70%). DRM Evils: Now Comic Fodder Brad Colbow does some good looking design and an occasional comic. He isn’t the first to address DRM woes in comic form, but his comic is one more public cry for rationality. And continuing that cry is this from an unnamed source, originally published at geekologie.com. Scott Smitelli On Hacking YouTube’s Content ID DRM System Scott Smitelli uploaded a total of 82 test videos and received 35 Content ID emails in the name of science: testing YouTube’s Content ID system. He reversed the audio, shifted the pitch, altered the time (without changing pitch), resampled (pitch and time), added noise, messed with the volume, chunked it up into pieces, and fiddled with the stereo fields. In the end, he found both amusing and frustrating results. He did his tests about a year ago. Connect-a-Desk Looks Ridiculous (though I may secretly want one) I was about to tell Sandee how foolish these people look with their laptops stuck to their torsos, but she hit me with “that looks like something you’d use.” Ouch. Worse, I’m not sure she’s wrong. Double ouch. Maybe the company could send me one. Then I could have these conflicted feelings for real. Social Media Usage Stats Retrevo claims to help electronics shoppers decide what to buy, when to buy, and where to buy it,” so their recent survey on social media addition is probably more significant as link bait than as serious research. Despite my concerns about confirmation bias, I’m as amused as anybody by the numbers. 8% of adult respondents say they check or update Twitter or Facebook before getting out of bed in the morning, a number that rises to 28% for iPhone users of all ages. Addressing Hateful And Libelous Internet Speech In The Post Juicy Campus Era Juicy Campus is gone, but other sites have taken its place as a hub for anonymous slander around college campuses. Intentional or not, the conversation at these sites tends toward abusive, with successive commenters attempting to one-up each other with each insult. Students targeted by the abuse and defamation have little easy recourse. Some sites allow users to mark comments as offensive, but require membership to do so, and the anonymous nature of the posts limits the real world social group’s opportunity to moderate itself and its members. html5media – Project Hosting on Google Code I was wondering when somebody was going to do what html5media does: HTML5 video tags make embedding videos into documents as easy as embedding an image. All it takes is a single tag. Unfortunately, not all browsers natively support HTML5 video tags. html5media is a JavaScript library that enables tags for clunky browsers. URL Path Bug In WordPress.com Video Server You’ve got to both respect Automattic for releasing their internal code as open source while also giving them a break for not assuring that it works for anybody else. One of their projects, the WordPress.com Video Server is a sophisticated WordPress plugin that handles video transcoding and offers a bit of a YouTube in a box solution for WordPress. The bug I found is that the code assumes WPMU is running in subdomain mode, rather than subdirectory mode. Rock Out With A Cardboard Record Player http://www.vimeo.com/10271288 The physical, analog nature of vinyl has long appealed to the DIY crowd. This cardboard record player capitalizes on that to create a direct mail marketing campaign that people appear to actually enjoy receiving. From the description at Agency News: Grey Vancouver created a portable record player from corrugated cardboard that folds into an envelope. The record can be spun with a pencil and the vibrations go through the needle and produce a recording of a children’s story called “A town that found its sound. The Cost Of IE’s Non-Compliance Google this month dropped Internet Explorer 6 support in Google Apps and YouTube, and others are lining up at idroppedie6.com. Still, even newer versions of IE suffer from poor standards support, and there are doubts about the just announced IE9. To put this in perspective, BillforBill.com is adding up the costs of all the workarounds that web developers have to go through to make it buggy browser work. After just a few days and only 233 submissions the total is over $9 million. WP Memcache Object Cache Breaks HTTP HEAD Requests I just posted about the following confounding problem to the WP-Hackers list: When running WordPress MU (tested in 2.8x and 2.9x) with the memcached object cache active, it refuses to respond to HTTP HEAD requests. The result of this is that head requests to check the mimetype of a linked file (as for setting the enclosure) or size (as the video framework plugin does) fail. curl -I http://url.path returns either an empty result, or (if fronted with varnish) a 503 error. WordPress Bug In setup_postdata() WordPress is built around the Loop, and all the cool kids are using multiple loops on the same page to show the main post and feature other posts. The problem is: WordPress doesn’t properly reset the $pages global for each post. If the post in main loop (or default query) is paged, then all the other posts will show the same paged content as in the main post. I started a ticket and submitted a patch, but in the meantime you might have to unset( $GLOBALS['pages'] ) in your custom loops just before calling the_post(). Web vs. Native Apps One lesson here is that a simple but well-done web app […] can be vastly superior to a full-fledged but terrible iPhone application. Usability Nightmare: The My.SXSW iPhone App. Consumer Society and Citizen Networks Logo Consumer Society and Citizen Networks “aims at promoting access of citizens to information on product safety, consumer rights protection, and to results of independent testing, as well as promoting wide public discussion of challenges facing the consumer society in Ukraine.” Their logo, however, is pure genius: Some sketches from logolog showing how it came together: Christian Madrasas From the March 2002 Newsletter of The North Texas Skeptics: In the madrasa, the religious school, I watched and listened as the instructor related his view of the world to the students and the others present. Politics, personal relationships, nations, and the physical world were interpreted in the light of the speaker’s religious teachings. Hinduism and Buddhism were lumped together with that quaintly American religion called New Age. Pagan symbols invoke demons to do dirty work for cultists, and evolution is the root of much of this evil, the students were told. Auto-Tune Put To Better Use: News Auto-Tune has been prettying up vocal tracks for more than a decade now, but applying it to news is simply brilliant. The Gregory Brothers‘ autotunethenews.com is worth a look. NH’s Proud Political System A NH House Judiciary Committee hearing recently made New Hampshire famous in BoingBoing and The Huffington Post. Watch the hearing where the speaker describes sex acts, and take special note of the amazing poker face of the others during the talk. I’ll Stop The World and Melt With You Flickr Video Watching Valentine’s Rose Fade The Georgia O’Keefe view, above, or the still life view, below: This isn’t so much about Valentine’s Day as it is about finally getting setup to do time lapse video like this. More to come at maisonbisson.com/timelapse. Valentine’s Rose (O’Keefe View) Flickr Video Valentine’s Rose Flickr Video What The Critics Are Missing About Apple’s iPad It’s doubtful that anybody reading this blog missed the news that Apple finally took the wraps off their much rumored tablet: the iPad. Trouble is, a bunch of folks seem to be upset about the features and specs, or something that made the buzz machine go meh. It’s just a bigger iPhone, complain the privileged tech pundits. They apparently missed the recent Pew Internet Project report on internet usage by demographic. Blogging By Email WordPress has some simple built-in support for posting by email, but that didn’t stop a couple people from developing plugins that might do better. Postie and PostMaster both claim to support attached photos (though neither appears to use WP’s built-in media management). But if your goal is to post photos, you might consider posting through Flickr. Organizational Vanity, Google Alerts, and Social Engineering As more and more organizations become aware of the need to track their online reputation, more people in those organizations are following Google alerts for their organization’s name. That creates a perfect opportunity for scammers to play on that organizational vanity to infect computers used by officers of the organization with malware that can reveal the inner workings of that organization. I’m not exactly sure what clicking the button above does. Apple’s 1997 Netbook A post on thomas fitzgerald.net serves to remind us that Apple released their first netbook in 1997: the Apple eMate 300: …next time you see people ranting about an Apple netbook, remember that Apple had something similar long before anyone even uttered the phrase “netbook.” The device ran Netwon OS 2 with a 20-30 hour battery life (yes, 20-30 hours). I’ve written more than a few posts eulogizing the eMate’s tablet-shaped sibling: Newton Message Pad 2000. Coda Feature Wishlist I’d long been a user of BareBones’ BBEdit, a product that’s served me well for a number of years. But upgrading from version 8.5 to 9 is a paid deal, and after spending 15 days with the demo of BBEdit 9, I decided I wanted to look around a little bit. My friend Matt switched from BBEdit to Panic’s Coda some time ago, and I liked the demo of that well enough that I bought a license. Put An SSD In Your ExpressCard Slot? I spied the Wintec FileMate 48GB Ultra ExpressCard and began to wonder how it works as a boot drive for Mac OS X in a late 2008 MacBook Pro (the model just before Apple replaced the ExpressCard slot with an SD slot). But I didn’t have to wonder too much, as a post to this MacObserver forum thread offers enough details to make a geek salivate: The computer now boots primarily from the SSD Card and will start up the computer in less than 1/2 the time of the internal HD […] I have all the applications and system files on the SSD Card, the user files/record on the internal HD. Do e-Books Have A Future? David Weinberger kicked off the latest installment in the ongoing debate about the future of electronic books versus paper books in his Will books survive? A scorecard… post. He’s got some good points, but like many of the smart folks I admire, he approaches this question assuming that books, in any form, are important. Ursula K. Le Guin’s excellent essay on “the alleged decline of reading” is especially informative on this point: books don’t matter to most Americans, and they haven’t for some time. Even If They Don’t Click Ethan Zuckerman’s recent post, What if they stop clicking? points out the difficulty of building a business on ad revenue. He points to statistics that show fewer readers are clicking banner and arguments from the web advertising industry about how un-clicked ads still build brand awareness. It’s not really central to Zuckerman’s point, but I didn’t sense that he was aware that Google has picked up the same argument. I commented on the post that Google has started reporting the numbers of people who are presented (but don’t click) ads, then later visit the advertisers that are paying for, um, clicks. My WordCamp NYC Talks Authentication Hacks My first talk was on User Authentication with MU in Existing Ecosystems, all about integrating WP with LDAP/AD/CAS and other directory authentication schemes, as well as the hacks I did to make that integration bi-directional and deliver new user features. My slides are online (.MOV / .PDF), and you can read earlier blog post summing up the project. Plugins Mentioned wpCAS (long description) Alternate Contact Info WordPress Ticket Framework wpSMS (long description) Scriblio I was most excited, however, to talk about Scriblio, a plugin that turns WordPress into a library catalog with faceted searching and browsing. Spell Checking Matt demanded accent-aware spell checking for the WordPress spell checking plugin his company acquired earlier this year. And just a little more than a month later, After the Deadline delivered. Now Beyoncé, café, coöperate, and even my resumé look prettier. Separately, Wordnik offers a new take on online dictionaries, and they just launched an API. Backblaze Storage Pod Backblaze is a cloud backup service that needs cheap storage. Lots of it. They say a petabyte worth of raw drives runs under $100,000, but buying that much storage in products from major vendors easily costs over $1,000,000. So they built their own. The result is a 4U rack-mounted Linux-based server that contains 67 terabytes at a material cost of $7,867, the bulk of which goes to purchase the drives themselves. Drobo: Sweet Storage, One Big Flaw I’ve been a fan of Drobo since I got mine over a year ago. The little(-ish, and sweet looking, for stack of disks) device packs as many as four drives and automatically manages them to ensure the reliability of your data and easy expandability of the storage. However, Thomas Tomchak just pointed out one major flaw: if you overflow your Drobo with data, the entire device may give up and you’ll lose everything. The Bugs That Haunt Me A few years ago I found an article pointing out how spammers had figured out how to abuse some code I wrote back in 2001 or so. I’d put it on the list to fix and even started a blog post so that I could take my lumps publicly. Now I’ve rediscovered that draft post…and that I never fixed the bad code it had fingered. Worse, I’m no longer in a position to change the code. SSH Tunneling Examples Most of my work is available publicly, but some development is hosted on a private SVN that’s hidden behind a firewall. Unfortunately, my primary development server is on the wrong side of that particular firewall, so I use the following command to bridge the gap: ssh -R 1980:svn_host:80 username@dev_server.com That creates a reverse tunnel through my laptop to the SVN server and allows me to checkout code using the following: Yelp: A Poster Child For Semantic Markup Search Engine Land.com: Yelp…is…essentially a poster-child for semantic markup. This spring, Google’s introduction of rich snippets has allowed Yelp’s listings in the SERPs to stand out more, attracting consumers to click more due to the “bling” decorating the listings in the form of the star ratings. There are now some very good reasons why sites with ratings and reviews should be adopting microformats, and it’s not that hard to do! iPhone’s Anti-Customer Config File In March of this year Apple applied for a patent on technology that enables or disables features of a phone via a config file. The tech is already in use: it’s the carrier profiles we’ve been downloading recently. On the one hand this is just an extension of the parental controls that Apple has included in Mac OS X since the early days, but it also implies some rather anti-consumer thinking at the company. Evil Evil klaomta.com A quick Google search of klaomta.com reveals more than a few people wondering why it’s iframed on their websites. The answer is that the site has been compromised. Unfortunately for the fellow who asked me the question at WordCamp, solving the problem can be a bit of a chore. Keeping your WordPress installation up to date is important, as there are some known security flaws in older versions, but most of the attacks that crackers use are targeted elsewhere. The WordPress Way Plugin Development Will Norris‘ talk at WordCamp PDX introduces WordPress coding standards, common functions, and constants to would be plugin developers (and smacks those who’ve already done it wrong). Also notable: functions, classes, variables, and constants in the WordPress trunk. Custom Installations Just as WordPress has a number of hooks and filters that plugins can use to modify and extend behavior, it also has a cool way to customize the installation process. Hacking WordPress Login and Password Reset Processes For My University Environment Any university worth the title is likely to have a very mixed identity environment. At Plymouth State University we’ve been pursuing a strategy of unifying identity and offering single sign-on to web services, but an inventory last year still revealed a great number of systems not integrated with either our single sign-on (AuthN) or authorization systems (AuthZ, see difference). And in addition to the many application/system specific stores of identity information (even for those systems integrated into our single sign-on environment), we also use both LDAP and AD (which we try to synchronize at the application level). Worst of all, the entire environment is provisioned solely from our MIS database, which is good if you want to make sure that students and faculty get user accounts, but bad if you want to provision an account for somebody who doesn’t fit into one of those roles. The one way relationship between our user accounts and the MIS database also makes it difficult to engage with new users online. If you can’t get an account until you become a student, how do you allow potential students to apply online if all your systems are integrated with single sign-on? And if you can’t authenticate the online identity of your users, how do you set initial passwords into your system? Or allow them to reset a forgotten password online? Internet companies never struggled with this issue, as their customers could only approach them online, but most universities built systems around paper applications and have fond (and relatively recent) memories of offering their students their first internet experience. It’s still not unusual for universities to offer their students their campus computing account with a default password based on supposedly secret data shared between the user and the school. But your SSN, birth date, and mother’s name are no longer secret. A proposed change in FERPA policy (see the the top of page 15586 in the NPRM) would have barred the use of “a common form user name (e.g., last name and first name initial) with date of birth or SSN, or a portion of the SSN, as an initial password to be changed upon first use of the system” in systems that store academic data. The final rule excluded that provision, much to the relief of those schools with more lobbying clout than brains. Pigeon Beats ADSL: Slow Networks Or Massive Storage Capacity? It was a tech story so apparently humorous that the popular media felt compelled to cover it: carrier pigeons delivered 4GBs of data faster than an ADSL line. The BBC story’s subtitle read “broadband promised to unite the world with super-fast data delivery – but in South Africa it seems the web is still no faster than a humble pigeon,” and that’s how most stories played it. Unfortunately, they all got it wrong. Moving data by homing pigeon requires some planning, and pigeons. Source. The race was run by The Unlimited Group, but the clearest telling of it comes from Wikipedia: Inspired by RFC 2549{.external.mw-magiclink-rfc}, on 9 September 2009 the marketing team of The Unlimited, a regional company in South Africa, decided to host a tongue-in-cheek “Pigeon Race” between their pet pigeon “Winston” and local telecom company Telkom SA. The race is to send 4 gigabytes of data from Howick to Hillcrest, approximately 60 km apart. The pigeon carrying a microSD{.mw-redirect} card (an avian variant of a sneakernet), versus a Telkom ADSL{.mw-redirect} line. Winston beat the data transfer over Telkom’s ADSL line, with a total time of two hours, six minutes and 57 seconds from uploading data on the microSD card to completion of download from card. At the time of Winston’s victory, the 4GB ADSL transfer was just under 4% complete. JSNES: JavaScript Nintendo Emulator Ben Fisherman’s JSNES runs entirely in the browser using nothing more intrusive than JavaScript. It apparently manages real-time performance within Chrome, but it works (if not playably) on an iPhone. I wish the screen was resizable and that it supported iPhone compatible controls, but both of those assume that browser performance will improve enough to make it playable. Interestingly, though not surprisingly, the Safari JS engine is limited to consuming a single CPU (which it quickly does while playing JSNES). iTunes 9: Closer To An API? Will Norris has discovered that iTunes 9’s interactions with the Store are more web-happy. I’ve been asking where the iTunes Store API was for some time, now I think I’ve got what I need to build one. WordPress Hacks: Nested Paths For WPMU Blogs Situation: you’ve got WordPress Multi-User setup to host one or more domains in sub-directory mode (as in site.org/blogname), but you want a deeper directory structure than WPMU allows…something like the following examples, perhaps: site.org/blogname1 site.org/departments/blogname2 site.org/departments/blogname3 site.org/services/blogname3 The association between blog IDs and sub-directory paths is determined in wpmu-settings.php, but the code there knows nothing about nested paths. So a person planning to use WordPress MU as a CMS must either flatten his/her information architecture, or do some hacking. Am I Supposed To Feel Bad For AT&T Now? With AT&amp;T facing lawsuits for not delivering MMS features at the iPhone 3GS launch, they kind of had to do something. I’m not sure if I’d be satisfied by this video if I were among the plaintiffs, but I think it does a good enough job. The stat about 300% annual increases in mobile data use is pretty powerful. I’d heard it a dozen times before*, but because I wasn’t in Austin for SXSW iPhone meltdown, I don’t have quite the same appreciation as some do. AT&amp;T added capacity then, and they seem to have been scrambling elsewhere too. iPhone users are said to be six times as likely as anybody else to watch video on their phones, and if WiFi aggregator JiWire’s report says anything about cell data, the iPhone has certainly changed the game. JiWire’s Mobile Audience Insights Report shows that over 97% of the devices on their network are either iPhones (about 56% of the total) or iPod Touches! And all the way back in 2007 in Britain, iPhone users were 33 times as likely as other phone users to send or receive more than 25MB a month. It will be interesting to see what happens to other carriers as they get devices that encourage use as the iPhone has. *Actually, I hadn’t heard the 300% stat specifically, just inspecific reports of increased usage. Now I Want To Watch (or re-watch) All These Okay, I don’t want to watch all the movies depicted in this 100 year overview of film special effects, but I did just add a few to my Netflix queue. WordPress Hacks: Serving Multiple Domains Situation: using WordPress MU (possibly including BuddyPress) on multiple domains or sub-domains of a large organization with lots of users. WordPress MU is a solid CMS to support a large organization. Each individual blog has its own place in the organization’s URL scheme (www.site.org/blogname), and each blog can have its own administrators and other users. Groups of blogs in WPMU make up a “Site” and one or more Sites can be hosted with a single implementation. (I’m capitalizing Site for the same reason WordPress docs capitalize Page) Each Site has a defined set of administrators and options controlling various features. You might, for instance, lock down the plugins on your blogs.site.org, while keeping it open on your www.site.org. Or maybe you’d like to let your helpdesk staff create new blogs at blogs.site.org, but not at www.site.org. That’s what WPMU’s notion of Site can help you control. Online Advertising Metrics I don’t know if it’s just the Mother’s day effect, but the top 10 online retailers for May 2009 were dominated by flower shops. The top shop is converting almost 40% of their visitors to buyers, though the average is just over 5%. Tim, meanwhile, claims he’s lowered his bounce rate to just 10%. Not My Chair, Not My Problem Liam Lynch explains the origin of the video, but what was Dan Deacon thinking as he [recorded the audio][3]? Of all the [free MP3 downloads][4] he offers, [Two Friends][5] from the Acorn Master album may be the most, um, listenable. Thanks to [daily songsmith Corey B (Corey Blanchette)][6] for the tip. [3]: www-dandeacon-com-08 Drinking Out of Cups.mp3 [4]: http://www.dandeacon.com/mp3/ [5]: www-dandeacon-com-01_Two_Friends.mp3 [6]: http://coreyb603.com/ Who Gets To Control The Future Of Libraries? The following was my email response to a thread on the web4lib mail list: Okay, it must be said: you’re all wrong1. I can understand that news of a librarian being fired/furloughed will raise our defenses, but that’s no excuse for giving up the considered and critical thinking that this occasion demands. Consider this: the principle’s blog reveals a reasonable person actively trying to improve academic performance despite crushing economic conditions. Martin Belam’s Advice To Hackers At The Guardian’s July 2009 Hack Day An amusing hacks-conference lightning talk-turned-blog post on web development: “Graceful Hacks” – UX, IA and interaction design tips for hack days. Martin Belam‘s talk at The Guardian’s July 2009 Hack Day must have been both funny and useful: Funny: “However, I am given to understand that this is now deprecated and has gone out of fashion.” Useful: “the Yahoo! Design Pattern Library is your friend.” hNews Might Not Be So Bad The AP’s diagram of their Protect, Point, Pay “news DRM” scheme looked like a joke, then I saw the parody. Despite all the smoke and hype, Ed Felton explains that it’s underwhelming, at most. Still, hNews might be an interesting format for some blogs to adopt. Most of what the AP is rattling their saber about is in the rights (containing ccREL declarations). Felton thinks the dependence on ccREL may extend derivative usage rights, rather than limit them. Get Your Beer Pong Skills On Do Facebook Ads Work? All Facebook is happy to share the ten laws of Facebook advertising, but will those rules lead to better results than the .02% CTR Bob Gilbreath got a year ago? Newspaper Business: News Was A Loss Leader Howard Weaver wants newspapers to play offense against Google and others, but Chris Tolles, CEO of news aggregator Topix.com says he’s been trying Weaver’s plan for a while, and there’s no bucket of gold to be found in it. The problem, it would appear, is that newspapers don’t sell news. They sell advertising space and pair it with news as a loss leader to keep the eyeballs. And while that worked in print, it doesn’t work on the web. Google Recommends Microformats and RDFa Google’s own webmasters help site recommends microformats and RDFa structured data to improve indexing and usefulness of the data. Review metadata appears to have full support, while people, product, and business data are in beta. Do Air Taxis Actually Work? I just thought to follow up on this 2007 story about DayJet, a high-flying air taxi service that planned to operate tiny, three-passenger Eclipse 500 jets. The story doesn’t deviate from economic trends: DayJet ceased operations in September 2008, and the aircraft manufacturer entered Chapter 7 in February 2009. The Air Taxi Association says their operators save big money over scheduled airline service, but finding the price of that service can be hard. Mozilla Labs’ Ubiquity http://www.vimeo.com/1561578 Mozilla Labs’ Ubiquity has a lot of promise: Ubiquity is an experiment into connecting the Web with language in an attempt to find new user interfaces that make it possible for everyone to do common Web tasks more quickly and easily. It’s a Firefox extension, so it works on Macs, Windows, and Linux. With only a couple keystrokes, it lets you use language to instruct your browser. You can translate to and from most languages, add maps to your email, edit any page, twitter, check your calendar, search, email your friends, and much more. Tomas Mankovsky’s Sorry I’m Late http://www.vimeo.com/4862670 I’m simply in love with this video. Watch through the credits to see a bit of how it’s made. Go Blog, Small Orgs (Or Large) Philip Greenspun suggests small organizations use a blog for their website (ironically, not blogged): The Small Business Web circa 1994 In 1994, a small organization that wanted a Web site would hire a “Web designer” skilled in the exotic art of “HTML programming” to produce a static Web site, i.e., a cluster of linked pages with a distinctive design and color scheme, giving information about the company or non-profit org. Get The Zimbra iSync Connector It can be difficult to get the Zimbra iSync connector, as the company doesn’t offer a simple download from their site. Fortunately, the license allows us to freely redistribute their software. Download the Zimbra iSync Connector here. What is David McNicol’s URL Cache Plugin? The description to David McNicol’s URL Cache Plugin raises more questions than it answers: Given a URL, the url_cache() function will attempt to download the file it represents and return a URL pointing to this locally cached version. Where did he plan to use it? Does he envision the cache as an archive, or for performance? Why hasn’t it been updated since 2005? It caught my interest because I’ve long been interested in a solution to link rot in my blog. Book Search Results Vs. Users Bret Victor offers the above design suggestions (from 2006) to Amazon in the book search results display (he’s comparing to this). I didn’t discover them at the time, but many of them are still relevant now. Bret notes that Amazon’s display doesn’t do a good job of answering the questions a person has when searching for books: “What is the book about?” and “is it any good?” Unfortunately, these questions are completely unaddressed by the information provided. Too Bad The Hanzo Archives WordPress Plugin Is Caput The Hanzo Archives WordPress plugin is something I’d be very excited to use. Ironically, it’s disappeared from the web (though the blog post hasn’t): We’ve released a WordPress Plugin which automatically archives anything you link to in your blog posts; it also adds a ‘perma-permalink’ for the archived version adjacent to each original link. An Amazon Web Services case study put me on to Hanzo a while ago, and in May 2008 I actually spoke with Mark Middleton (the markm who posted the entry above). Customizable Post Listings Lorelle is a big fan of Scott Reilly’s Customizable Post Listings: Display Recent Posts, Recently Commented Posts, Recently Modified Posts, Random Posts, and other post, page, or draft listings using the post information of your choosing in an easily customizable manner. You can narrow post searches by specifying categories and/or authors, among other things. Using VLC As A Live Video Stream Transcoder For Axis Camera and FLV [I]n theory, I should be able to issue one command to VLC and have it receive the MPEG4-ES stream from the camera, transcode it to h.264, and stream it to the Wowza, which would handle the rest. via John Beales. Leaked Video Of Bumblebee’s Breakdance Moves http://www.vimeo.com/3784930 Well, not ‘leaked,’ but just in time for the new Transformers movie, Patrick Boivin has posted this video of Bumblebee breakdancing. Video or Audio Comments in WordPress with Riffly In line with yesterday’s discovery of the Viddler WP plugin, Riffly Webcam Video Comments also supports video or audio comments within WordPress: Riffly is a free service that easily plugs into your site allowing visitors to create video and audio comments. The service is advertising supported. We cover all the costs for bandwidth, servers, and maintenance. Optionally, we also offer Premium Riffly accounts that provide you with additional benefits, such as advertising removal, control panel access, analytics, and much more. Video Comments With Viddler WordPress Plugin The Viddler WordPress plugin promises to “Enrich your site’s commenting experience by enabling video comments….” Users can record direcly from a web cam or choose a video they’ve previously uploaded to Viddler.com. Viddler evangelist Colin Devroe has it on his site, where I can see it requires would-be commenters have a Viddler account. That last bit is too bad. I like Viddler, but I can’t force my readers to like it and get accounts as a prerequisite to commenting. Wolfram|Alpha’s Missing Feature: Libraries John Timmer brings up my two biggest complaints about Wolfram|Alpha. The first is that it’s even harder to identify the source of information than it is in Wikipedia, the other is what happens when searches fail: A bad Web search typically brings up results that help you refine your search terms; a bad Alpha search returns nothing, and it’s not clear that there’s an easy way to fix that. Systems Wrangling Session At WordCamp Developer Day What is the current status of web servers…Is Apache 2.x “fast enough?” Automattic uses Lightspeed (for PHP), nginx (for static content), and Apache (for media uploads). For WordPress-generated content, all server options are approximately the same speed. What about APC? Automattic uses beta versions of APC, and provides a 3-5x performance increase. It’s tied closely to the PHP version, so Automattic recently switched from PHP 4 to PHP 5. Databases? Andy Peatling on BuddyPress Why BuddyPress? “Build passionate users around a specific niche.” Do you have to become a social network? “No, look at GigaOM Pro,” a recently launched subscription research site based on BuddyPress. But, yo do get “BYOTOS: bring your own terms of service.” That is, you get to control content and interactions. And your service won’t be subject to the whims of a larger network like FaceBook (or vagaries of their service — think Ma. WordPress 2.8 Script Handling jQuery 1.3.2 is in WordPress 2.8, but the most exciting changes are in the automatic concatenation and compression of scripts via the script loader. Andrew Ozz says “This feature can easily be extended to include scripts added by plugins and to use server side caching, however that would require some changes to the server settings (.htaccess on Apache).” I have yet to figure out how to extend that feature to scripts in my plugins, but I’m working on it. Google’s Matt Cutts On Building Better Sites With WordPress 90% of WordPress blogs he sees are spam. But for those who aren’t spammers and want to do better in Google…. “WordPress automatically solves a ton of SEO issues…WordPress takes care of 80-90% of SEO.” Still, he recommends a few extra plugins: Akismet — reduce spam comments Cookies for Comments — reduce spam comments FeedBurner FeedSmith WP Super Cache — improve performance “We crawl roughly in order of PageRank…higher ranked sites get crawled faster and deeper. Understanding, Leveraging Google Image Search Above is [Peter Linsley][1] speaking about Google Image Search at [SMX West][2] in February, 2009. Meanwhile, [Stefan Juhl suggests some JavaScript][3] to break your site out of the image search result pages: Many Google image search users are quickly clicking on to the direct image URL and thereby not seeing the page with the image. Also, it seems that many of the users don’t hesitate to click back to the image SERPs when they don’t see the image “above the fold” – probably because of Google image search framing the page with the picture and thus making it almost too easy to do so. On The One Hand He Wants To Catapult Chicken Droppings, On The Other Hand He Did Catapult His Wife; Repeatedly The homeland security press is just getting wind of Joe Weston-Webb’s attempts to deter vandals with nonlethal weapons, but the story became all the rage in Britain when it broke last year. The stories hit all the timely bits: Joe got burgled, so he announced plans to install a catapult. A what? A catapult. Why? To launch chicken droppings at miscreants. Unfortunately, the local constabulary warned him off, and the catapult wasn’t ready when burglars returned. Zen and the Art of Motorcycle Maintenance Is Available All Over The Web Robert M. Pirsig‘s Zen and the Art of Motorcycle Maintenance at Amazon, a used book store, or your parent’s book shelf. Still, it’s available on the web as PDF, at least two text files — one, two — And even as a podcast (subscribe via iTunes). Lots of people have re-traced the journey described in the book, at least one person has posted a travelogue about it to the web. Henry Gurr has posted Pirsig’s own photos, and Christoph Bartneck pointed out many locations in Google Maps: Is MySQL 5.1 Ready? MySQL 5.1 hasn’t gotten a lot of love, but it does introduce support for pluggable storage engines. And that’s required to use SphinxSE. Sphinx is a fast full text search engine. It doesn’t need to run as a MySQL storage engine to work, but doing that allows joining against other MySQL tables. So while I’m watching the future of MySQL alternatives, I’m also watching 5.1 bug fixes and playing with the CoolStack-packaged 5. Extreme Sheep Herding iPhone 3G Camera Hacks And Deets Those unwilling to open of their iPhone to adjust the camera focus might take a look at Griffin’s Clarifi, a case with a built-in close-up lens that can slide in our out of place as needed. Flickr user Meine Ideenecke, meanwhile, has figured out the iPhone camera specifications. He says it’s about 37MM (35MM equivalent), though this source says it’s 27MM. Will TuneUp Fix My Collection Of PodCast Music Downloads? Now that I’ve discovered it, I’m tempted to try TuneUp on my collection of MP3s downloaded as podcasts (and without good ID3 tags) from places like the KCRW’s Today’s Top Tune. The story is that the iTunes plugin automatically identifies your tracks, can fix the tags, and add album art. Google Street View Camera Sightings What happens when one of Google’s street view camera vehicles encounters a low bridge or a muddy Australian road? Comparing Panorama Stitching Tools The above are the result of PanoLab, Hugin, Calico, and a single shot with a very wide angle lens (Canon’s 10-22mm, effectively 16mm on my Rebel XTi). The first three originated on my iPhone and the PanoLab shot was stitched and originally uploaded to Flickr on my iPhone (though I have since done some color enhancement and reuploaded the photo from my MacBook Pro). Hugin is GPL, the other solutions are less free (in both senses). The Difference Between MySQL’s utf8_unicode_ci and. utf8_general_ci Collations MySQL answer: utf8_unicode_ci vs. utf8_general_ci. Collation controls sorting behavior. Unicode rationalizes the character set, but doesn’t, on it’s own, rationalize sorting behavior for all the various languages it supports. utf8_general_ci (ci = case insensitive) is apparently a bit faster, but sloppier, and only appropriate for English language data sets. The Many Uses Of A PocketTorch Doesn’t everybody need a PocketTorch? It’s a “safe, practical tool,” they say. More amusingly, the list of suggested uses includes: melting your cache of gold, scaring grandma, lighting illegal fireworks, dental/lab work, and making friends jealous. Fun Threads For Librarians Who doesn’t want to be an anarchist librarian? Or a bibliophian? Photoshop Retouching Magic vs. Disasters Compare the retouching portfolio here against the regular posts at Photoshop Disasters. Lessons Learned: Why It’s Better Not To Use Parentheses When They’re Optional There it is in the PHP manual for return(): Note: since return() is a language construct and not a function, the parentheses surrounding its arguments are not required. It is common to leave them out, and you actually should do so as PHP has less work to do in this case. I knew the parentheses were optional, but I’ve been merrily using them all along. And I probably would have continued doing so until I saw the second note attached to the docs: MySQL Correlated Subqueries Correlated Subqueries are said to be “inefficient and likely to be slow,” but that doesn’t mean I’m not glad to have learned of them. What Is An Archive In The Digital Age? Jessamyn pointed out the dust up over the dissapearing of PaperOfRecord.com, a historical newspaper archive. Most Annoying Song Ever? Is this the most annoying song ever? Independent + catchy and pop gone wrong. How Much Do You Want That Job? One of the many odd questions this prank job application asks is: What Are You Willing To Wear At Work? (Check All That Apply) Paper Hat Tie Hairnet Spandex Singing Omelet Costume Sweet VW Bus Scooter Sidecar I spied this drool-worthy scooter and sidecar combo on Scooter Sidecars. WordPress Action Ticketing API This plugin is the next step after my proposal for a common invite API. Here’s how I described it when requesting hosting at the plugin directory: A common framework for registering tickets that will be acted upon later. Use it to manage challenge/response interactions to confirm email addresses, phone numbers, IM screen names, Twitter accounts, etc. Build an invite system around it, or use it as the foundation of a short URL system. You Think You’re Paying Too Much For Mobile Data? A caller to Clark Howard’s CNN show complains of being billed $62,000 by his cell phone provider for data usage. And Oklahoman Billie Parks has filed suit over a $5,000 bill. Saving Objects In WordPress’ User Meta There’s a hole in the wall at about head level next to my desk. I’ve spent most of the day trying to track down a bug with some code I’ve been working on to add fields to a user’s profile in WordPress. The problem is that upon trying to save the profile I’d get an error like the following: Catchable fatal error: Object of class stdClass could not be converted to string in /wp-includes/wp-db. ExpanDrive FTP/SFTP/Amazon S3 Client ExpanDrive makes FTP, SFTP, and Amazon S3 connectivity dead easy. ExpanDrive acts just like a USB drive plugged into your Mac. Open, edit, and save files to remote computers from within your favorite programs—even when they are on a server half a world away. ExpanDrive enhances every single application on your computer by transparently connecting it to remote data. PHP Magic Constants: __LINE__, __FILE__, __DIR__, __FUNCTION__, __CLASS__, __METHOD__, and __NAMESPACE__ I’ve been using __FILE__ for years, but I never thought to look for its siblings. echo ' line:'. __LINE__ .' file:'. __FILE__ .' directory:'. __DIR__ .' function:'. __FUNCTION__ .' class:'. __CLASS__ .' method:'. __METHOD__ .' namespace:'. __NAMESPACE__; I feel as though I should have noticed these earlier; they’re clearly referenced in the docs for debug_backtrace(), after all. Down The Drain: Flowers In The In-Sink-Erator Flickr Video I can’t explain my fascination with putting flowers into the In-Sink-Erator, but the sink does smell like flowers afterwards. Music is Evil by Beads. Trash Fiction Book Covers A while ago I discovered a great collection of scanned book covers from 1950s-ish pulp fiction in Flickr. I had gone looking for things to post on our clipboard wall, but these are too fun to walk away from — especially now that Sandee’s put cats up. Marc Acito On Strunk and White’s Elements of Style When it comes to “shall” and “will,” Strunk and White gives the following example: “A swimmer in distress cries, ‘I shall drown; no one will save me!’ ” But a suicide says, “I will drown; no one shall save me!” And I say, “You two (pedantic) know-it-alls deserve to drown.” I mean, what about “Help!” via Who Needs A Manual To Write Real Good?. Yahoo! Bids Adieu To 1997 Yahoo! has divested itself of Blo.gs and is shuttering GeoCities. Would this have happened in a good economy? No. Did it need to happen anyway? Yes. Yes. Yes. And for the love of god, yes. Tips To Publishers From Google News It turns out that there are a lot of differences between Google’s regular web crawler and the Google News crawler. And though very few of us will find our content included in Google News, it still seems like a good idea to make our content conform to their technical requirements. Here are a few of them: In order for our crawler to correctly gather your content, each article needs to link to a page dedicated solely to that article. Correction: I Do Still Need The Wufoo Forms WordPress Embed Shortcode A few weeks ago I said I no longer needed the Wufoo embedding code that I’d put into bSuite. I was wrong. So I’ve taken another look, fixed the code from my old post, and coded it up into a stand-alone plugin. I’ve added installation and usage instructions to the bottom of the original post. What’s The Best Panorama Stitching App For iPhone? I spent some time looking for panorama-related apps for the iPhone and came up with the following: Panorama by Airshed Panoramas by Helix Interactive TripStitch by Byteslice Software Pano by Debacle Software Panoramascope by Phil Endicott PanoLab and PanoLab Pro by Originate Lab I’ve actually played with PanoLab a bit (landscape, portrait) after seeing p0ps Harlow using it. Fixing Batcache to Send The Correct Content-Type Header I’m a fan of Batcache, the Memcached-based WordPress full-page cache solution, but I’ve discovered that it ignores the content-type header set when the page is initially generated and re-sends all content with content-type: text/html. I posted a note about this at the WordPress support forums, but then I realized what the problem was: apache_response_headers() doesn’t return the content type, but headers_list() does. The solution is to replace apache_response_headers() with headers_list() in the code, though headers_list() is PHP 5+ only, so it might be a while before we see a change like this committed. Facebook’s Favorite Metadata [Facebook’s guide to sharing][1] details some meta tags to make that sharing work better: In order to make sure that the preview is always correctly populated, you should add the tags shown below to your html. An example news story could have the following: &gt; &gt; As shown, title contains the preview title, description contains the preview summary and image_src contains the preview image. Please make sure that none of the content fields contain any html markup because it will be stripped out. Google Labs: Similar Images and News Timeline New releases from Google Labs: Similar Images and News Timeline. I count it as a failure for Google that the news timeline doesn’t show future events. Three Or More Ways To Record Or Intercept VoIP Calls VoIP Now offers a few tips, Hackszine discusses VoIPong, and Mac VoIP mentions Cain &amp; Abel and describes ARP poisoning to make a man-in-the-middle intercept. Jeeves Is Back! Does Your Organization Need Its Own Avatar/Personality? If you remember Ask.com, you probably remember Jeeves. Now he’s back on the UK site. It turns out that people liked the old chap, and in this age of social media, it’s probably prudent to have a corporate avatar (it looks a lot better on Facebook, anyway). There’s more about the resurrection at Search Engine Land. Flight Level 110, PVD Kent Wien‘s photo of Providence, Rhode Island is better than average for the camera out the window genre. William Shatner’s Rocketman Still Makes Me Laugh Elton John and Bernie Taupin wrote it, but William Shatner did it best. Watch the video now and download the MP3 for future enjoyment. Thanks to Vasken for pointing out the video. Do We Need A WordPress Common Invite or Challenge-Response API? The BuddyPress forums have a number of threads about handling invitations (two worth looking at: one, two), but no real solution has emerged. At the same time, there’s also a need for some means of confirming other actions such as password resets, email changes (both of those are already handled by WPMU, I know), cell phone numbers to receive SMS messages, and other actions that need to be confirmed later. Fixing User Meta To Accept Repeating Fields — Just In Time For The WordPress Has-Patch Marathon There’s a WordPress has-patch marathon going on now and I’m hoping one of my recent patches gets some attention. I’m hoping to fix the user meta functions to allow them to accept multiple values per key, per user. It’s listed there among the other has-patch tickets in Trac, and there’s been some discussion in WP-Hackers. Why not take a look? WiFi Is Critical To Academia, The WiFi Alliance Says A study sponsored by the WiFi alliance reveals the following: WiFi and college choice 90% of college students say Wi-Fi access is as essential to education as classrooms and computers 57% say they wouldn’t go to a college that doesn’t have free Wi-Fi 79% say that without Wi-Fi access, college would be a lot harder 60% agree that widely available Wi-Fi on campus is an indication that a school cares about its students WiFi and where they use it 55% have connected from coffee shops and restaurants 47% from parks 24% from in their cars WiFi in the classroom 55% have checked Facebook™ or MySpace™ and sent or received e-mail while using their laptop in class 47% have sent instant messages to a friend during class 44% used Wi-Fi to get a head start on an assignment before a class was finished WiFi and linkbaiting statistics If forced to choose, 48% would give up beer before giving up Wi-Fi Survey methodology: “In conjunction with the Wi-Fi Alliance, Wakefield Research surveyed 501 U. GlobeSurfer X-1 Wireless Broadband Router Option GlobeSurfer X•1 router: “a new product that transforms any USB wireless modem into an instant Internet-connected WiFi network capable of supporting multiple users.” Too bad I can’t figure out where to buy it. Also too bad that I can’t simply do this with a jail-broked iPhone. I mean, doesn’t an iPhone have everything it needs built-in: a cell-phone modem, WiFi hardware, and enough unixy goodness to support NAT and routing? BumpTop: Taking The Desktop Metaphor Deeper BumpTop: a fun, intuitive 3D desktop that keeps you organized and makes you more productive. Like a real desk, but better. Your desktop doesn’t have to be a boring graveyard for lost and forgotten files anymore! Transform it with BumpTop. Create the desktop that suits your needs and style. Recently reviewed in ArsTechnica. Extracting/Decompressing .RAR files on Mac OS X Mac OS X doesn’t ship with unrar, the common Linux utility, but you can easily get it bundled in UnRarX, a convenient Mac OS X utility. Dig around and you’ll find it in UnRarX.app/Contents/Resources. Not Sure That rev=“canonical” Is Really The Solution Anything that can help stop this kind of madness is worth a good long look (yes, I don’t like the DiggBar any more than John Gruber, despite Digg’s assurances it’s safe), so I’ve had rev=“canonical” on my mind (yes, that’s rev, not rel). Chris Shiflett thinks it will save the internet, but Matt Cutts suggests what I’ve always thought: why not resolve short URLs to their long form and store/display them that way? CAS Is A Standard Protocol, Not A Standard Application I’m not really part of the Jasig CAS Community (learn more), but I do maintain the wpCAS WordPress CAS client and I’ve started development of a CAS server component for WordPress. That project is on hold because one of the products that I’d expected to integrate with it doesn’t use standard CAS and the vendor of that app has chosen to modify the JASIG CAS server to support their apps. Weird Screw Drive Russian Truck Can We Stop Complaining About Taxes Already? Andrew Tobias asks if we can finally put the tax argument to bed: Is the reason you’re not investing in stocks these days (a) the prospect of having to pay 15% capital gains tax? Or (b) the fear of further losses? (Well, or – c – that you don’t have any money?) Is the reason you don’t start a new business that (a) if it made you a lot of money you’d have to pay a lot of taxes? Sniff Sniff — Network Sniffing in Mac OS X Adam had to remind me of this: sudo tcpdump -i en0 -s 0 port 80 Of course tcpdump can only tell us what other machines the computer is talking to, not what the conversation is. That requires a sniffer like Wireshark. iPhone Earbud + Business Card Hacks: Speakers and Cord Winder Two interesting submissions to the Core77 Business Card Hacks Challenge: earbud speakers and a cord winder. You’re Nobody Unless You’re Fake — On Twitter Here’s a simple way to tell whether the star you’re following is the real thing. Are the alleged celebrity’s tweets funny and entertaining, with a palpable sense of self-awareness and wit? Full on fake then, and by default, well worth following. Oh, and Twitter, if you’re still confused, the fake celebs are the ones who cannot afford a publicist to announce that the @fakeAccount everyone’s following isn’t really them. Damn Firewalls…But Which Firewall? For some reason two CDNs, BitGravity and Castfire, are being blocked on campus. You might think firewall, but the problem even seems to appear outside the firewall. International Pillow Fight Day World Pillow Fight Day in Boston last Saturday was not only a lot of spring fever fun, it also resulted in a marriage proposal. Banditos Misteriosos estimates there were over 1,100 pillow fighters, apparently making it one of the largest fights that day. View the above panorama large to see the crowd. Detroit police shut down the fight there, confiscating pillows and demanding permits, though the Calgary fight went without incident, despite concerns about permits. Adventure Cameras: Olympus vs. Panasonic I’ve been keeping my eye on the Olympus Stylus Tough-8000. It’s reportedly durable and waterproof to 33 feet. But I’ve just discovered the Panasonic Lumix DMC-TS1, also supposedly tough and waterproof (though only to 10 feet). The Panasonic, however, can shoot HD video and has a higher maximum ISO. The Panasonic also does some funky facial recognition (which favors recognized faces when focusing), but the Olympus can stitch multiple-shot panoramas in the camera and has “tap control” that allows, well, it appears to allow you to control the camera’s settings by tapping the sides rather than fiddling with buttons. Things Learned From The Durex Sexual Wellbeing Survey Yes, they did a survey, and the results show the French have plenty of sex, but are among the least satisfied for all that activity. Russians (80%), Brazilians (82%), and Greeks (86%) appear to be the most likely to get it at least once a week, while in Japan it appears both infrequent and unsatisfying. New Zealand distinguished itself for being the only country where women averaged more partners than men. We Were Warned About This…15 Years Ago FORTUNE Magazine, March 7, 1994: Like alligators in a swamp, financial derivatives lurk in the global economy. Deriving their value from the worth of some underlying asset, like currencies or equities, these potentially lucrative contracts are measured in trillions of dollars. But they also lie in convoluted layers in a tightly wound market of global interconnections. And that gives them the capacity to bring on a worldwide financial quake. New Plymouth State University Mascot Matt worked this up for our university portal today. Plymouth has long been the Panthers, but a little change does the University good. Panthers may have paws, but platypi have venom. Crime vs. Highways. Or, Internet Security Is A Social (Not Technical) Problem Stefan Savage, speaking in a segment on March 13’s On The Media, asked: The question I like to ask people is, what are you going to do to the highway system to reduce crime. And when you put it that way, it sounds absolutely ridiculous, because while criminals do use the highway, no rational person is suggesting that if only we could change the transportation architecture that crime would go away. 50mm f/0.95 The Canon 50mm f0.95 is the stuff of legend. Sure it wasn’t particularly sharp, and depth of field was so short that you’re unlikely to get an entire face in focus, but the notion of a lens that bright is more than a little attractive (even if you’re unlikely to have enough light to focus at all if you’re in a situation where you need the f0.95 maximum aperture). PHP iCalendar PHP iCalendar can parse and render iCal formatted files. Apple’s developer docs, amusingly enough, offer a few more hints along those lines. Wufoo Forms WordPress Embed Shortcode I tossed this together a while ago, and it even made it in to bSuite for a time, but I don’t have a need for it anymore, and I’m cleaning house. function shortcode_wufoo( $arg ){ // [wufoo id=z7x4m0 domain=place.wufoo.com] $arg = shortcode_atts( array( 'id' =&gt; FALSE, 'domain' =&gt; FALSE, 'height' =&gt; 500, ), $arg ); if( !$arg['id'] || !$arg['domain'] ) return( FALSE ); return( str_replace( array( '%%id%%','%%domain%%','%%height%%' ), array( $arg['id'], $arg['domain'], $arg['height'] ), '&lt;iframe height=&quot;%%height%%&quot; allowTransparency=&quot;true&quot; frameborder=&quot;0&quot; scrolling=&quot;no&quot; style=&quot;width:100%; border:none&quot; src=&quot;https://%%domain%%/embed/%%id%%/&quot;&gt;&lt;a href=&quot;http://%%domain%%/forms/%%id%%/&quot;&gt;Fill out my Wufoo form! Jellyfish At The Monterey Bay Aquarium Flickr Video The 38 Year War A 2004 commentary by Doug Bandow of The Future of Freedom Foundation points out how much we love war, well at least politicians love war: War has become a centerpiece of American politics. The war on terrorism is the focus of U.S. foreign policy. A real war is being fought in Iraq. Jimmy Carter proclaimed the “moral equivalent of war” over energy. Some analysts are advocating a war on obesity. The Economist on Open Source From The Economist in 2006: Open-source business: Open, but not as usual. Happy St. Patrick’s Day The entire kitchen is Sandee’s playground, and that includes the chalkboard. I’m not sure what holiday she’ll decide to honor next. She’s been busy elsewhere at home too. MySQL Slow Query Log Analysis Peter at MySQL Performance Blog pointed out this sweet perl script to analyze MySQL’s slow query logs. (This is supposedly a PHP port.) The script does a good job of aggregating similar queries (those that only differ in their query values) and displaying overall stats for them. The following two queries are showing up a lot in my WPMU installation because I also have it set to log queries that don’t use indexes. Slideshare WordPress Embed Shortcode I’m cleaning house in [bSuite][1], and I’ve decided that this shortcode function for embedding Slideshare items in WordPress needs to go. Rather than totally toss it away, however, I’m posting it here in case somebody else finds it useful. ``` function shortcode_slideshare( $arg ){ // [slideshare id=211578&doc=misty-holland-1198496990903941-2&w=425] $arg = shortcode_atts( array( 'id' =&gt; FALSE, ), $arg ); if( ! $arg['id'] ) return( FALSE ); return( str_replace( '%%id%%', $arg['id'], ' ' )); } add_shortcode('slideshare', array(&amp;amp;$this, 'shortcode_slideshare')); I Missed The Nightclub and Bar Show The international nightclub and bar show ran in Las Vegas last week, bringing a bunch of nightclub, bar, tavern, pub, restaurant, and hotel professionals to the city, including my friends at Biba. Dave must be faking his shock at the free shots, music, and dancing girls filling the hall “all at noon on a Tuesday!” I’m not at all involved in the business, but I think I need to go next year. Volkswagen Ad Claimed Too Violent For British TV First it was 100, then over 500 complaints about the Matrix-style (that means fake looking) kung foo action in Volkswagen’s new ad. Dual-WAN or Multi-WAN Load Balancing Routers Bonding and 802.3ad/802.1AX link aggregation it’s not, but dual- or mutil-WAN load balancing seems like a good way to improve overall bandwidth and reliability. The Cisco/Linksys RV016 (just under $400) can group up to seven different WAN connections, but the customer reviews are only so-so. For a little more I can get a Peplink Balance 30 that can handle three WAN connections and seems built for speed. There are other products, I know, but not a lot of information about any of them. Yeah, I’m That Guy I’m flying Virgin America from BOS to SFO, and apparently all their planes on that route offer in-flight internet via Gogo. $12.95 buys 3Mbps down and 300Kbps up (at least early on when nobody else seemed to be using it). I can get my iPhone online for only 8 bucks, but as far as I can tell, I’d have to buy two plans if I wanted to use both on this flight. Fly Safe, Fly Without ID This is an old one, but because I’m in the air again today it’s worth digging up this up. Defense Tech long ago pointed out The Identity Project‘s position on showing ID for air travel: If a 19 year-old college student can get a fake ID to drink, why couldn’t a bad person get one, too? And no matter how sophisticated the security embedded into the ID, wouldn’t a well-financed terrorist be able to falsify that, too? Mmm… Bacon Who doesn’t like bacon, or little piglets? Or kittens? Juice Your OPAC Richard Wallace’s Juice project (Javascript User Interface Componentised Extensions) is a “simple componentised framework constructed in Javascript to enable the sharing of Ajax Stye extensions to a web interface.” WordPress or Scriblio users might do well to think about it as a way to put widgets on systems that don’t support widgets, though as Richard points out, “the framework is applicable to any environment which, via identifiers contained within a html page, needs to link to or embed external resources. Way Cooler Than A Catalog I got a little excited when Shirley Lincicum wrote to the NGC4Lib mail list: [O]ne of the most frustrating things for me about Next Generation Catalog systems as they currently exist is that they seem wholly focused on the user interface and can, in fact, actually hold libraries back from designing or implementing improved “back end” systems because of the dependencies introduced by the new “discovery layer” applications. I was excited because almost two years ago I wrote something like this: Usability vs. Open Source This article comparing the usability of Joomla vs. WordPress has already been linked by everybody’s uncle, but it’s still worth a look. I find it amusing, however, that none of the comments so far on that blog post mention the commitment that the core WordPress team appears to have on making blogging fun. If you start with the goal of making something fun, then add sophistication to make it flexible without being complex, you’ll get a very different result than you would if you started with different goals. Tattoo: Pantone 475 seanbonner‘s photo of Esther’s new tattoo makes me want one. TGFKAE&rsquo;s new tattoo by seanbonner on Flickr Scriblio Theater Flickr Video Flickr Video I should have done screencasts like the above long ago. It’s not that they’re great, but they are a wonderful excuse to use the canned lounge music I’ve got. Those videos are now on the front page of the official Scriblio site, and I did five more to demo the installation and configuration. Big thanks go to Collingswood NJ Public Library Director Brett Bonfield who let me use his library like this. Pedal Powered Hovercraft I love the engineering of the lift fan on this pedal powered hovercraft. It needs a little more lift to make really work, but wow. Scriblio 2.7 Released My slides for my presentation yesterday at code4lib are available both as a 2.7MB QuickTime and a 7.8 MB PDF, while the gist of talk went something like this: Scriblio is an open source WordPress plugin that adds the ability to search, browse, and create structured data to the the popular blog/content management platform. And WordPress adds great ease of use, permalinks, comments/trackbacks/pingbacks, and other social and web-centric features to that structured data. Is Internet Linking Legal? You’d think the top search results on the matter would be newer than 1999, but that’s where you’ll find this NYT article and PubLaw item story, both from precambrian times. Worse, both of those articles suggest that my links to them may not be entirely kosher. The problem is probably that US courts have not spoken clearly on such a case. A Danish court in 2006 did, but I think that no case in the US has gone far enough to actually set a precedent. Don’t Be Stupid, Magenta Is A Color Anybody who claims magenta isn’t a color is stupid, lying, or link-baiting. Take it from a color-blind person: all colors are a matter of perception, and claiming Magenta isn’t a color because it doesn’t fit neatly in the linear spectrum of visible electromagnetic radiation is like saying this isn’t music because the vibrations that tickle our ear aren’t the result of a monotone sinusoidal wave. We have no equivalent of polyphony for light, but just as it took a whole orchestra to make Jaws scary, the colors we perceive are most commonly a mixture of different frequencies of light. Make Yours A ModBook I really don’t know what I’d do with a tablet, but it’s still plenty interesting to see this ModBook come together. On the other hand, if there’s anything to the earlier rumors of an Apple tablet, I hope it leads to some sort of large-screen iPhone-like device. Pedal Powered Big Wheel Fun This big wheel was purported to be the work of Cyclecide, a SF-based bike art collective. The Big Wheel is cool no matter who built it, and Cyclecide’s pedal powered contraptions look awesome: The pedal powered roller coaster looks tame by comparison. Turning A Podcast Track Into A Music Track in iTunes I subscribe to a few song of the day podcasts, which makes it easy to get the tracks, but difficult to enjoy them as music in iTunes. But podcast tracks can’t be simply moved over to the music section of your library, it takes a little finagling. There’s a lot of advice out there suggesting you use one of the menu commands to convert the track to MP3 or AAC, but I prefer not to re-encode my music, and that’s a big hammer for a small problem. 5,848 (max), 656 (avg) MySQL Queries Per Second The above graph is far from typical, but I love that the box (the top one in this picture) can do the job when it needs to. This activity is a result of bulk record imports, web activity results in relatively little database traffic due to my use of Memcached and Batcache. The World’s Greenest Roller Coaster This pedal-powered roller coaster is Washuzan Highland Park‘s Skycycle in Okayama prefecture, Japan It appears that the only CO2 emissions are the huffing and puffing of riders peddling to the top. The park does have three traditionally powered steel coasters (the Ultra, Star Jet, and Chupy). How To Ruin Valentine’s Day, And A Basketball Game Valentine’s day will never be the same for this dude. Aparently, however, marriage proposal rejections at basketball games are common, though this LOL cats proposal worked out well. Matching Multi-line Regex in BBEdit I love BBEdit on my Mac, but I was left scratching my head again today when I was trying to remember how to make its regex engine match a pattern across multiple lines. My hope was to extract a list of initial articles from a page that had HTML like this: WordCamp Higher Ed, Northeast It’s not WordCamp Paris (running on 7 February), but WordCamp Edu Northeast is today. I’m there to meet up with fellow WordPressies and talk about extending WordPress with Holladay Penick and Dave Lester. Squeezing the three of us into a single time slot requires quite a bit of cutting, especially if we hope to have time to answer questions, so I’ll be focusing on Scriblio. That means I won’t be talking about how we’re going to use BuddyPress or replace significant portions of our university portal with it. Why Are These People So Happy? The soothing ambient sounds and smiling faces might be enough to have you keep this site open all winter long, but then you’d have to explain it. The World Record Headspin Master Is 124 Times Cooler Than Me Darien’s new materials handling is cool, but not world record headspin cool. Actually, that’s probably a false comparison, enjoy them both. Woot! WordPress MU 2.7 Out Sure, Matt says it’s Thank a Plugin Developer Day, but let’s hear it for the developers who just tagged WordPress MU 2.7! Not long ago there were still 300 files to merge, now it’s done and ready for the next version. New Hampshire: Live Free Or Die By Firing Squad NH State Representative Delmar Burridge recently introduced HB 0037 proscribing death by firing squad: When the penalty of death is imposed, the punishment for a defendant convicted under RSA 630:1, I(g) shall be execution by firing squad. Burridge would likely describe himself as “principled,” like when he reported one of his constituents to the cops because of his advocacy for marijuana decriminalization. The photo above is a still from a Ridley Report interview with him. Not Happy It’s called Gigapan, a robotic panorama-maker. David Bergman used one to take the picture above (though his view was much larger) (you can buy your own for about $300 if you get in on the beta). The point, however, is that if you zoom in real close, you can see W’s pursed lipped scowl. Sitting In Sin Thomas Von Staffeldt’s remix of Arne Jacobsen‘s “chair no. 7”. Above are gluttony, pride, and lust. They’re all on auction, but does that suggest avarice? Through The Viewfinder original_ann‘s hacked-together rig for shooting though the viewfinder of her Kodak Starflex has me wanting one. She has a beautiful set and points to the Through The Viewfinder group for more. The Real Intronetz Argument “What happens when a group that commands respect meets an audience that doesn’t give it readily?” Pete Cashmore on The Vatican Launching YouTube Channel. Oh Noes! My Table Is Gone! # mysqlcheck -p -A --auto-repair --optimize wp_1_options info : Found block with too small length at 17732; Skipped info : Wrong block with wrong total length starting at 17776 info : Found block with too small length at 28776; Skipped warning : Number of rows changed from 444 to 441 status : OK Cleaning up the mess after a hardware failure can suck. This mysqlcheck output is from the wp_options table for this blog. A Cocktail I Can Believe In Sandee’s toasting tomorrow’s inauguration with a special “fresh start cocktail.” I’m not usually one for overwrought imagery, but the delicate fruit flavor is quite refreshing change from the dark and stormy winter we’ve been suffering. And no, I really don’t know if I’m talking about the 18 feet of snow that’s fallen these past couple months of those eight years we’ve suffered. Everybody’s Underwear I was using the dirty laundry metaphor in a previous post and wanted to extend it a bit by saying something like: For the generation of children who’s parents have already posted their silliest and most embarrassing baby pictures to Facebook and elsewhere, being caught in your underwear is both expected and forgivable. Being evil, on the other hand… Except I couldn’t find a link to support my claim. Gaming Help: Bond 007: Quantum of Solace Walkthrough shadowzack knows his games a lot better than I do. Even though he says it’s “crap”, I’m enjoying playing Bond 007: Quantum of Solace on my Wii. I only play about one game a year, so I’m not ashamed to go looking for a bit of help in shadowzack’s walkthroughs: Chapter 1 Chapter 2 Chapter 3 Chapter 4 Chapter 5 Chapter 6 Chapter 7 Chapter 8 Chapter 9 Chapter 10 Hardmuth’s DIY Ring Flash Is Quite A Hack This light-piped ring flash should do the trick. It’s gotta be cheaper than Canon’s offering (though cheap ring lights can be had for under $100), and it seems to work more than well enough. No Such Thing As Bad Publicity Finding a 2007 blog post about a condom and a cheeseburger made a friend ask if student blogs should be moved off-domain. My flippant answer was “There’s no such thing as bad publicity.” His retort was simple and quick: “Tell that to the catholic church.” It stung. He had me, I was sure. It’s hard for many Americans not to think of sex abuse when Catholic Church comes to mind, but there are probably two lessons from that: Gaming: Pac-Txt Richard Moore’s Pac-Txt is even more brilliant than his Paper Pong (which, ironically, you can play online). Here’s a transcript of my best Pac-Txt game to date: Pac-Txt! -------- You awaken in a large complex, slightly disoriented. Glowing dots hover mouth level near you in every direction. Off in the distance you hear the faint howling of what you can only imagine must be some sort of ghost or several ghosts. Patrick McGoohan Dead At 80 Patrick McGoohan, creator of The Prisoner, has died. Looking Back At Mac Hardware Performance I recently replaced the Mac Mini I use to host my web development with a PowerMac G4. (Story: the Mini was mine, a personal purchase I made to support my work on Scriblio and other WordPress-related projects, but recent changes in our network and firewall policy made the machine inaccessible from off-campus without using the VPN. Having a personal machine sit at my desk at work isn’t as useful if I can’t use it conveniently and for para-work activities, so I wanted to take the Mini home. Firefox Improved RDF Browsing lbjay uses both the Tabulator and Semantic Radar Firefox plugins to do magic with RDF in his browser. Play FLV in QuickTime Player Using Perian Perian: “The swiss-army knife of QuickTime components” File formats: AVI, DIVX, FLV, MKV, GVI, VP6, and VFW Video types: MS-MPEG4 v1 &amp; v2, DivX, 3ivx, H.264, Sorenson H.263, FLV/Sorenson Spark, FSV1, VP6, H263i, VP3, HuffYUV, FFVHuff, MPEG1 &amp; MPEG2 Video, Fraps, Snow, NuppelVideo, Techsmith Screen Capture, DosBox Capture The LGPL–licensed QuickTime plugin installs easily on Mac OS X 10.5 and does what it promises. FLV videos (such as those you’d sneakily download from YouTube) open just like any other QuickTime vid, and you can easily export them to other types. Corey Blanchette’s 365 Song Project The 365 photos meme was quite popular last year (despite the 366 day leap year). I might have joined, but it’s unlikely I would have finished. Instead, I’ve been pushing my my brother-in-law Corey Blanchette, nicknamed CoreyB or CoreyB603, to do 365 songs in 2009. He launched on January first and since then has done songs about elves, the serotonin in Saratoga, Albert Ayler, and a bunch of others. If I Ever Find Myself In Prague… Ilya Schurov thinks this is the time capsule from from Isaak Asimov‘s The End Of Eternity. It’s really the elevator and stair (or ramp)-way in Prague‘s Old Town Hall. A clock and great views of the square are at the top. Thinking of interesting elevators to be found in Europe: The Paternoster. DIY Fisheye Lens For Aiptek GO-HD Camera The Aiptek GO-HD isn’t such a bad camera for the money. It does 720p video and 8 megapixel photos, but the lens doesn’t go very wide. But a post in the Flickr blog pointed to a solution: use a door peephole as a fisheye lens. It works, but holding the peephole in front of the camera can get tiresome. Here’s how I solved it: A rubber stopper easily holds the peephole, while a 1. Some Predictions Come True Way back in 2002 Dave Winer made a bet: In a Google search of five keywords or phrases representing the top five news stories of 2007, weblogs will rank higher than the New York Times’ Web site. It’s important to remember that in 2002 people still wrote “weblogs” in quotes, as though they weren’t sure how to use the word. Winer won his bet in 2007. Anybody want to make a bet about 2014? Safe Livestock Transportation Recommendations You might not have cared to know the recommended trucking practices for pigs or other livestock, but Colorado State University professor Temple Grandin is happy to explain all of that and more. She’s got videos too. Perhaps you know somebody who made a new year’s resolution to improve the way they truck their livestock? You Didn’t Know They Were Fighting: The Karen National Liberation Army in Myanmar This news story from 2006 alerted me to a war I didn’t know anybody was fighting: the liberation of Karen State from Myanmar. The KNLA (Karen National Liberation Army) and KNU (Karen National Union) have been fighting for independence since the British left Burma (Myanmar) in 1948. What do you get a 51-year old rebel movement for its birthday? Here are their demands: For us, surrender is out of the question. Super Cheap Aiptek GO-HD Video Camera A while ago now I bought a Aiptek GO-HD 720p from Amazon for cheap. The FotoRamp review was helpful; links to actual raw video convinced me; but this video review was absolutely no help at all. You can’t track the brand on Flickr; but a search reveals a few photos (one, two), a video, and even some photo panoramas (one, two) assembled from the video. You can see my own test videos here (note the link to the raw video in the description of each). New Year’s Hangover Remedies I find a few sausage, egg, and cheese breakfast sandwiches and chocolate milk do the trick, but I’d eat those every day if I could. I’m always dubious of claims to national consensus, but this is especially ridiculous. Is our national hangover cure really tomato juice and eggs? I thought it was hair of the dog, or beer and eggs. Friends of mine have been so concerned by the challenge that they’ve developed Biba, an electrolyte rich mixer that’s supposed to reduce the risk of hangover from the start (join their Facebook group to learn more). Will Time Warner Cable Customers Be Able To Watch Nickelodeon In The Morning (or Visit Nick.com)? This dispute is going on now, tonight. There are obviously at least two sides to this story (Viacom &amp;Time Warner Cable). You’d think a media giant like Viacom would know how to handle this one, but it seems that all they’ve got is that splash screen in front of a bunch of their websites and this uninspiring ad. Time Warner Cable, which you might think is just a bunch of network plumbers, seems a little more connected. Wired But Disconnected duckett‘s Wired But Disconnected on ccMixter is actually ironic: the whole song is the result of an online collaboration. Listen Lensbaby Baby I have an old Lensbaby 2.0 (looks like this) that does a great job of making casual snapshots look like real portraits. But I also find it really difficult to get focus on my subject. Blame my bad eyes, my insistence on using it wide open with it’s shallowest depth of field, and simply sloppiness, but I can’t do it. This new Lensbaby Composer with a sort of normal focus ring (rather than flexible bellows), might work a little better. Tankmen Tankmen is funny, no doubt, but I wonder what it means when we’re deeply embroiled in two of the longest running armed conflicts of US history that we find it so easy to make comedy about war. Happy Holidays! Jappy Jaladays begets a number of other punny greetings: Merry Mojitos! Merry Margaritas! Tijuana celebrate? Hope you do. Party tortilla tired! Don’t let the season tequila. Salsa nice having you in our lives. Let’s go singing Christmas Cuervos! YouTomb Tracks Takedowns On YouTube YouTomb continually monitors the most popular videos on YouTube for copyright-related takedowns. Any information available in the metadata is retained, including who issued the complaint and how long the video was up before takedown. The goal of the project is to identify how YouTube recognizes potential copyright violations as well as to aggregate mistakes made by the algorithm. Hacking Cellphones For Public Health Using only an LED, plastic light filter and some wires, scientists at UCLA have modded a cellphone into a portable blood tester capable of detecting HIV, malaria and other illnesses. via Wired. LCSH Linked Data lcsh.info is gone, but there’s a lot to learn from this paper. I wish I’d seen that earlier. Everybody’s Spoon Is Too Big Best of Craigslist: Manly Bike For Sale From the best-of-Craigslist: Manly Bike for Sale: What kind of bike? I don’t know, I’m not a bike scientist. What I am though is a manly guy looking to sell his bike. This bike is made out of metal and kick ass spokes. The back reflector was taken off, but if you think that deters me from riding at night, you’re way wrong. I practiced ninja training in Japan’s mount Fuji for 5 years and the first rule they teach about ninja biking is that back reflectors let the enemy know where you are. Plugin Options Pages in WordPress 2.7 WordPress 2.7 requires that plugins explicitly white list their options using a couple new functions. WordPress MU has required this security measure for a while, and it’s nice to see an evolved form of it brought to the core code. [Migrating Plugins and Themes to 2.7][1] article in the codex offers some guidance, but here’s how it works: First, register each option for your plugin during the admin_init action: ``` function myplugin_admin_init(){ register_setting( 'my-options-group', 'my-option-name-1', 'absint' ); register_setting( 'my-options-group', 'my-option-name-2', 'wp_filter_nohtml_kses' ); } add_action( 'admin_init', 'myplugin_admin_init' ); ``` In the example above, the value for my-option-name-1 will be filtered by absint before being saved to the options table. Quizzes Are Good Link Bait Via Information Nation: How Long Could You Survive Chained to a Bunk Bed with a Velociraptor? and How Many Five Year Olds Could You Take in a Fight?. The Social Beaver: 1960s Campus Life At MIT Really, it’s titled “The Social Beaver,” though I can’t imagine campus life ever looking like that. Aside: MIT’s TechTV is powered by Viddler’s white-label solutions. Woodman Institute, Dover, NH The Woodman Institute Museum in Dover NH is famous for having a four-legged chicken, but that’s only a small example of the weirdness you’ll find inside. A big collection of snakes and bugs and bears in top hats along with other examples of taxidermy fills the first two floors. The top floor is dedicated to war and includes the obligatory rusty cannon ball that killed and maimed. What Could Have Been: Lee Mercer’s 2008 Presidential Campaign Former 2008 Presidential Candidate Lee Mercer shares your concern for circumstances and issues. He wants to crack down on treason and recognizes Democratic concerns about expansion of executive power. MySQL 5.1 Released, Community Takes Stock MySQL 5.1 is out as a GA release, but with crashing bugs that should give likely users pause. Perhaps worse, the problems are blamed on essential breakdowns in the project management: “We have changed the release model so that instead of focusing on quality and features our release is now defined by timeliness and features. Quality is not regarded to be that important.” Still, people are finding inspiration in OurDelta and Drizzle. SIMILE Timeline For, Um, Timelines Timeline is a SIMILE project that uses Exhibit JSON (which you can create with Babel). Longwell RDF Browser Longwell mixes the flexibility of the RDF data model with the effectiveness of the faceted browsing UI paradigm and enables you to visualize and browse any arbitrarely complex RDF dataset, allowing you to build a user-friendly web site out of your data within minutes and without requiring any code at all. Demos Another Approach To Web Forms Just saw a cool demo of XForms and Orbeon Forms. WordPress For Zach’s Web Programming Class Zach is apparently too lazy to prep his own lectures for the last few days of his intro to web programming class. After bringing his students from zero to database-backed web-apps, he asked Matt do JavaScript and me to introduce WordPress as an application platform. The WordPress API makes it easy to write plugins that modify WordPress’ behavior with filters and action hooks. Additionally, shortcodes allow you to put small bbcode-like tokens in your WordPress posts and pages that are replaced with by functionality defined in your plugins. Real Data Architecture: Stockholm Data Cave Need a retro-looking bomb shelter for your server, or are you a big fan of the Cheyenne Mountain scenes in WarGames? The Bahnhof Pionen White Mountains hosting facility is a cave below Stockholm. You’d expect the sysadmin blogs to call it fit for a James Bond villain, but even the architecture blogs are a gaga. Trendhunter compares it to the RFM FM Radio headquarters (Poland) and John Lautner‘s Chemosphere house (Los Angeles). Lens Lust Digital Photography Review’s look of Sigma’s 50mm f/1.4 has me drooling. I have an el cheapo 50mm f/1.8 and am looking to upgrade. At $1500, Canon’s 50mm f/1.2 is just way too expensive, but their 50mm f/1.4 just didn’t seem to be enough of a upgrade to be worth the price. Sigma’s new lens, seems to do it. I stumbled into that lens, however, as I was looking up Canon’s EF 100mm f/2. Derailed Eu-Jin Ooi‘s picture of rail trucks piled up after a derailment isn’t nearly as scary as this derailment found at Dee’s Inbox: Can anybody name that incident? (The top one is BNSF, Barstow CA, April 2008. What’s the bottom one?) Piano Man Light-Paint Piano Player from Ryan Cashman on Vimeo. Mobile Safari Advanced Features If you’re already building web apps, you might wonder why you should bother to build an iPhone native app. The short answer is that you might not need to, but you should still optimize the app for iPhones. Native-looking chrome Set these in the head: ``` // set a custom icon for when a user bookmarks the app to the home screen // hide the browser chrome //set the phone status bar style; can be grey, black, or black translucent &lt;/td&gt; &lt;/tr&gt; &lt;/table&gt; &lt;/div&gt; Caveats: * Only works for web pages that have been saved to the home screen and opened from there. iPhone Dev Camp NYC I’m at Apple’s iPhone Tech Talk in New York today. Info is flowing like water through a firehose, so I’m not going to attempt live blogging, but here are their suggested ingredients for a successful iPhone app: Delightful Innovative Designed Integrated Optimized Connected Localized The picture is of the main theater for the event. It’s by far the most beautiful space I’ve ever been in for a tech conference. Peephole DIY Fisheye Lens Flickr blog I discovered the Peephole fish eye group. The idea is simple: us a $5 door peephole to give your camera a fisheye lens. Here are the instructions: Hold peephole against rim of camera lens. Set camera to “macro”. (the image is actually displayed on the inside face of the convex lens of the peephole. The camera must focus on the foreground image rather than the background image.) Zoom in to the point that the viewable “circle” is framed almost evenly. I Am Talking To You After stuffing yourself with too much Thanksgiving dinner and the tryptothan kicks in, there’s some time when all conversations seem to work like this one from Martin Wilson. A DC Story One sunny day in January, 2009 an old man approached the White House from across Pennsylvania Avenue, where he’d been sitting on a park bench. He spoke to the U.S. Marine standing guard and said, “I would like to go in and meet with President Bush.” The Marine looked at the man and said, “Sir, Mr. Bush is no longer president and no longer resides here.” The old man said “Okay”, and walked away. After The Thanksgiving Feast: Answer Who Owns The Fish You can only eat so much, and though we’ll likely stretch those limits tomorrow, at some point we all have to take a break. The good folks at Coudal Partners have the perfect solution: a simple test (available as a convenient PDF) that Einstein says only a handful of people can actually figure out. The premise is simple: somebody in the neighborhood keeps a fish, but who? Read the clues, work it out, and send your answer to the Coudal folks. If you’re right they might have a prize for you. You can leave your answer in the comments here too, but all I’ll have for you is left over turkey. Amazon’s Content Delivery Network Launches In Beta Amazon calls it CloudFront, and it costs $0.17 – $0.22 per GB at the lowest usage tiers. It seems that you simply put your files in an S3 container, make an API call to share them, then let your users enjoy the lower-latency, higher performance service. Their domestic locations include sites in Virginia, Texas, California, Florida, New Jersey, Washington, and Missouri. Internationally, they’ve got Amsterdam, Dublin, Frankfurt, London, Hong Kong, and Tokyo covered. Web Search Re-Imagined: Searchme iPhone App Re-imagined a bit, anyway. Why browse a vertical list of results when you can flip through them like pages in a book (or album covers in iTunes). Searchme on the iPhone and iPod touch does just that. As you type your search term, icons representing rough categories appear, allowing you to target your search and helping people who’re searching for information about pythons the snake avoid results about the programming language. Video DRM Hammering Legal Consumers Nobody but the studios seem happy about Apple’s implementation of HDCP on its recent laptops. The situation leaves people who legally purchased movies unable to play them on external displays (yeah, that means you can’t watch movies on the video projector you borrowed from the office). A related story may reveal the extent of the problem. The MPAA is petitioning the FCC to allow it to use “selective output control” to block playback of video content in a manner similar to HDCP. SCO vs. Novell Lawsuit Over, Linux Safe According to Groklaw, the long running battle between SCO and Novell may finally be over. The Judge ruled that SCO, the company that claimed Linux infringed on it’s IP and sued everybody in sight, never did own any rights to Unix in the first place, and has ordered the company to pay millions. Novell and others are unlikely to ever see much of that, though, as SCO is in bankruptcy. Toshiba Takes Bullet Time Up A Notch Supposedly this is more real than it looks. See how it was made. The USS Albacore, Portsmouth NH The Albacore is a post World War II experimental submarine now on display in Portsmouth NH. Seeing the sub on land, some height above sea level, is a bit surprising, and it’s clear that moving it there was no small task. Five dollars will get you inside the sub’s tight and awkward quarters, where you’ll see the Frankensteinian bathroom (and that’s for officers) and details such as lithium hydroxide canisters and signal ejector instructions that stand as reminders of the dangers of submarining. Nest: The Softer Side of MaisonBisson Sandee’s not such a fan of the new theme here at MaisonBisson. Without really telling me that I should have discussed the new decor with her before making any big decisions, she does say she feels it doesn’t suit her style. There are lots of ways to resolve the, um, difference of opinion, but we decided that just as Sandee gets most of the authority regarding the kitchen and I get the office, we can find a way to share the website. Lincoln Obama Paste Up Mashup enrguerrero‘s photo of a Lincoln/Obama paste up mashup on the corner of Larkin and Myrtle streets in San Francisco. Fiddling With Open Source Software for Libraries Theme I generally liked CommentPress, but when the Institute for the Future of the Book website went down recently, it started throwing errors in the dashboard. So I decided to re-do the Open Source Software For Libraries website using Derek Powazek’s DePo Masthead. I think it’s a beautifully readable theme, and I only had to make a few modifications. I’ve ostensibly lost CommentPress’ paragraph-level commenting features, but I discovered those may have been broken all along (that was what started me thinking about replacing the theme). Obama’s Use of Complete Sentences Stirs Controversy From the Borowitz Report: In the first two weeks since the election, President-elect Barack Obama has broken with a tradition established over the past eight years through his controversial use of complete sentences, political observers say. “Every time Obama opens his mouth, his subjects and verbs are in agreement,” says Mr. Logsdon. “If he keeps it up, he is running the risk of sounding like an elitist.” More… McGill University Powered by WordPress Well, not the entire university, I guess, but a number of online publications use it. The newspaper is featured above, their CIO has a blog, and they’ve started a pilot with WPMU to offer blogging to everybody in the University. Abandoned Cars, Yes, But Abandoned Jumbo Jets? Residents of Mumbai (Bombay) were wondering who was responsible for removing an abandoned 737 in their Chembur neighborhood. Then, as quickly and mysteriously as it appeared, it vanished. The Times of India says the plane arrived by truck, but the driver took a wrong turn and couldn’t maneuver the 75 foot long hulk out. Wingless planes and beached whales aren’t so dissimilar. The Oregon Highway Department knows how to take care of the latter (though, it turns out that whales are known to spontaneously self destruct). Tricky Uses of bSuite After writing the project page for wpSMS I didn’t have much more to say in a blog post announcing it. The cool thing about writing Pages in WordPress is that I can create a taxonomy like /projects/wpsms/ to place them in. The downside is that new pages never appear in the RSS feed. So I need both the page and a blog post to announce it. I could have simply copied the content from the wpSMS page into a blog post, but that creates confusion and splits the audience between the two pages. WordPress Uses: Oobject Oobject‘s galleries of abandoned pools, subway architecture, and revolting gold gadgets, among others, are all built in WordPress. Using WordPress With External SMTP Server I really don’t like having sendmail running on a webserver, but some features of WordPress just don’t work if it can’t send email (user registration, for example). Still, WordPress offers support to send email through external SMTP servers instead if a local mailer. In &lt;a href=&quot;http://trac.wordpress.org/browser/tags/2.6.3/wp-includes/pluggable.php&quot;&gt;/wp-includes/pluggable.php&lt;/a&gt; around line 377, change ``` $phpmailer-isMail(); ``` to ``` $phpmailer-isSMTP(); ``` Then, in &lt;a title=&quot;/tags/2. A Day In The Life… DGENERATE NATION – Skate With Me from DGENETICS on Vimeo. Whisky and Gin Dispenser Gaellery‘s Hotel room whisky and gin dispenser. Push in the drawer, pull out, and find a tiny bottle of booze. Just like those movies you claim you didn’t watch, it’s automatically charged to your bill. Uploading .docx Files In WordPress It may be a sign that none of the core WordPress developers much likes or uses Microsoft Office, but the core code hasn’t been updated to recognize the Office 2007 file extensions like .docx, .pptx, or .xlsx. It’s no criticism, wouldn’t have discovered it if a user hadn’t complained, and I stewed a bit before deciding it was a bug. It’s now ticket #8194 in the WordPress.org Trac. It only affects my MU users now, though, and the same patch works there. World Usability Day Today The Usability Professionals’ Association says “a cell phone should be as easy to access as a doorknob.” And since 2005 they’ve been organizing World Usability Day to help make that happen. Locally the UPA Boston chapter is holding events at the Boston Museum of Science (in Cambridge, actually) that explore the clues we use to understand how to operate doors and the frustrations of setting an alarm clock. This year’s theme is transportation, and they have an online transportation survey that helps us see our “transportation footprint and learn how small travel changes can make a big impact on all our lives. Google Brings Video To GTalk, But Why No iChat/Skype Interoperability? Google yesterday introduced video chat to the web-based version of it’s Google Talk app (think GMail), but doesn’t appear to interoperate with any of the many existing video chat apps, iChat and Skype tops among them. Getting a Teflon Fix Teflon might be just what I need to get my walking desk treadmill back in working order. But where to get it? Turns out that Dupont sells in both teflon spray and squeeze bottle. Found via. The Animated Llama You Didn’t Know You Needed click for more. i dare you. WordPress Education Mail List wp-edu, the WordPress for education mail list has launched. Join up, catch up on the archives, and set it up at your school. New Plugin: wpSMS Supports Sending SMS Messages [include post_id=”12897″ field=”post_content”] Poke A Muffin click for more. i dare you. A Bullet Dodged We all knew the sordid details of Palin’s candidacy would emerge, but who figured they pour out so soon or on Fox News? Via Borkweb.com Declaration of Metadata Independance Declaration of Metadata Independance: We hold these truths to be self-evident, that Metadata is essential to all Users, and that the Creation of Metadata endows certain inalienable Rights, that among these are the right to collect, the right to share and the pursuit of Happiness through the reuse of the Metadata… (read more) Via. SVN Repository Hooks Rock I stumbled on them by accident, but once I discovered Subversion supports action hooks that can fire before or after a transaction, I knew exactly what to do with them. Presidents Change…Presidential Limousines Change Presidential Limos are armored, yes, but Gregg Merksamer reveals that George W. Bush’s limos sport five-inch thick glass, more than twice as thick as in Clinton’s limo. Merksamer should know, he wrote the book on so-called “professional cars”. He says half an inch is enough to stop a .44 magnum at point blank range, and BMW’s X5 “Security” model features only a little more than that. So what’s it mean when a person needs ten times that amount? McCain Staffers: More Whisky. Stat! John McCain’s election team apparently told staff at The Phoenix Biltmore to have extra whisky on hand for their election party tonight. They’re not just planning to drown their sorrows: Republicans and Republican-leaning independents drink more whisky than the national average. Sweet photo by Bearfaced, though I almost used this picture of barrels (or this one). Techno Viking Rocks More Than Other Vikings (And Vikings Generally Rock) The TechnoViking will have you scratching your head for the first 90 seconds, then ROFLing for a while. Not enough yet? Watch him dance to “It’s a Piece Of Cake To Bake A Pretty Cake.” This one claims to be the original, and though the sound is bad the video quality is much better than the others. Thing is, now that you’ve watched it a couple times, did he stop a pickpocket or admonish a groper at the beginning? Wikipedia API? I’ve wanted a Wikipedia API for a while. Now I might’ve stumbled into one: commons.wikimedia.org/w/api.php. It doesn’t do exactly what I want, but it might yet be useful. Engrave Your Tech The image on this moleskine notebook was custom laser engraved by EngraveYourBook.com, a part of EngraveYourTech.com, where they recently announced they were suspending moleskine engraving due to atmospheric health concerns. You can’t get a notebook, but you can ogle the fancy, laser engraved MacBooks Creative Commons Licenses Not Compatible With GPL? GPL and CC are incompatible? FSF says so, and the Debian Free Software Guidelines agree. I’m as opposed to ruinous compromises as the next guy, and I feel the GPL fever, but I just want to use Mark James‘ excellent Silk Icons in my GPL’d WordPress plugin. CSSHttpRequest: cross domain JavaScript solution Who’d a thunk it: CSSHttpRequest is a way of doing cross-domain AJAX by using CSS’ @import method to fetch the data. Super Mario Quilt Keith Lewis bakes, paints, makes robots with machine guns, and has stitched not one but two Mario quilts (closeup, from back). They apparently make good gifts, who wouldn’t want one? Diagramed: Things Said During Sex View it large, for all the details. Via anonymous. Asian Robot Olympics News of BrickCon the web and the Flickr earlier this month, but MSE2006’s photos of robot competition have my attention now. But what am I looking at? What was the competition? Steve Souders Website Performance O’Reilly Webcast I’ve linked to Steve Sauders‘ webcasts on website performance optimization before. Here’s another. Turns out that he’s co-chairing the O’Reilly Velocity conference in June. Apache Virtual Hosting Black Magic I’ve configured Apache for virtual hosting on more sites than I can count, but I’ve always just kind of stumbled through until now. What’s changed? The Apache 2.2 documentation is worlds better than the old 1.3 docs (even though the old docs rank highest in Google). So here they are: name-based virtual hosts, plus virtual host configuration examples (including an example mixed name and IP virtual hosting, which is what I needed), and some tips on dynamically configured mass virtual hosting. Sarah Palin Is A Vampire I think this election has designers more involved than most. (Via DottieboBottie.) Determining Paths and URLs In WordPress 2.6+ WP 2.6 allows sites to move the wp-content directory around, so plugin developers like me can’t depend on them being in a predictable location. We can look to the WP_CONTENT_DIR and WP_PLUGIN_DIR constants for answers, but a better solution is likely to use the X_url() functions. The most useful of those is likely to be plugins_url(). Even better, you can give these functions a relative path and they’ll return a fully qualified URL to the item. xFruits: “Compose Your Information System” Is xFruits a worthy replacement for Yahoo! Pipes? WordPress Bug: Duplicate post_meta Entries I just submitted a trac ticket about this: The update_post_meta() and delete_post_meta() functions don’t know how to deal with post revision IDs. add_post_meta() does, it uses the following block of code to make sure the passed $post_id is a real post, not a revision: ``` if ( $the_post = wp_is_post_revision($post_id) ) $post_id = $the_post; ``` This is important because the global $post_id when a post is being saved is for the revision, not the real post. Are You Ready For The Digital TV Conversion? This PSA should help you understand the upcoming switch to digital television. (via) Comfort, Thy Name Is Sumo I sink into a strange, giant blue marshmallow and sigh contentedly. I balked at this new furniture. I balk at anything that I don’t actually pick out. I didn’t pick this out, Casey acquired it on his own. Our home is small and I am very picky about what goes into it. This was a beanbag. A beanbag? I can’t think of a more immature piece of furniture. Libraries vs. IT Departments The Chronicle‘s Tech Therapy podcast last week featured Libraries vs. IT Departments. (Via.) xkcd Against DRM I think Richard M. Stallman would agree with xkcd: DRM is evil. It’s bad for both customers and content creators — even Hilary Rosen and Steve Jobs have their doubts about it. Got Wood? You can get a carved wood replica Macintosh 128 or faux-wood vinyl wrap for your Mac Mini, but ASUS is demoing a series of bamboo-covered computers and Fujitsu is showing their Cedar concept. And then Miniot has a series of wooden cases for your iPhone and iPod touch. Olde Skool iPod Cases Contexture Design‘s iPod classic and nano cases made of reclaimed 45 RPM vinyl or audio cassettes are just fine. Too bad they’re all sold out. Edward Tufte On The iPhone’s UI Design Edward “to clarify add detail” Tufte, who criticizes the PowerPointing of America, earlier this year posted a video on the iPhone’s UI design. He loves the photo viewer (except the grid-lines between images are too big), he loves the web browser (except the navigation bar takes up too much space), he calls the weather app an elegant way to demo your iPhone to friends (but says it’s devoid of information), and calls the stock market app cartoonish. How Wikipedia Works When Phoebe Ayers isn’t hanging out at ROFLcon she’s probably doing something related to Wikipedia, so I’m looking forward to reading How Wikipedia Works: And How You Can Be a Part of It. Extra points: Phoebe and her co-authors somehow convinced their publisher to release the entire work under the GFDL, the same license Wikipedia uses. You could read the entire thing online for free, but that’s the easy part. Beat It: Instant Rimshot Scott Carver has his hand in a number of projects — The Penny Jam is especially outstanding — but his Instant Rimshot is one of those silly infectious sites that’s you can’t help but share. Another Reason I’m Glad I Left Verizon I received the following message from Clickatell, the SMS gateway provider I use to programmatically send text messages to cell phones: Please be advised that US carrier Verizon Wireless has announced that they will be charging an additional 3c per SMS for all application originated mobile terminated messaging beginning November 1, 2008. This increase will apply to standard rate and premium programs only through the Verizon Wireless network. Transaction fees will not apply to Free-2-End-User, Mobile Giving or Non-Profit organizational programs, according to Verizon. WordPress Event Calendaring Plugins I actually use Event Calendar, which has been abandoned for some time. Looking at the alternatives listed in the Plugin Directory, Calendar, Events Calendar, and Gigs Calendar add full calendar management features to WordPress. While ICS Calendar, iCal Events, and Upcoming Events, simply offer the ability to display calendar data from elsewhere. What I liked about the old Event Calendar plugin is how events were posts. Creating an event started with creating a new post. Converting MySQL Character Sets This Gentoo Wiki page suggests dumping the table and using iconv to convert the characters, then insert the dump into a new table with the new charset. Alex King solved a different problem: his apps were talking UTF8, but his tables were Latin1. His solution was to dump the tables, change the charset info in the dump file, then re-insert the contents. Tracking Aircraft Movements From Justin: real-time flight tracking. You can even overlay it on Google Earth. None of them as pretty as Aaron Koblin’s Flightplan, though. Acronym Overload: IIS + ISAPI + CAS I’m working to integrate an application on a remote-hosted IIS server into our CAS environment. CASisapi (svn trunk or svn tags/production) may do the trick, though Phil Sladen struggled with it (in 2005). There’s reason to doubt it. Not only is the sparse information all old, I first learned about it from a page full of broken links and the apparent author recommends against it. There’s a little more information here for those who can read Danish. Sarah Palin’s Debate Strategy Flowchart Via Jon Link: Sarah Palin’s debate strategy flowchart. Eh. At least she had a strategy. What’s McCain’s plan going to be for tonight? Autoerotica, Detailed Photos Of The silver SUV apparently backed out into the street so fast that it struck and flipped the blue car, then mounted it. Nobody appears to have been seriously hurt, so we all have a guilt-free pass to mock the, um, compromising situation. Found in Paula Wirth‘s photo stream. Demetri Martin Flips His Chart You’ll find more than a few of Demetri Martin‘s (his site) videos on the web (one, two, quotes). Though I think he’s particularly good at powerpoint comedy and this flipchart thing, you’d think he doesn’t like to do interviews. Solaris’ CacheFS Could Be The Space Ship I’ve Been Looking For Joerg Moellenkamp‘s post explaining CacheFS has me excited: Long ago, admins didn’t want to manage dozens of operating system installations. Instead of this they wanted to store all this data on a central fileserver (you know, the network is the computer). Thus netbooting Solaris and SunOS was invented. But there was a problem: All the users started to work at 9 o’clock. They switched on their workstations and the load on the fileserver and the network got higher and higher. This Stone Laid By L.G. Bogus Physically located in Katoomba; found in Seb Chan‘s photo stream. Do WordPress Pages Better With bSuite WordPress‘ Pages feature makes the popular blogging platform a sophisticated CMS. bSuite adds a few features to make it even better. Write excerpts, tag, and categorize your pages WordPress excerpts are an underused but powerful feature that allow you to explain to your readers why they should read the page you wrote. Tagging and categorization of pages help improve the findability of those pages, especially in search engines. What Is Social Media? Social Media in Plain English and RSS In Plain English, among others from Common Craft among the best explanations you’ll find. Knowledge, Distilled And Sketched On Index Cards Maslow without the pyramid, found at Jessica Hagy’s “Indexed”. She posts new explanations of the world daily. More available in her book. Website Performance vs. Crawl Rate Simple fact of The Google Economy: people can’t find stuff if it’s not indexed in major search engines. A slow site might not seem as bad as blocking the crawlers that search engines use to index your content, but it does seriously affect the depth and frequency of crawling they do. The above is Google’s report of their crawling activity on a site I’ve been trying to optimize server performance on. Beginner’s Guide to DataPortability, The Video DataPortability – Connect, Control, Share, Remix from Smashcut on Vimeo. From DataPortability.org: The DataPortability Project is a group created to promote the idea that individuals have control over their data by determing how they can use it and who can use it. This includes access to data that is under the control of another entity. You should be able to decide what you do with that data and how it gets used by others Open Source solutions are preferred to closed source proprietary solutions Bottom-up distributed solutions are preferred to top down centralized solutions My DevCamp Lightning Talk Hi, I’m Casey. I developed Scriblio, which is really just a faceted search and browse plugin for WordPress that allows you to use it as a library catalog or digital library system (or both). I’m not the only one to misuse WordPress that way. Viddler is a cool YouTube competitor built atop WordPress that allows you to tag and comment inside the timeline. StayPress is a property management and booking system also built atop WordPress. Scaling PHP This two year old post about Rasmus Lerdorf’s PHP scaling tips (slides) is interesting in the context of what we’ve learned since then. APC now seems common, and it’s supposedly built-in to PHP6. Still, I’d be interested in seeing an update. Are MySQL prepared statements still slow? And that’s where Rasmus’ latest presentation comes in. We don’t learn anything about MySQL prepared statements, but we do learn how to find choke points in our applications using callgrind and other tools. Scared Of The Dark? Who knew an ad that targeted our fear of the dark could work so well or playfully? Then again, what would this ad feature if it played here in the US? Do You Still Use Your Walking Desk? Michael Pratt asked me recently: Do you still use your treadmill desk? Do you continue to find it beneficial? I love the idea of these things, but worry a little that I might tire of it in practice, or that it might be difficult to work at it for long periods. It may seem a perfect opportunity to revisit my old walking desk blog post, but that just raises the guilt level I feel every time I see the thing unused. Sweet Business Cards This handful of business cards is good for a little design inspiration. And here’s 70 more if you need an extra shot. Thanks to Frank for the tip. Amazon To Offer Content Delivery Services Via an email from the Amazon Web Services group today: …we are excited to share some early details with you about a new offering we have under development here at AWS — a content delivery service. This new service will provide you a high performance method of distributing content to end users, giving your customers low latency and high data transfer rates when they access your objects. The initial release will help developers and businesses who need to deliver popular, publicly readable content over HTTP connections. The URL Is The Citation From Jessamyn: “don’t toss up a bunch of bibliographic citations when a decent URL will do. You’re online, act like you’re online.” Yet Another Encryption Crack Those kwazy kids will quack anything now. Stream ciphers may never have been expected to be that secure, but Adi Shamir’s cube attack breaks them like so many, um, bits of data. Michael Pick Screencast Master Professional screencast producer Michael Pick has joined Automattic and shuttered Smashcut, his production company. It’s not all bad, though. He’s been busy making instructional videos for WordPress.com (many of which are useful for WordPress.org users), explaining things like how to manage tags or use the Press This! feature, and answering the question “What should I do first?” What does this suggest about the pro screencasting marketplace? Pick says “this is a huge underdeveloped niche, [with fewer] screencasters with chops than there are jobs. Google Minus Google From The Register: Inspired by a recent New York Times piece that questioned whether the Mountain View search monopoly is morphing into a media company — which it is — Finnish blogger Timo Paloheimo promptly unveiled Google minus Google. Key in the word “YouTube,” and the first result is Wikipedia. Open Source Citation Extractors For Non-Structured Data hmm-citation-extractor, ParsCit and FreeCite (not to be confused with FreeCite, the F/OSS EndNote-like app). FreeCite is available as a service and a download. Still, wouldn’t a simple URL be easier than all these unstructured citation formats? Installing PHP APC On RHEL/CentOS Yum up some packages: ``` yum install php-pear php-devel httpd-devel &lt;/td&gt; &lt;/tr&gt; &lt;/table&gt; &lt;/div&gt; 2. Install APC using pear (the pear installer is smarter than the pecl installer): When the installer asks about APXS, say ‘no’. &lt;/p&gt; &lt;div class=&quot;wp_syntax&quot;&gt; &lt;table&gt; &lt;tr&gt; &lt;td class=&quot;code&quot;&gt; ``` pear install pecl/apc &lt;/td&gt; &lt;/tr&gt; &lt;/table&gt; &lt;/div&gt; Tell PHP to load APC: ``` echo extension=apc. Some Might Suggest Banning Sticky Notes From The Office EepyBird’s Sticky Note experiment from Eepybird on Vimeo. I have some experience with Post-It Notes in the office, and though that achieved international recognition, it doesn’t quite compare to what we see in this video. Our 5,300 Post-It Notes just don’t compare the 280,951 we see slinking across the screen now. Web Form Validation With jQuery Josh Bush’s Masked Input Plugin and Paulo P. Marinas’ AlphaNumeric are both jQuery plugins to prevent input of invalid data in web forms. GreenSQL | Open Source Database Security GreenSQL promises to protect SQL databases against SQL injections. GreenSQL works as a reverse proxy and has built in support for MySQL. The logic is based on evaluation of SQL commands using a risk scoring matrix as well as blocking known db administrative commands (DROP, CREATE, etc). CSS Transformations in Safari/WebKit (and Chrome too?) The cool browsers support radius corners, but Safari supports CSS transformations that allow developers to scale, skew, and rotate objects on the page like we’re used to doing in PostScript. And better than that, we can animate those transformations over time — all without any JavaScript. Fire up Safari or Chrome and mouse over the examples here. The screencast at the top is from the menu on that page. There are, obviously, better uses for these transforms, but it’s easy to see it at work there. Browser-Based JSON Editors JSONLint, a JSON validator, was the tool I needed a while ago to be able to play with JSON as format for exchanging data in some APIs I was working on a while ago. And now I like JSON well enough that I’m thinking of using it as an internal data format in one of my applications, especially because it’s relatively easy to work with in JavaScript. Or, at least that’s the promise. NFL Powered By WordPress WordPress.com VIP hosts some high-traffic sites, including Gizmodo’s live coverage of the iPhone 3g introduction. Now that the NFL has selected the service for their blogging we’ll get a chance to see how they handle the Superbowl rush. Michael Stephens Teaching on WordPress MU Michael Stephens is now using WordPress MU to host his classes online, and that opening page is really sweet. It’s hardly the first time somebody’s used a blog to host course content, but I like where he’s going with it. We’re significantly expanding our use of WordPress at Plymouth, and using it to replace WebCT/Blackboard is definitely an option. The biggest difference may be that course content in blogs is public, by default, but content in Blackboard is shared only with the members of the course. Google’s Own Satellite It’s not truly “Google’s own,” but the internet giant will get exclusive use of the images for mapping purposes, according to Reuters: GeoEye Inc said it successfully launched into space on Saturday its new GeoEye-1 satellite, which will provide the U.S. government, Google Earth users and others the highest-resolution commercial color satellite imagery on the market. Of course, Google doesn’t need a satellite to watch us all very closely. Thesis and f8 — Two Sweet Commercial WordPress Themes Good work deserves compensation, but commercial themes are still unusual in the world of WordPress. The new themes directory has well over 200 free themes listed, and the old directory had thousands of them. Still, I like Thesis and f8. Actually, I like a bunch of themes from Graph Paper Press (get them all for $99!). And, as we see WordPress adding so many options that require theme support, the promise of free lifetime upgrades for Thesis is also appealing. Installing memcached On CentOS/RHEL Using info from CentOS forums, Sunny Walia and Ryan Boren, here’s how I got memcached running on my Dotster VPS: Install libevent: ``` wget http://www.monkey.org/~provos/libevent-1.3e.tar.gz tar zxvf libevent-1.3e.tar.gz cd libevent-1.3e ./configure make make install ``` Install memcached ``` wget http://danga.com:80/memcached/dist/memcached-1.2.5.tar.gz tar zxvf memcached-1.2.5.tar.gz cd memcached-1.2.5 ./configure make make install ``` We will start the server to use 30 megs of ram (-m 30), listen on ip 127. Want: Canon’s EOS 50D News of Canon’s new EOS 50D with ISO sensitivity as high as 12,800 has my mouth watering. I used to push my black and white film so much that development times were as long as 45 minutes (I bought super cheap ASA125 and pushed it to 1000) just so I could get decent natural light. I leave my Canon Digital Rebel set for 1600 and usually only remember to knock it back when I go outside and find I can’t shoot wide open. Axiotron modbook: Cool, but bad timing? The Axiotron modbook is cool, I gotta admit, but with so many rumors of a MacBook Touch due this fall, I suspect that potential buyers might be holding their breath. But, on the other hand, those people have been waiting for a Mac tablet since Jobs killed the Newton, and rumors of a tablet are hardly unusual — see 2002, 2003, 2004, 2005, 2006, 2007, 2008. Still, the whispers of an over-grown iPhone device are getting a lot of echos lately. Jon Stewart vs. GOP/Sarah Palin Media Machine Dragonflyer X6 UAV Remote Control Helicopter Is Sneaky, Awesome I so want one of these sweet Draganflyer X6 helicopters. The two pound powerhouse can carry up to one pound of camera equipment, carrying it smooth enough to get decent video and stills. More videos are at the Dragonfly website, including one which supposedly demonstrates that it’s quiet enough for wildlife photo work (scroll down and look for “hawk”). Who knows how much it costs, but I requested a quote. Automated Website Screen Captures on OS X I’m not sure exactly what I’ll do with it, but thanks to this tip about webkit2png, I now know how to get screen captures of websites. Maybe useful for archiving. Who knows. WordPress CAS Integration Plugin CAS — Central Authentication Service — has no logo, but it’s still cool. Heterogeneous environments like mine offer hundreds of different online services or applications that each need to authenticate the user. Instead of throwing our passwords around like confetti, CAS allows those applications to identify their users based on session information managed by the CAS service. It also obviates the need for users to offer their credentials to potentially untrusted systems — think externally hosted systems. Bush Trying To Figure Out How To Invite Volleyball Team To White House Sure, volleyball is the new gymnastics, so much so that the White House posted a picture of Bush with Olympians Misty May-Treanor and Kerri Walsh in their “News &amp; Policy” section. Chalk it up to August being a slow news month. Still, I can just imagine the old man telling Laura “I think you should invite those volleyball girls to the house sometime.” And Laura, I hope, responds: “You can watch them shake it on TV if you need another look. Joshua Longo’s Longoland Is Full Of Fuzzy, But Not Cuddly Animals Brooklynite Joshua Longo‘s crazy animals are showing at the Shelburne Museum in Vermont through October 26th. Sweet for me: I’ll be in town this weekend. I’m hoping to check it out. Are Rock Operas Too Weird For Remixing? I love remixes, mashups, and covers. I love it when bad songs get good covers, I love it more when it’s a bad cover. I’m a fan of Coverville and I get excited every time I find yet another version of Smells Like Teen Spirit (hey, this is just a sampling: lullaby version, Patti Smith, The Bad Plus, another jazz version, and another jazz version, a string version, no, two string versions, a tango, a damn chant version, some lounge thing, and one for the opium lounge). But I think I have yet to hear a decent cover or remix of a track from a rock opera. Take One Night In Bangkok: sexing it up doesn’t help. You just can’t out rock a rock opera. (Really, look for yourself.) It might help that Chess featured a character loosely based on eccentric chess master Bobby Fischer, but rock operas just might be too weird for remixing. Though…I’d like to be surprised. Perhaps a folk version? Can Design Save Democracy? From the New York Times: How Design Can Save Democracy …recently, the Brennan Center for Justice at New York University School of Law issued a report outlining the importance of well-designed, easy to understand ballots. Duh. And, I guess we’re giving up on electronic voting. 2.6 Million Self-Hosted WordPress Sites And Counting The huge problem with open source software is that there are no sales numbers to show how many people are using it. We know that WordPress.com hosts over three million blogs. We know EduBlogs powers nearly 200,000. But how many sites are hosted using the original, downloadable, self-installed and managed version of WordPress? Now, the automatic update notification system in WordPress gives answers to that question and others. Most hugely: over 2. Sweet Drobo Home RAID I’m not sure who Robin Harris is, but he’s mighty sure home RAID won’t fly. He’s just so certain that consumers are stupider than him and that vendors’ imaginations are as limited as his. And if Harris was right, we’d probably still be using 8088 microprocessors and getting by on less than a megabyte of RAM, because “nobody needs more than 640K.” Too bad then that Data Robotics‘s Drobo seems to do everything Harris says home RAID can’t. OLPC Origins: US and Taiwan’s Hardware Lovechild OLPC Origins: US and Taiwan’s Hardware Lovechild A deeper than expected history of the OLPC’s development. Part two of a three part series. SSD For My BacBook Pro? Sure, we can get a MacBook Air with 64GB solid state disk (SSD), but what about upgrading a MacBook Pro? Ryan Block put one in his MBP and got a 20 second startup. Ridata released a 128GB 2.5“ SATA SSD in January that looks compatible with my MacBook Pro. Newegg has it for under $500. For comparison, however, a 250GB 2.5” spinning platter SATA drive can be had for under $100. More Web Performance Tips From Steve Souders Hearing Steve Souders at WordCamp last week got me thinking about website performance, so I went looking for more. The slides from his WordCamp talk are online, but he gave a similar talk at Google I/O which got videotaped and posted richer detail than his slides alone will ever reveal. Also on his blog: Use the Google AJAX Libraries API when you don’t have a CDN, and a post that asks why make users wait to download all your javascript before they see the page if you’re only going to use 25% of it at first? CommentPress Comments The rights to my Library Technology Report on Open-Source Software for Libraries have reverted back to me, so I’m posting the text online under a CC-BY-SA license. More importantly, I’m using it as an opportunity to play with how longer-than-blog texts can be represented online. The Institute for the Future of the Book has spent some time thinking about that very question, and their answer is CommentPress, a theme for WordPress that enables commenting on each paragraph of a text and organizes posts into a book-like table of contents with the first (and oldest) posts on top. MySQL Performance Monitoring Tips From The MySQL Newsletter Google turned this up, but i have no idea how old it is: How to Monitor MySQL’s performance. The War On Photography Amanda Mooney posted a note about being told she needed corporate permission to take a picture in a store. Mooney’s interest was in telling others how much she likes the products and the brand — exactly the sort of word of mouth advertising most brands are anxious for, but imagine some more pedestrian uses: what about the customer who wants a friend’s opinion about a new skirt? Can that customer snap a cell phone pic to send? Global Voices On WordPress I hadn’t heard of Global Voices Online, a community generated global group news blog, until Jeremy Clarke spoke of it at WordCamp. And I didn’t think the site, with it’s do-good premise, worked until I actually explored it for a while. But, well, it’s a bit fascinating. Global Voices grew out of a one-day conference in December 2004 at Harvard Law School which brought together bloggers from around the world to discuss ways in which the new medium could foment global dialogue at the grassroots level. Quercus PHP To Java Compiler vs. WordPress Emil Ong is the Chief Evangelist and a lead developer for Caucho Technology, the developers of the Quercus PHP to Java compiler. The idea, I guess, is to write in PHP, deploy in Java, which some people say is better supported by the “enterprise.” Ong claims 26% performance improvement over Apache + mod_php + APC. That sounds great, I suppose, but it’s less than what Chris Lea suggests is possible if you simply replace Apache with Nginx. Chris Lea On Nginx And WordPress “Apache is like Microsoft Word, it has a million options but you only need six. Nginx does those six things, and it does five of them 50 times faster than Apache.” —Chris Lea. Why? No forking. No loading of unnecessary components. Fast CGI. And to prove it’s not as complex as you might think, he’s installing it live. The session has eight minutes left, can he do it? Yes, he did. Mark Jaquith On WordPress Security For Plugin Developers I’ve been pretty aware of the risks of SQL injection and am militant about keeping my database interactions clean. Mark Jaquith today reminded me about the need to make sure my browser output is filtered through clean_url(), sanitize_url(), and attribute_escape(). Furthermore, we all need to remember current_user_can(), check_admin_referer(), and nonces. Steve Souders On Website Performance Steve Souders: 10% of the problem is server performance, 90% of problem is browser activity after the main html is downloaded. He wrote the book and developed YSlow, so he should know. JavaScripts are downloaded serially and block other activity. Most JavaScript functions aren’t used at OnLoad. We could split the JS and only load essential functions up front, and load all the rest later. How much might that help? He says 25% to 50%. Will Norris on OAuth and DiSo Will Norris talking about things OAuth, OpenID, and Diso at WordCamp. Demonstrates/fakes an OAuth authentication and authorization process with WordPress for iPhone app. Does this matter? OAuth support is slated for WP 2.7, and people are finally getting smart about linking all this stuff without throwing passwords around “like confetti.” Aaron Brazell On Blog Search And Findability Aaron Brazell at WordCamp is talking about search and finability “not SEO.” Riffing on Ambient Findability, he asks: Can people find your blog? Can people find their way around your blog? Can people find your content and services despite your blog? Remember: Your blog serves as a nexus for information about you. You serve as the nexus for trust and relevance. Going Further? Make your social content outside your blog searchable, findable via your blog. Johnny Cash’ Hurt Not every song Johnny Cash has covered turned to gold (see Personal jesus), but Hurt is magic. Copying MySQL Usernames and Database Priveleges Now that I’m the nominal MySQL DBA for PSU, it became my job to jimmy up the MySQL user privileges so that the new web server could connect. I’m not sure if this is the fastest, most efficient way to do it, but it worked quickly enough: ``` CREATE TABLE mysql.user_copy SELECT * FROM mysql.user; DELETE FROM mysql.user_copy WHERE Host NOT LIKE 'OLD_HOST_NAME'; UPDATE mysql.user_copy SET Host = 'NEW_HOST_NAME'; INSERT INTO mysql. WordPress Performance Tips Elliott C. Back points to his use of object caching, WP-Cache, and MySQL query caching among the reasons why his site “is so much faster that yours.” The iPhone Apps I’ve Kept Catherine asked me what iPhone apps I recommend, so I went looking. Exposure, WordPress, and Google Mobile App are on the first page of my home screen. Mocha VNC and Band are buried a little deeper, but deserve mention. I’m surprised to say that Loopt and Whrrl disappointed me. iPint was good for one laugh, but it appears to be gone from the store already. Morocco, a decent copy of Othello/Reversi is the only the game that’s still on my phone. Lyceum Vs. WordPress MU The news about BuddyPress has fully shifted my attention from single-blog WordPress installs to multi-user, multi-blog installs. WordPress mu is my platform of choice, but I was quite fond of Lyceum when I first learned of it a while ago. The big perceived advantage of Lyceum is that it uses a unified table structure for all blogs, rather than creating a new set of tables for each blog as WPmu does. Most Expensive iPhone App Yet? Armin Heinrich‘s $999 I Am Rich iPhone app is no longer available on Apple’s app store. Perhaps they felt too ridiculed by The Register to keep it listed? Heinrich says seven people bought it, two by mistake. So, now what’s the most expensive app? OAuth and WordPress I just realized OAuth support is slated for inclusion in WordPress 2.7. It’s not in trunk yet, but that’s no reason not to get up to speed. Scott Gilbertson says OAuth and OpenID are foundations to the open social web, giving apps like WordPress a “secure, centralized means of identifying yourself and a way to control who knows what about you.” Chris Messina, who says we currently treat user credentials “like confetti,” is more than a little excited and is building a series of WordPress Plugins to take advantage of these formats. Is My PHP Script Running Out Of Memory? I’ve got a PHP script that sometimes just dies with no errors to the browser and no messages in the error log. I’ve seen this in the past with scripts that consumed too much memory (yeah, it should have issued an error, but it didn’t, and increasing the memory limit fixed it), but now the memory limit is set pretty high and I’m not sure I want to increase it further. Macintosh Antivirus Software Setting aside questions about the usefulness of antivirus software for Macs, it appears VirusBarrier (commercial) and ClamXav (open source) are the best options. There are others, of course. Added: Avast offers a free version for MacOS X as well. Drill And Burn Republicans John McCain thinks fuel efficiency is for sissies. I guess he figures our oil supply is infinite, or that fossile fuel consumption has no effect on climate change. He probably also thinks the Holocaust was a hoax — somebody should ask him. For now let’s call him a “drill and burn Republican.” Low-Tech HDR: Black Card Mask I’ve been following Ásmundur’s use of multi-exposure HDR for a while, but today I discovered Max Chu’s use of an older, more crafty technique: black card mask. The photo below show’s Ásmundur’s multiple photo technique, but that above is Chu’s. How he do it? Apparently it’s about the same as dodging a photo in the dark room: simply block the light with a card or your hand. Extra: Paul Butzi’s thoughts on dodging and burning in the digital age. DIY Fig Rig Mike Figgis‘ Fig Rig works equally well for guys in sneakers and guys in suits, but they’re not free, which is why you have to love Keith Lewis’ DIY version. PVC is sexy! Displays: Go Long, Go Wide If you want more monitors than you’ve got DVI or VGA ports, your options include adding a video card, using a USB-based display, or this Matrox hack: a small box plugs into your computer’s monitor port, and two or three monitors plug into the box, no software drivers or additional hardware required. If you want to send a video signal further than your monitor’s cable, your options include getting a longer cable (works up to about 50′) or get a different cable. Everybody’s Smarter In Glasses Eyeglasses certainly add something. At least that’s the suggestion of these ads. And, thinking of comparisons: Hitler vs. Chaplin. Found via mirage.studio.7, where they think Le Corbusier‘s glasses are where it’s at. I’m Voting Republican No, I’m not likely to vote for any republican candidates, but this is funny. From the producers: I’m Voting Republican is a satirical look at the likely outcome of another four years of Republican government. The not-so-subtle message behind the film is the importance of a united bloc of citizens willing to take the time and effort to vote Democrat in order to improve America’s domestic and foreign policy. PodCamp Boston Is This Weekend Hey, PodCamp Boston is this weekend. I can’t go, but Sean M. Brown will be and he’s looking for librarians to join him. Web Application Design Book Recommendation I’ve learned to ignore contests on the web. Banner ads that promise prizes if I click the right pixel are the least offensive, but the contests that have me creating content (and then force me to give up my copyright to it) for another person’s gain infuriate me. So when I saw author and experience architect Robert Hoekman Jr‘s post offering a deal, I quickly skipped to the next entry in my reader. WordPress 2.6 Notes WordPress 2.6 is out. It’s cool. Take a look: I’m most excited about automatic tracking of changes to posts and pages, but I’ll also probably come to like the “Press This” feature: if you click “Press This” from a Youtube page it’ll magically extract the video embed code, and if you do it from a Flickr page it’ll make it easy for you to put the image in your post. Web Development Languages David Cloutman pointed to Craiglist’s job ads as an indicator of programming language popularity. Here’s the hit counts for “web design jobs” and “internet engineering jobs” in the Bay Area: &lt;td&gt; PHP &lt;/td&gt; &lt;td&gt; Java &lt;/td&gt; &lt;td&gt; Ruby &lt;/td&gt; &lt;td&gt; Python &lt;/td&gt; &lt;td&gt; PERL &lt;/td&gt; internet engineering jobs &lt;td&gt; 167 &lt;/td&gt; &lt;td&gt; 246 &lt;/td&gt; &lt;td&gt; 85 &lt;/td&gt; &lt;td&gt; 98 &lt;/td&gt; &lt;td&gt; 109 &lt;/td&gt; web design jobs &lt;td&gt; 110 &lt;/td&gt; &lt;td&gt; 71 &lt;/td&gt; &lt;td&gt; 22 &lt;/td&gt; &lt;td&gt; 19 &lt;/td&gt; &lt;td&gt; 31 &lt;/td&gt; &lt;td&gt; &lt;/td&gt; Cloutman has a few ideas for what the numbers mean, but I’m just entertained by the data. WordPress 2.6 Plugin and wp-config.php Path Changes Ozh’s tutorial explains the details, but the short story is that we’ll soon get WP_CONTENT_URL and WP_CONTENT_DIR constants. And this is more than just convenience, 2.6 allows site admins to put those directories anywhere they want, so the constants will be the only reliable way of finding that info. Truth Have you ever argued with a member of the Flat Earth Society? It’s futile, because fundamentally they don’t car if something is true or false. To them, the measure of truth is how important it makes them feel. If telling the truth makes them feel important, then it’s true. If telling the truth makes them feel ashamed and small, then it’s false. –from Louis Theroux‘s The Call of the Weird Site Back Online, Further Downtime Expected This site and a number of other projects are hosted on a Mac Mini that normally sits on my desk. Thing is…my desk moved. And, unfortunately, I didn’t confirm the firewall rules for the network in my new office before bringing the machine over. Thankfully Chris was happy to put the Mini on a different VLAN, and that solved everything (my other machines remain on the new “secure” network…ugh). In the no too distant future, however, I’ll be moving the site again. Video Game Controller Family Tree Sock Master did some outstanding work tracing the lineage of video game controllers from 1977 to now without missing any of the weirdness in between. Search Trends vs Community Standards Via MotherJones: Pensacola residents Clinton Raymond McCowen and Kevin Patrick Stevens, producers of a very NSFW website last week faced a judge in an obscenity and racketeering trial for their work. The interesting thing? The defense planned to use Google search trends to demonstrate community standards. “Time and time again you’ll have jurors sitting on a jury panel who will condemn material that they routinely consume in private,” said the defense. Censorship, Unpublishing, and New Media The actual reasons may never be discovered, but Boing Boing, the perennially top ten ranked blog, has “unpublished (NSFW)” stories by, about, or mentioning author and sex columnist Violet Blue (NSFW). Much has already been said about the Orwellianism of “unpublishing” and how it conflicts with the ethics of the web, as well as the incongruence between these actions and Boing Boing’s position on web censorship, media manipulation, and revisionism. New Theme For the past year or so I’ve been wanting to design a non-bloggy theme for this site — a beautiful theme with a magazine-like front page showing the most recent post in a handful of categories. But I’m further from it now than last year, so it’s time to move on. Which isn’t to say that I settled for my new theme. It’s based on Neo-Sapien by Small Potato. I made it a bit wider, the header a bit shorter, and the image is random-ish (random, but cached). WordPress Survey Tools Lorelle and Samir both point to a number of plugins to do surveys within WordPress, but neither of them say any of them are that good. And Samir is pretty disapointed: “at the end of it all, I never did find my ideal online survey tool.” Survey Fly is the best recommendation from both of Lorelle and Samir, but it isn’t WP2.5 compatible and was las updated in summer 2006. It’s also limited to tracking only one survey at a time. Optimizing Inserts/Updates On MySQL Tables When doing a bulk insert/update/change to a MySQL table you can temporarily disable index updates like this: ``` ALTER TABLE $tbl_name DISABLE KEYS ``` …do stuff… ``` ALTER TABLE $tbl_name ENABLE KEYS ``` From the docs: ALTER TABLE ... DISABLE KEYS tells MySQL to stop updating non-unique indexes. ALTER TABLE ... ENABLE KEYS then should be used to re-create missing indexes. Truemors Powered By WordPress In the “They Did This With WordPress” category (though from about a year ago, sorry) comes Truemors, a Digg, del.icio.us, Reddit clone from Guy Kawasaki. Calling it a clone might be a backhanded non-compliment, but the truth is that it does a credible job in this increasingly crowded space*. And it’s built on WordPress. The relevant plugins are WP-PostRatings and Share This. Electric Pulp did the design, and the whole thing apparently went live quickly on a tiny budget. Kitty Porn Newton isn’t really a kitten, but he is cute. Anyway, I got a new video camera and all I’ve done with it so far is shoot closeups of a cat. Is that why I got it? At least it’s not as bad as this. Music is Jungle Struttin’, by The Lions. 1975 Programming vs. Today’s Computer Architecture Poul-Henning Kamp, the guy behind the Varnish reverse proxy, talks about 1975 programming: It used to be that you had the primary store, and it was anything from acoustic delaylines filled with mercury via small magnetic dougnuts via transistor flip-flops to dynamic RAM. And then there were the secondary store, paper tape, magnetic tape, disk drives the size of houses, then the size of washing machines and these days so small that girls get disappointed if think they got hold of something else than the MP3 player you had in your pocket. MySQL Bug? After an upgrade to MySQL 5.0.51b on RHEL 5 I started seeing curious results in a fairly common query. Here’s a simplified version: ``` SELECT ID, post_date_gmt FROM wp_posts GROUP BY ID ORDER BY post_date_gmt DESC LIMIT 5 ``` What I expected was to get a handful of post ID numbers sorted in descending order by the post_date_gmt. Instead, I got a list of post IDs sorted in ascending order by the ID number. Huh. I wonder what he thinks about the iPhone 3g? David Lynch doesn’t like the iPhone. At all. At least not for watching movies. Maybe the guy doesn’t take the subway much. Abandoned Malls What is it about abandonment that’s so compelling? From Chernobyl and Pripyat to mental hospitals to lost theme parks from Korea to California, we can’t help but stare at darkly vacant buildings. Now add malls to the list. And put South China Mall, in Dongguan at the top of it. Unlike most every other expanse of empty hallways we can name, this one’ been empty since it opened in 2005. .SHP to MySQL GIS data seems to come in .shp (shape?) files, but it’s not like MySQL knows what to do with those. this MySQL forum post points to a PHP tool and Windows executable that promise to convert the .shp data into something more useful to MySQL. Superfluo explains a little more, and there’s lots of .shp data to be had here. Dear Steve I’m really glad to see the news about the iPhone 3g. I’m interested in how the new mobile me service takes a small step toward cloud-based storage services that I’ve wanted for a while. And the news that Max OS X 10.6 “Snow Leopard” will focus on speed and stability, rather than features is good, especially considering the following. You see, I’m a fan of Apple products. Not because I like the brand, but because the products work for me. Could BuddyPress Go The Distance? Facebook and MySpace are trying to turn themselves into application platforms (how else will they monetize their audience?). Google is pushing OpenSocial to compete with it. But no matter what features they offer their users, they user still orbits the site. Scot Hacker talks of BuddyPress changing the game, turning “social networks” from destination websites, to features you’ll find on every website. And the “social network” is the internet, with all those sites sharing information meaningfully. Detecting Broken Images in JavaScript We’ve become accustomed to link rot and broken images in nearly all corners of the web, but is there a way to keep things a bit cleaner? K.T. Lam of Hong Kong University of Science and Technology came up with this sweet trick using jQuery and readyState to find and replace broken images: ``` jQuery('span#gbs_'+info.bib_key).parents('ul').find('img.bookjacket[@readyState*="uninitialized"]').replaceWith('&lt;img src="'+info.thumbnail_url+'" alt="'+strTitle+'" height="140" width="88" /'); ``` And it works really well, but only in IE. Find Stuff By Minimum Bounding Rectangle MySQL offers ENVELOPE() to find the minimum bounding rectangle of a geometric object. The result is a polygon with four segments, defined by five points. It took me a while to make sense of it, partially because the only documentation that I’ve run across so far for POLYGON() syntax is in the ENVELOPE() function mentioned above. I also had to draw a picture to think it through. They write this: POLYGON(( MINX MINY, MAXX MINY, MAXX MAXY, MINX MAXY, MINX MINY )), I think this (in pseudocode-ish form): POLYGON(( $point_a, $point_b, $point_c, $point_d, $point_a )), with the $point_s corresponding to the diagram. Working With Spatial Data in MySQL It’s MySQL spatial data week here, though I am spreading out the posts to, um, ease the pain (or boredom). Anyway, here are some commands/functions I don’t want to forget about later: Start with an existing table called geometry, add a spatial column and index it: ``` ALTER TABLE geometry ADD coord POINT NOT NULL; CREATE SPATIAL INDEX coord ON geometry (coord); ``` Insert some data; think in terms of POINT(X Y) or POINT(lat lon): bSuite 4 beta 2 I announced the bSuite 4 public beta not long ago, now I’ve just posted a new version to SVN that addresses some of the bugs and fleshes out some of the features. I have yet to update the bSuite page, but here’s a preview of what’s new or changed: Additional stats reports WP2.5-style tag input tools on the Page edit screen* WP2.5-style category selector on the Page edit screen* WP2. Calculating Distance Between Points In MySQL MySQL has some powerful, and perhaps underused spatial extensions, but the most interesting functions are still unimplemented: “Note: Currently, MySQL does not implement these functions…” Among those as-yet unimplemented functions is DISTANCE(). Alternatives can be found here and here, though neither is clean or simple. I wonder if a simple MBRContains() is good enough, though… Anticipating Steve Jobs’ WWDC Keynote Will it be a thinner or fatter iPhone? Will it record live video? Will it have a metal cutting laser? To heck with the iPhone rumors. We know the story, all we’re waiting on are the details. I’m more interested in what we don’t know. What aren’t we expecting? Will there be “one more thing”? (thanks to roblef for the sweet photo.) MySQL Documentation Found in the MySQL 5.0 Reference Manual: Related(g1,g2,pattern_matrix) Returns 1 or 0 to indicate whether the spatial relationship specified by pattern_matrix exists between g1 and g2. Returns –1 if the arguments are NULL. The pattern matrix is a string. Its specification will be noted here if this function is implemented. (emphasis mine.) Converting a WP.org Site To WPMU I have a lot of WordPress sites I manage and I’ve been thinking about converting them to WordPress MU sites to consolidate management. Today I attempted the first one, about.Scriblio.net. There’s no proper way of doing it that I found, but here’s what I did: Create a new site in MU Create the users in the correct order (user ID numbers must match) Replace the posts, postmeta, comments, terms, term_taxonomy, and term_relationship tables with those from the original blog Copy the contents of wp-content/uploads to wp-content/files Update the posts table with the new path (both for regular content and attachments, see below) Hope it all worked Somebody is likely to say “just export the content in WordPress XML format and import it in the new blog,” but that person doesn’t use permalinks based on post_id. bSuite 4 Public Beta I’ve had a lot of features on the table for bSuite for a while, but this recently discovered comment from John Pratt (whose Smorgasboard.net is a lot of fun), kicked me into gear to actually get working on it again. The result is bSuite 4, which is probably what bSuite 3 should have been all along. The big news is that I’ve finally revamped stats tracking to work with caching mechanisms like WP Cache, WP Super Cache, Varnish, or whatever else. JSON on RHEL & PHP 5.1.6 Stuck with PHP 5.1.6 on RHEL or even CentOS (and a sysadmin who insists on using packages)? Need JSON? I did. The solution is easy: yum install php-devel&lt;br /&gt; pecl install json The pecl install failed when it hit an 8MB memory limit, and I was clueless about how to fix it until I learned that the pecl installer ignores the php.ini. Turns out the best solution is to use the pear installer (which does follow php. Happy Birthday WordPress WordPress was released to the world five years ago today. Celebrate in SFO, Sydney, or with me at whatever bar I find myself at in New Hampshire tonight. DM me with any ideas. Another Gun Control Analogy “Gun control is like trying to reduce drunk driving by making it tougher for sober people to own cars.” via Many Eyes, Bugs Being Shallow, All That WordPress 2.5.1 added a really powerful feature to register_taxonomy(): automatic registration of permalinks and query vars to match the taxonomy. Well, theoretically it added that feature. It wasn’t working in practice. After some searching yesterday and today, I finally found the bug and worked up a fix. I made a diff and set off to open a ticket in Trac. On the one hand I’m glad I searched first, because it turns out that a ticket on the very same issue was opened on May 16th and it already has a fix. Where Do They Find The Time? Clay Shirky recently posted (wayback) a transcript of his Web 2.0 Expo keynote. …If you take Wikipedia as a kind of unit, all of Wikipedia, the whole project — every page, every edit, every talk page, every line of code, in every language that Wikipedia exists in — that represents something like the cumulation of 100 million hours of human thought. Then Shirky asks us to compare that to television. ROFLcon Turns Me On To Ustream.TV I was amused to learn Nathan was officially at ROFLcon on behalf of his library. I wasn’t representing my work and wasn’t on the lookout for work-related tools, but I found some anyway. Universities have been anxious to get into live video casting for a while. Our first effort eventually became PBS (NET, ETS and PBS histories). Later, we invested huge amounts of money in interactive television (ITV), but enormous costs and complexities limit the use of such facilities. Anglia Ruskin University Faces Criticism 2.0 Anglia Ruskin University is in Cambridge, but it’s not Cambridge University. It’s likely that none of us would even know of Anglia Ruskin‘s existence if it wasn’t for Naomi Sugai, but she’s not interested in promoting the school. She’s got complaints, she’s fed up, and she’s taking her case to YouTube. Well, she took her case to YouTube, and then she got suspended. The video that’s up now doesn’t seem suspension-worthy, but the Telegraph story suggests there’s a different version that may slander an ARU administrator, and that’s the reason ARU gives for suspending her. 2002 Honda Civic iPod/iPhone Install Last weekend, while I was putting an iPod interface into my Scion I did the same thing for my 2002 Honda Civic. Using Ben Johnson’s story as a guide, I bought a PIE HON98-AUX interface and dove in. Aside from tools (screwdrivers and 8 and 10mm sockets), you’ll need: The interface adapter Audio wiring — I used a 6′ RCA to 1/8th inch cable from Radio Shack Power — I used a Belkin car charger plugged into this 12v extension cord I picked up from Radio Shack I also recommend a sufficient quantity of good beer or other beverage. Snakes On A Plane It was only after I’d taken my seat and David Weinberger began his ROFLcon keynote that I realized there was a box of t-shirts at the side of the room with a sign over them that said something along the lines of “FREE: t-shirts from worn out memes.” Thinking that the internet might be old enough now that the old memes might be resurrected in some ironic way, I almost jumped over Jessamyn to rifle through the box and claim a prize. Retro Atari 2600 Video Game Cover Art Sure you played Asteroids and Defender, but did you play these? 2004 Scion xB iPod/iPhone Install Based on this story about an iPod interface install I purchased a PIE TOY03-AUX aux input adapter so I could finally listen to my iPhone without using the lousy FM transmitter. Sure, I coulda bought a new car, as the manufacturers seem to have finally come to their senses and started including such inputs, but I refuse to buy another car until I can have one that gets well over 40MPG. Barbed Wire, The Deeper History Of It turns out that, like most everything else, barbed wire shows up at auctions. Not just shiny new stuff, you’ll find used stuff too. Expect it to be at least a little rusty, and look out for clumps of hair or other things stuck to it. Whether that adds value or not is unclear. Where could we look to find out? The Antique Barbed Wire Society‘s Barbed Wire Collector Magazine might be your best source. My Flickr Complaint Some whine about movies on Flickr, others about the switch to Yahoo IDs, I simply want better rendering of transparent PNGs as JPGs. Cats Want To Eat Your Brains NYT: parasites in your brain are driving you to raise cats in hopes that they eat you. Hat tip to Cliff. Flickr Adds Video I asked for it in 2004, before YouTube, Vimeo, Viddler, or Revver appeared on the scene, and before MySpace and Facebook added video sharing as a feature. Four years later they finally added it. Neil Rickards should get credit for creating the theme of “long photos” (Neil called them “moving photos”). And anybody who was around then isn’t the least surprised at how angry some are now about the new feature (see sarcastic response to that). The Internet, According To mememolly Identity Management Going Commodity? Atlassian’s Crowd SSO and IdM solution has the kind of online pricing you’d expect for word processing software. I don’t know if it’s any good, but it’s a sign that identity management getting boring. Why Can’t I Re-Check Spam With Akismet & WordPress 2.5? (Workaround) I recently installed WordPress 2.5 and among the changes I noticed was a loss of Akismet‘s “Recheck Spam” button (or something like that. It didn’t seem like such a problem at the time, but then I got swamped with so much trackback and comment spam that the flood DOS‘d my server. I had to disable comments and trackbacks for a time, which brought my server back, but my moderation queue still had over 500 comments waiting for me. Christian Nymphos Not that you’d mistake our sites, but Christian Nymphos uses the same theme I use here at MaisonBisson. Well, I modified the theme quite a bit for my use, but…. Well, regarding the title of the site: Pastor Bob Snowdon probably approves of any and all efforts to reclaim “nympho” from its pejorative purgatory. Cargo Aircraft Safety Who knew FedEx and UPS planes crashed so often? (Blame the intronetz for making this too easy to discover.) UPS plane catches on fire, lands in Philadelphia (2006). Apparently the source of the fire remains a mystery, as with a few other UPS fires. FedEx planes have crashed and burned in Tallahassee (2002) and Memphis (2006). In 1994 a fired FedEx pilot attempted to murder flight crew with hammer and hijack the plane. SWIFT: Another Ham Handed Attempt At Social Networking All yesterday and this morning I’ve been seeing tweets about SWIFT, so I finally googled it to see what it was about. The service promises to help organize conferences in some new 2.0 way, but it looks to be about as preposterous a social network as WalMart’s aborted 2006 attempt at copying MySpace. There are some real lessons here, however, about how to court the early adopters that are essential to making an application that depends on user activity successful: WordPress 2.5 Out, MaisonBisson Upgraded WordPress 2.5 is out (and the WordPress site got a facelift), and I’ve already upgraded MaisonBisson using SVN. The changes are exciting, and seem to reflect a tradition that’s developing in WordPress of delivering some really revolutionary features in the x.5 release. The loss of file-based object caching was a bit of a problem, as my VPS‘s load average jumped to over 30 pretty quickly after the upgrade. I tried Mark Jaquith‘s apc-object-cache enabler and saw load average drop back to 2 or so, but I also saw tag and category names disappear and discovered other weirdness. Make Your Own Sign I had fun with the signs in Taiwan (jet powered baby stroller and men’s bathroom signs, for example), but why travel around the world for these things when you can make them at home? Create warning signs, protest signs, church signs, library catalog cards, or whatever. Tibet Open Letter and other innovative uses of WordPress All Things Digital is interesting. Parents would say My Baby Our Baby.com is a little more important. But Tibet Open Letter is as real as the violence. Two things to note: all of them are based on WordPress, and those who discuss Tibet probably risk being listed by the Chinese government as a trouble maker. Evil Google Aaron Swartz‘s Bubble City, Chapter 8: He sent the report to his superior and wandered off for a bit to dwell on the power he had as a faceless person deep inside an office park in Mountain View to know every detail of another person’s life. He wondered what it would be like if he came across that person on the street, he would know every detail of his life, his household budget, the secrets he confided over IM, even what he looked like naked. Interesting WordPress Plugins WP Contact Manager turns WordPress into a contact manager. It’s a combination of theme and plugins (including Custom Write Panel) that allows you to enter and manage contacts as blog posts (familiar, eh?). Use Members Only to secure access. TDO Mini Forms “allows you to add highly customisable forms to your website that allows non-registered users and/or subscribers (also configurable) to submit posts. The posts are kept in ”draft“ until an admin can publish them (also configurable). Best Restaurant In Taipei I ate here. It’s every bit as good as the review suggests. Seb’s description and photos tell more, I’ll post my own photos soon. Update: posted. Short story: there’s a restaurant in Australia with a three month waiting list, but a Sydney Morning Herald reporter says the restaurant I ate at is its equal or better, but without the waiting list and at US$33 per meal. Google PageRank Is/Is Not/Is All Machine Generated Google’s always been in the awkward position of claiming that PageRank is algorithmic, not editorial, while also explaining that they’re constantly adjusting their algorithms to ensure that PageRank reflects editorial judgments of quality. Here’s a peek inside the machine. Zach Houston’s Poem Store Walking north on Valencia I heard the characteristic snap snap snap of an old manual typewriter’s hammers striking paper on the platen. I was more than a bit curious about who might still use such a classic machine even before its operator called out to ask if I wanted to buy a poem. Still, it’d been a full day exploring The Mission with a fabulous host and the time for my flight home was nearing. No Mo W Stolen from Jessamyn‘s photostream. Where The Previews Are I announced yesterday Scriblio‘s integration of Google’s new book viewability API that links to full text, previews, or additional book information (depending on copyright status and publisher foresight). Now that it’s live with Plymouth’s full catalog, I spent a moment browsing the collection and taking note of what books had what. I get no preview for A Baby Sister For Frances, but another of Russell Hoban‘s books, A Bargain For Frances. Scriblio Integrates Google Book Search Links (crossposted at Scriblio.net) Using the newly released book viewability API in Google Book Search, Plymouth State University’s Lamson Library and Learning Commons is one of the first libraries to move beyond simply listing their books online and open them up to reading and searching via the web. Take a look at how this works with books by Plymouth authors Bruce Heald and Joseph Monninger. The “Browse on Google” link in the New Features section leads to extended previews of their works where you can browse excerpts of the books and search the full text. Great Name, But Is It Any Good? “Spork” is a great name for a restaurant, but is it any good? Yelp says it is, but most of the reviews mention the burger, putting me in the position of having to review the reviewers and wonder if a hamburger person can recommend a restaurant to a vegetarian. Not that I am a vegetarian or not a hamburger person, but please tell me there’s more to the retrofabulous-looking place than a cool name and a hamburger. Geographic Tweeting twittervision and twittermap show new tweets wherever they appear on the map, TwitterWhere let’s you follow tweets at a specific location, and Ask500People has nothing to do with Twitter but does show you global opinion. Live. While you watch (so they say, anyway). Warming If this doesn’t warm your heart, check to see that it’s not made of stone. Netflix for Audio Books Netflix for audio books: Simply Audiobooks. Though it makes me wonder why we don’t say “like a library for audiobooks where they send you the stuff you want.” WordPress 2.5 Offers Built-In Gravatar Support Nobody doubted that full Gravatar support would make it into WordPress eventually. Weblog Tools Collection shows what they look like, how they’re managed, and how theme designers can implement them. Quaint vs. Libraries This Slashdot post asks the same question a lot of people do: “can libraries be saved from the internet?” Slate has an interesting photo essay exploring the question of how to build a public library in the age of Google, Wikipedia, and Kindle. The grand old reading rooms and stacks of past civic monuments are giving way to a new library-as-urban-hangout concept, as evidenced by Seattle’s Starbucks-meets-mega-bookstore central library and Salt Lake City’s shop-lined education mall. BuddyPress: The WordPress Of Social Networks? Andy Peatling, who developed a WordPress MU-based social network and then released the code as BuddyPress has just joined Automattic, where they seem to have big plans for it. I’d been predicting something like this since Automattic acquired Gravatar: It’s clear that the future is social. Connections are key. WordPress MU is a platform which has shown itself to be able to operate at Internet-scale and with BuddyPress we can make it friendlier. Parse HTML And Traverse DOM In PHP? I spoke of this the other day, but now I’ve learned of PHP’s DOM functions, including loadHTML(). Use it in combination with simplexml_import_dom like this: ``` $dom = new domDocument; $dom->loadHTML(' one two three sublist item ' ); if($dom){ $xml = simplexml_import_dom($dom); print_r($xml); } Parse HTML And Traverse DOM In PHP? I love how easily I can traverse an HTML document with jQuery, and I’d love to be able to do it in PHP. There are a few classes, but the PHP binding for Tidy seems to be where it’s at. The Zend dev pages make it look that way, anyway. Movable Type To WordPress Scot Hacker (yes, that’s really his name) posted a story about migrating China Digital Times (published by Berkeley School of Journalism) from Movable Type to WordPress: We’ve launched with a lovely new design, reduced story publishing times from by orders of magnitude, been able to re-enable a bunch of features we’d previously had to disable for load reasons, and added new features that were never possible before. The team of authors and editors is in heaven, and I’m considering bringing the site back onto the main J-School server. Scriblio Feature: Text This To Me Take note of the “New Feature: Text this to your cellphone” line above. Adam Brin of Tricollege Libraries explained that the “text this to me” feature he built to send location information about items in the library catalog as text messages to a user’s cell phone is being used as many as 60 times a day. That was the news I needed to decide to offer the feature in PSU’s Scriblio implementation. Web Design Frameworks? I’m a fan of the Sandbox WordPress theme because it does so much to separate application logic from design, and a few small changes to the CSS can make huge changes to the look of the site. I think that’s the idea behind Yahoo! Developer Network’s Grids CSS library. That is, well structured HTML allows very sophisticated styling. All you have to do is plug in your content. To wit: Give Up Your Civil Rights (and your laptop and hard drives) At The Border Can the Feds take your laptop? Yep. Be prepared to give up your civil rights and your laptop at the border, says a recent article in the Washington Post. This came to the attention of music fans earlier, when MTV news reported that a hard drive seized at the border contained studio recordings for Chris Walla’s (guitarist for Death Cab For Cutie) latest album. There was some suggestion that it was all a publicity stunt, but the Post story suggests that it’s a real and not uncommon problem. Apache Reverse Proxy Apache mod_proxy does most of the work, Nick Kew’s howto on running a reverse proxy with Apache explains it. Now, can I tack on some authentication and make it replace III’s WAM or EZproxy? Moscow Subway’s Underground Palaces Photographer farflungphotos describes: All the stations in Moscow’s metro are completely different from one another. Some of them are so opulent, with grand marble halls and chandeliers, all hidden away underground. People seemed to be using them as places just to hang out and meet up with friends. The trains were really frequent too, practically on each others tails. You never have to wait more than a few minutes for one to come along. Western North Carolina Library Network’s Classification Outline Western North Carolina Library Network‘s LC outline is full of detail. LC outline, classification, Western North Carolina Library Network, libraries Changes To WordPress Object Caching In 2.5 Jacob Santos‘ FuncDoc notes: The WordPress Object Cache changed in WordPress 2.5 and removed a lot of file support from the code. This means that the Object Cache in WordPress 2.5 is completely dependent on memory and will not be saved to disk for retrieval later. The constant WP_CACHE also changed its meaning. I’ve just started using the object cache and I’m happy with how it works now, so these changes are somewhat concerning. iPhone Strobe Light Strobe Light is clearly the perfect app for your new 16GB iPhone. MySQL On Multi-Core Machines The DevShed technical tour explains that MySQL can spawn new threads, each of which can execute on a different processor/core. What it doesn’t say is that a single thread can only execute on a single core, and if that thread locks a table, then no other threads that need that table can execute until the locking thread/query is complete. Short answer: MySQL works well on multi-core machines until you lock a table. Looking ahead from 2008: top tech trends I’m excited and honored to be joining Meredith Farkas and David J. Fiander in a roundtable discussion of Top Tech Trends, an OLITA program at Superconference. We’ve made a pact not to share our trends with each other in advance (no peeking), so it’ll be interesting to see how much overlap we have and how differently we approach the issues where we do have overlap. Sophistication The search box with its flashing cursor is a powerful tool, but it’s positively pre-cambrian when compared to our hyper A/V culture. OLA Superconference Presentation: Scriblio I’m honored to be invited to the Ontario Library Association Superconference to present my work on Scriblio today (session #1329). A PDF of my slides is online. Scriblio has had about a year of use in production at each of three sites, and the lessons suggest that Web 2.0 technologies really do work for libraries. And the best news: we can do it without breaking the budget: I’ll be demonstrating how to install Scriblio and reinvent a library in about ten minutes. Microsoft Threatens To Buy Yahoo! I like Yahoo!. I really hope the shareholders decline Microsoft’s offer. Blech, MS has wanted a piece of Yahoo! for a while. Never Forget, 1-31-07 Paranoia If it’s not an American Flag, it’s probably a bomb. What Do Coots Eat? Turns out that coots are omnivorous, but prefer plant matter. Why. Forget Time Capsule, I want a Space Ship Apple’s Time Capsule is great. Seriously. When has backup been easier? But I need more. The MacBook Air&rsquo;s small storage highlights a problem I’ve been suffering for some time: there’s never enough storage. The slower processor and limited RAM expansion are sufferable, but storage isn’t. The 120GB drive in my MacBook Pro now is stuffed with 8GB of music (and that’s after spending hours paring it down a few weeks ago), and almost 50GB of pictures. Camera Found In Cab Starts Digital Goose Chase What would you do if you found a camera in a cab? LCSH News: “Mountain Biking” Replaces “All Terrain Cycling” Even though mountain bike sales and participation are down (as a percentage of market share, biking has been declining for ten years), the Library of Congress has just issued a directive to change the subject heading from “All Terrain Cycling” to “Mountain Biking.” The term was apparently first coined by Charlie Kelly and Gary Fisher in 1979. Stephen King Doesn’t Hate Kindle Stephen King writes at Entertainment Weekly.com that he doesn’t hate the Kindle: Will Kindles replace books? No. And not just because books furnish a room, either. There’s a permanence to books that underlines the importance of the ideas and the stories we find inside them; books solidify an otherwise fragile medium. But can a Kindle enrich any reader’s life? My own experience — so far limited to 1.5 books, I’ll admit — suggests that it can. McQualifications Bruce Pechman earned his credentials, but you could get yours at McDonald’s. Yes, the fast food chain is apparently offering diplomas in Britain now. Dangerous Grains Call For Drastic Measures “The Office of Emergency Management, the New York City Fire Department, Department of Buildings, NYPD, Health Department, and Department of Agriculture” all apparently showed up to evict 200 tenants from a building called the “kibbutz” in the Williamsburg section of Brooklyn. Why? “Dangerous grains,” and a matzoh bakery. It’s been labeled Matzo-Gate, and speculation is rampant that the eviction was spurred by developers eyeing the now fashionable neighborhood. Gothamist has a picture. Apache, MySQL, and PHP on MacOS X p0ps Harlow tweeted something about trying to get an AMP environment running on his Mac. Conversation followed, and eventually I sent along an email that look sorta like this: If you’re running 10.4 (I doubt it, but it’s worth mentioning because I’m most familiar with it), here’s how I’ve setup dozens of machines for web development and WordPress: Install MySQL http://dev.mysql.com/downloads/mysql/5.0.html#macosx-dmg Install Marc Liyanage’s PHP 5 package Usability experts are from Mars, graphic designers are from Venus This an old one, but it just caught my atention. In A List Apart tells us Usability experts are from Mars, graphic designers are from Venus. Is this still true? Haven’t the last several years been about the triumph of good design in both the usability and graphic senses? Or are rounded corners not actually useful? Dancing With The Nerds Richard Stallman‘s Soulja Boy dance, MIT style (via). WordPress to_ping Query Optimization The WordPress team has taken up the issue of performance optimization pretty seriously, and I look forward to the fruits of their efforts, but I’m also casting a critical eye on my own code. Thanks to caching and a hugely optimized query architecture, Scriblio is now performing better than ever, and I’m now looking at the next tier of problems. First among them is a WordPress query that runs to find which posts have pingbacks or trackbacks waiting to be processed. This Would _So_ Cramp My Style The New Hampshire House is considering a ban on texting while Driving. Please, no. Even Cheetah Moms Have To Argue With Kids About Dinner Mother cheetah wants kids to learn to hunt gazelle, but cubs want to nuzzle it. Signs Of User-Centric Shift At CES? Doc Searls in Linux Journal compares previous CES expos to 2008 and finds a shift from talk of “broadcasters and rights-holders extending their franchise” to a Web 2.0 enlightened user-centricity. At every CES up to this one, I always felt that both open source and user-in-charge were swimming upstream against a tide of proprietary “solutions” and user lock-in strategies. This year I can feel the tide shift. Lots of small things point toward increased user autonomy, originality, invention and engagement. Introducing Phonepedia, a Voice-Activated Wikipedia Mashup The Phonepedia concept is simple: take Wikipedia’s rich content and add voice recognition. It’s as easy as calling a number and asking your question, the answer will be returned via SMS and email. Go ahead and try it for yourself. Phonepedia. The voice recognition is powered by Jott, and thanks are due to Heidi for writing so glowingly about it (Cluetrain moment: I’d heard about Jott before, but hadn’t been stirred to look at it until I saw Heidi’s post speaking in the voice of a real person). Like Mr. Ranganathong said… Like Mr. Ranganathong said: “The intellect cannot be tied down with a decimal thong.” (via) I Can Haz Ice Cream And Booze? This thread says you can get booze and ice cream in the same joint! Places to know in NYC: Otto, The Chocolate Room (beer &amp; wine only?), ChikaLicious, Clinton Street Baking Company, BLT Burger, Homer’s, and Liquor &amp; Ice Cream. Staring Contest Shirow Masamune himself couldn’t draw Manga Eyes like hers. Google Pumps OpenID Too Following news that Yahoo! is joining the OpenID fray, it appears Google is dipping a toe in too. While those two giants work out their implementations, others are raising the temperature of the debate on IDM solutions. Stefan Brands is among the OpenID naysayers (&lt;a href=&quot;http://daveman692.livejournal.com/310578.html&rdquo; title=&quot;David Recordon&rsquo;s Blog - Stefan Chooses to Take the &ldquo;Fox News&rdquo; Approach to OpenID Blogging&quot;&gt;David Recordon’s response), while Scott Gillbertson sees a bright future. Let’s watch the OpenID Directory to see how fast it grows now (count on January 19 2008: 446). Harvard Film Archive’s Wild Movies of 1930s Pre-Code films were apparently something of a spectacle. Harvard Film Archive this weekend is exploring their depths in a series titled Vice vs. Virtue. just in case anybody else wond… just in case anybody else wonders why a WordPress 404 initiates extra MySQL activity http://tinyurl.com/2nkplo Balloon Organ, Yes, A Balloon Organ In a piece that will have some people eagerly looking for some Afro Celt Sound System, others singing Where Do They Make Balloons, and some people just shaking their heads, this fellow, apparently standing in his bathroom, introduces us to another guy and his balloon organ. Really. Check this for more homemade organ fun. Eccentric Chess Champ Bobby Fischer Dead Eccentric, perhaps persecuted, Bobby Fischer is dead. News story. WordPress + Invalid URLs = Extra Database Queries After reporting weirdness last week I finally sat down with a completely clean and virgin install of WordPress 2.3.2 and traced what happens when you make a permalink request for a non-existent URL. Here are two sets of URLs to use as examples and context: These are valid URLs: http://site.org/archives/101 http://site.org/page-name These are _not_ valid URLs: http://site.org/archivezorz/101 http://site.org/favicon.ico Valid URLs get parsed, the expected MySQL queries get executed, and the results are processed and returned to the browser. Yahoo! Pumps OpenID Ars notes that Yahoo! supports OpenID. Yeah, that OpenID. Southwest’s In-Flight Magazine Doesn’t Suck, They Say Derek Powazek likes it, but is it worth flying SouthWest for? @jblyberg: I had to look it up… @jblyberg: I had to look it up a while ago too http://tinyurl.com/z87sg sifting results of error_log( … sifting results of ``` error_log( $_SERVER['REQUEST_URI'] ."\n". $_SERVER['REMOTE_ADDR'] ."\n". print_r( debug_backtrace(), TRUE ) ); ``` trying to figure out why WP hi… trying to figure out why WP hits DB for all posts query _after_ it determines the URL is a 404 Is Facebook Really The Point? A post to Web4lib alerted me to this U Mich survey about libraries in social networks (blog post) that finds 77% of students don’t care for or want libraries in Facebook or MySpace. the biggest reason being that they feel the current methods (in-person, email, IM) are more than sufficient. 14% said no because they felt it was inappropriate or that Facebook/MySpace is a social tool, not a research tool. @tinfoilraccoon: take the pled… @tinfoilraccoon: take the pledge: http://tinyurl.com/2x8qye @tinfoilraccoon: is it really … @tinfoilraccoon: is it really so complex that it requires training? PLS tell them Amazon and iTunes don’t require training, ask why OD does. Fancy Up Your Website With Web Clip Icons Aaron Schmidt alerted me to this how to sweetening up your site with fancy iPhone web clip icons. Impeach Cheney Now You’ll feel better after signing the petition. Bits Of MySQL Query Syntax I’ve Learned This Week Watching the WordPress hacker list this week, a couple messages related to selecting information about users schooled me on MySQL syntax. I obviously knew the following would work, but I’d previously used the UNION syntax in similar situations and somehow hadn’t thought of writing it this way: ``` SELECT (SELECT meta_value FROM wp_usermeta WHERE meta_key = 'first_name' AND user_id = 2) AS FIRST, (SELECT meta_value FROM wp_usermeta WHERE meta_key = 'last_name' AND user_id = 2) AS LAST, wp_users. user posts antisemitic content… user posts antisemitic content to wikipedia, then crosses out my comment in the requests for deletion page!?!? http://tinyurl.com/ytt5zh @edventures: their hardware an… @edventures: their hardware and operating system operations are getting squeezed. They’ve gotta look elsewhere. I like MySQL. I like Sun. This… I like MySQL. I like Sun. This could work well: http://tinyurl.com/yr43rl tried sleep, failed. Surfing w… tried sleep, failed. Surfing web oniohone in bed while Sandee sleeps soundly. just a tiny example of a commu… just a tiny example of a community trying to figure out its boundaries http://tinyurl.com/ytt5zh drove home clicking iphone map… drove home clicking iphone maps locate button like walt mossberg on meth. works great in cities, crap in woods New iPhone maps locate circle … New iPhone maps locate circle has yet to locate me MacBook Air is sealed like iPo… MacBook Air is sealed like iPod. Can’t replace battery, no RAM upgrades. iPhone update finally download… iPhone update finally downloading. Not leaving office until I get a locator button on my maps. iPhone update server overloade… iPhone update server overloaded NH Primary Fraud? Two very important things: I have every confidence that the NH Primary results were correct and accurate, and, most importantly, unmolested. And, I’m also quite happy with them. But that doesn’t mean I’m not anxiously awaiting the results of the hand recount that Congressman Kucinich has requested. Conspiracy theories abound, and Diebold is a despicable company worthy of general derision, but at least our AccuVote OS machines have paper ballots. @awd: wasn’t sure if there was… @awd: wasn’t sure if there was a specific meeting your sarcasm was directed toward, though I’ve been following the drama all along @mstephens7: bring cigars and … @mstephens7: bring cigars and ask if prez has has scotch in the office? Getting Ready For The Stevenote I can’t go to the parties Laughing Squid names, and World of Apple’s live video coverage seems about as likely as a Kucinich becoming president, but The Unofficial Apple Weblog‘s keynote predictions are out, Ars’ keynote bingo is set, and half the blogaverse will likely offer some updates about the action, some of them live. The Stevenote is coming, and at the end of the day, or at least later that day, it’s likely that Apple will broadcast the recorded event in QuickTime (judging from this URL, you might find it here). Dead Men Don’t Cash Checks Virgilio Cintron was the happiest corpse in the city… Chris “Long Tail” Anderson On Open Source Open source and the Long Tail: An interview with Chris Anderson The shift of software from the desktop to the Web will really be the making of open-source software. The Long Tail side of software will almost certainly be Web-based because the Web lowers the barriers to adoption of software. There will always be some software best delivered as packaged bits. But the big problem with packaged software–or one big problem–is the risk associated with installation. How Do I Create A Semantic Web Site? A member of the Web4lib mail list asked: How do I create a semantic web site? I know I have to use either RDF or OWL but do I use either of these to create a mark up language which I then use to create the web site or, with the semantic web do we move away from mark up languages altogether? Am I right in thinking that OWL and RDF do not contain any information on how the document is to be displayed or presented? Live In Mehran Karimi Nasseri, Sanjay Shah and Alex Ervasti all made their names living in airports. Now, comedian Mark Malkoff is hoping his one week stay at the Paramus, NJ Ikea store will do the same. The State Of Democracy What does it mean about the state of democracy when viral video darling Obama Girl Amber Lee Ettinger shows up in NH? And Chuck Norris too? (Chuck Norris political facts.) It probably surprises no one that Kucinich’s press secretary’s 18 year old daughter is more articulate than Amber and Chuck combined. Ugh. WordPress Admin Redesign Progress Happy Cog‘s Liz Danzico introduced it at WordCamp 2007 (her slides are online), but it’s been only recently that the fruits of the admin control panel re-thinking have started to appear in code. Though there’s much work yet to be done and it’s not uncontroversial, I think I like it. MaisonBisson Chocolate Martini The holidays are past, but we still have a sweet tooth here. chocolate shavings for rimming 1 part crème de cacao 2 parts vodka dark chocolate garnish Warm a martini glass over a small flame, then roll the rim in chocolate shavings. Put a square of dark chocolate in the glass, then prepare the liquor. Shake vodka and crème de cacao with ice and strain into glass. For additional flavor, sprinkle the top with cocoa powder or chocolate shavings. Wiimote (Wii Remote) + Projector + Computer = Homebrew Multitouch Display You’ve got the hardware, you’ve got the skills, go build a multi-touch electronic whiteboard with your Wiimote and a data projector. Building In A (Big) Bubble dcdead‘s photo of the Central Station of Strasbourg, France reminds me of something I’d long wanted to do in (or around) my old house: put it in a dome. Apparently, this dome doesn’t fully cover the building, just enlarges it without obscuring the facade. Still, 6000 square meters of glass looks pretty good, eh? Back to my old house, however. Here’s the plan: forget the lack of insulation and the drafty windows (and the dying roof, before I replaced it), solve all of that by putting a greenhouse up around it. WordPress 2.4 Performance, Timeline The good news is that performance is a big goal for WP 2.4, the bad news is that it’s been delayed to the end of January at the earliest. Gmail IMAP vs. Previous POP3 Users Google Mail now supports IMAP, but what if you’ve been using POP3 all along and have a gajillion messages on the server, all marked unread and waiting in your inbox? How can I tell Apple Mail not to download the [Gmail]/All Mail IMAP folder without an ugly hack? [Update, the hack just causes Mail to crash a lot.] Free Report On Accessible Web Design From Jakob Nielsen Free from Nielsen Norman Group: Beyond ALT Text, Making the Web Easy to Use for Users With Disabilities, a report on web design for users with disabilities. “Seventy-five best practices for design of websites and intranets, based on usability studies with people who use assistive technology” According to the blog post, usability is three times better for non-disabled users. bSuite Machine Tags There can be no arguments about it, machine tags are cool and they solve problems. And now they work in WordPress with bSuite too (svn only, for the moment). It’s not just because flickr popularized them that I like them, though it helps and you should definitely look at that stuff: The announcement Excitement from O’Reilly Radar, ProgrammableWeb, and Dan Catt (who championed the concept at flickr, I think). Inside Your Head Video found via a photo in Soffia Gisladóttir‘s photostream. The suggestion that things go rotten inside a person’s head is very sad, but I’ve also suggested it to Zach for Moldy Snack.com CSS Transparency Settings for All Browsers ``` .transparent_class { opacity: 0.5; /* the standards compliant attribute that all browsers should recognize, but... */ filter:alpha(opacity=50); /* for IE */ -khtml-opacity: 0.5; /* for old Safari (1.x) */ -moz-opacity:0.5; /* for old skool Netscape Navigator */ } ``` (via) A Boy And His Cabbage of Significant Size From the La Crosse Tribune, A boy and his cabbage of significant size: Wisconsin ten-year-old Douglas Mezera grew a 31-pound cabbage for a competition sponsored by Bonnie Plant. The Alabama plant company’s program aims to promote gardening as fun and rewarding. What do you do with so much cabbage? “We made it into homemade sauerkraut,” Douglas’ mom said. “It’s good.” (VIA) Language Translation Icon We all need a recognized icon to represent “translate this.” We’ve got one for feeds and social bookmarking, but where’s our translate icon? A lot of folks simply use flags, but that’s a bad idea because they’re “nationalistic, and represent ideals, boundaries, and political beliefs, but do not represent a language.” Joe Lee has developed a few icons for use in the OLPC project, and they look good. The only problem I have with them is in trying to make them work at 16×16 pixels. In Flight WiFi Back In The Air? I thought the matter was dead after Boeing shut down their much hyped in-flight WiFi plans (yep), but Engadget got a seat on JetBlue’s private introductory flight for their WiFi service. The good news is that it’s free, the not surprising news is that Yahoo! is partnering in it (and it requires a Yahoo! account), the bad news is that all you get is Yahoo! IM and email. No web browsing, or anything else useful. Scriblio 2.3 v4 Released Scriblio 2.3 v4 is out. See it. Download it. Install it. Join the mail list. What’s new? Lots of small bug fixes. Implemented wp_cache support. Revamped SQL query logic for better memory efficiency. New widget options. Search suggest/autocomplete support (implemented in the new theme). New theme. New Theme! By Jon Link. Home Libraries, Amateur Libraries The Library Problem: In March of 2006 my wife Mary and I owned about 3,500 books. We both have eclectic interests, voracious appetites for knowledge, and a great love of used bookstores. The problem was that we had no idea what books we had or where any of them were. We lost books all the time, cursed late into the night digging through piles for that one book we knew must be there, and even bought books only to find that we already owned them. USB-Connected Monitors? DisplayLink is licensing technology that promises to make adding a second (or sixth) monitor as easy as plugging into a spare USB port. Samsung’s 940UX 19“ LCD (Under $350, review) is among the first to employ it, though IOGEAR’s USB to VGA adapter is also available (about $65, review). This isn’t without problems, though. Image quality is said to be sharp until it moves, then it stutters and chops, more from CNet Labs. Seven Person Bicycle: The Conference Bike I saw this bike here, here, and here on Flickr, but nobody said what it was or where I could learn more. Some googling revealed it was Eric Staller’s ConferenceBike, first sold by Hemmacher Schlemmer. One person steers while all seven riders peddle, and it looks like a lot of fun if you’ve got a spare $13,000. The eight foot long bike is six feet wide and weighs about 400 pounds. Compress CSS & JavaScript Using PHP Minify It was part of a long thread among WordPress hackers over the summer and fall, but this post at VulgarisOverIP just reminded of it: minify promises to be an easy way to compress external CSS and JavaScript without adding extra steps to your develop/deploy process. No, really, look at the usage instructions. (To be clear, the Vulgaris and Google Code versions are different, one derived from the other and backported to PHP4 compatible. Old Romans Knew How To Make Glue We’ve known about the birch bark glue Romans used on their clay pots and jars for a while, but now researchers in Germany are calling it “Caesar’s Superglue.” Researchers at the Rhine State Museum in Bonn apparently found it used to bond silver plate to an iron helmet in a 2000 year old repair job. The superglue part: the bond was still good. People Make Scriblio Better It’s way cool to see Lichen‘s Scriblio installation instructions translated to Hungarian. Even cooler to have Sarah the tagging librarian take hard look at it and give us some criticism (and praise!). But I’m positively ecstatic to see Robin Hastings’ post on installing Scriblio (it’s not easy on Windows, apparently). Part of it is pride in seeing something that I’ve been working on for so long finally get out into the world, but Scriblio really does get better with every comment or criticism. Roadside Attractions Fading Away? Roadside Attractions Fading from Landscape: A staple of the American road trip could be slowly disappearing from the nation’s interstates and byways. Owners of some roadside attractions are deciding that interest is waning bSuite 3 Released [innerindex]I started bStat in 2005 when I ported my blog from pMachine to WordPress and needed to bring over the tools I’d built to identify popular stories and recent comments. I renamed it bSuite when I added tagging and other features to it. Now it’s bSuite 3. Get it here. Get installation details here, and users of previous versions should check the upgrade instructions here. Features Tracking of page loads for each post and page. My iPhone Commercial (or, The Night We Almost Died On A Mountain) It was cold. The air carried no scent, ice squeaked under our boots, and every little leaf and twig crinkled and snapped as we walked over it. But this was louder than that. Much louder. Neither Jon nor I saw it actually happen, but when I found Will he was mostly upside down between a boulder and tree. The trail at that point was elevated by some rocks and bordered by pines that grew from the forrest floor some distance below. Tabbed Chatting In iChat Among the missing features I hear the most complaints about regarding iChat is the lack of tabbed chatting. Today I discovered it’s part of Leopard. Simply go to the iChat prefs, click on the messages pane, and selected “Collect chats into a single window” and you’re set. A Nation Marketing Itself Japan‘s The Ministry of Foreign Affairs English-language Web Japan is a bottomless trove of in-flight magazine-quality stories like ANTIBACTERIAL EPIDEMIC and J-culture-hyping love-fests like Honoring The World’s Manga Artists. If American propaganda efforts are this bad, why do foreign governments even bother blocking them? Is This Really Worth Protesting? It can only be taken as evidence of our wealth and privilege that two years after Macy’s bought Marshall Field’s people are planning a Black Friday rally and holiday boycott to protest the name change. WP Rewrite Instructable Dan’s instructable for custom rewrite rules in WordPress is better than the docs in the codex. How Expensive Does Commercial Software Need To Get Before We Consider Open Source? Open source software of the free as in free beer and free as in free speech variety has matured to the point that there are now strong contenders in nearly every category, though that doesn’t make them easy choices. It’s often revealing when people criticize OSS as being free as in free kittens, which is true in the sense that F/OSS does require continued care and feeding to make it work, and false in that it suggests commercial solutions don’t. Themes I Like Matt has updated his site with a less blog-like front page and I just discovered Unsleepable, which is very bloggy, but seems like a good start for what I want to do next. Remix Remix Remix: The Tracey Fragments I guess the criticism is that it’s one thing for somebody to open up their music for remixing, but an entirely different thing to do the same with a movie. Or is it? Is it (click re-fragmented)? [Insert Word Here] Is Hurting Your Network Corporate networks are defenseless against the growing threat from instant messaging, and the government warns WiFi is insecure and easily sniffed. Experts suggest we take precautions against the growing risk of p2p software that’s exposing sensitive documents and threatening national security. Businesses blame security problems on their employees, their mobile devices, and other consumer technologies. And now we have MySpace. Tidens Hotteste IT-Trends My presentation for today’s hottest IT trends is nearly completely new, though it draws a number of pieces from my building web 2.0-native library services and remixability presentations. What it adds is an (even more) intense focus on the people that make up the web. Denmark is among the most wired countries of Europe, and it’s especially interesting that more than half of Danes over 55 use the web at least once a week. Remember The Good Old Days? The first article database I remember using was Dialog, sometime in the late 80s or early 90s. Today I found myself amused that we used to call such things “interactive.” That is, you poked the command line interface with questions and it usually beeped a syntax error, all while they charge $4 per minute, plus the connection fees. (The image above is from a later CD-ROM version.) A 1993 article in Phrack reminded me of some of the details and fun of such systems: European Internet Usage Statistics Eurostat 2006: Internet usage in the EU25: “Nearly half of individuals in the EU25 used the internet at least once a week in 2006 and a third of households and three-quarters of enterprises had broadband internet access.” Statistics Denmark 2007: Access to the Internet: 78% of population has home internet access. Going Global With My iPhone I can use my iPhone pretty much anywhere, but ATT is going to charge me $1.30 a minute for calls, $.50 per text, and $.02 per KB for data while in Denmark. ATT requires international activation but they do offer some tips for international roamers. I bought an international iPhone data plan (20MB for $25), but I also learned that visual voice mail counts against that (regular voice mail counts against minutes, at the $1. WordPress vs. Drupal I’m a WordPress Partisan, so I agree with Mark Ghosh’s criticism of this WordPress vs Drupal Report. Still, it reminds me that I should point out XXLmag, SLAM Online, and Ford among the very non-bloggy sites built on WordPress. Fish Tacos Oh decadence! Veterans Day provided not only a chance for reflection but also a rare day free from the classroom. So what to do with this open period of time? The answer was easy, dinner party. I have wanted to have my colleagues Roxanna and John over, but time is always an issue. I phoned them up and they accepted. Now the fun began — menu planning. While vacationing with my parents in Vegas last summer we went out to marvelous food chain, The Cheesecake Factory. Design Anxiety All I know about Denmark is what gets imported: Legos, of course, but also a tradition of exquisitely clean and functional design. That’s why, as I prepare for my talk in Copenhagen later this week, I’m incredibly conscious of my own design and a bit jealous of Jessamyn’s outstanding use of orange. Anyway, that’s where I’ll be all week. Any tips? Anybody up for a drink? Gender Gaps Connect the dots: Boys vs. girls in US colleges and too many men in East Germany. Object-Based vs. Ego Based Social Networks vs. WoW and Second Life There are so many cool things in Fred Stutzman’s recent post, but this point rang the bell for me just as I was considering the differences between World of Warcraft and Second Life. More on those games in a moment, first let’s get Stutzman’s description of ego vs. object networks: An ego-centric social network places the individual as the core of the network experience (Orkut, Facebook, LinkedIn, Friendster) while the object-centric network places a non-ego element at the center of the network. Internet Safety NPR : Back to School: Reading, Writing and Internet Safety As students return to school in Virginia, there’s something new in their curriculum. Virginia is the first state to require public schools to teach Internet safety. Freaking MySQL Character Set Encodings Derek Sivers‘ plan, with all it’s bin2hex and regexp and back and forth between MySQL and PHP almost looks good compared to what I’m about to do. Really, why is it so difficult to go from latin1 (tables created back in MySQL 3) to utf8? Not only do you have to set the charset on the table, but also the connection, in PHP, and flipping everywhere. And then you’ve gotta deal with all this old data that’s in the wrong character set. Pick Up Lines How to pick up girls in the library. Indeed, it’s Picking Up Girls Made Easy. Internet Librarian 2007 Presentation: Building Web 2.0 Native Library Services The conference program says I’m speaking about designing an OPAC for Web 2.0, and I guess I am, but the approach this time is what have we learned so far? And though it’s the sort of thing only a fool would do, I’m also planning to demonstrate how to install Scriblio, a web 2.0 platform for libraries (foolish because I plan to do it live and in real time). Is The Answers.com API Public? Answers.com is throwing a bone to WordPress users with their new AnswerLinks plugin written by Alex King. But wait, there’s an Answers.com API? A few pokes at the Google machine reveal nothing relevant, and Asnwers.com’s site is mum too. Taking apart the code, I get the following (modded enough to make it run-able if you drop it in the base of your WordPress install): ``` require_once('wp-config.php'); require_once(ABSPATH.WPINC.'/class-snoopy.php'); $snoop = new Snoopy; $snoop-read_timeout = 5; $snoop-submit( 'http://alink. MaisonBisson And unAPI Thanks to Mike Giarlo‘s unAPI Server for WordPress. Now if only there were a library catalog built on WordPress, I could probably just drop it in. Panorama Stitchers: Calico vs. DoubleTake I’ve been using DoubleTake to stitch panoramas for a while, but when I discovered p0ps Harlow’s photos and learned he was using Calico Panorama, I figured it was worth taking a look. DoubleTake has done a great job for a number of my photos (Mt. Moriah, San Francisco Motorcycles, Mt. Mondadnock), and when the automatic stitch failed, I could manually reposition (or re-order) the photos. I could also adjust the individual images to make them better match each other. Mac OS X 10.5 Comes With Apache 2 and PHP 5 Yep. Leopard comes with new stuff. Lazeez says it works fine, but commenters here are having trouble. Memory, Intimacy, And The Web I’ve been thinking about it since Troy mentioned to me that he thought Google was ruining his memory. And I thought I found confirmation of it when I read Gladwell’s description of Daniel Wegner, et al’s Transactive Memory in Close Relationships: When we talk about memory, we aren’t just talking about ideas and impressions and facts stored inside our heads. An awful lot of what we remember is actually stored outside our brains. Library 2.0 Subject Guides Ellyssa Kroski‘s Librarian’s Guide to Creating 2.0 Subject Guides is good introduction for Librarians who think know “there has to be a better way.” But why no mention of blogs and blogging tools? (I’m still really happy that when you search our catalog for something, a subject guide for that term appears (if we have one that’s relevant)). Book Autopsies Via Ryan: Brian Dettmer: Book Autopsies at Centripetal Notion. Site Crashed…Recovered…Sort Of My hosting provider lost a server, and their most recent backup of my database was from Wednesday. That was newer than what I had, so that’s what I’ve got. Any comments submitted between then and mid afternoon today have been lost. I was luckier with my posts: I write most of them in ecto and had them backed up on my lappy. At least the Sox won. The War On Zombies From Kim to Zach to me to you: Bush Vs. Zombies. Now we know: the guy doesn’t understand the difference between fact and fiction. Most people thought Shaun of the Dead was horror/comedy, not documentary. Poor W probably read The Zombie Survival Guide as an instruction manual (don’t show him How To Survive a Robot Uprising, please). Gah. The guy hired a cannibal, fears animal-human hybrids, and flip-flops on evolution. Gravatar Acquired, More Features & Better Reliability Ahead Matt pointed out that Automattic has purchased Gravatar, the globally recognizable avatar service. Om speaks of the economics and Matt’s cagy, but it’s hard not to see the possibility of creating a larger identity solution around this. WordPress’ market penetration is huge, a service that connects those nearly two million blogs could offer real value, especially in connection with Automattic’s Akismet. Aside: now that Gravitar’s reliability is up, I’ll probably get Sexy Comments running here soon. Stupid Trademark Law Story: Timbuk2 develops a new line of messenger bags that features fabric made of &lt;a href=;http://www.treehugger.com/files/2007/06/dont_shoot_the.php&quot;&gt;recycled material (engineered by RootPhi). Some of the fabric contains a symbol that Target lawyers say is their logo. Target lawyers cease and desist Timbuk2. Thing is, the trademarked Target logo is a roundel, commonly used around the world (easily recognized in British aircraft of WWII). The particular design Target has chosen appears to be a copy of Peru’s official insignia. Screencasting On Mac I’m as annoyed as the next guy about how hard it is to find a decent screencast app for Mac. The forthcoming Mac OS 10.5’s new iChat Theater (and the built-in screen sharing/control features) should create some new opportunities for developers, but right now it’s hard to know what works or is worth trying. Further, I narrowed the field with the following requirement: I need an app that records to QuickTime-compatible files, not Flash. Not Just Hip When a writer goes looking for young Turks (my words, not Scott’s), you should expect the story to include some brash quotes (writers are supposed to have a chip of ice in their hearts, after all). On the other hand, we’re librarians, so how brash can we be? Scott Carlson’s Young Librarians, Talkin’ ‘Bout Their Generation in The Chronicle this week did it better than most articles: rather than showing how hip or geeky we are, it asks us about the future. Friends, Photos, Favors, Feeling Ill I practically begged Will and Karen to get on a carnival ride with me so I could get portraits with the lights streaking behind them. Will warned me that he doesn’t do well on rides; I argued that no ride with so many kids under four feet tall could be too dangerous for us. We boarded, it started. From the ground it looked gentle, much like the teacups. That was misleading. Corrosion Test Facility Not As Rusty As Expected Corey, Will, and Jon were all as excited as I was to see the fabled Point Judith Corrosion Test Site, just south of Narragansett, but we were all surprised at how un-rusty the goods were. Don’t laugh, corrosion is a big deal. According to the National Materials Advisory Board: Corrosion of metallic structures has a significant impact on the U.S. economy. In a congressional study, the total economic impact of corrosion and corrosion control applications was estimated to be $276 billion annually, or 3. Fools On The Beach [[slideshow|height=375px|farm3-static-flickr-com-1507480544_6070e748c5.jpg farm3-static-flickr-com-1506619571_f36bd9da1b.jpg farm3-static-flickr-com-1507473874_9ea5fede30.jpg farm3-static-flickr-com-1507470768_fb2c7354b7.jpg farm1-static-flickr-com-1506610039_dbaee19d93.jpg]] We were there because of the Point Judith Corrosion Test Facility — the Rust Museum — but who can resist chasing seagulls? And who can resist posting the sequence? Assuming you’ve got a recent browser with JavaScript enabled, you should see a bit of a slideshow above. Photos on Flickr, slideshow powered by jQuery and bSuite. Cocktail Manifesto We’re huge fans of The New Joy of Cooking by Marion Rombauer Becker, Irma S. Rombauer, and Ethan Becker. Hardly a meal goes through our kitchen that isn’t shaped in some part by the recipes and general information in its pages. A recent discovery was Joy’s description and defense of cocktail parties. So, when a book as serious and valuable as The New Joy of Cooking raises alarms about the declining future of cocktail parties, we listen. Who Owns The Network? Note: this cross-posted item is my contribution to our Banned Books Week recognition. We’ve been pitting books against each other, hoping to illustrate that there are always (at least) two sides to every story. Most of the other books were more social or political, but I liked this pair. Wikinomics authors Don Tapscott and Anthony D. Williams tell stories of how the the internet’s unprecedented collaboration opportunities are changing the rules of economics. Banned Books Week Dilemma Our intention is to feature “a series of books that challenge our beliefs and test our commitment to free speech,” but on this post about Holocaust denial I found myself unwilling (and unable) to link to the free, online PDF full text of David Irving‘s Hitler’s War. And when we discovered it wasn’t in our collection (though it may have been lost/stolen, not replaced, and the record deleted), we decided not to purchase it. Business 2.0 Too Tired? Magazines fail all the time, but it’s hard not to look at them as signs of something larger. MacWEEK‘s fizzle was claimed to represent the demise of the Mac, Computer Shopper has lost more weight than a Slim Fast spokesmodel (800 pages to 80 in ten years!). And now Business 2.0 Magazine is shutting down and sending cancellation notices to readers. Perhaps the lesson here is that there’s nothing too 2. Restaurant Review: Cotton First Impressions How much is too much for an entree at a place that plays the kind of anonymous Muzak that Kenny G calls jazz and is decorated like Applebee’s? Trust me, I like renovated mill buildings, but why confuse it with faux grecian columns and too many pictures of dead celebrities? I mean, the interior was clean and pleasant, but lacked attention to detail. If you’re so afraid your customers are going to walk off with the poorly framed prints of old Hollywood darlings that you nail them to the wall through the frame, how much can you expect them to pay for dinner? Smashitup Smashitup Smashitup! After all my agitating for small, cheap, fuel efficient cars (and automotive metaphors), I figured I had to post this picture (and a few others) from the demolition derby at the Hopkinton Fair a couple weeks ago. My video of the four-cylinder event is at YouTube. Extra: I don’t know where it fits in your stereotype of the demolition derby audience, but I was happy to find somebody wearing a css_descramble. “to ascertain if the applicant is still living” Whose Library Is It Anyway?: A Visit to the Lenox [tags]library, libraries, humor, lennox library[/tags] Don’t Mistake Me (Please) Over at KLE’s Web 2.0 Challenge I was surprised to learn: Both Bisson and Stephens are so excited about this concept of Web 2.0 they have not taken a good look at what they can’t do for our libraries. …with all this new technology we can not forget that what is the most important in our libraries is the personal touch. We are one of the few institutions left that still offers individual attention. Checkouts Vs. GPA? Cindy Harper, Systems Librarian at Colgate University, posted to the IUG list with this notion today: I’m clearing out a large group of expired student records, and wonder if anyone else has had the same idea that has occurred to me. [Our ILS] keeps track in the patron record of TOTCHKOUTs (total checkouts). At the expiration of the students’ record at the end of their four or so years, this represents a measure that is not perfect, but could distinguish heavy library users from non-users. Copyleft: Defending Intellectual Property Anybody who thinks Free Software is anti-copyright or disrespectful of intellectual property should take a look at Mark Jaquith’s post, What a GPL’d Movable Type means. Let’s be clear, Anil Dash takes issue with Jaquith’s interpretation, but the point is Jaquith’s offense at what appears to be Six Apart’s grabbiness for any code somebody might contribute. Freedom 0 was one thing, the willingness of a person to pour his or her sweat into something, then watch somebody else (or even risk watching somebody else) profit from it is another. Mullenweg on WordPress and Open Source I wish I’d seen this from WordPress maven Matt Mullenweg before I finished My LTR on open source software for libraries. Mullenweg is brushing off some of the mystique and praise the media has been giving him, and giving an honest sense of what makes open source software work: the real story is more exciting than the cookie-cutter founder myth the media tries frame everything in. It’s not just one or two guys hacking on something alone, it’s dozens of people from across the world coming together because of a shared passion. It’s Standard Playtesting, Everybody Does It In another sign that my generation’s culture is gaining dominance, NPR gave video games a bit of coverage this morning. Unfortunately, the story that makes it sound like the company invented playtesting doesn’t suggest that Microsoft’s behemoth investment in the Halo franchise makes that testing (and, perhaps, blandness) necessary. (Meanwhile, MSNBC last year ran an off-message story about how playtesters declared the Wii the top console.) Reality: Playtesting is one of those dream jobs that people scour Craigslist for or start questionable-looking services around. Developing and Testing Mobile Content Read: A List Apart: Articles: Put Your Content in My Pocket and Part II. Test/simulate: Opera Mini, Lynx, a variety of mobile phones, Internet Explorer (because even with Parallels, who really wants to infect their machine with windows?), and iPhone. A Message From The Establishment To The Establishment We must stop thinking of ourselves as a good-idea factory whose every thought has greater merit than those of our customers. Procter & Gamble doesn’t even do that. — paraphrased NH’s Virtual Learning Academy The CEO of NH’s first online-only, distance education high school expects about 700 students to enroll in its first semester, to start in January. So says a report at NHPR. Four Years Of Music Industry Lawsuits & Madness Marketplace reminds us the storm of RIAA lawsuits began in September 2003. In that time they’ve sued a thousands of people, and most lawyers apparently advise those caught in the madness to simply roll over and take it. But Tanya Andersen, a 41 year old disabled single mother didn’t. After years of litigation (and mounting legal bills), it finally came out the RIAA’s lawyers had misidentified her and dropped the case, casually saying “Sometimes when you go fishing with a driftnet, you catch a few dolphins. Obligatory Talk Like A Pirate Day Post Perhaps Talk Like A Pirate Day has been too successful when NPR hosts are doing it, but anything that’s so important to our children’s future success is important enough for me. And if you need a brush up on your skills, don’t miss this instructional video. NYT: The Link Is The Currency Of The Web The New York Times has struggled with TimesSelect, now they’re killing it. But the news here isn’t that a media giant is giving up on a much hyped online venture. The news is that a media giant is endorsing what we now call web 2.0: Since we launched TimesSelect in 2005, the online landscape has altered significantly. Readers increasingly find news through search, as well as through social networks, blogs and other online sources. Closed Formats Are Bad For Libraries, Stop OOXML Now Microsoft just won’t quit. Now they’re trying to make OOXML an ISO standard. Please help stop this. Here’s how I explained it in Open Source Software for Libraries: The state of Massachusetts in 2005 announced new IT standards that required its 80,000 employees and 173 agencies to adopt open file formats. The decision didn’t specify the applications to be used, just the format of the electronic documents they created, stored and exchanged #. Nebraska State Senator Ernie Chambers Sues God The following, quoted from Daily Kos: Accodring to Chambers, God has caused fearsome floods, egregious earthquakes, horrendous hurricanes, terrifying tornadoes, pestilential plagues, ferocious famines, devastating droughts, genocidal wars, birth defects, calamitous catastrophes resulting in the wide-spread death, destruction and terrorization of millions upon millions of the Earth’s inhabitants including innocent babes, infants, children, the aged and infirm without mercy or distinction. So, you think “yeah, he’s got a point. Building Libraries With Free Software Sarah Houghton-Jan‘s review of my LTR on open source software for libraries reminded me I wanted to blog this related piece I’d written for American Libraries. Tim Spalding cocks his head a bit as he says it to emphasize the point: “LibraryThing.com is social software.” However we categorize it, Spalding’s baby has become a darling to librarians, and as we sat chatting over lunch in spring 2006, the web application that had begun life just to months earlier was to catalog its 3-millionth book. The “Show of Force” Brand A Pentagon commissioned $400,000 RAND study, Enlisting Madison Avenue: The Marketing Approach to Earning Popular Support in Theaters of Operation, concludes “the ‘force’ brand, which the United States peddled for the first few years of the occupation, was doomed from the start and lost ground to enemies’ competing brands.” Small Is Beautiful Will found this on the side of the road, and after he told me about it I begged him to show me. It’s tiny, rusty, and a little older than I expected. Like a very, very small VW Bus, it has a rear-mounted engine. I think it’s a Subaru Sambar, but that’s mostly based on the details I gleaned from the Subaru 360 article, which reveals that engine was probably air cooled, displacing 330 CCs, and producing under 40 HP. A Shadow Lifted, Berlin’s Smokestacks Felled Corey and I went to Berlin to watch the stacks fall today, but bad weather, confusion, and some dud explosives conspired to leave me with no usable pictures of the event. We arrived early and lined up a perfect view of two out of three towers that were to be felled, but as the explosions started it became clear that I was mistaken about which smokestacks were being destroyed, and instead we had a really good view the one stack that was supposed to be left standing at the end of the day. Mildly Funny Scenes I’ve Come Across Recently Not LMAO, certainly not ROFLcopter-ingly funny, but funny enough to want to snap a picture, and good enough for casual Friday here. The boat in the parking lot, UPS vs. FedEx, and Hoe For Hire are all easy enough to understand (though they leave me open to easy criticism). The fourth photo is of some books on an anonymous shelf: look closely at “Library Trends, 1985” and others. Lessons In Change From Ford Motor Company I probably spend too much time considering competition and change management, but just as I figured I was done with it for the week, a comment from Kathryn Greenhill regarding Model Ts got me going again. Just like railroads, those “any color as long as it’s black” Model Ts looked like freedom, until General Motors showed the world they could get their cars in color and with curves. Every car came with four wheels and an engine, and they’d drive you down the block and around town, but the moldy Model T suddenly looked pretty old next to a sleek green Chevrolet. OneWebDay Have You Thanked the Internet Lately? OneWebDay, our opportunity to celebrate “one web, one world, one wish” is just about a week away (though it falls on Yom Kippur). This video explains a bit and Tim Berners-Lee is planning his own video (worth mentioning: his net neutrality post). If things work out, I’ll be posting a video too, even though I’ll likely be offline most of that day (not observing Yom Kippur, at a friend’s wedding). First They Ignore You, Then They Ridicule You, Then They Fight You It’s an aside to Kathryn Greenhill’s larger point, that all this 2.0 stuff is about a shifting power to the user, but she places L2 somewhere on Ghandi’s continuum of change between ridicule and fight. The photo above (original by Monster) is in support of Greenhill’s larger point: control is shifting. Trains were once seen as icons of freedom, but that view changed with the development of the automobile — and the way it shifted control of routes and schedules from the railroad to the driver. Playing With Food Like all well bred women, my mother always told me not to play with my food. However, as we get older we realize that sometimes ignoring the rules is just as important as, generally, following them. Food is fun. It has wonderful tastes, smells, colors, and textures. Something with so many wonderful attributes is just begging to be played with. For me, breakfast is not just the most important meal of the day, its also the most wonderfully yummy for one specific reason — maple syrup. Jumping From Airplanes A guy walked into the student newspaper office and asked “does anybody want to jump out of an airplane?” Without a moment’s hesitation, I said “I’m your man.” It was only afterwards that I confirmed a parachute would be involved. Well, that was ten years ago (can’t you tell, I look young — young!), but the video is still laying around and I just uploaded it to YouTube. Actually, this video has been through the wringer. Hawkish Is Bush really so hawkish that he refuses to formally declare an end to the Korean War? Launch! A little more than two years after I realized how (really) bad the problem was and about 18 months after I &lt;a href=;http://maisonbisson.com/post/11133/wpopac-an-opac-20-testbed&quot;&gt;prototyped my solution, our new library website, catalog, and knowledgebase launched last week — just in time for the fall semester opening. It’s all built on Scriblio, includes a very simple new books list that you can narrow by subject and get via RSS. And if you search for subject areas like anthropology, economics, english writing, or any of a few dozen other topics, you’ll find our librarians’ subject guides listed at or near the top to help you out. Cliffy’s Office Prankd Office pranks are a bit of a thing here. Well, at least in IT. Last year Matt took charge and put together a quartet of pranks that got the attention of the London Daily Mirror. This video is from a May 2002 prank that put a golf cart with fuzzy dice and bobble headed Jesus in Cliffy‘s office along with a Vote Bush sign and other things. He was mad, to be sure. Add Tags To Flickr Photos While Uploading Via Email The short story is that you simply put “tags:” in the subject or body and anything that follows becomes a tag. It’s worth remembering that the Subject of the email becomes the title and the body becomes the description. The longer story is at Flickr. Make It Official Before He Forgets In a development that even FOXNews couldn’t ignore, US attorney general Alberto Gonzales has resigned, he thinks. Would Princess Diana Have Been A Blogger? In an interview on NPR, The Diana Chronicles author Tina Brown says “Diana had represented feeling, and the end of the stiff upper lip,” but the Princess comes off sounding a bit like a harbinger of the Cluetrain. Yes it’s all about the Royals, the glamor, and her dramatic death ten years ago, but take note of this exchange: Renee Montagne: “The Royal Family is probably stronger than it was when she died. Vicar’s Delight Hot weather demands cool drinks. Lemonade is fine for the kids, but adults need a pitcher of something more entertaining. 2 parts Vodka 1 part Orange Juice 2 parts Lemonade dash Lime Juice Prepare in a pitcher with ice and share. Adjust quantities to taste. Enjoy safely. iPhone Unlocked If the news is to be believed, separate teams have found hardware and software-based solutions to unlock an iPhone. It’s worth noting that all this is legal because of an exemption, &lt;a href=&quot;http://www.onthemedia.org/transcripts/2007/03/02/04&rdquo; title=&quot;On The Media: Transcript of &ldquo;Mobile Malcontent&rdquo; (March 2, 2007)&ldquo;&gt;much needed and hard fought. Scratch-n-Sniff Hey, I’m a fan of that old book smell too, can I get some scratch-n-sniff stickers? MeeboMe + Pidgin = a match made in heaven MeeboMe + Pidgin (formerly GAIM) = a match made in heaven. (Via.) Color Blind Safe Web Design Check Etre‘s Colour Check. A good day to land the shuttle? A hurricane, high crosswinds at the landing site, a nitrogen leak, and two damaged tiles. Watch the shuttle land live on NASA TV. Allagash Wilderness, Maine Will, Jon, Joe, Ted, and I arrived at Telos Landing with plans to run the Allagash Wilderness Waterway. As we prepared to embark, the park ranger appeared with a tape measure and told us our kayaks weren’t canoes. Section 2.3 of the Allagash rules and regulations is quite clear: “A canoe is defined as a form of small watercraft long and narrow…. The width at the widest point shall not exceed 20% of the craft’s overall length. 73,764 structurally deficient bridges About 597,000 vehicular bridges nationwide, and 73,764 are “structurally deficient.” Sources: 2006 National Bridge Inventory compiled by the U.S. Department of Transportation, American Society of Civil Engineers‘ Infrastructure Report Card, and Gannett. p0ps’ Panoramas Shot With iPhone I’m coming to learn that p0ps has a number of interesting things going on, but it was his panoramas stitched from pictures taken by iPhone that caught my attention first. Above is the J Train somewhere between Fulton and City Hall. I’d thought the iPhone’s camera was pretty decent, p0ps’ work shows it off. Bad Joke Friday [innerindex] Beginning of a bad day… I rear-ended a car this morning. I knew it was going to be a really bad day! The driver got out of the other car and I looked down and realized he was a dwarf!!! He looked up at me and said “I’M NOT HAPPY!” So I said, “Well then, which one are you?” And that’s how the fight started. Our diets, our health A doctor was addressing a large audience in Tampa. Mac + Cell Phone + Bluetooth + SMS Old instructions that connect the Mac OS X Address Book app to a phone via Bluetooth from O’Reilly and SillyDog. Once paired, the Address Book can initiate dialing, notify the user of incoming calls, and send SMS texts. Bluetooth Texter SMS Widget, message2net, and BluePhoneElite all offer further tools to interact with your Bluetooth-connected mobile phone. The list of compatible phones (BPE &amp; m2n) offers some leads for those trying to make the connection. Fuel Economy: Is Diesel An Option? In response to my previous kvetching about the scarcity of cheap fuel efficient cars, JWK commented that his 2001 Golf TDI gets 48 MPG (it’s rated for 44). Meanwhile, TreeHugger pointed out that Volkswagen’s Polo BlueMotion gets 62 MPG (Volkswagen UK claims the current Polo hatchback gets up to 72 MPG in diesel (I assume that’s about 60 MPG in US measures), and TreeHugger points out the 157 mpg Loremo AG). iPhone + Newton + eMate Pr0n {#set_thumb_link_815008614.image_link}{#set_thumb_link_814958046.image_link}{#set_thumb_link_813781733.image_link}{#set_thumb_link_813757895.image_link}{#set_thumb_link_814609120.image_link} {#set_thumb_link_773797123.image_link}{#set_thumb_link_773748277.image_link}{#set_thumb_link_773765455.image_link}{#set_thumb_link_774571582.image_link}{#set_thumb_link_774567276.image_link} It’s likely Phil Carrizzi could make a broken tire iron look good, but his series of the iPhone with the Newton Message Pad and eMate is geek-sweet eye candy. I Want A Cheap Fuel Efficient Car I’m looking for a new car, but I’m finding that the market for cheap and fuel efficient cars is no better now than it was in 2005. I drive about 140 miles round trip to work (all highway), so I’m looking for the best available highway fuel economy. I can drive a standard, but Sandee can’t, so we’ll need automatic. I like small cars, but no so much that I want to pay a lot for one. Moving a Subversion Repository I foolishly just moved a Subversion repository by importing the contents of a current checkout into a new repository. Wrong. A friend pointed out these instructions that make it easy and preserve the revision history. Here’s the trick: svnadmin dump /path/to/repository &gt; repository-name.dmp and svnadmin load repository-name &lt; repository-name.dmp [tags]svn, subversion, move, repository[/tags] Castro Sued For Wrongful Death of CIA Operative, Guantanamo Bay Prisoners Taking Notes The Bangor Daily News is reporting a Maine woman has sued Fidel Castro for her father’s death. Sherry Sullivan of Stockton Springs accuses Fidel Castro, his brother Raul, the Cuban army, and the Republic of Cuba for the wrongful death of her father, who has been missing and assumed dead since he was last seen at a Mexican airstrip in 1963. According to the lawsuit, from 1960 until their disappearance, Sullivan and Rorke participated in numerous covert anti-Castro operations in Central America and Cuba. Chocolate White Chocolate Chip Cookie and Vanilla Bean Ice Cream Sandwiches So once again, my husband called on my assistance with a Friday Food Fiesta challenge. This week’s theme was cookies and biscuits. I scoured my pantry, but alas, like Old Mother Hubbard, my cupboards were practically bare. The one interesting thing I did have was a bag of Hershey’s white chocolate chips. So, between my meager rations and a quick trip to our town’s tiny market for butter, I cobbled together the ingredients needed to make the chocolate, white chocolate chip cookies on the Hershey wrapper. Is It That They Don’t Care? Or Just Don’t Want It From Us? &amp;tJessamyn asks “do library users care about our new initiatives?” It comes from a survey done by the Wisconsin Public Library ConsortiumOn one hand, if you interpret the results literally you could make a decision to reject technology and focus on building a collection around personal enjoyment for Wisconsin residents. On the other hand, these same results may suggest that initiatives and library services need to be marketed in such a way that resonates with current conceptions of a public library. The FBI And IRS Are A Series Of Accountants Alaska Senator Ted—The Internet Is A Series Of Tubes—Stevens (mockingly so, listen) returned to find the FBI and IRS searching his Alaska home. iPhone Complaints Cliff and Vasken wrote up some link bait complaining about how the iPhone doesn’t meet their expectations or is a lesser competitor to a crackberry. But I challenge them to find a device that offers what they say is missing or even matches what the iPhone has. Still, I’ve been using mine for a month now, and I can say there are few things it’s missing or could do better. Ingmar Bergman Dead at 89 Swedish film director Ingmar Bergman is dead at 89. The Local calls his work immortal, as did many of his colleagues. Until now I’ve been misremembering the title of one of his movies as Three Smiles of a Summer Night, a 1955 romantic comedy. I’d say that most of his works I’d seen were depressing and that Smiles was one of the few that wasn’t. But I couldn’t even remember the title properly, so perhaps I should keep that to myself. Sour Cream Berry Bread My wonderful neighbor, Wendy, went berry picking and dropped me off a large container with luscious, fresh blueberries and raspberries. I decided to try a bit of an experiment and use the batter for one my favorite cakes with the berries. The result was this heavenly sour cream berry bread. Preheat oven to 350 degrees. Grease and flour an 8-cup loaf pan. Melt 5 tablespoons of salted butter, pour into a large bowl, let cool. What Is That Thing Kent Wien posted this photo of the tail of a Boeing 757 showing what looks like the exhaust end of a turbine. I had to ask what it was all about, and Kent explained: Ahh, very good question! There actually IS an engine back there. It’s the APU (auxiliary power unit) and it’s what keeps the airplane cool on the ground without being plugged into the gate. It also provides electrical power and high pressure air that starts the engines after we push back from the gate. Poet-Bot Doug Savage‘s take on Frost. iPhones Around The World A long time ago somebody started the Newtons Around The World gallery, and it came to symbolize the love we Newton users had for the little device as well as our geeky pride. The trend seemed to continue with iPods Around The World, and now iLounge wants to start a gallery for the iPhone. I was about to submit when I noticed the legal fine print: By submitting, you agree that all photographs, and private information you submit are entirely yours at the time of submission, become the property of iLounge upon submission, and that you have not submitted and will not submit such images to any other contests. iPhone Troubled, Replaced On Thursday I had trouble answering a call. By Friday night it was clear my iPhone was seriously porked. A visit to the nearby Apple store got me a swift replacement, and a promise that once I synchronized the new device it’d have all the info the old one did. Hrm. Well, the Mac Genius did ask if I had any photos I hadn’t offloaded, as those would be lost in the swap. Liz Danzico on WordPress Usability Liz Danzico of Happy Cog Studios spoke today about her consulting with Automattic on the design of the WordPress admin interface. As with so many of the presentation today, I’m really hoping the slides will be published soon, as there are some great ideas coming out. Liz spent a lot of time watching WordPress users at blog. At work, in cafes, and in their homes with coffee and cigarettes, Liz saw real users of all types doing everything they do with WordPress. Scriblio Goes To WordCamp Scriblio is based on WordPress, an open source content management system, and the community that uses, supports, and builds it is what makes it great. WordCamp started last year, when the community was about 750,000, and it’s even more important now that it’s grown to nearly two million. The first day of the schedule focuses on how to better use the software, and included a great session by Lorelle VanFossen. Tomorrow is more technical, with discussions about performance, usability, and development. Designing the Obvious Robert Hoekman, Jr is speaking now on Designing the Obvious, his book and philosophy: These principles include building only what’s necessary, getting users up to speed quickly, preventing and handling errors, and designing for the activity. I just added the book to my must read list, but what I’m hearing here sounds like instructions to a sculptor: chip away all that is not David. Calliope Gazetas Design Calliope Gazetas works for The FontShop and freelances under the name 99 Monsters. One of her projects includes skinning the Burning Man environmental blog. Jason Brightman Design Portfolio Jason Brightman’s work includes XXLmag. WordCamp WordCamp WordCamp I’m at WordCamp again. This time I dragged Matt and Zach with me. Dan Kuykendall, author of PodPress, is first on the schedule, and I’m just now learning how he’s built in support for a variety of media types (more than MP3) and for premium content. Those who showed up early got to pick over last year’s t-shirts. This year’s shirts are way different, having given up the somewhat cleaner and simpler design of that has characterized WordPress so far. Peanut Butter Burger Now matter how depressed I got in New Orleans, I still had to eat. A tip from the ladies at Molly’s on Toulouse led me to Yo Moma’s with instructions to try their peanut butter burger. Yes. Peanut butter. On a burger. I was also told that if I don’t like mayo, I should tell them to hold it because they’ll put it on thick if I don’t. Yes. Peanut butter, on a burger with mayo. When you can’t say it in English… When you can’t say it in English, say it in German. The Reconstruction of New Orleans It wasn’t until after my presentation that I had a chance to see the city. And I have to admit it was so depressing that I’ve been having trouble writing about it. I have a sick interest in abandoned theme parks and the like, but seeing the neighborhoods of all classes so destroyed, the symbols marking search and rescue attempts, and the general vacancy of the city left me confused and uncomfortable. Presentation: Bringing The Library To The User I’m at AALL in New Orleans as part of a program organized by June Liptay and Alan Keely, speaking with U of R’s David Lindahl and NCSU’s Emily Lynema. From the description (see page 5 in the program): Traditional library online catalogs are being marginalized in an increasingly complex information landscape. …Better methods are needed for mining the wealth of information in library systems and presenting it clearly and concisely. Yes it’s laughable, but… I get as frustrated with airport security as the next guy (and I’m plenty doubtful of its effectiveness), but really, if you don’t yet know liquids aren’t allowed, and you hold up the one security line at a small airport at an ungodly early hour, it’d be nicer if you didn’t laugh like a kid at a theme park about it. Yes it’s farcical, but not funny. Usage Instructions &lt;img src=&quot;http://farm1.static.flickr.com/200/514808113_ce17f81316.jpg&rdquo; width=&quot;500&rdquo; height=&quot;442&rdquo; alt=&quot;&ldquo;tear open packet and use&rdquo;&rdquo; /&gt; What’s really angering about instructions […] is that they imply there’s only one way […] their way. And that presumption wipes out all the creativity. Actually there are hundreds of ways […] and when they make you follow just one way without showing you the overall problem the instructions become hard to follow in such a way as not to make mistakes. The Rarin in Librarian I’m going to violate my rule against linking to NYT (because) and give a shout out to this article. Not just because it quotes my friend Jessamyn, but for what it says: libraries are full of smart, hip people. [tags]library 2.0, Jessamyn West, New York Times, libraries, hip, smart[/tags] Essential iPhone Apps Rush In [innerindex] Games Tilt, described in programmer Joe Hewitt‘s blog: …Christopher introduced me to a very talented video game designer, Nicole Lazzaro, who had an endless stream of ideas for games that would use the iPhone’s accelerometer. Nicole’s ideas quickly ran into the limitations of the phone, as we discovered that the browser doesn’t rotate when you hold it vertically upside down, nor is it possible to distinguish the two horizontal orientations. Whose Technology Is It Anyway? I wasn’t planning on posting much about Keen’s Cult of the Amateur, but I did. And now I find myself posting about it again. Thing is, I’m a sucker for historical analogy, and Clay Shirky yesterday posted a good one that compared the disruptive effects of mechanized cloth production to today’s internet. Yes, that’s actually the birth of the Luddite movement, or at least where it got its name. And, though I was aware of the story, Shirky’s study offered details I’d not know previously. ironic moments in law enforcement New Hampshire’s deputy chief of liquor enforcement caught drunk driving. Keen Says I’m Killing Culture, Byte By Byte Andrew Keen‘s The Cult of the Amateur__; How Today’s Internet Is Killing Our Culture is getting a lot of attention from usually quiet corners of the web, and I’ve had to quell the urge to write a story under the headline “Andrew Keen Tells YouTubers to Eat Spinach.” Keen’s argument rests on the belief that “culture” is the sole provence of established media, and falls flat as soon as you get past the bombast of the subtitle. Why Is PDF Inferior To HTML? HTML and PostScript are both page description languages, but one is designed to convey the look of the page, while the other to convey the meaning of its content. pinch me I’ve been away from my computer for a couple days, but very much online with my iPhone. Today, as I looked at something on my laptop in Google Maps I found myself trying to pinch and flick my monitor to manipulate the position and scale. felonious dancing naked == lewd lascivious conduct == felony crime. (Better, however, than riding a gondola naked.) Celebrate Independence Day With A Drink Ok, the truth is that at MaisonBisson we celebrate all holidays with a drink. Since we take cocktails quite seriously, I wanted something very pretty for the little Fourth of July soiree we were having. I have found that the secret to a perfect strawberry daiquiri is using frozen strawberries. I also use lots of crushed ice and a ripe banana — it adds a nice creaminess. I garnished with whipped cream, blueberries, and star fruit. Cold Cucumber Soup My beloved husband went off on a Boy’s Adventure Weekend. This left me with the entire house and kitchen to myself. When this happens, I become a bit like a mad scientist left alone in my laboratory. So, it was just me, the cats, and that most dangerous invention, Food Network. After some house work, chick flicks, and visiting with my parents, I spent an hour putting away laundry and watching Emeril. Sweet bike Sweet bike Originally uploaded by misterbisson. Sent from my iPhone iPhone accident Big accident on highway leaving mall…was somebody unboxing their iPhone while driving? so much sweetness in so small a package zero hour +50 minutes: the iPhone rocks. 15 minutes to go 15 minutes to go. Guy from store: “being in line doesn’t guarantee you’ll get one.” Two hours, 85 people. Two hours to go, 85 people in line. blackout They just put up black vinyl over the windows and gate. The line has grown to about 50. Still no word of quantity, but somebody shared a story that they asked “what happens if there are 300 people in line?” The answer was supposedly: “Even if they buy two we’ll have enough.” Retail Status Check Does your Apple Store have iPhones? about 30 The rumors are that the AT&amp;T store here has about 30 phones. Nobody is talking about how many our Apple Store has. 26 people in line 26 people in line. At least one is hoping to auction his, three are being paid, and nobody wants the cheap one. Fake iPhone Pic At First Believed, Then Quickly Called Out By The True Believes In Line This pic elicited gasps, then indignation. we’re loved, we share the love Suited security guy with square jaw and angry expression grunts at us as he confirms plans with store manager. He’s from management, and though we couldn’t overhear much, we did realize he was headed off to the AT&amp;T store next. All of us remained silent as we watched him stomp off in the wrong direction. waiting for iPhone Arrived at 8am to find four parties ahead of me. The first arrived at 7am, after repeatedly being chased out of the mall parking lot last night. June 28: Tony Day It’s Tony Day, not just because Joe’s book has garnered some good reviews—“the only excuse for the continued existence of boxing is that its battles have occasioned some of the best writing any sport has ever inspired”—or because he likes telling the story. It’s Tony Day because “Galento [is] a champion of everyone who’s ever gotten in over his head, shrugged, and said ‘What the hell? I’ll give it a shot. Apple iPhone vs. Internet Tablets Sure, the iPhone is a sweet phone (even at $600), but how does it compare to the less definable internet tablet category? I’ve actually used a Pepper Pad and held an OLPC in my hands (yes, they exist), but what I know about the Nokia n800 (the successor to the n770) is limited to what I’ve been told. All four devices have feature-complete browsers and can take advantage of the rich web 2. Presentation: Faceted Searching and Browsing in Scriblio I was honored to be a panelist at the LITA/ALCTS CCS Authority Control in the Online Environment Interest Group presentation of “Authority Control Meets Faceted Browse.” What is faceting? Why is it (re)emerging in use? Where can I see it in action? This program is intended to introduce the audience to facet theory, showcase implementations that use faceted approaches for online catalogs, and facilitate discussion on the relationship between structured authority data and this type of navigation. The iPhone Cometh; Haters Swarm Some are calling it the Jesus phone, but Jason Chen calls it a moral quandry, Gartner Group is &lt;a href=&quot;http://www.techworld.com/mobility/news/index.cfm?newsID=9252&amp;pagtype=samechan&rdquo; title=;Techworld.com - Gartner warns IT to avoid Apple&rsquo;s iPhone&quot;&gt;telling IT to avoid it (really, because iTunes is scary to enterprise), Business 2.0’s Joshua Quittner is reminding the peeps it’s just a regular phone, and Wayne Smallman is whining that it doesn’t have a flash or telephoto lens. (Humor alert: one of those is supposed to be funny, and another is supposed to be hilarious. Presentation: Transforming Your Library With Technology [innerindex]Part of the Transformation Track, Transforming Your Library, and Your Library’s Future, with Technology, program coordinators Alan Gray and John Blyberg (both of Darien Public Library) described it like this: Technology can transform your library and its services, as it is transforming the lives of your patrons. From do-it-now technology improvements to next-generation implementations, from software to SOPACs, from in-your-face competition to over-the-horizon transformations, three accomplished experts will instruct, enlighten and challenge you to use technology to make your library more relevant to your patrons — today and tomorrow. iPhone Service Plans and Coverage? AT&amp;T’s current (reasonable) voice and smartphone data plans offer 900 minutes for $60 and unlimited data for an additional $20, but previous reports about the iPhone suggested that consumers should expect to pay $60/month for service, so we’re left to wonder what’s up. Meanwhile, I’ve been asking AT&amp;T users about their signal coverage. I’m on Verizon now and enjoyed pretty solid coverage throughout DC, even underground. Folks on AT&amp;T, however, had spottier coverage, even above ground. “as dead as Elvis” “The librarian as information priest is as dead as Elvis,” Needham said. The whole “gestalt” of the academic library has been set up like a church, he said, with various parts of a reading room acting like “the stations of the cross,” all leading up to the “alter of the reference desk,” where “you make supplication and if you are found worthy, you will be helped.” Via. down the up escalator Running down the up escalator = fun. Landing upright = difficult. escalator, running, up, down An Almost-Manifesto Masquerading as a Presentation… Context: Below is the text of my virtual presentation to the LITA BIGWIG (it stands for blogs, wikis, interest group, and stuff) Social Software Showcase. The presentation is virtual, but the round table discussion is going on today, June 23rd, from 1:30-2:30 p.m. in the Renaissance Mayflower Cabinet Room. I won’t be there, though. My bad scheduling got me double-booked and I’m presenting in the Transforming Your Library With Technology track. cider drinks Black Adder = cider + Guinness Snakebite = cider + Harp 20th Century Information Architecture One hundred years ago the country was in the middle of a riot of library construction. Andrew Carnegie’s name is nearly synonymous with the period, largely due to his funding for over 1,500 libraries between 1883 and 1929, but architectural historian Abigail Van Slyck notes that the late 19th century was marked by widespread interest in community development, with broad recognition of libraries as a means of promoting individual development. trains vs. seat belts I’m not saying I want seat belts, but it always takes me a moment to get used to them not being there on a train. The Sky Is Falling MySpace, Second Life, and Twitter Are Doomed. The Rules, 2007 [innerindex]Web 2.0 has matured to the point where even those who endorse the moniker are beginning to cringe at its use. Still, it gave me pause the other day when Cliff (a sysop) began a sentence with “Web 2.0 standards require….” Web 2.0 is now coherent enough to have standards? We used to joke about rounded corners and gradient blends being the rule, but something more has indeed emerged. O’Reilly defined Web 2. Google Gears Google Gears: create web apps that work offline Two Books On A Shelf… Two books that just happened to be sitting next to eachother in the LC files: 001 47029455 003 DLC 005 20050826211147.0 008 761229s1946 xx 000 0 dut 010 _a 47029455 020 _a940.544 035 _a(OCoLC)2652163 040 _aDLC _cPBm _dDLC 042 _apremarc 050 00 _aD763.N42 _bR64 100 1 _aToonder, Jan Gerhard, _d1914- 245 14 _aHet puin aan de Rotte, _cdoor J. Gerhard Toonder. 260 _aAmsterdam, _bA. J. G. Strengholt _c[c1946] 300 _a95 p. Cake Robed In Chocolate And Strawberries Like so many women, there are days when my desire for chocolate is nearly overwhelming. However, perhaps because I am a tad high maintenance, my cravings are not satisfied by a mere candy bar. When I crave chocolate I want something rich, decadent, and freshly baked, I want chocolate cake. When one of these cravings coincided with finding the first of the year’s native strawberries I decided to combine the two, the result was the cake you see above. Arm Wrestling, Dung Throwing, Lawnmower Racing, and Seed Spitting I don’t know whether to thank the Pheonix or the fair organizers for this great ad copy, but I hope the Washington County Fair is as good in 2007 as it sounded in 2006: An agricultural fair featuring tractor pulls, stage shows, crafts, and livestock, plus games and children’s contests. Adult events include arm-wrestling contests, dung throwing, lawnmower racing, and seed spitting. Live country concerts every night. Open Wed through Sat from 10 am to 10 pm, and on Sun until 9 pm. go together? Just spotted: do hippie skirts and bluetooth headsets go together? Star Wars stamps found at post office Star Wars stamps found at post office. Will the merchandizing ever end? Flag Day The US flag with all its stripes and a few of its stars was adopted by a resolution of the Second Continental Congress in 1777. But today, overpriced textbooks and underpaid schoolteachers have sanitized most of our history and hidden the early controversies while fluffing half-truths, leaving us unclear about what that flag really stands for. Fortunately, this is America and we’ve got movies to tell us what our teachers didn’t. a three year high Report: civilian and military death toll in Iraq is up strongly after US “surge.” Roy Pearson sues Custom Cleaners Roy Pearson sues Custom Cleaners for $67 million over lost pants. Millions! Pants! New Hampshire ranks Local pride: New Hampshire ranks near the top of the list for quality of healthcare services, according to new report. climate change vs. budget planning Just as climate change makes hurricanes more frequent and dangerous, NOAA says its best tracking satellite is failing and there’s no plan to replace it until 2012. DeSoto report leaked. DeSoto report leaked. The highest ranking UN official in Israel has warned that American pressure has “pummelled into submission” the UN’s role as an impartial Middle East negotiator in a damning confidential report. Echos abound. The neocons were right, so far… The neocons were right so far: civil war is erupting throughout the middle east and Iran is feeding the flames. Is this really what we (or anybody) wanted? Paralyzed Paralyzed: they can blow our helicopters out of the sky, and now they’re [destroying the roads and bridges][2]. Are we prepared for [another surge in Iraq][3]? [2]: http://www.plenglish.com/article.asp?ID=%7BB12C79AD-2CBF-41AB-9B5E-5B80936A00D2%7D)&amp;language=EN [3]: http://www.huffingtonpost.com/2007/01/07/no-2-us-commander-in-ira_n_38040.html Installing MySQL with YUM how to install and configure MySQL database server WordPress Blogging By Email The built-in tools don’t support secure POP3, but Gmail requires SSL POP3. The fix? Postie. carbon neutral living APM Marketplace: news of a British model home. Highly insulated, carbon neutral, just 40% more$. Not just a demo, it’s going to be the law: all new UK buildings to must be carbon neutral by 2016. Economies of scale are said to reduce or eliminate the added cost by then. down for fifteen years straight, up like a rocket now After being down for fifteen years straight, milk consumption is up. Up big, and prices are rising to meet it. stand alone AppleTV? New 160GB AppleTV. How far away are we from a standalone unit that can download from iTunes store directly, sync iPods, and write to USB-attached burners? iPhone apps = web apps; web apps = iPhone apps WWDC: Safari for windows!?!? Leopard looks sweet, but delayed ’till October. iPhone apps = web apps. The New Plazes Plazes, a kinda-cool, formerly networked-based geolocation tool has just been revamped. They’ve been promoting this change for over a month (I got a cool invite to the launch party, but couldn’t make the flight to Germany), and they’re continuing the push now that it’s live. I’ve used the new service for a few days, the company has sent me an email soliciting feedback, I’m offering it. I submitted the following via the site’s Contact form, but the message seems to have disappeared, and I prefer public discussion, so I’m reprinting it here: Presidential candidates chasing rural votes? Presidential candidates chasing rural votes? Worth remembering that 60% of US libraries serve towns of 10,000 or fewer people. -Fed R-Fed defeated, K-Fed mourns. missed the paper airplane contest… I missed the paper airplane contest in Concord NH today!?!? Ultimate Frozen Mud Slide Recipe Who wouldn’t enjoy a frozen mud slide on a hot summer day? Typical recipes call for crushed ice and cream or ice cream. For some reason, we decided to try making them from ice cream, from scratch. The MaisonBisson Frozen Mud Slide This recipe requires an ice cream maker, we used the Deni Scoop Factory. 1.5 cups heavy cream 1 cup milk 1 cup sugar .5 cups Bailey’s dash vanilla Mix ingredients in bowl, then pour into ice cream maker’s freezer container. wheelchair ride 50MPH wheelchair ride in Michigan People Invent Funny Words: Schaedenfatte Okay, now that we all know what a muffin top is, let’s learn about schaedenfatte: Schaedenfatte: shaw-den-FAH-tuh, etym. from the German, schaedenfraude. (n.) 1. the feeling of pleasure upon seeing someone for whom one once held unrequited romantic and/or lustful feelings who has now become fat. 2. the taking of such pleasure. With summer being the season of weddings (and, along with reunions, weddings being the place where people people who haven’t seen eachother for years cross paths…), I suppose you might also call it the season of schaedenfatte. students want libraries iblee points out that students want libraries. asdasd asdasd They vaccinate ducks… They vaccinate ducks against H5N1 bird flu, but not enough. It’s active again in Vietnam, where the first human case since 2005 has now appeared. regime change… Why isn’t the US supporting regime change and democracy in Packistan? We’ve given General Perv US$10B in aid since 2001! queasy stomach Bush gets queasy stomach when facing other world leaders at G8. The poor fellow is being shamed by his peers. Open Source Software and Libraries; LTR 43.3, Finally The most selfish thing about submitting a manuscript late is asking “When is it going to be out?” So I’ve been waiting quietly, rather than trouble Judi Lauber, who did an excellent job editing and managing the publication. Ryan and Jessamyn each contributed a chapter, and I owe additional thank yous to the full chorus of voices that answered so many of my questions, participated in interviews, and generally made the book/journal/thing what it is. What’s up with police? “Prosecuting a woman for ‘staring’ at a police dog is absurd,” said her lawyer. “People are allowed to make faces at police dogs and officers to express their disapproval. It’s constitutional expression,” said public defender Kelly Green, who represented Jayna Hutchinson. More: What’s up with police? This Is The Liberal Media? What Liberal Media author Eric Alterman arrested, mocked at GOP debates. Poke Your Tech Staff With Sticks, And Other Ideas What a difference a year makes? Jessamyn was among those sharing her stories of how technology and tech staff were often mistreated in libraries, but there’s a lot of technology in this year’s ALA program (including three competing programs on Saturday: The Ultimate Debate: Do Libraries Innovate, Social Software Showcase, and Transforming Your Library With Technology. And still, not all is well. Ryan Deschamps seems to have hit the button with a post from April of this year. 30 months Libby to scoot in for 30 months. Is it enough? good for? “What is an atomic bomb good for?” Easy MySQL Performance Tips Yes, I’m still trying to squeeze more performance out of MySQL. And since small changes to a query can make a big difference in performance… Here are two really easy things to be aware of: Never do a COUNT(*) (or anything *, says Zach). Instead, replace the * with the name of the column you’re searching against (and is hopefully indexed). That way some queries can execute entirely in the keycache (while * forces MySQL to read every matching row from the table). what’s so bad? Congressman Sensenbrenner: “what’s so bad about shorter winters and global warming?” Ironic: Lightning Strikes Church Steeple Lightning struck the steeple of the Saint John the Baptist Church in Allenstown NH Saturday. Men At Work… Men At Work lead singer has new album: “are you lookin’ at me?” Biofuel: Good Idea, Bad Practice Yes, gas prices are high, and gas doesn’t grow on trees (well, in geologic time it does), but that doesn’t mean that it’s a good idea to run on cars on corn, even if it does grow on, um, trees (yes, alright, cornstalks). I mean, people talk about photovoltaics being inefficient, but wow, think of how much energy it takes to turn a seed into corn, then turn that corn into ethanol and truck it to a gas station. The Lawnmowers in Ohio From Associated Press and WAVY TV: Police said a drunk man drove a lawnmower to a store about a mile from his house. They arrested him on his way home. Dondi Bowles, 50, of Vermilion was arrested Friday night as he drove the mower on a sidewalk. Police said a breath test showed that Bowles’ blood alcohol level was 0.144 percent, nearly twice the legal limit of 0.08 percent. industrialized transportation vs. individual choice Thought: industrialized transportation first aggregated passengers onto railroads, the broke up into cars…technology empowered the individual, and they embraced it. Wish Alanis A Happy Birthday I’m wishing Alanis Morissette a happy birthday not just because we share a birth month and year, but because it’s a good reason to look back at her cover of My Humps and get another smile. But, as long as we’re talking about events in June, we might as well remember that we’re now just 20 Days away from Paris Hilton’s retirement. YouNiversity “YouNiversity” big issue… Huh, the NASA Administrator doesn’t think global warming is big issue. What’s his stance on evolution? Speedy PHP: Intermediate Code Caching I’ve been working on MySQL optimization for a while, and though there’s still more to done on that front, I’ve gotten to the point where the the cumulative query times make up less than half of the page generation time. So I’m optimizing code when the solution is obvious (and I hope to rope Zach into giving the code a performance audit soon), but I’m also looking at optimizing how PHP works. Bragging About My New Office It’s taken a while (we moved in two months ago), but my new home office is finally usable. The big hurdle was my desk. I prefer to stand (or walk) while working, but there aren’t many desks for that, and those that are available are very pricey. So I put together the above from a recycled base, a matching pair of table tops from Ikea, and some decorative wall-boxes that elevate the upper surface. Books I Now Want To Read… The problem with working on Scriblio is that I end up running into so many interesting looking books. Just this morning I discovered a number of recent acquisitions in the 19th Century and 20th Century subject feeds in my development instance (also available via RSS). All of this is under active development, so those links may or may not work, and the site is definitely changing URLs soon. Street-Level Photos in Google Maps! Thanks to Ryan Eby for tipping me to this. Go try it out. Whatever you think of them, they do keep delivering. I wonder if people will ask for stack-level photos of our libraries? Burninator: Kinetic Sculpture Never Looked So Hot This is what I get for not following Gizmodo faithfully: flaming industrial art. They introduced it saying “Do you enjoy fire? Do you also enjoy very intricate Rube Goldberg machines? Of course you do.” Though a reader there exclaims: It didn’t do anything. For it to be a true Rube Goldberg doesn’t it have to accomplish some task, like cracking an egg or pouring a glass of milk or something? Kids Need Bowling Coaches, Desperately There is little doubt that the great diversity of styles and techniques of bowlers from countries enjoying test match status has helped to shape the history of [the sport]. With the recent world-wide implementation of professional coaching schemes, which generally teach only one, or perhaps two optimal ways…, bowling could be in danger of losing its technical diversity. Are we therefore on the verge of a new era in which the art of bowling is irretrievably lost? Harry Potter finale out soon, does Book Embargo have details? Student Gets Restraining Order Over Facebook Photo The Associated Press reports a composite nude posted to facebook has earned a UNH student a restraining order: A University of New Hampshire student got a temporary restraining order against another student who combined an image of her face with an explicit photo of another woman’s body, then posted the composite on his Facebook page. A judge ordered Owen Sanborn, of Laconia, to stay at least 100 feet away from the woman and barred him from posting her “likeness or name on any Internet site,” pending a final hearing. A Fair(y) Use Tale From The Chronicle: Copyright law, a constant thorn in the sides of scholars and researchers, is generating a lot of public discussion this week, thanks in part to a new 10-minute video that parodies the law. “A Fair(y) Use Tale” has been downloaded from YouTube about 145,000 times since it was posted online Friday. The video uses 400 cuts from 27 different Disney films to mock copyright law as overly protective of the interests of copyright owners — Disney among them. Google To Psyc Profile Users!?! There it is in The Guardian: Internet giant Google has drawn up plans to compile psychological profiles of millions of web users by covertly monitoring the way they play online games. Yep, “do no evil” Google has filed a patent on the process of building psychological profiles of its users for sale to advertisers. Details such as whether a person is more likely to be aggressive, hostile or dishonest could be obtained and stored for future use, it says… Players who spend a lot of time exploring “may be interested in vacations, so the system may show ads for vacations”. RedHat 5 SELinux Gets In My Way Ack, my WordPress suffers connectile dysfunction on a fresh install of RedHat 5! Not only did I get the above message, but dmesg was filling up with errors like this: audit(1179258445.529:38): avc: denied { name_connect } for pid=3332 comm=“httpd” dest=3306 scontext=user_u:system_r:httpd_t:s0 tcontext=system_u:object_r:mysqld_port_t:s0 tclass=tcp_socket It turns out that I was getting stung by SELinux, which is enabled by default in RedHat 5. All the extra security is probably a good idea, if I knew how to configure it, but for the moment it was breaking a live site. Surf ‘n Turf Salad My computer geek husband, who I do adore, joined a Flickr photo group called Friday Food Fiesta. A new theme is announced every Friday, and everyone contributes a single photo that illustrates that theme. The first themes he contributed to were burgers and pizza, but when salads came up, he needed help. Luckily for him, I love making salads. So Casey, my husband, asked me to be his partner in crime and create a salad for him to photograph and submit. Bringing Up The Cute Quotient Of This Blog If you ever tire of the kittens on Flickr, it turns out there’s no shortage of bunnies on YouTube. Are You A Certified Asshole? Sure it’s a promo for his new book, but Bob Sutton is offering us all a chance to see if we’re assholes with the Asshole Rating Self-Exam (ARSE). After 24 questions like “You secretly enjoy watching other people suffer and squirm” (hey, what’s wrong with a little schaedenfreud?) you’ll find yourself placed somewhere on the scale from possible liar to full-blown certified asshole. You don’t sound like a certified asshole, unless you are fooling yourself. Customer Relations Done Right Rebekka Guðleifsdóttir is one of my favorite photographers on Flickr. Her photos are amazing, and it’s clear a lot of people agree. That’s the easy part. Then two problems arose: First Rebekka discovered that somebody was selling her photos for profit, and she posted about it. The community was shocked, and angry. And then, and this is the second thing, Flickr removed her post about it. And then the storm got worse. Increased Fuel Economy, Easy Here’s an irony: I used to live in the country, a small town with fewer than 900 residents, and I used to speed. Now I live in the city, well, as much of a city as New Hampshire can manage, and I’m driving slower. Driving slower not just because Manchester‘s traffic lights are on timers they leave me listening to crickets chirping at empty intersections while they blindly tick tick tick through the cycles before finally giving me the green (usually just as somebody arrives at the newly reddened light on the other street). WordPress 2.2 Out WordPress 2.2 is out and available for download now! I’m excited because this version includes widgets (by default), some XML-RPC hooks to edit pages (so you don’t need my hacks), a switch to jQuery from Scriptaculous (Matty got me excited about this), full Atom support (enough of the different versions of RSS!), and the ability to set your MySQL character encoding (go UTF-8!). If that isn’t enough, 2.3 is planned for release in September. PlasticLogic’s Flexible E-Paper Display Plastic Logic is a developer of plastic electronics – a new technology for manufacturing (or printing) electronics. The Plastic Logic approach solves the critical issues in manufacturing high resolution transistor arrays on flexible plastic substrates by using a low temperature process without mask alignment that is scaleable for large area, high volume and low cost. This enables radical new product concepts in a wide range of applications including flexible displays and sensors. People Ask Me Questions: Web Design Software (or is it Website Management Software?) The question: What’s a good user-friendly Macintosh web development program? A friend called. She’s thinking of buying Dreamweaver, but is afraid it will be overkill. She found Frontpage to be easy and needs something similar. My answer: If the intent is to design individual pages on an unknown number of sites, then I don’t have a recommendation. If the intent is to build a site (or any number of sites), then I’d suggest looking at WordPress. WordPress Strips Classnames, And How To Fix It WordPress 2.0 introduced some sophisticated HTML inspecting and de-linting courtesy of kses. kses is an HTML/XHTML filter written in PHP. It removes all unwanted HTML elements and attributes, and it also does several checks on attribute values. kses can be used to avoid Cross-Site Scripting (XSS), Buffer Overflows and Denial of Service attacks. It’s a good addition, but it was also removing the class names from some of the elements of my posts. It’s Not About Technology, Stupid Inside Higher Ed asks Are College Students Techno Idiots? Slashdot summarized it this way: Are college students techno idiots? Despite the inflammatory headline, Inside Higher Ed asks an interesting question. The article refers to a recent study by ETS, which analyzed results from 6,300 students who took its ICT Literacy Assessment. The findings show that students don’t know how to judge the authoritativeness or objectivity of web sites, can’t narrow down an overly broad search, and can’t tailor a message to a particular audience. L.A. Burdick’s Cafe and Chocolate My favorite place to eat in all of New Hampshire is LA Burdick’s in Walpole. It’s a chocolate shop and cafe and I’ve never had anything there that isn’t sinfully delicious. We took my mother-in-law there for Mother’s Day this year. We started the meal with their delightful cheese plate. This featured four cheeses in a range of intensities, a delightful fruit chutney, olives, seasoned nuts, and crackers. The cheeses were all wonderful and could be purchased at the market next door, many are by local artisans. Sausage: The Other Ground Hog The photo is from Jessamyn, who declared it Groan-worthy. I’m still grinning about it. Reminds me of the time Homer said “Yeah, right Lisa. A wonderful, magical animal.” Sweet Meatcake First it was meat hats, then SuperModelMeat. Now it’s meat cakes. Yes. Three layers of meat, with ketchup and potato frosting. It all happened when the groom announced that a man’s cake should be made of meat, ’cause “wedding cackes are all girly.” Apparently a red velvet armadillo groom’s cake isn’t manly enough. Funny thing, now there’s a growing gallery of meatcakes. (Via.) Wikipedia The Wonder Middlebury College banned it, but 46% of college students and 50% of college grads use it. Twelve year olds point out errors in its competition, while those over 50 are among its smallest demographic — just 29% (Just! 29%!) say they’ve used it. It’s Wikipedia, of course, and the numbers come from a recent Pew Internet Project memo reporting that Wikipedia is used by 36% of the online population and is one of the top ten destinations on the web. Is Automated Metadata Production Really The Answer? (It’s old, but I just stumbled into it again…) Karen Calhoun’s report, The Changing Nature of the Catalog and its Integration with Other Discovery Tools, included a lot of things I agree with, but it also touched something I’m a bit skeptical about: automated metadata production. Some interviewees noted that today’s catalogs are put together mainly by humans and that this approach doesn’t scale. Several urged building or expanding the scope of catalogs by using automated methods. CentOS 5 Released At work I use Red Hat Enterprise Linux, but my personal stuff is served from machines running CentOS. Both distros were just bumped to version 5, bringing with them support for current components of the LAMP stack. I care because I want Apache 2.2.4, and while it’s pretty easy to get MySQL &amp; PHP 5 on a CentOS/Plesk box, Apache 2.2 is a bit more of a struggle. Gary Sims at Linux. Leopard Beta To Be Released At WWDC Those of us hoping for an early release of Mac OS X 10.5 Leopard might be disappointed to learn that Apple will just be getting around to giving out a “feature complete” beta at WWDC in mid-June. If you really must have it, conference badges are $1,295. The Leopard beta. Available first at WWDC. At the Apple Worldwide Developers Conference, we’re planning to show you a feature-complete version of Mac OS X Leopard, and you can take home a beta copy. World’s Hottest Peppers Tabasco thinks their peppers and eponymous sauce are hot. Anybody who’s just ate a habanero thinks that’s a hot pepper. But earlier this year, Paul Bosland of New Mexico State University said “Damn, I’ve got a hot pepper.” And the Guiness World Records folks agreed. World&rsquo;s hottest pepper? Bosland had identified the Naga Jolokia pepper and measured it at over one million Scoville Heat Units, quite a bit more than three times the burn of a hot hot habanero. DeWitt Clinton On The Birth of OpenSearch OpenSearch is a common way of querying a database for content and returning the results. The idea is that it brings sanity to the proliferation of search APIs, but a realistic view would have to admit that we’ve been trying to do that since before the development of z39.50 in libraries decades ago, and the hundreds of APIs that have followed have all well intentioned and purposeful. So what makes makes OpenSearch something more than an also ran in a crowded herd? Awkward Moments In Social Software We all know social networking may be a feature, not an application, but one person’s feature can become another’s bane. So when Netflix offers a handy Friends feature that makes it easy to share your viewing history and recommendations, it opens itself up not only to the value of social interaction, but also the awkwardness it can sometimes be rife with. Titration’s story is instructive: So I have this friend who has invited me to become her “netflix friend” twice now. David Halberstam On Competition Speaking at UC Berkeley’s School of Journalism last month, David Halberstam struck the chord of competition journalists must struggle with. As a newspaper man who started at the smallest newspaper in Mississippi and worked his way up to the New York Times, where he won a Pulitzer for his reporting on the Vietnam War, he learned that television’s constant stream of images offered “drama and excitement,” but perhaps incomplete reporting. Not that he was criticizing TV, no, he praised it for bringing images and awareness into our living rooms nightly, raising questions among the viewing audience that “we [in newspapers] had the chance to answer if we used our skills properly. MySQL Error 28: Temp Tables And Running Out of Disk Space Bam: MySQL error 28, and suddenly my queries came to a stop. Error 28 is about disk space, usually the disk space for temp tables. The first thing to do is figure out what filesystem(s) the tables are on. SHOW VARIABLES LIKE “%dir%” will return a number of results, but the ones that matter are tmpdir and datadir. `SHOW VARIABLES LIKE “%dir%”; basedir / character_sets_dir /usr/share/mysql/charsets/ datadir /var/lib/mysql/ innodb_data_home_dir innodb_log_arch_dir Miles Hilton-Barber Flies Blind From Britain To Oz I learned of it last night on The CBC’s As It Happens: Miles Hilton-Barber, blind since age 30, has flown from Biggen Hill, south of London, to Gosford, outside Sydney, by ultralight in a journey that took almost two months. Aviation regulations required he take a sighted co-pilot, but in the As It Happens story he explained how his instruments were geared up to give him audio and voice feedback such that he could do most of it on his own. PHP Libraries for Collaborative Filtering and Recommendations Daniel Lemire and Sean McGrath note that “User personalization and profiling is key to many succesful Web sites. Consider that there is considerable free content on the Web, but comparatively few tools to help us organize or mine such content for specific purposes.” And they’ve written a paper and released prototype code on collaborative filtering. Vogoo claims to be a “a powerful collaborative filtering engine that allows Webmasters to easily add personalization features to their Web Sites. Remixability vs. Business Self Interest vs. Libraries and the Public Good I’ve been talking a lot about remixability lately, but Nat Torkington just pointed out that the web services and APIs from commercial organizations aren’t as infrastructural as we might think. Offering the example of Amazon suing Alexaholic (for remixing Alexa’s data), he tells us that APIs are not “a commons of goodies to be built on top of for fun and profit, like open source software.” Here are his “six basic truths of free APIs:” Boris Yeltsin: The Most Colorful, Drunk Politician Since Churchill Sure, Clinton played his sax on TV, Bush groped Angela Merkel, but Boris Yeltsin gave speeches drunk, tossed women into the water, danced on stage, and generally did all manner of laughable things. But he also turned back a hardline coup by jumping atop a tank and dragged Russia kicking and screaming toward democracy. Not since cigar chomping, Scotch drinking Winston Churchill led Britain through World War II has the world had a more colorful leader. Atomic Test Photos From Los Angeles This renewed talk of building nuclear weapons here in the US reminded me of an old report of photos of the sky glow from nuclear tests done in Nevada seen over Los Angeles. This one includes the following description: Atomic explosion, the largest yet set off on the Nevada test range, was clearly visible in Los Angeles. Staff photographer Perry Folwer was ready with his camera on a tripod on the roof of the Herald-Express building when the blast occurred at 5:49 a. Nukerator, We’re Nukrawavable Will, Cliff (both above), and I recorded this song in one take in late 1999. Though, calling it a “take” is overstating it. We were beyond silly drunk and lacked any talent for the task, but we had a mic in front of us, a guitar, and a willingness to open our mouths and let something — anything — fly out. It wasn’t until Will said “This song is called Nukerator” that we knew what we were supposed to be singing about. CSI Jumped The Shark I’m a newcomer CSI: Crime Scene Investigation, I started watching it with season six while suffering a flu that immobilized me for what seemed like a week or more. Dumb with illness, I went searching for a diversion at the iTunes store and stumbled into the series. I had the entire season downloaded quickly; it took me two marathon days to watch them all. I got hooked. Now I’m following season seven, again via iTunes. How To: Zip Files on Mac OS X It couldn’t be much easier. I’d previously posted command line instructions, but it turns out that there’s a huge number of people who don’t know the easy way: just CTRL-click on the file and select “Create Archive…” You’ll also find the option in the File menu. Either way, you’ll end up with both the original and a zipped copy. Decompressing that zip — or any other — is as simple as double-clicking it. NCAA Set To Ban Text Messaging Between Recruiters And High School Students College sports are big business, so recruiting student athletes is big business. The NCAA limits the times coaches and recruiters can call or visit athletes, but text messages are all fair game. For now. The Chronicle of Higher Education explained in an October 2006 story: Before Chandler Parsons committed to play basketball for the University of Florida, his cellphone buzzed more than 100 times a day with text messages from college coaches. Are We There Yet? Still Waiting For Decent iPod Car Integration Even Bob Borchers, Apple’s senior director of iPod worldwide product marketing, calls most iPod car setups an “inelegant mess of cassette adaptors and wires.” Indeed, while Apple aparently doesn’t want to get into the car audio business, they do want to improve the in-car iPod experience: What Apple really wants you to buy is a car that’s designed from the ground up to interface with the iPod,” the Web site said. Please, Not Another Wiki Ironic secret: I don’t really like most wikis, though that’s probably putting it too strongly. Ironic because I love both Wikipedia (and, especially, collabularies), but I grit my teeth pretty much every time I hear somebody suggest we need another wiki. Putting it tersely: if wikis are so great, why do we need more than one of them? I think my concern is that wikis appear to depend on either very large or very, very active communities. Claims of Prior Art In Verizon/Vonage Patent Infringement Case Vonage has been saying Verizon’s patent claims are overly broad for some time, but now people have dug up some prior art. One of the patents Verizon is complaining about is #6,104,711, what they call an “enhanced internet domain name server.” In short, it’s all about linking phone numbers to IP numbers, and Jeff Pulver says he was doing that in 1995 with Free World Dialup, an early, noncommercial VoIP service. The High Cost Of Innovation: Vonage’s Patent Woes Vonage will be in court again tomorrow defending itself against Verizon’s claims of patent infringement. The innovative VoIP company had lost the trial and was ordered to pay $58 Million in damages in early March, when a jury found them to have violated thee of seven related patents held by Verizon. Vonage appealed of course, but it’s uncertain if the company, which has yet to turn a profit, has the stamina for a drawn out battle. Eco-Friendly Web Design For Earth Day Mark Ontkush at ecoIron did some math starting with the Department of Energy data that showed CRT monitors consume less power displaying dark colors than light and determined that redesigning Google’s site in black would save 750 megawatt-hours per year (assuming that 25% of computer users still haven’t upgraded to LCDs and are using power-hungry CRTs). The results were so dramatic he redesigned his own site and developed a low wattage palette that uses only about three or four watts more than a completely black screen (white is to be used only as a text or accent color). “I Want My Money” My nephew checked his email while he was here this morning and this was the first thing in his inbox. Maybe it’s because he’s 17 and my humor is at about the same level, but both of us were cracking up over it. Miserable attempt at recovering my dignity with serious criticism: Will Farrell and landlord prove there is no meaning (or humor) without context. Would it be as funny without Will Farrell (with full afro! Reminder: Paris Hilton To Retire In 60 Days Amid all the “ZOMG Paris Hilton is pregnant!” rumors, it’s worth remembering that the girl famous for doing nothing (except repeatedly having her racy photos and video leaked) is retiring in two months. Yep, on June 20th 2007, Paris is give up on public life. At least that’s what she said in Newsweek: She’s certainly managed to turn herself into an icon and a conglomerate for essentially being a party girl—that is, for doing nothing. DeLoreans Are Back In This Future If the DeLorean looks at all like a Lotus Esprit, it should. Both of them were designed by Giorgetto Giugiaro, and much of the engineering work was done by Lotus founder Colin—to add speed, add lightness—Chapman. Amusingly, John De Lorean also owned a company that manufactured snowcats under the DMC name. Owners and wannabes can join the fun at the DeLorean Motor Company open house, being held June 8 — 10 in Humble, Texas. MoveOn: We Can’t Afford Bad Song Parodies In yet another lesson about how a bad joke in front of one audience can trouble a larger public, MoveOn wants McCain to know bombing Iran is no laughing matter. Music and bombing, it could be said, really only go well together when joined in criticism. WordPress, Permalinks, Mod_Rewrite, and Avoiding 404s I made a mistake in changing my WordPress permalinks, but by the time I’d discovered it my blog had already been indexed. Fixing the permalinks meant breaking those indexed URLs, leading to a bad user experience, but leaving them as is wasn’t really an option. Last night, after getting 404’d while using Google to search my own blog, I realized I had to do something. First I looked at Apache mod_rewrite and the URL rewriting guide (as well as this cheat sheet from ilovejackdaniels), Then, frustrated, I found some items in the WordPress Codex, including this one about conflicts between . Some Needs, Some Of The Time I don’t know why I love this quote from a post in panlibus: serve some needs of some parts of the population, some of the time …though my love for the quote may have something to do with my embrace of what OpenSearch creator DeWitt Clinton describes as the “80% case,” the solution that would work for the great majority of applications most of the time. It’s one of those things that’s easy to see in retrospect, but difficult to aim for: building a tool that is specific enough to be useful, but not too specific. Joost Brings Television To The Internet Age (Finally) On demand internet TV has been just around the corner since the dawn of the popular internet, but like flying cars, it’s still not here. The problem is how TV streams clog the internet’s tubes. Bandwidth may be cheap, but there’s still never enough of it. Well, that’s true if your metaphor for the internet is a hub and spoke system. Not so if you think of it as a mesh. Usability, Findability, and Remixability, Especially Remixability It’s been more than a year since I first demonstrated Scriblio (was WPopac) at ALA Midwinter in San Antonio. More than a year since NCSU debuted their Endeca-based OPAC. And by now most every major library vendor has announced a product that promises to finally deliver some real improvements to our systems. My over-simplified list said that our systems failed us in the categories of usability, findability, and remixability, and now people are asking me what I think about what I’ve seen from the vendors so far. My Boston Library Consortium Presentation Speaking Thursday at the Boston Library Consortium‘s annual meeting in the beautiful Boston Public Library, my focus was on the status of our library systems and the importance of remixability. My blog post on remixability probably covers the material best, but I define it as: Remixability is the quality of a system or data set to be used for purposes the original designers or owners didn’t predict or intend. bsuite Bug Fixes (release b2v7) [innerindex]Work on bsuite3 is progressing well, thanks to help from Zach and Matt, who are collaborating with me on completely rearchitecting how stats are collected and reported. This, however, is not bs3. It’s a transitional release intended to fix some bugs in b2 and make upgrading easier. This upgrade is recommended for all current bsuite users and new users. bsuite Features Tracks page loads (hits) Tracks search terms used by visitors ariving at your site via search engines Reports top-performing stories via a function that can be included in the sidebar Reports recent comments via a function that can be included in the sidebar Reports top search terms via a function that can be included in the sidebar Outputs a pulse graph of activity on your site or specific stories Lists related posts at the bottom of the current post’s content Suggests posts that closely match the search criteria for visitors who arrive via search engines Integrates bsuite_speedcache Does some stuff with tags Fixed/Changed/Added As mentioned above, a huge-but-invisible feature here is that this version includes some pieces that will make it easy to transition to the new plugin. MySQL errors while creating the tables should now be fixed. It’s my shame that these have persisted so long. The plugin now “rebuilds the tags table” as soon as you activate it. This is a good thing, but if you’ve got a huge number of posts (or a really short max execution time) it might cause a problem (please leave a comment if it does). The related posts feature now works even if you aren’t tagging your posts. If there are no tags, the post’s title is used as a search string. This list is probably incomplete and in some other way inaccurate. It’s not intentional, I’m just sloppy. Please leave comments with bug reports or corrections, I’ll do what I can to fix them. Finally, I’m now hosting the download on a new server, so it won’t be subject to .Mac’s bandwidth consumption limits. Is The Moller Skycar A Fraud? Will I Ever Get My Flying Car? A recent comment here reminded me to check in on our options for flying cars, now at least seven years overdue. It turns out that Moller International, the folks developing the M400 Skycar aerodyne, are accepting deposits: As a result of the recent successful hovering flights of the M400 Skycar, Moller International is accepting deposits to secure delivery positions for our M400 Skycar until after the Skycar has flown from hover to full aerodynamic flight and returned (transitioning flight). Yep, Skulls Are Office Products, Brains Not Included I don’t know what’s funnier, that Amazon sells skulls (just $132, get one now!), or that they’re classified as “office products.” Extra: more office weirdness in this video. I’m A Fonero, Are You A Fonero Too? Now that I’ve moved I’ve finally set up my Fonera. I had hoped to offer a story about the process, but it was so simple I can’t really say much more than “I plugged it in, I registered it, it worked.” The Fonera is a tiny little router/WiFi access point that looks worlds better than the average Linksys/Netgear/Belkin job, but the real sweetness is in what it does that they don’t do. Google MyMaps and GeoRSS O’Reilly’s Where 2.0 Conference isn’t until the end of May, but Google just released two sweet new map-related features: GeoRSS support and MyMaps. The GeoRSS support means that any application that can output it’s geocoding — as simple as &lt;georss:point&gt;45.256 -71.92&lt;/georss:point&gt; — can now be linked to a live map with no more effort than it takes to paste the feed URL into Google Maps’ search box. Google holds this up as the exemplar, but I’m a fan of the cheese photo map here. Twitter Twitter Anti-Twitter My own feelings about Twitter have gone back and forth across indecision street for a while, and despite a moment of excitement it’s still not part of my life-kit. So I was amused to see Blyberg pointing out Kathy Sierra’s poo-poo-ing of Twitter. Ironically, services like Twitter are simultaneously leaving some people with a feeling of not being connected, by feeding the fear of not being in the loop. By elevating the importance of being “constantly updated,” it amplifies the feeling of missing something if you’re not checking Twitter (or Twittering) with enough frequency. Dawn Of The Citizen Professor? It should be no surprise that journalists are talking about citizen journalism, but what of the disintermediation of other industries? Man-on-the-street Mark Georgiev told Marketplace: I didn’t want a certificate, I didn’t want any kind of accreditation, I really just wanted the knowledge. And I also wanted to work at my own pace. Georgiev, the story explains, has a masters from Yale but wanted to learn programming. That’s when he found Foundations of Software Engineering in MIT’s OpenCourseware. Pranks International Matt tells us the office pranks he masterminded a couple weeks ago got reported in Saturday’s Daily Mirror (scan above): JOKER Matt Batchelder had the last laugh after he was left out of an office conference trip. Alone at his desk for a week, the snubbed computer geek dreamed up a series of pranks to greet his boss and three colleagues as they returned… on April Fool’s Day. Cut And Paste Is A Skill Too [Update: Keith pointed out that my small disclaimer at the end isn’t clear enough. This post is copied, stolen, cut and pasted in its entirety from Keith’s blog, ISTP Dad. I was glad to learn of the story, and this was meant to be ironic and funny.] An editorial in the Washington Post is explicit about a topic close to my heart: students think plagiarism is fine, and teachers (high school? Moving and Shaking and Shimmy-ing It’s sort of late by now, and others have been offering their congratulations to me for a while (thank you, thank you, thank you, thank you), but I only just got the paper copy myself and this morning had a chance to browse the list. Mover &amp; Shaker alumnus John Blyberg asked me if I preferred moving or shaking better, but now that I’ve seen the names and read the profiles, I can say I’m just proud to be among such a distinguished group. [Good|Bad] Covers: My Humps, Interpreted By Alanis Morissette I’m one of those guys who almost never actually hears the lyrics to the music that’s playing constantly. Then somebody covers the song in a beautiful-but-ridiculous way, and I finally clue to them. Example: Tori Amos’ cover of Smells Like Teen Spirit. Now I hear Alanis’ interpretation of The Black Eyed Peas My Humps, and I realize that, while not meaningless, it’s on par with Lene Alexandra’s current single. Does it make me old to say that bad grammar in lyrics hinders my understanding of them? Economics Of Open Source Two fairly old papers on the economics of open source. The news recently has been that open source allows companies to bring in better, more innovative talent and saves marketing costs, but these papers are interesting nonetheless. The Simple Economics of Open Source: The nexus of open source development appears to have shifted to Europe over the last ten years. This paper explains why this trend undermines cultural arguments about “hacker ethics” and “post-scarcity” gift economies. “Smart Networks” Are A Stupid-Bad Idea This story in MIT Technology Review scares me. Instead of letting all computers within the network communicate freely, Ethane is designed so that communication privileges within the network have to be explicitly set; that way, only those activities deemed safe are permitted. “With hindsight, it’s a very obvious thing to do,” McKeown says. No matter how obvious it seems, it’s still a really bad idea. It’s hard to imagine a world without the internet now, which makes it especially easy to dismiss the critical features that made it possible. Sweet Vespa Scooter With Sidecar on eBay Greenstemstudios is selling a sweet-looking 1980 Vespa with sidecar. In gleaming Cinder Red and House of Kolar Black, riding on white wall Continentals, “the scooter gets 60 to 70 miles to the gallon and can easily maintain 60 mph even with the sidecar attached.” The starting price is $3,750. I’m plenty happy with my scooter, but this is very tempting. EMI and Apple/iTunes To Offer DRM-Free Music Downloads Following Steve Jobs’ ant-DRM post, people began to wonder if Apple was just pointing fingers or really willing to distribute DRM-free music via their online store. Yesterday we learned the answer. Apple and EMI announced yesterday they would offer DRM-free 256bit AAC premium downloads, priced at $1.29 each. Bisson Tower Siezed With plenty of moving help from Zack, Matt, Cliff, Justin, Jon, Will, and Karen, Bisson Tower went from empty to full quickly enough that we all had plenty of time to sit around and enjoy the lunch Sandee cooked up, then retire to the roof with cocktails. The cats were traumatized by it all, but I’m happy to be done with construction and finally be able to enjoy the new place, with all its quirks. Web Based Genealogy Software Interesting, a LAMP solution that promises “the next generation of genealogy sitebuilding.” it does pretty charts and pages, and as any web app should, makes it easy to edit or add information. But it also makes me wonder if there’s an XFN attribute to indicate parent/child relationships. Could our work on network identity and social software solve this? For April Fools… Those looking for this year’s April Fools gags should look at the office pranking from last week (pictured above). This blog will henceforth be very serious. Not. Dance Around The World Among the pop-culture viral videos I apparently missed is Matt Harding‘s dancing. I had to turn to Wikipedia for an explanation: Harding was known for a particular dance, and while videotaping each other in Vietnam, his traveling companion suggested he add the dance. The videos were uploaded to his website for friends and family to enjoy. Later, Harding edited together 15 dance scenes, all with him center frame, with the background music “Sweet Lullaby. Whoosh Boom Splat Bill Gurstelle thought the exploding balloons were as funny as I did, and now I understand why: the contributing editor of Make magazine knows his way around improvised munitions. He also knows YouTube videos of oppressed geeks getting back at The Man with potato guns is a good marketing ploy for his audience. Whoosh Boom Splat appears to be his latest book. Amazon doesn’t let me look inside, but how can you go wrong with projects like these? Who Will Be First To Put A MetroNaps Pod In Their Library? MetroNaps started business in 2004 with a boutique in NYC’s Empire State Building, selling 20 minute naps for $14 bucks. The company has slowly been opening franchises around the world, but MetroNaps co-founder Arshad Chowdhury says overwhelming interest from office folks who wanted to install the pods on-site as an employee perk. So the company redesigned the pods to fit through the smaller doors common to office environments (trust me, retail doors are big), and has started selling direct. APIs Are Big Business ProgrammableWeb pointed out an InformationWeek story that claimed 28% of Amazon’s sales in early 2005 were attributable to Amazon affiliates. And C|net claims Amazon now has 180,000 AWS developers (up from the 140,000 Amazon was claiming about a year ago). (Note: not every Amazon affiliate/associate is an Amazon Web Services (AWS) developer, but Amazon hasn’t shared more specific numbers.) These slides, from Amazon’s AWS developer relations team explain a lot about what AWS is. Office Prankd! When Ken, Zach, Dan, and Dee all went off to a conference without Matt, Al, Cliff, Tim, Laurianne, and me (but especially Matt), they had to assume something would happen in their absence. Something. And it did. To each one of them in turn. 1,100 square feet of tinfoil covered everything in Ken’s office. 5,300 Post-It notes were tiled over everything in Zach’s. 575 cups (many had water in them) covered Dan’s floor and desk. IdM, OpenID, and Attribute Exchange The conversation on Code4Lib about OpenID reminded me to finish a draft I’d started at Identity Future on the topic. The short of it is that Marc Canter says that single sign-on is good, but “we need the attribute exchange to make this thing really take off.” Then all the skeptics will realize that the authentication layer HAD to come first – but was just a first step. Along the way we’ll figure out standards for user intrerface and usage flow. Japanese Lessons From William Rowe: zetcho = the apex of the mountain tonsei = to shave one’s head and forsake the world I learned the literal meaning of “karaoke” early last year. Heavy Skies Newley Purnell pointed me at this astronomy picture of the day by Antti Kemppainen: Sometimes the sky itself is the best show in town. On January 26, people from Perth, Australia gathered on a local beach to watch a sky light up with delights near and far. Nearby, fireworks exploded as part of Australia Day celebrations. On the far right, lightning from a thunderstorm flashed in the distance. Near the image center, though, seen through clouds, was the most unusual sight of all: Comet McNaught. World’s Smallest Horse Thumbelina is smaller than a decent dog. So small, in fact, that the Guinness folks — no, not those Guinness folks — recognize her as the smallest. From Boing Boing: Thumbelina is the world’s smallest horse. She weighs 60lb and is five years old. She was born on a ranch that specializes in breeding miniature horses. She is thought to have dwarfism, which makes her even tinier. But she’s not alone. Spring! Spring Flowers! Uploaded from before the days when Flickr would keep the original size photos, this is one of my favorite, most spring-y shots. And with weather like we’re having here now — 57° in northern New Hampshire! — it’s very appropriate. My Personal Crisis of Digital Preservation For a long time I was a big fan of Dantz Retrospect Backup. For while I was so committed that I would do an incremental backup of my laptop and most every other computer in my house every day, but I’ve been using it one way or another since 1999 or 2000 or so. All those backups have added up, and they’ve even saved me a couple times. I wish, of course, that I’d been using it previously, when my laptop was stolen in 1995, or when my hard drive failed catastrophically in 1997. UC Berkeley Proud Of PowerPoint Bob Gaskins, a former Berkeley Ph.D. student, conceived PowerPoint originally as an easy-to-use presentation program. He hired a software developer, Dennis Austin, in 1984 to build a prototype program that they called “Presenter,” later changing the name to PowerPoint for trademark reasons. PowerPoint 1.0 was released in 1987 for the Apple Macintosh platform; later that year Gaskins’s company Forethought and the program were purchased by Microsoft for $14 million. The first Windows and DOS versions of PowerPoint followed in 1988. NYT Struggles To Find Young Audience, Online Audience, Audience The New York Times last week announced that it’s giving away TimesSelect to students and faculty that hold a .edu email address. TimesSelect, of course, is the paid access site that debuted in January 2006 to a confused and critical web. Editor and Publisher repeated the Times’ claim that they’re doing this for the good of democracy: “It’s part of our journalistic mission to get people talking on campuses,” says Vivian Schiller, senior vice president and general manager at NYTimes. Snow Spider Karen found this spider in the snow yesterday when she wasn’t running for the camera. Will spied several more, all moving laboriously over the crystalline landscape. None of us had ever seen spiders on snow before, but it’s likely we’d never looked. Charlie The Unicorn Meg was never shy about asking me what rock I was found under when I stunned her with my complete ignorance of major pop culture touchstones, so I put my mind to it and after significant remedial work I thought I’d caught up. But, no. I’d not seen this video and only discovered it when Blyberg pointed at it as an icon of network-enabled pop culture. The Candy Mountain video has been circulating for almost a year now and it’s a prime example of how network effects are allowing society to disseminate, in this case, popular culture, and ultimately the bulk of information deemed “important” by our fellow citizens Snow Thrower In my favorite action photo since Will cut a woody, Karen hit the snow with fury. I Missed Lebowski Fest!?!? As usual, beatnikside had to tell me what I missed: Lebowski Fest. It looks like everybody was there. The Dude Jeffrey Lebowski, Theodore Donald ‘Donny’ Kerabatsos, Walter Sobchak, Maude Lebowski, Bunny Lebowski, the rich Jeffrey Lebowski with no legs, and his lacky Brandt. And don’t forget Jesus Quintana or Treehorn’s Thugs. And certainly don’t forget Nihilists Uli Kunkel, Karl Hungus, Kieffer, and Franz. Twittter Twittter Twittter Ryan tried to tell me about it a month ago, Jessamyn gets the idea but uses Facebook instead, DeWitt fell for it, Ross said it tipped the tuna, and now I’m finally checking Twitter out. I signed up yesterday and immediately went looking for ways to connect Twitter, Plazes, and iChat. Tweet is an AppleScript that works with Quicksilver (a launcher) and Twitterrific (a desktop Twitter client) to make updating even easier. OSS Saves Marketing Costs, Protects Business VA Linux founder Larry Augustin on OSS In Augustin’s view open source development became a necessity in the 1990s when the cost of marketing a program came to exceed the cost of creating it. “My favorite is Salesforce.com. In 1995 they spent under $10 million in R&amp;D and over $100 million in sales and marketing. That doesn’t work.” “Open source enables people to reach all those customers. It’s a distribution model. Beyonce and Swimsuits Not Appropriate For Librarians My ALA email newsletter arrived today with this story: Sports Illustrated decides libraries don’t need swimsuit issue Librarians on Publib and other discussion lists discovered in the first week of March that none of them had received the February 14 “swimsuit issue” of Sports Illustrated. Inquiries to publisher Time Warner eventually resulted in a statement from spokesman Rick McCabe that the company had withheld shipment of that issue to some 21,000 libraries and schools because for years the magazine had received complaints it was too risqué. Linux Leads On World’s Top Supercomputers The real map of the world’s top 100 supercomputers isn’t nearly as US-centric as my screenshot suggests, but the operating system stats are seriously tilted toward Linux. Over 400 of the top 500 supercomputers in the November 2006 report run some form of the free operating system. Generic “Linux” leads the pack, but Redhat and SuSE are the two most named distributions. Non-free operating systems include IBM’s AIX, HP-UX, and MacOS X. Spam Getting More Personal? The Viagra and Cialis knock-offs being pushed in so much of the spam I get may be directed at things the recipients feel very personally about, but the message itself has never been personal. Well, it had never seemed personal to me, anyway, until now. Clay Shirky pointed out what I’ve started to see, and wonder about, myself: many of the subject lines in the spam I’ve received recently sound familiar, and plausible as a real message. The Future Of Library Technology Is Free, Cheap, And Social delicious = Endoeavor’s course content integrator OpenSearch = metasearch Flickr = digital collections management Damn Daylight Saving Doesn’t Save NPR covered it like an eclipse or astronomic curiosity, and did little to question the claimed energy saving benefits. But, as Michael Downing asks in Spring Forward, how can something understood by so few be done by so many? And why go through this twice annual madness? Supposedly, we subject ourselves to the rule of time to conserve oil, but even the most wildly optimistic predictions suggest only a 1% drop in consumption. Firecrackers For Troops Via NPR this morning: A Michigan man strapped more than 13,000 firecrackers onto himself, and lit the fuse. John Fletcher publicized it as an effort to support U.S. troops. It was an event to collect cell phones for soldiers. The Daily Press and Argus, in Livingston County, Mich., shows Fletcher standing calmly as the firecrackers explode. Afterward he did say he needed some Tylenol. LivingstonDaily.com has has video as well as photos of the fiery 47 seconds of firecracker fury, which worked out a whole lot better than this other soldier-related firecracker stunt. 300: A Torrent Of Awesomeness or Just Too Much? So, is 300 really the “torrent of blood and awesomeness” that Matt says it is (and the preview supports), or does it run out of steam as NPR’s film critic, Kenneth Turan, suggests? Unless you love violence as much as a spartan, Quentin Tarantino, or a video game playing teenage boy, you will not be endlessly fascinated. The problem is that the visual panache that made Snyder an acclaimed director of commercials works better for 30 second spots than two hour features. And He-Man Screams From The Top Of His Lungs “What’s Goin’ On” The What’s Up? cover would be funny enough on its own, with the He-Man video it’s golden. Now, you know you want to sing along with the chorus. Go for it, here are the lyrics: And so I wake in the morning and I step outside And I take a deep breath and I get real high And I scream from the top of my lungs &ldquo;What&rsquo;s going on? Charges Put Internet Radio On Pause In early 2002 the Copyright Arbitration Royalty Panel (CARP) set royalty rates for webcasters that were twice as high as for regular radio broadcasts. The Library of Congress reset those rates in late summer (yes, the LoC oversees those things). Now it’s 2007, and the RIAA is at it again. Techdirt reports the Copyright Royalty Board is adopting royalty rates the RIAA has been asking for, “and making them effective retroactively to the beginning of 2006 — meaning that many small independent webcasters are now facing a tremendous royalty bill they’re unlikely to be able to afford. The True Spirit Of Copyright I wrote to C|Net, owner of TechRepublic and Builder.com, asking if I could quote their Ten Commandments of Egoless Programming in an issue of Library Technology Reports journal on open source software for libraries and got the following canned response: Thank you for your interest in including CNET content on your website. […] There would be licensing fee of $400.00 associated with use of the CNET logo or text excerpt on your website, or $1000. Ingenious And Almost Unusably Different Lars Wirzenius’ Linux Anecdotes: In January, Linus bought a PC. He’d been using a Sinclair QL before that, which, like much British computer stuff, was ingenious and almost unusably different from everything else. Dell Tells Linux Users Where To Put It Holy smokes. As Dell’s sales slump and stock remains flat, the famously unimaginative company is trying to tap into the Mob for ideas about what new shade of grey to deliver its hardware in next. And what did the Dell IdeaStorm mob say? “Give us Linux!” “Give Us OpenOffice.” And how did Dell respond? “No. No. And, No.” John Naughton reports on the story for The Guardian, explaining: Waiting For Mac OS X 10.5 Leopard With rumors of a March release of Mac OS X 10.5 Leopard, swirling, Zach asked what was promised that he should be excited about, so I went looking to jog my memory. The announced features include Time Machine automatic backup of all your stuff (with integration to make finding and restoring stuff in applications easy and sweet, watch the video already), as well as a big leap ahead for iChat. Internet Awesomeness Diagram By Matthew Batchelder Above, Matthew Batchelder’s diagram showing the correct relationship of the internet, awesomeness, ninjas, pirates, dinosaurs, zombies, robots, and Gummi Bears (though, where are the superheros you might ask). This Guy Can Draw Circles Around You (And Me) Found at Baekdal.com, where the author expresses some amount of whiteboard-skills envy. The video shows Alex Overwijk, head of Glebe Collegiate high school‘s math department (more trivia: Alanis Morrisette went there) drawing what appears to be a perfect circle. This is something I do in my spare time. I draw freehand circles and then I found out there was a world championship…It’s like winning the Masters. Once you win, you automatically get invited back every year. Google Apps and Roadshow I was supposed to go to the what I think is a Google Apps roadshow this morning, but I was also supposed to be at code4lib this weeks and be doing a dozen other things that didn’t happen. So, in lieu of that I’m reading up on the company’s first new business strategy since Adsense. Phil Wainewright is skeptical, even mocking at the likely prospects for the premium package that Google is offering for about $50 per person, per year. Links from Ryan Eby Encyclopodia – the encyclopedia on your iPod GeoCool! – Rasmus’ Toys Page IE7 and OpenSearch Autodiscovery Information Management Now: Social Tagging For The Enterprise Let Me Show You My Credentials “I’m Bruce Pechman, the muscleman of technology, let me show you my credentials.” This is the instructional video that comes with the DynaFlex Powerball Gyro. The fan videos on YouTube have got nothing on this. Just click play and prepare to laugh. Will and I have been asking to see people credentials since he shared this with me a week ago. Middlebury College vs. Wikipedia Middlebury College is proud to have taken a stand against Wikipedia this year: Members of the Vermont institution’s history department voted unanimously in January to adopt the statement, which bans students from citing the open-source encyclopedia in essays and examinations. Without entirely dismissing Wikipedia — “whereas Wikipedia is extraordinarily convenient and, for some general purposes, extremely useful…” — the decision paints it with a broad brush — “as educators, we are in the business of reducing the dissemination of misinformation. WWAN Update Brings Higher Speed-Mobile Connectivity Apple’s WWAN Support Update 1.0 brings support for the following new cell carrier-based based networking cards (WWAN = wireless wide-area networking): Available on the Cingular network Novatel Merlin XU870 ExpressCard (HSDPA) Available on the Sprint network Novatel Wireless Merlin EX720 Express Card (EVDO Rev. A) Novatel Wireless Ovation U720 USB Modem (USB Adapter, EVDO Rev. A) Available on the Verizon network Novatel XV620 ExpressCard (EVDO Rev. Top Ten Times Two For Students Back in August Educated Nation offered the following top ten list of web tools for college students: Writely Soundslides Bluedot.Us eFax PDF Online Google Calendar Google Spreadsheets Bloglines Technorati mynoteIT Not to be outdone, an anonymous-but-first-person story at Nextstudent identifies their top ten: Book Finder MynoteIT Ottobib Google Docs Tada List Meebo Wikipedia Zoho Show Google Reader Del.icio.us Quiet Comfort That’s me on JetBlue Flight 481 to Long Beach, wearing my noise canceling headphones. Sandee saw me wanting them, so she was especially happy to make them a Christmas present to me. And, with all the flying I’ve been doing lately, I was especially happy to have them. I wanted the QuietComfort 2s not just because I like big, old skool, over-the-ear headphones (I don’t, actually), but because I really wanted the extra noise reduction that design offers. Let It Snow! With over a foot on the ground already, and more falling now the through the night, we’re crossing our fingers for another snow day tomorrow. Foods I Want To Try… Despite the mystery, porklets are quite yummy, at least according to Sandee‘s recipe. What I want to try next is bacon cheesecake or chili powder on french toast or maraschino cherries mixed with jalapeños. All of those sound delightful to me. Extra: sausage man, don’t eat that, don’t try this at home. Just Pretend It’s All Okay Ryan IM’d this to me, and it was pretty easy to find that Northern Sun sells them for $4 a pop. This is serious stuff, but it’s hard not to laugh at the support our pants magnet or some of the stickers here. This Blog Is For Academic And Research Purposes Only This sign on a computer in the Paul A. Elsner Library at Mesa Community College caught Beth‘s eye and garnered a number of comments, including one from theangelremiel that seems to mark one of the most elusive aspects of Library 2.0. they know that none of their classes require gaming Excerpting the above as a simple declarative may not be fair, but it gets to the point. Let’s say they “know” (that is, let’s say they think they know) that none of the courses requires gaming. Treo Firmware, DUN, Frustration John commented to say he’s been using his 650 for DUN over bluetooth for a long time now, and that all it takes is the latest firmware. So I go looking and find Treo 650 Updater 1.04 from October 2005 and I have to wonder “what firmware does my phone have?” Here’s how to check: Open the Phone application, press ‘menu’, navigate to ‘Options’, then ‘Phone Info’ Of course nothing is simple, and a TreoAddicts story notes trouble with the update, and the installation instructions are daunting (really, look at ’em). A Visual Explanation of Web 2.0 Kansas State University‘s Digital Ethnography group — “a working group of Kansas State University students and faculty dedicated to exploring and extending the possibilities of digital ethnography” — posted this visual explanation of Web 2.0. It’s by Michael Wesh, assistant professor of cultural anthropology, and it rocks. Text is unilinear…when written on paper. Digital text is different. Hypertext can link. With form seperated from content, users did not need to know complicated code to upload content to the web. Steve Jobs’ Thoughts On Music, Music Stores, and DRM Steve Jobs’ Thoughts On Music is surprisingly open and frank, almost blog-like, for the man and the company especially know for keeping secrets. Jobs is addressing complaints about Apple’s “proprietary” DRM used in the iTunes Music Store. There is no theory of protecting content other than keeping secrets. In other words, even if one uses the most sophisticated cryptographic locks to protect the actual music, one must still “hide” the keys which unlock the music on the user’s computer or portable music player. No one has ever implemented a DRM system that does not depend on such secrets for its operation. And after offering his view of the situation, he offers three possible futures. The first alternative is to continue on the current course, with each manufacturer competing freely with their own “top to bottom” proprietary systems for selling, playing and protecting music. And the case for doing more of the same is pretty clear. Apple’s iPod and iTunes Music Store are successful, and though there are competitors, they’ll have to convince would be buyers to give up their iPods. The second alternative is for Apple to license its FairPlay DRM technology to current and future competitors with the goal of achieving interoperability between different company’s players and music stores. And that’s exactly what people have been asking for. It’s hard to know who wants to use a player that’s not an iPod, but there are some things that don’t play on iPods. But… Apple has concluded that if it licenses FairPlay to others, it can no longer guarantee to protect the music it licenses from the big four music companies. Perhaps this same conclusion contributed to Microsoft’s recent decision to switch their emphasis from an “open” model of licensing their DRM to others to a “closed” model of offering a proprietary music store, proprietary jukebox software and proprietary players. And finally… The third alternative is to abolish DRMs entirely. And how does that work? In 2006, under 2 billion DRM-protected songs were sold worldwide by online stores, while over 20 billion songs were sold completely DRM-free and unprotected on CDs by the music companies themselves. The music companies sell the vast majority of their music DRM-free, and show no signs of changing this behavior, since the overwhelming majority of their revenues depend on selling CDs which must play in CD players that support no DRM system. So if the music companies are selling over 90 percent of their music DRM-free, what benefits do they get from selling the remaining small percentage of their music encumbered with a DRM system? There appear to be none. If anything, the technical expertise and overhead required to create, operate and update a DRM system has limited the number of participants selling DRM protected music. If such requirements were removed, the music industry might experience an influx of new companies willing to invest in innovative new stores and players. This can only be seen as a positive by the music companies. Connectile Dysfunction No sooner do I lay down a rant about how bad Sprint WiFi is than do they run an ad telling us how great their service is. Well, not only that, but they promise to save us from “Connectile Dysfunction.” Angela Natividad described it best: It’s hard to position broadband ads. You can be like Earthlink, which kind of laughs at the whole idea of marketing in general, and you can be like Comcast, which takes the easy way out with off-colour humour. 1984 Wasn’t Like 1984 For those who watch the ads as intently as the game, it’s hard not to think of Apple’s 1984 commercial. And from that thin thread, I’m reminded of the Ministry of Re-shelving and, now, the Ministry of Love. I discovered the last from a comment here, and after looking them up, I decided to contribute a few copies to the cause. The notes I sent along requested the following: Sprint WiFi Sucks I’m back in Oakland Airport, but this time I’m bringing my own network and I don’t have to deal with Sprint’s WIFI mess. See, the problem isn’t just that it costs too much. The problem is that once you pay, you’re plopped at the login page where the login I just created doesn’t work. And worse, the error offers absolutely no clue about why the username I just just created (and paid for! Social Internet Sharing It all started as a simple idea. Why should you pay for Internet access on the go when you have already paid for it at home? Exactly, you shouldn’t. So we decided to help create a community of people who get more out of their connection through sharing. The deal is that you get a special Wifi router and use it to securely open your connection to the world. Ecto vs. WordPress Ecto is finally available in Intel optimized form, but WP 2.1‘s XMLRPC breaks it. Cliffy, of all people, tells us how to fix it. Now, when is Ecto 3 coming out? Aside: this blog post explains how to hack up the XMLRPC to extract the tags Ecto is sending. This was interesting to me a long time ago, but bsuite handles tags entirely in the post content. Open Source Shifts Costs Does open source free your budget up for the best talent? I asked her if the choice to go with open source is helping her to keep costs in check, here’s what [Dabble CEO Mary Hodder] said: What happens with open source is you actually spend the same amount of money, but you don’t have lock-in and you pay for really good people to run it. And so you still end up paying. Neg’s Urban Sprinting I might watch more TV if I didn’t live in the US. Well, I used to like watching World’s Wildest Police Chases on Spike while knocking back a few at the bar after work, but they re-arranged the schedule a while back and it’s just not the same. So clearly I have to sit around waiting for people to forward me goodies like this. Yeah, it’s Neg’s Urban Sprinting, which apparently aired on a show named “Balls of Steel,” and it’s just one in a brilliant series. Sealand For Sale Principality of Sealand, a WWII-era gunnery platform called Roughs Tower, in the North Sea outside Britain’s pre-1968 three nautical mile claim of sovereign waters, is for sale. Yep, the “land” declared by some as the world’s smallest micronation will go to the highest bidder. Ravage by fire (2006), beset by marauders (1978), and generally ignored by the world’s governments (all time), it’s, well, it is what it is. And now The Pirate Bay hopes to buy Sealand. Communities Are As Communities Do Right there are the beginning of Esther Dyson‘s ten-year-old book, Release 2.1, she alerts us to the Web 2.0 challenge we’re we’re now beginning to understand: The challenge for us all is to build a critical mass of healthy communities on the Net and to design good basic rules for its public spaces so that larger systems do self-organize and work effectively. Rule-making is not the job of legislatures and governments alone. Presentation: Collaboration, Not Competition ALA Midwinter 2007, ALCTS Future of Cataloging presentation: Collaboration, Not Competition. (slides: QuickTime &amp; PDF.) Stir my writings on The Google Economy and Arrival of the Stupendous post with frame four of the ALCTS And The Future Of Bibliographic Control: Challenges, Actions, And Values document: In the realm of advanced digital applications, we are interested in collaboration, not competition. We take as axiomatic the idea that library catalogs and bibliographic databases on the one hand, and Web search engines on the other, have complementary strengths. Presentation: Faceted Searching And Our Cataloging Norms ALA Midwinter 2007, ALCTS Cataloging Norms Discussion Group presentation: Metadata and faceted searching: an implementation report based on WPopac. (slides: QuickTime &amp; PDF.) Faceted searching such as that made possible by WPopac (look for the new name soon) improves the usability of our systems and findability of our materials, but also puts new demands on how we catalog them. My favorite search example is sociology of education, both because it’s a common search in our logs, but also because it demonstrates how our systems can help bridge the gap between what our users know and what our catalogs know. Casual Friday: The ALA Midwinter + Music Video Edition The above circulated a while ago, but I post it today to recognize this special ALA Midwinter edition of Casual Fridays. And while I’m not suggesting libraries will or should become 21st century dance halls, Lichen’s title, “1.0 -&gt; 2.0, the video” has some resonance here. And on the theme of music videos that tell stories comes Miranda’s Yo Te Dire, which I like both because it’s funny and because I’m instantly attracted to foreign pop culture. Let The Silence Roar Okay, before anybody inquires if I’ve gone into boat sales or brings up the BisonBoom story again, I need to ask for your understanding. It’s not that I’ve been spending my days trying to pick out just the right shade of red for my new Corvette (really I’m not, it’s the Lotus I like), or that I’ve been moving to sunny California to take up my new job at Google (a year ago I would have been twitching with excitement, now I’m more likely to agree with this). Sweet jQuery Matty discovered jQuery at The Ajax Experience, and his enthusiasm has rubbed off on me. jQuery makes coding JavaScript fun again. Well, at least it makes it possible to write code and content separately. And that means that sweet AJAXy pages can be made more easily, and it sort of forces designers to make them accessible from the start. Resources: jQuery: JavaScript Library Getting Started with jQuery Visual jQuery 1. PES Films I’ve been loving the PES films I found via this Design Observer post, and despite featuring his films for Christmas day and new year’s eve, there’s still a lot to see. Animated peanut butter is about as cool as it gets, even if I can sympathize with the peanut here in Drowning Nut. Casual Friday extras that tickle my inner 12-year-old: Roof Sex, Beasty Boy, Pee-Nut, and Prank Call. Apache 2.2.x on Mac OS X I’m lazy, that’s all I can say to explain why I hadn’t put any serious thought into upgrading from the 1.3.x version of Apache that ships with Mac OS X to the much more feature rich 2.0.x or 2.2.x. But today I found reason enough to switch my development to 2.2.3, and I went looking to the community for information about the switch. A post in Marc Liyanage’s forums made it clear how easy config/compile was. Rusty Nail: The Maison Bisson Winter Drink The holidays are long since past, here’s a drink to carry you through ’till Spring. Rusty Nail 3 parts Scotch 1 part Drambuie Serve over ice in an old fashioned glass. Please enjoy it responsibly. Lies, Damn Lies, and Statistics Thanks to MetaFilter for pointing this out, and Matty, for putting it to good use. Yes, you really can use this to make authoritative looking reports on anything. New Year’s Fireworks PES offers these fireworks for any occasion, but when better to celebrate than the new year? And thinking of that, if all these clocks are correct, the new year has already started in GMT, which means I’m probably a few drinks behind and need to catch up. Holiday Violence By the end of it, all the wrapping paper and other material affects of the holidays really do take on air of violence. Well, at least they do in PES‘s Kaboom. And if you’re amused by that, you might want to see how it was made. Happy Holidays One Goat Down, One Goat To Go Cliffy got excited about the Gävle Goat when his pal Derek emailed him about it all. Derek was in town, or something like that, and got caught up in the frenzy first hand: “Last year some other guy was a bit smarter, hitting it with a flaming arrow from a bow, and he wasn’t caught. It went up in flames!” The goat, of course, is a 40 year holiday tradition. Great White Solstice While northern-hemisphere inhabitants are enjoying their first day of winter, our cousins in the southern hemisphere are just beginning summer. And in South Africa’s Shark Bay, near Gansbaai, the great whites are departing for other waters. The great whites make their way to Shark Bay annually between September and January, though they are not hunting, and, as Rob Mousley reports, they “ignore bait slicks (and bathers), swimming through them without any reaction–in contrast to their behaviour at other locations such as Dyer Island” [link added]. Competition, Market Position, and Statistics Watch this video a few times. It’s funny. It’s catchy. It’s kitsch. Now watch it a few times more. The ad, for a Lada VAZ 2109, appeared sometime in the 90s. It reflects the influence of MTV and other cultural imports from the West, but the details betray it’s command economy provenance. The snow appears trodden and dirty, the trees barren, the background architecture bleak. The car has headlights that flash in time to the music, but their dim yellow glow fails to dazzle. Welcome To Your World In pointing this out to me, Lichen noted “if this isn’t evidence that Web2.0 is an undeniable force, I don’t know what is.” “This,” of course, is Time Magazine‘s announcement of the 2006 Person of the Year. And the answer is you. Yes, you. Michael Stephens was right on top of it, pulling this quote: …But look at 2006 through a different lens and you’ll see another story, one that isn’t about conflict or great men. Helsinki Complaints Choir Though some people prefer the Birmingham choir to Helsinki’s, there’s certainly something to be said about complaining in song, and something more when it’s in a language I can’t begin to understand. One blogger remarked of the video: To think of what might of been. What if I’d moved in with a bunch of angst ridden Finns,instead of pseudo-happy baptists, and been forced to sing their rants along with them. Wish I Could Be There… Harry Shearer and Judith Owen are performing their holiday sing-a-long at the concert hall at the Society for Ethical Culture in NYC with guests TMBG and others. It’s a go on Friday, but why can’t these things happen closer to me? Actually, maybe they should all come to Warren afterwards. Memcached and WordPress Ryan Boren wrote about using memcached with WordPress almost a year ago: Memcached is a distributed memory object caching system. WordPress 2.0 can make use of memcached by dropping in a special backend for the WP object cache. The memcached backend replaces the default backend and directs all cache requests to one or more memcached daemons. You must have a memcached daemon running somewhere for this to work. Unless you’re managing the server on which your blog is running, you probably can’t run a memcached daemon, making this backend useless to you. WordPress 2.1 + WPopac I’ve been following WP2.1 development, but Aaron Brazell’s post in the development blog wrapped up a lot of questions all at once. The short story is that 2.1 is going to bring some really good changes that will allow more flexibility and better optimization of WPopac. Of the four changes Brazell names, the last two, the addition of the post_type column and a change in usage of the post_status column, are where the money is. Woot! Woot! The press release: Making Libraries Relevant in an Internet-Based Society PSU’s Casey Bisson wins Mellon Award for innovative search software for libraries PLYMOUTH, N.H. — You can’t trip over what’s not there. Every day millions of Internet users search online for information about millions of topics. And none of their search results include resources from the countless libraries around the world—until now. Casey Bisson, information architect for Plymouth State University’s Lamson Library, has received the prestigious Mellon Award for Technology Collaboration for his ground-breaking software application known as WPopac. Flightplan Perhaps it’s just because I’m in the air again today, but I’m fascinated by Aaron Koblin‘s animation of aircraft activity, illustrating the pulsing, throbbing movements of aircraft over North America. Nah, this is hot. You’ll love it too. Also worth checking out: Koblin’s other works. Flickr Interstingness Patent…Application It’s old news (Boing Boing and Slashdot covered it a month ago), but Flickr’s patent application is a bit troublesome. It’s not that they’re trying to patent tagging (they’re not), it’s that they’re trying to patent the things library folks have been wanting to do (and in some cases actually doing) for some time. Media objects, such as images or soundtracks, may be ranked according to a new class of metrics known as ”interestingness. Lemurs Movin’ It Thank Jon for pointing out the above. Actually, you should go read his post on the matter because, well, it gave me a chuckle and it’s certainly better than going shopping today. And Then The Feds Blocked Me Via a friend who coordinated a program I presented at not long ago I received this message about difficulty accessing my blog post with notes from the presentation: Do you have the notes electronically that you could send? Believe it or not our federal government internet filter is blocking access to the blog site below…..big brother is truly at work these days….. Jessamyn has been dealing with this for a while now, but this is the first I’d learned that I’d been blocked. Will It Blend? Go now to willitblend.com and offer your suggestion for something new. Want to see a bacon cheeseburger with pickles and grilled onions? Go for it. Parsing MARC Directory Info I expected a record that looked like this: LEADER 00000nas 2200000Ia 4500 001 18971047 008 890105c19079999mau u p 0uuua0eng 010 07023955 /rev 040 DLC|cAUG 049 PSMM 050 F41.5|b.A64 090 F41.5|b.A64 110 2 Appalachian Mountain Club 245 14 The A.M.C. White Mountain guide :|ba guide to trails in the mountains of New Hampshire and adjacent parts of Maine 246 13 AMC White Mountain guide 246 13 White Mountain guide 246 13 A. Second School? Rebecca Nesson, speaking via Skype and appearing before us as her avatar in Second Life, offered her experiences as a co-instructor of Harvard Law School‘s CyberOne, a course being held jointly in a meatspace classroom and in Second Life, and open to students via Harvard Law, the Harvard Extension School, and to the public that shows up in Second Life. Nesson has an interesting blog post about how it all works, but she also answered questions from the audience about why it works: Social Learning On The Cluetrain? They don’t want to engage in chat with their professors in the classroom space, they want to chat with other students in their own space. — from Eric Gordon’s presentation this morning. Hey, isn’t that the lesson that smart folks have been offering for a while now: “Nobody cares about you or your site. Really.” How could learning environments not be subject to the same cluetrain forces affecting the rest of the world? Social Software In Learning Environments It’s really titled Social Software for Teaching &amp; Learning, and I’m here with John Martin, who’s deeply involved with our learning management system and portfolio efforts (especially as both of these are subject to change real soon now). Aside: CMS = content management system, LMS = learning management system. Let’s please never call an LMS a CMS…please? On the schedule is… Social Software in the Classroom: Happy Marriage or Clash of Cultures? Displaying Google Calendars in PHP iCal PHP iCalendar solves a couple problems I’m working on, but I needed a solution to fix the duration display for Gcal-managed ICS calendars. As it turns out, a fix can be found in the forums, and the trick is to insert the following code in functions/ical_parser.php. case 'DURATION': if (($first_duration == TRUE) &amp;&amp; (!stristr($field, '=DURATION'))) { ereg ('^P([0-9]{1,2}[W])?([0-9]{1,2}[D])?([T]{0,1})?([0-9]{1,2}[H])?([0-9]{1,2}[M])?([0-9]{1,}[S])?', $data, $duration); $weeks = str_replace('W', '', $duration[1]); $days = str_replace('D', '', $duration[2]); $hours = str_replace('H', '', $duration[4]); $minutes = str_replace('M', '', $duration[5]); $seconds = str_replace('S', '', $duration[6]); // Convert seconds to hours, minutes, and seconds if ($seconds &gt; 60) { $rem_seconds = $seconds % 60; $minutes = $minutes + (($seconds - $rem_seconds) / 60); $seconds = $rem_seconds; } if ($minutes &gt; 60) { $rem_minutes = $minutes % 60; $hours = $hours + (($minutes - $rem_minutes) / 60); $minutes = $rem_minutes; } $the_duration = ($weeks * 60 * 60 * 24 * 7) + ($days * 60 * 60 * 24) + ($hours * 60 * 60) + ($minutes * 60) + ($seconds); $first_duration = FALSE; } break; Hopefully this gets worked into the baseline with the next release. Rock Paper Scissors This weekend’s Fifth Annual Rock Paper Scissors World Championships have ended, and Brit Bob Cooper has come out a winner. The Toronto event drew a reported 500 competitors and 250 spectators from 26 U.S. States, four Canadian provinces, Norway, New Zealand, Australia, Wales, the UK and Ireland and paid a top prize of CAN$7000. “I went through extensive training, read ‘The Official Rock Paper Scissors Strategy Guide’, and studied the 27 possible RPS gambits before competing,” said Cooper. Mushaboom Remix Props to Tim for offering linking me to a remix of Feist’s Mushaboom. I like the original better, but, well, I’m also a fan of remixes. I Feel Great Transcipt: What? Oh, yeah. I feel great. Larry, I’m quittin’ the company and startin’ my own. And by the way, I feel great. Steve, you’re a great guy with great skills, you’re gonna do great. *pounds fist* What the hell, I’m comin’ with ya. Ooohhhhfff. Hey, you’re hot and I feel great. Let’s get married. Alright, but I want lots of kids. Me too. Five hundred of them. *slams file drawer* Ooohhhhfff. And Fell The Wall It’s worth taking a moment to remember that the Berlin Wall fell this day in 1989. Though orders had been been given, they were botched by East German propaganda minister Günter Schabowski, who mistakenly announced in a press conference that restrictions on border crossings would be lifted immediately. In fact, restrictions were to be lifted the next day. Tens of thousands of East Berliners heard Schabowski’s statement live on East German television and flooded the checkpoints in the Wall demanding entry into West Berlin. Art vs. The Google Economy In an anomaly that we would eventually recognize as commonplace on the internet, Touching the Void, a book that had gone out of print, remaindered before it hit paperback, was all but forgotten, started selling again in 1998. Chris Anderson wondered why, and found that user reviews in Amazon’s listing of publishing sensation Into Thin Air had people recommending Touching the Void as a better read. Today, Touching the Void outsells Into Thin Air 2 to 1. Ministry of Truth = George Bush’s Whitehouse The Huffington Post pointed out how the White House is doctoring video of Bush’s “Mission Accomplished” speech from May 2003. Visitors to whitehouse.gov now get a video that crops out the mission accomplished sign. How Orwellian will this president get? “The future of evil is in manipulating information.” I Hope You’re All Voting Today Okay, even if this Diesel Sweeties cartoon is a little disheartening, please vote. The fact is, vote suppression is probably more likely than vote fraud. A tip of the hat to Lichen for alerting me to this, and for making the point that our users’ notions of “authority” are among the fastest changing features of our post-Google world. Arlington East The above photo and some others were forwarded to me by a friend. The body of the email included: A few friends of mine participated in this event on Saturday. There wasn’t a lot of media coverage, but NPR and the CCT. The photos show 2700 markers representing American dead in the Iraq war, and 200 markers representing just a small percentage of the approximately 600,000 Iraqi dead from a memorial held on Cape Cod on October 14th, 2006. The Political Parties In Vermont Cliff took a picture of his absentee ballot because the new parties were just too good: Dennis Morriseau is the Impeach Bush Now candidate for Congress and Peter Moss is the Anti-Bushist Candidate for Senate. Midterms Mentioned earlier, but worth mentioning again: TrueMajorityACTION’s Take It Back campaign. Among the videos and political graffiti of the moment, don’t miss Freedom, Beat Box Bush, and &lt;a href=&quot;http://video.google.com/videoplay?docid=2601232339745819805&rdquo; title=&quot;9/11, Shock &amp; Awe: clip from &ldquo;Hijacking Catastrophe&rdquo; - Google Video&quot;&gt;Hijacking Catastrophe. And as funny as the Brazillion Joke is, we need a government that doesn’t lie, a government that’s smart, a government that cares for its people, its soldiers and foreign civilians and our elections. Network-Enabled Snooping In The Physical World We’ve got OCR. We’ve got cameraphones. We’ve got web-based license plate lookup services. Amazon Japan has a fancy cameraphone-based product search feature. What’s more naive, imagining that somewhere somebody has a SMS/MMS-based license plate snooping and facial recognition services and fingerprint scanners, or imagining that they don’t? Political Graffiti found by lorelei in Copenhagen. discovered by Kieran’sPhoto’s’ in Cork. Freedom (Video) Karen forwarded mgarthoff‘s Freedom, tagged: bush war election midterm iraq katrina on YouTube. Presentation: Designing an OPAC for Web 2.0 MAIUG 2006 Philadelphia: Designing an OPAC for Web 2.0 (interactive QuickTime with links or static PDF) Web 2.0 and other “2.0” monikers have become loaded terms. But as we look back at the world wide web of 1996, there can be little doubt that today’s web is better and more useful. Indeed, that seems to be the conclusion millions of Americans are making, as current estimates show over 200 million users in the US, including 87% of youth 12-17. advice you didn’t ask for On writing: First figure out your story, then tell it. Anything else is masturbatory. The Solution Is In Your Hands currugated_film‘s photo of graffitti in Oaxaca. The caption at Flickr notes that the text to the right says “the solution is in your hands, the rocks are on the ground.” Two Ton: One Night, One Fight Tony Day is June 28th, but today is the day I received my copy of Joe Monninger’s latest work, Two Ton: One Night, One Fight — Tony Galento v. Joe Louis. I learned a lot about the characters and times during the two years of research Joe invested in the book, but other than sneaking peaks at the manuscript, I’ve not had a chance to learn the whole story of how Tony Galento ended up in the ring against Joe Louis — and knocked him down. All About Atlatls…or…Humans Need To Throw Things In classic Wikipedia-voice, an atlatl is… An atlatl (from Nahuatl ahtlatl [?ah.t?at?]; in English pronounced [???t?l??t??]1 or [??t?l??t??]2) or spear-thrower is a tool that uses leverage to achieve greater velocity in spear-throwing, and includes a bearing surface which allows the user to temporarily store energy during the throw. […] A well-made atlatl can readily achieve ranges of greater than 100 meters. Atlatl Bob describes it more passionately: damn that’s big The Switzerland‘s Verzasca Dam is now added to the list of places I’d like to visit. Linkability Fertilizes Online Communities Redux I certainly don’t mean this to be as snarky as it’s about to come out, but I love the fact that Isaak questions my claim that linkability is essential to online discussions (and thus, communities) with a link: Linkability Fertilizes Online Communities I really don’t know how linkability will build communities. But we really need to work on building support platforms for the public to interact with the library and promote social discussions, whether offline or online. GoogleSmacked At a time when people are still wowing over the Google-YouTube deal (and wondering why their 2.0 company didn’t get bought for $1.6 billion), it’s good to know that Marc Cantor is dead down on it. Not because of the copyright issues or “limited” advertising potential of YouTube that others cite, but apparently because he just doesn’t like Google anymore. To wit, he names Orkut as a failed social network; knocks Blogger as an also-ran; disregards Google Base as pointless; labels AdSense a $5 billion cash machine for Sergey, Larry and Eric; tosses aside Gmaps, Gmail, Gcalendar, Gscholar, Gbooks, and Gtalk as “unrelated, random output of the labs, thrown up to justify their R&amp;D expenditures;” and closes with an ominous warning: Cheap and Broken Above, one of Sandge‘s contributions to the The Toy Cameras Pool reminds us that good photography is something that often happens despite the equipment, not because of it. Of course, no sweeping generalization can go without argument, and in this case I think the toy camera enthusiasts would be joined by the glitch art aficionados, like RoninVision, who apparently made a mistake while scanning to give us this: Flipbook Animation I love this flipbook animation on YouTube (jump ahead to about 3:05 for it), even if the live-action preface is somewhat tiresome. And even with that, it still doesn’t rate as bad as some viewers think it is. This is the “making of” / behind-the-scenes sneak peak at my upcoming movie “Annihilation”. I had hoped to finish Annihilation in time to turn it in for my Cinema class, but I didn’t… so I had to make a movie about my failure to complete the movie, and turn that in instead. Cataloging Errors A bibliographic instruction quiz we used to use asked students how many of Dan Brown’s books could be found in our catalog. The idea was that attentive students would dutifully search by author for “brown, dan,” get redirected to “Brown, Dan 1964-,” and find three books. Indeed, the expected answer was “three.” As it turns out, my library has all four of Dan Brown’s published books, including the missing Digital Fortress. What Do You Call A Group Of Ninjas? From AskMeFi: “You know, like gaggle of geese, murder of crows, school of fish, all that. Does a group of ninjas have some sort of descriptor? We’re talking many people in halloween costumes, how to address them together. The { blank }.” Aside from the inevitable brush to Ask a Ninja, answers included: sir, sir, sir, and sir one ninja, many ninjim. And the collective is a flipout of ninjim a hedge of ninjas. The Candy Bar Metaphor Eleta explained it this way, and credited it to R. David Lankes: Your data: Your _meta_data: Butane Handwarmer Mt. Moriah, this time better than last time. Eat-Rite Diner, St. Louis MO Some time ago in St Louis, I stumbled upon Eat-Rite Diner. Aparently I wasn’t the first to be taken in by its charms. Yelp notes: This is a MUST in St. Louis. However don’t go here for the friendly staff, good food, or fun atmosphere. This place is a joke! They will need to buzz you in the door to come in and try the delightful SLINGER. Eat right or don’t eat at all! Teddy Bear Kills 2,500 Fish From Associate Press: CONCORD, N.H. — A teddy bear dropped into a pool at a hatchery in Milford, N.H., killed all 2,500 rainbow trout living in the pool. Fish and Game Department hatcheries supervisor Robert Fawcett said the teddy — dressed in a yellow raincoat and hat — clogged a drain earlier this month, blocking oxygen flow to the pool and suffocating the fish. In a statement, Fawcett noted: “RELEASE OF ANY TEDDY BEARS into fish hatchery water IS NOT PERMITTED. What’s So Great About Adium? Brian Mann calls Adium “one of the best multi-network [IM] clients ever.” Tim Bray says it has a “wonderful user interface,” while also naming IM generally “an essential business tool.” Eric Meyer, meanwhile, exclaims “Adium is my new chat buddy.” What’s so great about Adium? Gaim is the engine behind the scenes, but the face of the application is XHTML and CSS. Wit Meyer: The entirety of an Adium chat window is an XHTML document that’s being dynamically updated via DOM scripting—all of it pumped through WebKit, of course. ISBN1013 API Followup A couple questions about my API to convert 10 digit ISBNs to 13 digits pointed out somethings I failed to mention earlier. First, the API actually works both ways. That is, it identifies and validates both 10 and 13 digit ISBNs on input, and returns both versions in the output. Example: 0811822842 and 978081182284-8. And, as yet, I have no user agreement or usage policy. Except for the disclaimer — don’t blame me if it’s broke — I’m leaving this open (though I’ll probably have to figure something out for future APIs). Inclusion Is Addictive Lichen, who’s had a great string of posts lately, pointed out Amy Campbell‘s website, which opens with the following: So I guess this myspace thing is going to catch on. I resisted for a long time. These things make me nervous – myspace, messenger, emoticons… I can’t help but see it as some sinister forerunner of the complete degredation of language and of human interaction. I’m worried about a generation of people who’s definition of “friendship” consists first and foremost of an anonymous exchange of links. My Own Garlitz Bob Garlitz dropped by with a couple canvases yesterday — untitled and teng. It’s an honor I’d appreciate even if I wasn’t looking for something to cover my bare office walls. Converting Between ISBN-10 and ISBN-13 David Kane asked the web4libbers: Can anyone tell me what the conversion between ISBN-10 and ISBN-13 is, please. I need to write a little conversion program. Anything in PHP, for example. Answers: “There is already an online converter: http://www.isbn.org/converterpub.asp;” some pointing at Wikipedia on ISBNs, Bookland, and EANs; John Blyberg’s PHP port of the PERL ISBN-10/13 tool; some explanation that you have to watch the check digit, and discussion about why you’d need to do all this conversion. I Am Not A Terrorist I Am Not A Terrorist. I AM NOT A TERRORIST. I am not a terrorist. Democracy Now! Burning Patriotism! Beat Box Bush and DJ Cheney Bush speech mashups rock. From Google Video: So, you wanna learn how to beatbox? GWB is back with another amazing performance. Surprisingly he is actually very good. Previously: State of the Union? Not good. Also, note the tags on that video, and the way somebody snuck “????? ??? ? ???” past the filters. Teddy Bear Cries Red Tears southtyrolean, who seems to take an interest in found graffiti posted this one (from Graz)to his Flickr stream, describing it: in the Sackstraße, near Kastner&amp;Öhler (entrance to the car park for bikes) :: in der Sackstraße, neben Kastner&amp;Öhler (Eingang zum Fahrrad-Abstellplatz) I especially like this one. “This Would Make A Really Great Blog Post…” Another great comic from XKCD: “I feel like I’m wasting my life on the internet. Let’s walk around the world.” “Sounds good.” [panels showing the world’s great beauty, a truly grand adventure] “And yet all I can think of is ‘this will make for a great Livejournal entry.’” Rocking Wirelessly: Verizon’s V640 EVDO Card After vacillating for a while (and waiting for it to become available), I finally purchased one of the Verizon / Novatel V640 Express Card EVDO adapters that everybody’s talking about for my MacBook Pro. GearLog promised it would be easy — simply install drivers, plug in card — but they were wrong. Truth was that I didn’t even have to install the drivers. Mac OS X asked me if I wanted to “activate” the card when I plugged it in, then automatically went about configuring everything. Whitcher Sawmill Burned I described it to Jessamyn in an IM last night: lights flickering here, sirened vehicles passing frequently, smell of smoke hangs in air outside The Globe reported it this way: WARREN, N.H. — A sawmill went up in flames during the night in Warren (New Hampshire). Fire officials say they may never know what started the flames at the K.E. Whitcher mill around ten o’clock last night. Should Universities Host Faculty or Student Blogs? (part 1: examples and fear) Our CIO is asking whether or not Plymouth should get involved with blogs. Not to be overly academic, but I think we should define our terms. Despite all the talk, “blogs” are a content agnostic technology being used to support all manner of online activities. What you’re really asking is instead: what kind of content do we want to put online, and who do we want to let do it? Library Camp East 2006 LCE2006 was a success. Let me quickly join with the other participants to offer my appreciation to John Blyberg and Alan Grey for all their work planning the event, as well as Darien Public Library director Louise Berry and the rest of the library for hosting the event. Side note: Darien is a beautiful town, but we all have to learn to pronounce the name like a local. Michael Golrick and John Blyberg each have a number of photos on Flickr, and I’m jealous of those like Lichen Rancourt who can live-blog events like this. Scotchtober Fest New Hampshire’s Highland Games are back where they belong in Lincoln NH. Fittingly for the Highlands theme, the weather Saturday was cold and misty, with fogs rolling over the hills. I half expected Lorna Doone herself to appear. The games, of course, are “Scottish Heavy Athletics” involving the throwing (though sometimes carrying) of just about anything that can be found. Rocks… hammers… sheep… trees, they all count. Well, the “sheep toss” is actually the “sheaf toss” and is intended to measure an athlete’s ability to toss hay to the top of the pile. With All Voices Now… Preaching to the choir, or encouraging them to sing louder? TrueMajorityACTION‘s Take It Back campaign amuses, but will it motivate the middle? Will you join? Kid Koala’s Fender Bender While looking up Bonobo — who is soon to have a new album out — I discovered not only some videos of his tunes, but also a path leading to videos from other Nijna Tune artists, including this goodie from Kid Koala. namiacs mr. pro-life and his wife, kirsten faith pro-life why not? does anybody know a way make a reverse-ordered — think countdown — ordered list without resorting to non-semantic (though ingenious) css tricks? wp ssl one wonders why ssl support isn’t built-in to wp. until then, this noctis.de post offers some tips. It Be Talk Like A Pirate Day, Matey Hop to it, dogs. Peer an eye at thar video and argue not w’the cap’n: Tuesday September 19th 2006 is Talk Like a Pirate Day! Talk Like a Pirate Day only comes once a year (on September 19th), this year it falls on a Tuesday. If you’re not ready yet, you can learn more about this international holiday on the About TLAPD page or practice some phrases from the PiratePhrases page. Our Responsibility: Teach Our Children How To Talk Like A Pirate Early For Future Success There’s no question that the video mentioned this morning is valuable resource for all of us, but our responsibility to our nation’s future demands more. The good folks at Cook Memorial Library in Tamworth NH are an example to us all with their series of instructional sessions in preparation for Talk Like A Pirate Day. Microsoft Vs. Bloggers In Accusations of MSN Spaces Censorship I’ve been citing pieces of branding consultant james Torio‘s master’s thesis for some time now. But because the thesis is long, and I want to cite a few small pieces, and those pieces aren’t directly URL addressable, I’m quoting them here. Clickable URLs are added, but everything else should be exactly as Torio wrote it. (Also related: Why There’s No Escaping The Blog and MSN Spaces Isn’t The Blogging Service For Me.) info on geo tags in the wp codex does this mean that geo stuff is built-in to wp? PHP Array To XML I needed a quick, perhaps even sloppy way to output an array as XML. Some Googling turned up a few tools, including Simon Willison’s XmlWriter, Johnny Brochard’s Array 2 XML, Roger Veciana Associative array to XML, and Gijs van Tulder’s Array to XML. Finally, Gijs also pointed me to the XML_Serializer PEAR Package. In an example of how even the smallest barriers can turn people away, I completely ignored the two possible solutions at PHP Classes, because navigating and using the site sucks. MySQL Fulltext Tips Peter Gulutzan, author of SQL Performance Tuning, writes in The Full-Text Stuff That We Didn’t Put In The Manual about the particulars of word boundaries, index structure, boolean searching, exact phrase searching, and stopwords, as well as offering a few articles for further reading (Ian Gilfillan’s “Using Fulltext Index in MySQL”, Sergei Golubchik’s “MySQL Fulltext Search”, Joe Stump’s “MySQL FULLTEXT Searching”). It’s one of a number of articles in the MySQL Tech Resources collection. Sysop Humor I got tipped to this geeky-funny comic that deserves reposting here for casual friday: Always San Fran From the West Coast comes this tale– A friend of mine is part of Maxine Hong Kingston’s Veterans Writing group. They are publishing a collection of their work this October “veterans of War, Veterans of Peace,”, and he was invited to a reading in San Francisco. They are a program up there called “Drinks with Writers” that moves from restuant to restuant once a month. People come, have drinks, writers read, they talk. Sweet Sumolounge Omni A Sumolounge beanbag chair is a beanbag like a Maserati is a car. But even that doesn’t properly characterize the difference. For starters, it’s big — over five feet on one side. Not big enough for the whole wrestling team, but big enough for cuddling. A bit bigger and I’d go looking for sheets and call it a bed, as it’s also comfortable. The website calls it a “crash mat, lounge chair, loveseat or floor pillow,” but whatever you call it, you’ll settle into it like an addictive personality to a bad habit. Making Plans For Library Camp East In the list of things I should have done a month ago is an item about making my hotel reservations for Library Camp East 2006. Fortunately, John Blyberg notes that Alan Gray has arranged for a special rate Doubletree Hotel in Norwalk, not far from the site of the event. Apple’s iTV — From 1995! The original Apple press release is gone (and gone from the Wayback Machine too), but back in 1995 Apple announced a different set-top box, also called the iTV, for a six-state trial of interactive television services. Apple’s ITV system incorporates key technologies including a subset of the MacOS, QuickDraw and QuickTime. In addition, it includes an MPEG1 decoder and supports PAL and NTSC video formats as well as E1 and T1 telephone protocols. The Church Of September 11th David Moats did some hard thinking on Oliver Stone‘s World Trade Center. “[I]t occurred to me that the problem with the movie is that five years later we remain stuck in the moment. We haven’t really moved on.” We’ve not been able to move on from 9/11 because we’re still mired in the mistakes that followed from 9/11. Many people responded with bravery, including the service men and women who found themselves caught up in one struggle or another. Top Gun: A Requiem For Goose TeamTigerAwesome‘s Top Gun: A Requiem For Goose is more than funny, it’s the sort of thing a person should mine for insults and one-liners to use later. Of course, the recent Tom Cruise flap doesn’t dampen it any. From the title cards: On March 3, 1919 President Harding established the swingenest, scientologist, dew drop of a flight school in all 38. Now, you boys may think that you are the high-hattenest group of flyboys ever to shoot down a Mrs. Laura Veirs Hey folks! Good news. The Young Rapture Choir CD is now available from Raven Marching Band Records. This album is an amazing collection of songs written by Laura Veirs, and performed by a choir of school children in Cognac, France. It was recorded live in April 2006 by Tucker Martine. The packaging is all handmade and it’s a wonderful recording. This is a lovely, limited edition cd — we only made 3,000 — so get one quick at http://www. NewerTech FireWire 2 Go PCMCIA/CardBus Card Target Disk Mode? All my searching seems to confirm my hazy memory that my olf NewerTech FireWire 2 Go card does indeed support target disk mode, but the old “hold T while booting” trick doesn’t seem to be working. Another shady part of my memory is that the key command was different, but what is it? Either Google is failing me, or it really isn’t online anywhere. Help? Mac OS X VNC, Built-In Sure it’s old news, but I am pretty happy that Mac OS X 10.4 has a built-in VNC server. You’ll still need a client, like Chicken of the VNC, but it couldn’t be much simpler to make work. Though, you could run a separate server app (even several instances of it) and work up a hack like this to allow you to have several people all logged in to the same machine (and getting different screens) simultaneously. Crocodile Hunter Steve Irwin Dead TV star and crocodile hunter Steve Irwin is dead after being &lt;a href=&quot;http://www.injurywatch.co.uk/news-and-groups/news/marine-incidents/australia-s-crocodile-hunter-steve-irwin-killed-by-a-stingray-496621&rdquo; title=&quot;Australia&rsquo;s &ldquo;Crocodile Hunter&rdquo; Steve Irwin killed by a stingray — injurywatch&quot;&gt;stung by a stingray on Australia’s Great Barrier Reef. Blue Marlin Spears Fisherman From the Royal Gazette: An angler was almost killed when a giant bill fish leapt from the sea, speared his chest and knocked him off his boat in a freak accident at the weekend. Ian Card, from Somerset, was impaled by the blue marlin and forced overboard during an international sports fishing tournament on Saturday morning. His father Alan, skipper of the commercial fishing vessel Challenger, watched as the struggling creature — estimated to weigh about 800lb and measuring 14ft in length — flew through the air and struck the 32-year-old, who was acting as mate, just below his collarbone with its sword-like bill. Remember, He’s Really Big In Germany Blame Bentley for this. And, as noted in a comment there, “it’s so amazing how [David] Hasselhoff has this entire other career that doesn’t exist in the US, except for mocking purposes.” Lyrics: Beware the pretty faces that you find. A pretty face can hide an evil mind. Oh be careful what you say, Or you’ll give yourself away, Odds are you won’t live to see tomorrow. The Competitive Advantage Of Easing Upgrades ZDnet’s David Berlind complains that upgrades are painful: Upgrading to new systems is one of the most painful things you can possibly do. If you’re a vendor of desktop/notebook systems, it also represents that point where you can keep or lose a customer. Today, most system vendors have pretty much nothing from a technology point of view that “encourages” loyalty. Upgrading from an old Dell to a new Dell is no easier than upgrading to a system from a competing vendor. Things I Need To Incorporate Into Various Projects memcached, a “highly effective caching daemon, …designed to decrease database load in dynamic web applications,” and the related PHP functions pspell PHP functions related to aspell and this pspell overview from Zend http_build_query, duh? current connected mysql threads * unix load average = system busy; reduce operations when $system_busy &gt; $x Missiles Are The New IED I’m not going to make this point well, but let me try. Now that we’ve recognized the long tail of violence and the “open source insurgency” and seen the Hezbollah missile threat, it’s hard not to imagine a growing threat from enemy or terrorist missiles. In short, as technology becomes cheaper, the weapons people can use against us become more complex. Iran and North Korea have been developing and testing missiles for some time, but the 800 pound gorilla here is Russia. Flickr To Get All Geotaggylicious? When Dan Cat gets cagey, and people are talking about mysterious map buttons in Flickr a guy has to wonder…is this why the lines between Dan’s hobby and day job are so blurry? update: Ryan Eby points out that the map is live! Lurk, cut, paste and It is cutting and pasting but what other names are there now for it?? For looking at other websites, following the site and lifting off passages and putting them onto your own site– for one reason or another?? I found bookish.dk while looking up info on Denmark about a year ago. Finally this May, lifelong wish, I finally got to Copenhagen for two days. Karen B is a Scotswoman who has Seeger’s Springsteen Made the mistake of complaining about Bruce’s new album. I knew I was risking the age thing, and sure enough– I downloaded finally, with too much anticipation, Bruce’s new Seeger Sessions. I haven’t heard B much lately but his voice sounds like its shot?? Seeger did his work with such a rich voice, deep and subtly modulated. This album is beautifully produced, the backup band is greatvoice and nearly too much. Stranger than crazy Every so often you want to know more about real gypsies. This film is where to start when that time comes round again. Romanian gypsies portray themselves in Gajo Dilo. The crazy stranger in question is not a gypsy but a visiting young Frenchman, played by the wonderful Romain Duris. We Just Have To Go Do The Work Nicholas Lemann, in a story on blogging and citizen journalism in the August 7 issue of The New Yorker: [N]ew media in their fresh youth [produce] a distinctive, hot-tempered rhetorical style. …transformative in their capabilities…a mass medium with a short lead time — cheap…and easily accessible to people of all classes and political inclinations. And quoting author Mark Knights: …a medium that facilitated slander, polemic, and satire. It delighted in mocking or even abusive criticism, in part because of the conventions of anonymity. Swimming In Spam, But Customer Support Comes Through I awoke this morning to a bit of a mess. After enjoying months of spam-free bliss thanks to Akismet, I found over a hundred spam comments for pills and free pictures to suit most any need or desire. Spam has snuck through before, but never in this volume, and Akismet has always been quick to learn from my manual corrections and stop further leaks. Not this time. So I began to panic. Reality Television Infects Print Media Now that we’ve forgotten how deep the collected sludge on the bottom of our cultural barrel is since Fox appears to have given up dredging it for entertainment like Who Wants to Marry a Millionaire? and &lt;a href=&quot;http://imdb.com/title/tt0416385/&rdquo; title=&quot;&ldquo;The Littlest Groom&rdquo; (2004) (mini)&ldquo;&gt;The Littlest Groom, Jane Magazine (subscribe) has stepped up to explore what remains. The Huffington Post’s Eat The Press blog recently reported a story titled “Girl, You’ll Be A Woman Soon: The Quest To Deflower Jane‘s 29-Year Old Virgin” Eaten Alive Books Eaten Alive Books It’s A Piece Of Cake To Bake A Pretty Cake Don’t hate me for this, it was MattyB who showed it to me and then setup the domain itsapieceofcaketobakeaprettycake.com. The clip comes from LazyTown (&lt;a href=&quot;http://www.imdb.com/title/tt0396991/&rdquo; title=&quot;&ldquo;LazyTown&rdquo; (2004)&ldquo;&gt;IMDB), which airs in the US on Nick Jr. An excerpt of the lyrics: I’ts a piece of cake to bake a pretty cake if the way is ha-zy you gotta do the cooking by the book Darwin, Schmarwin Are we ahead of Turkey? Yes. Sign Up Now: Library Camp East 2006 Library Camp East 2006 is set for September 25 at Darien Public Library in Darien CT. It’s an unconference, so the content is determined by the participants, and judging from the names on the signup page (John Blyberg and Jessamyn sound excited), there will be a lot of good discussion. Catching Bugs Before They Catch You I got itchy about magic quotes the other day because it’s the cause (through a fairly long cascade of errors) of some performance problems and runaways I’ve been seeing lately (pictured above). But I deserve most of the blame for allowing a query like this to run at all: ` SELECT type, data, count(*) AS hits FROM wpopac_WPopac_bibs_atsk &lt;strong&gt;WHERE data LIKE '%'&lt;/strong&gt; AND type IN ('subjkey','author', 'title') GROUP BY data ORDER BY hits DESC LIMIT 7 ` As executed, it’s trying to select all 1. false I had no words for it Now wri- ting I am temp ted to say that I fe lt the wor ld had been giv en as a gi ft uni que ly to me and al so eq ual ly to ea ch per son a lone verse style of Robert Lax sentence by Rory Stewart Treo 650 As Dial Up Network Adapter Sometime ago I started work on figuring out how to get dial up networking (DUN) access via my Treo 650. Now I’m getting serious about mobile internet access and looking at this again. The plan is that you should be able to make a Bluetooth connection between your laptop and the phone and then get piped onto the internet from the phone. Trevor Harmon wrote it up and has been following the issue as it relates to Mac OS X and Sprint Wireless service. Dang addslashes() And GPC Magic Quotes Somewhere in the WordPress code extra slashes are being added to my query terms. I’ve turned GPC magic quotes off via a php_value magic_quotes_gpc 0 directive in the .htaccess file (we have far too much legacy code that nobody wants to touch to turn it off site-wide). And I know my code is doing one run of addslashes(), but where are the other two sets of slashes coming from? Knockbox = WiFi + Real Estate Info In another sign of the arrival of the stupendous, i.e. that the internet is changing our world, Engadget some time ago reported on the SellSmart Knockbox real estate selling dohicky. What is a KNOCKBOX? A KNOCKBOX is a sleek, self-contained appliance that is placed unobtrusively inside your home for sale. It contains a photographic tour, custom buyer presentation, and other important details about your home, which potential buyers can access without ever having to enter your home. Are You With Me? This weeks free i-tunes down load is the song ” Are You With Me?” released by the band Vaux. I like the song, it’s a little hardcore for my tastes but I can see my self mosh pitting to this. It’s not my favorite music but being a fairly open minded person I can find a place for it in my musical library. I give this song an 6.8 on pies listening pleasure scale. WPopac Reloaded I’ve re-thought the contents of the record and summary displays in WPopac. After some experimentation and a lot of listening, it became clear that people needed specific information when looking at a search result or a catalog record. So now, when searching for Cantonese slang, for instance, the summary displays show the title, year, format, attribution, and subject keys of each result. And when viewing the record for A Dictionary Of Cantonese Slang you’ll get all of that and more. longest book title ever Geography Made Easy : Being An Abridgement Of The American Universal Geography, Containing Astronomical Geography, Discovery And General Description Of America, General View Of The United States, Particular Accounts Of The United States Of America, And Of All The Kingdoms, States And Republics In The Known World, In Regard To Their Boundaries, Extent, Rivers, Lakes, Mountains, Productions, Population, Character, Government, Trade, Manufactures, Curiosities, History, &amp;c. : To Which Is Added, An Improved Chronological Table Of Remarkable Events, From The Creation To The Present Time, Illustrated With Maps Of The Countries Described : Calculated Particularly For The Use And Improvement Of Schools And Academies In The United States Of America Snakes on Boards Snakes on skateboards would not wear helmets nor would they swing hatchets, but snakes on snowboards might if they had just visited Love Land’s Phallus Garden Verizon EVDO Service And The Mobile Office? The much anticipated Novatel V640 Express Card EVDO adapter is out. Verizon is pimping them for $180 with 2 year contract and GearLog says it’s “almost too easy” to use these goodies with the MacBook Pros. Then GearLog reader Brad commented: “If you had to install a driver, I wouldn’t say it was the true Mac experience. I have Sprint EVDO with a Merlin S620 card. With OS X 10. Sweet Bluetooth Graphire Tablet, Bad Portraits My Graphire Bluetooth tablet arrived last week as a bundled treat with some Adobe software I needed. Why do I need a tablet, especially as my days as a graphic designer are a distant memory? I don’t…at least not now. But somewhere on the long tail my um, unique, style of portraiture (above) will come into vogue and I’ll score it big. Yup, there’s an unrecognized niche of people just waiting to be drawn with big cheeks, bulging eyes, and open mouths. Carry-On Restrictions To Carry On? The Mercury News’ QA on carry-on restrictions answered a big question I had: Q Can I still carry my laptop, cell phone and iPod on board? A Those items are still OK as long as you’re not traveling to or through the United Kingdom. But a Reuters story posted at C|Net suggests the restriction on liquids won’t be going away any time soon. Draconian restrictions on carry-on baggage may stay in place for months, even years… All About Non-Profits I’ve been looking up information on non-profits, specifically 501c3 corporations. There’s this sales-pitch filled FAQ; The Company Corporation makes it sound easy, but this how to guide from the National Mental Health Association (of all places) seems to offer the…um…most honest info I’ve seen yet. Well, most honest sounding. Dancing Against The Current You might argue with Kevin Lim‘s suggestion that terrorism depends on our emotional and psychological insecurity, but can you really argue with the notion that more happy people is a bad thing? I can’t. And I can’t criticize him for finding deep meaning in catchy pop songs and funny movies. He and Brandtson might be right… “nobody dances anymore. Everyone’s still playing safe and nobody takes chances anymore.” SXSW 2007 Program Proposals There’s 173 programs proposed for SXSW Interactive, March 9-13 2007. Go vote for the ones you most want to see at Lindsey Simon’s super cool picker. Round one voting is going now. (Also note the really good use of semantic markup in the HTML download version (which I’m embarrassed to have sullied a bit in this representation).) Podcasting – What’s it going to Take to Mainstream the Technology? business / funding / entrepreneurial · web audio / web video Over the past twelve months, podcasting has exploded among tech savvy individuals and organizations However, what’s it going to take for podcasting to evolve from its current state as a delivery system for specialized, longtail content to a widely-adopted media distribution system for mainstream users? Hard Math I found this at joe-ks.com. The title there is “Mennonite longhand math,” but can anybody identify the source or context? Can anybody work out the equation on the board? I’ve convinced my friend Will, who teaches math and physics, to pose for a shot like this, but that means we’ll have find and fill a huge chalkboard…and he’ll have to grow his beard back. Lawn Mower Speed Record It’s late summer and the heat wave killed the grass on your lawn, so what better to do than challenge Bob Cleveland’s record for the fastest lawn mower yet? Not sure your mower has what it’ll take to race down the salt flats at over 80 MPH? Wimp. Utah’s KSL TV quotes Bob saying “we don’t need a whole lot of horsepower to go fast.” And when you look at the tiny wheels on that thing, well, you’ve gotta imagine you can do better. Shakespeare, Motivation, War, What Are We Doing Here? I’m a sap. I can’t help but get choked up when I read or hear Shakespeare’s St. Crispin’s Day speech in Henry The V. eHow tells me that “Saint Crispin’s Day is a good day to honor lives well lived, beliefs held dear and shoes well made.” But Steve Denning calls the speech a “magical, linguistic sleight of hand,” and warns us: …it may work for a battle, or even several battles. Flight, Hotel, Spa “Take a deep breath.” I did, and with it Lisa Souza, my massage practitioner at San Francisco’s International Orange, pressed into a knot just below my shoulder blade, deep in the latissimus dorsi. She worked along the length of it, not as a baker kneads bread, but rather as person wringing water from a damp cloth. Each press was deliberate, powerful. I’d asked for the deep tissue treatment. Eight hours in planes from Boston (six hours to LGB, almost another two to SFO) had taken their toll, and this, I hoped, might spell relief. Workflow Goes Social I was amused this week to see two examples of workflow getting sexy. That’s not how the developers describe their efforts, but the departure from old groupware notions is clear. In daring defiance of Zawinski’s proclamation, Jeffrey McManus, with Approver.com, and Karen Greenwood Henke, with Nimble Net (as reported yesterday), are tackling workflow and approval processes. Combine the increasing numbers of people who are self employed or working in very small businesses that can’t afford those old enterprise groupware “solutions” (but who nonetheless have to get a job done) with the combination of luck, pluck and smarts these two seem to have applied to the challenge, and there’s a chance these new products — groupware 2. Sweet Coffee Shop Logo How can a person not like Ritual Coffee Roasters [logo][2]? The [Laughing Squid][3] folks [apparently like the place][4]. [2]: ritualroasters-com-huge cup.jpg [3]: http://laughingsquid.com/ &ldquo;Laughing Squid&rdquo; [4]: http://laughingsquid.com/2006/08/01/wordcamp-is-this-saturday/ Dr. Frankenstein’s Stress-o-Meter The Scientologists regularly have a table on Powell St., somewhere near Union Square. The game here, if it’s not obvious, is to invite people to take a free stress test, then sit them down and twiddle those unlabeled dials until the needle starts twitching. The blood red table cloth is sure to help. A Technology For Every Niche Way too many people are processing grant applications on paper. They spend a lot of time moving paper around and they don’t know much about who’s applying until after the deadline. That’s why we built Nimble Net. Karen Greenwood Henke’s been working the world of grants and grantwriting for years. Her site grantwrangler.com, and the new Grant Wrangler Blog represent her efforts to connect grantors with grantees, but Nimble Net delivers the tools necessary to manage the process from announcement to award, and all the application and review processes in between. WordCamp Kickoff Woot! WordCamp kickoff party at Taylor’s Automatic Refresher (no doubt selected in part because homophone to Automattic), at the ferry Building. But does it make up for missing Wikimania, the LibraryThing Bar-b-Que-Thing, and Napoleon Dynamite night at The Twig? Go Air Scooter, Go While we’re still waiting for flying cars (or even just fuel efficient cars) I’m keeping track of tiny helicopters like the GEN H-4 and this one, the AirScooter II, pictured above. The company, AirScooter Corporation of Henderson NV, introduces the new craft with a tip of the hat to Igor Sikorsky‘s earliest designs featuring counter-rotating blades. Company founder Woody Norris (who won an award for acoustics) explains: “what we’ve done is package the coaxial design in a modern light-weight craft that allows for intuitive control and incredible maneuverability. The Onion Greets Wikimania Wikimania is about to start, but here, the ever-topical Onion folk are poking fun at Wikipedia. What is there to say when “America’s finest news source” casts aspersions on the world’s newest encyclopedia with the headline Wikipedia Celebrates 750 Years Of American Independence? Extra: watch out for Meredith Farkas‘ panel presentation on wikis and enabling library knowledgebases. I should have thought of this in the context of Ryan Eby’s question about librarians going to non-library conferences. Joe’s Favorite Novels Will pressed Joe, asking him to name his top ten favorite books. Joe pressed back, saying such lists were ridiculous, but still, sometime later he emailed with the following: Okay, here are the books that got to me at certain points in my life. Not sure I would view them all the same now, but this is a list of sorts. I found this an interesting challenge, and of course impossible…I have more lists but I stuck to novels… OpenSearch Progress I really need to keep better tabs on Michael Fagan, as his June 11 OpenSearch Update is full of goodies. The Perils Of Flickr’s “May Offend” Button Quite a while ago now, stepinrazor asked people to do some self-censorhip in a post in the Flickr Ideas forum. FlyButtafly quickly joined the discussion, noting that she’d encountered some material she found offensive in pictures from other Flickr members: “as I’m going through the pictures, one shows up of a protestor holding a sign with a vulgar statement on it.” Though she refused to identify what she saw that was offensive, she did note in a later post that she “would never take my child to a pro-abortion rally. And Now This Is Happening? When a gossip site has a picture of Mel Gibson that looks more like Ted Kaczynski, and a story about drunken, anti-semitic ravings, I think “eh.” But somehow I get more interested agitated when I learn the cops might have sanitized the police report of the whole affair. update: ooh, what about his endorsements? Dooce and BlogHer Bob, the occasional cultural affairs correspondent here, took me to task: how could you not? no link to Dooce.com?? nor to BlogHer.org??? What can I say? My immediate reaction was that he’d found proof of Danah Boyd‘s point that male bloggers only link to male bloggers. Anyway. The BlogHer conference just wrapped up, but as Ryan notes, I don’t know of any library folk who attended. Still, Marianne Richmond is on-blog, raising our awareness of DOPA just like a lot of librarians are trying to do. Wal-Mart Trying To Ape MySpace, Seriously I just got a heads up on an Advertising Age story that Wal-Mart is trying to be MySpace (and, yeah, I aped their headline, too). Here’s the lead: It’s a quasi-social-networking site for teens designed to allow them to “express their individuality,” yet it screens all content, tells parents their kids have joined and forbids users to e-mail one another. Oh, and it calls users “hubsters” — a twist on hipsters that proves just how painfully uncool it is to try to be cool. Stage Two Truth Arthur Schopenhauer is suggested to have said: Every truth passes through three stages before it is recognized. In the first it is ridiculed, in the second it is violently opposed, in the third is regarded as self-evident. If the reaction to Karen Calhoun‘s report to the Library of Congress on The Changing Nature of the Catalog and its Integration with Other Discovery Tools is any guide, libraries are stuck firmly in the second stage. Richard Cheese’s Lounge Against The Machine Richard Cheese‘s lounge-core renditions of pop favorites (and some not-so-favorites) have been cracking me up every time they chime into the mix on random, but I didn’t know what the guy looked like until I spied Beatnikside‘s photo of the man in among his Vegas people set. “Cheese,” of course, is a pseudonym for LA comedian Mark Jonathan Davis, who’s been performing with a band of cheese-named musicians since 2000. Two Events, Two Coasts Matt Mullenweg announced WordCamp in San Francisco, then ten days later Abby announced the LibraryThing cookout in Portland (Maine). Both are set for August 5. The LibraryThing event promises free burgers and potato salad, while WordCamp attendees will enjoy both free BBQ and free t-shirts. I’d like to go to both, but rather than have to make some decision about which one I’d most like to go to, I’m leaning on the fact that I’d already bought my flight to SFO when the LT event was announced. Be Romantic And Smoke His Brains Out This photo from Tsunaminotes appeared in Ende’s photo stream and reminded me instantly of all the cool things I’d never done because I was born too late and cool stuff is what I saw in black and white photos from years past. Of course, Flickr says the photo was taken July 5th, and the photographers of the past would have burned the bright spot on his cheek during printing, but it still has a classic quality to it. Pretty Little Thing Fink‘s Pretty Little Thing is this week’s free download at iTunes, and I have to say I like it. Pretty Little Thing is not usually what I would listen to but i found the song to be new and interesting, very “fresh”! Fink‘s Pretty Little Thing gets a 7.5 on pies listening pleasure scale. . Tags, Folksonomies, And Whose Library Is It Anyway? I was honored to join the conversation yesterday for the latest Talis Library 2.0 Gang podcast, this one on folksonomies and tags. The MP3 is already posted and, as usual, it makes me wonder if I really sound like that. Still, listen to the other participants, they had some great things to say and made it a smart discussion. I approached the conversation with the notion that what we were really talking about was whether libraries should give their patrons the opportunity to organize the resources they value in ways that make sense to them. WordCamp As noted here, I’m going to WordCamp in SFO in early August. Matt describes it as a BarCamp-style event (where “’BarCamp-style’ is a code phrase for ‘last minute’”) with “a full day of both user and developer discussion.” I’m just going for the free t-shirt, of course, but I can imagine a number of folks will get a good value out of the sessions and discussions that will likely run, especially all the developer stuff. …It’s How You Use It Not A Pretty Librarian has kicked things off well with a first post titled “It Is Not A Tool,” covering an argument about which has more value to a teenager: a car or a computer. On one side is the notion that “She can’t drive herself to work with a computer.” While, on the other side is the growing likelihood that she won’t drive to work at all, but instead simply work at whatever computer she has available. bsuite Bug Fixes (release b2v6) [innerindex]Update: bugfix release b2v7 available. It’s been a while since I released a new version of bsuite, my multi-purpose WordPress plugin. I’d been hoping to finish up a series of new features, but those have been delayed and this is mostly just a collection of bugfixes. This update is recommended for all bsuite users. bsuite Features Tracks page loads (hits) Tracks search terms used by visitors ariving at your site via search engines It’s Official WPopac, a project I started on my nights and weekends, is now officially one of my day-job projects too. We’ve been using our WPopac-based catalog as a prototype since February 2006, but the change not only allocates a portion of my work time specifically to the development of the project, but also reflects the library‘s decision to transition to WPopac as a our primary web OPAC. Work to make a general release of the WPopac software available for download and use by any library (or anybody who wants to present structured data with faceted searching on the web) is in progress. 33.3 The music has been on random for weeks now, but 33.3‘s “Joanne Will,” from Plays Music played this afternoon as soundtrack to the summer rains. Brent Sirota may struggle to tell us how bad it is (while also giving it a 4.5 rating), but this “easier to listen to jazzy than to listen to jazz” turned out to be the perfect accompaniment for the ballet of raindrops and splashes just out of reach from my seat on the porch. BeerMapping.com In yet more geolocation news, beermapping.com‘s maps to breweries will make my travel planning easier, and my travels boozier. Hey, it’s casual Friday, take off early and go find a new brewpub for lunch. Plazes Updated Wearing the badge “still beta,” Plazes, the free, network-based geolocation service, now sports a new coat of paint. Among the improvements is the Flash-based badge (above) and a much improved frontpage/dashboard that combines the map of known locations with the map of active users, formerly two separate screens. On the downside, I sort of miss the old tracker. I love the icons on the new one, but there was a simplicity to the old list of recent plazes and favorite plazes that I liked. The Flickr Is A Series Of Tubes It’s hard to be angry with Flickr about unexpected downtime when they post funny things like this. For my part, this is more than just an excuse to link to DJ Ted Stevens’ Internet Song (yeah, “the internet is a series of tubes”), it’s an excuse to point out how Flickr apparently knows how to speak to their customers in language they we understand. I dare a library to do the same next time the opportunity permits. OpenSearch In A Nutshell OpenSearch is a standard way of querying a database for content and returning the results. The official docs note simply: “Any website that has a search feature can make their results available in OpenSearch format,” then adds: “Publishing your search results in OpenSearch™ format will draw more people to your content, by exposing it to a much wider audience through aggregators such as A9.com.” It’s a lot easier to understand OpenSearch once you’ve used it, so take a look at A9. Arctic Monkeys While listening to my favorite radio station 92.1 FNX, I discovered my new favorite band. The Arctic Monkeys is a new band that comes from the UK, and their popularity is rocketing. Their new album Whatever People Say I Am, That’s What I’m Not, has sold more than 360,000 copies which makes it the fastest selling debut album in UK history. Having heard them months ago I was pleasantly surprised to see the Arctic Monkeys perform there hit single “I bet you look good on the Dance Floor” on MTV. NELINET 2006 IT Conference Proposal I recently submitted my proposal for the 2006 NELINET Information Technology Conference. It’s about WPopac, of course, but the excitement now is that the presentation would be the story of the first library outside PSU to implement it. WPopac is an open source replacement for a library’s online catalog that improves the usability, findability, and remixability of the library’s collection. This presentation will detail the implementation of WPopac in the real world, including discussion of challenges and costs, as well as the improvements to service and increased access to library materials. Less Than A Year Left Before Paris’ Retirement Yup, Tom reminded me recently that there’s less than a year left on the Official Paris Hilton Retirement Countdown. In case you’ve forgotten, the hamburger-eating heiress announced her retirement in a June 20 issue of Newsweek (jump to page two for the relevant bits). Don’t get tripped up on the postdated retirement announcement, Bill Gates announced his intentions to retire in 2008 last month, so one might say it’s all the rage. Technology Scouts At AALL I’m honored to join Katie Bauer, of Yale University Library, in a program coordinated by Mary Jane Kelsey, of Yale Law’s Lillian Goldman Library. The full title of our program is Technology Scouts: how to keep your library and ILS current in the IT world (H-4, 4PM Tuesday, room 274). My portion of the presentation will focus on how we’re fixing up our catalogs, with a big emphasis on how APIs can be used to continuously reinvent the way we look at — and thus understand and use — the information we have. The Social Software Over There Amusing. One one side of the world is Jenny Levine, the original library RSS bigot, pushing libraries to adopt new technologies from the bottom up, and here on the other side of the world is NewsGator offering their products for top-down adoption. Why are law libraries interested in NewsGator? Could it be that social software increases productivity? Might it offer some competitive advantage? Do they just make it easier to communicate (and keep track of our communications) in today’s web-driven world? Inclusion or Exclusion By Language …The time for pedantic purism is past; if we wish to communicate with the larger audience, we must use language they understand. We do not have the luxury of defining our words, their definitions are thrust upon us by usage. I was struck by how much that sounds like something I might have said about libraries — only more compact and pointed — but it’s actually my father describing his position on an argument at the World History Association annual conference a couple weeks ago. Education America Today I discovered (thank you Ryan) Kareem Elnahal’s speech as valedictorian of Mainland Regional High School and I discovered new hope, new faith in our country’s future. When high school students can step up and speak truth to power, as Elnahal did so well, I become a believer in the strength of human spirit. “We study what is, never why, never what should be. …[T]his pattern, grade for the sake of a grade, work for the sake of work, can be found everywhere,” said Elnahal. Rocket Cars Make Better Fireworks I pointed out this Jet Turbine Powered Toyota MR2 a year ago, but now I’ve discovered Ron Patrick’s Jet Powered VW Beatle. The story is well told in a San Francisco Gate article from April (with bonus video), which describes the builder: Patrick is a 48-year-old Stanford-trained (Ph.D.) engineer who owns ECM (Engine Control and Monitoring), a Sunnyvale firm that makes electronic instruments used by auto manufacturers to calibrate their engines for performance, fuel economy and emissions. Antstepology French vexillographers circulate the national library, protesting flag desecration, too many windows, and cardboard sunscreens. Fireworks on the Fourth of July promise. Celebrate Independence Day With A Drink. tags: banana, bananas, blueberries, blueberry, flag, fourth of july, fruit, independence day, july 4, july 4th, patriotic, patriotism, raspberries, raspberry, red white and blue, stars Celebrate Independence Day With Breakfast Let the vexillographers cringe, flag desecration never tasted so good. Sure, it’s barbecue season, but that’s no reason not to enjoy breakfast. And what better way to break fast on the Fourth of July than to dress waffles as sugary, fruity flags? Do that with your hamburgers. Do that with your potato salad. Do that with your hot dogs. (Okay, I can imagine a few ways to do that with all of those, so let’s see the pictures. Today’s terms tags: , concert, music, political cr… Today’s terms tags: , concert, music, political criticism, politics, show, the sun, they might be giants, tmbg echo through pine walls stretch sun along desire’s coast They Might Be Giants They Might Be Giants, playing at Mohegan Sun, drew roars of approval from the crowd when John Flansburgh went off-lyric sheet during The Sun (which they amusingly described as part of their Venue Songs series): …The heat and light of the sun are caused by nuclear reactions between a failed foreign policy, a failed domestic policy, and a failed presidency… I’ve not known TMBG to be at all political, just smart. Saturday, July 1, 2006 2:01 pm Is there a term already for what I am about to do? OK, here goes: bad knockoffs of cheap pop Oops! I Did It Again Richard Thompson strange sense of humor Last.fm, despite the suggestion here stream it from NPR, go buying. ****KCRWmusic Toxi The Chapin Sisters Top Tune Britney Spears episode 175 Coverville. **** Britney in wax at Madame Tussaud’s pretending to do hard math some fan with a Brit photo on his refrigerator Oops! I Covered It Again I don’t know why it is that I love bad knockoffs of cheap pop, but I do. That’s why, when I heard a folksy rendition of Oops! I Did It Again playing between segments on some NPR program a while ago, I had to go looking for it. As it turns out, it was Richard Thompson, whose strange sense of humor apparently pops up in his music regularly. You can find his version indexed in Last. June 28: Tony Day In the two years Joe spent researching and writing Two Ton: One Night, One Fight — Tony Galento v. Joe Louis I’ve heard a lot about this guy. Tony Galento was a most improbable opponent for Louis, who by then had regained the world heavyweight title from Max Schmeling, but Joe’s description tells it best: Beetle-browed, nearly bald, a head that rode his collarbones like a bowling ball returning on rails, his waist size more than half his five-foot-eight height, Two Ton Tony Galento appeared nearly square, his legs two broomsticks jammed into a vertical hay bale. Burning Patriotism My feelings on the Flag Burning Desecration Amendment should have been clear from my Flag Day story. Still, let me offer the t-shirts above as confirmation. Sealand Burning A comment from TroublePup alerted me that the Principality of Sealand burned Friday. The Evening Star explained: Witnesses watched in amazement as a huge plume of smoke started to rise from one of the legs of Sealand — and boats raced to the scene. Seafront worker Bruce Harrison said: “It was quite spectacular. The amount of smoke was huge and people kept saying there must have been an explosion. American Diplomacy I don’t collect stamps, but this set caught my eye. First there’s the irony that the USPS is celebrating American diplomacy at a time when, well, there’s not much to celebrate. Then I get a further chuckle when I notice the postal service can only scrounge up six examples to celebrate, but found 40 “superlatives” to get excited about in their Wonders of America collection. Of course, the superlatives are relative — The Bison is only the largest land mammal in the US, for example — but I don’t know enough to judge the six diplomats. The Twig’s Grand Opening Wendy sent out this invite last week: Last month the Monningers quite suddenly became restauranteurs. Six weeks later, Wendy, Joe and Pie are excited to announce the Grand Opening of “the Twig”– an ever-so-cute restaurant in their hometown of Warren, NH. On Saturday, June 24th from 11-2 come to the Twig for free pizza and cake. Win gift certificates and enjoy the newly-opened “Brook-Side at the Twig,” a beautiful outdoor beer garden along the bank of Black Brook. Context, Language, Systems “Bagged products” is little better than “cookery.” I’m gonna bet that no customer has ever asked the sales people for “bagged products,” that nobody’s ever checked the yellow pages for “bagged products,” and without context, nobody would come close to answering a question on what the heck “bagged products” are all about. But we do have context. And within that context, those two words are probably meaningful enough to the potential customers driving by. Free Markets, Bad Products, Slow Change Rates Point A: John Blyberg’s ILS Customer Bill-of-Rights. Point B: Dan Chudnov’s The problem with the “ILS Bill of Rights” Response: John Blyberg’s OPACs in the frying pan, Vendors in the fire While there’s some disagreement between John and Dan, I can’t help but see a strong concordance between their posts: Both are an attempt to educate potential customers. Blyberg wants customers to know what to ask/look for in evaluating products, Dchud wants those customers to know how free markets work. Scooter By Sunset The light Sunday evening was golden, so I stopped to take way too many photos of the meadow in the sunset. Just before filling my memory card with all that, I got back to my scooter to find this scene with a haze settling on the field and the sun just ducking behind enough of a cloud to make the exposure work. Well, okay, it was still a double exposure to get the light right across everything, but still… Spark Fun’s GPS Data Logger Engadget alerted me to this GPS data logger from Spark Fun Electronics. The device records up to 440 hours of data to a 256MB SD card in either a simple text file or KML-compatible format that you can display in Google Earth. I like it, I want one (actually, I want three, and I’ll eventually post about why), but the ad copy tweaked me a bit: Pull the SD card, insert it into a card reader, […] and wammo–you can see what Casey did over lunch with a satellite image overlay. The Pope vs. The Da Vinci Code The above image and following text are circulating the web, tickling funny bones. This man (on the left wearing a fabulous vintage chiffon-lined Dior gold lamé gown over a silk Vera Wang empire waist tulle cocktail dress, accessorized with a three-foot beaded peaked House of Whoville hat, and the ruby slippers Judy Garland wore in the Wizard of Oz) is worried that The Da Vinci Code might make the Roman Catholic Church look foolish. From The Memepool Memepool has more than earned its place in my aggregator. Where else would I learn of The Monkey Chow Diaries (and blog), or the plot structure of Fight Club in Legos, or this flying dude? Happy Bloomsday Thanks to an aside in a sad/angering story at Copyfight, I’m now up on Bloomsday. Here it is, as explained by Wikipedia: Bloomsday is observed annually on June 16 to celebrate the life of Irish writer James Joyce and commemorate the events in his novel Ulysses, all of which took place on the same day in Dublin in 1904. The day is also a secular holiday in Ireland. The name derives from Leopold Bloom, the protagonist in Ulysses, and June 16 was the date of Joyce’s first outing with his wife-to-be, Nora Barnacle, when they walked to the Dublin village of Ringsend. Google Geo News This post started with Ryan sending me this link demonstrating a KML overlay of county borders of his bifurcated state in Google Maps. Then I found this Roundup of Google’s Geo Developer Day (btw, I so wanted to be at Where 2.0) with tales of the new geocoding feature of the Google Maps API, more details about KML-in-Google-Maps, geotagging in Picasa, and the new Google Earth 4.0 beta. And somewhere along the line, I ran across a link to SketchUp, Google’s 3-D modeler that seems built especially to put dimensional structures in Google Earth. Donald Norman — Everyday Things I was especially young and impressionable when I discovered Don Norman‘s The Design of Everyday Things, but I still claim it’s required reading for anybody who’s read more than one post here at MaisonBisson. That’s self selection at work, but let me put it this way: unless you’re the only consumer of the things you create, then you need to read this. Now. I feel foolish to have only recently discovered Norman’s website and essays. The ALA/NO Events I’d Like To See I’m not going to ALA/NO so I’m hoping those who are will blog it. Two events I’m especially interested in: On Sunday, June 25: Catalog Transformed: From Traditional to Emerging Models of Use This program, co-sponsored by the MARS User Access to Services Committee and RUSA’s Reference Services Section (RSS, formerly MOUSS), deals with changes in library catalogs in response to the increasing Googlization of electronic resources. Speakers include: Cindy Levine (Reference Librarian for the Humanities, North Carolina State University), Jill Newby (English Language Literature and Writing Librarian, University of Arizona), Andrew K. The Biblioblogger vs. the Branch Library Steve Lawson‘s A biblioblogger visits the local branch library is worth a look and quite a hoot. Squashing Criticism vs. Improving Products I wrote yesterday of Nicole Engard’s comment that the ILS was about as open and flexible as a brick wall. Today I learned that the vendor of that ILS had tried to squash her public criticism. Not cool. It’s pure speculation on my part, but what comes next? Surely no vendor would send Vinny over to bust an uppity biblioblogger’s knee-caps, but might they offer a customer a better deal if they could just help quiet down a critic within the customer’s organization? seven deadly sins Seven Deadly Sins, The some people think seven is too many, others think it’s not enough DOPA, Social Software, and Libraries I’m more than a month late to this bandwagon, but whatever. Jessamyn alerted me to DOPA, the proposed Deleting Online Predators Act. What’s the point? When conservatives pit FUD against free speech, reasonable people would do well to pay attention. And what’s social software? Take a look at what Meredith Farkas has to say about it. The ILS Brick Wall &lt;img src=&quot;static-flickr-com-103031816_f396e4b726.jpg&rdquo; width=&quot;500&rdquo; height=&quot;375&rdquo; alt=&quot;The great wall of &ldquo;standards&rdquo;&rdquo; /&gt; Nicole Engard last month posted about The State of our ILS, describing the systems as: I’d say it’s a like the crazy cousin you have to deal with because he’s family! It doesn’t fit, we are a very open IT environment, we have applications all over that need to talk to each other nicely and the [ILS] is a brick wall preventing us from getting the information we need and sending the information we’d like. Darn DNS So, you should expect problems when you move your server to a new IP and don’t bother to update the InterNIC registration for your nameservers. It’s an area where I don’t have much experience, so I had to go looking for the solution. Paul Woutrs gave some tips to get started in his short document on the subject. But the real lesson there was that I had to go back to the registrar where I’d originally registered the nameserver objects to change the registration. did adam and eve have navels? Did Adam And Eve Have Navels? : Discourses On Reflexology, Numerology, Urine Therapy, And Other Dubious Subjects filed under “science — miscellanea“ Ugh. “Save NPR and PBS (again)” My dad just forwarded the following message to me: Hi, Everyone expected House Republicans to give up efforts to kill NPR and PBS after a massive public outcry stopped them last year. But they’ve just voted to eliminate funding for NPR and PBS—unbelievably, starting with programs like “Sesame Street.” Public broadcasting would lose nearly a quarter of its federal funding this year. Even worse, all funding would be eliminated in two years–threatening one of the last remaining sources of watchdog journalism. T2000 Unboxed And Online My Sun T2000 is here, and with Cliff‘s help it’s now patched, configured, and online. (Aside: what’s a Sun Happy Meal?) I’ll second Jon‘s assessment that Sun really should put some reasonable cable adapters in the box, as the the bundle of adapters necessary to make a null modem connection to the box is ridiculously out of scale (I’ll get a picture soon). I’m getting the application environment put together, which has turned out easier than expected thanks to the convenient packages from Blastwave. ego soars because sometimes i feel i’m just moving my lips to the sound of babble, it’s a great delight to find a blog post that suggests i said something coherent. Extra: my wife just pointed out this one with photo. Nina Katchadourian’s Sorted Books It seems common among contemporary artists that a web search might turn up a few pictures of their works, but not much about them or their works. In this case it’s Nina Katchadourian and the work I’m interested in is her Sorted Books Project. A video interview from the University of Colorado and ResearchChannel.org does offer some insight into Katchadourian’s art, but why are such glimpses so rare? Anyway, I was happy to find her compact, graphic poetry. thenonist How can I not appreciate thenonist‘s link dumps and other posts when they’re illustrated with works like those above? The men in suits come from May 29. June 4 offers us these funny trading cards and a gallery of horror movie damsels (in distress, of course). June 5 offers a good look at sincerity among other things. And all of this amidst a context of intelligent commentary and smart politics. I Want URL Addressable Spreadsheet Cells (and cell-ranges) When I heard news that Google was to release a spreadsheet companion to their freshly bought Writely web-based word processing app, I got excited about all the things they could do to make it more than just a copy of Numsum. Let’s face it, Google’s the Gorilla in the room here and they’re gonna squash Numsum, but wouldn’t it be cool if… Well, Dmitry Nekrasovski get’s credit for planting the notion of URL-addressable rows, columns, and cells in my mind with this commentary from months ago: Solaris + AMP, ASAP A Solaris sysadmin I’m not. But now that I’ve finally got the Sun T2000 server I begged for a while back, I’ve got to ramp it up right quick. The first task is to get a, um, LAMP environment up and running (SAMP?…oh, Sun wants us to call it AMPS). A bit of Googling turned up this forum thread that suggested Blastwave.org‘s ports of PHP, MySQL, and Apache. edit: I corrected the model number. Circle of Gorillas Thenonist brings the story of Buddy/Gargantua The Great back with better pictures in a post subtitled “Buddy, the gorilla who was scared of lightning” The URLs From My Portland Talk Following Edward Tufte’s advice, I’ve been wanting to offer a presentation without slides for a long time now; I finally got my chance in Portland. The downside is that now I don’t have anything to offer as a takeaway memory aid for my talk. My speaking notes are too abstract to offer for public consumption, but below are the URLs from them along with a tiny bit of context. Foundation Prime As it turns out, +2,147,483,647 is not just the largest 32 bit signed integer you’ll find most anyplace, it’s also a prime number. Asian Scooter Gangs The members of this Taiwanese scooter gang might really be cooler than me. Well, they would be cooler if the scooter gangs weren’t also known to be violent: A scooter gang viciously attacked and injured 12 teenagers — three critically — while on a violent joyride in Taipei County’s Tucheng City… The gang of more than 20 scooter-riding thugs, who brandished large knives and baseball bats, went after most of their hapless victims as they were barbecuing for the Mid-Autumn Festival. Car Lust I told Vincent that I didn’t really care much for cars. It was my sister, I explained, that wanted to look. Vincent agreed quickly and said it was rock climbing that excited him most. Cars, it turned out, were just a family thing he had to play along with. Still, he told me about the Lotus‘ under 2000 pound dead weight, noted the tiny engine that gets nearly 30 miles a gallon yet delivers 0 to 60 in better than five seconds, then opened the door and suggested I shoehorn myself inside. Will Google Eat Itself? Once upon a time Microsoft was the gorilla to beat. Once upon a time we thought Google could do it. Perhaps not any more. Amazon has dropped Google’s search results from their A9 search aggregator in favor of Microsoft’s Live search, and while Yahoo!’s on again, off again partnership talks with Microsoft appear dead after Y!’s announcement Thursday of a partnership with eBay, Microsoft still hasn’t given up on the notion. Sweet Portland Central Library in Portland wasn’t open when I returned the next morning to get some snapshots, but you’ll have to take my word that they did a great job renovating it ten years ago. The outside preserves the original appearance of this historic building, and the early hour of the shot hides the hive of activity that I found the previous afternoon. I have to thank Caleb and Caroline for showing around town, and offer my apologies to Heidi and Alice, who had offered me tips and suggestions that I (again) didn’t have time to follow up on. Denver Sights There’s plenty of public art in Denver, including a blue bear and this horse in a red chair (here and here, respectively). Tourists can also sneak a peak inside the Unsinkable Molly Brown’s house on Pennsylvania St. What I didn’t get to explore, however, includes Tesla’s time in Colorado Springs, the Forney Transportation Museum, NORAD, the remains of the Jewish Consumptives’ Relief Society (apparently still findable behind a mall somewhere), and Gary Sweeney’s “America: Why I Love Her” map at the airport. Denver Nights El Chapultepec is a little jazz club on Market St in LoDo. The Walnut Room just north of everything offers live music and a sweet mile high club pizza made “kitchen sink style.” Those seeking quieter times can smoke a cigar at the Churchill Bar at The Brown Palace on Tremont Pl. And, outstanding sunset views can be had from the Peaks Lounge at the Hyatt on California St. Presentation: Designing an OPAC for Web 2.0 IUG 2006 presentation: Designing an OPAC for Web 2.0 (also available as a PDF with space for notes) Web 2.0 and other “2.0” monikers have become loaded terms recently. But as we look back at the world wide web of 1996, there can be little doubt that today’s web is better and more useful. Indeed, that seems to be the conclusion millions of Americans are making, as current estimates show over 200 million users in the US, including 87% of youth 12-17. And We’re Discarding This? I read enough of this to get a good laugh, but not enough to understand if it was serious or not. Some of it reads like satire, but other parts as are dry as, well, they’re dry (who really needs a simile anyway, they’re just dry, okay?). Scooter My new scooter. It’s not much of a picture, but we’ve had two weeks of rain and this is what I could get. Whiskey Blanket I just bought Whiskey Blanket‘s It’s Warmer Down Here (2004) on the basis of a few tracks they offered on MySpace. It’s hip hop, socially critical hip hop (crit hop?), set atop a well constructed downtempo trip hop music bed (yeah, I’ll cut it with the hops already). It immediately brought to mind MC 900 Ft. Jesus‘s The City Sleeps and other tracks, but with better, sharper raps and without the MC’s somewhat whiny voice. Flickr Goes Gamma Just when we started wondering how much longer flickr would be beta, they announced gamma. The new design had me scratching my head for a bit, but I’m coming to like the changes. The menu/toolbar in the header has direct links to a lot more stuff, while the stuff in the footer has many fewer links. I can’t really tell if there are any links missing there, or if they’re just organized better, as I really only used one or two of them anyway. Better Business Bureau Pulls One Out I gave up on Hostgator a while ago, and I thought I’d cancelled my account until I noticed they were still charging me monthly (yeah, I should pay more attention to what’s on my CC bill). When I contacted them about it they claimed I never fully cancelled. Here’s a copy of the form I submitted: HGSales #GSW-[[private]] October 3, 2005 8:10:40 PM EDT Subject: CANCELLATION Department: Hostgator Sales Request Details: Your Email: : [[private]] Domain name: : MaisonBisson. Linkability Fertilizes Online Communities It’s hard to know how Fuzzyfruit found the WPopac catalog page for A Baby Sister for Frances (though it is ranked fifth in a Google search for the title), but what matters is that she did find it, and she was able to link to it by simply copying the URL from her browser’s location bar. The link appears among her comments in the discussion about her post on an early letter she’d written to her mom. Stonehill Industrial History Center (aka the shovel museum) Most travel guides simply call it the “shovel museum,” but it’s really the Stonehill Industrial History Center. Much more than shovels, curator Greg Galer tells us the collection reveals interesting facts about what we were building and how we built it over the past 200 years. Located on the campus of Stonehill College in Easton Massachusetts, the collection does boast 755 shovels from the Ames manufacturing companies. From the FAQ: Blogging From Basements My buddy Cliff emailed me excited about the following quote he found on the Yahoo Finance message boards: Sun vs Dell All you need to know about Dell &amp; Sun was predicted 8 months ago by some blogger in his parent’s basement. The draft ads are cool: http://spiralbound.net/2005/09/15/sun-talks-some-smack/ How come the big brokerage house analysts can’t figure this stuff out? Cliff doesn’t really blog from his parent’s basement, but well, he was happy for the link love. Pretty Soon Everybody Will Have It This isn’t as funny as it used to be. Every time I read about or hear of somebody talking about autism, I recognize some many of the behaviors as my own. First it was this rather amusing comparison between “eccentric” and autistic behaviors, then it was an interview on Fresh Air, and just this weekend I heard Kamran Nazeer talking about his new book that profiles himself and four other autistic adults. Amazon’s Simple Storage Service Ryan Eby got me excited about S3 a while ago when he pointed out this post on the Amazon web services blog and started talking up the notion of building library-style digital repositories. I’m interested in the notion that storage is being offered as a commodity service, where it used to be closely connected to servers and bought (and wasted) in chunks. With S3, you can build a simple application that runs anywhere, store your big data in S3, pay for what you use, and expand (or contract) as you need to. Reputation Management At Applied Dreams 2.2 Ryan gave me the drop on this presentation by Dave Chiu and Didier Hilhorst where they do an amusingly effective job of explaining the concept of reputation management. It all went down at the conclusion of the Applied Dreams 2.2 project at Interaction Design Institute Ivrea in Milano. The project brief begins: Our identities are changing due to our constant exposure to enabling technologies. Our old physical identities, fixed to a house, an address, a tax number, private, detached, individual, introvert, seem increasingly at odds with our new electronic identities, mobile, self-published, publicly exposed, extrovert, shared, accessible, communal. betty bowers First I found her Harry Potter review, then I found the God Told Me To Hate You buttons and other stuff. Who Makes These Decisions Anyway? Brian’s comment at RemainingRelevant should resonate with many of us: Something to consider about why libraries end up with bad interfaces (at least as far as catalogs go) is that it might be that the people who use the interface (and help the public use it) are not the people who decide which interface to use. When it comes to demanding better from vendors […] consortiums like mine seem to place more emphasis on “cheap and reliable” than in “useful to the patrons. George Bush And Cognitive Dissonance: “Evolution Is A Lie” And “Bird Flu Will Evolve To Threaten Humans” Alpha Liberal reminds me that Bush somehow gets his head around the following: “the jury is still out on evolution” and “the bird flu virus could evolve to a form that can be spread easily from human to human” eh, I’ll take any excuse to point to Michelle Leeds’ photo and bash Bush’s stupidity. Used Brains And Black Plague, On eBay He he. Chuckle, chuckle. Thanks to Kris and Brett for these pics. They ads are still there now when I search Google for used brain or black plague. My question is: does eBay just submit bulk lists of terms they want to buy, or do they have a deal with Google to just link ’em up like this? Authority and Base Jumping Authority has varied meanings in every context. This piece on iFilm has Iiro Seppanen explaining his view of the matter as it relates to jumping off the Stratosphere in Las Vegas. View above, or click through to Base Concepts: Authority. I don't need an excuse to drink tequila, but I'll eagerly take one Ian Chadwick’s In Search of the Blue Agave begins: “Tequila is Mexico,” said Carmelita Roman, widow of the late tequila producer Jesus Lopez Roman in an interview after her husband’s murder. “It’s the only product that identifies us as a culture.” No other drink is surrounded by as many stories, myths, legends and lore as tequila and its companion, mezcal. They transcend simple definition by reaching into the heart of Mexico, past and present. Q: Why Do Some Things Suck? A: Because we compare them to the wrong things. I’m in training today for a piece of software used in libraries. It’s the second of three days of training and things aren’t going well. Some stuff doesn’t work, some things don’t work the first (second, third…ninth) time, and other things just don’t make sense. At lunch, one of the other participants mentioned to the trainer that some of the activities in the software seemed to have too many steps, too many places to go wrong, too many turns between beginning and end. WPopac Gets Googled A discussion on Web4Lib last month raised the issue of Google indexing our library catalogs. My answer spoke of the huge number of searches being done in search engines every day and the way that people increasingly expect that anything worth finding can be found in Google. There were doubts about the effectiveness of such plans, and concerns about how frustrating it might be for a searcher in California to find books (that he or she can’t access) in New Hampshire. Higher Ed Blog Con (and other things I should have posted about last month) I meant to post about this weeks ago, but HigherEd BlogCon has now come and gone. It had sections on teaching, libraries, CRM, and web development. (Aside: why must we call it “admissions, alumni relations, and communications &amp; marketing” instead of the easier to swallow “CRM”?) The “events” are over, but everything is online, and most of it is free. Ryan did a good job of covering the first few days, and what would a blog conference be without a common tag? Linkrot? We Don’t Have Any Steenking Linkrot! Allen asked, via the web4lib list: I’m interested in how others handle linkrot in library blogs. Do you fix broken links? Remove them if they can’t be fixed? Do nothing? Michael answered: I deal with link rot on blogs as I would with any other publication, print or otherwise: do nothing. The post is dated and users should be aware that links from two years ago may no longer work. Frank Rich on Bush’s Last 1000 Days Frank Rich’s New York Times op-ed column today was full of the kind of easy one-liners that repressives conservatives usually like to use against honest people progressives. I got it from my friend Joe, but because The New York Times thinks their content is golden, they won’t let me link you to the full-text. Eh, I looked it up in LexisNexis (also a paid service, but better (marginally)) and posted the good parts here: Kobb Labs Joe forwarded me a link to Kobb Labs the other day, and I’ve got to admit that the guy has a much better introduction than anything I could have written for my site: Despite what you may have been told, I am not a mad scientist. (No, no, no, that’s all slander and lies from jealous colleagues.) As you can probably tell from my website, I’m just a man curious about the universe and the order of things. MoBA Revisited I had a good opportunity to revisit the Museum of Bad Art in Dedham Mass earlier this week. Above is my buddy Corey, but I was amused to find that visitors appear to be leaving their own works for the collection. Cupcakes? “I’ve never seen the inside of a rabbit’s brain before. What’s in there anyway?” “Nobody knows yet. Johnson and I are hoping it’s cupcakes.” “Me too. Except vegan cupcakes. Because I’m a vegan. Vegans don’t eat animals or animal prod–” “I know what vegan means, Thomas. You’ve told us.” “Well, I was just saying, because–” “I know what vegan means” Thank you, Tristan. Twenty Years And A Day Mark Nelson’s Pripyat series on flickr is full of the pictures of desolation that people seem to be looking for as we solemnly honor the twentieth anniversary of the Chernobyl disaster. Google added high-resolution satellite photos of the area yesterday, and Pripyat.com offers both stories and photo galleries to help us remember. It is there that I learned that Rimma Kiselica, the woman who has guided so many of those who’ve reported from the dead-zone, died on March 19. Chernobyl and Pripyat Satellite Photos Today, on the twentieth anniversary of the disaster, Google has added high-resolution satellite photos of the Chernobyl Nuclear Power Plant and the abandoned town of Pripyat. Above is the plant; the damaged reactor is on the left. In Pripyat, the ghostly ferris wheel was easy to find, but where’s the vehicle graveyard? Update: here it is. Hat tip to “di” and “pero69” for their comments. Twenty Years Ago Today Twenty years ago today at 1:23:44, the Chernobyl NPP reactor number four exploded. Five thousand tons of lead, sand, and other materials were dropped on the resulting fire in an attempt to stop the spread of the radioactive cloud. The world learned of the accident when Western European nuclear facilities identified radiation anomalies and traced them to the Chernobyl plant, forcing the USSR to make its first public announcement on the matter. Boolean Searching in WPopac WPopac takes advantage of MySQL’s indexing and relevance-ranked searching (go ahead, try it), including boolean searching (on MySQL versions &gt; 4.x). Here are some details and examples taken wholesale from the MySQL manual: + A leading plus sign indicates that this word must be present in each result returned. – A leading minus sign indicates that this word must not be present in any of the resuls that are returned. Shifting Borders My first reaction to the notion of librarians running reading groups in Second Life was a question of whether this was akin to putting a reference desk in a bar. My second reaction was a question of how our systems will support these extra-library interactions. Can people quickly and easily trade URLs to access the library materials they’re talking about? Will library systems ever be as easy to use as the game/social environments we’re trying to use them in? Living The Life Embarrassing, Stupid Online Without contradicting the moral weight of social software post from last week, let’s take a moment to look at three stories from Arstechnica about MySpace and others: online video leads to teen arrests, shooting rampage avoided due to MySpace posting, and Google + Facebook + alcohol = trouble. These are the stories we’ve come to expect: teen does or post the results of something [stupid|illegal|dangerous] in [MySpace|Facebook|some other online place] and gets caught. That Crazy Gnarls Barkley Other than the notion that I heard it on a KCRW music show, I couldn’t put my finger on the tune weaving through my head. So I listened, and listened carefully, waiting to hear it again. Eventually I learned the earworm was Gnarls Barkley‘s Crazy (thanks to Molly for the mp3 download link). The group, a collaboration between DJ Danger Mouse (of The Grey Album infamy) and Cee-Lo, released the single on MySpace and created a new instant sensation in late March. Movie: Airport Iain Anderson‘s animated film, Aiport, shows even the most pedestrian of designs come to life with a bit of creativity. Elsewhere, a post at Copyfight, suggests that the availability of those symbols — their freedom from copyright and trademark restrictions — was a key factor in spurring their broad adoption, creating both the culture and the free imagery for artists like Anderson to use in their cultural commentary. Bush: “I Invented The iPod” President Bush, speaking in Alabama at the American Competitiveness Initiative, made a claim that would make Al Gore blush: he claimed to have invented the iPod. After taking credit for the development of ultra-small hard drives, audio compression, and chemistry(?), he laid it out: “it turned out that those were the key ingredients for the development of the iPod.” Tip o’ the hat to Engadget. bibliochaise What book lover doesn’t look twice at this bibliochaise from nobody&amp;co? The Wealth of Networks Wendy Seltzer gave a shout-out for Yochai Nenkler‘s The Wealth of Networks: How Social Production Transforms Markets and Freedom, describing it as… …an economic history of information production. We’re moving from the age of industrial information production to one of social information production. Ever-faster computers on our desks let us individually produce what would have taken a firm to organize just a decade ago. Ever-further networks let us share that with the world as cheaply as storing it for ourselves. Danah Boyd On The Moral Weight Of Social Software Danah Boyd posted recently at Many-to-Many about the future of social software. I’ve been more than a little bit gung ho on web 2.0 for a while, but I do like her caution: If MySpace falters in the next 1-2 years, it will be because of this moral panic. Before all of you competitors get motivated to exacerbate the moral panic, think again. If the moral panic succeeds: Youth will lose (even more) freedom of speech. WordPress Baseline Changes To Support WPopac I’ve whittled things down to the point where the only baseline change from WordPress 2.0.2 is in the next_posts_link function of the wp-includes/template-functions-links.php file. The change is necessary because WPopac rewrites the SQL search queries in a way that’s incompatible with a piece of this function, but necessary for performance reasons. Where’d All My Rewrite Rules Go? Between WordPress 1.x and 2.x there was a big change to the way rewrite rules are handled. In the old days, everything got written out to a &lt;a href=&quot;http://httpd.apache.org/docs/1.3/mod/mod_rewrite.html#RewriteRule&quot;&gt;.htaccess&lt;/a&gt; file. Every condition, every form of permalink could be found there, and I had some comfort knowing I could see and mess with it all. I was a bit surprised to find that with 2.0.2, WP writes out a sparse file that has only one significant rule. Bloody Tax Day April 15 has been tax day in the US for as long as anybody can remember, but with the weekend and all, most of us have ’til Monday to file and some of us in the Northeast have ’til Tuesday. The thing I don’t like about tax time is that it brings out the worst in me. Most any other time of the year I’m a pinko liberal, but the anticipation of taxes makes me look decidedly conservative and ornery. The Crucible Who wouldn’t like to play with The Crucible‘s “fire truck”? What’s “The Crucible”? [it’s] an arts education center that fosters a collaboration of arts, industry and community. Through training in the fine and industrial arts, The Crucible promotes creative expression, reuse of materials and innovative design while serving as an accessible arts venue for the public. You can see the truck at the Make Magazine Maker Faire later this month, and in July at the Crubible’s Fire Arts Festival. movie combos This is strange enough on its own, but I dare you to use it as a soundtrack to this one. Printer Fingerprinting News came out a while ago that many of our laser printers were embedding “fingerprints” that allowed folks who knew how (like, say, the feds) to trace a printed page back to the day and time it was printed, and the serial number of the printer. Or, at least that was the theory, until the EFF got all CSI on it. The image above is magnified 10x and illuminated with blue light to increase the contrast of the yellow dot pattern used by Xerox DocuColor printers. PHP5’s SimpleXML Now Passes CDATA Content I didn’t hear big announcement of it, but [deep in the docs][1] (? PHP 5.1.0) you’ll find a note about [additional Libxml parameters][2]. In there you’ll learn about “LIBXML_NOCDATA,” and it works like this: simplexml_load_string($xmlraw, &lsquo;SimpleXMLElement&rsquo;, LIBXML_NOCDATA); Without that option (and with all previous versions of PHP/SimpleXML), SimpleXML just ignores any &lt; ![CDATA[...]]&gt; ‘escaped’ content, such as you’ll find in most every blog feed. [1]: http://us3.php.net/manual/en/function.simplexml-load-string.php [2]: http://us3. Reboot Your ‘Pod Colin has a nifty guide to your iPod’s hidden commands, like those for rebooting or getting into the diagnostics. He’s got more iPod tips if you look. good headline Don’t these Mainich Daily News editors think they’re the shit when they get to combine “bondage” and “rope” in the same headline. i will trademark your every word Yes, as it turns out, “freedom of expression®” is a trademarked term. And, yes, as it turns out, somebody’s been cease and desisted for using it. Email Is For Old People I happened to stumble back onto the Pew Internet Report on teens and technology from July 2005 that report that told us “87% of [US children] between the ages of 12 and 17 are online.” But the part I’d missed before regarded how these teens were using communication technology: Email, once the cutting edge “killer app,” is losing its privileged place among many teens as they express preferences for instant messaging (IM) and text messaging [SMS] as ways to connect with their friends. i m 12 Super heros gotta go too, ya know? We Regret The Error Not all errors in news reporting are as trivial as this one: THE COST of beer kegs has risen by about 30% since the end of 2003. In addition, Neil Witte is the draught beer quality-control specialist of Boulevard Brewing Co., and Steven Pauwels is the brewer’s brewmaster. A March 14 page-one article on beer-keg theft incorrectly said that the cost of kegs has tripled in recent years and incorrectly said that Mr. and he did it in a tie Steve Jobs Demos NeXTSTEP Macs vs. PCs Vista delayed The delay is the latest problem for the software giant’s flagship operating system. Microsoft had originally slated the software for release in late 2005, but pushed back its target date to summer 2006 and dropped several planned features to try to guarantee delivery. The company attributed the delay to the extra time needed to insure quality and fix remaining security issues. Macsimum News | Apple &amp; Macintosh Related News Reviews &amp; Opinions Bad Quality I should be all down on this sneaky way of advertising Nokia’s N90, but…eh, they’re funny. Bad Quality Officechairs is the latest, Bad Quality Hydraulics (somebody tell them it’s “pneumatics”) and Bad Quality Superglue bring up the rear. If that isn’t enough, they’ve got the Bad Quality Blog which pulls back the curtain a bit. If you look around a bit, however, you might stumble across Nokia’s Lifeblog (“feed it, watch it grow”): Zhang Huan’s “My Boston” Most people may recognize Zhang Huan from his “My New York” work that had him dressed in a beefy muscle suit. Above is “My Boston,” but I have a feeling it might get repurposed elsewhere during finals this spring to represent the agony of study. Ups to Ryan for the pointer. drive thru crucifixion titles and typefaces Ryan pointed out that the titles for Thank You For Smoking are pretty interesting, then he followed up with a pointer to some font spotting at Typographica. DNS Problems Things went whacky with Dotster‘s hosted DNS services last night. Though the problem now appears to be fixed on their end (and I’ve actually move elsewhere in my attempts to get back online), it could be a while before the bad data is flushed from caches around the world. In the meantime, let me mention that Ryan shared with me a useful tool I’d not seen before: DNSReport. interesting, scary Ilya Khrzhanovsky’s 4. more. Identity Management In Social Spaces (note: the following is cross-posted at Identity Future.) Being that good software — the social software that’s nearly synonymous with Web 2.0 — is stuff that gets you laid, where does that leave IdM? Danah Boyd might not have been thinking about it in exactly those terms, but her approach is uniquely social-centered. She proposes “SecureId” What is SecureId? SecureId is a program that helps you protect and control your digital identity by allowing you to determine who can access your private information. Big Iron Won’t Win Wars Anymore Technology changes things, sure. The question is, how do you recognize the early signs of change before they become catastrophic? I spend most of my days working on that question in academia, but what about our armed forces? Noah Shachtman regularly covers that issue in DefenseTech: Like a lot of other sage observers, Naval Postgraduate School professor John Arquilla isn’t nuts about the idea of spending a ton on Cold War-style weapons systems when we’re supposed to be fighting terrorists and insurgents. Sparkline PHP Sparklines are “intense, simple, wordlike graphics? so named by Edward Tufte. In lieu of a more detailed introduction, Professor Tufte’s site has an early release of a chapter on sparklines. Cool. Here’s a PHP library and accompanying documentation wiki. More bsuite Hacking Update: bugfix release b2v6 available. Some conversations with Chow Kah Soon, who’s site is full of diversions from work , finally convinced encouraged me to solve some small problems that were giving him big trouble. Chow Kah Soon is in the lucky, but rare, position of having over 20,000 unique daily visitors to his site, so he’s sort of my designated stress-tester. After looking at the logs he shared with me, the table structure, and the queries in bsuite, it was pretty clear that I needed to make some changes to the indexes. Winter’s Last Breath Snow and rain mixed throughout the day Tuesday, but we awoke to glistening white fields and trees. Above is the view due west in Wentworth this morning, before the warm spring sun melted it all away. Don’t Think You Use Web 2.0? Think Again It can be hard for library folk to imagine that the web development world might be as divided about the meaning and value of “Web 2.0” as the library world is about “Library 2.0,” but we/they are. Take Jeffrey Zeldman’s anti-Web 2.0, anti-AJAX post, for instance. Zeldman’s a smart guy, and he’s not entirely off-base, but let’s not confuse his argument. What you don’t see him suggesting is that we abandon the web. “I Hate DRM” And Other Projects To Preserve The Digital Artistic Commons People hate DRM. It prevents law abiding folks from enjoying the music and movies they’ve purchased, and it does little to prevent crackers from making illegal copies. In response, somebody’s created I Hate DRM, “a site dedicated to reclaiming consumer digital rights.” I created this site because, as a consumer, I am fed up. I feel like all of the entertainment that I love is slowly being eroded away by overly greedy companies. Number Sequences Think about it, at the moment this post went live, it was one hour, two minutes, and three seconds past midnight Greenwich Mean Time. Why’s that matter? It doesn’t, but it looks cool: 01:02:03 04-05-06 Of course, Brits and most others don’t represent dates that way, so the point is really only valid in US local time. C’mon, let’s wait up. Richard Sambrook Talks Citizen Journalism I’m not sure what to think of Richard Sambrook appearing to struggle to find a place for traditional journalism in the age of the internet, but the story’s worth a read. David Weinberger […] talked about the crisis in US journalism with failing trust in the big news organisations. He pointed out that Google now provided a news service with just an algorithm where there used to be a newsroom of dozens of people — and suggested algorithms were probably more reliable than journalists anyway! Getting Things Done, And Feeling Okay About It How’s a guy supposed to feel when his manager gives him a copy of David Allen’s Getting Things Done? Go Get Yer Podcast On Gizmodo pointed out these USB and FireWire podcasting kits from Alesis. The package gets you a (hopefully not sucky) microphone with desktop stand, headphones, a carrying case, podcast production software, Cubase LE recording and editing software, and a digital mixer that plugs directly into the computer via USB or FireWire (duh). The US$400 USB version does two channels of 16bit/44.1 KHz audio while the US$600 FireWire model cranks eight channels of 24bit/48KHz sound. Information Behavior It was more than a year ago that Lorcan Dempsey pointed out this bit from The Chronicle: Librarians should not assume that college students welcome their help in doing research online. The typical freshman assumes that she is already an expert user of the Internet, and her daily experience leads her to believe that she can get what she wants online without having to undergo a training program. Indeed, if she were to use her library’s Web site, with its dozens of user interfaces, search protocols, and limitations, she might with some justification conclude that it is the library, not her, that needs help understanding the nature of electronic information retrieval. Atlanta Art Scene, Spring 2006 Atlanta was a bit of a lark. I hadn’t seen my friends for a while, and they were telling me that the weather was beautiful. So why not go? Anyway, Chuck Close is on display at the High Museum. And the thing about Close’s work is that it frustrates my rule of “don’t do twice what you can automate once.” Many of his portraits are the result of carefully mapped and measured graph lines that allow him to create pixelated works. Water Feature We were excited in New Hampshire to have the first week of weather warm enough to go out without our coats at midday, but Atlanta was warm enough to hop in the pool and hot tub after midnight. Abductions I don’t know how I feel about shilling for the california dairy industry, but this cow abduction site is pretty funny. Be sure to watch the movie. Want more, go look at mailorderchickens.org. The Aural Times Thanks again to a good tip from Ryan, I’ve get something new to laugh at: The Aural Times. Did I Really Just Put This Together? Huh. Noah Shachtman tells us that even with the wars in Iraq and Afghanistan raging, our military forces are spending $70 Billion to arm up for a new enemy. But whom? China. Then over here we’re reminded that China is the US’s largest creditor. facts of life A person will do certain things for money. IdM Takes Lessons From the Microformats Crowd A tip from [Ryan][1] sent me [looking][2] at [MicroID][3]: a new Identity layer to the web and [Microformats][4] that allows anyone to simply claim verifiable ownership over their own pages and content hosted anywhere. The idea is to hash a user’s email address (or other identifier) with the name of the site it will be published on, giving a string that can be inserted — in true Microformats style — as an element of the html on the site. …And A Mechanical Turk To Rule Them All Paul Bausch has concerns about Amazon’s Mechanical Turk: I can imagine a world where my computer can organize my time in front of the screen better than I can. In fact, I bet [Amazon’s Mechanical Turk] will eventually gather data about how many [Human Intelligence Tasks] someone can perform at peak accuracy in a 10 hour period. Once my HIT-level is known, the computer could divide all of my work into a series of decisions. Involvement, Inclusion, Collaboration Peter Caputa dropped a comment on Jeff Nolan‘s post about Zvents. The discussion was about how online event/calendar aggregators did business in a world where everything is rather thinly distributed. Part of the problem is answering how do you get people to contribute content — post their events — to a site that has little traffic, and how do you build traffic without content? The suggestion is that you have editorial staff scouring for content to build the database until reader contributions can catch up, and that’s where Peter comes in, suggesting that content and traffic aren’t where the value and excitement are: Twenty Years After Chernobyl Nearly 20 years after the initial events of the Chernobyl nuclear disaster of April 26 1986, the story is still unfolding. This month’s National Geographic Magazine tells of the “long shadow of Chernobyl” — grown children of the disaster now fear having their own children while some elderly residents return to their old homes inside the 1,000 square mile, still contaminated “exclusion zone.” The print article seemed to offer hope, noting that even the pines of the “red forest” — so called because they received so much radiation that it bleached the chlorophyl from them, and some say the trees actually glowed — are beginning to grow back now. Germaine I found Germaine across from the Prudential Center Friday. His sound was good and I especially liked his snare drum. Door of Mystery I found myself wandering about Boston Public Library for longer than I expected Friday. Part of it was the map exhibit and part of it was the architecture (and simply a place to relax for a bit). Amusingly, stairs and stairways seem filled with drama at BPL, and if the guard hadn’t just warned me about taking flash photos, I might have tried to sneak a peak behind that door. Questions Are All Around Us These pictures are mostly foolish, but here’s a small point: none of us had ever seen a cop pull over a cab — certainly not a cab with passengers — before this, so we were all rather curious about why. In front of us stood a question, an example of the many questions we all encounter every day, and it’s the kind of question that few of us would ever suggest going to the library to answer. The Things They Do To Students At Rice I won’t say why I went looking for pictures of people getting poked with sticks (but you’ll figure it out in a later post). I will say I was happy to find these from the Poke-A-Spontaneous-Combustion-Member-With-A-Stick-Day at Rice University. Look, they even have a price list that includes: $1 poke with a stick song/poem on demand two minute massage lick a SC member $2 picture with [unreadable] kissing whack with a stick $3 marker tattoo $4 attempt hedge jumping $5 human piñata shave a leg we wrestle each other $15 jump into hedges Nowhere on the site does it note how much the fundraiser netted for “Rice’s best (only) improvisational comedy troupe. Business Marketing Babble Makes Me Laugh Found on Jeff Nolan’s blog: Competitive Intelligence: “a large fuzzy animal may be a bear.” Marketing: “SAP can help you understand your fuzzy animals. With over 30 years in the fuzzy animal industry, we know if you are looking at a bear, a guy in a coat, or a large dog.” Communications: “In today’s world of increasing challenges, It’s obvious fuzzy animals are what our customers care about.” Sales: “Who cares what it is. Tomorrow In Human Computer Interaction My Dutch skills are weak to non-existant, and without a Google translator for MacArena.be, I’m pretty much stuck with staring at the above video and contemplating the short description provided: A movie about the technology which Apple has recently patented. It is not a movie made by Apple but by some researchers. Fortunately, this is an area where video is much more illustrative than words. I sometimes get accused of blue sky thinking when I speak of the role of technology in our lives, but while I go on about how access to huge volumes of instantly searchable information is changing us, this video shows a rather near future where we can manipulate it ways that seemed like science fiction just the other day. Facial Recognitition Spytech Goes Social Troy expressed both great amusement and trepidation in his message alerting me to Riya, a new photo sharing site: I don’t know whether to say cool, or zool. The tour explains that you upload photos, Riya identifies faces in your photos, then asks you to name them (or correct its guesses!). Then you get all your friends to join up and we can all search for everybody by people, location, and time. Speaking My Language I loved this quote from Dave Young when I first found it, and I love it more now: Talk to the customer in the language of the customer about what matters to the customer. Bad advertising is about you, your company, your product or your service. Good advertising is about the customer, and how your product or service will change their world. Read that again, but replace the relevant bits with “user” or “patron” and “your library” or “your databases. Wyoming Libraries Marketing Campaign I have mixed feelings about the value of advertising — it’s worth pointing out that according to John Battelle, Google never ran an ad anywhere prior to going public — but I still enjoy seeing things like this Wyoming Libraries campaign. Jill Stover quotes Wyoming Libraries’ Tina Lackey with the news that “Wyoming’s libraries are as expansive as the state, and as close as down the street.” I’m just hoping that A, the horse is real; and B, they auction it off. Gates Harshes Poor, Tells Them To Buy Windows What’s sadder than people in Burundi earning an average of only $90 a year? It might be Bill Gates‘ criticism of MIT’s efforts to bring affordable, networked computers to the poorest countries of the world in hopes of improving education (and communication and healthcare and more). The challenge is enormous: the technology needs to be durable, require low-power (and be easily rechargeable), as easy to use as an egg timer, have networking in a land without infrastructure, and be cheap, cheap, cheap. Can Actors Sell Their Digital Clones? Alan Wexelblat in Copyfight poses a question from a reader about the future of entertainment: what rights do you purchase/license/contract for in creating such a reproduction of a real person? Rights to the “likeness?” Performance rights? Do either of these cover things the actor never physically did or said? Is there an exclusivity clause? There are clearly some issues around the ownership of a character, if that character has appeared before (e. Pravda March 18 Headline: US To Collapse on Feb 5 I regularly check the English language online edition of Pravda for laughs and sometimes for their take on US domestic affairs. But today’s headline left me scratching my head. What calendar are these people using, anyway? The headlined story is offered without any context or explanation. As it turns out, author Ian Magnussen really did mean February 5th 2006, not 2007 or later. Had it appeared two months ago it might have been called speculative fiction, though more likely seen as a crazy conspiracy theory. Flight of the Conchords Ryan sent along a link to Flight of the Concords‘ Business Time last week and I’m still laughing over it. With some exploring at a fansite, What the Folk!, I dug up a trove of other amusements, including She’s So Hot Boom. For more info, I turned (as usual) to the Wikipedia article. And if I had HBO, I could have caught a repeat of them on One Night Stand this past Wednesday. MaisonBisson Cultural Reporter at SXSW, Can’t Get Tickets, Brushes With Owen Wilson Instead SXSW passes have apparently been sold out for weeks now. So what’s Bob Garlitz, the MaisonBisson cultural affairs reporter, to do? Hunt for celebrities around Austin, of course. Here’s how he describes his first hit: I look at him intently, he’s about six inches in front of me. A long pause as I study his face and especially note the nose. He waits, expecting, knowing, what’s next. He’s shorter than me, in a white cap, white t-shirt and maybe white jeans. Everybody’s Irish With A Quart O’ Whiskey In ‘Em Modern Drunkard Magazine suggests we chase the snakes out of our minds, for as Yeats reminds us: The problem with some people is that when they’re not drunk, they’re sober. (Ryan points out that you can have that quote, along with three others from quipsters Dylan Thomas, W.C. Fields, and Oscar Wilde on shot glasses.) But Modern Drunkard and Yeats (despite his fine heritage) have it wrong. Saint Patty’s Day isn’t about getting drunk or being drunk, it’s about getting silly enough to think you can dance a jig or sing a song. Native To Web & The Future Of Web Apps Yahoo’s Tom Coats was of seven star speakers at Carson Workshops‘ Future of Web Apps Summit last month. As usual, Ryan Eby was pretty quick to point out his slides to me, mostly by way of pointing out Jeremy Zawodny’s translation of them. If it’s not clear yet: I wasn’t there, though I very much wanted to be, especially given some of what can be found in the post-summit blog posts. Office Cocktails I like pretty much everything Paula Wirth puts up on Flickr, but this afternoon I could do well with a dive like Scolari’s Office in San Diego. But, that’s probably because it mixes “office” and “cocktails” in the sort of way that has anonymous tipsters slipping photocopies of the alcohol policy from our HR handbook under my office door. Eh, here’s to happy hour. Homeland Security: Now Policing Porn? The Washington Post reports two men in uniforms bearing “Homeland Security” insignia walked into a Bethesda library in early February, announced that viewing of internet pornography was forbidden, and began questioning patrons. The men asked one library user to step outside just before a librarian intervened. Then… the two men [and the librarian] went into the library’s work area to discuss the matter. A police officer arrived. In the end, no one had to step outside except the uniformed men. The code4lib Journal(s) I Should’ve Kept code4lib was less than a month ago, but already I’ve forgotten some details. That’s why I’m glad to have notes from Ed Summers (day one, two, and three), Art Rhyno, Tom Hickey, Karen Coombs, and Ryan Eby. There was a lot going on, and if I missed your blog it’s because Google and Technorati didn’t know about it (or I was being particularly lazy with my searching). Our Connected Students Just when you thought I was done talking about how the internet really does touch everything, Lichen posts some details from the most recent University of New Hampshire Res Life student survey and it gets me going again. In order, the top three activities are: socializing (15.8 hours/week) studying, excluding in-class time (12.5 hours/week) instant messaging, (9.3 hours/week) Lichen also points out that IM activity was reported separately from “personal internet use,” which got an additional 8. This Is What Social Software Can Do The FlickrBlog reports this message from Gale: People have been submitting good humpback whale fluke shots to a group called Humpback whale flukes. I volunteer at Allied Whale which holds the North Atlantic Humpback Whale Catalog and I was able to make a very exciting match with one of the whales that was posted on the group by GeorgeK. George saw this whale in Newfoundland in the summer of 2005. Willie Mae Rock Camp For Girls The Willie Mae Rock Camp For Girls: just another example of why New York is cooler than New Hampshire. Photo by Rocco Kasby, performance by the Pink Slips. Yet again, a tip of the hat to Ryan Eby for the pointer. bsuite Feature: User Contributed Tags Ross Singer gets the prize for submitting the first reader contributed tag, the latest feature in bsuite. There are arguments about whether user-contributed tags are useful or even valid, or whether they should be stored in my site or aggregated at places like del.ici.ous. But who’s to worry about such questions? Who’s to worry when you can put together the work already done to support author’s tags with WordPress’s pretty good comment system and get user contributed tag support with just a few extra lines of code? User Experience Map I was this close to posting soldierant‘s Gobbledy Gook map, but, well… I guess I wanted to make a point with his user experience map, done in collaboration with the smart folks at Experience Dynamics. Take a careful look at the role of your competitors and a user’s expectations and goals. Yeah, we’ve all got some work to do. Too bad the free seminar schedule hasn’t been updated for 2006. Whisky Essential To Writing God bless William Faulkner for pointing it out: My own experience has been that the tools I need for my trade are paper, tobacco, food, and a little whisky. Nash Edgerton’s Lucky Scott Smith’s Imperfect Ten too slow for you? Take a look at Nash Edgerton‘s Lucky over at Blue Tongue Films. What would you do in 4 minutes 25 seconds? How would you escape? Zorb: Another Reason New Zealanders Are Cooler Than You Who of us didn’t want to try it when we saw Jackie Chan bounce down a mountainside in one in Operation Condor (well, who of us who saw Operation Condor didn’t want to try it)? But until Cool Hunter gave me a pointer, I had no idea what the these strange inflatable balls (yeah, go Google that) might be called or where to look for more information. As it turns out, they’re called “Zorbs,” and the company even has a promo video to show them off. Nuns Vs. Librarians In Spelling Bee From Yahoo! News and Ryan Eby, there’s a funny spelling bee planned in Erlanger Kentucky: ERLANGER, Ky. – After a five-year hiatus, the Sisters of St. Walburg Monastery in Villa Hills are ready to show whether they are superior spellers. The sisters were champions of the annual Corporate Spelling Bee for Literacy in northern Kentucky for years before giving others a chance to win. But now the nuns are back, even if they’re a little timid about challenging the reigning champions — a group of Boone County librarians. Scott Smith’s Imperfect Ten The nice folks at Coudal Partners are hosting Scott Smith’s Imperfect Ten, “wherein one man breaks all ten commandments before breakfast.” It’s Friday (March 10th, even), go watch. Crisp Green Shirt Between the MIT show and Microsoft’s vaporware, origami is back in a big way. Here’s drumsnwhistles answer: a very crisp green shirt. All About OpenSearch and Autodiscovery from Davey P I’ve been meaning to point out (and steal from) Dave Pattern’s post on tipping off IE7 (and other browsers soon too, hopefully) to available OpenSearch targets for some time now. I haven’t had time to do the stealing, so I’ll have to settle for pointing it out while it’s still news. What’s the trick? As Dave explains, you put a link in the &lt;head&gt; section of your pages like this: Visual Complexity I found the above image of a yFiles-generated site map at visualcomplexity.com. We’ve seen a lot of internet diagrams, including this one from 1977, but what about mapping food? Or disaster situations? Or air routes? It’s like data porn, and there’s more in the visualcomplexity gallery. The Ignorant Perfection of Ordinary People Bob Garlitz, who’s trying to decide between blogging at Typepad and Blogspot, wrote to offer a somewhat older phrase for the success of social software as described in The Wisdom of Crowds and in the definition of collabulary: “the ignorant perfection of ordinary people.” Bob is at a loss to identify the source (and it pre-dates the book of the same title by a long shot), but maybe this crowd will know? MIT Origami Competition Ryan Eby and MAKE magazine alerted me to MIT’s student origami exhibit, in which Jason Ku’s ringwraith won the Best Original Model prize, and Brian Chan’s beaver — the MIT mascot — got special attention from the MIT News Office. Collabulary I found this a few days ago and realized that it embodied the difference between how I understand tag folksonomies and how others (with whom I’ve argued) may see them. That is, I see the role of the social group — the wisdom of the crowd — as essential to the success of our folksonomic efforts. As it turns out, somebody’s come up with a word that emphasizes that (uncoordinated) collaboration: collabulary. Talking ‘Bout Library 2.0 Users want a rich pool from which to search, simplicity, and satisfaction. One does not have to take a 50-minute instruction session to order from Amazon. Why should libraries continue to be so difficult for our users to master? — from page 8 of the The University of California Libraries Bibliographic Services Task Force Final Report. I find a new gem every time I look at it. Robins at Bath I heard birds chirping yesterday morning for the first time in a while, and from my office window I could see robins returned from the south. Spring, it seems, has arrived in New Hampshire, but nobody’s captured it better than Breezin with the photo above — obviously taken from a somewhat warmer place than this in late January. Tags Done Right Flickr does tags better than any other, so far as I can tell. We love tag folksonomies for way they allow us all to organize our world, for the way they allow patterns to emerge from chaos, and for their easy flexibility. But that flexibility, if poorly implemented in our software, can interrupt the very patterns we hope to find in our tag networks. Take “road trip” as an example. What one tagger thinks is two words might be just “roadtrip” to another. MacBook Pro Reviewed Jacqui Cheng likes her new MacBook Pro and loves the performance, but gives the MagSafe power adapter mixed reviews. Why? She says it disconnects when it shouldn’t, and seems to stay connected when it should disconnect. Well, I think I still want one. Troy Bennett at “Ben Show” Ben Apfelbaum died before having the chance to see it all come together, but his quirky idea seems to be a hit. Here’s how Jerry Cullum described it for the Atlanta Journal Constitution: “The Ben Show” was the brainchild of beloved Spruill Gallery director Ben Apfelbaum, who asked one day, “What’s in a name?” and proceeded to track down a host of artists named “Ben.” Well, actually, he asked, “Is the use of a given name as a thematic device as useful as any other thematic device to create an art exhibition of interest? PodBop Rocks Your Calendar Ryan Eby pointed out PodBop, a site that podcasts sample tracks from bands coming to your area (or any other area you select), and we both wished we’d thought of it ourselves. There’s nothing coming to Warren (of course). But they’ve got coverage for Denver, where I’ll be in May, so it immediately found a place in my podcast aggregator. Laura Fries might have covered the smart and cool factors best: Oddest Title of the Year Winner …And Also Rans The Bookseller magazine Friday announced the winner of the 28th annual Diagram Prize for Oddest Title. Bookseller deputy editor Joel Rickett appeared on Weekend Edition Saturday with the news, saying, as he did in a Telegraph story on the matter: “It has been a pretty good year for strange titles.” The winner is People Who Don’t Know They’re Dead: How They Attach Themselves to Unsuspecting Bystanders and What to Do About It by Gary Leon Hill, but the list of nominees and near nominees included Rock Paper Scissors Posted on the wall in Tom’s Peacock Bar in Corvallis was a mystery: a notice of a rock paper scissors tournament. A visit to the USA Rock Paper Scissors League‘s website proved more confusing. Take the first news release as an example: Rocky Balboa is stepping back into the ring for his final comeback, as production has begun on “Rocky VI: Rocky Paper Scissors.” After a 16-year hiatus, Sylvester Stallone wrote the film himself, knocking out boxing from the script and replacing it with a hand sport that is more intense, more courageous and that looks even better in those dramatic slow-motion shots: Rock Paper Scissors. “Peanutty” ≠ Peanut Butter Treehugger pointed out these P.B. Slices as an example of excessive packaging. What they didn’t mention was the ingredients or processing used to make a non-sticky, peanut flavored “food product.” Peanutty, but not quite peanut butter It’s worth mentioning here that I have a rule about things I find in the supermarket: if it says “food” on the label, you probably shouldn’t eat it. Think about it, start first with the cat food, dog food, and fish food, then take a look at the pasteurized processed cheese food product and some of the goodies in the canned meats aisle. Fun With (Explosive) Balloons Okay, so this is certainly in the “don’t try this at home, kids” category, but we can all laugh and point at other’s stupidity. Denver‘s ABC channel 7 reported last month on a foolish fellow who inflated balloons with acetalyne, the highly flamable and explosive gas used in welding, and drove off to a superbowl party. The balloons ignited, possible because of static electricity, and the explosion blew out all the windows, bent the car’s roof and doors out, and left the driver and with burst eardrums, burns, pain, and a felony explosives charges. Can Anybody Explain This? ???????????????? Morbidly Curiouser Zach saw my story about plane crashes and forwarded me a link to this video of an early parachutist he found on Damn Interesting. The connection to yesterday’s story is that the video ends with cops measuring the depth of the crater the jumper left after falling almost 1000 feet from the top of the Eiffel Tower. It’s the sort of thing that gets you nominated for a Darwin Award. Morbidly Curious A friend pointed me to PlaneCrashInfo.com and I can’t help but explore. I was told to start with the pictures (which end in late 2001, and so don’t include recent incidents like the flaming nose-wheel at LAX or the overshot runway in Chicago), but it was the collection of “last words” transcripts from the cockpit voice recorder (audio is available for many of them) that really trapped me. We might get a furtive chuckle over such last lines as “Hey, what’s happening here” or “Uh. The Oregon Attractions I Didn’t See I’ve been back from Oregon for about a week and a day now, and it’s really time to clear out my files. So here now are the attractions I had put on the list, but never got to see. I’m not complaining, afterall, I did get to see sprayfoam art, the US’s only municipal elevator, the world’s tallest barber pole, the Spruce Goose, Mt. Tabor, and the Velveteria. Clearly, Oregon has a lot to offer wacky travelers. Is Sun’s T2000 Up To It? Jonathan Schwartz made the kind of news that makes Slash Dotters happy: he announced Sun is (sort of) giving away free servers. It’s a promotion, a media play, of course, but one that might make a few lucky people very happy. Here’s the deal: Sun is really proud of their new T2000 eight core server. Each core runs at 1.2GHz, but they’re apparently applying some distributive power of multiplication and calling it an 9. LEGO Architecture The Millyard Museum was hosting the New England Lego Users’ Group Saturday, building LEGO replica’s of Manchester NH‘s old victorian-era houses. It turns out they’re building a scale model of the entire millyard. Love Letters From Your ISP A friend got his own cease and desist letter the other day. His ISP forwarded the notice from a copyright enforcement agency along with five pages of content intended both to stop those that know they’re sharing and help out parents (or others) who may not be aware of what all is going on with the computers attached to their cable modem. Of course you’re a valued customer, and of course it wasn’t your fault, just stop it is the message. Worse Things A friend forwarded this, from Fleur Adcock: Things There are worse things than having behaved foolishly in public. There are worse things than these miniature betrayals, committed or endured or suspected; there are worse things than not being able to sleep for thinking about them. It is 5 a.m. All the worse things come stalking in and stand icily about the bed looking worse and worse and worse. As The Useful Becomes Useless, It Becomes Art The story here isn’t about why I’m on the Kate Spade mailing list. The story is about their new line of “paper.” It’s stationary, of course. The kind of formal paper people use to send out wedding invites and thank yous and whatever other little missives that email or AIM seem too uncouth for. I made this point before, in a discussion of how painting evolved from trade-craft to art after the development of the camera, but I love seeing a new example. Standards Cage Match I prefaced my point about how the standards we choose in libraries isolate us from the larger stream of progress driving development outside libraries with the note that I was sure to get hanged for it. It’s true. I commented that there were over 140,00 registered Amazon API developers and 365 public OpenSearch targets (hey look, there’s another one already), but that SRW/SRU would always play to a smaller audience. Evergreen Aviation Museum Howard Hughes‘ Spruce Goose now rests in McMinnville, at the Evergreen Aviation Museum. The Goose is as long as a 747 with a wingspan a third again as broad, and for a short few seconds in 1947, it flew. The docent was incredibly pleased to tell us that the tail almost broke off during those few seconds in the air. He claimed Hughes hushed up the story and maintained the aircraft in flight-ready condition to protect himself from further attacks from government accountants. DIY Hoverboard My friend Troy sent along a pointer to The Gadget Show‘s feature on DIY hoverboards. They claim it all goes together with basic tools, a leaf blower, plywood, a bit of pipe, and other various parts totaling about £150. Oh yeah, they also recommend “an insurance policy with good fringe benefits,” and being as British as they are, apparently “craft knives” and “scalpels” are pretty interchangeable. It all goes together in eight easy steps explained on four pages, so what’s keeping you? About My code4lib Presentation As with all my other presentations, the my slides tell less than half the story, but I’ve posted them anyway. I’m told the audio was recorded, and there’s a chance that will help explain all this, but until then you’ll have to piece this all together from my previous writings, what little I’m about to offer here, and the slides (which, again, without the spoken component, probably do more to misdirect interested readers than answer questions). Brick I just popped in The Constant Gardener (trailer) and discovered the preview for Brick. And even though I want to see almost every movie previewed for me, I really want to see this movie. The Constant Gardener, by the way, is good too. Velveteria I wasn’t just surprised to find a gallery of velvet paintings, I was further surprised to learn they were hosting a show of Valentines velvet works by local artist Juanita and had cards advertising a show of LA artist Arnold Pander’s oil on velvet works at the local Vault Martini Lounge. But the fact is, Carl Baldwin and Caren Anderson’s Velveteria is the place, if ever there was such a place, where such forces will collide. World’s Tallest Barber Pole Forest Grove, Oregon claims to have the world’s tallest barber pole, apparently presented by the Portland Area Barbershoppers in recognition “Ballad Town USA’s” role in promoting and encouraging barbershop quartet singing. It stands in Lincoln Park (visible from sat photos!) just north of Pacific University. Barbershop poles and quartets they may have, but the barber I visited there did a lousy job trimming my beard. Such is life, I suppose. Librarians of Springfield That’s my contribution to the Springfield Public Library meme that Michael Casey and Laura Savastinuk started over the weekend. Oregon City Municipal Elevator Oregon City apparently boasts one of only four municipal elevators worldwide. One hundred thirty feet tall, with an observation deck at the top, it seemed to be worth stopping for. Jason wrote in to Roadside America explaining: It began as a water-powered elevator in 1915, but was upgraded to an electric-powered elevator in 1954. It is an example of Googie architecture, which is reminiscent of the space-age housing structures in the Jetson’s cartoon show. PDX’s Free WiFi Rocks Here’s a lesson the rest of the world’s airports could take from PDX: free WiFi. Most other aiports charge dearly for WiFi, but PDX offers it free. Knowing this, I arrived at the airport a couple hours early and got my dinner and caught up on my email here instead of elsewhere. The Port of Portland didn’t get my $7.95 an hour, but they did get an extra customer in their restaurants and shops. Mt. Hood from Mt. Tabor Above: tonight’s sunset view of Mt. Hood from atop Mt. Tabor, an ancient volcano. Roadside America claims: this is the only volcano located within a city limit in any US city. You can view the cinder cone and a few feet away from the parking lot is a kids play area. Sprayfoam Art In Millersburg What you can’t tell about the photo above is that the eagle is huge, and made of spray foam. It stands at Sprayfoam Inc., just off the I5 at Millersburg. Don’t miss the cornucopia-like sign, or the completely enfoamed Sprayfoam-mobile. The Chuck Norris Meme I first caught up with all this at Matt‘s blog, but on the radio out here in Oregon today they kept inserting Chuck Norris legends between songs. Here’s a bunch from Chuck Norris Facts: When the Boogeyman goes to sleep every night, he checks his closet for Chuck Norris. Chuck Norris doesn’t read books. He stares them down until he gets the information he wants. There is no theory of evolution. Lessons From The Microformat World I can’t help but like microformats, and part of that comes from the dogmatic principles that drive them. Among those is the notion that none of us should attempt to create a format out of whole cloth. Here’s how they explain it: Under the title of “Propose a Microformat” they tell us: “Actually, DON’T!!!” ask yourself: “are there any well established, interoperably implemented standards we can look at which address this problem? Things I Learned At Lunch Today Karaoke means “empty orchestra” in about the same way that karate means “empty hand.” The “oke” piece is actually a shortened form of “orchestra,” borrowed from western languages. Ethiopians supposedly discovered coffee when they noticed goats eating the beans. No word on weather the coffee beans in their droppings are any good. You Mean Other Businesses Handle Acquisitions Too? Art Rhyno confused my by calling it ERP, but he just rocked his code4lib presentation and I realized he’s talking about the same thing that’s been itching me: libraries are not unique, but our software and standards are unnecessarily so. In my introduction of WPopac I made the point that I didn’t want to replace the ILS — certainly not the acquisitions management functions or other business processes. Art today explained that he wouldn’t want to have to develop or support those features either, but that we don’t need to. Pig-n-Ford Races!?! So here I am looking up things to do in Oregon and I come across the Tillamook Chamber of Commerce‘s guide to local attractions and its note about the Pig-n-Ford races: Vintage vehicles, daring drivers and squealing porkers. Mixed together, the outcome can only be described as frenzied farm-style fun. Most people would agree that individuals who race Model-T Fords must be strange to begin with. When competitors insist on carrying pigs as passengers, however, it’s a sure sign of a rare breed of driver. On Flying If I didn’t like flying, or at least if I couldn’t tolerate it, I wouldn’t making my third distant trip in as many months. And though I know many others spend a whole lot more time in planes than I do, I still think Vasken has a bit of a point in the following: I couldnt help thinking about the horrid dichotomy that is airline travel… on one hand, my flight from Philly to Manchester takes 50 minutes, or 6+ hours less than the trip takes in a car–on the other hand, it took me 5 hours to get from my house to the place I was staying in PA, a savings of a mere 2 hours. Instant Messenger Or Virtual Reference? I noted Aaron Schmidt‘s points on IM in libraries previously, but what I didn’t say then was how certain I was that popular instant messaging clients like AOL Instant Messenger or Yahoo!’s or Google’s are far superior to the so-called virtual reference products. Why? They’re free, our patrons are comfortable with them, and they work (three things that can’t be said about VR products). Ah, heck, just take a look at what Michael Stephens was saying about them last week (as quoted by Teresa Koltzenburg at ALA TechSource): Choose Your Disaster The good people at Keep the Faye gave me a chuckle with their series of choose you daily disaster magnets, like the hillbillies and volcano series pictures above. Then they followed it up with the amusing, but somewhat less funny choose your favorite fantasy series. MySQL’s Slow Query Log Zach suggested it last week, but it’s only now that I’ve gotten around to setting up MySQL’s slow query log. It’s easy enough, you’ve just got to put a couple lines like this in your my.cnf (which is in /etc on my server): log-slow-queries = /var/log/mysql/mysql-slow.log&lt;br /&gt; long_query_time = 10 This should get most people running, but this story in Database Journal offers a few more details. Potentially more useful is this guide to query and index optimization (though it’s probably a little out of date). NMC’s 2006 Horizon Report I’d never heard of the New Media Consortium before, but they claim a mission to “advocate and stimulate the use of new learning and creative technologies in higher education.” Anyway, their 2006 Horizon Report identifies the following trends among those shaping the role of technology in education: Dynamic knowledge creation and social computing tools and processes are becoming more widespread and accepted. Mobile and personal technology is increasingly being viewed as a delivery platform for services of all kinds. Roadside Attractions Perhaps it’s just because I’m now scouring Roadside America for tips on what to do in the 35 hours after the end of code4lib and my flight home, but I got a hoot out of this AP story about “Roadside Giants”: A Pittsburgh-area couple find “Roadside Giants” historic, attractive, a boon to local economies… and silly. Associated Press PITTSBURGH – How can you find the Cadet Restaurant in Kittanning? High-Speed Photography The gallery at Pulse Photonics has more than a few images that seem to pause time in impossible moments. They’ve got images of balloons pierced by arrows and darts, oranges exploding from [a gunshot][7], bullets [shattering glass][8] and [slicing through jelly][9], and all of this [falling water][10] and [oil][11] in [so many][12] [little droplets][13]. You really oughtta go see the [whole gallery][14]. And after that, go visit the [Photron gallery][15] of slow motion videos that [caught my eye][16] a while ago. Bicycle Snowplow To go along with summer’s bicycle riding mower is this “Vancouver Snowplow” from Joe-ks.com (yes, I feel appropriately stupid for linking to a site with an animated gif splash page). Oddly, this isn’t the only such snowplow. On Being Busy I should be thankful to have friends who get worried about me when I don’t blog for a couple days (or at least make up stories), but let me take this moment to make it clear that I haven’t gone into boat sales. This has happened before, and it just means I’ve got a larger than usual pile of deadlines (and interesting projects like WPopac) on my plate. WPopac: An OPAC 2.0 Testbed First things first, this thing probably needs a better name, but I’m not up to the task. Got ideas? Post in the comments. For the rest of this, let’s just pretend it’s an interview. What is WPopac? It’s an OPAC — a library catalog, for my readers outside libraries — inside the framework of WordPress, the hugely popular blog management application. Why misuse WordPress that way? WordPress has a a few things we care about built-in: permalinks, comments, and trackbacks (and a good comment spam filter), just to start. Performance Optimization A couple notes from the past few days of tweaks and fixes: Hyper-threading has a huge effect on LAMP performance. From now on, I’ll have bad dreams about running MySQL without Query Caching in the way that I used to have nightmares about going to school wearing only my underwear. The difference is that big. WordPress rocks, but it has some queries that will kill large databases. I’m playing with baseline when I fix ’em, but it’s worth it. The Web Is Not A One-Way Medium Anybody who questioned the Pew Internet and American Life report about how teens use the internet and how they expect conversations and interactivity from the online services they use might do well to take a look at this comment on my Chernobyl Tour story: Student Looking for Info that your not give us February 3rd, 2006 10:11 you people suck. We have to do a school report and you are not giving us any info on what happened to the people, and the environmetn, we need a story from someone and about someone who lived through this inccident. FAQs About Those Three Wishes I ran across David Owen’s Three Wishes FAQ in a month-old New Yorker on my friend’s coffee table last night. I tore out the page thinking I’d not find it online, but lo, the New Yorker posted it on their site on Jan ninth! You have been granted three wishes — congratulations. If you wish wisely, your wishes may bring you great happiness. Before wishing, please take a moment to read the following frequently asked questions. Libraries vs. DRM Within minutes of each other, two friends from separate corners of the world sent me a tip about the following: Slashdot pointed to this BBC News that talks about the ill effects of DRM on libraries. What’s DRM? It’s that “digital rights management” component of some software and media that supposedly protects against illegal copying, but more often prevents legitimate users from enjoying the stuff they’ve bought legally. Now think about how this works (or doesn’t) in libraries… Exxpose Exxon ExxonMobil’s 2005 profits of $36.13 billion are apparently the largest ever recorded by any corporation in America. To celebrate, the folks at SaveOurEnvironment.org put together this funny short: ExxposeExxon. The movie makes some good points, but let’s face it, high oil prices encourage conservation and research on alternative energy technologies. Is J. K. Rowling Carolyn Keene’s Sister? I said previously that I drop my journalistic standards on Fridays. Today is no exception. Background, from Mysterynet: Carolyn Keene is a writer pen name that was used by many different people — both men and women — over the years. The company that was the creator of the Nancy Drew series, the Stratemeyer Syndicate, hired a variety of writers. For Nancy Drew, the writers used the pseudonym Carolyn Keene to assure anonymity of the creator. As If Retro Fashion Didn’t Already Go Far Enough I guess I can see why people might be willing to throw down $4000 or more for these fancy Northstar refrigerators, I mean, they remind rich young people of their grandma’s house, with fresh-baked cookies and a big glass of milk to dunk them in. I’ve gotta admit, I almost got suckered too. But why is it that our rosy nostalgia for the 50s ignores both the racial segregation (a bad thing) and the income equity (a good thing)? Onion Story Predicted Five-Blade Razors In 2004 Gillette’s Fusion five-blade razor is hitting the shelves now, but The Onion predicted it in February 2004. AIM And Changing Modes Of Communication There’s a bit of discussion of AIM‘s role in personal communications over at Remaining Relevant. I mention it here because I’ve been thinking about this lately. We’re seeing some great shifts in our modes of communication. Take a look at how “webinar” technologies have changed sales forces. The promise is lower costs and faster response time, but it also challenges our expectations and the skills of the salesperson. Now imagine the generation of kids who are growing up with AIM entering the workforce. The Future Of Privacy and Libraries Ryan Eby speaks with tongue firmly in cheek in this blog post, but his point is well taken. Privacy is serious to us, but we nonetheless make decisions that trade bits of our patrons’ privacy as an operational cost. While we argue about the appropriate time keep backups of our circulation records, we largely accept them — and the way they connect our patrons with the books they read — without question. Zach’s Couch Camouflage Here’s Zach hidden in plain sight on a couch at a friend’s house the other day. That’s skill. Where’d my 151 go? Nobody remembers how, but the 151 bottle is empty again. We’re beginning to blame it on bandits. Warren (and Dog Sledding) On TV Tonight The folks at WMUR‘s Chronicle are featuring my friends Joe and Wendy and their dog sledding tonight. The photos above are of Justin in a race a few years ago (video of the finish also online). Warren hasn’t been so proud since we put the rocket up. Large Format Scanners For Document Imaging The market for large-format flatbed scanners is shrinking, so products turn over slowly and development is far behind my expectations. That said, the Epson GT-1500 doesn’t look like a bad choice for tight budgets. It has a relatively low maximum resolution of only 600DPI, but has the highest claimed scan speed of 30 seconds at 300DPI. Following that is the Microtek ScanMaker 9800XL, which has a much higher maximum resolution, but much slower scan speed (even at the same resolution as the Epson). What Does Facebook Matter To Libraries? Lichen pointed me to this Librarian’s Guide to Etiquette post about new technologies: Keep up to date with new technologies that you can co-opt for library use. So what if no one will ever listen to the pod casts of your bibliographic instruction lectures, subscribe to the RSS feeds from your library’s blog, send your reference librarian instant messages, or view your library’s profile on facebook.com? At least you did your part to make all these cool technologies a little bit lamer. Walking Desk I used to have a stand-up desk at work. Then that got replaced by a pair of standup workstations above a more normal desk. Then I moved offices and switched roles from sysadmin to programmer and got the most normal desk ever. Then, in January 2005, I heard an NPR story about Dr. Jim Levine’s study that put a high value on constant movement throughout the day, and I got concerned about sitting for so long. Not Invented Here I couldn’t say it, but Alexander Johannesen could: libraries are the last bastions of the “not invented here syndrome” (scroll down just a bit, you’ll find it). Between Alex’s post and mine, I don’t think there’s much to say except this: there may be five programmers in the world who know how to work with Z39.50, but several thousand who can build an Amazon API-based application in 15 minutes. What technology do you want to bet on? Reviews You Can Trust Cameron Moll (via Ryan Eby) wants “weight” customer ratings to reflect how two products of the same rating might have wildly different numbers of reviews. At first glance I agree with him, but after a moment of thought, I begin to wonder if I want the ratings weighted by the number of reviews, or the number of reviews I “trust.” Amazon keeps huge amounts of data about all its customers. So how hard could it be to correlate my purchasing behavior with the purchasing behaviors of the reviewers along with the details of which reviews I’ve previously checked as “helpful. Indian Frankie The plan was to meet Jessamyn and Greg at the India Queen last night, so discovering this note yesterday on Slashfood about “frankies” had the added excitement of both discovering a new food I wanted to eat, and being in a position to get it that day — the sort of instant satisfaction one doesn’t expect in these parts. Here’s the description: The frankie is an Indian street-type food made of a thin bread similar to a tortilla that is coated with egg and fried. Conceding Defeat I wasn’t really in the game, but when samb posted the above picture of David Brown’s typical meal, I couldn’t help but take it as a challenge. I never did get around to snapping a picture to match samb’s, and now I’ve got accept that there are others with more skill and determination than me. Slashfood explains that anybody can walk in to In-n-Out Burger and order a sandwich of any size. To Blog Or Not To Blog A friend revealed his reticence to blogging recently by explaining that he didn’t want to create a trail of work and opinions that could limit his future career choices. Fair point, perhaps. We’ve all heard stories of bloggers who’ve lost jobs as a result of the content of their posts. And if you believe the Forbes story, the blogosphere is filled with teaming hordes intent on ruining established companies and destroying the economy (okay, I exaggerate). To Blog Or Not To Blog A friend decided the old pornstar name formula was good enough to use to name her blog, as she explains in her launch story. So, should this be the Nick Hastings blog? Elsewhere, another friend is struggling with the decision to blog. When You Need To Talk To Customer Support It’s good to know Hard to Find 800 Numbers.com is there when you need it. Here are the top five: &lt;td width=&quot;85&quot;&gt; HTF# &lt;/td&gt; &lt;td width=&quot;86&quot;&gt; Who &lt;/td&gt; &lt;td width=&quot;136&quot;&gt; Notes &lt;/td&gt; Amazon.com &lt;td&gt; 800-201-7575&lt;br /&gt; &lt;br /&gt; 877-251-0696&lt;br /&gt; &lt;br /&gt; 866-348-2492&lt;br /&gt; 206-266-2992 &lt;/td&gt; &lt;td&gt; Cust. service&lt;br /&gt; &lt;br /&gt; Seller support&lt;br /&gt; &lt;br /&gt; Rebate status Local or int’l &lt;/td&gt; &lt;td&gt; 24/7&lt;br /&gt; &lt;br /&gt; &quot;&lt;br /&gt; &quot; ( Press 0 to bypass menu) &lt;br /&gt; &quot; &lt;/td&gt; Ebay. Dawg It’s Friday, a day when I drop my journalistic standards and usually publish whatever video or joke somebody forwarded me during the week. This one came from my dad: A guy is driving around and he sees a sign in front of a house: “Talking Dog For Sale.” He rings the bell and the owner tells him the dog is in the backyard. The guy goes into the backyard and sees a Labrador retriever sitting there. Plesk Bites I picked Plesk over CPanel as my server control panel because it was cheaper, looked better, and seemed to have all the features I wanted. What I didn’t know was that it came with PHP4 and MySQL3 at times when each was a major version ahead of that. When the good folks at my hosting provider tried to upgrade this, it conflicted with Plesk and they have to back off. Quickly Noted: MooFlex CMS New AJAX-happy CMS: MooFlex, more info at Ajaxian (and in their podcast). About SHERPA And Their Advice To Digital Libraries… I mentioned SHERPA a while ago: SHERPA is a large consortial UK project that’s attempting to build an academic archive/repository for 20 institutions, including the British Library and Cambridge University. [link added] I bring this up again now because they’ve got some advice for people on the subject of digital archives. They recommend EPrints, an open source project developed and maintained by the University of Southampton. Second to that, or for those interested in archiving a broader variety of object types, they suggest MIT’s DSpace. Users vs. Network Printers in WinXP It’s been a problem we’ve struggled with here for much longer than we should have, and it took a hotshot new guy in desktop support to show us the answer. But if you know the right magic, you can add a printer to Windows XP and make it available to all users. See, if you add the printer using the “add printer” wizard, it’s available only to that user. But if you use the command line, then you can throw a switch to make it available to any user who logs in to that machine. Jenny Levine’s Online Library User Manifesto Drawing from John Blyberg‘s ILS Customer’s Bill of Rights and The Social Customer Manifesto, Jenny Levine offers this Online Library User Manifesto: I want to have a say, so you need to provide mechanisms for this to happen online. I want to know when something is wrong, and what you’re going to do to fix it. I want to help shape services that I’ll find useful. I want to connect with others that share my interests. CIO’s Message To Faculty: The Internet Is Here As part of a larger message to faculty returning from winter break, our CIO offered this summary of how he sees advancing internet use affecting higher education: Are you familiar with blogs and podcasts? Google them, or look them up in Wikipedia. Some of you may already be using these new tools. Others may think these terms are the latest in a sea of techno-jargon. Regardless, your millennial students — the NetGens — are using these new technologies — along with the ubiquitous cell phone — more and more. The Arrival of the Stupendous We can be forgiven for not noticing, but the world changed not long ago. Sometime after the academics gave up complaining about the apparent commercialization of the internet, and while Wall Street was licking it’s wounds after the first internet boom went bust, the world changed. Around the time we realized that over 200 million Americans have internet access, that 94 million Americans use the internet ?on an average day, and that 80% of them believe the internet is a reliable source of information, we looked around and found that along with doing their banking, their taxes, and booking tickets for travel and movies, those users were making about five billion web searches each month. Goodbye San Antonio You won’t get your salad dressing on the side in San Antonio. I don’t know what it says about a place, but in New England it’s so common I never learned to ask for it on the side, it just happens. Not so in San Antonio. You’ll also have trouble finding a place to eat dinner away from the riverwalk, as all the neighborhood places I found are open only for breakfast and lunch. Data Visualization and the OPAC A chat with Ryan Eby, also an Edward Tufte fan, elicited this line about another reason we continue to struggle with the design of our catalogs: data isn’t usable by itself if it was then the OPAC would just be marc displays And yesterday I was speaking with Corey Seeman about how to measure and use “popularity” information about catalog items. It got me thinking about Flickr’s interestingness metric, which seems to combine the number of times a photo has been “favorited,” viewed, and commented. Presentation: Designing an OPAC for Web 2.0 ALA Midwinter IUG SIG Presentation: Designing an OPAC for Web 2.0 update: PDF version with space for notes Web 2.0 and other “2.0” monikers have become loaded terms recently. But as we look back at the world wide web of 1996, there can be little doubt that today’s web is better and more useful. Indeed, that seems to be the conclusion millions of Americans are making, as current estimates show over 200 million users in the US, including 87% of youth 12-17. Fully Wired and Mobile in San Antonio I’m in San Antonio for ALA Midwinter and enjoying the benefits of wide-area mobile internet access via my Treo and and the power of local search. This is sort of a test for me and my Treo, as I passed on all the usual trip prep I do and entirely I’m depending on what I’ll find in situ or in my mobile web browser. I wandered around a bit this afternoon to get a feel for the place, but as I got hungrier, I found myself stuck in the Riverwalk Mall, and without any local clues about where to look for better food (Steers &amp; Beers, in the mall, might have been an option if it had more activity or if those few who were sitting at tables didn’t look so miserable). Educause on Future of Libraries Take a look at this editorial by Jerry D. Campbell, CIO and Dean of University Libraries at the University of Southern California: Academic libraries today are complex institutions with multiple roles and a host of related operations and services developed over the years. Yet their fundamental purpose has remained the same: to provide access to trustworthy, authoritative knowledge. Consequently, academic libraries — along with their private and governmental counterparts — have long stood unchallenged throughout the world as the primary providers of recorded knowledge and historical records. Goodbye x.0 In recognition of the divisive and increasingly meaningless nature of x.0 monikers — think library 2.0 and the web 2.0 that inspired it — I’m doing away with them. When Jeffrey Zeldman speaks with disdain about the AJAX happy nouveaux web application designers and the second internet bubble (and he’s not entirely off-base) and starts claiming he’s moving to Web 3.0, then it’s a pretty clear sign that we should give up on trying to version all this. Learning: MySQL Optimization I have over 1000 posts here at MaisonBisson, but even so, the table with all those posts is under 3MB. Now I’ve got a project with 150,000 posts — yes, 150,000 posts! — and the table is about 500MB. An associated table, structured sort of like WP’s postsmeta, has over 1.5 million records and weighs in at over 100MB (not including the 150MB of indexes). Up to now I’ve been a “throw more hardware at it” sort of guy — and in a server with only 1GB of RAM, that’s probably the best solution — but I also think it’s time I learned some MySQL optimization tricks. Radical, Militant Librarian The ALA’s Intellectual Freedom folks came up with this Radical, Militant Librarian button (which I found in Library Mistress’ photostream): In recognition of the efforts of librarians to help raise awareness of the overreaching aspects of the USA PATRIOT Act, the American Library Association (ALA) Office for Intellectual Freedom (OIF) is offering librarians an opportunity to proudly proclaim their “radical” and “militant” support for intellectual freedom, privacy, and civil liberties. WordPress Plugin: Add To del.icio.us I’m not running it here (only because I’m too lazy), but I was happy to find Arne Brachold’s Del.icio.us – Bookmark this! WordPress Plugin. It puts a sweet Bookmark on del.icio.us link whereever you call this function: &lt;?php dbt_getLinkTag(“Bookmark on del.icio.us”); ?&gt; Arne also wrote the Google sitemap plugin I use (though it turns out I’m a few versions behind). US Census on Internet Access and Computing Rebecca Lieb reports for ClickZ Stats that, based on US Census data (report), most Americans have PCs and web access: Sixty-two million U.S. households, or 55 percent of American homes, had a Web-connected computer in 2003, according to just-released U.S. Census data. That’s up from 50 percent in 2001, and more than triple 1997’s 18 percent figure. Home Web use continues to skew toward more affluent, younger and educated demographics. How I Broke My Clie It’s an unseasonably warm and rainy January here in Warren, where warm actually means daytime highs of about 30 degrees and ‘seasonable weather’ would be closer to zero. The point is that it’s the worst possible winter weather: the rain ruins the regular winter activities, and it’s still too cold to take up summer activities. Perhaps that’s why I take such comfort in this video of Ashton, even if it is the video that killed my Clie. Field of Trains fishfin50 has an interesting collection of photos from the American plains. That old train car caught my eye and fishfin50 replied to my comment with more detail: this old train car sits about 200 yards from the Soo Line Railroad in north eastern Montana, it’s in Comertown, an old abandoned town were they used to run whiskey from Canada to the US in the early 1900’s. [link added] fishfin50&rsquo;s old train. Highways Think now of the US interstate highway system. Like the internet that followed, the highway system was the subject of much hype and conjecture. Most notably, Norman Bel Geddes’ -designed General Motors Futurama exhibit at the 1939 New York Word’s Fair. In it we saw magical highways connecting our cities, and whisking motorists from New York to LA in 24 hours. He predicted cities would expand their commuting radius by 600% by 1960. The Library vs. Search Engine Debate, Redux A while ago I reported on the Pew Internet Project‘s November 2005 report on increased use of search engines. Here’s what I had to say at the time: On an average day, about 94 million American adults use the internet; 77% will use email, 63% will use a search engine. Among all the online activities tracked, including chatting and IMing, reading blogs or news, banking, and buying, not one of them includes searching a library OPAC. More Trends In Online Behavior From Pew Internet It turns out that the Pew Internet and American Life Project sort of keeps a blog. Here are some points from a November 2004 post by project director Lee Rainie regarding “surprising, strange, and wonderful data:” The vast majority of most Internet users (80%) and many non-users (about 40%) expect that they will be able to find reliable information online when it comes to news, health care information, e-commerce, and government. Winter’s Day Winter in Warren can be rather picturesque. Poets, Justice, Scotch Unattributable: “Poetic justice is a lie. It’s no more real than military inteligence. The entire motivation for poetry is the unjust pain of life.” Separately, what’s the appropriate LC classification for Scotch? My first thought was around PR600, but what do I know. Should it go elsewhere? What about other spirits? Joel Friesen’s Misuse of PowerPoint Joel Friesen‘s PowerPoint-esque presentation on why his girlfriend should continue to date him didn’t win her back, but it entertained folks. Yes, the diagram above shows Joel’s position at the intersection of those who are graphic designers, awesome people, and people who’ve played a zombie in a low-budget horror flick, yes the other slides are as entertaining. Go look: Why you should continue to date me; a series of charts and graphs. Presentation Advice From An Apple Insider Mike Evangelist’s look behind the magic curtain of Apple Keynotes during his time with the company. code4lib Program Proposal I’d be excited just to be a fly on the wall at code4lib, but I’m on a bit of a mission to change the architecture of our library software — to make it more hackable, and make those hacks more sharable — so I had to propose a talk. Title: What Blog Applications Can Teach Us About Library Software Architecture Description: The number of programmers in the library world is growing and our individual efforts have shown great promise, but they exist largely as a spectacle that few libraries can enjoy. Looking At Controversy Through The Eyes Of Britannica and Wikipedia The argument about Wikipedia versus Britannica continues to rage in libraryland. The questions are about authority and the likelihood of outright deception, of course, and a recent round brought up the limitations of peer review as exemplified in the 1989 cold fusion controversy, where two scientists claimed to have achieved a nuclear fusion reaction at room temperature. Randy Souther, from the University of San Francisco, asked us to look more carefully: Boat Full Of Toilets My inner 13-year-old is cracking up over the notion of a shipwrecked load of toilets in the Mediterranean. Magnetic Fields, Earworms, Fido I can’t get Fido, Your Leash Is Too Long, from The Magnetic Fields‘ 69 Love Songs, out of my head. This entry is an attempt to kill this earworm by posting the lyrics. If this doesn’t work I’m checking out Maim That Tune. Fido, your leash is too long You go where you don’t belong You’ve been digging in the rubble Gettin’ bitches in trouble Tag Clouds… “The tag cloud is the mullet of the internet.” Found at phpFlickr. Look closely. Gallery to Flickr Migration Tool For those people still using Gallery, here’s the last straw: Rasmus Lerdorf got to playing with the Flickr API and quickly wrote up a script to migrate his photos from Gallery to Flickr. He’s didn’t post a script or anything, he’s just saying it’s easy to do. A lot of things are easy to do, of course, but that doesn’t mean they get done. So it’s probably a great relief to somebody that Paul Baron got on the job. DDOS’d My hosting provider sent along the following message: We have experienced a DDOS attack today January 4th, which resulted in latency across the entire network. During this time your domain, email, ftp and other services may have appeared to be offline, or intermittent. Our techs have been working as quickly as possible to block the attack and get the network back up to speed. I was relieved to know that the unexpected downtime was the result of something I’d done. Political Blogging Protected By FEC Way back near the end of 2005, Lot 49 reported that the Federal Election Commission had basically ruled that bloggers are journalists: The Federal Election Commission today issued an advisory opinion that finds the Fired Up network of blogs qualifies for the “press exemption” to federal campaign finance laws. The press exemption, as defined by Congress, is meant to assure “the unfettered right of the newspapers, TV networks, and other media to cover and comment on political campaigns. Social Software Works For Organizations Too Ignore the politics for a moment. MoveOn‘s CTO, Patrick Michael Kane, remarked that the organization’s membership to Flickr, the photo sharing site, has paid off: “Flickr has got to be the best $24.95 we’ve ever spent.” Why? Micah Sifry explains in a story at AlterNet that MoveOn had been soliciting photos of events from members for some time, but their ability to move those photos through the process and make them available to the public was limited. WordPress 2.0 & bsuite Update: bugfix release b2v6 available. WordPress 2.0 is out and available for download now. I don’t know how the development team did it — I mean with the holidays and all — but here it is. And now I have to admit that I haven’t even played with the betas, so I’ve got no clue how bsuite works with the big 2.0. For all I know it works just fine, or it drops all your tables and sends your browser history to your mother, so please take caution. Avenue Q Steve Wynn could probably have had any show he wanted, but he chose Avenue Q, the Sesame Street and Muppets-inspired show that has to include a disclaimer denying its roots in the program and advertising. What the show’s creators don’t have to disclaim are the three Tony Awards the show won in 2004 for best score, best book, and best musical. Sandee bought the cast recording (also at Amazon) because they’re the sort of tunes that get into your head…the sort of tunes you’ll find yourself humming days later. The Eating, Drinking, and Dancing in Vegas Vegas knows liquor. Vegas knows drinks. They go well with cards and dice and slot machines and such. And even though the cards and dice and slot machines and such aren’t my reasons for going to town, I do enjoy a drink. Above, center you see the West Wing Bar’s Sidecar with cognac, triple sec and lemon juice. At the left is a pinapple mojito from the Wynn’s Terrace Pointe Cafe. Nevada Considers Atomic Testing License Plate, Again The first license plate to remember Nevada’s history as the host of the US’s nuclear testing grounds drew criticism for featuring a mushroom cloud (see the plate on the right, above). Now it appears folks are at it again, this time with a plate that depicts the site’s area and includes the classic illustration of an atom’s electron cloud. All of this generated enough interest to bring the local media out to the Atomic Testing Museum to gawk at the proposed plate, including an actual-sized rendition being shown off on a Lincoln Navigator. Nevada Desert We didn’t get to go to Barstow as planned, but I couldn’t leave Las Vegas without a peak at the desert. Fortunately, Red Rock Canyon isn’t far from town, and the Blue Diamond Highway does a nice loop there and back. Along the way I found that the town of Blue Diamond has a new welcome sign, but the old text remains: “elevation: high, population: low, burros: ?” I stumbled across an upended car standing like a tombstone exclaiming “dirt man rocks. Font Friends You’ve got to love a friend who emails you when she finds fonts like Orange Whip and Comic Strip Exclaim and say they remind her of you. On Censorship Regarding nudity in photographs posted to Flickr, dancharvey says: Honestly, I’m more concerned about all the cats and flowers. Cliche is more damaging than breasts. Your opinion may vary. Barstow California What didn’t work out because of our problems with the hotel was our drive to Barstow to see Sandee’s friend Joanne. I don’t know much about the town, but Wikipedia told me to look out for the original Del Taco, Rainbow Basin Natural Area (site not loading now, try this instead), Calico Ghost Town, and the old Solar One solar energy generating experiment. Along the road, however, is the the World’s Tallest Thermometer, in Baker, California. Atomic Liquors I convinced Sandee to join me at Atomic Liquors on Fremont Street, just beyond the Western Hotel Casino in what the Las Vegas Sun calls the “gritty underbelly of Las Vegas.” Owner Joe Sobchick and his wife Stella started business in 1945 with a cafe called Virginia’s. They converted it into a bar in 1953, and changed the name to recognize their proximity to the nuclear tests just 60 miles away. Welcome To Fabulous Las Vegas…with your host, Casey The wind along Las Vegas Boulevard was blowing hard, so it hides the fact that I’m currently sporting one of the worst haircuts of all time. I’ve been meaning to take a picture of this damn sign for years — and more so after seeing beatnickside‘s collection of Vegas photos. What You Lose In The Whirligig… Nobody’s saying what caused it, but things didn’t go as planned at the MGM Grand Sunday night. We were told our room wasn’t ready when we tried to check in a little before midnight, so we ambled over to the cafe for a midnight breakfast on the house. Then at 3 AM, when our rooms still weren’t ready, we were sent to the Bellagio with a voucher for a free room and cab fare. The Real King Kong Here’s another story from my friend Joe Monninger. This time it’s a piece he cut from a book he’s working on, but I’m happy to take his tailings. The text that follows is his: With the mega-release of King Kong swarming the country this week, it might be interesting to hear a true big ape story. I came across this story while doing research for a project, and I pass it along as it came to me. Happy Holidays from Las Vegas! The Bellagio is all done up for the holidays, Vegas-style (which means it’ll give you a headache). Happy Holidays From Warren Snow, thick and heavy because of the thaw these past few days, covers Warren. Our rocket stands tall for all seasons. Shuffling iPods I couldn’t help but want one when they were released. I still wanted one after reading the reviews. And I couldn’t help but think about buying one when I finally got to play with it in the store. My wife, loving me and knowing me as she does, got me one. Yes, I got a video iPod for Christmas. Thing is, presents like this create a crisis. How do I extract the gigabytes of music I’ve accumulated on the old iPod? Last Minute Gift Idea My friend Joe loved his chickens, though a fox did them in this last fall. He’d planned to leave the coop empty for the winter and start fresh in the spring, but his surfing lead him to mail order chickens (adoption card pictured above). So…what better a gift for a friend than a chicken by mail? And what better a gift to the world than trade justice? Santa vs. Cops I always get a laugh out of Cops, and an even bigger laugh out parodies of the show. So I have to thank Cliff for finding this animated video of Santa getting pulled over. The War On Christmas I like Christmas as much as anybody (well, anybody who likes Christmas), but I’m a “happy holidays” guy. Why? because Christmas and the holidays aren’t about me, they’re about the way we spread happiness and joy to others, no matter how they celebrate. So while I quietly hope for my own merry Christmas, I resist the urge to wish everybody else a happy Festivus and opt for “happy holidays.” Blogging the Office Party (mostly because they suggested it) I don’t work for central IT anymore, but they still invite me to their holiday party. And no office holiday party would be complete without a yankee swap. I brought a sort of crappy battery operated screwdriver that seemed to be popular (but keep in mind that we have really low standards for these things), but I was pretty happy to unwrap a martini set with four glasses and pitcher for myself. Serena Collage Customer Sites Zach got a call from the Serena Collage rep who rattled off this list of customers in New England: Boston College Northeastern Bristol Community College UMass Lowell The Sungard/SCT Luminis Content Management Suite Demo We got the demo yesterday of Sungard/SCT‘s Luminis Content Management Suite (sales video). I mentioned previously that the sales rep thinks Pima Community College and Edison College show it off well. Here’s what we learned in the demo: It started with the explanation that data is stored as XML, processed by JSP, and rendered to the browser as XHTML according to templates, layouts, and “web views.” It was later explained that the product was “web server agnostic” and could run under Apache, IIS, SunOne, or others. Electric Aerobic Color me amused to learn that somebody (don’t worry, Amazon will never tell me who) bought Carmen Electra’s Aerobic Striptease after following one of my Amazon affiliate links. Book Flower Institutional and Academic Repositories MIT has DSpace, their solution to save, share, and search the collected work of their faculty and students (in use by 115 public sites). Now Royce just shared with me this presentation by Bill Hubbard, the SHERPA project manager at University of Nottingham. What’s SHERPA? The name is an acronym for Securing a Hybrid Environment for Research Preservation and Access, but it’s a project intended to archive the pre and post publication papers and other research products. Kim’s CMS Shortlist With 1,800 CMS vendors in the marketplace, we’re mining what we know or know-of as a way to shorten the list. Kim named the following four: Joomla, a derivative of Mambo Collage appears to have good content reuse features OmniUpdate has a good list of higher ed clients Drupal: open source and turning heads Ryan Eby’s Pursuit of Live-Search Ryan Eby gets excited over LiveSearch. And who can blame him? I mention the preceding because it explains the following: two links leading to some good examples of livesearch in the wild. Inquisitor is a livesearch plugin for OS X’s Safari web browser. It gives the top few hits, spelling suggestions where appropriate, and links to jump to other search engines. Garrett Murray’s ManiacalRage is an interesting blog on its own, but he’s also doing some good AJAX on his search interfaces. Simon Mahler Audioproduktion Simon Mahler did the audio for Benjamin Stephan and Lutz Vogel‘s Trusted Computing movie. The movie is good, but I realized I was letting it play in the background just to hear the soundtrack, so I finally looked up Mahler’s fotone.net and found the three free song downloads. It’s good stuff, but I’m wondering where the album is… Cop Tasers Cop Two cops: he wanted a soda, she didn’t. She had the wheel, he had a Taser. Details from this Associated Press story: HAMTRAMCK, Mich. — A police officer has been charged with using a Taser on his partner during an argument over whether they should stop for a soft drink. Ronald Dupuis, 32, was charged Wednesday with assault and could face up to three months in jail if convicted. The six-year veteran was fired after the Nov. They Might Be Giants Podcast Thanks go to Jenny for the link to the They Might Be Giants podcast! And all that brings up something I was too lazy to figure out before. Interestingly, it became an issue now only because I was also too lazy to look for the TMBG podcast in the iTunes podcast directory. It turned out to be easy enough to subscribe directly, but here are the directions from Apple: If you can’t find a podcast on the iTunes Music Store, never fear. Free Palm/Treo AIM Client My Treo rocks. Part of my love for the new gadget is how I can now AIM on the run without SMS. Sure, I risk frostbitten fingers as I walk across campus and I’d probably be a lot better off if I just called the person, but…but… Anyway, Everything Treo was near the top of my Google query with a roundup of three commercial IM apps for Palm. But none of the reviewed apps seemed all that great, and I sort of expected to find a free client. Two Things To Know About Library 2.0 You don’t like the “2.0” moniker? So what. John Blyberg reminds us that “if we’re arguing over semantics, we’ve been derailed.” And Stephen Abram is said to have cautioned us: “when librarians study something to death, we forget that death was not the original goal.” bsuite Bug Fixes (release b2v3) [innerindex]I’ve fixed another bug in bsuite b2, my multi-purpose plugin. This update is recommended for all bsuite users. Fixed Previous versions would throw errors at the bottom of the page when the http referrer info included search words from a recognized search engine. Installation Follow the directions for the bsuite b2 release. The download link there will always fetch the current version. Upgrades from earlier versions of bsuite are easy, just replace the old bsuite. Improving WordPress search results SimpleSearch – a Full-Text solution | Beau Collins Nature Concludes Wikipedia Not Bad Fresh from Nature: a peer reveiw comparison of Wikipedia’s science coverage against Encyclopaedia Britannica: One of the extraordinary stories of the Internet age is that of Wikipedia, a free online encyclopaedia that anyone can edit. This radical and rapidly growing publication, which includes close to 4 million entries, is now a much-used resource. But it is also controversial: if anyone can edit entries, how do users know if Wikipedia is as accurate as established sources such as Encyclopaedia Britannica? Yahoo! Rocks The Web No, I don’t mean that they’re disrupting it, I mean they’re getting it. And in saying that, I don’t mean they’re figured it our first, but they they’re making some damn good acquisitions to get it right. Mostly, I’m speaking of they’re purchase of Flickr last year and their acquisition of del.icio.us Friday. But in a somewhat lesser way I’m also speaking of their announcement Monday that they’ll be offering blogs as well. Yahoo! Buys Del.icio.us Nial Kennedy threw down some of the first coverage of Yahoo!’s acquisition of del.icio.us last week. Del.icio.us will most likely be integrated with existing Yahoo! Search property My Web. My Web allows Yahoo! members to tag search results for discovery through a defined social network (Y!360) or all Yahoo! users. Yahoo! will use del.icio.us bookmarks to better inform personalized search results throughout its services. Its ability to combine signals of relevance from search result click-throughs to a listing of sites bookmarked and classified will lead to increased use of Yahoo! OpenSearch Spec Updated I just received this email from the A9 OpenSearch team: We have just released OpenSearch 1.1 Draft 2. We hope to declare it the final version shortly, and it is already supported by A9.com. Uprading from a previous version should only take a few minutes… OpenSearch 1.1 allows you to specify search results in HTML, Atom, or any other format (or multiple formats) in addition to just RSS. In addition, OpenSearch 1. A Patron’s Perspective On Library 2.0 My friend Joe Monninger is perhaps a library’s favorite patron. He’s an avid reader who depends on his public library for books and audiobooks and DVDs, and as a writer and professor he depends on the services of the university library. But he doesn’t work in libraries, and though he listens patiently to my work stories, he doesn’t really care about the politics or internal struggles we face. That said, I’m reprinting here the full text of his recent column for the Valley News, a paper serving Hanover New Hampshire and other upper Connecticut River valley communities. Bush Joke I wish I could admit the provenance of the following, but I’ve been sworn to secrecy. Here goes: Donald Rumsfeld is briefing president Bush: “Yesterday, 3 Brazilian soldiers were killed.” “Oh no!” exclaims the president, “that’s terrible!” His staff is stunned at this unprecedented display of emotion, watching as Bush sits, head in hands. Finally, he looks up and asks, “How many is a brazillion?” Identity Management Podcast Josh Porter and Alex Barnett got Dick Hardt and Kim Cameron on the line to talk about Identity Management. The result is available as a podcast. I should add that Josh and Alex are big on the attention economy and social software, so they’re asking questions about how IdM works in those contexts. Most people thinking about IdM today seem to be thinking about its uses in the enterprise or in education, but when I say identity management is the next big thing, I mean it in the social context that Josh and Alex are rooted in. Sungard/SCT Luminis Content Management Suite We’re looking at the Sungard/SCT Luminis Content Management Suite (sales video). The real demo comes later, but the sales rep thinks Pima Community College and Edison College show it off well. Hmm. Four Million Dominos, A Sparrow, An Exterminator People like to topple dominos, and some people like to topple great long snaking lines of them. So TV crews get involved, people spend a month or more lining the damn things up, and Domino Day becomes an annual event. Enter sparrow. Sparrow menaces dominos, topples 23,000 of them. Enter exterminator. Exterminator shoots sparrow. Enter news media. Enter public outcry. Enter death threats. Result: a record 4 million dominos, the sparrow incident is being investigated by a reported seven agencies, and the martyr sparrow has been preserved for display in 2006. Free Fonts Zone Erogene has ten fonts available for free download, including Migraine Serif and the faux-cyrillic Perestroika. Tip for Mac OS X users: rename the font to remove the “.txt” extension that will get added to the filename, then double-click it. The Dial Up ISP Wasteland Yes, there are some parts of the continental US not yet served by DSL or cable modems. That’s why I’m looking for a dial up ISP. Nationally we’ve got AOL and Earthlink, followed by budget operators NetZero, PeoplePC, and Netscape Online. But here’s the thing, and forgive my ignorance, why do all these services suggest you need to download and install software just to dial in? I mean, hasn’t dial up networking been a standard feature of various releases of Mac OS and Windows since 1995 or so? Treo 650 For Me I’ve been talking up the Pepper Pad and Nokia 770 a lot, and I’ve mentioned a moment of lust for the LifeDrive (despite my complaints against PDAs), but today I bought a Treo 650 (even though I had doubts). My decision surprised me, but the following factors all weighed in its favor: My cell phone contract expired. Verizon was dangling their standard $100 discount (on top of other discounts) on a new phone if I renewed. The Bathroom Reader Somebody at Gizmodo found this Agence France-Presse story about the intersection of American surfing and bathroom habits in The Hindustan Times. It’s based on a report by the USC Annenberg School‘s Center for the Digital Future. For five years running now, the center has tracked internet use (and non-use) in a 2,000 household representative sample of America (choosing a new sample each year). This year, researchers found: “Over half of those who used Wi-fi had used it in the bathroom. GAO Report Confirms Election Fraud This should be no surprise — especially to those who’ve been appropriately concerned about electronic voting machines: Lyn Davis Lear is reporting on a GAO report that concluded the 2004 election was fraudulent and a Diebold insider is blowing the whistle (via Engadget). What does the report confirm? Bob Fitrakis &amp; Harvey Wasserman summarize: Some electronic voting machines “did not encrypt cast ballots or system audit logs, and it was possible to alter both without being detected. Supamonks Video Al sent this video along via email, and it seems perfect for Friday afternoon. It’s all about super-monks (supramoine in French?), a kind of European Shaolin, maybe. Warning Label Humor Amadana‘s new headphones come with an amusing warning label: Can’t climb wall. Can’t listen to the voice in your heart. Can’t open the coffer (safe). Sure, the above looks fake, but Lichen pointed out this other Engrishism: “Fits well and stable…with movable ear hangers.” Want more? Go visit galleries of oddness. Astro Dog Press Jon Link is among the smartest and coolest people I know, so when he decides to start up a press, and then decides to fund his startup with t-shirt sales, I get in line. Nokia 770 In The Wild Gizmodo‘s reporting the Nokia 770 is in customer’s hands and getting some buttons pushed. Now we’ve got Nokia and Pepper exploring this space. Where to next? FrontRow For Everybody Via an IM from Ryan Eby: a pointer to Andrew Escobar‘s directions on how to install Apple’s Front Row. Digitize Vinyl Easy Engadget and Gizmodo both have the skinny on a USB turntable. Microformats Oliver Brown introduced me to microformats a while ago, the Ryan Eby got excited about them, then COinS-PMH showed how useful they could be for libraries, but I still haven’t done anything with them myself (other than beg Peter Binkley to release his COinS-PMH WordPress Plugin). What are microformats? Garrett Dimon explains the theory: When writing markup against deadlines and priorities, it’s easy to forget that somebody else will eventually have to maintain it. MacOS X 10.4 = Built-in VNC Server macminicolo.net explains how to use it. Queen Mashups Are All The Rage Michael Sauers pointed out Q-Unit, a mashup of Queen and 50 Cent. They’re sure to have Disney (the rights owner for Queen’s catalog) on their back soon. At least, it didn’t take Disney long to shut down The Kleptones, whose “A Night At The Hip-Hopera” has a spot on my iPod. And that’s where the story comes around, are we at the point where we can say Queen’s music has taken on the status of a modern fairy tale? OCLC Report: Libraries vs. Search Engines So, the report was released Monday, and it’s actually titled Perceptions of Libraries and Information Resources (2005), but the part I’m highlighting here is the results of the question that asked users to compare their experiences with search engines against their experiences with libraries. Here’s the quesiton: Satisfaction with the Librarian and the Search Engine — by Total Respondents Based on the most recent search you conducted through [search engine used most recently],how satisfied were you in each of the following areas? All Conversations In Warren Revolve Around Heat A friend of mine jokes that every conversation in Warren revolves around heat. But, it wouldn’t be funny if it wasn’t at least a little bit true. As it turns out, most of the rest of the country is talking about heat too. Pellet stoves have been all the rage this fall. I feel lucky to have gotten one before the rush, but I’m also a little dismayed about the selection. Jabber As Inter-Process Communication Standard? open-ils blog » Blog Archive » OpenSRF Jabber: A Technical Review OSS In Lib Ryan Eby tells me that the current issue of Library Hi Tech includes some discussion of open source software’s uses in libraries. My Cultural Go-To Guy Most of my reading is non-fiction, so I depend on Bob Garlitz to keep me current with the rest of the literary world and a bit of the art world. Raging Arguments About The Future Of The ILS I feel a little misrepresented by a post from Talis’ Richard Wallis claiming you don’t need technology for Library 2.0 – but it helps, but the company blog doesn’t allow embedded URLs, so I’m posting my comment here: Richard, please don’t misunderstand me. Technology is the essential infrastructure for Library 2.0. My point was that technology alone doesn’t make a library. It would be better to read my post in the context of Meredith Farkas‘ and Jenny Levine‘s recent posts crying out for more programmers in libraries. Who’s Afraid Of Wikipedia? Arguments about Wikipedia‘s value and authority will rage for quite a while, but it’s interesting to see where the lines are being drawn. On the one had we’ve got a 12 year-old pointing out errors in Encyclopaedia Britannica (via Many2Many) and now on the other side we’ve got John Seigenthaler, a former editorial page editor at USA Today, piping mad about some libelous content in his Wikipedia biography page. Now, I have to agree with Seigenthaler in as much as I would never want anybody to make such claims against me, and I’d probably consider my legal options in such a matter, but I’m sure I’m not the only one who gets a chuckle over the matter. Understanding Airport Codes Dave English explains why airport codes can be so darn confusing (even while some of them are stupid obvious). Criticism of Modern Movies We’ve all heard it before, but we just can’t get it out of our heads. Today’s movies make us feel dumb. Paulina Borsook joins the chorus and condemns contemporary cinema by praising movies of the 60s and 70s: They were movies made for adults, even if they had been mainstream movies and/or nominally rated PG. They made presumptions about the intelligence of their audience, didn’t need things to be boldly spelled out, and they were predicated on the assumption that their audience was capable of making inferences. $100 Laptop Details I’ve been doing a lot of talking about the coming information age and how it depends on access technology that is as cheap and easy to use as our cell phones (and applications of it that are as appealing as people find their cell phones). But I’ve been slow to mention the MIT Media Lab‘s One Laptop Per Child $100 laptop plan. The truth is that I just don’t know that much about it. Humanoid Robots Are Eerie My friend Troy pointed out a while ago that the more “realistic” our 3-d models of humans get, the scarier they look. Apparently it applies to robots to, at least judging by the “actroid” above. Maybe I better put How To Survive a Robot Uprising closer to the top of my reading list. More at Akihabara News, found via Gizmodo. Understanding WP_Rewrite And Related Hooks The docs are in the codex, this tag plugin offers quite a few examples, as does Jerome’s Keywords plugin. WP Geo Mashup Plugin I don’t know how I missed cyberhobo‘s geo-mashup-plugin (also at wp-plugins.org) until now. It’s Been AHAH All This Time? I might be reading this wrong, but it looks like I’ve been using AHAH when I’ve thought I was using AJAX. Hmm… bsuite Bug Fixes (release b2b) [innerindex]I’ve fixed a couple bugs in bsuite b2, released last week. Fixes A bug with search word highlighting that caused it to litter the display in some cases. A silly mistake of mine that cause a mysql error for some users. Installation Follow the directions for the bsuite b2 release. The download link there will always fetch the current version. Upgrades from bsuite b2 are easy, just replace the old bsuite. SAFE: Design Takes On Risk I’ve been sitting on this story since October, hoping I’d be able to get to the show, but It’s increasingly clear that I’m not getting to NYC for a while. So, anyway… MoMA is showing SAFE: Design Takes On Risk Wired Magazine described it: Just in time for the wave of catastrophes plaguing our fragile planet, some top designers unveil a series of aesthetically pleasing objects that could be handy in dangerous situations, from the banal to the apocalyptic. Library 2.0? Rochelle worries that all this Library 2.0 talk is lost on her library. Ross tells us why he hates the Library 2.0 meme and Dan reminds us it’s not about buzzwords. But Michael is getting closest to a point that’s been troubling me for a while: Library 2.0 isn’t about software, it’s about libraries. It’s about the evolution of all of our services to meet the needs of our users. Bar Hosts 81 Burglaries In 12 Years Yahoo! News tells me that Brigitte Hoffmann’s Tages-Bar in Berlin gets robbed a lot. Edward Gorey’s “Elephant” House Edward Gorey is known for having created the Gashlycrumb Tinies, an alphabet of ways young children can meet an early end. That, and the bumper animations for public television’s Mystery! (here, have some games). Gorey is dead now, but his house in Yarmouth is open to the public. Admission is $5 for adults (http://edwardgoreyhouse.org/, phone 781-768-8367). I found out about the house at Odd New England. 1,800 CMS Vendors! CMS Market Watch tells us that there are 1,800 CMS vendors, and some of them are getting a little feisty. A Library For All Peoples In a Washington Post column last week, Librarian of Congress James H. Billington proposed A Library for The New World: [T]he time may be right for our country’s delegation to consider introducing to the [UNESCO] a proposal for the cooperative building of a World Digital Library. This would offer the promise of bringing people closer together by celebrating the depth and uniqueness of different cultures in a single global undertaking. bsuite Features: The Photo Spread bsuite highlights the search words used to find blog posts in Google and other search engines, and uses those search terms to recommend other related posts at your WordPress site. — – — bsuite uses the tags of one post to recommend related posts in your WordPress blog. — – — bsuite includes an easy to use statistics engine that tracks the daily hits to every post and page. Opportunity Knocks Message from Jenny Levine: opportunity knocks. Some people hear it, others claim it’s just squirrels on the roof. OPAC Web Services Should Be Like Amazon Web Services No, I’m not talking about the interface our users see in the web browser — there’s enough argument about that — I’m talking about web services, the technologies that form much of the infrastructure for Web 2.0. Once upon a time, the technology that displayed a set of data, let’s say catalog records, was inextricably linked to the technology that stored that set of data. As we started to fill our data repositories, we found it usefull to import (and export) the data so that we could benefit from the work others had done and share our contributions with others. Talk Big If I lived in Seattle, I’d look to Beatnickside’s photos for clues about where the fun is. Here’s his photo of the “Iron Composer” competition at The Crocodile Cafe. Dance Dance Revolution, NYC I caught the following story on NPR’s All Things Considered (RealAudio stream) last night: New York is known for its vibrant nightlife, yet in many bars and restaurants it’s illegal to dance. Now, a law professor is challenging the “Cabaret Laws,” claiming they violate a dancer’s right of free expression. The city says dancing by patrons is not a protected right — and can prove it. (link added) This was a big surprise to me, and a bigger surprise to learn that it’s not just some blue law. bsuite WordPress Plugin (b2 release) [innerindex]The first thing we all have to agree on is that bsuite is the replacement for bstat. The name change reflects that fact that the plugin is doing a lot more than simply track page loads. The most exciting new feature is a module I can’t help but call bsuggestive. It uses the tags of the current post to suggest related posts to your readers. And when readers arrive at your site via a search engine, it not only highlights the search words they used, but offers a list of other posts matching their search criteria. CMS Pitfalls Everybody wants a content management system, but there’s little agreement about what a CMS is or what it should do. Even knowledgeable people often find themselves struggling for an answer before giving up and defining a CMS by example. The problem is that we know we want better websites, and we know technology should help, but how. Jeffery Veen offers some sage advice to those who would ignore the non-technical facets of the problem: Theories of Information Behavior Via Librarian Way I found the LiS Radio webcast of a conversation between Sandra Erdelez and Karen Fischer, two of three editors of Theories of Information Behavior from ASIS&amp;T and Information Today. Unfortunately, the interview focuses on how the book came to be more than the content, but the description reads: overviews of more than 70 conceptual frameworks for understanding how people seek, manage, share, and use information in different contexts. bsuggestive and bsuite Tag Support bsuite, the follow-up to bstat, now includes a module called “bsuggestive” that recommends related posts based on the current post’s tags or alternate posts based on your search words when you arrive from a recognized search engine. That is, bsuggestive does two neat things: First, visitors will see a section in each post with links to other posts on your site that have similar content. The “similarity” is judged by comparing the current posts tags against the content and titles of all other posts in the database. bsuite Is Coming I’m about to release a public beta of my WordPress plugin over at MaisonBisson. Information about my favorite new feature, bsuggestive, online now. bsuite started out as bstat, and continues to offer rich stats tracking features. update: bsuite b2 is out! Wayfaring.com Wayfaring: With Wayfaring.com you can explore maps created by others, or create your own personalized map. Share them with friends or the whole world. Now imagine it with Earthcomber integration. Wouldn’t that be neat. Raging Arguments About The Future Of The ILS I hadn’t seen Ryan Eby’s post at LibDev that connected ILSs with WordPress before I posted that library catalogs should be like WordPress here. It connects with a my comment on a post at Meredith Farkas’ Information Wants To Be Free. My comment there goes in two directions, but I’d like to focus on the technology side now. Our vendors will inevitably bend to our demands and add small features here and there, but even after that, we’ll still be stuck paying enormous amounts of money for systems that remain fundamentally flawed. Rollyo Metasearch Rollyo: Roll your own search engine. Create personal search engines using only the sources you trust. Relevant. Reliable. Rollyo. They call them “searchrolls” Library Catalogs Should Be Like WordPress Library catalogs should be be like WordPress. That is, every entry should support comments, trackbacks, and pingbacks. Every record should have a permalink. Content should be tag-able. The look should be easily customizable with themes. Everything should be available via RSS or Atom. It should be extendable with a rich plugin API. And when that fails, it would be nice if it were all written in a convenient language like PHP so we can hack it ourselves. Infrared Photos Among the infrared photos at PBase.com is this plantation infrared collection by Joseph Levy. Above: part of the collection by Richard Higgs. Blog Value The sale of Weblogs Inc. to AOL last month for $25+ million got a lot of bloggers excited. Tristan Louis did the math and put the sale value into perspective against the number of incoming links the the Weblogs Inc. properties. It’s an interesting assertion of the value of the Google Economy, no? The various properties have a total of almost 50,000 incoming links, which work out to being worth between about $500 and $900 each, depending on the actual sale price, which everybody’s mum about. Karen Kills in Karts Karen has the smart-sexy-funny thing going on, but that doesn’t stop her from eating donut after donut or beating Will and me in every white-knuckled kart race we ran last weekend. Drivers sit only an inch or two off the ground in karts that are said to go 40 miles an hour. Eight minute races may seem short, but at between 20 and 30 seconds per lap (my best time was 23-some-odd seconds, Karen’s was at least a second faster), you’ll get plenty of chances to skid out at every turn. Thanksgiving There is, supposedly, some historical meaning to our Thanksgiving holiday, but all I can figure out is that I wasn’t there and it probably didn’t go as I’ve been told. Thing is, Thanksgiving isn’t so much about what we were, but who we are. Thanksgiving celebrates the two most important things in life: food and family. Almost unique among US holidays, retailers haven’t yet found a way to commercialize it. International readers may wonder how a US holiday can exist without commercial involvement, but they should know that we make up for it in the way we eat. My Wife The Technology Dependent Anti-Geek My wife Sandee cringes at the suggestion that she’s a geek. She writes poetry and teaches English, she cooks fabulous meals and dances all night long. Surely you’re mistaken she’ll say. But she does have a laptop, a digital camera, and an iPod. And she immediately saw the value of having a computer in the living room when MP3s replaced CDs many years ago. So you’ll point to all of this and ask for a clarification and she’ll explain that her use of technology does not make her a technophile any more than her use of a car makes her a NASCAR fan. Pew Internet Report: Search Engines Gain Ground According to the recently released Pew Internet report on online activities: On an average day, about 94 million American adults use the internet; 77% will use email, 63% will use a search engine. Among all the online activities tracked, including chatting and IMing, reading blogs or news, banking, and buying, not one of them includes searching a library OPAC. November Snow We’ve had snow on the mountains for a while now, but this is the first accumulation in my yard. When You Hit Bottom And Need Design Help stock.xchng has nothing on Flickr for searching, finding, sharing photos, except that they’re uploaded with the express intention of offering them for re-use. Some are available free, others free for non-commercial use, others with their own license terms. But stock photos aren’t really the bottom of the barrel. No, for that you have to look at pixellogo. It’s there that you’ll see the sorts of things you can do to put some pop in a limp design. Using XML In PHP5 Everybody likes documentation. The Zend folks posted this overview and SimpleXML introduction The O’Reilly folks at ONLamp offered this guide to using SimpleXML. Of course, there’s always the SimpleXML docs at PHP.net. Two problems: I haven’t encountered CDATA in my XML yet, but I do hope to develop a better solution than offered here when I do. The other is that SimpleXML chokes on illegal characters, a unfortunately common occurrence in documents coming from III’s XML Server. Akismet Spam Catcher I’ve been getting spam, a lot of spam; 3400 spam comments and trackbacks in the last two months or so. So it was a relief to find Akismet, a networked spam blocking plugin for WordPress. They claim to have blocked 318,825 spams since its release, and I’ve been pretty happy with it. Instant Messaging in Libraries: Ten Points from Aaron Schmidt Aaron Schmidt’s 10 points about IM in libraries include: Instant Messaging is free (minus staff time) Millions of our patrons use IM every day. For some, not being available via IM is like not having a telephone number. There are three major IM networks (AIM, Y!M, MSN) Y!M and MSN will be interoperable at some point. Trillian is a multi-network IM client, meebo is a web-based multi-network client. Use them. Retro Gaming For The Holidays It’s amusing how retailers will try to capture a trend. So retro gaming fans have been building their own arcade cabinets for years now, but I just saw that Target is offering a Midway Arcade Machine for the holidays. The 96-pound machine is described as “full-size” and offers Joust, Defender I and II, Robotron, Rampage, Splat, Satan’s Hollow, Root Beer Tapper, Bubbles, Wizard of War, Timber and Sinistar. Thermometer Museum Dick Porter, of Onset MA, has been building his collection of over 5000 thermometers since the mid-80s, though the collection has nearly doubled since 1998 when it was just over 3000. He calls it the world’s largest and only thermometer museum. He’s certainly passionate about them, and he’s been an invited speaker at more than a few thermometer and weather related events, like the christening of the world’s largest thermometer in Baker California. Harmon’s Lunch I learned of Harmon’s Lunch from a mention on The Splendid Table a few weeks ago. I wrote down the following quote from the show from memory, so it may not be entirely accurate: They have two things on the menu, and nobody ever orders the other one. They serve hamburgers, and the only option is with or without onions. As it turns out, the menu is a little richer than suggested. Collective Intelligence: Wisdom Of The Crowds I’m here at NEASIS&amp;T’s “Social Software, Libraries, and the Communities that (could) Sustain Them” event, presented by Steven Cohen. He’s suggesting we read James Surowiecki’s The Wisdom of Crowds. Surowiecki first developed his ideas for Wisdom of Crowds in his “Financial Page” column of The New Yorker. Many critics found his premise to be an interesting twist on the long held notion that Americans generally question the masses and eschew groupthink. More NEASIS&T Buy Hack or Build Followup First, Josh Porter, the first speaker of the day has a blog where he’s posted his presentation notes and some key points. Josh spoke about Web 2.0, and ended with the conclusion that successful online technologies are those that best model user behavior. “I think Web 2.0 is about modeling something that already exists in our offline worlds, mostly in the spoken words and minds of humankind.” Interestingly, in findability terms, it was Josh’s post that clued me in that the event podcast was online because he linked to my blog in his post. NELINET Bibliographic Services Conference I’m here at the NELINET Bibliographic Services Conference at the College of the Holy Cross today. The conference is titled “Google vs. the OPAC: the challenge is on!” and there’s quite a lineup of speakers. My presentation is on “the social life of metadata.” My slides are online, and below is some background. **The Library Catalog… ** The catalog is among a library’s most important assets. An unread book offers little value, but the catalog offers the promise that the library’s resources will be found and used, and a well constructed catalog makes the finding easier by offering rich details and easy navigation. NEASIS&T Buy, Hack or Build Followup I was tempted to speak without slides yesterday, and I must offer my apologies to anybody trying to read them now, as I’m not sure how the slides make sense without the context of my speech. On that point, it’s worth knowing that Lichen did an outstanding job liveblogging the event, despite struggling with a blown tire earlier that morning. It’s probably well understood by anybody reading this that most library services are at the web 1. NEASIS&T Buy, Hack or Build I’m here at the NEASIS&amp;T Buy, Hack or Build event today at MIT’s Media Lab. On the list are Joshua Porter, Director of Web Development for User Interface Engineering, Pete Bell [corrected], co-founder of Endeca Solutions, and me. I’m posting my slides here now, but I’m told we’ll see a podcast of the proceedings soon after the conclusion. Be aware that the slides are full of links. I won’t be able to explore them all during the presentation, but they might add value later. Zimbra Rocks Zach made me take another look at Zimbra, the web-based, web 2.0-smart, very social and AJAXed up collaboration, email, and calendar suite (plus some other goodies). Go ahead, watch the Flash-based demo or kick the tires with their hosted demo. I think you’ll agree that it looks better than anything else we’ve seen yet. Part of the success of the project is that the developers appear to understand the problem. Here’s the list of [how broken email is][5] from the white paper: Ars on Video iPod It’s old news now, but ArsTechnica did a really thorough review of the video iPod. I especially appreciated reviewer Clint Ecker’s opinion of the video playback capabilities. Now I’m curious about what this does to enable more video podcasts. Virtual Economies I’m not much of a gamer, but Matt got me following video game law with curious interest. And now, via ArsTechnica, I’ve learned of crazy things going on in role playing game economies. To some, the only surprise in Jon Jacobs’s US$100,000 purchase of in-game real estate is that nobody thought of it sooner. The first thing to know is that unlike most other MMORPGs, Project Entropia mixes its virtual economy with the real world. Second Annual West Texas Beautiful Burro and Mule Contest Held Today The text of what appears to be the press release (online at Alpine Avalanche): The Fort Davis Merchants Association and the Jeff Davis County 4-H Club encourage everyone to come join the fun as they host the Second Annual West Texas Beautiful Burro and Mule Contest Saturday, Nov. 12. The contest will be held on the west side of the Jeff Davis County Courthouse, and begins at 11 a. Slot Car Camera I got a slot car set for Christmas when I was about eight years old. I ran the cars until the contact pads wore out, then I pretty much gave up on them. But Simon Jansen is just getting into the action, and he’s doing it at a time when compact and cheap electronics afford (potentially) more interactivity. See, Jansen taped his cellie on one of his cars and started recording the action with the built-in camera. Wolfram’s Tones WolframTones mixes hard science with social software in the form of a ringtone generator. Each click on any of the 15 style buttons yields a “unique [note: not random] composition.” Why not random? The FAQs note: Once Wolfram_Tones_ has picked a Rule to use, all the notes it will generate are in principle determined. But that doesn’t mean there’s an easy way to predict them; in fact, Wolfram’s phenomenon of computational irreducibility shows that in general there can’t be. Tech Tuesdays: Spam Management John Martin was kind enough to lead a session on spam management Tuesday (November 8th). Here was the description: Spam is annoying and often offensive, but it’s a fact of life for all of us. John Martin will lead a discussion about how we can limit the amount of spam we see using tools running on our campus mail server and in Outlook. He’ll also discuss what we can do to keep our email addresses out of spam lists in the first place and spam related issues such as phishing. Six Weapons of Influence Ken forwarded me this podcast of Robert Cialdini speaking on his Six Weapons of Influence, which he lists as Reciprocation Commitment and consistency Social proof Authority Liking Scarcity Cialdini’s book is in its fourth edition, and has apparently been adopted as a text for more than a few classes and the concepts have worked their way into everybody’s marketing seminars. Motivation speaker and marketing yakyak Patricia Fripp summarizes those six weapons like this: Library Integration Stuff I’d meant to point out these two articles from Library Journal ages ago, but now that I’m putting together my presentations for next week (NEASIS&amp;T &amp; NELINET), I realized I hadn’t. Roy Tennant writes in Doing Data Differently that “our rich collections of metadata are underused.” While Roland Dietz &amp; Carl Grant, in the same issue, bemoan the dis-integrated world of library systems. How To Survive a Robot Uprising So there I am trying to read things I can’t possible read and I stumble across a link to Daniel H. Wilson’s How To Survive a Robot Uprising: Tips on Defending Yourself Against the Coming Rebellion. From th Amazon book description: How do you spot a robot mimicking a human? How do you recognize and then deactivate a rebel servant robot? How do you escape a murderous “smart” house, or evade a swarm of marauding robotic flies? Digital Library Systems Group Shows Wares I was in Cambridge today attending the Digital Library Systems Group presentation on their fancy scanners and imaging workflow software. We have no digital collections program going yet, but we’re part of a university system plan to acquire either Ex Libris’s Digitool or ENCompass for Digital Collections (sample sites). But getting the collection management software just creates another problem: we don’t have any imaging resources to use to fill the new digital archive. IKEA Comes To New England Hey, doesn’t the IKEA near Boston open today? Sure does. The company has 226 stores worldwide. According to a story in the Pheonix: Oddly enough, IKEA flopped when it opened its first US store in 1986. But by making concessions to American expectations (softer couches, American bed sizes, good thread counts) it gradually won over low-budget consumers attracted to its upmarket design, with its subtle implications of class mobility. That they were willing to bruise their toes lifting those deceptively heavy boxes speaks to the brand’s participatory appeal […]. Internet, Interactivity, & Youth Jenny Levine alerted me to the Pew Internet &amp; American Life Project report on teens as both content creators and consumers. It turns out that teens, and teen girls especially, are highly active online IMing, sharing photos, blogging, reading and commenting on other’s blogs, and gaming. An especially strong trend in this group is the use of web technologies for collaboration. Interactivity, increasingly, is being defined by the teen’s ability to ask questions, comment, or contribute. Reva “ElectriCity Car” How crazy is it that we can get neither flying cars nor (affordable) fuel efficient cars today? Anyway, the Reva (shown above) is a tiny little electric that seats two adults, can go 50 miles on a charge, and fully charges in five hours (two hours gets an 80% charge). It’s an Indian company, but they export to Europe and the website has some mention of test-marketing the cars in the US. Pen-Based Computing Loses The Tablet Via Engadget I found mention of the LeapFrog FLY, a pen with embedded computer that reads your handwriting. Need a calculator? Just write out “2 + 2 = ” and hear a response from the pen computer’s synthesized voice. Need to schedule something? Write out the date. It’s targeted at kids, and the company has released it with a variety of tutoring applications and games (you guessed it: FLYware) appropriate for kids in 3rd to 8th grade. This Car Climbed Hubbert Peak This Car Climbed Hubbert Peak bumper stickers from HubbertPeak.net. Devil’s Horn On NPR’s Weekend Edition today: an interview with Michael Segel, author of The Devil’s Horn, subtitled “The Story of the Saxophone, from Noisy Novelty to King of Cool.” Adolph Sax’s instrument seems to have been controversial from the start. Other manufacturers tried to assassinate him, the Pope declared the church’s opposition to the instrument, Ladies Home Journal explained that it “rendered listeners unable to distinguish right and wrong.” I Get Love Letters (about Bill Bennett’s racist remarks) “John B,” from Omaha, NE writes regarding my post about conservatives, Freakonomics, and Bill Bennett’s racism: [I]f you had actually listened when Bill Bennett made the comment you quote, you would see it was NOT intentionally racist. You’ve taken the quote completely out of context. I’m willing to bet that you know you’ve taken the quote out of context, but really don’t care. You’ll do anything to make anyone conservative or republican look bad. The Codex Series This, from Chris Anderson: The Codex is a 20 episode series of machinimas made on Xboxes running Halo 2. The result caught the attention of his six- and eight-year-old children, and then him. Machinimas are computer animated in real-time, using video games to create the environment, and human “puppeteers” to drive the action. The action is capture, edited, and voice-overs added. Because they remove many of the economic and technical barriers to film production, they hold the promise of emphasizing story and plot, and exposing talent among those who create them. Gnarly Trees Gnarly Trees: “this group is for trees with oddly-formed limbs, strange bulges or growths, braided roots, or otherwise abnormal looking parts.” This Car Climbed Hubbert Peak This is probably the perfect bumper sticker for your neighbor’s SUV, at least until your neighbor comes over with the perfect chainsaw for your front door (yeah, try to run from that in Birkenstocks). But seriously, shouldn’t somebody tell these people that the world is running out of oil? Venkman JavaScript Debugger How did I miss this before? The Venkman JavaScript Debugger; available here, with user’s guide and FAQ. Ostankino Tower & World Federation of Great Towers I don’t remember exactly why I found myself looking up Moscow‘s Ostankino Tower, a 1772 ft (540 m) tall radio-television tower. Compared to the world’s tallest buildings, it’s taller than all the greats: the Taipei 101, the Sears Tower, Empire State Building, though some people keep towers — even those with observation platforms — in a category separate from skyscrapers. So what’s a tower enthusiast to do? Go take a look at the World Federation of Great Towers (also at Wikipedia). Decision Death Spiral Scott Berkun, the author of The Art of Project Management just blogged about the data death spiral: Whenever data is misused as the only means for making decisions, a death spiral begins. The lust for data overwhelms all sensibilities. Cowardly decision makers howl in glee at reams of unnecessary data, while bright people sit handcuffed to ugly slidedecks and mediocre ideas. Decision makers forget their brains and wait for numbers, fueling an organizational addiction to unnecessary and distracting data. The Livermore Centennial Bulb Treehugger alerted me to the rather surprising story of this light bulb, burning continuously since 1901. Yeah, at least that’s the story here, at the Centennial Light Bulb Committee’s website (a partnership of the Livermore-Pleasanton Fire Department, Livermore Heritage Guild, Lawrence Livermore National Laboratories, and Sandia National Laboratories). The bulb is said to have been made by the Shelby Electric Company of Shelby, Ohio, and given to the fire department by Dennis Bernal, owner the Livermore Power and Light Co. Russian Navy Likes It Big (And Heavy) Maybe the meaning is simply lost in translation, but take a look at the captions for this photo essay of the Russian Navy titled “BALTOPS military exercise: Russia is showing its muscles.” Here, have two big ships, some big anti-aircraft ships, a big landing ship, a big anti-submarine ship, even a big atomic missile cruiser, and add this heavy atomic cruiser. Now how would you feel about captaining the one small landing ship? What’s In A Web Search? Sometimes the answer isn’t as interesting as the question. Consider this note from Yahoo Buzz: On Sunday, the day before the nomination became official, [searches for] Alito sprang up a sudden 320%. Did searches for Alito spike on tips White House staffers, or were White House Staffers vetting their nominee via the search engines? Seattle911 Via the ProgrammableWeb: Seattle911.com. It’s another mashup with Google Maps, but who knew anybody could get 911 data in real time? Sure, it’s only for Seattle, and only their fire/EMS servers (no police), but technology wise, it’s cool. Kudos to Seattle, I guess. What’s my reticence? I don’t know if I should have this data…and putting it together like this hits my privacy funny bone a bit. But then, this data exists…it’s a matter of public record. UC Irvine’s HIPerWall Putting together 50 30″ Apple Cinema HD Displays with 25 Power Mac G5s gets you 200 million pixels of screen real estate spread over 23 x 9 feet. Call it UC Irvine’s HIPerWall. Paper House A visit to The Paper House will run $1.50 and takes you out to a beautiful corner of the Massachusetts coast, 52 Pigeon Hill Street Rockport, MA 01966, just up the hill from Pigeon Cove. Call (978) 546-2629 if you’ve got questions. More info at Odd New England. Pictures tell quite a story, so take a look at the photoset showing details of the fireplace, curtains, and exterior walls. Missiles Explode In South Korea One or more trucks carrying disassembled Nike-Hercules missiles exploded in a tunnel near the cities of Taegu and Masan in South Korea today. Reuters reports no deaths, The Korea Times criticizes lack of safety. The New iMacs… I live quite a distance from any Apple Stores, so it’s only now that I’ve been able to see the new stuff. The Photo Booth application bundled with the new iMacs is actually more fun than I expected. That’s me above with the “comic book” effect applied. But Front Row is every bit as sweet as it looks in the demos. Yes, I want it on my current machine. And, yes, I would pay $49, or maybe $79, I might even be convinced to pay $99 for the remote and software. I Will Crush You Or, er, my server will be crushed. I guess I should admit that my stuff could do with some optimization, maybe. Perhaps what I really need is something faster than Celeron with 512MB RAM. Maybe. Is Search Rank Group-think? Way back in April 1997, Jakob Nielsen tried to educate us on Zipf Distributions and the power law, and their relationship to the web. This is where discussions of the Chris Anderson’s Long Tail start, but the emphasis is on the whole picture, not just the many economic opportunities at the end of the tail. Here’s how it works with hits to websites: a few sites become popular and form the “big head” at the left a few more sites form the slope a huge number of websites score very low and form the “long tail” Nielsen adds these examples: 50+ Ways Good HTML Can Go Bad Via Brad Neuberg: RSnake’s XSS (Cross Site Scripting) Cheatsheet: Esp: for filter evasion. Limitations on cross site scripting (XSS hereafter) have been troubling me as I try to write enhancements to our library catalog, but the reasons for the prohibition are sound. Without them I could snort your browser cookies (RSnake lists: “cookie/credential stealing/replay/session riding” among the threats, but a well-planned attack could also fetch resources from internal webservers and deliver them to external data thieves). iPod Linux Tutorial How to Install iPod Linux on 1 &amp; 2G mini, 4G, Photo Attack Of The Blogs (Yeah)! Online reaction to the Forbes cover story Attack of the Blogs has been quick and strong, and given the doom and gloom language, it’s not surprising: Blogs started a few years ago as a simple way for people to keep online diaries. Suddenly they are the ultimate vehicle for brand-bashing, personal attacks, political extremism and smear campaigns. It’s not easy to fight back: Often a bashing victim can’t even figure out who his attacker is. SwarmSketch Via Information Nation, I found SwarmSketch. Here’s the description: SwarmSketch: Collective sketching of the collective consciousness. SwarmSketch is an ongoing online canvas that explores the possibilities of distributed design by the masses. Each week it randomly chooses a popular search term which becomes the sketch subject for the week. In this way, the collective is sketching what the collective thought was important each week. (Due to increased traffic sketches are currently being updated after about 1000 lines) Learn Japanese Online tutoring in Japanese at udanstraight.com. Here, have some trial lessons. New social web apps Ross Mayfield’s new social software list discusses Ning, Flock, Wink, Memeorandum, Sphere, and Rollyo. The Fight Over Massport WiFi I do a lot of flying in and out of Boston’s Logan Airport, so I’ve been following the controversy about WiFi there with some interest. The story is that Massport, the government agency that runs the airport, is trying to tell tennents — like the airlines — that they can’t operate their own WiFi networks. But the FCC previously ruled that landowners had no authority can control use of the WiFi spectrum on their premises. Public broadcasting SMS to construction sign (at Engadget and Textually), and SMS to megaphone — for the armchair protester (at Textually and Engadget). GEN H-4 Personal Helicopter It’s nearing the end of 2005 and we still don’t have any flying cars like we were promised, but the GEN H-4 personal helicopter looks promising (and dangerous). Here it is in the air, and I might be crazy, but it looks to be controlled by weight-shift (even more photos). OhGizmo says it sells for about $31,000. Gizmodo claims it drives its counter-rotating rotors with an eight-horsepower, 125cc engine. And Odd things from Japan wonders if “this is the nearest thing on Earth to ‘Takekoputa. Goats Show I can’t really pass as an undergrad anymore, but they still let me in to Friday Night Rock to see The Mountain Goats. MP3s: This Year Commandante Going to Bridlington Homer Simpson Nuclear Safety Simulator Here: have at it with a Swedish nuclear power plant simulator. Raise and lower the control rods, turn pumps on and off, open and close valves, just make sure you don’t blowup anything. Go look at the Chernobyl tour to see what happens when you mess up. The original page includes this context: The control-room operators of the Kärnobyl nuclear power plant are telecommuting and are running the plant through the Web. 11 Minutes of Attention I won’t link to The New York Times anymore, but when Ross Mayfield quotes them, I don’t have to. The story is that life is full of interruptions. The typical office environment today apparently allows workers “only 11 minutes on any given project before being interrupted and whisked off to do something else.” Worse, “each 11-minute project was itself fragmented into even shorter three-minute tasks, like answering e-mail messages, reading a Web page or working on a spreadsheet. UbiComp Goes Spray-On Via Gizmodo, we make money not art, and The Engineer: spray-on computers. The idea is to develop computers about the size of a grain of sand (though they say a cubic millimeter here), give them sensors and networking capabilities, and completely change our notion of “computer.” From The Engineer: Each Speck will be autonomous, with its own captive, renewable energy source. Thousands of Specks, scattered or sprayed on a person or surfaces, will collaborate in programmable computational networks called Specknets. Dick Hardt ‘s Identity 2.0 Presentation I said “identity management is the next big thing” back in September. That was before I’d seen Sxip founder Dick Hardt’s presentation on Identity 2.0. Zach peeped me the link and told me I wouldn’t regret watching the presentation. He was right. Everybody, especially the people who don’t yet care about identity management, should take a look. The Language Of Your Website Lynne Puckett on the Web4Lib list pointed me to Web Pages That Suck and highlighted this quote from the site: Nobody cares about you or your site. Really. What visitors care about is getting their problems solved. Most people visit a web site to solve one or more of the following three problems. They want/need information They want/need to make a purchase / donation. They want/need to be entertained. What are blogs? Tech Tuesdays: Blogs and Blogging Tech Tuesdays: Blogs and Blogging Note: these are my presentation notes for a brown bag discussion with library faculty and university IT staff today. This may become a series…[[pageindex]] More: my presentation slides and the Daily Show video. Introduction Public awareness of blogs seems to begin during the years of campaigning leading up to the 2004 election, but many people credit bloggers for swaying news coverage of Senator Trent Lott‘s comments at Senator Strom Thurmond‘s 100th birthday celebration in December 2002. Mike Walter’s Mellotron Before gadgeteers could get affordable (or any) electronics for polyphonic sound synthesis or sample playback, they dallied with tape playback devices that would link each key to its own tape mechanism that played a pre-recorded tape loop at the keyed pitch. They called it a Mellotron, and yes, an 88-key piano would require 88 tape mechanisms. Mike Walters’ home-made Melloman uses walkman-style cassette players wired to a two-octave keyboard in that snazzy-cool case. Flock Out The Flock preview is out and I love it. The good folks at WordPress.com are saying “it’s like Firefox with goodies.” I’m saying it’s a browser built for Web 2.0. Somebody Somewhere Is Starting The Gamer’s Rights Movement Annalee Newitz tells me that video game developers are looking for cheaters by installing spyware with their games. Blizzard, developer of World of Warcraft, Starcraft, and Diablo is among the biggest names doing this. Greg Hoglund, quoted at Copyfight, notes: I watched the [software] warden sniff down the email addresses of people I was communicating with on MSN, the URL of several websites that I had open at the time, and the names of all my running programs, including those that were minimized or in the toolbar. Engadget Caption Contest Caption Contest: what large honkers you have! Genuine Fractals Resolution on Demand onOne Software‘s Genuine Fractals Putting Your Video On A New iPod [How-To: Automatically download and convert TV for your iPod – hackaday.com ]1 Understanding Web 2.0 Ross Mayfield says Web 2.0 is “made of people.” Tim O’Reilly tells us it’s about participation. And to Marc Canter, it’s the connectivity. More to come… Mmm. Spelunking in Sewers International Urban Glow – Europe Underground Mt. Moriah: Summit Denied Will and I didn’t summit Mt. Moriah yesterday. We’d started late and the weather was turning against us, but I did get this shot of Mt. Washington and the Presidential Range. Email 2.0 From Ross Mayfield in Many2Many: this email is: [ ] bloggable [x] ask first [ ] private Whale Watching On Lake Michigan? Whale Watching On Lake Michigan? False: Whale Watching On Lake Michigan Way back in 2003, ClassroomHelp.com published a story on whale watching in Lake Michigan. As it turns out, the info was based on content on a Geocities.com member page that suggests they book trips to see and swim with marine fauna in the Great Lakes. Unfortunately, ClassroomHelp.com later posted a retraction saying “we thought it was true …it looked so real. It looked like a legitimate Web site.” Jim Wenzloff notes Web Pages That Suck “Web Pages That Suck: learn usability and good Web design by looking at bad Web design.” Where Are The MIT Weblog Survey Results? Where are the MIT Weblog Survey Results? They were supposed to be out September first, but they’re still missing… All I can find is this older page from Fernanda Viegas. Bad Covers: Oops! I Did It Again Memepool.com points out that the folks at Supermasterpiece are claiming priority over Britney Spears’ Oops ! I Did It Again. Their story is: “Oops ! I Did It Again” was recorded in April, 1932 in a Chicago studio, most likely Nearlie’s or West and Fourth. Cut for the Decca label by Louis Armstrong and elemends of Zilner Randolph’s touring group, “Oops!” failed to make the chart impact of “All of Me,” another side recorded in the same session, and soon fell out of print. Now Search Lamson Library at A9.com A9, the search engine from Amazon.com, does some pretty interesting things that libraries should be aware of. First, any library considering a metasearch product should look at what can be done for free, and second, libraries should take a look at the OpenSearch technology that drives it. So now, when searching for Harry Potter, you’ll also find relevant results from Plymouth State University‘s Lamson Library. We’re not the first library — I think Seattle Public was — and my work mostly follows the cookbook written up by Ryan Eby, of Michigan State University Libraries. Camera Tossing Memepool introduced me to camera tossing at Flickr, where there’s even a group for those who are willing to risk their camera for a chance at a shot of streaky lights. But not everybody tosses in the dark, it’s turned out to be a a new fad in self-portraiture. Click through for credits and more info on the photos above. PHP5 + XML = LOVE The Zend overview of the new XML features in PHP 5 has re-energized me for building XML Server Applications at my library. Hello WordPress.com! Cliff invited me to WordPress.com earlier this week and I’ve just gotten a chance to get things up and running over there. I’m planning (though plans are never certain) to move my link blogging (think “blinks”) over there and (perhaps) re-publish them here in some aggregated form. We’ll see how that works out over time. Dan Grossman’s List of Top Ten Ajax Apps Top 10 Ajax Applications at A Venture Forth. Fuel Efficient Vehicles People looking for oversized pickups, ridiculously large Russian Army trucks, even jet powered speedsters have it easy. But what about people who have some understanding of the Hubbert peak and don’t want SUVs? 50+ MPG cars have been available in Japan for 30 years now, and can be bought used in Canada for under$5000. But US law forbids importing them to the US! Heck, the Smart, the super-efficient line from DaimlerChrysler, has been available in Europe (and now Canada) for about ten years now, but it too can only be imported with a lot of restrictions. Affordable Fuel Efficient Vehicles (Not In The US) I’m a fan of the Smart, the fuel efficient European roadster that’s smaller than a Mini (see above). It’s coming to America, but indirectly and not without some complexity. Oddly, considering the current energy crisis and that buyers appear to be looking for more efficient vehicles now, there’s a lot of red tape involved with bringing efficient vehicles new or old to the US. Take these Japanese K-Cars that get around 50 miles per gallon and can be imported and bought used for under $5000, but only in Canada. Manhattan User’s Guide Manhattan User’s Guide caught my attention when I followed a link to their Hump Day list of funnies. Social Geography: Common Census CommonCensus Map Project: The CommonCensus Map Project is redrawing the map of the United States based on your voting, to show how the country is organized culturally, as opposed to traditional political boundaries. It shows how the country is divided into ‘spheres of influence’ between different cities at the national, regional, and local levels. Movie Night: Save The Green Planet I’m at a loss for words of my own to describe Save The Green Planet (IMDb page), so I’ll have to crib from others. Amazon’s description: A sensitive, blue collar sad sack hopped up on conspiracy theories and sci-fi is convinced that aliens have infiltrated human society and are planning to destroy the planet at the next lunar eclipse. He sets out to kidnap his boss to torture him until he confesses to his alien identity and stops the invasion. The Conservatives vs. Freakonomics Conservatives hate Freakonomics, that book by economist Steven D. Levitt and journalist Stephen J. Dubner that takes on more than a few sticky issues that most people don’t normally consider to be within the purview of economics. (See also the Freakonomics blog). Publisher’s Weekly notes: There isn’t really a grand theory of everything here, except perhaps the suggestion that self-styled experts have a vested interest in promoting conventional wisdom even when it’s wrong. Weird Travel It started with The Plastics Museum and Museum of Bad Art, progressed with a visit to the International Bowling Museum and Hall of Fame and continued with a tour of Donut shops in Lowell, MA. Now I can report that the MaisonBisson Weird Travel Archives include the Thermometer Museum, the Edward Gorey House, and the Paper House. Click the links to see Photosets at Flickr, and watch MaisonBisson for full reports later. CubeSat Kickstarts New Space Race CubeSat is Cal Poly’s plan to make space accessible to the rest of us. That is, they want to make it easy and cheap enough to launch satellites that even high schools can get a chance at it. Engadget says they call it “the Apple II of space exploration” (link added). Here, read this: The CubeSat Project is a international collaboration of over 40 universities, high schools, and private firms developing picosatellites containing scientific, private, and government payloads. Group Portrait at Pigeon Cove An unconventional panorama in Rockport‘s Pigeon Cove. From left to right stand Will and Corey. Of course, it looks better bigger. Note: this was just a sideshow on our Weird Travel Tour. The Jumping In Rockport It was raining today in Rockport, but that didn’t stop Corey (top) or Will (bottom) from doing a little jumping on the seawall. Note: this was just a sideshow on our Weird Travel Tour. Getting A Passport My old passport is expired and my wife has never had a passport, so I had to look this up. Fortunately, the US State Department has a pretty good website for it. There are rules of course, especially for first-timers or expired passport holders. You’ll have to fill out a DS11 application form and bring to one of the 7000 facilities — mostly post offices — around the country. A [photographer’s guide][5] is worth looking at for those considering taking their own photos, as the State Department cares greatly for the [lighting][6], [composition][7], and [quality][8] of those photos. Balloon Museum I was browsing the NPR archives the other day and found this report on the International Balloon Museum in Albuquerque, N.M.. Of course I want to go there. Pepper Pad As Multipurpose VoIP Device I’m quite taken with my new Bluetooth headset, despite the little hiccup I encountered. So, naturally, I’m thinking about how it would work with the VoIP softphone that’s promised for the Pepper Pad soon. I’ve become a super-fan of Gizmo Project on my PowerBook, but that loaner Pepper Pad was a capable enough and more than portable enough machine that it has me wondering if I’d rather have a desktop Mac and a Pepper Pad when upgrade times comes. Monkey Business If that proverbial room full of monkeys at typewriters ever really did randomly pound out the complete works of Shakespeare, would they be as good? What if they randomly pounded out something better? James Torio’s Blogging Thesis James Torio has been working on his masters in marketing and took a strong look at blogs for his thesis. I looked at how Blogs have impacted business and communication, how some Blogs create revenue, how some companies are using Blogs, how Blogs greatly boost the spread of information, how Blogs add richness to the media landscape, how Blogs work in the Long Tail, how some companies are tracking the Blogosphere and what the future of Blogging may be. Pravda and McCarthyism Don’t worry. I’m right on top of whatever happens in Pravda, the leading newspaper of the Russian Federation. Or, at least, I’m right on top of whatever they report in their English language version. The thing that had me choking on my onion and boursin cheese bagel this morning was the story headlined FBI arrests another spy in the White House, ‘prevents’ Philippine revolution. The whole Philippine thing is entertaining and laughable on its own, but further down in the story the reader will find so many layers of irony and amusement as to spray their breakfast cereal about the room. Findability, The Google Economy, and Libraries Peter Morville, author of Ambient Findability, stirred up the web4lib email list with a message about Authority and Findability. His message is about how services like Wikipedia and Google are changing our global information architecture and the meaning of “authority.” The reaction was quick, and largely critical, but good argument tests our thinking and weeds the gardens of our mind. Argument is good. Here’s my side. It’s important that we understand how modern search engines work. What Bloggers Need To Know About Cahill v. Doe Wendy Seltzer alerts us to the Delaware Supreme Court’s ruling last week in Cahill v. Doe, a case that tested our rights to anonymity online, as well as the standard for judging defamation. As it turns out, the court decided against the plaintiff, a city councilman, and protected the identity of “Proud Citizen,” who the councilman accused of posting defamatory remarks in an online forum. Further, it also decided that the context of the remarks “a chatroom filled with invective and personal opinion” are “not a source of facts or data upon which a reasonable person would rely. Bluetooth Headset Problems I’m still excited about that Bluetooth headset I got last week, but I did encounter a little problem with it. Rather, I encountered a problem with Mac OS X and the Bluetooth headset. I don’t remember all the precipitating details, but the obvious threshold event was when Gizmo Project complained that it couldn’t find the headset. I tried deleting the configuration and re-pairing, but aside from some momentary linkages, it was all for nada. Fried Ravioli Of course I like my new camera. If you don’t think these fried ravioli have enough detail, take a look at the full-size version (3264 x 2448). Priorities So long as I’m talking about change I want to bring attention to some commentaries by Chris Farrell in Marketplace Money. On September 16th he noted that hurricane Katrina (Rita hadn’t hit yet) “ripped the veil off poverty in America” and wondered aloud weather the voting public would continue to support the Republican obsession with tax breaks in the face of this new empathy for those struggling to hold on to the bottom rung of that same economic ladder. ChangeThis Worth looking at: ChangeThis, started by Seth Godin and “a sharp team of change agents.” The quote comes from Ben McConnell at Church of the Customer, who also reminds us of the ways that conservatives in every field favor traditional views and values and oppose change: Stay the course Don’t fix what isn’t broken Ignore all critics We don’t have time Keep out anything foreign to us (actual or metaphorical) Destroy anyone who opposes us or our way of thinking Who cares that Godin and McConnell are marketers. …And The Floods Moved North The rains this weekend swelled the rivers to flood stage in south-western New Hampshire. As much as half of Keene is said to be under water. Further north, the small and historic downtown of Alstead has been washed away. This picture comes from the Portsmouth Herald, and reports in the Washington Post from Keene and Alstead add detail. The current death count is five, according to NHPR news, and NH Governor John Lynch has declared a state of emergency and activated the National Guard. Switched Servers I switched to Lunarpages last week after the fiasco with my old hosting provider. Now, because of bandwidth and CPU usage, I’m moving to a new server at Lunarpages. I wasn’t surprised about what they said when I got a message from the sysadmins about excessive CPU usage on my shared hosting account, but I was surprised with their proactive and customer friendly approach. Anyway, I’ll be figuring out my new server and control panel (it’s Plesk, and I’d been using CPanel for a while). Bluetooth Headset As I was contemplating making angry calls to my hosting provider last week when they shut down MaisonBisson for a couple days, it occurred to me that I would rather make those calls via SkypeOut or some similar service that didn’t reveal my home phone number. After all, I wouldn’t want an angry sysop to take revenge by having a spare modem call me up every 27 minutes between the hours of midnight and seven AM. Ear Shrapnel Noise Grenade Engadget calls it “skull-shattering fun” and Gizmodo labeled it “ear shrapnel.” It’s available at Paladone.com and Boy’s Stuff, though nobody seems to have yet found a domestic supplier. From the catalog page: The Sonic Grenade features three different levels of the most noxious sound since the last Westlife album. To launch, pull the pin and throw it towards your target. After 20 seconds, the sonic explosion occurs, giving even the deepest sleeper a wake-up call like they’ve never had before. Library Feel-Good A Flash animation about why libraries matter. Rules For Writing Bad Poetry Tips from a friend: Center justify the text and write things like “kill me daddy, the robins chirped.” Compact, Modular, And Lego-Like Housing Compact, modular, and Lego-like housing is nothing new. Buckminster Fuller‘s Dymaxion House (now at the Henry Ford Museum), designed in the 1940s, was probably the first. But the Lustron House was actually sold commercially in the years after World War Two. Though it didn’t turn out to be a commercial success, the house did show the promise of pre-fabrication and mass-manufacture for house. They even have have an enduring fan base, with websites like the Lustron Connection and Lustron Luxury, and a documentary. Cladonia Exchanger XML Editor Interesting: Cladonia Exchanger XML Editor, a Java-based app that makes reading raw XML easy. Much easier than in a regular text editor, even with syntax highlighting. Stone Face Fables Note: The following comes without attribution from an acquaintance of my father’s. Once upon a time there were people who lived in a valley near a mountain. On the mountain there appeared a large rock formation which resembled a face. You could almost see the nose and eyes and mouth. Some people claimed that it was the face of a God and they claimed that if you looked closely you would see that for yourself and once you did you would be able to live a happy and comfortable life. Bye Bye Pepper Pad My week with the Pepper Pad is over, and the UPS van just drove off with it, but I’ve still got a lot to report. My testing ran into problems when it turned out that the WiFi network in the library was on the fritz. I did some netstumbling today and found that only two APs were broadcasting at anything close to full-power and all the others were whispering like they were gonna get shushed by an old-time librarian. Who Knew Transit Maps Were Copyrighted? The MTA, the folks who run New York’s subways and busses and such, weren’t the only ones to smack a cease and desist down on iPod Subway Maps last week, but they’re the first to tell they can pay $500 for the privilege of distributing those maps in an iPod-readable format — but only for non-commercial distribution. Cluetrain moment: doesn’t the MTA understand that services like this serve potential tourists like me? Five Days Left To Apply To Be Chivas Life Editor Chivas, the folks who bring us Chivas Regal scotch whisky and virtual tours of the Playboy Mansion, is looking for a pair of ambassador editors for ThisIsTheLife.com. The deal pays $100,000 to the lucky pair to tour the world making good press and pictures for the brand. You’ve got six more days to put together the three-minute application video, so get on it. Thanks to Gadling for the link. Library-Related Geekery Ryan beat me to reporting on the interesting new services at the Ockham Network (noted in this Web4lib post). The easiest one to grok is this spelling service, but there are others that are cooler. He also alerted me to a Perl script to proxy Z39.50 to RSS. Though for those more into PHP (like me), I’d like to point out the YAZ extension from the folks at Index Data. Distracted By My Shiny New Camera The Olympus C8080, one of the best digital cameras ever, can be had for under $500, refurbished, from some sellers on Amazon. That’s about where the price/features ratio against the C7000 I was excited about last week tips strongly in favor of the C8080. I might get into why I’m not excited about dSLRs in a later post, but I won’t deny that price is part of it. Still, I think even the most die-hard dSLR aficionado will agree the C8080 has a lot to love. Open Content Alliance The news is that Yahoo! announced they’ve formed the Open Content Alliance. Though that certainly fits the Google versus Yahoo! story that newsmen want to report on now, it’s somewhat disingenuous to the Internet Archive, which has been beating the Open Content drum for a while. But Brewster Kahle, the founder of the Internet Archive doesn’t seem to care. He was talking about it on the Yahoo! Search Blog yesterday: Mac Wireless Card Compatibility In case you’re looking: Metaphyzx’s Mac OS Wireless Adapter Compatibility List. Introducing bsuite_speedcache I wrote bsuite_speedcache to reduce the number of database queries I was executing per page load. By implementing it on some of the content in my sidebar, I dropped 35 queries for each cache hit. That might not seem like much, but it should average about 525 queries per minute that that my host server won’t need to process. Now that I’m looking seriously at optimizing my queries, I’ve also cut the monthly archives links from the sidebar. Meltdown Sometime around 10 PM Friday the MySQL server at my hosting provider took a walk. The hosting sysop blamed it on my site and disabled the database that serves it by making the directory the MySQL files are in unreadable. MySQL didn’t seem to handle that condition well, and since MaisonBisson was still piling up queries looking for the content in the DB, things continued to go downhill. My involvement started around 11 PM Friday night (yes, I’m that dorky). Pepper Links Pepper Computer Buying a Pepper Pad at Amazon Pepper Hacks Victor Rehorst has been blogging about his Pepper since he got it (a few days ago) Pepper Pad stories at TeleRead Other Pepper Pad stories here at MaisonBisson Open Test Sites I guess not everybody in Nevada loves the Test Site as much as this postcard might suggest, but hey, what do tourists know? The image comes from _roberta‘s Flickr photostream, and she doesn’t seem too critical. About 850 miles southeast today, the Trinity Site — where the world’s first atomic weapon was detonated in a test on July 16, 1945 at 5:29:45 a.m. — is open to the public. Pepper Pad — First Impressions The Pepper Pad (available at Amazon) has a very clean out of box experience. There’s nothing to assemble and no questions about what order to do things in. Just open, unwrap, plug in, startup. I attempted running through the configuration in my office, but the WiFi propagation is very weak there and Pepper Pad couldn’t catch a signal. The requirements listed on the box say only two things: “broadband” and “WiFi,” so it’s no surprise that the configuration application requires WiFi — or perhaps a BlueTooth phone it can connect through? Those Crazy K-Fee Ads It turns out that K-Fee, the company that pushes its energy drink with the scary TV ads, has a English-language website. It also turns out they’ve got the scary car ad and eight others online. Here they are: Angler Car Buddha Golf Beach Meadow Yoga Soothing Waves Ocean Path Pepper Pad — Arrival The Pepper Pad‘s technical details — a lightweight Linux powered device with an 8.4-inch SVGA touchscreen, Wi-Fi auto-configuration, Bluetooth device support, multi-gigabyte disk, full QWERTY thumb-keypad, stereo speakers, and more — are already well reported. But I’ve been arguing that attention to such details runs counter to the purpose and intended use of the device. Many computer users can name (and point to) the CPU in their computer, but who of those can tell me what CPU or chipset drives their cellphone? Must Read: Ambient Findability Peter Morville‘s Ambient Findability sold out at Amazon today on the first day of release. There’s a reason: it’s good. Morville’s work is the most appropriate follow-on to the usability concepts so well promoted by Steven Krug in his Don’t Make Me Think and Jakob Nielsen in Designing Web Usability. Findability, Morville argues, is a necessary component in the success and propagation of an idea or detail or fact. Business and non-profits alike will benefit from understanding the value of findability. Mt. Moosilauke Will and I climbed Moosilauke in early August, but it was only now that I got around to stitching the panorama. The view is considerably wider than 360 degrees, composited from 33 photos. The “full-size” version on Flickr contains 8 gigapixels of data. The real full-size version is a over 34 gigapixels. bsuite_innerindex WordPress Plugin [[pageindex]] About “Blogging” typically connotes short-form writing that needs little internal structure, but that’s no reason to cramp your style. As people start to explore WordPress‘s Pages feature, it seems likely that we’ll need a way to structure content within posts or pages sooner or later. That’s why I’m working on bsuite_innerindex. It’s a WordPress Plugin that puts named anchors on all of the &lt;h1&gt;, &lt;h2&gt;, &lt;h*&gt;-tagged content, and builds a list of links to those anchors that can be inserted anywhere on the page. Game Law Redux Matt says my attempts to analogize online roleplaying games to more familiar contests like chess or automobile racing are “just silly.” But his response appears to reinforce my point rather than refute it. It is the responsibility of the gamers and gaming organizations to create and enforce rules. People violating those rules are subject to sanctions by the gaming organization first, but it’s hard to imagine how any contestant who follows the rules of a (legal) game can be subject to legal sanction. Teachers Get Paid Crap From AlterNet: Teaching In America: The Impossible Dream. Tagline: Many public school teachers today must work two jobs to survive, and can’t afford to buy homes or raise families. Why do we treat our teachers so poorly? Open Source GIS Here’s an interesting GeoPlace.com article on open source GIS tools, including GIS extensions to PosgreSQL and MySQL. Via The Map Room. Distracted By My Shiny New Camera My Olympus C4000 is hard to beat. Steve’s Digicams reviewed it well, and many friends with newer cameras find features or capabilities in it they miss on theirs. So, despite my schoolboy giddiness at the arrival of new gadgets, I’m waiting to be convinced that my new C7000 will replace it. It too was well reviewed, and already I can see that it addresses some of my few complaints about the C4000, but transitions like this take time. bsuite_geocode Plugin For WordPress I’m a big fan of the WP Geo plugin, but I want more. My biggest complaint is that I want to insert coordinates using Google Maps or MultiMap URLs, rather than insert them in the modified story editor. So I wrote a bit of code that reads through the URLs in a post, finds the “maps.google” or “multimap.com” URLs, fishes the latitude and longitude out of them, and adds some geocoding tags to the body of the post. Home Theater Remote Control I have a sort of guilt complex about looking at home theater issues. Nonetheless, I’ve been building one piecemeal ever since I found an incredible deal on a video projector. Now I’m working on assembling a video jukebox of sorts and I need to face the remote control stumbling block. That’s why I like the Logitech Harmony 520{#2084,CONTENTID=10929}, available at Amazon. Credit due: I got the tip from a post at Engadget some time ago. Helpful Pages In The WordPress Codex The following pages from the WordPress Codex were surprisingly helpful recently: Creating a Static Front Page « WordPress Codex Creating Tables with Plugins « WordPress Codex Alphabetizing Posts « WordPress Codex The Potential Of Political Campaigning in Online Games Matt and I have been talking about online role playing games lately. He’s more than interested in the new challenges they pose to our legal system, the new media opportunities they offer, the ways they’re altering culture. We got into a conversation about how companies are taking advantage of them in marketing campaigns, so I asked him, “in what presidential election year will we see the first in-game campaigning?” He seemed to think it might be as late as 2020 before that happened, but immediately embraced the concept. What’s Zimbra? They say “Zimbra is a community for building and maintaining next generation collaboration technology.” What I’d like to know, however, is whether Zmbra is a community driven, social software answer to the problems of groupware — typically driven by management’s needs. A Motivated Team Member Is A Productive Team Member I think this is Dave. Apparently they keep him in a cell at the server farm. DoubleTake Stitches Panoramic Photos Cheap I actually like the look of a broken panorama, where the borders of each photo are clearly visible — even emphasized. But last night I got the notion of doing a seamless pano and found DoubleTake, a $12 shareware app that makes the process pretty darn easy. The sunrise shot above (larger sizes) was my first crack at it, but I was so sure I’d use it again (and again) that I’ve already registered it. Ambient Findability And The Google Economy I’m only just getting into Peter Morville‘s Ambient Findability, but I’m eating it up. In trying to prep the reader to understand his thesis — summed up on the front cover as “what we find changes who we become” — Morville relates his difficulty in finding authoritative, non-marketing information about his daughter’s newly diagnosed peanut allergy: I can tell you from personal experience that Google does not perform well when it comes to health. Editing WordPress “Pages” Via XML-RPC WordPress‘s Pages open the door to using WP as a content management system. Unfortunately, Pages can’t be edited via XML-RPC blogging apps like Ecto. This might be a good thing, but I’m foolhardy enough to try working around it. Here’s how: Find a text editor you like and open up the wp-includes/functions-post.php file. in the wp_get_recent_posts() function, change this: $sql = “SELECT * FROM $wpdb-&gt;posts WHERE post_status IN ('publish', 'draft', 'private') ORDER BY post_date DESC $limit”; Recycling Tips From Our Physical Plant Along with the energy saving and water saving tips previously, our physical plant folks have sent out these recycling tips: Recycling of Aluminum Cans — saves 95% of the Energy required to make the same amount of Aluminum from its virgin source. One ton of recycled Aluminum saves 14,000 KWH of Energy, 40 barrels of oil, 238 million BTUs of Energy. One ton of recycled Aluminum saves 10 cubic yards of landfill space. SUV Sales Slump Earnings reports from car makers seemed to suggest SUV sales were down last spring, and with gas prices near $3 per gallon in some parts of the country still, nobody should be surprised that Yahoo! is saying interest in SUVs is down — way down — now: If the Buzz is any indication, then yes. Searches on “hybrids” outrank “SUVs” by a tremendous margin, and it’s the same story with individual models. Satellite Broadband Macsimum News did a story on satellite internet options a few weeks ago, but reader reports focused on fixed base station solutions for domestic use. What about mobile data solutions for international use? That’s where companies like Outfitter Satellite come in. They’ve got Inmarsat solutions that can do 64kbps (or bonded to 128kbps) almost anywhere in the world. And, for customers in the Mid-East or Asia, they’ve got a 144kbps RBGAN solution that seems to offer much better throughput at far lower prices. Plan C: Signed JavaScripts The Mozilla docs on JavaScript security give a hint of hope that signed scripts will work around the cross-domain script exclusions that all good browsers enforce. But an item at DevArticles.com throws water on the idea: Signed scripts are primarily useful in an intranet environment; they’re not so useful on the Web in general. To see why this is, consider that even though you can authenticate the origin of a signed script on the Web, there’s still no reason to trust the creator. PC World Pepper Pad Reviewer Doesn’t Get It David Rothman pointed me to Michael Lasky’s PC World review of the Pepper Pad. Lasky bangs on Pepper, saying he can’t recommend it. Too often, I think, technology reviewers approach a new product without understanding it. Lasky tells us how the Pepper performs when playing music or videos before comparing it to “notebook computers available for the same or a lower price.” We wouldn’t let an automotive reviewer conclude a review of a Prius hybrid to a Chevy truck by saying the truck is the better deal because it has a bigger engine for the same money, so why let technology reviewers off so easy? bstat Japan! It looks like bstat has been localized for Japan! With that in mind, I’d love to hear from international users about what I can do to make localization easier. There will be some big changes in the transition to bsuite, and it might be a good time to make sure I’m properly supporting WP‘s translation tables and localization features. Plan B: Remote Scripting With IFRAMEs I have plans to apply AJAX to our library catalog but I’m running into a problem where I can’t do XMLHttpRequest events to servers other than the one I loaded the main webpage from. Mozilla calls it the “same origin policy,” everyone else calls it a cross-domain script exclusion, or something like that. Some Mozilla folks are working on a standard to address the problem, but it could be quite a while before browser support is common enough to build for it. Water Saving Tips Our physical plant folks sent out this list of water saving tips to followup on the energy savings tips they sent previously. Again, I think they should be blogging them, but what do I know? (It’s a rhetorical question, please don’t answer.) Limit the use of domestic hot water — use cold water whenever it will do. Turn off the water while you are brushing your teeth or washing your face. Atlanta Scene My friend Troy keeps a studio at Saltworks, a combined gallery and studio space in Atlanta where Prema Murthy just opened her deStructures show. I was in Atlanta to see Troy and family, so the opening was added sugar, and quite a pleasure. The image above comes from Troy’s Above and Below series. Next Big Thing: Identity Management I might be overstating it, but Identity Management is the next big thing for the open source community to tackle. That’s why I like Sxip, even though I know so little about it. There are a number of other solutions stewing, but most of those that I’m aware of are targeted at academic and enterprise users. Wouldn’t it be nice to have some federated system of identity management among blogs? Linotype FontExplorer I was never a very good graphic designer, but the part of me that thought I was still pays attention when I see software like Linotype’s free FontExplorer, described somewhere as “the iTunes for fonts.” That’s Excitement… “Oooh… I want a number ten.” — a man stepping into line at the airport McDonalds. The number ten meal, by the way, is a ten piece Chicken McNuggets meal. Absinthe Roderick sent me a link to this Reason article on Absinthe that claims: the U.S. Food and Drug Administration considers true absinthe “adulterated” because of the wormwood. Production, sale, and importation are banned, but mere possession is not, and customs agents typically ignore a bottle or two in your suitcase. It’s a legal situation that seems designed to keep absinthe cool.“ The Wikipedia article on absinthe pretty much confirms that point, so who’s going to test it? Improvised Anti-Telemarketing Device The Telecrapper 2000 is an improvised, homemade system that identifies telemarketing calls and leads the marketer through an artificial conversation that wastes the company’s time and money. The idea is to drive down productivity, and like so many other productivity sapping things, it can be quite funny. Check this Flash-animated recording: My Hip Hurts (mirror) Rather less funny, though interesting nonetheless, is EGBG’s Counterscript. tc2k hint via Engadget. Fixing position: fixed In IE It turns out the Internet Explorer doesn’t properly support CSS’s position: fixed. Google led me to the following: How To Create – Making Internet Explorer use position: fixed; doxdesk.com: software: fixed.js Fixed Positioning for Windows Internet Explorer The DoxDesk solution looks promising and simple, but I think bugs elsewhere in my layout are preventing it from working. It’s time to start again from scratch. PowerPoint. Killer App? Ruth Marcus at the Washington Post wonders if PowerPoint is a killing app. She’s not the first to note that NASA administrators make decisions — sometimes fatal decisions — on the basis of PowerPoint presentations that mask or misrepresent details. I wrote about Edward Tufte’s Cognitive Style of PowerPoint essay in a previous post. Marcus doesn’t add to many new points, but the column is a sign that an anti-PowerPoint movement may be growing. [FWD:] Katrina Eyewitness Report (about the photo) The following report comes from CosmoBaker.com, which includes this preamble: EDIT: The following is an email that was sent to my mother from one of her colleagues. Although I cannot substantiate the contents, after all the horror stories that I’ve heard so far, I though that this one was important to tell. Stand up and be counted. Spread truth. Stay awake. C —–Original Message—– WiFi In Public Spaces A message came acrross the web4lib list a few weeks ago with the following request: I want to hear from libraries who are currently implementing, or who already have implemented, wireless access for staff and/or patrons. I want your ‘stories’–good, bad and ugly. Issues and/or triumphs with IT staff, vendors, library staff, library boards, faculty committees, etc. I’m looking for all aspects of the process-finding hardware, implementation, policy (!), training staff, marketing the service to your patron base, troubleshooting and maintenance issues. Search, Findability, The Google Economy: How It Shapes Us Just when I was beginning to feel a little on my own with my talk about the Google Economy here, I see two related new books are coming out. The first is Peter Morville’s Ambient Findability. The second is John Battelle’s The Search. Findability appears to ask the big question that I’ve been pushing toward. From the description at Amazon: Are we truly at a critical point in our evolution where the quality of our digital networks will dictate how we behave as a species? Trusted Computing: The Movie Benjamin Stephan and Lutz Vogel at Lafkon bring us this wonderfully engaging animated story of Trusted Computing. There’s lots more to the story at AgainstTCPA.com, and I need to thank David Rothman at TeleRead for alerting me to both the video and the site. I haven’t had much to say about TCPA, but I think of it like technology politics…politics where I have no say, no vote, no power. Wide World of Video Games Matt started talking up the weird issues developing around multiplayer online games a few weeks ago. Then soon after he blogged it, a story appeared in On the Media (listen, transcript) Short story: online gaming is huge — one developer claims four million paying customers. More significantly, the interplay between real and virtual worlds might create new challenges for this real world legal system. “Theft” of in-game money and equipment among players in the online world is possible, but it’s lead to the real-world arrest of at least one person and the murder of another when authorities refused to act. Energy Saving Tips Our physical plant folks sent out a message with tips on how to conserve energy. Perhaps they oughtta blog this stuff? Here it is: Computer power management — A typical computer monitor uses 60 to 120 watts of electrical power, depending upon screen size. Do not use screensavers as energy savers as they continue to use the monitor at full power and do not conserve energy. Configure your monitor to turn off after 20 minutes of inactivity, your hard drive to turn off after 30 minutes of inactivity, and your desktop computer or laptop to go into a standby or sleep mode after 90 minutes of inactivity. Osceola Weekend I climbed the Osceolas with Will and Adam this weekend. It was my first overnight in a long, long time, and their first mountaintop sunrise. I used to do sunrises on Mt. Monadnock, but I’d lost the habit. More pictures of the Osceola adventure at Flickr. What Counts Will reminds us: “Flasks are like people, it’s what’s on the inside that counts.” From the top of Mt. Osceola. The Quotable John Scott John Scott reminds the naive: “Don’t believe everything you find in Google.” Be A Leader! Manage Your Staff With Ralph Wiggum Quotes! “I eated the purpleberries” (groaning). “How are they Ralph…. Good?” “They taste like…burning.” More goodness at the Ralph Wiggum Soundboard, via InformationNation. More quotes, like “Oh boy, sleep! That’s where I’m a viking!,” at TheDotDotDot. If I Close My Eyes, Does It Go Away? Can Bush Censor His Shame Away? Reuters: FEMA accused of censorship: “It’s impossible for me to imagine how you report a story whose subject is death without allowing the public to see images of the subject of the story,” said Larry Siems of the PEN American Center, an authors’ group that defends free expression. Brian Williams’ MSNBC Nightly News Blog: While we were attempting to take pictures of the National Guard (a unit from Oklahoma) taking up positions outside a Brooks Brothers on the edge of the Quarter, the sergeant ordered us to the other side of the boulevard. Axe Gang Security Bumbles Again We laugh at the single minded foolishness of the Axe Gang in Kung Fu Hustle Jackie Chan’s The Legend of Drunken Master, but do we laugh when we see it in our own security policies? To intelligence staffers and border guards working under a policy of hammers, all the world is a nail. Here’s an example: In August 2001, US Customs Agents stopped and searched Ahmad El Maati, a Kuwaiti-born Canadian and a truck driver crossing the US-Canadian border at Buffalo, NY. Marketing And Search Engine Optimization I don’t want to admit to being interested in marketing, but I am. Here’s a few links… Blogs: Church of the Customer Seth Godin Aaron Wall’s SEO Book.com Threadwatch.org Randomness: Writing, Briefly Google’s search result quality evaluation guidelines definition of the Google Economy at Wikipedia The Fall of Advertising and the Rise of PR Simple Bookmarklet Demo Bookmarklets are interesting little bits of JavaScript stored as bookmarks. They’ve been around since about 1998 (earlier?), but I’ve never bothered to write one. Here are a few examples: This sort of creates a bookmark Alexa Snapshot Wayback La Femme’s Poison Browsing Flickr the other day I found la_femme‘s poison. Other good photos in her photostream. Energy Crisis Mike Whelan posted the above photo to his Flickr photostream recently. Back in April, when gas prices were still well below the $3-per-gallon mark, it looked like sales of SUVs were starting to slow. Interestingly, we’ve crossed the threshold Keith Bradsher quotes in High and Mighty, his book detailing how the US auto industry became so dependent on SUVs and how common sense has been powerless against them. The threshold was the point at which gas prices would begin having the same effect on current car purchases as the 1970s oil crisis did. Doing Relevance Ranked Full-Text Searches In MySQL I’m going out on a limb to say MySQL’s full-text indexing and searching features are underused. They appeared in MySQL 3.23.23 (most people are using 4.x, and 5 is in development), but it’s been news to most of the people I know. Here’s the deal, the MATCH() function can search a full-text index for a string of text (one or more words) and return relevance-ranked results. It’s at the core of the list of related links at the bottom of every post here. La Tomatina From a Reuters story in ChinaDaily: At noon [Wednesday], municipal trucks dumped about 130 tons of ripe, juicy plum tomatoes at the feet of adrenaline-charged crowds in town’s main square. Within minutes the area was covered in red slime, and clouds of tomato sauce filled the air. It all takes place in Buñol, in Spain’s Valencia region along the Mediterranean coast. Canada.com{#db86d958-934e-4fa3-a9b5-be64ad559819} describes the origins: Local lore says it began in the mid-1940s with a food battle that broke out between youngsters near a vegetable stand on the town square in Buñol, 300 kilometres southeast of Madrid. 37signals Tells Google A Thing Or Two 37signals takes on Google and suggests some improvements. UCLA Takes On Google Scholar Via Jay Bhatt at LISNews: UCLA Libraries‘ discussion of Google Scholar, Search Engines, Databases, and the Research Process. Time-Picayune In Exile Times-Picayune editor Jim Amoss answered questions for On The Media‘s Brooke Gladstone. Amoss and his staff have been covering the catastrophe in New Orleans as only locals can. Some of the best reporting I’ve seen on this has come from the Times-Picayune, and I was quite amazed when I discovered the electronic edition Wednesday. Despite the damage, they appear to have start releasing a print version again and are distributing it in the city and in communities where refugees have fled. Sneaky Is there a sneaky surprise hidden in your hotel room? See if you can recognize anything in these photos (tip: mouse-over them). Back To School Video Kate says: “Life is good. And I’ve got a sleeping bag from the future.” Tim explains, a bit. None of that matters nearly as much as the video Kate is quoting from, and that matters now because back to school time means play dates and sleepovers. Tim guarantees it will kill a few braincells, but nothing ridicules us the way we once were (and often still are) better than Saturday Night Live. Things Go To Hell DefenseTech’s Noah Shachtman writes: Organizing thousands and thousands of people, in hellish conditions and in a hurry, is tough work. Let’s take that as a given. But still: We’re now a work week into a natural disaster that had been forecast for years, and New Orleans “is being run by thugs,” the city’s emergency preparedness director tells the Times. “Some people there have not eaten or drunk water for three or four days, which is inexcusable. Rollerblading Via Pya{#16437}. Policing By Cellphone Though we imagine the Dutch to be a rather unexcitable lot, I did anyway, it turns out they have a history of getting rowdy at football games (yes, if this all happened back in the States I be calling it “soccer”). So it can’t be so much of a surprise that fans rioted again in April. What is surprising is that mobile phone companies got involved in the investigation. This AP report tells the story: The Water Down There I don’t watch TV, so I haven’t seen many images of the flooding in New Orleans until I found these. Amazingly, The Times Picayune is publishing PDF editions during disaster. The hurricane and flood damage are truly scary, but the worst news is on page five, which tells of widespread looting: Law enforcement efforts to contain the emergency left by Katrina slipped into chaos in parts of New Orleans Tuesday. The Google Economy Will Beat You With A Stick Call it a law, or dictum, or just a big stick, but it goes like this: The value and influence of an idea or piece of information is limited by the extent that the information provider has embraced the Google Economy; unavailable or unfindable information buried on the second or tenth page of search results might as well be hidden in a cave. The Ultraviolet Sun From the NASA website: EIT (Extreme ultraviolet Imaging Telescope) images the solar atmosphere at several wavelengths, and therefore, shows solar material at different temperatures. In the images taken at 304 Angstroms the bright material is at 60,000 to 80,000 degrees Kelvin. In those taken at 171, at 1 million degrees. 195 Angstrom images correspond to about 1.5 million Kelvin. 284 Angstrom, to 2 million degrees. The hotter the temperature, the higher you look in the solar atmosphere. Enabling .htaccess On Mac OS X I do a lot of web development on my laptop. I’ve got Apache and PHP there, so it’s really convenient, but I usually move projects off to other server before I get around to wanting to mess with mod_rewrite. Not so, recently, but I ran into a big stumbling block when I discovered OS X’s Apache comes pre-configured to ignore .htaccess files. A couple points. First, Apache’s own mod_rewrite docs include the following quote: Coconut Battery coconutBattery: coconutBattery is a tool that reads out the data of your notebook-battery (iBook/Powerbook). It shows the current charge of your battery as well as the current maximum capacity related to its original. Via O’Grady’s PowerPage{#14850} AWStats As much as I like the bstat functionality of bsuite, I never intended it to be a replacement for a full server log-based stats application. That’s why I’m happy my hosting provider offers AWStats. The reports suggested ways to optimize my pages so that I could control my bandwidth consumption — up to 3.7GB/day before optimization, now 1.8GB/day. But today I found an AWStat feature that got me excited enough to email the university sysadmin about it: email stats. The Google Economy — The Wikipedia Entry I’m rather passionate about the Google Economy, so it shouldn’t be too much of a surprise to learn that I just wrote about it in my first ever Wikipedia entry. Here it is: http://en.wikipedia.org/wiki/Google_economy “Google Economy” identifies the concept that the value of a resource can be determined by the way that resource is linked to other resources. It is more complex than search ranking, and broader than interlinked web pages, though it draws meaning from both. bsuite Development bstat has become bsuite. The name change reflects the fact that I want the plugin to do a lot more than track usage stats. One of the first features to enter testing here is the “related” section below. I’m calling it “bsuggestive,” but that may turn out to be too cute a name to tolerate for long. The results are based on the tags for the post, so it doesn’t work with old posts that haven’t been tagged, and it sometimes returns some weird matches, but it’s still alpha, so what can we ask for. Beloit College’s List Of Things That Make Us Look Old To Incoming Students We’ve seen lists like this before. Beloit College in Beloit Wisconsin releases their “Mindeset List” for their incoming class every year around now. The point is to remind us how cultural touchstones change over time. It does that, but it also give us (me, anyway) a good chuckle. It’s worth reading all the way down to number 49, at least, where libraries get a good mention. Video Bulb and Zakka Shop NYC The Video Bulb is a “lipstick-sized tube” that plugs in to your TV’s RCA jack and plays Bitman videos. GadgetMadness explains what Bitman is: Bitman is the creation of Japanese Art Performer “Meiwa Denki” and was an 8-bit electronic stick figure who would dance, pose, etc. The VideoBulb sounds interesting enough, but I think I could get into the reseller as much as the GadgetMadness writer did: I went to Zakka Shop &amp; Space the last time I was in NYC, and literally wanted everything in the store. Changing Modes Of Communication I talk a lot about the Google Economy here, and how that and other ideas are driving changing modes of communication. Today I learned of arXiv. Henry Farrell describes it at CrookedTimber: [I]t’s effectively replaced journal publication as the primary means for physicists to communicate with each other. Journal publication is still important – but as an imprimatur, a proof of quality, rather than a way to disseminate findings to a wider audience. WordPress As CMS A friend and I have been talking about what it would take to turn WordPress into a CMS. We both have our doubts, but today I found this job ad that suggests we’re not alone in at least thinking of the possibility. Needed: Web Designer/Programmer For Our Sites We’re growing very fast, and have outgrown our current CMS and design. We’re looking for a designer and/or programmer to redesign our rapidly growing network and implement a CMS that ties it all together. KingCosmonaut & WP Themes I stumbled across the sometimes funny How To Live Your Life and got curious about the theme. Turns out it’s by Sebastian Schmieg, who keeps things real at kingcosmonaut. The theme is Blix, but the kingcosmonaut site is much cooler. Flock The developers describe Flock as [T]he world’s most innovative social browsing experience. We call it the two-way web. Which is a good enough sales pitch to make me try the free demo, but it’s all still a private beta. Perhaps they’re trying to prove the point that nothing builds buzz better than unavailability. Osakasteve gushes: A browser that is designed around social software like blogs and flickr iTunes Music Store API? I can’t explain why, at least not yet, but I’m looking for a way to search the iTunes Music Store{#XfFSogqWv7s&amp;offerid=78941.10000007&amp;type=3&amp;subid=0} catalog outside of iTunes. Rumors of an iTunes-Google partnership{#1230} have been flying lately, but what I really want is a webservice/API I can use. Yes, Apple offers an affiliate program that supports direct links, but again, they don’t offer an Amazon-style API to search their catalog. All of this has me thinking about reverse-engineering the iTMS to build the webservice I’m looking for. A List Apart Updated A List Apart, has been revamped and they’re proud of it. They should be, it’s beautiful and functional. It’s one of the few early web development resources that’s still with us, and there’s a reason. Copyright and Academic Libraries Back when I was looking things up for my Digital Preservation and Copyright story I found a bunch of info the University of Texas System had gathered on issues related to copyright, libraries, and education. In among the pages on copying copyrighted works, A/V reserves, and electronic reserves I found a document titled: Educational Fair Use Guidelines for Digital Images. It’s some interesting stuff — if you get excited about copyright law. Re-Shelving Orwell’s 1984 Via Jon Gordon‘s Future Tense: Re-shelving George Orwell. Smart people everywhere are taking it upon themselves to re-shelve George Orwell’s 1984 from fiction to more appropriate sections in non-fiction, like “Current Events”, “Politics”, “History”, “True Crime”, or “New Non-Fiction.” Instructions and photos on Flickr. Laura Quilter Defends Google Print With all the talk about Google scanning or not scanning copyrighted books, I was happy to see Laura Quilter talking about Google as a library. The Internet Archive is certainly a library. […] Libraries may be private, semi-private, public; for- or not-for-profit; paper or digital. Why is Google not a library? More interestingly, she casts a critical eye on the Texaco decision that everybody points to as the guiding law on fair use. Wikipedia API? I want Wikipedia to have an API, but it doesn’t. Some web searching turned up Gina Trapani’s WikipedizeText, but that still wasn’t exactly what I wanted. A note in the source code, however, put me back on the trail to the Wikipedia database downloads, and while that’s not what I want, I did learn that they’ve got a table of just the article titles (over 1.2 million of them) in their downloads. Drug Side Effects Drive Patients to Gamble, Eat, Drink, and … …people with Parkinson’s disease temporarily became compulsive gamblers after taking […] drugs designed to control movement problems caused by the illness… That’s the lead in this Forbes story on the matter, and that’s not all. A variety of ‘interesting’ side effects popped up among a relatively small number of study participants: pathological gambling compulsive eating increased alcohol consumption obsession with sex. The drugs in question are “dopamine agonists” and are part of the standard treatment of Parkinson’s disease. Segway Easy Rider Movie Trailer Remember those guys who rode a Segway cross-country last year? Well, they’ve got a movie coming out. Yup, there’s even a trailer. Possibly more interesting: the photo gallery (from which the photo above came). Thanks to Engadget for the link. Steelers Fan Never Misses a Game Day In remembering James Henry Smith, a zealous Pittsburgh Steelers fan who died of prostate cancer in early July, his family asked the Samuel E. Coston Funeral Home to do things “as he would have wanted them to be.” For the viewing, the funeral home arranged a living room ensemble with the TV and recliner just as Smith liked it on game day. An AP article describes it: Smith’s body was on the recliner, his feet crossed and a remote in his hand. Alt Browser Shiira Project, an Apple WebKit-based browser with some interesting features. Sadly, it also brings page transitions to the Mac. Let’s hope these don’t become the new . Chasing Clicks Al asked how low I will go to chase traffic. Truth is, I can’t answer. Maisonbisson has had moments of popularity, but it’s hard to know why. Alexa tells us there are 18 million unique sites on the Web, but… if you take Alexa’s Top 100,000 sites you’ll find that almost 3 out every 4 clicks are spoken for. In other words, almost 75% of all the traffic on the web goes to the sites in the Top 100K list, leaving the remaining 18 million or so sites to fight over the scraps. Neutron Bomb Boing Boing has an exclusive profile of neutron bomb inventor Samuel T. Cohen by Charles Platt. All the reports so far are that it’s a 10,000 word “must read.” The article, Profits of Fear, is available in PDF, plain text, and Palm doc versions at Boing Boing. Thanks to David Rothman for the heads up. Extra: Rothman asks what it all says about mainstream media when respected authors eschew traditional media for blogs. Another Limitation of LC Classification Right up front in the prologue of Ruth Wajnryb’s Expletive Deleted she quotes the following from Richard Dooling on the difficulty in researching “bad language”: The Library of Congress classification system does not provide a selection of books … on swearing or dirty words. A researcher … must travel to the BF of psychoanalysis, the PE of slang, the GT of anthropology, the P of literature and literary theory, the N of art, the RC of medical psychiatry, and back to the B of religion and philosphy. Network Effects on Violence Some time ago I pointed to John Robb’s discussion of the potential for the network to amplify the threat of violence from otherwise un-connected and un-organized individuals. Now Noah Shachtman at DefenseTech is writing about “open source insurgents.” It used to be that a small group of ideological-driven guerilla leaders would spread information, tactics, training, and cash to their followers. No more. Internet-enabled insurgents with only the loosest of real-world connections can now share all of that freely online. Grizzly Man David Edelstein’s review of Werner Herzog’s documentary, Grizzly Man, describes Timothy Treadwell as …a manic but lovable whack-job who doggedly filmed and obsessively idealized the bears that would ultimately eat him… The film is made up largely of the bits of the hundreds of hours of video that Treadwell himself shot during his 14 years with the bears. Later, however, Edelstein — probably restraining laughter — calls Treadwell “histrionic” and a “drama-queen” (isn’t that sort of redundant? PHP Developer Resources Somebody asked for some links to get started with PHP. Of course I lead them to the PHP.net official site, where the documentation is some of the best I’ve seen for any product. I also suggested PHPDeveloper.org and PHPFreaks.com, though the truth is I usually Google any questions I have that the official docs don’t answer. Still, I’ve found some good info at both of those. Finally, the PHP Cheat Sheet at ILoveJackDaniels. DRM = Customer Lock-In Donna Wentworth is now saying what I’ve been saying for over a year now. Digital Rights Management (DRM) isn’t about preventing copyright violations by ne’er-do-wells, it’s about eliminating legal me2me fair use and locking in customers. In Your PC == A Toaster, Wentworth quotes Don Marti saying: Isn’t it time to drop the polite fiction that MSFT and other incumbent IT and CE [CE = consumer electronics — Casey] vendors are only doing DRM because of big, bad Hollywood? Digital Preservation and Copyright We’re struggling with the question of what to do with our collection of vinyl recordings. They’re deteriorating, and we’re finding it increasingly difficult to keep the playback equipment in working order — the record needles seem to disappear. We’re re-purchased much of our collection on CD, but some items — this one might be one of them — are impossible to find on CD. So we’re considering digital preservation, capturing the audio of the records and scanning the dust jackets. The Part Where Speakeasy Cons Me Into Shilling For Them The Speakeasy Speed Test is an okay way to waste some time, but the most amusing thing is how easy they make it to promote them. The Speakeasy badge here looks like any web ad, but they’re not paying for it. All they did was post a link saying Add Speakeasy Speed Test to Your Site. I guess we all ought to take this marketing tip from them: make sure your readers know how to link to you. MaisonBisson Top Seven The most recent version of my WordPress stats tracking plugin makes it very easy to see and track my top stories. I don’t know whether I should be proud or ashamed by them, but here they are: Big Bear Photos That story gets a lot of morbid interest, and I’m sure the movie Grizzly Man will too. K-Fee Energy Drink TV Ad For a while, though, people looking for that story were finding my Zygo energy vodka story instead. Atomic While looking for a picture for my memorial to the bomb, I found a number of related links. This blog is sometimes nothing more than an annotated bookmark list, and this is why…. The Bomb Project describes itself as: a comprehensive on-line compendium of nuclear-related links, imagery and documentation. It is intended specifically as a resource for artists, and encourages those working in all media, from net.art, film and video, eco-intervention and site-specific installation to more traditional forms of agitprop, to use this site to search for raw material. Linking Bias Danah Boyd posted about the biases of links over at Many2Many the other day. She looked for patterns in a random set of 500 blogs tracked by Technorati as well as the 100 top blogs tracked by Technorati. She found patterns in who keeps blogrolls and who is in them, as well as patterns about how bloggers link in context and who they link to. The patterns Boyd points to would certainly effect the Google Economy, our way of creating and identifying value based on linking structures. Annoises Via Gizmodo: a CD of annoying sounds at Gadgets.co.uk. Twenty “ear splitting” sound effects and a pair of earplugs “for your sanity and protection” for £14.99. What 20 sound effects? Drill Party (at least 200 People) Orgasm (Outstanding) Train Drum (Played by a Child) Inhuman Screams Walking (High Heels) Domestic Squabble Doors Banging Bowling Unhappy Dog Practicing a violin Traffic Jam Garbage Truck A screaming newborn baby Phone Ringing Ball Game Pigeons Spring house cleaning Cock-a-Doodle-Do! Grizzly Man Within the last wild lands of North America dwells an animal that inspires respect and fear around the world. It is the grizzly bear, a living legend of the wilderness. Grizzlies can sprint thirty five plus miles an hour, smell carrion at nine or more miles, and drag a thousand-pund animal up steep mountains. The grizzly bear is one of a very few animals remaining on earth that can kill a human in physical combat. Point ‘N Shoot DefenseTech reported on the FireFly, a disposable camera that can be shot from the M203 grenade launchers used by US land forces. The cameras fly 600 meters in eight seconds, wirelessly sending pictures back to the soldier’s PDA. Now they’ll know what’s over that hill or around that corner. Not that soldiers don’t need this sort of thing, but one wonders when Hasbro will release a plastic version in bright colors. Movie Night: Open Water Joe recommended Open Water whole heartedly, but others, like some of these one-star reviewers at Amazon, had equally strong reactions against it. I first learned of the events the movie is based on in Bill Bryson’s In a Sunburned Country, where he described the events of Thomas and Eileen Lonergan’s disappearance during a dive in the Australian Pacific. The similarity between these true events and the movie’s events likely ends there. Jimmy Wales’ Free Culture Manifesto Jimmy Wales, the founder of Wikipedia and director of the Wikimedia Foundation, is working on his keynote for the Wikimania conference in Frankfurt. Ross Mayfield at Many2Many posted a preview and gives some background. What should we expect? Wales’ speech touches on ten things necessary for Free Culture: Free the Encyclopedia! Free the Dictionary! Free the Curriculum! Free the Music! Free the Art! Free the File Formats! Free the Maps! 60 Years Later In what was to be the final act of World War II in the Pacific, the United States made the first and only use of nuclear power as a weapon in the bombing of Hiroshima and Nagasaki on August 6th and 9th (US dates), 1945. George Weller of the Chicago Daily News snuck in to Nagasaki in early September 1945 and became the first American journalist to see the destruction. His stories were censored, and official sources maintained control of news about the bombings and the aftermath for many years. Reminisce: My First Ebook The first ebook I ever read was Bruce Sterling’s Hacker Crackdown on my Newton Message Pad 2000. It had a big and bright screen — “the best screen for reading eBooks on the (non-)market” says DJ Vollkasko — but it could get a bit little heavy at times. Crackdown is available for free, along with perhaps 16,000 others, at Matthew McClintock’s ManyBooks.net. Downloads are available in 11 different formats, or you can read online. Information Is Sexy It used to be you could identify the librarian by the sensible shoes, but times they are a changing. Witness this ad from Library Bar. Sure their “librarians” are bartenders, but what cultural shift changed to thrust librarians up the sex appeal scale? Yeah, this is old. After all, it was the Spring 2004 of Bust Magazine that asked if librarians might be the new “it” girls, but it’s still amusing. DRM: Bad For Customers, Bad For Publishers The news came out last week that the biggest music consumers — the ones throwing down cash for music — are also the biggest music sharers. Alan Wexblat at Copyfight says simply: “those who share, care” (BBC link via TeleRead). Rather than taking legal action against downloaders, the music industry needs to entice them to use legal alternatives, the report said. Lawsuits against customers go hand in hand with DRM in limiting community buzz for a particular artist or song. Gizmos For Geeks Colin pointed out Spark Fun Electronics as a source for all manner of geeky components, like component level GPSs and accelerometers. Thing is, they also sell the components in kits with custom PC boards, some with USB interfaces. The Coming Information Age That headline might seem a little late among the folks reading this. But we’re all geeks, and if not geeks, then at least regular computer users. Regular computer users, however, are a minority. Worldwide, only around 500 million people have internet access, and fewer than 100 million people in the US have internet access at home. With populations of over 6 billion and 300 million respectively, there’s clearly a lot of growth potential. Faces I stumbled upon captnkurt’s Information Nation where he popped a link over to Eric Myer’s Stereotypes. The gimic — and it’s a fun one — is that you can mix and match bits of faces. I don’t know why I like the combo above so much, but, anyway. The thing about this is that it reminds me of Troy Bennett’s Human-IntoFace, reported here back in 2002 and 2004. Separately, I need to go back and take another look at captnkurt’s story about CouchSurfing. Nokia 770 I’ve been babbling like a stoolie for Pepper here for the past couple weeks, but after some prodding by Roger Sperberg I’ve started to take a serious look at the Nokia 770 linux-based internet tablet. To get me started is Mike Cane’s hands on report from some time spent with it at LinuxWorld Expo. Nokia is pushing Maemo.org to support the developer/hacker community, and there’s already some interesting work being done. More Bluetooth Hacks As if bluejacking wasn’t fun enough, now a few folks have now taken it a little further and figured out how to connect to the growing number of Bluetooth handsfree sets all around us. Gizmodo fed me the link to what they’re calling “The Car Whisperer.” Nothing against these guys, but it’s not like they did anything amazingly complex. Their story explains that they’re simply taking advantage of poor security like default passwords. Movie Night: House Of Flying Daggers I’ve been a fan of Zhang Yimou’s1 films since, well, for a while now. But I’m also a huge kung fu fan — Jackie Chan especially — so House of Flying Daggers was quite a treat. It’s not that I didn’t like Hero, or that Daggers was particularly funny. To the contrary, it’s tale of complex characters who don’t end well. That might be story enough, but every scene is richly photographed and styled — a hallmark of so many of Yimou’s films, but wonderfully so in Daggers. Sweet Cheat Sheets Colin over at Command-Tab alerted me to some great cheat sheets, including this one for JavaScript at ILoveJackDaniels.com.     Apple Releases Multi-Button Mouse Apple this morning released the Mighty Mouse. With a scrollball, left and right click, and side buttons, it’s a big departure from Apple’s old opposition to multi-button mice. Apple didn’t invent the mouse, but they were probably the first to put mice through usability testing. One, two, and three button mice of a great many different shapes and sizes were tested before they settled on a one-button mouse for the original Macintosh in 1984. Hands On The Pepper Pad The most amazing thing about the Pepper Pad is how easy it is to pick up and use, how easy it is to walk around with, and how it’s available when you want it and gone when you don’t. The Pepper Pad‘s portability goes far beyond that of laptops. I mentioned previously that laptops move from desk to desk and Bill Gates tells us how poorly laptops work in elevators. Netflix Expands Queues This is old news, but Netflix{#XfFSogqWv7s&amp;offerid=78684.10000076&amp;type=3&amp;subid=0} now offers multiple queues for each account. Queues, of course, are the movie wish lists each Netflix customer keeps; when you return a movie, they send out the next movie in your queue. In the old days, each subscriber got just one queue, no matter how many members of the household had an interest in the movies. Two people, one queue? Marital drama ensued in my home and others. Movie Night: The Underneath Steven Soderbergh has done a number of good films, but The Underneath isn’t among them. It’s interesting to see the director working out his moves, but more entertaining to see them in a more mature form, as in Out of Site. Eh, I’m ready to give the guy a break. My real complaint has nothing to with this film. Instead, it’s about Kafka, one of his best works. It was released in 1991, and though they’ve still got a few VHS copies in a warehouse somewhere, it deserves a DVD release Space Shuttle Tracking (and other good uses of the Google Maps API) Tom Mangan has put the Google Maps API to interesting use with his space shuttle tracking page. Also worth checking out: his Blackbird Spotting site and TLable, a little extension to make pinning/annotating maps even better. Politics And The Google Economy While I’m anxiously working to better fit libraries into the Google Economy, a few paragraphs of Barry Glassner’s The Culture of Fear, got me thinking about its role in politics. Glassner was telling of how a 1996 article in USA Today quoted the National Assocation of Scholars{#242} saying that Georgetown University had dumbed down its curriculum and dropped Shakespeare{#778} requirements. Of course, nothing could have been farther from the truth, a point confirmed by the Georgetown’s dean. Japanoid K-Cars Gizmodo reported it a while ago, but a Canadian company called Japanoid is importing these and other tiny Japanese cars. How tiny? At or under 1.5 meters (under 5 feet!) wide with engines 660CC or under. They’re called Kei Jidousha, or Keicars, or just K-cars (though not to be confused with Chrysler’s K-Cars). Japanoid has 12 vehicles listed, but my favorites are those four above and this funny looking truck. Movie Night: Entropy Phil Joanou’s Entropy isn’t available in the US on DVD, but I found it at Amazon UK. IMDB has this to say: Stephen Dorff narrates this tale about how his life goes astray as his character attempts to strike a balance between the demands of directing his first film and the pressures of his new romance with a model. U2’s Bono plays a role in this film as both himself and Dorff’s character’s concience. The Problem With PDAs Today When I finally get around to writing up my impressions of the Pepper Pad, I’ll be pointing to Roger Sperberg’s recent posts at TeleRead about non-PDA handhelds and computers for stand up use. At the moment, however, some of his points remind my of a few I’ve got to make about PDAs here. I’ve got a Sony Clie TH-55, the top of the line of the last series they imported to North American shores. Gizmo Project, VoIP, Asterisk Jason O’Grady{#14763} introduced me to the Skype-like Gizmo Project by the folks over at SIPphone. I’ve been a Vonage customer for a couple years now, so I’ve had a chance to get familiar with VoIP, and I’m looking for a good Bluetooth headset so I can try Gizmo and Skype (and others), but I got to wondering what more I could do. Asterisk is an open source PBX application that runs on Linux, MacOS X, and others. Marriage Alternet has a story by Monica Mehta titled The Myth of Marriage with this synopsis: A radical new book debunks the concept of marriage as a time-honored institution, and argues that we need to loosen up about it. The book is Stephanie Coontz’s Marriage, A History. Related previous story: The “Sanctity” Of Marriage. Put A Pepper In Your Library Libraries are known for books. And despite the constant march of technology, despite the fact that we can put a bazillion songs in our pocket, despite the availability of the New York Times and so many other newspapers and thousands of journals online, books are a big part of what libraries are. Books, dead tree books with that rotting paper smell. And though I dare not prognosticate, I expect they’ll be an emblematic feature of libraries for a while now. Elements Of Murder John Emsley, author of Elements Of Murder: a history of poisons appeared in an interview on NPR’s Fresh Air’{#4769877} earlier today. Those who were fascinated by the morbid details of Devil in the White City should give it a listen. I plan on checking out the book too, though it sounds like Emsley offers more chemical formulae than outright suspense. ILS: Inventory or Search and Retrieval System? There’s an interesting discussion going at LibDev about what our ILSs are. It all started with a discussion of what role XML and webservices could/should play with ILS/catalogs, but a comment reminded us that Vendor’s decisions about adding new features to products that have been around for 20 or 30 years sometimes edge towards lock-in. I replied offering Flickr as an example of a vendor that’s been successful in part because of their open APIs. Nuclear Family Vacation Via Defense Tech: Slate did a series last week titled A Nuclear Family Vacation that visited the Nevada Test Site; Los Alamos, Lawrence Livermore, and Sandia National Labs; and Trinity. Extra: a slideshow accompanies the text and the authors interviewed{#4755708} on NPR’s Day to Day{#4755708}. Related: previous nuclear stories at MaisonBisson. Karl Rove’s Leak-and-Covergate Two items from the blogosphere about Rove’s Leak-and-Covergate at Tikun Olam and AlterNet. Life Magazine Covers I get a kick out of these 1948 and 1950 Life magazine covers. Take a look and I think you’ll agree that no magazine puts photos like this on their covers today. Screen Real Estate At 2560 x 1600 pixels, Apple’s Cinema HD display{#XfFSogqWv7s&amp;offerid=77305.10000038&amp;type=2&amp;subid=0} is big enough for three people’s egos. XML/PHP/SWF Charts Flash app dynamically generates charts based on XML formated data or values in a PHP array. XML/SWF Charts is a simple, yet powerful tool to create attractive web charts and graphs from dynamic XML data. Create an XML source to describe a chart, then pass it to this tool’s flash file to generate the chart. The same tool also accepts PHP sources. XML/SWF Charts makes the best of both the XML and SWF worlds. Pepper I’m off visiting the good folks at Pepper today. I’ll update this post with photos as soon as they’re available, then look for a pair of posts about how the hardware/software works and what I’d like to do with it later. Until then, here are some related posts: Ultra Portable Computing, Pepper Pad 2, and Portable Computing. UPDATE: the picture above is blurry because of my poor photography skills. Better pictures can be found at the Pepper site. Tags Tags Tags David Weinberger at Many-to-Many pointed me to Tom Coates’ post about different schools of thought regarding tags. Coates has been thinking about tags as keywords, annotations. Thats how I’ve been using and thinking about tags too, but some people have different ideas. …At the end of the argument I said to Joshua that it was almost like he was treating tags as folders. And he replied, exasperated, that this was exactly what they were. What’s a MIRT? MIRTs turn red lights green, but merely having one will probably get you in a pile of trouble. More info at i-hacked.com{#176}. Peerflix Ross Rubin at Engadget just alerted me to Peerflix …which can be described on a basic level as eBay meets Netflix. Peerflix resembles many online DVD stores, but it neither rents nor sells DVDs. Rather, it depends on a community of users willing to trade DVDs they have for DVDs they want. There are no subscription fees. Peerflix charges a 99-cent transaction fee and senders are responsible for the postage charge of 37 cents for the mailers that the company distributes. John Barlycorn Must Die In a popular antebellum Arkansas story, a backwoodsman bought a 5-gallon barrel of whiskey, only to return a week later for another. “Surely you haven’t drank that whiskey already?” inquired the astonished merchant. “It ain’t so much,” replied the backwoodsman. “There are six of us, counting the kids, and we have no cow.” It’s not quite as detailed as some of the stories in the Foxfire books, but it’s a good treat. The Failures Of Permission Culture Donna Wentworth, over at Copyfight pointed out a JD Lasica piece detailing the responses from seven studios to his requests to use short (10-30 seconds) clips of their films in a non-commercial project he was working on with his child. …four of the studios refused outright, two refused to respond, and the seventh wobbled. This is the quandary millions of us face today: The Hollywood studios demand that we ask for permission to borrow from their works — and then they deny our requests as a matter of course. Google Moon Rocks Google engineers have got the moon on their minds lately. We all got a laugh at their April Fools Day lunar hosting and research center job opening, but they’ve done themselves one better and several points more serious with Google Moon. Sure, it’s in celebration of the first lunar landing 36 years ago today, but if they’re so fixated on the moon, why not sponsor a space competition? Google Maps Gets All The Attention It would reasonably appear that here in the US, there’s only one map site: good ol’ Google. But until Google adds maps for countries other than the US, Canada, and UK, the rest of the world will have to look elsewhere. Enter the UK competitor: Multimap.com has been serving the world outside the bubble since 1996. From their self description: Key features include street-level maps of the United Kingdom, Europe, and the US; road maps of the world; door-to-door travel directions; aerial photographs; and local information. Jenny’s DRM Scourge Jenny Levine, over at The Shifted Librarian, is telling the latest chapter in her long-running struggle with DRM. Now, I’ve installed a lot of Windows software in my day, so I feel pretty confident in my ability to double-click on an installation file. However, when I try to install [Yahoo Music Engine], I get three screens into the installer (oh the joy of accepting the license agreement over and over) before I get an error message that says, “The file c:\downloads\ could not be opened. bstat Beta 4 Release [[pageindex]]UPDATE: shout outs to Zach, Cliff, Justin, and Thomas who’ve submitted bug reports. Their feedback has been rolled in to the B4 July 20 release, available now (look for the link below). This is likely the last release before the code gets bundled into bsuite (more details on that later). Changes This documentation supersedes any previous documentation. More changes to the bstat_pulse() function; bstat_pulse_style() is no longer used. It’s been replaced by a flag in the call. See the usage example to understand. Want to customize the style? Start with that example, look at the XHTML it outputs, work from there. UPDATE thanks to Zach, these parameters are all optional. You can call it with nothing more than “bstat_pulse()”, if that’s your thing. Still, I’d recommend using the full example below. There are a lot of improvements to the management console. The number of lines to display for each category and the date range (past day, week, month, etc.) are now configurable. Quick Start Installation Download and unzip bstat.zip   Place bstat.php in you wp-content/plugins directory   Place spacer.gif in your wp-content directory   Log in to your WordPress admin panel and activate the plugin, then visit the new bstat submenu of the options tab. This will allow bstat to create its database tables.   Add the bstat_hitit function to the footer.php of your theme (or in some other place where it will be called once for each page load). This starts the counting; you can see the results in the bstat submenu of the manage tab of the WordPress admin panel. In order to view the bstat results on your public pages, you’ll need to the bstat display functions to your pages. It’s Funny ‘Cause It’s True First Lady Laura Bush speaking at the White House Correspondents Association gala noted: George’s answer to any problem at the ranch is to cut it down with a chain saw. Which, I think, is why he and Cheney and Rumsfeld get along so well. The quote is all over the net now, but I found it in the August issue of Vanity Fair. Australia’s Rum Jungle Alan Moorhead, in his 1952 Rum Jungle — a sort of casual ethnography or serious travelogue — explains the uses and attitudes towards alcohol in his native Australia: […] I took it for granted that for all social occasions, at any time of the day or night, beer was the drink. You did not take it with your meals, but before or afterwards and in considerable quantities. Beer was the solace of life and the white man’s true vision of bliss. Full-Text Searching Inside Books Search Engine Watch did a story about how to use Google and Amazon’s tools to search full-text content inside books. The gist? when you can get to the tools and where they’ve got content, it does a lot to make books as accessible and open as electronic content. Sort of related: I’ve spoken of Google Print before and there’s more in the Libraries and Networked Information category. Organizational/Institutional Blogging Done Right Jenny Levine is talking about an example of The Perfect Library Blog over at The Shifted Librarian. The posts are written in the first person and in a conversational tone, with the author’s first name to help stress the people in the library. The staff isn’t afraid to note problems with the new catalog, the web site, or anything else. Full transparency — nice. You can feel the level of trust building online. Hackable Snackable Gumstix The MAKE: podcast pointed me to gumstix — really small computers built for hacking. Cool. Google Hacks From O’Grady’s PowerPage{#14723}: I have no interest in true hacking (i.e. rummaging through people’s private junk) although viewing random unprotected IP cameras around the world in public places and controlling their panning and zoom functions is kind of mind-blowing. There are a ton of fun GHacks out there – like spelling out words in pictures using Google image search, and the Google poetry generator, or the news map generator etc. Skyhook WiFi Geolocation Old news from Gizmodo and Wi-Fi Networking News (quoting WiFi NN): Skyhook has assembled a database of information about 1.5 million access points across 25 major cities in the U.S. by driving every street in every city. Their software records multiple data points per sample for directionality. Fire up their software on a laptop, and it compares the Wi-Fi information it sees with what’s in the Skyhook database, popping out a latitude and longitude within 20 to 40 meters. Coolest Watch Ever, Today Anyway The Nixon Rotolog{#1124&amp;MENU_ID=1}. Ike Dwight Eisenhower’s eight years as president were about a lot more than I Like Ike buttons and interstate highways. From Wikipedia: After his many wartime successes, General Eisenhower returned to the United States a great hero. It would not be long before many supporters were pressuring him to run for public office. Eisenhower was generally considered a political moderate, and it was not immediately clear which party he would choose to join. Jet Turbine Powered Toyota MR2 On eBay Yup, it’s up on eBay now (closing in a day or so) with the following description: Everybody needs one of these, cleaning out the garage, this little car is so much fun, it is thrust powered by 2 GE t-58 turbines, has 4 fuel tanks, power steering, power brakes, fire detection, fire suppression, roll over protection, self starting and quick. I have taken this car to the salt flats twice, the first time it wanted to fly @ 140 mph, but after adding the spoilers and air dam it stayed solid thru 187 mph with a lot more room to go. The Google Economy I’ve been talking about it a lot lately, most recently in a comment at LibDev. In the old world, information companies could create value by limiting access to their content. Most of us have so internalized this scarcity = value theory that we do little more than grumble about the New York Times’ authwall or similar limitations to the free-flow and linking of information. Jenny Levine wrote recently about OCLC/LJ’s short-run (though not yet ended) experiment with authwalls. What’s a “Blink” ? Stealing from Corante/Copyfight: It’s a short, one-sentence blog post + a link, à la Kottke remainders. [It’s] to share links to articles, resources, and websites of interest that do not necessarily require paragraphs of context or analysis. Enjoy! Solar Backpacks & Chargers Solar charging backpacks: Juice Bags (news), Voltaic Solar Backpack (news). And, solar iPod charger: Solio (news, news). Personalizing the Preservation Problem I went looking for an old file the other day. As it turns out, the file was from 12 years ago, but that doesn’t seem so long ago now. Anyway, I was amused to find how most of my images from that time were TIFFs instead of JPEGs. Thankfully, TIFFs are well supported now, but my old PageMaker files are largely useless to me. And while I was looking at these files from so long ago I found my really bad music from the day. Is Blogging Career Suicide? Ken (I wish he had a blog to link to) pointed out Bloggers Need Not Apply in the Chronicle Of Higher Ed over the weekend. The story is to some a highly cautionary tale: A candidate’s blog is more accessible to the search committee than most forms of scholarly output. It can be hard to lay your hands on an obscure journal or book chapter, but the applicant’s blog comes up on any computer. The Big Switch Other than a bit of head scratching after the announcement in June, I’ve been quiet about Apple’s switch to Intel processors. Now, ArsTechnica‘s Jon “Hannibal” Stokes has written some of the most intelligent material I’ve seen since. How’s it work? Hannibal thinks Apple’s relationship with IBM soured to the point where they refused to play the game. And Apple is imagining a world of devices Macs, iPods, and as yet unannounced portable, personal lifestyle devices. Napster’s Hard Road Napster — the legal, reincarnated music download site — essentially invented the concept of incumbent campus download services. They loudly{#1684} touted deals with schools “anxious” to stop the p2p music sharing problem. Trouble is, according to this story at The Reg, it’s not working well. A survey at one client university paints a sad picture: Not a single University of Rochester student admitted to buying a song via Napster during the Fall 2004 semester. The High Cost Of Metasearch For Libraries I’ve been looking seriously at metasearch/federated search products for libraries recently. After a lot of reading and a few demos I’ve got some complaints. I’m surprised how vendors, even now, devote so much time demonstrating patron features that are neither used nor appreciated by any patrons without an MLS. Recent lessons (one, two, three) should have made it clear that libraries need to conform to patron expectations of how online resources should work. bStat Features UPDATE: bstat has been updated. bStat is a hit and search term stats tracking plugin for WordPress. In addition to reporting lists of popular stories and popular search terms, it will report recent comments and a unique “pulse” graph showing the activity for a story or the entire blog over time. The documentation for the current release (b3, as of July 9, 2005) explains the public functions and their use. I believe they reveal themselves in their names, so here’s a list of most of them: Make My xB A Low Rider Team Pneumatik’s FAQ addresses the question “why do I need air suspension” simply: “Because you wanna be cool!” And now, with Pneumatik’s forthcoming kit, Scion xB owners can be cool too. Thing is, based on the photos it just doesn’t have the same effect on an xB as it does on, say, a 1965 Caddy. Braving Home Jake Halpern’s Braving Home (also in softcover) easily took my interest. Here’s how John Moe described it for Amazon.com: As a cub reporter at The New Republic, Jake Halpern earned the unofficial job title of Bad Homes Correspondent. Braving Home tells his stories of places where people really ought not live and the people who live there anyway. Halpern traveled to such inadvisable destinations as a bed and breakfast at the foot of an active Hawaiian volcano, a North Carolina town trying to recover from being completely submerged, an indoor Alaskan city, and an island in the Gulf of Mexico located directly in the cross hairs of numerous hurricanes. bstat Beta 3 Release UPDATE: bstat has been updated. Beta 2 never went public. This is beta 3. Changes This documentation supersedes any previous documentation. The bstat_pulse() function has been improved and now uses your CSS for appearance. Call bstat_pulse_style() to add my default styles inline if you don’t want to modify your CSS. Also, bstat_pulse() now has two switches to control what it displays. Please take a look at the usage guide below for how to call this function now. LibDev Launched LibDev launched today. From the Welcome message there: LibDev is a site for those interested in libraries and networked information. Want to find a way to apply tags or social bookmarking to library content? Interested in how Wikipedia can serve libraries? Want to find a better way to do patron loads or talk about what identity management means to libraries? Looking for single sign-on solutions so patrons can move seamlessly from the campus portal to your OPAC without re-authenticating? Idaho Politics Earlier this year the Idaho legislature passed a bill recognizing the success of Napoleon Dynamite, a film about Idaho life by Idahoan native sons. LEGISLATURE OF THE STATE OF IDAHO First Regular Session – 2005 HOUSE CONCURRENT RESOLUTION NO. 29 STATING LEGISLATIVE FINDINGS AND COMMENDING JARED AND JERUSHA HESS AND THE CITY OF PRESTON FOR THE PRODUCTION OF THE MOVIE “NAPOLEON DYNAMITE.” Be It Resolved by the Legislature of the State of Idaho: The Struggle To Protect Democracy In Florida My dad, who’s called Florida home for quite a while now, emailed me the following about goings on there: The big news here is the struggle to prevent Volusia County adopting the the Diebold touch screen ballot machines. They are bad news, because these Diebold machines do not leave a paper trail and so a manual recount of a disputed election is impossible. The Republican leaders of Florida, who take pride in their deviousness, are trying to require the adoption of these machines under the guise of providing an accessible voting system for the handicapped, especially the visually impaired. Happy Birthday, Popsicle NPR’s food essayist Bonny Wolf reported yesterday on the 100th birthday of the popsicle{#4727935} for Weekend Edition Sunday{#10} (listen in RealAudio). Like so many brilliant inventions, it happened by accident in 1905. And through a century of change, it remains a consistent American icon, stick and all. It all started, apparently, with a forgotten bottle of soda pop with a stick in it and an unusually cold night. When Is Principality of Sealand’s Independence Day? Principality of Sealand is a WWII-era gunnery platform — called Roughs Tower — in the North Sea, outside Britain’s pre-1968 three nautical mile claim of sovereign waters. Founded by Roy and Joan Bates in 1967, over time, Roy wrote a constitution and named himself and Joan as prince and princess. The Wikipedia article on Sealand tells the story of the world’s smallest micronation about as well and evenly as might be possible, but Sean Hastings’ website offers a more gripping tale. Cannon Aerial Tramway It’s hot in New Hampshire, but on top of Cannon Mountain, 4146 feet about sea level, it’s a little cooler. It’s an easy enough hike, but the aerial tram will save you the sweat. The current tram was built in 1980 and replaced the 1938 tram. The 2100 foot climb from the base takes a mile of cable each way, and the two cars make a trip every fifteen minutes. Google Maps Rock, The Google Maps API Rocks More We don’t need to hack Google Maps anymore. Now that Google has released a public maps API, we can make more reliable map-dependent apps (which will now have better browser compatibility, thank you). Within a few minutes of signing up for a maps API key I had put together the following of the Nevada Test Site Tour. Yeah, click the satellite button, scroll, zoom… It’s real. The API is all JavaScript, but I use a bit of PHP to iterate through an array of points and generate the code that puts the lines and pins on the map. Photron Makes My Favorite Video Camera Photron’s APX-RS video camera{#KingOfHighSpeedVideo} can capture 250,000 frames per second at top speed, and it can get megapixel+ resolution at 3,000 frames per second. It’s one of a dozen or so cameras in Photron‘s lineup that can shoot very, very fast video. How fast is a thousand frames a second? How fast is several thousand frames a second? Numbers alone do a bad job of telling that story. That’s why they did up this set of sample vids… Color Picking I needed to pick some colors for a new website recently. I’m color blind, so that complicates things. Thing is, color relationships can be defined mathematically and “good” or “bad” color combos can be selected by a formula, so it possible to pick colors that go together without actually being able to see them. I’ve done this color math manually for years, but I went looking for a piece of software to make it easier. WordPress’ is_X() function An entry at the WordPress support forums{#13505} gave me the list I needed. How do they work? “You can use [these] in a conditional to display certain stuff only on [certain] page[s], or to omit certain stuff on [those] page[s].” Here’s the list: is_404() is_archive() is_author() is_category() is_date() is_day() is_feed() is_home() is_month() is_new_day() is_page() is_search() is_single() is_time() is_year() So there you go. Freight Elevator Quartet JazzMusique (RSS, stream) treated me to Freight Elevator Quartet‘s So Fragile (from their Becoming Transparent album) not long ago and I liked it enough to take a note to look them up later. The band released five albums between 1997 and 2001, but seems to have disappeared since. Their site is still alive, and most entertainingly, has 13 fan remixes of Svengali (also from Becoming Transparent) available for download. My favorite is the version by Absinthe &amp; Shiftless. Alcohol Knowledge Test I just love tests (previously: psychotic, leadership style in movies and famous people in history, and eccentric or autistic), so I was quick to try myself at this one when Al emailed me. It’s about alcohol, and like most tests, it’s not about getting the answer right, but giving the answer that the test writer wants. So it’s flawed, but it’s a bit of fun. Here are my results: Score: Bacardi 151 Sending SMS Messages My friend Will was in meetings all day Friday, and there are few better times to have SMS messaging than in meetings. Thing is, I didn’t want to type on my phone’s numeric keypad when I had my computer in front of me, so I went looking for the details of this old hint that describes how to send SMS messages with iChat (would also work with any AIM client). RegEx Reference Regular expressions are a pain. Jan Goyvaerts’ RegEx Reference helps. In a related tip, the following will eliminate any non-numeric components in a string: ereg_replace(“[^0-9]”, “”, $string) . I guess I’ll have to admit that I’d not used the exclusion operator before (the carrot immediately following a square bracket). Now I know. GeoTagging Gets A New Meaning Who doesn’t love tagging? No, tagging as in annotating, not graffiti. Anyway, Rixome is the latest among a bunch of plans/projects to enable tagging of geographic spaces/real-life environments. The good people at We Make Money Not Art had this in their post: rixome is a network and a tool that turns mobile screens into windows that show the virtual and public dimensions of our point of view. A walker (a rixome user) can see on his/her mobile phone/PDA/laptop screen the virtual interventions that have been added to the location where s/he now stands. Art Deco Hair Daniela Turudich knows vintage fashion. Her books include not only hair, but how to recreate a vintage wedding, vintage recipes and candy making, and Beauty Secrets of History’s Most Notorious Courtesans. Here’s the description from Art Deco Hair: Art deco has long been associated with uncompromising style and sophistication, and this guide to recreating the sassy, controversial styles of the ’20s and ’30s offers a glimpse back at the hairstyles of this era. Oooms Design Ist Sehr Gut Guido Ooms has some pretty neat ideas. Engadget got high on his Anti Gravity Machine (you must watch the video), but there’s a lot more to see. I wish I could link to examples of his furniture, bottle holders, personal transportation devices, or dohickies, but his Flash-based site won’t let me. His Glassbulbs are pictured here, but go visit the Oooms site and click on the “products” link to see more. How To Measure The Tallest Building Zach likes tall buildings. Perhaps it relates to his superhero obsession (leap giant buildings in a single bound and all), but it’s undeniable that he likes them. Here, he gushes about the details of what makes a tall building and how it is measured. Judging can be to the top of the highest occupied floor, top of the roof, architectural top (including spires), and top of mast or antenna. Of course, the building must be freestanding and habitable too. Culture of Entertainment I don’t remember how I found this tip to BaitCar.com‘s collection of police videos of car thefts. They’re good for a few laughs, but things like this — and about half of the programming on Spike{#13535} — make me wonder how far we are from from the worlds depicted in Running Man and so many other stories. Eh, at least we’ve got Bravo. That’s some good TV. Least Wanted I’m entirely captivated by Mark Michaelson‘s collection of mug shots on Flickr. It’s titled “Least Wanted” and he notes with little fanfare that they’re “Nobody famous.” Some of the photos contain little histories, like this set from the 40s and 50s that includes conviction details — “30 days W. H.” for “selling obscene literature.” Another image shows rapid aging over a three year period starting in 1943. It’s part of a small collection of recidivist women of the 1940s. Overheard In The Library “I want all the books that I’m interested in on one shelf.” Making ZIP Files On Mac OS X Everybody else may know this, but MacOS X includes the command-line utility to make Windows-compatible ZIP files. It works a lot like tar, but without needing any switches. &gt; zip {target file} {source files} Big Brother Gets More Eyes Engadget yesterday had a story about the Mobile Plate Hunter 900, a device that mounts on police cars and scans 500 to 800 license plates an hour. More details are in the Wired News story, where LA County police commander Sid Heal notes that the system is hands-off: “It doesn’t require the [officer] to do anything.” The plates are automatically checked against a database of stole cars, and the patrolling officer when the system finds a match. Switching Hosting Providers I’ll be switching hosting providers this week. At some point I’ll have to turn off the comments here so that I can synchronize the database and prevent loss of comments as the DNS changes propagate. **Update: ** The switch seems to have gone well and the DNS changes have propagated to the networks I’m using. Comments are on again. That’s the way it’s supposed to work. bstat Beta 1 Release UPDATE: bstat has been updated. I’ve finally added a clean admin interface to my bstat WordPress stats tracking plugin and cleaned up the code for release as a public beta. Quick Start Installation Download and unzip bstat.zip Place bstat.php in you wp-content/plugins directory Place spacer.gif in your wp-content directory Log in to your WordPress admin panel and activate the plugin, then visit the new bstat submenu of the options tab. What Makes Ohio Red It’s a story that won’t die, and yet it can’t get any attention. Since November 3rd, reasonable people have been wondering what happened. On election night, exit polls predicted a 5 million vote win for Kerry, but the official election results declared Bush the winner by 3 million votes. We’re all suspicious of polls, but an 8 million vote discrepancy is big and exit polls are considered the most accurate of all. North-Country Drive-Ins The Fairlee Drive-In Theatre is open with double features on weekends Details: 1809 Route 5, Fairlee VT 05045 (one mile north of town) 802-333-9192 Notes from Driveinmovie.com: The usual Hotel/Motel concept of in-room movies is cable TV, this is one of only two Drive-ins in America that have a motel on the premises with a view of a drive-in movie. All rooms have a picture window and speaker, allowing motel guests to watch the movies. Squirrel Decanter And Other Dead Animal Art The strange folks over at Custom Creature Taxidermy Arts have come out with a Squirrel Liquor Decanter that’s making the rounds. Jon said simply “words cannot describe.” But the good folks at Gizmodo assure us that “anyone who sees you sucking on the desiccated neck of an ex-squirrel will know you are a man of class and style.” Other items in their novelty selection include flying squirrels and punk rock squirrels. American Reporter’s Nagasaki Story Emerges After 60 Years Of Censorship George Weller won a Pulitzer Prize, a Polk Award, and was named a Neimann Fellow during his fifty-some-odd year career during which he covered much of Europe and Asia for the New York Times and Chicago Daily News. Weller died in 2002 at age 95, leaving behind a body of work that tells much of the 20th century’s events. His 1943 story about an appendectomy performed by navy pharmacist’s mate Wheeler Lipes in a submarine 120 feet below Pacific waters amid the concussive blasts of depth charges is legendary. The Difference Between Progressive and Conservative Bloggers David Rothman points to a Daily KOS story that points to a MyDD story titled “Aristocratic Right Wing Blogosphere Stagnating.” What’s the point? Of the top 40 political blogs, more than half are ‘liberal,’ and more importantly, they support community involvement — including basic features like comments — that the conservative blogs shun. of the five most trafficked conservative blogs (over 200,000 page views per week), only one […] even allows comments… Google Print: Reports From Michigan & Oxford I’m listening and watching along with the EDUCAUSE online presentation from the Universities of Michigan and Oxford and their participation in Google Print. Presenters: John P. Wilkin Associate University Librarian Library Information Technology and Technical and Access Services University of Michigan   Reginald Carr Director of University Library Services and Bodley’s Librarian University of Oxford Google Print is old news by now, but it’s interesting to get their reports on it. Geolocating The News Last week I got excited about the as-yet unreleased geolocation API for BBC Backstage. Now Larry D. Larsen of the Poynter Institute is excited too. In a post titled The Future of News (… Hint: GPS){#31&amp;aid=83597} he talks about putting news in geographic context with geolocation tags. Eventually, clicking an article in a news/Google Map hybrid might zoom in to a 3D model of the area where an automatic pop-up starts playing a slideshow with pictures of the scene or streaming video along with the text news content. Blogger’s Legal Guide Copyfight is pointing to the EFF‘s new Legal Guide for Bloggers. Most of the content is about liability, but it also addresses issues of access and privilege that are generally granted to journalists, election law, and labor law. From the introduction: Whether you’re a newly minted blogger or a relative old-timer, you’ve been seeing more and more stories pop up every day about bloggers getting in trouble for what they post. When You Don’t Have A GPS… Geolocation by GPS my be the most straightforward approach, but we mustn’t forget the other ways to get lat/lon coordinates. All current cell phones support aGPS positioning to comply with federal E-911 mandates, but not all phones make it easy for the user to get that information out of them. Still, some do and GPS-enabled moblogging is becoming common in Asia and Europe, and there’s at least a public proof of concept going in the US. The Mystifying Aroma Of Rot I love libraries, and I love books, but there the needs of our students and limitations of our budgets have no room for misplaced romantic attachments. That’s why I’ve found myself paraphrasing something from Ibiblio’s Paul Jones (via Teleread): That smell of an old book, that smell of old libraries? That’s the smell of the books rotting. We must remember that libraries catalog and share information and knowledge, not books. Pinball Wizard Gets His Due The Laconia Citizen{#/20050608/CITIZEN0104/106080096/-1/CITIZEN} reported today that Ron Mowry’s 31 year quest for recognition as the real pinball wizard of 1974 has finally achieved some success. The Twin Galaxies Official Video Game &amp; Pinball Book of World Records will credit Mowry’s 72 hour 8 minute marathon pinball session as a record. Mowry set his 1974 record at a sandwich shop in Hallandale Beach, Florida, but he was raised in Plymouth, NH, where he now works for the university. bstat Pulse I imported the content of my old referrer tracking database as hits in my new bstat stats datatabase so I could have more data to work with. I mixed this with a fairly simple graphing routine and now we can see the “pulse” of the whole site and each story. Take a look at the bottom of the main page and between the body and comments in the single story pages to see what I mean. bstat Progress I’ve been hard at work on my bstat stats tracking plugin for WordPress and you can see the results in the sidebar and in the story views here. The work has been made especially easy because of the great documentation, including writing a plugin, plugin API, and related pages at the WordPress codex. I’m testing the plugin with a limited group now (thank you Sandee and Cliff). But with a few more tweaks and a little more time to prove itself, I think it will be ready for an open beta. Professionals Don’t Use Ofoto Or Wal Mart Photo Services At least that’s the only thing a person can conclude from the stories at Copyfight earlier this week. This post reports on two stories where the photo services concluded that the photos to be printed were too good to have come from an average customer. Upon trying to order prints of her child, one Ofoto user found the following: Your order has been cancelled because it appears your order contains one of the following… 1. BBC Backstage Is Gonna Rock (Once They Release The APIs) The APIs aren’t yet out, but the BBC has already won me over with their Backstage BBC concept. Of course, I’m a fan of anything with an API, but the real deal here is that it appears they’re planning on releasing a “query by geo-location data” API — and I’m all a gaga about about geolocation. I’ll definitely be looking to see what takes shape across the pond. Damn PNGs in Internet Explorer I don’t know why IE has never displayed my transparent PNGs correctly, but I know now that I’m not the only one with this complaint. Bob Osola (name?) shares my frustration, and better, he sat down and coded a solution, shared the code, and posted a wonderfully informative guide to the problem. Not sure if your browser can display transparent PNGs properly? Follow that link for examples. The Google Economy Vs. Libraries Roger over at Electric Forest is making some arguments about the value of open access to information. Hopefully he’ll forgive me for my edit of his comment (though readers check the original to make sure I preserved the original meaning): …keep the [information] under heavy protection and you will find that people ignore this sheltered content in favor of the sources that embrace the web and make everything accessible… [Open and accessible resources] will become the influential authorities, not because they are more trustworthy, or more authoritative, or better written, but because they are more accessible. What? I’m not sure what to think about Steve J’s WWDC announcement (video stream) of Apple’s switch to x86 processors. Coverage at MacNN, Mac Rumors, Ars Technica, etc. I’m not sure, but it would be easier to take if I wasn’t the only one who saw conspiracy in it. Does this relate to Intel’s recent shoehorning of DRM onto the CPU? It wasn’t long ago that I was praising Apple for making devices that served the remix world that exists in the void between fair use and copyright infringement, but moves since then have concerned me. On The Media Does Copyright Issue I had just sat down to post a note about an interview with J.D. Lasica in On The Media (listen to MP3) this week when I found David Rothman beat me to it. The interview was one of the better treatments of copyright issues that’s I’ve heard/seen in the (relatively-) popular media. Here’s the summary from the OTM site: For every move that media industries have taken to protect their copyrights, there has been an equal and opposite countermove by consumers. Doggy And You: Mark Schutte’s Dog Powered Scooter Engadget has a link to Mark Schutte’s dog powered scooter. This catches my eye because my friend Joe is always looking for ways to exercise his sled dogs in the summer. The developer, of course, is very serious about its befits and usefulness of this contraption. Here’s the sales pitch: Focus your dogs energy and enjoy the new sport of Urban Dog Mushing. Engadget has some complaints, but this looks like the best solution I’ve seen yet for running sled dogs in the summer. Remixing Reality: Good or Bad? We’ve all seen the ads they digitally insert on the field during football games and we’ve heard talk about inserting new product placements as old TV shows play in syndication. Ernie Miller has been thinking about this recently. Last week he noted that folks are creating ipod-able, independent audio tours of museums. “…Hack the gallery experience, […] remix MoMA!” commands ArtMobs, one of the groups producing these unauthorized audio tours. Ohara Fireflies I don’t consider myself a Japonophile, but I do find myself reading Mainichi Daily News each day, and when they put up a picture like this, of fireflies near the Yamada River in Ohara, (Chiba Prefecture) I can’t help but notice. TeleRead Spends Morning On Portable Computing Stories …Well, not entirely, but I couldn’t help but read the posts on the PepperPad and history of the Newton. I’m a fan of computing devices that don’t fit the mold, so I eat up stuff like this. I noted the Pepper Pad previously, and written a few posts about the Newton and ultra-portable computing. Update: Engadget is getting in on the excitement too. They’re pointing to this OSOpinion article that’s at the center of it all. Wikipedia and Libraries Wikipedia seems to get mixed reviews in the academic world, but I don’t fully understand why. There are those that complain that they can’t trust the untamed masses with such an important task as writing and editing an encyclopedia, then there are others that say you can’t trust the experts with it either. For my part, I’ve come to love Wikipedia, despite having access to EB and other, more traditional sources. Disobey Gary Wolf wrote in the June issue of Wired about how smart mobs in New York’s World Trade Center outbrained the “authorities” and enjoyed higher survival rates because of it. Wolf is talking about the NIST report on Occupant Behavior, Egress, and Emergency Communications (warning: PDFs). There’s also this executive summary and this looks like a mind numbing PowerPoint presentation (also PDF). So, what about it? For nearly four years – steadily, seriously, and with the unsentimental rigor for which we love them – civil engineers have been studying the destruction of the World Trade Center towers, sifting the tragedy for its lessons. Japanese Government Employees Extremely Troubled By Summer Casual Dress Code Today is the first day of summer, according to Japan’s Environmental Ministry, and that means it’s time to take off the ties and suit jackets and put on “casual” clothes. The ministry has been leading a charge to reduce energy consumption and ease global warming by asking all government employees to leave their neckties at home so they feel cooler with less air conditioning. But despite endorsements from Prime Minister Junichiro Koizumi it might not be going as well as planned. Take A Picture, Get Hassled By The Man Alan Wexelblat at Copyfight pointed out this story that talks about increasing limits on public photography. If you’re standing on public property, you can shoot anything the naked eye can see, explains Ken Kobre, professor of photojournalism at San Francisco State University and author of one of the seminal textbooks on the subject. …But that apparently doesn’t stop security guards, cops, and others from intimidating and sometimes arresting those who try it. Theme change… Theme change not yet complete, but looking good. It’s a widened version of Clemens Orth’s Relaxation_3column, itself a derivitive of John Wrana‘s two columned Relaxation theme. I found it on the WordPress Codex, and though it was among the first group I looked at, I dutifully clicked through to every other three-columned theme listed there. Anyway, expect the banner to change, and I’m working on how I want to handle the width on smaller monitors (where “smaller” actually equals anything narrower than 1280px). Bad Movie, Verboten Subject? I’m embarrassed to be in the middle of Fantasy Mission Force, a kung fu movie that demonstrates a brand of Asian humor that I haven’t yet learned to appreciate. I’m watching it because I’m a sucker for Jackie Chan flicks and Netflix makes it too easy to queue up bad movies. David Chute wrote the Amazon editorial review: Jackie Chan makes a brief guest appearance in this surreally goofy action comedy, a high-spirited shambles from 1982 that hovers awkwardly somewhere between Monty Python and The Three Stooges. Global Threats, As Seen Through Eyes Of Movie Producers and Insurers Jonathan Crowe points out this Risks In Global Filmmaking Map by Aon, the entertainment industry insurance company. Go view the PDF or a full-size PNG{#16725021&amp;size=o} for all the details. Lunch at Burdick’s Treated Mom to lunch at L.A. Burdick’s in Walpole today. The food at Burdick’s is always remarkable, but this time I got a decent photo of it. I’m calling the plate in front a real tuna salad. Yes, those are strips of medium-rare tuna, but it’s the pickled onions that delighted me. In the middle is my rare steak with a dollop of stilton butter. For desert, we enjoyed a frappes and shared a piece of hazelnut-orange cake while thunder and large hailstones menaced the street outside. WordPress Stats Goodness Work on my bstats plugin continues. I’ve added recently commented posts tracking, begun work on a usage graph, as requested by Richard Akerman, and put together an interesting way to track usage of the Google ads. I’m using the Google ads to figure out how to best use them on another project later. I think they look a little too commercial here too. I’ve done nothing yet to created a list of related posts, and I’m still researching how I want to do referrer tracking. Of WordPress Tags, Keywords, XML-RPC, and the MovableType API WordPress’s XML-RPC support looks pretty good. Heck, it supports a half dozen APIs and works well with ecto … except for tag support, which is my only complaint with it so far. The Movable Type API supports a “keywords” field that I’m thinking can be hijacked as a “tags” field instead, but while ecto sends the goods — I can see them in the XML-RPC data that gets sent out, WordPress seems to ignore them upon receipt. bstats Plugin I’m more than surprised that there’s no (decent) stats plugin for WordPress, but that hasn’t stopped me from writing me own. It’s called “bstats,” and I’ll release a beta soon. In the meantime. the “today’s most popular” list comes directly from this new plugin. One Step Forward… I thought I was real smart when I modified the tags plugin to support integration with Technorati. The code was simple, just look in the tags.php plugin file for the foreach statements that run through the tags names and turn them into links on the page and change the $tags[] = statement to look something like this: $tags[] = “tag_name).”\“ target=\”$target\“ rel=\”tag\“ title=\”More “.$row-&gt;tag_name.” at {site name}\“ &gt;”.$row-&gt;tag_name.“ tag_name.”\“ target=\”$target\“ rel=\”tag\“ title=\”Find “. Nuclear Test Site Tour The above image is my followup to my Nevada Test Site Tour post from last month and comes courtesy of Adam Schneider’s very useful GPS Visualizer (you really need to see it full-sized{#15521015&amp;size=o}, though). I still don’t have a cable to connect the ancient Magellan GPS I used to a computer, so I manually entered the waypoints I marked into the form and selected a few options, and viola. …And Then You Realize You Wasted Your Life I think I’ve been avoiding commenting on this issue for weeks because it hits so close to home. First I read it in BiblioAcid, then Jenny Levine picked it up, then Richard Ackerman picked it up at the Science Library Pad: library catalogs are broken, and there’s no amount of adding pictures or fiddling with colors that will fix them. I nibbled at the edges of this in my IUG conference presentation, but I didn’t say it as well or as clearly as Roy Tennant did in his widely quoted April 15 Library Journal column: Vonage CEO Interview Makes Me Feel Old Engadget’s interview with Jeffrey Citron, chairman and CEO of Vonage gives an interesting peak into the world of the baby bells, through the eyes of an upstart. Citron dishes about the competition, stomping AT&amp;T, working deals with the bells to make 911 services work, and a possible Palm version of their softphone. Most interestingly is his notions about what their customers want and expect. …more and more people are deciding that they don’t even want a land line in the house…? Blog Software Switched I’m almost ready to call the first stage of my WordPress migration done, except it looks like the comment submission forms aren’t working. While I’m working on that, please note the new feed URLs: RSS 0.9x, RSS 2.0, and atom. Update: Found a reference to the comment bug on the WP support site and in their bug tracking system. I didn’t find the answer there, though, so this is still a problem. Switching Blog Software… I think I’ve finally decided to go to WordPress after all. I tried doing it too quickly last time and it almost worked, but I switched back when I realized I might need more than 15 minutes to figure out how to use WordPress in production. Since then I’ve found a set of plugins that do most of what I want, but it looks like I’m going to have to put together a stats tracking plugin of my own. Crime and Privacy on Google Maps Annalee Newitz last week posted a column on people’s fear of privacy loss as a result of Google Maps. Her point: So while all these people are wringing their hands over how simple it is for strangers to discover the color of their roof on Google, we forget that we can already be tracked everywhere we go using cell phones and the RFID chips in Wal-Mart backpacks. I honestly didn’t know people were up in arms about the maps and satellite images (which have been available elsewhere for years), and, like Annalee, I’m much more concerned about the proliferation of real-time tracking systems like cameras, RFID tags in our driver’s licenses and consumer products, and other sensor technologies. Eating My Way Through San Francisco San Francisco is a great city for a conference. It’s also a pretty good place to get lunch. The following is poorly written and incomplete. Well, at least it’s something. Sunday I was a little surprised to find Johnny Rockets on Jefferson St. serving breakfast, but they did a fine sausage, egg, and cheese sandwich all the same. After visiting Alcatraz, I had a delectable rueben at The Buena Vista on the corner of Beech and Hyde, where they’re known for their Irish Coffee. 146 Wasted Minutes I can now say with the authority of experience that Star Wars Episode III sucked. Update: Zach’s right, my opinion of the original trilogy has fallen over time. But I stand by the statement that Episode III is worse than it should be. The real reason for the update, however, is to note a couple pictures of things seen and done while waiting in line: Matt, with an oversized jug of generic cola and this oversized scorpion bowl. UN Food Survey The proceeding was forwarded to me by my dad, who included a note suggesting that jokes may embody the only real truths we can know. A worldwide survey was conducted by the UN. The only question asked was: Would you please give your honest opinion about solutions to the food shortage in the rest of the world? Though translated into appropriate local languages and delivered using local personnel, the survey was a huge failure. Cool Stuff Made Easy (RSS, OpenGL 3D Graphics, Screensaver App) I have an appropriate fondness for Engadget‘s How-To features, like today’s “Make a customized RSS screensaver in Tiger.” MacOS X 10.4 Tiger comes with a pretty decent RSS screensaver (don’t miss the movie), which can be set to display feeds from any source that Safari can read and bookmark. And if that’s all you want out of life, well then you won’t have any reason to leave your couch/chair/bathtub or wherever you use your Mac. Geolocating Everything I’ve been excited about geolocating photos, blog posts, etc for a while. So this past month or so has been quite exciting. Most recently, GPS Photo Linker has been updated with Mac OS X 10.4 specific features: With Spotlight in Mac OS X 10.4, you can instantly search for the city, state and country information automatically saved by GPSPhotoLinker. Additionally, Mac OS X 10.4 does support the GPS metadata tags in photos. About That Bookless UT Austin Library There’s a lot of talk about the New York Times story about UT Austin’s undergrad library throwing out its books. Problem is, I don’t think it’s as exciting as people are making it out to be. First, the undergraduate library is one of 14 libraries on campus and the real issue was space, not books. When priorities change, but you don’t have enough money to break ground on new buildings, you’ve got to re-use the old ones. Flickr API The Flickr API rocks. It helps that the developers are really excited about web services (PDFs converted from their original PPTs). Anyway, there are code libraries available for PHP4, JavaScript and others. Michael Madrid’s Oberkampf is a dead simple PHP library that looks easy enough for non-coders to use. And I found myself quite satisfied with the REST request format and the XML to array parser by Eric Rosebrock. Do I Want A LifeDrive? After months of no news or no good news, and just as I’m about to knock Palm news site 1src off my feeder, palmOne starts leaking details of their LifeDrive “mediacentric handheld.” Then somebody leaked the whole datasheet, and 1src was there with the deets. Engadget was on the story the next day, and summarized as follows: it’s 4.76 x 2.87 x 0.74 inches in size, weighs 6.8 ounces, runs on Palm OS Garnet 5. Markoff, I Wish I Could Trust Thee Trouble: John Markoff has been doing tech stories for the New York Times since the beginning of days, so it’s likely he’s written something you’ve read and enjoyed. But he’s also written a number of wrong or counterfactual stories that he makes little or no apology for. At the core of the claims against him is his coverage of Kevin Mitnick, the accused cyber-criminal who was held for over four years — including eight months in solitary — without a bail or sentencing hearing. Google’s War On Hierarchy, Alert The Librarians Via Ernie Miller I saw a link to John Hiller‘s story about Google’s War on Hierarchy, and the Death of Hierarchical Folders. Googlization is a concept libraries have been strugling with for a while. And while it’s hard to say wether the change is good or bad, I can say that failure to change makes libraries irrelevant among patrons who’ve grown accustomed to Google and other exemplary services. So John’s story caught my eye and had my full attention for a while. Sunrise on Mount Monadnock I’ve loaded some more of my old photography, inlcuding this shot of sunrise on Mount Monadnock (info) from the spring of 1992 or 1993. Josh stands on the outcrop in the foreground. I held the exposure open longer than appropriate for true brightness and color, but I like the effect. Other photos: Another sunrise on Mt. Monadnock, photos from around Harrisville, panoramas of the Nevada desert and London, a rose, and a set of stairs. Library Portal Integration I’ve been back at work less than a week now, and I’m already behind. I’ve finally posted the handout and slides (as a QuickTime movie, PDF here) from our IUG presentation. I’ll submit them to IUG for their archive and add them to the Plymouth State University library portal integration page in an update soon. As usual, presentation slides don’t stand on their own, but they should be helpful reminders of what was said. Kwajalein Atoll Kwajalein Atoll is a part of the Republic of the Marshall Islands, lost in the Pacific Ocean (MapTech makes it easier to find) along with more recognizable locations like Bikini and Enewetak atolls. The military presence is far from gone, however, as Kwajalein is home to Reagan Test Site, where the US Army tests the last remnants of Reagan‘s infamous Star Wars program. Now reincarnated as George W. Bush‘s missile defense, it survives despite its flaws and an unbroken string of failed tests. Hilary Rosen: Sock Puppet We’re all talking about Hilary Rosen‘s apparent about face, apparently pro-customer, anti-DRM essay now (props to David Rothman for taking the high road on this). In an update to his Monday post, however, Ernie Miller notes that the RIAA and Hilary Rosen’s history is that of blanket opposition to MP3 players (and fair use) in general. If the RIAA had its way, there wouldn’t be any portable MP3 players. The only portable players you would be able to buy would play only DRM restricted tunes. Delicious, Refreshing, Old Liquor Bottles So grenadine isn’t officially a liquor, but it gets kept behind the bar and this one has a great label. The collection comes from the estate of a friend’s mother, who appears to have had a taste for old martini culture (not pictured are several bottles of vermouth). There’s more in my Flickr photoblog. Pointless, Crude, Badly Drawn, Unintelligent, Offensive It’s a book review. It goes like this: Pointless, crude, badly drawn, unintelligent, offensive. Life-threateningly funny. Buy this. Another Amazon UK customer wrote: Funnier than the real people with Tourettes The book is Modern Toss, by Jon Link and Mick Bunnage. Cartoons and more info are online. When We Can’t All Just Get Along (The Failure of Logical Centrism) I love the following quote from Copyfight: Frank Field, responding to James Boyle’s much–discussed FT column, Deconstructing Stupidity: “Flat-earthers are harmless — until they start forcing you to write the specifications for your GPS system in accordance with their views. Then, you’re screwed.” And Boyle’s column is pretty good too. Former RIAA Head Hates DRM? Today is sort of an anti-DRM day here, so it was some pleasure that I just saw Ernie Miller’s post at Copyfight regarding Hilary Rosen, the former head of the RIAA. She’s complaining about the DRM Apple uses with its music store and iPod. She says: I spent 17 years in the music business the last several of which were all about pushing and prodding the painful development of legitimate on-line music. Give Orphaned Works A Home David Rothman at TeleRead is alerting us to something we should have done a long time ago, but, hey look, a caterpillar…. Really, the US Copyright Office and Library of Congress are accepting comments to comments on the issue of “orphan works.” But, the deadline is today at 5PM EST today! James Boyle, addressed some of these questions in a column in the Financial Times recently: Thomas Macaulay told us copyright law is a tax on readers for the benefit of writers, a tax that shouldn’t last a day longer than necessary. Broadcast Flag Smackdown The only thing that could have made Friday’s news sweeter would be to have received the DC Circuit Court of Appeals’ deciscion against the broadcast flag from the US Supreme Court instead. Still, it’s enough to get most of the IP-aware blogsphere excited. To wit: here, here, here, and everywhere else. Copyfight‘s synopsis was the best: The American Library Association, Public Knowledge, EFF, et al. just won our joint challenge to the FCC’s ability to regulate consumer electronic devices that receive digital television signals, 3-0 at the D. T-Mobile Does Coverage Maps, Verizon Wireless Baffled I’d like to make more of this, but it’s old news. We’re all sick of the “can you hear me now” ads, but that doesn’t stop Verizon from talking up their network testing efforts. But when it comes to network performance, the CEO starts complaining about customers who expect their phones to work at home. What? Yes. Engadget reports: In an interview with the San Francisco Chronicle he asks, “Why in the world would you think your (cell) phone would work in your house? Time To Change… Time To Rearrange… Time To Restore From Backup… I’ve given up on my poorly timed and completely unplanned try at switching to WordPress. I started out thinking I’d experiment with it, then things got out of hand. Factors contributing to my interest in WordPress: ecto via allforces.com A little compare and contrast with pMachine livesearch Better RSS/Atom output Flickr Gallery A mostly functional pMachine importer A damn easy install A bunch of plugins Factors that made me give it up for the short-term: What Are You Doing To Shape The Future Of Libraries? Jenny Levine recently posted a note about OPACs and XML and Maps wherein she makes two points: first, Mike Copley at North Shore Libraries in New Zealand has been doing some exciting stuff to help patrons find books (go ahead, go there and click a “view map” link), then expands her post to address the struggles that folks like Mike face to do some of these things. See, Mike’s library system is converting to Innovative (III) soon, so the work he’s done is mostly for naught, as it’s very difficult to identify item locations with the detail he’s getting now. XML Server Applications Well, it’s done. The [handout][1] and [slides][2] as presented are posted here, and I’ll add them to our [portal integration][3] page (yeah, they’re sort of connected) when I return to [Plymouth][4]. The slides don’t stand on their own, but for those that were there, they should be helpful reminders of what was said what links we looked at. One of the attendees took me to task for recommending MARC XML as the replacement for III’s proprietary schema, saying that it fails to leverage the full value of XML. III Introduces “Web Works” Where did this come from? Innovative calls it “Web Works,” and describes them as “HTML-based interfaces for light-weight system access.” Here’s the program description: WebWorks are new products that offer focused functionality for staff through a lightweight browser-based client. One Web Works client handles Selection List processing while a cataloging client provides the ability to add and edit records. The session was hugely crowded, and I had to run off before I got to ask my question: “how do these fit in with any web services strategy III may be developing? Citing Library Collections On The Web The example below uses a JavaScript to display bibliographic details about an item in Plymouth State University’s library catalog. Now imagine this link included information on the availability of the item, and a button to request or reserve it…. This post is intended to demonstrate how library catalog data can be used in places far from the catalog, perhaps in Blackboard/WebCT, blogs, or elsewhere. I’m at the Innovative Users Group 2005 Conference, where I’ll use this post in my presentation on XML Server, session L5. IUG2005: LDAP Is Not Single Sign-On At Innovative Users Group 2005 Conference now. The most exciting thing today was Using LDAP Authentication by John Culshaw of University of Colorado at Boulder, and Richard Paladino of Innovative Interfaces. Despite the title, the raison d’etre of the presentation was single sign-on, and the unstated hurdle was identity management. Academic IT departments are struggling with these two huge issues, but libraries often have even more limited IT resources and are getting little help from campus IT departments. Prisoners Of Age at Alcatraz Found Ron Levine’s Prisoners of Age exhibit at Alcatraz today. Sadly, the website doesn’t appear give the prisoner’s stories, and, though the photos are well done, it’s the stories that hold our attention. Leaving Las Vegas Morning’s cold light shines harshly even on the strip, but this Saturday morning on Fremont Street looks especially forlorn. I’ll be on a plane to San Francisco for my conference in a few hours. Golden Gate Hotel and Casino According to the history printed on their diner placemats, the Golden Gate has been standing at the corner of Fremont and Main streets for 100 years. kris247 had some good fun eating unhealthy quantities of 99 cent shrimp cocktail at the Gate. [update:] The stay wasn’t bad, in fact, I enjoyed the best sleep I’ve had all week. Some were out trying to save souls, but I found fried Twinkies. Fatburger and Henderson, NV My trip to Henderson was a bust. I’ll eventually make a story about what I’d planned to do, but the only thing that worked out was a visit to Fatburger in the Sunset Station Casino. Along the way I snapped this bad panorama of the Vegas strip. The point here was to show the sprawl on what some are calling the city’s centennial. The shot goes better with the story I wanted to tell, but it fails even there. Nevada Test Site Tour Toured the Nevada Test Site today. No cameras allowed, but I did take along a GPS and marked points of interest along the way. I’ll have to upload the track and landmarks when I get home, but Google Sightseeing has some interesting Nevada destinations, including one for the test site area. But satellite photos can do little to show the human scale of things like the 1,200 foot wide Sedan Crater. Waiting In Long Beach Long Beach airport is a small affair, seemingly more fitting for Dubuque Iowa than the south Los Angeles sprawl. Gates one through three are in a pre-manufactured temporary structure that’s obviously been in use for some time, but the food from the one vendor is better than in Boston and the Queen Mary Spa offers massages hidden behind a partition in the corner. A five minute of scalp rub runs $7. beatnikside’s Vegas Photo Gallery I can’t help but like beatnickside‘s Las Vegas Flickr photo set. It’s one of the most photographed of cities, but these photos are fresher than that. Sometimes enteraining, sometimes informing, the shots of Vegas’s glitz and glamour show special attention to detail. This week is Vegas week at MaisonBisson, since I’m out here before heading to San Francisco to present at IUG 2005. I have an inexplicable fondness for Vegas. SMART High Efficiency Car Coming To US I got excited a while ago when I learned that Daimler Chrysler was bringing their little SMART car to Canada, and I’m even more excited now that I learn that it’s coming to the US via ZAP, a company originally formed to make and sell electric cars (ZAP stands for zero air pollution). Though powered by a normal internal combustion engine, its small size and low weight allow it up to 60 miles a gallon — much better than the 20. The Long Tail Of Violence It’s been a few days of “long tail” talk here at MaisonBisson. Stories about popularity vs. the long tail and aesthetics of the short head are just below. Here’s one on the violence of the long tail. John Robb at Global Guerrillas wrote about the “dark side” of the long tail in a March 18 post to his blog. It’s a touchy one, so I’d better explain Robb’s point in his own words: National Weather Service Adds XML And RSS Feeds The US National Weather Service just updated the SOAP/XML interface to their National Digital Forecast Database (NDFD) and RSS feeds from their Storm Prediction Center. I feel a little happier about paying my taxes when I see government organizations like the Weather Service posting answers like this: The National Weather Service is striving to serve society’s needs for weather information by evolving its services from a text-based paradigm to one based on making NWS information available quickly, efficiently, and in convenient and understandable forms. Tetris Shelves Gizmodo posted a picture and a little text about BraveSpaceDesign‘s Tetris Shelves. More from BraveSpaceDesign can be seen in this post at Land+Living. They’re all the standard Tetris shapes constructed of walnut and ash. My previous attempts at cabinet making were miserable failures, but considering these shelves cost seven large — yes, $7,000 — it’s more likely that I’ll be making my own than buying them. Question, though, am I violating copyright/trademark/patent law if I built my own for personal use? LibLime/Koha ILS A comment to a post on The Shifted Librarian pointed me to the LibLime collection of open source library applications including the Koha ILS. They’ve got demos for the whole collection, including the OPAC. It’s the first I’d heard of LibLime or Koha ILS, but it’s good stuff and I certainly hope to see more of it. The Dark Side Of Networked Information According to the website, MITRE is: a not-for-profit company that provides systems engineering, research and development, and information technology support to the government. It operates federally funded research and development centers for the Department of Defense, the Federal Aviation Administration and the Internal Revenue Service, with principal locations in Bedford, Massachusetts, and McLean, Virginia. All of this is interesting because BlogsOfWar points out that they’ve been presenting information on a project titled BlogINT: Weblogs as a Source of Intelligence (with slides in PDF format): “Short Head” Vulgarity and Prurience Chris Anderson at the Long Tail Blog quotes a passage from David Foster Wallace’s A Supposedly Fun Thing I’ll Never Do Again: TV is not vulgar and prurient and dumb because the people who compose the audience are vulgar and dumb. Television is the way it is simply because people tend to be extremely similar in their vulgar and prurient and dumb interests and wildly different in their refined and aesthetic and noble interests. What Is Networked Information? There’s data, then there’s information. Information is meaningful and self explanatory, data need to be aggregated and analyzed before they become information. Networks — Ethernet and the internet — transmit data, but our web browsers and the back-end applications they connect to turn it into useful information. “Networked information” is what results from building connections between multiple information sources. Displaying an author’s biography inline with the library catalog holdings of books by that author is one example of how the value of information sources grows when they’re networked. Credit Where Credit Is Due Jenny Levine’s mention of my work with Innovative’s XML Server Wednesday drew a lot of attention, but there’s little online public discussion of Innovative to give some of my comments context. Innovative started started development on their XML Server product quite a while ago (five years, yes?), before later standards like MARC XML had any traction. They did it to create another very useful product, their AirPAC, a online catalog for mobile phones and handheld devices, and without any clear demand for XML Server from customers. Stanford Library’s Tech History Collection I just discovered Standford Library’s collection of documents relating to the technology and culture in Silicon Valley and the development of the Mac thanks to a link from Gizmodo. Gizmodo was excited about the &lt;a href=&quot;http://library.stanford.edu/mac/primary/images/dayton6.html&rdquo; title=&quot;mice &ldquo;wine tastings&rdquo; &ldquo;&gt;mice “wine tastings” that Apple did in its efforts to develop the first consumer mouse. Elsewhere, however, I found this interesting little tidbit: Reading it twenty years later, the most surprising thing about it is the amount of attention it gives to networking, and the degree to which the first Macintosh was intended to be a kind of network computer. XML Isn’t Enough A lot of this is in my XML Server presentation at the Innovative Users Group conference in a couple weeks… Jenny Levine is an outspoken advocate for the use of RSS in libraries. One example she cites is posting lists of new acquisitions to library websites. She estimates that folks in the 77 libraries of her library system spend 924 hours per year on that one activity, time that could be used elsewhere if automated by RSS. New Catagory: Libraries & Networked Information Thank or blame Jenny Levine of TheShiftedLibrarian for this: I’ve just created a “Libraries and Networked Information” category here. More to come. The Long Tail At MaisonBisson Content here at MaisonBisson isn’t well focused, but a few stories have come out winners in the Google sweepstakes of passing popular fancy. My story about a giant bear in Alaska was one such winner, but I’m happy to see a few others are also getting read. My stories about stainless steel, the heat output of Dell servers, and iTunes vs. Firewalls are obviously filling a need for technical information not readily available elsewhere. Safari 1.3 supports for contentEditable WYSIWYG Melvin Rivera reports on &lt;a href=&quot;http://allforces.com/2005/04/19/wysiwyg-comes-to-safari-13/&rdquo; title=&quot;Safari 1.3&rsquo;s support for contentEditable“&gt;Safari 1.3’s support for _contentEditable_. When Decorum Is Entirely Innapropriate It’s hard to find the words to introduce Eric Berndt‘s open letter to his NYU Law School classmates. The Nation said the following: Justice Antonin Scalia got more than he bargained for when he accepted the NYU Annual Survey of American Law’s invitation to engage students in a Q&amp;A session. Randomly selected to attend the limited-seating and closed-to-the-press event, NYU law school student Eric Berndt asked Scalia to explain his dissent in Lawrence v. Copyright And The Internet David Rothman at TeleRead linked to Franklin Pierce Law Center professor Thomas G. Field’s guide to copyright on the internet. Field gives a clear overview of of the limits to copyright, the ways copyright applies to web sites and email, and the limited law on linking and framing web content. In his section on risks, he notes: Copyright law precludes most uses of others’ works without explicit or implied permission. Satelite Imagery There appear to be two non-government-owned companies providing satelite imagery: Space Imaging and upstart DigitalGlobe (yeah, like they’re not both upstarts). DigitalGlobe is working hard to make friends with the media and regularly offers timely images of events, disasters, and wars to them. For the public, they offer some more scenic shots, like this one of the the boneyard at Davis-Monthan AFB in Tucson, Arizona from August 11, 2002. The boneyard serves as a holding place for out-of-rotation airplanes until their fate is decided; the dry, clear climate of Tucson provides an ideal environment for the storage of aircraft, as they can sit indefinitely without rusting. Focal Plane Shutter Distortion Henri Lartigue’s photo of a race car shows one of the wonderful ways in which the camera records its own reality. Spectators lean left while the speeding car tilts right all because of some facts about how his camera works. Lartigue’s camera had a focal plane shutter, a two-part light curtain that slides to one side to expose the film while the second part follows a moment behind to again block the light. Jeffrey Veen Gives Presentation Advice In Seven Steps to Better Presentations, Jeffrey Veen acknowledges the complaints against PowerPoint, but explains that the real problem is “bad content delivered poorly.” His seven points have a lot more detail that what I’m quoting here: Tell stories. Show pictures. Don’t apologize. Ever. Start strong. End strong too. Stand. Away from the podium. Pause. My own opinion is that Veen and Tufte would agree more than they disagree. Tips To Flag Designers (Vexillographers?) The folks at the North American Vexillological Association get excited about flags. Yeah, I had to look up Vexillology too. Anyway, they’ve got a 16 page how-to about designing a flag, for “your organization, city, tribe, company, family, neighborhood, or even country!” Their advice centers around these five rules of flag design: Keep it simple Use meaningful symbolism Use 2-3 basic colors No lettering or seals Be distinctive or be related Each point is supported by examples illustrating both the “right” and “wrong” way to do it. Cat and Girl Makes Me Laugh I can’t get enough of Cat and Girl and this one just hit my funny bone. Thinking of comics, Comic Life makes it easy to lay out your digital photos and add comic-style speech balloons. Looks interesting, though I’m not sure it’s worth $40 bucks. Geolocating Everything I just added Jonathan Crowe’s The Map Room to my daily read. It was there that I learned that GeoURL is back, and that’s got me thinking about geocoding things again. I spoke of geolocating photos in a previous post, but my interest has broadened. I now want to geolocate my blog posts, I want lat and long recorded with my ATM transactions, I want my emails and phone calls to have location information. URLs I Need To Bookmark on My Clie and Phone Google Local for mobile devices may be the most useful thing yet. But then, I’ve been slow to get even the regular Google Search for mobile devices bookmarked. See, When The President Does It, It’s Different, Somehow It’s a reasonable story: guy gets iPod, buddy puts a few favorite tracks on it, everybody jams happily because they can share their little bits of culture. In a way it’s an extension of the mixed tape so romanticized in High Fidelity, but in another way — the RIAA’s way — it’s probably a copyright violation. This is about the time you’d expect me to announce a new round of charges from the RIAA, more claims of theft and lost profits due to the scourge of technology and hordes of uncaring, music copying punks. Modern Day Opium Craze In a story in the Sacramento News and Review, Peter Thompson writes about his drug use. At 16 he tried making mead, but when that failed he continued to look elsewhere: I began to see the supermarket and drugstore as potential drug dealers. I drank bottles of cough syrup before I knew what dextromethorphan (DXM) was. I ate catnip and didn’t feel anything. I ate nutmeg and felt everything. There was no Internet to guide me and nothing in the library about morning-glory seeds. Apple Finally Unleashes Tiger Apple announced the availability of Mac OS X v10.4 TigerTuesday and is now accepting pre-orders. The product is to be in stores on Friday, April 29 (beginning at 6PM?) and will sell for $129, or $199 for the Mac OS X v10.4 Tiger Family Pack, a five seat household license. Amazon is offering Tiger for $95, after rebate, though the rebate doesn’t appear to apply to the family pack. Apple’s been selling family packs for a while, but it’s added some new family features to the OS that surprised me. Our Underequipped Military Forces A story over at DefenseTech is reporting that four years after the September 11th attacks and during a time when US personnel are involved in armed action on the ground in Arabic speaking states, the military still doesn’t have a plan to train their soldiers in the language. It seems the Pentagon can spend bazilions on failed missile defense systems, but hasn’t the money or interest for language instruction. I’d say get the folks in green some iPods and In Flight Arabic, or the more extensive Pimsleur Quick &amp; Simple Arabic (hey, the Amazon reviews for it are positively glowing), but I’m thinking both lack important vocab for people who have to deal with car bombs regularly. Most CMSs Suck I’ve been slowly struggling with the question of how to replace pMachine, my CMS engine here. I haven’t really liked any of the alternatives that others I know are using (link link link link), though I’ve been hard pressed to identify exactly what my complaints are. Among the points in Making A Better Open Source CMS, Jeffrey Veen names a few of the most frustrating for me: hard-coding of site layout in the CMS, mixing of content with site administration in the interface, and, sometimes, lax security. Who Doesn’t Want a Caboose? Perhaps it’s the lasting effects of watching The Station Agent too many times, but I went looking for a place to buy a caboose. They’re big; as much as 50′ long, 16′ tall, and 11′ feet wide. And they’re heavy, perhaps 30 tons. But they can be moved on roads via big trucks and cranes, but then, they also move brick houses. Caboose disappeared from the railroads in the 1980s, after about 130 years of service. Molecular Visualization in Mac OS X A while ago I went looking for alternatives to MDL Chime on Mac OS X, as MDL is still choosing not to support OS X. Sure, you can run it in Netscape 4.x in Classic mode, but that’s getting increasingly frustrating. What’s great about the Mac, however, is how many great solutions there are from small developers who take on the “big guys” and do it better. Evidence: Piotr Rotkiewicz’s iMol. Declaring Bankruptcy On Old Stories I often use the MaisonBisson blog as a sort of annotated bookmark list, keeping track of the things that catch my interest for one reason or another, things that I’d like to return to or share. But I often get ahead of myself in identifying the things I’d like to look at further and never get around to posting an annotated link here. For those, I’ve been keeping a text file with URLs that I’ve sometimes revisited and sometimes posted stories on, but the list is growing, and it’s becoming clear that I won’t ever get to around to posting stories for most of the URLs there. Does Size Matter? A while ago I asked a friend why short sentences were so pleasing to read and write. He had no answers, but agreed that brevity is its own reward. Some (though I can find no reference to it) suggest that technological developments have changed and simplified sentence structure by allowing writers to write and revise freely, while typewriters and pens required forethought and concentration to avoid scribbling out unwanted, half-formed sentences. Verizon Wireless’ Wardriving Rig (Can You Hear Me Now?) It turns out that Verizon (and all the other carriers, presumably) really do go around asking “can you hear me now?” The actual test conversation sounds different (possible source?) and the testing is automated, but there really are people out in the world doing real coverage testing. I guess I naively assumed that it was all theoretical and computer modeled, or something. Anyway, MobileTracker rode around Tampa, FL, with a Verizon Wireless test guy Levy Rippy back in February: Of Bricks And Progress… This post is about a couple of things. First, it seems Cory Doctorow has issued DMCA takedown notice to the folks at BoringBoring.org for their parody of Doctorow’s BoingBoing. What nobody knew at the time is that Gakker has also been on the scene, Doing Doctorow parodies, and all. Which is where thing 2™ comes in: this post about bricks highlights an ongoing concern of mine. What is the real difference between a long-existing thing with a variety of uses, some of them illegal, and the thing not yet developed with a variety of uses, some of them illegal? The RIAA’s Logic And ‘Declining’ Music Sales Blogger Mark Cuban listened politely to RIAA chief Mitch Bainwol stumble into the logically fallacious argument that: it was obvious that illegal downloads were hurting music sales. It was obvious because the advent of file sharing coincided with a decrease in music sales. Therefore A lead to B. (I’m quoting Cuban, who’s parapharsing from &lt;a href=&quot;http://www.ce.org/events/event_info/downloads/Industry_Leaders_React-IP.pdf&rdquo; title=&quot;Bainwol&rsquo;s CEA blather speech”&gt;Bainwol’s CEA blather speech). But instead of arguing with Bainwol’s logic — it’s too easy, and too many others are doing it — Cuban is using it to prove the contrary. Archiving RealAudio Streams on Mac OS X Standard players for RTSP streams like those for RealAudio don’t cache the files they download, meaning they require a net connection to operate. I found an EZBoard forum message that identified HiDownload, Net Transport, OEP-OEE and StreamDown — Windows-only applications that can download RTSP streams and save them to a playable file. But those trick ponies do nothing to help Mac users. AudioHijack has been around for years now, but it only captures the audio stream as it leaves RealPlayer and heads off to your Mac’s audio output. Gas Prices (Finally) Affecting Car Sales? A Mainichi Daily Times story announced today sales of energy-efficient Japanese cars soar in U.S. Toyota and Nissan both saw 12% sales growth, with Toyota’s Prius sales jumping to 260% their numbers from a year ago. Honda, which usually wears the energy efficiency leader’s hat, saw a nearly 7% increase in sales. Ever prideful, MDN notes: In sharp contrast, the sales of new cars sold by General Motors and other American automakers in March posted decreases from a year earlier. Tator-Tot Pizza So my challenge is to prove that I can be both trite and serious in the same day. Here, Tom chows on tator-tot pizza with ranch dressing and chipotle chile Tabasco sauce. It’s part of the Tator-Tot Pizza set at Flickr. There’s no good reason to make tator-tot pizza, but we had both, plus all the sauce, so what else is there to do. That’s trite, this is serious. Serious Saturday I’ve lost my way a bit and been posting a bunch of trite stories here lately about my kitchen and in my photoblog. I’m sorry. I have made a few attempts at serious discourse. If you look carefully you’ll see stories on Grokster, RFID passports, a library conference, a chilling look at the death penalty in Texas. Looking a little further back, you’ll find new stories in the very serious copyrights &amp; intellectual property and politics &amp; controversy categories. Can You Eat It? Food bets seem harmless, but they look funny. Everybody likes the old “can’t eat four saltines in 60 seconds” bet, and it’s likely that many of these foods would never get eaten except on a bet. Then there’s the story of two guys who took a bet they could eat Ramen noodles — only Ramen noodles — for a month. It’s probably apocryphal, but they story ends with them getting scurvy and giving up. It’s Friday! Over at Caravie: Peace, Nonviolence And Conflict Resolution I found the Lies, lies, lies, lies, lies, lies, lies, lies, lies, lies, lies, lies, lies, lies, lies, lies,lies, lies, lies, lies, lies, lies, lies, lies music video. Also at Caravie I found a link to this ‘zine, with a selection of videos, like this one. It’s a perfectly enjoyable way to waste a Friday afternoon. [update:] This is confusing. New US Passports Will Serve as Terrorist Beacons I cannot say it any better than it was said in today’s issue of EFFector: The US State Department is pushing for what may be the most misguided and dangerous travel “security” plan ever proposed: putting insecure radio-frequency identification (RFID) chips in all new US passports. These chips would broadcast your name, date of birth, nationality, unique passport number, and any other personal information contained in the passport to anyone with a compatible RFID reader. Reporting Late On Grokster These things take time and can often be hard to read, so while we all wanted the high court to look at the entertainment industry lawyers and tell them to take a hike Tuesday, we’ll have to wait until summer to know what actually went down. But there is one interesting thing so far… It was in Nina Totenberg’s wrap-up for NPR that alerted me to this turn in the arguments: Life Of A Kitchen Blueskygirl alerted me to the Life Of A Kitchen group at Flickr in a comment on a photo of my remodeled kitchen. So, of course I joined and had to upload a pile of related pictures from my back-file. There’s some great stuff from a bunch of contributors up there, despite the trash I tossed in. In the photo above, Sandee makes homefries for a brunch with our neighbors back in July 2003. Cheap LCDs For In-Car-Computers A PowerPage story alerted me to a couple of inexpensive touch-screen LCDs: Innovatek and Lilliput. Take this as an update to my story on carputers. That story, of course, connects with mobile carrier networking (with followup), and GPS. Kitchen It was done in quite a rush and there’s some touchup to do yet, but our kitchen is now more complete than it’s been in six years. Late Notes From October Library Conference I just re-discovered my notes from Dartmouth Biomedical Libraries’ October Conference for 2004 and found a number of things I wish I’d remembered earlier. Academic libraries are facing declining use/circulation of traditional materials (books, print periodicals, fiche, etc). It’s not that students and faculty don’t care about libraries or learning, the problem is that libraries aren’t serving their patrons with the forms of information they need at the time and place they need it. Considering The Death Penalty Texas executes a lot of people. During the years 1995 through 2000, Texas executed 152 inmates, making then governor Georg W. Bush the killingest governor in history. A March 1998 Amnesty International report titled The Death Penalty in Texas: Lethal Injustice notes that “public support for the death penalty in Texas remains strong,” and a later news release states “Texas is so proud of killing people that it issues press releases for the executions it carries out. Choppin’ Ice Corey chops ice from my walkway on Sunday afternoon. Dinner went well, despite worries that our new kitchen wouldn’t be completed in time. I guess I’m a huge fan of pictures with particle action. Here’s another, where Will cuts it up with a circular saw. Crunch: Three More Days There are at least two ways to appreciate Easter: To some it’s the most important religious event of the year, while, to others — your hosts here at MaisonBisson, for instance — it’s yet another good reason to gather friends and family ’round a table and celebrate good food, good wine, and all that makes us human. But there’s a problem: We dismantled our kitchen last week in anticipation of our new kitchen…which is taking longer to install than I expected. The Risks Of Googling One’s Self Well, actually it was A9, but the results are just as scary. There’s a fellow named Gerald Dewight Casey on deathrow in Texas and an Asian language site has a picture of the Bisson Battlesuit. WiFi My World I’m in Hooksett today waiting for the my kitchen cabinets to be delivered. Why Hooksett? Because Ikea won’t deliver to Warren and I’ve got in-laws in Hooksett where Ikea will deliver. I’ve just setup my old router and wireless base station here, so at least I don’t have to slum it without network. And that’s sort of what this great Onion infographic is all about. Take note of the point: “facilitates blogging while/about doing laundry. Of Life & Death… I’m not sure I could say it any better than David Rothman did when he went off topic over at TeleRead to make note of some important issues related to the Terri Schiavo matter. Rothman points at the bigger issue, but doesn’t come out and say it: all life concludes with death; indeed, the leading cause of death is birth. I’m not being flippant, I mean this. Life is filled with serious and difficult choices, including some related to the end of life. Dis-Intermediating Pop Culture Via Copyfight via Deep Links: Fiona Apple, that Grammy award winning gal you remember from the Criminal video, apparently put together a third album a couple years back only to have Sony music shelve the thing. Now that it’s gotten out, her fans are “demanding that Sony release the album so they can pay for it.” Which Fred von Lohmann describes as “a substantial noninfringing use of P2P networks if I’ve ever seen one. Sunshine Week I’ve failed to live up to my potential this week. I’ve wasted a lot of time on stories about useless video cameras, home theater, whining about my kitchen remodeling, and lamenting some lost stories when I should have been paying attention to SXSW, ETech, copyright issues, and Sunshine Week. Please accept my Johnny-come-lately mea culpa on all of that. Sunshine Week is intended to bring public attention to concerns about goverment secrecy. Shuffleboard Fridays Joe, Tami, Sandee, and John throwing weights on the shuffleboard table Friday night. Extra: shufflboard rules at MastersGames, suffleboard rules at shuffleboard.co.uk. Shuffleboard tables and tabletop shuffleboard accessories can also be found online. Wish I was There: ETech 2005 Just as I was about to cut the Future Tense blog (from the Public Radio show of the same name) from my list, Jon Gordon steps up with a few good stories. Of course, he had good material to start with. He’d been at the O’Reilly Emerging Technology Conference, and it looks like it was quite a show. Many2Many has a couple notable stories about Etech events, including Wikipedia and the Future of Social Computing and Folksonomy, or How I Learned to Stop Worrying and Love the Mess. Your EFF Needs You A couple stories in the Electronic Frontier Foundation‘s email newsletter need our attention and support. Well, they all do, but here’s the most important: Grokster: EFF this week kicked off a new campaign to celebrate the technological diversity protected by the Supreme Court’s 1984 “Betamax ruling,” which found that vendors cannot be held liable for contributory copyright infringement if their products are capable of significant noninfringing (legal) uses. EFF will post information about a copying technology with substantial legal uses every weekday leading up to the March 29th Supreme Court hearing in MGM v. MaisonBisson: The Lost Tapes…. I discovered recently that my content database is [missing a bunch of stories][1] from the first weeks of 2005. I tempered my feelings of loss with the knowledge that I couldn’t remember the title of more than one of the 21 missing stories. While looking into a question about my out-of-date RSS feed today, I discovered that it had clues to the content of twelve of my missing stories. They clearly weren’t that important (what is? Small Video Cameras This fiddling with video has me looking for small cheap video cameras. 123 Security Products has some, but Pine Computer has them cheaper. Better yet, they’ve got a 203CA sub-mini video camera with interchangable lenses for $25. The standard 4mm lens has only a 78 degree view angle, but an available (+ $15) 2.5mm lens should result in a much more useful 125 degree view. The cameras all have composite NTSC outputs, but a USB video converter make them “digital. Home Theater There are bigger problems in the world than my home theater, but that’s not what this entry is about. I’ll get back to political ranting in a while, but for now — now that I have &lt;a href=&rdquo;/post/10477&rdquo; title=&quot;a cheap inexpensive projector”&gt;a cheap inexpensive projector — I’m interested in figuring out how to play videos from my computer. Some people don’t need to ask why, but for those who do, let me offer this: most the video I create is better seen on the small screen, but fair-use DVD rips and content downloaded from the Internet Film Archive. Liability & License It turns out that the Quicken website is full of legal tips and advice. What caught my eye was a description of implied warranties. Implied warranties don’t come from anything a seller says or does. They arise automatically when a product is sold. Under the Uniform Commercial Code, there are two kinds of implied warranties: that the product is fit for its ordinary use, and that the product is fit for any special use the seller knows about. Loss I discovered today that my content database is missing about 20 entries from the first weeks of 2005. The feeling of loss is pretty thick, but I get these feelings pretty easily — hey, don’t pick on me. Of the 21 stories, I can only remember the content of one of them. I think the story was titled “Web Apps Rocked 2004” or something like that and was basically all about the goodness of XMLHTTPRequest. Too Exhausted, Busy To Blog I’ve got to tear down the last cabinet, get all the junk to the dump, clean, spot-sand and clearcoat the floors, and…. I probably won’t get it all done today. Watch yesterday’s video for an idea of what’s going on, otherwise, today is re-run day. The archives are yours to explore. Kitchen Destruction Time-lapse Movie It’s all part of the plan, but this is a bigger mix effort and uncertainty than expected. I’d hoped to have everything cleared from the kitchen by mid-day, but I’ve got another cabinet to remove Sunday. The uncertainty? We don’t yet have the new cabinets in hand. If those are delayed, we could be without a kitchen for quite a while. Worse: we’re hosting Easter and I’ve only got next weekend to install the cabinets and put the kitchen back together. “Shred It!” Engadget‘s got a story about SSI Shredding Systems and their action videos of their equipment doing the job on refrigerators, medical waste, steel drums, couches, concrete, boats…. Engadget reccomends the washing machine video “for its rather endearing inclusion of one of the bystanders’ enthusiastic cries of ‘Shred it!'” Best New Music Trilok Gurtu and Robert Miles on miles_gurtu Listen in at iTunes or Amazon. Bonobo’s Dial M For Monkey Listen in at iTunes or Amazon. Bonobo’s Animal Magic Listen in at iTunes or Amazon. The Bad Plus Give Listen in at iTunes or Amazon. Virtual KVM Solutions Folks are increasingly aware of screen sharing apps like VNC, but what about solutions that allow you to control multiple computers with a single keyboard and mouse? Back in the day, there was an interesting MacOS 7 hack that would send mouse and keyboard input from one computer to another (after some very easy configuration), today, in the days of OS X, I can find two solutions: The PowerPage tipped me off to KMremoteControl a while ago. …And Copyright Law Is Broken Too (Duh!) I was looking for a way to includes these in my story about the brokeness of patent law, but they just wouldn’t fit. So here they are separately. Increasingly, content owners are taking advantage of the vagaries of the “public domain” to make us pay for rights we used to take for granted. For instance, when you buy a chair, you expect to be able to use it however you wish. Cliff Likes The ‘Works A flash and long manual exposure caught Cliff and me setting up the ‘works, then their launch and aerial explosion on a cold night in January. The camera sat on my mitten in the snow while luck worked in my favor to get a couple good shots (and not burn my camera). Just to be clear: neither of us was anywhere near the launch tube when the ‘works went off. Today Is Warren’s Town Meeting Day Meeting has come and gone. The issue in the selectmen’s letter was postponed indefinitely and the meeting adjourned around 3:30 PM. On RSS, Taxonomies and Folksonomies Copyfight went somewhat off topic to point out Joshua Porter’s paper on How Content Aggregators Change Navigation and Control of Content at User Interface Engineering. This quote says exactly what I needed: Every time someone makes a list, be it on a blog […] or a list of groceries, content is aggregated. The act of aggregating content (usually content that is alike in some way) makes it more understandable. Instead of looking at a whole field of information, you choose smaller, more logical subsets of it in the hopes of understanding those. “So computers were worthless ten years ago?” Jenny, The Shifted Librarian, related a story that show’s her son’s innate understanding of Metcalfe’s Law. Here’s a completely truncated quote: “…Before you were born, there wasn’t really an internet or the web or email. There was a very basic form for people in the military and at universities, but there were no web sites to visit and no web games to play.” “So computers were worthless ten years ago? All Conversations In Warren Revolve Around Heat On Jan 30th I noted that I’d burned through half my wood pellets for the season. I’ve burned another 40 bags since, making it three quarters of my pellets for the season. Now I’m hoping it feels a lot Springier by early April, when my last 40 bags will likely run out. What’s Your Nerd Score? There in my referrer tags was planetilug.draiocht.net (though I can’t figure out why), where I found a link to the nerd test. Two posters who’d taken it scored 80 and 96. Just as Gareth Easton said “I thought I’d give it a go… I answered truthfully (I’m ashamed to admit) ;-)” My score? 90th percentile: Supreme Nerd. Apply for a professorship at MIT now!!! Of course, I’m a sucker for even the most ridiculous of personality tests. Cuttin’ It Up Will cuts stuff up like…well, like a guy who cuts stuff. True to form, Cliff points. They were over last Saturday helping with with some remodeling projects. The luan is going to cover the bits of old horsehair plaster that still cling to the lath in the closet of what is becoming our laundry room. More of Will and Cliff can be seen in the Plastics Museum and Museum of Bad Art, all part of the Weird Museum Tour 2004. Vacation In The Luxury of My Own Home I’m taking a spot of vacation here. Expect nothing more from me today, and not much more in the days to come. — – — As before, the Flickr photos have nothing to do with the post. And, no, this is not at all like Martha‘s house arrest thing. Sweet Deal On Home Theater Projector The Sharp PG-B10S projector isn’t the best out there, but it rates pretty well according to ProjectorCentral.com. Their stats show it to be a 1200 lumen, 800×600 projector with a 400:1 contrast ratio and a long lamp life of up to 4000 hours. The ProjectorCentral.com user reviews suggest it has a good picture with great color rendition. MacUser UK concluded: The PG-B10S showed excellent detail from our presentation slides, with accurate colours and well-defined text, and it coped particularly well with solid blocks of colour. Stay Free!: Copyright Activists The are few things as joyus as the excitement of discovery, so it was a great pleasure to learn that Stay Free! Magazine has a new blog: Stay Free! Daily. The blog has a number of stories about intellectual freedom and copyright oppression that resonated with me. Take a look at Silent Disobedience, Christo’s policy of photographing The Gates, and Wizard People screening in NYC. Anybody following discussion of the FCC’s broadcast flag mandate will be amused by an old movie studio and broadcaster PSA arguing against subscription TV services. Beware The Cheap PC; Beware The Company That Advertises Them I’ve been saying for years that there’s no such thing as a cheap PC, but now a class action lawsuit against Dell is claiming the same. According to ArsTechnica: It accuses Dell of bait and switch tactics along with breach of contract, fraud and deceit in sales and advertising, and false advertising. The computer manufacturer is accused of advertising low-priced computers to consumers, but when consumers try to to buy the advertised machines, they find they are not available at the specified price. Food And Kitchen Gadgets Gizmodo just popped two stories about kitchen or food related gadgets that I love: a knife block worth having and a banana wrapper you didn’t know you needed. I might as well link to the sites themselves, as I can’t really think of anything to add: Banana Bunker and Viceversa Knife Block. Picture Phone Threats: They’re Not What You Think In a story that couldn’t have been much better timed, ArsTechnica is reporting on a camera system from that reads license plates and automatically looks up vehicle registration details. With some glibness, the article claims: “You just drive around and point the camera — it’s that easy!” Though, it does note: As previously unconnected networks and systems integrate, this will increasingly be the case, and as Scott McNeally said way back in 1999, when Sun Microsystems had a bright and shiny future, “You have zero privacy anyway, get over it. (Re-)Programming The Sony RM-V60 Multifunction Remote Control In case you find the batteries dead, and the programming lost, Sony’s instructions for configuring the RM-V60 multifunction remote control are online. You’ll have a heck of a time finding them, however, what with all the lousy ePinions and NexTag listings getting in the way. Ignore those. Codes for all the rest of Sony’s remotes are online too. Here are some seeds for Google and the others: Sony remote control codes for programming Sony multifunction remote controls, like the RM-V60 are online at Sony remote control support site. Macs vs. PCs: Tables Turned? Yale Daily News reports on how Windows is increasingly being pushed aside by MacOS X and Linux. According to the article, Yale Information Technology Services’ registration records show that nearly 20 percent of University students and 33 percent of faculty choose Macs over Windows PCs. This is quite a change from the late 90s, when University IT departments made news by trying to eliminate Macs from their campuses. So what’s going on? IUG 2005: Library Portal Integration & XML Server Applications Elaine Allard and I will be presenting on Library Portal Integration at the IUG 2005 in San Francisco, CA. The session is scheduled for the 1:30 to 2:30 time slot on Wednesday. From the program description: Portal Integration: What Works at Plymouth State University Lamson Library began its portal integration in 2002 with the launch of Plymouth State University’s first portal, MyPlymouth. Within this single point of service students can register for classes and check their grades, faculty can review their rosters and post grades, staff can review benefits and vacation time, and, of course, everybody can use the library. Extra Quotes Most of these are a rehash, but I like them…. — – — A ZDNet News article from December 2003 remarks: “Apple buyers tend to have higher incomes and greater technological sophistication than the PC audience as a whole.” — – — Regarding the first time her phone was hacked, a spokesperson for Paris Hilton is said to have claimed: She was pretty upset about it. It’s one thing to have people looking at your sex tapes, but having people reading your personal e-mails is a real invasion of privacy. International Symbols Enterprise Language Solutions has an interesting brief by Yves Lang on how to use symbols and icons in localization. Cultural differences challenge the design and implementation of icons and symbols for international use. What is meaningful and natural for one group may be ambiguous, unintelligible, or arbitrary for another. Fundamentally, communication is subjective, as a person’s perceptions are influenced by their environment. Since their start in the Olympics, the number of icons has grown remarkably. Feature: Privacy in the 21st Century This is the story that gives me an excuse to name Paris Hilton here at MaisonBisson. Here’s a fact of 21st century life: pieces of our life that, taken one by one, are seemingly insignificant are being gathered and indexed by a handful of companies that re-sell that data to phone marketers, the CIA, and many others. Information that we recognize as somewhat more significant and often more private, like our driving records and tax information, gets sold and traded right along with the rest of it. Feeling Very Sleepy Around noon Saturday Sandee asked “why don’t we go to Ikea?” The closest one is in New Haven, Connecticut, and we got there around 4 PM. They close at 9 PM, but after loading our U-Haul, it was almost 11 PM when we got on the road. We got back to the house around 4 AM, and now, after too little sleep, Sandee has me assembling the catch. In short: no meaningful updates today. Today in Sports: Le Parkour Troy pointed wildly and excitedly at a video showing his new favorite sport: Le Parkour. The video appeared on a site normally devoted to the fun of Macromedia’s Flash Communications Server: I recently saw the film film ‘Jump Britain’ on Channel4 and was impressed by what I consider is an art form. It’s like skateboarding without skateboards, brilliant. Le Parkour consists of finding new and often dangerous ways through the city landscape — scaling walls, roof-running and leaping from building to building. Retro Handsets For Mobile Phones Pokia is setting the world on fire with their retro phone handsets. They’re taking apart phones from the 60s 70s and 80s and rewiring the handsets to plug into today’s mobile phones. They’re selling on Ebay, but most of the offerings are knock-offs. Now MobileMag reports that Boost Mobile, the carrier that sells overpriced wood veneered handests is taking the idea mainstream. Their Retro Phone handset has the look of a 1940s, bakelite molded phone, but, I presume, without that funky feel that old bakelite has. Feature: Patent Law Is Broken US patent laws are broken. Adam B. Jaffe and Josh Lerner say so. Their IEEE article is filled with equal measures of anecdotes and facts about why patent law is doing more to limit advancement in the arts and science than to support it. And that isn’t just wrong, it’s unconstitutional. There are a lot of ways to interpret the US Constitution, but Article 1, Section 8 is quite clear: “To promote the progress of science and useful arts, by securing for limited times to authors and inventors the exclusive right to their respective writings and discoveries. Shameless Commerce My Beef T-Shirts aren’t exactly mass market, so it’s a pleasure to see sales to California (2), Florida (1), Illinois (2), Kansas (1), New York (3), Ohio (3), Oklahoma (1), Pennsylvania (1), and Washington (2). I’ve just added a Beef Trucker’s Hat for real retro fashion. It’s also a pleasure to see that the other designs are selling a bit too. Brocolli and Stump are the most popular (behind Beef), but Swine, Cream Filled, and Killer get some attention. Unusual Hotels I recently discovered Unusual Hotels of the World, “the online guide for travelers interestedinstaying somewhere truly different,” and was pleasantly surprised to find a few hotels in North America I’d like to check in to some day. Jules Undersea Lodge. Source. Want to slay a night under water? Jules Undersea Lodge in Key Largo, Florida, is for you. I have secret interest in trains, so I’d like to know more about The Station Restaurant &amp; Sleeping Cars in Ithaca, New York, and The Aurora Express of Fairbanks, Alaska. Google Maps Rock, Hacking Them Rocks More People are going wild{#14204} over Google Maps, but I honestly didn’t get too excited about it until I saw Glen Murphy’s Movin Gmap project. It’s a Python script that reads location data from a connected GPS and pans the Gmap to follow. Upon seeing this hack of Gmaps, I went looking for more. Hack a Day shows us how to get maps for a set of decimal coordinates from both Terraserver and Gmap (Terrabrowser will do some of this for MacOS X). Students Take Academic Technology Into Their Own Hands Jenny Levine, The Shifted Librarian, points out a recent survey that finds 90% of US college students own a cell phone. Nationally, 171.2 million Americans have cell phones. And cell phones aren’t just for talking, as we Americans are sending 2.5 billion text messages a month. Jenny’s point: “you can tell yourself that these trends won’t affect libraries, but you’d just be burying your head in the sand.” Coincidentally, Ken “Caesar” Fisher posted at ArsTechnica about student technology trends as well: All About Stainless Steel I’ve been contemplating the idea of welding/fabricating a stainless steel counter top, but I’ve never attempted any welding before, and most people say stainless steel is difficult to work with. Thanks to this PDF, I know everything there is to know about stainless steel finishes, but nothing about working with the material. Azom, “the premier on-line materials information site, supplier and expert directory” has a guide to stainless steel fabrication with rules for machining, welding, soldering, and brazing the various types of stainless. Inflate & Collapse Two perfectly paired books: Blow-Up by Sean Topham and Collapsible by Per Mollerup. One explores inflatable forms in art, architecture, and science. The other explores the somewhat broader range of things whose size and shape are meant to change as their use changes. They both look absolutely delightfull. . Moving About On One, Two, or Three-Wheels We’ve come to expect certain things. Cars have four wheels, for instance. And we expect two-wheeled vehicles look like bikes or motorcycles or scooters. Then came the Segway a few years ago and shifted the two-wheeled concept around. Now, a number of stories regarding vehicles of one, two, and three wheels have come out. They’re all interesting, some are awkward, some are to die for. One Wheel Wheelsurf. Snow Day! As Cliff likes to say, “cur-tailed, the sweetest two words in the English language.” The snow started falling Wednesday night and didn’t stop. Even now, big, puffy flakes like oversized cotton balls are falling. [update:] Photos added. Also, here’s a snowy panorama from early January. Geolocation Tagging Photos There’s a new version of Jeff Early’s GPS Photo Linker, which allows you to combine tracks from your GPS (time and position data) with your photos (time and image data), so you end up with a bunch of photos with embeded GPS coordinates. Jeff notes: Apple has confirmed that MacOS 10.4 will support the GPS metadata tags in photos. This will open up a whole realm of opportunities for users and developers to take advantage of the position data on photos. Conspicuous Consumption: The Plan After some scraping and saving, and our refinancing, we’re remodeling our kitchen. Our first attempt at doing this failed when I realized — too late — that I’m not actually capable of making cabinets. By that time, we’d filled the kitchen with a bunch of poorly made and unfinished junk. Sure, there’s a sink and a fridge and stove top and an oven, but there’s one counter that’s been bare plywood for five years now, And there’s a bunch of other stuff that can never be finished because it was never built according to a plan that would ever actually work. Marmite Today I give props to bunchofpants‘s Flickr photoset on Marmite. I don’t really know what Marmite is, but the Marmite FAQ claims: Marmite is dark brown-colored savory spread made from the yeast that is a by-product of the brewing industry. It has a very strong, slightly salty flavor. It is definitely a love-it-or-hate-it type of food. And, yes, Marmite competes with Vegemite, and both appear to be made of the same stuff. Fast Sofa…iMac G5 Fast There are a lot of folks who will tell you how “wrong” it is that Apple integrates the monitor and computer in so many models, so I guess there’s a bunch of them that will tell you the same thing about how Bluebroc is integrating the a sweet-looking couch and an iMac G5. “You’ll have to replace your couch every time you upgrade your computer! Gosh (said Napoleon-style).” There are probably even people that recommend dis-integrating the iPod from its display. iPod Giggles iPod Giggles **»** Paul Bourke, of the Astronomy department at Swinburne University of Technology, has developed an iPod stereoscope. His system uses a pair of iPods in an old-style stereoscope viewer to display stereo-matched photos. » Somebody at Iaxb has come up with some renderings of a giant iPod shuffle sitting around the house like he or she owns the place. » More enlighteningly, Canadian Broadcasting Corp. has a story on the the evolution of portable audio. Standing Up For Clam Juice Okay, so I’ve been doing at least a post a day since about September 2004 and a few people got concerned when I missed a couple days{#33}, but I am alive. Gosh (said Napoleon style). I’d probably pass on posts again today, but I was looking recent comments on my Flickr photoblog and got a smile when I found evil angela‘s defense of clam juice: You know, it’s kind of like fish sauce. Folksonomy Is My New Love Okay, I’m excited about folksonomies. My introduction to tags was at Flickr, where I’ve been amused at how they help connect people, photos, and concepts. Then Jenny Levine at The Shifted Librarian started talking about them, with David Rothman at TeleRead echoing and expanding many of her points. That was about when I found Many to Many, where I read about Technorati’s tag project (plus documentation). Wanna see it in action? Copyright Terrorism The Dunhuang Grottoes are one of China’s richest archaeological treasures. Built during the 4th through 14th centuries, they are a 1,000-year-old ancient art gallery of cave architecture, sculptures and murals. Rediscovered in 1900, the region has been listed on the UNESCO World Heritage List since 1987. Despite over 100 years of exploration and study, the mysteries of the grottoes are as great as the lessons they teach us. Now, it would seem that The Dunhuang Academy is claiming ownership of all images associated with these 1000 year old treasures. Looking For The Energy Drink TV Ad? Based on the search terms people come to this site with, I know that there’s a bunch of folks looking for the “energy drink ad,” or “K-fee TV commercial,” or “scary German,” or some such. Most people end up finding my story about Zygo energy vodka, and completely miss my story about the (deceptively titled) serene, calming video where I first linked the energy drink TV. Let me eliminate the confusion now. All Conversations In Warren Revolve Around Heat I have burned 1.6 tons of wood pellets so far this winter. The significance of the number isn’t its size, though 1.6 tons is a lot. The significance is that it represents 80 bags of pellets, each 40 pounds. The significance is that it represents about half of the pellets I’d purchased for the heating season. By the almanac, it looks like I should have ordered more pellets, as we’re not yet at midwinter and I’ll probably run out. Big Bear Photos Circulating My dad forwarded me the following pictures and story: These pictures are of a guy who works for the US Forest Service in Alaska and his trophy bear. He was out deer hunting last week when a large grizzly bear charged him from about 50 yards away. The guy unloaded his 7mm Mag Semi-automatic rifle into the bear and it dropped a few feet from him. The big bear was still alive so he reloaded and shot it several times in the head. Language Is Of The People I am always amazed at the lengths we’ll go through to communicate or express or simply transliterate an idea, and further amazed at how we represent the result. Take this for instance: 6th string| ---0---3---3--5--5----7-8-7-8-7-8-7-8---3-3--5-5 Once you figure it out, you’ll likely not be able to get it out of your head. And this: sort of related, and much more ridiculous. Wikipedia vs. Brittannica; Folksonomy vs. Taxonomy A post on Techdirt notes: You may recall that we somehow got involved in a bizarre battle over Wikipedia, when I got into a discussion with a reporter who told me that Wikipedia was “outrageous,” “repugnant” and “dangerous,” mainly because it’s not reviewed by “professionals.” Despite a valiant effort, I was unable to ever convince the reporter, Al Fasoldt, that regular encyclopedias, complete with their experts, make mistakes too — and, in fact, the problem is that those encyclopedias can’t then be updated and fixed. The Tyranny Of Copyright Last week I pointed to Will Shetterly’s “The People Who Owned the Bible” as an example of what might happen if copyright/intellectual property law continues to favor short term commercial interests over long term public interests. It’s worth noting that the original copyright laws, developed in 1600s Britain, allowed for only a seven year monopoly (that’s what copyright is, after all). US law started by doubling that to 14. The current term is 75 or 95 years, but it doesn’t matter because the music and film industries will lobby congress in a few years to make it 120 or so. Cold Weather Operations Force PowerBook PMU Reset Batteries don’t work well in the cold, and with the -20°F nights we’ve had, I think I can say it’s been cold here lately. I woke my PowerBook from sleep in sub-freezing temperatures this morning and got a few minutes of work out of it before it put itself to sleep again. I popped it into my computer bag and ran off to work, where I was troubled to find it refusing to wake from sleep — even when plugged into the AC adapter in a warm room. Using Your Mobile Phone As Modem I’ve been following cell-carrier wireless data options here at MaisonBisson (here and here), but I have to admit that I don’t actually use any such solutions. I live and work (and usually travel) in range of ethernet and WiFi, so I might get a pass on this but the real reason is laziness. Engadget has a nice write-up on the process with CDMA-based phones like the ones you get from Sprint and Verizon. Edward Tufte Gives Presentation Advice Edward Tufte‘s passion is the graphical display of information. But his nemsis the visual lie. So naturally, he has a special dislike for PowerPoint. His poster on The Cognitive Style of PowerPoint gave me this line, which I will likely find myself repeating at a time when it is both most accurate and most politically suicidal to do so: Why are we having this meeting? The rate of information transfer is asymptotically approaching zero. Palm Travel Guides MyPalmLife is running a story about some new travel guides that run on your Palm-powered device. Produced as a collaboration between Rough Guides and Visual IT, they also support PocketPC and Symbion devices. London, Paris, Rome, New York, and San Francisco are available now at an introductory price of $20 each. “Further cities will be released over the coming months.” According to the website, the Rough Guide city maps include: Feds Go Beyond Carnivore; Artists Embrace Carnivore DefenseTech reports that the FBI has given up on Carnivore, the electronic snooping application that it used to force on ISPs serving suspects. It seems that the folks in dark suits are now using commercial software instead. This probably has no effect on artists — yes, artists — who use an open source app inspired by the feds as the center of their networked interactive art. Called CarnivorePE, it’s the back-end of over two dozen art installations, most graphically: Police State. Microsoft: Bad For Browsers; Bad For Air Travel I just discovered This Is Broken and couldn’t help but explore the archives. First I discovered Brill.com‘s weird search results. The problem is that a search for bond funds returns a list of stories that have little to do with financial news. It looks like somebody has entered a bunch of bogus stories in their database. They might have been hacked, but I’d be more suspicious of a disgruntled employee. The saddest part is that the problem was reported on September 22, 2004 and they haven’t fixed it yet. Browse Happy Browse happy, by the The Web Standards Project is urging people to give up on Microsoft’s Internet Explorer. Their solution? Firefox, Mozilla, Opera, and Safari. Mac OS X Performance Questions I was a little bummed to find my CPU busy all morning yesterday. And though I still don’t understand exactly what was causing it, it seems no longer to be a problem. A lot of people don’t know how to see what their Mac is doing, to see what it’s busy with. Here are some hints: Start with Activity Monitor in Applications &gt; Utilities. From there you can see and sort applications and processes that are running on your computer. Problems and Pre-Dated Stories Due to problems with the site all this week, a couple of time-sensitive stories that I wrote but coudn’t post have now been posted with pre-dated timestamps. I’ve been following every news item about the Mac mini with likely more interest than it deserves. What can I say, I like the little computer. As it turns out, the mini is smaller than it looks in the pitures. And thinking of pictures, a few shots of Bill Gates vogueing with circa-1986 computer equipment started circulating early this week. Candy Karen forwarded me a link to Juicy Panic‘s “you drive me oh oh oh” video by torisukoshiro + autophene. More animation and illustration by torisukoshiro is linked from the main site. Then she sent me this link to How Strange, a site full of odd, interesting, and weird images. . . . Palm News & Goodies Gizmodo mentioned the new Garmin iQue 3600a GPS Palm for Pilots this morning. There’s a long write up about it at MyPalmLife, but the Gizmodo story linked to Palm247. Once there, I found a link to instructions on putting the Wikipedia on a Palm. Well, you’ll need a 1GB SD card, but that’s okay, right? It all depends on TomeRaider, an interesting app and fileformat for searchable, hyper-linked e-content. Palm247 is also running a contest to win a free copy of Trip Boss, an all-in-one travel manager. Problems Happen My hosting provider has a US-based datacenter and UK-based staff. It’s an odd mix that may or may not be helpful when things go all to heck, like they did on Saturday and again on Tuesday. The first acknowledgment of the problem Saturday explained that “the server is reporting a Kernel Panic.” then four hours later, it was reported that “there is a major fault with the boot sector and kernel on the server prevent it from loading into the lilo prompt, or booting from a new Kernel due to damage. Mac Mini vs. Cheapo PCs Charles Jade at ArsTechnica has written both a Mac mini preview and a MacWorld Expo show walkthrough. The expo is about a lot more than the Stevenote, and Jade does a fine job walking us about the show floor. Also entertaining is an OSViews story on the Mac mini that concludes the mini is far less expensive than home-built PCs. Not that there aren’t a lot of people arguing with that conclusion in the comments. The Mac Mini is _Small_ I said the Mac mini was the reincarnation of the Cube last week, but Gizmodo has posted a picture of the two, um, together. We all knew the mini was small, but this shows how reall small it is. The Unoffical Apple Weblog has a list of things people are planning to do with their mini as soon as they get their hands on one. Now add to that list a mini-based synthesizer. Where’s My Video Jukebox? Yesterday I posted a story about using a Mac mini in my home entertainment center. I noted that I’d already replaced my CD player with iTunes on an old iMac and I wondered if I could do the same for DVDs. I ignored the facts that some provisions of the DMCA may make this illegal. The music revolution was made possible because courts recognize our right to encode CDs from our collection as MP3s, and CDs (mostly) lack copy protections that prevent us from doing that. Bill G Just Wants To Be Cool Gizmodo has two pictures of a young Bill Gates vogueing on a desk with 5.25-inch floppies and a circa-1986 PC monitor. Oh, wait, is that a Mac on his desk behind him? The pics were reportedly published in Tiger Beat, and Gizmodo is offering a reward for the original issue. update David Heisler wrote to Gizmodo to offer this correction and detail: [Those] are not from Tiger Beat. According to snopes. Mac Mini As Media Player More than a few people are looking at the Mac mini as a new component in their home entertainment center. CDs are unknown in our house, where iTunes and an old iMac entirely replaced our five disc changer some time ago. Correction: CDs are used as an input medium. New CDs are ripped into iTunes on their first play, then left to gather dust on the shelf. Video seems ripe for a similar shift, and to many, the mini looks like the perfect platform for it. Michale Stephen’s Twelve Techie Things Michael Stephens’ Twelve Techie Things for Librarians 2005 deserves a look. User-centered technology planning, RSS, acnd convergence lead his list, but other items speak directly to the role of the library in the internet age. pMachine Discontinued, Where To Next? I learned today that pMachine Pro — the software behind this site — has been discontinued. I’d expected the announcement for some time, seeing it today reminded me that I should be looking for a new blog/CMS solution. Expression Engine has largely replaced pMachine, and I know at least one person running it, so I’ll likely be giving it another look soon. I’ve got a list of things I’d like to solve here, so this news sort of fits. Oil Star This super-cool 70s-styled logo adorns the side of a trailer in the backwoods of New Hampshire. More photos from MaisonBisson Jailed For A Song trying to quote lyrics for his book, Planet Simpson to understand how current copyright law is already limiting legitimate work. Lots more stories of copyright law gone amok in the MaisonBisson Copyrights &amp; Intellectual Property index. The Tyranny Of Copyright If you read nothing else all year, read this. Will Shetterly’s “The People Who Owned the Bible” is a tale of copyright gone amok. It’s the clearest, plainest, and funniest of all such works I’ve seen. Note: My title is based on a New York Times story about copyright from a while back. Am I in trouble? Steve Jobs Introduces iPod shuffle In his MacWorld Expo keynote today, Steve Jobs introduced the iPod shuffle. From MacNN: Apple introduces iPod Shuffle…flash based player. Smaller than most packs of gum. Weighs the same as 4 quarters (less than 1 ounce). Volume/Up dow. Simple LED to provide feedback. No display. Either shuffle or album-based playback. USB 2 transfer connector under connector at the bottom. 12-hour rechargeable battery. Steve Jobs Introduces Mac mini Steve Jobs, in his keynote at MacWorld Expo today reintroduced a redesigned Mac Cube as the Mac mini. From MacNN: Apple introduces Mac mini. New member of Mac family Slot-load Combo optical drive. Play DVDs, burn CDs. Quiet. Tiny. FireWire, ethernet, USB 2.0, both DVI/VGA output. Very tiny. Height is half the size of an iPod mini. BYODKM. Bring Your Own Display, Keyboard, Mouse. Vonage WiFi VoIP Handset Is Real All the world is atwitter about Vonage’s new WiFi VoIP phone today. WiFiNetNews got the hint from Engadget, who appears to have broken the story today, and links to a USA Today story that says: With a Wi-Fi phone, they could make Internet calls from home without the need to run wires to the broadband line. Customers could use the phone number of their existing Vonage service or a new one for no extra fee. Video Fix Today might be [wierd|strange|funny|scary] video day. Or something. These are probably not safe for work, though your mileage may vary. Here’s the list of things found last night: Rainbow The site explains/claims: “Rainbow was a credible children’s TV show from the 70s and 80s. This clip was actually broadcast and watched by millions. …there’s no way these could have been done by accident. Innuendo all the way.” SuperModelMeat Classic and Independent Movie Theaters A story in the December 2004/January 2005 issue of Arthur Frommer’s Budget Travel Magazine alerted me to Ross Melnick and Andreas Fuchs’s Cinema Treasures. It was an annotated list of seven theaters still operating today: Cape Cinema: This 1930 Dennis, Mass., theater was built to look like a church. The Senator Theatre: A 65-year-old art moderne classic, it shows new releases in Baltimore. Oriental Theatre: Head to Milwaukee for this $1. The Future Of Libraries Roderick (also, check out Roderick’s new blog) forwarded me a story about the challenges facing academic libraries from The Chronicle of Higher Education. The author, Dennis Dillon, whose full title is associate director for research services at the libraries of the University of Texas at Austin, begins by relating a conversation: “Couldn’t you move your technology to Mumbai and hire some English-speaking Indian librarians to catalog the books and answer reference questions over the Web? Backfill I should admit to it now before it becomes a scandal. I backfilled some content this weekend. Some of it is stuff that I wrote in the past for work (edited for publication here), but I feel may have some public value. Specifically, two stories about wireless: one about its vulnerabilities and another about (then) current practices in the academic community. I also posted my Wife’s first story for MaisonBisson: a recipe for fish tacos. A Decadent And Debauched Slave Of Foreign Culture I first learned of Wei Hui and her first book Shanghai Baby on NPR a few years ago. According to the story, Wei Hui is among a “group of young, attractive women known as the ‘beautiful writers’ churning out novels that graphically describe the hedonism of modern urban China.” Wei Hui’s book was so controversial that it Chinese authorities banned it, causing a nearly immediate surge in popularity at home and abroad. 2004 Tech Roundup It’s getting a little late for these roundup things, but I’m too tired with post-New Year’s party haze to come up with much of anything better right now. Annalee Newitz subtitles her website with “technology, pop culture, sex.” Her index of stories isn’t actually a roundup per se, but it’s good material if you’re too lazy to leave the couch and find a book to re-read off the shelf (because you’ve read all you new books by now, right? Wrapping Up A Year Of Controversy AlterNet had a good line of stories this weekend to round up the old year and ring in the new. I’m running a little late on such things here at MaisonBisson, so let me just quote from theirs instead. — – — Daniel Kurtzman’s list of The 25 Dumbest Quotes of 2004 includes this doozy at the number 12 spot: “All of a sudden, we see riots, we see protests, we see people clashing. Slacking Is Universal In yet another reminder from Mainichi Daily News that American’s and Japanese aren’t so different, now they’re reporting: coeds say college guys ‘childish, irresponsible, stupid.’ A survey of 300 female students selected from 15 universities located in either Osaka, Kyoto or Kobe reveals: A majority of the 300 women polled said that their main impression of male students is that they are childish, the 52.3 percent given to the most frequent answer followed by the 45 percent who thought guys are kind and 40. iPod Hacks Hack-a-Day has just given me the best reason I’ve seen yet to take a closer look at iPod Linux: audio input without the cheap dohicky accessories and at up to 96KHz x 16bit. The five step instructions couldn’t be much simpler (well, it might be more complex once a person actually tries it, but the comments suggest good success). Hack-a-Day is covering lots of iPod hacks (much to the consternation of some readers, but they’re just jealous ’cause they don’t have one). Terminal Holiday For 30K+ I got to spend the holidays near home this year, and with everything else going on I didn’t really pay much attention to the Comair/Delta problem that stranded over 30,000 passengers last weekend. Now that I’m starting to pay attention to the news again, though, I was interested in ArsTechnica‘s discussion of the software glitch that made everything go wrong: At the core of the problem was an application created by SBS, a subsidiary of Boeing. Let Fly The MacWorld Rumors Everybody is gaga (links: one — two — three — four) over the ThinkSecret story: Apple to drop sub-$500 Mac bomb at Expo. Many people in the Mac community have been agitating for a low-end ‘headless’ Mac to compete on price against cheap PCs. The rumored specs include: 1.25GHz G4 CPU 256MB RAM Combo drive 40 – 80 GB hard drive USB 2. National Geographic Society Not So Environmentally Conscious I know I’m complaining here, but National Geographic seems to have done this wrong. I purchased The Complete National Geographic — 110 Years of National Geographic on CD-ROM a few years ago. The collection of 36 CDs is an archive of every page of every issue published from 1888 through 1998. It was a joy to explore that archive, but let’s face it, I wasn’t spending every night doing it. Today I got the notion to reinstall it to search for something, but discovered that the application is far out of date and no bug fixes are available. Google 101 The Economist has a very concise explanation of how Google works, and how it became today’s dominant search engine. Mr Brin’s and Mr Page’s accomplishment was to devise a way to sort the results by determining which pages were likely to be most relevant. They did so using a mathematical recipe, or algorithm, called PageRank. This algorithm is at the heart of Google’s success, distinguishing it from all previous search engines and accounting for its apparently magical ability to find the most useful web pages. High Speed Wireless Michael Sciannamea at WirelessWeblog noted that: BMW, Audi, Daimler Chrysler, Volkswagen, Renault, and Fiat have all received grants from the German government to develop a car-to-car wireless data network using 802.11a and IPv6 technologies to link vehicles to each other to pass on information about traffic, bad weather, and accidents. They’re calling it “NOW: Network on Wheels,” and there’s more at Wi-FiPlanet.com. My comment: static mesh networks are so 2004. Chernobyl Followup I posted a story about a tour through Chernobyl a few weeks ago. The story still gets a lot of hits, and somebody pointed out a few related Wikipedia links about the accident, the ghost town, and the controversy about Elena Filatova, the author of everybody’s favorite online Chernobyl tour story. Separately, Peace.ca reminds us about the dangers of war, nuclear contamination, and more. Free Palm Apps, Now Easier To Find Jon Aquino‘s holiday gift to us is to make FreewarePalm useful: Why this work was necessary: FreewarePalm contains a goldmine of ratings of Palm freeware. But it does not provide a way to sort the programs by rating. That is why I extracted the ratings and sorted them. With over 6000 listings, there’s a lot to choose from, but, as Jon says, no way to sort those listings. Jon has crawled FreewarePalm with “Cygwin lynx, XEmacs, and a 60-line Ruby script” and done what FreewarePalm couldn’t: made a list of apps sorted by rating. Heart Warming Holiday Tale For Hackers I recently stumbled across Ron Avitzur’s story of the the development of Graphing Calculator, the little application that makes complex math easy to visualize. If there was a collection of essays titled “Chicken Soup For The Silicon Valley Soul,” this would be included. Pacific Tech’s Graphing Calculator has a long history. I began the work in 1985 while in school. That became Milo, and later became part of FrameMaker. Over the last twenty years, many people have contributed to it. Requisite Holiday Email Forward Mark Turski‘s holiday message: Avoid carrot sticks. Anyone who puts carrots on a holiday buffet table knows nothing of the Christmas spirit. In fact, if you see carrots, leave immediately. Go next door, where they’re serving rum balls. Drink as much eggnog as you can. And quickly. Like fine single-malt scotch, it’s rare. In fact, it’s even rarer than single-malt scotch. You can’t find it any other time of year but now. Happy Holidays 2004 The Warren Rocket stands in the the snow on December 4, 2004. Happy Holidays 2004 Photo taken December 5, 2004, just north of Warren on NH Route 25C. The snow is real (and much deeper now), but I added the lights for the holidays. Regular updates to MaisonBisson will return after a short holiday break. Coincidence Is Too General A Term Engadget had a laugh over a story in the Keene Sentinel: So the other day a UPS driver in New Hampshire was on his way to the Cheshire Medical Center in Keene to deliver some much-needed parts for a piece of medical equipment when he got into acrash. He suffered a head injury and was taken by ambulance to the very same hospital he was headed to, but they weren’t able to do any of the tests they needed because the brain scan machine was broken — and the parts needed to fix it were sitting in his wrecked truck on the highway. Apple Fans Mod Macs Joseph DeRuvo Jr.’s i-Tablet is this year’s Mac Mod. Wired’s Leander Kahneyusually covers the story, but DeRuvo published this one himself at MacMod. Kahney covered Jeff Paradiso’s converted iBook tablet as part of his 2002 story on Mac modders. He followed that up in 2003 with a story about a pyramid-shaped PowerMac that glowed blue. The Mac mod thing is international, as Kahney points out in this story about Japan’s Mac mod culture. Cross-country Journeys In Time-Lapse I feel a tinge of jealousy every time I see something like this: Lacquer Sound’s Road Trip. Similar: I covered Matt Frondorf’s Mile Markers project a while back. (Picture from Mile Markers). Gary Webb: A Journalist Who Dared AlterNet ran an interesting story about Gary Webb‘s recent suicide and the events that may have led to it. Webb was the 49-year-old former Pulitzer-winning reporter who in 1996, while working for the San Jose Mercury News, touched off a national debate with a three-part series that linked the CIA-sponsored Nicaraguan Contras to a crack-dealing epidemic in Los Angeles and other American cities. The resulting firestorm swept the country. FCC’s Complaint System Gamed I’ve got a backlog o stories to post here, including this old one about broadcast programming complaints to the FCC. The FCC reports that it received a mere 350 complaints in 2000, but 240,000 in 2003. So what can account for the nearly 700-X increase? The FCC did some homework on the matter: According to a new FCC estimate obtained by Mediaweek, nearly all indecency complaints in 2003 — 99. GPS Happy My brother and his wife surprised me with a Rayming TN-200 GPS this holiday season. What’s so great about it? It’s a tiny USB powered brick that interfaces easily with a laptop. The plan? Wardriving (yes, it’s sooo three years ago), better geolocation while traveling, matching GPS coordinates to photos, and as much mayhem as can be had with a computer-connected GPS. Software Options Rayming is Mac friendly enough to offer a page of links to Mac GPS resources and include the necessary driver on the CD. Seacoast Industry Sometimes a story will popup as a clear reminder that the world is not always as it seems. I will admit both surprise and amusement when I found that Foster’s Daily Democrat reported Saturday on the content of a federal indictment of a Kittery, Maine, health club. Geography lesson: Foster’s covers New Hampshire’s seacoast — all 18 miles of it — and Kittery is a shopping destination squished into the southernmost corner of Maine. The indictment accuses Gary H. Reiner of running “an interstate prostitution ring.” Foster’s reports that the club has operated under various names, most recently the “Danish Health Club,” owned by “Kittery Health Club Inc.” Reiner was apparently both the owner of the club and the former town council chairman and had a role in shaping the local regulations of spas and health clubs. The story clearly had some history, and I’m fortunate the web, and Foster’s archives, can educate me. Displaying Word Docs and PDFs in Safari Royce asked: How can I disable or tweak Download Manager so that files can be read in line with the download and manually launch through the Download Manager? I want to be able to click on a PDF or Word doc and have it open inline without having the Download Manager handle it to the desktop first. Context: Some people say the inline display of PDF and Word documents enables bad habits that are making the web less accessible and harder to use. Fun With License Plates Jameson wrote me today to point out that he can get a New Hampshire Moose license plate with the text “-BRK4M” He found my story about New Hampshire license plates, including the bit about NH’s online plate lookup. Then he pointed out that he could get a Purple Heart plate with the text “FUGW” Political messages on license plates seem to usually go one way: from government to people. This rare one reverses it. iSight Accessories And Beauty Tips MacDevCenter published a guide on How to Look Great on iChat AV back in March. The point? Video is changing telecommunications: No longer can we sit in grubby geek glee, protected by our avatar shields, wearing only uniforms of underwear. Endangered are the days where we can pass digital transmissions and gas simultaneously, picking our noses with one hand, and stuffing pizza down our throats with the other. Slowly but surely video is changing that, and sooner or later you’re going to find yourself beamed up into someone’s iChat AV window. Weird Palm Apps CanalPDA, a Spanish-language PDA info site has released an English version of their story about the weirdest Palm OS programs. You’ll have to follow the link to read about why they thought the apps were so weird, but the titles give some clue: Voodoo Palm Mirror Bistromatic FakeCall Palmasutra fDic Divination Scare The Doggy Bubble Wrap Emulator Darn Comment Spam &lt;a href=&quot;http://flickr.com/photos/maisonbisson/sets/15240/&rdquo; title=&quot;Canned Meats at Flickr&quot;&ldquo;&gt;Now that most email clients have reasonable spam filtering capabilities, spammers are targeting comments systems on blogs, guestbooks (I thought those had disappeared, but I saw one yesterday) and other open submission forms that post to the web. IP banning probably never worked, as spammers have been using open proxys for years. Word blacklists (like ignore comments with “online-casino.com” in them) require regular maintenance and could result in false positives. Beware The Cheap PC The public radio show Future Tense did a story Monday that asks “Will you regret buying a cheapie PC?” Computers are cheaper than ever. But if you’re looking at a new machine this holiday season, Dwight Silverman of the Houston Chronicle says beware of the low, low prices. Why will you regret it? The machines are RAM starved, have lousy video hardware, bad monitors, processors that are slower than their MHz ratings make them look, small hard drives, and often lack even a CD burner. More About Google Print Prediction: we’ll talk about Google Print until they debut the beta, then we’ll talk about it more. Copyfight posted some followup on Google’s announcement earlier this week. Of note was a quote from Michael Madison: A first thought: It’s one more example, and a pretty important one, of the fading of the lines separating copyright law from communications law. Is Google Print an information conduit? A massive, rogue P2P technology? iPod Supplies Tight; Holiday Sales To Exceed Four Million Summary: four million to be sold this holiday season; adoption rate higher than for Sony’s Walkman. From MacNN: An article in the The Wall Street Journal today says that iPods are becoming scarce at retailers around the country. The report says that Amazon.com, Buy.com, and other online retailers are now out of stock and “Apple is contending with what appears to be an immense demand for the gadget,” and it suggests that Apple is dealing with manufacturing and distribution constraints due to the iPod’s ‘near-cult status. Wireless Security: WEP Dead WiFi Net News is saying R.I.P. W.E.P. after news of a new version of Aircrack was released that can break WEP in seconds after passively sniffing only a small number of packets. The result is that it takes only two to five minutes to crack a key. Even keys changed every 10 minutes are thus susceptible to an attack that might allow several minutes of discrete information. Unique keys distributed by 802. USB Headset Microphone I went looking for a USB headset microphone, and the Telex H-841 USB Digital Computer Headset seems to be the cheapest one that doesn’t suck. Amazon’s users comments for the other headsets in that price range (under 50 bucks) spoke of bad sound, uncomfortable fit, and fragile parts. The customer reviews of the Telex H-841, on the other hand, all rate it 5 out of 5 and commend its quality. Serious Question About Funny Picture Sometime ago I saw this picture among a bunch that were circulating in those emails that get forwarded all over the place. The site I first saw it on dissappeared shortly after, and I haven’t seen this shot again until now. It looks like this page is a copy of the one I saw in early 2001, and it includes this picture. My question is, where did it come from. I haven’t seen anybody name the source or context for this photo. I’m Now An Expert On Kabbalah Okay, that’s a lie, and it’s probably a little insensitive. Sorry. What I really mean is that the Monday edition of Fresh Air — that NPR talk show with Terry Gross — was all about Kabbalah. Terry’s guest was Arthur Green: Historian and theologian Arthur Green has long studied Jewish religion and culture. Among the many books he has written is his latest, A Guide to the Zohar. […] In addition to being dean of the rabbinical school of Hebrew College, Arthur Green is also on leave from Brandeis University. Google Stuns Libraries, Again ArsTechnica seemed to sum it up best: Today, it is expected that Google will announce an agreement to scan and create databases of works from five major libraries. According to news reports, Google will digitize all volumes in the University of Michigan and Stanford University library systems along with parts of research libraries at Harvard, the New York Public Library, and Oxford University in England. More information on the scope of projects at the individual institutions can be found at news. Exploring Coudal Last week I noted the SHHH project to hush noisy cell phone users by Draplin and Coudal. Today, I spent some time surfing the Coudal site and found a few things. Jewelboxing is Coudal’s answer to lousy CD jewel boxes and DVD cases that aren’t much better. The Super Jewel Box King was developed in conjunction with Phillips at the same time as the DVD. The Standard was designed and introduced shortly after. New Hampshire’s Teen Drug Use High, Teen Crime Rate Low Katherine Merrow, Senior Research Associate at the New Hampshire Center for Public Policy Studies recently released a study on Teen Drug Use and Juvenile Crime in NH. The following is quoted from the study’s executive summary: Two recent surveys indicate that New Hampshire teens use drugs at rates significantly higher than their national counterparts. One survey placed New Hampshire among the top 10 states in the nation in terms of the proportion of its teen population abusing either alcohol or drugs. Laughing At Your Idol While following the story about bad teachers, found the Mathcaddy blog. The only relation Mathcaddy has to the other story is that Steve, the unfortunate student runs his blog on a subdomain there. The post that got me interested at Mathcaddy was I Walked on Water… I Think I Can Walk to the Door: In one of his forty-eight dozen interviews about The Passion of the Christ, Mel Gibson said there have been more than a hundred films made about the life of Jesus. Holiday Deals On Macs MacNN gave me the heads up that Apple had reshuffled its refurb and discount shelves late last week. Shoppers got as much as 27% off selected items, with previous generation models being unloaded at the best discounts. Thing is, the deals were picked up quick, and the store seems to be empty of the best of them. The 1GHz iBook that was current until this Fall was going for $700, and the 1. Teacher Proves — Once Again — That Schools Are Averse To Free Thought Copyfight‘s Donna Wentworth passed along this “sad and perverse story of a teenager who was given an “F” for writing a paper attempting to distinguish between piracy and stealing.” Copyfight quote’s BoingBoing‘s story: Geluso, an “A” student, recently completed an in-class exit exam for his Language Arts class. The goal of the exit exam was to write a comparative essay on a topic of the student’s choice. Being a student who enjoys a challenge, he wrote an essay contrasting piracy with stealing. Cult of Mac, Cult of Newton, Cult of iPod No Starch Press recently released Leander Kahney’s The Cult of Mac. BookBlog notes: Are there trade shows for toasters? Of course not. So why is there a twice-yearly show devoted to a type of [computer] consumer? Well, a computer isn’t just a computer when it’s a Mac, and Macintosh fans will go to great lengths to celebrate their devotion. The book is a followup to the regular Cult Of Mac reporting in Wired News. Gear And Gadget Reviews Gizmodo popped a link over Dan Washburn’s gadget round up. Dan had been on a four month road trip through China, and has now posted the results of how his gear stood up to the trek. On the trip he took an iPod with a media reader, extended battery, and voice recorder mic; two cameras — Cannon S30 and S80; an iPaq with keyboard and GPRS modem; and a Garmin eTrex. Writer Goes Solar For Electric, Hot water, And Heat O’Reilly author Brian McConnell hasn’t gone off the grid, but he’s reduced his dependance on it and in so doing, lessened his footprint on the environment. Electric generates 70% of his home electric consumption. Solar hot water heats his hot tub, eliminating much of the remaining electric consumption. Forced hot air solar heats his house, eliminating half of his natural gas consumption. Total cost of system was $22,000, rolled into his mortgage. Saab Is Latest Car Maker To Get Excited About iPods MacNN reports that Saab has released an iPod integration kit: Saab has quietly introduced its own iPod/MP3 Player audio integration system. The new system, listed in the most recent Saab Accessories Catalog from October 2004, offers direct input for and control of the iPod on its Saab 9-3, according to one MacNN reader: “I spoke with the parts department at my dealership and they confirmed that it’s available. Evidently it’s wired through to the center console armrest and will be out of site. Smack the SHHH Down on Noisy Cell Users Gizmodo was excited enough about the Draplin and Coudal SHHH cards: Two designers have made these warning cards for obnoxious cell phone users, available in convenient PDF download-and-cut-out form. It’s a good way to make it clear to people they’re talking too loudly, and a good way to eventually get into a good, American fist-fight. Then someone can hand you a card that explains why they found your teeth in their soda to be “more than a little annoying. Missile Week at MaisonBisson It’s missile and space weapons week at MaisonBisson. One item, the increasing pace of missile development in hostile and semi-hostile countries as a reaction to the US missile shield, is real news. The others are softer. I wish I’d planned it. Don’t miss Russia’s space battle station or Warren’s home-town missile. Copyright Lessons From Waffle House To round out my week of quoting stories from lquilter.net, today I’m putting forward this one about intellectual property (originally from Critical Montages): Ever notice the Waffle House menu’s insistence that Double Waffle is for &lt;a href=&quot;www-wafflehouse-com-whmenu.pdf&rdquo; title=&quot;&ldquo;dine-in only, no sharing&quot;&ldquo;&gt;“dine-in only, no sharing”? A common prohibition at low-end restaurants, it’s also a small-print reminder of what capitalism is all about. From enclosure to enforcement of intellectual property rights, capital’s message is always No Sharing. Mobile Carrier Wireless Networking, Take 2 I took a long look at mobile wireless data service back in September. Now, Engadget says: They’re currently test-marketing a new wireless data plan called Mobile Media that costs fifteen bucks a month (the same as Sprint PCS Vision) and gives you unlimited data usage and access to their new streaming video service […] Assuming everything goes as planned, they’ll be introducing the new service in January. I guess I have to look at Sprint PCS again, because last time I looked, prices were $40 to $80. Reader Report: PIE iPod Input Adapter A reader, Mike, wrote in to reccomend the Precision Interface Electronics aux input adapter to connect the audio from my iPod to my Scion’s factory head unit. I don’t know if you ever found a solution to connecting your iPod to your Scion head unit, but if not, you can use this adapter to add an AUX input to the Scion factory head unit. I asked Mike for followup and details, and he offered this: Pictures of the Warren Rocket Warren is blessed with a rocket. It was once an intermediate range ballistic missile, but it’s basically the same rocket that launched America’s first astronauts Allen B. Shepherd and Gus Grissom into sub-orbital space. It’s enough to be proud of, anyway. RoadsideAmerica.com has a story on our rocket, but it’s based on reader reports and it seems people just don’t know what town they’re in when they see the thing. The Christian Right and the Sanctity of Marriage lquilter.net pointed me to an interesting entry at NewDonkey: The Christian Right and the Sanctity of Marriage As we all know, the Christian Right has now made defense of the institution of marriage, as defined as a union of a man and woman, not only its top political priority, but the very touchstone of Christian moral responsibility. I’ve always found this rather ironic, since the Protestant Reformation, to which most Christian Right leaders continue to swear fealty, made one of its own touchstones the derogation of marriage as a purely religious, as opposed to civic, obligation. Missiles Are The New Fashion DefenseTech reported today that “Russia is leaning more and more on its nuclear weapons, as its conventional military falls into the toilet.” Elsewhere at DefenseTech today was a link to ArmsControlWonk, which leads to news that the US isn’t working with the IAEA. This isn’t good. The AP, via DefenseTech is reporting Speaking at a meeting of the Armed Forces’ leadership, Putin reportedly said that Russia is researching and successfully testing new nuclear missile systems. Russian Battle Station Polyus DefenseTech reported, some time ago, on the old USSR’s Space Battle Station (or, communist Russia’s answer to Reagan’s star wars program). More pictures are in a forum at Militaryphotos.net. Called Polyus, it was ridiculously huge — as with all things Russian. Sadly, (from a purely scientific perspective) DefenseTech reports “it couldn’t get itself into a working orbit, probably because of ‘a faulty inertial guidance sensor,’ according to the Encyclopedia Astronautica.” US Senate On Porn I’ve been reading the archives at lquilter.net, where I stumbled across this amusing yet scary entry: …On the First Amendment side of things, Wired has a great new story explaining how recent Senate Commerce Committee, Science, Technology &amp; Space Subcommittee hearings have shown that Internet porn is the worst scourge this nation has seen since CIA-sponsored heroin. [wired 11/19] “Pornography really does, unlike other addictions, biologically cause direct release of the most perfect addictive substance,” Satinover said. Shock Tanks Gizmodo alerted me to these shocking remote control tanks. For 50 bucks you get two remote control tanks with which you and a pal will do battle. It’s a game of “maneauver and fire, evade,” or something like that, with the additional carrot that if you hit your opponent’s tank, he or she will get an electric shock. The stick is that if your opponent hits your tank, you get the shock. Dog Sled Racing Justin at the start of his four-dog sled race in Meredith, New Hampshire. The video of Justin’s finish is also online. Snow started falling early Friday and continued through Saturday morning. It’s the heavy, wet snow you get when the air is still warm. The frost isn’t deep and there are still-soft patches of ground here and there, so the snow is melting in parts, but it’s snow nonetheless. It’s snow enough that Justin might be able to run the dogs on the sled, rather than on his bike as he does through the Fall. Cool TVs and RC Aerial Photos Gizmodo went gaga for Plus Minus Zero, a little electronics shop in Japan where “they hand-design a selection of products, then contract the production of the units out for a limited run.” The post includes a picture of one of their products, an LCD television that looks like one of those classic tube TVs from the 1960s. Then Gizmodo linked to this radio control aerial photography discussion board with some great pix. Bush On Tape Cliff over at Spiralbound.net posted the video of Bush flipping the bird. It’s not as exciting as I’d hoped, but it’s on video. Then there’s the Dubya Movie. It’s a fantastic mashup of old Don Kotts movies, but that’s already giving too much away. Go watch it, you’ll laugh. A Night At The Hip Hopera I’m not really sure how to describe The Kleptones and their album A Night At The Hip Hopera, but I can tell you how I found it. Disney sent takedown notices to those who were mirroring the work, raising the ire of the Copyfight community. You see, The Kleptones are really quite good, but their album is a mashup of Queen songs, and Disney (who owns the rights to Queen’s music), got itchy. States Rights LQ wrote at lquilter.net about looming challenges to federalism i’ll be interested to see how the conservative, pro-federalism, pro-states’ rights, GOP-run government (and the conservative intelligentsia which carries their theoretical water) handles some of the upcoming challenges to federalism: medical marijuana laws state & regional initiatives on global warming: for isntance, California’s mandatory cap on greenhouse-gas emissions will have to be signed off on by the EPA before it goes into effect I tried to comment, but WordPress kept ignoring me. Instead, I’ll post here and trackback. James Loewen writes, in his book Lies Across America, that “states rights” is the call of whatever party doesn’t control the presidency. The Republicans made a lot of noise about it during the Clinton years, but will likely have to adjust their position now. Some readers will likely point out, however, that the unspoken Republican tenet (at least since the early 1900s) is “might makes right.” Sadly, the Bush administration has already supported challenges to local environmental regulations. I can’t remember the specifics, but a federal court struck down a California law that required clean-burning busses and trucks in the state. Maybe Republicans are more tolerant of cognitive dissonance than liberals. Maybe they don’t care. Flickr Random Selection Email Is For Dinosaurs in South Korea A South Korean newspaper is predicting the death of email. A poll conducted […] on over 2,000 middle, high school and college students in Gyeonggi and Chungcheong provinces in October revealed that more than two-thirds of the respondents said, “I rarely use or don’t use e-mail at all.” It seems email just isn’t fast enough for these wippersnappers. …it’s impossible to tell whether an addressee has received a message right away and replies are not immediately forthcoming. Lycos-Europe’s Spam Plan SmartMobs reports that Lycos is planning to raise the cost of spam with a gentle DDOS attack. Yes, gentle. Lycos-Europe is distributing a free downloadable screensaver called Make Love Not Spam that directs a low-intensity distributed denial of service attack (DDOS) at URLs contained in spam messages. The BBC article quoted at SmartMobs reports: Mr Pollmann said there was no intention to stop the spam websites working by subjecting them with too much data to cope with. WiFi Seeker, Finder, Detector Roundup Handtops.com has published a WiFi Seeker, Finder, Detector Roundup. The five models they reviewed include: Smart ID WiFi Detector – WFS-1 PCTEL WiFi Seeker Kensington WiFi Finder Plus Hawking Technologies WiFi Locator – HWL1 Canary Wireless Digital Hotspotter – HS10 My favorite, and it’s not based on any experience with any of these products, is the Canary Wireless Digital Hotspotter. It’s the smartest of the bunch and shows The War On Fair Use Somebody somewhere, probably a lawyer in the entertainment industry, has a list titled “rabid fair use advocates” and David Rothman is near or at the top. Not that I mean that as a criticism, or that Mr. Rothman would take it as such. It’s just a likely fact. Today, however, I’m playing a game by quoting his post about the war on fair use in full: Doubt there’s a war against fair use? ENCompass for Digital Collections and Resource Access We’re looking at ENCompass for Digital Collections and Resource Access here. It’s an expensive product, but has a lot of interesting and useful features. Some sites we looked at in the demo today included New Zealand National Library, UT Dallas, and Alabama Mosaic. Bloody Saturday in the Soviet Union: Novocherkassk, 1962 I had a long conversation with my brother about communist Russia last night. It’s not really an area I can talk about, execpt that I’d recently read enough to make me look semi-smart. My reading was of Samuel H. Baron’s Bloody Saturday in the Soviet Union: Novocherkassk, 1962. Review From Library Journal: Baron (history emeritus, Univ. of North Carolina; Plekhanov in Russian History and Soviet Historiography) brings to light events of nearly 40 years ago that foreshadowed the demise of the Soviet Union. Robert Berger’s WiFi Will Beat Up Your WiMax From WiFi Networking News: WiMax Hype, 802.11 Reality Wi-Fi will out evolve and deliver connectivity at costs dramatically lower than WiMax. WiMax / 802.16 is just starting on its path to evolution, has a much smaller base of innovators and chipset growth volume. Wi-Fi is already far along on its core learning curve, has an easy order of magnitude larger base of innovators / investors and chipset growth volume. WiMax hype will sputter out to reality of a niche backhaul and rural marketplace, Wi-Fi/802.11 will evolve and grow into many more realms and dominate the Local Area Network (LAN) / Neighborhood Area Network (NAN) / Metro Area Network (MAN). Berger’s conclusion is based on the history and development of earlier, wired networking technologies, where Ethernet is the clear winner. He reminds us that “Token Ring, then 802.12 AnyLAN VG, then ATM” were all once considered leading technologies that would replace lowly Ethernet, but didn’t. Today, 802.11 products are shunned by wireless carriers, but their spread and market dominance will be hard to beat by WiMax and 802.16. iPod Integration Kits Proliferate for Home and Car MacNN reports the Sonance iPort will ship later this month, which must mean next week. Anyway, the iPort is a wall mounted dock that hides all the cables — audio, firewire, dock, others — in the wall. The MacNN story includes nice pictures of the unit, including the beauty shot and a view of the ports and connectors. Sonance makes no end of “architectural-audio” equipment, including those speakers you sometimes find hidden in the wall. falljuahinpictures fallujapictures (soon to be at falljuahinpictures.com) posts pictures too sad or scary to appear in most newspapers or even on this site. Geolocation Stumbling Block: GeoURL Host Down A an old John Udell piece at InfoWorld hints at GeoURLs, but the GoeURL site is down, and has been for a while. The concept sounds interesting: you mark pages with coordinates, then use GIS to map those pages to geographic locations, finding pages and people of interest along the way. To join GeoURL, you add this kind of metadata to your homepage: I got interested in this sort of thing (geolocation) a while back, and I haven’t quite given up. Copyright Czar Cometh? David Rothman at TeleRead echoed the following: “Buried inside the massive $388 billion spending bill Congress approved last weekend is a program that creates a federal copyright enforcement czar.” – Lawmakers OK antipiracy czar, via CNET. Sealing History Democratic Underground published a May 5 2004 story about Bush administration efforts to replace the national archivist. the national archivist is the keeper of the nation’s records – the archives. The National Archives control what information gets released to the public – and what does not. With so much power over how what history we see, the independence of the archivist’s position is paramount, lest one political party usurp that power. People who know these things were afraid when the previous archivist announced his intention to resign early, despite previous signals he intended to complete his full term. These people were doubly surprised when they learned the Bush White House has […] nominated Allen Weinstein for the position, one who is held in dubious esteem at best, who has been criticized for having a penchant for privacy not becoming a National Archivist and, to the surprise of many, was nominated without any consultation with outside experts – the first such time ever since 1984, and in direct contravention with the wishes of Congress as expressed in the House report accompanying the law that made the Archives independent. Had the previous archivist fulfilled his term, he would have presided over the release of George H. W. Bush’s records. The new archivist will be able to lock up those records and along with the “W” files for the next ten years. With a straw man in place, the Bushs can rest comfortably, but can we? Liberty Vampire jokir Flickr’d this, writing: “GREAT work — Alex Ross is one of my favorite artists…Plus – it pretty much nails what’s up in the world, right?” Ross’s website has mostly shows his comic book art and superhero imagery, and it took some time to find a reference to this piece. Apparently it was for an article in The Village Voice and appeared on the cover. Ross writes: WB Says You’ll Pay Here’s the irony: an academic writes a paper that references and quotes relevant prior work, and is commended for the work. But, a journalist working on a book that quotes elements of pop culture risks a copyright infringement lawsuit if he doesn’t pay for his quotes. The fact is, “fair use” is not protected, and it can only be determined in court. Fact is, the risk of lawsuit is enough to make most authors and other content creators license work for uses that most agree should be covered by fair use. U2 Cozies To Apple I’ve been warm and lukewarm on U2 for a while. I can’t deny that they’ve done some great stuff, but I’ve failed to appreciate some of it. Take the band’s previous work, All That You Can’t Leave Behind, for example. It seemed like a sad attempt to capture a younger audience, and was out of line from the band’s other work. Aging is tough on everybody, but neither the band-members, nor their fans are getting any younger. The Kinkos Conspiracy Engadget raised my fears a bit when they announced your laser printer will give you away: It was big news last month when a couple of researchers at Purdue announced a way to trace documents back to their original printer or photocopier, but it turns out that Xerox and most other laser printer and copier makers have been selling devices that encode serial numbers and manufacturing codes on everything they print out for years. Click Fraud ArsTechnica has a story about new Google lawsuits. The company is getting sued by a porn purveyor for copyright infringement and is suing another company for “click fraud” — fraudulent clicks to Google’s Adsense advertising links. Having recently taken on Adsense links here at MaisonBisson, I couldn’t help but pay attention. The Ars story leads to one at C|Net that explains: Click fraud is perpetrated in both automated and human ways. Predicting the Computer of 2004 in 1954 (Fake) Steffan O’Sullivan writes: “This is from a 1954 edition of Modern Mechanics Magazine, predicting what the home computer will look like in 2004. I think I worked on that printer once… How can I get a steering wheel like that on my office computer here?” The caption reads: “Scientists from the RAND Corporation have created this model to illustrate how a ‘home computer’ could look like in the year 2004. Chernobyl Tour update: there’s more pictures, even some video (look for links marked with the QuickTime logo), and a bundle more nuclear and Chernobyl-related stories. I almost fell into a trap that has snared quite a few before me. bookofjoe recently pointed to the story of Elena, a motorcycle riding woman who claimed to brave the radiation to tour the area around Chernobyl, the nucluear reactor that exploded disasterously in 1986. A commentor quickly pointed out that her story has some history and is surrounded by controversy. Google Scholar ArsTechnica and bookofjoe both heralded the beta release of Google Scholar. My questions: “is it accessible via the Google API,” and, “what does this mean for academic libraries?” I’ll be exploring both in time. In the meantime: Library Portal Integration. How Blue Is My Country? My father sent along a link with the following annotation: We all know the expression that “one picture is worth a thousand words.” Well, here are several pictures of the same phenomena that tell the same story but give very different impressions. They illustrate clearly how pictures can be misleading (or should that be ‘leading’ ?). I found them very interesting. Please look at all of them. The link lead to a web page by Michael Gastner, Cosma Shalizi, and Mark Newman of the University of Michigan offering Maps and cartograms of the 2004 US presidential election results. Science of Coercion Roderick sent me a link to a story at Common Dreams: Killing the Political Animal: CIA Psychological Operations and Us, by Heather Wokusch. A CIA instruction manual entitled “Psychological Operations in Guerrilla Warfare” provides some clues. Written in the early 1980s (coincidentally, soon after Bush Sr. headed the Agency) the document was part of the US government’s crusade to bring down Nicaragua’s leftist government, by providing training and weapons to the Contra rebels. Coldplay I didn’t think I’d become a Coldplay fan, but then I heard Don’t Panic in the Garden State soundtrack and I couldn’t help myself. Now I’m liking Clocks. My only problem with all this is that everybody else likes it too. Reviewing FCC Rules on WiFi Use I wasn’t really paying attention in June when WiFi Net News reported on a FCC decision regarding control of WiFi: The FCC says landlords, associations can’t regulate Part 15 use: The FCC’s Office of Engineering and Technology says that the function of regulating and coordinating frequency use is reserved to the FCC itself. It’s a clear refutation of mall owners, airports, and condominium associations to limit use of Wi-Fi and other wireless technologies. Why We Fear The FCC The Engadget headline on Monday appeared at first exaggerated: the FCC says it has power over anything that can receive and play a digital file. But, the short news entry reveals the truth of the headline: In a brief filed in a suit brought against the Broadcast Flag by the Electronic Frontier Foundation and PublicKnowlegde, the FCC argues that not only do they have the right to regulate that all digital TVs, settop boxes, digital video recorders, satellite receivers, DVD recorders, etc. Ken Nordine’s Word Jazz Ken Nordine may have the best voice ever. In the pantheon of deep soothing voices, Ken Nordine’s stands above The Magnetic Fields and MC Honky, and about on par with Barry White. Content Management Below are loosely organized speaking notes for Zach’s Essentials of Web Development class that I guest-lectured/substituted on Monday, November 17th. Either we do the content management, or we get the computer to do it for us What is redundant and repetitive about web management? Placement of branding elements. Placement and updating of navigation elements Placement and tracking of ads Updating of lists, indexes, and other info as a site’s content changes These tasks consume time, but do not require great skill. What’s Up With Lowell And Donuts? See the full What’s Up With Lowell And Donuts Flickr photoset with slideshow. Follow that with the Post-Donut Tour photo set. Story/explanation/narrative to follow. Sometime. Donut Shack Eat-a-Donut Still Hungry DefenseTech Compares Book to Practice in Fallujah The news from Fallujah is grim. Casualties are heavy on all sides, the city is being bombed to ruin, and those few civilians that remain are without water or power while bodies rot in the streets. DefenseTech reported on the Fallujah push last week and included some quotes from the Army’s new Counterinsurgency Operations field manual: Concentrate on elimination of the insurgents, not on terrain objectives… Get counterinsurgency forces out of garrisons, cities, and towns; off the roads and trails into the environment of the insurgents… Avoid establishment of semipermanent patrol bases laden with artillery and supplies that tend to tie down the force. Dangit: FreeFonts A part of me hates 1001freefonts.com. It’s the part that has too often found just the right font, only to discover that the free or cheap knock-off version that I had didn’t have all the characters, like quote-marks and other punctuation. Then I see a font like “Accidental President” and realize what a sucker I am for font shopping. Thanks to bookofjoe for the link. Also, High Tech-Styles (get the pun? Shatner’s Return: Has Been William Shatner has a new album out. Most people receive this news with a smirk, or a chuckle, or a dumbfounded look. Let me assure you, he can’t sing any better than you think, and probably not any better than in his previous albums. But here’s the thing: the first single Common People, really is good. Well, good in one way or another. I laughed the first time I heard it, and the second time, and again and again. Ludicorp Will Be Flooded With Under-qualified Applicants Job ads reveal a lot about a company, what technology they use, what they’re developing, and what sort of culture they have. This one from Ludicorp/Flickr caught my eye: Starting immediately, we’re looking for a great technical operations person. The ideal candidate can grow into a leadership role in technical operations and has broad practical experience on both the systems and networks sides. Requirements: 5 years system administration experience with Linux and Apache (some network administration experience strongly preferred) Experience with both 32 and 64bit systems Experience with both hardware and software approaches for load balancing web serving and database traffic Experience in firewall administration and best practices for security Basic network design and administration Current knowledge of hardware systems (servers and networking gear) Prior experience running mid-sized systems (30 servers) Bonus characteristics: Fish Tacos Oh decadence! Veterans Day provided not only a chance for reflection but also a rare Thursday free from the classroom. So what to do with this open period of time? The answer was easy, dinner party. I have wanted to have my colleagues Roxanna and John over, but time is always an issue. I phoned them up and they accepted. Now the fun began — menu planning. While vacationing with my parents in Vegas last summer we went out to marvelous food chain, The Cheesecake Factory. High Tech-Styles Foof started out by making some interesting iPod sleves. Now they’re offering Foofbags for your iBook and PowerBook. If you are looking for a funky alternative to neoprene, rubber or plastic to protect your Apple technology from scratches, then we think that this site is for you. Our foofproducts are handmade, simple and beautiful. foofproducts were originally created in a Martello Tower (Dublin, Ireland). They are now currently handmade using a 1953 Pinnock sewing machine (Sydney, Australia). Delicious Library & Earthcomber & What? I’ll be saving my pennies, because Delicious Library may be the coolest new app in a while. Ars Technica revied a beta and gave it an 8.5 out of 10 — for a beta of a 1.0 product. People are right when they suspect that something very different is going on over in the Mac corner of the software development universe. Is it something crazy, or something sublime? You be the judge. Money Grubbing You’ll notice there are more ads on the site recently. It’s not because I need to recoup my investment in the site and need the pennies I get for these ads; it’s just because I’m a money grubbing bastard. Anyway, this is the response I got to my application to the Target affiliate program: We regret to inform you that Target.com has chosen not to accept you into their affiliate program at this time. The Campaign For Klem the Killer Klown Jones Soda, the folks who make the extra-flavored pop with the intersting photos on the label have an online gallery where you can submit works to appear on future labels and vote on works already submitted. Roderick’s girlfriend Toni submitted an piece and he’s campaigning for it: Hey there. Toni is trying to get her Klem the Killer Klown banner on a Jones Soda bottom. Help her out by voting for her image! WPA Cracked Yesterday’s story about wired and wireless network security, and policy-based networking (sort of) was really just preparation for WiFi Net News’ WPA Cracking story. Glenn Fleishman’s lead is quite direct, “we warned you: short WPA passphrases could be cracked — and now the software exists.” He explains further: a weakness in shorter and dictionary-word-based passphrases used with Wi-Fi Protected Access render those passphrases capable of being cracked. The WPA Cracker tool is somewhat primitive, requiring that you enter the appropriate data retrieved via a packet sniffer like Ethereal. Better Networks Through Policy Back in the Fall of 2003, PSU was still considering its wireless plans. Things were moving slowly, and the decision makers seemed to be looking for answers in the wrong places. I’d been agitating for better answers, a simpler solution, lower costs, and more progress. My criticism landed me on the hot seat, and I was soon asked to be more constructive. My answers are in this presentation, the accompanying handout, and a handout for a followup meeting. At the time, the networking staff was leaning towards a proprietary 802.1x-based authentication scheme that required specific client software and had limited hardware support. The package was rather pricey, would have required additional client software and hardware purchases, and was restrictive in its support of student computers. At an institution that supports over 7000 users, most of whom purchase and maintain their own equipment, the plan seemed to have a lot of shortcomings. I wanted the school to look at the Wireless ISP model, and consider the options used there. I also wanted the networking folks to explore network security over-all, rather than just wireless security, as most network threats affect wired and wireless networks in similar ways. I no longer work in the IT shop, where I was a sys admin at the time, but this presentation and my arguments may have been successful. The school selected a commercial captive portal authentication system, just like the WISPs. A lot has changed in the wireless market over the intervening year, but I’m offering the presentation here anyway. Getting Schooled on Trademark Law Krispy Kream, the donut folks, are itching to get Krispy Kream Drive In on Route 422 in Belsano to change their name. I’ve no idea where Belsano is, but ower Christina Hoover says “we’re an ice cream fast food stand. It’s a drive in.” It’s been the Hoover’s bread and butter since 1968. What Krispy Kreme is really arguing is dilution of their “famous” brand. Since going IPO a few years ago, Krispy Kremes have popped up everywhere across the county, from SBC Park in SF to the Excaliber in Las Vegas. iPod News Galore iPodLounge has posted a lengthy buyers guide for the iPod and accessories. It’s a whopping PDF — they call it retro because it’s in magazine format. Whatever, it’s packed with details and includes comparison reviews. Mac360 is offering up a chatty review of the iPod Photo. Tera poked around and found an odd “Photo Import” command lurking in the menus. Could this be the feature that allows camera users to import memory card contents directly? Recovery Lawrence Lessig picked out a comment by adamsj that resonated with him: “I’m going to spend time these next few days looking for the America in my heart. It may be a while before I see it anywhere else.” The response was strong and swift. The first few comments were highly critical, even personally critical. John‘s comment seemed to sum up the Republican view: You may also find it in the scores of millions of voters and nonvoters in between Manhattan and San Francisco whom the Democratic Party has repeatedly mocked, ridiculed, called stupid/ignorant/intolerant, and excluded for the past 30 or so years. Stealing From The bookofjoe Once again, I’m echoing a lot of content from bookofjoe. I just can’t help myself. Without the blog, how would I know about products like the Flatulence Deodorizer? The Flatulence Deodorizer — U.S. Patent No. 6,313,371 — is “guaranteed to eliminate embarrassment from odors associated with flatulence – forever – or your money back.” Says the site: “Try it, you’ll like it – and so will the others around you. bookofjoe Says CIA, NSA, Defense, and others Will Make Kerry President “The old guard of the CIA, threatened and beleaguered as they haven’t been since the disclosure of ‘the family jewels’ by the Rockefeller Commission in 1975, is striking back.” When Bush turned to the intelligence agencies to produce “evidence” to support his NeoCon plan to invade Iraq, they ponied up. To them, that’s what you do when you work in the executive branch and the executive gives an order. Of course, much of the Intelligence community’s behaviour was formed in the days when the buck stopped at the desk in the Oval Office. Fear The Takedown, Part II: Homeland Security Copyfight and Teleread both picked up on an AP story about Homeland Security Agents Enforcing Trademark Law. Pufferbelly Toys owner Stephanie Cox “was taken aback by a mysterious phone call from the U.S. Department of Homeland Security to her small store in this quiet Columbia River town just north of Portland.” Calls from law enforcement agents get noticed. Calls from organizations charged with securing America from terrorist threats get fretted over. Halloween 2004: The Movie Food, booze, fire: Halloween 2004. Links: Picoserver and iVideo Picoserver: Japanese firm Package Technology is coming out with a 42 x 23.5 x 61 mm box called the PicoServer that’s essentially a web/mail server with an Ethernet port and three sockets for sensors (one out, two in). This could be a packaged implementation of the iButton TINI ICs from Dallas Semiconductor. Then again, it might not be. Either way, it’s interesting and convenient. I just wish they were cheaper than the $375 or so Engadget claims they’ll cost. The October Surprise NPR’s senior news analyst, Daniel Schorr, reported Wednesday that the Bush administration has been busy keeping the bad news it has known about for months out of the press and away from the public scrutiny. Iraqi Explosives The Bush administration knew about the 400 tons of missing explosives a year ago, but still claims no knowledge of how they went missing or who might have taken them. Their knee-jerk reaction, of course, is to say the explosives went missing before US troops invaded, but TV news video that has recently come to light shows US troops inspecting the explosives then being ordered away. What Have You Done For Me Lately, Dubbya? UnionVoice.org asks Are you better off now than you were four years go? In his four years, George W. Bush has taken away overtime pay, presided over the first net loss of jobs since Herbert Hoover and the Great Depression, proposed a 30 percent cut in funds for children’s hospitals, sought tax breaks for companies that export jobs overseas and signed a Medicare prescription drug bill that helps HMOs and drug companies more than seniors. Grandma Had More Sex FleshBot pointed to a story in The Guardian that reports on a study by Prima Magazine that suggests married women of today have less sex than married women of the 1950s. women in the 1950s had sex an average of twice a week. But a survey found two-thirds of today’s women said they were too tired to manage that much. When I mentioned this to Sandee, she echoed what Prima says about it: Warmonger ≠ Support Our Troops On the heels of “&lt;a href=&rdquo;/post/10260&rdquo; title=&quot;There _were no international terrorists in Iraq until we went in“&gt;There were no international terrorists in Iraq _until we went in_” comes a story from Alternet: “Bush has failed the military on almost every level — marking the difference between being militaristic and pro-military.” Discounting that he sent American troops into Iraq on false pretenses, a real commander would fight for the welfare of his troops. Fictional Story Asks: Is There A Right To Life After Death? The story focuses on the brain as an organ, in this case, an organ donated for medical research after the death of the host. What has prompted the lawsuits, protests and threats just over one year after the procedure is not the facts of the initial donation, but the university’s decision to terminate the experiments, and therefore the care, of the brain. What the [right to life groups] and their supporters claim is that Brian Schultz, the nine-year-old organ donor who legally passed away one year ago, is actually alive and well in the research lab. C&D = Takedown = Chill = Limited Creativity = Limited Speech Ericka Jacobs at Copyfutures found my Fear the Takedown story about Bits of Freedom’s takedown study. She over-stated my effort; all I really did was quote text from Copyfight, which they quoted from Doom9, but that’s how blogs and the web work. More importantly, Erika explained a lot more than I did, including detailing takedown proceedures and safe harbor provisions under US and European copyright law. Finally, she ends by quoting a report by Chilling Effects, a copyright resource center maintained by the “Electronic Frontier Foundation and six law school clinical programs. Prepare To Get Screwed by DRM Copyfight is picking up on something I started talking about a while ago: content owners want to re-sell you the things you already own. Digital isn’t about copying, it’s about not having to re-purchase music just because the record company releases it in a new format (album, cassette, CD, beyond CD). The Real Threat: Me2Me is about just that. HBO, for one, is very straightforward in its FAQ that the goal is to take away your time/space shifting rights in order to sell them back to you. The Sweet Taste of Lead bookofjoe reports on a October 5 Washington Post story titled: Lead Levels in Water Misrepresented Across US. What the headline really means, however, is that lead levels are under-reported accross the US. “The problems we know about are just the tip of the iceberg,” said Erik D. Olson of the nonprofit Natural Resources Defense Council, “because utilities are gaming the system, states have often been willing to ignore long-standing violations and the EPA sits on the sidelines and refuses to crack down. Serene, Calming Video Turn up your speakers to enjoy the serene music and pastoral scenes in this relaxing video of a car ad. [update:] the original link is broken; look for current links to the video in the text and comments of this newer story. Malware, OSX On Old Macs, Brass Knuckles ArsTechnica reports Linux and Mac OS X get some love (?) from malware writers: Some of you may have seen e-mails purporting to be from the Red Hat Security Team. The e-mail contains a link to fedora-redhat.com and prompts users to download and install a patch for fileutils-1.0.6, stating that a vulnerability could “allow a remote attacker to execute arbitrary code with root privileges.” The “patch” actually contains malicious code that will compromise the system it is run on. “There were no international terrorists in Iraq until we went in” It made some news when former British foreign secretary Robin Cook, who resigned from the Cabinet over the Iraq war, said: “There were no international terrorists in Iraq until we went in. It was we who gave the perfect conditions in which al Qaeda could thrive.” Now, news organizations around the world are quoting the IAEA in saying: Nearly 400 tons of conventional explosives that can be used in the kind of car bomb attacks that have targeted US-led coalition forces in Iraq for months have vanished from a former Iraqi military installation, the UN nuclear agency said Monday. Ribbons A story on NPR’s Morning Edition this morning declares: yellow-ribbon magnets carry complex meaning. The Library of Congress’s American Folklife Center tells the history of the yellow ribbon. Though its conceptual beginnings are mixed, Penne Laingen was the first known American to tie a ribbon ’round an ole oak tree in hopes of the safe return of a loved one from conflict or captivity. It was 1979, and her husband was among the hostages taken that November in Teheran, Iran. Duties and Responsibilities “I really don’t know what he did for us.” — said recently about me by my old manager to a former co-worker. Cliff Points At Stuff So, Cliff points at stuff a lot. It turns out that he’s pointing in every picture in my photoblog that he appears in. Sure, it’s only five out of five photos, but it’s still 100%! More photos from MaisonBisson In Car iPod, Take 2 Engadget echoed a story from AutoBlog (duh, I just noticed that they’re both from Weblogs Inc.) about an iPod integration kit that works with most all 1998-or-newer cars: iPod2car. First, it gives a clean line-in to the stero from the iPod, then it gives next and previous track as well as rewind and fastforward control on the stereo. Sure, you can buy a 2005 BMW and get the same deal as an option, but this is cheaper. Digital Camera Reccomendations A friend asked me what digital camera she should buy. Her criteria were that it be small and inexpensive. My answer: the Pentax Optio S40 with a 256MB or 1GB SD card. Why? It’s less than an inch thick, is hovering at just over $200, and works well. My slightly upscale alternative is the Olympus Stylus 410, but XD memory cards are much more expensive than their SD cousins. Still, Olympus’ new Stylus Verve looks like a winner. Red Sox The Red Sox did an amazing thing last night: they won. There’s a lot of talk about how historic the four wins in a row come from behind victory is, but for most people, it’s enough simply that they won, and they beat the Yankees. Close to home, PSU students, and students all over New Hampshire and Massachusettes, expressed their joy over the Sox’s victory in a way that has mature adults™ shaking their heads everywhere. I’m No Economist, But… It’s an old story, the growing gap between rich and poor, and it’s probably booring as hell to most. Thing is, I fear it’s shaping America in more ways than can be counted. I’ve been at a loss to make a clean argument about this, so all I can do now is give you this: Across the Great Divide: In 1999, CEOs made 458 times as much as production and non-supervisory workers. Fear the Takedown Copyfight points me to Doom9 which reports on Bits of Freedom‘s recent project: Dutch civil rights organization Bits of Freedom has run an interesting experiment: They put up a text by a famous Dutch author, written in 1871 to accounts with 10 different ISPs. Then they made up an imaginary society that is supposed to be the copyright holder of the author in question, and sent copyright infringement takedown notices to those 10 ISP via email (using a Hotmail account). “Try a Florsheim Maneuver” Quotes from the bookofjoe: “The bleeding always stops.” …my favorite of the zillions of wonderful, pithy, often-harsh apothegms I’ve heard in my years in medicine. There’s more: “Try a Florsheim maneuver” [kick him to see if he’s dead or faking] “We won’t know until the autopsy.” [actually spoken on internal medicine rounds by a resident when I was in med school, in response to the question, “What’s he have? TV-B-Gone Wired News ran a two page profile of the inventor and his creation. Just two weeks before the US Presidential election, NPR found time run an interview with the inventor. Gizmodo rants angrilly about it. Clearly, a device that shuts of televisions gets attention. TV-B-Gone is a one button remote control who’s only purpose is to turn off televisions, whereever they may be. From Wired News: The idea for TV-B-Gone was born at a restaurant in the early 1990s, when Altman and his friends kept paying attention to a TV in the corner, not to one another. Monday Politics Sex and politics, voter registration at strip clubs “Ashcroft used to care more about pornography than terrorism,” says Scot Powe, professor of law at the University of Texas. “The guy is a throwback to the early 50s; maybe that’s being too generous.” &lt;p&gt; […] &lt;/p&gt; &lt;p&gt; David Wasserman, a first amendment attorney, [says:] “My fear is that a second Bush administration will unleash a slew of prosecutions against adult entertainment web sites, video stores and producers of adult films. Monday Copyfight Disney thieves Peter Pan from copyright-holding childrens’ hospital charity Peter and the Starcatchers by Dave Barry and Ridley Pearson and published by Disney’s Hyperion Books is billed as a prequel to the children’s classic, Peter Pan. […] But the hospital charity says [it] is getting nothing from Peter and the Starcatchers — which has been on the New York Times best seller lists, has had an extensive author tour and has its own Web site. Monday Tech Now that WiFi access is common, WiFi-dependant applications are starting to appear. providers are finding out that the key to encouraging usage of hotspots and the key to leveraging hotspots to boost business is by offering applications that customers can use. &lt;p&gt; &lt;/li&gt; &lt;li&gt; &lt;a href=&quot;http://wifinetnews.com/archives/004343.html&quot; title=&quot;Rest Stop WiFi Roundup&quot;&gt;Rest Stop WiFi Roundup&lt;/a&gt;&lt;br /&gt; &lt;blockquote&gt; &lt;p&gt; Texas has signed a contract to install Wi-Fi at 105 locations by Oct. Sunday Links Links: starting with politics, going to copyfight, ending nowhere. On The Mediathis week is reporting on the controversies about Sinclair TV and Bush’s wiring, looks at why there’s a dearth of local real local news, and, most interestingly, compares Bush’s lies to Kerry’s exaggerations.The whole show is available as MP3. RealClearPolitics lists polls in swing states and elsewhere. EarthBrowser (for Mac) gives us a glimpse of the world, showing swirling clouds and other weather, but hiding the politics and tension. Football Injuries Joe was telling his son, Justin, about his college football days. It was mostly a tale of his injuries, including one that required he have fluid drained from his knees daily before practice. He says it hurt. It hurt a lot. It hurt to drain the fluid. It hurt to practice on it. It hurt throughout the day and night. Justin asked why he would do such things to himself. Because he could not imagine doing anything else. Local Cinemas While Yahoo Movies is okay, it doesn’t track all the local theaters. Fortunately, many of them are online: The Nugget, Hanover Lebanon 6 Lincoln Cinemas Smitty’s/Chunky’s Tilton Then there are the drive-ins: Meadows Drive-In Route 135, Woodsville, N.H (603)747-2608 Fairlee Drive-In Theater Fairlee, VT. (802) 333-9192 St. Louis I’m ashamed to say that St. Louis, Missouri, wasn’t on my list of must-see-cities™. It’s not that I thought I wouldn’t like St. Louis, it just never crossed my mind to go there. I’d also forgotten about the Arch. I ended up in St. Louis because it was hosting the Library Information Technology Association annual conference. I did the Arch Friday morning, before the conference. The day was rainy and gray, but the Arch still stood out as an amazing structure. Veicon Thin Client Solutions The theory is that thin clients save money over the long-haul because they require less maintenance and management, have longer useful lives, and can be purchased for about the same or less money than the PC you might have otherwise used. The problem is that it’s very different from the normal practice and not many people can explain exactly how it works. So, in the absence of good information, most people go on like they always have and ignore the possibilities of thin clients. QR Codes QR Codes are starting to appear everywhere. I’m intrigued and I want to know more about them. Here are some links I dug up and hope to return to: Wikipedia on QR codes Schubart’s Wikipedia on QR codes jphonegames on QR codes QR code generator QR codes and PHP A better QR code generator Winging Into Cleveland The wing dips toward the ground while turning for the Cleveland airport. Lake Erie is visible underneath the clouds at the top of the frame. Two more photos from this series are posted in my new Aerial &amp; Scenic set at Flickr. What Liberal Media? Now on CNN.com: Sinclair Broadcast Group, owner of the largest group of television stations in the nation, plans to air a documentary that accuses Sen. John Kerry of betraying American prisoners during the Vietnam War, a newspaper reported Monday. This story is bigger than it looks, and I almost let it slip by without mention because I couldn’t fully address it. But ignoring it won’t make it go away, so…. Libraries Under Fire KOMO TV 4 is reporting Big Brother™ is watching, even in small communities off the beaten path. Deming, Washington, a town of 210 with a library that “isn’t much larger than a family home” is facing a showdown with the FBI. The FBI wants to know who checked out a book from a small library about Osama Bin Laden. But the library isn’t giving out names, saying the government has no business knowing what their patrons read. RedLightGreen Teleread reports: RedLightGreen.com, a creation of RLG, searches through 120 million books based on such criteria as author’s name, title, and subject matter. Not full text search–but still useful. Over at RedLightGreen, they say it “helps you locate the most important books and other research materials in your area of interest, and find out whether what you need is available at your favorite library.” Foggy St. Louis from the Top of the Arch This is my second try at stitching these photos together. I decided to give up the illusion of the single shot, and added the white borders to make clear that this image is a composite. The resolution is way up on this one, and it shows. The baseball stadium is clearly visable on the left, the football dome is on the extreme right. Click the picture for larger (or smaller) views. The Rumble In St. Louis This text has been moved from the Scenes From St. Louis story so that it can be filed, more correctly, in politics &amp; controversy. Unable to get into the “town hall” to take part in the debate personally, I went looking for a place to watch it. Sadly, the Sox game pre-empted the debate at most bars, but the Drunken Fish was showing it, with subtitles only. Regarding the debate, Oliver Willis has a clip titled “watch your President flip out of his gourd” and everybody is asking is this Bush’s Dean Scream™? Bowling Museum and Hall of Fame Things Learned at the International Bowling Museum and Hall of Fame (and easily repeated as quotes from their online history page): Sir Flinders Petrie, discovered in the 1930’s a collection of objects in a child’s grave in Egypt that appeared to him to be used for a crude form of bowling. If he was correct, then bowling traces its ancestry to 3200 BC. […] There is substantial evidence that a form of bowling was in vogue in England in 1366, when King Edward III allegedly outlawed it to keep his troops focused on archery practice. Copyfight Friday Microsoft CEO Steve Ballmer did another one of his monkey acts when he went ape about music and DRM. Most people still steal music…We can build the technology but there are still ways for people to steal music. The most common format of music on an iPod is ‘stolen’. It could just be a picture of what happens when Microsoft wakes up and realizes it doesn’t own and can’t control everything, but it also reveals a lot about where the company is going. Ballmer could have said that the shifting of purchased music from one device or format to another is a legally protected form of fair use (at least for now). Instead, he argued something like “Microsoft’s DRM is the only solution to piracy.” Anyway, it’s a crock of shite. Teleread (always an anti-DRM advocate) has picked up on it. — And — Riding Mower Gizmodo has this picture of what they describe simply as a “Homebrew Riding Mower.” I can’t help but like it, and I have a feeling my friend Joe will be trying to make one of his own soon. Stealing From The bookofjoe As long as I’m quoting content from bookofjoe, I might as well post these two other links I got from there this week: Douwe Osinga’s Visited States Dynamic Map Dohicky and AwfulPlasticSurgery.com. Fox News Just Makes Stuff Up Most people know I’m not a huge fan of Fox News, at least in part because Fox News is no great fan of mine. Al Franken and Eric Alterman are rather detailed their explanation of just how conservative Fox is (it’s like the tower of Pizza leaning toward Texas; actually, it’s like the tower layed down in Texas). But you’d have to figure that even conservatives would have trouble keeping a straight face while making up lines like this: “‘Didn’t my nails and cuticles look great? St. Louis WiFi Panera offers free WiFi in about 400 locations. The odd thing is that even though their listings didn’t name a location near my hotel, a proximity search found one in my hotel: Westport Plaza 147 Westport Plaza Maryland Heights, MO 63146 Then there’s also Apple Store West Country: 131 West County Center Des Peres, MO 63131 …Just a quarter mile east of 270 on Manchester. Eccentric or Autistic, You Decide bookofjoe ran a story about Eccentrics by David Weeks. His story is really just a listing of the 15 characteristics of eccentrics as quote from the book, but it makes a good game to calculate how eccentric a person is. Try the list on for size: Nonconforming Creative Strongly motivated by curiosity Idealistic: wants to make the world a better place and the people in it happier Happily obsessed with one or more hobbyhorses (usually five or six) Aware from early childhood that he is different Intelligent Opinionated and outspoken, convinced that he is right and that the rest of the world is out of step Noncompetitive, not in need of reassurance or reinforcement from society Unusual in his eating habits and living arrangements Not particularly interested in the opinions or company of other people, except in order to persuade them to his – the correct – point of view Possessed of a mischievous sense of humor Single Usually the eldest or an only child Bad speller What isn’t so funny or joyful is his later story about autism, accompanied by the iconic diagnoses sheet pictured at right. Feel Safer Now? I guess somebody will sleep better at night knowing our Department of Homeland Security is shaking down music and video pirates. Their new plan: Strategy Targeting Organized Piracy (STOP), a crackdown on the theft of U.S. intellectual property such as pirated compact discs and knockoff auto parts. The effort is consuming the attentions of Attorney General John Ashcroft, Commerce Secretary Don Evans and U.S. Trade Representative Robert Zoellick and senior officials from the Department of Homeland Security. Weird Museum Tour, September 2004 Travelling buddies, Willberry & Cliff I should thank RoadSideAmerica.com for making a rainy day a _fun day_™. Will and I were supposed to go on a hike, but the rain killed that plan and most anything else we could come up with. RoadSideAmerica.com gave us alternatives. RandMcNally gave me directions. Cliffy met me in Warren, we picked up Willberry in Manchester, and headed off to our first stop in Leominster. Tales of Woe I just got IM’d by my friend Karen. Her sister got married this past weekend and they were all in New Hampshire for the event. Here’s the transcript: hi – sooooo sorry we did not call the wedding was insane everything kept going wrong all weekend I didn’t really expect you to call. Not that I didn’t want to see you guys, but weddings are crazy stuff. the rehersal restaraunt closed, the chef for the reception quit, the organist over booked, the salon canceled our 10 reservations, my wedding dress never got finished, it rained during the party at my mom’s house…. Cocktail Manifesto We’re huge fans of The New Joy of Cooking by Marion Rombauer Becker, Irma S. Rombauer, and Ethan Becker. Hardly a meal goes through our kitchen that isn’t shaped in some part by the recipes and general information in its pages. A recent discovery was Joy’s description and defense of cocktail parties. So, when a book as serious and valuable as The New Joy of Cooking raises alarms about the declining future of cocktail parties, we listen. Canned Meats Monday Some time ago, a box with the above pictured contents went to Chuck Robidoux. He wrote back: Nothing starts a Monday off like Kippered Seafood Snacks and Deviled Ham with a side of Spam and Potted Meat Food Product followed by Vienna Sausage, all washed down with some icey cold Clam Juice. Now I am ready to face the day. Yours Meatily, Dr. Meaty McMeat Meatofski Meatovich Hamkowsky-Beafeau Porkson Politics, Terror, & Sexual Identity I hadn’t given it the slightest thought, but then I read TinyNibbles.com’s travel advisory (this site has been referenced previously at MaisonBisson). What do Politics, 9/11, &amp; Sexual Identity have to do with each-other? Read: Traveling when you do not appear as the gender on your identification is much more tricky…. If your driver’s license says “F” and you look like an “M,” you’ll have some explaining to do. With the Patriot Act, when they run your license through at the airport, it automatically links to all other federal databases, and if there are any discrepancies, again you’ll have some explaining to do — and a possible delay. NixiChron & Techno-Retro Lust Decades ago, Nixie Tubes were used as indicating devices in many different types of instrumentation, and ultimately replaced by the cheaper – and unattractive -LED display. Having been obsolete for almost a quarter century, these glowing bottles of ionized gas have attracted another generation who appreciate their beauty and mysterious function. The display tubes may be decades old, but the clock is GPS accurate. Those who’d rather just fiddle with Nixi Tubes than spend a pile on on a clock (though we all agree it would be well spent), can buy bare tubes here. Feeling The Web: Pulse, Buzz, Zeitgeist Flickr Zeitgeist  BlogPulse  Yahoo! Buzz  Google Zeitgeist  Round One: Kerry 1, Bush 0 Thank NPR for putting audio of Thursday’s presidential debate on their site. Spin-masters will be working this one over for a while, but the original is the most important. There were people who expected Bush to come off in his casual, frat-boy manner, but he didn’t. He stumbled, he got red-faced, and he never answered any questions. Republicans like to stay on message, but their message, already short on details or plans, has grown stale. The Mac vs. PC Debate I generally don’t get into this, but a series of columns by Paul Murphy at LinuxInsider (LinuxInsider!) caught my attention. In Macs Are More Expensive, Right?, he compares Apple’s offerings to Dell’s and finds the PCs cost about the same or more than similarly equipped Macs. At the low end…the PC desktops are marginally less expensive than the Macs — if you can do without their connectivity and multimedia capabilities — and considerably more expensive if you can’t. Film Performance Licensing In case the notion strikes me again, I’m putting these links here so I can find them in case the notion strikes me again. The aforementioned notion is one of wanting to do public performances of movies, who know why. This would be easy, except for copyright, so these links are for information about getting performance licenses for films. Wisconsin Department of Public Instruction’s information on performance, with links to disributors. Cultural Revolution-Era Clip Art Book Oldtasty has posted a collection of pictures scanned from the pages of a clip art book of the Cultural Revolution. I’ve always enjoyed look of Communist art, and I’m particularly pleased with this showing. Things You Can Do With ISBNs Jon Udell has been working on LibraryLookup and other mechanisms for finding library content on the web. In the meantime, LibraryTechtonics, Library Stuff, and The Shifted Librarian have picked up on it. Part of it is about OCLC making their records available to search engines. Now both Yahoo! and Google in the game. So what you do is put your ISBN in the properly formatted URL and you’ll be given links to libraries that hold it: via Google and via Yahoo! A Day In The Life Of Joe I’m not sure of the origins of the following text. There’s nothing patently false in it, so I’m posting it here for all to ponder. Joe gets up at 6 a.m. and fills his coffeepot with water to prepare his morning coffee. The water is clean and good because some tree-hugging liberal fought for minimum water-quality standards. With his first swallow of coffee, he takes his daily medication. His medications are safe to take because some stupid commie liberal fought to insure their safety and that they work as advertised. All but $10 of his medications are paid for by his employer’s medical plan because some liberal union workers fought their employers for paid medical insurance – now Joe gets it too. Korean Thanksgiving Jong-Yoon Kim emailed to tell me today is Chusok, the traditional Korean thanksgiving day, when families gather and give thanks to their forebears. According to the lunar calendar, today, sep 28th, is Aug 15th, the Korean thanksgiving day. Tonight, we will have the biggest and the brightest moon of the year. Traditionally, we pray to the moon for our hope and believe that the moon will listen to us. Enjoy the moon and have a great day. Google News Gamed? What happens when machines edit our news? What happens when news sources game Google News to raise their ranking? Online Journalism Review is asking that question, and has some interesting answers to report. It seems conservatives and conservative-biased news or quasi-news organizations use people’s full names, while mainstream sources and those with a liberal bent often use only the last name. The result: Google Newsing for “John Kerry” results in some incredibly negative stories, but “George Bush” is largely positive. Ultra Portable I’ve been interested in ultra-portable computers for some time. My first such computer was a Newton Message Pad 2000, which remains useful despite its age. The Newton was replaced by a Palm m125 that cost less and did less. No more email, web browsing, no writing or word processing. In short, nothing more than addresses, calendar, to-do lists, and a note or two jotted down using the infuriating Graffiti text recognition. Home-Made Arcade I found Retro Gamer magazine on the rack last week and couldn’t hep but pick it up. It’s issue six with a feature story on building both stand-up and cocktail arcade cabinets with PCs running MAME (which isn’t to say you couldn’t use a Mac instead). For now, I want to keep track of these related websites: Check Ultimarc for arcade buttons, sticks, and fancy interfaces to make them work. Throwing Google A Bone For Cliff Cliff worries that his website, Spiralbound.net, doesn’t get indexed by Google often enough. He’s a good guy, so I figure I’ll prime the pump for him. Here, Google Google. Solaris Docs: Migrating Veritas Volume Manager disk groups between servers{#14} Solaris Docs: Solaris Disk Partition Layout{#13} Solaris Docs: Copying A Boot Drive Between Disks With Different Partion Layouts If you’re looking for those, you should also take note of these here at MaisonBisson: Configuring Sun T3 Storage Arrays and Things To Remember While Doing Upgrades on Mission Critical Sun Equipment. Techlinks Dartmouth College in the WiFi limelight, again as they replace their 1500 802.11b APs with A+B+G APs. WiFi Net News wonders how WiMax will change Dartmouth’s plans next time around. Foof makes some snazzy looking iPod and laptop cases. Michelle has set up an example of the worst designed web page ever. It’s a counter-example thing. Brad Templeton brought a VoIP phone to Burning Man. It’s Automotive Week In The Blogs First Gizmodo published a feature on in-car computers. ArsTechnica got into the automotive theme by reporting the International CXT story. Not to be outdone by Gizmodo, Engadget reported on the ultimate car computer install: a Tatra with a Mac in it. For some reason, I went looking at the Tatra car-mod and found Tatra trucks which seemed to connect back to ArsTechnica and Caesar’s gushing about the HEMTT. After all, the largest of the Tatras is called the Kolos (colossal). Roderick’s Sites Roderick has been sending me links and I’ve been lax about posting them. Some of these links are NSFW, and one of them is a present back to Roderick. I’m not going to comment, because I’m lazy because I don’t want to prejudice you. Corporate MoFo A Fundraiser Billionaires for Bush Hello Laziness: Management tips from the executive slow lane Kite Aerial Photography I got sort of excited about kite aerial photography a couple of weeks ago in a post about photoblogging. I was amazed with Scott Haefner‘s work and especially impressed with his VR picture of Slain’s Castle in Scotland. Scott is pretty serious about KAP, and it shows in his description of his rig, but what’s an amateur or naive fool to do? Engadget is doing features on things to do with an old digital camera, and this week they tackled kite aerial photography. Scenes from the Museum of Bad Art The Museum of Bad Art (MoBA) in the Dedham Community Theater. It’s in the basement outside the men’s bathroom, illuminated by a single fluorescent light hanging from the ceiling The MOBA slideshow. More photos from MaisonBisson. Sandee’s Clothing Donations It’s 132 photos, but I think there’s actually only 128 items. No, I’m not sure why I photo’d each one. More photos from MaisonBisson The Plastics Museum The Plastics Museum is in Leominster, MA, and online at plasticsmuseum.org. The National Plastics Center and Museum is a non-profit institution dedicated to preserving the past, addressing the present and promoting the future of plastics through public education and awareness. The educational staff has supported this mission throughout the years by conducting hands-on science programming for schools, organizations and the plastics community. And, if you’re a lucky kid, your school might get a vist by the PlastiVan: The Bellingham Accident I pulled up to the stop sign at the end of North St., looking to turn left onto Route 126 in Bellingham, MA, at about 3:40 PM on Saturday 18 September when I saw a red Dodge Neon coming down the hill towards me with its brakes locked up. It was a busy intersection and with roads still soaked from the heavy rains that had had been falling all that day and the day before but had recently cleared. Funky Time Gizmodo pointed out this fancy clock by Kikkerland. Being the clock-fiend I am, I had trouble not looking for more. Ship The Web seems to have Kikkerland’s entire catalog of clocks, which is more than enough to make me drool. Of course I want this one and this one and this one and this one. “I Wanted a Tatra, So I Got A Tatra” Engadget picked up on the story about the Tatra with a Mac in it. I couldn’t help checking for changes since I first saw the story. There’s a new version of DashMac, the control software, and it seems he can now control his car via SMS messages, but most things seem in-line with where he was going. The thing is, I can’t help but get interested in the car itself. I sort of went gaga for Tatras after seeing the original story and doing some research. 5 Megapixels, Cheap Engadget was quite excited about the Gateway DC-T50 5 megapixel camera, now selling for $150 at various retailers. I know more than one person who wants a cheap digital camera that doesn’t suck, so I went looking for reviews. Steve’s Digicams has some really detailed reviews, so I was excited to see they covered the DC-T50. They say it’s a rebranded Toshiba PDR-5300. Their review is based on a price of $350, so weigh that when considering their so-so conclusions. Mobile Carrier Wireless Networking I put together a list of wide area wireless networking options in semi-rural areas for a friend recently. It’s far from complete and may not be accurate, but it’s a start. The coverage area I was looking for was north of Portland, ME, but we all know coverage maps lie and local conditions vary. I focused on PC-Cards, but most carriers sell phones that can be attached via USB port. These Aren’t Campaign Commercials eBaum’s World added a couple of funny Bush videos recently. What is soveriegnty? Bumble mumble. Two things: if he was a lot smarter, he would have known the meaning of “sovereignty,” but if we was just a little bit smarter, he would have known that the question was about how his government would treat Native Americans and answered that. The claim is that this is a video of George W. Techlinks The Save Betamax campaign has nothing to do with videotape and everything to do with the fair-use rights that allow us to legally convert CDs to MP3s or legally use Tivo to keep up with our favorite shows. These rights are under siege by content producers who want to charge consumers for every use. Copyfighters look here. Rumors are that OQO will release their Ultra Personal Computer soon. Be Better Dork: Command Line Stuff Be geeky and look at the Apache modules: ``` /usr/sbin/httpd -l Compiled in modules: core.c prefork.c http_core.c mod_so.c ``` Set your path: ``` PATH=$PATH:/usr/sbin export PATH ``` Project Censored’s Annual Roundup Project Censored has released their list of the most censored stories of 2003-2004: #1: Wealth Inequality in 21st Century Threatens Economy and Democracy #2: Ashcroft vs. the Human Rights Law that Holds Corporations Accountable #3: Bush Administration Censors Science #4: High Levels of Uranium Found in Troops and Civilians #5: The Wholesale Giveaway of Our Natural Resources #6: The Sale of Electoral Politics #7: Conservative Organization Drives Judicial Appointments #8: Cheney’s Energy Task Force and The Energy Policy #9: Widow Brings RICO Case Against U. High And Mighty I can’t help but steal the title to Keith Bradsher’s excellent book about the titanic rise of SUVs on our highways. Bradsher, in his 2002 book, makes note of efforts at Freightliner and Mercedes to release uber-SUVs based on the companies’ commercial truck bodies but weighing in at just under the limit at which commercial drivers’ licenses would be required to operate them. Both companies eventually decided against it, but now International is going forward with similar plans. The International CXT is the latest entry in the super SUV market. At nine feet tall, over 21 feet long, and cruising at six to ten miles per gallon (diesel), it’s the kind of vehicle any Texan could love. Ars Technica went off-topic to give me the heads up. Along the way, Caesar got all excited about the HEMTT. Sewer in the Woods, Unknown Flower Found the left image in the woods near Warren NH this weekend. Photo is composite of four smaller pictures taken with my Clie TH55, but the scene is entirely real. Seperately, I found the flower on the right a week before, while hiking around the other side of the lake where the sewer scene was found. I’ve no idea what it is, but I’m not against finding out. More photos from MaisonBisson Pepper Pad 2 I can’t help but want one of Pepper Computer’s Pepper Pad 2 hand-held computer thingies. It’s available for pre-order now at only $800. But what is it, you ask? According to Pepper, it’s “either as a user’s only wireless computing device or […] a convenient, easy-to-use accessory to a PC.” It’s a Linux-based palmtop computer with 20GB hard drive, 800 x 600 12.1″ display, 802.11b+g, and a bunch of other stuff. In-Car Computers The age of the in-car computer has come. One vendor calls them “carputers,” and Gizmodo lays it out for those who want an Intel-based CPU in their trunk/under the seat/in the dash. What to do with a computer in the car? Now that computers have moved out of the den to become part of the home entertainment center, users are anxious to use that library of downloaded music in their cars too. Claim: Beverage Choice = Politics I’ve been a little slow to blog these things lately, but this comes from BeverageWorld magazine. They published the results of a poll connects beverage choices to political affiliation. They break the politics down into six choices: Democrat, Republican, independent, independent liberal, independent conservative, and none of these, then they compared booze and soda-pop choices for each. Of booze, Democrats and “none of these” drink the least. The three varieties of independents seem to drink the most. Conservative independents are 42% more likely than the national average to tipple some variety of whisky, while liberal independents are 47% more likely to drink imported beer. Overall, the liberals are more likely to drink than the conservatives, but Republicans are more likely to drink than Democrats. The implication, of course, is that candidates can woo swing drinkers by offering the right drink to the right person. Which, as my wife would say, is just good manners. Claim: Sleep Position = Personality About a year ago, Reuters reported on the results of some sleep research from Professor Chris Idzikowski, director of the Sleep Assessment and Advisory Service and a visiting professor at the University of Surrey in southern England. The story is still online now at Wellspan.org and Netscape News. In summary, your sleep position is a reliable indicator of your personality. Here’s how it goes from Netscape’s version of the story: NH License Plates For a variety of reasons, I was happy to discover that NH allows drivers to check the availability of vanity plates online (though, somewhat nervous find that the state uses Microsoft servers). The search enlightened me to a variety of plates I didn’t know about. We’ve all seen the “veteran” and “Purple Heart” plates, and a few “antique” plates, but I’ve never seen a “street rod” plate. But there are even more plates available. In Car iPod Without wanting to get into the rest of the story, I’m now trying to figure out how to plug an iPod into a Scion xB. The xB comes with a stereo by Pioneer, but I haven’t been able to get details about what inputs it supports. Installer.com and Logjam both offer connection kits that appear to give me RCA aux inputs to the radio head unit, but Pioneer offers a simple IP Bus adapter that might also do the trick. Photoblogging, Etc. I think I’m a fan of Flickr. It makes photoblogging easy and fun. Easier, anyway, than setting up an email to blog solution on my own, and the community features are more fun than I’d expected them to be at the outset. Flickr more or less automatically puts up a blog entry for each photo I upload (though I still have to configure the layout features to my satisfaction). Anyway, in related web surfing, I came across the following: Mini Golf Minigolf is very serious business. Very serious. More photos from MaisonBisson Texas’ Crony Politics and the Presidency I finished Cronies by Robert Bryce recently and I can’t help but tell people about it. I hadn’t really wondered why so many presidents and vice-presidents have been from Texas, but Bryce did. “Two of the last three American presidents — and three of the last eight — have been Texans. Each of them got to the White House by exploiting a network of money and power that no other state can match. Co-Worker It turns out that one of my co-workers is blogging over at Live Journal. RNC Eve NYC’s sex workers expect to be extra busy while the Republicans are in town. There’s been talk of terror alerts. Get some backstory here, then read Ridge Issues Alert For U-BoatAttacks On Northeast Coast (and laugh). Google seems to think MaisonBisson and alandwilliams are similar. There, I found Pleasure Boat Captains for Truth and Cabbies against Bush. It seems the cabbies are offering free rides to Kennedy and Newark airports for GOP delegates who are willing to go to Iraq to fight. Muppin Tongue Muppin wags his tongue, leaves slobbery mess on lens. More photos from MaisonBisson Republican National Convention To Be Windfall For NYC’s Sex Workers The New York Metro reports that the sex industry is expecting a 20 to 50 percent uptick in business while the Republicans are in town for the Republican National Convention this week. Mary, a stripper at Ten’s Cabaret speaks from experience. She worked the 2000 RNC in Philadelphia and expects the strip clubs in NYC to be “really crowded” during the convention, adding, “the girls have been talking about it literally since June. Heat: Dell Server Thermal Load (BTU/hour) It’s a shame that Dell doesn’t list the thermal loads of their products in the datasheets at the online store. It’s a shame that it took several Google searches to get close to a link with the info, then mine the Google cache of a Dell support forum and find/follow a chain of links before I could get that detail. As it turns out, there’s some Dell and the Environment page where they list all their products and their environmental properties/certifications/regulatory compliance. Camera Goes All To Hell, Bits Recovered From Memory Card SanDisk is playing this as the coolest thing that ever happened. Some photographer planted a couple cameras to photo the demolition of a bridge over the Mississippi, the explosion was bigger than he expected, he lost one of the cameras, but the CF card survived in working order. MobileMag has the story. SanDisk has a press release. And every blog in the western world is echoing it. The photographer is Don Frazier, a staff photographer for the Southeast Missourian newspaper. O’Reilly Mac OS X Conference I trust O’Reilly’s books, so when I see they’re running a conference about something I’m interested, i get excited. The third annual O’Reilly Mac OS X conference is like that. With speakers like Andy Ihnatko, David Pogue, and Rael Dornfest and tracks covering digital audio, “insanely great Mac”, programming &amp; scripting, and system administration, this could be the summer MacWorld that no longer is. The effect would be complete if it were one the east coast. Clie Annoyances, Part 1 The Clie TH55 stylus is one of the most annoying parts of the Palm OS-based handheld. It’s small, too small. It telescopes to an almost usable length, but it’s still too narrow to hold comfortably. So I’m a little reticent to buy a replacement for the one I lost. Also, you’d think the Clie could have come with a decent sync cradle, or any sync cradle. And, while I’m whining, why can’t the keyboard also work as a sync cradle? Making a DAT/DDS Tape Drive Work on Red Hat Enterprise Linux We could see messages about the tape drive in dmesg, but it wasn’t giving the device name. We tried working with /dev/st0, but we kept getting errors. Everything seemed right, but it didn’t work. It turns out our SCSI card was the problem. It wasn’t being properly recognized. After a tip, we tried the following: /sbin/modprobe aic7xxx Where “aic7xxx” is appropriate for our Adaptec card. We checked lsmod and found the aic7xxx stuff properly initialized there (shortened output): iTunes vs. Firewalls iTunes on the PC on my desk (notice I feel more possessive of the desk than the PC) hasn’t been able to share music to or from iTunes on my PowerBook. Blame the firewall. A moment of Googline led me to Travis Saling’s guide to enabling iTunes sharing through a firewall. Here’s the ports that need to be open: Port 3689 TCP Port 5353 UDP However, he notes: The Conservatives vs. The Academy AlterNet has a story by Joshua Holland about the Right’s crusade against lefties on campus. As I saw with my experience with the conservative sniper that was trolling here not long ago, the conservative mission is to criticize everything that’s off their message. Holland describes this as “backlash” politics: The backlash came about when traditional big-business conservatives, tired of facing the resentment of ordinary working-class Americans, stumbled onto ‘wedge’ social issues in the 1960s. Configuring Sun T3 Storage Arrays Sun’s T3 documentation is available online: The Sun StorEdge T3 and T3+ Array Configuration Guide explains physical configuration. The Sun StorEdge T3 and T3+ Array Administrator’s Guide explains the software side. The short course: Creating volume ‘v0’ using half the disks: vol add v0 data u1d1-4 raid 5 standby u1d9 vol init v0 data vol mount v0 Creating volume ‘v1’ using the other half of the disks: vol add v1 data u1d5-8 raid 5 standby u1d9 vol init v1 data vol mount v1 Listing volumes: Faith-Based Missile Defense Defense Tech is reporting on the progress and prospects of missile defense (and their title is too good to pass up). Early in his administration, President Bush put a whole lot of stock in “faith-based” initiatives to solve domestic problems. Now, the President seems to be taking the same approach to military matters. Defense Tech quotes Slate’s Fred Kaplan: In the past six years of flight tests, here is what the Pentagon’s missile-defense agency has demonstrated: A missile can hit another missile in mid-air as long as a) the operators know exactly where the target missile has come from and where it’s going; b) the target missile is flying at a slower-than-normal speed; c) it’s transmitting a special beam that exaggerates its radar signature, thus making it easier to track; d) only one target missile has been launched; and e) the “attack” happens in daylight. FBI Investigates A friend sent this along yesterday: I was visited, a couple of weeks ago by an FBI agent investigating whether or not I was involved in terrorist activities. Seems one of my neighbors (I don’t know who) placed an anonymous call saying that “[name deleted], who works for [airline name deleted] and lives [address deleted], resembled a terrorist on a watch list.” So, the guy had to come over here and make sure I was not evil. Galleries of Oddness I ran across Darren Barefoot‘s Hall of Technical Documentation Weirdness, where he catalogues “wacky, bizarre, surreal and otherwise strange examples of technical documentation.” Considering the number of poorly done or just weird technical illustrations we’ve all seen, you’d think the gallery would be larger. When done with that, go to the Snope‘s Urban Legends Reference Pages photo gallery. You’ll laugh at some of the images (and you’ve seen at least a few of them already), but the real entertainment here is in the stories that supposedly explain what’s true and what’s false. Mac Consulting I get a number of requests for help with people’s Macs. They’re are often willing to pay, but the truth is that computer support (on any platform) is one of the things I least like to do. A typical question looks like this: We’d like to upgrade or replace our aging Mac and have questions about how to upgrade or what to buy. We’d also like to network our computers on opposite ends of our house and are wondering about wireless. Extra Links Swim-up, floating blackjack tables for your pool. Yes, the Hard Rock Las Vegas has similar stuff, but their minimum bet is too high for my game. There’s a sock subscription service, and it’s been around for five years. A Chinese DVD player manufacturer has developed a unit that excels at playing China’s famous black market DVDs. I’m not that excited about case-mods, but this Predicta case-mod gets my nod. Flying Car Options In commenting on the Space Race story, Zach pointed out that the Moller Skycar is still under development (which is better than going bankrupt or just disappearing — like so many other good ideas have). If you poke around the site you can find video of flight tests and sales info. Yes, they’re taking deposits for deliveries they hope will start in 2006. Meanwhile, the sky hasn’t fallen on the Trek Aerospace Millennium Jet either. O’Reilly Covers RSS Ben Hammersley’s Content Syndication With RSS has got me back on the RSS wagon. Hammersley covers the history and context of RSS’s development in more detail than many other tech books have given their subject. I’m ashamed that I didn’t know RSS got its start as “Hot Sauce” in Apple’s research labs. You won’t find it on the web now, but Hot Sauce was an interesting technology demonstration in 1996/7. I’m also ashamed I didn’t know of the connections between efforts at creating the “semantic web” and RSS (1. Random/Color-Light/Balloon Lamp Im jealous I didn’t think of these things before Kyouei Ltd. released them as a product. A DVD that fills your TV with solid colors to illuminate the room. A CD with 99 tracks for 99 tones: “When using the ‘random’ function, the CD will automatically select random tones, and make a new melody.” A combination of battery, LED, and balloon that results in a glowing glob of latex. The only thing cooler than these is a little book titled Count Sheep that was filled with pages of identical sheep arranged in rows and columns, ready for counting. RNC Anarchy Writer Paul Schmelzer has a list of (civil disobedience?) actions against the RNC in NYC. Among the actions planned: Bikes Against Bush, radio jacking, backback broadcasts, WiFi on wheels, and accurate crowd counts. Crowd counts? It seems government bodies like to undercount the number of people protesting against them, so a few hactivists will be using technology to gather crowd images from above and use image analysis software to do the counting. We The Media Dan Gillmor’s We The Media caught my attention. From the Publisher’s description: For the first time, bloggers have been awarded press credentials to cover the national political conventions. …Grassroots journalists, including bloggers, […] are dismantling Big Media’s monopoly on the news. Through Internet-fueled, interactive vehicles like weblogs, these readers-turned-reporters are transforming the news from a lecture to a conversation. They’re publishing in real time to a worldwide audience that’s eager to read their independent, unfiltered reports. Look Ma, No Fire Protection Alternet is featuring a story about the Bush administration’s attempts to reduce nuclear power plant safety requirements. This news might have slipped by unnoticed, except Mainichi Daily News is reporting on a steam explosion at a Japanese nuclear plant that killed four and injured seven workers today. Bush’s plan, against this background, seems haphazard. At least this accident didn’t result in a radiation leak, the the 1999 Tokaimura nuclear accident did. Space Race Heats Up It’s been almost 47 years since Sputnik began the space race and 35 years since a few men hobbled about on the moon, but I don’t yet have a flying car and I can’t take an orbiting vacation. Folks, the space race wasn’t won, it was abandoned. And that’s why we have the Ansari X Prize. Burt Rutan’s team seemed to be in the lead earlier this year with the successful launch of SpaceShipOne, the competition has been in the news lately. Strange Days This story is too complex for me to do it justice, but too interesting to ignore: the Mainichi Daily News is reporting chess champion Bobby Fischer has been jailed in Japan. Fischer, a one-time world grand master who represented the US in cold war grudge matches against the USSR, but has since mostly fallen out of public view and, perhaps, gone a little crazy, was arrested in Japan for passport violations. Juliusblog on Coincidence: Bush Ratings vs. Terror Alerts Juliusblog has a chart comparing approval ratings on a timeline with terror alerts. Guess what? Juliusblog makes the following observations: Whenever his ratings dip, there’s a new terror alert. Every terror alert is followed by a slight uptick of Bush approval ratings. Whenever there are many unfavorable headlines, there’s another alert or announcement (distraction effect). As we approach the 2004 elections, the number and frequency of terror alerts keeps growing, to the point that they collapse in the graphic. Now Listed in Blogshares? I moment or two of ego-Googling lead me to Blogshares, where MaisonBisson is trading me as a penny stock. Oh well. Cronies A co-worker just handed me Robert Bryce’s Cronies. From the Publisher’s description: Texans are running the country — maybe the world. Now the author of Pipe Dreams examines who they are, how they got into power, and how they reward themselves and each other, often at the expense of American taxpayers. No other province holds more political and economic power than the Lone Star State. Two of the last three American presidents — and three of the last eight — have been Texans. Fear Aint the Word For It Mix a born again Christian who confuses Christ and God (yup, check Molly Ivins for the quote), clinical and medicated depression, several million believers and call it the Church of Bush! Fear is just the beginning. Village Voice: Church of Bush I started to make noise about this a few weeks ago in my story about Fahrenheit 9/11: I’m growing increasingly uneasy about the cult-of-Bush-worship that Brittany Spears exemplified in her appearance in Fahrenheit. The Greeks expected questions and debate, so did the Romans before the fall of the republic. Egyptian pharaohs, Mayan emperors, and Soviet premiers may have killed or non-personed those who questioned them, but democracy demands otherwise. MySilo Knowing that everybody wants a missile silo, bari1001 has posted his for sale on eBay (thanks to DefenseTech for the pointer). Silo World has the skinny on Titan 1 silo design, NPR did a story on missile silo homes a few years ago, though most of the silos are empty, abandoned, and dangerous. Still, there are one or two realtors that specialize in missile silos. News: Bush Bushed I hadn’t heard of Capitol Hill Blue until a friend forwarded this story about Bush’s paranoid isolation. First, I should say that paranoid isolation isn’t all bad. It worked well enough for ol’ &lt;a href=&quot;http://retroplanet.net/3hughes.html&rdquo; title=&quot;Howard &ndash; &ldquo;I&rsquo;m not a paranoid deranged millionaire; Goddamit, I&rsquo;m a billionaire&rdquo; &ndash; Hughes&quot;&gt;Howard — “I’m not a paranoid deranged millionaire; Goddamit, I’m a billionaire” — Hughes, but then Hughes wasn’t president and didn’t think he was on a mission from God. ‘Pod Happy The new iPod came Monday. Stepping up to it from the second generation iPod I had is amazing. Most noticeable differences so far: I can now charge from the computer and play music (in the 2g iPod, it locks the interface and flashes “do not disconnect” any time it’s plugged in to a computer), the UI is faster or more responsive and is now customizable (a bit), it pauses playback when the external power supply turns off (especially useful in the car). Things You Have To Believe To Be A Republican Today My father forwarded this to me this morning: Saddam was a good guy when Reagan armed him, a bad guy when Bush’s daddy made war on him, a good guy when Cheney did business with him and a bad guy when Bush needed a “we can’t find Bin Laden” diversion. Trade with Cuba is wrong because the country is communist, but trade with China and Vietnam is vital to a spirit of international harmony. Woody Guthrie On Copright Copyfight is reporting on the infringement lawsuit threatening the creators of the 2004 presidential election parody animation that’s getting all the laughs. They’re quoting TechDirt which apparently has a quote from Guthrie himself: This song is Copyrighted in U.S., under Seal of Copyright # 154085, for a period of 28 years, and anybody caught singin’ it without our permission, will be mighty good friends of ourn, cause we don’t give a dern. Apple Fusses Over Fuse Fuse, a music TV network trying to compete with MTV by actually playing music videos done some bilboards in NYC that look a lot like Apple’s silhouette ads, but with people pole dancing and masturbating and stuff. Gizmodo came through and posted images of the ads so low brow people outside NY (like me) could be further corrupted by them (I’m not complaining here). Let’s hear it for Gizmodo. Yeah! These Aren’t Cubes Also at Gizmodo: the Volume Macropod. They’re like cubicles, but cooler. They’re mobile, but useful. Ad agency Chiat-Day made big news about giving up structured offices and such back around 1995 [CNN Story &amp; Supervert.com story]. The point, of course, is to have people working out of cube farms because they’re cheaper, cheaper, cheaper. Problem is, they feel cheap and they make employees feel unvalued. According to the CNN story: “employees who were […] looking forward to having a regular office the way they always thought it was going to be, and then they don’t have that. This Land Greg &amp; Evan Spiridellis oever at Jib Jab have put together a damn funny flash movie about the presidential race. From the lyrics: … Kerry: “You can’t say ‘nuclear,’ that really scares me. Sometimes a brain can come in quite handy” … Bush: “you’re a liberal sissy” Kerry: “you’re a right wing nut job” Bush: “you’re a pinko commie” Kerry: “you’re dumb as a doorknob” Life Goes On… Sandee called me from home Friday to say she was having trouble playing music from our primary music server. Every time she selected a song iTunes complained that it couldn’t find the file. I had a plausible explanation at the time and didn’t think much of it, but Sandee was really reporting something much more serious: the complete loss of all our music. Over the past five years or so, we’d built a collection of about 65 gigabytes of music, just under 20,000 files that could play 24/7 for over two months straight without repeating. Mapparium, Boston Religious landmarks usually don’t interest me, but the Mapparium really is a sight to see. …The Mapparium, located within the Christian Science Publishing Society. A thirty-foot stained-glass globe room in lobby of the Christian Science Publishing Society gives one an ‘inside view’ of the world. Standing on the thirty-foot glass bridge, which traverses the diameter of this large sphere, visitors can virtually be encompassed by the world. From pole to pole, you can journey through and explore the correct proportion and relationship of the earth”s land and water areas. You Can Take It With You: DVDs on Palm/Clie Junglemike has an interesting post on compressing video for Palm playback at the 1src Forums (n the ClieSource Forums): This guide explains in detail how you can prepare video to watch on you Palm handheld. It [is usefull] for converting full-length 1.5-2 hour movies to be stored on even a small 128mb sd-card with uperior quality. Let me not fail, however, to mention that this seemingly harmless and legal use of technology puts users smack in the middle of the biggest land (property) war since Napoleon invaded Russia. Fox and Conservative Pals Out Spreading More Slander and Libel Welcome the flacks. I don’t get many comments on stories here at MaisonBisson, so I was interested when I found a comment to my story about the Outfoxed documentary just an hour after I’d posted it. Here’s my theory, and it’s supported by stories in Eric Alterman’s What Liberal Media and Al Franken’s Lies: conservative groups spend a huge amount of time identifying and attacking every liberal criticism. This mysterious Matt (perhaps from Ohio? OutFOXed OUTFOXED: Rupert Murdoch’s War On Journalism is out on DVD and VHS now. Outfoxed examines how media empires, led by Rupert Murdoch’s Fox News, have been running a “race to the bottom” in television news. This film provides an in-depth look at Fox News and the dangers of ever-enlarging corporations taking control of the public’s right to know. I was hooked before I saw the Outfoxed preview, but I’m definitely buying the DVD now. Another Military Family Against Bush Another Military Family Against Bush bumper stickers and other products available. Another Military Family Against Bush Value T-Shirt Another Military Family Against Bush Long Sleve T-Shirt Another Military Family Against Bush Frisbee Another Military Family Against Bush Mug Another Military Family Against Bush Big Mug Another Military Family Against Bush Messenger Bag Another Military Family Against Bush Bumper Sticker Another Military Family Against Bush: All Products Why? My mother called in tears the other night after watching Fahrenheit 9/11. Cheap Food, Cheap Labor I’ve found myself in a number of conversations about food safety lately. Eric Schlosser’s Fast Food Nation: The Dark Side of the All-American Meal comes up regularly, but I keep wanting to mention Bushwhacked: Life in George W. Bush’s America. Why? Because Molly Ivins and Lou Dubose did such great job explaining the political context in which the attrocities Schlosser describes take place. “With republican control of the presidency and both houses of congress, you might want to consider becoming a vegetarian. Old News, Big Story Google just lead me to Wage Slave Journal where I found an August 2003 story about American casualties in Iraqi. It turns out Fox News was comparing Iraq to California and claiming the former was safer than the latter. Fox can’t do math, but others can. Should anybody ask, you should know that if Californians were dying at the rate US soldiers in Iraq are, the governator would be facing 385 deaths per day. DRM Snuffs The Constitution TeleRead brought me this story about a copy protected version of the US Constitution that’s now selling on Amazon. Among the restrictions: it can only be printed twice a year. For those who don’t understand the irony already, the US Constitution is in the public domain in so many ways it’s funny, yet a commercial publisher has created a version so locked up that it can’t be used and appreciated by all. Fahrenheit 9/11 We expect Fox News And the Washington Times to hate it, but the reaction from the left seems to prove the old adage that a liberal wouldn’t join his or her own side in an argument. My own arguments against it relate to how little new information it revealed. The audience at the show I saw laughed hysterically at the images of our government primping themselves for the camera and generally looking dim, but the facts of the film have been well reported in previous works. More Japanese Ice Cream I got all excited about some unappealing Japanese ice cream flavors when I found the story in Mainichi Daily News a while ago. I thought the lineup of fish, octopus, squid, ox tongue, sweet potato, fried eggplant, crab, corn, rice, wasabi, shrimp, eel, noodle, chicken wing, miso, and cactus flavored ice cream had everything pretty well covered, but now MDN has done it again. They’ve put up a new gallery of 21 flavors of ice cream you’re unlikely to find in the US: More About Clie TH55 PalmZone has a nice story about the TH55 with a number of links to software, updates and more information. What everybody should appreciate is the link to the Clie Movie Recorder. I thought I was so smart in an earlier story when I linked to the Google query I used to find this file. That worked for about a month until my site landed at the top of the Google index for that search. Beef T-Shirts Rock Beef t-shirts coming back: it was quite a while ago now that my Cafe Press shop was the top Google result for beef t-shirt. Worse, I haven’t linked to the shop from MaisonBisson for a while either. So it was something of a surprise to discover that the products are still selling. Yes, real people are buying these laughable t-shirts and other crap. They’ve been shipped to California, Illinois, Ohio, and Oklahoma (as well as a few to me here in New Hampshire). This Is Copyrighted? Defense Tech is reporting that the Warner/Electric/Atlantic conglomerate of music labels gave up its defense in a copyright case against their artist Wilco. It seems Wilco sampled from Irdial-Disc’s compilation of recordings from mysterious radio stations that everybody expects to be related to espionage (and clearly emanate from government buildings and embassies). Nobody argues that Wilco sampled from a previously recorded work, the argument was weather Irdial’s work was itself copyrightable. Nauset Beach Panoramas More photos from MaisonBisson Taken Monday morning, around 5:30, before getting on the road to return to New Hampshire. Troy and Karen were kind enough to invite me to the Cape for the weekend, where I generally lazed about and did nothing. We did take in a double feature at the Wellfleet Drive-In (don’t miss the picture) and ate lots of ice cream, but the main point was being lazy. The Letter Not Sent (re: LPFM, NPR, NHPR, complaint) I was going through my files and found this unfinished letter to NHPR, my local National Public Radio affiliate, regarding the FCC’s proposed licensing of community-based low-power FM radio stations (LPFM). My point was (or it was going to be) that NPR was afraid to compete against other non-profit stations. NPR paints itself as an alternative to commercial radio (and it does a pretty good job most of the time), but it’s also a business. So NPR joined with commercial broadcasters to kill LPFM before it could get off the ground. The fight included big broadcasting’s techs playing faked interference to scare lawmakers, but then they had to backtrack and call it “simulated” when somebody blew the whistle. Sadly, it really didn’t matter what the played; they brought the money and the pols gave a bullet to LPFM. April 2, 2001 Mr. Sean T. Gillery Director of Development New Hampshire Public Radio 207 North Main Street Concord, NH 03301-5003 Mr. Gillery I recently received a letter from you regarding renewals to our NHPR membership and I wanted to take a moment to express to you my concerns over National Public Radio’s opposition to community-based low power FM radio. As you know, NPR joined with the National Association of Broadcasters to lobby for legislation that has blocked the FCC from licensing LPFM stations. I believe that NPR’s position on LPFM betrays the beliefs and philosophy that had once drawn me to public radio. Can NPR or NHPR be trusted to put its listeners’ needs first and its commercial interests last? Not anymore. I am growing increasingly concerned that the recent and ongoing consolidation of the radio marketplace will further limit and degrade coverage of news, culture, and local events. NPR has covered the consolidation and aired concerns about its negative effects: Morning Edition, “Radio Merger Explosion” December 1, 1997 Weekend All Things Considered, “Black Radio” August 9, 1998 All Things Considered, “Radio Consolidation” January 9, 1999. All Things Considered, “Radio Merger” October 4, 1999. Unfortunately, coverage of the mergers ended when the FCC began considering LPFM in 1999. Since then, NPR has run a handful of LPFM stories. Each one focused on the potential for technical problems the LPFM law might create and the battle in Washington to prevent the licensing of LPFM stations. But none of the coverage discussed the reasons why the FCC was proposing LPFM. None of this coverage put LPFM in the context of the earlier commercial radio consolidations. NPR, of course, had to issue a very carefully crafted press release to explain their position. I can’t imagine what the response, if any, from NHPR would have been had I sent the letter. In the time that’s past, the republican controlled FCC has proposed measures that would lead to further market consolidation. Ironically, an NHPR sponsored station is one of the few LPFM licenses granted by the FCC before the law ended further licensing. The station, which plays classical music in the Concord area, went on the air just this year. Comment Spam First I was amused to see comments, then somewhat angered to discover they were spam, then amused again to find that comment spam etiquette requires that it be gratuitously patronizing. Then I struggled to decide if I could delete the comments without feeling like I was censoring free speech. My solution (and it’s sort of evil) is to delete the comments (and the links they contained, I don’t want my (puny) Google rank associated with them), but reprint them here: Foiled Troy has this image of a tin-foiled cubical on his blog. It comes from Servers Under the Sun and is interesting enough. Now that I’m checking his blog regularly, I’m sort of wishing he’d update more often (not that he doesn’t have a lot of interesting stuff in archive). . Six Months of 2004 Books: The Art of Deception Asmara Bloody Saturday in the Soviet Union The Cockpit Dangerous Waters Face to Face With the Bomb Flight The Iron Triangle Lies and the Lying Liars Who Tell Them The New Roadside America Parting the Desert Reefer Madness Small Things Considered States of Emergency An Underground Education Wireless Hacks Audio Books: Bushwacked In a Sunburned Country Re-Reads: Divided Highways The Race The Real Las Vegas AllConsuming.net AllConsuming.net aggregates book mentions on the web, mostly in blogs. Assuming bloggers can be trusted, the AllConsuming stats can show a lot about what people are reading and talking about. David Sedaris’ new book Dress Your Family in Corduroy and Denim is ranking with 22 mentions today and 15 the day before (or, that’s what it was when I checked it last night). Dan Brown’s The Davinci Code consistently ranks near the top of each day, and both these books will get bosted a notch when AllConsuming trolls me again today. All Consuming is a website that visits recently updated weblogs every hour, checking them for links to books on Amazon, Barnes &amp; Noble, Book Sense, and other book sites. Every book on this site has a list of all the weblogs that have mentioned it, and every weblog that has mentioned books in the past also has a page here listing which books it has mentioned. It’s more than a website, it’s also a set of web services by a guy who seems to know his way around XML, SOAP, RSS and other incredibly useful acronyms. He even authored some chapters in Amazon Hacks from O’Reilly press. Anyway, call me a fan. Faces Richard Coniff writes in the January 2004 Smithsonian magazine about the work of UC San Fran prof Paul Ekman and his study of faces. It carries pictures of a work by artists Bill Viola and his wife Kira Perov. Yeah, sure, the face is capable of 43 movements expressing 10,000 different expressions. Yeah, Bill’s work is interesting, but… I have two complaints. First, there’s all this talk that facial expressions are confusing. Sun’s Little Marketing Problem Sun had to make changes. They’re (or were) getting their butts handed to them in the mid-range and entry level server markets, so those changes had to come fast. There was a time when the top of their low-end server lineup was the V480 with four UltraSparc III CPUs in a 4U rack enclosure. Trouble is, it lists way over $30,000. They can’t cut the price on it without bleeding money, and worse, they can’t scrap their old models because their inventory of pieces and parts is too much to swallow if they did. So what they did do is release a new line of low-end servers at half the price, but with some slightly different specs (and, I’d imagine, cheaper manufacturing processes) while preserving their older, more expensive servers in the line as the “better” machines. Example: the V440 is similar to the V480 but has fewer DIMM slots and sports UltraSparc IIIi CPUs. The USIIIi doesn’t have the brains to do more than four-way multi-processing, but the designers used the chip real estate that freed up to put one MB of on-chip L2 cache. The USIII usually comes in machines with 8 MB of external L2 cache, but it runs far slower than the CPU’s clock rate. Eight MBs of cache is a lot, but arguments seem to favor a much faster one MB internal cache when performance is on the line. Beyond the cache issue, the IIIi sports a faster interconnect bus called JBus which further decreases the value of an off-chip L2 cache. Access to main RAM at almost the same speed as the L2 cache in previous CPUs, and greater over-all throughput combined with the integrated L2 cache, how can Sun argue that the IIIi is slower than the old III? But that’s exactly what Sun is doing. Their old manufacturing processes left them sitting on huge inventories for all manner of machines, and until they can clear those out, they’ll be sending some difficult marketing messages. The basics of it are like this: if you’re a regular Sun customer and can afford it, then continue to buy the really expensive boxes. If you can’t afford it and might otherwise buy servers from our competitors, then take a look at these newer, cheaper models. And if you’ve never bought Sun before, take a look at the speedy performance and low-cost of this V440. How Copyright Law Changed Hip Hop Kembrew McLeod’s story about How Copyright Law Changed Hip Hop in Stay Free! Magazine is an interesting tale of how copyright kills culture. In the mid- to late 1980s, hip-hop artists had a very small window of opportunity to run wild with the newly emerging sampling technologies before the record labels and lawyers started paying attention. No one took advantage of these technologies more effectively than Public Enemy, who put hundreds of sampled aural fragments into It Takes a Nation and stirred them up to create a new, radical sound that changed the way we hear music. JFK and Mr. Rogers Look the Same Well, they sorta’ look the same. Sorta. The Real Florida Gators From an email from my Dad: Florida allows those who win permits to take three alligators. They sell the meat and hides , except the tails, which have the best cuts of alligator meat, and which they normally keep to feed their families. Mal asked how the alligator meat is cooked; the lady said by cutting it into cubes and deep frying it. She said it tastes just like chicken. Leadership Who can complain about being compared favorably to ol’ JFK? (Yes, in a really vain way, I was happy about it.) A co-worker was surprised to be matched with Saddam Hussein, but my boss was happy to be Gandhi. Numbskull, meanwhile, looks like Abe. In another test, I was matched with Indiana Jones and Raiders of the Lost Ark. What Famous Leader Are You? What Classic Movie Are You? Extra Stories A friend of a friend says his life is made up of places he can no longer go (or is no longer invited). Sad, but somewhat true. He’s also a funny bastard. – – – Sandee’s aunt had her 50th birthday not long ago. The aunt makes cakes on the side so it was no big thing when her daughters (who were planning the surprise birthday party for her) asked if she’d make a cake for some unknown group one of them was in. Top Google Lamson Library’s portal integration project tops Google’s search hits for “library portal integration.” I’ve been crowing about it all over campus for a week now, and while you can argue about what real value it has, it’s still exciting. WorldCat Now Available to World (via Google) I’d heard that that OCLC was opening up WorldCat, their huge bibliographic database, to Google. It seems to be online now. If you happen to Google some very complete search terms for Dan Brown’s The Da Vinci Code (look for the WorldCatLibraries URL), you’ll find a link to the public WorldCat record. Interesting, but I wonder where this will go. In fairness, this news is about six months old. Jenny reported it in December. Cliff’s Piranha He’s named it Officer Angry, and it eats like a monster. It looks like a monster too, so that’s not so bad. Videos of the fishy fellow eating are at Cliff’s website: Officer Angry Chases Chow{#2} and Officer Angry Eats Off a Stick{#6}. The second one is much better than the first. Yes, I shot both, and just as an aside, they were taken with my Clie TH-55 (but edited with iMovie). Re: Gasoline Blackout Day (Wednesday, May 19, 2004) From Jon Link, who can also be seen at thenumbskull.com: I hate expensive gas as much as anyone BUT, this is a problem of our own design. We don’t need to stop buying oil for one day, we need to buy less oil in general. We love capitalism– supply and demand is it’s cornerstone… it can help or hurt us. It is just silly to think that one day without gas will do anything to supply and demand. Jon Link Goes Online With TheNumbskull.com Okay, his self portrait on my white board has nothing to do with his recent website launch, but…well…. TheNumbskull.com More photos from MaisonBisson Japanese Ice Cream…Novelties? Fish, octopus, squid, ox tongue, sweet potato, fried eggplant, crab, corn, rice, wasabi, shrimp, eel, noodle, chicken wing, miso, and cactus. Those may not sound like appetizing ice cream flavors, but it’s what they’ve got. The Secret Poetry of Donald Rumsfeld Pieces of Intelligence : The Existential Poetry of Donald H. Rumsfeld From Amazon’s Description: “Until now, the poetry of Secretary of Defense Donald Rumsfeld has been hidden, ’embedded’ within comments made at press briefings and in interviews. His preferred medium is the spoken word, and his audience has been limited to hard-bitten reporters and hard-core watchers of C-SPAN.” The Unknown As we know, There are known knowns, DMCRA vs. DMCA Get the word out. The fight is on to create sensible limits to the DMCA. Read ArsTechnica’s DMCRA argument. Copyfight, of course, is covering DMCRA, and arguing for it. TeleRead is swinging for DMCRA too. Heck, they’ve even endorsed a congressional candidate based on his stand on fair-use. Read those and act. Tell your congressperson you support fair-use and the DMCRA. Now say it again with the EFF: “I believe in fair-use. The Twig It’s actually called The Garlic Clove, but for a variety of reasons, we just call it The Twig. More photos from MaisonBisson How Do You Sell a Castle? When you call around for realtors to sell your ‘house,’ how do you tell them it’s a castle? I somehow found out about the Martin Castle in Kentucky, but that lead to information about the Dupont Castle and that sites guide to over 200 castles in the US. Dupont reports there was a fire at Martin Castle just yesterday. The Lexington Herald-Leader covered the fire. So, I guess the real question is “who do you call to insure a castle? In the Window Sarah left these as a gift for Wendy in the window of her new toy. Tesla’s History In Colorado Springs, Colorado Nikola Tesla arrived in Colorado Springs on May 17, 1899. He was met at the train by patent lawyer Leonard Curtis, and was taken by horse and carriage to the Alta Vista Hotel, where he would reside while in Colorado. Tesla was greeted at the hotel by a group of reporters, one of whom asked him why he chose Colorado for his operation. Tesla replied, “I might as well tell you the truth, I have come here to carry on a series of exhaustive experiments in regard to wireless telegraphy — I come here for work. Joe’s Chickens and Turkeys Joe’s prized chickens and turkeys. In the brooders now. They’ll be in their coops by May. More photos from MaisonBisson Restaurant Insider A link from WiFi Networking News points to QSR Magazine, the trade mag for the quick service restaurant industry (think McDonalds and Taco Bell). The connection here is that McDonalds plans to offer wireless access in 13,000 locations. With McDonald’s off the market, WiFi hotspot operators are looking to hook the next big fish, and that’s why WiFi Networking News is linking to QSR’s Top 50 Chains list. Some technologists would speak about how we’re moving ever closer to the time when we have ubiquitous hi-speed wireless. Music Biz Sales Up UK markets first reported it, then Australia’s record industry tried to suppress it, now US sales figures suggest the trend has spread here: record sales are up. Yes, despite the RIAA’s whining and lawsuits (and P2P’s continued growth despite those lawsuits), record sales are up in the US. BBC News reports US record sales up 9% after a claimed four year slump. This story deserves more attention, but for now I’ll just have to link to my earlier stories about music industry wackiness: Bringing Digital Video Back to the Living Room You can burn DVDs of your home movies (and you probably ought to, just for backups), but what if you want to make a movie library to match your computer-based music library? Watching video on a computer is no more fun than listening to MP3s on the computer’s tinny internal speaker. The solution may be one of a new generation of products that link the TV in the living room to the computer in the office. Exploring the News NewsMap displays current news in an explorable two dimensional space. Headline sizes appear to be weighted based on the number of related stories. Like PlumbDesign’s Visual Thesaurus, it’s a truly new use of computer in the display of information. Jacque’s Cabaret BostonNoise.org says “Jacques’ Cabaret is Boston’s oldest gay bar. The upstairs features live female impersonator shows five nights per week, including weekends. The downstairs basement is open only on Friday and Saturday, and hosts local bands.” Jacque’s official website shows Norell Gardner &amp; his cast of miss-leading ladies playing every Friday and Saturday upstairs. The Raw Bar, “a return to the old style of cabaret where artists entertain each other, for the pure art and enjoyment of it, creating a space for talented people who don’t have the opportunity to perform because their music or performance is more artistic than commercial,” was featured in the Globe and plays downstairs at Jacque’s Underground on the second Friday of every month. VoIP Links Vonage is starting to look like the ma-bell of VoIP. It’s not that there isn’t competition — there is, but they just don’t have the profile that Vonage has. It looks like Vonage has picked up the early adopters, now they have to start converting others. The market seems to have three fields: computer-to-computer only, software client with POTS bridging, and hardware client with POTS bridging. I don’t much care about the computer-to-computer systems, AIM and iChat take care of that well enough. Richard Clarke’s Insider Tell-All Tom Maertens Speaks on Richard Clarke’s insider story in a Star Tribune article dated Sunday 28 March 2004. The troops who could have been used in Afghanistan to capture Osama bin Laden and Al-Qaida were instead held back for the planned invasion of Iraq. In contrast to the 150,000 men sent to Iraq, only about 11,500 troops were sent to Afghanistan, a force smaller than the New York City police. The result is that Bin Laden and his followers escaped across the border into Pakistan. … Clarke’s gutsy insider recounting of events related to 9/11 is an important public service. From my perspective, the Bush administration has practiced the most cynical, opportunistic form of politics I witnessed in my 28 years in government: hijacking legitimate American outrage and patriotism over 9/11 to conduct a pre-ordained war against Saddam Hussein. Copyright War Something doesn’t add up. ARIA, Australia’s version of our RIAA recently announced that sales continued to slide there this past year, while critics pointed out that they really had a record-breaking year with 50 million album sales. Thank ArsTechnica for the link. This matches news from the UK this past summer. So why is the industry lying? Ignore for a moment the ironic story about the music industry using P2P stats to improve their marketing and sell more records. Why Music Biz Loves P2P Jason Shultz over at CopyFight just posted this story about The Mercury News’ story about how record labels use P2P stats to boost sales. &lt;a href=&quot;http://www.corante.com/copyfight/archives/002790.html&rdquo; title=&quot;Record Labels Using &ldquo;Pirate&rdquo; Data to sell more CDs&quot;&gt;Record Labels Using “Pirate” Data to sell more CDs (posted by Jason Schultz) The Merc has a great article on how the RIAA bashes P2P out of one side of their mouth while secretly using data from the networks to boost sales of their CDs. Political Diagramming A graph from Orgnet plots book purchasing patterns by politics.There’s not much middle ground there. “These political books are preaching to the converted. The extreme book titles on both sides reveal a focus on hate, instead of debate. In a year of presidential election, is this the new arms race?” Could it be that our book readers are key opinion leaders in their communities? An opinion leader is someone whose influence spreads much further than their immediate circle of friends &amp; family. What is the Simputer? I just saw a pointer to the Amida Simputer, an Indian designed and manufactured PDA. The review at Engadget sounds sort of down, but it comes from a company on a mission. It seems others are fed up with importing (and paying for) US technology, so they’re developing their own. Take a look-see at the Amida and mix that with a quick browse of the Argosy EB660, a Chinese designed ebook reader. Hmmm… Boats It looks like a tug boat, but the Great Harbour 37 could be a lot of fun. A magazine article talks about bareboat charters in the British Virgin Islands and the pleasures quietly exploring the coves and uninhabited areas on your own. NASA’s X43 Flies NASA’s X43 scramjet test plane flew at speeds exceeding mach 7 and altitudes of 100,000 feet today. I believe that’s a new air-breathing speed record. GlobalSecurity.org has a nice wite-up on it. American Proprietary Eponyms There I was Googling “proprietary” for a story about misuse of the word when I came across this gem from R.Krause: An eponym is a general term used to describe from what or whom something derived its name. Therefore, a proprietary eponym could be considered a brand name (product or service mark) which has fallen into general use. Yes, R. has a bunch of them listed, Xerox, Jell-O, Velcro, and more. Too bad it was last updated in 1997. I wonder when “Google” turned from brand name to verb. What Does Proprietary Mean, Anyway? Googling “proprietary” results in lots of hits, but very few of them use the word in a positive sense. The Webopedia Computer Dictionary offers: Proprietary Privately owned and controlled. In the computer industry, proprietary is the opposite of open. A proprietary design or technique is one that is owned by a company. It also implies that the company has not divulged specifications that would allow other companies to duplicate the product. Thank Chank The font designing folks at Chank have a nice list of free fonts to pick from. Sure, they’re not the fonts you use to design flyers for the church social or nursing home holiday dinner, but that’s sort of the point. Isn’t it? Anyway, they also link to nerfect where you’ll find other cool designey things. Integrating Library Systems in Campus Portals Information about Lamson Library’s portal integration at Plymouth State University. I’ll expand this story later, but I want to put the link here now to get it in Google’s index. Update On Pen Twirling I did a story on the practice of pen twirling in Japan a couple years ago. Since then I have received an email from Pierre Etienne Bastouil who is trying to organize a pen twirling competition in Paris. Despite the popularity of the sport in Japan, he’s having some difficulty finding skilled pen twirlers in Europe. So the call is out, interested pen twirlers should contact me and I will forward you on to Pierre. Schlossberg Quote “The skill of writing is to create a context in which other people can think.” –Edwin Schlossberg Squirm Squirm Little Man Far too often the mainstream press lets politicians get away with revising or misrepresenting their previous positions. Far too often the press is complicit in their lies. Not this time. Hopefully Quoticus will develop into a very useful historical truth machine to prevent revisionism. Hopefully. NY Times on Netflix The New York Times did a Netflix story. The Author, William Grimes, seemed to like it, but… [My wife and I] each judge the other’s selections harshly. I scored a major victory with “Mon Oncle” by Jacques Tati, a director I once dismissed as tedious, annoying and far too French. He is now a god in our house. But I have had my back against the wall after “L’Atalante,” a film I had never seen but knew to be, by expert consensus, a towering masterpiece. Less than 10 minutes after the opening credits rolled, the atmosphere in the living room grew frosty. I lost control of the mouse for a week. At least I had the foresight to sneak off and watch “Russian Ark” on my own. That’s the fun of Netflix. Along with savage recriminations, my home now resonates with high-toned animated discussion of directors, cinematographers and camera angles. Once again I’m the moviegoer I was in college, when Bergman, Fellini and Truffaut were in full stride, and adventure was in the air, and bright-eyed cinéastes could sit through a film like “El Topo” and not demand their money back. It’s not available on Netflix, alas, but the Web site does propose an alternative, a compilation of “Ed Sullivan” shows featuring Topo Gigio. Close enough. Interesting enough, but Netflix — and services yet to appear — are a sign of things to come: a world of entertainment shaped by the consumer, not by marketers. Netflix executives say their edge over the competition is not their library but the way the library is presented to users, who are asked to rate the films they have seen. By sifting through the ratings, about 400 million of them at present, and analyzing buying patterns, a company program called CineMatch generates rental suggestions specific to each user. “Lost in Translation will outperform most $300 million films for us, and that’s because of our ratings and recommendations,” said Ted Sarandos, the chief content officer for Netflix. “Monster will be huge for us, and that’s not because our subscribers are more sophisticated than the general moviegoing public, but because our merchandising system is much more specific.” It will be a world of what you want, and only what you want, as clearly marked by your previous purchases and selections. You’ll never be upset by products that you don’t want, even if you didn’t know you didn’t want them, nor will you have to tolerate contrary opinions or debate. Dr. Seuss Was So Political Who would have figured old Dr. Seuss was so political? Rick Minear at UCSD has collected a number of the good doctor’s works as chief editorial cartoonist for the New York newspaper PM. “Because of the fame of his children’s books (and because we often misunderstand these books) and because his political cartoons have remained largely unknown, we do not think of Dr. Seuss as a political cartoonist,” writes Minear. Turkeys On the Lot! Turkeys aren’t small birds. Along the commute from home to work, they’re as common as pigeons in a city park, but it’s still odd to see a turkey in the parking lot (video link). The source video was taken with a Sony Clie PEG-TH55 and edited — just a bit — in iMovie. Wireless VoIP GPhone is a bust for me, at least for now, but other solutions are available. Ars Technica pointed out an 802.11b wireless VoIP phone from Zyxel. Then there’s the Vocera VoIP communicator badge that everybody at Dartmouth College uses. They were happy to show it off during the Unleashed Wireless Conference they hosted last Fall. [UPDATED]: The VoIP market is heating up. Vonage is set to offer a wireless phone soon to help compete against AT&amp;T’s new entry into the VoIP market. Then there’s VoicePulse and Packet8 also making a play in the full-service residential/small business VoIP market. Gamer’s Delight: Palm Emulates GameBoy, Atari ST and Apple //e I saw a link for a Palm-based GameBoy emulator, then was stunned to read about an Atari ST emulator for Palm. A quick Google search later, and I found an Apple //e emulator too!. It’s the old-timer in me, but I really enjoyed the games on those old systems. More info on the Apple //e emulator for Palm are at Palm Info Center and FreewarePalm. PalmEmu links up a number of emulators for Palm. GPhone Doesn’t Work On Clie TH55 I’ve given up on VLI’s tech support for GPhone, the VoIP software for Palm. The download page said it was compatible with Palm OS 5.x devices, but was only tested on the Palm Tungsten C. I contacted support after trying it on my Clie TH55, but fell into a loop where they kept recommending I try the same simple things and telling me that Clies use non-standard audio hardware. I’d, in turn, tell them the results of those simple tests and explain that the TH55 uses standard Palm audio APIs. Hopefully they’ll find a solution, but I think the hangup with the GPhone software is a network problem. Recording Video on Clie PEG TH55 The ClieSource Forums are an excellent source of info. It turns out that installing the movierecorder.prc (version 1.3) from a UX50 onto the TH55 allows it to record movies. The problem is getting that file…. Isn’t Google great? If that doesn’t work out for you, try searching at the Palm User Message Board, where you might just find it. Here’s the trick: you can’t just install the app via Palm Sync. Scrabble Aside from all the other online dictionaries, Scrabble players may be interested in the following sites: Hasbro’s word lists for tough times (including Q without U, two letter words, X words, and more). Wordplays.com’s tools for word games is a collection of web apps that would be handy to use (if it were legal to do so) during a game. Mark has developed a number of word lists and other scrabble tools. Wireless Links The PublicIP ZoneCD is a bootable CD implementation of NoCat’s NoCatAuth. NoCatAuth configuration help is available from AmsterNet and Blyx. The LEAF Project intends to create Linux-based firewall-in-a-box solution that has uses for wireless. LinSpot is a commercial hotspot-in-a-box software solution. NoCat, LESS Networks, Portland Community Wireless, and Newbury Open.net are active community wireless operations. O’Reilly Wireless DevCenter has loads of news. Murphy’s Junk On the list of places to visit next time I go out west: Murphys Surplus Warehouse: Located at 401 N. Johnson Ave: El Cajon, Ca. 92020 (Near San Diego) 619 444 7717 Fax 444 6750 8,000 sq. ft. of military and industrial electronics, communications, and MIS electronic equipment. Sandee’s Favorite Bad Songs 80s revivals may be played out and we’re not yet ready for 90s nostalgia. Nonetheless, there are a number of songs of the time period that we’re a little ashamed to admit we love. Without knowing why, and in no particular order, here they are: The Humpty Dance Funky Cold Medina Can’t Touch This Ice Ice Baby Do Me and Poison Hotstepper Mama Said Knock You Out Goin Back to Cali Mildly Psychotic? Eysenck’s Test Results Extraversion (68%) moderately high which suggests you are talkative, optimistic, and sociable but possibly not very reflective. Neuroticism (39%) moderately low which suggests you are relaxed, calm, secure, unemotional but possibly too unobservant of your feelings. Psychoticism (53%) medium which suggests you are moderately offensive, uncooperative, and rebellious. Take Eysenck’s EPQ-R based Personality Test. Clie Memory Stick, Playing Videos, and More… The Lexar 256MB Memory Stick arrived. It sucks. It’s not really a 256 MB stick, it’s 2 x 128MB, and you have to flick a little switch to choose which 128MB you want to use at any moment. Let me be more clear: you can only use 128MB at a time, and you have to eject the card and flip a switch to select the other 128MB. I don’t know if it’s returnable, but I think I’ll try. Interesting Site Design Just ran across 24-7media.de. It’s a cool site. Their Flash design is top notch and I really like the metaphor. Does it work? Yes, in the limited context they’re using, it works well. Best of all — or most disturbing, who knows — is the soundtrack. Composed by Yuko Ohigashi, it’s haunting and mysterious. Mac & Palm/Clie GPS, Maybe Just learned of the Rayming TripNav TN-200 GPS receiver. It’s the type that has no display or UI and must connect to a computer (via USB) to be useful. It’s Mac compatible and it appears there’s a slight variation (the TN-204) that works with Sony Clie Palm compatible handhelds. The problem is, the company website is down now and I can’t get detailed information from the other sites. Yes, Google Cache has info, but that’s more frustrating than helpful. Of course, Amazon doesn’t carry it, so I can’t view the reader reviews there. What I really want is a receiver that will work with both. But perhaps I’m just dreaming. Then there’s also the question of what happened to Sony’s Clie GPS cradle? Finally, none of this would be an issue if I hadn’t also just read about TomTom GPS navigation software for Palm. Return of Dirigibles: Delayed or Dead? The 90s saw a resurgence of interest in dirigible airships. People believed their time had come again, but few are flying today. The CargoLifter, a cargo airship designed for loads of 160 metric tons (that’s over 175 US tons), is in receivership, and little has been heard of the Zeppelin NT. Links and more info: Story about the CargoLifter (via Google Translations), and a CargoLifter image gallery as well (also via Google Translations). Going to See the Goats Went with Will to see the Mountain Goats, Will’s favorite band ever. Plans included reliving the beef tatar at the Korea Garden. Read my earlier story about it, but remember that it’s not actually called beef tatar. It’s “Ok Doi Bi Bim Bab” on their menu. Of course I wanted to take pictures of the beef tatar experience, but I also wanted to taste it again. It wasn’t the same as last time. What For Wireless? Planning for wireless deployments differs from wired network planning in many ways. Unlike wired networks, the primary question isn’t bandwidth or reliability, but availability. Wireless networking enables mobility — and mobile connectivity — in ways never before seen in the world of computers. Just as movie theaters and television coexist despite their similarities, wired and wireless networks will coexist. Each has it’s unique benefits and drawbacks. Each is desirable for different purposes. Bush’s Fiscal Felony Matt Miller’s NPR commentary about the Bush budget includes the following details: A deficit of 521 billion means borrowing almost 1 out of 4 dollars in the budget. It includes 300 billion in tax cuts that go mostly to the rich, but ignores the 25 trillion dollar shortfall in social security and medicare that will start to come due in five years. Bush plans to send an addendum to the budget to cover the growing costs of the US military presence in Iraq and Afghanistan after the November elections. Flight Planning Software for Mac I hope someday to have a need for flight planning software, so I’ll keep these URLs around for a while: Mac Flight Planner and Flight Math. Vegas Links 2004 Now that the Nevada Test Site Historical Foundation’s Atomic Testing Museum. is open, you don’t have to wait for the DOE’s occaisional tours of the test site to get your radiation fix. Lawrence Livermore National Lab has a review of the new museum. We caught a show at the Amargosa Opera House (official site) in Death Valley, just a short drive west of Vegas. The Opera House deserves a story of its own and the views and scenery of Death Valley are just beautiful. Shopping in New York, NY We watch Queer Eye for the Straight Guy a lot over here. It seems we can make time for about one hour of TV per week, and Sandee’s decided we’ll spend it with the Fab Five. I’m sure the New York merchants featured in the show are expecting this, but we’ve started to keep a list of places we have to visit when we next go to the city. I’m posting it here for my use as much as anybody else’s. Vegas! I might get around to telling the story later, but for now all I have is a couple movies and a few pictures. There’s a short video of the koi and gardens at the Flamingo, an album of snapshots and nightlife, an album of pictures from the very unique Amargosa Hotel and Opera House, and a short video of our short visit to Crystal, NV. We saw Zoomanity and a show at the Amargosa Opera House. Getting to Vegas I blame Missouri. Kansas City in particular. I’m sure there’s probably another airport like this somewhere, but I don’t know about it. KCI, MO, is setup so that you have to exit and re-enter security areas just to change planes. Then, if you need to use the bathroom or get something to eat, well then you have to go through security again then too. Of all the airports to suffer a three hour delay in, KCI might be the worst. Dreaming of a Sony Clie PEG TH55 I’ve pre-ordered the just-released Sony Clie PEG TH55 and am anxiously awaiting its arrival. Brighthand has a nice review that speaks (mostly) highly of the new Palm OS compatible handheld. High points were the integrated WiFi, excellent battery life (compared to other WiFi handhelds), large screen, integrated camera, and relatively good software bundle. Low points were the email client, the low resolution of the integrated camera (640×480), and lack of Bluetooth (which is included in the European and Japanese versions). Land of the Loops Was listening to Land of the Loops’ Bundle of Joy on the way home from work tonight. It somehow fit the mood and I found myself really enjoying it. Yes, it’s loop/sample-based, but the results are anything bet techno or hip-hop. Originally released in 1997 (I think?), it holds its tune seven years later. . . Things To Remember While Doing Upgrades on Mission Critical Sun Equipment…. 1a: Sending Stop-A with non-Sun keyboards or over a telnet connection With a terminal server, the terminal is hardcoded to a “cli” interface which, in turn, telnets to the console port on the destination host. The point is to get the *telnet* to generate a break, which can be done by: Press ctrl-] (or whatever is the telnet escape sequence) At the telnet prompt, enter “send break” Newbury Open Net Just saw a link to Newbury Open Net, a community wireless project in Boston. Newbury Open Net describes itself: NewburyOpen.net is a network which provides high-speed Internet services, in the form of free wireless and for-pay workstations, to Boston’s residents, workers, and travelers. … We believe that high-speed Internet must become like a public utility: cheap, simple to access, easy to find, and available to everyone, no matter their location or social status. MacDevCenter on Home Automation First, I found this story at MacDevCenter rather interesting: Home Automation with Mac OS X, Part 1 by Alan Graham — Having more control over how your home operates isn’t just a geek fantasy. You can lower energy costs, improve security, and enhance the overall ambiance of your humble abode. Alan Graham shows you how to leverage your Mac OS X computer and get started. Home automation is, of course, something I’ve wanted to play with ever since I heard about it. Sure, iTunes visuals are great, but what about programming all the lights in your house to work like a huge color organ to pulse with the music? But I was also amused by the O’Reilly/MacDevCenter website. Along with the usual print and [email][6] buttons they had a [blog this][7] button. While they clearly wanted visitors to see the website as something more substantial than a weblog, they also wanted to cash in on the blogging public’s ability to create buzz and swing Google rankings. We Like the Moon, Biscuits, and More Flash Animation The folks at RatherGood.com have no end of Flash animations to entertain and delight. May I suggest starting off with Moon Song, and Biscuits? Along those lines, I also found (the far too obviously named) Flash archive with even more great goodies. Yes, you’ve seen some of these before, but there are some new ones there too. And, of course, regular laughs can be had at HomeStarRunner.com, where Strong Bad’s Email (updated each Monday, usually) will likely make you a repeat visitor. Zygo: The Last Energy Drink Cola wars are one thing, but “altbev” sure has come a long way since soft drink makers identified the market segment in the 90s. Coke’s Fruitopia was among the entries from the majors, but, as usual, it’s the independents that have lead the way. Water remains the leading altbev, but energy and “health” drinks are squeezing the market. Just as Coke and Pepsi were developing their bottled water brands to catch up with Poland Spring (owned by Nestle, by the way), Red Bull appeared and turned things upside down. Useful Dohickeys Why can’t I find the Sumajin Smartwrap, a small cable management device that looks perfect for headphones and other small cables, locally? Smartwrap, winner of ID magazine’s Design Distinction award, is a cord manager for headphone cables designed and developed by Sumajin, an industrial design firm in Singapore. You snap the cord into place at one of two places then wrap and snap into place again. Smartwrap comes in seven colors and are produced in limited quantities. /etc/hosts in MacOS X 10.3 I’ve run into a situation things would work better with a static host mapping, but my first thought/fear was that MacOS X’s NetInfo would get in my way. Google turned up some old info on reconfiguring NetInfo, as well as a slightly more current NetInfo tip. But as it turns out, Panther is all setup to read your /etc/hosts entries and use those before going to DNS or NetInfo. So there you go. What is IBIBLIO? If 14-year-olds were old enough to remember Bush Sr., they’d think this Bush monologue was the funniest thing all day. So, in the interest of educating and entertaining those 14-year-olds, let me explain that the current President Bush is the oldest son of a previous president Bush. Bush Sr. was elected in 1988, his term of presidency included huge job losses and recession, and he got us entangled in a War in Iraq and many other places. Deep Thoughts; Timewasters Here’s a graph to get you thinking about politics: job growth per president. Who knows if the numbers are real, but it jives with my memory of the past 20 years. This dark and slightly objectionable cartoon of life features a good soundtrack and really cool styling. Finally, everybody likes Latin translations of old rap songs. Right? “Magnae clunes mihi placent, nec possum de hac re mentiri.” Peer-to-Peer, DMCA, RIAA, Lawsuits After six months of RIAA lawsuits, you’d think this would be old news, but…. It’s been a while since I’ve reported on the music industry’s attempts to control online music distribution, but Ars Technica has been following that and the larger issues all along. The story took a turn in December when a three judge panel ruled that the RIAA’s subpoenas were illegal. That was a win for the ISPs that had brought the appeal against the RIAA and have now ceased cooperation with the music industry. TiVo Getting Close to Home. Too Close. The folks at Ars Technica are asking question that I first started wondering about during the Patriot’s 2002 Superbowl win. After the game, the TiVo folks released an announcement that Britney Spears’ Pepsi commercial was the most-rewatched ad of the game. Their claim was apparently based on stats from the TiVos in people living rooms. We’re all familiar with Nielson TV ratings, but those viewers know their habits are being recorded. MIT Tech Review’s Ten Technologies That Refuse to Die The folks at Ars also pointed out an interesting story by the MIT Tech Review. It’s all about things that were expected to have been passed by, but weren’t. It sort of puts us in our place. Microsoft, in its biggest act of irony ever, issues security education posters Microsoft Corp, the software company responsible for producing some of the most notoriously (and dangerously) insecure software ever has issued a collection of posters aimed at, start your irony engine, computer security education. “Educate your students, faculty, and staff on the simple steps they can take to protect their PCs,” says the Microsoft website offering the posters. Site Updated Um, not many people noticed, but this site was offline for a few months because the hosting company I was using shut down operations. Well, I’m back, mostly. I’ve redesigned things (having stolen the design from another site of mine), but there are still a number of things missing. Theoretically I still have a backup of the comments and members and stuff, but I may not bother looking. The Redstone Brewery info is in here, but the categories list is gone. How To Get Off an RBL It sucks to get on a email blackhole list. Click “more” to find out how we got PSU off ATT.net’s proprietary RBL. Entertainment Value First, take a look at &lt; bushin30seconds.org &gt; . It will do more to make you mad than entertain you, but take a look and channel that anger into something meaningful. Now that that’s over, take a look at &lt; ebaumsworld.com &gt; and waste the rest of the day laughing. There’s no shortage of video, cartoons, and other junk. Enjoy it all. Here are a couple links to get you started: super truck, paranoid, something else funny, and yet another thing. The Unwired World is Growing First, look at some numbers: “‘Last year under 20 percent of the laptops have Wi-Fi built in, this year it’s 40 percent.’ Says Brain Grimm, communications director for the Wi-Fi Alliance” Now consider that the quote appeared in a story in AAAWorld (yes, the American Auto Association). Their demographic is generally older and non-technical, so either their demographic is changing or “non-technical” is being redefined. I’m going to bet that the water is rising and, just as the world now accepts email, it now seems to expect some understanding of networking. Hmmm. [UPDATE] And now the Minneapolis Federal Reserve Bank is reporting on growing WiFi use in the Mid-West! Oh my. Why Superbowl Ads Matter Last Saturday was the 20th anniversary of the Macintosh. Apple announced the Macintosh to about 90 million households in a 60 second ad during the superbowl. The ad, which has been lauded as one of the best ads ever and created “event marketing,” rocks. It was this theory of event marketing that lead advertisers to create ever larger, ever more expensive ad spots. And that’s when the ads during the Superbowl became the the main event for some viewers. Okay, Now I Want One There are two things you need to know about the The International Streamlined Tatra Site: It’s cool, and they’re cool. I happen to love art deco advertising, and it seems Tatra has some of the best. Of course, I wouldn’t know anything about Tatra (it’s a car company, or it was, they now only make trucks) except I stumbled across this story elsewhere. Warren Republicans Vote Democrat Former Vermont governor Howard Dean carried the polls in Warren this primary night. The numbers for the rest of the state are still being counted, but what’s more impressive to me is the number of voters who went to the polls and the number of registered republicans who wrote in Democrats on their ballots. Twenty three out of 77 Republican ballots cast in this very conservative northern New Hampshire town had Democrats written in for President. Czech it Out! Bad headline, yes, but what this guy has done with his car is pretty cool. Antarctica in My Name It’s good to know that there’s an Antarctic outpost in my namesake. Good ol’ Casey Station even has a webcam. [update:] Here’s an interesting sattelite image of the area, found at this Remote Sensing Project website. Ethel’s Holiday Fashion Nothing says holidays like leopard print. More photos from MaisonBisson How to Have Fun Like I Just Did Start with approx 1 cup of bacon grease collected over time just like Jon’s mom said to do. Pour grease into small disposable aluminum loaf pan. Insert pan with grease into burning wood stove. Wait. Watch. Wait. Watch as oil ignites with a whooosh that’s vaguely reminiscent of a chimney fire. No, that woooosh is exactly how you remember that chimney fire. Close stove air intakes and continue to watch fire. More Complaining and Whining The lousy Red Cross can’t get its act together well enough to schedule blood drives in Plymouth (where I work each day) well enough so regular donors can go to all of them. The Red Cross knows that something like 85% of their blood comes from regular donors who make it a point to donate at every opportunity (and how many of us can there be in Plymouth?). Yet, they schedule a blood drive today, fewer than 56 days since their last blood drive. O’Reilly’s Wireless Hacks The question here is between 802.1x authentication and web-based, captive portal authentication. The former has high client requirements, the later seems too simple. Rob Flickenger’s Wireless Hacks has fired me up for captive portals. An excerpt, Dispelling the Myth of Wireless Security, makes clear the need for application layer security, an argument I’d say applies to wired and wireless network alike. Point: wireless is exposing holes that have existed in our network security all along, but patching those holes will secure everything, including wireless without spending loads of money on expensive APs and proprietary clients. Wireless Vulnerabilities Related to my review of wireless security landscape is this review of threats to wireless security. Passive Sniffing “The same information in a Probe Response frame is available in the Beacon frames that every 802.11 network is required to transmit (even closed networks). So, we just listen for these frames in Monitor Mode instead.” Extreme Tech’s guide to exploiting and protecting WiFi networks “AirSnort can determine the WEP key in seconds…” The Wireless Security Landscape The view from the trenches Fall 2002 Below is an email I sent to MacLabManagers mail list in late September 2002. Our discussions of wireless security had just begun at that time. The wireless landscape has changed a lot since then, but the responses have information that remains valid and useful to us today. Howdy, We’re using wireless in many locations here, but somebody just got scared about security. Until now we haven’t been using WEP, nor have we cloaking the network name for wireless base stations that serve mobile classrooms on campus. Wired Mag’s 12 Commandments of Programming Wired Magazine has an interesting article on “Extreme Programming.” Supposedly, the solo programmer pulling all-nighters on excessive caffein is out. In are 40 hour work weeks, group coordination, and two people per computer. But what about productivity cry the managers. According to the article, coders do more, do it faster, and do it with fewer bugs this way. Summary Page for Music Industry Wackiness I’ve posted a number of stories and links related to the music industry and P2P and such. Here’s a short summary of them. First was a story about how music swappers actually buy more music. Then came a story about the decline of the album format, and why it’s a good thing for listeners. I followed that up with something about copying is theft, and other legal myths. And just now I posted a story about the real reasons for the decline in the music biz. Perfect for the Church Social Hey, so what about the local sports team and their player that’s excelling with that thing that he does? Some people like to argue so much they run out of material. Or, maybe it’s like what Rob Gordon says in High Fidelity: “it’s not what you’re like, it’s what you like.” So maybe arguments erupt as we try to establish and defend our identity (evidence: teenagers). If true, and our identity is made up of the pop-culture elements that we consume, then what are the key traits we must evaluate? Street Lights…and other things that don’t work the way they should It’s probably due to my color blindness, but I have the darndest time seeing streetlights (the red/yellow/green things at controlled intersections) at night. I’ve had to explain it a million times, but nobody seems to understand. Finally I’ve discovered a sympathetic friend, sort of. Michael Darnell writes about his complaints with street lights and other things that don’t work well or aren’t designed well. Time Wasters I found myself waiting. A CD quietly burned in the combo-drive, a computer slowly reboot after a system update, and a large file was drifting across the ether[net] between my laptop and sever. Clearly this was the time to surf over to ilovebacon.com and waste some time. I was in luck right away. Ask Snoop isn’t quite as funny as old unix jive, but it’s good for some quick laughs. Music This, Music That Continuing the recent music and copyright theme…. It turns out that I wasn’t the only one who thought the BuyMusic.com ads looked a little familiar. Rob Walker wrote about the New Apple Clones for Slate.com. “…I kept re-watching the BuyMusic ads to try and figure out what I was missing. Is there a hidden critique here? A satire? Not really. They’re just knockoffs. It’s as if, by borrowing the look and feel of Apple’s ads, BuyMusic is explicitly interested in underscoring that its service is a copycat. Website Spotlight I just added ArsTechnica to the list of websites I check daily. I’ve been reading technical articles there for years, but two articles today clinched it: “the social complexities of the f-word” and “your cheating heart’s been clickin’ her buttons. Both are well worth reading for anybody who cares about the social aspects of technology. Well, the first one doesn’t really have anything to do with technology, it’s just funny. Copying is Theft – and other legal myths Music has been an issue for me lately. What with my previous stories about the “decline of the album format” (and why I think it’s a good thing) and how music swappers apparently buy more music, you’d think I’d gotten the matter out of my system. No. Copying is Theft – and other legal myths is an article that everybody who’s ever heard of MP3s should read. No matter what you’ve come to believe (or how much the RIAA pays you), the title is real. USB Hacking So I’d like to get this old USB video capture device working in OS X, but the vendor has quit the business and no OS X drivers are around for it. A little searching on the web netted the following how-to on making one vendor’s USB device drivers work with another vendor’s products. The details relate to USB WiFi adapters, but we can generalize. With the tips in that story in mind, we can face down the next question: are there any drivers that might be made to work with my USB device? Whiney Sell-Outs Charles Haddad writes in Business Week Online about musicians making a stand for the “integrity of the album format.” Fortunately, he gets it right: this isn’t about artists looking after their art, this is about the end of a business strategy where a few good tracks are mingled with a pile of chaff and called an ‘album.’ What’s really important here is that you can buy what you want, rather than just what labels and the bands have decided you should have. No longer do you have take the fat with the meat — and pay $15 or more for a CD that has only three songs you like. …This doesn’t necessarily mean the death of album rock, just bad album rock. A package of great songs that work together will still sell. Just look at the evergreen appeal of the Who’s Tommy or Miles Davis’ Kind of Blue . The labels may be forced to change. If filler no longer sells, will the music industry continue to compel bands to produce it? Maybe, just maybe, bands and labels will start improving the overall quality of pop music. Music Labels Have Heads Up Asses A story on BBC News (File Swappers ‘Buy More Music’) reports on a study that claims those who download music using P2P services (old Napster, Gnutella, etc) actually buy more music. It should make sense to anybody with a hair of marketing experience: try before you buy. Yummy Shit Karen pointed out an article about scary-but-common food ingredients at Fortune.com Stupid OS X Server Hint OS X Server is great, but it doesn’t respond well when you change its IP number. The resulting fiasco will make you think working a fast food job is worth it. Here are some links that won’t make it easier but will at least give you a bootable machine: A little how-to Support discussion More discussion Even more discussion Update August 15, 2003: Apple has finally done something, just a little something, to address this problem. DVRs Are Cool I don’t watch much TV and I don’t own a TiVo, but I love the idea. So I’m glad to read about open source folks building their own DVRs. Apollo Archive The Apollo Archive boasts a wealth of content covering the moon landing. Good stuff. Google-Watch Google has been raved about since it first appeared on the search engine scene four years ago. Now that it’s trounced all the other, however, people are getting concerned about the effects of the monopoly. Google-Watch is leading the charge. Their claim? They say that Google’s PageRank means only that the rich get richer, and they’re concerned about close ties between Google and government snoops. Hmmm. You Are Being Lied To I found a collection of three books by The Disinformation Company on a shelf in City Lights. I’d already picked out my book (Toothpicks &amp; Logos) when I saw Abuse Your Illusions, Everything You Know Is Wrong, and You Are Being Lied To lined up and beckoning to me. I’ll have to take another look at them, but at least the publisher has an interesting story. Good Liberal Rabble Rousing It’s a pleasure to read the many pages of Molotov Cocktail for the Soul. IUG 2003: Library Portal Integration Elaine Allard and I will be presenting on Library Portal Integration at the IUG 2003 in San Jose, CA. Two sessions have been scheduled for Sunday, April 28th: 9am and 4:30pm. Our description, in the program guide: Like many colleges, Plymouth State College is working to consolidate its online resources inside a portal. Within this single point of service students can register for classes and check their grades, faculty can review their rosters and post grades, and staff can review benefits and vacation time. Tinkerer’s Joy While reading up on the SLiMP3 network MP3 player I came across some mention of Dallas Semiconductor and their line of wonderfully hackable TINI ICs. These little things have ethernet interfaces, Java runtime engines, and webservers built-in, and are ideal for making non-networked devices internet ready. As if we don’t now have enough internet connected light switches and soda machines. A nice overview of TINI is available. Ohh, Film Music Pornorchestra: The PornOrchestra is an attempt to radically reinterpret the soundtrack to pornographic film. This complicated genre has taken its share of scorn: from adult film producers who refuse to pay it any mind to legions of consumers who instinctively snap the sound off after pressing Play. Performing live improvised and composed scores to pornographic film, the PornOrchestra invigorates the mysterious experience of the Voyeur-cum-Auditeur. The equivalent of a circus band with its collective eye on the trapeze artist: the PornOrchestra teases out the thrill, amplifying the collective gasp at pornographic triumph — and tragedy — using the most eclectic and creative musical minds working in the Bay Area today. The Promise of Wireless Wired has a story about the effect of wireless on agriculture, theme parks, health care, and conferences. So speaketh O’Reilly’s Rael Dornfest about a recent conference with ubiquitous WiFi access: “people weren’t disappearing back to their rooms to check email between sessions. They’d just sit down in one of the common areas and log on. Because everyone was gathering in the same place, there was a lot more spontaneous discussion. Also, the sessions themselves became more interactive.” Cool Fonts Font Diner offers some darn cool fonts. Go visit their site for freebies too. A Farmer’s Job I don’t know who gets the worse end of this stick, but it’s really sad that chemists can’t tell the difference between banned nerve agents and agricultural pesticides. Conflict Management How to talk down your adversary: “There is no reproach between me and you except the stabbing of kidneys and the chopping of heads.” Damn Cool Site Plumb Design’s Visual Thesaurus may be the coolest thing ever. Psychoanalysis Word of the Day Ego Dystonic Elvis vs. Nixon A friend forwarded a link that reveals the following story (as quoted from the website): On December 21, 1970, Elvis Presley paid a visit to President Richard M. Nixon at the White House in Washington, D.C. The meeting was initiated by Presley, who wrote Nixon a six-page letter requesting a visit with the President and suggesting that he be made a “Federal Agent-at-Large” in the Bureau of Narcotics and Dangerous Drugs. Tom Bihn Bags The story is that Tom Bihn designs and makes bags for laptops and other stuff. Or, at least that’s what Tom says at his site. Tom Bihn has been designing and making bags for well over twenty years. Daypacks he made when he was 13 years old are still in use, and in Santa Cruz, California, where Tom grew up, his laptop cases and book bags are almost legendary. Conferencing in DC I’m in Washington D.C. at the Computers in Libraries conference. It’s a good lineup of presenters and good programs, but I’m sad to know that I’ll be missing a peace rally this Saturday. Where to eat and drink: Old Dominion Brewery is in Virginia not far from DC. It’s in an industrial park and you’ll doubt that you’re in the right place, but the food and local brews are good. Short Quiz For discussion: WORLD HISTORY 101 MID-TERM EXAM This test consists of one (1) multiple-choice question (so you better get it right!) Here’s a list of the countries that the U.S. has bombed since the end of World War II, compiled by historian William Blum: China: 1945-46 Korea: 1950-53 China: 1950-53 Guatemala: 1954 Indonesia: 1958 Cuba: 1959-60 Guatemala: 1960 Congo: 1964 Peru: 1965 Laos: 1964-73 Vietnam: 1961-73 Cambodia: 1969-70 Guatemala: 1967-69 Bi Bam Bab in Cambridge The Korea Garden is on 20 Pearl Street somewhere behind the Middle East in Cambridge, Mass. It’s the sort of place that attracts local Asians and very few white boys (like me). So it’s hard to say what they must have thought when Cliff and I walked in one night this winter. An argument broke out in the kitchen as the waitress presented our order. We joked and smiled among ourselves about it, but my smile fell as my dinner was delivered. Counterscript Telemarketers may be people too, but this script will ease the pain of their next call. Take a look at EGBG’s Counterscript. And if you’re looking for serious anti-telemarketing resources, look at JunkBusters’s resources. Warren Redstone Brochure Available! I found a brochure about the Warren Redstone and present it here for your enjoyment in PDF form. It features the story of how and why it came to Warren, written by Ted Asselin, the man who brought it here. It also has information about the progress of the rocketry in the 1950s. The brochure was originally in tri-fold form, but is presented here as a two page PDF file. Enjoy. Yum! Email received today: Nothing starts a Monday off like Kippered Seafood Snacks, Deviled Ham, with a side of Spam, Potted Meat Food Product, followed by Vienna Sausage, all washed down with some Icey cold Clam Juice. Now I am ready to face the day. Yours Meatily, Dr. Meaty McMeat Meatofski Meatovich Hamkowsky-Beafeau Porkson Justin and the Sled Dogs The season for running sled dogs is almost at its end. Here’s a short video of Justin racing for the finish of one of his last races of 2003. Click the link to watch Justin’s Big Finish. Ashcroft’s Biggest Boob In the way emails thread their way from one person to another I came across the text of a speech about antics in the US Justice Department. It was titled “An Open Letter to John Ashcroft” and came with this preface: The following is a letter read by Claire Braz-Valentine, author at this year’s In Celebration of the Muse, Cabrillo College. It is worth knowing that the author is a woman of 60+ years, conservatively dressed and obviously quite talented. Marketing Artifacts Each of us deals with a lot of stuff unique to our jobs or life context, stuff that outsiders never see. Now and then it’s fun to see that other stuff. Here’s some: Silly marketing materials. More Commercialism! People have asked about this whole t-shirt thing. Click the banner to see how it works. Sign Up! Update: I just found a similar service for video distribution. You might want to check out CustomFlix.com. State of the Union? It’s not real, but it may be more accurate. Watch the State of the Union speech here. Thanks to my sister for pointing me to this. [UPDATE]: The link above may be down, the speech is mirrored here. Where Have All the Updates Gone, Long Time Passing? Since this website is such an important and valued news source for so many people, I’ve received many dire complaints about the scarcity of updates over the past month. Here’s the story: January is a busy, busy month at work. Students are gone, computers must be updated. Work also includes many large changes to the Lamson Library website, and more updates are due shortly. Daytime work is one thing, but I’ve also been pursuing my side business more actively. Common Sense Revisited? This may not be news to somebody who hadn’t swallowed the school approved version of American history whole, but there are a few important things to note: Before 1776, Colonists paid less in taxes than Britons in their homeland did. While the colonies were not represented in Parliament, neither were big British cities such as Liverpool or Manchester. Meanwhile the colonists enjoyed a free press, voted for local representation, ate better, lived in larger houses, and were generally better educated than their British cousins (the literacy rate in Massachusetts was more than twice that in Britain). Bryson on Language Speaking on language patterns around the time of the American Civil War, Bill Bryson states: …no nineteenth century journalist with any self-respect would ever write that a house had burned down, but must instead say that a great conflagration consumed the edifice.’ –Bill Bryson quoting (in part) Kenneth Cmiel’s Democratic Eloquence in, Made In America, an informal history of the English language in the United States. Mitnick off Parole He’ll be on parole of a long time, and he’s facing a number of additional restrictions, but Kevin Mitnick is finally free! Maison Bisson’s Winter Drink The holidays are long since past, here’s a drink to carry you through ’till Spring. Rusty Nail 3 parts Scotch 1 part Drambuie Serve over ice in an old fashioned glass. Please enjoy it responsibly. The Light I’ve found it. It’s here! Newswatch: Foreign Secrets: Bad; Domestic Secrets: Good. The news of the day is government secrecy. NPR’s All Things Considered ran two stories about the matter today. One story about general secrecy, and another story about Admiral Poindexter (formerly of the Iran-Contra scandal). Previously, NPR ran a capsule biography about Henry Kissinger. Of note is the discussion about Kissinger’s disbelief in open government. That story was followed by analysis by Daniel Schorr which may suggest why Kissinger was chosen to head up the independent panel to investigate the attacks of September 11th, then another story about his resignation from that panel. Trickle Down Voodoo It seems clear that Trickle Down Economics is back with new tax breaks for the rich, new spending on the security-industrial complex, and our first dip into deficit spending in years. While some call it it Voodo Economics, faith in Trickle Down Economics seems to be based upon the oft repeated line that anytime you put money into the economy, it benefits everybody. When pressed about rising executive salaries, believers embrace that too as eventually benefitting the economy. I found myself in an argument about these matters recently, and had to take a moment to assemble my thoughts about it. New Books I used to read magazines — I find it difficult to commit to things and magazines let me off easy, but I’ve been feeling unfulfilled by magazines lately (those who know me might also point out that I was somehow able to commit to marriage, and I’m still married over four years now). So I’ve been reading books left and right. Now, after the holidays, I’ve got a pile more. Bowling for Columbine Highlights Meaningless Ideology There’s a small battle being fought in the comments of my previous entry about Bowling for Columbine. It should be no surprise that gun rights are a very serious matter for many people. Nonetheless, guns are involved in a huge number of homicides in the US each year. And so those who would seek to prevent or limit those murders find themselves battling gun owners who would rather ignore them. Road Rage While the state argues with environmentalists about needed environmental abatement in the project to widen I93, we should all take a moment to consider the social implications of the plan. Wider roads will inevitably lead more people to commute greater distances to work each day. Whatever the causes of road rage, we can all acknowledge that time spent in the car is not quality time. Incidents of road rage are at their highest in areas where commuting times are the greatest — think of LA and Washington DC. iPod Links: iPod iPodHacks.com iPods Around The World Newtons Around The World iPoding.com PodNews Wired News’ Cult of Mac Wired iPod hacking story Water World Water is the primary ingredient in every liquid soap, body wash, shampoo, and conditioner product in my bathroom. Some even boast “purified water.” EBN Videos Online EBN, Emergency Broadcast Network, was a band of media jammers from the days of the Gulf War (the one back in 1990). They disappeared from the scene a few years ago, but you can find some of their old videos over at GuerrillaNews.com. And, as long as we’re talking about media jamming, I should throw this book at you: Jamming the Media by Gareth Branwyn. Edit: the links here go nowhere, but a few videos are in YouTube: Movie: Bowling for Columbine A friend of mine recently pointed out what I should have seen for myself: conservatives won’t change. So, while Bowling for Columbine is great entertainment for open-minded folks, it won’t make an impact on the folks who most need to see it. If you’re lucky you may still be able to catch this film in theaters, but everybody should take a moment to view this clip of the cartoon that appears in the film: A Brief History. Turn of century bridge jumpers had wide field of opportunity The opening of a new bridge in the early 20th century attracted a lot of attention. It was at that time that materials and engineering skill finally allowed cities to bridge rivers that had formerly required a ferry to cross. New York, with its many islands and rivers, was exceptional in this regard. New Yorkers eagerly followed news of the design and construction of bridges. Bridge openings where celebrated with days of events and fireworks attended by presidents and luminaries. WNYC’s _On the Media_ does sex show. On the Media’s recent show on November 29th and a piece in All Things Considered explored the relationship between technology and pornography. This is familiar territory for some—Wired magazine reports on it regularly… Click the links above and listen for yourself. Booklist: Nickel and Dimed When I first found Barbara Ehrenreich’s Nickel and Dimed while waiting for someone or something, I picked it up and started reading in the middle. I found myself immediately taken in to her story and her writing, and was more than a little remise to give it up. Not many non-fiction books about social issues are call page-turners. But this is one. Ehrenreich attempts three low-wage jobs in three cities for a month each, trying to find housing and food within the budget allowed by such work. Apple and the Future of Intelectual Property Macintouch pointed me to a blog entry at PlasticBag.org related to the role of computers in the war over digital intellectual property rights. The author believes Apple has already staked out its territory in this matter. After a series of examples, he explains the following: The reasons for all this, of course, are that – for good or ill – at the moment copyrighted material and intellectual property are endangered and cornered beasts anyway. Marching Toward Privatization Republicans and business leaders have been pushing privatization (and deregulation) for decades. Now, the results of this effort are becoming clear. Even as the Bush administration announces plans to privatize nearly a million federal jobs, reports of the costs and failures of such privatization roll in. Mother Jones reports this month on the growth in privatization of municipal water systems. The result in cities like Atlanta has been water boiling alerts do to dangerous bacteria levels, and poor service do to a workforce slashed by cost cutting. Activist Art Art is not, or does not have to be, cheery. It turns out that people become troubled and conflicted when they see pictures of the hungry and the homeless just weeks before Thanksgiving and the start of the holidays. The Nashua Telegraph takes up the story here: A new exhibit in the Town Hall Gallery, designed to raise awareness of and funds for the Open Cupboard Food Pantry, has gathered some complaints from residents and prompted the Board of Selectmen to suggest that it be removed. The exhibit consists of a selection of black- and-white photographs taken by resident Preston Heller of urban street scenes and various people he describes as being at the bottom of the social ladder.’ UPDATE Nov-18-2002: NHPR reported on this story today, and linked to the photographer’s online gallery. In Mother Jones: A Confederacy of Cronies Readers can trust Mother Jones to shine liberal light on conservatives. In A Confederacy of Cronies George Packer tells us how difficult it can be to play America’s CEO, where regular Americans really stand. Great Movie Criticism It’s hard to explain why or how I just stumbled across a 15 year old Roger Ebert movie review, so I won’t. I will try to explain why I found the review so real. I actually saw this movie, and it’s really every bit as bad as the review suggests. Ebert questions how movies stereotype baddies. Ebert doesn’t get too controversial, so this is as much as we’ll get out of him. Mac Geeks Have More Fun Thanks to the folks at MacOS X Hints, I’ve been pointed to the most useless thing ever: a tool that allows you to view any QuickTime file in your terminal window as ASCII text. Yes, it is absolutely useless. Understanding Marijuana Liane Hansen of NPR’s Weekend Edition Sunday interviewed Dr. Mitch Earleywine about his recent book, Understanding Marijuana: A New Look at the Scientific Evidence this weekend. Earleywine has the credentials to look at this seriously and be taken seriously. But he probably won’t be. There’s no shortage of books on this subject, and the Drug War marches on. But as long as we’re slinging books, let me throw Michael Pollan’s Botany of Desire at you. FrameThief Animation Toolbox FrameThief is a toolbox for capturing hand-drawn frames and assembling them as animation. Image sources can include video camera — the old standby, and digital still camera — a new twist that allows animators to work in HDTV resolutions. One component, FrameSplicer, can be used to turn any QuickTime compatible video file into a DV stream that can be used in iMovie. Political-Economic Conspiracy? Marektplace comentator James Galbraith explains in Tuesday’s show how this will be a longer and deeper recession than previously thought and many economic indicators may have been manipulated to hide the recession’s true nature prior to November 5. Galbraith reminds us that things were rather similar 20 years ago, when unemployment rose over 10% and Democrats took control of congress from a far-right conservative president. History did not repeat itself, yet. Mile Markers Matt Frondorf’s American Mile Markers takes us on a tour from New York to San Francisco, one photo per mile. It’s a fine concept — inspiring, really, but the pictures are quite a mishmash. Matt calls his mile marker project statistical photography.’ A lot of photography tends to be anecdotal and heavily edited,’ he says. And it doesn’t present what is really there — every picture from beginning to end.’ Yahoo! Pen twirling! Pen twirling takes great skill that can be achieved only by hard practice and determination. Though promoted by stars as famous as miss Iyo Matsumoto, it can be difficult to find pen twirling masters capable of teaching the sport. Hideaki Kondoh, who’s interest in pen twirling was sparked by a TV appearance by Iyo Matsumoto, struggled to learn: “I couldn’t help admiring her excellent performance, but I didn’t think I would try to spin a pen myself. Hops n’ Things It was a few years ago now that Jon at Hops n’ Things put us on track to brew our first big batch of cider. Knowledge comes from books, but a guy like Jon can give you know-how. Today he introduced us to Distillers’ Active Dry Yeast, or DADY. Our last batch of cider went to 30 proof with Epernay champaign yeast, DADY might get us to 50 proof! More importantly, he was kind enough to help us fix a CO2 leak in our keg system — and he stayed open late to get it done. RedstoneBrewery.com Online! After months of lost time, RedstoneBrewery.com is finally online. There’s not much there, but you wait baby. You just wait and see. Or. Um. Well, we’ll see what happens there next. Raspberry Jelly I usually try to keep this blog above trivial things like this, but not today. I enjoy penut butter and jelly sandwiches, but usually with raspberry preserves — the stuff with fruit chunks a seeds in it. So I was rather surprised when I found I’d accidentally bought Hannaford brand Red Raspberry Jelly. It mostly tastes like raspberry, but it’s been pureed smooth like Jello. I tried it, the product doesn’t spread well and the texture is all wrong. Modern Drunkard Magazine This little ‘zine just scored distribution with Borders book stores. But if you can’t find it there, take a look at Modern Drunkard Magazine online. Take a look at their Wino Wisdom section where you’ll find gems like “The secret of being a good drunk is not to try to hard. To me, it just comes naturally. You might even say it’s effortless.” And “I don’t smoke filtered cigarettes for the same reason I don’t drink whiskey through a bar rag. Megahertz Gap? So the project to crack a 64-bit RC5 encryption key is over. Some computer in Japan figured it out in July, but everybody was too busy to notice until last week. The real news here isn’t that 64-bit RC5 is crackable (everybody knew it could be done, eventually), the real news is that they compiled efficiency statistics on the various computer platforms that did the job. Here’s the quote, straight from their press release: “Our peak rate of 270,147,024 kkeys/sec is equivalent to 32,504 800MHz Apple PowerBook G4 laptops or 45,998 2GHz AMD Athlon XP machines…” Was Capitalism the Only Difference? &lt;a href=&quot;http://www.cera.com/commandingheights/&rdquo; title=&rdquo;Commanding Heights“&gt;_Commanding Heights_ authors Daniel Yergen and Joseph Stanislaw tell us that workers in communist Russia were not motivated to work simply because the government controlled economy offered no rewards for innovation. This they use as the basis for their argument that communism/government controlled economies were bad and capitalism was good. And what’s truly amazing is that in this obvious comparison between the USA and communist Russia, they find the most significant difference to be economic. MC Hawking Drops Some Science You The opening to this site announces “Yo! This site is your ultimate resource for information about Stephen Hawking the gangsta rapper.” And if that isn’t enough to make you go look there right now, then I suppose you feel bad for the poor guy and don’t like jerks who wish to make fun of him. Anyway. Just now he’s got a link up that points out one more sport I’ve never heard of or imagined: cup stacking. The First Law of Assignation The person [closest to the act/holding the instrument of the act], no matter how qualified or culpable is first to be assigned [credit/blame] for the act. Natalie Jeremijenko and the Interaction Between Humans and Technology It’s not for nothing that the MIT Technology Review named Natalie Jeremijenko “one of the top one hundred young innovators.” Anybody who bothers to read this blog should run out and look over her portfolio now. Weeds and Flowers Weeds and flowers alike seek the sunlight — nobody can fault them for that — but some of them learn do it with beauty and grace. Human-IntoFace: Face=Identity? From the artist’s statement: “Images of faces hold little ability to communicate the totality of a personality. The essence of a personality is not something that is stored in a static two dimentional array of dots, grains, or pixels. Rather, what is stored are subtle cues which signify base personality traits, such as a curl of a lip, squint of an eye, or pursing of the lips. These can work in series or combinations to suggest complexity of description, but ultimately, amount only to a caricature. Hungry-Man XXL! The marketers and designers for this product found their audience, and know how to speak to them. Just look at the pictures. “I know what I like, and I like a lot of it” reads the text next to the over-weight, blue-collared white boy on the back. In bold yellow type at the bottom, it reads “it’s good to be full.” With 1.5 pounds of food, this preprocessed meal delivers 1140 Calories, 70% of your recommended fat intake (84% of saturated fats), and 123% of your recommended sodium. Book List: Flight of Passage I’m all wrapped up by Flight of Passage, Rinker Buck’s tale of his 1966 journey cross country with his brother in an old Piper Cub. As much as it’s a tale of flying, it’s a tale of teenage angst. Both subjects that I identify with (but aren’t we supposed to grow out of teenage angst?). American Tyranny The worst forms of tyranny are those so subtle, so deeply ingrained, so thoroughly controlling as not even to be consciously experienced. So there are Americans who are afraid to entertain contrary notions for fear of jeopardizing their jobs, but who still think they are “free.”  –Michael Parenti’s Democracy for the Few. Corn Flakes, McCarthy, and Flag Wavers This story would be more appropriate for early July — that’s probably when this flag-printed box of Kellogg’s Corn Flakes was put on the shelf — but it was just last weekend when I came across it at our Warren Village Market. Of course, in early July, everything including corn chips and cat litter was available in patriotic red, white, and blue, so it really wouldn’t have stood out then. Dreams. What Do They Mean? Years ago, I used to wake up with a start. I’d be trying to sit up with my hands outstretched in front of me. I’d wake up thinking I’d been falling. Now. I find that I wake up thinking I’d stubbed my toe or hit my head. Somewhat unrelated: I’ve gotten no end of laughs and amusement from Dion Mcgregor Dreams Again, a collection of sleep talking from Dion Mcgregor, an apparently famous “somniloquist. Casey’s Sky Diving Adventure I made my one and only parachute jump back in the Fall of 1997. About a year ago I re-edited the video of that event. Casey’s Skydiving Adventure O’Reilly Offers MacOS X Conference The O’Reilly folks aren’t the only old Unix geeks who’ve been looking at Mac OS X with hungry eyes. Mac OS X is cool enough to get its own section on Slash Dot. And, of course, Apple is pushing it’s ‘switch‘ campaign toward Windows users. But as much as the O’Reilly folks love Mac OS X, they probably wouldn’t be planning a conference about it if it wasn’t clear there were hordes of like-minded geeks willing to shell out the $1000 or so it costs to attend. Vegas Guide, part 1: Introduction Las Vegas may be the most thoroughly American city. No other town has been so shaped by the singular desire to make a buck. Churches and strip clubs coexist in close proximity. Each competes for the hard luck — but not broke — gamblers seeking refuge from their losses. If Capitalism works, it works in Vegas. Vegas is America’s liver. The worst of pop culture eventually finds a home someplace in Las Vegas or the surrounding Clark County. Vegas Guide, part 2: Peyote Most of Nevada’s land is under federal control. The Pentagon, Department of Energy, and Bureau of Land Management claim a total area of about 80% of the state. It’s mostly desert, and the desert dois best left alone, so few people seem to care. Some towns, mostly old silver mining camps, persist amid the desert. Horses graze free on the school ball field in Blue Diamond, Nevada. The town sits on a spring in Red Rock Canyon. Vegas Guide, part 3: Nukes and Moon Hoaxes On a map, Mercury sits a little northwest of Las Vegas. There is nothing to suggest that the town is inaccessible to the average tourist, but it is in fact a part of the Nevada Test Site — a nuclear bomb testing facility. The site was formed in 1950 from land originally granted to the Shoshone Indians. Nearly one thousand nuclear devices have been detonated there between its formation and 1991, when President Bush imposed a moratorium on tests that has been extended by succeeding administrations. Vegas Guide, part 4: Flesh Prostitution in Vegas is illegal, but that’s okay. For a little jiggle, you can check out the innumerable gentlemen’s clubs and strip shows. Even many of the ritzy hotels often have their own “tantalizing topless revues.” Freemont Street, the heart of old Vegas and one of the city’s largest attractions, is home to more than one strip club. But a short drive will get you more than jiggle. Fifty miles west of Las Vegas on Highway 160, just accross the Clark county line in Nye county you’ll find the sleepy town of Pahrump — “heart of the new Old West” according to the welcome sign at the town line. Morse Museum Mummy Unmasked This isn’t current news by any stretch. The story was reported in the Boston Globe when it happened in 1997, and can be found on the web at Maine Antique Digest. It goes like this: the contents of the Morse Museum were auctioned off in the early 90s. Among the spoils were two Egyptian mummies. One of them landed in the hands of a Maine antiques dealer. The Egyptian government learned of the mummy, which was advertised as a ‘princess. Redstone Brewery’s Product Labels Brewing cider takes a long time. …and most of it is just waiting. So while we wait, I draw up new labels. Click for Maison Bisson’s Summer Drink Hot weather demands cool drinks. Lemonade is fine for the kids, but adults need a pitcher of something more entertaining. Give it a try: Vicker’s Delight: 1 part Vodka 2 parts Lemonade dash Lime Juice dash Orange Juice Prepare in a pitcher with ice and share. Adjust quantities to taste. Enjoy safely. The Old Scooter Yes. The scooter was a thing of ridicule for most people, but I loved it. Riding the scooter was like ‘playing bikes’ when I was ten. It was just fun, and I didn’t need an excuse to do it. I named her Trixie, but most people just called her scooter. But the scooter is sold now. It went first to Cliffy, then to Chuck. Did Cliffy appreciate it like I did? Airplane Safety It may be a little bit cliche after being ridiculed in Fight Club (The line was “Look at their faces, as calm as Hindu cows.”), but I’ve always loved airplane safety guides. Click for Warren’s Morse Museum It’s hard to say which is more memorable: Warren’s rocket or our Morse Museum. For larger picture, click Who Are These Dorks? What a motley crew who work for ITS. Click for pictures. Newton: Best PDA Ever Just as I’m about to retire my old Newton, just as I’m exporting the contacts and calendar entries, I rediscovered why the Newton was — and still is — the best PDA ever. The Newton had a rough start back in the early 90s when the first model was released. I’ve never used an older model, but it’s clear that the handwriting recognition was bad enough to be ridiculed in comics and The Simpsons. Now Even the Conservatives Agree: Supporting the Drug War Supports Terrorists This may be old news (it was published on May 20th, 2002) but, David R. Henderson’s essay on how the drug war effects the war against terrorism is a must read for everybody. Conservatives tell truth about drug war. Why do I say the Hoover Institution is a pack of conservatives? Because Eric Alterman says so. Cape Cod Dining: Ay Caramba Cafe Sandee and I stumbled into the Ay Caramba Cafe on Main Street in Harwich at just the right time. We were starving and desperate for something other than fried sea food. Diners can help themselves to chips and three varieties of homemade salsa. Each is rather unique, and far more complex than the mild, medium, and hot descriptions we typically use to describe salsas. Sandee and I both had the pork tomales that were on special — cheese tomales were also offered. Cape Cod Our Friends Troy and Karen were kind enough to invite us to Cape Cod to visit them. We lazed around on the beach, took in a show at the Wellfleet Drive-In, and twice gorged ourselves on fried seafood at Arnold’s Restaurant. Geeks may take interest in Cape Cod’s involvment in the history of trans-Atlantic communications. Nauset Light Beach was a former terminus for many undersea telegraph cables. Friendy links: see Troy here, here, and here. Doonesbury’s Middle Age Slump A feature story by Jesse Walker in Reason Magazine’s July 2002 issue confirms something I’ve been worried about for a while: Doonesbury isn’t what it used to be. Walker gives us examples detailing Trudeau’s mild conservitive shift, and his more unfortunate shift toward irrelevence. I’m too young to know the strip from its beginnings in the early 70s (or earlier), but we can all compare old and new cartoons online in the Doonsebury retrospective. The incident The front the shocks and coil springs slowed the downward thrust of the front suspension as inertia, stable just moments before, pitches the vehicle forward. A small, unconscious rightward twitch of the steering wheel is amplified by tires which, at this moment, have greater than normal mechanical advantage. The turn, though slight, moves the center of gravity even farther forward and now to the left. The rear of the vehicle, lightly weighted under normal conditions, is riding at the full extension of the rear leaf springs. Now Playing at Maison Bisson While mainstream (commercial) pop music producers are anxiously introducing ever younger children to ever more sexualised music, They Might Be Giants are busy making music for kids of all ages. Their new album, No!, might sound fluffy and sacharine compared to the band’s earlier work, but so what. Like so many of their songs, you’ll quickly be singing along. Besides, Sandee says “it’s just good music.” Lustworthy: Honda Silver Wing and Reflex Sure, Italian scooters look great, but where do you get them serviced? Motostrada in Maryland has a great selection of new and vintage European scooters, but that’s the nearest dealership and service center. It’s a great shop, don’t get me wrong, but it’s not really a solution for people in northern New Hampshire. So if I don’t trust Biff at the local cycle shop to work on an European import scooter, what would I trust Biff to work on? Learning Unix MacOS X’s unix underpinnings have had Mac users asking the same question for a while now: “how can I learn Unix?” And for those who really want to learn Unix, I point them to ?leenFrisch’s &lt;a href=&quot;http://www.oreilly.com/catalog/esa3/&rdquo; title=&quot;Essential System Administration, 3rd Edition“&gt;Essential System Administration, 3rd Edition. It’s direct and concise, yet thorough. It was the book I turned to for an introduction to Unix, and it’s a book I keep on my shelf as a reference when I need it. Frozen Mud Slides — from scratch Who wouldn’t enjoy a frozen mud slide on a hot summer day? Typical recipes call for crushed ice and cream or ice cream. For some reason, we decided to try making them from ice cream, from scratch. The MaisonBisson Frozen Mud Slide This recipe requires an ice cream maker, we used the Deni Scoop Factory. 1.5 cups heavy cream 1 cup milk 1 cup sugar .5 cups Bailey’s dash vanilla Mix ingredients in bowl, then pour into ice cream maker’s freezer container. The Plan What we need is a van. A black van with red alloy wheels and a diamond bubble window. Yeah. Get on the jazz, sucka. Streamripper saves MP3 radio to disk I must be an idiot not to have found Streamripper sooner. In the days before Walkmans I used to record radio broadcasts on an early portable cassette recorder so I could listen later. This is how I discovered “Angel in a Centerfold” and many other great cultural landmarks from the early 80s. Of course, things have changed since then. My taste hasn’t improved so much as commercial radio has fallen. Internet radio, thankfully, may rescue me. Story Review: Derryl Murphy’s Last Call One: I discovered Fictionwise.com, a source all types of fiction in eBook formats. Two: Here’s the assignment that lead me to look for Fictionwise in the first place. Click for PDF. [UPDATE] It’s funny how things circulate on the web. I’ve Googled myself enough to know how I show up in odd places, so I can understand how Derryl Murphy might have wondered how a review of one of his many stories appeared here. My new favorite pop I found a bottle of IBC Cream Soda at our famous Warren Village Market and it quickly reminded me of why I love cream soda. But now, no other cream soda tastes as good. I’ve tried a few; they just make me sad. Now I need to speak with the folks at the market to get a case of the good stuff. It can also be ordered from PopSoda.com. Pictures of the Warren Rocket Warren is blessed with a rocket. It was once a intermediate range ballistic missile, but it’s basically the same rocket that launched America’s first astronauts Allen B. Shepherd and Gus Grissom into sub-orbital space. It’s enough to be proud of, anyway. RoadsideAmerica.com has a story on our rocket, but it’s based on reader reports and it seems people just don’t know what town they’re in when they see the thing. Redstone Brewery’s First Steps In the fall of 2000 Cliff convinced me that I needed to brew hard cider. In turn, I convinced him that we needed to brew lots of it. We soon bought barrels that had been used for Cherry Coke concentrate and found an orchard that would sell us bulk sweet cider. After siphoning the 120 gallons from two barrels in my truck into two barrels prepared in my basement, adding sugar and other flavors, and pitching the yeast, we waited. Color Theory My overwhelming interest in earth-tones and browns leads me to look for them and define them numerically. A lousy overview of color models, especially the HSV model. Originally written for one of my classes. Click here for PDF. [UPDATE]: I’d like to point out a later story about color blindness and streetlights. Tempo Cameras need regular testing. Don’t they? View Tempo at .mac Theater. Originally put together to demonstrate synchronization of music and images. Look for Daria and her silly monkey, and a short appearance by Travis. Hammernode DynoDNS services Hammernode dynamic DNS services couldn’t be better. Well, what could be better than a free, high quality service? Headshots Our new camera equipment arrived one day in August 2000. Obviously, it needed testing. This is the result.View Headshots at .Mac Theater. That’s me looking like an idiot. And Cliff too. Sorry, this one isn’t “fast start.” You’ll have to wait until it loads all 2MBs. IUG 10: Houston Officially I’m here to attend the Innovative Users’ Group conference, but there’s a lot more to do in Texas and I took a few extra days to do it. My brother lives just north of Austin, and just north of that is Waco. Being so close, I had to go visit. …and while there, I couldn’t help but look for the Branch Davidian compound. Houston is an interesting city, but two landmarks particularly interested me. Looking at Waco **Texas 2002 Stories** I had a chance to visit Waco in April 2002. Here are some links that I gathered from that time. Eventually I’ll post a story to go with them. Dr. Pepper Museum Waco Visitor Bureau Red Men Museum Texas Ranger Museum Branch Davidians Contrasting Houston Texas 2002 Stories The Beer Can House on the northwest side of town was built by John Milkovisch starting in 1968. Over the next 18 years he drank a six-pack per day to furnish and adorn the house with almost 40,000 cans. Meanwhile, on the southeast side of town, Cleveland Turner looked to God to help get him off the sauce. As thanks for his salvation and sobriety, he gathered up all the trash in his neighborhood, painted it, and arranged it to look like flowers. Galveston’s Seawolf Park **Texas 2002 Stories** While in Texas I had an oportunity to see Galveston and visit Seawolf Park. Seawolf Park is home to a WWII sub and an escort cruiser. It pleased me to no end that I was able to climb all over inside and outside both boats. I took more pictures there than anywhere else during my Texas adventure. [](http://homepage.mac.com/misterbisson/SeawolfPark/003_1.JPG) Cavalla’s Diving Controls &lt;td align=&quot;center&quot; valign=&quot;middle&quot;&gt; [&lt;img src=&quot;http://homepage. Visiting the Branch Davidian compound Texas 2002 Stories Work brought me to Texas in April 2002, but morbid curiosity brought me to Waco. I found a story by Dan Tobias about the Branch Davidian compound and its remains. Following his directions, I found my way to the site and later emailed Dan with the changes I found since he last visited. My email to him is included in the body of this story, but I recommend you read Dan’s story about the Branch Davidians first. QuickTime Embed Tags Apple’s docs on embedding QT media in web pages. It’s here mostly as a bookmark for me. Click here for the docs. Search From https://gist.github.com/eddiewebb/735feb48f50f0ddd65ae5606a1cb41ae, which continues: This file exists solely to respond to /search URL with the related search layout template. No content shown here is rendered, all content is based in the template $theme/layouts/page/search.html Setting a very low sitemap priority will tell search engines this is not important content. This implementation uses Fusejs, jquery and mark.js The full details can be found in https://gist.github.com/eddiewebb/735feb48f50f0ddd65ae5606a1cb41ae. You should never see this content! 
policyreview-info-6176	----	Towards platform observability | Internet Policy Review Regulation Watch Main menu Login RSS About Contact Scope Editorial board Managing board Editorial team Reviewers Contributors Why publish with us License Funding Archives Publish Aims and scope Submissions Peer review How to publish with us Manuscript template Style guide Open Access Concept Accessibility Categories Infrastructure & Standards Innovation Information & Data Intellectual Property Rights Privacy & Security Governance Volume 9, Issue 4 Towards platform observability Bernhard Rieder, New Media and Digital Culture, University of Amsterdam, Netherlands, B.Rieder@uva.nl Jeanette Hofmann, Berlin Social Science Center (WZB), Germany PUBLISHED ON: 18 Dec 2020 DOI: 10.14763/2020.4.1535 Abstract The growing power of digital platforms raises the question of democratic control or at least containment. In light of the transforming impact of platforms on markets, the public sphere, elections, and employment conditions, governments, and civil society alike are demanding more transparency and accountability. Shedding light on the principles and practices of algorithmic ordering promises to limit the power of platforms by subjecting their hidden operations to regulatory inspection. This article questions the popular image of an openable ‘black box’. Based on a critical reflection on transparency as a panacea for curtailing platform power, we propose the concept of observability to deal more systematically with the problem of studying complex algorithmic systems. We set out three broad principles as regulatory guidelines for making platforms more accountable. These principles concern the normative and analytical scope, the empirical and temporal dimension, and the necessary capacities for learning and knowledge generation. Citation & publishing information Received: August 4, 2020 Reviewed: October 28, 2020 Published: December 18, 2020 Licence: Creative Commons Attribution 3.0 Germany Competing interests: The author has declared that no competing interests exist that have influenced the text. Keywords: Platforms, Algorithms, Transparency, Observability, Regulation Citation: Rieder, B. & Hofmann, J. (2020). Towards platform observability. Internet Policy Review, 9(4). https://doi.org/10.14763/2020.4.1535 1. Introduction Platforms are large-scale infrastructures specialised in facilitating interaction and exchange among independent actors. Whether understood economically as two- or multi-sided markets (Langley & Leyshon, 2017) or with an eye on online media as services that ‘host, organize, and circulate users’ shared content or social interactions’ (Gillespie, 2018, p. 18), platforms have not only become highly visible and valuable companies but also raise important social challenges. While intermediaries have in one form or another existed for millennia, contemporary platforms are relying on digital technologies in (at least) two fundamental ways. First, platforms ‘capture’ (Agre, 1994) activities by channelling them through designed functionalities, interfaces, and data structures. Uber, for example, matches riders with drivers in physical space, handles payment, and enforces ‘good behaviour’ through an extensive review system covering both parties. This infrastructural capture means that a wide variety of data can be generated from user activity, including transactions, clickstreams, textual expressions, and sensor data such as location or movement speed. Second, the available data and large numbers of users make algorithmic matching highly attractive: ranking, filtering, and recommending have become central techniques for facilitating the ‘right’ connections, whether between consumers and products, users and contents, or between people seeking interaction, friendship, or love. Digital platforms host social exchange in ways that Lawrence Lessig (1999) summarised under the famous slogan ‘code is law’, which holds that technical means take part in regulating conduct and shaping outcomes. The combination of infrastructural capture and algorithmic matching results in forms of socio-technical ordering that make platforms particularly powerful. As Zuboff (2019, p. 15) discusses under the term surveillance capitalism, the tight integration of data collection and targeted ‘intervention’ has produced ‘a market form that is unimaginable outside the digital milieu’. The rising power of platforms poses the question of what kind of accountability is necessary to understand these processes and their consequences in more detail. Matching algorithms, in particular, represent ordering mechanisms that do not follow the same logic as traditional decision-making, leading to considerable uncertainty concerning their inner workings, performativities, and broader social effects. So far, most regulatory approaches to tackling these questions seek to create accountability by ‘opening the black box’ of algorithmic decision-making. A recent EU regulation on fairness in platform-to-business relations, for example, proposes transparency as its principal means. 1 The public debate about the upcoming EU Digital Services Act indeed shows that calls for transparency of algorithmic power have gained support across parliamentary factions and stakeholder groups. 2 The ‘Filter Bubble Transparency Act’—a US legislative proposal that seeks to protect users from being ‘manipulated by algorithms driven by user-specific data’ - focuses more specifically on platforms as media, but again relies on transparency as guiding principle. 3 The German Medienstaatsvertrag (‘State Media Treaty’), which has recently been ratified by all state parliaments, explicitly requires platform operators to divulge criteria for ranking, recommendation, and personalisation ‘in a form that is easily perceivable, directly reachable, and permanently available’. 4 This widespread demand for disclosure and explanation articulates not only justified concerns about the opacity of platforms but also testifies to the glaring lack of information on their conduct and its social, political, and economic repercussions. In this paper, we likewise take up the challenge posed by platform opacity from the angle of accountability but seek to probe the conceptual and practical limitations of these transparency-led approaches to platform regulation. Echoing the critical literature on transparency as a policy panacea (e.g., Etzioni, 2010; Ananny & Crawford, 2018), we propose the concept of observability as a more pragmatic way of thinking about the means and strategies necessary to hold platforms accountable. While transparency and observability are often used synonymously (e.g. August & Osrecki, 2019), we would like to highlight their semantic differences. Unlike transparency, which nominally describes a state that may exist or not, observability emphasises the conditions for the practice of observing in a given domain. These conditions may facilitate or hamper modes of observing and impact the capacity to generate external insights. Hence, while the image of the black box more or less skips the practicalities involved in opening it, the term observability intends to draw attention to and problematise the process dimension inherent to transparency as a regulatory tool. While observability incorporates similar regulatory goals to transparency, it also deviates in important respects, most importantly by understanding accountability as a complex, dynamic ‘social relation’ (Bovens, 2007, p. 450), which is embedded in a specific material setting. The goal is not to exchange one concept for the other but to sharpen our view for the specificities of platform power. At the risk of stating the obvious, regulatory oversight needs to take into account the material quality of the objects under investigation. Inspecting the inner workings of a machine learning system differs in important ways from audits in accounting or the supervision of financial markets. Rather than nailing down ‘the algorithm’, understood as a singular decision mechanism, the concept of observability seeks to address the conditions, means, and processes of knowledge production about large-scale socio-technical systems. In the everyday life of platforms, complex technologies, business practices, and user appropriations are intersecting in often unexpected ways. These platform dynamics result in massive information asymmetries that affect stakeholder groups as well as societies at large. Regulatory proposals need to take a broader view to live up to these challenges. Our argument proceeds in three steps. In the next section, we retrace some of the main problems and limitations of transparency, paying specific attention to technical complexity. The third section then discusses the main principles guiding the observability concept and provides concrete examples and directions for further discussion. We conclude by arguing for a policy approach to promoting observability, emphasising that institutional audacity and innovation are needed to tackle the challenges raised by digital platforms. 2. Limitations to transparency Much of the debate around our insufficient understanding of platforms and their use of complex algorithmic techniques to modulate users’ experience has centred on the metaphor of a ‘black box’. Although Frank Pasquale, whose Black Box Society (2015) has popularised the term beyond academia, prefers the broader concept of intelligibility, the talk of black boxes is often accompanied by demands for transparency. The regulatory proposals mentioned above are largely organised around mechanisms such as explanations, disclosures, and—more rarely—audits 5 that would bring the inner workings of the machine to light and thereby establish some form of control. But these calls for transparency as a remedy against unchecked platform power encounter two sets of problems. First, the dominant understanding of transparency as information disclosure faces important limitations. Second, the object under scrutiny itself poses problems. Platforms are marked by opacity and complexity, which effectively challenges the idea of a black box whose lid can be lifted to look inside. This section discusses both of these issues in turn. 2.1. Accountability as mediated process Transparency has a long tradition as a ‘light form’ (Etzioni, 2010) of regulation. It gained new popularity in the 1970s as a neoliberal governance method, promising better control of organisational behaviour through inspection (August & Osrecki, 2019). Transparency is seen as an essential means of oversight and of holding commercial and public entities to account: only if powerful organisations reveal relevant information about their actions are we able to assess their performance. This understanding of transparency implies a number of taken for granted assumptions, which link information disclosure to visibility, visibility to insight, and insight to effective regulatory judgement (Ananny & Crawford, 2018, p. 974). According to this view, transparency is able to reveal the truth by reflecting the internal reality of an organisation (Albu & Flyverbom, 2019, p. 9) and thereby creating ‘representations that are more intrinsically true than others’ (Ananny & Crawford, 2018, p. 975). Making the opaque and hidden visible, creates truth and truth enables control, which serves as a ‘disinfectant’ (Brandeis, 1913, p. 10) capable of eliminating malicious conduct. Transparency is considered crucial for the accountability of politics because seeing, just as in the physical world, is equated with knowing: ‘what is seen is largely what is happening’, as Ezrahi (1992, p. 366) summarises this view. These assumptions also inform current considerations on platform regulation. However, recent research on transparency has shown that transparency does more and different things than shedding light on what is hidden. The visibility of an entity and its procedures is not simply a disclosure of pre-existing facts, but a process that implies its own perspective. While transparency requirements expect ‘to align the behavior of the observed with the general interest of the observers’, empirical studies found that ‘transparency practices do not simply make organizations observable, but actively change them’ (August & Osrecki, 2019, p. 16). As Flyverbom (2016, p. 15) puts it, ‘transparency reconfigures - rather than reproduces - its objects and subjects’. The oversight devices used to generate visibility shape what we get to see (Ezrahi, 1992; Flyverbom, 2016), which puts into question the idea of direct, unmediated access to reality if only the disclosed information is accurate. From a social science perspective, transparency should not be regarded as a state or a ‘thing’ but as the practice ‘of deciding what to make present (i.e. public and transparent) and what to make absent’ (Rowland & Passoth, 2015, p. 140). Creating visibility and insights as part of regulatory oversight consists of specific procedures, which involve choices about what specifically should be exposed and how, what is relevant and what can be neglected, which elements should be shown to whom and, not least, how the visible aspects should be interpreted (Power, 1997). In their critique of transparency-led approaches to algorithmic accountability, Ananny & Crawford (2018) moreover argue that there is a distinct lack of sensitivity for fundamental power imbalances, strategic occlusions, and false binaries between secrecy and openness, as well as a broad adherence to neoliberal models of individual agency. In light of these criticisms, it may not come as a surprise that regulatory transparency obligations often fall short of their goals and create significant side-effects instead. Among the most common unintended outcomes are bureaucratisation, generalised distrust, and various forms of ‘window dressing’ designed to hide what is supposed to be exposed to external review. Informal organisational practices emerge and coexist with official reports, accounts, and presentations (August & Orecki, 2019, p. 21). While the critical literature on regulatory failures of transparency obligations is increasing, these insights have yet to have an impact on regulatory thinking. Most regulatory proposals resort to traditional ideas of external control through transparency and frame transparency as a straightforward process of disclosure. As a result, they are missing the mark on the complex and conflictual task of creating meaningful understanding that can serve as an effective check on platform power. Taken together, a social science perspective on this key ideal of regulation suggests that making platforms accountable requires a critical engagement with the achievements and shortcomings of transparency. It needs to take on board efforts to combine different forms of evidence, and above all, to become attentive to the selective and mediated character of knowledge-building. Similar to the flawed logic of ‘notice and consent’ in the area of privacy protection, which holds that informing individuals on the purposes of data collection allows them to exercise their rights, a superficial understanding of transparency in the area of platform regulation risks producing ineffective results (see Obar, 2020; Yeung, 2017). 2.2. Opacity, complexity, fragmentation A second set of complications for transparency concerns algorithms and platforms as the actual objects of scrutiny. Large-scale technical systems, in particular those incorporating complex algorithmic decision-making processes, pose severe challenges for assessing their inner workings and social effects. One obvious reason for this is indeed their opacity. As Burrell (2016, p. 2) argues, opacity may stem from secrecy practices, lack of expertise in reading code, and the increasing ‘mismatch between mathematical optimization in high-dimensionality characteristic of machine learning and the demands of human-scale reasoning’. The last point in particular introduces significant challenges to transparency understood as information disclosure or audit. Even if decision procedures behind automated matchmaking can sometimes still be meticulously specified, platforms nowadays mainly deploy statistical learning techniques. These techniques develop decision models inductively and ‘learn programs from data’ (Domingos, 2012, p. 81), based on an arrangement between data, feedback, and a given purpose (see Rieder, 2020). In the canonical example of spam filtering, users label incoming emails as spam or not spam. Learning consists in associating each word in these messages with these two categories or ‘target variables’. Since every word contributes to the final decision to mark an incoming message as spam or not spam, the process cannot be easily traced back to singular factors. Too many variables come into play, and these algorithms are therefore not ‘legible’ in the same way as more tangible regulatory objects. With regard to regulatory oversight, this means that transparency in the sense of reconstructing the procedure of algorithmic decision making ‘is unlikely to lead to an informative outcome’, as Koene et al. (2019, p. II) conclude. Audits are unable to find out ‘what the algorithm knows because the algorithm knows only about inexpressible commonalities in millions of pieces of training data’ (Dourish, 2016, p. 7). There is a large gulf between the disclosure of ‘fundamental criteria’ mandated by regulatory proposals like the Medienstaatsvertrag and the technical complexities at hand. Even if regulators were given access to data centres and source code, the process of sense-making would not be straightforward. Reading the gist of an algorithm from complex code may run into difficulties, even if no machine learning is involved. As Dourish (2016) shows, the presence of different programming languages and execution environments adds further complications, and so do the many subsystems and modules that concrete programmes often draw on. Algorithmic decision procedures ‘may not happen all in one place’ (Dourish, 2016, p. 4) but can be distributed over many different locations in a large programme or computer network. In the case of online advertising, for example, the placement of a single ad may entail a whole cascade of real-time auctions, each drawing on different algorithms and data points, each adding something to the final outcome. The result is a continuously evolving metastable arrangement. Thus, time becomes a crucial analytical factor, causing considerable difficulties for the ‘snapshot logic’ underlying most audit proposals. For these reasons, algorithms turn out to be difficult to locate. In his ethnographic study of a recommender system, Seaver (2017) observes that even in small companies it can be a challenge for staff members to explain where exactly ‘the algorithm’ is. As Bogost (2015) quips, ‘[c]oncepts like “algorithm” have become sloppy shorthands, slang terms for the act of mistaking multipart complex systems for simple, singular ones’. What is referred to as ‘algorithm’, i.e. the actual matchmaking technique, may thus only be a small component in a much larger system that includes other various instances of ordering, ranging from data modelling to user-facing interfaces and functions that inform and define what users can see and do. YouTube, for example, not only fills its recommendation pipeline with a broad array of signals generated from the activities of billions of users but actually uses two different deep learning models for ‘candidate generation’ (the selection of hundreds of potential videos from the full corpus) and ‘ranking’ (the selection and ordering of actual recommendations from the candidate list) (see Covington et al., 2016). The fuzzy, dynamic, and distributed materiality of contemporary computing technologies and data sets means that algorithmic accountability is harder to put into practice than the call for transparency suggests. Regulatory proposals such as disclosures, audits, or certification procedures seeking to establish effective control over their functionality and effects assume properties that algorithmic systems may often not meet. Suffice to say that technical complexity also facilitates the attempts at dissimulation and ‘window dressing’ mentioned above. Yet, as if this was not difficult enough, our understanding of platform accountability should extend beyond oversight of algorithms and platform conduct to be meaningful. The ordering power of platforms also encompasses shared or distributed accomplishments (see Suchman, 2007) to which platforms, users and content providers each contribute in specific ways. As Rahwan et al. (2019, p. 477) argue, machine behaviour ‘cannot be fully understood without the integrated study of algorithms and the social environments in which algorithms operate’. The actions of users, for example, provide the data that shape algorithmic models and decisions as part of machine learning systems. In the same vein, platform behaviour cannot be reduced to platform conduct, that is, to the policies and design decisions put in place by operators. It must include the evolving interactions between changing social practices and technical adjustments, which may, in turn, be countered by user appropriations. As use practices change, algorithmic decision models change as well. Platform companies are therefore neither fully in control of actual outcomes, nor fully aware what is happening within their systems. Finally, the effects of platforms can only be sufficiently addressed if we consider what is being ordered. For example, ranking principles considered beneficial in one culture domain, e.g. music recommendation, may have troubling implications in another, e.g. the circulation of political content. Accountability thus has to consider what is made available on platforms and how ordering mechanisms interact with or shape the content and its visibility. This again requires a broader view than what algorithm audits or broad technical disclosures are able to provide. Taken together, research on the properties of algorithms and algorithmic systems suggests that regulatory proposals such as ‘opening the black box’ through transparency, audit, or explainability requirements reflect an insufficient understanding of algorithms and the platform architectures they enable. Algorithms can neither be studied nor regulated as single, clear-cut, and stable entities. Rather, their behaviour and effects result from assemblage-like contexts whose components are not only spatially and functionally distributed but also subject to continuous change, which is partly driven by users or markets facilitated by platforms. Given the ephemeral character of algorithms on the one side and the enormous generative and performative power of algorithmic systems on the other, the question arises what concepts, strategies, and concrete tools might help us to comprehend their logics and to establish effective political oversight. Such an approach needs to take on board the critique of transparency as a regulatory tool and consider accountability as a continuous interaction and learning process rather than periodical undertakings. It should recognise that the legibility of algorithmic systems significantly differs from that of other objects or areas of regulation; and it should take into account that any form of review is not only selective but also shapes the object under investigation. Thus, the debate on platform regulation needs to become reflexive with regard to the specific materiality of the regulatory field and the constitutive effects of studying it. 3. Principles of observability This section seeks to flesh out an understanding of observability as a step toward tackling the problems platform accountability currently faces. While the term is regularly used in the literature on transparency (e.g., Bernstein, 2012; Albu & Flyverbom, 2015; August & Osrecki, 2019), we seek to calibrate it to our specific goals: the challenges raised by platforms as regulatory structures need to be addressed more broadly, beginning with the question of how we can assess what is happening within large-scale, transnational environments that heavily rely on technology as a mode of governance. Who gets treated how on large online platforms, how are connections between participants made and structured, what are the outcomes, and—crucially—who can or should be able to make such assessments? Rather than a binary between transparency and opacity, the question is how to foster the capacity to produce knowledge about platforms and ‘platform life’ in constructive ways. The increasingly technological nature of our societies requires not just penalties for law infringements, but a deeper and well-informed public conversation about the role of digital platforms. This includes attention to the larger impacts of the new kinds of ordering outlined above, as well as a sensitivity for the ideological uses of transparency, which may serve ‘as a tool to fight off the regulations opposed by various business groups and politicians from conservative parties’ (Etzioni, 2010, p. 2). We therefore position observability as an explicit means of, not an alternative to regulation. As van Dijck et al. (2018, p. 158) underline, ‘[r]egulatory fixes require detailed insights into how technology and business models work, how intricate platform mechanisms are deployed in relation to user practices, and how they impact social activities’. Our concept of observability thus seeks to propose concrete actions for how to produce these insights. While some of the more concrete strategies we discuss may come out of self-regulation efforts, effective and robust observability clearly requires a regulatory framework and institutional support. In what follows, we outline three principles that inform the concrete conceptual and practical directions observability seeks to emphasise. 3.1. Expand the normative and analytical horizon The first principle concerns the research perspective on platforms and argues that a broader focus is needed. This focus takes into consideration how digital platforms affect societies in general, ranging from everyday intimacy to economic and labour relations, cultural production, and democratic life. Given that platformisation transforms not only specific markets but ‘has started to uproot the infrastructural, organizational design of societies’ (van Dijck, 2020, p. 2), it seems crucial to develop knowledge capacities beyond critical algorithm studies and include platform conduct, behaviour, and effects across relevant social domains in our agendas. As Powles and Nissenbaum (2018) have recently argued for artificial intelligence systems, limiting our focus to the important yet narrow problems of fairness and biases means that ‘vast zones of contest and imagination are relinquished’, among them the question whether the massive efforts in data collection underlying contemporary platform businesses are acceptable in the first place. The ability to say no and prohibit the deployment of certain technologies such as political micro-targeting of voters or face recognition requires robust empirical and normative evidence on its harm for democracies. While investigations into misinformation and election tampering are important, there are other long-term challenges waiting to be addressed. Recent studies on surveillance capitalism (Zuboff, 2019), digital capitalism (Staab, 2019), informational capitalism (Cohen, 2019), the platform society (van Dijck et al., 2018), or the ‘dataist state’ (Fourcade & Gordon, 2020) aim to capture and make sense of the ongoing structural changes of societies and economies, including the power shifts these imply. EU commissioner Vestager recently evoked Michel Foucault’s notion of biopower when addressing novel data-based techniques of classifying, sorting, and governing (Stolton, 2019). While the term addresses a set of political technologies that emerged in the 19th century to manage the behaviour of populations by means of specific regimes of knowledge and power, digital platforms’ considerable reach and fine-grained ‘capture’ (Agre, 1994) of everyday activities invites comparison. The deep political and social repercussions these conceptual frames highlight require broader forms of social accountability (Bovens, 2007) than disclosures or audits are able to provide. How can researchers, regulators, and civil society expand their capacity to study, reflect and act on these developments? The concept of observability starts from the recognition of a growing information asymmetry between platform companies, a few data brokers, and everyone else. The resulting data monopoly deprives society of a crucial resource for producing knowledge about itself. The expanding data sets on vast numbers of people and transactions bear the potential for privileged insights into societies’ texture, even if platforms tend to use them only for operational purposes. AirBnB’s impact on urban development, Uber’s role in transforming transportation, Amazon’s sway over retail, or Facebook and Twitter’s outsized influence on the public sphere cannot be assessed without access to relevant information. It is symptomatic that companies refuse access to the data necessary for in-depth, independent studies and then use the lack of in-depth, independent studies as evidence for lack of harm. New modes of domination are unfolding as part of analytics-driven business models and the unprecedented information asymmetries they bring about. Powles and Nissenbaum (2019) therefore argue that we need ‘genuine accountability mechanisms, external to companies and accessible to populations’. An essential condition and experimental construction site for such accountability mechanisms would be the institutionalisation of reliable information interfaces between digital platforms and society—with a broad mandate to focus on the public interest. We propose the concept of public interest as a normative reference for assessing platform behaviour and regulatory goals. However, public interest is neither well defined nor without alternatives. 6 We prefer public interest over the closely related common good because the former refers to an internationally established mandate in media regulation and could thus inform the formulation of specific requirements or ‘public interest obligations’ for platforms as well (Napoli, 2015, p.4). Furthermore, the concept speaks to our specific concern with matters of governance of platform life. The use of public interest spans different disciplinary and regulatory contexts, and it is open to flexible interpretation. Yet, the often-criticised vagueness of the concept has the advantage of accommodating the broad range of existing platforms. As a normative framework it can be used to critically assess the design of multiple-sided markets as much as the impact of digital intermediaries on the public sphere. Approaches to defining and operationalising public interest depend on the context. In economic theory, public interest is suspected of functioning as a ‘weapon’ for justifying regulatory intervention into markets for the purpose of enhancing social welfare (Morgan & Yeung, 2007). Correcting failing markets constitutes a minimalist interpretation of public interest, however. In politics, public interest is associated with more diverse social goals, among them social justice, non-discrimination, and access to social welfare; or more generally the redistribution of resources and the maintenance of public infrastructures. With regard to the public sphere and the media sector, public interest refers to protecting human rights such as freedom of information and freedom of expression, fostering cultural and political diversity, and not least sustaining the conditions for democratic will formation through high quality news production and dissemination (Napoli, 2015). What these different understandings of public interest have in common is a focus on both procedural and substantial aspects. Obviously, public interest as a frame of reference for assessing and regulating digital platforms is not a given. Rather, the meaning and principles of public interest have to be constantly negotiated and reinterpreted. As van Dijck (2020, p. 3) reminds us, such battles over common interest do not take place in a vacuum, they are ‘historically anchored in institutions or sectors’ and ‘after extensive deliberation’ become codified in more or less formal norms. From a procedural point of view, public interest can also be defined as a practice, which has to meet standards of due process such as inclusiveness, transparency, fairness, and right to recourse (Mattli & Woods, 2009, p. 15). In terms of substance, the notion of public interest clearly privileges the collective common welfare over that of individuals or private commercial entities. In this respect, it entails a departure from the neoliberal focus on individual liberty toward collective freedoms. Thereby it also extends the space of policy options beyond ‘notice and consent’ to more far-reaching regulatory interventions (Yeung, 2017, p. 15). We see similar conceptual adjustments toward public interest in other areas such as the discourse on data protection. As Parsons (2015, p. 6) argues, it is necessary to recognise ‘the co-original nature of [...] private and public autonomy’ to understand that mass surveillance is not merely violating citizens’ individual rights, but ‘erodes the integrity of democratic processes and institutions’ (p. 1). To conclude, the concept of observability emphasises the societal repercussions of platformisation and suggests public interest as a normative horizon for assessing and regulating them. It problematises the poor conditions for observing platform life and its effects, and suggests levelling off, in institutionalised ways, the information asymmetry between platforms and platform research. Thus, we think of observability as one possible ‘counter power’ in the sense of Helberger (2020, p. 9) who calls for establishing ‘entirely new forms of transparency’. First and foremost, observability therefore seeks to improve the informational conditions for studying the broader effects of platformisation. Over the next two sections, we discuss the modalities for such an approach. 3.2. Observe platform behaviour over time Building on the arguments laid out in section two, the second principle of observability holds that the volatility of platforms requires continuous observation. While ex ante audits of technical mechanisms and ex post analysis of emblematic cases are certainly viable for more restricted systems, the dynamic and distributed nature of online platforms means that intermittent inspections or disclosures are insufficient, thwarted by the object’s transient character. Traditional forms of information sharing through transparency reports, legal inquiries, and regulated and structured disclosures, similar to those that exist for stock markets, can still be part of an observability framework, as can investigative reporting and whistleblowing. However, to tackle the specific challenges of digital platforms, more continuous forms of observation need to be envisaged. When terms of service, technical design, or business practices change, the ‘rules of the game’ change as well, affecting platform participants in various ways. Projects like TOSBack 7 use browser plugins and volunteer work to track and observe changes in platforms’ terms of service continuously, that is, while they are happening and not after some complaint has been filed. These are then distilled into more readable forms to accommodate wider audiences. The joint Polisis 8 and PriBot 9 projects pursue similar goals, drawing on artificial intelligence to interpret privacy policies and deal with the limitations of volunteer work. Such efforts should be made easier: a recent proposal by Cornelius (2019) suggests making terms of service contracts available as machine-readable documents to facilitate ongoing observation and interpretation. Similar approaches can be imagined for other areas of platform conduct, including technical tweaks or changes in business practices. However, to account for the distributed and dynamic character of platform life, as it emerges from the interaction between policies, design choices, and use practices, continuous observation needs to reach beyond legal and technical specifications. Bringing the space of distributed outcomes into view is by no means easy, but the importance of doing so is increasingly clear. In their discussion of algorithms as policies, Hunt and McKelvey (2020, p. 330) indeed argue that the ‘outcomes of these policies are as inscrutable as their intentions - under our current system of platform governance, it is beyond our reach to know whether algorithmic regulation is discriminatory or radicalizing or otherwise undermines the values that guide public policy’. Here, observability does not alter the underlying normative concerns but asks how platform reality can be sufficiently understood to make it amenable to normative reasoning in the first place. As platforms suck the bulk of online exchange into their increasingly centralised infrastructures, we need the capacity to probe not merely how algorithms work, but how fundamental social institutions are being reshaped. Answering these questions requires studying technical and legal mechanisms, use practices, and circulating units such as messages together. Given that our first goal is to understand rather than to place blame, there is no need to untangle networks of distributed causation from the outset. Entanglement and the wide variety of relevant questions we may want to ask mean that observability thus favours continuous and broad access to knowledge generating facilities. There are at least four practical approaches that align with what we are aiming at. First, platforms have occasionally entered into data access agreements with researchers, journalists, NGOs, and so forth. Facebook is a case in point. The company’s Data for Good 10 programme, which builds ‘privacy-preserving data products to help solve some of the world's biggest problems’, shares data with approved universities and civil society groups. The recently launched Social Science One initiative 11, a collaboration with the US Social Science Council, is supposed to grant selected researchers access to both data and funding to study ‘the impact of social media on elections and democracy’ (King & Persily, 2019, p. 1). While these initiatives are good starting points, they have been plagued by delays and restrictions. Scholars have rightfully criticised that the scope and modalities for access remain in the hands of platforms themselves (Hegelich, 2020; Suzor et al., 2019). The central question is thus how to structure agreements in ways that asymmetries between platforms and third parties are reduced. Without a legal framework, companies can not only start and stop such initiatives at will but are also able to control parameters coming into play, such as thematic scope, coverage, and granularity. Accountability interfaces providing continuous access to relevant data constitute a second direction. Facebook’s Ad Library 12, for example, is an attempt to introduce carefully designed observability, here with regard to (political) advertisement. Despite the limitations of the existing setup (see Leerssen et al., 2019), machine-readable data access for purposes of accountability can enable third-party actors to ask their own questions and develop independent analytical perspectives. While tools like Google Trends 13 are not designed for accountability purposes, a broader understanding of the term could well include tools that shed light on emergent outcomes in aggregate terms. There are already working examples in other domains, as the German Market Transparency Unit for Fuels 14, a division of the Federal Cartel Office shows. It requires gas stations to communicate current prices in real-time to make them available on the Web and via third-party Apps. 15 Well-designed data interfaces could both facilitate observability and alleviate some of the privacy problems other approaches have run into. One could even imagine sandbox-style execution environments that allow third parties to run limited code within platforms’ server environment, allowing for privacy-sensitive analytics where data never leaves the server. Developer APIs are data interfaces made available without explicit accountability purposes. These interfaces have been extensively repurposed to investigate the many social phenomena platforms host, ranging from political campaigning (e.g. Larsson, 2016) to crisis communication during disasters (e.g. Bruns & Burgess, 2014), as well as the technical mechanisms behind ranking and recommendation (e.g., Airoldi et al., 2016; Rieder et al., 2018). Depending on the platform, developer APIs provide data access through keyword searches, user samples, or other means. Twitter’s random sample endpoint 16, which delivers representative selections of all tweets in real time (Morstatter et al., 2014), is particularly interesting since it allows observing overall trends while reducing computational requirements. One of the many examples for exploiting a data interface beyond social media is David Kriesel’s project BahnMining 17, which uses the German railroad’s timetable API to analyse train delays and challenge the official figures released by Deutsche Bahn. But the so-called ‘APIcalypse’ (Bruns, 2019) that followed the Facebook-Cambridge Analytica scandal has led to restrictions in data access, rendering independent research much more difficult. Even before Facebook-Cambridge Analytica, working with developer APIs regularly created issues of reliability and reproducibility of results, research ethics, and privacy considerations (see Puschmann, 2019). Generally, developer interfaces are not designed for structured investigations into the layers of personalisation and localisation that may impact what users actually see on their screens. YouTube’s ‘up next’ column is a case in point: while the API does make so-called ‘related videos’ available, it leaves out the personalized recommendations that constitute a second source for suggested videos. Research on the YouTube’s recommender system, for example a study by PEW 18, is therefore necessarily incomplete. But the fact that developer APIs enable a wide variety of independent research on different topics means that in cases where privacy concerns can be mitigated, they are worth extending further. A structured conversation between platforms and research organisations about possible long-term arrangements is necessary and independent regulatory institutions could play a central role here. Finally, due to API limitations, researchers have been relying on scraping, a set of techniques that glean data from end-user interfaces. Search engines, price snipers, and a whole industry of information aggregators and sellers rely on scraped data, but there are many non-commercial examples as well. Projects like AlgoTransparency 19, run by former YouTube employee Guillaume Chaslot, regularly capture video recommendations from the web interface to trace what is being suggested to users. Roth et al. (2020) have recently used a similar approach to study whether YouTube indeed confines users to filter bubbles. Such high-profile questions call for empirical evidence, and since research results may change as quickly as systems evolve, continuous monitoring is crucial. While scraping does not demand active cooperation from the platforms under scrutiny, large-scale projects do require at least implicit acquiescence because websites can deploy a whole range of measures to thwart scraping. Although more precarious than API-based approaches, taking data directly from the user interface allows for the explicit study of personalisation and localisation. Data retrieved through scraping may also serve to verify or critique data obtained through the previously mentioned techniques. Not unlike the panels assembled by analytics companies like Nielsen for their online products 20, the most promising platform-centred crowd-sourcing projects ask volunteers to install custom-built browser plugins to ‘look over their shoulder’. The Datenspende project, a collaboration between several German state-level media authorities, the NGO AlgorithmWatch, the Technical University Kaiserslautern, and Spiegel Online, recruited 4,500 volunteers before the German parliamentary elections in 2017 to investigate what users actually see when they look for party and candidate names on Google Search and Google News. 21 The same approach was later used to scrutinise the SCHUFA 22, Germany’s leading credit bureau, and most recently Instagram 23. There are many other areas where scraping has been productively used. The $herrif project 24, for example, also deployed browser plugins to investigate price discrimination practices on retail websites like Amazon (Iordanou et al., 2017). Even regulators have to resort to scraping: a recent study by the French Conseil Supérieur de l’Audiovisuel used the accounts of 39 employees and four fictitious users to study YouTube’s recommendation system. 25 The City of Amsterdam already began scraping data from AirBnB in 2017 26, analysing consequences for the housing market and compliance by landlords with rules on short-term rentals. Given that sample quality, scale, and the dependence on platform acquiescence are significant disadvantages under current conditions, a legal framework regulating access to platform data would increase the practical viability of this approach. The current ambiguities risk creating chilling effects that discourage smaller research projects in particular. NYU’s Ad Observer 27, a tool that uses browser plugins and scraping to investigate ad targeting on Facebook to compensate for the limitations of the above-mentioned Ad Library, tells a cautionary tale. The researchers recently received a cease and desist letter from the company, putting the whole project in peril (Horwitz, 2020). However, it should be stated that not all forms of access to platform data further the public interest. Across all these four approaches we encounter serious privacy concerns. While there are areas where data access is unproblematic, others may require restricting access to certain groups, anonymise data, use aggregate statistics, or explore innovative models such as sandbox environments. These are not trivial problems; they raise the need for innovative and experimental approaches supported by institutional oversight. From a legal perspective, a recent interpretation of the GDPR by the European Data Protection Supervisor 28 clarified that research in the public interest must have leeway if done in accordance with ethical best practices. Still, concrete measures will need to be the subject of broader conversations about the appropriate balance to strike, which may lead, in certain cases, to more restrictions rather than fewer. 3.3. Strengthen capacities for collaborative knowledge creation In his analysis of accountability as a social relation, Bovens (2007, p. 453) argues that ‘transparency as such is not enough to qualify as a genuine form of accountability, because transparency does not necessarily involve scrutiny by a specific forum’. Given their deep and transversal impact, the question as to how knowledge about platforms is generated and how it circulates through society is crucial. In this section, we argue that effective accountability requires the participation of different actors and the generation of different forms of knowledge. Our argument starts from the fact that platform companies have largely treated information about their systems, what users are posting or selling, and which kind of dynamics emerge from their interactions as private assets. They heavily invest in sophisticated analytics to provide insights and pathways for corporate action. Product development, optimisation, and detection and moderation of all kinds of illegal or ‘undesirable’ content have become important tasks that fully rely on evolving observational capabilities. While platforms would be able to facilitate knowledge creation beyond such operational concerns, the existing information asymmetries between those collecting and mining private data and society at large make this highly unlikely. Instead, platforms provide businesses and individual users with deliberately designed ‘market information regimes’ (Anand & Peterson, 2000) consisting of analytics products and services that provide information about the larger market and one’s own standing. Creators on YouTube, for example, are now able to gauge how their videos are faring, how the choice of thumbnails affects viewer numbers, or how advertisers are bidding on keywords within the platform interface. But such interfaces are ‘socially and politically constructed and [...] hence fraught with biases and assumptions’ (Anand & Peterson, 2000, p. 270), privileging operational knowledge designed to boost performance over broader and more contextualised forms of insight. The narrow epistemological horizon of platform companies thus needs to be supplemented by inquiries that contextualise and question this business model. The problematic monopolisation of analytical capacities legitimises our demand for a more inclusive approach, which would open the locked-up data troves to qualified external actors. However, there simply is no one-size-fits-all approach able to cover all types of platforms, audiences, and concerns. Researchers, journalists, and activists are already engaged in ‘accountability work’, covering a range of questions and methods. Regulators add to this diversity: competition and antitrust inquiries require different forms of evidence than concerns regarding misinformation or radicalisation. We may therefore prefer to speak of ‘accountabilities’ in plural form. There are many approaches coming from the technical disciplines that promise to enhance understanding. Emerging research fields like ‘explainable AI’ (e.g. Doran et al., 2017) seek to make primary ordering mechanisms more accountable, even if the issue remains of what ‘explainable’ means when different audiences ask different questions. Other strategies like the ‘glass box’ approach (Tubella & Dignum, 2019) focus on the monitoring of inputs and outputs to ‘evaluate the moral bounds’ of AI systems. A particularly rich example for image classification from Google Researchers comes in the form of an ‘activation atlas’, which intends to communicate how a convolutional neural network ‘sees’. 29 But since platforms are much more than contained ordering mechanisms, the problem of how to make their complexity readable, how to narrate what can be gleaned from data (see Dourish, 2016), remains unsolved. However, researchers in the humanities and social sciences have long been interested in how to make sense of quantitative information. Work on ‘narrating numbers’ (Espeland, 2015), ‘narrating networks’ (Bounegru et al., 2017), or the substantial research on information visualisation (e.g. Drucker, 2014) can serve as models. But as Sloane & Moss (2019) argue in their critique of current approaches to AI, there is a broader ‘social science deficit’ and the one-sided focus on quantitative information is part of the problem. The marginalisation of qualitative methods such as ethnographic work that tries to elucidate both the context within which platforms make decisions and the meaning actors ascribe to practices and their effects, limits knowledge production. Journalists also have unique expertise when it comes to forms of knowledge generation and presentation. A recent example is the work by Karen Hao and Jonathan Stray 30 on the controversial KOMPASS project, 31 which questions the very possibility of fair judgements by allowing users to ‘play’ with the parameters of a simplified model. Likewise, NGOs have long worked on compound forms of narration that combine different data sources and methods for purposes of accountability. Greenpeace’s Guide to Greener Electronics, which includes a grade for companies’ willingness to share information, or the Ranking Digital Rights 32 project are good examples for the translation of research into concrete political devices. Accountability, understood as an inherent element of democratic control, cannot be reduced to a forensic process that transposes ‘facts’ from obscurity into the light. It needs to be considered as an ongoing social achievement that requires different forms of sense-making, asking for contributions from different directions and epistemological sensitivities. Access to machine-readable data, our focus in the last section, has limitations, but also allows different actors to develop their own observation capacities, adapting their analytical methods to the questions they want to ask. We are aware that increased understanding of platform life would prompt reactions and adaptations by different stakeholders gathering around platforms, including actors seeking to ‘game’ the system and even platform owners themselves. Making the constant negotiations between these actors more visible may have the advantage, however, that the process of establishing boundaries of acceptable behaviour could be engaged more explicitly. As Ziewitz (2019, p. 713) argues for the field of search engine optimisation (SEO), ‘the moral status of reactive practices is not given, but needs to be accomplished in practice’. Distributing this ‘ethical work’ over a wider array of actors could thus be a step toward some modest form of ‘cooperative responsibility’ (Helberger et al., 2018), even if fundamental power asymmetries remain. Observability thus raises the complicated question of how data and analytical capacities should be made available, to whom, and for what purpose. This clearly goes beyond data access. As Kemper & Kolkman (2019) note, ‘no algorithmic accountability without a critical audience’, and the capacity for critique requires more than a critical attitude. For this reason, frameworks for data access should ‘go hand-in-hand with the broader cultivation of a robust and democratic civil society, which is adequately funded and guaranteed of its independence’ (Ausloos et al., 2020, p. 86). And Flyverbom (2015, p. 115) reminds us that transparency, understood as a transformative process, cannot succeed ‘without careful attention to the formats, processes of socialization, and other affordances of the technologies and environments in which they play out’. Monitoring platforms on a continuous basis may thus call for considerable resources if done well. Governmental institutions, possibly on a European level, could play a central role in managing data access, in making long-term funding available for research, and in coordinating the exchange between existing initiatives. But given the complexity of the task, regulators will also have to build ‘in-house’ expertise and observational capacity, backed by strong institutional support. The capacity to make sense of large and complex socio-technical systems indeed relies on a number of material conditions, including access to data, technical expertise, computing power, and not least the capacity to connect data-analytical practices to social concerns. Such a capacity is typically produced as a collective effort, through public discourse. The quality of observability depends on such discourses to explore what kind of knowledge forms allow concerned actors to make actually meaningful interpretations. 4. Conclusion: toward platform observability This article developed the concept of observability to problematise the assumptions and expectations that drive our demands for transparency of platform life. Observability is not meant to be a radical departure from the call for transparency. Rather, it draws practical conclusions from the discrepancy we noted between the complexity of the platform machinery and the traditional idea of shedding light on and seeing as a way of establishing external oversight. In a nutshell, we are suggesting observability as a pragmatic, knowledge-focused approach to accountability. Observability stresses technical and social complexities, including the distributed nature of platform behaviour. Moreover, it regards continuous and collaborative observation within a normative framework as a necessary condition for regulating the explosive growth of platform power. We see three main directions where further steps are needed to move closer to the practical realisation of these principles. Regulating for observability means working toward structured information interfaces between platforms and society. 33 To account for quickly changing circumstances, these interfaces need to enable continuous observation. To allow for a broader set of questions to be asked, a broad range of data has to be covered. And to bring a wider variety of epistemological sensitivities into the fold, they need to be sufficiently flexible. What constitutes suitable and sufficient access will have to be decided on a per-platform basis, including the question of who should be able to have access in the first place. But the examples we briefly discussed in section 3.2—and the many others we left out—show that there is already much to build on. The main goal, here, is to develop existing approaches further and to make them more stable, transparent, and predictable. Twitter’s new API 34, which now explicitly singles out academic research use cases, is a good example for a step in the right direction, but these efforts are still voluntary and can be revoked at any time. Without binding legal frameworks, platforms can not only terminate such initiatives at will, they also control relevant modalities such as thematic scope and depth of access. Realigning the structural information asymmetries between platforms and society thus requires curtailing the de facto ownership over data that platforms collect about their users. Observability as part of regulation requires engaging with the specific properties of algorithmic systems and the co-produced nature of platform behaviour. The complex interactions between technical design, terms of service, and sometimes vast numbers of both users and ‘items’ mean that the concept of a singular algorithm steering the ordering processes at work in large-scale platforms is practically and conceptually insufficient. If techniques like machine learning are here to stay, regulatory approaches will have to adapt to conditions where the object of regulation is spread out, volatile, and elusive. The pressing questions are not restricted to how and what to regulate, but also encompass the issue of what platforms are doing in the first place. While normative concepts such as algorithmic fairness or diversity are laudable goals, their focus seems rather narrow considering the fundamental change of markets and the public sphere that platforms provoke. We therefore suggest the broader concept of public interest as a normative benchmark for assessing platform behaviour, a concept obviously in need of specification. But whatever set of norms or values are chosen as guiding principles, the question remains how to ‘apply’ them, that is, how to assess platform behaviour against public interest norms. Observation as a companion to regulation stresses the fact that we need to invest in our analytical capacities to undergird the regulatory response to the challenges platforms pose. Likewise, the existing approaches to studying platforms should be supplemented with specific rights to information. Together, these elements would constitute important steps towards a shared governance model (see Helberger et al., 2018), where power is distributed more equally between platforms and their constituencies. Institutionalising processes of collective learning refers to the need to develop and maintain the skills that are required to observe platforms. A common characteristic of the data collecting projects mentioned above is their ephemeral, experimental, and somewhat amateurish nature. While this may sound harsh, it should be obvious that holding platforms to account requires ‘institution-building’, that is, the painstaking assembly of skills and competence in a form that transposes local experiments into more robust practices able to guarantee continuity and accumulation. While academic research fields have their own ways of assembling and preserving knowledge, the task of observing large-scale platforms implies highly specialised technical and logistical feats that few organisations are able to tackle. Material resources are only one part of the equation and the means to combat discontinuity and fragmentation are at least equally important. One form of institutional incorporation of observability would therefore be something akin to ‘centres of expertise’ tasked with building the capacity to produce relevant knowledge about platforms. Such centres could act as an, ‘important bridge builder between those holding the data and those wishing to get access to that data’ (Ausloos et al., 2020, p. 83). Pushing further, a European Platform Observatory, 35 driven by a public interest mandate, equipped with adequate funding, and backed by strong regulatory support, could be a way forward to platform accountability. Holding platforms to account is a complex task that faces many challenges. However, given their rising power, it is quickly becoming a necessity. The concept of observability spells out these challenges and suggests steps to tackle them, taking a pragmatic, knowledge-based approach. The goal, ultimately, is to establish observability as a ‘counter power’ to platforms’ outsized hold on contemporary societies. Acknowledgements This work was, in part, inspired by discussions we had as members of the European Commission’s Observatory on the Online Platform Economy. We would also like to thank Joris van Hoboken, Paddy Leerssen, and Thomas Poell for helpful comments and feedback. References Agre, P. E. (1994). Surveillance and Capture: Two Models of Privacy. The Information Society, 10(2), 101–127. https://doi.org/10.1080/01972243.1994.9960162 Albu, O. B., & Flyverbom, M. (2019). Organizational Transparency: Conceptualizations, Conditions, and Consequences. Business & Society, 58(2), 268–297. https://doi.org/10.1177/0007650316659851 Anand, N., & Peterson, R. A. (2000). When Market Information Constitutes Fields: Sensemaking of Markets in the Commercial Music Industry. Organization Science, 11(3), 270–284. https://doi.org/10.1287/orsc.11.3.270.12502 Ananny, M., & Crawford, K. (2018). Seeing without knowing: Limitations of the transparency ideal and its application to algorithmic accountability. New Media & Society, 20(3), 973–989. https://doi.org/10.1177/1461444816676645 August, V., & Osrecki, F. (2019). Transparency Imperatives: Results and Frontiers of Social Science Research. In V. August & F. Osrecki (Eds.), Der Transparenz-Imperativ: Normen – Praktiken – Strukturen (pp. 1–34). Springer. https://doi.org/10.1007/978-3-658-22294-9 Bernstein, E. S. (2012). The Transparency Paradox: A Role for Privacy in Organizational Learning and Operational Control. Administrative Science Quarterly, 57(2), 181–216. https://doi.org/10.1177/0001839212453028 Bogost, I. (2015, January 15). The Cathedral of Computation. The Atlantic. https://www.theatlantic.com/technology/archive/2015/01/the-cathedral-of-computation/384300/ Bovens, M. (2007). Analysing and Assessing Accountability: A Conceptual Framework. European Law Journal, 13(4), 447–468. https://doi.org/10.1111/j.1468-0386.2007.00378.x Brandeis, L. D. (1913, December 20). What publicity can do. Harper’s Weekly. Bruns, A. (2019). After the ‘APIcalypse’: Social media platforms and their fight against critical scholarly research. Information, Communication & Society, 22(11), 1544–1566. https://doi.org/10.1080/1369118X.2019.1637447 Bruns, A., & Burgess, J. (2013). Crisis communication in natural disasters: The Queensland floods and Christchurch earthquakes. In K. Weller, A. Bruns, J. Burgess, M. Mahrt, & C. Puschmann (Eds.), Twitter and Society (pp. 373–384). Peter Lang. Burrell, J. (2016). How the machine “thinks”: Understanding opacity in machine learning algorithms. Big Data & Society, 3(1), 1–12. https://doi.org/10.1177/2053951715622512 Cohen, J. E. (2019). Between Truth and Power: The Legal Constructions of Informational Capitalism. Oxford University Press. https://doi.org/10.1093/oso/9780190246693.001.0001 Cornelius, K. B. (2019). Zombie contracts, dark patterns of design, and ‘documentisation’. Internet Policy Review, 8(2). https://doi.org/10.14763/2019.2.1412 Covington, P., Adams, J., & Sargin, E. (2016). Deep Neural Networks for YouTube Recommendations. Proceedings of the 10th ACM Conference on Recommender Systems, 191–198. https://doi.org/10.1145/2959100.2959190 Domingos, P. (2012). A few useful things to know about machine learning. Communications of the ACM, 55(10), 78–87. https://doi.org/10.1145/2347736.2347755 Doran, D., Schulz, S., & Besold, T. R. (2017). What Does Explainable AI Really Mean? A New Conceptualization of Perspectives. ArXiv. http://arxiv.org/abs/1710.00794 Douglass, B. (1980). The Common Good and the Public Interest. Political Theory, 8(1), 103–117. https://doi.org/10.1177/009059178000800108 Dourish, P. (2016). Algorithms and their others: Algorithmic culture in context. Big Data & Society, 3(2). https://doi.org/10.1177/2053951716665128 Espeland, W. (2015). Narrating Numbers. In R. Rottenburg, S. E. Merry, S.-J. Park, & J. Mugler (Eds.), The World of Indicators: The Making of Governmental Knowledge through Quantification (pp. 56–75). Cambridge University Press. https://doi.org/10.1017/CBO9781316091265.003 Etzioni, A. (2010). Is Transparency the Best Disinfectant? Journal of Political Philosophy, 18(4), 389–404. https://doi.org/10.1111/j.1467-9760.2010.00366.x Ezrahi, Y. (1992). Technology and the civil epistemology of democracy. Inquiry, 35(3–4), 363–376. https://doi.org/10.1080/00201749208602299 Flyverbom, M. (2016). Transparency: Mediation and the Management of Visibilities. International Journal of Communication, 10, 110–122. https://ijoc.org/index.php/ijoc/article/view/4490 Fourcade, M., & Gordon, J. (2020). Learning Like a State: Statecraft in the Digital Age. Journal of Law and Political Economy, 1(1), 78–108. https://escholarship.org/uc/item/3k16c24g Gillespie, T. (2018). Custodians of the Internet. Yale University Press. Hegelich, S. (2020). Facebook needs to share more with researchers. Nature, 579, 473–473. https://doi.org/10.1038/d41586-020-00828-5 Helberger, N. (2020). The Political Power of Platforms: How Current Attempts to Regulate Misinformation Amplify Opinion Power. Digital Journalism, 8(3). https://doi.org/10.1080/21670811.2020.1773888 Helberger, N., Pierson, J., & Poell, T. (2018). Governing online platforms: From contested to cooperative responsibility. The Information Society, 34(1), 1–14. https://doi.org/10.1080/01972243.2017.1391913 Horwitz, J. (2020, October 23). Facebook Seeks Shutdown of NYU Research Project Into Political Ad Targeting. The Wall Street Journal. https://www.wsj.com/articles/facebook-seeks-shutdown-of-nyu-research-project-into-political-ad-targeting-11603488533 Hunt, R., & McKelvey, F. (2019). Algorithmic Regulation in Media and Cultural Policy: A Framework to Evaluate Barriers to Accountability. Journal of Information Policy, 9, 307–335. https://doi.org/10.5325/jinfopoli.9.2019.0307 Iordanou, C., Soriente, C., Sirivianos, M., & Laoutaris, N. (2017). Who is Fiddling with Prices?: Building and Deploying a Watchdog Service for E-commerce. Proceedings of the Conference of the ACM Special Interest Group on Data Communication - SIGCOMM, 17, 376–389. https://doi.org/10.1145/3098822.3098850 Kemper, J., & Kolkman, D. (2019). Transparent to whom? No algorithmic accountability without a critical audience. Information, Communication & Society, 22(14), 2081–2096. https://doi.org/10.1080/1369118X.2018.1477967 King, G., & Persily, N. (2019). A New Model for Industry–Academic Partnerships. PS: Political Science & Politics, 53(4), 703–709. https://doi.org/10.1017/S1049096519001021 Langley, P., & Leyshon, A. (2017). Platform capitalism: The intermediation and capitalisation of digital economic circulation. Finance and Society, 3(1), 11–31. https://doi.org/10.2218/finsoc.v3i1.1936 Larsson, A. O. (2016). Online, all the time? A quantitative assessment of the permanent campaign on Facebook. New Media & Society, 18(2), 274–292. https://doi.org/10.1177/1461444814538798 Leerssen, P., Ausloos, J., Zarouali, B., Helberger, N., & Vreese, C. H. (2019). Platform Ad Archives: Promises and Pitfalls. Internet Policy Review, 8(4), 1–21. https://doi.org/10.14763/2019.4.1421 Lessig, L. (1999). Code: And other laws of cyberspace. Basic Books. Mattli, W., & Woods, N. (2009). In Whose Benefit? Explaining Regulatory Change in Global Politics. In W. Mattli & N. Woods (Eds.), The Politics of Global Regulation (pp. 1–43). https://doi.org/10.1515/9781400830732.1 Morgan, B., & Yeung, K. (2007). An introduction to Law and Regulation. Cambridge University Press. https://doi.org/10.1017/CBO9780511801112 Morstatter, F., Pfeffer, J., & Liu, H. (2014). When is it Biased? Assessing the Representativeness of Twitter’s Streaming API. ArXiv. http://arxiv.org/abs/1401.7909 Napoli, P. M. (2015). Social media and the public interest: Governance of news platforms in the realm of individual and algorithmic gatekeepers. Telecommunications Policy, 39(9), 751–760. https://doi.org/10.1016/j.telpol.2014.12.003 Obar, J. A. (2020). Sunlight alone is not a disinfectant: Consent and the futility of opening Big Data black boxes (without assistance). Big Data & Society, 7(1). https://doi.org/10.1177/2053951720935615 Parsons, C. (2015). Beyond Privacy: Articulating the Broader Harms of Pervasive Mass Surveillance. Media and Communication, 3(3), 1–11. https://doi.org/10.17645/mac.v3i3.263 Pasquale, F. (2015). The black box society: The secret algorithms that control money and information. Harvard University Press. Power, M. (1997). The audit society. Rituals of verification. Oxford University Press. https://doi.org/10.1093/acprof:oso/9780198296034.001.0001 Powles, J., & Nissenbaum, H. (2018, December 7). The Seductive Diversion of ‘Solving’ Bias in Artificial Intelligence. OneZero. https://onezero.medium.com/the-seductive-diversion-of-solving-bias-in-artificial-intelligence-890df5e5ef53 Puschmann, C. (2019). An end to the wild west of social media research: A response to Axel Bruns. Information, Communication & Society, 22(11), 1582–1589. https://doi.org/10.1080/1369118X.2019.1646300 Rahwan, I., Cebrian, M., Obradovich, N., Bongard, J., Bonnefon, J.-F., Breazeal, C., Crandall, J. W., Christakis, N. A., Couzin, I. D., Jackson, M. O., Jennings, N. R., Kamar, E., Kloumann, I. M., Larochelle, H., Lazer, D., McElreath, R., Mislove, A., Parkes, D. C., Pentland, A. ‘Sandy’, … Wellman, M. (2019). Machine behaviour. Nature, 568(7753), 477–486. https://doi.org/10.1038/s41586-019-1138-y Rieder, B. (2020). Engines of Order. A Mechanology of Algorithmic Techniques. Amsterdam University Press. https://doi.org/10.2307/j.ctv12sdvf1 Rieder, B., Matamoros-Fernández, A., & Coromina, Ò. (2018). From ranking algorithms to ‘ranking cultures’: Investigating the modulation of visibility in YouTube search results. Convergence, 24(1), 50–68. https://doi.org/10.1177/1354856517736982 Roth, C., Mazières, A., & Menezes, T. (2020). Tubes and bubbles topological confinement of YouTube recommendations. PLOS ONE, 15(4). https://doi.org/10.1371/journal.pone.0231703 Rowland, N. J., & Passoth, J.-H. (2015). Infrastructure and the state in science and technology studies. Social Studies of Science, 45(1), 137–145. https://doi.org/10.1177/0306312714537566 Sandvig, C., Hamilton, K., Karahalios, K., & Langbort, C. (2014). Auditing algorithms: Research methods for detecting discrimination on internet platforms. Data and Discrimination: Converting Critical concerns into productive inquiry, a Preconference at the 64th Annual Meeting of the International Communication Association, Seattle, WA. https://pdfs.semanticscholar.org/b722/7cbd34766655dea10d0437ab10df3a127396.pdf Seaver, N. (2017). Algorithms as culture: Some tactics for the ethnography of algorithmic systems. Big Data & Society, 4(2), 1–12. https://doi.org/10.1177/2053951717738104 Sloane, M., & Moss, E. (2019). AI’s social sciences deficit. Nature Machine Intelligence, 1(8), 330–331. https://doi.org/10.1038/s42256-019-0084-6 Staab, P. (2019). Digitaler Kapitalismus: Markt und Herrschaft in der Ökonomie der Unknappheit. Suhrkamp. Stolton, S. (2019, November 20). Vestager takes aim at ‘biopower’ of tech giants. EURACTIV. https://www.euractiv.com/section/copyright/news/vestager-takes-aim-at-biopower-of-tech-giants/ Suchman, L. A. (2007). Human-Machine Reconfigurations. Plans and Situated Actions (Second). Cambridge University Press. https://doi.org/10.1017/CBO9780511808418 Suzor, N. P., Myers West, S., Quodling, A., & York, J. (2019). What Do We Mean When We Talk About Transparency? Toward Meaningful Transparency in Commercial Content Moderation. International Journal of Communication, 13, 1526–1543. https://ijoc.org/index.php/ijoc/article/view/9736/0 van Dijck, J. (2020). Governing digital societies: Private platforms, public values. Computer Law & Security Review, 36. https://doi.org/10.1016/j.clsr.2019.105377 van Dijck, J., Poell, T., & De Waal, M. (2018). The platform society: Public values in a connective world. Oxford University Press. https://doi.org/10.1093/oso/9780190889760.001.0001 Yeung, K. (2017). 'Hypernudge’: Big Data as a mode of regulation by design. Information, Communication & Society, 20(1), 118–136. https://doi.org/10.1080/1369118X.2016.1186713 Ziewitz, M. (2019). Rethinking gaming: The ethical work of optimization in web search engines. Social Studies of Science, 49(5), 707–731. https://doi.org/10.1177/0306312719865607 Zuboff, S. (2019). The age of surveillance capitalism: The fight for a human future at the new frontier of power. Profile Books. Footnotes 1. https://eur-lex.europa.eu/eli/reg/2019/1150/oj 2. See, for example, the response by AlgorithmWatch and other signatories to the European Commission’s planned Digital Services Act: https://algorithmwatch.org/en/submission-digital-services-act-dsa/. 3. https://www.congress.gov/bill/116th-congress/senate-bill/2763/all-info 4. https://www.rlp.de/fileadmin/rlp-stk/pdf-Dateien/Medienpolitik/ModStV_MStV_und_JMStV_2019-12-05_MPK.pdf 5. The ACM’s Statement on Algorithmic Transparency and Accountability (https://www.acm.org/binaries/content/assets/public-policy/2017_usacm_statement_algorithms.pdf), for example, explicitly mentions ‘auditability’ as a desirable principle. 6. For a discussion of the intricate history of ideas behind the concepts of the common good and public interest in the anglo-american realm and a definition of the latter see Douglass (1980, p. 114): ‘the public interest would come to mean what is really good for the whole people. And in a democratic society, this would mean what is really good for the whole people as interpreted by the people.’ 7. https://tosback.org 8. https://pribot.org/polisis 9. https://pribot.org 10. https://dataforgood.fb.com/ 11. https://socialscience.one 12. https://www.facebook.com/ads/library/ 13. https://trends.google.com 14. https://www.bundeskartellamt.de/EN/Economicsectors/MineralOil/MTU-Fuels/mtufuels_node.html 15. https://creativecommons.tankerkoenig.de / https://de.wikipedia.org/wiki/Markttransparenzstelle_für_Kraftstoffe 16. https://developer.twitter.com/en/products/tweets/sample 17. https://www.heise.de/newsticker/meldung/36C3-BahnMining-offenbart-die-nackte-Wahrheit-hinter-der-DB-Puenktlichkeitsquote-4624384.html 18. https://www.pewinternet.org/2018/11/07/many-turn-to-youtube-for-childrens-content-news-how-to-lessons/ 19. https://algotransparency.org 20. https://www.nielsen.com/us/en/solutions/measurement/online/ 21. https://algorithmwatch.org/datenspende-unser-projekt-zur-bundestagswahl/ 22. https://algorithmwatch.org/openschufa-warum-wir-diese-kampagne-machen/ 23. https://algorithmwatch.org/instagram-algorithmus/ 24. http://sheriff-v2.dynu.net/views/manual 25. https://www.csa.fr/Informer/Toutes-les-actualites/Actualites/Pourquoi-et-comment-le-CSA-a-realise-une-etude-sur-l-un-des-algorithmes-de-recommandations-de-YouTube 26. https://publicaties.rekenkamer.amsterdam.nl/handhaving-vakantieverhuurbestuurlijk-rapport/ 27. https://adobserver.org 28. https://edps.europa.eu/sites/edp/files/publication/20-01-06_opinion_research_en.pdf 29. https://distill.pub/2019/activation-atlas/ 30. https://www.technologyreview.com/s/613508/ai-fairer-than-judge-criminal-risk-assessment-algorithm/ 31. https://www.technologyreview.com/s/607955/inspecting-algorithms-for-bias/ 32. http://rankingdigitalrights.org 33. This aligns with Sandvig et al. (2014, p. 17), who call for ‘regulation toward auditability’. 34. https://blog.twitter.com/developer/en_us/topics/tools/2020/introducing_new_twitter_api.html 35. The European Commission is already hosting an Observatory on the Online Platform Economy (https://platformobservatory.eu/)—of which both authors are members—and it plans to create a digital media observatory.https://ec.europa.eu/digital-single-market/en/news/commission-launches-call-create-european-digital-media-observatory. However, both bodies have a thematically restricted mandate and lack any regulatory authority. Add new comment Your name * E-mail * The content of this field is kept private and will not be shown publicly. Homepage Comment * Math question 4 + 7 = Solve this simple math problem and enter the result. E.g. for 1+3, enter 4. Download PDF Cite APA Rieder, B. & Hofmann, J. (2020). Towards platform observability. Internet Policy Review, 9(4). https://doi.org/10.14763/2020.4.1535 Chicago Rieder, Bernhard, and Jeanette Hofmann. 2020. "Towards platform observability". Internet Policy Review 9 (4). DOI: 10.14763/2020.4.1535. https://policyreview.info/articles/analysis/towards-platform-observability. Harvard Rieder, B. and Hofmann, J. (2020). Towards platform observability. Internet Policy Review,[online] 9(4). Available at: https://policyreview.info/articles/analysis/towards-platform-observability [Accessed: 27 Apr. 2021]. MLA Rieder, Bernhard and Jeanette Hofmann. "Towards platform observability". Internet Policy Review 9.4 (2020). Web. 27 Apr. 2021. Vancouver Rieder B, Hofmann J. Towards platform observability. Internet Policy Review [Internet]. 2020 [27 April 2021];9(4). Available from: https://policyreview.info/articles/analysis/towards-platform-observability Tweet accessibilityDyslexia mode Adjusts contrasts, text, and spacing in order to improve legibility for people with dyslexia. Contrasts, text, and spacing are adjusted in order to improve legibility for people with dyslexia. Also, links look like this and italics like this. Is this feature helpful for you, or could the design be improved? If you have feedback please send us a message. Metrics Category Information & Data Governance Keywords Platforms Algorithms Transparency Observability Regulation Related Articles Volume 7, Issue 1 Neutrality, fairness or freedom? Principles for platform regulation Friso Bostoen, KU Leuven PUBLISHED ON: 31 Mar 2018 DOI: 10.14763/2018.1.785 This article distils from the various (proposals for) platform regulation operational principles that can serve as the basis for productive debate on the subject. KEYWORDS: Platforms, Regulation, Fairness Volume 8, Issue 4 Platform ad archives: promises and pitfalls Paddy Leerssen, University of Amsterdam Jef Ausloos, University of Amsterdam Brahim Zarouali, University of Amsterdam Natali Helberger, University of Amsterdam Claes H. de Vreese, University of Amsterdam PUBLISHED ON: 09 Oct 2019 DOI: 10.14763/2019.4.1421 Ad archives are a novel tool in online advertising governance. They promise significant benefits, but only if their operators address key criticisms. KEYWORDS: Advertising, Micro-targeting, Platforms Volume 8, Issue 4 Algorithmic governance Christian Katzenbach, Alexander von Humboldt Institute for Internet and Society Lena Ulbricht, Berlin Social Science Center (WZB) PUBLISHED ON: 29 Nov 2019 DOI: 10.14763/2019.4.1424 Algorithmic governance as a key concept in controversies around the emerging digital society takes up the idea that digital technologies produce social ordering in a specific way. KEYWORDS: Transparency, Automation, Politicisation Volume 8, Issue 4 Platformisation Thomas Poell, University of Amsterdam David Nieborg, University of Toronto José van Dijck, Utrecht University PUBLISHED ON: 29 Nov 2019 DOI: 10.14763/2019.4.1425 What is platformisation? This article contextualises, defines, and operationalises the concept. Drawing insights from different scholarly perspectives on platforms it develops a comprehensive approach to this process. KEYWORDS: Platforms, Platformisation, Datafication Volume 8, Issue 2 The platform governance triangle: conceptualising the informal regulation of online content Robert Gorwa, University of Oxford PUBLISHED ON: 30 Jun 2019 DOI: 10.14763/2019.2.1407 What are the informal arrangements governing online content on platforms in Europe, and what are the factors that make them more or less successful? KEYWORDS: Platforms, Platform governance, Co-governance Internet Policy Review is an open access and peer-reviewed journal on internet regulation. Scholars, regulators, journalists, activists, and other stakeholders publish in the journal in a variety of formats peer reviewed Research articles In-depth scholarly research papers and essays Concepts Critical reflections on emerging core concepts of the digital society Editorials Contextual or thematic introductions to special issues not peer reviewed Essays Free-form yet in-depth contentions with issues of academic or social relevance News Journalistic reports on events of interest to the Internet Policy Review community Opinions Opinion pieces commenting on developments in the realm of internet policy Open Abstract Extended abstracts for works in progress that receive public peer review Published by Logo of the Alexander von Humboldt Institute for Internet and Society gGmbH in cooperation with and additional partners Connect Follow us @POLICYR emailSubscribe NEWSLETTER Internet Policy Review 2019. Some rights reserved Imprint Data Protection Desktop View Mobile View 
ptsefton-com-122	----	FAIR Data Management; It's a lifestyle not a lifecycle - ptsefton.com Toggle navigation ptsefton.com HOME CV Archives FAIR Data Management; It's a lifestyle not a lifecycle Date Wed 07 April 2021 I have been working with my colleague Marco La Rosa on summary diagrams that capture some important aspects of Research Data Management, and include the FAIR data principles; that data should be Findable, Accessible, Interoperable and Reusable. But first, here's a rant about some modeling and diagramming styles and trends that I do not like. I took part in a fun Twitter thread recently kicked off by Fiona Tweedie. Fiona Tweedie @FCTweedie So my current bugbear is university processes that seem to forget that the actual work of higher ed is doing research and/ or teaching. This "research lifecycle" diagram from @UW is a stunning example: In this tweet Dr Tweedie has called out Yet Another Research Lifecycle Diagram That Leaves Out The Process Of You Know, Actually Doing Research. This process-elision happened more than once when I was working as an eResearch manager - management would get in the consultants to look at research systems, talk to the research office and graduate school and come up with a "journey map" of administrative processes that either didn't mention the actual DOING research or represented it as a tiny segment, never mind that it's, you know, the main thing researchers do when they're being researchers rather than teachers or administrators. At least the consultants would usually produce a 'journey map' that got you from point A to Point B using chevrons to >> indicate progress and didn't insist that everything was a 'lifecycle'. Something like: Plan / Propose >> Setup >> Manage / Do Research >> Closeout But all too commonly processes are represented using the tired old metaphor of a lifecycle. Reminder: A lifecycle is a biological process; how organisms come into existence, reproduce and die via various means including producing seeds, splitting themselves in two, um, making love, laying eggs and so on. It's really stretching the metaphor to talk about research in this way - maybe the research outputs in the UW "closeout" phase are eggs that hatch into new bouncing baby proposals? Regrettably, arranging things in circles and using the "lifecycle" metaphor is very common - see this Google image search for "Research Lifecycle": I wonder if the diagramming tools that are available to people are part of the issue - Microsoft Word, for example can build cycles and other diagrams out of a bullet list. (I thought it would be amusing to draw the UW diagram from above as a set cogs but this happened - you can only have 3 cogs in a Word diagram.) Research Data Management as a Cycle Now that I've got that off my chest let's look at research data management. Here's a diagram which is in fairly wide use, from The University of California. (This image has a CC-BY logo which means I can use it if I attribute it - but I'm not 100% clear on the original source of the diagram - it seems to be from UC somewhere.) Marco used this one in some presentations we gave. I thought we could do better. The good part of this diagram is that it shows research data management as a cyclical, recurring activity - which for FAIR data it needs to be. What I don't like: I think it is trying to show a project (ie grant) level view of research with data management happening in ONE spot on the journey. Typically researchers do research all the time (or in between teaching or when they can get time on equipment) not at a particular point in some administrative "journey map". We often hear feedback that their research is a lifetime activity and does not happen the way administrators and IT think it does. "Archive" is shown as a single-step pre-publication. This is a terrible message; if we are to start really doing FAIR data then data need to be described and made findable and accessible ASAP. The big so-called lifecycle is (to me) very contrived and looks like a librarian view of the world with data searching as a stand-alone process before research data management planning. Not clear whether Publication means articles or data. "Data Search / Reuse" is a type of "Collection", and why is it happening before data management planning? "Re-Collection" is also a kind of collection, so we can probably collapse all those together (the Findable and Accessible in FAIR). It’s not clear whether Publication means articles or data or both. Most research uses some kind of data storage but very often not directly; people might be interacting with a lab notebook system or a data repository - at UTS we arrived at the concept of "workspaces" to capture this. The "Minimum Viable FAIR Diagram" Marco and I have a sketch of a new diagram that attempts to address these issues and addresses what needs to be in place for broad-scale FAIR data practice. Two of the FAIR principles suggest services that need to be in place; ways to Find and Access data. The I and R in FAIR are not something that can be encapsulated in a service, as such, rather they imply that data are well described for re-use and Interoperation of systems and in Reusable formats. As it happens, there is a common infrastructure component which encapsulates finding data and accessing; the repository. Repositories are services which hold data and make it discoverable and accessible, with governance that ensures that data does not change without notice and is available for access over agreed time frames - sometimes with detailed access control. Repositories may be general purpose or specialized around a particular type of data: gene sequences, maps, code, microscope images etc. They may also be ad-hoc - at a lab level they could be a well laid out, well managed file system. Some well-funded disciplines have established global or national repositories and workflows for some or all of their data, notably physics and astronomy, bioinformatics, geophysical sciences, climate and marine science. Some of these may not be thought of by their community as repositories - but according to our functional definition they are repositories, even if they are "just" vast shared file systems or databases where everyone knows what's what and data managers keep stuff organized. Also, some institutions have institutional data repositories but it is by no means common practice across the whole of the research sector that data find their way into any of these repositories. Remember: data storage is not all files-on-disks. Researchers use a very wide range of tools which may make data inaccessible outside of the tool. Examples include: cloud-based research (lab) notebook systems in which data is deposited alongside narrative activity logs; large shared virtual laboratories where data are uploaded; Secure eResearch Platforms (SERPs) which allow access only via virtualized desktops with severely constrained data ingress and egress; survey tools; content management systems; digital asset management systems; email (yes, it's true some folks use email as project archives!); to custom-made code for a single experiment. Our general term for all of the infrastructures that researchers use for RDM day to day including general purpose storage is “workspaces”. Many, if not most workspaces do not have high levels of governance, and data may be technically or legally inaccessible over the long term. They should not be considered as suitable archives or repositories - hence our emphasis on making sure that data can be described and deposited into general purpose, standards-driven repository services. The following is a snapshot of the core parts of an idealised FAIR data service. It shows the activities that researchers undertake, acquiring data from observations, instruments and by reuse, conducting analysis and data description in a working environment, and depositing results into one or more repositories. We wanted it to show: That infrastructure services are required for research data management - researchers don't just "Archive" their data without support - they and those who will reuse data need repository services in some form. That research is conducted using workspace environments - more infrastructure. We (by which I mean Marco) will make this prettier soon. And yes, there is a legitimate cycle in this diagram it's the FIND -> ACCESS -> REUSE -> DESCRIBE -> DEPOSIT cycle that's inherent in the FAIR lifestyle. Things that might still be missing: Some kind of rubbish bin - to show that workspaces are ephemeral and working data that doesn't make the cut may be culled, and that some data is held only for a time. What do you think's missing? Thoughts anyone? Comments below or take it up on twitter with @ptsefton. (I have reworked parts of a document that Marco and I have been working on with Guido Aben for this document, and thanks to recent graduate Florence Sefton for picking up typos and sense-checking). Comments Please enable JavaScript to view the comments powered by Disqus. comments powered by Disqus Categories Arkisto Platform Data Packaging Standards DataCrate DataCrate, Repositories, eResearch eResearch File Data Capture Housekeeping How to jiscPUB misc Music Repositories Research Data Management ScholarlyHTML Word Processing Links Work Play Twitter: @ptsefton Photos © 2021 Peter (Petie) Sefton · Powered by pelican-bootstrap3, Pelican, Bootstrap Back to top 
ptsefton-com-1810	----	None 
ptsefton-com-3536	----	What did you do in the lockdowns PT? Part 1 - Music Videos - ptsefton.com Toggle navigation ptsefton.com HOME CV Archives What did you do in the lockdowns PT? Part 1 - Music Videos Date Thu 08 April 2021 Post looks too long? Don't want to read? Here's the summary. Last year Gail McGlinn* and I did the lockdown home-recording thing. We put out at least one song video per week for a year (and counting - we're up to 58 over 53 weeks). Searchable, sortable website here. We learned some things, got better at performing for the phone camera and our microphones and better at mixing and publishing the result. * Disclosure Gail's my wife. We got married; she proposed, I accepted. I may I might - Is this the world's best marriage proposal acceptance song? (It did win a prize at a Ukulele festival for best song) (This post is littered with links to our songs, sorry but there are 58 of them and someone has to link to them.) In the second quarter of 2020 Gail McGlinn and I went from playing and singing in community music events (jams, gigs, get togethers) at least once a week to being at home every evening, like everyone else. Like lots of people we decided to put our efforts into home recording, not streaming cos that would be pointless for people with basically no audience, but we started making videos and releasing them under our band name Team Happy. By release I mean "put on Facebook" and "sometimes remember to upload to YouTube". This post is about that experience and what we learned. Team Happy is the name we use to perform as a duo at open mic events and the odd community or ukulele festival. We were originally called "The Narrownecks" in honour of where we live, for one gig, but then we found out there's another group with that name. Actually they're much better than us, just go watch them. Coming in to 2020 we already had a YouTube channel and it had a grand total of two videos on it with a handful of views - as in you could count them on your fingers. It's still a sad thing to behold, how many views we have - but it's not about views it's about getting discovered and having our songs performed by, oh I dunno, Casey Chambers? Keith Urban? (Oh yeah, that would mean we'd need views. Bugger.) Either that or it's about our personal journey and growth as people. Or continuing to contribute to our local music communities in lockdown (which is what Gail says it's about.). Seriously though, we think I called your name and Dry Pebbles would go well on someone else's album. Dry Pebbles, by Gail McGlinn - a song written tramping through the bush. I called your name by Peter Sefton Anyway, in late March we got out our recording gear and started. While phone cameras are fine for the quality of video we need, we wanted to do better than phone-camera sound. (Here's an example of that sound from one of our first recordings on my song Seventeen - it's pretty muddy, like the lighting.) Seventeen by Peter Sefton Initial attempts to get good audio involved feeding USB-audio from a sound mixer with a built in audio interface (a Yamaha MX10) into the phone itself and recording an audio track with the video - but this is clunky and you only get two tracks even though the mixer has multiple inputs. We soon graduated to using a DAW - a Digital Audio Workstation with our mixer, still only two tracks but much less mucking around with the phone. So this is more or less what we ended up with for the first few weeks - We'd record or "track" everything on the computer and then use it again to mix. Our first-generation recording rig with annoying recording via a laptop There's a thing you have to do to audio files called mastering which means getting them to a suitable volume level and dynamic range for distribution. Without it loud stuff is too quiet and quiet stuff is too quiet, and the music has no punch. This was a complete mystery to me to start with so I paid for online services that use AI to master tracks - kind of but not really making everything louder. At some point I started doing it myself, beginning the long process of learning the mysteries of compression and limiting and saving money. Haven't mastered it yet, though. Mastering is an actual profession, by the way and I'm not going to reach those heights. In May, we got a new bit of gear, the Tascam Model 12 an all in one mixer-recorder-interface that lets you track (that is record tracks) without a computer - much easier to deal with. A bit later we got a Zoom H5 portable recorder with built in mics and a couple of extra tracks for instruments so we can do stuff away from home - this got used on our month-long holiday in March 2021. Well it was almost a month, but there was a Rain Event and we came home a bit early. These machines let you capture tracks, including adding new ones without touching the computer which is a big win as far as I am concerned. Gail singing Closer to fine on The Strand in Townsville, in North Queensland, recorded on the H5 and (partly) mixed in the car on holidays. After a bit, and depending on the level of lockdown we'd have guests around to visit and when that was happening, we kept our distance at either end of our long lounge room and used a phone camera and microphone at each end. Our second-generation recording rig with stand-alone laptop-free tracking This new setup made it much easier to do overdubs - capture more stuff into the Model 12 and make videos each time, like on this song of mine They Say Dancing where I overdubbed guitar and bass over a live track. They Say Dancing by Peter Sefton So what did we learn? Perfect is the enemy of Done. Well, we knew that, but if you've decided to release a song every week, even if you're away on a holiday, or there are other things going on then there's no time to obsess over details - you have to get better at getting a useable take quickly or you won't be able to keep going for a year or more. Practice may not make perfect, but it's a better investment than new gear, or doing endless takes with the cameras rolling. We got better at picking a song (or deciding to write one or finish one off), playing it for a week or two and then getting the take. Simplify! We learned that to get a good performance sometimes it was better for only one of us to play or sing, that fancy parts increased the chance of major errors, meaning yet another take. If in doubt (like my harmony singing that's always in doubt) we're learning to leave it out. Nobody likes us! Actually we know that's not true, some of the songs get hundreds of plays on Facebook but not many people actually click the like button, maybe twenty or so. But then you run into people in the supermarket; they say "love the songs keep it up"! And there are quite a few people who listen every week on FB we just can't tell they're enjoying it. There are complex reasons for this lack of engagement - some people don't like to like things so that (they think) the evil FB can't track them. I think the default auto-play for video might be a factor too - the video starts playing, and that might not be a good time, so people skip forward to something else. It's kind of demoralizing that it is MUCH easier to get likes with pictures of the dog. Our spoiled covid-hound, Floki - about 18 months old. Much more likeable on the socials than our music. YouTube definitely doesn't like us. I figured that some of the songs we sang would attract some kind of Youtube audience - we often search to see what kinds of covers of songs are out there and thought others might find us the same way, but we get almost no views on that platform. I also thought that adding some text about the gear we used might bring in some views. For example we were pretty early adopters of the Tascam Model 12. I had tried to find out what one sounded like in real life before I bought, with no success - and I thought people might drop by to hear us, but I don't think Google/YouTube is giving us any search-juice at all. Our personal favourites Our Favourite cover we did (and we actually agreee on this - Team Happy is NOT an ironic name) was Colour my World. We'd just got the Tascam and Gail was able to double track herself - no mucking around with computers. We had fun that night. Colour my World - one of our fave covers to perform And my favourite original? Well i'm very proud of All L'Amour for you with lots of words and a bi-lingual pun - I wanted to do that on the local community radio just last weekend when we were asked in, but the host Richard 'Duck' Keegan kind of mentioned the aforementioned I Called Your Name so we did that instead along with Dry Pebbles and Seventeen. All L'Amour for you The last word on love and metaphors for love? By Peter Sefton. Gail's fave original? I may I might, the song that snagged her the best husband in South Katoomba over 1.95m tall. And she likes the tear jerker Goodbye Mongrel dog I wrote, on which she pays some pumpin' banjo. Goodbye Mongrel dog - a song that says goodbye to a (deceased) Mongrel dog who went by the name of Spensa. Music-tech stuff and mixing tips For those of you who care, here's a roundup of the main bits of kit that work well. We've reached the point where there's actually nothing on the shopping list - we can do everything for the foreseeable future with what we have. I have mentioned that we track using the Tascam Model 12 and the Zoom H5 - these are both great. The only drawback of the Zoom is that you can't see the screen (and thus the levels) from performance position. It also needed a better wind shield - I bought a dead-cat, shaggy thing to go over the mics that works if the wind is moderate. When I bought the Tascam I thought it was going to be all analogue through the mixer stage like their Model 16 and Model 24, but no, it's all digital. I don't think this is an issue having used it but it was not something they made all that explicit at launch. There's a digital Zoom equivalent (the L12) which is a bit smaller, and has more headphone outputs but at the expense of having to do mode-switching to to access all the functions. I think the Tascam will be easier to use for live shows when those start happening again. For video we just use our phones - for a while we had matching Pixel 4XLs then a Pixel 5 which drowned in a tropical stream. Yes they're waterproof, those models, but not when they have tiny cracks in the screen. No more $1000 phones for me. Reaper is bloody marvelous software. It's cheap for non-commercial use, incredibly powerful and extensible. I have not used any other Digital Audio Workstation other than Garage Band, that comes for free on the Apple Platform but as far as I can see there's no reason for non-technophobic home producers to pay any more than the Reaper fee for something else. Our mainstay mics are a slightly battered pair of Audio Technica AT2020s - we had these for performing live with Gail's band U4ria - everyone gathered around a condenser mic, bluegrass style. For recording we either put one at either end of the room or mount them vertically in an X/Y configuration - 90° to get stereo. They're fairly airy and have come to be a big part of our sound. We tried some other cheap things that didn't work very well, and I got a pair of Australian Rode M5 pencil condenser mics, not expensive, that I hoped might be easier to mount X/Y but we didn't like them for vocals at all, though they're great on stringed instruments. We do have an SM58 and SM57 -- gotta love a microphone with a wikipedia page -- which see occasional use as vocal mics if we want a more rock 'n roll sound, or the guest singer is more used to a close-mic. And the SM57 for guitar amps sometimes. We tend to play our favourite acoustic instruments but when we have bass we use the Trace Elliot Elf amp which has a great compressor and a DI output (it can send a signal to the mixer/interface without going via the speaker). Sometimes we run the speaker and try not to let it bleed too much into the AT2020s, very occasionally we wear headphones for the first track and go direct so there's no bass bleed. I have done a bit of electric guitar with the Boss Katana 50 - to me it sounds good in the room that amp, but has not recorded well either via the headphone out or via an SM57. I get better results thru the bass amp. I don't have any kind of actual electric guitar tone sorted though I have seen lot of videos about how to achieve the elusive tone. Maybe one day. One thing that I wasn't expecting to happen - I dropped the top E of my little Made in Mexico Martin OOO Jr guitar to D (you know, like Keef) some time early in 2020 and it ended up staying there. Gives some nice new chord voicings (9ths mostly) and it's the same top 4 strings as a 5 string banjo with some very easy-to-grab chords. Have started doing it to Ukuleles too, putting them in open C. A note on the bass: Playing bass is fun (we knew that before we started) but mixing it so it can be heard on a phone speaker is a real challenge. One approach that helps is using an acoustic bass which out of a lot more high frequency than a solid body electric this also helps because you don't have to have an amp on while you're tracking it live, but you can take a direct input from a pickup (or two) AND mic the bass giving you lots of signals with different EQ to play with. I gaffa-taped a guitar humbucker into my Artist Guitars 5 string acoustic and it sounds huge. The basic (ha!) trick I try to use for getting more high frequency for tiny speakers is to create a second track, saturate the signal with distortion and/or saturation effects to boost the upper harmonic content and then cut all the low frequency out and mix that so it can just be heard and imply the fundamental bass frequency in addition to the real bassy bass. Helps if you have some bridge pickup or under-saddle pickup in the signal if those are available and if you remember. I also like to add some phaser effect that gives some motion in the upper frequencies - for example my Perfect Country Pop Song - too much phaser? Probably, but I can hear the bass on my phone and it bounces :). Phaser is Team Happy's favourite effect, nothing says perfect country pop (which is what we are, right?) like a phaser. Perfect Country Pop Song - is it perfect or merely sublime? (This one has a cute puppy in it). Everything I know about music production is from YouTube. Everything I know about song writing is from deep in my soul. Thank you for reading all the way to the bottom. Normal service will resume next week. Comments Please enable JavaScript to view the comments powered by Disqus. comments powered by Disqus Categories Arkisto Platform Data Packaging Standards DataCrate DataCrate, Repositories, eResearch eResearch File Data Capture Housekeeping How to jiscPUB misc Music Repositories Research Data Management ScholarlyHTML Word Processing Links Work Play Twitter: @ptsefton Photos © 2021 Peter (Petie) Sefton · Powered by pelican-bootstrap3, Pelican, Bootstrap Back to top 
ptsefton-com-4152	----	None 
ptsefton-com-4865	----	ptsefton.com Toggle navigation ptsefton.com HOME CV Archives 2021-04-08: What did you do in the lockdowns PT? Part 1 - Music Videos 2021-04-07: FAIR Data Management; It's a lifestyle not a lifecycle 2021-01-28: Research Data Management looking outward from IT 2021-01-04: Redundant. 2020-11-23: An open, composable standards–based research eResearch platform: Arkisto 2020-03-24: You won't believe this shocking semantic web trick I use to avoid publishing my own ontologies! Will I end up going to hell for this? 2019-11-07: eResearch Australasia 2019 trip report 2019-11-05: FAIR Simple Scalable Static Research Data Repository 2019-11-05: Meet RO-Crate 2019-07-01: DataCrate - a progress report on packaging research data for distribution via your repository 2019-07-01: Implementation of a Research Data Repository using the Oxford Common File Layout standard at the University of Technology Sydney 2019-07-01: Trip Report - Open Repositories 2019 - Peter Sefton Looking for more? See the archive. Categories Arkisto Platform Data Packaging Standards DataCrate DataCrate, Repositories, eResearch eResearch File Data Capture Housekeeping How to jiscPUB misc Music Repositories Research Data Management ScholarlyHTML Word Processing Links Work Play Twitter: @ptsefton Photos © 2021 Peter (Petie) Sefton · Powered by pelican-bootstrap3, Pelican, Bootstrap Back to top 
ptsefton-com-5319	----	None 
ptsefton-com-619	----	ResearchCycle_BlueWords_highRes 
ptsefton-com-6834	----	None 
ptsefton-com-7973	----	None 
ptsefton-com-8073	----	ptsefton.com ptsefton.com What did you do in the lockdowns PT? Part 1 - Music Videos Post looks too long? Don't want to read? Here's the summary. Last year Gail McGlinn* and I did the lockdown home-recording thing. We put out at least one song video per week for a year (and counting - we're up to 58 over 53 weeks). Searchable, sortable website here. We learned … FAIR Data Management; It's a lifestyle not a lifecycle I have been working with my colleague Marco La Rosa on summary diagrams that capture some important aspects of Research Data Management, and include the FAIR data principles; that data should be Findable, Accessible, Interoperable and Reusable. But first, here's a rant about some modeling and diagramming styles and trends … Research Data Management looking outward from IT This is a presentation that I gave on Wednesday the 2nd of December 2020 at the AeRO (Australian eResearch Organizations) council meeting at the request of the chair Dr Carina Kemp). Carina asked: It would be really interesting to find out what is happening in the research data management space … Redundant. Thursday 10 December 2020 was my last day at UTS as the eResearch Support Manager. The position was declared to be redundant under the &quot;Voluntary Separation Program&quot;. I guess the corporate maths works for UTS and it works for me. Thanks COVID-19. This is the third redundancy for me, and … An open, composable standards–based research eResearch platform: Arkisto This is a talk delivered in recorded format by Peter Sefton, Nick Thieberger, Marco La Rosa and Mike Lynch at eResearch Australasia 2020. Also posted on the UTS eResearch website. ' title='1' border='1' width='85%'/&gt; Research data from all disciplines has interest and value that extends beyond funding cycles and must continue to be managed … You won't believe this shocking semantic web trick I use to avoid publishing my own ontologies! Will I end up going to hell for this? [Update - as soon as this went live I spotted an error in the final example and fixed it]. In this post I describe a disgusting, filthy, but possibly beautiful hack* I devised to get around a common problem in data description using semantic web techniques, specifically JSON-LD and schema.org … eResearch Australasia 2019 trip report By Mike Lynch and Peter Sefton I'm re-posting / self-archiving this from the UTS eResearch Blog. Mike Lynch and Peter Sefton attended the 2019 eResearch Australasia conference in Brisbane from 22-24 October 2019, where we presented a few things - and a pre-conference summit on the 21st held by the Australian Research … FAIR Simple Scalable Static Research Data Repository This presentation was given by Peter Sefton &amp; Michael Lynch at the eResearch Australasia 2019 Conference in Brisbane, on the 24th of October 2019. Welcome - we’re going to share this presentation. Peter/Petie will talk through the two major standards we’re building on, and Mike will talk about the … Meet RO-Crate By Peter Sefton This presentation was given by Peter Sefton at the eResearch Australasia 2019 Conference in Brisbane, on the 24th of October 2019. ' title='Meet RO-Crate ' border='1' width='85%'/&gt; This presentation is part of a series of talks delivered here at eResearch Australasia - so it won’t go back over all of the detail already … DataCrate - a progress report on packaging research data for distribution via your repository ' title='DataCrate: a progress report on packaging research data for distribution via your repository Peter Sefton University of Technology Sydney ' border='1' width='85%'/&gt; This is a talk that I delivered at Open Repositories 2019 in Hamburg Germany, reporting on developments in the DataCrate specification for research data description and packaging. The big news is that DataCrate is now part of a broader international effort known as RO-Crate. I spent several hours at the … 
punchup-world-239	----	PUNCHUP | WE CREATE MEANINGFUL CONVERSATION WITH DATA + DESIGN + STORY. Skip to content menu About Project Blog Contact close About Project Blog Contact Get in touch hi@punchup.world data +design+ story We create meaningful conversation with data + design + story. Through storytelling and visualization, we provide a service of consulting and create new ways of communication. What We Do CONSULTING When you’ve no idea how to tell your story, we help you design creative and insightful ways. หากคุณมีเรื่องหรือข้อมูลที่อยากเล่า เรายินดีช่วยออกแบบวิธีสื่อสาร ด้วยเทคนิคที่น่าสนใจ และเต็มไปด้วยความคิดสร้างสรรค์ PRODUCTION We provide a variety of innovative tools to tell your story way better : data visualization, interactive content, visual essays, etc. เรารับออกแบบและผลิตชิ้นงาน ด้วยเทคนิคและเครื่องมือต่างๆ ที่ช่วยทำให้ข้อมูลน่าสนใจและเข้าใจง่ายขึ้น WORKSHOP We’re happy to offer a fun and friendly session to make you learn how to tell story yourself. เรารับจัดเวิร์คชอปเพื่อแนะนำและให้คุณทดลองเรียนรู้ที่จะออกแบบการเล่าเรื่องด้วยตัวคุณเอง Client Project <Client> <2021> เปิดกระป๋อง แบรนด์ทูน่าไทย อุตสาหกรรมยักษ์ใหญ่ พร้อมความรับผิดชอบที่ใหญ่ยิ่ง ร่วมสำรวจปัญหาเบื้องหลังอุตสาหกรรมทูน่ากระป๋องไทย ผ่านข้อมูลผลกระทบด้านสิ่งแวดล้อม แรงงาน และความโปร่งใส ที่เปิดโอกาสให้ทุกคนมีส่วนร่วมแก้ไขปัญหา Thailand is the world biggest exporter of canned Tuna, Although some controversies still occur around domestic canned tuna products. This storytelling clarifies major issues behind Thai Tuna industry. Visit Site More on Client Project Studio Project <Studio> <2020> Love Chart Quiz ในช่วงวันวาเลนไทน์ Punch Up อยากชวนคุณมาวิชวลไลซ์หัวใจด้วยควิซคิวท์ๆ เล่นชิวๆ แถมได้รู้จักชาร์ตหลายแบบ ความรักของคุณเป็นแบบไหน? เล่าผ่านชาร์ตอะไรดี? ลองเล่นได้ที่นี่ On Valentine’s Day, Punch Up would like to invite you to visualize your love with this simple quiz that can interpret your relationship in different charts. What is your love like? Let’s play! Visit Site More on Studio Project Blog <Article> Best of Digital News Design งานที่เราว่า ว้าว! เลยอยากแบ่งกันดู Thanisara Ruangdej (GG) <Event> RECAP – Open Data For Democracy เปิดข้อมูลรัฐสู่สาธารณะ  เพื่อประชาธิปไตย Punch Up Team <Project> 4 แสนล้าน กู้มาแล้วไปไหน? ชวนดูเครื่องมือติดตามและตรวจสอบเงินกู้โควิด-19 Punch Up Team <Article> นี่ว่าดี! งาน Visual & Data-driven Stories จาก The Pudding Cup 2020 Punch Up Team See All Blog Event <Event> RECAP – Open Data For Democracy เปิดข้อมูลรัฐสู่สาธารณะ  เพื่อประชาธิปไตย 8 Mar 2021 More Info <Event> Punch Up x Skooldio เวิร์คชอปออนไลน์ที่นำบรรยากาศห้องเรียนมาอยู่บนหน้าจอที่บ้าน 5 Jun 2020 More Info <Event> พลิกโฉมสื่อไทยด้วยข้อมูลใน Data Journalism Workshop 5 Jun 2020 More Info <Event> Punch Up x Wisesight : Workshop แปลงโฉม Data Report ให้ย่อยง่าย น่าดู อ่านสนุก ตอบโจทย์ผู้อ่าน 4 Dec 2019 More Info <Address> No. 1 Building, 6th Floor, Soi Patpong, 1 Surawong Rd, Bang Rak, Bangkok 10500 Copyright 2019, Punch Up Say Hi! hi@punchup.world or leave us a message Notice: JavaScript is required for this content. 
pypi-org-3825	----	twarc-videos · PyPI Skip to main content Switch to mobile version Warning Some features may not work without JavaScript. Please try enabling it if you encounter problems. Search PyPI Search Help Sponsors Log in Register Menu Help Sponsors Log in Register Search PyPI Search twarc-videos 0.0.4 pip install twarc-videos Copy PIP instructions Latest version Released: Mar 24, 2021 A twarc plugin to extract referenced video from tweet data Navigation Project description Release history Download files Project links Homepage Statistics GitHub statistics: Stars: Forks: Open issues/PRs: View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery Meta Author: Ed Summers Requires: Python >=3.3 Maintainers esummers Project description Project details Release history Download files Project description twarc-videos This twarc plugin uses youtube_dl to download videos and their metadata from tweets. This is nice because youtube_dl downloads video from many more platforms than YouTube including Twitter itself. To use twarc-videos first you need to install it: pip install twarc-videos Now you can collect data using the core twarc utility. For example this search finds tweets that mention the word "nirvana" and also have native video (Twitter video) or a link to YouTube: twarc2 search 'nirvana (has:videos OR url:"https://youtu.be")' > nirvana-tweets.jsonl And you have a new subcommand videos that is supplied by twarc-videos. twarc2 videos nirvana-tweets.jsonl Once it is finished you will have a new videos directory that looks something like: videos ├── archive.txt ├── mapping.tsv ├── twitter │   ├── 1339223561731530753 │   │   ├── Psychedelia_-_Nirvana_-_Come_As_You_Are.description │   │   ├── Psychedelia_-_Nirvana_-_Come_As_You_Are.info.json │   │   └── Psychedelia_-_Nirvana_-_Come_As_You_Are.mp4 │   ├── 1341668409428353025 │   │   ├── Rt_Your_Fav_Bands_-_Nirvana_Come_As_You_Are.description │   │   ├── Rt_Your_Fav_Bands_-_Nirvana_Come_As_You_Are.info.json │   │   └── Rt_Your_Fav_Bands_-_Nirvana_Come_As_You_Are.mp4 │   ├── 1374212180002926594 │   │   ├── Hanna_-_She_s_in_Nirvana....description │   │   ├── Hanna_-_She_s_in_Nirvana....info.json │   │   └── Hanna_-_She_s_in_Nirvana....mp4 │   ├── 1374467789885378569 │   │   ├── MUSIC_NOSTALGIA_-_Nirvana_The_Man_Who_Sold_The_World_..description │   │   ├── MUSIC_NOSTALGIA_-_Nirvana_The_Man_Who_Sold_The_World_..info.json │   │   └── MUSIC_NOSTALGIA_-_Nirvana_The_Man_Who_Sold_The_World_..mp4 │   ├── 1374469206226264067 │   │   ├── Take_it_easy_-_Abuelo_donde_andas_Nirvana.description │   │   ├── Take_it_easy_-_Abuelo_donde_andas_Nirvana.info.json │   │   └── Take_it_easy_-_Abuelo_donde_andas_Nirvana.mp4 │   ├── 1374631023502360576 │   │   ├── OraEtLabora_-_Reel_Stories_-_Dave_Grohl_is_on_@bbctwo_this_Saturday_at_10.30pm...talking_@Nirvana_amp_@foofighters_with_Dermot_@radioleary_@wearecraftuk.description │   │   ├── OraEtLabora_-_Reel_Stories_-_Dave_Grohl_is_on_@bbctwo_this_Saturday_at_10.30pm...talking_@Nirvana_amp_@foofighters_with_Dermot_@radioleary_@wearecraftuk.info.json │   │   └── OraEtLabora_-_Reel_Stories_-_Dave_Grohl_is_on_@bbctwo_this_Saturday_at_10.30pm...talking_@Nirvana_amp_@foofighters_with_Dermot_@radioleary_@wearecraftuk.mp4 │   ├── 1374656171844329477 │   ├── 1374656880694292483 │   ├── 1374660019241762817 │   ├── 1374664809078272000 │   └── 1374671562016661506 │   ├── John_-_Nirvana_-_In_Bloom_Live_at_Reading_1992_@YouTube.description │   ├── John_-_Nirvana_-_In_Bloom_Live_at_Reading_1992_@YouTube.info.json │   └── John_-_Nirvana_-_In_Bloom_Live_at_Reading_1992_@YouTube.mp4 └── youtube ├── 5X9CGFQyjN4 │   ├── Heart-Shaped_Box_Nirvana_Music_Box.description │   ├── Heart-Shaped_Box_Nirvana_Music_Box.en.vtt │   ├── Heart-Shaped_Box_Nirvana_Music_Box.info.json │   └── Heart-Shaped_Box_Nirvana_Music_Box.mp4 ├── AhcttcXcRYY │   ├── Nirvana_-_About_A_Girl_MTV_Unplugged.description │   ├── Nirvana_-_About_A_Girl_MTV_Unplugged.en.vtt │   ├── Nirvana_-_About_A_Girl_MTV_Unplugged.info.json │   └── Nirvana_-_About_A_Girl_MTV_Unplugged.mp4 ├── AXU-LaaO_xQ │   ├── Nirvana_Drain_You_lyrics_sub_espanol.description │   ├── Nirvana_Drain_You_lyrics_sub_espanol.info.json │   └── Nirvana_Drain_You_lyrics_sub_espanol.mp4 ├── D742dNm1f8Q │   ├── Nirvana_-_In_Bloom_Live_at_Reading_1992.description │   ├── Nirvana_-_In_Bloom_Live_at_Reading_1992.info.json │   └── Nirvana_-_In_Bloom_Live_at_Reading_1992.mp4 ├── -fh-bqSV73E │   ├── Becoming_a_minimalist_w_Matt_D_Avella.description │   ├── Becoming_a_minimalist_w_Matt_D_Avella.en.vtt │   ├── Becoming_a_minimalist_w_Matt_D_Avella.info.json │   └── Becoming_a_minimalist_w_Matt_D_Avella.mp4 ├── hTWKbfoikeg │   ├── Nirvana_-_Smells_Like_Teen_Spirit_Official_Music_Video.description │   ├── Nirvana_-_Smells_Like_Teen_Spirit_Official_Music_Video.en.vtt │   ├── Nirvana_-_Smells_Like_Teen_Spirit_Official_Music_Video.info.json │   └── Nirvana_-_Smells_Like_Teen_Spirit_Official_Music_Video.mp4 ├── jWkSt4G8F18 │   ├── Nirvana_healing_centre_overview.description │   ├── Nirvana_healing_centre_overview.info.json │   └── Nirvana_healing_centre_overview.mp4 ├── MW6E_TNgCsY │   ├── Everclear_-_Santa_Monica_Official_Music_Video.description │   ├── Everclear_-_Santa_Monica_Official_Music_Video.info.json │   └── Everclear_-_Santa_Monica_Official_Music_Video.mp4 ├── n6P0SitRwy8 │   ├── Nirvana_-_Heart-Shaped_Box.description │   ├── Nirvana_-_Heart-Shaped_Box.info.json │   └── Nirvana_-_Heart-Shaped_Box.mp4 ├── OgeR2oqZGTs │   ├── Nirvana_-_The_Man_Who_Sold_The_World_Live_On_MTV_Unplugged_1993_Unedited.description │   ├── Nirvana_-_The_Man_Who_Sold_The_World_Live_On_MTV_Unplugged_1993_Unedited.en.vtt │   ├── Nirvana_-_The_Man_Who_Sold_The_World_Live_On_MTV_Unplugged_1993_Unedited.info.json │   └── Nirvana_-_The_Man_Who_Sold_The_World_Live_On_MTV_Unplugged_1993_Unedited.mp4 ├── v9RY25eImcw │   ├── Nirvana_-_Smells_Like_Teen_Spirit_Cover_RADIO_TAPOK.description │   ├── Nirvana_-_Smells_Like_Teen_Spirit_Cover_RADIO_TAPOK.en.vtt │   ├── Nirvana_-_Smells_Like_Teen_Spirit_Cover_RADIO_TAPOK.info.json │   └── Nirvana_-_Smells_Like_Teen_Spirit_Cover_RADIO_TAPOK.mp4 ├── ycHvL3W3_PA │   ├── Nirvana_-_Where_Did_You_Sleep_Last_Night_8D_Audio.description │   ├── Nirvana_-_Where_Did_You_Sleep_Last_Night_8D_Audio.info.json │   └── Nirvana_-_Where_Did_You_Sleep_Last_Night_8D_Audio.mp4 └── y-lQgqHD8Xs ├── dodo_tofubeats_-_nirvana_Official_Music_Video.description ├── dodo_tofubeats_-_nirvana_Official_Music_Video.info.json └── dodo_tofubeats_-_nirvana_Official_Music_Video.mp4 The video/mapping.tsv file is a tab separated value file of video URLs found and their corresponding location in disk. Testing To run the tests you will need create a .env file that looks like: BEARER_TOKEN=YOUR_TOKEN_HERE And then: python setup.py test Project details Project links Homepage Statistics GitHub statistics: Stars: Forks: Open issues/PRs: View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery Meta Author: Ed Summers Requires: Python >=3.3 Maintainers esummers Release history Release notifications | RSS feed This version 0.0.4 Mar 24, 2021 0.0.3 Mar 24, 2021 0.0.2 Mar 24, 2021 0.0.1 Mar 24, 2021 Download files Download the file for your platform. If you're not sure which to choose, learn more about installing packages. Files for twarc-videos, version 0.0.4 Filename, size File type Python version Upload date Hashes Filename, size twarc-videos-0.0.4.tar.gz (5.6 kB) File type Source Python version None Upload date Mar 24, 2021 Hashes View Close Hashes for twarc-videos-0.0.4.tar.gz Hashes for twarc-videos-0.0.4.tar.gz Algorithm Hash digest SHA256 9acc92871caea60b03dabda32a0af868b699fd439c0cf92d38f71b5ed615caeb Copy MD5 ccb32517fb5f5d1ae33545807f7e5bb1 Copy BLAKE2-256 6edf99ad1351ac7527b0c7525dabb05ced2d496501d84029355e907ec9961de6 Copy Close Help Installing packages Uploading packages User guide FAQs About PyPI PyPI on Twitter Infrastructure dashboard Package index name retention Our sponsors Contributing to PyPI Bugs and feedback Contribute on GitHub Translate PyPI Development credits Using PyPI Code of conduct Report security issue Privacy policy Terms of use Status: all systems operational Developed and maintained by the Python community, for the Python community. Donate today! © 2021 Python Software Foundation Site map Switch to desktop version English español français 日本語 português (Brasil) українська Ελληνικά Deutsch 中文 (简体) русский עברית esperanto Supported by AWS Cloud computing Datadog Monitoring DigiCert EV certificate Facebook / Instagram PSF Sponsor Fastly CDN Google Object Storage and Download Analytics Pingdom Monitoring Salesforce PSF Sponsor Sentry Error logging StatusPage Status page 
pypi-org-8064	----	click-plugins · PyPI Skip to main content Switch to mobile version Warning Some features may not work without JavaScript. Please try enabling it if you encounter problems. Search PyPI Search Help Sponsors Log in Register Menu Help Sponsors Log in Register Search PyPI Search click-plugins 1.1.1 pip install click-plugins Copy PIP instructions Latest version Released: Apr 4, 2019 An extension module for click to enable registering CLI commands via setuptools entry-points. Navigation Project description Release history Download files Project links Homepage Statistics GitHub statistics: Stars: Forks: Open issues/PRs: View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery Meta License: BSD License (New BSD) Author: Kevin Wurster, Sean Gillies Tags click, plugin, setuptools, entry-point Maintainers geowurster Classifiers Development Status 5 - Production/Stable Intended Audience Developers License OSI Approved :: BSD License Programming Language Python Python :: 3 Topic Utilities Project description Project details Release history Download files Project description An extension module for click to register external CLI commands via setuptools entry-points. Why? Lets say you develop a commandline interface and someone requests a new feature that is absolutely related to your project but would have negative consequences like additional dependencies, major refactoring, or maybe its just too domain specific to be supported directly. Rather than developing a separate standalone utility you could offer up a setuptools entry point that allows others to use your commandline utility as a home for their related sub-commands. You get to choose where these sub-commands or sub-groups CAN be registered but the plugin developer gets to choose they ARE registered. You could have all plugins register alongside the core commands, in a special sub-group, across multiple sub-groups, or some combination. Enabling Plugins For a more detailed example see the examples section. The only requirement is decorating click.group() with click_plugins.with_plugins() which handles attaching external commands and groups. In this case the core CLI developer registers CLI plugins from core_package.cli_plugins. from pkg_resources import iter_entry_points import click from click_plugins import with_plugins @with_plugins(iter_entry_points('core_package.cli_plugins')) @click.group() def cli(): """Commandline interface for yourpackage.""" @cli.command() def subcommand(): """Subcommand that does something.""" Developing Plugins Plugin developers need to register their sub-commands or sub-groups to an entry-point in their setup.py that is loaded by the core package. from setuptools import setup setup( name='yourscript', version='0.1', py_modules=['yourscript'], install_requires=[ 'click', ], entry_points=''' [core_package.cli_plugins] cool_subcommand=yourscript.cli:cool_subcommand another_subcommand=yourscript.cli:another_subcommand ''', ) Broken and Incompatible Plugins Any sub-command or sub-group that cannot be loaded is caught and converted to a click_plugins.core.BrokenCommand() rather than just crashing the entire CLI. The short-help is converted to a warning message like: Warning: could not load plugin. See ``<CLI> <command/group> --help``. and if the sub-command or group is executed the entire traceback is printed. Best Practices and Extra Credit Opening a CLI to plugins encourages other developers to independently extend functionality independently but there is no guarantee these new features will be “on brand”. Plugin developers are almost certainly already using features in the core package the CLI belongs to so defining commonly used arguments and options in one place lets plugin developers reuse these flags to produce a more cohesive CLI. If the CLI is simple maybe just define them at the top of yourpackage/cli.py or for more complex packages something like yourpackage/cli/options.py. These common options need to be easy to find and be well documented so that plugin developers know what variable to give to their sub-command’s function and what object they can expect to receive. Don’t forget to document non-obvious callbacks. Keep in mind that plugin developers also have access to the parent group’s ctx.obj, which is very useful for passing things like verbosity levels or config values around to sub-commands. Here’s some code that sub-commands could re-use: from multiprocessing import cpu_count import click jobs_opt = click.option( '-j', '--jobs', metavar='CORES', type=click.IntRange(min=1, max=cpu_count()), default=1, show_default=True, help="Process data across N cores." ) Plugin developers can access this with: import click import parent_cli_package.cli.options @click.command() @parent_cli_package.cli.options.jobs_opt def subcommand(jobs): """I do something domain specific.""" Installation With pip: $ pip install click-plugins From source: $ git clone https://github.com/click-contrib/click-plugins.git $ cd click-plugins $ python setup.py install Developing $ git clone https://github.com/click-contrib/click-plugins.git $ cd click-plugins $ pip install -e .\[dev\] $ pytest tests --cov click_plugins --cov-report term-missing Changelog See CHANGES.txt Authors See AUTHORS.txt License See LICENSE.txt Project details Project links Homepage Statistics GitHub statistics: Stars: Forks: Open issues/PRs: View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery Meta License: BSD License (New BSD) Author: Kevin Wurster, Sean Gillies Tags click, plugin, setuptools, entry-point Maintainers geowurster Classifiers Development Status 5 - Production/Stable Intended Audience Developers License OSI Approved :: BSD License Programming Language Python Python :: 3 Topic Utilities Release history Release notifications | RSS feed This version 1.1.1 Apr 4, 2019 1.1 Apr 4, 2019 1.0.4 Sep 16, 2018 1.0.3 Jan 5, 2016 1.0.2 Sep 23, 2015 1.0.1 Aug 20, 2015 1.0 Jul 21, 2015 Download files Download the file for your platform. If you're not sure which to choose, learn more about installing packages. Files for click-plugins, version 1.1.1 Filename, size File type Python version Upload date Hashes Filename, size click_plugins-1.1.1-py2.py3-none-any.whl (7.5 kB) File type Wheel Python version py2.py3 Upload date Apr 4, 2019 Hashes View Filename, size click-plugins-1.1.1.tar.gz (8.2 kB) File type Source Python version None Upload date Apr 4, 2019 Hashes View Close Hashes for click_plugins-1.1.1-py2.py3-none-any.whl Hashes for click_plugins-1.1.1-py2.py3-none-any.whl Algorithm Hash digest SHA256 5d262006d3222f5057fd81e1623d4443e41dcda5dc815c06b442aa3c02889fc8 Copy MD5 943968f6aa1a14862f164c2080cb2fda Copy BLAKE2-256 e9da824b92d9942f4e472702488857914bdd50f73021efea15b4cad9aca8ecef Copy Close Close Hashes for click-plugins-1.1.1.tar.gz Hashes for click-plugins-1.1.1.tar.gz Algorithm Hash digest SHA256 46ab999744a9d831159c3411bb0c79346d94a444df9a3a3742e9ed63645f264b Copy MD5 969268b5b005b2b56115c66c55013252 Copy BLAKE2-256 5f1d45434f64ed749540af821fd7e42b8e4d23ac04b1eda7c26613288d6cd8a8 Copy Close Help Installing packages Uploading packages User guide FAQs About PyPI PyPI on Twitter Infrastructure dashboard Package index name retention Our sponsors Contributing to PyPI Bugs and feedback Contribute on GitHub Translate PyPI Development credits Using PyPI Code of conduct Report security issue Privacy policy Terms of use Status: all systems operational Developed and maintained by the Python community, for the Python community. Donate today! © 2021 Python Software Foundation Site map Switch to desktop version English español français 日本語 português (Brasil) українська Ελληνικά Deutsch 中文 (简体) русский עברית esperanto Supported by AWS Cloud computing Datadog Monitoring DigiCert EV certificate Facebook / Instagram PSF Sponsor Fastly CDN Google Object Storage and Download Analytics Pingdom Monitoring Salesforce PSF Sponsor Sentry Error logging StatusPage Status page 
pypi-org-8219	----	twarc-csv · PyPI Skip to main content Switch to mobile version Warning Some features may not work without JavaScript. Please try enabling it if you encounter problems. Search PyPI Search Help Sponsors Log in Register Menu Help Sponsors Log in Register Search PyPI Search twarc-csv 0.1.0 pip install twarc-csv Copy PIP instructions Latest version Released: Apr 22, 2021 A twarc plugin to output Twitter data as CSV Navigation Project description Release history Download files Project links Homepage Statistics GitHub statistics: Stars: Forks: Open issues/PRs: View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery Meta Author: Igor Brigadir Requires: Python >=3.3 Maintainers igorbrigadir Project description Project details Release history Download files Project description twarc-csv This module adds CSV Export for Tweets to twarc. Make sure twarc is installed and configured: pip install twarc twarc2 configure Install this plugin: pip install twarc-csv A new csv command will be available in twarc. If you have collected some tweets in a file tweets.jsonl you can now convert them to CSV twarc2 search --limit 500 "blacklivesmatter" tweets.jsonl # collect some tweets twarc2 csv tweets.jsonl tweets.csv # convert to CSV Issues with Twitter Data in CSV CSV isn't the best choice for storing twitter data. Always keep the original API responses, and perform feature extraction on json objects. This export script is intended for convenience, for importing samples of data into other tools. The work in progress script does not expose any customizations or configuration yet, there are many ways to format a CSV of tweets, and this is just one way. Contributing Suggestions, opinions, and pull requests welcome and encouraged. Even if you are just interested in using this plugin, post your use case in the Issues. Project details Project links Homepage Statistics GitHub statistics: Stars: Forks: Open issues/PRs: View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery Meta Author: Igor Brigadir Requires: Python >=3.3 Maintainers igorbrigadir Release history Release notifications | RSS feed This version 0.1.0 Apr 22, 2021 0.0.2 Apr 9, 2021 0.0.1 Mar 28, 2021 Download files Download the file for your platform. If you're not sure which to choose, learn more about installing packages. Files for twarc-csv, version 0.1.0 Filename, size File type Python version Upload date Hashes Filename, size twarc-csv-0.1.0.tar.gz (4.4 kB) File type Source Python version None Upload date Apr 22, 2021 Hashes View Close Hashes for twarc-csv-0.1.0.tar.gz Hashes for twarc-csv-0.1.0.tar.gz Algorithm Hash digest SHA256 84b34ae2a09965bdb59fef188024b1e4f4a777209c4dbef7cb32f9a8d97bac61 Copy MD5 74765f9e27342d36dc8d97b899bf5ef6 Copy BLAKE2-256 8f081fba8e2cfe93170e7c4bb89aa1ea54085a461cd13c2a6c0386da374e6ba6 Copy Close Help Installing packages Uploading packages User guide FAQs About PyPI PyPI on Twitter Infrastructure dashboard Package index name retention Our sponsors Contributing to PyPI Bugs and feedback Contribute on GitHub Translate PyPI Development credits Using PyPI Code of conduct Report security issue Privacy policy Terms of use Status: all systems operational Developed and maintained by the Python community, for the Python community. Donate today! © 2021 Python Software Foundation Site map Switch to desktop version English español français 日本語 português (Brasil) українська Ελληνικά Deutsch 中文 (简体) русский עברית esperanto Supported by AWS Cloud computing Datadog Monitoring DigiCert EV certificate Facebook / Instagram PSF Sponsor Fastly CDN Google Object Storage and Download Analytics Pingdom Monitoring Salesforce PSF Sponsor Sentry Error logging StatusPage Status page 
python-org-856	----	PyPI · The Python Package Index Skip to main content Switch to mobile version Warning Some features may not work without JavaScript. Please try enabling it if you encounter problems. Help Sponsors Log in Register Menu Help Sponsors Log in Register Find, install and publish Python packages with the Python Package Index Search PyPI Search Or browse projects 301,670 projects 2,557,160 releases 4,230,628 files 503,156 users The Python Package Index (PyPI) is a repository of software for the Python programming language. PyPI helps you find and install software developed and shared by the Python community. Learn about installing packages. Package authors use PyPI to distribute their software. Learn how to package your Python code for PyPI. Trending projects Trending projects as downloaded by the community alembic 1.5.8 A database migration tool for SQLAlchemy. MarkupSafe 1.1.1 Safely add untrusted strings to HTML/XML markup. setproctitle 1.2.2 A Python module to customize the process title cx-Oracle 8.1.0 Python interface to Oracle ply 3.11 Python Lex & Yacc New releases Hot off the press: the newest project releases larksuite-oapi 1.0.19 larksuite oapi python field-slicer 0.1.12 None chellow 2449 Web Application for checking UK utility bills. pollination-geojson-annual-energy-use 0.1.1 Estimate annual energy use from a geoJSON with building footprints. pytender 1.2 Prepare bid documents Help Installing packages Uploading packages User guide FAQs About PyPI PyPI on Twitter Infrastructure dashboard Package index name retention Our sponsors Contributing to PyPI Bugs and feedback Contribute on GitHub Translate PyPI Development credits Using PyPI Code of conduct Report security issue Privacy policy Terms of use Status: all systems operational Developed and maintained by the Python community, for the Python community. Donate today! © 2021 Python Software Foundation Site map Switch to desktop version English español français 日本語 português (Brasil) українська Ελληνικά Deutsch 中文 (简体) русский עברית esperanto Supported by AWS Cloud computing Datadog Monitoring DigiCert EV certificate Facebook / Instagram PSF Sponsor Fastly CDN Google Object Storage and Download Analytics Pingdom Monitoring Salesforce PSF Sponsor Sentry Error logging StatusPage Status page 
ranti-10centuries-org-3428	----	ranti.10centuries.org ranti.10centuries.org Eternally Yours on 10Centuries Keeping the Dream Alive - Freiheit I did not recall when the first time I heard it, but I remembered it was introduced by my cousin. This song from Münchener Freiheit became one of the songs I listen a lot. The lyrics (see below) resonate stronger nowadays. Keeping the Dream Alive (Single Version) Cover by David Groeneveld: Cover by Kim Wilde: Lyrics:Freiheit - Keeping The Dream Alive Tonight the rain is fallingFull of memories of people and placesAnd while the past is callingIn my fantasy I remember their faces The hopes we had were much too highWay out of reach but we had to tryThe game will never be overBecause we&apos;re keeping the dream alive I hear myself recallingThings you said to meThe night it all startedAnd still the rain is fallingMakes me feel the wayI felt when we parted The hopes we had were much too highWay out of reach but we have to tryNo need to hide no need to run&apos;Cause all the answers come one by oneThe game will never be overBecause we&apos;re keeping the dream alive I need youI love you The game will never be overBecause we&apos;re keeping the dream alive The hopes we had were much too highWay out of reach but we had to tryNo need to hide no need to run&apos;Cause all the answers come one by one The hopes we had were much too highWay out of reach but we had to tryNo need to hide no need to run&apos;Cause all the answers come one by one The game will never be overBecause we&apos;re keeping the dream aliveThe game will never be overBecause we&apos;re keeping the dream alive The game will never be over… Lou Reed's Walk on the Wild Side If my memory serves me right, I heard about this Walk on the Wild Side song (wikipedia) sometime during my college year in the 90s. Of course, the bass and guitar reef were the one that captured my attention right away. At that time, being an international student here in the US, I was totally oblivious with the lyrics and the references on it. When I finally understood what the lyrics are about, listening to the song makes more sense. Here&apos;s the 1973 footage of the Walk on the Wild Side song (youtube) But what prompted me to write this was started by the version that Amanda Palmer sang for Neil Gaiman. I was listening to her CD "Several attempts to cover songs by the Velvet Underground & Lou Reed for Neil Gaiman as his birthday approaches" and one of the songs was Walk on the Wild Side. I like her rendition of the songs, which prompted me to find it on YouTube. Welp, that platform does not disappoint; it&apos;s a quite a nice piano rendition. Of course, like any other platform that wants you to stay there, YouTube also listed various Walk on the Wild Side cover songs. One of them is from Alice Phoebe Lou a singer-songwriter. Her rendition using a guitar is also quite enjoyable (youtube) and now I have a new singer-songwriter to keep an eye on. Among other videos that were listed on YouTube is the one that kinda blew my mind, Walk On The Wild Side - The story behind the classic bass intro featuring Herbie Flowers which explained that those are two basses layered on top of each other. Man, what a nice thing to learn something new about this song. :-) Tao Read it from the Lazy Yogi on climate change Read the whole poem TV News Archive from the Internet Archive I just learned about the existence of the TV News Archive (covering news from 2009 until the day before today&apos;s date) containing news shows from US TV such as PBS, CBS, ABC, FOXNews, CNN, etc. You can search by the captions. They also have several curated collections like news clips regarding NSA or snippets or TV around the world I think some of you might find this useful. Quite a nice collection, IMO. Public Domain Day (January 1, 2017): what could have entered it in 2017 and what did get released Copyright law is messy, yo. We won&apos;t see a lot of notable and important works entering public domain here in the US until 2019. Other countries, however, got to enjoy many of them first. Public Domain Reviews put a list of creators whose work are entering the public domain for Canada, European Union (EU), and many other countries (https://publicdomainreview.org/collections/class-of-2017/.) For those in EU, nice to see H.G. Wells name there (if UK do withdraw, this might end up not applicable to them. But, my knowledge about UK copyright law is zero, so, who knows.) As usual, Center of Study for the Public Domain from Duke University put a list of some quite well-known works that are still under the extended copyright restriction: http://web.law.duke.edu/cspd/publicdomainday/2017/pre-1976. Those works would have been entered the public domain if we use the law that was applicable when they were published. I&apos;m still baffled how current copyright hinders research done and published in 1960 to be made available freely. Greedy publishers… So, thanks to that, USA doesn&apos;t get to enjoy many published works yet. "Yet" is the operative word here because we don&apos;t know what the incoming administration would do on this topic. Considering the next POTUS is a businessman, I fear the worst. I know: gloomy first of the year thought, but it is what it is. On a cheerful side, check the list from John Mark Ockerbloom on his Online Books Project. It&apos;s quite an amazing project he&apos;s been working on. Of course, there are also writings made available from HathiTrust and Gutenberg Project, among other things. Here&apos;s to the next 365 days. xoxo for 2017 read the full poem light "Light thinks it travels faster than anything but it is wrong. No matter how fast light travels, it finds the darkness has always got there first, and is waiting for it."― Terry Pratchett, Reaper Man dot-dot-dot More about Bertolt Brecht poem assistive technology Many people would probably think assistive technology (AT) are computer software, applications, or tools that are designed to help blind or deaf people. Typically, the first thing that one might have in mind was screen readers, braille display, screen magnifier app for desktop reading, or physical objects like hearing aid, wheel chair, or crutches, A lot of people probably won&apos;t think glasses as an AT. Perhaps because glasses can be highly personalized to fit one&apos;s fashion style. woodchuck There&apos;s a question how much wood would a woodchuck chuck if a woodchuck could chuck wood. Obviously, a woodchuck would chuck wood as much wood as a woodchuck could. shrugs droplets The Story of the Chinese Farmer "You&apos;ll never know what would be the consequences of misfortune. Or, you&apos;ll never know what would be the consequences of good fortune." — Alan Watts Persistent bat is persistent For the last couple weeks or so, there&apos;s a bat that somehow managed to sneak in and hid somewhere in the house and then flew frantically in the living room every evening around this time of the day, causing the cats to run and jump around trying to catch it. We caught this bat every time and delivered it outside, hoping it would never return again. But it kept coming back. Now I am sort of giving up trying to catch it. Even the cats are no longer paying attention to the bat and just give this "meh" face when they spotted it. old window #garage 
ranti-10centuries-org-7785	----	ranti.10centuries.org ranti.10centuries.org Eternally Yours on 10Centuries Home Articles Hello! Archives Contact Keeping the Dream Alive - Freiheit Written By ranti 2020-10-29T00:34:00Z I did not recall when the first time I heard it, but I remembered it was introduced by my cousin. This song from Münchener Freiheit became one of the songs I listen a lot. The lyrics (see below) resonate stronger nowadays. Keeping the Dream Alive (Single Version) Cover by David Groeneveld: Cover by Kim Wilde: Lyrics: Freiheit - Keeping The Dream Alive Tonight the rain is falling Full of memories of people and places And while the past is calling In my fantasy I remember their faces The hopes we had were much too high Way out of reach but we had to try The game will never be over Because we're keeping the dream alive I hear myself recalling Things you said to me The night it all started And still the rain is falling Makes me feel the way I felt when we parted The hopes we had were much too high Way out of reach but we have to try No need to hide no need to run 'Cause all the answers come one by one The game will never be over Because we're keeping the dream alive I need you I love you The game will never be over Because we're keeping the dream alive The hopes we had were much too high Way out of reach but we had to try No need to hide no need to run 'Cause all the answers come one by one The hopes we had were much too high Way out of reach but we had to try No need to hide no need to run 'Cause all the answers come one by one The game will never be over Because we're keeping the dream alive The game will never be over Because we're keeping the dream alive The game will never be over… Edit Lou Reed's Walk on the Wild Side Written By ranti 2018-04-15T15:50:00Z If my memory serves me right, I heard about this Walk on the Wild Side song (wikipedia) sometime during my college year in the 90s. Of course, the bass and guitar reef were the one that captured my attention right away. At that time, being an international student here in the US, I was totally oblivious with the lyrics and the references on it. When I finally understood what the lyrics are about, listening to the song makes more sense. Here's the 1973 footage of the Walk on the Wild Side song (youtube) But what prompted me to write this was started by the version that Amanda Palmer sang for Neil Gaiman. I was listening to her CD "Several attempts to cover songs by the Velvet Underground & Lou Reed for Neil Gaiman as his birthday approaches" and one of the songs was Walk on the Wild Side. I like her rendition of the songs, which prompted me to find it on YouTube. Welp, that platform does not disappoint; it's a quite a nice piano rendition. Of course, like any other platform that wants you to stay there, YouTube also listed various Walk on the Wild Side cover songs. One of them is from Alice Phoebe Lou a singer-songwriter. Her rendition using a guitar is also quite enjoyable (youtube) and now I have a new singer-songwriter to keep an eye on. Among other videos that were listed on YouTube is the one that kinda blew my mind, Walk On The Wild Side - The story behind the classic bass intro featuring Herbie Flowers which explained that those are two basses layered on top of each other. Man, what a nice thing to learn something new about this song. :-) Edit Tao Written By ranti 2018-01-01T20:54:00Z Read it from the Lazy Yogi Edit on climate change Written By ranti 2017-01-25T02:23:00Z Read the whole poem Edit TV News Archive from the Internet Archive Written By ranti 2017-01-13T18:12:00Z I just learned about the existence of the TV News Archive (covering news from 2009 until the day before today's date) containing news shows from US TV such as PBS, CBS, ABC, FOXNews, CNN, etc. You can search by the captions. They also have several curated collections like news clips regarding NSA or snippets or TV around the world I think some of you might find this useful. Quite a nice collection, IMO. Edit Public Domain Day (January 1, 2017): what could have entered it in 2017 and what did get released Written By ranti 2017-01-01T17:33:00Z Copyright law is messy, yo. We won't see a lot of notable and important works entering public domain here in the US until 2019. Other countries, however, got to enjoy many of them first. Public Domain Reviews put a list of creators whose work are entering the public domain for Canada, European Union (EU), and many other countries (https://publicdomainreview.org/collections/class-of-2017/.) For those in EU, nice to see H.G. Wells name there (if UK do withdraw, this might end up not applicable to them. But, my knowledge about UK copyright law is zero, so, who knows.) As usual, Center of Study for the Public Domain from Duke University put a list of some quite well-known works that are still under the extended copyright restriction: http://web.law.duke.edu/cspd/publicdomainday/2017/pre-1976. Those works would have been entered the public domain if we use the law that was applicable when they were published. I'm still baffled how current copyright hinders research done and published in 1960 to be made available freely. Greedy publishers… So, thanks to that, USA doesn't get to enjoy many published works yet. "Yet" is the operative word here because we don't know what the incoming administration would do on this topic. Considering the next POTUS is a businessman, I fear the worst. I know: gloomy first of the year thought, but it is what it is. On a cheerful side, check the list from John Mark Ockerbloom on his Online Books Project. It's quite an amazing project he's been working on. Of course, there are also writings made available from HathiTrust and Gutenberg Project, among other things. Here's to the next 365 days. xoxo Edit for 2017 Written By ranti 2017-01-01T04:21:00Z read the full poem Edit light Written By ranti 2016-12-11T03:22:00Z “Light thinks it travels faster than anything but it is wrong. No matter how fast light travels, it finds the darkness has always got there first, and is waiting for it.” ― Terry Pratchett, Reaper Man Edit dot-dot-dot Written By ranti 2016-11-27T01:49:00Z More about Bertolt Brecht poem Edit assistive technology Written By ranti 2016-09-12T16:35:00Z Many people would probably think assistive technology (AT) are computer software, applications, or tools that are designed to help blind or deaf people. Typically, the first thing that one might have in mind was screen readers, braille display, screen magnifier app for desktop reading, or physical objects like hearing aid, wheel chair, or crutches, A lot of people probably won't think glasses as an AT. Perhaps because glasses can be highly personalized to fit one's fashion style. Edit 1 2 3 4 Recent Popular Posts Keeping the Dream Alive - Freiheit 2020-10-29T00:34:00Z for 2017 2017-01-01T04:21:00Z light 2016-12-11T03:22:00Z assistive technology 2016-09-12T16:35:00Z dot-dot-dot 2016-11-27T01:49:00Z Lou Reed's Walk on the Wild Side 2018-04-15T15:50:00Z Public Domain Day (January 1, 2017): what could have entered it in 2017 and what did get released 2017-01-01T17:33:00Z Persistent bat is persistent 2016-09-09T03:14:00Z Favourite excerpt for the time being 2015-03-14T21:13:00Z © 2021 — Site Powered by Strong Coffee & Pictures of Happy Puppies This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. Subscribe Login Searching 
research-dugnads-github-io-735	----	Research Dugnads | Research Dugnads Research Dugnads Research Dugnads Welcome to Research Dugnads! 👋 This website is a collection of examples and advocacy for the concept of a Research Dugnad - a coming together of a research group or team to share knowledge, pass on skills, tidy and review code, among other software and working best practices. The aim is to help research teams benefit from collaborative peer-to-peer learning of research software practices and tooling that helps streamline the day-to-day of performing research. 🤔 If you’d like to know more about Research Dugnads and why we think they’re beneficial, read our Motivation 🔍 To find out what types of tasks could be undertaken during a dugnad, visit our Example Activities list ✉ You could even invite your local Research Software Engineer to share their experience and expertise 📚 Here’s what a Dugnad could look like. Have you hosted a Dugnad? We’d love to hear your experience! If you’re happy to, please leave a short account of your experience in this GitHub issue. Your testimonial will help us improve these resources for everyone! ✨ © Research Dugnads 2021 
research-umn-edu-9955	----	Teach with Story Maps: Announcing the Story Maps Curriculum Portal | Office of the Vice President for Research Skip to Content University of Minnesota Go to the U of M home page One Stop MyU Search Search Submit search query Office of the Vice President for Research U-Spatial News Events The Spatial University Contact Us COVID-19 Guidance for the Research Community About Us Overview Staff List Contact Us Training U-Spatial Training Geospatial Skills Badge - Storytelling Help Desk Software Resources Spatial Data Teach with Story Maps GIS Courses at UMN Esri Innovation Program (EIP) Esri Conference Session Videos Services Hazard Mitigation Planning Acknowledging U-Spatial Mapping Prize Overview 2020 Best Maps 2019 Best Maps 2018 Best Maps 2017 Best Maps 2016 Best Maps 2015 Best Maps 2013 Best Maps Menu close About Us Overview Staff List Contact Us Training U-Spatial Training Geospatial Skills Badge - Storytelling Help Desk Software Resources Spatial Data Teach with Story Maps GIS Courses at UMN Esri Innovation Program (EIP) Esri Conference Session Videos Services Hazard Mitigation Planning Acknowledging U-Spatial Mapping Prize Overview 2020 Best Maps 2019 Best Maps 2018 Best Maps 2017 Best Maps 2016 Best Maps 2015 Best Maps 2013 Best Maps News Events The Spatial University Contact Us One Stop MyU You are here U-Spatial Home » News » Teach with Story Maps: Announcing the Story Maps Curriculum Portal Teach with Story Maps: Announcing the Story Maps Curriculum Portal Feb 18, 2019 U-Spatial is excited to announce the recent launch of the Story Maps Curriculum Portal, a site that provides pedagogical materials for university classes working with Esri Story Maps. This portal is designed to provide tools for both instructors and students who want to work with Story Maps, but who do not have an extensive background in GIS or digital projects more generally. Its resources range from assignment templates, to short how to’s, to exemplary student work. story_maps_image.jpg Over the last few years, there has been an increased interest in Story Maps from instructors and students who would normally never encounter GIS in their disciplines. Since the platform offers a simple and compelling way to engage with spatial thinking, Story Maps has been particularly popular among those seeking to engage in the digital humanities. Yet, many instructors and students alike have remained intimidated by GIS, which has prompted the need for accessible resources to assist with the implementation of Story Maps in the classroom. This need was addressed by the efforts of a team of researchers and educators from across the University of Minnesota, whose work has culminated in this site. With the assistance of an Academic Innovation Grant from the College of Liberal Arts, this team worked with a variety of instructors to develop resources to enable them to teach with Story Maps. This work has already paid off, with at least 11 courses across CLA running Story Map assignments this Spring semester. If you or your colleagues are interested in using Story Maps in the classroom, check out http://storymaps.umn.edu/ or contact U-Spatial to get connected with the Story Maps Team. The UMN Story Maps Curriculum Team Sarah Chambers, PhD - CLA Innovation Grant PI, Department of History faculty Chris Saladin - History PhD student, Graduate Research Assistant Shana Crosson - Academic Technologist, Liberal Arts Technologies and Innovation Services (LATIS) Kate Carlson - Spatial Technology Consultant and Training Coordinator, U-Spatial Melinda Kernik - Spatial Data Analyst and Curator, UMN Libraries Len Kne - Associate Director, U-Spatial Ben Wiggins - Program Director, Digital Arts, Sciences, & Humanities (DASH) Subscribe to OVPR's Inquiry Newsletter Email Address * Leave this field blank U-Spatial 420 Blegen Hall 269 19th Ave. S Minneapolis, MN 55455 Email:  uspatial@umn.edu Phone:  (612) 624-7591 Office of the Vice President for Research 420 Johnston Hall 101 Pleasant St. SE Minneapolis, MN 55455 Email:  research@umn.edu Phone:  (612) 625-3394 OVPR Home OVPR Department Directory Website Feedback/Questions Maps & Directions Parking & Transportation Last Modified: February 18, 2019 - 11:27am. Back to Top © 2018 Regents of the University of Minnesota. All rights reserved. The University of Minnesota is an equal opportunity educator and employer. Privacy Statement Report Accessibility Concerns 
richmondgov-com-3483	----	Richmond, VA City of Richmond, Virginia Welcome Last Updated: 2020-10-08 Click here for updates on the city's preparation for and response to COVID-19. Stay Connected Emergency Preparedness Natural Gas Safety City Events Calendar Data Portal Interact with GRTC Pulse Here Blogs / Twitter / Facebook / YouTube Site Wide Menu: Home Home, General or Common Pages Accessibility Awards to the City Contact Us City Wide Calendar Employee Directory FAQ Forms Privacy Policy Services A to Z Government Leadership Mayor Richmond City Council School Board Agencies Airport Commission Health District, Richmond(VDH-R) Public Utilities Ambulance Authority Housing and Community Development Public Works Animal Care and Control Human Resources Redevelopment and Housing Assessor of Real Estate Human Services Retirement System Office of the City Attorney Information Technology Richmond Gas Works Office of the City Auditor Office of the Inspector General Richmond Public Schools Behavioral Health Authority Justice Services Sheriff Budget and Strategic Planning Richmond Public Library Social Services Circuit Court Clerk Minority Business Development Southside Community Service Center Office of the City Clerk Parks, Recreation and Special Events East District Center   Community Facilities Sustainability Economic Development Planning and Development Review Transit Authority(RMTA) Emergency Communications(911) Police City Treasurer Emergency Management Port of Richmond Voter Registrar Finance Press Secretary to the Mayor Office of Community Wealth Building Fire and Emergency Services Procurement Services           Services A to Z More Information... Highlighted Information Coronavirus (COVID-19) info Contact Mayor Neighbor to Neighbor Richmond Recycles Traffic Information Resort Casino Development & Capital Parking Information RVA Impact Map Traffic Tickets City Code   Improvement Projects Park Search RVA311 Transportation Planning Civic Associations Employee Directory Parking Tickets Sister Cities   Contact City Council Open Data Portal Pay Taxes Slave Trail   Community Gardens Multicultural Affairs Redistricting Special Events                   Services A to Z More Information... Business Business Animal Care and Control City Clerk Employee Directory Property Search Annual Budget Report City Council Finance Department Richmond Gas Works Assessor City Wide Calendar Mayor's Office RVA311 Auditor Economic Development Minority Business Set Up a Special Event Bids and Proposals Emergency Management   Development(MBD) Start a Business Board of Zoning Appeals Emergency Multicultural Affairs Taxes Online Payment Budget   Communications(911) Permits Web Inquiry   CAPS Employment Port of Richmond               Services A to Z More Information... Visitors Visitors 17th St Farmers Market Census Data Cultural Venues Richmond Coliseum About Richmond Center Stage Employee Directory Park Search Altria Theatre City Council Maps (GIS) Travel and Tourism City Wide Calendar City Publication Main Street Station Tours Careers Crime Incidents Mayor's Office Visitors Centers             Services A to Z More Information... Residents Residents 17th St Farmers Market Crime Report Mayor's Office Richmond Gas Works Office on Aging and Departments Neighbor to Neighbor Richmond Recycles   Persons with Disabilities Emergency Multicultural Affairs Road Repairs Altria Theater   Communications(911) Parks Information RVA311 Animal Care and Control Employee Directory Public Works Schools Bus System(GRTC) Fire and Emergency Police Sheriff Office CAPS Forms for the City Property Search Social Services Center Stage GIS Maps and Data Public Health Tours City Council Green City Public Library Traffic Conditions City Wide Calendar Justice Services Public Utilities Voter Registration Civic Associations Leaf Collection Redistricting Community Wealth Building Community Gardens Main Street Station Refuse Collection               Services A to Z More Information... Online Services Online Payments Admissions, Lodging and Meals Tax Gas and Water Parking Violations Personal Property Taxes Real Estate Taxes   Interactive Maps Land Use Projects Map of Traffic Hazards Map to City Hall Zoning     Other Services Bids and Proposals Job Descriptions Property Transfer Search Circuit Court Case Information Job Search Property Search Committees Information Library Online Services RAPIDS Employee Self Service Court Case Information Minority Business Directory Real-Time Traffic Crime Incident Information Open Data Portal Resolutions Search City Events Calendar Ordinances Search RVA Impact Map Community Gardens Parks Search RVA311 Employee Directory Parking Information Video Archive Fraud and Waste Reporting Permits Web Inquiry           Services A to Z More Information... Click here for updates on the city's preparation for and response to COVID-19. Home Home of Mobile Site Complete RichmondGov.com Site Announcements City News Government Mayor City Council Human Resources Finance Assessor Information Business License Personal Property Tax Real Estate Taxes Bids and Proposals Public Safety Police Fire Animal Control Visitors About Richmond City Events Calendar Tours Family Fun Landmarks Cultural Venues Museums Services Traffic Conditions Property Search Utility payment plan Public Works services Mayor's Participation Action and Communication Team Accessibility | Privacy Policy | Contact Us | Auditor Reports Issued | FOIA Request 900 E. Broad St. | Richmond, VA 23219 | Map it | (804)646-7000 Hours: Monday - Friday, 8 a.m. - 5 p.m. Mayor Levar Stoney This is a print version of the webpage. The navigation of the site has been removed through the print css. If you require a printout of the page as it looks in your browser, please use screen capture. Keeping Citizens Informed! 
ronallo-com-1067	----	Preliminary Inventory of Digital Collections by Jason Ronallo Preliminary Inventory of Digital Collections by Jason Ronallo Incomplete thoughts on digital libraries. Upgrading from Ubuntu 17.10 to 18.04 Choosing a Path Forward for IIIF Audio and Video Testing DASH and HLS Streams on Linux Client-side Video Tricks for IIIF IIIF Examples #1: Wellcome Library Closing in on Client-side IIIF Content Search 
ruebot-net-980	----	Nick Ruest Search Nick Ruest Home C.V. Posts Presentations Publications Projects Visualizations Music Contact Nick Ruest Associate Librarian York University Biography Nick Ruest is an Associate Librarian in the Digital Scholarship Infrastructure Department at York University, co-Principal Investigator of the Andrew W. Mellon Foundation funded The Archives Unleashed Project, co-Principal Investigator of the SSHRC grant “A Longitudinal Analysis of the Canadian World Wide Web as a Historical Resource, 1996-2014”, and co-Principal Investigator of the Compute Canada Research Platforms and Portals Web Archives for Longitudinal Knowledge. At York University, he oversees the libraries’ preservation initiatives, along with creating and implementing systems that support the capture, description, delivery, and preservation of digital objects having significant content of enduring value. He was previously active in the Islandora and Fedora communities, serving as Project Director for the Islandora CLAW project, member of the Islandora Foundation’s Roadmap Committee and Board of Directors, and contributed code to the project. He has also served as the Release Manager for Islandora and Fedora, the moderator for the OCUL Digital Curation Community, the President of the Ontario Library and Technology Association, and President of McMaster University Academic Librarians’ Association. Interests Web archives Data analytics Distributed systems Information retrieval Digital preservation Education MLIS, 2007 Wayne State University Bachelor of Arts Political Science, Minor in History, 2004 University of Michigan-Dearborn Recent Publications More Publications Fostering Community Engagement through Datathon Events: The Archives Unleashed Experience Samantha Fritz, Ian Milligan, Nick Ruest, Jimmy Lin PDF From archive to analysis: accessing web archives at scale through a cloud-based interface Nick Ruest, Samantha Fritz, Ryan Deschamps, Jimmy Lin, Ian Milligan PDF Building community at distance: a datathon during COVID-19 Samantha Fritz, Ian Milligan, Nick Ruest, Jimmy Lin PDF Project Content-Based Exploration of Archival Images Using Neural Networks Tobi Adewoye, Xiao Han, Nick Ruest, Ian Milligan, Samantha Fritz, Jimmy Lin PDF Project Video The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives Nick Ruest, Jimmy Lin, Ian Milligan, Samantha Fritz PDF Project Video We Could, but Should We? Ethical Considerations for Providing Access to GeoCities and Other Historical Digital Collections Jimmy Lin, Ian Milligan, Douglas W. Oard, Nick Ruest, Katie Shilton PDF The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives Nick Ruest, Jimmy Lin, Ian Milligan, Samantha Fritz PDF Solr Integration in the Anserini Information Retrieval Toolkit Ryan Clancy, Toke Eskildsen, Nick Ruest, Jimmy Lin PDF Dataset Project Building Community and Tools for Analyzing Web Archives through Datathons Ian Milligan, Nathalie Casemajor, Samantha Fritz, Jimmy Lin, Nick Ruest, Matthew S. Weber, Nicholas Worby PDF Project Slides Scalable Content-Based Analysis of Images in Web Archives with TensorFlow and the Archives Unleashed Toolkit Hsiu-Wei Yang, Linqing Liu, Ian Milligan, Nick Ruest, Jimmy Lin PDF Dataset Project Poster Recent & Upcoming Talks More Talks Lowering the Barrier to Access: The Archives Unleashed Cloud Project Jun 19, 2019 The web that was: archives, traces, reflections RESAW 2019 Slides Project Sustainability and Research Platforms: The Archives Unleashed Cloud Project Jun 7, 2019 International Internet Preservation Consortium Web Archiving Conference 2019 Slides See a little Warclight: building an open-source web archive portal with project blacklight Jun 6, 2019 International Internet Preservation Consortium Web Archiving Conference 2019 Slides Web Archives Analysis at Scale with the Archives Unleashed Cloud (with Ian Milligan) Apr 8, 2019 CNI Spring 2019 Membership Meeting Slides Oh, I Get by with a little help from my friends: Interdisciplinary Web Archive Collaboration. Feb 27, 2019 Workshop on Quantitative Analysis and the Digital Turn in Historical Studies Slides Make it WALK! May 10, 2018 Archives Association of Ontario 2018 Slides Hot Tips To Boost Your Interdisciplinary Web Archive Collaboration! Apr 17, 2018 Lewis & Ruth Sherman Centre for Digital Scholarship Speak Series Slides The World is a Beautiful and Terrible Place Mar 22, 2018 National Forum on Ethics and Archiving the Web Slides Video Boosting Your Interdisciplinary Web Archive Collaboration Feb 16, 2018 BC Research Libraries Group Lecture Series Slides Twitter and Web Archive Analysis at Scale Feb 14, 2018 Data Love-In 2018: A day of data management planning and conversations Slides Recent Posts More Posts Four Fucking Years of Donald Trump Nearly four years ago I decided to start collecting tweets to Donald Trump out of morbid curiosity. If I was a real archivist, I would … Jan 23, 2021 2 min read Enhancing Archives Unleashed Toolkit Usability with Spark-Submit Originally posted here. Over the last month, we have put out several Toolkit releases. The primary focus of the releases has been … May 6, 2020 4 min read Cloud-hosted web archive data: The winding path to web archive collections as data Originally posted here. Web archives are hard to use, and while the past activities of Archives Unleashed has helped to lower these … Feb 19, 2020 7 min read twut. Wait, wut? twut? Originally posted here. Introduction A few of the Archives Unleashed team members have a pretty in-depth background of working with … Dec 5, 2019 7 min read Exploring #elxn43 Twitter Data Introduction A few years ago Library Archives Canada, Ian Milligan and I collected tweets from the 42nd Canadian Federal Election. Ian … Nov 28, 2019 8 min read Projects Archives Unleashed Project Archives Unleashed aims to make petabytes of historical internet content accessible to scholars and others interested in researching the recent past. Supported by a grant from the Andrew W. Mellon Foundation, we will be developing web archive search and data analysis tools to enable scholars and librarians to access, share, and investigate recent history since the early days of the World Wide Web. Web Archives for Historical Research Our research focuses on both web histories - writing about the recent past as reflected in web archives - as well as methodological approaches to understanding these repositories. Islandora CLAW Islandora CLAW is the next generation of Islandora. Fedora Repository Fedora is the flexible, modular, open source repository platform with native linked data support. Visualizations 212,621 #elxn43 Images Dear Donald; May 2017 - January 2019. HAPPY NEW YEAR TO EVERYONE, INCLUDING THE HATERS AND THE FAKE NEWS MEDIA! Totally clears the President. Thank you! 1,039,993 Audio Cover Images from the Internet Archive 6,104,790 #WomensMarch images A month of tweets at @realDonaldTrump Islandora CLAW Development 1,419,259 #panamapapers images 306,740 #YMMfire images 63,455 #TheHip, #hipinkingston images 1,203,867 #elxn42 images 108,375 #MakeDonaldDrumpfAgain images #elxn42 wordclouds by day anon development visualization Islandora 7.x-1.3 Development Music Unfavourable Offerings Sloppy A-Sides, Last of The Worst Strange Delights The HUMANS SoundtrackPro Unlimited THE ACHIEVEMENTS The Potions - Regular Release EP Sloppy B-Sides, First of The Worst Surnom de Gorille The Potions @ The Lifton Matterwave Foci 1.0 Audio Wardrobe's Jacuzzi Contact ruestn@yorku.ca CC-BY · Powered by the Academic theme for Hugo. Cite × Copy Download 
s0-wp-com-2464	----	None 
s0-wp-com-8875	----	None 
samvera-atlassian-net-6306	----	Samvera Wiki {"serverDuration": 33, "requestCorrelationId": "4ac5cfd775659226"} 
samvera-org-1340	----	Samvera - a vibrant and welcoming community developing repository software tools Skip to Content About Samvera Samvera is Community Sourced Software for Repository Solutions Philosophy Governance 2019 Annual Report (PDF download) All Annual Reports Interest and Working Groups FAQ Samvera Privacy Policy Licensing What is Samvera? Samvera is an Open Source Repository Framework Samvera Community Overview (PDF download) Technology White Paper 2019 (PDF download) Applications & Demos Technology stack Why Use Samvera? Samvera is Flexible and Extensible The Samvera Community Community Support Sustainability Who Uses Samvera? Samvera Partners Partner Prospectus (PDF download) Partner Contribution Model (PDF download) Samvera Adopters Community Framework Samvera user profiles Case study: Emory University Case Study: Avalon at the University of Houston Getting started General documentation Installation from GitHub Communication Service Providers News & Events News & Events Samvera Calendar Samvera Twitter page Search Samvera.org Close Search for: A VIBRANT AND WELCOMING COMMUNITY Samvera Vision Statement “Samvera™ is a vibrant and welcoming community of information and technology professionals who share challenges, build expertise, and create sustainable, best-in-class solutions, making the world’s digital collections accessible now and into the future.” Samvera’s suite of repository software tools offers flexible and rich user interfaces tailored to distinct content types on top of a robust back end – giving adopters the best of both worlds. '; BENEFITS OF SAMVERA We believe that no single system can provide the full range of repository-based solutions for a given institution’s needs and that no single institution can resource the development of a full range of solutions on its own. Working together, the Samvera Community creates sustainable solutions using a common infrastructure within which there is the flexibility to tailor solutions to local demands and workflows. Samvera software is free and open source, available under an Apache 2 license. HOW IT WORKS Samvera maintains a set of Ruby on Rails components (Ruby gems) that, together, can be used to build flexible and extensible digital repository solutions. Hyrax combines a number of these components into a toolkit (a Rails engine) for building repository applications to meet a wide range of repository requirements, whilst Hyku is an out-of-the-box repository application with multi-tenant capability built on Hyrax. Samvera does not work in isolation and relies on a number of external open source components, including: Fedora – a durable repository layer for persisting and managing digital objects. Apache Solr – a fast and performant search platform Blacklight – a discovery platform built on Solr applications INSTITUTIONAL REPOSITORIES Samvera is being used as a base for a number of institutional repositories (IRs) each of which contains a range of content types. Many of the Samvera partners have developed IR with Hyrax. For instance, George Washington University, UNC Chapel Hill, the University of Hull, UK cultural heritage organisations, and many others have an IR containing electronic dissertations and theses (ETDs), past examination papers, learning materials, journal articles, small datasets and more. For more information visit Samvera Wiki Implementations Information Page. MEDIA COLLECTIONS Avalon, an access platform for online audio and video was developed by Indiana University and Northwestern University using the Samvera stack. Amongst others, WGBH, a public broadcaster in Boston, the University of Virginia, the University of Houston and Washington University are utilizing Avalon and Samvera to manage their digital media content. SOLUTION BUNDLES The Avalon Media System is a collaborative Samvera-based project for managing and providing online access to digital video and audio. It is now available as a Samvera “solution bundle”. Hyku is the result of a collaboration to extend the existing Samvera project codebase to build, bundle, and promote a feature-rich, robust, flexible digital repository that is easy to install, configure, and maintain.  Hyku is a solution bundle that can be installed locally or run in the cloud.  It is based on Hyrax, a Community-developed Ruby gem that allows users to design and build their own, customized installation of our software. DATA AND PRESERVATION The Samvera software is being used as the basis for data repositories, for instance “Deep Blue Data” at the University of Michigan and “Imago” at Indiana University.  The Digital Repository of Ireland is “a national repository for Ireland’s humanities, social sciences and cultural heritage data.” A number of Samvera Partners are investigating the use of our software for dealing with the long-term preservation of research data.  In the UK, the Universities of York and Hull have been integrating the open-source preservation system Archivematica into their Samvera workflows. ARCHIVES AND SPECIAL COLLECTIONS Samvera is being used in conjunction with archives and special collections.  The University of York in the UK has used it as the basis for their Archbishops’ Registers site, providing access to more than 20,000 pages of early manuscripts.  Princeton University has used Samvera to create “Figgy”, a workflow tool for digitizing a wide range of formats including archival materials, ephemera, maps, audio, and coins. PUBLISHING Fulcrum is a community-based, open source publishing platform based on Samvera that helps publishers present their authors’ research outputs in a durable, discoverable, accessible and flexible form. It is hosted on the University of Michigan library infrastructure, specifically designed to curate digital objects. Interoperable with other publishing tools and integrated into the information supply chain, Fulcrum ensures that content is discovered by readers and impact is tracked. Fulcrum aims to implement accessible systems and features and effect change by sharing and maintaining a high standard of accessibility. Previous Slide Next Slide THE SAMVERA COMMUNITY Samvera is not (and has never been) grant funded. It is distributed, robust and open. The Samvera Community was conceived and executed, under its original name “Hydra”, as a collaborative, open source effort from its very beginning in 2008. Samvera has grown into a vibrant, highly active community including more than 30 Partners who formally support our work and development. Samvera is designed so that adopters can each have their own mix of features; variation is part of the plan. For adopters who do not have the resourcing to create their own variant, the Samvera Community has developed rather more “off-the-shelf” application bundles. getting started samvera news News & Events Registration now open for Samvera Virtual Connect, April 20 – 21 Hyku 3.0 Release Includes New Customization Features Season’s Greetings from the Samvera Community Save the Date for Samvera Virtual Connect 2021 Developer resources: Bug Hunting in Hyrax; Adding Blacklight Advanced Search to Hyku view all news IF YOU WANT TO GO FAR, GO TOGETHER. contact us Samvera Partners Boston Public Library Columbia University Cornell University CoSector, University of London Data Curation Experts Digital Repository of Ireland Duke University Emory University Indiana University Lafayette College Northwestern University Notch8 Oregon State University Penn State University Princeton University Library Stanford University Tufts University Ubiquity Press University of California, Santa Barbara University of California, San Diego University of Cincinnati University of Houston University of Hull University of Michigan University of Notre Dame University of Oregon University of Utah University of Virginia University of York Washington University in St Louis WGBH Boston Yale University This work is licensed under a Creative Commons Attribution 4.0 International License. © 2021 Samvera. Samvera Twitter Samvera Github Samvera Wiki 
samvera-org-4113	----	Registration now open for Samvera Virtual Connect, April 20 - 21 - Samvera Skip to Content Skip to Sidebar Navigation Region About Samvera Samvera is Community Sourced Software for Repository Solutions Philosophy Governance 2019 Annual Report (PDF download) All Annual Reports Interest and Working Groups FAQ Samvera Privacy Policy Licensing What is Samvera? Samvera is an Open Source Repository Framework Samvera Community Overview (PDF download) Technology White Paper 2019 (PDF download) Applications & Demos Technology stack Why Use Samvera? Samvera is Flexible and Extensible The Samvera Community Community Support Sustainability Who Uses Samvera? Samvera Partners Partner Prospectus (PDF download) Partner Contribution Model (PDF download) Samvera Adopters Community Framework Samvera user profiles Case study: Emory University Case Study: Avalon at the University of Houston Getting started General documentation Installation from GitHub Communication Service Providers News & Events News & Events Samvera Calendar Samvera Twitter page Search Samvera.org Close Search for: Home » News & Events » Registration now open for Samvera Virtual Connect, April 20 – 21 Registration now open for Samvera Virtual Connect, April 20 – 21 Posted on April 8, 2021 - Events, News Registration is now open for Samvera Virtual Connect 2021! Samvera Virtual Connect will take place April 20th -21st from 11am – 2pm EDT. Registration is free and open to anyone with an interest in Samvera. This year’s program is packed with presentations and lightning talks of interest to developers, managers, librarians, and other current or potential Samvera Community participants and technology users. Register and view the full program on the Samvera wiki. « Previous Post Sidebar Navigation Links Contact us Join Samvera’s Community mailing list Join Samvera’s technical mailing list Samvera Wiki Home Page Samvera on Github RSS feed for this site News & Events Registration now open for Samvera Virtual Connect, April 20 – 21 Hyku 3.0 Release Includes New Customization Features Season’s Greetings from the Samvera Community Save the Date for Samvera Virtual Connect 2021 Developer resources: Bug Hunting in Hyrax; Adding Blacklight Advanced Search to Hyku Recent Tweets Tweets by SamveraRepo IF YOU WANT TO GO FAR, GO TOGETHER. contact us Samvera Partners Boston Public Library Columbia University Cornell University CoSector, University of London Data Curation Experts Digital Repository of Ireland Duke University Emory University Indiana University Lafayette College Northwestern University Notch8 Oregon State University Penn State University Princeton University Library Stanford University Tufts University Ubiquity Press University of California, Santa Barbara University of California, San Diego University of Cincinnati University of Houston University of Hull University of Michigan University of Notre Dame University of Oregon University of Utah University of Virginia University of York Washington University in St Louis WGBH Boston Yale University This work is licensed under a Creative Commons Attribution 4.0 International License. © 2021 Samvera. Samvera Twitter Samvera Github Samvera Wiki 
samvera-org-4116	----	Samvera Samvera Registration now open for Samvera Virtual Connect, April 20 – 21 Registration is now open for Samvera Virtual Connect 2021! Samvera Virtual Connect will take place April 20th -21st from 11am &#8211; 2pm EDT. Registration is free and open to anyone with an interest in Samvera. This year&#8217;s program is packed with presentations and lightning talks of interest to developers, managers, librarians, and other current or... Read more &#187; The post Registration now open for Samvera Virtual Connect, April 20 &#8211; 21 appeared first on Samvera. Hyku 3.0 Release Includes New Customization Features Hyku 3.0 is now available, with new features and improvements. These features add customization options at the institution level, and the improvements provide for easier maintenance of Hyku implementations across all adopters. Theming Improvements Now even more theming capability is in the hands of non-technical administrators, offering the ability to create a unique branded repository... Read more &#187; The post Hyku 3.0 Release Includes New Customization Features appeared first on Samvera. Season’s Greetings from the Samvera Community As 2020 draws to a close, we want to share our gratitude for each and every person who has helped the Samvera Community thrive in this difficult year of profound loss and challenge. We could not be the vibrant, welcoming, and valuable open source solution community we are without each and every person who contributed... Read more &#187; The post Season&#8217;s Greetings from the Samvera Community appeared first on Samvera. Save the Date for Samvera Virtual Connect 2021 Mark your calendar for Samvera Virtual Connect 2021!Tuesday, April 20 &#8211; Wednesday, April 21, 2021 11:00 AM &#8211; 2:00 PM EDT / 8:00 AM – 11:00 AM PDT / 16:00-19:00 BST / 15:00-18:00 UTC Watch for more information coming in early 2021 including a call for Program Committee participation and a call for proposals. The post Save the Date for Samvera Virtual Connect 2021 appeared first on Samvera. Developer resources: Bug Hunting in Hyrax; Adding Blacklight Advanced Search to Hyku Bess Sadler from Notch8 has created two excellent guides that may be helpful to developers working in Hyrax or Hyku applications: Bug Hunting in Hyrax: A well-documented process for finding a bug in a Hyrax application Adding blacklight_advanced_search to Hyku: a how-to guide for adding Blacklight Advanced Search to a Hyku application Have you or... Read more &#187; The post Developer resources: Bug Hunting in Hyrax; Adding Blacklight Advanced Search to Hyku appeared first on Samvera. Fedora 6 Alpha Release available for download and testing Fedora 6.0 Alpha-1 is now available for download and testing. The primary goals for Fedora 6 are robust migration support, enhanced digital preservations features, and improved performance and scale. The Fedora team will ask the Samvera Community for testing assistance when the full version is available in early 2021. In the meantime, you can learn... Read more &#187; The post Fedora 6 Alpha Release available for download and testing appeared first on Samvera. Samvera Tech 101: A Beginner-Friendly Overview of Samvera Samvera Connect On-line 2020 included an excellent, beginner-friendly overview of “Samvera Tech 101” presented by Alisha Evans and Shana Moore, software engineers at Notch8. Evans has turned this presentation into a blog post walking through the technologies used in the Samvera Community. Check out the post on the Notch8 blog: Samvera Tech 101 The post... Read more &#187; The post Samvera Tech 101: A Beginner-Friendly Overview of Samvera appeared first on Samvera. Samvera Connect presentations If you are not one of the 340+ people who registered to enjoy Samvera Connect 2020 On-line, or if you are but missed a session you wanted to see, links to the recordings and session slide packs are being added to the Samvera wiki here. Our grateful thanks to all the organizers and speakers who... Read more &#187; The post Samvera Connect presentations appeared first on Samvera. Things are already happening for Samvera Connect! Samvera Connect ‘proper’ is a little over a week away but the poster exhibition is now available here!  There is a Slack channel #connect-posters for asynchronous comment or discussion and each presenter has a 30 minute video conferencing slot Monday 19th – Wednesday 21st October for live discussion.  See Sched for details.  Need a Slack... Read more &#187; The post Things are already happening for Samvera Connect! appeared first on Samvera. Samvera Connect 2020 On-line is nearly here! Samvera&#8217;s annual Connect conference has gone virtual this year, like so many others.  Nevertheless, we&#8217;ve put together an exciting program of workshops, presentations, posters and community social events that we hope will make up for not being able to meet in person.  The main events are on Friday 10/23 and Monday &#8211; Thursday 10/26 &#8211;... Read more &#187; The post Samvera Connect 2020 On-line is nearly here! appeared first on Samvera. 
sbdevel-wordpress-com-2449	----	Software Development at Royal Danish Library Software Development at Royal Danish Library A peekhole into the life of the software development department at the Royal Danish Library SolrWayback 4.0 release! What’s it all about? Part 2 In this blog post I will go into the more technical details of SolrWayback and the new version 4.0 release. The whole frontend GUI was rewritten from scratch to be up to date with 2020 web-applications expectations along with many &#8230; Continue reading &#8594; SolrWayback 4.0 release! What’s it all about? So, it&#8217;s finally here! SolrWayback 4.0 was released December 20th, after an intense development period. In this blog post, we&#8217;ll give you a nice little overview of the changes we made, some of the improvements and some of the added &#8230; Continue reading &#8594; Which type bug? A light tale of bug hunting an Out Of Memory problem with SolrCloud. The setup and the problem At the Royal Danish Library we provide full text search for the Danish Netarchive. The heavy lifting is done in a single &#8230; Continue reading &#8594; Touching encouraged (an ongoing story) Ongoing experiments with a large touch screen providing access to cultural heritage material Continue reading &#8594; DocValues jump tables in Lucene/Solr 8 Lucene/Solr 8 is about to be released. Among a lot of other things is brings LUCENE-8585, written by your truly with a heap of help from Adrien Grand. LUCENE-8585 introduces jump-tables for DocValues, is all about performance and brings speed-ups &#8230; Continue reading &#8594; Faster DocValues in Lucene/Solr 7+ This is a fairly technical post explaining LUCENE-8374 and its implications on Lucene, Solr and (qualified guess) Elasticsearch search and retrieval speed. It is primarily relevant for people with indexes of 100M+ documents. Teaser We have a Solr setup for &#8230; Continue reading &#8594; Prebuild Big Data Word2Vec dictionaries                    Prebuild and trained Word2Vec dictionaries ready for use Two different prebuild big data Word2Vec dictionaries has been added to LOAR (Library Open Access Repository) for download. These dictionaries are build from the text of 55,000 e-books from Project Gutenberg &#8230; Continue reading &#8594; SolrWayback software bundle has been released The SolrWayback software bundle can be used to search and playback archived webpages in Warc format. It is an out of the box solution with index workflow, Solr and Tomcat webserver and a free text search interface with playback functionality. &#8230; Continue reading &#8594; Visualising Netarchive Harvests &#160; An overview of website harvest data is important for both research and development operations in the netarchive team at Det Kgl. Bibliotek. In this post we present a recent frontend visualisation widget we have made. From the SolrWayback Machine &#8230; Continue reading &#8594; SolrWayback Machine Another &#8216;google innovation week&#8217; at work has produced the SolrWayback Machine. It works similar to the Internet Archive: Wayback Machine (https://archive.org/web/) and can be used to show harvested web content (Warc files).  The Danish Internet Archive has over 20billion harvested &#8230; Continue reading &#8594; 
sbdevel-wordpress-com-8472	----	Software Development at Royal Danish Library | A peekhole into the life of the software development department at the Royal Danish Library Software Development at Royal Danish Library A peekhole into the life of the software development department at the Royal Danish Library Skip to content Home About Net Archive Search ← Older posts SolrWayback 4.0 release! What’s it all about? Part 2 Posted on February 26, 2021 by thomasegense In this blog post I will go into the more technical details of SolrWayback and the new version 4.0 release. The whole frontend GUI was rewritten from scratch to be up to date with 2020 web-applications expectations along with many new features implemented in the backend. I recommend reading the frontend blog post first. The frontend blog post has beautiful animated gifs demonstrating most of the features in SolrWayback. Live demo of SolrWayback You can access a live demo of SolrWayback here. Thanks to National Széchényi Library of Hungary for providing the SolrWayback demo site! Back in 2018… The open source SolrWayback project was created in 2018 as an alternative to the existing webarchive frontend applications at that time. At the Royal Danish Library we were already using Blacklight as search frontend. Blacklight is an all purpose Solr frontend application and is very easy to configure and install by defining a few properties such as Solr server url, fields and facet fields. But since Blacklight is a generic solr-frontend it had no special handling of the rich datastructure we had in Solr. Also the binary data such as images and videos are not in Solr, so integration to the WARC-file repository can enrich the experience and make playback possible, since Solr has enough information to work as CDX server also. Another interesting frontend was the Shine frontend. It was custom tailored for the Solr index created with WARC-indexer and had features such as Trend analysis (n-gram) visualization of search results over time. The showstopper was that Shine was using an older version the Play-framework and the latest version of the Play-framework was not backwards compatible to the maintained branch of the Play-framework. Upgrading was far from trivial and would require a major rewrite of the application. Adding to that, the frontend developers had years of experience with the larger more widely used pure javascript-frameworks. The weapon of choice by the frontenders for SolrWayback was the VUE JS framework. Both SolrWayback 3.0 and the new rewritten SolrWayback 4.0 had the frontend developed in VUE JS. If you have skills in VUE JS and interest in SolrWayback your collaboration will be appriciate 🙂 WARC-Indexer. Where the magic happens! WARC-files are indexed into Solr using the WARC-Indexer. The WARC-Indexer reads every WARC record,extracts all kind of information and splits this into up to 60 different fields. It uses Tika to parse all the different Mime types that can be encountered in WARC-files. Tika extract the text from HTML, PDF, Excel, Word documents etc. It also extracts metadata from binary documents if present. The metadata can include created/modified time, title, description, author etc. For images meta-data it can also width/height or exif information such as latitude/longitude. The binary data themselves are not stored in Solr but for every record in the warc-file there is a record in Solr. This also includes empty records such as HTTP 302 (MOVED) with information about the new URL. WARC-Indexer. Paying the price up front… Indexing a large amount of warc-files require massive amounts of CPU, but is easily parallelized as the warc-indexer takes a single warc-file as input. Indexing 700 TB (5.5M WARC files) of warc-files took 3 months using 280 CPUs to give an idea of the requirements. When the existing collection is indexed, it is easier to keep up with the incremental growth of the collection. So this is the drawback when using SolrWayback on large collections: The Warc-files have to be indexed first. Solr provides multiple ways of aggregating data, moving common netarchive statistics tasks from slow batch processing to interactive requests. Based on input from researchers, the feature set is continuously expanding with aggregation, visualization and extraction of data. Due to the amazing performance of Solr, the query is often performed in less than 2 seconds in a collection with 32 billion (32*10⁹) documents and this includes facets. The search results are not limited to HTML pages where the freetext is found, but every document that matches the search query. When presenting the results each document type has custom display for that mime-type. HTML results are enriched with showing thumbnail images from page as part of the result, images are shown directly, and audio and video files can be be play directly from the results list with an in-browser player or downloaded if the browser does not support that format. Solr. Reaping the benefits from the Warc-indexer The SolrWayback java-backend offers a lot more than just sending queries to Solr and returning them to the frontend. Methods can aggregate data from multiple Solr queries or directly read WARC entries and return the processed data in a simple format to the frontend. Instead of re-parsing the warc files, which is a very tedious task, the information can be retreived from Solr, and the task can be done in seconds/minutes instead of weeks. See the frontend blog post for more feature examples. Wordcloud Generating a wordcloud image is done by extracting text from 1000 random HTML pages from the domain and generate a wordcloud from the extracted text. Interactive linkgraph By extracting domains that links to a given domain(A) and also extract outgoing links from that domain(A) you can build a link-graph. Repeating this for new domains found gives you a two-level local linkgraph for the domain(A). Even though this can be 100s of seperate Solr-queries it is still done in seconds on a large corpus. Clicking a domain will highlight neighbors in the graph. (Try demo:interactive linkgraph) Large scale linkgraph Extraction of massive linkgraphs with up to 500K domains can be done in hours. Link graph example from the Danish NetArchive. The exported link-graph data was rendered in Gephi and made zoomable and interactive using Graph presenter. The link-graphs can be exported fast as all links (a href) for each HTML-record are extracted and indexed as part of the corresponding Solr document. Image search Freetext search can be used to find HTML documents. The HTML documents in Solr are already enriched with image links on that page without having to parse the HTML again. Instead of showing the HTML pages, SolrWayback collects all the images from the pages and shows them in a Google like image search result. Under the assumption that text on the HTML page relates to the images, you can find images for can match the query. If you search for “Cats” in HTML pages , the results found will mostly likely show pictures of cats. The pictures could not be found by just searching for the image documents if no meta data (or image-name) has “Cats” as part of it. CVS Stream export You can export result sets with millionsof documents to a CSV file. Instead of exporting all possible 60 Solr fields for each result, you can custom pick which fields to export. This CSV export has been used by several researchers at the Royal Danish Library already and gives them the opportunity to use other tools, such as RStudio, to perform analysis on the data. The National Széchényi Library demo site has disabled CSV export in the SolrWayback configuration, so it can not be tested live. WARC corpus extraction Besides CSV export, you can also export a result to a WARC-file. The export will read the Warc-entry for each document in the resultset and copy the WARC-header+ Http-header + payload and create a new Warc-file with all results combined. Extract a sub-corpus this easy has already shown to be extremely useful for researchers. Examples includes extracting of a domain for a given date range, or query with restriction to a list of defined domains. This export is a 1-1 mapping from the result in Solr to the entries in the WARC-files. SolrWayback can also perform an extended WARC-export which will include all resources(js/css/images) for every HTML page in the export. The extended export ensures that playback will also work for the sub-corpus. Since the exported Warc file can become very large, you can use a WARC splitter tool or just split up the export in smaller batches by adding crawl year/month to the query etc. The National Széchényi Library demo site has disabled WARC export in the SolrWayback configuration, so it can not be tested live. SolrWayback playback engine SolrWayback has a built-in playback engine, but using it is optional and SolrWayback can be configured to use any other playback engine that uses the same API in URL for playback “/server/<date>/<url>” such as PyWb. It has been a common misunderstanding that SolrWayback forces you to use the SolrWayback playback engine. The demo at National Széchényi Library has configured PyWb as alternative playback engine. Clicking the icon next to the titel for a HTML result will open playback in PyWb instead of SolrWayback. Playback quality The playback quality of SolrWayback is an improvement over OpenWayback for the Danish Netarchive, but not as good as PyWb. The technique used is url-rewrite just as PyWb does, and replaces urls according to the HTML specification for html-pages and CSS files. However , SolrWayback does not replace links generated from javascript yet, but this is most likely to be improved in a next major release. It has not been a priority since the content for the The Danish NetArchive is harvested with Heritrix and the dynamic javascript resources are not harvested by Heritrix. This is only a problem for absolute links, ie. starting with http://domain/… since all relative URL paths will be resolved automatically due to the URL playback API. Relative links that refer to the root of the playback-server will also be resolved by the SolrWaybackRootProxy application which has this sole purpose. It calculates the correct URL from the http-referer tags and redirect back into SolrWayback. The absolute URL from javascript (or dynamic javascript) can result in live leaks. This can be avoided by a HTTP proxy or just adding a white list of urls to the browser. In the Danish Citrix production environment, live leaks are blocked by sandboxing the enviroment. Improving playback is in the pipeline. The SolrWayback playback has been designed to as authentic as possible without showing a fixed toolbar in top of the browser. Only a small overlay is included in the top left corner, that can be removed with a click, so that you see the page as it was harvested. From playback overlay you can open the calendar and an overview of the resources included by the HTML page along with their timestamps compared to the main HTML page, similar to the feature provided by the archive.org playback engine. The URL replacement is done up front and fully resolved to an exact WARC file and offset. An HTML page can have 100 of different resources on the page and each of them require an URL lookup for the version nearest to the crawl time of the HTML page. All resource lookups for a single HTML page are batched as a single Solr query, which both improves performance and scalability. SolrWayback and Scalability For scalability, it all comes down to the scalability of SolrCloud, which has proven without a doubt to be one of the leading search technologies and is still rapidly improving for each new version. Storing the iindexes on SSD gives substantial performance boosts as well but can be costly. The Danish Netarchive has 126 Solr servers running in a SolrCloud setup. One of the servers is master and the only one that recieve requests. The Solr master has an empty index but is responsible for gathering the data from the other Solr-services. If the master server also had an index there would be an overhead. 112 of the Solr servers have a 900 GB index with an average of ~300M documents while the last 13 servers currently has an empty index, but it makes expanding the collections easy without any configuration changes. Even with 32 billion documents, the query response times are sub 2 seconds. The result query and the facet query are seperate simultaneous calls and its advantage is that the result can be rendered very fast and the facets will finish loading later. For very large results in the billions, the facets can take 10 seconds or more, but such queries are not realistic and the user should be more precise in limiting the results up front. Building new shards Building of new shards (collection pieces) is done outside the production enviroment and moved into one of the empty Solr servers when the index reaches ~900GB. The index is optimized before it is moved, since no more data will be written to it that would undo the optimization. This will also give a small performance improvement in query times. If the indexing was done directly into the production index, it would also impact response times. The separation of the production and building environment has spared us from dealing with complex problems we would have faced otherwise. It also makes speeding up the index building trivial by assigning more machines/CPU for the task and creating multiple indexes at once. You can not keep indexing into the same shard forever as this would cause other problems. We found the sweet spot at that time to be ~900GB index size and it could fit on the 932GB SSDs that were available to us when the servers were built. The size of the index also requires more memory of each Solr server and we have allocated 8 GB memory to each. For our large scale webarchive we keep track of which WARC files has been indexed using Archon and Arctica. Archon is the central server with a database and keeps track of all WARC files and if they have been index and into which shard number. Arctika is a small workflow application that starts WARC-indexer jobs and query Arctika for next WARC file to process and return the call when it has been completed. SolrWayback – framework SolrWayback is a single Java Web application containing both the VUE frontend and Java backend. The backend has two Rest service interfaces written with Jax-Rs. One is responsible for services called by the VUE frontend and the other handles playback logic. SolrWayback software bundle Solrwayback comes with an out of the box bundle release. The release contains a Tomcat Server with Solrwayback, a Solr server and workflow for indexing. All products are configured. All that is required is unzipping the zip file and copying the two property-files to your home-directory. Add some WARC-files yourself and start the indexing job. Try: SolrWayback Software bundle Posted in open source, Solr, Visualization, Web | Tagged iipc, Solr, SolrWayback, webarchiving | 1 Comment SolrWayback 4.0 release! What’s it all about? Posted on February 12, 2021 by Jesper Lauridsen So, it’s finally here! SolrWayback 4.0 was released December 20th, after an intense development period. In this blog post, we’ll give you a nice little overview of the changes we made, some of the improvements and some of the added functionality that we’re very proud of having released. So let’s dig in! A small intro – What is SolrWayback really? As the name implies, SolrWayback is a fusion of discovery (Solr) and playback (Wayback) functionality. Besides full-text search, Solr provides multiple ways of aggregating data, moving common net archive statistics tasks from slow batch processing to interactive requests. Based on input from researchers the feature set is continuously expanding with aggregation, visualization and extraction of data. SolrWayback relies on real time access to WARC files and a Solr index populated by the UKWA webarchive-discovery tool. The basic workflow is: Amass a collection of WARCs (using Heritrix, wget, ArchiveIT…) and put them on live storage Analyze and process the WARCs using webarchive-discovery. Depending on the amount of WARCS, this can be a fairly heavy job: Processing ½ petabyte of WARCs at the Royal Danish Library took 40+ CPU-years Index the result from webarchive-discovery into Solr. For non-small collections, this means SolrCloud and Solid State Drives. A rule of thumb is that the index takes up about 5-10% of the size of the compressed WARCs Connect SolrWayback to the WARC storage and the Solr index A small visual illustration of the components used for SolrWayback. Live demo Try Live demo provided by National Széchényi Library, Hungary. (thanks!) Helicopter view: What happend to SolrWayback We decided to give the SolrWayback a complete makeover, making the interface more coherent, the design more stylish, and the information architecture better structured. At first glance, not much has changed apart from an update on the color scheme, but looking closely, we’ve added some new functionality, and grouped some of the existing features in a new, and much improved, way. The new interface of SolrWayback. The search page is still the same, and after searching you’ll still see all the results lined in a nice single column. We’ve added some more functionality up front, giving you the opportunity to see the WARC header for a single post, as well as selecting an alternative playback engine for the post. Some of the more noticeable reworks and optimizations are highlighted in the section below. Faster loadtimes We’ve done some work under the hood too, to make the application run faster. A lot of our call to the backend has been reworked to be individual calls, only being requested at need. This means, that facet calls are now made as a separate call to the backend instead of being being called with a query. So when you’re paging results, we only request the results – giving us a faster response, since the facets stay the same. The same principle has been applied to loading images and individual post data. GUI polished As mentioned, we’ve done some cleanup in the interface, making it easier to navigate. The search field has been reworked, to service the many needs. It will expand if the query is line separated (do so by SHIFT+Enter), making large and complex queries much easier to manage. We’ve even added context sensitive help, so if you’re making queries with boolean operators or similar, SolrWayback tell you if their syntax is correct. We’ve kept the most used features upfront, with image and URL search readily available from the get go. The same goes for the option to group the search results to avoid URL duplicates. Below the line are some of of the other features not directly linked to the query field, but nice to have upfront. Searching with an uploaded file, searching by GPS and the toolbox containing a lot of the different tools that can help gain insight into the archive, by generating Wordclouds or link graphs, searching through the Ngram interface and much more. The nifty helper when making complex queries for SolrWayback. Image searching by location rethought We’re reworked the way to search and look through the results when searching by GPS coordinates. We’ve made it easy to search for a specific location, and we’ve grouped the results so that they are easier to interpret. The new and improved location search interface. Images intentionally blurred. Zooming into the map will expand the places where images are clustered. Furthermore, we realize that sometimes the need is to look through all the images regardless of their exact position, so we’ve made a split screen that can expand either way, depending on your needs. It’s still possible to do do a new search based on any of the found images in the list. Elaborated export options We’ve added a more functionality to the export options. It’s possible to export both fields from the full search result and the raw WARC records for the search result, if enabled in the settings. You can even decide the format of your export and we’ve added an option to select exactly which fields in the search result you want exported – so if you want to leave out some stuff, that is now possible! Quickly move through your archive The standard search UI is pretty much as you are accustomed to but we made an effort to keep things simple and clean as well as facilitating in depth research and tracking of subject interests. In the search results you get a basic outline of metadata on each post. You can narrow your search with the provided facet filters. When expanding a post you get access to all metadata and every field has a link if you which to explore a particular angle related to your post. So you can quickly navigate the archive by starting wide, filtering and afterwards do a specific drill down and find related material. Visualization of search result by domain We’ve also made it very easy to quickly get a overview of the results. When clicking the icon in the results headline, you get a complete overview of the different domains in the results, and how big of a portion of the search result they amount for to each year. This is a very neat way to get a overview of the results, and the relative distribution by year. The toolbox With quick access from right under the search box we have gathered Toolbox with utilities for further data exploration. In the following we will give you a quick tour of the updates and new functionality in this section. Linkgraph, domain stats and wordcloud Linkgraph. Domain stats. Wordcloud. We reworked the Linkgraph, the Wordcloud and the Domain stats components a little, adding some more interaction to the graph and domain stats, and polished the interface for all of them a little. For the Linkgraph, it is now possible to highlight certain sections within the graph, making it much easier to navigate the sometimes rather large cluster, and looking at connections you find relevant. These tools now provide a easy and quick way to gain a deeper insight in specific domains and what content they hold. Ngram We are so pleased to finally be able to supply a classical Ngram search tool complete with graphs and all. In this version you are able to search through the entire HTML content of your archive and see how the results are distributed over time (harvest time). You can even do comparisons by providing several queries sequentially and see how they compare. On every graph the datapoint at each year is clickable and will trigger a search for the underlying results which is a very handy feature for checking the context and further exploring underlying data. Oh and before we forget – if things get a little crowded in the graph area you can always click on the nicely colored labels at the top of the chart and deselect/select each query. The ngram interface. The evolution of the blink tag. If the HTML content isn’t really your thing but your passion lays within the HTML tags themselves we got you covered. Just flip the radio button under the search box over to HTML-tags in HTML-pages and you will have all same features listed above but now the underlying data will be the HTML tags themselves. As easy as that you will finally be able to get answers to important questions like ‘when did we actually start to frown upon the blink tag?’ The export functionality for Ngram. Gephi Export The possibilty to export a query, in a format that can be used in Gephi, is still present in the new version of SolrWayback.  This will allow you to create some very nice visual graphs that can help you explore how exactly a collection of results are tied together. If you’re interested in this, feel free to visit the labs website about gephi graphs, where we’ve showcasted some of the possiblities of using Gephi. Tools for the playback SolrWayback comes with a build in playback engine, but can be configured to use another playback engine such as PyWb. The SolrWayback playback viewer shows a small toolbar overlay on the page that can be opened or hidden. When the toolbar is hidden the page is display without any frame/top-toolbar etc. to show the page exactly as it was harvested. The menu when you access the individual search results. When you have clicked a specific result, you’re taking to the harvested resource. If it is a website, you will be shown a menu to the right, giving you some more ways to analyse the resource.  This menu is hidden in the left upper corner when you enter, but can be expanded by clicking on it. The harvest calendar will give you a very smooth overview of the harvest times of the resource, so you can easily see when, and how often, the resource has been harvest in the current index. This gives you an excellent opportunity to look at your index over time, and see how a website evolved. The date harvest calendar module. The PWID option lets you export the harvest resource metadata, so you can share what’s in that particular resource in a nice and clean way. the PWID standard is an excellent way to keep track of, and share ressources between researchers, so a list of the exact dataset is preserved – along with all the resources to go with it View page resources gives you a clean overview of the contents of the harvested page, along with all the resources. We’ve even added a way to quickly see the difference between the first and the last harvested resource on the page, giving you a quick hint of the contents and if they are all from the same period. You can even see a preview of the page here and download the individual resources from the page, if you wish. Customization of your local SolrWayback instance We’ve made it possible to customize your installation, to fit your needs. The logo can be changed, the about text can be changed, and you can even customize your search guidelines, if you need to. This makes sure, that you have a chance to make instance your own in some way – making sure that people can recognize when they are using your instance of SolarWayback, and it can now reflect your organisation and the people who is contributing to it. The future of the SolrWayback This is just the beginning for SolrWayback. Further down the road, we hope to add even more functionality that can help you dig deeper into the archives. One of our main goals is to provide you with the tools necessary to understand and analyse the vast amounts of data, that lies in most of the archives that SolrWayback is designed for. We already have a few ideas as to what could be useful, but if you have any suggestions for tools that might be helpful, feel free to reach out to us. And a teaser: Thomas Egense is currently writing a blog post that focuses on the more technical aspects of SolrWayback. Stay tuned! Posted in Frontend, Web | 2 Comments Which type bug? Posted on June 10, 2020 by Toke Eskildsen A light tale of bug hunting an Out Of Memory problem with SolrCloud. The setup and the problem At the Royal Danish Library we provide full text search for the Danish Netarchive. The heavy lifting is done in a single collection SolrCloud made up of 107 shards (for a total of 94TB / 32 billion documents). All queries are issued to a Solr instance with an empty shard, with the sole responsibility of aggregating responses from the real shards. One of the frontends is SolrWayback, which is a JavaScript application backed by a middle layer acting as an advanced proxy; issuing searches, rewriting HTML, doing streaming exports and so. The problem this time was that the aggregating Solr node occasionally crashed with an Out Of Memory error, where occasionally means that it sometimes took months to crash, sometimes days. Clues and analysis Access to the Netarchive Search is strictly controlled, so there were no chance of denial of service or fimilar foul play. Log analysis showed modest activity (a maximum of 9 concurrent searches) around the time of the latest crash. The queries themselves were run-of-the-mill, but the crashing queries themselves were not logged, as Solr only logs the query when is has been completed, not when it starts. The Garbage Collection logs showed that everything was a-ok, right up til the time when everything exploded in progressively longer collections, culminating in a 29 second stop-the-World and no heap space left. Heap graph with stop-the-world GC as red triangles, courtesy of gceasy.io Should be simple to pinpoint, right? And (plot twist) for once it was! Of course we chased the usual red herrings, but ultimately “dissect the logs around the problematic time slots” won the day. Pop quiz: What is wrong with the log entries below? (meticulously unearthed from too many nearly-but-not-fully-similar entries and with timestamps adjusted to match graph-timezone). 1) 2020-06-04 19:29:06.285 INFO (qtp1908316405-618188) [c:ns0 s:shard1 r:core_node2 x:ns0_shard1_replica_n1] o.a.s.c.S.Request [ns0_shard1_replica_n1] webapp=/solr path=/select params={q=facebook.com&facet.field=domain&facet.field=content_type_norm&facet.field=type&facet.field=crawl_year&facet.field=status_code&facet.field=public_suffix&hl=on&indent=true&fl=id,score,title,hash,source_file_path,source_file_offset,url,url_norm,wayback_date,domain,content_type,crawl_date,content_type_norm,type&start=100&q.op=AND&fq=record_type:response+OR+record_type:arc&fq=domain:"facebook.com"+AND+crawl_year:"2015"&rows=20&wt=json&facet=true&f.crawl_year.facet.limit=100} hits=53532331 status=0 QTime=2181 2) 2020-06-04 19:33:32.418 INFO (qtp1908316405-619134) [c:ns0 s:shard1 r:core_node2 x:ns0_shard1_replica_n1] o.a.s.c.S.Request [ns0_shard1_replica_n1] webapp=/solr path=/select params={q=facebook.com&facet.field=domain&facet.field=content_type_norm&facet.field=type&facet.field=crawl_year&facet.field=status_code&facet.field=public_suffix&hl=on&indent=true&fl=id,score,title,hash,source_file_path,source_file_offset,url,url_norm,wayback_date,domain,content_type,crawl_date,content_type_norm,type&start=10020&q.op=AND&fq=record_type:response+OR+record_type:arc&fq=domain:"facebook.com"+AND+crawl_year:"2015"&rows=20&wt=json&facet=true&f.crawl_year.facet.limit=100} hits=53527106 status=0 QTime=6958 3) 2020-06-05 20:33:26.204 INFO (qtp1908316405-639768) [c:ns0 s:shard1 r:core_node2 x:ns0_shard1_replica_n1] o.a.s.c.S.Request [ns0_shard1_replica_n1] webapp=/solr path=/select params={q=facebook.com&facet.field=domain&facet.field=content_type_norm&facet.field=type&facet.field=crawl_year&facet.field=status_code&facet.field=public_suffix&hl=on&indent=true&fl=id,score,title,hash,source_file_path,source_file_offset,url,url_norm,wayback_date,domain,content_type,crawl_date,content_type_norm,type&start=10020&q.op=AND&fq=record_type:response+OR+record_type:arc&fq=domain:"facebook.com"+AND+crawl_year:"2017"&rows=20&wt=json&facet=true&f.crawl_year.facet.limit=100} hits=3785666 status=0 QTime=3650 4) 2020-06-05 20:34:36.078 INFO (qtp1908316405-641342) [c:ns0 s:shard1 r:core_node2 x:ns0_shard1_replica_n1] o.a.s.c.S.Request [ns0_shard1_replica_n1] webapp=/solr path=/select params={q=facebook.com&facet.field=domain&facet.field=content_type_norm&facet.field=type&facet.field=crawl_year&facet.field=status_code&facet.field=public_suffix&hl=on&indent=true&fl=id,score,title,hash,source_file_path,source_file_offset,url,url_norm,wayback_date,domain,content_type,crawl_date,content_type_norm,type&start=1002020&q.op=AND&fq=record_type:response+OR+record_type:arc&fq=domain:"facebook.com"+AND+crawl_year:"2017"&rows=20&wt=json&facet=true&f.crawl_year.facet.limit=100} hits=3781705 status=0 QTime=39489 5) 2020-06-05 20:43:25.303 INFO (qtp1908316405-639769) [c:ns0 s:shard1 r:core_node2 x:ns0_shard1_replica_n1] o.a.s.c.S.Request [ns0_shard1_replica_n1] webapp=/solr path=/select params={q=facebook.com&facet.field=domain&facet.field=content_type_norm&facet.field=type&facet.field=crawl_year&facet.field=status_code&facet.field=public_suffix&hl=on&indent=true&fl=id,score,title,hash,source_file_path,source_file_offset,url,url_norm,wayback_date,domain,content_type,crawl_date,content_type_norm,type&start=1002020&q.op=AND&fq=record_type:response+OR+record_type:arc&fq=domain:"facebook.com"+AND+crawl_year:"2018"&rows=20&wt=json&facet=true&f.crawl_year.facet.limit=100} hits=15355247 status=0 QTime=166414 If your answer was “Hey, what’s up with start!?” then you are now officially a Big Search Analyst. Your badge will arrive shortly. For those not catching it (that included me for a long time): A search is issued with a query for facebook material from 2015 with the parameters start=100&rows=20 (corresponding to page 6 in a UI which shows 20 results/page). Response time is 2 seconds. The same query is repeated, this time with start=10020&rows=20. If the intent was to go to page 7 in the UI, we would expect start=120&rows=20. Response time is 7 seconds. The query is changed to facebook material from 2017, still with start=10020&rows=20. Seems like someone’s URL hacking. Response time is 3½ seconds. Same query as in #4, but now with start=1002020&rows=20. Response time jumps to 39 seconds. The query is changed to facebook material from 2018, with the previous start=1002020&rows=20 intact. Response time jumps to 166 seconds. Locating the error Time to inspect the code responsible for the paging: if (this.start + 20 < this.totalHits) { this.start = this.start + 20; } Seems innocent enough and when we tested by pressing “Next” a few times in SolrWayback, it did indeed behave exemplary: start=0, start=20, start=40 and so on. Looking further down we encounter this nugget: /* Method used on creation, reload and route change to get query parameters */ getQueryparams:function(){ this.myQuery = this.$route.query.query; this.start= this.$route.query.start; this.filters = this.$route.query.filter; ... A quick appliance of console.log(typeof this.start) in the right place tells us that when the UI page is reloaded, which happens when the URL is changed by hand, the type of this.start becomes a string! Loosely typed languages is a taste not acquired by your humble author. Back to the code for skipping to the next page: this.start = this.start + 20; If this.start is 100 to begin with and if it is a string, we suddenly have "100" + 20, which JavaScript handles by casting the number 20 to the string 20: "100" + "20" = "10020". That translates to page 502 instead of page 2, which of course is not what the user wants, but how does it become a memory problem? SolrCloud internals and the smoking gun The SolrCloud for Netarchive Search is a distributed one (remember the 107 shards?), so when 20 documents starting at position 10020 are needed, the master must request start=0&rows=10040 document representations from each shard, sort them and deliver documents 10020-10039. For our setup that means holding up to 10040*107 = 1 million document representations in memory. The master node has one job and this it it, so it handles the load. Yes, it bumps heap requirements temporarily with a gigabyte or two, but that’s okay. It still delivers the result in 7 seconds. So what happens when the user presses Next again? Yes, "10020" + 20 = "1002020". That’s a factor 100 right there, as we move 2 decimal places. And master has -Xmx=8g… Fortunately the logged request only matched 15 million documents, so the master Solr got by with a 4GB bump to the heap (the first spike in the graph) at that time. Knowing what to look for (start=xxxx, where xxxx is at least 4 digits), it is simple to find the last relevant log entry before the crash: grep "start=[1-9][0-9][0-9][0-9]" solr.log.1 2020-06-08 08:29:43.898 INFO (qtp1908316405-709360) [c:ns0 s:shard1 r:core_node2 x:ns0_shard1_replica_n1] o.a.s.c.S.Request [ns0_shard1_replica_n1] webapp=/solr path=/select params={q=twitter.com&facet.field=domain&facet.field=content_type_norm&facet.field=type&facet.field=crawl_year&facet.field=status_code&facet.field=public_suffix&hl=on&indent=true&fl=id,score,title,hash,source_file_path,source_file_offset,url,url_norm,wayback_date,domain,content_type,crawl_date,content_type_norm,type&start=4020&q.op=AND&fq=record_type:response+OR+record_type:arc&fq=domain:"twitter.com"+AND+content_type_norm:"html"+AND+crawl_year:"2015"&rows=20&wt=json&facet=true&f.crawl_year.facet.limit=100} hits=53598240 status=0 QTime=3835 Here we have start=4020 and 54 million hits. The aggregating Solr died 10 minutes later. $10 says that the request that crashed the master Solr was for the same query, but with start=402020. As 402020 document representations * 107 shards equals 43 million document representations, the master JVM might have survived with -Xmx=12g. If not for the huge amount of tiny objects overloading the garbage collector. Fixes and take aways Easy fix of course: Cast this.start in the JavaScript layer to integer and enforce an upper limit for start & rows in the middle layer for good measure. For next time we’ve learned to Closely examine slow queries (Captain Obvious says hello) Keep GC-logs a few restarts back (we only had the current and the previous one to start with) Plot the GC pauses to see if there are spikes that did not crash the JVM (trivial to do with gceasy.io), then inspect the request logs around the same time as the spikes Posted in eskildsen, Solr | Tagged bughunting, memory | Leave a comment Touching encouraged (an ongoing story) Posted on October 26, 2019 by Toke Eskildsen A recurring theme at KB Labs is to show a lot of pixels. By chance we got our hands on a 4K touch-sensitive display, capable of showing a non-trivial amount of said pixels on a non-trivial surface area. Our cunning plan is to Adapt some of the labs products to work on the display Put the display somewhere in the public area of the library Watch people swoon when they delve into beautiful cultural heritage data This post is intended to be a diary of sorts, journaling what we learn. Coincidental activation (2019-10-10) We have talked about experimenting with interactive large displays for years. With emphasis on talked. It took someone with youthful initiative to actually do something: Max Odsbjerg Pedersen discovered a usable & unused display and promptly sent us a video showing him using a labs product on the display. 4 days later he brokered a loan agreement and 10 days later we verified that no one questions two people removing a large display, as long as it is transported in a cardboard box. Adding heavy box moving to résumé Fair warning (2019-10-23) The software development department has a – not entirely undeserved – reputation of being loose cannons that tend to muck about in ways that unexpectedly affects other departments. To atone for blunders past and primarily because it really is the most constructive practice, representatives of the Cultural Heritage and the Communications departments were duly informed about the initiated process and invited to participate in discussion hereof. In other words: We met them at lunch as usual and said “Hey, we’ve got this nifty idea …”, to which they answered “Sounds good!”. What have we got? (2019-10-24) The display is a 55″ Samsung Flᴉp. Its internal software seems focused on providing a digital flip-over with some connectivity possibilities? It does not have a build-in web browser, but connecting it to a Windows 10 machine is exceedingly easy. We will just have to duct tape a laptop to its back or something to that effect. It was a pleasant surprise to discover that the excellent tool OpenSeadragon works perfectly out of the box with multi touch on desktop browsers: Tap, double tap, drag, pinch & spread. Well, as long as you are not a lazy developer that still use a pre-2017 version of OpenSeadragon where pinching is wonky *cough*. Adapt, by which we mean “remove stuff” (2019-10-25) Three KB Labs products, which would benefit from a large display, were selected: At the core they are all web pages using OpenSeadragon and as such, adaptation mostly meant removing features and interface elements. A simple navigational area was added to switch between the products and the PoC Mark I alpha was born: Best viewed on a 55″ tablet or larger. Secure, by which we mean “fail” (2019-10-25) Since the whole thing is intended for public display & interaction, we want to make sure the users stay on the designated pages. A developer navigating one of the designated pages Pressing F11 switches to full screen with no navigational bar in Chrome and the end users does not have access to the keyboard, so problem solved? Our boss Bjarne Andersen passed by, stopped and played with the presentations. It took him 2 minutes to somehow activate right-click and presto, the box was cracked. Thanks boss! Well, Chrome has a designated kiosk mode: Simply add -kiosk as an argument and all is taken care off. At least until co-worker Kim Christensen discovers that there is a handy swipe-from-a-vertical-edge gesture that opens the Windows menu and other elements. Cracked again. Thanks Kim! Disabling swipe gestures did not seem possible without admin rights, which we do not have on the current computer. There seems to be a Windows kiosk mode that also requires admin rights. Oh well, maybe Monday? Weekend calls. Broken Windows and tweaks (2019-10-28) Colleague Thomas Egense brought a private windows laptop to work (no worries, we only connect those to the eduroam network). It would not connect to the large display. Reboot. It did connect to the display in 4K, but not to WiFi. Reboot. It did connect to WiFi, but would no longer connect to the display. Reboot. Same. Reboot. Same. Give up. Actively avoid defenestrating the laptop. Drink coffee. At least it went a little better when colleague Gry Vindelev Elstrøm stopped by. She suggested adding some sort of map overlay, so that the users would not get lost in the big collages? And of course OpenSeadragon delivers again: 90 seconds and a reload later the wish were granted: OpenSeadragon with Navigator overlay Gry’s other wish: To have visual-similarity spatial grouping of the maps collection is both valid and entirely possible to fulfill. Buuut it is not a 90 second job and the touch screen project is a side project, so that idea was filed under when We Find The Time. And then they were two (2019-10-29) Heroic display digger Max Odsbjerg Pedersen phoned in and said he had found a twin display lying around. He’ll put it up somewhere at AU Library, Nobel Park, mirroring the display we’re working with at the Reoyal Danish Library, Aarhus. Thank you, Max. You do realize we’re at the early Proof on Concept stage, right? Go ahead is a given (2019-10-30) Gitte Bomholt Christensen deals with the public space at the library. She visited to take a look at the project. Her first question was if we should put the display on a movable stand or if a wall mount were better. We’ll take that as a “Yes, we’ll go forward with this experiment”. Soon they will be five (2019-10-31) Early in the day miniboss Katrine Hofmann Gasser asked for requirements for 3 extra touch displays. Later in the day, miniboss Katrine Hofmann Gasser had ordered 3 extra touch displays. Damn, people! What happened to the concept of testing a minimum viable product followed by iterative improvement? The hunt for 4K (2019-11-05) The afternoon was not free (they never are), but at least it was not bogged down with other stuff. So what about upping the resolution from HD to 4K? How hard can that be? Yeah, 4 trips to Operations and 3 different computers produced the new knowledge that passive DisplayPort → HDMI cables have trouble delivering the goods. Native HDMI 1.4 handles 4k though: Admittedly at 30Hz only, but that works well enough when the interface reflects finger movements directly. The only situation where the 30Hz is noticeable is when the user pans by flinging. Gridified tSNE (2019-11-06) Running image collections through a trained network and positioning them according to their visual similarity is one of those “the future is now”-things. One favourite is Pix-Plot which produces an interactive 3D-visualization. But the touch screen is meant for large images and Pix-Plot is not made to display those. Plotting directly to 2D does not solve the problem: A bit hard to enjoy all the images when they cover each other A marriage between Pix-Plot and the existing zoomable grid-based layout was proposed. Some hacking later with the tools ml4a & RasterFairy and… Yeah, kinda-sorta? As can be seen on the screenshot below, there are definitely areas of similar images, but there are also multiple areas that looks like they should be collapsed into one. Something’s off, but that will have to be solved later. There’s definitely some grouping there. And groups that looks suspiciously similar There are no image duplicates – we checked! Frontpage material (2019-11-14) Thomas Egense wanted something else on the large touch screen, so he extracted all frontpages from the Danish newspaper Berlingske Tidende, available from Mediestream (of course he cheated and took an internal shortcut). It is quite an old newspaper, so “all” means 68,367 pages. Rendering 68K fairly-high-res images is no technical problem, but as our scans are greyscale the result was somewhat bland and extremely cumbersome to navigate with intention. A sea of grey Thankfully newspaper frontpages do possess one singular reliable piece of metadata: The date of the paper. Adding an ugly date picker was easy enough and presto! Intuitive navigation of the primary navigational axis for the material. Proper tSNE (2019-11-25) A breakthrough discovery was made today: If you clumsily swap the x and y axis for the coordinates, but keep the calculated width and height, when you plot gridified data, the result is … less good. Corollary: If you un-swap said axes the result looks much better! As demonstrated by these before and after images: Sorry about doing this on a not-fully-public-yet dataset (the awesome “anskuelsesbilleder” at AU Library, Emdrup): We can only show the scaled-down versions of the properly gridified tSNE layout, but they should convey the gist. Maybe machines can label our stuff? (2019-11-27) Since machine learning was great at positioning images according to visual similarity (or rather a mix of visual similarity and content similarity), maybe use it to automatically label our material? Well, not so much with the collection of anskuelsesbilleder: The network (imagenet) is great for labelling photographies but poor for drawings: “Binder”, “Web site”, “Envelope” and “Jigsaw puzzle” were recurring and absolutely wrong guesses. Again, sorry for not being allowed to show the images. Hopefully later, when the rights has been cleared, ok? Ideas aplenty (2019-11-27) Karen Williams was the nearest innocent bystander to show the latest experiment with the large touch screen and she upped the ante, asking for drive-by crowd-sourcing capabilities and visualization of sound. So much untapped potential in this! Organisations gotta organise (2019-11-28) One does not simply walk down a put a touch screen on the wall. It is hard to have patience with a new toy in hand, but it is understandable that a mix of people from different departments must participate on something that involves display of cultural heritage data in the public areas of the library. Unfortunately it will be nearly 2 weeks before said mix of people are able to meet and determine how to go about the project. Deep breath. Inner peace. Tempus fugit. Pong detour (2019-12-06) Annual christmas party time! And Jesper Lauridsen did not miss the oportunity that a big touch screen presented. He whipped up a multi-ball Pong game where the balls were the faces of the people at the party. Will you be hitting your colleague with a bat or let said colleague fall into oblivion? Great success and nobody spilled beer on the touch table! And no, sorry, not allowed to show it due to the face thing. Privacy and all. Last details finished in the Real Hardware department by leet hacker Jesper Proper public tSNE (2019-12-09) The image classification → tSNE → RasterFairy → juxta chain is our new golden hammer, and the next collection to hits were our Maps & Atlases collection. Given that the network was never trained explicitly for the minute differences in maps, it went surprisingly well. And this time we’re allowed to show the result: Don’t just sit there! Try it yourself Secure, by which we mean “nearly succeed” (2019-12-10) There was a meeting with The Right People and it took all of 3½ minute to decide that yes, the large tablet should definitely be displayed in the public areas. Then it took 10 minutes to hammer out the details: The plan is to mount in on wheels and move it around to see where it works best. Progress! The afternoon was spend trying to make the big screen less easy to hack. It is driven by an Ubuntu 19.10 box using Google Chrome as the browser. As discovered earlier, Chrome has a “kiosk” mode, which disables right click, the address bar and more. Easy! The real problem was Ubuntu itself: It has tablet support, meaning clever swipe gestures that activates program selection, unmaximizes windows, shows an on screen keyboard and other goodies. Goodies meaning baddies, when trying to build a display kiosk! Most of the solution was to use the Disable Gestures extension (and reboot to get the full disablement), but the on screen keyboard (activated by swiping in from the bottom of the screen) is apparently hard baked into the system (Block Caribou did not help us). We might have to uninstall it completely. To be continued Posted in eskildsen, Visualization | Leave a comment DocValues jump tables in Lucene/Solr 8 Posted on March 12, 2019 by Toke Eskildsen Lucene/Solr 8 is about to be released. Among a lot of other things is brings LUCENE-8585, written by your truly with a heap of help from Adrien Grand. LUCENE-8585 introduces jump-tables for DocValues, is all about performance and brings speed-ups ranging from worse than baseline to 1000x, extremely dependent on index and access pattern. This is a follow-up post to Faster DocValues in Lucene/Solr 7+. The previous post contains an in-depth technical explanation of the DocValues mechanisms, while this post focuses on the final implementation. DocValues? Whenever the content of a field is to be used for grouping, faceting, sorting, stats or streaming in Solr (or Elasticsearch or Lucene, where applicable), it is advisable to store it using DocValues. It is also used for document retrieval, depending on setup. DocValues in Lucene 7: Linked lists Lucene 7 shifted the API for DocValues from random access to sequential. This meant smaller storage footprint and cleaner code, but also caused the worst case single value lookup to scale linear with document count: Getting the value for a DocValued field from the last document in a segment required a visit to all other value blocks. The linear access time was not a problem for small indexes or requests for values for a lot of documents, where most blocks needs to be visited anyway. Thus the downside of the change was largely unnoticeable or at least unnoticed. For some setups with larger indexes, it was very noticeable and for some of them it was also noticed. For our netarchive search setup, where each segments has 300M documents, there was a severe performance regression: 5-10x for common interactive use. Text book optimization: Jump-tables The Lucene 7 DocValues structure behaves as a linked list of data-nodes, with the specializations that it is build sequentially and that it is never updated after the build has finished. This makes it possible to collect the node offsets in the underlying index data during build and to store an array of these offsets along with the index data. With the node offsets quickly accessible, worst-case access time for a DocValue entry becomes independent of document count. Of course, there is a lot more to this: See the previously mentioned Faster DocValues in Lucene/Solr 7+ for details. One interesting detail for jump-tables is that they can be build both as a cache on first access (see LUCENE-8374) and baked into the index-data (see LUCENE-8585). I much preferred having both options available in Lucene, to get instant speed up with existing indexes and technically superior implementation for future indexes. Alas, only LUCENE-8585 was deemed acceptable. Best case test case Our netarchive search contains 89 Solr collections, each holding 300M documents in 900GB of index data. Each collection is 1 shard, merged down to 1 segment and never updated. Most fields are DocValues and they are heavily used for faceting, grouping, statistics, streaming exports and document retrieval. The impact of LUCENE-8585 should be significant. In netarchive search, all collections are searched together using an alias. For the tests below only a single collection was used for practical reasons. There are three contenders: Unmodified Solr 7 collection, using Solr 8.0.0 RC1. Codename Solr 7. In this setup, jump-tables are not active as Solr 8.0.0 RC1, which includes LUCENE-8585, only supports index-time jump-tables. This is the same as Solr 7 behaviour. Solr 7 collection upgraded to Solr 8, using Solr 8.0.0 RC1. Codename Solr 8r1. In this setup, jump-tables are active and baked into the index data. This is the expected future behaviour when Solr 8 is released. Solr 7 collection, using Lucene/Solr at git commit point 05d728f57a28b9ab83208eda9e98c3b6a51830fc. Codename Solr 7 L8374. During LUCENE-8374 (search time jump tables) development, the implementation was committed to master. This was later reverted, but the checkout allow us to see what the performance would have been if this path had been chosen. Test hardware is a puny 4-core i5 desktop with 16GB of RAM, a 6TB 7200RPM drive and a 1TB SSD. About 9GB of RAM free for disk cache. Due to time constraints only the streaming export test has been done on the spinning drive, the rest is SSD only. Streaming exports Premise: Solr’s export function is used by us to extract selected fields from the collection, typically to deliver a CSV-file with URLs, MIME types, file sizes etc for a corpus defined by a given filter. It requires DocValues to work. DV-Problem: The current implementation of streaming export in Solr does not retrieve the field values in document order, making the access pattern extremely random. This is absolute worst case for sequential DocValues. Note that SOLR-13013 will hopefully remedy this at some point. The test performs a streaming export of 4 fields for 52,653 documents in the 300M index. The same export is done 4 times, to measure the impact of caching. curl '<solr>/export?q=text:hestevogn&sort=id+desc& fl=content_type_ext,content_type_served,crawl_date,content_length' run 1 seconds run 2 seconds run 3 seconds run4 seconds Solr 7 spin 1705 1297 1352 1314 Solr 8r1 spin 834 3 2 1 Solr 7 L8374 spin 935 1 1 1 Solr 7 SSD 1276 1258 1262 1262 Solr 8r1 SSD 16 1 2 1 Solr 7 L8374 SSD 15 1 1 1 Observation: Both Solr 8r1 and Solr 7 L8374 vastly outperforms Solr 7. On a spinning drive there is a multi-minute penalty for run 1 after which the cache has been warmed. This is a well known phenomenon. Faceting Premise: Faceting is used everywhere and it is a hard recommendation to use DocValues for the requested fields. DV-Problem: Filling the counters used when faceting is done in document order, which works well with sequential access as long as the jumps aren’t too long: Small result sets are relatively heavier penalized than large result sets. Simple term-based searches with top-20 faceting on 6 fields of varying type and cardinality: domain, crawl_year, public_suffix, content_type_norm, status_code and host. Reading the graphs: All charts in this blog post follows the same recipe: X-axis is hit count (aka result set size), y-axis is response time (lower is better) Hit counts are bucketed by order of magnitude and for each magnitude, boxes are shown for the three contenders: Blue boxes are Solr 7, pink are Solr 8r1 and green are Solr 7 L8374 The bottom of a box is the 25 percentile, the top is the 75 percentile. The black line in the middle is the median. Minimum response time for the bucket is the bottom spike, while the top spike is 95 percentile Maximum response times are not shown as they tend to jitter severely due to garbage collection Observation: Modest gains from jump-tables with both Solr 8rc1 and Solr 7 L8374. Surprisingly the gains scale with hit count, which should be investigated further. Grouping Premise: Grouping is used in netarchive search to collapse multiple harvests of the same URL. As with faceting, using DocValues for grouping fields are highly recommended. DV-Problem: As with faceting, group values are retrieved in document order and follows the same performance/scale logic. Simple term-based searches with grouping on the high-cardinality (~250M unique values) field url_norm. Observations: Modest gains from jump-tables, similar to faceting. Sorting Premise: Sorting is a basic functionality. DV-Problem: As with faceting and grouping, the values used for sorting are retrieved in document order and follows the same performance/scale logic. This tests performs simple term-based searches with sorting on the high-cardinality field content_length. Observations: Modest gains from jump-tables. Contrary to faceting and grouping, performance for high hit counts are the same for all 3 setups, which fits with the theoretical model. Positively surprising is that the theoretical overhead of the jump-tables does not show for higher hit counts. Document retrieval Premise: Content intended for later retrieval can either be stored explicitly or as docValues. Doing both means extra storage, but also means that everything is retrieved from the same stored (and compressed) blocks, minimizing random access to the data. For the netarchive search at the Royal Danish Library we don’t double-store field data and nearly all of the 70 retrievable fields are docValues. DV-Problem: Getting a search result is a multi-step process. Early on, the top-X documents matching the query are calculated and their IDs are collected. After that the IDs are used for retrieving document representations. If this is done from DocValues, it means random access linear to the number of documents requested. Simple term-based relevance ranked searches for the top-20 matching documents with 9 core fields: id, source_file_s, url_norm, host, domain, content_type_served, content_length, crawl_date and content_language. Observations: Solid performance improvement with jump-tables. Production request Premise: The different functionalities are usually requested in combination. At netarchive search a typical request uses grouping, faceting, cardinality counting and top-20 document retrieval. DV-Problem: Combining functionality often means that separate parts of the index data are accessed. This can cause cache thrashing if there is not enough free memory for disk cache. With sequential DocValues, all intermediate blocks needs to be visited, increasing the need for disk cache. Jump-tables lowers the number of storage requests and are thus less reliant on cache size. Simple term-based relevance ranked searches for the top-20 matching documents, doing grouping, faceting, cardinality and document retrieval as described in the tests above. Observations: Solid performance improvement with jump tables. As with the previous analysis of search-time jump tables, utilizing multiple DocValues-using functionality has a cocktail effect where the combined impact is larger than the sum of the parts. This might be due to disk cache thrashing. Overall observations & conclusions The effect of jump tables, both with Solr 8.0.0 RC1 and LUCENE-8374, is fairly limited; except for export and document retrieval, where the gains are solid. The two different implementations of jump tables performs very similar. Do remember that these tests does not involve index updates at all: As LUCENE-8374 is search-time, it does have a startup penalty when indexes are updated. For a the large segment index tested above, the positive impact of jump tables is clear. Furthermore there is no significant slow down for higher hit counts with faceting/grouping/statistics, where the jump tables has no positive impact. Before running these tests, it was my suspicion that the search-time jump tables in LUCENE-8374 would perform better than the baked-in version. This showed not to be the case. As such, my idea of combining the approaches by creating in-memory copies of some of the on-storage jump tables has been shelved. Missing Performance testing is never complete, it just stops. Some interesting thing to explore could be Spinning drives Concurrent requests Raw search speed with rows=0 Smaller corpus Variations of rows, facet.limit and group.limit Kibana and similar data-visualization tools Posted in eskildsen, Hacking, Low-level, Lucene, Performance, Solr, Uncategorized | 7 Comments Faster DocValues in Lucene/Solr 7+ Posted on October 2, 2018 by Toke Eskildsen This is a fairly technical post explaining LUCENE-8374 and its implications on Lucene, Solr and (qualified guess) Elasticsearch search and retrieval speed. It is primarily relevant for people with indexes of 100M+ documents. Teaser We have a Solr setup for Netarchive Search at the Royal Danish Library. Below are response times grouped by the magnitude of the hitCount with and without the Lucene patch. Grouping on url_norm, cardinality stats on url_norm, faceting on 6 fields and retrieval of all stored & docValued fields for the top-10 documents in our search result. As can be seen, the median response time with the patch is about half that of vanilla Solr. The 95% percentile shows that the outliers has also been markedly reduced. Long explanation follows as to what the patch does and why indexes with less than 100M documents are not likely to see the same performance boost. Lucene/Solr (birds eye) Lucene is a framework for building search engines. Solr is a search engine build using Lucene. Lucene, and thereby Solr, is known as an inverted index, referring to the terms⟶documents structure that ensures fast searches in large amounts of material. As with most things, the truth is a little more complicated. Fast searches are not enough: Quite obviously it also helps to deliver a rich document representation as part of the search. More advanced features are grouping, faceting, statistics, mass exports etc. All of these have in common that they at some point needs to map documents⟶terms. Lucene indexes are logically made up of segments containing documents made up of fields containing terms (or numbers/booleans/raws…). Fields can be indexed for searching, which means terms⟶documents lookup stored for document retrieval docValues for documents⟶terms lookup stored and docValues representations can both be used for building a document representation as part of common search. stored cannot be used for grouping, faceting and similar purposes. The two strengths of stored are Compression, which is most effective for “large” content. Locality, meaning that all the terms for stored fields for a given document are stored together, making is low-cost to retrieve the content for multiple fields. Whenever grouping, faceting etc. needs the documents⟶terms mapping, it can either be resolved from docValues, which are build for this exact purpose, or by un-inverting the indexed terms. Un-inversion costs time & memory, so the strong recommendation is to enable docValues for grouping, faceting etc. DocValues in Lucene/Solr 7+ (technical) So the mission is to provide a documents⟶terms (and numbers/booleans/etc) lookup mechanism. In Lucene/Solr 4, 5 & 6 this mechanism had a random access API, meaning that terms could be requested for documents in no particular order. The implementation presented some challenges and from Lucene/Solr 7 this was changed to an iterator API (see LUCENE-7407), meaning that terms must be resolved in increasing document ID order. If the terms are needed for a document with a lower ID that previously requested, a new iterator must be created and the iteration starts from the beginning. Most of the code for this is available in Lucene70DocValuesProducer and IndexedDISI. Digging into it, the gains from the iterative approach becomes apparent: Besides a very clean implementation with lower risk of errors, the representation is very compact and requires very little heap to access. Indeed, the heap requirement for the search nodes in Netarchive Search at the Royal Danish Library was nearly halved when upgrading from Solr 4 to Solr 7. The compact representation is primarily the work of Adrian Grand in LUCENE-7489 and LUCENE-7589. When reading the wall of text below, it helps to mentally view the structures as linked lists: To get to a certain point in the list, all the entries between the current entry and the destination entry needs to be visited. DocValues sparseness and packing It is often the case that not all documents contains terms for a given field. When this is case, the field is called sparse. A trivial representation for mapping documents⟶terms for a field with 0 or 1 long values per document would be an array of long[#documents_in_segment], but this takes up 8 bytes/document, whether the document has a value defined or not. LUCENE-7489 optimizes sparse values by using indirection: First step is to determine whether a document has a value or not. If it has a value, an index into a value-structure is derived. The second step is to retrieve the value from the value-structure. IndedDISI takes care of the first step: For each DocValues field, documents are grouped in blocks of 65536 documents. Each block starts with meta-data stating the block-ID and the number of documents in the block that has a value for the field. There are 4 types of blocks: EMPTY: 0 documents in the block has a term. SPARSE: 1-4095 documents in the block has a term. DENSE: 4096-65535 documents in the block has a term. ALL: 65536 documents in the block has a term. Step 1.1: Block skipping To determine if a document has a value and what the index of the value is, the following pseudo-code is used: while (blockIndex < docID/65536) {   valueIndex += block.documents_with_values_count block = seekToBlock(block.nextBlockOffset) blockIndex++} if (!block.hasValue(docID%65536)) {  // No value for docID return } valueIndex += block.valueIndex(docID%65536) Unfortunately it does not scale with index size: At the Netarchive at the Royal Danish Library, we use segments with 300M values (not a common use case), which means that 4,500 blocks must be iterated in the worst case. Introducing an indexValue cache solves this and the code becomes valueIndex = valueCache[docID/65536] block = seekToBlock(offsetCache[docID/65536]) if (!block.hasValue(docID%65536) {  // No value for docID return } valueIndex += block.valueIndex(docID%65536) The while-loop has been removed and getting to the needed block is constant-time. Step 1.2: Block internals Determining the value index inside of the block is trivial for EMPTY and ALL blocks. SPARSE is a list of the documentIDs with values that is simply iterated (this could be a binary search). This leaves DENSE, which is the interesting one. DENSE blocks contains a bit for each of its 65536 documents, represented as a bitmap = long[1024]. Getting the value index is a matter of counting the set bits up to the wanted document ID: inBlockID = docID%65536 while (inBlockIndex < inBlockID/64) { valueIndex += total_set_bits(bitmap[inBlockIndex++]) } valueIndex += set_bits_up_to(bitmap[inBlockIndex], inBlockID%64) This is not as bad as it seems as counting bits in a long is a single processor instruction on modern CPUs. Still, doing 1024 of anything to get a value is a bit much and this worst-case is valid for even small indexes. This is solved by introducing another cache: rank = char[256] (a char is 16 bytes): inBlockID = docID%65536 valueIndex = rank[inBlockID/8] inBlockIndex = inBlockID/8*8 while (inBlockIndex < inBlockID/64) { valueIndex += total_set_bits(bitmap[inBlockIndex++]) } valueIndex += set_bits_up_to(bitmap[inBlockIndex], inBlockID%64) Worst-case it reduced to a rank-cache lookup and summing of the bits from 8 longs. Now that step 1: Value existence and value index has been taken care of, the value itself needs to be resolved. Step 2: Whole numbers representation There are different types of values Lucene/Solr: Strings, whole numbers, floating point numbers, booleans and binaries. On top of that a field can be single- or multi-valued. Most of these values are represented in a way that provides direct lookup in Lucene/Solr 7, but whole numbers are special. In Java whole numbers are represented in a fixed amount of bytes, depending on type: 1 byte for byte, 2 bytes for short or char, 4 bytes for integer and 8 bytes for long. This is often wasteful: The sequence [0, 3, 2, 1] could be represented using only 2 bits/value. The sequence [0, 30000000, 20000000, 10000000] could also be represented using only 2 bits/value if it is known that the greatest common divisor is 10⁷. The list of tricks goes on. For whole numbers, Lucene/Solr uses both the smallest amount of bits required by PackedInts for a given sequence as well as greatest common divisor and constant offset. These compression techniques works poorly both for very short sequences and for very long ones; LUCENE-7589 splits whole numbers into sequences of 16384 numbers. Getting the value for a given index is a matter of locating the right block and extracting the value from that block: while (longBlockIndex < valueIndex/16386) { longBlock = seekToLongBlock(longBlock.nextBlockOffset) longBlockIndex++ } value = longBlock.getValue(valueIndex%16386) This uses the same principle as for value existence and the penalty for iteration is also there: In our 300M documents/segment index, we have 2 numeric fields where most values are present. They have 28,000 blocks each, which must be all be visited in the worst case. The optimization is the same as for value existence: Introduce a jump table. longBlock = seekToLongBlock(longJumps[valueIndex/16384)) value = longBlock.getValue(valueIndex%16386) Value retrieval becomes constant time. Theoretical observations With a pure iterative approach, performance goes down when segment size goes up and the amount of data to retrieve goes up slower than index size. The performance slowdown only happens after a certain point! As long as the gap between the docIDs is small enough to be within the current or the subsequent data chunk, pure iteration is fine. Consequently, the requests that involves lots of monotonically increasing docID lookups (faceting, sorting & grouping for large result sets) fits the iterative API well as they needs data from most data blocks. Requests that involves fewer monotonically increasing docID lookups (export & document retrieval for all requests, faceting, sorting & grouping for small result sets) fits poorly as they result in iteration over data blocks that do not provide any other information than a link to the next data block. As all the structures are storage-backed, iterating all data blocks – even when it is just to get a pointer to the next block – means a read request. This is problematic, unless there is plenty of RAM for caching: Besides the direct read-time impact, the docValues structures will hog the disk cache. With this in mind, it makes sense to check the patch itself for performance regressions with requests for a lot of values as well as test with the disk cache fully warmed and containing the structures that are used. Alas, this has to go on the to-do for now. Tests Hardware & index Testing was done against our production Netarchive Search. It consists of 84 collections, accessed as a single collection using Solr’s alias mechanism. Each collection is roughly 300M documents / 900GB of index data optimized to 1 segment, each segment on a separate SSD. Each machine has 384GB of RAM with about 220GB free for disk cache. There are 4 machines, each serving 25 collections (except the last one that only serves 9 at the moment). This means that ~1% of total index size is disk cached. Methodology Queries were constructed by extracting terms of varying use from the index and permutating them for simple 1-4 term queries All tests were against the full production index, issued at times when it was not heavily used Queries were issued single-threaded, with no repeat queries All test setups were executed 3 times, with a new set of queries each time The order of patch vs. sans-patch tests was always patch first, to ensure that any difference in patch favour was not due to better disk caching How to read the charts All charts are plotted with number of hits on the x-axis and time on the y-axis. The x-axis is logarithmic with the number of hits bucketed by magnitude: First bucket holds all measurements with 1-9 hits, second bucket holds those with 10-99 hits, the third holds those with 100-999 hits and so forth. The response times are displayed as box plots where Upper whisker is the 95% percentile Top of the box is 75% percentile Black bar is 50% percentile (the median) Bottom of the box is 25% percentile Lower whisker is minimum measured time Each bucket holds 4 boxes Test run 2, patch enabled Test run 2, vanilla Solr Test run 3, patch enabled Test run 3, vanilla Solr Test run 1 is discarded to avoid jitter from cache warming. Ideally the boxes from run 3 should be the same as for run 2. However, as the queries are always new and unique, an amount of variation is to be expected. Important note 1: The Y-axis max-value changes between some of the charts. Document retrieval There seems to be some disagreement as to whether the docValues mechanism should ever be used to populate documents, as opposed to using stored. This blog post will only note that docValues are indeed used for this purpose at the Royal Danish Library and let it be up to the reader to seek more information on the matter. There are about 70 fields in total in Netarchive Search, with the vast majority being docValued String fields. There are 6 numeric DocValued fields. Retrieval of top-20 documents with all field values Observation: Response times for patched (blue & green) are markedly lower than vanilla (ping & orange). The difference is fairly independent of hit count, which matches well with the premise that the result set size is constant at 20 documents. Grouping Grouping on the String field url_norm field is used in Netarchive Search to avoid seeing too many duplicates. To remove the pronounced difference caused by document retrieval, only the single field url_norm is requested for only 1 group with 1 document. Grouping on url_norm Observation: The medians for patched vs. vanilla are about the same, with a slight edge to patched. The outliers (the top T of the boxes) are higher for vanilla. Faceting Faceting is done for 6 fields of varying cardinality. As with grouping, the effect of document retrieval is sought minimized. Faceting on fields domain, crawl_year, public_suffix, content_type_norm, status_code, host Observation: Patched is an improvement over vanilla up to 10M+ hits. Sorting In this test, sorting is done descending on content_length, to locate the largest documents in the index. As with grouping, the effect of document retrieval is sought minimized. Sorting on content_length Observation: Patched is a slight improvement over vanilla. Cardinality In order to provide an approximate hitCount with grouping, the cardinality of the url_norm field is requested. As with grouping, the effect of document retrieval is sought minimized. HyperLogLog cardinality on url_norm Observation: Too much jitter to say if patch helps here. Numeric statistics Statistics (min, max, average…) on content_length is a common use case in Netarchive Search. As with grouping, the effect of document retrieval is sought minimized. Numeric statistics on content_length Observation: Patched is a slight improvement over vanilla. Cocktail effect, sans document Combining faceting, grouping, stats and sorting while still minimizing the effect of document retrieval. Faceting on 6 fields, grouping on url_norm, stats on content_length and sorting on content_length Observation: Patched is a clear improvement over vanilla. Production request combination The SolrWayback front end for Netarchive Search commonly use document retrieval for top-10 results, grouping, cardinality and faceting. This is the same chart as the teaser at the top, with the addition of test run 2. Grouping on url_norm, cardinality stats on url_norm, faceting on 6 fields and retrieval of all stored & docValued fields for the top-10 documents in our search result. Observation: Patched is a pronounced improvement over vanilla. The combination of multiple docValues using request parameters is interesting as the effect of the patch on the whole seems greater than the sum of the individual parts. This could be explained by cache/IO saturation when using vanilla Solr. Whether the cause, this shows that it is important to try and simulate real-world workflows as close as possible. Overall observations For most of the performance tests, the effect of the LUCENE-8374 patch vs. vanilla is pronounced, but limited in magnitude Besides lowing the median, there seems to be a tendency for the patch to reduce outliers, notably for  grouping For document retrieval, the patch improved performance significantly. Separate experiments shows that export gets a similar speed boost For all the single-feature tests, the active parts of the index data are so small that they are probably cached. Coupled with the limited improvement that the patch gives for these tests, it indicates that the patch will in general have little effect on systems where the full index is is disk cache The performance gains with the “Production request combination” aka the standard requests from our researchers, are very high Future testing Potential regression for large hit counts Max response times (not just percentile 95) Concurrent requests IO-load during tests Smaller corpus Export/streaming Disk cache influence Want to try? There is a patch for Solr trunk at LUCENE-8374 and it needs third party validation from people with large indexes. I’ll port it to any Solr 7.x-version requested and if building Solr is a problem, I can also compile it and put it somewhere. Hopefully it will be part of Solr at some point. Update 20181003: Patch overhead and status Currently the patch is search-time only. Technically is could also be index-time by modifying the codec. For a single index in the Netarchive Search setup, the patch adds 13 seconds to first search-time and 31MB of heap out of 8GB allocated for the whole Solr. The 13 seconds is in the same ballpark (this is hard to measure) as a single unwarmed search with top-1000 document retrieval. The patch is ported to Solr 7.3.0 and used in production at the Royal Danish Library. It is a debug-patch, meaning that the individual optimizations can be enabled selectively for easy performance comparison. See the LUCENE-8374 JIRA-issue for details. Posted in eskildsen, Hacking, Low-level, Lucene, Performance, Solr | 1 Comment Prebuild Big Data Word2Vec dictionaries Posted on July 4, 2018 by thomasegense                    Prebuild and trained Word2Vec dictionaries ready for use Two different prebuild big data Word2Vec dictionaries has been added to LOAR (Library Open Access Repository) for download. These dictionaries are build from the text of 55,000 e-books from Project Gutenberg and 32.000.000 Danish newspaper pages. 35.000 of the Gutenberg e-books are English, but over 50 different languages are present in the dictionaries. Even though they are different languages the Word2Vec algorithm did a good job of separating the different languages so it is almost like 50 different Word2Vec dictionaries. The text from the danish newspapers is not public available so you would not be able to build this dictionary yourself. A total of 300Gb of raw text went into building the dictionary, so it is probably the largest Word2Vec dictionary build on a Danish corpus. Since the danish newspapers suffer from low quality OCR, many of words in the dictionary are misspellings. Using this dictionary it was possible to fix many of the OCR errors due the nature of the Word2Vec algorithm, since a given word appears in similar contexts despite its misspellings and is identified by its context. (see https://sbdevel.wordpress.com/2017/02/02/automated-improvement-of-search-in-low-quality-ocr-using-word2vec/) Download and more information about the Word2Vec dictionaries: Download   Online demo of the two corpora: Word2Vec demo             Posted in Uncategorized | Leave a comment SolrWayback software bundle has been released Posted on May 4, 2018 by thomasegense The SolrWayback software bundle can be used to search and playback archived webpages in Warc format. It is an out of the box solution with index workflow, Solr and Tomcat webserver and a free text search interface with playback functionality. Just add your Warc to a folder and start the index job. The search interface has additional features besides freetext search. This includes: Image search similar to google images Search by uploading a file. (image/pdf etc.) See if the resource has been harvested and from where. Link graph showing links (ingoing/outgoing) for domains using the D3 javascript framework. Raw download of any harvested resource from the binary Arc/Warc file. Export a search resultset to a Warc-file. Streaming download, no limit of size of resultset. An optional built in SOCKS proxy can be used to view historical webpages without browser leaking resources from the live web. See the GitHub page for screenshots of SolrWayback and scroll down to the install guide try it out. Link: SolrWayback     Posted in Uncategorized | Leave a comment Visualising Netarchive Harvests Posted on March 7, 2017 by nielskristianhansenskovmand   An overview of website harvest data is important for both research and development operations in the netarchive team at Det Kgl. Bibliotek. In this post we present a recent frontend visualisation widget we have made. From the SolrWayback Machine we can extract an array of dates of all harvests of a given URL. These dates are processed in the browser into a data object containing the years, months, weeks and days to enable us to visualise the data. Futhermore the data is given an activity level from 0-4. The high-level overview seen below is the year-month graph. Each cell is coloured based on the activity level relative to the number of harvests in the most active month. For now we use a linear calculation so gray means no activity, activity level 1 is 0-25% of the most active month, and level 4 is 75-100% of the most active month. As GitHub fans, we have borrowed the activity level colouring from the user commit heat map.   We can visualise a more detailed view of the data as either a week-day view of the whole year, or as a view of all days since the first harvest. Clicking one of these days reveals all the harvests for the given day, with links back to SolrWayback to see a particular harvest.   In the graph above we see the days of all weeks of 2009 as vertical rows. The same visualisation can be made for all harvest data for the URL, as seen below (cut off before 2011, for this blog post).   There are both advantages and disadvantages to using the browser-based data calculation. One of the main advantages is a very portable frontend application. It can be used with any backend application that outputs an array of dates. The initial idea was to make the application usable for several different in-house projects. Drawbacks to this approach is, of course, the scalability. Currently the application processes 25.000 dates in about 3-5 seconds on the computer used to develop the application (a 2016 quad core Intel i5). The application uses the frontend library VueJS and only one other dependency, the date-fns library. It is completely self-contained and it is included in a single script tag, including styles. Ideas for further development. We would like to expand this to also include both: multiple URLs, which would be nice for resources that have changed domain, subdomain or protocol over time, e.g. the URL http://pol.dk, http://www.pol.dk and https://politiken.dk could be used for the danish newspaper Politiken. domain visualisation for all URLs on a domain. A challenge here will of course be the resources needed to process the data in the browser. Perhaps a better calculation method must be used – or a kind of lazy loading. Posted in Blogging, Solr, Web | Tagged Frontend, SolrWayback | Leave a comment SolrWayback Machine Posted on February 9, 2017 by thomasegense Another ‘google innovation week’ at work has produced the SolrWayback Machine. It works similar to the Internet Archive: Wayback Machine (https://archive.org/web/) and can be used to show harvested web content (Warc files).  The Danish Internet Archive has over 20billion harvested web objects and takes 1 Petabyte of storage. The SolrWayback engine require you have indexed the Warc files using the Warc-indexer tool from British Library. (https://github.com/ukwa/webarchive-discovery/tree/master/warc-indexer). It is quite fast and comes with some additional features as well:  Image search similar to google images  Link graphs showing  links (ingoing/outgoing) for domains using the D3 javascript framework.  Raw download of any harvested resource from the binary Arc/Warc file. Unfortunately  the collection is not available for the public so I can not show you the demo. But here is a few pictures from the SolrWayback machine. SolrWayback at GitHub: https://github.com/netarchivesuite/solrwayback/ Posted in Uncategorized | 1 Comment ← Older posts Search for: Archives February 2021 June 2020 October 2019 March 2019 October 2018 July 2018 May 2018 March 2017 February 2017 November 2016 July 2016 March 2016 January 2016 November 2015 October 2015 September 2015 July 2015 June 2015 May 2015 April 2015 March 2015 February 2015 January 2015 December 2014 November 2014 October 2014 September 2014 August 2014 June 2014 April 2014 March 2014 February 2014 January 2014 December 2013 November 2013 October 2013 September 2013 June 2013 April 2013 March 2013 February 2013 January 2013 September 2011 May 2011 March 2011 February 2011 January 2011 November 2010 October 2010 September 2010 June 2010 April 2010 March 2010 February 2010 January 2010 December 2009 October 2009 September 2009 August 2009 July 2009 June 2009 May 2009 April 2009 March 2009 February 2009 January 2009 December 2008 November 2008 October 2008 September 2008 Meta Register Log in Software Development at Royal Danish Library Create a free website or blog at WordPress.com. Software Development at Royal Danish Library Blog at WordPress.com. Email (Required) Name (Required) Website   Loading Comments... Comment × Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use. To find out more, including how to control cookies, see here: Cookie Policy 
scholarly-comms-product-blog-com-567	----	thoughts on GetFTR | ScholCommsProd Home About Categories RSS Posts thoughts on GetFTR Posted on 11 December, 2019 at 00:00 UTC Category: getftr wtfgetftwt access saml publishing libraries what does it do? By using a single sign-on system (like SAML, or OAuth) a researcher can have their browser remember who they are. Today most access to subscription content is done via IP authentication. A university pays a publisher to access the content that the publisher hosts, and the university sends over a list of the IP ranges that cover the university buildings. Any researcher on campus just gets access. The big problem is that when you are off campus that doesn’t work any more. If researchers log in to the publisher site via a university authenticated single sign on system, then the browser can remember that the researcher has access to the content, on their behalf. This allows the researcher to get access to the publisher site, even when they are off campus. (It also allows the publisher to have a much better understanding of who is accessing their content). That’s the key idea behind ”seamlessaccess.org”, but there could be concerns about the publisher now knowing who the researcher is. So far, so good, but what does GetFTR do that’s different. As I understand it, from reading what’s on the GetFTR site, GetFTR has an API and when any website that shows a DOI on it wants to, that website can send a request to this API with the DOI, and if the website has it, it can also send information about the institution from where the request is coming. Now rather than the identification of the university being any old identifying string, it needs to be one backed by a single sign on system, like SAML. The API will then send some information back that can allow the website to create a ”WAYFless URL”. That lest the person clicking the link get directly to the content without having to go through the payday. Now GetFTR have said that they don’t need or receive any user specific information, they just need the institutional authentication. OK, so what is the difference, for the user, between seamlessaccess and GetFTR? I think that the difference is the following - with seamless access you the user have to log in to the publisher site. With GetFTR if you are providing pages that contain DOIs (like on a discovery service) to your researchers, you can give them links they can click on that have been setup to get those users direct access to the content. That means as a researcher, so long as the discovery service has you as an authenticated user, you don’t need to even think about logins, or publisher access credentials. On the other hand, the links that you are being presented are now determined by the publisher. If you are on campus, and your institution has access to the publisher site, then the end result should be indistinguishable from just clicking on the DOI and getting a redirect to the publisher site (unless the publisher decides to stop supporting IP authentication, and maybe that is a long term goal of this initiative, but who knows ¯_(ツ)_/¯). If you are off campus the experience could be different for paywalled journals, in that if you are logged in to your discovery service you should now get links that will work for you without you needing to log in to specific publisher sites, whereas just clicking on the DOI won’t necesscarily get you that access. How about using unpaywall, the open access button, or some other browser extension for finding alternative links for that article that you need (like links to repository versions, for example?). Well the good folks behind GetFTR could also provide those links if they were to wrap around the unpaywall API, for example, but what will happen is that some publishers will have the idea of wanting to provide researchers with some form of bronze version of the article, perhaps through platforms that restrict the ability of the researcher to copy and paste form the text, or to share onwardly, all presumable with the aims of being able to “count” usage while at the same time reducing the surface area from which content can “leak” from the publisher site. In a world in which contracts with university libraries is focussed on data like COUNTER usage, then this is a perfectly reasonable concern from publishers, however the choices that publisher make about how to present “alternative” versions to reachers is going to vary. One example of how this could suck, lets imagine that there is a very usable green OA version of an article, but the publisher wants to push me to using some “e-reader limited functionality version” that requires an account registration, or god forbid a browser exertion, or desktop app. If the publisher shows only this limited utility version, and not the green version, well that sucks. OK, so that’s my understanding of what GetFTR does (if I’m wrong, then I hope I’ll get corrected quickly. why is it doing that? OK, so, I’ve covered what it does, but why are the big five doing this? Weeeeeellllll. It’s not totally clear, but the elephant in the room is sci-hub - the website where everyone in the world can go and get an illegal copy of any research article. That combined with the economic system of value exchange in scholarly publishing, probably gives sufficient motivation for publishers to create this service. I think there are two things that make it possible for sci-hub to be possible. The first is there is a huge fragmentation of publishers in the world, and the user experience of navigating all of these sites, and systems is a pain. There is no one clear unified experience for interacting with the scholarly research, and in the googlified world we live in, that just does not gel with the kind of experience that people want, so to start with sci-hub just provides a better user experience (I’m told, I’ve actually only used sci-hub once, and I found it a pain to use back then). The second thing that makes sci-hub possible is a combination of IP access (no-authentication access) along with weak on-campus security for researcher logins. That allows sci-hub to attack researcher accounts to enable them to scrape all of the content that they put on their servers. A service like GetFTR kind of targets both of the above. The motivation is probably mostly to try to get the industry on board to try to get beyond IP authentication, and to allow publishers to “count” usage so that the “usage” numbers they present to libraries look higher and allow them to justify either price rises or indeed, just simple renewals of contracts. I have to say, the moving beyond IP authentication is a really good idea for a whole host of security reasons, and it will indeed require a big shift across the whole industry, so if this initiative could be run in a way the builds trust amongst all parties, then that would be no small thing, and should be applauded. what would make this succeed? So, in order for this to work for a cohort of researchers, every place that they look for research will need to be to implement calls to this API, and will need to have information about that researchers authenticated access to their universities. That sounds like a tall order, but there are some services, and pieces of infrastructure, that have wide deployment footprints in the world, so if you can convince them to implement this, then maybe you get to a critical mass. Oh, you also need all the publishers to implement this too, and for publishers to do this, then they are going to need to be provided with a very low cost, turn-key solution to be able to implement it. To keep a service like this running, I would expect that you would want to have a small dedicated team (maybe 5 — 10 people), and support a reasonable volume of API queries, with a good response rate. I’d benchmark the cost of running this service to be on the order of a few hundred thousand dollars. With maybe 14k publishers in the world, then the cost of joining this service should be on the order of a few thousand dollars per year, not withstanding the cost of implementation and integration. That seems like a reasonable ball park. what would make this fail? I’ve not done anything like a full analysis on this, in terms of looking at a lean canvas and trying to tease out the riskiest assumptions around the sustainability of this service, but the one thing that sticks out that could make this fail is lack of researcher adoption. If researchers use different discovery systems, and they don’t see these links everywhere, then they may just fall back to using a behavioural route that works everywhere. (That’s a totally untested hypothesis on my part, and indeed, the very reasonableness of it means that it should be tested). Contributing to lack of adoption could be if GetFTR is complex or costly for institutions and publishers to implement. what would make this irrelevant? If publishing moves to being fully OA then this service is irrelevant. what are the concerns that we might think about with this, and do they hold water? OK, hold on to your hats, this is where we get into tin-foil hat territory. Will google scholar implement this, will other discovery services do so? Well, that’s a question. I suppose that publishers could coerce GS to implement this by threatening to rescind access to their full text, but would they really want to take the hit to their traffic? I don’t see that as being worth the risk. GS is a small team, if all publishers were implementing this, they might join in, they might not, they have their own effort to try to solve this for users. Someone on twitter way saying the the big five could coerce discovery services to implement this through contractual enforcement. If the system is easy enough to implement and truly adds value, then you should not need to enforce implementation, and if it is not/does not, then enforced implementation is not where your key problems are with this service. concern - this could replace crossref This was my first thought when I head about it, and joined up this with some of the comments that were reported from Crossref Live. The thinking is that is this a way for publishers to provide a routing system independent of the handle system that corssref redirects are based off of? Well, yes it does do that, however the entire service depends on DOIs, so while it could replace some of the existing infrastructure, it as the same time remains dependant on the existing infrastructure. I can appreciate that to move fast sometimes its good to just get on and do things, but I would advocate that if this service were to become a true replacement for IP authentication then it should hand governance over to someone like NISO or some other independent body. concern - this could kill green OA As I mentioned earlier, this could enable publishers to decide to provide links to restricted access, but free to read, versions of articles, in place of GreenOA. I don’t think this is going to be a big concern for two reasons. The first is that not all publishers will be able to do this, and secondly the motivation to deposit into institutional repositories is not currently driven by usage data from those repositories, but rather from mandates from funders, who are unlikely to be influenced by this service in how they decide to modify their mandates in the future. concern - this could leak proprietary publisher information OK, so I’m not sure how the API is provisioned, but I am assuming that it is provisioned by a single end point. Who owns that endpoint and the logged data that this endpoint collects? That endpoint will have information about what resources different publishers have provided to institutions, combined with usage data from those institutions. If I am publisher A and have a journal close to that from publisher B, but that is “lower tier”, then if I could access usage data from publisher B I could modify my sales price. How am I going to be sure that my own usage data does not leak to other publishers signed up to this service? concern - this does no look like the kind of initiative that a publisher who is on a transformative journey to Open Access would invest in. If I am sitting in the European Commission or in JISC I am probably not going to look at this initiative as something that proves publishers are on a journey to becoming open access. I think GetFTR do need to make clear what the overall benefits to the scholarly ecosystem are with this initiative, because at the moment the information on their FAQ seems to mostly support the creation of publisher value. concern - this will allow publishers to track users, and behaviourally target them. If the users are now logging in to some service directly, won’t this mean that the publisher can now track that person directly? Actually GetFTR explicitly states that they don’t get any information about the user, and given that this needs to be implemented server side, then there are actually fewer potential security concerns compared to using a browser plugin based solution. What do I think about it all? What are my recommendations? I don’t really know yet. Doing anything is hard, doing so in coordination amongst large competitors is even more so, so the team behind this should be recognised for having put tougher a well executed initial proposal for how to move on access management. I think if you are a researcher I would recommend continuing to use a bag of tools to get access to your research. For the interim this will only be a partial solution. If you are a publisher I would recommend calculating cost of implementation, time to implement against expectations around how much additional usage it is going to give you, and how much revenue that is going to translate to. Basically I would advocate using a cost of deals analysis on whether to implement this or now. I have two other recommendations for the GetFTR team. Both relate to building trust. First up, don’t list orgs as being on an advisory board, when they are not. Secondly it would be great to learn about the team behind the creation of the Service. At the moment its all very anonymous. Links that I referred to when putting this together. Seamless Access Description - GN4-3 Work Package 5 - GÉANT federated confluence. GitHub - TheIdentitySelector/thiss-js: The identity selector software source. Welcome to The Identity Selector Service’s documentation! — The Identity Selector Service 1.0.0 documentation. Publishers going-it-alone (for now?) with GetFTR | Disruptive Library Technology Jester Welcome to SeamlessAccess.org | SA Site https://www.getfulltextresearch.com/faqs/ https://scholarlykitchen.sspnet.org/2019/12/10/why-are-librarians-concerned-about-getftr/ Get To Fulltext Ourselves, Not GetFTR. - openaccessbutton ← Newer Top Older → 
scholarslab-lib-virginia-edu-3615	----	Shane Lin | Scholars' Lab Home About Blog Makerspace Our Work Events For Students Spatial Tech Accessibility Year of Blogging Charter Library People Search Site Map More› Home About Blog Makerspace Our Work Events For Students Spatial Tech Accessibility Year of Blogging Charter Library People Search Site Map COVID-19 Update: The Scholars’ Lab staff is available, working remotely to support your teaching and research needs. All of our workshops, events, and consultations will be online this semester. Contact us at scholarslab@virginia.edu to ask a question or schedule a virtual GIS, VR, digital project, or makerspace consultation. For information about other departments, see the Library’s Status Dashboard and FAQ. People // Shane Lin Senior Developer Contact Contact Email:ssl2ab@virginia.edu Twitter:@shane-et-al About Shane writes codes for Scholars’ Lab, teaches Code Lab, and co-directs the Slab Coffee Studies Program. On the academic side, he works on the history of computing and the impact of digital technology on culture and politics. Shane was previously a Praxis Fellow (2012), Makerspace Technologist (2015-2017), Digital Humanities Fellow (2016), and the sole recipient of the Scholars’ Lab’s prestigious Shane Lin Memorial Fellowship (2013). All Posts by Shane Recovering From Failure (with g-code) 02.19.2020 The Long and Messy History of Privacy 03.29.2017 Bigger nozzles, faster printing 02.02.2016 NinjaFlex on the Makerbot 05.04.2015 Adventures in 3D Printer Maintenance 04.06.2015 One day of Praxis 05.03.2013 Gender and Computing (ctd) 03.01.2013 Rails is kind of hard to get up and running 02.14.2013 Literals 11.30.2012 Holy crap 11.29.2012 Learning Ruby (again) 11.27.2012 Crowdsourcing for Profit and Pleasure 10.25.2012 A Practical Prism Pedagogy Proposal 10.24.2012 #!/bin/sh 09.13.2012 Back to top How to blog Explore Home About Events Connect Facebook Twitter Github Email us RSS feed Contact Email:scholarslab@virginia.edu Address:P.O. Box 400109 Charlottesville, VA 22904-4129 Phone:434-243-8800 
shelleygullikson-wordpress-com-4525	----	Shelley Gullikson Shelley Gullikson Usability and user experience in academic libraries. Mine mostly. Web Librarians Who Do UX: Access presentation This is the text (approximately) of my presentation from the virtual Access conference on Oct.19, 2020, &#8220;Web librarians who do UX: We are so sad, we are so very very sad.&#8221; Last year, I was doing interviews with library people who do User Experience work and noticed that people who were primarily focused on the &#8230; Continue reading Web Librarians Who Do UX: Access&#160;presentation The TPL Debacle: Values vs People I can’t stop thinking about the situation at TPL. The short version is that the library has accepted a room rental from an anti-trans speaker, and despite outcry from trans people and their allies, despite a petition and a boycott by writers, despite their own policy on room rentals not allowing events that promote discrimination, &#8230; Continue reading The TPL Debacle: Values vs&#160;People Library Workers and Resilience: More Than Self-Care An article in the Globe and Mail this spring about resilience was a breath of fresh air—no talk about &#8220;grit&#8221; or bootstraps or changing your own response to a situation. It was written by Michael Ungar, the Canada Research Chair in Child, Family, and Community Resilience at Dalhousie University and leader of the Resilience Research &#8230; Continue reading Library Workers and Resilience: More Than&#160;Self-Care Research projects: Call for help I&#8217;m on a year-long sabbatical as of July 1 and excited to get started on a few different research projects. For two of the projects, I&#8217;m going to need some help from the UXLibs/LibUX community. In one of them, I want to look at love letters that users have written to academic libraries so I &#8230; Continue reading Research projects: Call for&#160;help UXLibsV: Notes Five years of UXLibs &#8211; hurrah! Let&#8217;s dive straight in. Barriers to UX Design: Andy Priestner Andy kicked off the conference with his address about why he thinks not many of us are moving beyond research reports when it comes to doing UX work in our libraries: We see research as the finish line. UX &#8230; Continue reading UXLibsV: Notes Website Refresh: First Round of Iterative Testing As I mentioned in my last post, we&#8217;re doing a design refresh of our library website, with a goal to make it &#8220;beautiful.&#8221; As such, we&#8217;re not touching much of the organization. But of course we have to pay attention to not just how the information is categorized but also where it appears on the &#8230; Continue reading Website Refresh: First Round of Iterative&#160;Testing User Research: Beautiful Websites? My University Librarian has asked for a refresh of the library website. He is primarily concerned with the visual design; although he thinks the site meets the practical needs of our users, he would like it to be &#8220;beautiful&#8221; as well. Eep! I&#8217;m not a visual designer. I was a little unsure how to even &#8230; Continue reading User Research: Beautiful&#160;Websites? Access 2018: A UX Perspective I started my Access 2018 conference experience with a meetup of library people interested in UX. There were only five of us, but we had good conversations about Research Ethics Boards and UX research, about being a UX team of one, and about some of the projects we&#8217;ll be working on in the coming year. &#8230; Continue reading Access 2018: A UX&#160;Perspective UX from a Technical Services Point of View This the text of a presentation I did last year at the Access Conference in Regina. Emma and I had plans to write this up as a paper, but life intervened and that didn&#8217;t happen. I wanted to keep some record beyond the video of the presentation, so here it is. This morning I’m going &#8230; Continue reading UX from a Technical Services Point of&#160;View Adding Useful Friction to Library UX: Ideas from UXLibs Workshop Participants At this years UXLibs conference, I led a workshop on adding useful friction to the library user experience. I&#8217;ve already posted my text of the workshop, but I told the participants that I also wanted to share the ideas that they came up with during the course of the workshop. The ideas were generated around &#8230; Continue reading Adding Useful Friction to Library UX: Ideas from UXLibs Workshop&#160;Participants 
shelleygullikson-wordpress-com-9602	----	Shelley Gullikson – Usability and user experience in academic libraries. Mine mostly. Skip to content Shelley Gullikson Usability and user experience in academic libraries. Mine mostly. Menu About Web Librarians Who Do UX: Access presentation October 19, 2020 ~ Shelley ~ Leave a comment This is the text (approximately) of my presentation from the virtual Access conference on Oct.19, 2020, “Web librarians who do UX: We are so sad, we are so very very sad.” Last year, I was doing interviews with library people who do User Experience work and noticed that people who were primarily focused on the web had the most negative comments and fewest positive comments overall. It made me think of the song from Scott Pilgrim—the comic and the movie—“I am so sad, I am so very very sad.” So there’s the title.  And I’m saying “We are so sad” because I am also a web person who does UX work. And a lot of what I heard seemed familiar. I want to say that although the title and the visuals are based around a comic and comic book movie, I’m not trying to be flip. A lot of the people who I talked to were very open about being unhappy. Not everyone was unhappy. But, there was a lot in common among the people who said they were struggling and those who were pretty positive. Here are some quotes from people who were generally pretty positive : “How much can I do that no one will block me from doing?” “Why am I really here then, if I’m just moving things around the page?”  [I keep feedback] “for promotion purposes but also not-being-sad purposes.” And from the not-so-positive : “You have all the people who have their own personal opinions… and you’re like “you’re violating every good norm of website development”… they think their opinion is just as good as anyone else’s opinion. … That can definitely demoralize you.” “I bounce back and forth between, for my own sanity’s sake, needing to be apathetic about it, saying ‘I can’t change this therefore I can’t be stressed about it’, and also on the other hand, caring that we have crappy stuff out there and wanting to improve it.” “It is what it is. There’s lots of other things to be disappointed by.” Heartbreaking, right? So why is this the case? First  a tiny bit of background on the research project. The aim of the project was to look at how UX work is structured and supported in academic libraries and then to examine those supports within the context of the structures. I did hour-long semi-structured interviews with 30 people in academic libraries from 5 countries (Canada, the US, the UK, Sweden, and Norway). These were library workers who do UX, so not necessarily librarians, and not necessarily people in UX positions. The people I’m talking about today focus mostly on the web in their jobs. The frustrations of web folks were particularly  striking because I didn’t ask a question about frustrations; I asked what supports were helpful to them and what would be helpful. Admittedly, asking “what would be helpful” is going to bring up deficiencies, but I didn’t ask what supports were missing or what they found frustrating in their work. And again, the web folks talked more about frustrations and difficulties than participants who didn’t have a web focus. So let’s dig in a bit. Why, specifically, are we so sad? First off, we have a tendency to want to think big! Do more! “That’s what motivates me—the opportunity to really sit down, talk, observe, have a conversation with our users, how they approach the website, how they approach the research process, how they approach finding out about our services and how we in turn can better highlight our resources, how we can better highlight our collections, our services.”  “If I see people struggling with things, I want to make them better.” “I don’t want UX to be just a website thing. I don’t want people to think of it ‘oh, it’s just a web thing.’ I want it to be in everything.” “I just see lots of potential all the time. I see potential everywhere, the whole library. I see things we could do that would enhance things.” That doesn’t sound sad. There’s energy and excitement in those words! But contrast it with: “Why am I really here then, if I’m just moving things around the page? I’m trying to get deeper. I’m trying to get a better understanding. It’s not just a matter of moving things around.” Web people who do UX are, I think, well positioned—and perhaps uniquely positioned—to see big picture problems across the library. One participant told me they found that users were confused about the Circulation section of the website because there were 18 different policies underlying it; they could rewrite the web content but couldn’t do anything about the underlying spaghetti of policies. Another said that users found the floor maps confusing but the maps reflected the language used on the library’s signage; they could put clear language on the website’s floor maps but couldn’t do anything about the signage in the building. So we see these problems and naturally want to solve them. We get excited about the potential to make more things better. And we chafe against having to think smaller and do less. Which brings us to: lack of authority. Lack of authority often comes up around those larger library issues. One participant put it this way: “The UX work is actually informing something else to happen. Whether that’s a space being reorganized or a webpage being redesigned—the UX work is informing this other work. Right? So it would be easier for me to do the UX work if I could actually do the work that it’s informing.” Another person was even having problems at the research stage: [I’d like to] “have the authority and freedom to actively engage with users.” And someone else, in talking specifically about their web work said: “Nobody tries to stop me.” The implication being that people try to stop them when they do other things. But for many participants there was a lack of authority even when dealing with the library website: “The web team doesn’t feel like they can really make changes without consult, consult, consult with everybody even though – even if, and even though – the web team has web expertise.”  “Just because I’m our internal expert on this stuff doesn’t mean I can persuade everybody.” “There’s too much of a sense that these things have to be decided by consensus” “Everyone feels… like they should have the right to declare how databases should work, how links should be configured, things like that.” [Each library unit feels] “they have the right to do whatever they want with their content and their presentation. … I’m not their boss and they realize that.  I’m happy to draw up more guidelines and stuff like that but if I’m not allowed to enforce that… [it’s] hard to keep things together when you just have to go hat in hand to people and say ‘pretty please, stop breaking the guidelines.’” One participant described how having no authority for the one thing they were responsible for made them feel: “Of course that has stymied my initiative, not to mention my disposition. My purpose even.” Another frustration that came through was resistance from colleagues. A few comments have already touched on colleagues ignoring expertise but resistance comes through in other ways One participant described how they always approach a particular department: [I’m] “treading very slowly and carefully and choosing my words very carefully” Another said: “Are they deliberately killing the idea but trying to avoid being disagreeable about it but just letting it die from attrition, or do they really actually mean it when they say they agree with the idea in principle but just don’t want to be bothered to follow through? I don’t know – I can’t tell the difference.” These are things participants were told by their colleagues: A manager said that “staff felt unfairly targeted” by their work In opposing to changes to the website: “We have to keep it this way because we teach it this way” And similarly, “It’s our job to teach people how to use, not our job to make it easier to use.” So, not surprisingly, these kinds of things make us feel isolated. Feelings of isolation come through in a few ways. Some participants felt they were completely on their own when deciding where to focus their attention. This is one participant talking about being new in their position: “I remember asking for, if there were any focuses they wanted to focus on… they said ‘no, there’s nothing. We don’t have any direction for you to go in.” That lack of direction is often coupled with not having colleagues who do the same work: “It’s really me and up to me to figure out where to focus my attention by myself. So sometimes having someone to bounce ideas off of and talk things through with… would be nice.” And when no one else does what you do: “Sometimes that’s a barrier, if I’m the ‘expert’ and other people don’t really know what I’m talking about.” So, isolation, having to think small and do less, resistance from colleagues, and lack of authority. Yeah, no wonder we feel a bit sad. What are my take-aways? We need to find our people. UX folks who worked with groups of colleagues were more positive about their work. However, people who tried to do UX work with non-UX committees were even more negative than people who had no group at all. So we can’t just look for any people, they have to be the right people. I wrote an article about the larger project that was published in Weave earlier this month and in it, one of my recommendations was to try to move beyond the website. But I want to say here that moving beyond the web is not a panacea. I talked to someone who had great success in UX for the website and other digital projects. They wanted to embed UX throughout the library and they had management support to do it. But after continued resistance from colleagues, they realized they couldn’t make it work, and decided to move to a completely different area of the library. Which brings me to my next point. Advocacy is important, absolutely, but when we’re not getting buy-in, we need look at next steps: do we need to change our tactics? Would it be better to have someone else advocate on our behalf? Do we need to wait for a change of leadership? Or, as a few participants said, a few retirements? At a certain point, do we give up, or do we get out? Because advocacy doesn’t always work. And if it’s not working , we shouldn’t keep banging our heads against the post, right? Ultimately , I think we need to be clear about authority. We need to understand how authority works in our own library. Not just who can block us and who can help, but are there organizational structures that confer some authority? Is it better to chair a committee or a working group? For example. Then, we need a clear understanding of what our own authority is within our organization. Maybe we underestimate the authority we have. Maybe not. But we need to be clear before we get to the next part. Which is: we need to clearly understand our own tolerance for doing work that will never be acted on. The report that sits in a drawer. If our tolerance is low, if it’s upsetting to have our work ignored, then we need to stick very closely to our own sphere of authority. We have to dream within that sphere or burn out. “Dream small or burn out” is an exceptionally grim note to end on.  But these frustrations are largely beyond one person’s control. If you’re feeling so very very sad because of some of these things, IT’S NOT JUST YOU. The fact that these issues were common to web folks, regardless of how they seemed to feel about their work, suggests that these positions are prone to these kinds of frustrations. I wish I had some ideas for how to fix it! If you do,  please add them to the chat, tweet at me, email me (see contact info). I’ll gather it all in a blog post so it’s all in one spot. Thanks. Sponsored Post Sponsored Post Menu More Info Learn from the experts: Create a successful blog with our brand new courseThe WordPress.com Blog October 30, 2020 ~ Selena Jackson ~ Leave a comment WordPress.com is excited to announce our newest offering: a course just for beginning bloggers where you’ll learn everything you need to know about blogging from the most trusted experts in the industry. We have helped millions of blogs get up and running, we know what works, and we want you to to know everything we know. This course provides all the fundamental skills and inspiration you need to get your blog started, an interactive community forum, and content updated annually. Register now The TPL Debacle: Values vs People October 28, 2019 ~ Shelley ~ 1 Comment I can’t stop thinking about the situation at TPL. The short version is that the library has accepted a room rental from an anti-trans speaker, and despite outcry from trans people and their allies, despite a petition and a boycott by writers, despite their own policy on room rentals not allowing events that promote discrimination, they insist on letting the event proceed. Some library associations are supporting them because librarians love being Champions of Intellectual Freedom. Many people have made cogent arguments about why TPL’s stance is wrong (see posts by Fobazi Ettarh, Sam Popowich, Kris Joseph). I agree. But there seemed to be more of a reason why the whole thing made me so sad. I’m writing because I think I’ve figured it out. In its two public statements on the matter, TPL has made sure to say that they “are supporters of the LGBTQ2S+ community.” They “are aware that the upcoming room rental event has caused anger and concern.” But the “community is asking us to censor someone because of the beliefs they hold and to restrict a group’s right to equitably access public space and we cannot do either. Doing so would also weaken our ability to protect others’ rights to the same in the future.” Fine. But they also said “While TPL encourages public debate and discussion about differing ideas, we also encourage those with opposing or conflicting viewpoints to respectfully challenge each other’s ideas and not the library’s democratic mandate to provide space for both.” That doesn’t sound super supportive. And at the board meeting held on October 22 where the matter was discussed, it was clear they were more concerned with a respectful tone than with actually listening and understanding. Reading how the trans women who spoke at that meeting felt about how they were treated was heartbreaking. So @torontolibrary is only letting 8 of us into the room to speak or engage the board. They’ve moved most of our group into an overflow room. They had extra security guards and obviously have a plan for how they want to contain us. — Runaway Supernova (@GwenBenaway) October 22, 2019 I had to leave after I spoke. Being forced to recount all of the transphobic violence that I face in daily life to the TPL board in a room of strangers and watch them stare silently at me as if I was subhuman was one of the worst experiences of my life. — Runaway Supernova (@GwenBenaway) October 22, 2019 I asked @vbowlestpl directly if she would say that trans women were women and she refused. I asked the entire board if they thought that I should use men’s restrooms and if they thought that would be safe. Silence again. — Runaway Supernova (@GwenBenaway) October 22, 2019 No one on the @torontolibrary should serve this community, especially not @vbowlestpl , because regardless of their transphobic beliefs, they couldn’t even acknowledge my humanity in that moment. — Runaway Supernova (@GwenBenaway) October 22, 2019   I want to thank everyone that was there. I had to leave after I spoke, it was a lot to sit in front of people that barely seemed to care I was speaking. Thank you all for your support and love both in the room and out of it. — Niko Stratis (@nikostratis) October 22, 2019 The TPL board and staff showed us their true colours and allegiances tonight. Gwen was the ONLY person to be called at time and that’s because she was asking them to treat her like a human. Tonight was disgusting and dehumanizing. https://t.co/YAuXCBK0eR — Niko Stratis (@nikostratis) October 23, 2019 The TPL board showed us who they are and who they support today. I feel awful, like I have never felt. That was dehumanizing, to throw my trauma out on a table to a sea of uncaring eyesx waiting to move to the next agenda item. I feel honestly sick to my stomach. https://t.co/LIPPMLN6Rc — Niko Stratis (@nikostratis) October 23, 2019 It does not sound like these women were talking to “supporters” of their community. And that is what’s making me extra sad about the whole thing. Not only is TPL choosing to value intellectual freedom more than they value trans people in their community, they are choosing to value intellectual freedom instead of valuing trans people in their community. It is not incompatible with upholding intellectual freedom to also acknowledge that it’s doing harm. TPL could reach out to the community and say “we know this event makes trans people feel unsafe. But we’re convinced that not allowing it to go forward will set a precedent for future decisions to shut down other events, possibly those that actively support trans people, and we cannot let that happen. We understand that this event will cause harm and undermine our relationships with LGBTQ2S+ people and your allies. What can we do to mitigate this harm?” It’s not as good as cancelling the event entirely, but at least it would show that TPL has been listening to its community. It would show that they have thought through the consequences of choosing values over people. It would show that they are not just “aware” of “anger and concern” but they understand the fears, risks, and harm their actions are causing. And of course, the community would have every right to tell them, no, there is nothing you can do to mitigate this harm. But that doesn’t mean TPL shouldn’t try. To not just say “we uphold intellectual freedom,” but to acknowledge exactly what that means in this particular case. I’m reminded of the saying that goes something like “your right to swing your arm ends when your fist meets my face.” TPL is insisting that they have the right to keep swinging. Fine. But they have been told that their fist has already met the face of the trans community. The compassionate thing would be to offer first aid. But TPL is not interested. Which, sadly, speaks volumes. It makes it crystal clear that they do not care about the trans community. It makes it crystal clear that they believe that the trans community and its allies are dispensable to their operations. The consequences of their decision (or, to be fair, their decision not to make a decision) are acceptable collateral damage; they are happy to make no attempt to mitigate any of it. If they really were supporters of the LGBTQ2S+ community, they would be supporting the LGBTQ2S+ community. In a way, it’s not surprising that the trans community is the group that so many librarians are choosing to not care about. Being trans is simultaneously visible and invisible. A trans person may be visibly trans in that they do not present in the way that some might expect, but what makes them trans is inside them, not outside. What makes a person trans is in their heart and their mind. They know who they are *inside* in a way that cannot be seen by people who don’t know them (people who do know them can see how much happier they are when their outside gets closer to matching their inside). But to the outside eye, to the dispassionate eye, there is no evidence. And without evidence, their trans-ness can be seen as just a belief. And if it’s just a belief, well then, we can debate it. And we should debate it because, as librarians, we are Champions of Intellectual Freedom. I so wish that we were champions of people instead. Library Workers and Resilience: More Than Self-Care October 11, 2019 ~ Shelley ~ Leave a comment An article in the Globe and Mail this spring about resilience was a breath of fresh air—no talk about “grit” or bootstraps or changing your own response to a situation. It was written by Michael Ungar, the Canada Research Chair in Child, Family, and Community Resilience at Dalhousie University and leader of the Resilience Research Centre there. The research shows that what’s around us is much more important than what’s inside us when it comes to dealing with stress. The article was adapted from Ungar’s book, the now-published Change Your World: The Science of Resilience and the True Path to Success. I know, the title is a little cringey. And honestly, some of the book veers into self-help-style prose even as it decries the self-help industry. But on the whole, there is quite a lot that it interesting here. I was looking at it for an upcoming project on help-seeking, but it keeps coming to mind during discussions about self-care and burnout among library workers. Ungar writes of the myth of the “rugged individual” who can persevere through their own determination and strength of character. We get fed a lot of stories about rugged individuals, but Ungar has found that when you look closely at them, what you find instead are “resourced individuals”—people who have support from the people and environment around them. “Resilience is not a do-it-yourself endeavor. Striving for personal transformation will not make us better when our families, workplaces, communities, health care providers, and governments provide us with insufficient care and support.” (p.14) Ungar is mostly focused on youth but also writes about workplaces, even though this is not his direct area of research. Two passages in particular caught my eye: “Every serious look at workplace stress has found that when we try and influence workers’ problems in isolation, little change happens. … Most telling, when individual solutions are promoted in workplaces where supervisors do not support their workers… resilience training may actually make matters worse, not better.” (p.109) A now-removed article in School Library Journal explained how one library worker changed herself to deal with her burnout. The reaction to this article was swift and strong. Many of us know that individual stories of triumph over adversity are bullshit, particularly when we have seen those same efforts fail in our own contexts. I have found it validating to find research backs that up. Ungar does allow that there are times when changing oneself can work—either a) when stress is manageable and we already have the resources (if you can afford to take two weeks off to go to a meditation retreat, why not), or b) when there is absolutely nothing else you can do to change your environment or circumstances (your job is terrible but you can’t leave it and you’ve tried to do what you can to improve things, so sure take some time to meditate at your desk to get you through your day). But most of us live somewhere between perfectly-resourced and completely hopeless. So what needs to be fixed is our environment, not ourselves. I have noticed resilience has been coming up as a theme in my own university over the last year or so—workshops on becoming more resilient or fostering resilient employees. Ungar says “To be resilient is to find a place where we can be ourselves and be appreciated for the contributions that we make.” That’s not something individuals can do by themselves. People in leadership positions would do well to better understand the research behind resilience rather than the self-help inspired, grit-obsessed, bootstraps version. Workshops and other initiatives that focus on individuals will not fix anything. At best, they are resources for people who are already doing pretty well. At worst, they add to the burden of people already struggling by making them feel like their struggles are caused by their own insufficiency. Anyway, these are just some thoughts based on a single book; I’m nowhere in the realm of knowledgeable on this subject. But I thought it might be helpful to share that there is research that backs up the lived experience of the many library workers who struggle in their organizations, despite their own best efforts.   Research projects: Call for help June 27, 2019 ~ Shelley ~ Leave a comment I’m on a year-long sabbatical as of July 1 and excited to get started on a few different research projects. For two of the projects, I’m going to need some help from the UXLibs/LibUX community. In one of them, I want to look at love letters that users have written to academic libraries so I need people to send me love letters their users have written. In the other, I want to look at the different ways UX work is structured and supported in academic libraries so I need people who are willing to participate in an interview that will take around 60 minutes. Do you want to know more? Read more about the love letters project. Or, read more about the UX work project. I am happy to answer any and all questions: shelley.gullikson[at]carleton.ca or @shelley_gee on Twitter, or in the comments below. Thank you in advance for considering! And endless appreciation if you decide to help! UXLibsV: Notes June 26, 2019June 26, 2019 ~ Shelley ~ 1 Comment Five years of UXLibs – hurrah! Let’s dive straight in. Barriers to UX Design: Andy Priestner Andy kicked off the conference with his address about why he thinks not many of us are moving beyond research reports when it comes to doing UX work in our libraries: We see research as the finish line. UX is about uncovering actionable insights, not about statistical significance We’re terrible at idea generation. We tend to get set on the first “safe” idea we come up with. We pursue perfection. Instead, we should evolve services with our users. We’re too cautious. After talking with library directors, Andy thinks library staff perceive less agency than we actually have; directors say they want their staff to try new things. We’re not agile enough. Not everyone needs to be consulted before we can take action. Issues around ownership and politics. There is uncertainty about where UX sits and the scope is misunderstood. Ignoring the basics. UX is often perceived as innovation (and institutions love innovation) but UX can also be sorting out the basics. Fear of failure. We overreact to negative comments. Failure is not modeled; we may hear that it’s okay to fail but we don’t tend to see it. Andy then gave some examples of projects where libraries created prototypes out of their UX research, and iterated to improve the design to actually meet user needs. Leadership is Key—My UX Journey: Anneli Friberg Anneli gave a very warm and personal keynote, talking about her experiences growing UX at her library. One of the things that stood out most for me was her explanation of how “the user perspective” is different from “the user’s perspective.” Library workers often feel they have “the user perspective” because they spend so much time serving users. But Anneli said that this “user perspective” is only ever the best guess of library workers, looking from the inside-out. “The user’s perspective” is outside-in; we walk along with our users to learn what they actually do, say, and feel. It’s not a guess. Anneli showed us her version of a UX maturity model (created in Swedish and translated into English). She talked about the importance of recognizing what kind of organization you work in and where you are in the maturity model. She spoke about the frustrations she encountered when her library was in the early stages of maturity and how it helped her to have an external network she could rely on for support. To get through the frustration of the early stages of UX maturity, you have to shape the culture of your library. Anneli recommended leading this culture change by example. Michael West has said “The core of leadership is compassion and kindness” and lays out four aspects of leadership: attending, understanding, empathizing, and helping. He describes “attending” as “listening with fascination,” which I really like as an idea. A few other interesting bits from Anneli’s keynote: Failure is success in progress Do idea generation together with your users Take pictures of how students are using the library so you can easily show needs and gaps (e.g. a student hanging their coat on shelved books points to the need for coat hooks!) Lead by clearing the path (help remove barriers for others) Anneli had some interesting and useful things to say about failure. She believes that having a project fail was an important step in moving her UX vision forward. Her team did some research, found a problem, and wanted to try a solution. Anneli was pretty sure it wouldn’t work, but didn’t discourage them. They launched the solution and, sure enough, it didn’t work as well as they’d hoped. But having the experience of a failure, they were able to move on and try other things. They saw that failure wasn’t the end of the world, that the important thing was to try something, learn, and move on to try something else. Neurodiversity, Universal Design and Secrets of the Library: Penny Andrews Penny started her plenary talk by defining what neurodiversity is and is not. She then talked about how neurodiverse people experience the library. And often it’s not good. Libraries have a lot of unwritten rules and unspoken social norms, and this is very challenging for neurodiverse students. Library staff often don’t want to be the police so we expect users to manage the space themselves. But this usually relies on those unspoken social norms. Clarity of the rules and enforcement of those rules would help neurodiverse students. Silent study spaces can be difficult because they are never actually silent. It’s easier to hear things like people chewing and keyboards clacking in silent areas. But often, silent areas are where individual study spaces are found. Having individual spaces in non-silent areas could be helpful. Penny told us that most neurodiverse students do not ask for individual accommodations, or else wait until their situation is completely unbearable. Autistic students are most likely to drop out within their first year. But if they continue, they tend to have the highest marks. So, what can libraries do? Be upfront with our information (not hide it under “Services for Disabled Students”). Library websites have so much information and no good way into it. Related, be specific with our communications. Don’t just say “we’re here to help!” but make it clear how and why to make a one-on-one appointment. Use universal design and consider various people’s needs from the start, not as an add-on. We can’t do one-size-fits-all because of competing needs, but our designs can account for these competing needs. Don’t depend on Disability Services as a liaison. Not all students declare their disabilities so Disability Services won’t know what those students need. Recruiting can be difficult. Talk to people in the library who look like they’re not having a good time. Go to special interest groups that might draw neurodiverse people (Penny recommended something geek-related). Regular recruiting methods often bring out the outliers who always want to join in and who don’t represent the majority of neurodiverse people. Always go in assuming we know nothing. A little bit of knowledge (knowing one neurodiverse person) is worse than knowing nothing. Neurodiverse people are a diverse group. After Penny’s presentation, someone asked her if there were certain UX research methods that neurodiverse people found difficult. Penny responded that ambiguous prompts—particularly things like “draw your research experience” or “build your ideal library”—tend to be difficult, as is anything with group work. Definitely good things to keep in mind. Tales of the UneXpected: Hannah Fogg and Lorraine Noel Both speakers talked about the experiences of having front-line staff engage in UX work at their libraries. Hannah started off with the experience at Angela Ruskin University (ARU). At ARU, they didn’t want UX to be just for librarians, so they brought in Andy Priestner to do UX training for their frontline staff. As part of the training, the staff did mini UX projects using their newfound knowledge of UX research methods. Having “mini” projects was meant to not overwhelm some staff who might be scared off by a big project, and at the same time not give free rein to others who would be tempted to be too ambitious. One of the projects Hannah highlighted was a mapping exercise that showed users completely avoiding the print journals shelving (they diverged to one side or the other), so a decision was made to move those shelves out of that area of the library entirely. Lorraine was up next to talk about the experience at Huddersfield. They had seen what ARU had done and wanted to replicate it, in hopes of professionalizing their front-line staff and enhancing the user experience. Bryony Ramsden led the workshops for Huddersfield staff. Attendance was mandatory and they also had to work in groups on a “modest UX project.” Those groups had to include staff from at least two different areas of the library (I love that idea!), and each of the 10 groups had a manager as a “guide on the side.” There were a lot of benefits to the Huddersfield experience, but Lorraine also mentioned that there was some initial resentment from staff, likely due to the mandatory nature of the project. Hannah said that at ARU, staff appreciated learning skills in project management that could help with their career progression. Also, ARU lost their UX expert and staff were happy to feel empowered to carry on the UX work on their own. Passionate About Floorplans: Tim Graves (I was excited about this session because floorplans are the bane of my existence. We get a lot of requests to make them fancier or add functionality, but keeping them up to date is a constant struggle. I finally resigned myself to walking through our 5 floors three times a year, making any necessary corrections on printed versions of our maps so I can update the ones on the web. The maps posted in our building get updated by the campus facilities people and at times bear little resemblance to the web versions. ARGH!) Anyway, Tim also wanted to improve the floorplans on the website of the University of Sussex. The library was receiving a lot of questions about how to find things in the library and Tim thought that better floorplans on the website might help people better navigate to what they needed. First, he came up with a version based on printed floorplans, but they were too complex and not responsive on smaller screens. Inspired by the London Tube Map, he created a simplified version, but discovered it was *too* abstracted from reality to be useful. The “just right” solution came after he did a lot of reading in the design literature (especially Alberto Savoia and Jeanne Liedtka & Tim Ogilvie) and started iterating his design with users. Tim mentioned the usefulness of “pretotyping” a solution to see if it’s worth getting to the prototyping stage. A pretotype is essentially a very rough, low-fi prototype. It might be a good thing to keep in mind if you work with people who find it difficult to create quick and dirty prototypes. You could say “we don’t need a prototype yet, let’s just pretotype it!” Even though *you* know a prototype can just be a rough sketch, they can think it’s a whole different (and new!) thing. You can see Tim’s improved floorplans. And he said that he’s happy to share the code that drives them. You can contact Tim at t.c.graves[at]sussex.ac.uk. Appreciative Inquiry Workshop: Kristin Meyer Appreciative inquiry is a method that helps people focus on solutions instead of problems, leads groups to action, and does so in a very positive way. I was really excited about this workshop because anything that Kristin does always seems excellent. I was not disappointed. The workshop started with an introduction to appreciative inquiry and then Kristin led us through a sped-up process of appreciate inquiry as we worked through an issue that’s been raised through UX research at her own library. The steps we took: Connect to purpose: Look at the big picture and why this problem is important. How could exploring this area benefit users? Frame it and flip it: Clearly state the problem so that everyone is on the same page. Then, think about the desired state instead of the problem and come up with a question to help us explore what we desire for our users. Dream of the ideal future: Think about words and phrases that describe an ideal solution. How will success look and feel? Ideate: We skipped this step in the workshop because it takes a lot of time. Kristin mentioned her favourite ideation technique is Brainwriting, described in the book Gamestorming (2010). Prototype internally: Our colleagues may have good ideas and asking them for feedback can help build early buy-in. Generative questions keep things positive: What do you like about this idea? How can we improve this idea? Prototype with users: Again, we skipped this step because we had no users to prototype with. I liked step 2, where we flipped the problem into a desired state. I’m guessing that thinking of “what do we want to happen” instead of “what do we want to stop” could help avoid the “solution” of putting up a sign or trying to curb behaviour with punitive measures. I also really like the idea of connecting to colleagues with generative questions, rather than asking for general feedback. Andy may have said that not everyone needs to be consulted, but sometimes it’s important or useful to consult our colleagues. Using generative questions would be a way to lessen the chances of hearing “that will never work” or “why don’t you do X instead?” Advanced Interview Techniques: Danielle Cooper Since I’m about to embark on a project that involves a lot of interviewing, I thought it made sense to make sure that I took advantage of any opportunity to improve my skills in this area. Danielle has a lot of experience with interviewing users in her job at Ithaka S+R. The short version of this workshop is that the best way to get better at interviewing is to keep doing it, so we spent most of the time in groups of 3 taking turns being interviewer, interviewee, and observer. Danielle gave us some practical tips as well. To probe for more information, from least obtrusive to most: silence non-verbal affirmation echoing the response affirmative neutral comments repeating or clarifying the interview question summarize and synthesize the answer simply saying “tell me more” If participants are not very forthcoming, you can try a variety of these probes. Be willing to cut your losses and end the interview if you’re not getting any useful information. On the other hand, if participants are way too chatty, you can try the following: gentle inattention polite transitions graceful interruptions Working in Difficult Environments: Lessons from the World of Civic Design: Suzanne Chapman Suzanne started her keynote with some examples of behaviour that many of us recognized from our workplaces. She then pointed out that these behaviours were from the Simple Sabotage Field Manual from the OSS (predecessor to the CIA), a document explaining to spies how to sabotage enemy organizations. She gave a quotation from a senior person in one of the organizations she’d worked in: “We are trying to do as much end user testing as possible without actually talking to users.” Suzanne noted that UX maturity models, such as the one Anneli showed in her keynote, are missing the part where humans are difficult and sabotage-y. She also noted that doing UX in libraries is extremely hard. But this work can be made easier if everyone can agree on specific guiding principles. She shared seven that she uses at the Centre for Civic Design: Do the most good for the most people who need it the most (italics mine). This goes beyond the 80/20 rule and looks at need rather than just numbers. Delivery is the strategy. Given the choice between culture change and “getting shit done,” they have chosen to let culture change come second. Work lean, iterate quickly. Sometimes this means doing the least shitty thing, but it always means that you should only make *new* mistakes. We use design to make things better. Design means working your way through the problem in order to reach a solution, not just grabbing a solution. We design with users, not for them. This is similar to Anneli’s message to take the “user’s perspective” rather than the “user perspective.” Also, research is done with a goal of improvement, not just for learning. Hire and empower great people. And there has to be agreement about what it means to be empowered; there should not be responsibility without authority. These principles may not resonate, or even be possible in your library. But going through the process of deciding what your library’s guiding principles are can be your anti-sabotage model. My web committee went through this process, based on guiding principles Suzanne wrote while she was still working in libraries. The process was very helpful in making sure we really were on the same page. It’s also a useful document to show people coming on to the committee for the first time. It would definitely be *more* useful if it went beyond just our committee, but it’s something. If you’re interested, we’ve made our guiding principles public. UXVR: The Potential of Virtual Reality to UX Research: Victor Alfson Victor spoke about a project he did at the Black Mountain Library in Stockholm. He asked users to create a great library for themselves using a VR headset, Tilt Brush (a 3D-painting app), and a 3D model of the existing library. He asked participants to narrate their actions, but also jumped in with questions. It’s a similar task to what you could do with pen and paper, but using VR gave a different angle. To recruit participants, Victor asked the (possibly slighty creepy) question, “Do you want to come down to the basement to try something cool?” 9/10 people that he asked agreed to participate! And once they were there, they stayed—for 40 minutes on average— because the task was novel and engaging. Victor found that participants were very candid in what they said, and he wondered if that was due to people feeling like they were in a private space. With the VR headset on, they were alone in the 3D library space, with Victor’s disembodied voice occasionally asking them questions. So what did users draw and talk about? Well, it was the usual things: food, noise, finding the right kind of space. But the insights were interesting. A few kids drew a McDonalds in the library, and went on to say that they just wanted to be able to eat their snack without a librarian bugging them. One kid drew a vortex in the library that would take them directly to their home. Victor asked further about this and found out that this kid had to take two buses and the metro to get home from the library. I wondered if this kind of thing would have come out in a pen-and-paper exercise, or if it was the technology that made the kid think about an amazing technological solution to their transportation problem. Overall, Victor said that it was very fun research for both him and the participants. And his library will be following up on some of the insights they gained, such as creating a new quiet study room for kids working on their homework. Previously, these kids tried to find quiet nooks and crannies to work in, so both they and their needs were unseen by library staff. Victor’s project brought them out of their quiet corners and gave them a new space of their own. A nice real-world result for this VR project. Internships and Ethnography: Students Researching Students: Claire Browne Claire spoke about using a student intern to carry out a UX project using a cultural probe to get to know the needs of taught postgraduate students at the University of Birmingham. The university’s Careers department was looking for meaningful student placements that showcased careers in higher education and gave students experience with project management and data analysis. It was a great fit with the library’s desire to expand their UX work. Before the intern was hired, the library had to have the project go through ethics review and recruit participants (10 in total). They had ideas for what they wanted in the cultural probe, but the intern, Luke, was able to put his stamp on it, finalizing the tasks and adding notes and jokes to the participant diaries to keep their engagement up throughout the 2 weeks of daily tasks. Some of the tasks were: answering specific questions, writing a letter with advice to a student starting out, card sorting, a photo study showing their typical day, a love letter/break-up letter, and a cognitive map. All participants did every task, which seems to show that Luke did a great job keeping everyone engaged. Participants enjoyed the variety of tasks and provided a lot of rich information in the self-reflective tasks. Luke gave a presentation to senior staff about his findings and they were very engaged with this 17 year old telling them about the problems in their library. I want to know more about this; were they more engaged because he was an “outsider,” because he was a student, because he was young? Related, Claire mentioned that one of the benefits of having a student intern on this project was that he was not influenced by restraints or constraints felt by library staff; he saw only the user side. Another benefit Claire mentioned was that Luke was able to engage with the student participants in a natural and informal way that she didn’t think would be possible for librarians. She thought the librarians would have been too formal or crossed the line into “cringey.” If you want to know more, Luke wrote a report about the project and the techniques that were used in the cultural probe. Love at First Sight: Consolidating First Impressions: Debbie Phillips Debbie also spoke about doing a cultural probe, this time at Royal Holloway and focused on the experience of new students in their first weeks on campus. The focus was not entirely on the library, as the project was a collaboration among the library, Campus Life, and Internal Communications. The Campus Life team were able to help with recruitment and 23 students agreed to participate, though only 13 actually finished all the tasks. Still, since they were hoping for 8 participants, this was a good result. I was struck that, like Claire, Debbie said they were “hoping for a good mix” of participants. Both projects got a reasonable mix but missed out on representation from one or two groups. I think we often do generic recruitment when we want a mix, assuming that we should recruit from a wide group to get a wide range of participants. But if we want, for example, mature students or international students as part of the participant group, we really need to recruit them specifically in order to make sure of it. (I believe Claire did make this point as something they would do differently next time.) Some of the tasks in the cultural probe at Royal Holloway: diary questions (2 questions from each of the 3 teams plus some general ones), photo tasks, postcard to friends/family (participants could ask for it to be posted but no one did), campus map with emoji stickers to indicate how they felt about specific buildings or areas of campus. The library found they were surprised at how many students came to the library during their first visit to campus. They were also surprised at how few students attended their library induction. So, they’re planning to try to find ways to help students learn more about the library during that first campus visit, rather than waiting for induction. Related, they also found that students expressed a preference for learning about campus prior to arrival, so the library will increase their communications ahead of Arrivals Week, rather than waiting until students are actually on campus. Final Thoughts I usually do a full post about my thoughts on the conference, but I don’t have a lot more to say. I had an amazing time, as usual, thanks to the wonderful group of people who come to this conference. In my professional life, UXLibs is my very favourite place. I’m about to head off on sabbatical (maybe you can help with some of my projects!), so I’m not going to immediately apply much of what I learned but I am already excited to do that when my leave is over. I realize that I’ve been emphasizing the research part of UX because research is actually part of my job description and, outside of the website, design and prototyping is not. I felt comfortable doing research beyond the scope of the website, but not finding a way to move that research into action. When I get back to work I hope I can figure out how to, as both keynotes exhorted: get shit done. Website Refresh: First Round of Iterative Testing December 18, 2018December 17, 2018 ~ Shelley ~ Leave a comment As I mentioned in my last post, we’re doing a design refresh of our library website, with a goal to make it “beautiful.” As such, we’re not touching much of the organization. But of course we have to pay attention to not just how the information is categorized but also where it appears on the page. We learned that a few years back when we tried adding a “Spotlight” feature near our Library Hours (tl;dr: people stopped being able to see the Hours when other content shared the space). So we are firm believers that user testing and iterative design is vital in making sure we don’t make parts of our site invisible by moving elements around. After the results of our user research earlier in the fall, we came up with a design drawn from the sites that our users liked most that also worked within our current site structure. The layout was essentially the same, with three major changes: We pulled “Quick Links” out of the menu and put it in a box on the front page Hours moved from a box on the side to a banner under the search box Our Help and Chat button also moved to this banner We wanted to do user testing to make sure that users could: find today’s hours get to the full set of hours figure out how to access help or chat. We also asked them if there was anything they hated about the draft design. Just to flag anything that could cause problems but that we weren’t specifically asking about. Since we were doing this testing early in the process, we didn’t have a live site to show. Our Web Developer, the fabulous Kevin Bowrin, built the mockup in Drupal since he’s more comfortable in Drupal than in PhotoShop, but it wasn’t on a public server. So we used a printed screenshot for this round of testing. The first version of the design had a grey banner and small text and it was clear after talking to a few users that visibility was a problem. We only talked to 4 people, but only 2 saw the Hours and they were really squinting to make it out. Finding when the library is open should be really really easy. We decided to increase the text size and remove the grey background. Version 1 This time, even fewer people saw the hours: 1 out 6. Since people didn’t see today’s hours, we couldn’t even get to the part where we tested whether they knew how to access the full set of hours. We decided to see if adding an “All Hours →” link would help; perhaps by echoing the convention of the “View More →” links in other parts of the page, it would be clearer that this section was part of the content. Nope. Version 3 Again, quite quickly we saw that this section remained invisible. Only 1 person in 5 saw it. One user noticed it later on and said that he’d thought that part of the website was just a heading so he ignored it. Clearly, something was making people’s eyes just skip over this part of the website. We needed another approach. Kevin and I talked about a few options. We decided to try making the section more visible by having Library Hours, Help and Chat, and Quick Links all there. Kevin tweeted at me after I’d left for the day: “Just dropped the latest iteration on your desk. I kinda hate it, but we’ll see what the patrons have to say!” I had a look the next morning. I also hated it. No point in even testing that one! A blurry photo of the hated, not-tested version 4 We decided to put Hours where the Quick Links box was, to see if that would be more visible. We moved chat down, trying to mimic the chat call-out button on the McMaster Library website. Quick Links were removed completely. We have some ideas, but they were never a vital part of the site so we can play with them later. Success! Most of the people we talked to saw the Hours and almost all of them could get from there to the full set of hours. (I did this round of testing without a note-taker, thinking I could keep good enough track. “Good enough?” Yes. Actual numbers? No.) The downside was that most people didn’t notice the Help and Chat link (not pictured here). However, I think we’ll really need to test that when we can show the site on a screen that people can interact with. The “always visible” nature of that button is hard to replicate with a print-out. I feel like we’re in a good enough place that we can start building this as more than just a mock-up. Oh, and no one we talked to hated anything about the design. A low bar perhaps, but I’m happy that we cleared it. Version 5 We did all of this in one week, over 4 afternoons. For version 3, Kevin just added text to the screenshot so we could get it in front of people faster. Quick iterating and testing is such a great process if you can make it work. Next steps: menu interactions and interior pages. User Research: Beautiful Websites? October 24, 2018 ~ Shelley ~ 3 Comments My University Librarian has asked for a refresh of the library website. He is primarily concerned with the visual design; although he thinks the site meets the practical needs of our users, he would like it to be “beautiful” as well. Eep! I’m not a visual designer. I was a little unsure how to even begin. I decided to attack this the way we attack other problems: user research! Web Committee created a set of Guiding Principles a few years back (based on Suzanne Chapman’s document). Number one in that list is “Start with user needs & build in assessment” so even though I was having difficulty wrapping my head around a beautiful website as a user need, it made sense to move forward as if it were. Background How does one assess a beautiful website? I looked at a whole bunch of library websites to see which stood out as particularly beautiful and then discern what it was that made them so. Let me tell you, “beautiful” is not a word that immediately leaps to mind when I look at library websites. But then I came across one site that made me give a little exclamation of disgust (no, I won’t tell you which one). It was busy, the colours clashed garishly, and it made me want to click away instantly—ugh! Well. We might not be able to design a site that people find beautiful but surely we can design something that doesn’t make people feel disgusted. I had an idea then to show users a few different websites and ask them how they felt about the sites. Beauty can mean different things to different people, but it does conjure a positive feeling. Coming up with feeling words can be difficult for people, so I thought it might be easier for me to come up with a list they could choose from (overwhelming, calm, inspiring, boring, etc.). Then I decided that it might be better to have users place the sites on a continuum rather than pick a single word for their feeling: is the page more calming or more stressful? Is it more clear or more confusing? I came up with 11 feelings described on a continuum, plus an overall 🙂 to 🙁. I wasn’t completely confident about this and assumed others had done work in this area, so I did some reading on emotions, aesthetics, and web design. (Emotion and website design from The Encyclopedia of Human-Computer Interaction, 2nd ed.; Aesthetics and preferences of web pages from Behaviour & Information Technology (2000); Assessing dimensions of perceived visual aesthetics of web sites from International Journal of Human-Computer Studies (2004); and Measuring aesthetic emotions: A review of the literature and a new assessment tool from PLOS ONE (2017).) Turns out my method was in line with the research in this area. And although the wording sometimes differed, the 11 feelings I had come up with were all represented. Onward! There had been some talk of the library website perhaps needing to mirror other Carleton University websites a little more closely. However, there is not uniformity of design across Carleton sites, so I wanted to show users a mix of those sites to get a sense of which designs were most pleasing. I also wanted to show a few different library sites to get a sense of which of those designs were most appealing to our users. I worked with Web Committee to come up with a list of 7 library sites and 5 Carleton sites. There was no way I was going to ask someone to give us feedback on 12 different websites; I decided a selection of 3 was plenty for one person to work through. Since I was looking mostly for visceral reactions, I didn’t think we needed a lot of people to see each site. If each site was viewed 5 times (with our own library site as a baseline so we could measure improvement of the new design), we needed 30 participants. That was three times what we often see for a single round of UX research, but still doable. Method I planned a 10-minute process—longer than our usual processes where we test one or two things—and wanted to compensate students for this much of their time. That fell apart at the last minute and all I had was a box of Halloween mini-chocolates so revamped the process to remove a few pre- and post- questions and cut the number of continuums from 12 to 9 (8 feelings plus the overall positive/negative). That cut the time down to about 5 minutes for most people, and I was comfortable with a 5-minutes-for-chocolate deal. So in the end, these are the continuums we asked people to use to label the sites: Welcoming ↔ Off-putting Disorganized ↔ Organized Clear ↔ Confusing Up-to-date ↔ Old-fashioned Calming ↔ Stressful Useful ↔ Useless Inspiring ↔ Discouraging Ugly ↔ Beautiful 🙂 ↔ 🙁 We set up in the lobby of the library and saw 31 people over four time slots (each was 60-90 minutes long). There were 31 participants instead of 30 because the last person came with a friend who also wanted to participate. Happily, the only person to have difficulty understanding what to do was one of these very last people we saw. He had such trouble that if he’d been the first person we’d seen, I likely would have reconsidered the whole exercise. But thankfully everyone else was quick to understand what we wanted. Most people saw one Carleton site, one library site, and then our own Carleton library site. Because we had more library sites than Carleton sites, a few people saw two library sites then the Carleton library site. I had planned out in advance which participant would see which sites, making sure that each site would be seen the same number of times and not always in the same order. Participants looked at one site at a time on a tablet with a landscape orientation, so the sites looked similar to how they would look on a laptop. They filled out the continuum sheet for one site before looking at the next. They could refer back to the site as they completed the sheet. I had a note-taker on hand to keep track of the sites visited and to record any comments participants made about the sites (most people didn’t say much at all). Partway through, I discovered a problem with the “Up-to-date / Old-fashioned” continuum. I was trying to get at whether the design felt old and stale or contemporary and up-to-date. But many people assumed we were referring to the information on the site being up-to-date. I thought that using “old-fashioned” rather than “outdated” would mitigate this, but no. So this was not a useful data point. Usually with these kinds of processes, I have a sense of what we’re learning as we go. But with this one, I had very little idea until I started the analysis. So what did we find? Results I had purposely not used a Likert-type scale with numbers or labels on any of the mid-points. This was not quantitative research and I didn’t want users to try to put a number on their feelings. So, when it came time for analysis, I didn’t want to turn the continuum ratings into numbers either. I colour-coded the responses, with dark green corresponding to one end of the continuum, red to the other and yellow for the middle. I used light green and orange for less strong feelings that were still clearly on one side or the other. In determining what colour to code a mark, I looked at how the person had responded to all three sites. If all their marks were near the extremes, I used light green/orange for any mark tending toward the middle. If all their marks were clustered around the middle, I looked for their outer ranges and coded those as dark green/red (see examples in the image below). In this way, the coding reflected the relative feelings of each person rather than sticking to strict borders. Two marks in the same place on the continuum could be coded differently, depending on how that user had responded overall. The circled mark on the left was coded light green even though it’s quite close to the end. The circled mark on the right was coded red even though it’s not very close to the end. After coding, I looked at the results for the 🙂 ↔ 🙁 continuum to get a sense of the general feeling about each site. I gave them all an overall assessment (bad, ugh, meh, or ok). No site got better than ok because none was rated in the green by everyone who saw it. Then I looked at how often each was coded green, yellow, and red across all the continuums. Unsurprisingly, those results corresponded to my bad/ugh/meh/ok rating; participants’ 🙂 / 🙁 ratings had been reflective of their overall feelings. Our site ended up on the high end of “meh.” However, several participants made sure to say their ratings of our site were likely high because of familiarity, so we are really likely firmly in “meh” territory. Now that I’d looked at the overall, I wanted to look at each of the continuums. What was our current site doing really well with? I was happy to see that our current site felt Useful and Organized to participants. “Organized” is good because it means that I feel confident about keeping the structure of the site while we change the visual design. What did we need to improve? Participants felt the site was Discouraging and Ugly. “Discouraging” is something I definitely feel motivated to fix! And “Ugly?” Well, it helps me feel better about this project to make the site beautiful. More beautiful at least. After this, I looked at which sites did well on the aspects we needed to improve. For both the Carleton sites and the library sites, the ones felt to be most Inspiring and Beautiful were the same ones that were rated highly overall. These same sites were most felt to be Welcoming, Clear, and Calming. So these are the aspects that we’ll concentrate on most as we move through our design refresh. Next Steps Now, Web Committee will take a closer look at the two library sites and two Carleton sites that had the best feeling and see what specific aspects of those sites we’d like to borrow from. There’s no big time squeeze, as we’re aiming for a spring launch. Lots of time for many design-and-test iterations. I’ll report back as we move forward. Access 2018: A UX Perspective October 19, 2018 ~ Shelley ~ Leave a comment I started my Access 2018 conference experience with a meetup of library people interested in UX. There were only five of us, but we had good conversations about Research Ethics Boards and UX research, about being a UX team of one, and about some of the projects we’ll be working on in the coming year. We also chatted about how we would like to communicate more regularly but how difficult it can be to sustain virtual communities. (Canada is BIG. Heck, even Ontario is big.) It was nice to start off the conference with UX friends – old and new – and my focus stayed on the UX side of things throughout the conference so that’s what I want to write about here. On Day 1, the first post-keynote presentation was all about UX. Eka Grguric talked about her experience one year in as UX Librarian at McGill. She gets brought into projects in her library as a UX consultant, and also supports others doing UX and user research in the library. She also offers training on UX research methods for interested library staff. Her work is a combination of operational and project-based. She gave a bit of detail about two projects and her monthly operational tests to give us a flavour of the range of methods and processes she uses. Next up was Ken Fujiuchi and Joseph Riggie from Buffalo State College, who talked about Extended Reality, a combination of virtual reality, augmented reality, and mixed reality technologies. They covered a few different topics (slides here), but what stood out for me was their mention of how user experiences will change as new interfaces become possible and there are new ways for people to interact with materials. They specifically mentioned oral histories moving from audio-only files to users being able to interact with a holographic image of a person who can tell stories but also answer questions. What’s good UX for oral history holograms? A few presentations also focused on what I see as UX for library staff. Juan Denzer spoke about a project being developed by a pair of students he’s supervising that aims to make it easier to manage EXProxy configuration files (which can easily run to thousands of lines). Having tried to troubleshoot stanzas in EXProxy myself, I can definitely see how this could improve the UX for staff. However, as one of my table mates said, adding an application to manage a text file also adds overhead for whoever has to maintain and update that application. Trade-offs! Ruby Warren from University of Manitoba was fantastic in her description of a project that didn’t quite get off the ground in the six months she’d set aside to complete it. Ruby had seen that distance students weren’t learning how to use the library in the same way in-person students were (e.g. no in-class visits from a librarian). She wanted to find a way to teach some basic IL to these students and thought that an interactive fiction game would be a good thing to try. She had some great lessons learned (including “Don’t Do Everything in the Wrong Order” and “Plan for Apathy”). One of my favourite things about Ruby’s presentation was that she was upfront about her failures, including – as a UX person – not planning for user testing during development. It’s gutsy to get up in front of your peers and say that you forgot a basic tenet of your discipline because you were too excited about a project. So human but so hard. Yay Ruby! Another key takeaway was not underestimating appeal when planning this kind of project. As someone who has a bard time seeing the appeal of library games, I appreciated hearing this. (I believe it’s possible, but I think it’s extremely difficult.) Ruby’s slides are here. Back to UX for staff (and users too, to some extent), Calvin Mah from Simon Fraser University spoke about his experience archiving their ILS when his library moved from Millennium to Alma. Some kinds of information were not migrated at all, but even the records that were migrated were not trusted by cataloguers; they wanted to be able to go back to the old records and compare. With these two situations – missing information plus untrusted information – it was decided to build an archival copy of the old system. I find this interesting. On the one hand, I can absolutely understand wanting to help staff feel comfortable with the new system by letting them know they still have the old information if they need it; the transition can be more gradual. But Calvin noted that even though the information is getting stale, staff are still relying on it. So perhaps it’s more of a security blanket, and that’s not good. Also, there was a good library nerd laugh when he said that some staff wanted the archival copy to behave like the old system: “Respect the 2nd indicator non-filing characters skip!” Something I see as having both staff and user UX implications is having contract work in library systems (probably everywhere, but in systems for sure). Bobbi Fox from Harvard has been on many sides of this situation (as a contractor, as a person hiring the contractor, as a team member, as a person cleaning up after a contractor) and detailed many things to consider before, during, and after contract work in library IT. Too often, contract work results in projects that are difficult to maintain after the contractor has gone, if they are even completed at all. I really like that she specifically mentioned thinking about who is providing user support for the thing(s) the contractor is building, as separate from who is going to own/maintain the project going forward. And in talking about documentation, specifying what documentation those user support people need in order to be able to support the users. This will almost always be different documentation that what is required for maintenance. Good docs are vital for maintenance but if people can’t use the thing, there’s not much point in maintaining it! Nearing the end of the first day was a panel: “When the Digital Divides Us: Reconciling Emerging and Emerged Technologies in Libraries” that looked at disconnects that can happen on both the staff side and the user side when libraries favour emerging (“shiny”) technology. I thought there were some great points made. Monica Rettig at Brock University talked about issues when access services staff are expected to help troubleshoot technology problems; for staff used to a transactional approach to service, with a heavy reliance on policy and procedures, there is a big cultural shift in moving to a troubleshooting approach. Rebecca Laroque from North Bay Public Library wondered about providing 3D printers while she still has users asking for classes on how to use email. Monica noted the importance of core services to users even though they’re aren’t shiny or new; she asked who will be the champion for bathrooms or printers in the library? Krista Godfrey from Memorial University asked whether library technology should be evaluate and assessed in the same way that library collections are? Lots of questions here, but definitely an agreement that a focus on core infrastructure and services may not be exciting but it’s absolutely vital. Day 2 was a bit lighter on the UX side. Tim Ribaric gave a great presentation on RA21 and the possible implications of it replacing IP authentication for access to electronic resources in libraries. Tim is skeptical about RA21 and believes it is not good news for libraries (one of his theorems about RA21: “We are effed”). His take was very compelling, and from a UX perspective, he is not convinced there is a clear way forward for walk-in users of academic libraries (i.e. users not affiliated with the university or college) to access our subscription-based electronic resources if we move from IP authentication to RA21. I know some academic libraries explicitly exclude walk-in users, but others are mandated to provide access to the general public so we are used to providing guest access and our users are used to having it. Tim has posted his slides if you’re interested in more on this. Another interesting UX moment was in Autumn Mayes’ lightning talk about working in Digital Scholarship and Digital Humanities. Part of her job had been working in The  Humanities Interdisciplinary Collaboration (THINC) Lab at the University of Guelph. THINC Lab is a members-only space aimed at grad students, postdocs, faculty, etc. who are doing interdisciplinary and digital humanities research. However, they also host events and programs that are open to the larger university population. So Autumn found herself having to tell non-members that they weren’t allowed to use the space, but at the same time was trying to promote events and programs to both members and non-members. She very succinctly described this as “Get out! But come back!” It’s interesting to think about spaces that are alternately exclusionary and open; what is the impact on users when you make a mostly exclusionary space occasionally welcoming? What about when a mostly welcoming space is occasionally exclusionary? Bill Jones and Ben Rawlins from SUNY Geneseo spoke about their tool OASIS (Openly Available Sources Integrated Search), aimed at improving the discovery of Open Educational Resources (OER) for faculty at their campus and beyond. The tool allows searching and browsing of a curated collection of OER (currently over 160,000 records). It seems like a nice way to increase visibility and improve the UX of finding OER such as open textbooks. Again in library staff UX, May Yan and MJ Suhonos from Ryerson University talked about how library-specific technologies can be difficult to use and adapt, so they decided to use WordPress as a web platform for a records management project in their library. One thing I found interesting was that the Ryerson library had a Strategic Systems Requirements Review that explicitly says that unless library-specific technology has a big value-add, the preference should be to go outside library technology for solutions. From a UX point of view, this could mean that staff spend less time fighting with clunky library software, both using it and maintaining it. The last conference presentation of Day 2 reported on the results of UX testing of Open Badges in an institutional repository. Christie Hurrell from the University of Calgary reported that her institution uses quite a number of Open Badges. For this project, the team wondered whether having an Open Badge that demonstrated compliance with an Open Access policy would encourage faculty to deposit their work in the institutional repository. They did a survey, which didn’t show a lot of love for Open Badges in general. Then they did some user testing of their IR (DSpace), to find out whether faculty would add an Open Badge to their work if the option was there. Unfortunately, the option to add an Open Badge was completely lost in the overall process to deposit a work in the IR, which faculty found extremely time-consuming. Since faculty were frustrated with the process in general, it is very unlikely that an Open Badge would provide an incentive to use the IR again. The conference ended with the Dave Binkley Memorial Lecture, given this year by Monique Woroniak. Monique spoke about “Doing the Work: Settle Libraries and Responsibilities in a Time of Occupation” where the Work is what non-Indigenous people and organizations need to do before trying to work with Indigenous people and organizations. She gave some clear guidelines on, essentially, how to act with empathy and these guidelines can apply to many communities. However, I definitely don’t want to “all lives matter” this. Monique was clearly speaking about Indigenous people, and specifically about her experiences with Indigenous people in Winnipeg. When she spoke of the importance of assessing our capacity before undertaking new work, she included the capacity to build respectful relationships with Indigenous people. Although it can definitely be argued that a capacity to build respectful relationships is useful for UX work, her caution to never over-promise and under-deliver when working with Indigenous people is situated in the Canadian context of settlers over-promising and under-delivering time and time and time again. Sure, we’ll respect this treaty. Sure, we’ll take care of your children. Of course we’re ready for reconciliation. Over-promising and under-delivering is never a great move, but in this context it is particularly toxic. A few other things that stood out for me in Monique’s talk: Listen to the breadth of opinions in the community. Take the time. This is head work and heart work, and, especially, long-haul work. Look to shift the centre of power for not just the big decisions, but the small as well. If this interest you, Monique’s talk is available to view in its entirety, as are all of the presentations at the conference (they will be split into individual videos for each talk eventually). Monique finished with a lovely quotation from Katherena Vermette‘s poem “new year’s eve 2013” from her 2018 book river woman: truth is a seed planted deep if you want to get it you have to dig UX from a Technical Services Point of View September 11, 2018 ~ Shelley ~ 1 Comment This the text of a presentation I did last year at the Access Conference in Regina. Emma and I had plans to write this up as a paper, but life intervened and that didn’t happen. I wanted to keep some record beyond the video of the presentation, so here it is. This morning I’m going to talk about a user research project I did with my colleague Emma Cross. We observed the user experience of students doing academic research online and then looked at that UX from the perspective of technical services staff. I’ll start with talking about the research we did with the students, the results from that research that seemed most relevant to Technical Services staff, and then I’ll talk a bit about the reaction that Technical Services staff at Carleton had to those results. I’m sorry that Emma can’t be here. She was the Technical Services brain behind it all; I was the user research monkey. So, why did we want to do this? Mixing Technical Services and User Experience is not done very often. A Technical Services supervisor at Carleton told us she was finding it difficult to prioritize work for her staff, and she wanted some insight into what was likely to have the most impact for our users. Fantastic.   Methodology Emma and I designed the research to be student-led; we didn’t have specific questions but we were interested in where students searched, how they searched, and what kinds of things they looked at in their results. In our sessions, we asked students to search for something they needed for a research assignment, and to try as much as possible to do what they would normally do, not what they thought they “should” do. We emphasized that even though we were from the library, they didn’t have to use library tools or resources if they normally wouldn’t for the kinds of searches they were doing. I moderated the sessions, asking the students to think aloud throughout their searches, prompting them with questions if they were quiet. We let them search until they seemed to finish but let them know when we neared 30 minutes. The sessions lasted anywhere from 10-40 minutes, but most were 20-30 minutes. Emma took notes and we also captured the sessions on video, so we were able to go back and fill in gaps when people worked too quickly for Emma to capture everything. We did the research in March of 2017 and saw 10 undergraduate and 10 graduate students. Emma coded the results and found 4 themes that she thought were most relevant to technical services   Result #1: Overwhelming use of the single search box Summon and/or Google Scholar were used by most of the students, and the catalogue not much at all. 7 people used various specialized databases and there was also regular Google, Wikipedia, Tumblr, but Summon and Google Scholar were really the most used. There was little difference between grad and undergrad use of tools, except for catalogue use. The 2 people who used the catalogue were undergraduates. Kinda weird. But this is a good time to emphasize that this was a qualitative study, not a quantitative one; we’re not going to extrapolate that 20% of undergrads use the catalogue and 0 grad students do. The numbers don’t matter – it’s that when observing how students search and listening to how they approach looking for information, the library catalogue doesn’t often come up. It’s not part of their process.   Result #2: Popularity of the “get it” button   Emma’s second theme is the logical corollary to the overwhelming use of single search: the popularity of the “Get it” button and the link resolver in general. I love the “Get it” link – it makes my life much easier. (Graduate student) “Get it” is really useful (Graduate student) “Get it” is helpful!” (Undergraduate) HEY LOOK Carleton offers to “get this” in Google scholar – HEY THAT IS GREAT! (Undergraduate) Even when students didn’t mention it explicitly, they used it seamlessly. Maybe that seems obvious, but I have seen user research results from other university libraries where students had a hard time understanding their Get It links. Our students got “get it.”   Result #3: Metadata looked at:  title, date, abstract; Metadata searched: keyword, keyword, keyword A pattern we saw repeated over and over in student research was: Scanning search results list Quickly reviewing title for relevant keywords Check the date – majority of student not interested in old material If interested, click on record to read the abstract If title, date, abstract check out then download / print for further reading. Students seem to be so used to this pattern and used to seeing abstracts or snippets of content that when they don’t see an abstract, usually when they are looking at a monograph record, they’re confused and then they move on. And although students look at different metadata fields, they rarely search them. Aside from a couple of author searches and one really heartbreaking subject search, most of the searches we saw were keyword, keyword, keyword.   Result #4: Speed, impatience and ease of access Students quickly skimmed results lists and rarely went beyond the first page of results (or with Summon’s infinite scroll, the first 10 or so). Undergrads tended to look at fewer results than grad students. Many students had no qualms saying they were busy and they didn’t want to waste time. There was a general tendency to skip over materials that were harder to access – things on reserve, in storage, or borrowed, documents that take a long time to download. Even when they did pursue these harder to access items, they weren’t necessarily happy about it. This is probably Emma’s favourite quote: This is useful IF I can find it. It is not online so I will have to search the Library itself. This makes me cry a little. Generally, the students we saw were easily able to find other things that seemed just as good, so skipping over hard-to-access items didn’t seem to create much of a problem.   Reaction from Technical Services staff So these were the findings we thought were most relevant to Technical Services staff. There are no big surprises here, but we wanted to know how our own Technical Services staff would react to what we’d found. What would they take from our results? In July, we gave a presentation for Library technical services staff, followed by a discussion. Here are some of the first comments from staff, to give you a flavor: “On the library website, we now have a Summon search box instead of a catalogue search box, and maybe that’s why catalogue use was low.” “Are users even aware of the catalogue?” “Students don’t seem to be aware of subject headings. They should be taught about the catalogue and how to do subject searches.” “Maybe all first years could be given a booklet about how to search properly.” So that was sort of the tone at the beginning. Then our Head of Cataloguing said something like “I’m not buying into this discussion that keyword searching is a bad search. Remember that keyword searches subject. Indexing is the most important part of this.” Then the Technical Services supervisor whose questions started the project said something like “I found the part about Summon and the link resolver very interesting. This validates where we need to spend time. We can call out vendors where there is a consistent problem. Now I can be pushy to get issues resolved. If that is what students are relying on, then we have to make sure what we have is right.” Yes, having a Head and a Supervisor weigh in like this is bound to change the tone, but things did become much more positive and proactive from here on in, with comments and suggestions like this: “I’m wondering about loading e-book records. Sometimes we have good records but they don’t have subjects. Perhaps now I can load these records as they have summaries so they would get picked up in a keyword search.” “Cataloguers can change the way we work and include table of contents and summaries in monograph records when we find them. Perhaps we could make this an official policy and procedure.” “Perhaps we can take more time to see how Summon pulls information and where that information is pulled from.”   Conclusion So: A move to better understand how our discovery system handles our records A push to enrich print and ebook records to improve keyword searching A renewed focus on making sure the knowledge base is accurate so the link resolver works I know these aren’t necessarily ground-breaking ideas but less than an hour earlier, this same group suggested giving first year students a booklet on how to search! Hearing that students mostly do keyword searches in Summon and Google Scholar was understandably a little threatening to staff who have a very catalogue-centric view of the library (because that’s where they spend most of their time). But very quickly, they moved on and were suggesting new ways of doing things, and new ways of thinking about their work. It was wonderful. Technical Services and User Experience don’t usually cross over, but we saw that it can be a really good fit. Our students do their research online. Technical Services staff make decisions that affect how library resources are found online. So they are perfectly positioned to improve the user experience of our students. I’ll give the last word to one of our staff members, who after seeing our results said what I think we all want: “Now I can attack the right problems with purpose.” Adding Useful Friction to Library UX: Ideas from UXLibs Workshop Participants June 19, 2018 ~ Shelley ~ 1 Comment At this years UXLibs conference, I led a workshop on adding useful friction to the library user experience. I’ve already posted my text of the workshop, but I told the participants that I also wanted to share the ideas that they came up with during the course of the workshop. The ideas were generated around three themes: friction for users, with a goal of helping those same users friction for staff, with a goal of helping users friction to improve inclusion in the library What is below is verbatim, as much as I could decipher, from the post-its. There’s seems to be a combination of examples of bad friction and ideas for good friction. If you were a participant and would like to correct or flesh out what’s here, please get in touch! Here are all of the responses from both workshops, in no order at all: Remove desk or make it a low height only Lower the circulation desk or remove it altogether Users: appointments (promote other options first, Books/resources) Giving the article rather than showing to find the resource Answer questions instead of showing how to do it Wayfinding no.1 enquiry — looking at it with fresh eyes Staff want to put passive aggressive posters everywhere Toilet sign / Not a key — Gender N. Have the students suggest furniture in the library A room with a computer and a loudspeaker where the patron can hear what is on the screen Clickable text where a loudspeaker symbol shows you that you can hear what is said Wayfinding signage / Posters — loads of / passive aggressive Enforced preview when editing web pages Put forms through web group to ensure they’re not excluding When they click around one website When they order on shelf items When they order articles Making them having coffee with other departments and teaching staff Making them walk across campus to use another office Making them use the public spaces one hour a week You haven’t used Library Search for a while – do you need some tips? Get rid of printed self-help so staff have to promote online self-help Friction to help people understand the space they’re in Helping new users find books (when they want it!) Multi-language at entrances and around Remove classification systems!! Inclusivity check before things are published Remove search term suggestions databases Remove phone, e-mail, etc. from the info desk (anything that isn’t talking to students) Giving more options for reference help – all hours of the day, off campus, offline, etc. Change the quiet reading rooms with the group rooms every week Have staff meetings at the group study areas/rooms Put the check-out machines on the top floor Wary of pop ups but what if pop has the answer To slow down scanning of web pages — part scan and leave just prior to achieving answer Pop up box: “Sign in to get: full text, saved search, e-shelf, etc.” A confirm button when adding manual charges to accounts All A4H staff to be fully trained! Had thought of reducing page options/text but could friction be added another way? When they order interlibrary loans We added friction to subj guide BUT — super friction -> no control for subj lib. Therefore like the less friction idea presented by Shelley Pop up on self-issues: Your books will auto-renew for “x” weeks but may be recalled Are you sure? Deleting records from Endnote web EMS Editions: Removing assets Exhibition gallery: interactive screen Event submission Feedback forms Find a book / study space blindfolded? Stop them from using terms and phrases that people don’t understand Test all changes to web page on real users, especially extreme users Plain language checkers for web content Highlighting research consultations over guides + DBs Declarative signage: “You are on the (blank) floor, (blank wing)” Website style guides Push back on academic staff to upload accessible teaching materials to VLE Making ILL request — have you check whether this is in library? (?) Encourage use of learning technologies, but also provide analogue alternatives Provide alternative signage options (multiple alternatives) When entering study zones -> be aware of conditions expected in space Links that take users to Arabic page rather than going back to main page Allowing males to borrow / use the library during female hours Having box to return the books used inside the library Having a shared online spreadsheet if they would like to have someone to cover their desk hours rather than emailing Did you know? pop ups on library websites iPads out intermittently to draw attention Having to meet with a librarian Signage (or something else!) that prompts new students to consider using Library catalogue before trawling the physical shelves Helpdesk would benefit from friction when students make initial enquiries re: Learning Difference Support (e.g. Dyslexia) — In my Univ Lib they are required to ask about this in an “open” queue without any confidentiality! Near shelves potential redirect to lib cat On entry to help students choose appropriate working space On entry think about what student intends to achieve during visit Replying to email enquiry messages force scroll to beginning to force people to read whole history Have you left this in required state? Bog poster for open access disabled loos Creating new challenges in every day tasks to upskill staff, provide better services to users Asking questions (too many!) to get essential services in place / working properly (e.g. hearing loops [or might be learning loops]) Forcing users to rub up against us: “This resources has been brought to you by your library” (for colleagues) Flag for spamming no. of forwards and emails to lists per day Returning books through the book sorter—asking “have you returned all the books you need to” before issuing a receipt Students who don’t have a disability but are anxious to be able to have one-to-one library hours, therefore all need to be asked at induction ILR’s (InterLibrary Requests) asking “is this available locally” Items in store requested through the catalogue—”can this be accessed online” before the final request click—stops unnecessary collections from store that are not collected Vendor outlinks “You are about to leave the library’s website” Time to choose to read something you wouldn’t have thought of yourself Time to reflect on impact of a certain behavior Time to advertise additional services that might be helpful Screens/maps to look at before looking for books → are you going where you want to? Set target times to resolve a query. Solutions should be quick and easy. Library website: design decision Library website: content CMAS editors: removing assets ILL request form when item not available Library clear link when no results on discovery layer Disable possibility to take a breath from chat Stricter policy for adding web pages Slow down book drop Friction in ordering interlibrary loans which should be purchases How do we offer booking of “resource rooms”? Can we make it more difficult to make web pages inaccessible? Forced message to remove USB before the PC shuts down/logs you out Triage questions? IT vs Library Only hosting video tutorial with embedded subtitles — don’t rely on YouTube autotitles = RUBBISH!! What images are you using to show your library? Does it look inclusive on posters / online / in literature? E.g. pic of our staircase Reservation Collection—self-issue—extra touch screen with due date for 48 hr loans Stop them from rushing to the top floor, like signs in the elevator Force staff to actually test the accessibility of web sites Students, faculty, other ← Library VRS → stop before leaving the chat “Are you sure you don’t need further help?” How do we address people / users? Double-check before making a poster to “solve” a problem! Role management: Design does not equal project management Peer-checking of presentations / teaching sessions for accessibility Writing training materials for students with English as a 2nd language Uploading to online system: large files, Microsoft format, video and audio (not stream), copyrighted To support distant or part-time students Starting projects without: clarity about outcomes, testing, resources required Adding resource e.g. reading list not using the system Copying over last years materials to this years module Better obstacle than fee for interlibrary loans or document delivery Remove “scripts” for staff answers on Just Ask (IM) — be more personal? No pictures of PDFs or text on web — screen readers can’t cope with them Pop-ups letting students know access is being provided by the library (to online resources) Library website QR codes?? Symbols instead of English — Puts everyone at the same level of wayfinding regardless of language skills Diverse reading lists Know Your Staff Wiki! Regular process to review existing web content before adding more Entrance vestibule to silent study spaces Promoting self-service portal at library entrance Chatline. FAQs page to scroll through to get to input page Force a catalogue search before submitting an ILL request Policy that all staff deal with a request for help at point of need and see through Logging all enquiries on an EMS Pick-up shelf: Make users check out their reading room loans Database records in Summon—people going straight to Lib search when not everything is listed Sign up form for focus groups so we can pick by course, not first come, first served Academic workbooks arranged by topic on CMS not just straight link to AS server Online support and workshops more prominently promoted than 1:1s as easier to same [some?] large number I need to approve all external comms and surveys Web edits — I have to approve all pages Training on [survey?] software linked to approval from me Me as final editor for newsletter (brand / accessibility) Gender neutral toilets Editing text for screen readers — on all channels Check catalogue for students who have incorrect info on reserve items Complete a short online library quiz as part of first module Activate your student card in the library within the last week of term Put “Please refer to…” messages where rules aren’t clear ILL — request articles/books we already have—way to make them search first? Search box — choose what format first (they will type anything in a search box without thinking and then think we don’t have an article because they are looking in catalog) Ebooks — add to reserves or pop up asking them to look by title Student staff tell students we don’t have an item when we do — need to try other ways — have system prompt? Expand chat hours so people uncomfortable approaching desk can still ask questions CMS — make popup for alt-text but also color contrast, web writing, close-captioning for videos, etc. Content manager for website — approve all changes even Subject Guides Better feedback on item request — many are not picked up Knowing who your liaison is if on a certain page Staff Friction: Using CRM or equivalent to report issues to other teams, i.e. metadata errors: don’t ring team, logon LANDESK (CRM). Has advantages collating themes and work. Inclusion: Feedback form gender User [Gateways portals]: To prompt and remind about compliance maybe – copyright / usage — use of data/info. Authentication does this also. Staff: Printing checklist Actions before resorting to use of staff printer User: To prompt remind/inform resources purchased on behalf of students by institution IT passwords for faculty users Using lockers after library closing hours Computers on every floor (staff) Toilets (improve inclusion) Game area (students) Lounge area (students) Change main structure of website Adding too long text to buttons Adding too many main category pages Put “silence” signs on every door → there have to be noisy places Just grab a book (without having a look to the books around) Policy: force all staff to use structured text documents so that they are accessible Self-return machines (Don’t take think books, so we need to “slow” the users know know this) Inclusion: Programs → languages Open access funding program → read criteria before submitting the application Adding too long texts into modals designed to be glanced Gender in feedback forms Requirement for text and audio on video Request / reservation: This book is on the shelves in this library. Are you sure you want to request it? [checkbox] Yes. Sign on ground floor: The only toilets and drinking water in this building are on this floor. (Most library services are 2 floors up from here) Making gender option in forms more inclusive e.g. more option or textbox Before making an order/reservation that costs money Before making a reservation Before deleting your user account Before deleting any info permanently Get staff out of their offices — send them to find academic who have not been in the library for a long time We have a Lib Reciprocal Programme across unis in S.A. But in our Lib we force users to see an Info Lib before they get a letter to visit another uni library. Catalogue research (first finding is seldom the best) Remove option to add media on webpages for most staff Accessibility checks before publishing a webpage Filling out book request form for somebody Clearing a list in the catalog Printing single vs. double sided Staff designate, monitor, and enforce quiet areas Building entrance vs. exit Reserving lendable technology Requesting items from storage Information in subject guides Giving information to new students about the library’s services Ordering interlibrary loans In the Discovery systems Request print copies of articles Promote new physical and online materials in entrance User (student) testing before buying e-books Build UX into all projects Prayer facilities A note on self service screen to common ?s. Really good idea. Spending more time with the unfamiliar Symbol sign posting Meet and Greeters at front door Pick up cards at library Send librarians out to visit people Stop “library” work at enquiry point Wellbeing attention grabbing display — subject guide to Registration online — pick up library card in person Commuter room with lockers — charging (away from home help) Auto emails for book arrivals triggered by front desk team so that we are certain it is ready on the shelf Friction needed to prevent deletion of content Subject guides Allow use to browse area and discover other books related to study Develop electronic check lists for staff to ensure staff complete all necessary steps in a task on time and in order Finding tools — Before search encourage users to reflect if using the right finding tool Reading lists — Cap amount of items that can be added → “Do you really want to add this item?” Self-issue machines — Add “do you want to borrow” for very short loans / high charge (had at public) Modernise the till and integrate with LMS. Creates a couple of steps that slows staff and avoids mistakes on the till from “autopilot” “Lost” status and “Found” status. Create pop up explaining what to do and if want to continue to avoid incorrect use. Int’l students — Don’t assume that library experience of someone else is the same particularly when they have a different international experience / Encourage staff to think before assuming person is just not as smart as culture they are accustomed to. Filters — Putting a [friction?] to alert people that they can expand their search to include content not available at [library] as well Friction for staff: prompts to ask particular questions / edit or do something people often forget When searching: “This search result is showing everything. Is that what you want?” Or “It looks like you might be searching for a journal title. Would you like to do that?” Different language options — catalogue, website / signage Compulsory reflection on implicit biases before finalising a form / policy / procedure / interview / process / etc…. Sometimes it’s good to get “lost” and find hidden spaces… Have “no wifi” areas to create “switch-off” spaces… Noise control — something that encourages slowing of pace / pause on entry Furniture might cue quiet study vs. collaboration If staff are including a gender (or other protected characteristic) question on a form, make them type their justification! Supporting assistive tech (friction for staff) Stop long forms with every piece of info the librarian needs to order an item Shibbolth sign in from pub page — get to the right path, choose the best relationship for access Group study facilities — varied tech options More tailored handouts for students who have English as 2nd language or 3rd etc. DVD borrowing: “Don’t forget to unlock your case!” pop-up? Multimedia options for dyslexic students — on entry to library Chat box help kiosk for students who feel like “imposters” (afraid to admit what they don’t know) Single sign on — subject / CS team comms. Consistent approach to adding info to app. Autonomy and overall framework. Quizzes on VCEs at end of modules Furniture — soft for de-stresses Commuter students — find out what their priorities are and how this differs from other students To get integrated in the education with the Library competence, so every student gets the same education (information literacy) Find location in the library Gender free web Block them from the staff cataloguing OPAC — only use for 1 hour a day Think of the people you put on the website. Still mostly young, happy users. Teacher making resource lists Users: Interlibrary loans Pop-up help button after 3 kw searches < 1 minute Discover layer: where am I searching Website friction from adding content — specifically start “Headlines” when coming in to the library — To show services offered that are “unknown” Stacks — “Did you find out the exact location of your book?” Making signs — Added friction for personnel Multilingual captioning Sign friction or not? Faculty-librarian meeting for new faculty (in-person? why?) More faculty-librarian friction Leaving web presence, what about credibility? Evaluate results Require AIT text on IMG upload When leaving discovery tool to external site Management friction Default web editor template; to change, require friction Consider for more friction at admin side Mandatory meeting with librarian for an assignment Swipe card to enter the library Baby changing tables Rainbow lanyards Help uniforms / sashes? Program friction — new program proposal Signage Dual monitor search comp. for info desk enquiries Stop users from ordering books on shelf Warning pop up !DANGER! Universal design on website Pause before changing your brand colors etc. to your online library interface. …consider accessibility first. Pause before allowing online systems use your personal data …instead, learn what the provider will do with your data Pause before composing the perfect, new metadata or information model for the new library service …instead, involve users and designers in the process Shh… Quiet beyond this point   Posts navigation Older posts Archives October 2020 October 2019 June 2019 December 2018 October 2018 September 2018 June 2018 June 2017 December 2016 October 2016 June 2016 November 2015 August 2015 June 2015 March 2015 January 2015 December 2014 November 2014 Tags changes conference notes conference presentation information architecture user research user testing ux UXLibs Create a free website or blog at WordPress.com. Shelley Gullikson Blog at WordPress.com. Email (Required) Name (Required) Website   Loading Comments... Comment × Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use. To find out more, including how to control cookies, see here: Cookie Policy 
sites-google-com-3840	----	Community Mapping Online Event Search this site Community Mapping Online Event Home Program Workshops Materials Team Community Mapping Online Event Home Program Workshops Materials Team More Home Program Workshops Materials Team Community Mapping Online Event 2021 Webinars & Workshops SUBSCRIPTION/ INSCRICIONES WEBINAR - April 29 All three groups at the universities organizing this event already have a track record of research on urban settlements (particularly informal or low-income areas). Our research regularly requires fieldwork that is founded upon empowering community members while promoting their appropriation of the on-going mapping processes. We want to share these experiences and get to know others by promoting and strengthening a global network and disseminating educational materials for similar communities interested in mapping. The activities will provide important opportunities to develop participants' skills and knowledge while also building their networks and sharing experiences through a range of interactive sessions. This event is specially designed to community members who already work on local mapping projects and want to know new tools to meet their demands, as well as other communities experiences; and to researchers who work with urban settlements (in particular, informal or low-income), where surveys require fieldwork in these areas and most of them are dialogic surveys whose premise is to empower community members and promote their ownership of the ongoing mapping processes. Why are we proposing this event? Fast development of multiple digital tools for mapping purposes which have been used in different ways for different reasons; The COVID-19 pandemic situation brought a new challenge (and potential) to keep research field work ongoing in a remote manner; There are still many challenges to promote an effective community appropriation of mapping tools and developing good and context-based practices and procedures; Also, there is limited (open and centralized) information and resources available for communities on mapping practice, such as training material, other experiences to look into, lists of mapping tools available (open and licensed), etc. What type of Community mapping are we talking about? Research or surveys on urban settlements (particularly, informal, or low-income); Research or surveys that requires field work in those areas; Dialogical research or surveys that has as a premise empowering community members while promoting their appropriation of the on-going mapping processes. What do we want with this event? To promote and strengthen a global network of community data collectors from communities, organisations, as well as academic institutions by 1) focusing on sharing experiences from specific cases where particular mapping tools were used as part of strategies of community empowerment and 2) using the insights to subsequently co-design a platform to empower data collectors globally. Os três grupos nas universidades que organizam este evento já têm um histórico de pesquisas em assentamentos urbanos (particularmente em áreas informais ou de baixa renda). Nossa pesquisa requer regularmente trabalho de campo baseado no fortalecimento dos membros da comunidade, ao mesmo tempo em que promove sua apropriação dos processos de mapeamento em andamento. Queremos compartilhar essas experiências e conhecer outras pessoas, promovendo e fortalecendo uma rede global e disseminando materiais educacionais para comunidades semelhantes interessadas no mapeamento. As atividades proporcionarão oportunidades importantes para desenvolver as habilidades e conhecimentos dos participantes, ao mesmo tempo em que constroem suas redes e compartilham experiências por meio de uma série de sessões interativas. Este evento é voltado especialmente para membros da comunidade que já atuam em projetos de mapeamento local e desejam conhecer novas ferramentas para atender às suas demandas, bem como outras experiências das comunidades; e para pesquisadores que trabalham com assentamentos urbanos (em particular, informais ou de baixa renda), onde as pesquisas exigem trabalho de campo nessas áreas e a maioria delas são pesquisas dialógicas cuja premissa é empoderar os membros da comunidade e promover sua apropriação dos processos de mapeamento em andamento. Por que estamos propondo este evento? Desenvolvimento rápido de várias ferramentas digitais para fins de mapeamento, que têm sido usadas de diferentes maneiras por diferentes motivos; A situação de pandemia do COVID-19 trouxe um novo desafio (e potencial) para manter o trabalho de campo de pesquisa em andamento de maneira remota; Ainda existem muitos desafios para promover uma apropriação efetiva das ferramentas de mapeamento pela comunidade e o desenvolvimento de boas práticas e procedimentos baseados no contexto; Além disso, há informações e recursos limitados (abertos e centralizados) disponíveis para as comunidades sobre a prática de mapeamento, como material de treinamento, outras experiências para pesquisar, listas de ferramentas de mapeamento disponíveis (abertas e licenciadas), etc. De que tipo de mapeamento da comunidade estamos falando? Pesquisa ou pesquisas sobre assentamentos urbanos (particularmente, informais ou de baixa renda); Pesquisas ou levantamentos que requeiram trabalho de campo nessas áreas; Pesquisas ou levantamentos dialógicos que têm como premissa empoderar os membros da comunidade e ao mesmo tempo promover sua apropriação dos processos de mapeamento em andamento. O que queremos com este evento? Promover e fortalecer uma rede global de mapeadores comunitários, organizações, bem como instituições acadêmicas por 1) focando no compartilhamento de experiências de casos específicos onde ferramentas de mapeamento específicas foram usadas como parte de estratégias de empoderamento da comunidade e 2) usando os insights para subsequentemente co-projetar uma plataforma para capacitar os mapeadores globalmente. Los tres grupos de las universidades que organizan este evento ya tienen un historial de investigación sobre asentamientos urbanos (particularmente en barrios informales o de bajos ingresos). Nuestra investigación requiere regularmente un trabajo de campo que se basa en empoderar a los miembros de las comunidades con las que trabajamos y promover procesos apropiación de los mapeos en curso. Nosotros queremos compartir estas experiencias y conocer a otros que estén en proyectos similares de mapeo a través de la promoción y del fortalecimiento de una red global donde también compartamos materiales educativos orientados principalmente a las comunidades interesadas en procesos de mapeo comunitario. Esperamos que las actividades que proponemos brinden un espacio relevante para desarrollar las habilidades y el conocimiento de los participantes, al mismo tiempo que se construyen redes colaborativas para compartir experiencias a través de una variedad de sesiones interactivas. Este evento está especialmente diseñado para miembros de las comunidades que ya trabajan en proyectos de mapeo local y quieren conocer nuevas herramientas para satisfacer sus demandas, así como las experiencias de otras comunidades. También está enfocado en investigadores que trabajan con asentamientos urbanos (en particular, informales o de bajos ingresos), donde sus investigaciones llevadas a cabo con apuestas dialógicas requieren trabajo de campo cuya premisa es empoderar a los miembros de las comunidades y promover su apropiación de los procesos de mapeo en curso. ¿Por qué proponemos este evento? Por el rápido desarrollo que se ha presentado de múltiples herramientas digitales con fines cartográficos que se han utilizado de diferentes formas por distintas razones. La situación de pandemia por COVID-19 trajo un nuevo desafío (pero también potencial) para buscar formas para llevar a cabo trabajos de campo remotos en los distintos proyectos de investigación alrededor del mundo. Aún existen muchos desafíos para promover una apropiación comunitaria efectiva de las herramientas cartográficas disponibles y el desarrollo de buenas prácticas y procedimientos basados ​​en el contexto. Además, actualmente hay información y recursos limitados (abiertos y centralizados) disponibles para las comunidades sobre la práctica cartográfica, por ejemplo, material de capacitación, otras experiencias similares, listas de herramientas cartográficas disponibles (abiertas y con licencia), etc. ¿De qué tipo de mapeo comunitario estamos hablando? Investigaciones o levantamientos con propuestas dialógicas o participativas que tienen como premisa empoderar a los miembros de la comunidad y promover su apropiación de los procesos cartográficos en curso. Investigación sobre asentamientos urbanos (particularmente, informales o de bajos ingresos); Investigación o levantamientos que requieran trabajo de campo en esas áreas; ¿Qué queremos con este evento? Promover y fortalecer una red global de mapeadores comunitarios, organizaciones, e instituciones académicas enfocados en: 1) compartir experiencias de casos específicos donde se utilizaron herramientas de mapeo particulares como parte de estrategias de empoderamiento comunitario y 2) utilizando los conocimientos para co-crear posteriormente una plataforma abierta dónde todos estos recursos (material educativo, experiencias, etc) estén disponibles para empoderar a mapeadores comunitarios a nivel global. Report abuse Page details Page updated Google Sites Report abuse 
sloan-org-1900	----	Home Menu Click to search the website Programs Click to expand this navigation menu Research Economic Institutions, Behavior, & Performance Energy & Environment Sloan Digital Sky Survey Tabletop Particle Physics Click to expand this navigation menu Higher Education Diversity, Equity & Inclusion in STEM Higher Education Click to expand this navigation menu Technology Better Software for Science Data & Computational Research Exploratory Grantmaking in Technology Scholarly Communication Universal Access to Knowledge Click to expand this navigation menu Public Understanding Books Film Television Radio Theater New Media New York City Program Click to expand this navigation menu Completed Programs Anytime, Anyplace Learning Barcode of Life Biosecurity Census of Marine Life Chemistry of Indoor Environments Deep Carbon Observatory Encyclopedia of Life Industry Studies Information about Careers in Science and Technology Making Municipal Governments More Responsive to their Citizens Microbiology of the Built Environment Outsourcing: Impacts on the U.S. Workforce Professional Science Master's Degree Synthetic Biology Working Longer Workplace, Workforce, and Working Families Grants Apply For Grantees Grants Database Sloan Research Fellowships About Apply FAQ For Current Fellows Past Fellows About Mission Commitment to Diversity, Equity & Inclusion Alfred P. Sloan Jr. Trustees Staff Careers Documents Click to expand this navigation menu Press Room Press Releases Grantee News Logos Contact Us Home Menu Click to search the website Harvard University Coded Bias Radium Girls / Click to see next story Click to see next story Sloan Research Fellowships Isaiah Andrews, Harvard economist and 2018 Sloan Research Fellow, awarded John Bates Clark Medal Andrews receives economics' most prestigious early-career award in recognition of groundbreaking work in econometric theory Public Understanding Sloan-Supported Documentary “Coded Bias” Available on Netflix The feature-length film by Shalini Kantayya looks at the bias encoded in automated decision-making and machine-learning algorithms. Call for Letters of Inquiry Creating Equitable Pathways to STEM Graduate Education Grants of up to $500,000 will be awarded to U.S. higher education institutions and organizations developing equitable pathways to STEM graduate education for Black, Latinx, and Indigenous Students Public Understanding of Science Sloan/NYU-Winning Film “Radium Girls” Now Available on Netflix Based on the true story of young women in the 1920s who were poisoned while painting luminous watches at the U.S. Radium Factory. Announcement The Foundation Responds to an Evolving Crisis About the Foundation We fund research and education in science, technology, engineering, mathematics and economics Founded in 1934 by industrialist Alfred P. Sloan Jr., the Foundation is a not-for-profit grantmaking institution that supports high quality, impartial scientific research; fosters a robust, diverse scientific workforce; strengthens public understanding and engagement with science; and promotes the health of the institutions of scientific endeavor.  Read more Programs Our grantmaking is divided thematically across several broad subject areas related to science, technology, engineering, mathematics, and economics. Research Higher Education Technology Public Understanding New York City Program Completed Programs Sloan Research Fellowships These two-year fellowships honor outstanding early-career researchers in eight fields and have become one of the most prestigious and sought after awards available to young scholars. About Apply For Current Fellows For Past Fellows Grants Policies, forms, and other resources for grantees and grantseekers Apply What We Do Not Fund The Grant Application Process Grant Proposal Guidelines Grant Forms For Grantees Grants Database About Who we are, what we believe, and how we govern and fund our activities. Mission Who was Alfred P. Sloan? Trustees Staff Careers Documents Contact Recent News See All Scientific American World’s Largest Map of Space Offers Clues on Dark Energy MIT Handbook on Using Administrative Data for Research and Evidence-based Policy Science Magazine A water rule that turns a blind eye to transboundary pollution LiveScience NASA detects rare 'double quasar' in ancient corner of the universe Houston Advanced Research Center Houston Advanced Research Center Receives $600K Grant to Examine the Resilience of Power Systems to Climate Change Boston Globe MIT scientists launch Future Founders Initiative to solve biotech’s ‘missing women’ problem Home / Harvard University Coded Bias Radium Girls / Click to see next story Click to see next story Sloan Research Fellowships Isaiah Andrews, Harvard economist and 2018 Sloan Research Fellow, awarded John Bates Clark Medal … Programs Click to expand this navigation menu Research Economic Institutions, Behavior, & Performance Energy & Environment Sloan Digital Sky Survey Tabletop Particle Physics Click to expand this navigation menu Higher Education Diversity, Equity & Inclusion in STEM Higher Education Click to expand this navigation menu Technology Better Software for Science Data & Computational Research Exploratory Grantmaking in Technology Scholarly Communication Universal Access to Knowledge Click to expand this navigation menu Public Understanding Books Film Television Radio Theater New Media New York City Program Click to expand this navigation menu Completed Programs Anytime, Anyplace Learning Barcode of Life Biosecurity Census of Marine Life Chemistry of Indoor Environments Deep Carbon Observatory Encyclopedia of Life Industry Studies Information about Careers in Science and Technology Making Municipal Governments More Responsive to their Citizens Microbiology of the Built Environment Outsourcing: Impacts on the U.S. Workforce Professional Science Master's Degree Synthetic Biology Working Longer Workplace, Workforce, and Working Families Grants Apply For Grantees Grants Database Sloan Research Fellowships About Apply FAQ For Current Fellows Past Fellows About Mission Commitment to Diversity, Equity & Inclusion Alfred P. Sloan Jr. Trustees Staff Careers Documents Click to expand this navigation menu Press Room Press Releases Grantee News Logos Contact Us Home Back Up Follow Us Facebook Twitter © 2021 Alfred P. Sloan Foundation We use cookies to analyze our traffic. Please decide if you are willing to accept cookies from our website. Decline Accept × 
snapshotclimate-com-au-2035	----	CO2 Emissions Snapshots for municipalities in Australia :: Snapshot Snapshot Explore FAQ Resources User Guide About Sign upLog out Downloaded reports Organisations AccountLog outUpdate email addressDelete account CO2 Emissions Snapshots for municipalities in Australia Helping communities and councils plan for CO2 reduction Find your municipality report BetaTerms and conditions Use of this website governed by the terms of use set out in our Privacy Statement. Dismiss 
software-ac-uk-183	----	CW21 - Hack Day | Software Sustainability Institute Skip to main content Pre Header Link RSS Twitter LinkedIn Wakelet Blog News Events Contact Subscribe Primary links About About us Manifesto Staff Fellows Advisory Board Funders Partners Brand guidelines How to cite us Get involved Data Management Plan Contact Programmes and Events Fellowship Programme Open Call for Projects Research Software Healthcheck Carpentry Programmes Research Software Engineers Collaborations Workshops Research Software Camps Past events Code of Conduct Resources Get up to speed! Online sustainability evaluation Open Evidence Bank REF 2021: Guidance for Software Outputs Guides Top tips Training resources Publications Case studies T-shirts Videos Close Primary links About About us Manifesto Staff Fellows Advisory Board Funders Partners Brand guidelines How to cite us Get involved Data Management Plan Contact Programmes and Events Fellowship Programme Open Call for Projects Research Software Healthcheck Carpentry Programmes Research Software Engineers Collaborations Workshops Research Software Camps Past events Code of Conduct Resources Get up to speed! Online sustainability evaluation Open Evidence Bank REF 2021: Guidance for Software Outputs Guides Top tips Training resources Publications Case studies T-shirts Videos Other pages RSS Twitter LinkedIn Wakelet Blog News Events Contact Subscribe Search Menu CW21 - Hack Day Photo by Marvin Meyer on Unsplash The CW21 Hack Day will take place on:  Wednesday, 31 March 2021 from 15:45 - 17:15 BST (14:45 - 16:15 UTC) Thursday, 1 April 2021 from 10:00 - 17:00 BST (9:00 - 16:00 UTC)   Where do I submit an idea, and where can I see the submissions? The Hack Day is not limited to the ideas that came from the Collaborative Ideas session: anyone can suggest a hack project by filling out the Ideas form at any point during the workshop. Ideas forms will be made available to participants at the workshop, who will also be able to view the ideas that have already been submitted.   Pitching an idea and team formation If you are the Hack Day Idea Proposer, or part of a team who wishes to work on an idea during the Hack Day, you will need to put together a pitch. The pitching session and team formation takes place on Wednesday, 31 March 2021 starting at 15:45 BST (14:45 UTC).  The pitch should: Last three minutes in total Be clear about the problem you aim to solve and the way in which the solution will be realised Be clear about the skills you think the group needs Should be of the right scale that 3-6 people can make some headway within 5 hours Be clear about the benefit and impact of the idea to attract people to join your group! Each idea/pitch needs to be presented and registered by the evening of Wednesday, 31 March 2021 to be officially part of the CW21 Hack Day. When registering your idea/pitch you will be asked about the team leader, details of what you plan and the category of your idea/pitch (e.g. Software and Credit, Data/Code Sharing, Reproducible Research, Bring-Your-Own-Data (BYOD), Collaborative working, etc.) Note the ideas need not be about writing software: they could be standards related, paper hackathons or some other research software related activity. Each idea will have a Team leader; the leader could be the idea owner, the pitch author/leader or someone who has decided to form a team around someone else’s idea/open data. Each team can have a maximum of six people who are not Institute Staff in it. We recommend a minimum of two people in a team; we have a limit on the number of teams - if there are more than 10, preference will be given to the bigger teams (not who came first). If a team becomes too big we may ask you to become two teams but working on the same pitch/idea. Once your team is formed, we strongly suggest you have a free and frank discussion with your team about the licensing around the outputs so that everyone is on the same page.   Judges The judges for the Hack Day will be: Ben Krikler James Graham Malvika Sharan Mateusz Kuzak Matthew Cannon Shoaib Sufi (chair) Yo Yehudi Judges will visit each team during the Hack Day, see how they are doing, offer advice and ask questions. They may also prod you to start writing up your presentation and thinking about your demo of the work you have been doing to try and offset the ‘just one more commit’ urge that can set into a team as presentation/demo time approaches. Judges may make notes on the teams they are visiting to help assess teamwork, they may also visit you more than once during the day. All decisions of the judges with regards to marking and prize giving are final and neither they nor the Institute will entertain any appeals.   Judging criteria What follows are the criteria for how your Hack Day entry will be judged. During the 5 minute presentation of your Hack Day work, each team must show how they address the criteria. Failure to do this might prevent a good entry from getting a good score during its assessment. Each category will be scored from 1 (lowest) to 5 (highest); weighting may be applied to the categories but the judges will decide on this during their meeting on the Hack Day. 1. Novelty, creativity, coolness and/or usefulness Can you clearly define the problem that is being solved and how are you trying to solve it? Are you doing something new, better, slick or really useful to yourself or others? Is your solution purely self-serving, or is it enabling in some other way. You need to provide reasons as to how your Hack Day project benefits a wider community of potential users/developers to get the best marks during assessment. The advice here is indicative; other justifications in this space are welcome (within the constraints of presenting). 2. Implementation and infrastructure   Are you following research software best practice for the use of infrastructure? Is a source code repository being used? Is there documentation? Are appropriate services and infrastructure being used (e.g. cloud computing, databases)? If you are building on existing work, it’s essential that you are clear about what was done during the Hack Day in terms of adding features and functionality etc. (If this is not clear you will lose marks). Does your solution work for the stated purpose - can this be shown during the demo? If your team is developing a standard, are you using collaborative techniques and tools to allow contribution from the whole team? For paper hackathons involving presentation of data or analysis, are you using reproducible frameworks for the paper authoring? For other research software related hacks, is it clear you are using best practice in the construction of the work? 3. Demo and presentation Did the presentation and demo show how your hack has fulfilled the judging criteria? Did your team communicate the essence of why they did what they did and why it was important? If your team were demonstrating results (e.g. from an analysis), were they appropriate for the data chosen? 4. Project transparency Was your source code available on an open repository at presentation time? Teams may choose to work open or work closed. If you happen to decide that you want a publication from this work then you may choose to be open about your methods but not your data, for example. However, building and being able to build on each other's work during the Hack Day will be viewed favourably. Ideally your repository should contain a README covering configuration, make and run instructions included with the code. In addition there should be a brief description of the project and what the software/scripts do, along with a license.  These criteria may not be directly relevant for certain categories of entry; in this case other aspects of transparency and openness will be used as decided upon by the judging panel. 5. Future potential Was it clear how your work could be taken forward in the future, could it modify existing work, or be part of a new paper, initiative or bid? Were ideas of future steps provided? Was it mere fun or did the idea show usefulness in the long term? 6. Team work Was your team led well, were they able to involve all interested team members? Were non-technical members directed towards meaningful contributions; e.g. documentation, testing, usability and logo design in the case of more software-related hacks? Did your team’s software practices support synchronised working and decrease duplication? Did your team achieve more together than would have been possible separately? Was your team atmosphere healthy: disagreements are fine, but were they conducted agreeably? Did it appear enjoyable and/or fun to be part of your team?   Prizes The Hack Day teams to come in 1st, 2nd and 3rd place will win prizes! This year, the prizes are Redbubble digital gift cards, which give the recipients the choice of millions of designs by independent artists printed on a range of products. Redbubble are committed to social responsibility and sustainability, and digital gift cards provide a more international- and pandemic-friendly prize for our participants. The prizes for the winning Hack Day team members are as follows: First place: £100 Redbubble digital gift card Second place: £75 Redbubble digital gift card Third place: £50 Redbubble digital gift card Back to the CW21 agenda Was this page useful? Share this page Subscribe Every Friday we send out a digest of the week's news and blog posts. Email Address     Ignore Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution Non-Commercial 2.5 License. ©2010 - 2020 The University of Edinburgh on behalf of the Software Sustainability Institute. Privacy policy Website accessibility Sitemap We use cookies on our website to support technical features that enhance your user experience. We also use analytics & advertising services. To opt-out click for more information. I've read it More information 
software-ac-uk-2067	----	CW21 - Discussion session | Software Sustainability Institute Skip to main content Pre Header Link RSS Twitter LinkedIn Wakelet Blog News Events Contact Subscribe Primary links About About us Manifesto Staff Fellows Advisory Board Funders Partners Brand guidelines How to cite us Get involved Data Management Plan Contact Programmes and Events Fellowship Programme Open Call for Projects Research Software Healthcheck Carpentry Programmes Research Software Engineers Collaborations Workshops Research Software Camps Past events Code of Conduct Resources Get up to speed! Online sustainability evaluation Open Evidence Bank REF 2021: Guidance for Software Outputs Guides Top tips Training resources Publications Case studies T-shirts Videos Close Primary links About About us Manifesto Staff Fellows Advisory Board Funders Partners Brand guidelines How to cite us Get involved Data Management Plan Contact Programmes and Events Fellowship Programme Open Call for Projects Research Software Healthcheck Carpentry Programmes Research Software Engineers Collaborations Workshops Research Software Camps Past events Code of Conduct Resources Get up to speed! Online sustainability evaluation Open Evidence Bank REF 2021: Guidance for Software Outputs Guides Top tips Training resources Publications Case studies T-shirts Videos Other pages RSS Twitter LinkedIn Wakelet Blog News Events Contact Subscribe Search Menu CW21 - Discussion session The discussion session allows groups of people to discuss a topic that interests them in a way that furthers our knowledge of that topic. This session is a fundamental part of the Collaborations Workshop: it helps people learn about new ideas and work together on solving shared problems. The output of the discussion session is a speed blog post from each group to help disseminate the insights to the wider community. You can view the speed blogs from last year's Collaborations Workshop 2020 discussion session here. Schedule and topics  The CW21 discussion session will take place on Tuesday, 30 March 2021 from 13:45 - 15:15 BST. The live list of topics will be made available to registrants of the workshop, allowing them to sign up for topics and suggest new topics. How does it work? Each topic is assigned a Proposer whose role is to make sure that everyone's voice is heard, and keep everyone on topic. During the morning of the first day of the workshop, participants can make the following changes: Drop a topic: If someone is down against a topic as the proposer but they would prefer to be participating in another topic, they can remove their name as topic proposer. Lead a topic: If there is a topic that a participant wants to see discussed, but it doesn’t have an owner attached to it, they can volunteer to lead it by adding their name to the topic. Suggest a new topic: If the topic a participant wants to discuss isn’t listed, they can add it to the list with themselves as the topic owner. During lunch, all workshop participants should sign up to the topic they'd like to participate in (a link will be provided), there will only be one discussion topic session. Based on peoples' preferences, we'll schedule the discussion sessions and assign them breakout rooms. The discussion session lasts for 90 minutes. That's not enough time to discuss the subject in depth, but we find it's about the right amount of time to determine the main points and write them up as speed blog. What to do In the first five minutes, you should choose a Reporter. The Reporter clicks the link for the collaborative note taking and blog template for their group from the Discussion Topics spreadsheet and uses that to note down the pertinent points from the discussion that can then be used as the basis for constructing the speed blog about the session. A good way to start is to ask what participants in the group want from the discussion. Are people looking to solve a problem, wanting to promote a solution, or do they simply want to know more about the topic? If you can get a handle on what people want from the discussion, it's much easier to keep everyone on topic. Focus on what can be changed! It's easy with some topics to focus on a discussion of the problems and overlook the process of finding a solution. Also it's important to give the first 50 minutes of time to the discussion and not worry about writing the blog post, as it can be hard to explore a topic and write coherent prose as the same time. Once the 50 minutes is up you should move on to writing the blog post for which you will have approximately 40 minutes, ideally it's better to get it written during this time as we have found if you leave it for later then you tend to never get back to it. Even a first decent draft is a good start, and we will ask you if you want to work on it further before we publish them after Collaborations Workshop 2021. You might want to move the notes to the bottom of your document and blog post to the top in the last few minutes of the discussion session. In any case, once you are done and if you want others to see it then you can always let people know via the CW21 Slack workspace. Otherwise the organisers will be aware of where the documents are. You will be able to carry on working on the blog after the session and for about a week after the workshop but please do try and have a (near) complete blog by the end of the session otherwise the momentum to write anything might be lost. Reporting back   There is no formal reporting back session at Collaborations Workshop 2021, the blog posts forms the heart of reporting back information from the discussions in a way that is of wider benefit to the research software community.  The final product In the weeks after Collaborations Workshop 2021, we will publish the speed blog posts - so the teams will still have some time to tweak them. We will confirm with the team members before we publish. We won't be publishing the notes so please feel free in noting things down as you see fit.   Back to the CW21 agenda Was this page useful? Share this page Subscribe Every Friday we send out a digest of the week's news and blog posts. Email Address     Ignore Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution Non-Commercial 2.5 License. ©2010 - 2020 The University of Edinburgh on behalf of the Software Sustainability Institute. Privacy policy Website accessibility Sitemap We use cookies on our website to support technical features that enhance your user experience. We also use analytics & advertising services. To opt-out click for more information. I've read it More information 
software-ac-uk-410	----	Collaborations Workshop 2021 (CW21) | Software Sustainability Institute Skip to main content Pre Header Link RSS Twitter LinkedIn Wakelet Blog News Events Contact Subscribe Primary links About About us Manifesto Staff Fellows Advisory Board Funders Partners Brand guidelines How to cite us Get involved Data Management Plan Contact Programmes and Events Fellowship Programme Open Call for Projects Research Software Healthcheck Carpentry Programmes Research Software Engineers Collaborations Workshops Research Software Camps Past events Code of Conduct Resources Get up to speed! Online sustainability evaluation Open Evidence Bank REF 2021: Guidance for Software Outputs Guides Top tips Training resources Publications Case studies T-shirts Videos Close Primary links About About us Manifesto Staff Fellows Advisory Board Funders Partners Brand guidelines How to cite us Get involved Data Management Plan Contact Programmes and Events Fellowship Programme Open Call for Projects Research Software Healthcheck Carpentry Programmes Research Software Engineers Collaborations Workshops Research Software Camps Past events Code of Conduct Resources Get up to speed! Online sustainability evaluation Open Evidence Bank REF 2021: Guidance for Software Outputs Guides Top tips Training resources Publications Case studies T-shirts Videos Other pages RSS Twitter LinkedIn Wakelet Blog News Events Contact Subscribe Search Menu Collaborations Workshop 2021 (CW21) Photo by Andrea Enríquez Cousiño on Unsplash Twitter: #CollabW21 The Software Sustainability Institute’s Collaborations Workshop series brings together researchers, developers, innovators, managers, funders, publishers, policy makers, leaders and educators to explore best practices and the future of research software. Collaborations Workshop 2021 (CW21) will take place online from Tuesday, 30 March to Thursday, 1 April 2021. The themes of CW21 are: FAIR Research Software Diversity and Inclusion Software Sustainability Register for CW21 on Eventbrite   Why you should attend Blog posts and news items: The Wellcome Trust to sponsor Collaborations Workshop 2021 Announcing the mini-workshops and demo sessions at Collaborations Workshop 2021 Announcing the panel on diversity and inclusion at Collaborations Workshop 2021 Dr Chonnettia Jones to deliver keynote on diversity and inclusion at Collaborations Workshop 2021 Call for lightning talks at Collaborations Workshop 2021 Deadline for CW21 mini-workshop and social activity proposals: 31 January Financial assistance available for Collaborations Workshop 2021 Call for submissions to Collaborations Workshop 2021 Dr Michelle Barker to open Collaborations Workshop '21 with keynote on FAIR Research Software Highlights from Collaborations Workshop 2020 SSI Collaborations Workshop 2020: Remote unconference experience and notes Collaborations Workshop 2020 session recordings Familiar faces: Videos on better and sustainable research software from Collaborations Workshop 2019 10 things I didn’t do at my first unconference Previous events in the series. Registration Registration for the complete CW21 programme is now open via Eventbrite. Tickets provide access to all keynote presentations, interactive sessions, communication channels, social activities and the Hack Day. To make this event fair and inclusive to everyone, you will need to read, agree to and abide by our Participation Guidelines. We are pleased to announce that this year the CW21 keynote presentations and panel will also be broadcasted live via the Institute’s YouTube channel for everyone who is unable to participate in the full programme. The live streams will remain available to watch until the official recordings are released. You can watch the keynote sessions live here You can watch the panel session live here Financial Aid The Software Sustainability Institute is committed to fostering and supporting a diverse, equitable and inclusive research software community. We are proud to offer financial assistance to members of underrepresented groups, students/early career stages, and others who may not be able to attend or fully participate in the event otherwise. Please click here to find out more information and to apply for financial assistance to support your participation in CW21. Call for Submissions The call for submissions is closed. Agenda Click here to view the CW21 agenda Click here to view the Social Programme Click here to view the Lightning Talks taking place Click here to view the Mini-workshops and demo sessions taking place Click here to read our invited speakers' bios Venue and Accessibility CW21 will take place online using Zoom. You will need to install their client, and make sure that it is updated to the latest version.  A CW21 Slack workspace is available for participants to connect, engage in discussion, ask questions, and forge new collaborations. These conversations can persist long after the Zoom room has closed. The invitation link will be emailed to participants via the email they registered with.  We will be using Google Documents for collaborative note-taking and keeping everyone synchronised during CW21. The links to these documents will be shared with participants on the day. Live transcription and captioning will be provided using Otter.ai.  We recommend joining with a good quality headset with microphone, plenty of screen space (either using a large monitor or second screen), a normally reliable WiFi signal or wired connection, and a quiet room where you are less likely to be disturbed (but we understand this might be difficult given the current circumstances). Sponsors Please visit our Sponsors page to see who is partnering with CW21. Why FAIR Research Software?  The FAIR Guiding Principles - Findability, Accessibility, Interoperability, and Reusability - for scientific data management and stewardship aim to maximise the discoverability and reusability of research data, leading to increased transparency and reproducibility of research results. At a high level, the principles are meant to apply to all scholarly digital research objects, including algorithms, tools, and workflows, but there is ongoing discussion about how they can be applied to research software in a practical and useful way. Certain characteristics of software, such as the executability, composite nature, and continuous evolution accompanied by frequent versioning, make it necessary to translate and extend the original principles. The RDA FAIR For Research Software Working Group (FAIR4RS WG) is coordinating a range of existing community-led discussions on how to define and effectively apply FAIR principles to research software in order to achieve adoption of these principles and continue to advance the aims of the open science movement. What challenges do we face when applying the FAIR Guiding Principles to our digital research objects? Which of the principles directly apply to research software and which do not? Where are modifications needed? In what ways can the original principles be extended to account for the ways in which software differs from data and other digital research objects? Get an update on how FAIR is being applied to software, the overlap with reproducible research and Open Science and contribute to discussions to move this area forward. Why Diversity and Inclusion?  The Software Sustainability Institute has been surveying the research software engineering (RSE) community in order to understand the diversity challenges facing RSEs by obtaining a better understanding of the landscape as a whole. Of those surveyed in 2018, only 5% of UK RSEs identified as ethnic minorities, 6% as disabled, and 14% as women. These represent serious diversity gaps when compared to the 21% of UK software developers who identify as ethnic minorities and 10% as disabled. The same gender gap of 14% exists for UK software developers, however 46% of UK academics identify as women. Diversity and Inclusion is inherently about justice and equity, and it brings different perspectives, ideas and experiences to a community that can lead to higher quality research and improved business performance. What actions can we take to address the underrepresentation, differential needs and systemic disadvantages that exist within the research software community? How can we build inclusion and equity into our projects and teams to attract and retain diverse talent, and ensure accessibility and participation of diverse perspectives in our research/work? Get informed and take part in related discussions at CW21: Better Diversity, Better Software, Better Research! Why Software Sustainability? Software is fundamental to research: 7 out of 10 researchers report their work would be impossible without it. From short, thrown-together temporary scripts to solve a specific problem, through an abundance of complex spreadsheets analysing collected data, to the hundreds of software engineers and millions of lines of code behind international efforts such as the Large Hadron Collider and the Square Kilometre Array, there are few areas of research where software does not have a fundamental role. As more research is based on results that are generated by software, there must be an increased focus on developing software that is reliable and which can be easily proven to produce reproducible results. Sustainability means that the software you use today will be available - and continue to be improved and supported - in the future. Which best practices for ensuring accessibility and reproducibility of research software have yet to be largely adopted? How can we onboard new collaborators with diverse perspectives to help improve, maintain and sustain software over its lifetime? Are there any invisible roles or career paths we should work to professionalise (similar to the history of RSE)? Discuss and take part in collaborative speed blogging on these and other Software Sustainability topics at CW21! More about Collaborations Workshop 2021 The Software Sustainability Institute invites all members of the research software community to explore and discuss the themes described above and other related issues at CW21. CW21 participants will gain insight into the topics of FAIR Research Software, Diversity and Inclusion, and Software Sustainability and how these impact on research. It is also an ideal opportunity to form collaborations (on average, CW participants start two new collaborations by attending) and to discuss topics proposed by attendees. CW21 is a great place to network and participants will meet many of the new and existing Software Sustainability Institute Fellows – key ambassadors in varied research domains. Take a look at the Highlights from Collaborations Workshop 2020 and the videos on better and sustainable research software to better understand what happens at a Collaborations Workshop. Also please read the various blog posts from previous Collaborations Workshops to better understand why you should attend and what you will gain. Steering Committee View the members of the CW21 Steering Committee here. Participation Guidelines By registering for CW21, you agree to abide by our Participation Guidelines, which include policies related to Code of Conduct, Privacy, Social Media and Intellectual Property.    Sign up to our newsletter to receive CW updates Was this page useful? Share this page Subscribe Every Friday we send out a digest of the week's news and blog posts. Email Address     Ignore Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution Non-Commercial 2.5 License. ©2010 - 2020 The University of Edinburgh on behalf of the Software Sustainability Institute. Privacy policy Website accessibility Sitemap We use cookies on our website to support technical features that enhance your user experience. We also use analytics & advertising services. To opt-out click for more information. I've read it More information 
software-ac-uk-5888	----	CW21 - Collaborative Ideas session | Software Sustainability Institute Skip to main content Pre Header Link RSS Twitter LinkedIn Wakelet Blog News Events Contact Subscribe Primary links About About us Manifesto Staff Fellows Advisory Board Funders Partners Brand guidelines How to cite us Get involved Data Management Plan Contact Programmes and Events Fellowship Programme Open Call for Projects Research Software Healthcheck Carpentry Programmes Research Software Engineers Collaborations Workshops Research Software Camps Past events Code of Conduct Resources Get up to speed! Online sustainability evaluation Open Evidence Bank REF 2021: Guidance for Software Outputs Guides Top tips Training resources Publications Case studies T-shirts Videos Close Primary links About About us Manifesto Staff Fellows Advisory Board Funders Partners Brand guidelines How to cite us Get involved Data Management Plan Contact Programmes and Events Fellowship Programme Open Call for Projects Research Software Healthcheck Carpentry Programmes Research Software Engineers Collaborations Workshops Research Software Camps Past events Code of Conduct Resources Get up to speed! Online sustainability evaluation Open Evidence Bank REF 2021: Guidance for Software Outputs Guides Top tips Training resources Publications Case studies T-shirts Videos Other pages RSS Twitter LinkedIn Wakelet Blog News Events Contact Subscribe Search Menu CW21 - Collaborative Ideas session Photo by You X Ventures on Unsplash. The Collaborative Ideas session is used to get people talking about their work. You can discuss the work you're doing, work you would like to do, or problems you are facing. Other people in the group can then talk about how they could help get your new project off the ground, or help solve the problem you are facing. This is an excellent way of generating project ideas for the Hack Day. Schedule and group assignments The CW21 Collaborative Ideas session will take place on Wednesday, 31 March 2021 from 11:20 - 12:30 BST (10:20-11:30 UTC). Participants will be assigned into breakout groups.   What to do at the Collaborative Ideas session Once you arrive in your breakout room: Introduce yourselves! Select a Chair and a Scribe. Take it in turns to introduce something about your work that is important to you. It can be anything: a new project you want to get started, a policy that you want to see adopted, the skeleton of a concept that you want flesh out, a tool that you want built, a problem that you face - anything! If it's on your mind and you think discussing it with a group of leading researchers and software developers could help, then this is the place to discuss it. Make a decision about which idea is most likely to be taken forward. Selecting only one idea can be very difficult! (Even if your idea isn't taken forward during the session, you can submit it yourself later during the workshop). Help the scribe complete the Collaborative Ideas form.   Notes for the Scribe The scribe should record some notes about the group's idea in the Collaborative Ideas form (which will be a templated Google Doc). It should take about 15 minutes to fill out the form. The information needed is: Idea Title Context/Research Domain: one or two sentences that will help us understand where the idea comes from: a specific field of research, a general field of work or something wider. Problem: a couple of sentences that describe the problem that the idea will help solve (25-100 words) Solution: a couple of sentences that describes how the idea will help solve the problem (200-500 words) Diagram/illustration: please provide any diagrams or images that support your idea. It could be something you've created yourself or it could be an existing image from the web (please include attribution as necessary). Team members: Please provide the names of the people in your group. This allows us to trace the idea back to its originators (and if your team should win the prizes then we can get them to the right people). Hack Day Idea Proposer: If you would like this idea to be taken to the Hack Day, please provide the name of the Idea Proposer. It is the Idea Proposer's job to describe the idea during the pitching session, and try to attract people to work on the idea during the Hack Day (more information below). Hack Day Idea Proposer's email address: an email address for the Idea Proposer. After submitting your idea you can continue to work on it, fleshing out ideas and even starting work on the pitch for the Hack Day, or you can work on a second idea.   Do you want your Collaborative Idea to make it to the Hack Day? Some ideas are so good that they simply need to be realised. This is where the Hack Day comes in: at the end of the Collaborative Ideas session, you can state whether your idea should be pitched during the Hack Day. If you want your idea to be submitted to the Hack Day, someone in your group must nominate themselves as the Hack Day Idea Proposer (the person who will pitch the idea at the start of the Hack Day in an attempt to attract a team to work on the idea). The Hack Day Idea Proposer must put their name and email address in the appropriate fields of the Collaborative Ideas form (see above for detail). If these fields are left blank, the idea may not be carried through to the Hack Day.   Ideas mean prizes Not all ideas from the Collaborative Ideas session will go on to be used in the Hack Day, but there will be prizes for the best Collaborative Ideas regardless of whether they make it to the Hack Day. All the ideas will be collated and anyone at the workshop can vote for their favourite ideas.  The Collaborative Ideas groups to come in 1st, 2nd and 3rd place will win prizes! This year, the prizes are Redbubble digital gift cards, which give the recipients the choice of millions of designs by independent artists printed on a range of products. Redbubble are committed to social responsibility and sustainability, and digital gift cards provide a more international- and pandemic-friendly prize for our participants. The prizes for the winning Collaborative Ideas group members are as follows: First place: £30 Redbubble digital gift card Second place: £20 Redbubble digital gift card Third place: £10 Redbubble digital gift card   Back to the CW21 agenda Was this page useful? Share this page Subscribe Every Friday we send out a digest of the week's news and blog posts. Email Address     Ignore Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution Non-Commercial 2.5 License. ©2010 - 2020 The University of Edinburgh on behalf of the Software Sustainability Institute. Privacy policy Website accessibility Sitemap We use cookies on our website to support technical features that enhance your user experience. We also use analytics & advertising services. To opt-out click for more information. I've read it More information 
software-ac-uk-6822	----	Software and research: the Institute's Blog | Software Sustainability Institute Skip to main content Pre Header Link RSS Twitter LinkedIn Wakelet Blog News Events Contact Subscribe Primary links About About us Manifesto Staff Fellows Advisory Board Funders Partners Brand guidelines How to cite us Get involved Data Management Plan Contact Programmes and Events Fellowship Programme Open Call for Projects Research Software Healthcheck Carpentry Programmes Research Software Engineers Collaborations Workshops Research Software Camps Past events Code of Conduct Resources Get up to speed! Online sustainability evaluation Open Evidence Bank REF 2021: Guidance for Software Outputs Guides Top tips Training resources Publications Case studies T-shirts Videos Close Primary links About About us Manifesto Staff Fellows Advisory Board Funders Partners Brand guidelines How to cite us Get involved Data Management Plan Contact Programmes and Events Fellowship Programme Open Call for Projects Research Software Healthcheck Carpentry Programmes Research Software Engineers Collaborations Workshops Research Software Camps Past events Code of Conduct Resources Get up to speed! Online sustainability evaluation Open Evidence Bank REF 2021: Guidance for Software Outputs Guides Top tips Training resources Publications Case studies T-shirts Videos Other pages RSS Twitter LinkedIn Wakelet Blog News Events Contact Subscribe Search Menu Software and research: the Institute's Blog My 10 favourite pieces of software over time Latest version published on 23 April, 2021. David De Roure (Professor of e-Research, University of Oxford and Turing Fellow at The Alan Turing Institute) takes us on a four-decade journey of his favourite pieces of software in academia.Read More Software Sustainability in Practice at the University of York Latest version published on 8 April, 2021. Ben Catt (Open Research Librarian) and Killian Murphy (Research Software Engineer) talk in about the Open Research at York events and community-led initiatives which aim to support and advocate for best practice in open research across the University.Read More Pondering on the Question of Community Sustainability Latest version published on 19 April, 2021. This post provides a summary of discussions and takeaways from the Community Sustainability session that Serah Rono and Toby Hodges facilitated at the March 2021 SORSE workshop.Read More Using the SSI Event Organisation Guide to plan the first Research Software Camp Latest version published on 25 March, 2021. The Software Sustainability Institute’s (SSI) Research Software Camp on research accessibility took place from 21 February to 5 March 2021. In this post, we will share how we used relevant sections of the SSI Event Organisation Guide to plan, organise, and deliver the Camp.Read More Can we improve the sustainability and reusability of academic surveys? Latest version published on 9 March, 2021. Surveys are used in academia to collect data, investigate research questions, or to understand environments and drive policy. Can we improve their sustainability and reusability?Read More Highlights from week two of our Research Software Camp on research accessibility Latest version published on 5 March, 2021. It’s the end of the second and final week of our inaugural Research Software Camp, which has focussed on different aspects of research accessibility. Read some of the highlights from week two.Read More Using prototyping to select software for a research software project Latest version published on 3 March, 2021. Choosing the right software for use in a research software project can be challenging. How do we know which software is both fit for purpose and provides a sound basis for our project for the foreseeable future? And, how do we make such a choice given that the time and effort to explore what could be myriad alternatives may be limited?Read More Fellows newsletter: March 2021 Latest version published on 2 March, 2021. Welcome to this month's SSI Fellows Newsletter which shares activities and opportunities taking place within the SSI Fellows' community.Read More How to enhance the inclusivity and accessibility of your online calls Latest version published on 3 March, 2021. Yo Yehudi, Kaitlin Stack Whitney and Malvika Sharan describe how to structure online group calls for successful, multimodal collaboration among people who communicate in different ways.Read More Highlights from week one of our Research Software Camp on research accessibility Latest version published on 4 March, 2021. It’s the end of the first week of our inaugural Research Software Camp which is focussed on different aspects of research accessibility. We’ve been exploring the topic through a mixture of live sessions, guides on our website and discussions on social media. Here are some of the highlights from week one.Read More Pagination Current page 1 Page 2 Page 3 Page 4 Page 5 Page 6 Page 7 Page 8 Page 9 … Next page ›› Last page Last » Subscribe Every Friday we send out a digest of the week's news and blog posts. Email Address     Ignore Except where otherwise noted, content on this site is licensed under a Creative Commons Attribution Non-Commercial 2.5 License. ©2010 - 2020 The University of Edinburgh on behalf of the Software Sustainability Institute. Privacy policy Website accessibility Sitemap We use cookies on our website to support technical features that enhance your user experience. We also use analytics & advertising services. To opt-out click for more information. I've read it More information 
spacy-io-8181	----	spaCy · Industrial-strength Natural Language Processing in Python This app works best with JavaScript enabled. spaCy 💥 Out now: spaCy v3.0Menu Usage Models API Universe Usage Models API Universe Industrial-Strength Natural Language Processing in Python Get things done spaCy is designed to help you do real work — to build real products, or gather real insights. The library respects your time, and tries to avoid wasting it. It's easy to install, and its API is simple and productive. Get started Blazing fast spaCy excels at large-scale information extraction tasks. It's written from the ground up in carefully memory-managed Cython. If your application needs to process entire web dumps, spaCy is the library you want to be using. Facts & Figures Awesome ecosystem In the five years since its release, spaCy has become an industry standard with a huge ecosystem. Choose from a variety of plugins, integrate with your machine learning stack and build custom components and workflows. Read more Edit the code & try spaCy # pip install -U spacy # python -m spacy download en_core_web_sm import spacy # Load English tokenizer, tagger, parser and NER nlp = spacy.load("en_core_web_sm") # Process whole documents text = ("When Sebastian Thrun started working on self-driving cars at " "Google in 2007, few people outside of the company took him " "seriously. “I can tell you very senior CEOs of major American " "car companies would shake my hand and turn away because I wasn’t " "worth talking to,” said Thrun, in an interview with Recode earlier " "this week.") doc = nlp(text) # Analyze syntax print("Noun phrases:", [chunk.text for chunk in doc.noun_chunks]) print("Verbs:", [token.lemma_ for token in doc if token.pos_ == "VERB"]) # Find named entities, phrases and concepts for entity in doc.ents: print(entity.text, entity.label_) Features Support for 64+ languages 55 trained pipelines for 17 languages Multi-task learning with pretrained transformers like BERT Pretrained word vectors State-of-the-art speed Production-ready training system Linguistically-motivated tokenization Components for named entity recognition, part-of-speech tagging, dependency parsing, sentence segmentation, text classification, lemmatization, morphological analysis, entity linking and more Easily extensible with custom components and attributes Support for custom models in PyTorch, TensorFlow and other frameworks Built in visualizers for syntax and NER Easy model packaging, deployment and workflow management Robust, rigorously evaluated accuracy New in v3.0 Transformer-based pipelines, new training system, project templates & more spaCy v3.0 features all new transformer-based pipelines that bring spaCy's accuracy right up to the current state-of-the-art. You can use any pretrained transformer to train your own pipelines, and even share one transformer between multiple components with multi-task learning. Training is now fully configurable and extensible, and you can define your own custom models using PyTorch, TensorFlow and other frameworks. The new spaCy projects system lets you describe whole end-to-end workflows in a single file, giving you an easy path from prototype to production, and making it easy to clone and adapt best-practice projects for your own use cases. See what's new From the makers of spaCy Prodigy: Radically efficient machine teaching Prodigy is an annotation tool so efficient that data scientists can do the annotation themselves, enabling a new level of rapid iteration. Whether you're working on entity recognition, intent detection or image classification, Prodigy can help you train and evaluate your models faster. Try it out Reproducible training for custom pipelines spaCy v3.0 introduces a comprehensive and extensible system for configuring your training runs. Your configuration file will describe every detail of your training run, with no hidden defaults, making it easy to rerun your experiments and track changes. You can use the quickstart widget or the init config command to get started, or clone a project template for an end-to-end workflow. Get started Language Afrikaans Albanian Arabic Armenian Basque Bengali Bulgarian Catalan Chinese Croatian Czech Danish Dutch English Estonian Finnish French German Greek Gujarati Hebrew Hindi Hungarian Icelandic Indonesian Irish Italian Japanese Kannada Korean Kyrgyz Latvian Ligurian Lithuanian Luxembourgish Macedonian Malayalam Marathi Multi-language Nepali Norwegian Bokmål Persian Polish Portuguese Romanian Russian Sanskrit Serbian Setswana Sinhala Slovak Slovenian Spanish Swedish Tagalog Tamil Tatar Telugu Thai Turkish Ukrainian Urdu Vietnamese Yoruba Components taggerparsernertextcat Hardware CPUGPU (transformer) Optimize for efficiencyaccuracy # This is an auto-generated partial config. To use it with 'spacy train' # you can run spacy init fill-config to auto-fill all default settings: # python -m spacy init fill-config ./base_config.cfg ./config.cfg [paths] train = null dev = null [system] gpu_allocator = null [nlp] lang = "en" pipeline = [] batch_size = 1000 [components] [components.tok2vec] factory = "tok2vec" [components.tok2vec.model] @architectures = "spacy.Tok2Vec.v2" [components.tok2vec.model.embed] @architectures = "spacy.MultiHashEmbed.v2" width = ${components.tok2vec.model.encode.width} attrs = ["ORTH", "SHAPE"] rows = [5000, 2500] include_static_vectors = false [components.tok2vec.model.encode] @architectures = "spacy.MaxoutWindowEncoder.v2" width = 96 depth = 4 window_size = 1 maxout_pieces = 3 [corpora] [corpora.train] @readers = "spacy.Corpus.v1" path = ${paths.train} max_length = 2000 [corpora.dev] @readers = "spacy.Corpus.v1" path = ${paths.dev} max_length = 0 [training] dev_corpus = "corpora.dev" train_corpus = "corpora.train" [training.optimizer] @optimizers = "Adam.v1" [training.batcher] @batchers = "spacy.batch_by_words.v1" discard_oversize = false tolerance = 0.2 [training.batcher.size] @schedules = "compounding.v1" start = 100 stop = 1000 compound = 1.001 [initialize] vectors = null 🪐Get started: pipelines/tagger_parser_ud The easiest way to get started is to clone a project template and run it – for example, this template for training a part-of-speech tagger and dependency parser on a Universal Dependencies treebank.$python -m spacy project clone pipelines/tagger_parser_ud End-to-end workflows from prototype to production spaCy's new project system gives you a smooth path from prototype to production. It lets you keep track of all those data transformation, preprocessing and training steps, so you can make sure your project is always ready to hand over for automation. It features source asset download, command execution, checksum verification, and caching with a variety of backends and integrations. Try it out In this free and interactive online course you’ll learn how to use spaCy to build advanced natural language understanding systems, using both rule-based and machine learning approaches. It includes 55 exercises featuring videos, slide decks, multiple-choice questions and interactive coding practice in the browser. Start the course Benchmarks spaCy v3.0 introduces transformer-based pipelines that bring spaCy's accuracy right up to the current state-of-the-art. You can also use a CPU-optimized pipeline, which is less accurate but much cheaper to run. More results Pipeline Parser Tagger NER en_core_web_trf (spaCy v3) 95.1 97.8 89.8 en_core_web_lg (spaCy v3) 92.0 97.4 85.5 en_core_web_lg (spaCy v2) 91.9 97.2 85.5 Full pipeline accuracy on the OntoNotes 5.0 corpus (reported on the development set). Named Entity Recognition System OntoNotes CoNLL ‘03 spaCy RoBERTa (2020) 89.8 91.6 Stanza (StanfordNLP)1 88.8 92.1 Flair2 89.7 93.1 Named entity recognition accuracy on the OntoNotes 5.0 and CoNLL-2003 corpora. See NLP-progress for more results. Project template: benchmarks/ner_conll03. 1. Qi et al. (2020). 2. Akbik et al. (2018). spaCy Usage Models API Reference Online Course Community Universe GitHub Discussions Issue Tracker Stack Overflow Connect Twitter GitHub YouTube Blog Stay in the loop! Receive updates about new releases, tutorials and more. Sign up © 2016-2021 ExplosionLegal / Imprint 
storymaps-arcgis-com-4887	----	Getting started with ArcGIS StoryMaps 
s-w-org-1758	----	None 
s-w-org-2535	----	None 
s-w-org-3918	----	None 
s-w-org-7608	----	None 
s-w-org-9193	----	None 
s-w-org-9494	----	None 
tararobertson-ca-6488	----	- Tara Robertson Consulting Skip to content Tara Robertson Consulting Diversity, Equity + Inclusion Menu About Services Presentations Blog Contact Scroll down to content I work with leaders to make companies more diverse, equitable and inclusive so that they can make inclusive and innovative products and services that the world needs. Tara Robertson Consulting is located in Vancouver, Canada on the unceded xʷməθkwəy̓əm (Musqueam), Skwxwú7mesh (Squamish), Səl̓ílwətaʔ/Selilwitulh (Tsleil-Waututh) territories Twitter LinkedIn Proudly powered by WordPress 
tararobertson-ca-8046	----	Tara Robertson Consulting Tara Robertson Consulting Diversity, Equity + Inclusion Distributing DEI Work Across the Organization I enjoyed being a guest on Seed&#38;Spark&#8216;s first monthly office hours session where Stefanie Monge, Lara McLeod and I talked about distributing diversity, equity and inclusion work across organizations. Here&#8217;s some of the work that I mentioned: Megan Carpenter&#8217;s Get It Wrong For Me: What I Need From Allies Amy Edmondson on psychological safety Roxane &#8230; Continue reading "Distributing DEI Work Across the Organization" The post Distributing DEI Work Across the Organization appeared first on Tara Robertson Consulting. Diversity, Astrology and Inclusion: not a valid approach Edit: Thanks Tim Smith for letting me know the ChartHop website now shows that this is all an April Fools&#8217; day prank. I fell for it. Kudos to the commitment to write a 13 page fake report and for your social team&#8217;s convincing response. I first learned about ChartHop&#8217;s Charting Better Galaxies product on Dr. &#8230; Continue reading "Diversity, Astrology and Inclusion: not a valid approach" The post Diversity, Astrology and Inclusion: not a valid approach appeared first on Tara Robertson Consulting. StatsCan’s COVID impact report: same storm, different boats Many people have said that while we&#8217;ve all been in the same storm with COVID, we&#8217;ve been in our own boats. People have had very different experiences during the past year, based on their gender, race, geographic location and more. Today Statistics Canada released a report about the social and economic impacts of COVID, which &#8230; Continue reading "StatsCan&#8217;s COVID impact report: same storm, different boats" The post StatsCan&#8217;s COVID impact report: same storm, different boats appeared first on Tara Robertson Consulting. Activism Outside and Inside the Institution The recording for the talk I did for Kwantlen Polytechnic University’s Digital Pedagogy Webinar Series is up. Here&#8217;s the video and the slides. I submitted the title and abstract 6 months ago and as I wrote the talk I realized that the dichotomy of inside/outside is much messier than the title suggests. It&#8217;s a false &#8230; Continue reading "Activism Outside and Inside the Institution" The post Activism Outside and Inside the Institution appeared first on Tara Robertson Consulting. Diversity, equity and inclusion core competencies: Get cross functional projects done (Part 5 of 5) This is the last post in a weeklong series exploring DEI professional competencies. Again, I believe the five key competencies for DEI professionals are: be strategic translate academic research into action and measure the impact of initiatives meet people where they are at and help them move to be more inclusive  influence others get cross &#8230; Continue reading "Diversity, equity and inclusion core competencies: Get cross functional projects done (Part 5 of 5)" The post Diversity, equity and inclusion core competencies: Get cross functional projects done (Part 5 of 5) appeared first on Tara Robertson Consulting. Diversity, equity and inclusion core competencies: Influence others (Part 4 of 5) This is the fourth post in a week-long series exploring DEI professional competencies. I believe the five key competencies for DEI professionals are: be strategic translate academic research into action and measure the impact of initiatives meet people where they are at and help them move to be more inclusive  influence others get cross functional &#8230; Continue reading "Diversity, equity and inclusion core competencies: Influence others (Part 4 of 5)" The post Diversity, equity and inclusion core competencies: Influence others (Part 4 of 5) appeared first on Tara Robertson Consulting. Core competencies in DEI: meet people where they are at and help them move to be more inclusive (Part 3 of 5) This is the third post in a week-long series exploring DEI professional competencies. I believe the five key competencies for DEI professionals are: be strategic translate academic research into action and measure the impact of initiatives meet people where they are at and help them move to be more inclusive  influence others get cross functional &#8230; Continue reading "Core competencies in DEI: meet people where they are at and help them move to be more inclusive (Part 3 of 5)" The post Core competencies in DEI: meet people where they are at and help them move to be more inclusive (Part 3 of 5) appeared first on Tara Robertson Consulting. Core competencies in DEI: Translate academic research to action and measure impact of initiatives (Part 2 of 5) This is the second post in a week-long series exploring DEI professional competencies. I believe the five key competencies for DEI professionals are: be strategic translate academic research into action and measure the impact of initiatives meet people where they are at and help them move to be more inclusive  influence others get cross functional &#8230; Continue reading "Core competencies in DEI: Translate academic research to action and measure impact of initiatives (Part 2 of 5)" The post Core competencies in DEI: Translate academic research to action and measure impact of initiatives (Part 2 of 5) appeared first on Tara Robertson Consulting. Core competencies in DEI – Be strategic (Part 1 of 5) &#160; &#160; Three and a half years ago I changed careers from being an academic librarian who did diversity, equity and inclusion (DEI) work in the library technology community to a full time job as a DEI professional in the tech sector. Many people have reached out to see if I’d be willing to have &#8230; Continue reading "Core competencies in DEI &#8211; Be strategic (Part 1 of 5)" The post Core competencies in DEI &#8211; Be strategic (Part 1 of 5) appeared first on Tara Robertson Consulting. Living Corporate podcast: The Role of Data in Diversity & Inclusion I had the pleasure of being a guest on the Living Corporate podcast. I really enjoyed the conversation I had with Zach Nunn and love the work that Living Corporate is doing to center and amplify Black and brown people at work. At the end of the conversation Zach asked what three things executives should &#8230; Continue reading "Living Corporate podcast: The Role of Data in Diversity &#038; Inclusion" The post Living Corporate podcast: The Role of Data in Diversity &#038; Inclusion appeared first on Tara Robertson Consulting. 
tararobertson-ca-8832	----	Distributing DEI Work Across the Organization - Tara Robertson Consulting Skip to content Tara Robertson Consulting Diversity, Equity + Inclusion Menu About Services Presentations Blog Contact Posted on April 20, 2021 by Tara Robertson Distributing DEI Work Across the Organization I enjoyed being a guest on Seed&Spark‘s first monthly office hours session where Stefanie Monge, Lara McLeod and I talked about distributing diversity, equity and inclusion work across organizations. Here’s some of the work that I mentioned: Megan Carpenter’s Get It Wrong For Me: What I Need From Allies Amy Edmondson on psychological safety Roxane Gay and Tressie McMillan Cottom’s podcast Hear to Slay, which really is the Black feminist podcast of my dreams. Mozilla’s Community Participation Guidelines Share this: Click to share on Twitter (Opens in new window) Click to share on Facebook (Opens in new window) Related Leave a Reply Cancel reply Your email address will not be published. Required fields are marked * Comment Name * Email * Website Notify me of follow-up comments by email. Notify me of new posts by email. Post navigation Previous PostPrevious Diversity, Astrology and Inclusion: not a valid approach Search for: Search Tara Robertson Consulting is located in Vancouver, Canada on the unceded xʷməθkwəy̓əm (Musqueam), Skwxwú7mesh (Squamish), Səl̓ílwətaʔ/Selilwitulh (Tsleil-Waututh) territories Twitter LinkedIn Proudly powered by WordPress 
teamhappyband-com-9623	----	Team Happy Find in: title writers guests Clear Search All Original Cover Date Title Original View 2020-03-29 Raised by the railroad line Open with: Facebook | YouTube 2020-03-30 Seventeen * Open with: Facebook | YouTube 2020-04-05 Play a train song Open with: Facebook | YouTube 2020-04-12 Chocolate Jesus Open with: Facebook | YouTube 2020-04-17 In spite of ourselves Open with: Facebook | YouTube 2020-04-19 Teach the dog to drive * Open with: Facebook | YouTube 2020-04-26 Bang Bang Open with: Facebook | YouTube 2020-05-02 Slot Machine Baby * Open with: Facebook | YouTube 2020-05-03 Tear Stained Eye Open with: Facebook | YouTube 2020-05-10 Colour my world Open with: Facebook | YouTube 2020-05-17 San Fransisco Bay Blues Open with: Facebook | YouTube 2020-05-24 When I'm Gone Open with: Facebook | YouTube 2020-05-30 All this fucking rain * Open with: Facebook | YouTube 2020-06-07 Society Open with: Facebook | YouTube 2020-06-14 Certain Signs * Open with: Facebook | YouTube 2020-06-21 They say dancing * Open with: Facebook | YouTube 2020-06-28 I Called Your Name * Open with: Facebook | YouTube 2020-07-05 Last Thing On My Mind Open with: Facebook | YouTube 2020-07-12 Sea of Heartbreak Open with: Facebook | YouTube 2020-07-19 Perfect Country Pop Song * Open with: Facebook | YouTube 2020-07-26 Poor Ned / The Ballad of Ned Kelly Open with: Facebook | YouTube 2020-08-02 Nancy Spain Open with: Facebook | YouTube 2020-08-09 Let your Love Flow Open with: Facebook | YouTube 2020-08-16 Goodbye Mongrel Dog * Open with: Facebook | YouTube 2020-08-23 Old Shoes (and Picture Postcards) Open with: Facebook | YouTube 2020-08-30 Dry Pebbles * Open with: Facebook | YouTube 2020-09-06 All L'Amour for You * Open with: Facebook | YouTube 2020-09-13 Get a Room * Open with: Facebook | YouTube 2020-09-20 Clare Island Open with: Facebook | YouTube 2020-09-27 Did the Stars Move for You? * Open with: Facebook | YouTube 2020-10-04 Dimming of the Day Open with: Facebook | YouTube 2020-10-11 Marry You Open with: Facebook | YouTube 2020-10-18 In(di)visible * Open with: Facebook | YouTube 2020-10-25 Our sunshine Open with: Facebook | YouTube 2020-11-01 Harvest Moon Open with: Facebook | YouTube 2020-11-08 Brown Eyed Girl / The Lion Sleeps Tonight Open with: Facebook | YouTube 2020-11-15 Sugar Man Open with: Facebook | YouTube 2020-11-22 Waitin' around to die Open with: Facebook | YouTube 2020-11-29 Wildflowers / If I had a boat Open with: Facebook | YouTube 2020-12-06 Be Careful What you Pray for Open with: Facebook | YouTube 2020-12-13 Thirteen Silver Dollars Open with: Facebook | YouTube 2020-12-20 Going Back to Georgia Open with: Facebook | YouTube 2020-12-24 Silent Night (ish) Open with: Facebook | YouTube 2020-12-27 Riptide Open with: Facebook | YouTube 2021-01-03 Long Black Veil Open with: Facebook | YouTube 2021-01-10 Until the Day I Die Open with: Facebook | YouTube 2021-01-17 Something Open with: Facebook | YouTube 2021-01-24 Long Time Gone Open with: Facebook | YouTube 2021-01-31 If it's in the Groove * Open with: Facebook | YouTube 2021-02-07 If we met in '84 * Open with: Facebook | YouTube 2021-02-14 I May I might * Open with: Facebook | YouTube 2021-02-21 Gravity Open with: Facebook | YouTube 2021-02-28 Restless Open with: Facebook | YouTube 2021-03-09 Closer to Fine Open with: Facebook | YouTube 2021-03-14 Pineapple Road * Open with: Facebook | YouTube 2021-03-21 I Am Right * Open with: Facebook | YouTube 2021-03-28 I'll give you something to smile about * Open with: Facebook | YouTube 2021-04-04 Chocolate Jesus Open with: Facebook | YouTube 2021-04-11 If You Sing * Open with: Facebook | YouTube 2021-04-18 Dance Me to the End of Love Open with: Facebook | YouTube 2021-04-25 Deeper Water Open with: Facebook | YouTube 
tesla-cdn-thron-com-218	----	Q4 and FY2020 Update Q4 and FY2020 Update Highlights 03 Financial Summary 04 Operational Summary 07 Vehicle Capacity 08 Core Technology 09 Other Highlights 10 Outlook 11 Photos & Charts 12 Key Metrics 25 Financial Statements 27 Additional Information 33 This past year was transformative for Tesla. Despite unforeseen global challenges, we outpaced many trends seen elsewhere in the industry as we significantly increased volumes, profitability and cash generation. For the full year 2020, we achieved an industry-leading1 6.3% operating margin (despite an increase of SBC to $1.7B). Teams across our organization, including supply chain, manufacturing, logistics and delivery, rose to the occasion to ensure strong execution. In addition, we continued to improve our products and make progress on our long-term roadmap. We ramped Model 3 in China to over 5,000 cars per week and started production of Model Y at Gigafactory Shanghai less than a year after breaking ground on the expansion. We also launched and ramped Model Y in Fremont in 2020. In Berlin and Austin, we remain on track to start vehicle production this year with structural batteries leveraging in-house battery cells. Our engineering team has made significant progress on Full Self Driving (FSD) software, with a limited release to customers. Finally, we are excited to ramp the updated Model S and Model X and deliver our first Tesla Semi by the end of the year. While 2020 was a critical year for Tesla, we believe that 2021 will be even more important. Thank you for your trust and support and for being on this journey with us. Operating cash flow less capex (free cash flow) of $2.8B in 2020 $4.9B increase in our cash and cash equivalents in Q4 to $19.4B Free cash flow $1.9B in Q4 Cash Half a million vehicles produced and delivered in 2020 Model Y production at Gigafactory Shanghai started in December 2020 Updated Model S and Model X launched in January 2021 Profitability $721M GAAP net income; $2.5B non-GAAP net income in 2020 $270M GAAP net income; $903M non-GAAP net income (ex-SBC*) in Q4 $575M GAAP operating income; 5.4% operating margin in Q4 SBC expense increased to $633M in Q4 Operations S U M M A R YH I G H L I G H T S 3 *SBC = stock-based compensation 1 Based on latest available trailing 12- F I N A N C I A L S U M M A R Y (Unaudited) 4 ($ in millions, except percentages and per share data) Q4-2019 Q1-2020 Q2-2020 Q3-2020 Q4-2020 YoY Automotive revenues 6,368 5,132 5,179 7,611 9,314 46% of which regulatory credits 133 354 428 397 401 202% Automotive gross profit 1,434 1,311 1,317 2,105 2,244 56% Automotive gross margin 22.5% 25.5% 25.4% 27.7% 24.1% 157 bp Total revenues 7,384 5,985 6,036 8,771 10,744 46% Total gross profit 1,391 1,234 1,267 2,063 2,066 49% Total GAAP gross margin 18.8% 20.6% 21.0% 23.5% 19.2% 39 bp Operating expenses 1,032 951 940 1,254 1,491 44% Income from operations 359 283 327 809 575 60% Operating margin 4.9% 4.7% 5.4% 9.2% 5.4% 49 bp Adjusted EBITDA 1,175 951 1,209 1,807 1,850 57% Adjusted EBITDA margin 15.9% 15.9% 20.0% 20.6% 17.2% 131 bp Net income attributable to common stockholders (GAAP) 105 16 104 331 270 157% Net income attributable to common stockholders (non-GAAP) 386 227 451 874 903 134% EPS attributable to common stockholders, diluted (GAAP) (1) 0.11 0.02 0.10 0.27 0.24 118% EPS attributable to common stockholders, diluted (non-GAAP) (1) 0.41 0.23 0.44 0.76 0.80 95% Net cash provided by (used in) operating activities 1,425 (440) 964 2,400 3,019 112% Capital expenditures (412) (455) (546) (1,005) (1,151) 179% Free cash flow 1,013 (895) 418 1,395 1,868 84% Cash and cash equivalents 6,268 8,080 8,615 14,531 19,384 209% (1) Prior period results have been retroactively adjusted to reflect the five-for-one stock split effected in the form of a stock dividend in August 2020. EPS = Earnings per share F I N A N C I A L S U M M A R Y (Unaudited) 5 ($ in millions, except percentages and per share data) 2016 2017 2018 2019 2020 YoY Automotive revenues 6,351 9,642 18,515 20,821 27,236 31% of which regulatory credits 302 360 419 594 1,580 166% Automotive gross profit 1,601 2,209 4,341 4,423 6,977 58% Automotive gross margin 25.2% 22.9% 23.4% 21.2% 25.6% 437 bp Total revenues 7,000 11,759 21,461 24,578 31,536 28% Total gross profit 1,599 2,223 4,042 4,069 6,630 63% Total GAAP gross margin 22.8% 18.9% 18.8% 16.6% 21.0% 447 bp Operating expenses 2,267 3,855 4,430 4,138 4,636 12% (Loss) income from operations (667) (1,632) (388) (69) 1,994 N/A Operating margin -9.5% -13.9% -1.8% -0.3% 6.3% 660 bp Adjusted EBITDA 832 644 2,395 2,985 5,817 95% Adjusted EBITDA margin 11.9% 5.5% 11.2% 12.1% 18.4% 630 bp Net (loss) income attributable to common stockholders (GAAP) (675) (1,962) (976) (862) 721 N/A Net (loss) income attributable to common stockholders (non-GAAP) (341) (1,495) (227) 36 2,455 6,719% EPS attributable to common stockholders, diluted (GAAP) (1) (0.94) (2.37) (1.14) (0.98) 0.64 N/A EPS attributable to common stockholders, diluted (non-GAAP) (1) (0.47) (1.80) (0.27) 0.03 2.24 7,367% Net cash (used in) provided by operating activities (124) (61) 2,098 2,405 5,943 147% Capital expenditures (1,281) (3,415) (2,101) (1,327) (3,157) 138% Free cash flow (1,405) (3,476) (3) 1,078 2,786 158% Cash and cash equivalents 3,393 3,368 3,686 6,268 19,384 209% (1) Prior period results have been retroactively adjusted to reflect the five-for-one stock split effected in the form of a stock dividend in August 2020. EPS = Earnings per share F I N A N C I A L S U M M A R Y Revenue Profitability Cash Total revenue grew 46% YoY in Q4. This was primarily achieved through substantial growth in vehicle deliveries as well as growth in other parts of the business. At the same time, vehicle average selling price (ASP) declined by 11% YoY as our product mix continued to shift from Model S and Model X to the more affordable Model 3 and Model Y. Our operating income improved in Q4 compared to the same period last year to $575M, resulting in a 5.4% operating margin. Thi s profit level was reached while incurring SBC expense attributable to the 2018 CEO award of $267M in Q4, driven by an increase in our market capitalization and a new operational milestone becoming probable. Positive impact from volume growth and regulatory credit revenue growth YoY was mainly offset by lower ASP (including price reduction of China-made Model 3 and price reductions of Model S and Model X before the introduction of updated models) but also by a series of notable items. These included a portion of Q4 SBC charges, vehicle warranty accruals, additional supply chain cos ts, Model S and Model X changeover costs and other items. Quarter-end cash and cash equivalents increased to $19.4B in Q4, driven mainly by our recent capital raise of $5.0B (average pri ce of this offering was ~$632/share) and free cash flow of $1.9B, partially offset by early debt repayments (early conversion of convert ible notes). 6 Q4-2019 Q1-2020 Q2-2020 Q3-2020 Q4-2020 YoY Model S/X production 17,933 15,390 6,326 16,992 16,097 -10% Model 3/Y production 86,958 87,282 75,946 128,044 163,660 88% Total production 104,891 102,672 82,272 145,036 179,757 71% Model S/X deliveries 19,475 12,230 10,614 15,275 18,966 -3% Model 3/Y deliveries 92,620 76,266 80,277 124,318 161,701 75% Total deliveries 112,095 88,496 90,891 139,593 180,667 61% of which subject to operating lease accounting 8,848 6,104 4,716 10,014 13,636 54% Total end of quarter operating lease vehicle count 49,901 53,159 54,519 61,638 72,089 44% Global vehicle inventory (days of supply)(1) 10 25 17 14 11 10% Solar deployed (MW) 54 35 27 57 86 59% Storage deployed (MWh) 530 260 419 759 1,584 199% Store and service locations 433 438 446 466 523 21% Mobile service fleet 743 756 769 780 823 11% Supercharger stations 1,821 1,917 2,035 2,181 2,564 41% Supercharger connectors 16,104 17,007 18,100 19,437 23,277 45% (1) ned with Automotive News definition). 7 O P E R A T I O N A L S U M M A R Y (Unaudited) 0.0% 0.5% 1.0% 1.5% 2012 2013 2014 2015 2016 2017 2018 2019 2020 US / Canada Europe China Global market share of Tesla vehicles by region V E H I C L E C A P A C I T Y Fremont Over the past few weeks, we have been upgrading our Fremont Factory to launch the new Model S and Model X. These changes include a new powertrain (battery modules, battery packs, drive units), an entirely new interior, exterior updates and other improvements. Production will resume in Q1 and ramp back to full capacity over time. We also continue to increase Model Y production, including integration of the single-piece rear underbody castings, to meet customer demand. Shanghai Gigafactory Shanghai has demonstrated the ability to sustain Model 3 production at or above a run rate of 250,000/year. Model Y production started in late 2020 and is in the process of ramping to full capacity. Customer response to both Model 3 and Model Y continues to be strong. We recently started shipping Model 3 vehicles from Gigafactory Shanghai to several countries in Europe and APAC, which supplements production from the Fremont Factory for those markets. Berlin-Brandenburg Local production and deliveries remain a key part of our growth strategy. While our total market share in Europe increased in 2020, Gigafactory Berlin should enable a significant increase in local deliveries, similar to what we saw after constructing Gigafactory Shanghai. Buildout of our Berlin factory continues as planned, and we have already started to move machinery into the building. Installed Annual Capacity Current Status Fremont Model S / Model X 100,000 Production Model 3 / Model Y 500,000 Production Shanghai Model 3 / Model Y 450,000 Production Berlin Model Y - Construction Texas Model Y - Construction Cybertruck - In development TBD Tesla Semi - In development Roadster - In development Future Product - In development 8 Installed capacity ≠ Current production rate. The production rate depends on the pace of factory ramp, supply chain ramp, downtime related to factory upgrades and national holidays and other factors. Source: Tesla estimates based on data from ACEA; Autonews.com; CAAM (light-duty vehicle only) Model 0-60 mph 1/4 mile Tesla Model S Plaid <2.0 sec <9.3 sec Porsche 918 Spyder 2.1 sec 9.7 sec Porsche 911 Turbo S (992) 2.2 sec 10.1 sec Lamborghini Huracán Performante 2.2 sec 10.2 sec Tesla Model S Performance 2.3 sec 10.4 sec Dodge Challenger SRT Demon 2.3 sec 10.7 sec Bugatti Chiron 2.4 sec 9.4 sec Porsche Taycan Turbo S 2.4 sec 10.3 sec Nissan GT-R Nismo 2.5 sec 10.8 sec Bugatti Veyron 2.5 sec 9.9 sec C O R E T E C H N O L O G Y Autopilot & Full Self Driving (FSD) Over the last few months, we released multiple software updates to our FSD City Streets beta testers.1 With each iteration, the system is becoming more robust, resulting in the widening of our user base. We continue to work on the development of our Dojo supercomputer. This computer is designed to process video data from our fleet and train our neural network at an extremely fast rate. Vehicle Software As has been the tradition in recent years, we released a holiday software update for our vehicles in December. This time, we included a variety of new games, enabled drivers to make an entrance with custom horn sounds through the external speaker and improved driving visualization among many other updates. Battery & Powertrain While our Model S and Model X battery module architecture evolved over the past 8 years, both the battery pack and modules have now been fully redesigned. Additionally, we have incorporated Model 3 and Model Y motor technology throughout as well as our heat pump for better winter range. These changes enable 5x more high-speed quarter- mile runs than the prior architecture, while further improving energy efficiency. The Performance versions of Model S and Model X were replaced by Plaid, featuring a tri-motor powertrain with a unique high-speed, high-power rotor. Model S Plaid is the fastest accelerating production car ever made with a 0-60 mph time of <2.0 seconds and a quarter mile in under 9.3 seconds, faster than a Bugatti Chiron. EPA est. range gap continues to widen 9 The fastest accelerating production cars ever made 0 50 100 150 200 250 300 350 400 450 Jan-2019 Jan-2020 Jan-2021 Tesla Model S (AWD) Highest range non-Tesla EV (AWD) Sources: Tesla; Auto Motor & Sport; Car and Driver; CarIndigo; Carscoops; Hot Rod, Motor Trend; Road & Track; TopSpeed Sources: Tesla; OEM data 1 No revenue is recognized for software released for initial FSD City beta testers O T H E R H I G H L I G H T S Energy Storage Energy storage deployments grew substantially from 2019 to 2020. For the first time, our total battery deployments surpassed 3 GWh in a single year, which is an 83% increase compared to the prior year. This growth was driven mainly by the popularity of Megapack, our utility scale storage product. Powerwall demand continues to increase as the residential business continues to grow. While we have made progress on production, we should se e even further increases in supply in the next few months. Our energy storage business continues to be supply-constrained as backlog remains strong. We are looking to increase capacity both on the manufacturing equipment side as well as supply chain side, to allow us to continue to grow at a similar pace again in 2021. Solar Retrofit and Solar Roof In 2020, solar deployments increased to 205 MW, 18% more than the prior year. This growth is the result of meaningful improve ments to our solar retrofit strategy, including product simplification, cost reduction and industry-leading pricing. We have also made great progress growing our Solar Roof deployments, as we have expanded the team while simultaneously improving our installation efficiency. Tesla energy storage deployments in GWh 10 0.0 0.5 1.0 1.5 2.0 2.5 3.0 2015 2016 2017 2018 2019 2020 O U T L O O K Introduction Volume Cash Profit Product Given the number of significant projects in the pipeline, we have simplified our approach to guidance for 2021, enabling our teams to remain focused on achieving our long-term goals. We are planning to grow our manufacturing capacity as quickly as possible. Over a multi-year horizon, we expect to achieve 50% average annual growth in vehicle deliveries. In some years we may grow faster, which we expect to be the case in 2021. The rate of growth will depend on our equipment capacity, operational efficiency and capacity and stability of the supply cha in. We have sufficient liquidity to fund our product roadmap, long-term capacity expansion plans and other expenses. We expect our operating margin will continue to grow over time, continuing to reach industry-leading levels with capacity expansion and localization plans underway. We are currently building Model Y capacity at Gigafactory Berlin and Gigafactory Texas and remain on track to start deliverie s from each location in 2021. Gigafactory Shanghai will continue to expand further through the course of the year. Tesla Semi deliveries will also begin in 2021. 11 P H O T O S & C H A R T S T E S L A M O D E L S - F R O N T I N T E R I O R 13 T E S L A M O D E L S - R E A R I N T E R I O R 14 T E S L A M O D E L S P L A I D - E X T E R I O R 15 G I G A F A C T O R Y S H A N G H A I - M O D E L Y D I E C A S T 16 G I G A F A C T O R Y S H A N G H A I - M O D E L Y S T A M P I N G 17 G I G A F A C T O R Y S H A N G H A I - M O D E L Y B O D Y S H O P 18 G I G A F A C T O R Y S H A N G H A I - M O D E L Y G E N E R A L A S S E M B L Y 19 G I G A F A C T O R Y S H A N G H A I - M O D E L 3 F A C T O R Y ( F O R E G R O U N D ) ; M O D E L Y F A C T O R Y ( B A C K G R O U N D ) 20 21 G I G A F A C T O R Y B E R L I N - M O D E L Y F A C T O R Y C O N S T R U C T I O N 22 G I G A F A C T O R Y B E R L I N - M O D E L Y F A C T O R Y I N T E R I O R 23 G I G A F A C T O R Y B E R L I N - M O D E L Y F A C T O R Y I N T E R I O R 24 G I G A F A C T O R Y T E X A S 3 months ago Present day Vehicle Deliveries (units) Net Income ($B) K E Y M E T R I C S Q U A R T E R L Y (Unaudited) 25 Operating Cash Flow ($B) Free Cash Flow ($B) 0 20,000 40,000 60,000 80,000 100,000 120,000 140,000 160,000 180,000 200,000 1Q -2 0 18 2 Q -2 0 18 3 Q -2 0 18 4 Q -2 0 18 1Q -2 0 19 2 Q -2 0 19 3 Q -2 0 19 4 Q -2 0 19 1Q -2 0 2 0 2 Q -2 0 2 0 3 Q -2 0 2 0 4 Q -2 0 2 0 -4.0 -3.0 -2.0 -1.0 0.0 1.0 2.0 3.0 4.0 1Q -2 0 18 2 Q -2 0 18 3 Q -2 0 18 4 Q -2 0 18 1Q -2 0 19 2 Q -2 0 19 3 Q -2 0 19 4 Q -2 0 19 1Q -2 0 2 0 2 Q -2 0 2 0 3 Q -2 0 2 0 4 Q -2 0 2 0 -0.8 -0.6 -0.4 -0.2 0.0 0.2 0.4 0.6 0.8 1Q -2 0 18 2 Q -2 0 18 3 Q -2 0 18 4 Q -2 0 18 1Q -2 0 19 2 Q -2 0 19 3 Q -2 0 19 4 Q -2 0 19 1Q -2 0 2 0 2 Q -2 0 2 0 3 Q -2 0 2 0 4 Q -2 0 2 0 K E Y M E T R I C S T R A I L I N G 1 2 M O N T H S ( T T M ) (Unaudited) Vehicle Deliveries (units) Operating Cash Flow ($B) Free Cash Flow ($B) Net Income ($B) 26 0 50,000 100,000 150,000 200,000 250,000 300,000 350,000 400,000 450,000 500,000 1Q -2 0 18 2 Q -2 0 18 3 Q -2 0 18 4 Q -2 0 18 1Q -2 0 19 2 Q -2 0 19 3 Q -2 0 19 4 Q -2 0 19 1Q -2 0 2 0 2 Q -2 0 2 0 3 Q -2 0 2 0 4 Q -2 0 2 0 -6.0 -5.0 -4.0 -3.0 -2.0 -1.0 0.0 1.0 2.0 3.0 4.0 5.0 6.0 1Q -2 0 18 2 Q -2 0 18 3 Q -2 0 18 4 Q -2 0 18 1Q -2 0 19 2 Q -2 0 19 3 Q -2 0 19 4 Q -2 0 19 1Q -2 0 2 0 2 Q -2 0 2 0 3 Q -2 0 2 0 4 Q -2 0 2 0 -3.0 -2.0 -1.0 0.0 1.0 2.0 3.0 1Q -2 0 18 2 Q -2 0 18 3 Q -2 0 18 4 Q -2 0 18 1Q -2 0 19 2 Q -2 0 19 3 Q -2 0 19 4 Q -2 0 19 1Q -2 0 2 0 2 Q -2 0 2 0 3 Q -2 0 2 0 4 Q -2 0 2 0 F I N A N C I A L S T A T E M E N T S In millions of USD or shares as applicable, except per share data Q4-2019 Q1-2020 Q2-2020 Q3-2020 Q4-2020 REVENUES Automotive sales 6,143 4,893 4,911 7,346 9,034 Automotive leasing 225 239 268 265 280 Total automotive revenue 6,368 5,132 5,179 7,611 9,314 Energy generation and storage 436 293 370 579 752 Services and other 580 560 487 581 678 Total revenues 7,384 5,985 6,036 8,771 10,744 COST OF REVENUES Automotive sales 4,815 3,699 3,714 5,361 6,922 Automotive leasing 119 122 148 145 148 Total automotive cost of revenues 4,934 3,821 3,862 5,506 7,070 Energy generation and storage 385 282 349 558 787 Services and other 674 648 558 644 821 Total cost of revenues 5,993 4,751 4,769 6,708 8,678 Gross profit 1,391 1,234 1,267 2,063 2,066 OPERATING EXPENSES Research and development 345 324 279 366 522 Selling, general and administrative 699 627 661 888 969 Restructuring and other (12) Total operating expenses 1,032 951 940 1,254 1,491 INCOME FROM OPERATIONS 359 283 327 809 575 Interest income 10 10 8 6 6 Interest expense (170) (169) (170) (163) (246) Other (expense) income, net (25) (54) (15) (97) 44 INCOME BEFORE INCOME TAXES 174 70 150 555 379 Provision for income taxes 42 2 21 186 83 NET INCOME 132 68 129 369 296 Net income attributable to noncontrolling interests and redeemable noncontrolling interests 27 52 25 38 26 NET INCOME ATTRIBUTABLE TO COMMON STOCKHOLDERS 105 16 104 331 270 Less: Buy-out of noncontrolling interest 31 NET INCOME USED IN COMPUTING NET INCOME PER SHARE OF COMMON STOCK 105 16 104 300 270 Net income per share of common stock attributable to common stockholders(1) Basic $ 0.12 $ 0.02 $ 0.11 $ 0.32 $ 0.28 Diluted $ 0.11 $ 0.02 $ 0.10 $ 0.27 $ 0.24 Weighted average shares used in computing net income per share of common stock(1) Basic 902 915 928 937 951 Diluted 935 994 1,036 1,105 1,124 S T A T E M E N T O F O P E R A T I O N S (Unaudited) 28 (1)Prior period results have been retroactively adjusted to reflect the five-for-one stock split effected in the form of a stock dividend in August 2020. B A L A N C E S H E E T (Unaudited) In millions of USD 31-Dec-19 31-Mar-20 30-Jun-20 30-Sep-20 31-Dec-20 ASSETS Current assets Cash and cash equivalents 6,268 8,080 8,615 14,531 19,384 Accounts receivable, net 1,324 1,274 1,485 1,757 1,886 Inventory 3,552 4,494 4,018 4,218 4,101 Prepaid expenses and other current assets 959 1,045 1,218 1,238 1,346 Total current assets 12,103 14,893 15,336 21,744 26,717 Operating lease vehicles, net 2,447 2,527 2,524 2,742 3,091 Solar energy systems, net 6,138 6,106 6,069 6,025 5,979 Property, plant and equipment, net 10,396 10,638 11,009 11,848 12,747 Operating lease right-of-use assets 1,218 1,197 1,274 1,375 1,558 Goodwill and intangible assets, net 537 516 508 521 520 Other non-current assets 1,470 1,373 1,415 1,436 1,536 Total assets 34,309 37,250 38,135 45,691 52,148 LIABILITIES AND EQUITY Current liabilities Accounts payable 3,771 3,970 3,638 4,958 6,051 Accrued liabilities and other 3,222 2,825 3,110 3,252 3,855 Deferred revenue 1,163 1,186 1,130 1,258 1,458 Customer deposits 726 788 713 708 752 Current portion of debt and finance leases (1) 1,785 3,217 3,679 3,126 2,132 Total current liabilities 10,667 11,986 12,270 13,302 14,248 Debt and finance leases, net of current portion (1) 11,634 10,666 10,416 10,559 9,556 Deferred revenue, net of current portion 1,207 1,199 1,198 1,233 1,284 Other long-term liabilities 2,691 2,667 2,870 3,049 3,330 Total liabilities 26,199 26,518 26,754 28,143 28,418 Redeemable noncontrolling interests in subsidiaries 643 632 613 608 604 Convertible senior notes 60 44 48 51 Total stockholders' equity 6,618 9,173 9,855 16,031 22,225 Noncontrolling interests in subsidiaries 849 867 869 861 850 Total liabilities and equity 34,309 37,250 38,135 45,691 52,148 (1) Breakdown of our debt is as follows: Vehicle and energy product financing (non-recourse) 4,183 4,022 4,043 4,141 3,930 Other non-recourse debt 355 708 1,415 605 630 Recourse debt 7,263 7,600 7,106 7,448 5,660 Total debt excluding vehicle and energy product financing 7,618 8,308 8,521 8,053 6,290 29 In millions of USD Q4-2019 Q1-2020 Q2-2020 Q3-2020 Q4-2020 CASH FLOWS FROM OPERATING ACTIVITIES Net income 132 68 129 369 296 Adjustments to reconcile net income to net cash provided by (used in) operating activities: Depreciation, amortization and impairment 577 553 567 584 618 Stock-based compensation 281 211 347 543 633 Other 204 175 167 269 230 Changes in operating assets and liabilities, net of effect of business combinations 231 (1,447) (246) 635 1,242 Net cash provided by (used in) operating activities 1,425 (440) 964 2,400 3,019 CASH FLOWS FROM INVESTING ACTIVITIES Capital expenditures (412) (455) (546) (1,005) (1,151) Purchases of solar energy systems, net of sales (37) (26) (20) (16) (13) Purchase of intangible assets (5) (5) Receipt of government grants 46 1 — 122 Business combinations, net of cash acquired (13) Net cash used in investing activities (403) (480) (566) (1,039) (1,047) CASH FLOWS FROM FINANCING ACTIVITIES Net cash flows from debt activities (591) 544 164 (630) (2,074) Collateralized lease repayments (87) (97) (71) (56) (16) Net borrowings (repayments) under vehicle and solar financing 478 (160) 18 99 (215) Net cash flows from noncontrolling interests - Auto 19 (8) (3) (31) 0 Net cash flows from noncontrolling interests - Solar 6 (40) (42) (49) (46) Proceeds from issuances of common stock in public offerings, net of issuance costs 2,309 4,973 4,987 Other 96 160 57 144 56 Net cash (used in) provided by financing activities (79) 2,708 123 4,450 2,692 Effect of exchange rate changes on cash and cash equivalents and restricted cash 14 (24) 38 86 234 Net increase in cash and cash equivalents and restricted cash 957 1,764 559 5,897 4,898 Cash and cash equivalents and restricted cash at beginning of period 5,826 6,783 8,547 9,106 15,003 Cash and cash equivalents and restricted cash at end of period 6,783 8,547 9,106 15,003 19,901 S T A T E M E N T O F C A S H F L O W S (Unaudited) 30 In millions of USD or shares as applicable, except per share data Q4-2019 Q1-2020 Q2-2020 Q3-2020 Q4-2020 Net income attributable to common stockholders (GAAP) 105 16 104 331 270 Stock-based compensation expense 281 211 347 543 633 Net income attributable to common stockholders (non-GAAP) 386 227 451 874 903 Less: Buy-out of noncontrolling interest 31 Net income used in computing EPS attributable to common stockholders (non -GAAP) 386 227 451 843 903 EPS attributable to common stockholders, diluted (GAAP) (1) 0.11 0.02 0.10 0.27 0.24 Stock-based compensation expense per share (1) 0.30 0.21 0.34 0.49 0.56 EPS attributable to common stockholders, diluted (non-GAAP) (1) 0.41 0.23 0.44 0.76 0.80 Shares used in EPS calculation, diluted (GAAP and non-GAAP) (1) 935 994 1,036 1,105 1,124 Net income attributable to common stockholders (GAAP) 105 16 104 331 270 Interest expense 170 169 170 163 246 Provision for income taxes 42 2 21 186 83 Depreciation, amortization and impairment 577 553 567 584 618 Stock-based compensation expense 281 211 347 543 633 Adjusted EBITDA (non-GAAP) 1,175 951 1,209 1,807 1,850 Total revenues 7,384 5,985 6,036 8,771 10,744 Adjusted EBITDA margin (non-GAAP)(2) 15.9% 15.9% 20.0% 20.6% 17.2% Automotive gross margin (GAAP) 22.5% 25.5% 25.4% 27.7% 24.1% Less: Total regulatory credit revenue recognized 1.6% 5.5% 6.7% 4.0% 3.4% Automotive gross margin excluding regulatory credits (non-GAAP) 20.9% 20.0% 18.7% 23.7% 20.7% R E C O N C I L I A T I O N O F G A A P T O N O N G A A P F I N A N C I A L I N F O R M A T I O N (Unaudited) 31 In millions of USD 1Q-2018 2Q-2018 3Q-2018 4Q-2018 1Q-2019 2Q-2019 3Q-2019 4Q-2019 1Q-2020 2Q-2020 3Q-2020 4Q-2020 Net cash (used in) provided by operating activities (GAAP) (398) (130) 1,391 1,235 (640) 864 756 1,425 (440) 964 2,400 3,019 Capital expenditures (656) (610) (510) (325) (280) (250) (385) (412) (455) (546) (1,005) (1,151) Free cash flow (non-GAAP) (1,054) (740) 881 910 (920) 614 371 1,013 (895) 418 1,395 1,868 In millions of USD 1Q-2018 2Q-2018 3Q-2018 4Q-2018 1Q-2019 2Q-2019 3Q-2019 4Q-2019 1Q-2020 2Q-2020 3Q-2020 4Q-2020 Net cash (used in) provided by operating activities - TTM (GAAP) (389) (319) 1,373 2,098 1,856 2,850 2,215 2,405 2,605 2,705 4,349 5,943 Capital expenditures TTM (3,518) (3,169) (2,563) (2,101) (1,725) (1,365) (1,240) (1,327) (1,502) (1,798) (2,418) (3,157) Free cash flow - TTM (non-GAAP) (3,907) (3,488) (1,190) (3) 131 1,485 975 1,078 1,103 907 1,931 2,786 (1)Prior period results have been retroactively adjusted to reflect the five-for-one stock split effected in the form of a stock dividend in August 2020. (2) Adjusted EBITDA margin is Adjusted EBITDA as a percentage of total revenues. In millions of USD or shares as applicable, except per share data 2016 2017 2018 2019 2020 Net (loss) income attributable to common stockholders (GAAP) (675) (1,962) (976) (862) 721 Stock-based compensation expense 334 467 749 898 1,734 Net (loss) income attributable to common stockholders (non-GAAP) (341) (1,495) (227) 36 2,455 Less: Buy-out of noncontrolling interest 8 31 Net (loss) income used in computing EPS attributable to common stockholders (non -GAAP) (341) (1,495) (227) 28 2,424 EPS attributable to common stockholders, diluted (GAAP) (1) (0.94) (2.37) (1.14) (0.98) 0.64 Stock-based compensation expense per share (1) 0.47 0.57 0.87 1.01 1.60 EPS attributable to common stockholders, diluted (non-GAAP) (1) (0.47) (1.80) (0.27) 0.03 2.24 Shares used in EPS calculation, diluted (GAAP and non-GAAP) (1) 721 829 853 887 1,083 Net (loss) income attributable to common stockholders (GAAP) (675) (1,962) (976) (862) 721 Interest expense 199 471 663 685 748 Provision for income taxes 27 32 58 110 292 Depreciation, amortization and impairment 947 1,636 1,901 2,154 2,322 Stock-based compensation expense 334 467 749 898 1,734 Adjusted EBITDA (non-GAAP) 832 644 2,395 2,985 5,817 Total revenues 7,000 11,759 21,461 24,578 31,536 Adjusted EBITDA margin (non-GAAP)(2) 11.9% 5.5% 11.2% 12.1% 18.4% Automotive gross margin (GAAP) 25.2% 22.9% 23.4% 21.2% 25.6% Less: Total regulatory credit revenue recognized 3.7% 3.0% 1.7% 2.3% 4.6% Automotive gross margin excluding regulatory credits (non-GAAP) 21.5% 19.9% 21.7% 18.9% 21.0% R E C O N C I L I A T I O N O F G A A P T O N O N G A A P F I N A N C I A L I N F O R M A T I O N (Unaudited) 32 (1)Prior period results have been retroactively adjusted to reflect the five-for-one stock split effected in the form of a stock dividend in August 2020. (2) Adjusted EBITDA margin is Adjusted EBITDA as a percentage of total revenues. A D D I T I O N A L I N F O R M A T I O N WEBCAST INFORMATION Tesla will provide a live webcast of its fourth quarter 2020 financial results conference call beginning at 3:30 p.m. PT on January 27, 2021 at ir.tesla.com. This webcast will also be available for replay for approximately one year thereafter. CERTAIN TERMS When used in this update, certain terms have the following meanings. Our vehicle deliveries include only vehicles that have been transferred to end customers with all paperwork correctly completed. Our energy product deployment volume includes both customer units installed and equipment sales; we report installations at time of commissioning for storage projects or inspection for solar projects, and equipment sales at time of delivery. "Adjusted EBITDA" is equal to (i) net income (loss) attributable to common stockholders before (ii)(a) interest expense, (b) provision for income taxes, (c) depreciation, amortization and impairment and (d) stock-based compensation expense, which is the same measurement for this term pursuant to the performance-based stock option award granted to our CEO in 2018. "Free cash flow" is operating cash flow less capital expenditures. NON-GAAP FINANCIAL INFORMATION Consolidated financial information has been presented in accordance with GAAP as well as on a non-GAAP basis to supplement our consolidated financial results. Our non-GAAP financial measures include non-GAAP automotive gross margin, non-GAAP net income (loss) attributable to common stockholders, non-GAAP net income (loss) attributable to common stockholders on a diluted per share basis (calculated using weighted average shares for GAAP diluted net income (loss) attributable to common stockholders), Adjusted EBITDA, Adjusted EBITDA margin and free cash flow. These non- t is useful to supplement its GAAP financial statements with this non-GAAP information because management uses such information internally for its operating, budgeting and financial planning purposes. Management also believes that presentation of the non-GAAP financial measures provides useful information to our investors regarding our financial condition and results of operations so that investors can see through the eyes of Tesla management regarding important financial metrics that Tesla uses to run the business, and allowing -GAAP information is not prepared under a comprehensive set of accounting rules and therefore, should only be read in conjunction with financial information reported -GAAP financial information is provided above. FORWARD-LOOKING STATEMENTS nt, production capacity and output rates, demand and market growth, deliveries, deployment, range and other features and improvements and timing of existing and future Tesla products and technologies such as Model 3, Model Y, Model S, Model X, Tesla Semi, Autopilot and Full Self Driving software and hardware, our energy storage products, the battery cells we are developing and our manufacturing technologies; statements regarding operating margin, spending and liquidity; and statements regarding construction, expansion, improvements and/or -lookin -looking statements are erially from those projected. The following important factors, without limitation, could cause actual results to differ materially from those in the forward-looking statements: uncertainties in future macroeconomic and regulatory conditions arising from the current global pandemic; the risk of delays in launching and manufacturing our products and features cost-effectively; our ability to grow our sales, delivery, installation, servicing and c generally and our vehicles specifically; the ability of suppliers to deliver components according to schedules, prices, quality and volumes acceptable to us, and our ability to manage such components effectively; any issues with lithium-ion cells or other components manufactured at Gigafactory Nevada; our ability to build and ramp Gigafactory Shanghai, Gigafactory Berlin and Gigafactory Texas in accordance with our plans; our ability to procure supply of battery cells, including through our own manufacturing; risks relating to international expansion; any failures by Tesla products to perform as expected or if product recalls occur; the risk of product liability claims; competition in the automotive and energy product markets; our ability to maintain public credibility and confidence in our long-term business prospects; our ability to manage risks relating to our various product financing programs; the status of government and economic incentives for electric vehicles and energy products; our ability to attract, hire and retain key employees and qualified personnel and ramp our installation teams; our ability to maintain the security of our information and production and product systems; our compliance with various regulations and laws applicable to our operations and products, which may evolve from time to time; risks relating to our indebtedness and financing strategies; and adverse foreign exchange movements. More information on potential factors that could affect our financial results is included from time to time in our Securities and Exchange port on Form 10-Q filed with the SEC on October 26, 2020. Tesla disclaims any obligation to update information contained in these forward-looking statements whether as a result of new information, future events, or otherwise. 33 
theodi-org-9308	----	The ODI – Open Data Institute Search the ODI Submit Search Suggestion Hello, this is a suggestion Suggestion Hello, this is a suggestion Topic Hello, this is a topic Topic Hello, this is a topic The Week in Data Membership Jobs Knowledge & opinion Explainers Guides Reports News Blog Topics Case studies Worknotes ODI Inside Business series The Week in Data Projects & services Projects Data as Culture Become an ODI Member Startups & fostering innovation Research & development Tools & resources Consultancy & advice Courses and training Events Courses Talks Webinars Members events Past events Book a speaker for your event Our community ODI Partners ODI Nodes ODI Trainers Our Startups ODI Members About the ODI Our strategy Tender opportunities ODI Team Contact us Terms of use, privacy and policies Jobs Brand Annual reports and financial statements The Data Spectrum “We want a world where data works for everyone” Jeni Tennison, Vice President and Chief Strategy Advisor What we do > Our services Make better decisions using data ...and manage any harmful impacts. We work with companies and governments to build an open, trustworthy data ecosystem. Our services What's new > Explore all Data on teachers’ lives during the pandemic Our new report looks at how the pandemic has affected teachers and pupils across the country Reflections on the Commission for Race and Ethnic Disparities report Dr Jeni Tennison OBE and Milly Zimeta look at the data used in the Sewell report by the Commission for Race and Ethnic Disparities Highlights from our Innovate UK-funded R&D programme – Take a look at how we’ve supported innovation, improved data infrastructure and encouraged ethical data sharing across the UK over the past four years Explore all New report > Explore all Data about teachers' lives in the pandemic The impact of the pandemic on teachers’ and pupils’ lives, through the lens of new data made available to the ODI Explore all Our current projects > Explore all Transforming agriculture in South Asia and Sub-Saharan Africa – How a new data toolkit promises to transform agriculture and secure food supplies for growing communities Helping bridge the data divide – The ODI and Microsoft are working together to help address the looming ‘data divide’ Data innovation for the UK – Our R&D projects for Innovate UK cut across themes of data sharing and trust, supporting innovation and upgrading data infrastructure Explore all Work with us Whether you’re in the private, public or third sectors, we can help with your data strategy. Share your ideas with us, and we’ll be in touch to find out more. Get in touch Latest blogposts > Explore all Evaluation of the ODI's R&D programme, funded by Innovate UK Invitation to tender (ITT) - Evaluating the data assurance market The weird and the wonderful: reflections on the Commission for Race and Ethnic Disparities report Objective data? Reflections on the Commission for Race and Ethnic Disparities report Explore all Explainers > Explore all What is a digital twin? What is an identifier? What is a computer model? Spot the difference – explaining the Covid-19 apps Explore all Data Ethics Canvas > View the canvas Free tool to help identify and manage ethical issues The Data Ethics Canvas is a free, downloadable tool for anyone who collects, shares or uses data. It helps identify and manage ethical issues – at the start of a project that uses data, and throughout. View the canvas Free tools > Explore all The Data Ethics Canvas Open Standards for Data Data Ecosystem Mapping Datopolis board game Explore all Upcoming talks > Explore all ODI Fridays: Artist Everest Pipkin on the making of Shell Song ODI Fridays: Open data and China – a ten year review ODI Fridays: Data for tackling non-communicable diseases ODI Fridays: Making local deliveries safer, cleaner, and healthier Explore all Upcoming events > Explore all Open Data in a Day: Online Applying Machine Learning and AI Techniques to Data Anonymisation is for Everyone Anonymisation is for Everyone Explore all The Open Data Institute works with companies and governments to build an open, trustworthy data ecosystem, where people can make better decisions using data and manage any harmful impacts. Find out more about what we do Open Data Institute, 5th Floor, Kings Place, 90 York Way, London N1 9AG 
treeverse-app-2004	----	Treeverse Treeverse Treeverse is a tool for visualizing and navigating Twitter conversation threads. It is available as a browser extension for Chrome and Firefox. Installation For Chrome Users For Firefox Users Introduction After installing Treeverse for your browser, open Twitter and click on the tweet that you would like to visualize the conversation of (or try this one.) If you’re using Chrome, the icon for Treeverse should turn from grey to blue in your browser. Click it to enter Treeverse. If you're using Firefox, the icon will be hidden until you open a tweet, and then it will appear in the address bar. Exploring the Conversation Conversations are visualized as a tree. Each box is an individual tweet, and an line between two boxes indicates that the lower one is a reply to the upper one. The color of the line indicates the time duration between the two tweets (red is faster, blue is slower.) As you hover over nodes, the reply-chain preceeding that tweet appears on the right-side pane. By clicking a node, you can freeze the UI on that tweet in order to interact with the right-side pane. By clicking anywhere in the tree window, you can un-freeze the tweet and return to the normal hover behavior. Some tweets will appear with a red circle with white ellipses inside them, either overlayed on them or as a separate node. This means that there are more replies to that tweet that haven't been loaded. Double-clicking a node will load additional replies to that tweet. Privacy Treeverse runs entirely in your browser. No data is collected or tracked by Treeverse directly when you install it or explore a tree. The extension only communicates with the Treeverse server if you click the “Create sharable link” button, in which case the current tree to be sent to a server so that it can be made available to others. Access to the shared link server may be tracked to prevent abuse. Browser extension installs may be tracked by Google and Mozilla, and the data requests made to Twitter may be tracked by Twitter. Additionally, when Treeverse runs it loads a font hosted by Google Fonts. Google may track this download. License Treeverse is distributed under an MIT license. The code is available on GitHub. Bugs & Contact Tweet @paulgb or report on GitHub. Credits Icon created by Eli Schiff. Treeverse would not be possible without the excellent d3.js. Styling is powered by Semantic UI. 
treeverse-app-8453	----	Treeverse 
tutela-network-6619	----	TuTela – Learning Network Vai al contenuto TuTela Learning Network Menu Home Network What we do Contact   A space for critical thinking and horizontal learning  Ci hanno accusato di voler dividere le famiglie, ma noi volevamo tutt’altro. Abbiamo cercato di far capire che l’Associazione si proponeva come un appoggio forte alle famiglie. Anche per la mia famiglia è stato difficile far comprendere perché volevo questo tipo di organizzazione. Ho dovuto tranquillizzarle/i che la mia intenzione non era dar vita ad una ribellione all’interno della comunità, ma aprire una strada diversa per appoggiare le donne rom. (Anna Várnai, Ungheria) La sociedad tiene que educar a sus chicas y mujeres a pensar de manera independiente. En lugar de enfatizar tanto la virtud del sacrificio, tendría que animarlas para que se den prioridad. Tendríamos que enseñarles que reclamar sus derechos es justo y normal. Igualmente, la sociedad tendría que entender que es normal que las mujeres que viven en relaciones abusivas se sientan vulnerables y débiles. Las chicas jóvenes y las mujeres deben saber que solo cuando cuidamos de nosotras mismas y nos valoramos, logramos encontrar nuestro verdadero ser y nuestra fuerza interior, para así aceptar quienes somos. (Swati Kamble) Voices Asociación La Colectiva Articoli recenti Resonancias: primer encuentro 20 marzo, 2021 COMMON PLACE: Talking about co-housing and communities of sharing 20 marzo, 2021 Rassegna ‘This is not an Altas’ 3 febbraio, 2020 Home Network What we do Contact Blog su WordPress.com. 
twitter-com-1174	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-147	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-1516	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-1607	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-1680	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-1750	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2006	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2010	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2243	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2342	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2343	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2375	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2415	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2453	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2775	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2786	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2875	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-2883	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-3021	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-3044	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-3075	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-3221	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-3229	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-3282	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-3319	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-3470	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-3537	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-3546	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-4141	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-4158	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-4562	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-4572	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-4644	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-4668	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-4898	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-4921	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-4961	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-5156	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-547	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-5664	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-5668	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-5684	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-6023	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-6104	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-6138	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-6227	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-6488	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-6672	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-6778	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-6907	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-6978	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-7061	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-722	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-7334	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-7400	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-7525	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-7550	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-8198	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-8216	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-8272	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-862	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-8681	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-9242	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-9386	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-9567	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-9690	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-9799	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twitter-com-9889	----	JavaScript is not available. We’ve detected that JavaScript is disabled in this browser. Please enable JavaScript or switch to a supported browser to continue using twitter.com. You can see a list of supported browsers in our Help Center. Help Center Terms of Service Privacy Policy Cookie Policy Imprint Ads info © 2021 Twitter, Inc. Something went wrong, but don’t fret — let’s give it another shot. 
twoquants-com-7198	----	Bitcoin and Ethereum Carbon Footprints - Part 2 - Twoquants Skip to content Home About About the Two Quants Q&A with Moritz & Moritz Focus 2Q Portfolio Media Videos Podcast Blog Resources Favorite Books Papers & Articles Our Research Art Subscribe Home About About the Two Quants Q&A with Moritz & Moritz Focus 2Q Portfolio Media Videos Podcast Blog Resources Favorite Books Papers & Articles Our Research Art Subscribe Home About About the Two Quants Q&A with Moritz & Moritz Focus 2Q Portfolio Media Videos Podcast Blog Resources Favorite Books Papers & Articles Our Research Art Subscribe Bitcoin and Ethereum Carbon Footprints – Part 2 As outlined in part 1 of this article, Bitcoin mining uses a lot of power, and because this power isn’t 100% renewable, Bitcoin has a carbon footprint. That’s a fact. A lot of the criticism that’s directed at Bitcoin is based on the assumption that it’s useless. A digital mirage which neither has substance nor intrinsic value. A scam of sorts that requires a power socket to run. On the other side of that argument sits a market with more than a trillion in capitalization. It seems the market disagrees with the “it’s useless” argument. If Bitcoin is useless because it uses power, what about Christmas lights? What about your microwave and all the air-conditioning systems that are installed but which aren’t really needed. Is your PlayStation life-essential? The situation is similar — and even worse—for gold. One could argue that gold is useless because gold mining consumes a lot of fuel harms the environment, e.g., deforestation and erosion impacts animal and human life requires and pollutes water has a history of exploiting workers in underdeveloped countries Additionally, once the gold has left the Earth’s crust, it costs a lot of money to store, secure, and transport from point A to point B. It develops a secondary carbon footprint each time it travels around the globe on ships and aircraft, which can happen when it changes ownership or when traders arbitrage spot and futures prices. The smelting of gold and the production jewelry (also not a life necessity), require energy, too. Disregarding the de minimis industrial use of gold, it’s easy to jump to the conclusion that gold mining is a waste. However, that waste seems to outweigh the utility of having gold available as an independent monetary medium, a storehold of wealth, and a desirable counterweight against currencies which are backed by nothing. The market seems to think that way, since otherwise the price of gold would drop, possibly toward zero, and mining would stop. One should also recognize that our day-to-day currencies require a lot of power, too. While the fuel that’s required to print bills and mint coins is inconsequential, the cost of protecting the system through legal as well as national and international defense systems is not. The U.S. dollar requires several secondary payment and settlement systems, an international correspondence banking system, the Federal Reserve, and — of course — the diplomatic and military strength of the U.S. government to support and protect it all. It’s the same for other currencies. None of them are costless. Bitcoin transactions, in contrast, do not rely on the support of any government. The Bitcoin network is all that’s needed. That’s a great feature (decentralized and trustless), but it’s undeniable that Bitcoin has a carbon footprint, because running the network uses power, which brings up the question: Is it worth it? Transactions Bitcoin transactions themselves don’t cause a lot of power usage. Getting the network to accept a transaction consumes almost no power, but having ASIC miners grind through the mathematical ether to solve valid blocks does. Miners are incentivized to do this because they are compensated for it. Presently, that compensation includes a block reward which is paid in bitcoin (6.25 BTC per block) as well as a miner fee (transaction fee). Transaction fees are denom­i­nated in fractional bitcoins and paid by the initiator of the transaction. Today, about 15% of total miners’ rewards are transactions fees, and about 85% are block rewards. How much is a miner willing to spend on running a mining business? In the case of Bitcoin the miner’s primary business cost is power, and the incentive to spend can, in simple terms, be viewed as a function of the following variables: the current BTC price the BTC issuance rate as per the protocol difficulty the transaction fees which participants must pay to use the Bitcoin blockchain Of those four, the first two have the greatest economic impact. The market capitalization of BTC, which continues to grow, is why miners are willing to incur a lot of power-related costs today. It’s very lucrative to get paid in BTC. And the lower a mining operation’s power costs, the greater its advantage over competitors, all else being equal. Although miners are willing to spend a lot on power today to earn BTC, this does not mean that they will be inclined to use and pay for a lot of power in the future. Block rewards are a magnet for miners today, but Bitcoin’s issuance rate is halved every four years which means that, in the long run, miners’ revenues from block rewards will shrink. Note that close to 90% of all BTC have been mined already and, as mentioned above, fees are not a major source of revenue today. Most of the power-intensive block reward action has already happened. Going forward however, miners’ primary revenue source will change from block rewards to the fees paid for the processing of transactions, which don’t per se cause high carbon emissions. Bitcoin is set to become be a purely fee-based system (which may pose a risk to the security of the system itself if the overall hash rate declines, but that’s a topic for another article because a blockchain that is fully reliant on fees requires that BTCs are trans­acted with rather than held in Michael Saylor-style as HODLing leads to low BTC velocity, which does not contribute to security in a setup where fees are the only rewards for miners.) It costs a lot to build and operate a chocolate bar factory, but once the factory is up and running the marginal cost of a chocolate bar is tiny. Likewise, it costs a lot to create the “Bitcoin factory,” but the marginal transaction costs inside that factory are small. To understand this better one needs to recognize that Bitcoin is not meant to do what Visa or Mastercard do today. Our current current monetary system consists of several layers: Slow but final settlement layers which process a relatively small number of irreversible trans­ac­tions between banks. Those transactions tend to be large in size. On top of those final settlement layers are layers that optimize for more frequent and smaller consumer trans­ac­tions, e.g., your credit card. Those transactions are reversible and thus not final. When you pay with your credit card at a store, that’s actually a trans­ac­tion that your card-issuing bank will debit from your account and later batch into a netted omnibus payment with the store’s merchant bank. Bitcoin is akin to the slow, but ultimate settlement layer, which is why a comparison to Visa or any other higher-frequency transactional layer is misguided. It compares apples to oranges. But there can be several layers that work on top of Bitcoin and which allow for a greater frequency of off-chain transactions. These secondary layers can be either trusted or trust­less. The Lightning network is one example of a trustless secondary layer which already exists today. It does not settle on-chain each time a transaction occurs and is thus far more cost-efficient than the Bitcoin base layer. With Lightning, many transactions can be aggregated and send to the Bitcoin blockchain for an on-chain final settlement. Now we’re getting closer to the neighborhood of Visa and other credit cards; but since the Lightning network requires much less power to operate and keep running (no mining), the comparison of power usage becomes more apple to apple. Using the Lightning network incurs a cost that’s akin to the cost of sending an email. This also means that the power usage of the Bitcoin network won’t scale linearly with the number of transactions as the network becomes predominantly fee-based and less rewards-based (which causes a lot of power to the thrown at it in light of increasing BTC prices), and especially if those transactions take place on secondary layers. In other words, taking the ratio of “Bitcoin’s total power usage” to “Number of transactions” to calculate the “Power cost per transaction” falsely implies that all transactions hit the final settlement layer (they don’t) and disregards the fact that the final state of the Bitcoin base layer is a fee-based state which requires a very small fraction of Bitcoin’s overall power usage today (no more block rewards). Power usage Miners are economically incentivized to source cheap power as the marginal cost per unit of power that’s input into the mining process determines their competitiveness versus other miners. Renewable power is cheap (which is why miners use it), but the cheapest power of all is stranded (wasted) power. Power is local and not fungible. It is difficult and costly to store and there’s no battery which is 100% effective. Many Bitcoin critics complain that the power that’s used by miners could be used for other purposes — purposes which are more productive or socially beneficial. A good argument, but one that disregards the fact that the power used by miners could never actually become available for any other purposes. Power decays as it leaves its point of generation and is expensive to transport across space. A lot of power is lost in transit, which makes it impractical to transport electrons over very long distances. This is why call our power system a grid — we require a grid because we need to produce power virtually everywhere to make it available locally. Below is a chart from the Lawrence Livermore National Laboratory. It shows that about two thirds of all generated power in the U.S. ends up unused (rejected). Interestingly, the largest share of rejected power results from power generation itself — excess capacity that’s generated in case it’s needed on short demand. And that’s where Bitcoin makes a difference. Bitcoin does not waste power; it consumes power waste. Bitcoin miners offer demand-side flexibility to a power generator, which is an essential aspect of our ongoing transition toward renewables. Since the wind does not blow all the time and the sun doesn’t shine at night, our future power grids require demand-side flexibility. Something that can be shut off and turned back on. Better to do that with an ASIC than with your freezer. This will help balance the grid, make it more efficient, and it should ultimately reduce the retail power price a household pays. It will also reduce carbon emissions as the share of renewable sources can increase without overwhelming the grid or making its balancing too costly. And, as mentioned in part 1 to this article, miners already use a lot of renewables (zero carbon emissions). Ultimately…… … it’s all a matter of opinion. Some people hate Bitcoin, some people love Bitcoin. I’m somewhat in the middle. I think Bitcoin is a great invention. I understand that the dollar, the euro and all the other currencies will debase over time. But maybe that’s a tax worth paying and it’s better to live in that world than in a world that’s based on the existence of a non-state, synthetic monetary medium over which governments have no control. I’m not sure and want to think about it more. However, I do have an issue with the “Bitcoin wastes power” argument and I think it’s wrong. People are willing to pay for transactions on the blockchain. That’s a fact, and the cost of that service is essentially derived from the resources which are required to offer it. How can it be a waste if people want it and pay money for it? In a market that’s worth more than one trillion? Same thing for gold. Millions of people are willing to pay for physical, state-independent gold, which is why it continues to get extracted from the ground at huge costs. By Moritz Seibert|2021-03-29T08:44:24+02:00March 28th, 2021|Blog Post|0 Comments Share This Story, Choose Your Platform! FacebookTwitterRedditLinkedInWhatsAppEmail Related Posts Market Crash Anniversary March 23rd, 2021 | 0 Comments Bitcoin and Ethereum Carbon Footprints – Part 1 March 21st, 2021 | 0 Comments Flight to Inflation March 19th, 2021 | 0 Comments Hodling Art or the Art of Hodling March 1st, 2021 | 0 Comments No Need to Play Serve & Volley in the Markets February 20th, 2021 | 0 Comments © Copyright  - All Rights Reserved | Imprint | Privacy Policy | Terms and Conditions | Contact | Member Area Important Notice This website (the "Website") uses cookies to give you, the user, the most relevant experience by remembering your preferences and repeat visits. By clicking “ACCEPT” you consent to the use of ALL cookies. By clicking "ACCEPT" you also consent to our Privacy Policy as well as to our Terms and Conditions. Cookie SettingsACCEPT Privacy Policy Close Privacy Overview This Website uses cookies to improve your experience while you navigate through the Website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of this Website. Twoquants GbR is the provider of this Website (the "Provider") any may also use third party cookies that help the Provider analyze and understand how you use this Website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience. Necessary Necessary Always Enabled Necessary cookies are absolutely essential for this Website to function properly. This category only includes cookies that ensure basic functionalities and security features of this Website. These cookies do not store any personal information. Non-necessary Non-necessary Any cookies that may not be particularly necessary for this Website to function and are used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. SAVE & ACCEPT Go to Top 
ufsj-edu-br-4492	----	UFSJ | Universidade Federal de São João del-Rei Portal do Governo Brasileiro Atualize sua Barra de Governo Acesso Rápido Assistência Estudantil Biblioteca Calendário Acadêmico Centro Cultural Concursos Download de Arquivos Diárias e Passagens Dicon Fale Conosco Minha UFSJ Licitações Lista Telefônica Ouvidoria Periódicos Pesquisa Portal Didático Processos Seletivos SIG VoIP Webmail Mapa do site Busca Fonte: Institucional A UFSJ Conselhos Superiores Reitoria Pró-Reitorias Assessorias Unidades Acadêmicas Bibliotecas Fundações Comissões CPA Ouvidoria Auditoria Representação Estudantil Ensino Graduação Pós-Graduação Educação a Distância Formas de Ingresso Pesquisa A PROPE Inovação e Empreendedorismo Iniciação Científica Extensão A PROEX Inverno Cultural Bicicletada Concursos Processos Seletivos /SISU Docentes Técnicos Estagiários Serviços Carta de Serviços ao Cidadão Serviços Oferecidos Formulários Eletrônicos Assessoria de Comunicação UFSJ para Alunos Servidores Avaliadores Transparência Acesso à Informação Pesquisa Pública de Processos Eletrônicos   English | Español PrincipalUAIGeoEquipe Cartografia Escolar e DigitalPublicações -Cartografia Tátil�Base da Cidade de São João del-ReiPublicações Cartografia DigitalFale ConoscoMapas da Bacia do Córrego do LenheiroImagens de satélites DownloadEventos e Cursos Unificar Ações e Informações Geoespaciais - UAIGeo Unificar Ações e Informações Geoepaciais  é um capitulo do programa  universitário YouthMappers,  fundado, no ano de 2015, pelas  universidades Texas Tech University, George Washington University, West Virginia University , e mais recentemente a Arizona State University passou a fazer parte.O objetivo da rede YouthMappers é de incentivar jovens universitários a apoiar a geração de dados abertos e a expansão do mapeamento colaborativo para uso em escala local, em apoio às comunidades locais, pois se percebeu a necessidade de que houvesse um maior número de voluntários gerando dados abertos. Então, nada melhor que envolver os estudantes, já que é um grupo de pessoas altamente capacitadas para apoiar esses processos. Neste sentido, a importância do programa YouthMappers está no trabalho de poder gerar dados abertos que também podem ser usados por instituições, por ONGs locais e pelas próprias universidades, as quais podem acessar livremente esses dados. Para além de conectar estudantes universitários, professores e pesquisadores do mundo inteiro, pois a rede YouthMappers já tem parcerias com universidades em 52 países, e oferecer bolsas de estudo, estágios e intercâmbios. Equipe Unidade informadora responsável: BDGC Última atualização: 24/02/2021, às 19:19:05 Universidade Federal de São João del-Rei Praça Frei Orlando, 170, Centro, São João del-Rei, Minas Gerais, CEP: 36307-352. Clique aqui para ver os telefones e emails de contato. 
unsplash-com-1033	----	black asphalt road with rainbow sign photo â€“ Free Tarmac Image on Unsplash UnsplashPhotos for everyone Brands New Explore Submit a photo LoginJoin free Caleb Chen @calebchen Download free Share Info Related collections JAN 2021 23 photos Â· Curated by Angie Swartz human Women Images & Pictures corporate woman Road 553 photos Â· Curated by Iain Moore road HD Grey Wallpapers Car Images & Pictures Worship use 25 photos Â· Curated by S Townsley worship human People Images & Pictures Related tags tarmac asphalt road urban building street town HD City Wallpapers human People Images & Pictures text Thank You Images covid-19 nhs Rainbow Images & Pictures freeway housing symbol sign zebra crossing Free pictures 
unsplash-com-238	----	BÃ¡lint SzabÃ³ (@thehighdynamic) | Unsplash Photo Community UnsplashPhotos for everyone Brands New Explore Submit a photo LoginJoin free BÃ¡lint SzabÃ³ a creative soul, enjoying logical thinking and art Budapest thehighdynamic.com Interests HQ Background Images HD Wallpapers HD Sky Wallpapers Nature Images outdoor PhotosÂ 11 LikesÂ 54 CollectionsÂ 0 BÃ¡lint SzabÃ³ Download BÃ¡lint SzabÃ³ Download BÃ¡lint SzabÃ³ Download BÃ¡lint SzabÃ³ Download BÃ¡lint SzabÃ³ Download BÃ¡lint SzabÃ³ Download BÃ¡lint SzabÃ³ Download BÃ¡lint SzabÃ³ Download BÃ¡lint SzabÃ³ Download BÃ¡lint SzabÃ³ Download BÃ¡lint SzabÃ³ Download BÃ¡lint's work appears in the following categories train track Tree Images & Pictures HQ Background Images accessory glass line track HD Wallpapers fence rail railway terminal train train station transportation vehicle Brown Backgrounds construction cityscape dusk HD Sky Wallpapers Sun Images & Pictures sunlight sunrise Sunset Images & Pictures Birds Images Cloud Pictures & Images flare human Light Backgrounds Make something awesome 
unsplash-com-8054	----	Skewed Pictures | Download Free Images on Unsplash UnsplashPhotos for everyone Brands New Explore Submit a photo LoginJoin free PhotosÂ 2 CollectionsÂ 28 UsersÂ 1 Any orientation Any color Sort by Relevance Skewed sidewalk collage poster art modern art tile path pavement walkway grey refraction person HD Grey Wallpapers tile sidewalk HD Grey Wallpapers tile sidewalk BÃ¡lint SzabÃ³ Download Kaleb Nimz Download HD Grey Wallpapers tile sidewalk Make something awesome 
unsplash-com-8303	----	Caleb Chen (@calebchen) | Unsplash Photo Community UnsplashPhotos for everyone Brands New Explore Submit a photo LoginJoin free Caleb Chen Download free, beautiful high-quality photos curated by Caleb. Sheffield Interests technology Happy Images & Pictures office business work PhotosÂ 4 LikesÂ 2 CollectionsÂ 1 Caleb Chen Download Caleb Chen Download Caleb Chen Download Caleb Chen Download Caleb's work appears in the following categories human electronic feminism focus at work women of tech computer class coding tech technology developer meeting smile Happy Images & Pictures colleague HD Company Wallpapers office team business work HD Grey Wallpapers waiting room suitcase room reception room reception portrait HD PC Wallpapers meeting room luggage HD Laptop Wallpapers Make something awesome 
utopix-cc-7945	----	Utopix - Comunicación visual alternativa para el post-capitalismo Saltar al contenido Pix Herramientas Escuela Biblioteca Bitácora Muro Tripulación Somos Buscar: Alternar la navegación Alternar la navegación Pix Herramientas Escuela Biblioteca Bitácora Muro Tripulación Somos Buscar: Trabajo y Covid: Miguel Fotografía, Top, Trabajo y COVID, Venezuela La publicación del Mensaje del Che a la Tricontinental Cuba, Mil Vietnams, Top El programa espacial soviético Infografía, Top, URSS, Yuri Gagarin Marzo 2021: En el primer trimestre del año son 58 casos de femicidios en Venezuela Femicidios, Feminismo, Infografía, interactiva, Investigación, Top Somos Suscríbete a nuestro boletín Email   Enviar Colabora Escríbenos Síguenos Pix Trabajo y Covid: Miguel Pix Trabajo y Covid: Miguel By Rigger El programa espacial soviético Valentina Aguirre Marzo 2021: En el primer trimestre del año son 58 casos de femicidios en Venezuela Aimee Zambrano Herramientas Calendario abril 2021 Herramientas Calendario abril 2021 By Valentina Aguirre Calendario Marzo 2021 Deisa Tremarias Calendario febrero 2021 América Rodríguez Escuela Tutorial: Como hacer cómics en Krita Escuela Tutorial: Como hacer cómics en Krita By Utopix Stencil: Guía práctica para la realización de plantillas Comando Creativo Cuestionario: Licencias de Software libre GNU GPL y LGPL Utopix Bitácora La publicación del Mensaje del Che a la Tricontinental Bitácora La publicación del Mensaje del Che a la Tricontinental By Cario El emblema de la Unión Comunera Utopix Convocatoria: III Concurso Autónomo de Literatura Infantil Ksa La Tribu y Utopix Biblioteca Informe del Monitor de Femicidios 2020 (Descargar PDF) Biblioteca Umbrales virulentos (descarga) Biblioteca La fiesta de los moribundos Biblioteca Cinco historias en el Sahara Occidental Biblioteca Muro Espacio para la publicación abierta de contenidos de la Comunidad Utopix. 15 de febrero de 1966. Cae en combate el Cura Guerrillero, Camilo Torres. Muro 14 de febrero. Fallecimiento del insurgente afromexicano Vicente Guerrero Muro El Ajolote Muro Francisco de Miranda. Peregrino de la libertad. Muro Pix Control social y racismo sistémico Pix Control social y racismo sistémico By Forastero LPA Abril 2021 Pix Abril 2021 By Valentina Aguirre Mis tetas de negra Pix Mis tetas de negra By Sahili Franco Cipriani La Comuna de París Pix La Comuna de París By Francisco Tiapa Trabajo y Covid: Voluntaria en centro de aislamiento Pix Trabajo y Covid: Voluntaria en centro de aislamiento By Kalia León Herramientas Calendario enero 2021 Herramientas Calendario enero 2021 By Kalaka Calendario Diciembre 2020 Herramientas Calendario Diciembre 2020 By Deisa Tremarias Clarice Lispector Herramientas Clarice Lispector By sistema ¡Sáhara Libre! Herramientas ¡Sáhara Libre! By César Mosquera Calendario noviembre 2020 Herramientas Calendario noviembre 2020 By Kael Abello Calendario octubre 2020 Herramientas Calendario octubre 2020 By Juan Miguel Hernández Descarga: El muro marroquí en el Sáhara Occidental Herramientas Descarga: El muro marroquí en el Sáhara Occidental By César Mosquera Descarga: cerámica escultórica precolombina Herramientas Descarga: cerámica escultórica precolombina By Valentina Aguirre Bitácora Slogans de Ródchenko sobre la construcción Bitácora Slogans de Ródchenko sobre la construcción By Utopix Patrice Lumumba sobre los desafíos políticos y culturales de la independencia Bitácora Patrice Lumumba sobre los desafíos políticos y culturales de la independencia By Utopix Umbrales virulentos Bitácora Umbrales virulentos By Celeoguá Desafío fotográfico: trabajo y covid Bitácora Desafío fotográfico: trabajo y covid By Utopix Otto Neurath sobre el Isotipo Bitácora Otto Neurath sobre el Isotipo By Utopix Seguir leyendo Explorar productos Entradas Populares Cronología: las epidemias en Venezuela De la viruela al COVID-19 bajo Pix Calendario abril 2021 Descargas Disponibles / Avilable... bajo Anexos Mis tetas de negra Reflexión afrofeminista bajo Pix Tutorial: Grabar pantallas y ventanas de tu computadora ¿Necesitas grabar una pantalla o ventana de tu computa... bajo Escuela El credo de Aquiles Nazoa Creo en los poderes creadores del pueblo bajo Pix Colabora A una comunicación alternativa Suscribete a nuestro boletín Email   Enviar Contáctanos English Site Todos los documentos Política de privacidad Mapa del sitio Ingresar 
vangoghworldwide-org-2907	----	Van Gogh Worldwide Van Gogh Worldwide is a collaborative repository of all works by Vincent van Gogh. You need to enable JavaScript to run this app. 
viaf-org-3522	----	VIAF VIAF Virtual International Authority File Search Select Field: All Fields All Headings Corporate Names Geographic Names Personal Names Works Expressions Preferred Headings Exact Heading Bibliographic Titles Select Index: All VIAF Argentina Australia Belgium (Flemish) Brazil Canada Catalunya Chile Croatia Czech Denmark (DBC) Egypt Estonia FAST France (BnF) France (Sudoc) Germany Getty (ULAN) Greece Hispánica Hungary Iceland Ireland ISNI Israel Italy Japan (NDL) Japan (NII) Korea Latvia Lebanon LC (NACO) Lithuania Luxembourg Morocco Netherlands Norway (BIBSYS) Norway (National Library) Iceland Perseus Poland (National Library) Poland (NUKAT) Portugal Québec RILM RISM Russia Singapore Slovakia Slovenia Spain Sweden Swiss (National Library) Swiss (RERO) Syriac Taiwan Vatican Wikipedia xA xR Search Terms: VIAF: The Virtual International Authority File The VIAF® (Virtual International Authority File) combines multiple name authority files into a single OCLC-hosted name authority service. The goal of the service is to lower the cost and increase the utility of library authority files by matching and linking widely-used authority files and making that information available on the Web. Learn more . VIAF Contributors Library of Congress/NACO National Library of Mexico British Library Library and Archives Canada National Agricultural Library (U.S.) National Library of Medicine (U.S.) National Library of New Zealand National Library of Scotland National Library of South Africa National Library of Wales German National Library National Library of France National Library of Sweden National Library of Australia National Library of Spain National Library of Portugal National Library of Brazil Central Institute for the Union Catalogue of the Italian libraries National Library of the Czech Republic National Library of Israel Israel Museum Library of Alexandria, Egypt Vatican Library Swiss National Library Library and Archives Canada Union List of Artist Names [Getty Research Institute] NUKAT Center of Warsaw University Library National Széchényi Library, Hungary RERO - Library Network of Western Switzerland Sudoc [ABES], France Flemish Public Libraries National Library of Russia National Library of the Netherlands BIBSYS National Library of Greece National Library of Argentina National Library of Norway DBC (Danish Bibliographic Center) Danish Agency for Culture National Diet Library, Japan NII (Japan) National Library Board, Singapore National Library of Latvia National Library of Poland National Library of Catalonia Lebanese National Library Perseus Syriac Reference Portal Wikidata ISNI National Library of Ireland National and University Library in Zagreb National Central Library, Taiwan National Library and Archives of Québec National Library of Korea National Library of Luxembourg National Library of Chile National Library of Morocco xA Extended Authorities xR Extended Relationships FAST Subjects National Library of Estonia National and University Library of Iceland (NULI) Repertoire International de Litterature Musicale, Inc. (RILM) International Inventory of Musical Sources (RISM) NUK/COBISS.SI, Slovenia National Library of Lithuania Slovak National Library Search Note The search box at the top of this page searches a merged view of VIAF derived from the name authority and related bibliographic data of the participating libraries. More information can be found at http://viaf.org/viaf/ © 2010-2021 OCLC Privacy policy Cookie Notice Cookie Settings Hosted by OCLC About the dataset Send us a comment. About VIAF 
vphill-com-1478	----	mark e. phillips journal mark e. phillips journal Menu Skip to content Home About Metadata Events System: Accounting for Time Leave a reply In the last post, I mentioned that there four primary things that we have needed in order to move a large portion of our student and staff workers to full remote work during this quarantine. In this post, I wanted to jump ahead a bit and talk a bit about how we are accounting for remote workers’ time. This is one of the things that the university wrestled with when thinking about moving students offline, how would we be able to account for time. Well, one of the ways we are trying to track time is by using the activity in the metadata editing system to help managers understand what their workers are working on. Over the past two weeks, the Software Development Unit at the UNT Libraries has been working hard to push out quite a few changes to a system we call Events. This system has been sitting in the background of our metadata editing infrastructure for a number of years collecting what we call “Edit Events”. An Edit Event is logged in the Event system and contains information about the username, which record they edited, the timestamp for when it was edited, and how long the metadata edit window was open while they were doing metadata. This is then aggregated for users in a dashboard that they can view. For a few years now this system has been unusable because of some code that needed to be refactored now that we have almost 2.5 million edit events. That’s what we have been working on for the past few weeks. The first thing to note for users of our Edit system is “how do you get to the Events pages”. Well in the upper left corner of the screen, if you click on the “home” icon you will see an Events option in the dropdown. Getting to Edit Events from Edit Search Dashboard This will take you to the Events landing page, where you are greeted with an overview of what is going on with the events system. All data is divided into Today, This Month, and All Time and gives you statistics for the number of edits, the number of active users, and the number of unique items that have been edited. Clicking on the different buttons will take you to different places. I am going to walk through by clicking on any of the blue buttons for today. Edit Events Overall Stats By clicking on any of the Today buttons, you are presented with statistics for the Edit Events that have happened in the system today. You can get an overview of everything that has happened in the system. You can also navigate to different days by clicking on the Previous Day button. Daily Stats View for March 30, 2020 At the top of the page, you will see an overview of stats for the day. This includes the number of edits, the number of unique records edited, and how much aggregate time has been spent during the day editing. We also show the first and most recent edit, the number of users that have edited during the day, and then we list the user who has the most edits along with their edit count. Events Daily Stats Detail Next up is a view of the Activity By Hour section. I have been amazed to see the time of day when users are editing records. For example, on Saturday the 28th there were 35 users who edited records in 22 of the 24 hours during the day. Events Hourly Stats Detail Below that block is the Activity By User for the day. You can see a listing of all of the users who have edited during the day as well as some statistics related to their activity including the number of edits, number of records, total editing duration and the average time per edit, and the average time per record. Clicking on any of the names will take you to an overview of that user’s activity. Daily Activity by User A user’s page gives information going back a little over a month so that managers can easily verify the time for pay periods that are either every two weeks or one month. In addition to an overview of all of the user’s activity in the Events system, you can see a breakdown of what days they have edited records as well as statistics about what activity they completed during that day. By clicking on the day link you can see information specific to that day for that user. User Daily Activity View The User Hourly Detail View presents statistics for a user on a specific day. It includes similar information as the overview pages mentioned previously. There is also an hourly table that shows when the user was editing records, including how many edits, hours, duration and average time per edit and record for a given hour. User Hourly Detail View Below the hourly breakdown of activity, you see all of the edits performed by the user on that day. You can link directly to the edit event or you can view information about the record that they have edited. Below you will see the detail for a record in the Events system. You see how many total edits have taken place with a record including when, and which user performed the edits. There is a link on the page to view the records summary in the edit system to see more information about the record that was edited. Record Activity View When you click on that link you are taken to the record summary page in the Edit interface. Record in Edit System If you want to dig deeper into what happened with the record, you can click on the View History link and view different versions of the record to see what changes were made. Metadata History Page There are two other views in the Edit Events interface that can be useful. If you had clicked on the orange Users button on the Edit landing page you would see a list of all of the Users who have been active during the current week (starting Sunday). User Activity for this Week If you click on the orange items button on the Events landing page you get to a view that shows a listing of the records that were edited this week. It also includes the number of edits and the duration of editing for that week. Record Activity for This Week We are hoping that the improved Events pages will be useful for managers as they begin to review timesheets for students that they supervise. I know that I have been pleasantly surprised by the data when I can view the number of edits we are getting at all hours of the day. I think it shows the opportunities that we have with our digital library systems for providing engaging, meaningful work to a wide range of users during this quarantine. I skipped forward in my list of components we have needed for this process. Over the next blog posts, I will go back and pick up where I left off describing the infrastructure we have in place to communicate instructions and documentation, and finally how we are identifying work that needs to be done in the system. If you have questions or comments about this post,  please let me know via Twitter. This entry was posted in thinking outloud and tagged covid-19, metadata editing on March 30, 2020 by vphill. Managing Metadata Editing for Telecommuting. Leave a reply Many of us in the US, and around the world for that matter, are now sitting at home working remotely trying to maintain some semblance of normalcy during quarantine for COVID-19. I’m going to write up a few of the things that we have been working on her at the UNT Libraries to try and provide as many of the library employees, including student employees, with activities that they can do remotely. On March 13th it became quite clear that there was going to be a large number of folks from the library needing to work remotely, many of us have activities that we can do remotely, and some of us even prefer to work remotely when we have the opportunity. There are however a large group of people at the library that don’t have jobs that directly transfer to working remotely, or in the situation where there aren’t students on campus to directly serve, the work that they would be doing isn’t available for them. We wanted to provide an option for these individuals to create metadata for the UNT Libraries Digital Collections if they were interested in doing so. Additionally, this would give supervisors some meaningful activity that could be verified in the event that we had to move the workforce remotely because of this pandemic. What we needed There were a few things that we needed to have in place in order to move people online to edit metadata records. Web-based system for editing metadata Clear instructions and guides A way of identifying work to be done A way of coordinating activity A way of verifying/tracking activity for timekeeping Web-based editing system. There were a number of things that we have in place that allow us to move a large number of people into the metadata creation workflow. First, we have been using a web-based metadata editing system for the entire time that we have had our digital collections. Here at UNT Libraries we creatively call this system Edit. This system is built around the metadata format that we have been using locally called UNTL. We can add a user in the system, assign them to a subset of the collection, usually based on collections, and give them permission to begin editing metadata. We have a system in place to register new users in batches and then invite them to join the editing system. In addition to our own work here in the libraries, we have made use of this metadata editing system for a number of metadata classes of library school students to give them real-world experience writing metadata records in a production system. Because of these previous experiences, the task of creating 60-70 new accounts for users in the library wasn’t too daunting. For setting up user accounts we make use of a Django app that we wrote a few years ago called django_invite (https://github.com/unt-libraries/django-invite). Generally, the workflow we follow for new accounts is that we are given one or more new users who need accounts. The information we need is just a name, an email address, and the scope of the collection they need access to. We can enter multiple names at once if they have the same permissions. The permissions for this app are based on the standard Django permission and group concepts. Django Invite Once you submit users, the system sends out an invitation email for the user to complete the registration process by picking a name. We are able to see who has established their account and if needed, resend the invitation email. This process makes it fairly straightforward to get people set up in the system. Editing Records As I said, we have been using a web-based editing system for the UNT Libraries Digital Collections for over a decade now. The editing system (Edit) starts a user with a view of all of the records that they can access based on their permissions. We call this view the “Search Dashboard”. Edit System: Search Dashboard From here a user can search the metadata for a record, sort results, and limit their result sets based on any of the facets on the left-hand side of the screen. For those interested, the facets include. System Collections Partners Resource Type Visibility (hidden or not) Date Validity (valid EDTF dates) My Edits (records you have edited) Record Completeness Location data (with/without placenames, geocodes, or bounding boxes) Recently Edited Records (Last 24h, 48h, 7d, 30d, 90d, 180d, 365d) Edit System: Limiting to Collection From there a user gets back their results where they can be further refined by sorting. Edit System: Limited Collection The current sort options include: Title (default) Date Added (newest/oldest Creation Date (newest/oldest) Date Last Modified (newest/oldest) ARK Identifier (lowest/highest) Completeness (lowest/highest) They are also able to see basic information about the object including a thumbnail, the system, collections, and partner in which the item belongs. They can see the accession date and the last date that the item was edited. Finally, they can see a visibility flag, green for visible to the public and a red check for not visible. They can choose to go to one of two places for a record, the edit view or the summary view. I will start with an overview of the summary view. Edit System: Item Summary The Summary view provides an overview of the record including a compact version of the record itself. We provide an Edit Timeline to get a better sense of when the item has been edited, and when it became available online. Additionally, it has links and other information that are helpful for metadata creators. I will walk through a few of those links now. First up is the View Item screen. Edit System: View Item This view is for the metadata creator to interact with the object itself, they are able to see all of the pages of the item and zoom in to look at the details. For audio and video, they have the ability to view the item in a player as well as download the media files as needed. We also have the ability to see the history of edits that have occurred for a record. This history page presents information about who, when, and a high-level overview of what has happened to the item over time. You might notice a number of edits for this record. One of the things we have noticed in our metadata editing practice is that we tend to take a “column” approach to the editing of records instead of a “row” approach. We will find an issue, maybe an incorrectly formatted name, and fix all of those instances in the system. This results in many edits per record but allows editors to focus on a single task. As you can see, all of the record edits are versioned so it is possible to go back and view what was changed and by whom. Edit System: Item History The final view is the metadata editor itself. I’ve mentioned this in a number of other posts over time so I won’t go into too much detail here. Basically all of the work of editing gets done here. Users can add, subtract, reorder, and edit elements. Most elements have qualifiers to designate the type of element being used such as the Main Title, Added Title, Serial Title, or Series Title for the title element. Some element such as creator, contributor, and publisher have a type ahead that pull from our name authority system (UNT Names) and include information about the type of name (personal/organization), the role (author, photographer, editor) and an info field for other bits of info about the agent. All dropdown values are pulled from a centralized vocabulary management system. Some of the fields have popup modals for controlled vocabularies, picking locations from a map, or assigning bounding box information to an object. From here users can mark an object as hidden or visible, and publish the record in order to save it back into the system. Edit System: Edit Record As I mentioned above there are a number of other components that are proving to be important as we move a large number of works into our metadata system. In the past week, we have created over 70 new accounts for students and staff in the library so that they can begin to incorporate metadata editing into their work. In the next few posts, I will go over how we are attempting to manage who is doing what and how we are providing social and technical infrastructure to help managers keep track of what is going on with the folks they are responsible for. If you have questions or comments about this post,  please let me know via Twitter. This entry was posted in thinking outloud and tagged covid-19, metadata editing on March 26, 2020 by vphill. User Session Analysis: UNT Scholarly Works This is a continuation of a series of posts that I never got around to writing earlier this semester.  I posted the first and second post in the series in February but never got around to writing the rest of them.  This time I am looking at if users use items from multiple collections in the UNT Digital Library. The dataset I am using is a subset of the 10,427,111 user sessions logged in the UNT Libraries Digital Collections in 2017.  This subset is specifically for the UNT Scholarly Works Repository.  There are a total of 253,369 sessions in this dataset and these have been processed in the same way as was mentioned in previous posts. Of these 253,369 sessions, there were 223,168 that were sessions that involved interactions with a single item.  This means 88% of the time when a user made use of an object in the UNT Scholarly Works Repository, it was for just one item.   This leaves us 30,201 of the sessions in 2017 that would be interesting to look at for our further analysis. Items Accessed Sessions Percentages of All Sessions 1 223,168 88.08% 2 17,627 6.96% 3 5,009 1.98% 4 2,267 0.89% 5 1,285 0.51% 6 824 0.33% 7 598 0.24% 8 404 0.16% 9 270 0.11% 10 204 0.08% 11 150 0.06% 12 96 0.04% 13 94 0.04% 14 63 0.02% 15 41 0.02% 16 42 0.02% 17 50 0.02% 18 45 0.02% 19 50 0.02% 20 264 0.10% 30 147 0.06% 40 135 0.05% 50 117 0.05% 60 43 0.02% 70 39 0.02% 80 40 0.02% 90 33 0.01% 100 123 0.05% 200 52 0.02% 300 33 0.01% 400 19 0.01% 500 6 0.00% 600 8 0.00% 700 6 0.00% 800 6 0.00% 900 2 0.00% 1000 9 0.00% Based on what I see in this table I’m choosing 11 item uses as the cutoff point for further analysis.  This means that I will be looking at all of the sessions that have 2 – 11 items per session.  This is 11% of the 253,369 UNT Scholarly Works sessions and 95% of the sessions that have more than one item used. This represents 28,638 user sessions we are analyzing in the rest of this post.. Looking at the Sessions With the dataset layout we have we can easily go through and look at the Partners, Resource Types, and finally the Collections used per session.  Let’s get started. The first category we will look at are the Partners present in the sessions.  In the UNT Scholarly Works collection there is generally a single Partner field for a record.  This Partner is usually the contributing college, department, or center on campus where the author of the resource is contributing from.  The model is flat and doesn’t allow for any nuance for multiple authors from different colleges but seems to work pretty well for many of the items in the repository.  As I said there is generally a one-to-one relationship between an object and a Partner field in the dataset. UNT Scholarly Works: Partners per Session From the Partners Per Session graph we can see that there are many sessions that make use of items from multiple Partners in a single session.  In fact 66.7% of the sessions that accessed 2-11 items made use of items from more than one Partner.  To me that is really telling that there are discoveries being made that span disciplines in this collection.  So a user could pull an article that was contributed by the College of Information and in the same session pull up something that is from the College of Music. That’s pretty cool. The next thing we can look at is the number of different resource types that are used in a given session.  There is generally one resource type per digital object.  These resource types could be an Article, a Book Chapter, a Report, a Poster, or a Presentation.  We are interested in seeing how often a session will include multiple different types of resources. UNT Scholarly Works: Types per Session In looking at the graph above we can see that for sessions that included between 2 and 11 items there were 76% of the sessions where users made use of items that were different types. The final area we will look is at the collections per session.  This is a little bit messier to explain because it is possible (and common) for a single digital object to have multiple collections.  We had to take this into account in the way that we counted the collections per session. UNT Scholarly Works: Collections per Session This graph matches the same kind of pattern that we saw for Partners and Resource types.  For sessions that used between two and eleven items 75% of the sessions used two or more different collection combinations.  This means that when a user looked at two or more different records there was a very high chance that they were going to be pulling up a digital object that was from another collection in the UNT Digital Library. So how does this happen?  I can come up with four different ways that this can happen. A user is using the main search on https://digital.library.unt.edu/ and just pulls up items that are from a number of collections. A user is searching one of the combined search interfaces we have for the library that includes all 2 million metadata records in the UNT Libraries Digital Collections. A user is coming to our content from a google search that lands them in a collection and they navigate more broadly to get to a resource. A user has multiple different browser tabs open and might even have two different search tasks going on but they are getting combined into one session because of the way we are grouping things for this analysis. There are probably other ways that this is happening which might be a good thing to look at in more depth in the future.  I looked briefly at the full list of collections that get used together and some of the combinations aren’t immediately interpretable with a logical story of how these items were viewed together within a session.  The Web gets messy. Cross-Collection Sessions In looking at the number of sessions that spanned more than one collection I was interested in understanding which collections were most used with the UNT Scholarly Works Repository collection. I took all of the collections present in each session and created pairs of collections in the form of (‘UNTSW’, ‘UNTETD’) or (‘UNTSW’, ‘TRAIL’). These were then grouped and then the results placed into a table to show how everything matches up. UNTSW UNTETD OSTI TRAIL OTA TDNP MDID CRSR JNDS UNTGW UNTSW 0 11,147 1,938 1,126 1,121 1,118 952 703 676 302 UNTETD 11,147 0 323 258 895 80 230 175 59 63 OSTI 1,938 323 0 165 9 48 3 91 28 8 TRAIL 1,126 258 165 0 19 44 16 63 11 14 OTA 1,121 895 9 19 0 2 60 15 5 4 TDNP 1,118 80 48 44 2 0 0 17 4 12 MDID 952 230 3 16 60 0 0 3 2 1 CRSR 703 175 91 63 15 17 3 0 8 15 JNDS 676 59 28 11 5 4 2 8 0 0 UNTGW 302 63 8 14 4 12 1 15 0 0 The table needs just one piece of information to keep in mind.  When you start comparing collections that don’t include UNTSW you need to remember that because we limited our dataset to sessions that included UNTSW you should always add that into your interpretation.  For example if you were looking at how often do items from UNTETD (our theses and dissertation collection) get used with TRAIL (Technial Report Archive and Image Library collection) you will get 258 sessions.  But you also have to add into that UNTSW so it is really, how often does UNTETD, TRAIL and UNTSW get used together which is 258. Just looking at the first column of the table will give us the collections that are most often accessed within sessions with the UNT Scholarly Works Repository.  I pulled those out into the chart below. Cross-Collection Sessions By far the most commonly used collection with the UNT Scholarly Works Repository collection is the UNT Theses and Dissertations collection. This occurs 39% of the time when there are two or more collections used in a session.  The other collections drop off very quickly after UNTETD. This analysis is just another quick stab at understanding how the digital collections are being accessed by our users.  I think that there is more that we can do with this data and hopefully I’ll get around to doing a bit more analysis this summer.  There are still a few research questions from our original post that we haven’t answered. If you have questions or comments about this post,  please let me know via Twitter. This entry was posted in thinking outloud on June 4, 2018 by vphill. Introducing Sampling and New Algorithms to the Clustering Dashboard One of the things that we were excited about when we adding the Clustering Dashboard to the UNT Libraries’ Edit system was the ability to experiment with new algorithms for grouping or clustering metadata values.  I gave a rundown of the Cluster Dashboard in a previous blog post  This post is going to walk through some of things that we’ve been doing to try and bring new data views to the metadata we are managing here at the UNT Libraries. The need to sample I’m going to talk a bit about the need to sample values first and then get to the algorithms that make use of it in a bit. When we first developed the Cluster Dashboard we were working with a process that would take all of the values of a selected metadata element, convert those values into a hash value of some sort and then and identify where there were more than one value that produces the same hash.  We were only interested in the instances that contained multiple values that had the same hash.  While there were a large number of clusters for some of the elements, each cluster had a small number of values.  I think the biggest cluster I’ve seen in the system had 14 values.  This is easy to display to the user in the dashboard so that’s what we did. Moving forward we wanted to make use of some algorithms that would result in hundreds, thousands, and even tens of thousands of values per cluster.  An example of this is trying to cluster on the length of a field.  In our dataset there are  41,016 different creator values that are twelve characters in length.  If we tried to display all of that to the user we would quickly blow up the browser for the user which is never any fun. What we have found is that there are some algorithms we want to use that will always return all of the values and not only when there are multiple values that share a common hash.  For these situations we want to be proactive and sample the cluster members so that we don’t overwhelm the users interface. Sampling Options in Cluster Dashboard You can see in the screenshot above that there are a few different ways that you can sample the values of a cluster. Random 100 First 100 Alphabetically Last 100 Alphabetically 100 Most Frequent 100 Least Frequent This sampling allows us to provide some new types of algorithms but still keep the system pretty responsive.  So far we’ve found this works because when you are using these cluster algorithms that return so many value you generally aren’t interested in the clusters that are the giant clusters. You are typically looking for anomalies that show up in smaller clusters, like really long or really short values for a field. Cluster Options in Dashboard showing sampled and non-sampled clustering algorithms. We divided the algorithm selection dropdown into two parts to try and show the user the algorithms that will be sampled and the ones that don’t require sampling.  The option to select a sample method will only show up when it is required by the algorithm selected. New Algorithms As I mentioned briefly above we’ve added a new set of algorithms to the Cluster Dashboard.  These algorithms have been implemented to find anomalies in the data that are a bit hard to find other ways.  First on the list is the Length algorithm.  This algorithm uses the number of characters or length of the value as the clustering key.  Generally the very short and the very long values are the ones that we are interested in. I’ll show some screenshots of what this reveals about our Subject element.  I always feel like I should make some sort of defense of our metadata when I show these screenshots but I have a feeling that anyone actually reading this will know that metadata is messy. Subject Clustered by Length (shortest) Subject Clustered by Length (longest) So quickly we can get to values that we probably want to change. In this case the subject values that are only one character in length or those that are over 1,600 characters in length. A quick story about how this is useful.  We had a metadata creator a few years back accidentally pasted the contents of a personal email into the title field of a photograph because they just got their clipboard mixed up.  They didn’t notice this so it went unnoticed for a few weeks until it was stumbled on by another metadata editor.  This sort of thing happens from time to time and can show up with this kind of view. There are a few variations on the length that we provide.  Instead of the number of characters we have another view that is the count of tokens in the metadata value.  So a value of “University of North Texas” would have a token count of 4.  This gives a similar but different view as the length. Beyond that we provide some algorithms that look at the length of tokens within the values.  So the value of “University of North Texas” would have an Average Token Length of 5.5.  I’ve honestly not found a good use for the Average Token Length, Median Token Length, Token Length Mode, or Token Length Range yet but maybe we will? Finally there is the Pattern Mask algorithm that was implemented primarily for the date field in our metadata records.  This algorithm takes in the selected metadata element values and converts all digits to 0 and all of the letters to an a.  It leaves all punctuation characters alone. So a value of “1943” maps to “0000” or a value of “July 4, 2014” maps to “aaaa 0, 0000”. Pattern Mask on Date Element In the example above you can quickly see the patterns that we will want to address as we continue to clean up our date element. As I mentioned at the beginning of the post, one of the things that we were excited about when we implemented the Cluster Dashboard was the ability to try out different algorithms for looking at our metadata.  This is our first set of “new” algorithms for the system.  We also had to add the ability to sample the clusters because the can quickly get crazy with the number of values.  Hopefully we will be able to add additional clustering algorithms to the system in the future. Are there any ideas that you have for us that you would like us to try out in the interface?  If so please let me know, we would love to experiment a bit. If you have questions or comments about this post,  please let me know via Twitter. This entry was posted in thinking outloud on April 24, 2018 by vphill. User Session Analysis: Investigating Sessions In the previous post in this series I laid out the work that we were going to do with session data from the UNT Libraries’ Digital Collections.  In order to get the background that this post builds from take a quick look at that post. In this post we are going to look at the data for the 10,427,111 user sessions that we generated from the 2017 Apache access logs from the UNT Libraries Digital Collections. Items Per Sessions The first thing that we will take a look at in the dataset is information about how many different digital objects or items are viewed during a session. Items Accessed Sessions Percentage of All Sessions 1 8,979,144 86.11% 2 809,892 7.77% 3 246,089 2.36% 4 114,748 1.10% 5 65,510 0.63% 6 41,693 0.40% 7 29,145 0.28% 8 22,123 0.21% 9 16,574 0.16% 10 15,024 0.14% 11 10,726 0.10% 12 9,087 0.09% 13 7,688 0.07% 14 6,266 0.06% 15 5,569 0.05% 16 4,618 0.04% 17 4,159 0.04% 18 3,540 0.03% 19 3,145 0.03% 20-29 17,917 0.17% 30-39 5,813 0.06% 40-49 2,736 0.03% 50-59 1,302 0.01% 60-69 634 0.01% 70-79 425 0.00% 80-89 380 0.00% 90-99 419 0.00% 100-199 2,026 0.02% 200-299 411 0.00% 300-399 105 0.00% 400-499 63 0.00% 500-599 24 0.00% 600-699 43 0.00% 700-799 28 0.00% 800-899 20 0.00% 900-999 6 0.00% 1000+ 19 0.00% I grouped the item uses per session in order to make the table a little easier to read.  With 86% of sessions being single item accesses that means we have 14% of the sessions that have more than one item access. This is still 1,447,967 sessions that we can look at in the dataset so not bad. You can also see that there are a few sessions that have a very large number of items associated with them. For example there are 19 sessions that have over 1,000 items being used.  I would guess that this is some sort of script or harvester that is masquerading as a browser. Here are some descriptive statistics for the items per session data. N Min Median Max Mean Stdev 10,427,111 1 1 1,828 1.53 4.735 For further analysis we will probably restrict our sessions to those that have under 20 items used in a single session.  While this might remove some legitimate sessions that used a large number of items, it will give us numbers that we can feel a bit more confident about.  That will leave 1,415,596 or 98% of the sessions with more than one item used still in the dataset for further analysis. Duration of Sessions The next thing we will look at is the duration of sessions in the dataset.  We limited a single session to all interactions by an IP address in a thirty minute window so that gives us the possibility of sessions up to 1,800 seconds. Minutes Sessions Percentage of Sessions 0 8,539,553 81.9% 1 417,601 4.0% 2 220,343 2.1% 3 146,100 1.4% 4 107,981 1.0% 5 87,037 0.8% 6 71,666 0.7% 7 60,965 0.6% 8 53,245 0.5% 9 47,090 0.5% 10 42,428 0.4% 11 38,363 0.4% 12 35,622 0.3% 13 33,110 0.3% 14 31,304 0.3% 15 29,564 0.3% 16 27,731 0.3% 17 26,901 0.3% 18 25,756 0.2% 19 24,961 0.2% 20 32,789 0.3% 21 24,904 0.2% 22 24,220 0.2% 23 23,925 0.2% 24 24,088 0.2% 25 24,996 0.2% 26 26,855 0.3% 27 30,177 0.3% 28 39,114 0.4% 29 108,722 1.0% The table above groups a session into buckets for each minute.  The biggest bucket by number of sessions is the bucket of 0 minutes. This bucket has sessions that are up to 59 seconds in length and accounts for 8,539,553 or 82% of the sessions in the dataset. Duration Sessions Percent of Sessions Under 1 Min 0 sec 5,892,556 69% 1-9 sec 1,476,112 17% 10-19 sec 478,262 6% 20-29 sec 257,916 3% 30-39 sec 181,326 2% 40-49 sec 140,492 2% 50-59 sec 112,889 1% You might be wondering about those sessions that lasted only zero seconds.  There are 5,892,556 of them which is 69% of the sessions that were under one minute.  These are almost always sessions that used items as part of an embedded link, a pdf view directly from another site (google, twitter, webpage) or a similar kind of view. Next Steps This post helped us get a better look at the data that we are working with.  There is a bit of strangeness here and there with the data but this is pretty normal for situations where you work with access logs.  The Web is a strange place full of people, spiders, bots,  and scripts. Next up we will actually dig into some of the research questions we had in the first post.  We know how we are going to limit our data a bit to get rid of some of the outliers in the number of items used and we’ve given a bit of information about the large number of very short duration sessions.  So more to come. If you have questions or comments about this post,  please let me know via Twitter. This entry was posted in thinking outloud on February 20, 2018 by vphill. User Session Analysis: Connections Between Collections, Type, Institutions I’ve been putting off some analysis that a few of us at the UNT Libraries have wanted to do with the log files of the UNT Libraries Digital Collections.  This post (and probably a short series to follow) is an effort to get back on track. There are three systems that we use to provide access to content and those include: The Portal to Texas History, the UNT Digital Library, and the Gateway to Oklahoma History. In our digital collections there are a few things that we’ve said over time that we feel very strongly about but which we’ve never really measured.  First off we have said that there is value in co-locating all of our content in the same fairly uniform system instead of building visually and functionally distinct systems for different collections of items.  So instead of each new project or collection going into a new system, we’ve said there is not only cost savings, but real value in putting them all together in a single system.  We’ve said “there is an opportunity for users to not only find content from your collection, but they could find useful connections to other items in the overall digital library”. Another thing we’ve said is that there is value in putting all different types of digital objects together into our digital systems.  We put the newspapers, photographs, maps, audio, video, and datasets together and we think there is value in that.  We’ve said that users will be able to find newspaper issues, photographs, and maps that might meet their need.  If we had a separate newspaper system, separate video or audio system some of this cross-type discovery would never take place. Finally we’ve said that there is great value in locating collections from many institutions together in a system like The Portal to Texas History.  We thought (and still think) that users would be able to do a search and it will pull resources together from across institutions in Texas that have matching resources. Because of the geography of the state, you might be finding things that are physically located 10 or 12 hours away from each other at different institutions. In the Portal, these could be displayed together, something that would be challenging if they weren’t co-located in a system. In our mind these aren’t completely crazy concepts but we do run into other institutions and practitioner that don’t always feel as strongly about this as we do.  The one thing that we’ve never done locally is look at the usage data of the systems and find out: Do users discover and use items from different collections? Do users discover and use items that are different types? Do users discover and use items that are from different contributing partners? This blog post is going to be the first in a short series that takes a  look at the usage data in the UNT Libraries Digital Collections in an attempt to try and answer some of these questions. Hopefully that is enough background, now let’s get started: How to answer the questions. In order to get started we had to think a little bit about how we wanted to pull together data on this.  We have been generating item-based usage for the digital library collections for a while.  These get aggregated into collection and partner statistics that we make available in the different systems.  The problem with this data is that it just shows what items were used and how many times in a day they were used.  It doesn’t show what was used together. We decided that we needed to go back to the log files from the digital collections and re-create user sessions to group item usage together.  After we have information about what items were used together we can sprinkle in some metadata about those items and start answering our questions. With that as a plan we can move to the next step. Preparing the Data We decided to use all of the log files for 2017 from our digital collections servers.  This ends up being 1,379,439,042 lines of Apache access logs (geez, over 1.3 billion, or 3.7 million server requests a day).  The data came from two different servers that collectively host all of the application traffic for the three systems that make up the UNT Libraries’ Digital Collections. We decided that we would define a session as all of the interactions that a single IP address has with the system in a 30 minute window.  If a user uses the system for more than 30 minutes, say 45 minutes, that would count as one thirty minute session and one fifteen minute session. We started by writing a script that would do three things.  First it would ignore lines in the log file that were from robots and crawlers.  We have a pretty decent list of these bots so that was easy to remove.  Next we further reduced the data by only looking at digital object accesses.  Specifically lines that looked something like ‘/ark:/67531/metapth1000000/`. This pattern in our system denotes an item access and these are what we were interested in.  Finally we only were concerned with accesses that returned content so we only looked at lines that returned a 200 status code. We filtered the log files down to three columns of data.  The first column was the timestamp for when the http access was made,  the second column was the has of the hashed IP address used to make the request, and the final column was the digital item path requested.  This resulted in a much smaller dataset to work with, from 1,379,439,042 down to 144,405,009 individual lines of data. Here is what a snipped of data looks like 1500192934 dce4e45d9a90e4a031201b876a70ec0e /ark:/67531/metadc11591/m2/1/high_res_d/Bulletin6869.pdf 1500192940 fa057cf285725981939b622a4fe61f31 /ark:/67531/metadc98866/m1/43/high_res/ 1500192940 fa057cf285725981939b622a4fe61f31 /ark:/67531/metadc98866/m1/41/high_res/ 1500192944 b63927e2b8817600aadb18d3c9ab1557 /ark:/67531/metadc33192/m2/1/high_res_d/dissertation.pdf 1500192945 accb4887d609f8ef307d81679369bfb0 /ark:/67531/metacrs10285/m1/1/high_res_d/RS20643_2006May24.pdf 1500192948 decabc91fc670162bad9b41042814080 /ark:/67531/metadc504184/m1/2/small_res/ 1500192949 f7948b68f7b52fd15c808beee544c131 /ark:/67531/metadc52714/ 1500192951 f7948b68f7b52fd15c808beee544c131 /ark:/67531/metadc52714/m1/1/small_res/ 1500192950 c8a320f38b3477a931fabd208f25c219 /ark:/67531/metadc1729/m1/9/med_res_d/ 1500192952 f7948b68f7b52fd15c808beee544c131 /ark:/67531/metadc52714/m1/1/med_res/ 1500192952 f7948b68f7b52fd15c808beee544c131 /ark:/67531/metadc52714/m1/3/small_res/ 1500192953 f7948b68f7b52fd15c808beee544c131 /ark:/67531/metadc52714/m1/2/small_res/ 1500192952 f7948b68f7b52fd15c808beee544c131 /ark:/67531/metadc52714/m1/4/small_res/ 1500192955 67ef5c0798dd16cb688b94137b175f0b /ark:/67531/metadc848614/m1/2/small_res/ 1500192963 a19ce3e92cd3221e81b6c3084df2d4a6 /ark:/67531/metadc5270/m1/254/med_res/ 1500192961 ea9ba7d064412a6d09ff708c6e95e201 /ark:/67531/metadc85867/m1/4/high_res/ You can see the three columns in the data there. The next step was actually to sort all of this data by the timestamp in the first column.  You might notice that not all of the lines are in chronological order in the sample above.  By sorting on the timestamp, things will fall into order based on time. The next step was to further reduce this data down into sessions.  We created a short script that we could feed the data into and it would keep track of the ip addresses it came across, note the objects that the ip hash used, and after a thirty minute period of time (based on the timestamp) it would start the aggregation again. The result was a short JSON structure that looked like this. { "arks": ["metapth643331", "metapth656112"], "ip_hash": "85ebfe3f0b71c9b41e03ead92906e390", "timestamp_end": 1483254738, "timestamp_start": 1483252967 } This JSON has the ip hash, the starting and ending timestamp for that session, and finally the items that were used.  Each of these JSON structures were placed into a file, a line-oriented set of JSON “files” that would get used in the following steps. This new line-oriented JSON file is 10,427,111 lines long, with one line representing a single user session for the UNT Libraries’ Digital Collections.  I think that’s pretty cool. I think I’m going to wrap up this post but in the next post I will take a look at what these users sessions look like with a little bit of sorting, grouping, plotting, and graphing. If you have questions or comments about this post,  please let me know via Twitter. This entry was posted in thinking outloud on February 16, 2018 by vphill. Metadata Quality Interfaces: Cluster Dashboard (OpenRefine Clustering Baked Right In) This is the last of the updates from our summer’s activities in creating new metadata interfaces for the UNT Libraries Digital Collections.  If you are interested in the others in this series you can view the past few posts on this blog where I talk about our facet, count, search, and item interfaces. This time I am going to talk a bit about our Cluster Dashboard.  This interface took a little bit longer than the others to complete.  Because of this, we are just rolling it out this week, but it is before Autumn so I’m calling it a Summer interface. I warn you that there are going to be a bunch of screenshots here, so if you don’t like those, you probably won’t like this post. Cluster Dashboard For a number of years I have been using OpenRefine for working with spreadsheets of data before we load them into our digital repository.  This tool has a number of great features that help you get an overview of the data you are working with, as well as identifying some problem areas that you should think about cleaning up.  The feature that I have always felt was the most interesting was their data clustering interface.  The idea of this interface is that you choose a facet, (dimension, column) of your data and then group like values together.  There are a number of ways of doing this grouping and for an in-depth discussion of those algorithms I will point you to the wonderful OpenRefine Clustering documentation. OpenRefine is a wonderful tool for working with spreadsheets (and a whole bunch of other types of data) but there are a few challenges that you run into when you are working with data from our digital library collections.  First of all our data generally isn’t rectangular.  It doesn’t easily fit into a spreadsheet.  We have some records with one creator, we have some records with dozens of creators.  There are ways to work with these multiple values but things get complicated. The bigger challenge we generally have is that while many systems can generate a spreadsheet of their data for exporting, very few of them (our system included) have a way of importing those changes back into the system in a spreadsheet format.  This means that while you could pull data from the system, clean it up in OpenRefine, when you were ready to put it back in the system you would run into the problem that there wasn’t a way to get that nice clean data back into the system. A way that you could use OpenRefine was to identify records to change and then have to go back into the system and change records there. But that is far from ideal. So how did we overcome this? We wanted to use the OpenRefine clustering but couldn’t get data easily back into our system.  Our solution?  Bake the OpenRefine clustering right into the system.  That’s what this post is about. The first thing you see when you load up the Cluster Dashboard is a quick bit of information about how many records, collections, and partners you are going to be working on values from.  This is helpful to let you know the scope of what you are cluster, both to understand why it might take a while to generate clusters, but also because it is generally better to run these clustering tools over the largest sets of data that you can because it can pull in variations from many different records.  Other than that you are presented with a pretty standard dashboard interface from the UNT Libraries’ Edit System. You can limit to subsets of records with the facets on the left side and the number of items you cluster over will change accordingly. Cluster Dashboard The next thing that you will see is a little help box below the clustering stats. This is a help interface that helps to explain how to use the clustering dashboard and a little more information about how the different algorithms work.  Metadata folks generally like to know the fine details about how the algorithms work, or at least be able to find that information if they want to know it later. Cluster Dashboard Help The first thing you do is select a field/element/facet that you are interested in clustering. In the example below I’m going to select the Contributor field. Choosing an Element to Cluster Once you make a selection you can further limit it to a qualifier, in this case you could limit it to just the Contributors that are organizations, or Contributors that are Composers.  As I said above, using more data generally works better so we will just run the algorithms over all of the values. You next have the option of choosing an algorithm for your clustering.  We recommend to people that they start with the default Fingerprint algorithm because it is a great starting point.  I will discuss the other algorithms later in this post. Choosing an Algorithm After you select your algorithm, you hit submit and things start working.  You are given a screen that will have a spinner that tells you the clusters are generating. Generating Clusters Depending on your dataset size and the number of unique values of the selected element, you could get your results back on a second or dozens of seconds.  The general flow of data after you hit submit is to query the Solr backend for all of the facet values and their counts.  These values are then processed with the chosen algorithm that creates a “key” for that value.  Another way to think about it is that the values are placed into a bucket that groups similar values together.  There are some calculations that are preformed on the clusters and then they are cached for about ten minutes by the system.  After you wait for the clusters to generate the first time they are much quicker for the next ten minutes. In the screen below you can see the results of this first clustering.  I will go into detail about the values and options you have to work with the clusters. Contributor Clusters with Fingerprint Key Collision Hashing The first thing that you might want to do is sort the clusters in a different way.  By default they are sorted with the value of the cluster key.  Sometimes this makes sense, sometimes it doesn’t make sense as to why something is in a given order.  We thought about displaying the key but found that it was also distracting in the interface. Different ways of sorting clusters One of the ways that I like to sort the clusters is by the number of cluster Members.  The image below shows the clusters with this sort applied. Contributor Field sorted by Members Here is a more detailed view of a few clusters.  You can see that the name of the Russian composer Shostakovich has been grouped into a cluster of 14 members.  This represents 125 different records in the system with a Contributor element for this composer.  Next to each Member Value you will see a number in parenthesis, this is the number of records that uses that variation of the value. Contributor Cluster Detail You can also sort based on the number of records that a cluster contains.  This brings up the most frequently used values.  Generally there are a large number that have a value and then a few records that have a competing value.  Usually pretty easy to fix. Contributor Element sorted by Records Sorting by the Average Length Variation can help find values that are strange duplications of themselves.  Repeated phrases, a double copy and paste, strange things like that come to the surface. Contributor Element sorted by Average Length Variation Finally sorting by Average Length is helpful if you want to work with the longest or shortest values that are similar. Contributor Element sorted by Average Length Different Algorithms I’m going to go through the different algorithms that we currently have in production.  Our hope is that as time moves forward we will introduce new algorithms or slight variations of algorithms to really get at some of the oddities of the data in the system.  First up is the Fingerprint algorithm.  This is a direct clone of the default fingerprint algorithm used by OpenRefine. Contributor Element Clustered using Fingerprint Key Collision A small variation we introduced was instead of replacing punctuation with a whitespace character, the Fingerprint-NS (No Space) just removes the punctuation without adding whitespace.  This would group F.B.I with FBI where the other Fingerprint algorithm wouldn’t group them together.  This small variation surfaces different clusters.  We had to keep reminding ourselves that when we created the algorithms that there wasn’t such a thing as “best”, or “better”, but instead they were just “different”. Contributor Element Clustered using Fingerprint (No Space) Key Collision One thing that is really common for names in bibliographic metadata is that they have many dates.  Birth, death, flourished, and so on.  We have a variation of the Fingerprint algorithm that removes all numbers in addition to punctuation.  We call this one Fingerprint-ND (No Dates).  This is helpful for grouping names that are missing dates with versions of the name that have dates.  In the second cluster below I pointed out an instance of Mozart’s name that wouldn’t have been grouped with the default Fingerprint algorithm.  Remember, different, not better or best. Contributor Element Clustered using Fingerprint (No Dates) Key Collision From there we branch out into a few simpler algorithms.  The Caseless algorithm just lowercases all of the values and you can see clusters that only differ in ways that are related to upper case or lower case values. Contributor Element Clustered using Caseless (lowercase) Key Collision Next up is the ASCII algorithm which tries to group together values that only differ in diacritics.  So for instance the name Jose and José would be grouped together. Contributor Element Clustered using ASCII Key Collision The final algorithm is just a whitespace normalization called Normalize Whitespace, it removes consecutive whitespace characters to group values. Contributor Element Clustered using Normalized Whitespace Key Collision You may have noticed that the number of clusters went down dramatically from the Fingerprint algorithms to the Caseless, ASCII, or Normalize Whitespace, we generally want people to start with the Fingerprint algorithms because they will be useful most of the time. Other Example Elements Here are a few more examples from other fields.  I’ve gone ahead and sorted them by Members (High to Low) because I think that’s the best way to see the value of this interface.  First up is the Creator field. Creator Element clustered with Fingerprint algorithm and sorted by Members Next up is the Subject field.  We have so so many ways of saying “OU Football” Subject Element clustered with Fingerprint algorithm and sorted by Members The real power of this interface is when you start fixing things.  In the example below I’m wanting to focus in on the value “Football (O U )”.  I do this by clicking the link for that Member Value. Subject Element Cluster Detail You are taken directly to a result set that has the records for that selected value.  In this case there are two records with “Football (O U )”. Selected Records All you have to do at this point is open up a record, make the edit and publish that record back. Many of you will say “yeah but wouldn’t some sort of batch editing be faster here?”  And I will answer “absolutely,  we are going to look into how we would do that!” (but it is a non-trivial activity due to how we manage and store metadata, so sadface 🙁 ) Subject Value in the Record There you have it, the Cluster Dashboard and how it works.  The hope is to empower our metadata creators and metadata managers to better understand and if needed, clean up the values in our metadata records.  By doing so we are improving the ability for people to connect different records based on common valuse between the records. As we move forward we will introduce a number of other algorithms that we can use to cluster values.  There are also some other metrics that we will look at for sorting records to try and tease out “which clusters would be the most helpful to our users to correct first”.  That is always something we are keeping in the back of our head,  how can we provide a sorted list of things that are most in need of human fixing.  So if you are interested in that sort of thing stay tuned, I will probably talk about it on this blog. If you have questions or comments about this post,  please let me know via Twitter. This entry was posted in thinking outloud on September 19, 2017 by vphill. Metadata Interfaces: Search Dashboard This is the next blog post in a series that discusses some of the metadata interfaces that we have been working on improving over the summer for the UNT Libraries Digital Collections.  You can catch up on those posts about our Item Views, Facet Dashboard, and Element Count Dashboard if you are curious. In this post I’m going to talk about our Search Dashboard.  This dashboard is really the bread and butter of our whole metadata editing application.  About 99% of the time a user who is doing some metadata work will login and work with this interface to find the records that they need to create or edit. The records that they see and can search are only ones that they have privileges to edit.  In this post you will see what I see when I login to the system, the nearly 1.9 million records that we are currently managing in our systems. Let’s get started. Search Dashboard If you have read the other post you will probably notice quite a bit of similarity between the interfaces.  All of those other interfaces were based off of this search interfaces.  You can divide the dashboard into three primary sections.  On the left side there are facets that allow you to refine your view in a number of ways.  At the top of the right column is an area where you can search for a term or phrase in a record you are interested in.  Finally under the search box there is a result set of items and various ways to interact with those results. By default all the records that you have access to are viewable if you haven’t refined your view with a search or a limiting facet. Edit Interface Search Dashboard The search section of the dashboard lets you find a specific record or set of records that you are interested in working with.  You can choose to search across all of the fields in the metadata record or just a specific metadata field using the dropdown next to where you enter your search term.  You can search single words, phrases, or unique identifiers for records if you have those.  Once you hit the search button you are on your way. Search and View Options for Records Once you have submitted your search you will get back a set of results.  I’ll go over these more in depth in a little bit. Record Detail You can sort your results in a variety of ways.  By default they are returned in Title order but you can sort them by the date they were added to the system, the date the original item was created, the date that the metadata record was last modified, the ARK identifier and finally by a completeness metric.   You also have the option to change your view from the default list view to the grid view. Sort Options Here is a look at the grid view.  It presents a more visually compact view of the records you might be interested in working with. Grid View The image below is a detail of a record view. We tried to pack as much useful information into each row as we  could.  We have the title, a thumbnail, several links to either the edit or summary item view on the left part of the row.  Following that we have the system, collection, and partner that the record belongs to. We have the unique ARK identifier for the object, the date that it was added to the UNT Libraries’ Digital Collections, and the date the metadata was last modified.  Finally we have a green check if the item is visible to the public or a red X if the item is hidden from the public. Record Detail Facet Section There are a number of different facets that a user can use to limit the records they are working with to a smaller subset.  The list is pretty long so I’ll first show you it in a single image and then go over some of the specifics in more detail below. Facet Options The first three facets are the system, collection and partner facets.  We have three systems that we manage records for with this interface, The Portal to Texas History, the UNT Digital Library, and the Gateway to Oklahoma History. Each digital item can belong to multiple collections and generally belongs to a single partner organization.  If you are interested in just working on the records for the KXAS-NBC 5 New Collection you can limit your view of records by selecting that value from the Collections facet area. System, Collections and Partners Facet Options Next are the Resource Type and Visibility facets.  It is often helpful to limit to just a specific resource type, like Maps when you are doing your metadata editing so that you don’t see things that you aren’t interested in working with.  Likewise there are some kinds of metadata editing that you want to focus primarily on items that are already viewable to the public and you don’t want the hidden records to get in the way. You can do this with the Visibility facet. Resource Type and Visibility Facet Options Next we start getting into the new facet types that we added this summer to help identify records that need some metadata uplift.  We have the Date Validity, My Edits, and Location Data facets. Date Validity is a facet that allows you to identify records that have dates in them that are not valid according to the Extended Date Time Format (EDTF).  There are two different fields in a record that are checked, the date field and the coverage field (which can contain dates).  If any of these aren’t valid EDTF strings then we mark the whole record as having Invalid Dates.  You can use this facet to identify these and go in a correct those values. Next up is a facet for just the records that you have edited in the past.  This can be helpful for a number of reasons.  I use it from time to time to see if any of the records that I’ve edited have developed any issues like dates that aren’t valid since I last edited them.  It doesn’t happen often but can be helpful. Finally there is a section of Location Data.  This set of facets is helpful for identifying records which have or don’t have a Place Name, Place Point, or Place Box in the record.  Helpful if you are working through a collection trying to add geographic information to the records. Date Validity, My Edits, and Location Data Facet Options The final set of facets are Recently Edited Records, and Record Completeness.  The first is the Recently Edited Records which is pretty straight forward.  This just a listing of how many records have been edited in the past 24h, 48h, 7d, 30d, 180d, 365d in the system.  One note that causes a bit of confusion here is that these are records that are edited by  anyone in the past period of time.  It is often misunderstood as “your edits” in a given period of time which isn’t true.  Still very helpful but can get you into some strange results if you think about it the other way. The last facet value is for the Record Completeness. We really have two categories, records that have a completeness of 1.0 (Complete Records) or records that are less than 1.0 (Incomplete Records).  This metric is calculated when the item is indexed in the system and based on our notion of a minimally viable record. Recently Edited Records and Record Completeness Facet Options This finishes this post about the Search Dashboard for the UNT Libraries Digital Collections.  We have been working to build out this metadata environment for about the last eight years and have slowly refined it to the metadata creation and editing workflows that seem to work for the widest number of folks here at UNT.  There are always improvements that we can make and we have been steadily chipping away at those over time. There are a few other things that we’ve been working on over the summer that I will post about in the next week or so, so stay tuned for more. If you have questions or comments about this post,  please let me know via Twitter. This entry was posted in thinking outloud on September 5, 2017 by vphill. Metadata Quality Interfaces: Element Count Dashboard Next up in our review of the new metadata quality interfaces we have implemented this summer is our Element Count Dashboard. The basics of this are that whenever we index metadata records in our Solr index we go ahead and count the number of instances of a given element, or a given element with a specific qualifier and store those away in the index.  This results in hundreds of fields that are the counts of element instances in those fields. We built an interface on top of these counts because we had a hunch that we would be able to use this information to help us identify problems in our metadata records.  It feels like I’m showing some things in our metadata that we probably don’t want to really highlight but it is all for helping others understand.  So onward! Element Count Dashboard The dashboard is similar to other dashboards in the Edit system.  You have the ability to limit your view to just the collection, partner or system you are interested in working with. Count Dashboard From there you can select an element you are interested in viewing counts for.  In the example below I am interested in looking at the Description element or field. Select an Element to View Counts Once your selection is made you are presented with the number of instances of the description field in a record.  This is a little more helpful if you know that in our metadata world, a nice clean record will generally have two description fields.  One for a content description and one for a physical description of the item. More than two is usually strange and less than one is usually bad. Counts for Description Elements To get a clearer view you can see the detail below.  This again is for the top level Description element where we like to have two descriptions. Detail of Description Counts You can also limit to a qualifier specifically.  In the example below you see the counts of Description elements with a content qualifier.  The 1,667 records that have two Description elements with a content qualifier are pretty strange.  We should probably fix those. Detail of Description Counts for Content Qualifier Next we limit to just the physical description qualifier. You will see that there are a bunch that don’t have any sort of physical description and then 76 that have two. We should fix both of those record sets. Detail of Description Counts for Physical Qualifier Because of the way that we index things we can also get at the Description elements that don’t have either a content or physical qualifier selected.  These are identified with a value of none for the qualifier.  You can see that there are 1,861,356 records that have zero Description elements with a none qualifier.  That’s awesome.  You can also see 52 that have one element and 261 that have two elements that are missing qualifiers.  That’s not awesome. Detail of Description Counts for None Qualifier I’m hoping you are starting to see how this kind of interface could be useful to drill into records that might look a little strange.  When you identify something strange all you have to do is click on the number and you are taken directly to the records that match what you’ve asked for.  In the example below we are seeing all 76 of the records that have two physical descriptions because this is something we are interested in correcting. Records with Multiple Duplicate Physical Qualifiers If you open up a record to edit you will see that yes, in fact there are two Physical Descriptions in this record. It looks like the first one should actually be a Content Description. Example of two physical descriptions that need to be fixed Once we change that value we can hit the Publish button and be on our way fixing other metadata records.  The counts will update about thirty seconds later to reflect the corrections that you have made. Fixed Physical and Content Descriptions Even more of a good thing. Because I think this is a little different than other interfaces you might be used to, it might be good to see another example. This time we are looking at the Creator element in the Element Count Dashboard. Creator Counts You will see that there are 112 different counts from zero way up into way way too many creators on an item (silly physics articles). I was curious to see what the counts looked like for Creator elements that were missing a role qualifier.  These are identified by selecting the none value from the qualifier dropdown. Creator Counts for Missing Qualifiers You can see that the majority of our records don’t have Creator elements missing the role qualifier but there are a number that do.  We can fix those.  If you wanted to look at those records that have five different Creator elements that don’t have a role you would end up getting to records that loo like the one below. Example of Multiple Missing Types and Roles You will notice that when a record has a problem there are often multiple things wrong with it. In this case not only is it missing role information for each of these Creator elements but there is also name type information that is missing.  Once we fix those we can move along and edit some more. And a final example. I’m hoping you are starting to see how this interface could be useful.  Here is another example if you aren’t convinced yet.  We are completing a retrospective digitization of theses and dissertations here at UNT.  Not only is this a bunch of digitization but it is quite a bit of metadata that we are adding to both the UNT Digital Library as well as our traditional library catalog.   Let’s look at some of those records. You can limit your dashboard view to the collection you are interested in working on.  In this case we choose the UNT Theses and Dissertations collection. Next up we take a look at the number of Creator elements per record. Theses and dissertations are generally authored by just one person.  It would be strange to see counts other than one. Creator Counts for These and Dissertations Collection It looks lie there are 26 records that are missing Creator elements and a single record that for some reason has two Creator elements.  This is strange and we should take a look. Below you will see the view of the 26 records that are missing a Creator element.  Sadly at the time of writing there are seven of these that are visible to the public so that’s something we really need to fix in a hurry. Example Theses that are Missing Creators That’s it for this post about our Element Count Dashboard.  I hope that you find this sort of interface interesting.  I’d be interested to hear if you have interfaces like this for your digital library collections or if you think something like this would be useful in your metadata work. If you have questions or comments about this post,  please let me know via Twitter. This entry was posted in thinking outloud on September 1, 2017 by vphill. Metadata Quality Interfaces: Facet Dashboard This is the second post in a series that discusses the new metadata interfaces we have been developing for the UNT Libraries’ Digital Collections metadata editing environment. The previous post was related to the item views that we have created. This post discusses our facet dashboard in a bit of depth.  Let’s get started. Facet Dashboard A little bit of background is in order so that you can better understand the data that we are working with in our metadata system.  The UNT Libraries uses a locally-extended Dublin Core metadata element set. In addition to locally-extending the elements to include things like collection, partner, degree, citation, note, and meta fields we also qualify many of the fields. A qualifier usually specifics what type of value is represented.  So a subject could be a Keyword, or an LCSH value. A Creator could be an author, or a photographer.  Many of the fields have the ability to have one qualifier for the value. When we index records in our Solr instance we store strings of each of these elements, and each of the elements plus qualifiers, so we have fields we can facet on.  This results in facet fields for creator as well as specifically creator_author, or creator_photographer.  For fields that we expect the use of a qualifier we also capture when there isn’t a qualifier in a field like creator_none.  This results in many hundreds of fields in our Solr index but we do this for good reason,  to be able to get at the data in ways that are helpful for metadata maintainers. The first view we created around this data was our facet dashboard.  The image below shows what you get when you go to this view. Default Facet Dashboard On the left side of the screen you are presented with facets that you can make use of to limit and refine the information you are interested in viewing.  I’m currently looking at all of the records from all partners and all collections.  This is a bit over 1.8 million records. The next step is to decide which field you are interested in seeing the facet values for.  In this case I am choosing the Creator field. Selecting a field to view facet values After you make a selection you are presented with a paginated view of all of the creator values in the dataset (289,440 unique values in this case). These are sorted alphabetically so the first values are the ones that generally start with punctuation. In addition to the string value you are presented the number of records in the system that have that given value. All Creator Values Because there can be many many pages of results sometimes it is helpful to jump directly to a subset of the records.  This can be accomplished with a “Begins With” dropdown in the left menu.  I’m choosing to look at only facets that start with the letter D. Limit to a specific letter After making a selection you are presented with the facets that start with the letter D instead of the whole set.  This makes it a bit easier to target just the values you are looking for. Creator Values Starting with D Sometimes when you are looking at the facet values you are trying to identify values that fall next to each other but that might differ only a little bit. One of the things that can make this a bit easier is having a button that can highlight just the whitespace in the strings themselves. Highlight Whitespace Button Once you click this button you see that the whitespace is now highlighted in green.  This highlighting in combination with using a monospace font makes it easier to see when values only differ with the amount of whitespace. Highlighted Whitespace Once you have identified a value that you want to change the next thing to do is just click on the link for that facet value. Identified Value to Correct You are taken to a new tab in your browser that has just the records that have the selected value.  In this case there was just one record with “D & H Photo” that we wanted to edit. Record with Identified Value We have a convenient highlighting of visited rows on the facet dashboard so you know which values you have clicked on. Highlighted Reminder of Selected Value In addition to just seeing all of the values for the creator field you can also limit your view to a specific qualifier by selecting the qualifier dropdown when it is available. Select an Optional Qualifier You can also look at items that don’t have a given value, for example Creator values that don’t have a name type designated.  This is identified with a qualifier value of none-type. Creator Values Without a Designated Type You get just the 900+ values in the system that don’t have a name type designated. All of this can be performed on any of the elements or any of the qualified elements of the metadata records. While this is a useful first step in getting metadata editors directly to both the values of fields and their counts in the form of facets, it can be improved upon.  This view still requires users to scan a long long list of items to try and identify values that should be collapsed because they are just different ways of expressing the same thing with differences in spacing or punctuation. It is only possible to identify these values if they are located near each other alphabetically.  This can be a problem if you have a field like a name field that can have inverted or non-inverted strings for names.  So there is room for improvement of these interfaces for our users. Our next interface to talk about is our Count Dashboard.  But that will be in another post. If you have questions or comments about this post,  please let me know via Twitter.       This entry was posted in thinking outloud on August 31, 2017 by vphill. Post navigation ← Older posts Search for: Recent Posts Metadata Events System: Accounting for Time Managing Metadata Editing for Telecommuting. User Session Analysis: UNT Scholarly Works Introducing Sampling and New Algorithms to the Clustering Dashboard User Session Analysis: Investigating Sessions Recent Comments Government Data At Risk – UC3 Portal on How many of the EOT2008 PDF files were harvested in EOT2012 News Roundup | LJ INFOdocket on How do metadata records change over time? News Roundup | LJ INFOdocket on How do metadata records change over time? News Roundup | LJ INFOdocket on How do metadata records change over time? The Internet Archive has preserved 200TB of government website data during transition to Trump administration - techsqrd.com on How many of the EOT2008 PDF files were harvested in EOT2012 Proudly powered by WordPress 
vphill-com-8537	----	mark e. phillips journal mark e. phillips journal Metadata Events System: Accounting for Time In the last post, I mentioned that there four primary things that we have needed in order to move a large portion of our student and staff workers to full remote work during this quarantine. In this post, I wanted to jump ahead a bit and talk a bit about how we are accounting for [&#8230;] Managing Metadata Editing for Telecommuting. Many of us in the US, and around the world for that matter, are now sitting at home working remotely trying to maintain some semblance of normalcy during quarantine for COVID-19. I&#8217;m going to write up a few of the things that we have been working on her at the UNT Libraries to try and [&#8230;] User Session Analysis: UNT Scholarly Works This is a continuation of a series of posts that I never got around to writing earlier this semester.  I posted the first and second post in the series in February but never got around to writing the rest of them.  This time I am looking at if users use items from multiple collections in [&#8230;] Introducing Sampling and New Algorithms to the Clustering Dashboard One of the things that we were excited about when we adding the Clustering Dashboard to the UNT Libraries&#8217; Edit system was the ability to experiment with new algorithms for grouping or clustering metadata values.  I gave a rundown of the Cluster Dashboard in a previous blog post  This post is going to walk through [&#8230;] User Session Analysis: Investigating Sessions In the previous post in this series I laid out the work that we were going to do with session data from the UNT Libraries&#8217; Digital Collections.  In order to get the background that this post builds from take a quick look at that post. In this post we are going to look at the [&#8230;] User Session Analysis: Connections Between Collections, Type, Institutions I&#8217;ve been putting off some analysis that a few of us at the UNT Libraries have wanted to do with the log files of the UNT Libraries Digital Collections.  This post (and probably a short series to follow) is an effort to get back on track. There are three systems that we use to provide [&#8230;] Metadata Quality Interfaces: Cluster Dashboard (OpenRefine Clustering Baked Right In) This is the last of the updates from our summer&#8217;s activities in creating new metadata interfaces for the UNT Libraries Digital Collections.  If you are interested in the others in this series you can view the past few posts on this blog where I talk about our facet, count, search, and item interfaces. This time [&#8230;] Metadata Interfaces: Search Dashboard This is the next blog post in a series that discusses some of the metadata interfaces that we have been working on improving over the summer for the UNT Libraries Digital Collections.  You can catch up on those posts about our Item Views, Facet Dashboard, and Element Count Dashboard if you are curious. In this [&#8230;] Metadata Quality Interfaces: Element Count Dashboard Next up in our review of the new metadata quality interfaces we have implemented this summer is our Element Count Dashboard. The basics of this are that whenever we index metadata records in our Solr index we go ahead and count the number of instances of a given element, or a given element with a [&#8230;] Metadata Quality Interfaces: Facet Dashboard This is the second post in a series that discusses the new metadata interfaces we have been developing for the UNT Libraries&#8217; Digital Collections metadata editing environment. The previous post was related to the item views that we have created. This post discusses our facet dashboard in a bit of depth.  Let&#8217;s get started. Facet Dashboard [&#8230;] 
weallcount-com-1252	----	We All Count | Project For Equity in Data Science POSTS TOOLS DATA EQUITY FRAMEWORK NEWSLETTER LEARN WITH US WORK WITH US ABOUT US Select Page Welcome to We All Count! It’s a project to increase equity in data science. This project has a place for you in it if you want to work towards a world where data science is good, and good for everyone. Psst...want to skip the web browsing and get in touch with a human right away? Contact us! Interested in the push for better equity in Data Science? Here are some of the things WE CAN DO and some of the things YOU CAN DO. We can demystify, demonstrate and democratize. We All Count is committed to increasing data literacy – and particularly data equity literacy for everyone. We make tools, write articles and foster communities that everyone can benefit from. LEARN MORE ABOUT OUR CORE MISSION You can join the community. The first thing you should do if you want in on this project is to sign up for our newsletter. It’s the main way that we share the new ideas, resources, examples, and tools that our community creates.  SIGN UP HERE We can give solutions instead of pointing out problems. There are a lot of equity challenges facing data science today, but it’s not enough to just point them out. As data scientists ourselves we know that the best way forward is to find changes we can actually make and make today. SEE ALL THE WE ALL COUNT ARTICLES No One Is an Asterisk Read More Yes! You CAN Average Averages Read More What is a Research Question? Read More Algorithms Don’t Do Anything Read More Choose Maps for Equity Read More Reverse Engineering Data Viz for Equity Read More Why Big Data Needs Small Data Read More Who is the Head of Your Household? Read More The Promises and Limitations of Predictive Methodologies in the Public Sector Read More You can help us find answers. When we take on a data equity problem, we reach out to everyone in the project community for help finding examples, working through our thinking, and getting to workable solutions. If you are dealing with an equity problem, we want to hear about it, solve it and write it up with you! HELP US WRITE OUR UPCOMING ARTICLE We can create a Data Equity Framework. The core of We All Count’s philosophy is our Data Equity Framework. It’s a way of thinking about and engaging with equity in data projects from start to finish. It breaks down data work into meaningful, manageable steps with their own unique equity issues and solutions. LEARN THE DATA EQUITY FRAMEWORK You can get trained in Data Equity. We All Count trains individuals, teams, and entire organizations in data equity. We’ve trained thousands of people since we started in 2017. We do free workshops, public courses and custom trainings across a wide variety of data equity issues, sectors and project types, all grounded in The Data Equity Framework.  DISCOVER OUR TRAINING OPPORTUNITIES We can work with you to increase the equity of your projects. We All Count consults on data equity in all project types and offers equity-embedded services in project design, data collection, analysis, visualization & reporting.  FIND OUT HOW WE CAN WORK TOGETHER We can create tools anyone can use to improve data equity right away. You can use them, share them and help us improve them. Funding Web Tool Community built resource library Data Biography Builder Talk to your boss sheets Native Land Tableau Workbook Data Speak Decoder You can find an ally in us.  We don’t mean ‘ally’ in the overused, lip-service way it’s thrown about today. We mean if you are feeling lost, alone, confused, or oppressed in the world of data we will help you. Sometimes you just need someone who’s been there before. Sometimes you don’t even know what questions to start with. Sometimes you are facing repercussions or outright hostility. This form connects you to an email staffed by people who care. We will do our best to help you with whatever data equity problem you are facing; whether that means recommending training, offering consulting, introducing you to a community of supportive peers or having a one-on-one chat. We take privacy and anonymity extremely seriously and consider no problem too small or too large. Name Email Address Message SEND Facebook Twitter RSS © Datassist Inc., We All Count 2017 - 2021 | Privacy Policy 
weallcount-com-446	----	Learn About Equity in Data Science POSTS TOOLS DATA EQUITY FRAMEWORK NEWSLETTER LEARN WITH US WORK WITH US ABOUT US Select Page Learn with We All Count   The We All Count project offers a variety of ways to learn about equity in data science, from introductions to the base concepts to technical solutions and systems. No matter what you do or what you know, now is the perfect time to get to the next level of skill and understanding. Workshops & Events Foundations of Data Equity Learn our universal framework for creating and critiquing data projects. Each stage contains a set of practical tools to help embed and ensure increased equity in your data products.  Talking Data Equity Join us in our series of free, informal chats about current issues in data science. A great place to get some advice, share your experience and connect with other data equity champions! Lunch & Learn: A Case Study in Applying the Data Equity Framework Join us for one hour a week, for eight weeks, as we take you through a full case study of the We All Count Data Equity Framework. COMING SOON: Data Equity for Marketing & Human Resources Join us for a full day of interactive learning on how to ensure you can embed an equity lens into your organization’s data. From how you collect personal data, how you analyze performance data, and how you set up marketing campaigns to how to set up compensation structures to produce fair outcomes.  OTHER WAYS TO LEARN We All Count is committed to training and supporting anyone who wants to bring more equity and fairness to the world of data. We offer custom trainings and workshops suited to fit your needs. Contact us to get started! Contact us Facebook Twitter RSS © Datassist Inc., We All Count 2017 - 2021 | Privacy Policy 
weareinflux-com-132	----	Influx Library User Experience Influx Library User Experience Just another WordPress site 
weareinflux-com-5167	----	Influx Library User Experience – Just another WordPress site Contact Partner with Influx and create delightful experiences for your patrons A few things we do Prefab We've already built a great website for your library. UX evaluations What is your library doing well, and what can you improve?  Website reviews Looking for some expert feedback on your website? Get our eyes on it. Presentations Level up your staff's UX skills. Website redesign Full service research / design / build and anything in-between. Contact hello@weareinflux.com 503.200.4200 Be nice and have fun Be nice and have fun Be nice and have fun × Contact 
web-archive-org-2314	----	The ZOMDir project: The half-life of a link is two year success fail Sep OCT Nov 17 2016 2017 2018 62 captures 16 Oct 2017 - 15 Apr 2021 About this capture COLLECTED BY Organization: Internet Archive These crawls are part of an effort to archive pages as they are created and archive the pages that they refer to. That way, as the pages that are referenced are changed or taken from the web, a link to the version that was live when the page was written will be preserved. Then the Internet Archive hopes that references to these archived pages will be put in place of a link that would be otherwise be broken, or a companion link to allow people to see what was originally intended by a page's authors. The goal is to fix all broken links on the web. Crawls of supported "No More 404" sites. Collection: Wikipedia Near Real Time (from IRC) This is a collection of web page captures from links added to, or changed on, Wikipedia pages. The idea is to bring a reliability to Wikipedia outlinks so that if the pages referenced by Wikipedia articles are changed, or go away, a reader can permanently find what was originally referred to. This is part of the Internet Archive's attempt to rid the web of broken links. TIMESTAMPS The Wayback Machine - https://web.archive.org/web/20171017041901/http://blog.zomdir.com/2017/10/the-half-life-of-link-is-two-year.html ZOMDir > Blog Thursday, 12 October 2017 The half-life of a link is two year The half-life of a link is two year. Better said, the half-life of an external link is two year.  That is, when you create today a website with 100 working external links and checks your website after two year with a broken link checker, you will discover that rougly 50 links are broken. How do you know? I can almost hear you thinking "How do you know?". Well I will explain below. In the past I have copied as much data as possible of the directory Yahoo! This is because Yahoo! stopped, I have created a directory myself and I wanted to analyse the links and structure of this famous directory. At January 4, 2016 I analysed the data I have and concluded that 77% (or more exactly 76.8387682%) of the links are fine. Recently (October, 9 2017) I analysed the data again. Now 42% (42.0219319%) of the links are fine. Based on this data I concluded that on an average day 0,093670021% of external links will get broken. That does not seem much. However the linkrot percentage per month is 2.81%.  Consequences After a half year one sixth of the links are broken. After a year 30% of the links are broken. After two years 50% of the links are broken. Hence the half-life of a link is two year. See also this graph below So when you think 3% broken links is acceptable, then you should check for broken links every month. When 5% is acceptable, check every two months and when you think 10% is acceptable, check every 4 months for broken links. Be wise, and check and repair your links at a regular base, Hans Update: After writing this blogpost  I discovered that in the document "A longitudinal study of Web pages continued: a consideration of document persistence" it is stated that the half-time of a random web page is about 2.0 years. Great that's exactly what I concluded.   -- ZOMDir.com is a dynamic directory and a wiki Everyone is able to add a link in 10 seconds To learn more view this Slideshare presentation Posted by Hans van der Graaf at 12:26 Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest No comments: Post a Comment Newer Post Older Post Home Subscribe to: Post Comments (Atom) Popular Posts An alternative for Readability Why are these bots visiting ZOMDir.com? Minimal design Bookmarklets with icons? Pagerank checkers Blog Archive ▼  2017 (7) ▼  October (3) Dead Link City - A comparison of 8 Free Online Lin... The half-life of a link is two year How DeadLinkCity improved "Broken Links at a Glanc... ►  September (3) ►  June (1) ►  2016 (5) ►  December (1) ►  March (1) ►  February (2) ►  January (1) ►  2015 (4) ►  June (1) ►  May (1) ►  March (1) ►  January (1) ►  2014 (16) ►  December (2) ►  November (2) ►  October (2) ►  June (2) ►  February (3) ►  January (5) ►  2013 (11) ►  December (1) ►  November (2) ►  October (1) ►  September (5) ►  August (1) ►  July (1) ►  2012 (1) ►  February (1) ►  2011 (22) ►  December (1) ►  October (2) ►  September (1) ►  June (1) ►  May (1) ►  April (7) ►  February (2) ►  January (7) ►  2010 (1) ►  December (1) Total Pageviews About Me Hans van der Graaf View my complete profile (c) Hans van der Graaf. Powered by Blogger. 
web-archive-org-7310	----	A Large-Scale Study of the Evolution of Web Pages success fail Nov JUL Sep 09 2009 2011 2012 109 captures 23 Jun 2003 - 15 Apr 2021 About this capture COLLECTED BY Organization: Internet Archive The Internet Archive discovers and captures web pages through many different web crawls. At any given time several distinct crawls are running, some for months, and some every day or longer. View the web archive through the Wayback Machine. Collection: Wikipedia Outlinks July 2011 Crawl of outlinks from wikipedia.org started July, 2011. These files are currently not publicly accessible. TIMESTAMPS The Wayback Machine - https://web.archive.org/web/20110709175020/http://www2003.org/cdrom/papers/refereed/p097/P97%20sources/p97-fetterly.html A Large-Scale Study of the Evolution of Web Pages Dennis Fetterly Hewlett Packard Labs 1501 Page Mill Road Palo Alto, CA 94304 dennis.fetterly @ hp.com Mark Manasse Microsoft Research 1065 La Avenida Mountain View, CA 94043 manasse @ microsoft.com Marc Najork Microsoft Research 1065 La Avenida Mountain View, CA 94043 najork @ microsoft.com Janet L. Wiener Hewlett Packard Labs 1501 Page Mill Road Palo Alto, CA 94304 janet.wiener @ hp.com Copyright is held by the author/owner(s). WWW2003, May 20-24, 2003, Budapest, Hungary. ACM 1-58113-680-3/03/0005. Abstract How fast does the web change? Does most of the content remain unchanged once it has been authored, or are the documents continuously updated? Do pages change a little or a lot? Is the extent of change correlated to any other property of the page? All of these questions are of interest to those who mine the web, including all the popular search engines, but few studies have been performed to date to answer them. One notable exception is a study by Cho and Garcia-Molina, who crawled a set of 720,000 pages on a daily basis over four months, and counted pages as having changed if their MD5 checksum changed. They found that 40% of all web pages in their set changed within a week, and 23% of those pages that fell into the .com domain changed daily. This paper expands on Cho and Garcia-Molina's study, both in terms of coverage and in terms of sensitivity to change. We crawled a set of 150,836,209 HTML pages once every week, over a span of 11 weeks. For each page, we recorded a checksum of the page, and a feature vector of the words on the page, plus various other data such as the page length, the HTTP status code, etc. Moreover, we pseudo-randomly selected 0.1% of all of our URLs, and saved the full text of each download of the corresponding pages. After completion of the crawl, we analyzed the degree of change of each page, and investigated which factors are correlated with change intensity. We found that the average degree of change varies widely across top-level domains, and that larger pages change more often and more severely than smaller ones. This paper describes the crawl and the data transformations we performed on the logs, and presents some statistical observations on the degree of change of different classes of pages. Keywords Web characterization, web evolution, web pages, rate of change, degree of change 1.  INTRODUCTION The searchable web and the search engines which survey it have become indispensable tools for information discovery. From academic researchers to elementary-school students, from cancer patients to pensioners, from local high-school football fans to international travelers, the indexed content of the web is becoming the primary research tool for many. With hundreds of millions of people relying on these tools, one is led to ask if the tools provide useful, up-to-date results. Ideally, one would like the entire index of a search engine to be fresh, that is, to contain the most up-to-date version of a web page. The Google search engine attempts to maintain a fresh index by crawling over 3 billion pages once a month [7], with more frequent crawls of hand-selected sites that are known to change more often. In addition, it offers access to cached copies of pages, to obviate problems arising some of the crawled URLs being out-of-date or having disappeared entirely. To improve the freshness of results returned by search engines and allow them to spend more of their efforts crawling and indexing pages which have changed, it is interesting and important to answer some questions about the dynamic nature of the web. How fast does the web change? Does most of the content remain unchanged once it has been authored, or are the documents being continuously updated? Do pages change a little or a lot? Is the extent of change correlated to any other property of the page? Do pages change and then change back? How consistent are mirrors and near-mirrors of pages? Questions like these are of great relevance to search engines, and more generally to any party trying to maintain an up-to-date view of the web, but they are also interesting in their own right, as they shed light on the evolution of a major sociological phenomenon: the largest collectively constructed information repository known to man. In this paper, we attempt to answer some of these questions. We recount how we collected 151 million web pages eleven times over, retaining salient information including a feature vector of each page. We describe how we distilled the collected information about each URL into a summary record, tabulating the feature vectors. We sketch the framework we used to mine the distilled data for statistical information. We present the most interesting results of this data mining. Finally, we draw some conclusions and offer avenues of future work. 2.  RELATED WORK This paper expands on a study by Cho and Garcia-Molina [5]. The authors of that study downloaded 720,000 pages drawn from 270 "popular" web servers (not exceeding 3,000 pages per server) on a daily basis over the course of four months, and retained the MD5 checksum of the contents (including the HTML markup) of each page. This allowed them to determine if a document had changed, although it did not allow them to assess the degree of change. Among other things, they found that pages drawn from servers in the .com domain changed substantially faster than those in other domains, while pages in the .gov domain changed substantially slower. Overall, they found that about 40% of all web pages changed within a week, and that it took about 50 days for half of all pages to have changed. They also found that almost 25% of the pages in .com changed within a day, and that it took 11 days for half of all .com pages to have changed. By contrast, it took four months (the duration of their study) for half of the .gov pages to have changed. Sun et al. [11] studied the efficacy of web anonymizers. As part of that study, they drew a set of 100,000 web pages from the Open Directory listing, and crawled each page twice, with the second retrieval immediately following the first one. Since they were interested in the information leakage of encrypted channels, they did not compare checksums of the returned pages; rather, they compared the lengths of the pages and the number and lengths of their embedded images and frames (which will appear to an eavesdropper as temporally closely spaced TCP packets). They found that 40% of pages changed signatures, and that 14% of pages changed by 30% or more, using a Jaccard-coefficient-based similarity metric akin to the one we use, but based on the size of a document including sizes of embedded images, not contents. Douglis et al. [6] studied a web trace consisting of 950,000 entries (each entry representing a web access) collected over 17 days at the gateway between AT&T Labs-Research and the Internet. They recorded the "last-modified" timestamp transmitted by the web server (implicitly assuming that web servers transmit accurate information). In addition, they mined each page for items such as phone numbers using a domain-specific semantic analysis technique called "grinking", and measured the rate of change of these items. They found that according to the last-modified metric, 16.5% of the resources (including HTML pages as well as other content, such as images) that were accessed multiple times changed every time they were accessed. They also found that among the HTML pages that were accessed more than once, almost 60% experienced a change in HREF links, and over 50% of them experienced a change in IMG links. Brewington and Cybenko built a web clipping service, which they leveraged to study the change rate of web pages [1]. They did so by recording the last-modified time stamp, the time of download, and various stylistic attributes (number of images, links, tables, etc) of each downloaded HTML page. Their service downloaded about 100,000 pages per day, selected based on their topical interest, recrawling no page more often than once every three days. They evaluated data collected between March and November 1999. For pages that were downloaded six times or more, 56% did not change at all over the duration of the study (according to the features they retained), while 4% changed every single time. Our study differs from previous studies in several respects. First, it covers a roughly 200 times larger portion of the web (although the interval between revisits is seven times larger than, say, the one used by Cho and Garcia-Molina). Second, we used a different and more fine-grained similarity metric than any of the other studies, based on syntactic document sketches [4]. Third, we selected our pages based on a breadth-first crawl, which removed some of the bias inherent in the other studies (although breadth-first crawling is known to be biased towards pages with high PageRank [9]). Fourth and finally, we retained the full text of 0.1% of all downloaded pages, a sample set that is comparable in size to the set of pages summarized by other studies. 3.  EXPERIMENTAL SETUP Our experiment can be divided into three phases: Collecting the data through repeated web crawls, distilling the data to make it amenable to analysis, and mining the distillate. This section describes each of the phases in more detail. 3.1  Collecting the Data Between 26 Nov. 2002 and 5 Dec. 2002, we performed a large crawl ("crawl 1") that downloaded 151 million HTML pages as well as 62 million non-HTML pages, which we subsequently ignored. We then attempted to fetch each of these 151 million HTML pages ten more times over a span of ten weeks. Naturally, some of these pages became either temporarily or permanently unavailable. Moreover, we experienced a catastrophic disk failure during the third crawl, causing us to lose a quarter of the logs of that crawl. Figure 1 shows the distribution of successful downloads. As can be seen, we succeeded in downloading 49.2% of the pages all eleven times, and 33.6% ten times, leaving 17.2% of the pages that could only be downloaded nine times or fewer. Figure 1: Successfully downloaded versions of URLs Our hardware infrastructure consisted of a cluster of four Compaq DS20 servers, each one equipped with a 667 MHz Alpha processor, 4 GB of RAM, 648 GB of disk, and a fast Ethernet network connection. The machines were located at the Palo Alto Internet Exchange, a peering point for 12 Telcos and 20 major and about 130 minor ISPs. We conducted these crawls using the Mercator web crawler [8]. Mercator is both fast and highly configurable, making it a suitable tool for our purposes. We seeded crawl 1 with the Yahoo! home page. We restricted ourselves to content retrievable using HTTP, ignoring HTTPS, FTP, Gopher, and the like. Mercator crawled using its standard breadth-first search web traversal strategy, which is biased towards pages with high PageRank [9]. This crawl ran for 9 days, and downloaded a total of 6.4 TB of data. As said earlier, it logged the URLs of 151 million HTML pages that were successfully retrieved, i.e. that were returned with an HTTP status code of 200 and a content type of text/html. We used these 151 million URLs to seed the following ten crawls. These crawls ran consecutively, starting on 5 Dec. 2002 and ending on 12 Feb. 2003. We disabled link extraction and HTTP redirections; in other words, we configured Mercator to fetch only the pages in its seed set. Every crawl slowed down by two orders of magnitude once it had processed all but a million or so seeds, because the remaining URLs all referred to web servers that were extremely slow to respond to us during that crawl, and because Mercator's politeness policies cause it to space out requests to the same host proportional to the delay of the previous response from that host. In order to keep the overall duration of each crawl within reasonable bounds, we terminated the crawls after this happened, typically on the sixth day of the crawl, and started the next crawl a week after the start of the preceding one. For all eleven crawls, we provided Mercator with two new processing modules: a module that recorded a checksum and a fixed-size feature vector plus some ancillary data for each page, and a second module that selected 0.1% of all pages and saved them to disk. We computed the feature vectors using a modified version of the document shingling technique due to Broder et al. [4], which uses a metric of document similarity based on syntactic properties of the document. This similarity metric is applicable to any kind of document that consists of an ordered sequence of discrete features. The following description assumes that the features are words, i.e. that a document is an ordered sequence of words. In order to compare two documents, we map each document into a set of k-word subsequences (groups of adjacent words or "shingles"), wrapping at the end of the document, so that every word in the document starts a shingle. Two documents are considered to be identical if they map to the same set of shingles;1 they are considered to be similar if they map to similar sets of shingles. Quantitatively, the similarity of two documents is defined to be the number of distinct shingles appearing in both documents divided by the total number of distinct shingles. This means that two identical documents have similarity 1, while two documents that have no shingle in common have similarity 0. Note that the value of k parameterizes how sensitive this metric is. Changing the word "kumquat" to "persimmon" in an n-word document (assuming that "kumquat" and "persimmon" occur nowhere else in the document) results in a similarity of [(n-k)/(n+k)] between the original and the modified document. This means that one should not choose k to be too large, lest even small changes result in low similarity. On the other hand, neither should one choose k to be too small. As an extreme example, setting k to 1 results in a comparison of the lexicon of two documents, making the metric completely insensitive to word ordering. Our feature vector extraction module substitutes HTML markup by whitespace, and then segments the document into 5-word shingles, where each word is an uninterrupted sequence of alphanumeric characters. Next, it computes a 64-bit checksum of each shingle, using Rabin's fingerprinting algorithm [3,10]. We call these fingerprints the "pre-images". Next, the module applies 84 different (randomly selected but fixed thereafter) one-to-one functions to each pre-image. For each function, we retain the pre-image which results in the numerically smallest image. This results in a vector of 84 pre-images, which is the desired feature vector. If the one-to-one functions are chosen randomly,2 then given the feature vectors of two documents, two corresponding elements of the vectors are identical with probability equal to the similarity of the documents. The feature vector extraction module logs the feature vector of each document, together with a checksum of its raw content, the start time and the duration of its retrieval, the HTTP status code (or an error code indicating a TCP error or a robot exclusion), the document's length, the number of non-markup words, and the URL. If the document cannot be downloaded successfully or does not result in an HTML document,3 the module logs the URL and special values for everything else. URLs that were not downloaded because the crawl was terminated before it was complete are treated in a similar fashion. The document sampling module saves those successfully downloaded documents whose URLs hash to 0 modulo 1000; in other words, it saves 0.1% of all downloaded documents (assuming we used a uniform hash function). The log contains the URL of each document as well as the entire HTTP response to each request, which includes the HTTP header as well as the document. 3.2  Distilling the Data Crawling left us with 44 very large logs (produced by the eleven crawls on the four crawlers), each spanning multiple files, one per day. The logs totaled about 1,200 GB, whereas the sampled documents took up a mere 59 GB. As they were, these logs were not suitable for analysis yet, because the URLs occurred in non-deterministic order in each log. One way to rectify this would be to perform a merge-sort of the logs based on their URL, in order to bring together the various occurrences of each URL. However, performing such a merge-sort on 1,200 GB of data is prohibitively expensive. To overcome this problem, we bucketized each of the logs, dividing its contents over 1,000 buckets, using the same URL-based hash function as in the document sampling module. This step produced 44,000 buckets (1,000 buckets per crawler per crawl). While this might appear to double the storage requirement, we could process an individual daily log file, and then move it to near-line storage. As a result of the bucketization, all the occurrences of a given URL appear in corresponding buckets across generations, and each bucket is less than 30 MB in size. This allowed us to then perform a fast in-memory sort of each bucket, using the URL as the key, and replacing the original bucket with a sorted one. At the end of this process, the corresponding buckets of each generation all contained exactly the same set of URLs, in sorted order. Finally, we merged the buckets across crawls, and distilled them in the process. We did so by iterating on all four machines over the 1,000 bucket classes, reading a record at a time from each of the eleven corresponding buckets, and writing a combined record to a distilled bucket. Each combined record contains the following information: The URL. The start times of the eleven downloads. The duration of each download. The length of the document at each download. The number of non-markup words in the document at each download. The HTTP status code (or error code) for each download. Six "supershingles" for each download. For each pair of downloads, an 84-bit vector indicating if the 84 corresponding pre-images matched. For each pair of downloads, a match count (see below). Note that the distilled record does not include the checksums or the pre-images of each document. These values are subsumed by the "supershingles", the bit-vectors, and the match counts. Each of the six "supershingles" represents the concatenation of 14 adjacent pre-images. Due to the independence of the one-to-one functions used to select pre-images, if two documents have similarity p, each of their supershingles matches with probability p14. For two documents that are 95% similar, each supershingle matches its counterpart with probability 49%. Given that we retain six supershingles, there is a probability of almost 90% that at least two of the six supershingles will agree for documents that are 95% or more similar. Retaining the supershingles will be useful in the future, should we try to discover approximate mirrors [2] and investigate their update frequency. The match count, when non-negative, indicates how many of the 84 pre-image pairs matched between the two documents. If the document fingerprint matches as well, the match count is set to 85. The match count is negative if either document was not downloaded successfully or contained no words at all (which caused the pre-images to have a default value); its value indicates which condition applied to which document, and whether the documents were identical (i.e. their checksums matched). 3.3  Mining the Data While the original logs were 1,200 GB in size, the distilled buckets consume only about 222 GB. Even so, it still takes about 10 hours to read the distilled logs. In order to conduct the statistical experiments we report on below (as well as a number of others which proved less informative), we needed to find a way to conduct each experiment considerably faster. We achieve this by running several experiments at once, thereby amortizing the cost of reading the logs over multiple experiments. Using this approach, we were no longer limited by our computer's ability to read the logs, but rather by our ability to decide a priori which experiments would be interesting and relevant. Given the trial-and-error character of any data mining activity, we still had to do several passes over the data to construct all of the experiments described below. We built an analyzer harness that reads through the distilled buckets one record at a time, expands each record into an easy-to-use format, and presents it to each of a set of analyzer modules. Once all buckets have been read, the harness invokes a "finish" method on each of the analyzers, causing them to write out any aggregate statistics they may have gathered. The analyzers as well as the harness are written in Java; the harness uses Java's dynamic class loading capabilities to load the analyzers at run-time. Examples of the analyzers we wrote include: StatusCodeAnalyzer, which produces a histogram of HTTP status codes and TCP error conditions. FetchDurationAnalyzer, which produces a logarithmic histogram of document download durations. DocLengthAnalyzer, which produces a logarithmic histogram of document lengths. NumVersionsAnalyzer, which produces a histogram of how many versions of each URL we managed to download successfully. TopLevelDomainAnalyzer, which produces a histogram of top-level domains, and optionally counts the number of hosts in that domain. ChangeAnalyzer, which produces a histogram of the number of unchanged pre-images between two successive successful downloads of a URL. Analyzers can be nested into higher-level analyzers. For example, it is possible to put a ChangeAnalyzer into a TopLevelDomainAnalyzer, producing a list of document change histograms, one for each top-level domain. This is again implemented using Java's dynamic class loading machinery. In order to keep the size of the output manageable, we also provide ways to limit the number of top-level domains considered (putting all unspecified domains into a catch-all category), to group numbers of unchanged pre-images into clusters, and the like. For some of these higher-level analyzers, we need to aggregate multiple values (for example, the sizes of the eleven versions of a web page) into a single value, in order to decide what lower-level analyzer to invoke. We currently provide three different ways to aggregate values, namely minimum, maximum, and average. We plan to add support for mode, median, and geometric average. Recall that during the data collection phase, we saved the full text of 0.1% of all successfully downloaded pages. We selected these pages based on a hash of the URL, using the same hash function as the bucketizer. In particular, we saved the full text of all pages that went into bucket 0. This enables us to use the analyzer framework to detect interesting patterns, to use a special analyzer (using higher-level analyzers around a "DumpURLAnalyzer") to get a listing of all URLs in bucket 0 that fit this pattern, and then to examine the full text of some of these documents. We built some infrastructure to make this process easier. In particular, we prepend each file of sampled documents with an index of URLs and document offsets, which allows us to retrieve any page in constant time. Second, we implemented a web service that accepts a URL, a version number, and whether to return the HTTP header or the document, and returns the requested item for display in the browser. Third, we implemented another web service that accepts a list of analyses to run, executes them on a subset of the distilled buckets, and returns the results as a web page. We hope to make these services, as well as some of the distilled buckets, available to the research community. 4.  RESULTS The results presented in this section are derived from analyzing the 151 million distilled records in our collection, using the analyzer harness and many of the analyzers described above. Figure 2 shows a histogram of document length, for all of the 1,482,416,213 documents that were successfully downloaded (i.e. that had an HTTP status code of 200), as well as broken out by a few selected top-level domains. The x-axis denotes the document size; a value of n means that the size of the document was below 2n bytes, but not below 2n-1 bytes (a value of 0 indicates that the document had length 0). Figure 2: Distribution of documents lengths overall and for selected top-level domains. The distribution we observed centers at 14 with standard deviation 1; 66.3% of all observed HTML pages are between 4 KB and 32 KB in length. Looking at selected top-level domains, pages in .com, which represent 52.5% of all observed pages, largely reflect the overall distribution, but are biased slightly higher. Pages in .org and .gov, which account for 8.0% and 1.1% of all observed pages, respectively, are similar to the overall distribution, but are biased slightly lower. Pages in .edu tend to be smaller, with 64.9% of the pages being between 2 KB and 16 KB. Figure 3 is similar to Figure 2, but shows a histogram of the number of words per document, instead of the number of bytes. Note that the distribution of the .edu domain is closer to the overall distribution when it comes to words, suggesting that pages in .edu either have shorter words or less HTML markup. Figure 3: Distribution of words per documents overall and for selected top-level domains. Figures 4, 5, and 6 attempt to capture different aspects of the permanence of web pages. Figure 4 shows for each crawl generation the percentage of page retrievals resulting in different categories of status codes. The 200 category (corresponding to the HTTP status code "200 OK") shows pages that were successfully downloaded. Note that the y-axis starts at 85%; in all generations, over 85% of the retrievals were successful. The 3xx category contains all those pages that return an HTTP status code indicating that the page has moved. Since all URLs in our set produced a status code of 200 during crawl 1, the page has moved since. The 4xx category contains all client errors. The most common one is 404 ("Not Found"), indicating that the page has disappeared without leaving a forwarding address, distantly followed by 403 ("Forbidden"). The other category contains all pages for which the web server returned a status code not listed above. The various 5xx return codes dominated; we also found many web servers returning status codes not listed in the HTTP RFC. The network category contains all retrieval attempts that failed due to a network-related error, such as DNS lookup failure, refused connections, TCP timeouts, and the like (note that Mercator makes five attempts to resolve a domain name, and three attempts at retrievals that fail due to TCP errors). The RobotExcl category contains all those pages that were blocked by the web server's robots.txt file, and that we therefore refrained from even attempting to download. Again, since these pages were not excluded during crawl 1, the exclusion was imposed later. This appears to be a form of the Heisenberg effect, where the existence of an observer (a pesky web crawler that trots by every week) changes the behavior of the observed. As one might expect, Figure 4 bears out that the lifetime of a URL is not unlimited. As the crawl generations increase, more and more URLs move, become unreachable, or are blocked from us. While one would expect a geometric progression of these trends, we did not observe the web long enough to distinguish the trend from a linear progression. The growth of these three categories comes at the expense of the 200 category; fluctuations in the share of the network category appear better correlated with network conditions at the crawler side. Figure 4: Distribution of HTTP status codes over crawl generations. Figure 5 shows only the successful downloads as a percentage of all download attempts (including non-attempts due to robot exclusion rules), broken down by a few selected top-level domains. Note that the y-axis starts at 80%, reflecting the fact that all domains in all generations had at least that level of success. In general, pages in .jp, .de, and .edu were consistently more available than pages in .net and .com. The decline in the curves bears out the limited lifetime of web pages discussed above. Figure 5: Distribution of successful downloads over crawl generations, broken down by selected top-level domains. In Figure 6 we tried another approach for viewing the lifetime of URLs from different domains. Each bar represents a top-level domain (the leftmost bar represents the entire data set). We grouped URLs by the crawl generation of their last successful retrieval, the intuition being that a URL which could not be downloaded after some point is likely to have expired. This approach partitions URLs into 11 sets. Each shaded region of each bar represents the relative size of one such set. The region at the top of a bar corresponds to URLs that were consistently unreachable after crawl 1 (the crawl that defined the set of URLs), while the region at the bottom of a bar corresponds to URLs that were successfully downloaded during the final crawl. Note that the y-axis, which shows the percentage breakdown of the total, starts at 75%, due to the fact that for all domains considered here, more than 75% of all URLs were still reachable during the final crawl. Looking at the "all domains" bar, it can be seen that 88% of all URLs were still available during the final crawl. For most domains, the "OK at crawl 10" regions are larger than regions for preceding crawls. This makes intuitive sense: it represents documents that could not be retrieved during the final crawl, but might well come back to life in the future. Other than that, we see no discernible patterns between the lengths of regions within a bar. Looking across domains, we observe that web pages in China expire sooner than average, as do pages in .com and .net. Figure 6: Breakdown showing in which crawl a web page was last successfully downloaded, broken down by TLD. The remaining figures display information about the amount of change in a document between two successive successful downloads. Figure 7 shows a fine-grained illustration of change amount, independent of other factors. We partition the set of all pairs of successive successfully retrieved pages into 85 subsets, based on how many pre-images the two documents in each pair share. Subset 0 contains the pairs with no common pre-images, subset 84 the ones with all common pre-images, and subset 85 the ones that also agree in their document checksums (strongly suggesting that the documents are identical as byte sequences). The x-axis shows the 85 subsets, and the y-axis shows percentages. The lower curve (visible only at the extreme left and right) depicts what percentage of all documents fall into a given subset. The curve above shows the cumulative percentage distribution. Each point (x,y) on the curve indicates that y percent of all pairs had at least x pre-images in common. As can be seen (if your eyes are sharp enough), 65.2% of all page pairs don't differ at all. Another 9.2% differ only in their checksum, but have all common pre-images (suggesting that only the HTML markup, which was removed before computing pre-images, has changed). Figure 7: Distribution of change. Figure 8 magnifies the lower curve from Figure 7, making it easy to see that all other change buckets contain less than 2% of all page pairs, and that all buckets with fewer than 79 common pre-images (representing document pairs that are less than 94% similar) contain less than 1%. For the most part, the curve is monotonically decreasing, but this pattern is broken as we get to the buckets containing documents that underwent extreme change. Bucket 0, which represents complete dissimilarity, accounts for 0.8%, over ten times the level of bucket 7, the smallest bucket. We will revisit this phenomenon later on. Figure 8: Distribution of change, scaled to show low-percentage categories. Figure 9: Distribution of change, scaled to show low-percentage categories, after excluding automatically generated keyword-spam documents. We next tried to determine if there are any attributes of a document that help to predict its rate and degree of change. In Figure 10, we examine the relation between top-level domain and change. The figure shows a bar for all page pairs and several more for selected domains. Each bar is divided into 6 regions, corresponding to the following six change clusters: complete change (0 common pre-images), large change (1-28 common pre-images), medium change (29-56 common pre-images), small change (57-83 common pre-images), no text change (84 common pre-images), and no change (subset 85: 84 common pre-images and a common checksum). The top region depicts the complete change cluster, the one at the bottom the no change cluster. The y-axis shows the percentage. Figure 10: Clustered rates of change, broken down by selected top-level domains. We observe significant differences between top-level domains, confirming earlier observations by Cho and Garcia-Molina [5]. In the .com domain, pages change more frequently than in the .gov and .edu domains. We were surprised to see that pages in .de, the German domain, exhibit a significantly higher rate and degree of change than those in any other domain. 27% of the pages we sampled from .de underwent a large or complete change every week, compared with 3% for the web as a whole. Even taking the fabled German industriousness into account, these numbers were hard to explain. In order to shed light on the issue, we turned to our sampled documents, selecting documents from Germany with high change rate. Careful examination of the first few pages revealed more than we cared to see: of the first half dozen pages we examined, all but one contained disjoint, but perfectly grammatical phrases of an adult nature together with a redirection to an adult web site. It soon became clear that the phrases were automatically generated on the fly, for the purpose of "stuffing" search engines such as Google with topical keywords surrounded by sensible-looking context, in order to draw visitors to the adult web site. Upon further investigation, we discovered that our data set contained 1.03 million URLs drawn from 116,654 hosts (4,745 of them being outside the .de domain), which all resolved to a single IP address. This machine is serving up over 15% of the .de URLs in our data set! We speculate that the purpose of using that many distinct host names as a front to a single server is to circumvent the politeness policies that limit the number of pages a web crawler will attempt to download from any given host in a given time interval, and also to trick link-based ranking algorithms such as PageRank into believing that links to other pages on apparently different hosts are non-nepotistic, thereby inflating the ranking of the pages in the clique. After this discovery, we set out to explore if there were other such servers in our data set. We resolved the symbolic host names of all the URLs in our data set, and singled out each IP address with more than a thousand symbolic host names mapping to it. There were 213 such IP addresses, 78 of which proved to be of a similar nature as the site that triggered our investigation. We excluded all URLs on the 443,038 hosts that resolved to one of the 78 identified IP addresses, and reran the analysis that produced Figure 10. This eliminated about 60% of the excessive large and complete change in .de. The adjusted distribution is shown in Figure 11. Continued investigation of the excessive change found that automatically generated pornographic content accounts for much of the remainder. Figure 11: Clustered rates of change, broken down by selected top-level domains, after excluding automatically generated keyword-spam documents. In Figure 12, we look at the same data, but omit all pairs of documents with no change. Other than Germany (and to a lesser extent China and Japan), there is remarkably little difference between the various top-level domains. Our conclusions are twofold: First, pornography continues to skew our results. Second, our shingling technique is not well adapted to writing systems like Chinese or Kanji that do not employ inter-word spacing, which in turn causes documents to have a very small number of shingles, which means that any change is considered significant. Figure 12: Clustered rates of change, broken down by selected top-level domains, and omitting the no change cluster. Figure 9 is similar to Figure 8, but excludes the same URLs that were excluded in Figures 11 and 12. Note that most of the non-monotonicity at the right end of the distribution has disappeared, except for bucket 0, which nonetheless has been cut in half. We next consider whether the length of pages impacts their rate of change. In Figure 13, we use the same x-axis semantics as in Figure 2, and the same y-axis semantics and bar graph encodings as in Figure 10. The most striking feature of this figure is that document size is strongly related to amount and rate of change, and counterintuitively so! One might think that small documents are more likely to change, and if they do, change more severely (since any change is a large change). However, we found that large documents (32 KB and above) change much more frequently than smaller ones (4 KB and below). Figure 13: Clustered rates of change, broken down by document size. Figure 14 is similar in spirit, but examines the relationship between the number of words and the rate of change. In a way, this metric is more straightforward, since the sensitivity of our shingling techniques depends on the number of words in a document. For documents with few words, our metric gives a relatively coarse, "all-or-nothing" similarity metric. Nonetheless, this figure echoes the observation of Figure 13, that large documents are more likely to change than smaller ones. Figure 14: Clustered rates of change, broken down by number of words per document. In Figure 15, we examine the same information, excluding the documents with no change. We observe that pages with different numbers of words exhibit similar change behavior, except that pages with just a few words cannot show an intermediate amount of change, due to our sampling technique. Figure 15: Clustered rates of change, broken down by number of words per document, and omitting the no change cluster. We further investigated whether there are any confounding relationships between document size and top-level domain. Figure 16 uses the same representation as Figure 13, but each chart considers only those URLs from a specific top-level domain. The distributions for the .com and .net domains exhibit a much stronger threshold effect for large documents than do .gov and .edu. Figure 16: Clustered rates of change, broken down by top-level domain and number of words per document. Figure 17 examines the correlation of successive changes to a document. The figure shows a 3D histogram. The x axis denotes the number of pre-images in a document unchanged from week n-1 to n, the y axis the number of pre-images unchanged from week n to n+1, and the z axis shows the logarithm (base 2) of the number of such documents. A data point (x,y,z) indicates that there are 2z document/week pairs (d,n) for which the versions of document d had x pre-images in common between weeks n-1 and n, and y pre-images in common between weeks n and n+1. The spire surrounding the (x,y) coordinate (85,85) represents the vast majority of web pages that don't change much over a three-week interval. The tip of the spire is ten thousand times higher than any other feature in the plot, except for the smaller spire at the other end of the diagonal, which represents documents which differ completely in every sample. Much of this second peak can be attributed to machine-generated pornography, as described above. The second-most prominent feature is the pronounced ridge along the main diagonal of the xy plane. The crest of the ridge represents a thousandfold higher number of instances per grid point than the floor of the valley. This ridge suggests that changes are highly correlated; past changes to a document are an excellent predictor of future changes. The plumes at the far walls of the plot demonstrate that a sizable fraction of documents don't change in a given week, even if they changed in the previous or following week. Figure 17: Logarithmic histogram of intra-document changes over three successive weeks, showing absolute number of changes. Figure 18 modifies the previous figure in two ways: The view is down the z axis, transforming the 3D plot into a 2D contour map, where color/shading indicate the elevation of the terrain. In addition, rather than displaying absolute numbers of samples, we consider each column as a probability distribution (meaning that every data point is divided by the sum of the data points in its column). Since these values range from 0 to 1, their logarithms are negative. This normalization eliminates the spires that were so prominent in the previous figure. The diagonal ridge, however, remains, indicating once again that past change is a strong predictor of future change. Likewise, the plume along the top remains clearly visible. Figure 18: Logarithmic histogram of intra-document changes over three successive weeks, normalized to show conditional probabilities of changes. 5.  CONCLUSIONS This paper describes a large-scale experiment aimed at measuring the rate and degree of web page changes over a significant period of time. We crawled 151 million pages once a week for eleven weeks, saving salient information about each downloaded document, including a feature vector of the text without markup, plus the full text of 0.1% of all downloaded pages. Subsequently, we distilled the retained data to make it more amenable to statistical analysis, and we performed a number of data mining operations on the distilled data. We found that web pages that change usually either change only in their markup or in trivial ways. Moreover, we found that there is a strong relationship between the top-level domain and the frequency of change of a document, whereas the relationship between top-level domain and degree of change is much weaker. To our great surprise, we found that document size is another strong predictor of both frequency and degree of change. Moreover, one might expect that any change to a small document would be a significant one, by virtue of small documents having fewer words, so that any word change affects a significant fraction of the shingles. Contrary to that intuition, we found that large documents change more often and more extensively than smaller ones. We investigated whether the two factors - top-level domain and document size - were confounding, and discovered that for the most part, the relationship of document size and rate and degree of change are more pronounced for the .com and .net domains than for, say, the .edu and .gov domains, suggesting that they are not confounded. We also found that past changes to a page are a good predictor of future changes. This result has practical implications for incremental web crawlers that seek to maximize the freshness of a web page collection or index. We have done some limited experiments with the sampled full text documents to investigate some of our more perplexing results. These experiments helped us in uncovering a source of pollution in our data set, namely machine-generated pages constructed for the purpose of spamming search engines. We hope that future work using the sampled full text documents will provide us with additional insights. 9  REFERENCES [1] B. Brewington, G. Cybenko. How dynamic is the web? In Proc. of the 9th International World Wide Web Conference, May 2000. [2] K. Bharat and A. Broder. Mirror, mirror on the web: a study of host pairs with replicated content. In Proc. of the 8th International World Wide Web Conference, May 1999. [3] A. Broder. Some applications of Rabin's fingerprinting method. In R. Capocelli, A. De Santis, and U. Vaccaro, editors, Sequences II: Methods in Communications, Security, and Computer Science, Springer-Verlag, 1993. [4] A. Broder, S. Glassman, M. Manasse, and G. Zweig. Syntactic clustering of the web. In Proc. of the 6th International World Wide Web Conference, Apr. 1997. [5] J. Cho and H. Garcia-Molina. The evolution of the web and implications for an incremental crawler. In Proc. of the 26th International Conference on Very Large Databases, Sep. 2000. [6] F. Douglis, A. Feldmann, B. Krishnamurthy, J. Mogul. Rate of change and other metrics: a live study of the world wide web. In USENIX Symposium on Internetworking Technologies and Systems, Dec. 1997. [7] Google Information for Webmasters. http://www.google.com/webmasters/2.html [8] M. Najork and A. Heydon. High-performance web crawling. SRC Research Report 173, Compaq Systems Research Center, Palo Alto, CA, Sep. 2001. [9] M. Najork and J. Wiener. Breadth-first search crawling yields high-quality pages. In Proc. of the 10th International World Wide Web Conference, May 2001. [10] M. Rabin. Fingerprinting by random polynomials. Report TR-15-81, Center for Research in Computing Technology, Harvard University, 1981. [11] Q. Sun, D. Simon, Y. Wang, W. Russell, V. Padmanabhan, and L. Qiu. Statistical identification of encrypted web browsing traffic. In Proc. of the IEEE Symposium on Security and Privacy, May 2002. Footnotes: 1 This definition of identity differs from the standard definition. For example, a document A is considered identical to AA, the concatenation of A to itself. Similarly, given two documents A and B, document AB is identical to document BA. 2 We drew the 84 functions from a smaller space, which does not seem to be a problem in practice. Also, because we fixed them at the time when we wrote the feature vector extraction module, an adversary who had the code of that module could produce documents that would fool our metric. 3 We actually did encounter several documents whose content type changed during the time we observed them. 
wiki-diglib-org-8504	----	Pedagogy - DLF Wiki Pedagogy From DLF Wiki Jump to navigation Jump to search Contents 1 DLF Digital Library Pedagogy Group 1.1 Get involved 1.2 Subgroups 1.3 Resources 1.4 Archived Documentation DLF Digital Library Pedagogy Group The Digital Library Pedagogy Working Group, also known as #DLFteach, is a grassroots community of practice formed thanks to practitioner interest following the 2015 DLF Forum. We empower digital library practitioners to see themselves as teachers and equip teaching librarians to engage learners in how digital library technologies shape our knowledge infrastructure. The group is open to anyone interested in learning about or collaborating on digital library pedagogy. Get involved Join our Google Group. Check in with us during #DLFteach office hours on Slack every Tuesday afternoon, 2-3pm ET. Participate in a #DLFteach Twitter chat. Subgroups Pedagogy:Outreach Pedagogy:Professional Development Pedagogy:DOCC Pedagogy:Digital Primary Sources Pedagogy:Toolkit 2.0 CFP Pedagogy:Toolkit Blog Resources Constructing Digital Praxis: Pedagogy for Digital Collections (2017 Forum Workshop materials) Archived Documentation As Digital Library Pedagogy projects conclude, they will be archived with the Open Science Framework. Retrieved from "https://wiki.diglib.org/index.php?title=Pedagogy&oldid=15346" Navigation menu Personal tools Log in Request account Namespaces Page Discussion Variants Views Read View source View history More Search Navigation Main page Recent changes Random page Help Tools What links here Related changes Special pages Printable version Permanent link Page information Cite this page Related Sites DLF Website CLIR Website This page was last edited on 1 July 2020, at 16:17. Content is available under Creative Commons Attribution unless otherwise noted. Privacy policy About DLF Wiki Disclaimers 
www-amiaconference-net-6412	----	AMIA/DLF Hack Day | AMIA 2021 About Code of Conduct Contact Us Become an AMIA member! Conference Committee Gender Pronouns & Conference Badges Prior Conferences AMIA 2020 Program Keynote Speakers Poster Sessions AMIA 2020 Award Recipients Program Stream: Content as Data – Archival Approaches Virtual Conference 101 Recorded Sessions 1991-2019 Programs Conference Photos Photos from AMIA 2019 Photos from AMIA 2018 Photos from AMIA 2017 The Programming The Program Borders and Borderlands: Conversations and Documentation Keynote: Manuelito Wheeler, Navajo Nation Museum Opening Plenary: Let’s Get Uncomfy Together A Keynote Conversation with Mario Van Peebles Conference Schedule Poster Sessions Committee Meetings 2021 Spring Conference Speakers AMIA/DLF Hack Day Virtual Conference 201 Tour the Archives Information for Speakers Fall Conference Survey Virtual Pavilion Our Partners 2021 Partners & Sponsors Become a Sponsor Navigation Home » 2021 » AMIA/DLF Hack Day AMIA/DLF Hack Day April 1 – April 15, 2021 REGISTRATION IS FREE! #AVhack21 You DO NOT have to be registered for AMIA to attend and participate in Hack Day. Hack Day will take place from Thursday, April 1 through Thursday, April 15, 2021. This year, there will be two additional kickoff activities: Introductory/Informational Session on Thursday, March 18, 2021 Introduction to Git workshop (hosted by Brendan Coates) on Thursday, April 1, 2021 Signing up here indicates your interest in all Hack Day activities! For more information on these events, please see the wiki: https://wiki.diglib.org/AMIA-DLF_Hack_Day_2021 A partnership between AMIA and the Digital Library Federation, Hack Day is a unique opportunity for practitioners and managers of digital audiovisual collections to join with developers and engineers for an intense day of collaboration to develop solutions for digital audiovisual preservation and access. All are welcome! For more information, follow #AVhack21 on Twitter or email avhackday at gmail dot com. When you sign up for Hack Day, you’ll receive a discount code for the Introduction to Git Workshop on April 1st.     Intro to Git Workshop April 1, 2021 | 11:00am – 1:00pm (Pacific) Collaborating with people in virtual spaces and across time zones can get messy. One way to manage these projects is through Git, a tool that tracks changes and performs version control. This webinar will serve as an introduction to Git and GitHub basics. Attendees will learn about version control, the Git framework, and the difference between Git and GitHub, and understand common Git terms such as branches, fetching, pushing, and pulling. By the end of the webinar, attendees will be able to contribute to a repository using Git! For those participating In the AMIA/DLF 2021 Hack Day event, please use the discount code included in your Hack Day confirmation.  Brendan Coates is a gardener, fermentation enthusiast, member of the Los Angeles Tenants Union, and the Sr. Archivist at Academy Oral History Projects, where he’s worked since 2018, focusing on all aspects of post-production, archiving, preservation, and access. Prior to this, he worked as the Audiovisual Digitization Technician at the University of California, Santa Barbara, where he supervised the migration of a variety of materials, from “wax” cylinders to DigiBetas. He’s a graduate of the University of Michigan’s School of Information and has been working with open-source software since 2012, primarily focused on workflow and quality control automation.     Search for: Recent Posts Archival Screening Night Encore for Members Rutgers University Join us in the Pavilion Committee Meetings Fall Conference Survey About the Conference About the Conference Recorded Sessions Previous Conference Programs Meta Log in Entries feed Comments feed WordPress.org Follow Us Facebook Twitter Instagram Flickr LinkedIn YouTube Designed by Elegant Themes | Powered by WordPress 
www-archivalconnections-org-3	----	Archival Connections – project site Skip to content Archival Connections project site Menu and widgets Home/Recent Posts About People Groups Members Partners Reserach Research Prospectus Publications A and D in Cloud Bibliography Tools Search for: Recent Posts Platform Monopolies and Archives November 6, 2017 SIA Workshop Links October 20, 2017 Scaling Machine-Assisted Description of Historical Records October 20, 2017 Social Feed Manager Takeaways October 17, 2017 Arrangement and Description in the Cloud: A Preliminary Analysis September 8, 2017 Installing Social Feed Manager Locally June 6, 2017 Preserving Email Report Summary January 11, 2017 Introducing Archival Connections August 15, 2016 Platform Monopolies and Archives I am at the InterPARES Trust North American Team meeting in Vancouver, and the issue of platform monopolies has risen to the top of my mind. Here is a quick list of readings I’ve thrown together while listening to and engaging in the discussion: 107798 DVHHJ346 items 1 default asc https://www.archivalconnections.org/wp-content/plugins/zotpress/ Roberts, David. “Joe Biden Should Do Everything at Once.” Vox, December 1, 2020. https://www.vox.com/policy-and-politics/21724758/biden-transition-trump-polarized-climate-change-health-immigration. Marshall, Josh. “Acclimating to New Realities.” Talking Points Memo, December 1, 2020. https://talkingpointsmemo.com/edblog/acclimating-to-new-realities. Heffernan, Virginia. “‘What Tech Calls Thinking’ Might Really Be Something Else.” The New York Times, October 13, 2020, sec. Books. https://www.nytimes.com/2020/10/13/books/review/what-tech-calls-thinking-adrian-daub.html. Marshall, Josh. “Policy Is Always About the Next Election.” Talking Points Memo, November 30, 2020. https://talkingpointsmemo.com/edblog/policy-is-always-about-the-next-election. Wehner, Peter. “The Cost of the Evangelical Betrayal.” The Atlantic, July 10, 2020. https://www.theatlantic.com/ideas/archive/2020/07/white-evangelicals-gambled-and-lost/613999/. Frum, David. “Trump’s Loss at the Supreme Court Is a Win for His Candidacy.” The Atlantic, July 9, 2020. https://www.theatlantic.com/ideas/archive/2020/07/trump-candidate-won-trump-man-lost/613960/. Neuron, The Happy. “The Dunning-Kruger Effect Explains Why Society Is So Screwed-Up.” Medium, May 23, 2020. https://medium.com/discourse/the-dunning-kruger-effect-explains-why-society-is-so-screwed-up-1432aca90aa8. Frum, David. “Heads, Trump Wins. Tails, We All Lose.” The Atlantic, November 26, 2019. https://www.theatlantic.com/ideas/archive/2019/11/trump-absolute-immunity-and-supreme-court/602665/. Anti-Defamation League. “Sacha Baron Cohen’s Keynote Address at ADL’s 2019 Never Is Now Summit on Anti-Semitism and Hate.” Accessed November 22, 2019. https://www.adl.org/news/article/sacha-baron-cohens-keynote-address-at-adls-2019-never-is-now-summit-on-anti-semitism. “Republicans Storm Impeachment Inquiry Deposition in House Intel Hearing Room - CNNPolitics.” Accessed October 23, 2019. https://www.cnn.com/2019/10/23/politics/republicans-storm-impeachment-inquiry-deposition-laura-cooper/index.html. “Trump Lawyer Says Even If He Shot Someone on Fifth Ave., He Can’t Be Prosecuted - The New York Times.” Accessed October 23, 2019. https://www.nytimes.com/2019/10/23/nyregion/trump-taxes-vance.html?action=click&module=Top%20Stories&pgtype=Homepage. Baker, Peter. “On Day 1,001, Trump Made It Clear: Being ‘Presidential’ Is Boring.” The New York Times, October 18, 2019, sec. U.S. https://www.nytimes.com/2019/10/18/us/politics/trump-presidency.html. Nicholas, Peter. “The Unraveling of Donald Trump.” The Atlantic, October 18, 2019. https://www.theatlantic.com/politics/archive/2019/10/trump-impeachment-mental-health/600292/. “Inside Facebook’s Two Years of Hell.” Wired. Accessed October 15, 2019. https://www.wired.com/story/inside-facebook-mark-zuckerberg-2-years-of-hell/. Edsall, Thomas B. “Opinion | Will Trump Ever Leave the White House?” The New York Times, October 2, 2019, sec. Opinion. https://www.nytimes.com/2019/10/02/opinion/trump-leave-white-house.html. Doherty, Maggie. “Jia Tolentino on the ‘Unlivable Hell’ of the Web and Other Millennial Conundrums.” The New York Times, August 4, 2019, sec. Books. https://www.nytimes.com/2019/08/04/books/review/jia-tolentino-trick-mirror.html. Lowrey, Annie. “More Money Than Anyone Imagined.” The Atlantic, July 26, 2019. https://www.theatlantic.com/ideas/archive/2019/07/whatever-happened-tech-bubble/594856/. Madrigal, Alexis C. “The Huge Trend That Realigned the Media Industry Is Over.” The Atlantic, June 13, 2019. https://www.theatlantic.com/technology/archive/2019/06/massive-trend-drove-digital-media-over/591520/. Kulwin, Noah. “Shoshana Zuboff Talks Surveillance Capitalism’s Threat to Democracy.” Intelligencer, February 24, 2019. http://nymag.com/intelligencer/2019/02/shoshana-zuboff-q-and-a-the-age-of-surveillance-capital.html. “What Happened to the Uber-for-X Companies - The Atlantic.” Accessed March 8, 2019. https://www.theatlantic.com/technology/archive/2019/03/what-happened-uber-x-companies/584236/. Tufekci, Zeynep. “Zuckerberg’s So-Called Shift Toward Privacy.” The New York Times, March 8, 2019, sec. Opinion. https://www.nytimes.com/2019/03/07/opinion/zuckerberg-privacy-facebook.html. Thompson, Nicholas. “Jill Abramson’s Book Charts Journalism’s Stormy Seas, With Some Personal Regrets and Score-Settling.” The New York Times, February 3, 2019, sec. Books. https://www.nytimes.com/2019/01/22/books/review/jill-abramson-merchants-of-truth.html. Lipinski, Ann Marie. “A Revolution for Journalism — or a Death Knell?” The New York Times, February 3, 2019, sec. Books. https://www.nytimes.com/2019/01/23/books/review/alan-rusbridger-breaking-news.html. Bissell, Tom. “An Anti-Facebook Manifesto, by an Early Facebook Investor.” The New York Times, February 3, 2019, sec. Books. https://www.nytimes.com/2019/01/29/books/review/roger-mcnamee-zucked.html. Silverman, Jacob. “How Tech Companies Manipulate Our Personal Data.” The New York Times, January 21, 2019, sec. Books. https://www.nytimes.com/2019/01/18/books/review/shoshana-zuboff-age-of-surveillance-capitalism.html. Bowles, Nellie. “A Dark Consensus About Screens and Kids Begins to Emerge in Silicon Valley.” The New York Times, October 26, 2018, sec. Style. https://www.nytimes.com/2018/10/26/style/phones-children-silicon-valley.html. Schwartz, John. “I’ll Do Evil and I’ll Get Rich. You Can Have a Share.” The New York Times, January 12, 2019, sec. Business. https://www.nytimes.com/2019/01/11/business/do-evil-get-rich.html. Reitman, Janet. “U.S. Law Enforcement Failed to See the Threat of White Nationalism. Now They Don’t Know How to Stop It.” The New York Times, November 3, 2018, sec. Magazine. https://www.nytimes.com/2018/11/03/magazine/FBI-charlottesville-white-nationalism-far-right.html. Bissell, Tom. “An Anti-Facebook Manifesto, by an Early Facebook Investor.” The New York Times, January 29, 2019, sec. Books. https://www.nytimes.com/2019/01/29/books/review/roger-mcnamee-zucked.html. Littau, Jeremy. “For Those Who Aren’t Quite Sure Why These Media Layoffs Keep Happening, or Think ‘It’s the Internet!’ Or ‘People Don’t Pay to Subscribe,’ There’s a Lot More Going on. Though That Is Part of That. Here’s a Cliffs Notes Version - Not Exhaustive but It Hits the Highlights:” Tweet. @JeremyLittau (blog), January 24, 2019. https://twitter.com/JeremyLittau/status/1088503510184927233. “How Facebook Advertising Works - The Atlantic.” Accessed January 16, 2019. https://www.theatlantic.com/technology/archive/2019/01/facebook-users-still-dont-know-how-facebook-works/580546/. Wu, Tim. “Be Afraid of Economic ‘Bigness.’ Be Very Afraid.” The New York Times, November 16, 2018, sec. Opinion. https://www.nytimes.com/2018/11/10/opinion/sunday/fascism-economy-monopoly.html. Foer, Franklin. “It’s Time to Regulate the Internet.” The Atlantic, March 21, 2018. https://www.theatlantic.com/technology/archive/2018/03/its-time-to-regulate-the-internet/556097/. Swisher, Kara. “How You Can Help Fight the Information Wars.” The New York Times, December 18, 2018, sec. Opinion. https://www.nytimes.com/2018/12/18/opinion/russia-disinformation-facebook.html. Meyer, Alexis C. Madrigal, Robinson. “How Facebook’s Chaotic Push Into Video Cost Hundreds of Journalists Their Jobs.” The Atlantic, October 18, 2018. https://www.theatlantic.com/technology/archive/2018/10/facebook-driven-video-push-may-have-cost-483-journalists-their-jobs/573403/. Marshall, Josh. “Facebook, Corporate Bad Actor.” Talking Points Memo (blog), October 17, 2018. https://talkingpointsmemo.com/edblog/facebook-corporate-bad-actor. Tufekci, Zeynep. “Opinion | Russian Meddling Is a Symptom, Not the Disease.” The New York Times, October 3, 2018, sec. Opinion. https://www.nytimes.com/2018/10/03/opinion/midterms-facebook-foreign-meddling.html. Vara, Vauhini. “Of Course Twitter Loves Elon Musk.” The Atlantic, August 7, 2018. https://www.theatlantic.com/technology/archive/2018/08/twitter-loves-elon-musk/567026/. Lynn, Barry. “Google and Facebook Are Strangling the Free Press to Death. Democracy Is the Loser | Barry Lynn.” The Guardian, July 26, 2018, sec. Opinion. http://www.theguardian.com/commentisfree/2018/jul/26/google-and-facebook-are-strangling-the-free-press-to-death-democracy-is-the-loser. Friedersdorf, Conor. “Donald Trump’s Reckless Iran Tweet.” The Atlantic, July 23, 2018. https://www.theatlantic.com/politics/archive/2018/07/donald-trumps-reckless-iran-tweet/565838/. LaFrance, Adrienne. “Mark Zuckerberg Doesn’t Understand Journalism.” The Atlantic, May 1, 2018. https://www.theatlantic.com/technology/archive/2018/05/mark-zuckerberg-doesnt-understand-journalism/559424/. Friedersdorf, Conor. “Trump’s Brazen, Effective Lie.” The Atlantic, 11:24 AM ET. https://www.theatlantic.com/politics/archive/2018/05/revisiting-a-brazen-effective-lie/559433/. WIRED. “Why Zuckerberg’s 14-Year Apology Tour Hasn’t Fixed Facebook.” Accessed April 12, 2018. https://www.wired.com/story/why-zuckerberg-15-year-apology-tour-hasnt-fixed-facebook/. Chen, Brian X. “I Downloaded the Information That Facebook Has on Me. Yikes.” The New York Times, April 11, 2018, sec. Personal Tech. https://www.nytimes.com/2018/04/11/technology/personaltech/i-downloaded-the-information-that-facebook-has-on-me-yikes.html. Marshall, Josh. “Nasim Aghdam’s Massacre Is Part of the Crisis of Big Tech.” Talking Points Memo (blog), April 4, 2018. https://talkingpointsmemo.com/edblog/nasim-aghdams-massacre-is-part-of-the-crisis-of-big-tech. Tufekci, Zeynep. “We’re Building a Dystopia Just to Make People Click on Ads.” November 17, 2017. https://www.youtube.com/watch?v=iFTWM7HV2UI&feature=youtu.be. Thompson, Ben. “Manifestos and Monopolies.” Stratechery by Ben Thompson, February 21, 2017. https://stratechery.com/2017/manifestos-and-monopolies/. Blumenstyk, Goldie. “Why ‘Media Literacy’ Doesn’t Stand a Chance.” The Chronicle of Higher Education, March 26, 2018. https://www.chronicle.com/article/Why-Media-Literacy-/242929. Thompson, Ben. “The Facebook Brand.” Stratechery, March 19, 2018. https://stratechery.com/2018/the-facebook-brand/. Gassée, Jean-Louis. “Mark Zuckerberg Thinks We’re Idiots.” Monday Note, March 25, 2018. https://mondaynote.com/mark-zuckerberg-thinks-were-idiots-638c64dfab12. For now, I don’t have much to say, other than this: As a profession, we need to think deeply about how to be, and how to act in the current media environment, in which monopolies are facilitating and encouraging the political, social, and economic dysfunction that is the defining characteristic of our times. Share this: Click to share on Facebook (Opens in new window) Click to share on Twitter (Opens in new window) Click to share on Tumblr (Opens in new window) Click to email this to a friend (Opens in new window) Click to print (Opens in new window) Posted on November 6, 2017November 6, 2017 SIA Workshop Links Just sharing a few links for use during the SIA workshop I’ll be teaching later today: Google Form for Exercises SIA Workshop Slides Share this: Click to share on Facebook (Opens in new window) Click to share on Twitter (Opens in new window) Click to share on Tumblr (Opens in new window) Click to email this to a friend (Opens in new window) Click to print (Opens in new window) Posted on October 20, 2017 Scaling Machine-Assisted Description of Historical Records One of the questions I’ve been grappling with as part of the Archival Connections research project is simple: Is there a future for the finding aid?  I’m inclined to think not, at least not in the form we are used to. Looking to the future, I recently had the chance to propose something slightly different, and have proposed a potential project for funding via an Amazon Research Grant. While the jury is still out on the proposal (an answer is coming in mid-December), I’d like to share a copy of the proposal, Scaling Machine-Assisted Description of Historical Materials. The idea I describe there seeks to build on an emergent digital repository and library infrastructure that is being built by the University of Illinois Library.  It seeks to integrate natural language processing and named entity recognition elements to index and provide relational browsing pathways alongside file-system access.  I’ll  have more to say about this at the Society of Indiana Archivists meeting tomorrow. Share this: Click to share on Facebook (Opens in new window) Click to share on Twitter (Opens in new window) Click to share on Tumblr (Opens in new window) Click to email this to a friend (Opens in new window) Click to print (Opens in new window) Posted on October 20, 20171 Comment on Scaling Machine-Assisted Description of Historical Records Social Feed Manager Takeaways Later this week, I’ll be introducing the Archival Connections Project at the Society of Indiana Archivists Meeting.  During the first year of this project, one focus of my work was evaluating and developing some recommendations for using Social Feed Manager, a tool developed by George Washington University Libraries. My full report is here, for those interested:  https://gwu-libraries.github.io/sfm-ui/resources/SFMReportProm2017.pdf. Without going into too much detail, here is what I feel like I learned while working on this report, at least as far as it relates to the Archival Connections project: First and foremost: Data models matter. As I indicated in the report, the SFM’s underlying database and data model are both simple and elegant. Since the application focuses on doing one thing and doing it well, the database directly translates into user interface components that make the application a joy to use.  While the project team hired a usability consultant to improve the app, the tweaks made by the team in response to the report simply added polish to an already strong interface.  While I won’t be so impolitic as to compare SFM to other archival tools, the application works well, in part, because the various data object and the tables that underly them represent things that exist in the real world, not abstractions or vague concepts that are hard for staff to understand or programmers to translate into an interface. Second: Archivists should become better API consumers.  One of the things that fascinates me most about SFM is the fact that it connects directly to the Twitter API and slurps up all of the metadata supplied by it.  Thinking broadly, the archival and information professions are doing a lot to build and use our own API’s or data providers, but less to interact with those supplied by the data companies that now order our lives.   For example, do we have an API that line archivists (as opposed to technical staff) can connect to (a) Google Drive, Box.com, Outlook 365, or Facebook,  (b), harvest records from those systems, and (c) prep them for deposit in a digital repository?  Not that I am aware of, but we should. Without them, we can’t capture records and preservation metadata at or near the point that records created (h/t David Bearman). Third: The metadata that APIs supply is a two-edged sword.   Once you dig into their JSON files, you quickly see that Twitter supplies a lot of what the OAIS reference model calls preservation metadata: dates and times tweets were published, times the tool captured it, etc.  As a baseline, such data will help people make future claims about the authenticity of these records or mine them as data.  But given the relative lack of descriptive metadata and the fact that bots and other non-human agents control so many twitter accounts (not to mention the fact that many users’ handles tell you little to nothing about their real identity), this metadata in itself is not sufficient to say something is authentic or not authentic or to wring much value from the dataset.  That requires (wait for it . . . ) a person interpreting the records using all of the intelligence they can muster. Finally: Aggregations matter now more than ever.   I was a bit taken aback a few months ago when the committee charged with revising DACS made no mention of provenance, original order, arranging files or levels of description in their draft principles.  While their work had much to recommend it,  the lack of any mention like an oversight, and an important one.   My work with SFM has convinced me that aggregations and provenance are even more important when working with records harvested from the cloud.  Given the free-floating, intertwined nature of records found in social media or other ‘cloud’ platforms, it seems to me that the act of capturing records by an archivist results in an aggregation.  For instance, SFM generates a set of tweets, but that set is the result of an archivist’s activity to shape the collection.  And this aggregation and the provenance behind it deserve to be described as such, with as much transparency about the archivist’s role as possible.  In short, archivists can and must do a good job of arranging and describing materials at a collection or series level, there is no workaround for this core archival function, even–or perhaps especially–when extracting item based metadata and records from the platforms that now rule many people daily work and social lives. Share this: Click to share on Facebook (Opens in new window) Click to share on Twitter (Opens in new window) Click to share on Tumblr (Opens in new window) Click to email this to a friend (Opens in new window) Click to print (Opens in new window) Posted on October 17, 2017October 17, 2017 Arrangement and Description in the Cloud: A Preliminary Analysis I’m posting a preprint of some early work related to the Archival Connections project.  This work will be published as a book chapter/proceedings by the ArchiveSchule in Marburg.  In the meantime, here is the preprint: Archival Arrangement and Description in the Cloud A Preliminary Analysis Share this: Click to share on Facebook (Opens in new window) Click to share on Twitter (Opens in new window) Click to share on Tumblr (Opens in new window) Click to email this to a friend (Opens in new window) Click to print (Opens in new window) Posted on September 8, 2017August 10, 2018 Posts navigation Page 1 Page 2 Next page Proudly powered by WordPress Send to Email Address Your Name Your Email Address Cancel Post was not sent - check your email addresses! Email check failed, please try again Sorry, your blog cannot share posts by email. 
www-archivalconnections-org-6657	----	Archival Connections Archival Connections project site Platform Monopolies and Archives I am at the InterPARES Trust North American Team meeting in Vancouver, and the issue of platform monopolies has risen to the top of my mind. Here is a quick list of readings I&#8217;ve thrown together while listening to and engaging in the discussion: For now, I don&#8217;t have much to say, other than this: As a &#8230; Continue reading Platform Monopolies and Archives SIA Workshop Links Just sharing a few links for use during the SIA workshop I&#8217;ll be teaching later today: Google Form for Exercises SIA Workshop Slides Scaling Machine-Assisted Description of Historical Records One of the questions I&#8217;ve been grappling with as part of the Archival Connections research project is simple: Is there a future for the finding aid?  I&#8217;m inclined to think not, at least not in the form we are used to. Looking to the future, I recently had the chance to propose something slightly different, and &#8230; Continue reading Scaling Machine-Assisted Description of Historical Records Social Feed Manager Takeaways Later this week, I&#8217;ll be introducing the Archival Connections Project at the Society of Indiana Archivists Meeting.  During the first year of this project, one focus of my work was evaluating and developing some recommendations for using Social Feed Manager, a tool developed by George Washington University Libraries. My full report is here, for those interested:  https://gwu-libraries.github.io/sfm-ui/resources/SFMReportProm2017.pdf. Without &#8230; Continue reading Social Feed Manager Takeaways Arrangement and Description in the Cloud: A Preliminary Analysis I&#8217;m posting a preprint of some early work related to the Archival Connections project.  This work will be published as a book chapter/proceedings by the ArchiveSchule in Marburg.  In the meantime, here is the preprint: Archival Arrangement and Description in the Cloud A Preliminary Analysis Installing Social Feed Manager Locally The easiest way to get started with Social Feed Manager is to install Docker on a local machine, such as a laptop or (preferably) desktop computer with a persistent internet connection. Running SFM locally for anything other than testing purposes is NOT recommended. It will not be sufficient for a long-term documentation project and would &#8230; Continue reading Installing Social Feed Manager Locally Preserving Email Report Summary Earlier today, I provided a summary of Preserving Email, a Technology Watch Report I wrote back in 2011. I'll leave it to others to judge how well that report holds up, but I had the following takeaways when re-reading it: Introducing Archival Connections Welcome! This shares information from a five-year research project that I am coordinating at the University of Illinois at Urbana-Champaign.  The project aims to make it easier for people to find and use the materials managed by archival repositories like the University of Illinois Archives, where I work.  You can read more about the project on the &#8230; Continue reading Introducing Archival Connections 
www-association-maidi-mg-7541	----	Presentation Associaiton MAIDI         Association MAIDI Madagascar Initiatives for Digital Innovation Acceuil Notre Vision Open Data Data Journalisme Online Democracy Nos Projets         Association MAIDI Madagascar Initiatives for Digital Innovation    Acceuil Présentation Régie par l’Ordonnance n°60-133 du 03 octobre 1960, l’Association MAIDI   ou Madagascar Initiatives for Digital Innovation a été officiellement créée le 02 Février 2017   à Antananarivo  dans le but de promouvoir l’Open Data, le Data journalisme et l’E Democracy à Madagascar.     L'Association est constituée pour une durée indéterminée   L'Association n'appartient à aucun parti politique et est totalement indépendante de quelconque affiliation.     La structure de l'Association comprend :   Un Conseil d'Administration   Un Bureau   Une Assemblée Générale (AG) Contexte Lors de l'Open Government Partnership Summit du 7 au 9 Décembre 2016 à Paris, Madagascar a, avec le Maroc, Guinée, Haïti et le Sénégal   exprimé sa volonté de rejoindre la grande famille du Gouvernement Ouvert malgré les conditions d’éligibilité non remplies à seulement 58 %.   Parmi ces conditions, on a l’amélioration de la transparence sur le budget de l’Etat et   la divulgation des informations.  Faciliter l’accès à l’information mais aussi s’assurer que la liberté d’individuelle de chacun soit respectée. Au delà du principe de « redevabilité », nous espérons que le  gouvernement réponde favorablement au "Droit de Savoir" des citoyens. L’association fait donc le premier pas dans des concepts innovants sur le territoire national notamment les notions d’Open Data et du Data journalisme en visant des secteurs variés tels que l’éducation, la santé, l’emploi, l’environnement, le transport, finances et du budget. Title text information This is an excellent place for you to add a paragraph and let your visitors know more about you and your services. Easily edit by go to tab pages and content and click edit. Nous Contacter Antananarivo - Madagascar +261 34 09 638 00 contact@association-maidi.mg © 2017 Company Inc. All rights reserved. Home | About Us | News & Event | Download | Links | Contact Us Acceuil Notre Vision Open Data Data Journalisme Online Democracy Nos Projets 
www-blogger-com-4590	----	DSHR's Blog DSHR's Blog I'm David Rosenthal, and this is a place to discuss the work I'm doing in Digital Preservation. Dogecoin Disrupts Bitcoin! What Is The Point? NFTs and Web Archiving Cryptocurrency's Carbon Footprint Elon Musk: Threat or Menace? Internet Archive Storage Correlated Failures History Of Window Systems Principles For The Decentralized Web Blast Radius More On Archiving Twitter Talk At Berkeley's Information Access Seminar Chromebook Linux Update Effort Balancing And Rate Limits ISP Monopolies The Bitcoin "Price" Two Million Page Views! The New Oldweb.today Michael Nelson's Group On Archiving Twitter Stablecoins RISC vs. CISC 737 MAX Ungrounding I Rest My Case Storage Media Update Even More On The Ad Bubble 
www-blogger-com-4854	----	Scriptio Continua Scriptio Continua Thoughts on software development, Digital Humanities, the ancient world, and whatever else crosses my radar. All original content herein is licensed under a Creative Commons Attribution license. Reminder Experiencing Technical Difficulties Thank You DH Data Talk Outside the tent Missing DH First Contact You did _what_? Form-based XML editing How to Apologize A spot of mansplaining TEI in other formats; part the second: Theory TEI in other formats; part the first: HTML Humanities Data Curation Interfaces and Models TEI is a text modelling language I Will Never NOT EVER Type an Angle Bracket (or IWNNETAAB for short) DH Tea Leaves That Bug Bit Me #alt-ac Careers: Digital Humanities Developer Addenda et Corrigenda Making a new Numbers Server for papyri.info #APA2010 Converting APIS Object Artefact Script Stomping on Innovation Killers 
www-blogger-com-7365	----	Coyle's InFormation Coyle's InFormation Comments on the digital age, which, as we all know, is 42. Digitization Wars, Redux Women designing 1982 Ceci n'est pas une Bibliothèque Use the Leader, Luke! Pamflets The Work I, too, want answers I'd like to buy a VOWEL FRBR without FR or BR It's "academic" Libraryland, We Have a Problem FRBR as a Data Model Google Books and Mein Kampf On reading Library Journal, September, 1877 The Work Pray for Peace Two FRBRs, Many Relationships If It Ain't Broke Precipitating Forward Miseducation Transparency of judgment All the (good) books All the Books 2 Mysteries Solved! 
www-blogger-com-8670	----	Open Source Exile Open Source Exile An open sourcer in exile #ChristchurchMosqueShootings How would we know when it was time to move from TEI/XML to TEI/JSON? Whither TEI? The Next Thirty Years Thoughts on the NDFNZ wikipedia panel Feedback on NLNZ ‘DigitalNZ Concepts API‘ BIBFRAME A wikipedia strategy for the Royal Society of New Zealand Prep notes for NDF2011 demonstration Metadata vocabularies LODLAM NZ cares about Unexpected advice Goodbye 'social-media' world Recreational authority control Thoughts on "Letter about the TEI" from Martin Mueller unit testing framework for XSL transformations? Is there a place for readers' collectives in the bright new world of eBooks? HOWTO: Deep linking into the NZETC site ePubs and quality What LibraryThing metadata can the NZETC reasonable stuff inside it's CC'd epubs? Interlinking of collections: the quest continues eBook readers need OpenURL resolvers Thoughts on koha Data and data modelling and underlying assumptions Learning XSLT 2.0 Part 1; Finding Names Legal Māori Archive Why card-based records aren't good enough 
www-blogger-com-9826	----	futureArch, or the future of archives... futureArch, or the future of archives... A place for thoughts on hybrid archives and manuscripts at the Bodleian Library. This blog is no longer being updated Born Digital: Guidance for Donors, Dealers, and Archival Repositories Digital Preservation: What I wish I knew before I started Transcribe at the arcHIVE Atlas of digital damages DayOfDigitalArchives 2012 Sprucing up the TikaFileIdentifier SPRUCE Mashup: 16th-18th April 2012 Media Recognition: DV part 3 Media Recognition: DV part 2 Media Recognition: DV part 1 Digital Preservation: What I Wish I Knew Before I Started What is ‘The Future of the Past of the Web’? Day of Digital Archives, 2011 Another source for old software Comparing software tools Mobile forensics Preserving born-digital video - what are good practices? Hidden Pages Media recognition - Floppy Disks part 3 Preserving Digital Sound and Vision: A Briefing 8th April 2011 Sharp font writer files Got any older? World backup day 2011 Advisory Board Meeting, 18 March 2011 
www-bloomberg-com-8177	----	Bloomberg - Are you a robot? Bloomberg Need help? Contact us We've detected unusual activity from your computer network To continue, please click the box below to let us know you're not a robot. Why did this happen? Please make sure your browser supports JavaScript and cookies and that you are not blocking them from loading. For more information you can review our Terms of Service and Cookie Policy. Need Help? For inquiries related to this message please contact our support team and provide the reference ID below. Block reference ID: 
www-chemistryviews-org-8169	----	Get Full Text Research (GetFTR) :: News :: ChemistryViews Get Full Text Research (GetFTR) Author: ChemistryViews.org Published: 26 December 2019 Copyright: Wiley-VCH Verlag GmbH & Co. KGaA Related Articles News: Ketones Get Heavy Magazine: Chemistry 2030: A Roadmap for a New Decade Education: After Acceptance: How to Turn Your Research into Cover Art News: Rapid Charging of Lithium Ion Batteries A group of the largest scholarly publishers, namely American Chemical Society (ACS), Elsevier, Springer Nature, Taylor & Francis, and Wiley, have launched the service “Get Full Text Research” (GetFTR). Growing leakage has steadily eroded the ability of the publishers to monetize the value they create. Therefore, they have been working on improved discovery and access for several years. RA21’s SeamlessAccess.org service makes the authorization process smoother. And now GetFTR shows whether a user will have access to the full text before clicking on a link to the publisher’s website. If so, it links the user directly to it. This requires that the user previously has disclosed his/her institutional affiliation through the SeamlessAccess.org “Where Are You From” service. The information is stored locally on the user’s browser. The user’s institutional affiliation and the article DOI are used to determine whether the individual should be entitled to access the article. This takes place seamlessly in the background. The user will see, in a list of search results, clear information such as a green or red button, on whether he/she will be able to access the full text of each article. If already logged in, a user who then clicks on the link will be taken directly to the article without any intermediate pages. If not, the user will be taken to his/her institutional login. If GetFTR recognizes in the workflow described above that a user is not entitled to access the licensed version, it will provide an alternative. Each publisher will be able to determine these alternatives. This might, for example, be access to a preprint or a read-only version. GetFTR is intended to be entirely invisible to the user. Only the colored buttons will be visible. Thus, the brand name is not intended to face towards users. It is expected that the first publishers will pilot GetFTR in the first quarter of 2020. GetFTR website       Article Views: 2064 Sign in Area Please sign in below Email : Password : Remember me : Additional Sign In options Register now Forgot your password? Please note that to comment on an article you must be registered and logged in. Registration is for free, you may already be registered to receive, e.g., the newsletter. When you register on this website, please ensure you view our terms and conditions. All comments are subject to moderation. Article Comments - To add a comment please sign in Site Breadcrumb You are here : Home News Get Full Text Research (GetFTR) Most Read Education: Why Does Iodine Turn Starch Blue? Society: Chemistry Europe Magazine: Electrocatalytic Synthesis of Carbamates... News: Dual-Function Nanosheets for Water Treat... News: Merck to Expand its R&D and Manufacturin... Chemistry Views link back to homepage Open Search Menu Site Search Please Enter Search Terms Search All Content Types ChemViews Magazine News Webinars Videos Education Books Journals Advertorials Events All Subjects / Themes Analytical Bio Industrial Inorganic Organic Physical Catalysis Education Energy Everyday Chemistry Food History Materials Nanotech Pharma Healthcare Publishing Sustainability Water Additional Search Options Site Navigation Home SUB LINK ChemViews Magazine Inorganic Organic Physical Analytical Bio Industrial OPEN NAV CLOSE SUB NAV News Webinars Videos Education Books Journals Portals Events Early View Societies Advertorials CLOSE Login / Account Area Email: Password: Remember About us Remind Register If you would like to reuse any content, in print or online, from ChemistryViews.org, please contact us first for permission. more About us Editorial Office Contact Us Advertise Jobs Virtual Events Archive Most Read/Editor's Pick Awards Funding Collections Copyright Masthead/Impressum Privacy/Datenschutz Terms Magazine of published by Wiley-VCH Our site uses cookies to improve your experience. You can find out more about our use of cookies in our "standard cookie policy", including instructions on how to reject and delete cookies if you wish to do so. By continuing to browse this site you agree to us using cookies as described in our "standard cookie policy". Our site uses cookies to improve your experience. You can find out more about our use of cookies in our "standard cookie policy", including instructions on how to reject and delete cookies if you wish to do so. By continuing to browse this site you agree to us using cookies as described in our "standard cookie policy". 
www-bohyunkim-net-9108	----	Library Hat Library Hat http://www.bohyunkim.net/blog/ Skip to Content ↓ Bohyunkim.net About Publications Presentations CV / Resume Blockchain: Merits, Issues, and Suggestions for Compelling Use Cases Jul 24th, 2018 by Bohyun (Library Hat). Comments are off for this post * This post was also published in ACRL TechConnect.*** Blockchain holds a great potential for both innovation and disruption. The adoption of blockchain also poses certain risks, and those risks will need to be addressed and mitigated before blockchain becomes mainstream. A lot of people have heard of blockchain at this point. But many are unfamiliar with how this new technology exactly works and unsure about under which circumstances or on what conditions it may be useful to libraries. In this post, I will provide a brief overview of the merits and the issues of blockchain. I will also make some suggestions for compelling use cases of blockchain at the end of this post. What Blockchain Accomplishes Blockchain is the technology that underpins a well-known decentralized cryptocurrency, Bitcoin. To simply put, blockchain is a kind of distributed digital ledger on a peer-to-peer (P2P) network, in which records are confirmed and encrypted. Blockchain records and keeps data in the original state in a secure and tamper-proof manner[1] by its technical implementation alone, thereby obviating the need for a third-party authority to guarantee the authenticity of the data. Records in blockchain are stored in multiple ledgers in a distributed network instead of one central location. This prevents a single point of failure and secures records by protecting them from potential damage or loss. Blocks in each blockchain ledger are chained to one another by the mechanism called ‘proof of work.’ (For those familiar with a version control system such as Git, a blockchain ledger can be thought of as something similar to a P2P hosted git repository that allows sequential commits only.[2]) This makes records in a block immutable and irreversible, that is, tamper-proof. In areas where the authenticity and security of records is of paramount importance, such as electronic health records, digital identity authentication/authorization, digital rights management, historic materials that may be contested or challenged due to the vested interests of certain groups, and digital provenance to name a few, blockchain can lead to efficiency, convenience, and cost savings. For example, with blockchain implemented in banking, one will be able to transfer funds across different countries without going through banks.[3] This can drastically lower the fees involved, and the transaction will take effect much more quickly, if not immediately. Similarly, adopted in real estate transactions, blockchain can make the process of buying and selling a property more straightforward and efficient, saving time and money.[4] Disruptive Potential of Blockchain The disruptive potential of blockchain lies in its aforementioned ability to render the role of a third-party authority obsolete, which records and validates transactions and guarantees their authenticity, should a dispute arise. In this respect, blockchain can serve as an alternative trust protocol that decentralizes traditional authorities. Since blockchain achieves this by public key cryptography, however, if one loses one’s own personal key to the blockchain ledger holding one’s financial or real estate asset, for example, then that will result in the permanent loss of such asset. With the third-party authority gone, there will be no institution to step in and remedy the situation. Issues This is only some of the issues with blockchain. Other issues include (a) interoperability between different blockchain systems, (b) scalability of blockchain at a global scale with large amount of data, (c) potential security issues such as the 51% attack [5], and (d) huge energy consumption [6] that a blockchain requires to add a block to a ledger. Note that the last issue of energy consumption has both environmental and economic ramifications because it can cancel out the cost savings gained from eliminating a third-party authority and related processes and fees. Challenges for Wider Adoption There are growing interests in blockchain among information professionals, but there are also some obstacles to those interests gaining momentum and moving further towards wider trial and adoption. One obstacle is the lack of general understanding about blockchain in a larger audience of information professionals. Due to its original association with bitcoin, many mistake blockchain for cryptocurrency. Another obstacle is technical. The use of blockchain requires setting up and running a node in a blockchain network, such as Ethereum[7], which may be daunting to those who are not tech-savvy. This makes a barrier to entry high to those who are not familiar with command line scripting and yet still want to try out and test how a blockchain functions. The last and most important obstacle is the lack of compelling use cases for libraries, archives, and museums. To many, blockchain is an interesting new technology. But even many blockchain enthusiasts are skeptical of its practical benefits at this point when all associated costs are considered. Of course, this is not an insurmountable obstacle. The more people get familiar with blockchain, the more ways people will discover to use blockchain in the information profession that are uniquely beneficial for specific purposes. Suggestions for Compelling Use Cases of Blockchain In order to determine what may make a compelling use case of blockchain, the information profession would benefit from considering the following. What kind of data/records (or the series thereof) must be stored and preserved exactly the way they were created. What kind of information is at great risk to be altered and compromised by changing circumstances. What type of interactions may need to take place between such data/records and their users.[8] How much would be a reasonable cost for implementation. These will help connecting the potential benefits of blockchain with real-world use cases and take the information profession one step closer to its wider testing and adoption. To those further interested in blockchain and libraries, I recommend the recordings from the Library 2.018 online mini-conference, “Blockchain Applied: Impact on the Information Profession,” held back in June. The Blockchain National Forum, which is funded by IMLS and is to take place in San Jose, CA on August 6th, will also be livestreamed. Notes [1] For an excellent introduction to blockchain, see “The Great Chain of Being Sure about Things,” The Economist, October 31, 2015, https://www.economist.com/news/briefing/21677228-technology-behind-bitcoin-lets-people-who-do-not-know-or-trust-each-other-build-dependable. [2] Justin Ramos, “Blockchain: Under the Hood,” ThoughtWorks (blog), August 12, 2016, https://www.thoughtworks.com/insights/blog/blockchain-under-hood. [3] The World Food Programme, the food-assistance branch of the United Nations, is using blockchain to increase their humanitarian aid to refugees. Blockchain may possibly be used for not only financial transactions but also the identity verification for refugees. Russ Juskalian, “Inside the Jordan Refugee Camp That Runs on Blockchain,” MIT Technology Review, April 12, 2018, https://www.technologyreview.com/s/610806/inside-the-jordan-refugee-camp-that-runs-on-blockchain/. [4] Joanne Cleaver, “Could Blockchain Technology Transform Homebuying in Cook County — and Beyond?,” Chicago Tribune, July 9, 2018, http://www.chicagotribune.com/classified/realestate/ct-re-0715-blockchain-homebuying-20180628-story.html. [5] “51% Attack,” Investopedia, September 7, 2016, https://www.investopedia.com/terms/1/51-attack.asp. [6] Sherman Lee, “Bitcoin’s Energy Consumption Can Power An Entire Country — But EOS Is Trying To Fix That,” Forbes, April 19, 2018, https://www.forbes.com/sites/shermanlee/2018/04/19/bitcoins-energy-consumption-can-power-an-entire-country-but-eos-is-trying-to-fix-that/#49ff3aa41bc8. [7] Osita Chibuike, “How to Setup an Ethereum Node,” The Practical Dev, May 23, 2018, https://dev.to/legobox/how-to-setup-an-ethereum-node-41a7. [8] The interaction can also be a self-executing program when certain conditions are met in a blockchain ledger. This is called a “smart contract.” See Mike Orcutt, “States That Are Passing Laws to Govern ‘Smart Contracts’ Have No Idea What They’re Doing,” MIT Technology Review, March 29, 2018, https://www.technologyreview.com/s/610718/states-that-are-passing-laws-to-govern-smart-contracts-have-no-idea-what-theyre-doing/. Posted in: Coding, Library, Technology. Tagged: bitcoin · blockchain · distributed ledger technology · DLT Taking Diversity to the Next Level Dec 18th, 2017 by Bohyun (Library Hat). Comments are off for this post ** This post was also published in ACRL TechConnect on Dec. 18, 2017.*** “Building Bridges in a Divisive Climate: Diversity in Libraries, Archives, and Museums,” panel discussion program held at the University of Rhode Island Libraries on Thursday November 30, 2017. Getting Minorities on Board I recently moderated a panel discussion program titled “Building Bridges in a Divisive Climate: Diversity in Libraries, Archives, and Museums.”1 Participating in organizing this program was interesting experience. During the whole time, I experienced my perspective constantly shifting back and forth as (i) someone who is a woman of color in the US who experiences and deals with small and large daily acts of discrimination, (ii) an organizer/moderator trying to get as many people as possible to attend and participate, and (iii) a mid-career librarian who is trying to contribute to the group efforts to find a way to move the diversity agenda forward in a positive and inclusive way in my own institution. In the past, I have participated in multiple diversity-themed programs either as a member of the organizing committee or as an attendee and have been excited to see colleagues organize and run such programs. But when asked to write or speak about diversity myself, I always hesitated and declined. This puzzled me for a long time because I couldn’t quite pinpoint where my own resistance was coming from. I am writing about this now because I think it may shed some light on why it is often difficult to get minorities on board with diversity-related efforts. A common issue that many organizers experience is that often these diversity programs draw many allies who are already interested in working on the issue of diversity, equity, and inclusion but not necessarily a lot of those who the organizers consider to be the target audience, namely, minorities. What may be the reason? Perhaps I can find a clue for the answer to this question from my own resistance regarding speaking or writing about diversity, preferring rather to be in the audience with a certain distance or as an organizer helping with logistics behind the scene. To be honest, I always harbored a level of suspicion about how much of the sudden interests in diversity is real and how much of it is simply about being on the next hot trend. Trends come and go, but issues lived through many lives of those who belong to various systematically disadvantaged and marginalized groups are not trends. Although I have been always enthusiastic about participating in diversity-focused programs as attendees and was happy to see diversity, equity, and inclusion discussed in articles and talks, I wasn’t ready to sell out my lived experience as part of a hot trend, a potential fad. To be clear, I am not saying that any of the diversity-related programs or events were asking speakers or authors to be a sell-out. I am only describing how things felt to me and where my own resistance was originating. I have been and am happy to see diversity discussed even as a one-time fad. Better a fad than no discussion at all. One may argue that that diversity has been actively discussed for quite some time now. A few years, maybe several, or even more. Some of the prominent efforts to increase diversity in librarianship I know, for example, go as far back as 2007 when Oregon State University Libraries sponsored two scholarships to the Code4Lib conference, one for women and the other for minorities, which have continued from then on as the Code4Lib Diversity Scholarship.2 But if one has lived the entire life as a member of a systematically disadvantaged group either as a woman, a person of color, a person of certain sexual orientation, a person of a certain faith, a person with a certain disability, etc., one knows better than expecting some sudden interests in diversity to change the world we live in and most of the people overnight. I admit I have been watching the diversity discussion gaining more and more traction in librarianship with growing excitement and concern at the same time. For I felt that all of what is being achieved through so many people’s efforts may get wiped out at any moment. The more momentum it accrues, I worried, the more serious backlash it may come to face. For example, it was openly stated that seeking racial/ethnic diversity is superficial and for appearance’s sake and that those who appear to belong to “Team Diversity” do not work as hard as those in “Team Mainstream.” People make this type of statements in order to create and strengthen a negative association between multiple dimensions of diversity that are all non-normative (such as race/ethnicity, religion, sexual orientation, immigration status, disability) and unfavorable value judgements (such as inferior intellectual capacity or poor work ethic).3 According to this kind of flawed reasoning, a tech company whose entire staff consists of twenty-something white male programmers with a college degree, may well have achieved a high level of diversity because the staff might have potentially (no matter how unlikely) substantial intellectual and personal differences in their thinking, background, and experience, and therefore their clear homogeneity is no real problem. That’s just a matter of trivial “appearance.” The motivation behind this kind of intentional misdirection is to derail current efforts towards expanding diversity, equity, and inclusion by taking people’s attention away from the real issue of systematic marginalization in our society. Of course, the ultimate goal of all diversity efforts should be not the mere inclusion of minorities but enabling them to have agency as equal as the agency those privileged already possess. But note that objections are being raised against mere inclusion. Anti-diversity sentiment is real, and people will try to rationalize it in any way they can. Then of course, the other source of my inner resistance to speaking or writing about diversity has been the simple fact that thinking about diversity, equity, and inclusion does not take me to a happy place. It reminds me of many bad experiences accumulated over time that I would rather not revisit. This is why I admire those who have spoken and written about their lived experience as a member of a systematically discriminated and marginalized group. Their contribution is a remarkably selfless one. I don’t have a clear answer to how this reflection on my own resistance against actively speaking or writing about diversity will help future organizers. But clearly, being asked to join many times had an effect since I finally did accept the invitation to moderate a panel and wrote this article. So, if you are serious about getting more minorities – whether in different religions, genders, disabilities, races, etc. – to speak or write on the issue, then invite them and be ready to do it over and over again even if they decline. Don’t expect that they will trust you at the first invitation. Understand that by accepting such an invitation, minorities do risk far more than non-minorities will ever do. The survey I ran for the registrants of the “Building Bridges in a Divisive Climate: Diversity in Libraries, Archives, and Museums” panel discussion program showed several respondents expressing their concern about the backlash at their workplaces that did or may result from participating in diversity efforts as a serious deterrent.4 If we would like to see more minorities participate in diversity efforts, we must create a safe space for everyone and take steps to deal with potential backlash that may ensue afterwards.5 A Gentle Intro or a Deep Dive? Another issue that many organizers of diversity-focused events, programs, and initiatives struggle with is two conflicting expectations from their audience. On one hand, there are those who are familiar with diversity, equity, and inclusion issues and want to see how institutions and individuals are going to take their initial efforts to the next level. These people often come from organizations that already implemented certain pro-diversity measures such as search advocates for the hiring process.6 and educational programs that familiarize the staff with the topic of diversity, equity, and inclusion.7 On the other hand, there are still many who are not quite sure what diversity, equity, and inclusion exactly mean in a workplace or in their lives. Those people would continue to benefit from a gentle introduction to things such as privilege, microaggression, and unconscious biases. The feedback surveys collected after the “Building Bridges in a Divisive Climate: Diversity in Libraries, Archives, and Museums” panel discussion program showed these two different expectations. Some people responded that they deeply appreciated the personal stories shared by the panelists, noting that they did not realize how often minorities are marginalized even in one day’s time. Others, however, said they would be like to hear more about actionable items and strategies that can be implemented to further advance the values of diversity, equity, and inclusion that go beyond personal stories. Balancing these two different demands is a hard act for organizers. However, this is a testament to our collective achievement that more and more people are aware of the importance of continuing efforts to improve diversity, equity, and inclusion in libraries, archives, and museums. I do think that we need to continue to provide a general introduction to diversity-related issues, exposing people to everyday experience of marginalized groups such as micro-invalidation, impostor syndrome, and basic concepts like white privilege, systematic oppression, colonialism, and intersectionality. One of the comments we received via the feedback survey after our diversity panel discussion program was that the program was most relevant in that it made “having colleagues attend with me to hear what I myself have never told them” possible. General programs and events can be an excellent gateway to more open and less guarded discussion. At the same time, it seems to be high time for us in libraries, museums, and archives to take a deep dive into different realms of diversity, equity, and inclusion as well. Diversity comes in many dimensions such as age, disability, religion, sexual orientation, race/ethnicity, and socioeconomic status. Many of us feel more strongly about one issue than others. We should create opportunities for ourselves to advocate for specific diversity issues that we care most. The only thing I would emphasize is that one specific dimension of diversity should not be used as an excuse to neglect others. Exploring socioeconomic inequality issues without addressing how they work combined with the systematic oppression of marginalized groups such as Native Americans, women, or immigrants at the same time can be an example of such a case. All dimensions of diversity are closely knitted with one another, and they do not exist independently. For this reason, a deep dive into different realms of diversity, equity, and inclusion must be accompanied by the strong awareness of their intersectionality.8 Recommendations and Resources for Future Organizers Organizing a diversity-focused program takes a lot of effort. While planning the “Building Bridges in a Divisive Climate: Diversity in Libraries, Archives, and Museums” panel discussion program at the University of Rhode Island Libraries, I worked closely with my library dean, Karim Boughida, who originally came up with the idea of having a panel discussion program at the University of Rhode Island Libraries, and Renee Neely in the libraries’ diversity initiatives for approximately two months. For panelists, we decided to recruit as many minorities from diverse institutions and backgrounds. We were fortunate to find panelists from a museum, an archive, both a public and an academic library with varying degrees of experience in the field from only a few years to over twenty-five years, ranging from a relatively new archivist to an experienced museum and a library director. Our panel consisted of one-hundred percent people of color. The thoughts and perspectives that those panelists shared were, as a result, remarkably diverse and insightful. For this reason, I recommend spending some time to get the right speakers for your program if your program will have speakers. Discussion at the “Building Bridges in a Divisive Climate: Diversity in Libraries, Archives, and Museums,” at the University of Rhode Island Libraries Another thing I would like to share is the questions that I created for the panel discussion. Even though we had a whole hour, I was able to cover only several of them. But since I discussed all these questions in advance with the panelists and they helped me put a final touch on some of those, I think these questions can be useful to future organizers who may want to run a similar program. They can be utilized for a panel discussion, an unconference, or other types of programs. I hope this is helpful and save time for other organizers. Sample Questions for the Diversity Panel Discussion Why should libraries, archives, museums pay attention to the issues related to diversity, equity, and inclusion? In what ways do you think the lack of diversity in our profession affects the perception of libraries, museums, and archives in the communities we serve? Do you have any personal or work-related stories that you would like to share that relate to diversity, equity, and inclusion issues? How did you get interested in diversity, equity, and inclusion issues? Suppose you discovered that your library’s, archive’s or museum’s collection includes prejudiced information, controversial objects/ documents, or hate-inducing material. What would you do? Suppose a group of your library / archive / museum patrons want to use your space to hold a local gathering that involves hate speech. What would you do? What would you be mostly concerned about, and what would the things that you would consider to make a decision on how you will respond? Do you think libraries, archives, and museums are a neutral place? What do you think neutrality means to a library, an archive, a museum in practice in a divisive climate such as now? What are some of the areas in libraries, museums, and archives where you see privileges and marginalization function as a barrier to achieving our professional values – equal access and critical thinking?  What can we do to remove those barriers? Could you tell us how colonialist thinking and practice are affecting libraries, museums, and archives either consciously or unconsciously?  Since not everyone is familiar with what colonialism is, please begin with first your brief interpretation of what colonialist thinking or practice look like in libraries, museums, and archives first? What do you think libraries, archives, and museums can do more to improve critical thinking in the community that we serve? Although libraries, archives, museums have been making efforts to recruit, hire, and retain diverse personnel in recent years, the success rate has been relatively low. For example, in librarianship, it has been reported that often those hired through these efforts experienced backlash at their own institutions, were subject to unrealistic expectations, and met with unsupportive environment, which led to burnout and a low retention rate of talented people. From your perspective – either as a manager hiring people or a relatively new librarian who looked for jobs – what do you think can be done to improve this type of unfortunate situation? Many in our profession express their hesitation to actively participate in diversity, equity, and inclusion-related discussion and initiatives at their institutions because of the backlash from their own coworkers. What do you think we can do to minimize such backlash? Some people in our profession express strong negative feelings regarding diversity, equity, and inclusion-related initiatives. How much of this type of anti-diversity sentiment do you think exist in your field? Some worry that this is even growing faster in the current divisive and intolerant climate. What do you think we can do to counter such anti-diversity sentiment? There are many who are resistant to the values of diversity, equity, and inclusion. Have you taken any action to promote and advance these values facing such resistance? If so, what was your experience like, and what would be some of the strategies you may recommend to others working with those people? Many people in our profession want to take our diversity, equity, and inclusion initiatives to the next level, beyond offering mere lip service or simply playing a numbers game for statistics purpose. What do you think that next level may be? Lastly, I felt strongly about ensuring that the terms and concepts often thrown out in diversity/equity/inclusion-related programs and events – such as intersectionality, white privilege, microaggression, patriarchy, colonialism, and so on – are not used to unintentionally alienate those who are unfamiliar with them. These concepts are useful and convenient shortcuts that allow us to communicate a large set of ideas previously discussed and digested, so that we can move our discussion forward more efficiently. They should not make people feel uncomfortable nor generate any hint of superiority or inferiority. To this end, I create a pre-program survey which all program registrants were encouraged to take. My survey simply asked people how familiar and how comfortable they are with a variety of terms. At the panel discussion program, we also distributed the glossary of these terms, so that people can all become familiar with them.9 Also, videos can quickly bring all attendees up-to-speed with some basic concepts and phenomena in diversity discussion. For example, in the beginning of our panel discussion program, I played two short videos, “Life of Privilege Explained in a $100 Race” and “What If We Treated White Coworkers The Way We Treat Minority Coworkers?”, which were well received by the attendees. I am sharing the survey questions, the video links, and the glossary in the hope that they may be helpful as a useful tool for future organizers. For example, one may decide to provide a glossary like this before the program or run an unconference that aims at unpacking the meanings of these terms and discussing how they relate to people’s daily lives.10 In Closing: Diversity, Libraries, Technology, and Our Own Biases Disagreements on social issues are natural. But the divisiveness that we are currently experiencing seems to be particularly intense. This deeply concerns us, educators and professionals working in libraries, archives, and museums. Libraries, archives, and museums are public institutions dedicated to promoting and advancing civic values. Diversity, equity, and inclusion are part of those core civic values that move our society forward. This task, however, has become increasingly challenging as our society moves in a more and more divisive direction. To make matters even more complicated, libraries, archives, museums in general lack diversity in their staff composition. This homogeneity can impede achieving our own mission. According to the recent report from Ithaka S+R released this August, we do not appear to have gotten very far. Their report “Inclusion, Diversity, and Equity: Members of the Association of Research (ARL) Libraries – Employee Demographics and Director Perspectives,” shows that libraries and library leadership/administration are both markedly white-dominant (71% and 89% white non-Hispanic respectively).11 Also, while librarianship in general are female dominant (61%), the technology field in libraries is starkly male (70%) along with Makerspace (65%), facilities (64%), and security (73%) positions.12 The survey results in the report show that while the majority of library directors say there are barriers to achieving more diversity in their library, they attribute those barriers to external rather than internal factors such as the library’s geographic location and the insufficiently diverse application pool resulting from the library’s location. What is fascinating, however, is that this directly conflicts with the fact that libraries do show little variation in the ratio of white staff based on degree of urbanization. Equally interesting is that the staff in more homogeneous and less diverse (over 71% White Non-Hispanic) libraries think that their libraries are much more equitable than the library community (57% vs 14%) and that library directors (and staff) consider their own library to be more equitable, diverse, and inclusive than the library community with respect to almost every category such as race/ethnicity, gender, LGBTQ, disabilities, veterans, and religion. While these findings in the Ithaka S+R report are based upon the survey results from ARL libraries, similar staff composition and attitudes can be assumed to apply to libraries in general. There is a great need for both the library administration and the staff to understand their own unconscious and implicit biases, workplace norms, and organizational culture that may well be thwarting their own diversity efforts. Diversity, equity, and inclusion have certainly been a topic of active discussion in the recent years. Many libraries have established a committee or a task force dedicated to improving diversity. But how are those efforts paying out? Are they going beyond simply paying a lip service? Is it making a real difference to everyday experience of minority library workers?13 Can we improve, and if so where and how? Where do we go from here? Those would be the questions that we will need to examine in order to take our diversity efforts in libraries, archives, and museums to the next level. Notes The program description is available at https://web.uri.edu/library/2017/12/05/building-bridges-in-a-divisive-climate-diversity-in-libraries-archives-and-museums/ ↩ Carol Bean, Ranti Junus, and Deborah Mouw, “Conference Report: Code4LibCon 2008,” The Code4Lib Journal, no. 2 (March 24, 2008), http://journal.code4lib.org/articles/72. ↩ Note that this kind of biased assertions often masquerades itself as an objective intellectual pursuit in academia when in reality, it is a direct manifestation of an existing prejudice reflecting the limited and shallow experience of the person posting the question him/herself. A good example of this is found in the remark in 2005 made by Larry Summers, the former Harvard President. He suggested that one reason for relatively few women in top positions in science may be “issues of intrinsic aptitude” rather than widespread indisputable everyday discrimination against women. He resigned after the Harvard faculty of arts and sciences cast a vote of no confidence. See Scott Jaschik, “What Larry Summers Said,” Inside Higher Ed, February 18, 2005, https://www.insidehighered.com/news/2005/02/18/summers2_18. ↩ Our pre-program survey questions can be viewed at https://docs.google.com/forms/d/e/1FAIpQLScP-nQnkHAqli_43pVdidw-dQzrAfLyCdiKutu5dZjqm3F8rA/viewform. ↩ For this purpose, asking all participants to respect one another’s privacy in advance can be a good policy. In addition to this, we specifically decided not to stream or record our panel discussion program, so that both panelists and attendees can freely share their experience and thoughts. ↩ A good example is the Search Advocate program from Oregon State University. See http://searchadvocate.oregonstate.edu/. ↩ For an example, see the workshops offered by the Office of Community, Equity, and Inclusion of the University of Rhode Island at https://web.uri.edu/diversity/ced-inclusion-courses-overview/. ↩ For the limitations of the mainstream diversity discussion in LIS (library and information science) with the focus on inclusion and cultural competency, see David James Hudson, “On ‘Diversity’ as Anti-Racism in Library and Information Studies: A Critique,” Journal of Critical Library and Information Studies 1, no. 1 (January 31, 2017), https://doi.org/https://doi.org/10.24242/jclis.v1i1.6. ↩ You can see our glossary at https://drive.google.com/file/d/1UCI142HUuYTrElgnY-dbNSOXF_IlpM6n/view?usp=sharing; This glossary was put together by Renee Neely. ↩ For the nitty-gritty logistical details for organizing a large event with a group of local and remote volunteers, check the Organizer’s Toolkit created by the 2017 #critlib Unconference organizers at https://critlib2017.wordpress.com/organizers-toolkit/. ↩ Roger Schonfeld and Liam Sweeney, “Inclusion, Diversity, and Equity: Members of the Association of Research Libraries,” Ithaka S+R, August 30, 2017, http://www.sr.ithaka.org/publications/inclusion-diversity-and-equity-arl/. ↩ For the early discussion of diversity-focused recruitment in library technology, see Jim Hahn, “Diversity Recruitment in Library Information Technology,” ACRL TechConnect Blog, August 1, 2012, https://acrl.ala.org/techconnect/post/diversity-recruitment-in-library-information-technology. ↩ See April Hathcock, “White Librarianship in Blackface: Diversity Initiatives in LIS,” In the Library with the Lead Pipe, October 7, 2015, http://www.inthelibrarywiththeleadpipe.org/2015/lis-diversity/ and Angela Galvan, “Soliciting Performance, Hiding Bias: Whiteness and Librarianship,” In the Library with the Lead Pipe (blog), June 3, 2015, http://www.inthelibrarywiththeleadpipe.org/2015/soliciting-performance-hiding-bias-whiteness-and-librarianship. ↩ Posted in: Diversity. Tagged: equity · inclusion · resources From Need to Want: How to Maximize Social Impact for Libraries, Archives, and Museums Oct 18th, 2017 by Bohyun (Library Hat). Comments are off for this post At the NDP at Three event organized by IMLS yesterday, Sayeed Choudhury on the “Open Scholarly Communications” panel suggested that libraries think about return on impact in addition to return on investment (ROI). He further elaborated on this point by proposing a possible description of such impact. His description was that when an object or resource created through scholarly communication efforts is being used by someone we don’t know and is interpreted correctly without contacting us (=libraries, archives, museums etc.), that is an impact; to push that further, if someone uses the object or the resource in a way we didn’t anticipate, that’s an impact; if it is integrated into someone’s workflow, that’s also an impact. This emphasis on impact as a goal for libraries, archives, and museums (or non-profit organizations in general to apply broadly) resonated with me particularly because I gave a talk just a few days ago to a group of librarians at the IOLUG conference about how libraries can and should maximize their social impact in the context of innovation in the way many social entrepreneurs have been already doing for quite some time. In this post, I would like to revisit one point that I made in that talk. It is a specific interpretation of the idea of maximizing social impact as a conscious goal for libraries, archives, and museums (LAM). Hopefully, this will provide a useful heuristic for LAM institutions in mapping out the future efforts. Considering that ROI is a measure of cost-effectiveness, I believe impact is a much better goal than ROI for LAM institutions. We often think that to collect, organize, provide equitable access to, and preserve information, knowledge, and cultural heritage is the goal of a library, an archive, and a museum. But doing that well doesn’t mean simply doing it cost-effectively. Our efforts no doubt aim at achieving better-collected, better-organized, better-accessed, and better-preserved information, knowledge, and cultural heritage. However, our ultimate end-goal is attained only when such information, knowledge, and cultural heritage is better used by our users. Not simply better accessed, but better used in the sense that the person gets to leverage such information, knowledge, and cultural heritage to succeed in whatever endeavor that s/he was making, whether it be career success, advanced education, personal fulfillment, or private business growth. In my opinion, that’s the true impact that LAM institutions should aim at. If that kind of impact were a destination, cost-effectiveness is simply one mode of transportation, preferred one maybe but not quite comparable to the destination in terms of importance. But what does “better used” exactly mean? “Integrated into people’s workflow” is a hint; “unanticipated use” is another clue. If you are like me and need to create and design that kind of integrated or unanticipated use at your library, archive, or museum, how will you go about that? This is the same question we ask over and over again. How do you plan and implement innovation? Yes, we will go talk to our users, ask what they would like to see, meet with our stakeholders and find out their interests and concerns are, discuss ourselves what we can do to deliver things that our users want, and go from there to another wonderful project we work hard for. Then after all that, we reach a stage where we stop and wonder where that “greater social impact” went in almost all our projects. And we frantically look for numbers. How many people accessed what we created? How many downloads? What does the satisfaction survey say? In those moments, how does the “impact” verbiage help us? How does that help us in charting our actual path to creating and maximizing our social impact more than the old-fashioned “ROI” verbiage? At least ROI is quantifiable and measurable. This, I believe, is why we need a more concrete heuristic to translate the lofty “impact” to everyday “actions” we can take. Maybe not quite as specific as to dictate what exactly those actions are at each project level but a bit more specific to enable us to frame the value we are attempting to create and deliver at our LAM institutions beyond cost-effectiveness. I think the heuristic we need is the conversion of need to demand. What is an untapped need that people are not even aware of in the realm of information, knowledge, and cultural heritage? When we can identify any such need in a specific form and successfully convert that need to a demand, we make an impact. By “demand,” I mean the kind of user experience that people will desire and subsequently fulfill by using that object, resource, tool, service, etc., we create at our library, archive, and museum. (One good example of such desirable UX that comes to my mind is NYPL Photo Booth: https://www.nypl.org/blog/2013/08/12/snapshots-nypl.) When we create a demand out of such an untapped need, when the fulfillment of that kind of demand effectively creates, strengthens, and enriches our society in the direction of information, knowledge, evidence-based decisions, and truth being more valued, promoted, and equitably shared, I think we get to maximize our social impact. In the last “Going Forward” panel where the information discovery was discussed, Loretta Parham pointed out that in the corporate sector, information finds consumers, not the other way. By contrast, we (by which I mean all of us working at LAM institutions) still frame our value in terms of helping and supporting users access and use our material, resources, and physical and digital objects and tools. This is a mistake in my opinion, because it is a self-limiting value proposition for libraries, archives, and museums. What is the point of us LAM institutions, working so hard to get the public to use their resources and services? The end goal is so that we can maximize our social impact through such use. The rhetoric of “helping and supporting people to access and use our resources” does not adequately convey that. Businesses want their clients to use their goods and services, of course. But their real target is the making of profit out of those uses, aka purchases. Similarly, but far more importantly, the real goal of libraries, archives and museums is to move the society forward, closer in the direction of knowledge, evidence-based decisions, and truth being more valued, promoted, and equitably shared. One person at a time, yes, but the ultimate goal reaching far beyond individuals. The end goal is maximizing our impact on this side of the public good.   Posted in: Librarianship, Library, management, Usability, user experience. Tagged: archives · change · d4d · design thinking · digital collection · goal · impact · innovation · libraries · museums · ndpthree · social entrepreneurship · ux How to Price 3D Printing Service Fees May 22nd, 2017 by Bohyun (Library Hat). Comments are off for this post ** This post was originally published in ACRL TechConnect on May. 22, 2017.*** Many libraries today provide 3D printing service. But not all of them can afford to do so for free. While free 3D printing may be ideal, it can jeopardize the sustainability of the service over time. Nevertheless, many libraries tend to worry about charging service fees. In this post, I will outline how I determined the pricing schema for our library’s new 3D Printing service in the hope that more libraries will consider offering 3D printing service if having to charge the fee is a factor stopping them. But let me begin with libraries’ general aversion to fees. A 3D printer in action at the Health Sciences and Human Services Library (HS/HSL), Univ. of Maryland, Baltimore Service Fees Are Not Your Enemy Charging fees for the library’s service is not something librarians should regard as a taboo. We live in the times in which a library is being asked to create and provide more and more new and innovative services to help users successfully navigate the fast-changing information landscape. A makerspace and 3D printing are certainly one of those new and innovative services. But at many libraries, the operating budget is shrinking rather than increasing. So, the most obvious choice in this situation is to aim for cost-recovery. It is to be remembered that even when a library aims for cost-recovery, it will be only partial cost-recovery because there is a lot of staff time and expertise that is spent on planning and operating such new services. Libraries should not be afraid to introduce new services requiring service fees because users will still benefit from those services often much more greatly than a commercial equivalent (if any). Think of service fees as your friend. Without them, you won’t be able to introduce and continue to provide a service that your users need. It is a business cost to be expected, and libraries will not make profit out of it (even if they try). Still bothered? Almost every library charges for regular (paper) printing. Should a library rather not provide printing service because it cannot be offered for free? Library users certainly wouldn’t want that. Determining Your Service Fees What do you need in order to create a pricing scheme for your library’s 3D printing service? (a) First, you need to list all cost-incurring factors. Those include (i) the equipment cost and wear and tear, (ii) electricity, (iii) staff time & expertise for support and maintenance, and (iv) any consumables such as 3d print filament, painter’s tape. Remember that your new 3D printer will not last forever and will need to be replaced by a new one in 3-5 years. Also, some of these cost-incurring factors such as staff time and expertise for support is fixed per 3D print job. On the other hand, another cost-incurring factor, 3D print filament, for example, is a cost factor that increases in proportion to the size/density of a 3d model that is printed. That is, the larger and denser a 3d print model is, the more filament will be used incurring more cost. (b) Second, make sure that your pricing scheme is readily understood by users. Does it quickly give users a rough idea of the cost before their 3D print job begins? An obscure pricing scheme can confuse users and may deter them from trying out a new service. That would be bad user experience. Also in 3D printing, consider if you will also charge for a failed print. Perhaps you do. Perhaps you don’t. Maybe you want to charge a fee that is lower than a successful print. Whichever one you decide on, have that covered since failed prints will certainly happen. (c) Lastly, the pricing scheme should be easily handled by the library staff. The more library staff will be involved in the entire process of a library patron using the 3D printing service from the beginning to the end, the more important this becomes. If the pricing scheme is difficult for the staff to work with when they need charge for and process each 3D print job, the new 3D printing service will increase their workload significantly. Which staff will be responsible for which step of the new service? What would be the exact tasks that the staff will need to do? For example, it may be that several staff at the circulation desk need to learn and handle new tasks involving the 3D printing service, such as labeling and putting away completed 3D models, processing the payment transaction, delivering the model, and marking the job status for the paid 3D print job as ‘completed’ in the 3D Printing Staff Admin Portal if there is such a system in place. Below is the screenshot of the HS/HSL 3D Printing Staff Admin Portal developed in-house by the library IT team. The HS/HSL 3D Printing Staff Admin Portal, University of Maryland, Baltimore Examples – 3D Printing Service Fees It’s always helpful to see how other libraries are doing when you need to determine your own pricing scheme. Here are some examples that shows ten libraries’ 3D printing pricing scheme changed over the recent three years. UNR DeLaMare Library https://guides.library.unr.edu/3dprinting 2014 – $7.20 per cubic inch of modeling material (raised to $8.45 starting July, 2014). 2017 – uPrint – Model Material: $4.95 per cubic inch (=16.38 gm=0.036 lb) 2017 – uPrint – Support Materials: $7.75 per cubic inch NCSU Hunt Library https://www.lib.ncsu.edu/do/3d-printing 2014-  uPrint 3D Printer: $10 per cubic inch of material (ABS), with a $5 minimum 2014 – MakerBot 3D Printer: $0.35 per gram of material (PLA), with a $5 minimum 2017 – uPrint – $10 per cubic inch of material, $5 minimum 2017 – F306 – $0.35 per gram of material, $5 minimum Southern Illinois University Library http://libguides.siue.edu/3D/request 2014 – Originally $2 per hour of printing time; Reduced to $1 as the demand grew. 2017 – Lulzbot Taz 5, Luzbot mini – $2.00 per hour of printing time. BYU Library http://guides.lib.byu.edu/c.php?g=216600&p=1429612 2014 – 2017 – Makerbot Replicator 2/ Ultimaker 2 Extended $0.20 per gram for standard (0.2mm) resolution; $0.30 per gram for high (0.1mm) resolution. University of Michigan Library The Cube 3D printer checkout is no longer offered. 2017 – Cost for professional 3d printing service; Open access 3d printing is free. GVSU Library https://www.gvsu.edu/techshowcase/makerspace-18.htm 2014 – $0.35 per gram with a $6.00 minimum 2017 – Free (Ultimaker 2+, Makerbot Replicator 2, 7, 2x) University of Tennessee, Chattanooga Library http://www.utc.edu/library/services/studio/3d-printing/index.php 2014 – 2017 – Makerbot 1th, 5th – $0.10 per gram Port Washington Public library http://www.pwpl.org/3d-printing/3d-printing-guidelines/ 2017 – Makerbot 5 – $1 per hour of printing time Miami University 2014 – $0.20 per gram of the finished print; 2017 – ? UCLA Library, Dalhousie University Library (2014) Free Types of 3D Printing Service Fees From the examples above, you will notice that many 3d printing service fee schemes are based upon the weight of a 3D-print model. This is because these libraries are trying recover the cost of the 3d filament, and the amount of filament used is most accurately reflected in the weight of the resulting 3D-printed model. However, there are a few problems with the weight-based 3D printing pricing scheme. First, it is not readily calculable by a user before the print job, because to do so, the user will have to weigh a model that s/he won’t have until it is 3D-printed. Also, once 3D-printed, the staff will have to weigh each model and calculate the cost. This is time-consuming and not very efficient. For this reason, my library considered an alternative pricing scheme based on the size of a 3D model. The idea was that we will have roughly three different sizes of an empty box – small, medium, and large –  with three different prices assigned. Whichever box into which a user’s 3d printed object fits will determine how much the user will pay for her/his 3D-printed model. This seemed like a great idea because it is easy to determine how much a model will cost to 3d-print to both users and the library staff in comparison to the weight-based pricing scheme. Unfortunately, this size-based pricing scheme has a few significant flaws. A smaller model may use more filament than a larger model if it is denser (meaning the higher infill ratio). Second, depending on the shape of a model, a model that fits  in a large box may use much less filament than the one that fits in a small box. Think about a large tree model with think branches. Then compare that with a 100% filled compact baseball model that fits into a smaller box than the tree model does. Thirdly, the resolution that determines a layer height may change the amount of filament used even if what is 3D-printed is a same model. Different infill ratios – Image from https://www.packtpub.com/sites/default/files/Article-Images/9888OS_02_22.png Charging Based upon the 3D Printing Time So we couldn’t go with the size-based pricing scheme. But we did not like the problems of the weight-based pricing scheme, either. As an alternative, we decided to go with the time-based pricing scheme because printing time is proportionate to how much filament is used, but it does not require that the staff weigh the model each time. A 3D-printing software gives an estimate of the printing time, and most 3D printers also display actual printing time for each model printed. First, we wanted to confirm the hypothesis that 3D printing time and the weight of the resulting model are proportionate to each other. I tested this by translating the weight-based cost to the time-based cost based upon the estimated printing time and the estimated weight of several cube models. Here is the result I got using the Makerbot Replicator 2X. 9.10 gm/36 min= 0.25 gm per min. 17.48 gm/67 min= 0.26 gm per min. 30.80 gm/117 min= 0.26 gm per min. 50.75 gm/186 min=0.27 gm per min. 87.53 gm/316 min= 0.28 gm per min. 194.18 gm/674 min= 0.29 gm per min. There is some variance, but the hypothesis holds up. Based upon this, now let’s calculate the 3d printing cost by time. 3D plastic filament is $48 for ABS/PLA and $65 for the dissolvable per 0.90 kg  (=2.00 lb) from Makerbot. That means that filament cost is $0.05 per gram for ABS/PLA and $0.07 per gram for the dissolvable. So, 3D filament cost is 6 cents per gram on average. Finalizing the Service Fee for 3D Printing For an hour of 3D printing time, the amount of filament used would be 15.6 gm (=0.26 x 60 min). This gives us the filament cost of 94 cents per hour of 3D printing (=15.6 gm x 6 cents). So, for the cost-recovery of filament only, I get roughly $1 per hour of 3D printing time. Earlier, I mentioned that filament is only one of the cost-incurring factors for the 3D printing service. It’s time to bring in those other factors, such as hardware wear/tear, staff time, electricity, maintenance, etc., plus “no-charge-for-failed-print-policy,” which was adopted at our library. Those other factors will add an additional amount per 3D print job. And at my library, this came out to be about $2. (I will not go into details about how these have been determined because those will differ at each library.) So, the final service fee for our new 3D printing service was set to be $3 up to 1 hour of 3D printing + $1 per additional hour of 3D printing. The $3 is broken down to $1 per hour of 3D printing that accounts for the filament cost and $2 fixed cost for every 3D print job. To help our users to quickly get an idea of how much their 3D print job will cost, we have added a feature to the HS/HSL 3D Print Job Submission Form online. This feature automatically calculates and displays the final cost based upon the printing time estimate that a user enters.   The HS/HSL 3D Print Job Submission form, University of Maryland, Baltimore Don’t Be Afraid of Service Fees I would like to emphasize that libraries should not be afraid to set service fees for new services. As long as they are easy to understand and the staff can explain the reasons behind those service fees, they should not be a deterrent to a library trying to introduce and provide a new innovative service. There is a clear benefit in running through all cost-incurring factors and communicating how the final pricing scheme was determined (including the verification of the hypothesis that 3D printing time and the weight of the resulting model are proportionate to each other) to all library staff who will be involved in the new 3D printing service. If any library user inquire about or challenges the service fee, the staff will be able to provide a reasonable explanation on the spot. I implemented this pricing scheme at the same time as the launch of my library’s makerspace (the HS/HSL Innovation Space at the University of Maryland, Baltimore – http://www.hshsl.umaryland.edu/services/ispace/) back in April 2015. We have been providing 3D printing service and charging for it for more than two years. I am happy to report that during that entire duration, we have not received any complaint about the service fee. No library user expected our new 3D printing service to be free, and all comments that we received regarding the service fee were positive. Many expressed a surprise at how cheap our 3D printing service is and thanked us for it. To summarize, libraries should be willing to explore and offer new innovating services even when they require charging service fees. And if you do so, make sure that the resulting pricing scheme for the new service is (a) sustainable and accountable, (b) readily graspable by users, and (c) easily handled by the library staff who will handle the payment transaction. Good luck and happy 3D printing at your library! An example model with the 3D printing cost and the filament info displayed at the HS/HSL, University of Maryland, Baltimore Posted in: Library, management, Technology, user experience. Tagged: 3d printer · 3d printing · budget · charge · cost · funding · makerspace · service fees · sustainability · user experience · ux Post-Election Statements and Messages that Reaffirm Diversity Nov 15th, 2016 by Bohyun (Library Hat). Comments are off for this post These are statements and messages sent out publicly or internally to re-affirm diversity, equity, and inclusion by libraries or higher ed institutions. I have collected these – some myself and many others through my fellow librarians. Some of them were listed on my blog post, “Finding the Right Words in Post-Election Libraries and Higher Ed.” So there are some duplicates. If you think that your organization is already so much pro-diversity that there is no need to confirm or re-affirm diversity, you can’t be farther from the everyday reality that minorities experience. Sometimes, saying isn’t much. But right now, saying it out loud can mean everything. If you support those who belong to minority groups but don’t say it out loud, how would they know it? Right now, nothing is obvious other than there is a lot of hate and violence towards minorities. Feel free to use these as your resource to craft a similar message. Feel free to add if you have similar messages you have received or created in the comments section. If you haven’t heard from the organization you belong to, please ask for a message reaffirming and committing to diversity, equity, and inclusion. [UPDATE 11/15/2016: Statements from ALA and LITA have been released. I have added them below.] I will continue to add additional statements as I find them. If you see anything missing, please add below in the comment or send it via Twitter @bohyunkim. Thanks! From Librarians But I know that there will be libraries Librarian Zoe Fisher to other librarians Care for One Another Director Chris Bourg to the MIT Libraries staff Finding the Right Words in Post-Election Libraries and Higher Ed (My e-mail sent to the IT team at University of Maryland, Baltimore Health Sciences and Human Services Library) With a A Pin and a Prayer Dean K. G. Schneider to the Sonoma State University Library staff From Library Associations LITA ALA PLA ARL DLF Code4Lib [DRAFT in GitHub] From Libraries James Madison University Libraries Northwestern University Libraries University of Oregon Libraries From Higher Ed Institutions Clarke University CUNY Duke UniversityMIT Loyola University, Maryland Northwestern University Penn State University The Catholic University of America University of California University of Michigan University of Nebraska, Lincoln University of Nevada, Reno University of Oregon University of Rochester and Rochester Institute of Technology University of Florida addressing racially charged flyers on the campus Marshall University President Jerome A. Gilbert’s Statement regarding post-election Tweet Drexel University Moving On as a Community After the Election Dear Members of the Drexel Community, It is heartening to me to see the Drexel community come together over the last day to digest the news of the presidential election — and to do so in the spirit of support and caring that is so much a part of this University. We gathered family-style, meeting in small, informal groups in several places across campus, including the Student Center for Inclusion and Culture, our residence halls, and as colleagues over a cup of coffee. Many student leaders, particularly from our multicultural organizations, joined the conversation. This is not a process that can be completed in just one day, of course. So I hope these conversations will continue as long as students, faculty and professional staff feel they are needed, and I want to assure you that our professional staff in Student Life, Human Resources, Faculty Affairs, as well as our colleagues in the Lindy Center for Civic Engagement, will be there for your support. Without question, many members of our community were deeply concerned by the inflammatory rhetoric and hostility on the campaign trail that too often typified this bitter election season. As I wrote over the summer, the best response to an uncertain and at times deeply troubling world is to remain true to our values as an academic community. In the context of a presidential election, it is vital that we understand and respect that members of our broadly diverse campus can hold similarly diverse political views. The expression of these views is a fundamental element of the free exchange of ideas and intellectual inquiry that makes Drexel such a vibrant institution. At the same time, Drexel remains committed to ensuring a welcoming, inclusive, and respectful environment. Those tenets are more important than ever. While we continue to follow changes on the national scene, it is the responsibility of each of us at Drexel to join together to move ahead, unified in our commitment to open dialogue, civic engagement and inclusion. I am grateful for all you do to support Drexel as a community that welcomes and encourages all of its members. Lane Community College Good Morning, Colleagues, I am in our nation’s capital today. I’d rather be at home! Like me, I am guessing that many of you were glued to the media last night to find out the results of the election. Though we know who our next President will be, this transition still presents a lot of uncertainty. It is not clear what our future president’s higher education policies will be but we will be working with our national associations to understand and influence where we can. During times like this there is an opening for us to decide how we want to be with each other. Moods will range from joy to sadness and disbelief. It seems trite but we do need to work together, now more than ever. As educators we have a unique responsibility to create safe learning environments where every student can learn and become empowered workers and informed citizens. This imperative seems even more important today. Our college values of equity and inclusion have not changed and will not change and it is up to each of us to assure that we live out our values in every classroom and in each interaction. Preparing ourselves and our students for contentious discussions sparked by the election is work we must do. It is quite likely that some of our faculty, staff and students may be feeling particularly vulnerable right now. Can we reach out to each other and let each other know that we all belong at Lane? During my inservice remarks I said that “we must robustly reject the calculated narrative of cynicism, division and despair. Instead of letting this leak into our narratives, together we can bet on hope not fear, respect not hate, unity not division.” At Lane we have the intellect (and proud of it) and wherewithal to do this. I am attaching a favorite reading from Meg Wheatley which is resonating with me today and will end with Gary Snyder’s words from To The Children …..stay together learn the flowers go light. Maryland Institute College of Art Post-Election Community Forums and Support Dear Campus Community, No matter how each of us voted yesterday, most of us likely agree that the presidential campaign has been polarizing on multiple fronts. As a result, today is a difficult day for our nation and our campus community. In our nation, regardless of how one has aligned with a candidate, half of our country feels empowered and the other half sad and perhaps angry. Because such dynamics and feelings need to be addressed and supported on campus, this memo outlines immediate resources for our community of students, faculty and staff, and describes opportunities for fashioning dialogues and creative actions going forward. Before sharing the specifics, let me say unambiguously that MICA will always stand firm in our commitment to diversity and inclusion. This morning’s Presidential Task Force on Diversity, Inclusion, Equity, and Globalization meeting discussed measures to ensure that, as a creative community, we will continue to build a culture where everyone is honored and supported for success. The impact of exhibitions such as the current Baltimore Rising show remains as critical as ever, and MICA fosters an educational environment that is welcoming of all. In the short term our focus is to support one another. Whether you are happy or distressed with the results, there has been sufficient feedback to indicate that our campus community is struggling with how to make sense of such a divisive election process. You may find the following services helpful and are encouraged to take advantage of them: For Students: Student Counseling maintains walk-in hours from 3:00 – 4:00 pm every day. Students are welcome to stop by the Student Counseling Center (1501 Mt. Royal Avenue) during that time or call 410-669-9200 and enter x2367 once the recording begins to schedule an appointment. For Faculty and Staff: The Employee Assistance Program (EAP) is available to provide free, confidential support 24 hours a day. The EAP can be reached by calling 1-866-799-2728 or visiting HealthAdvocate.com/members and providing the username “Maryland Institute College of Art”. For all MICA community members: MICA’s chaplain, the Rev, maintains standing hours every Monday and can be reached in the Reflection Room (Meyerhoff House) or by calling the Office of Diversity and Intercultural Development at 443-552-1659. There are three events this week that can provide a shared space for dialogue; all are welcome: The “After the Baltimore Uprising: Still Waiting for Change” community forum attached to the Baltimore Rising exhibition takes place tonight from 7:00 pm to 9:00 pm in the Lazarus Center. An open space for all MICA community members will be hosted by the Black Student Union tonight at 10:00 pm in the Meyerhoff House Underground. In partnership with our student NAMI group, MICA will host a “Messages of Hope” event for the entire MICA community that will allow for shared space and reflection. This event will be on Friday, November 11th, and will begin at 3:00 pm in Cohen Plaza. In various upcoming meetings we look forward to exploring with campus members other appropriate activities that can be created to facilitate expressions and dialogues. A separate communication is coming from Provost David Bogen to the faculty regarding classroom conversations with students regarding the election. Northwestern University Women’s Center Dear Northwestern students, faculty, staff and community members: The Women’s Center is open today. Our staff members are all here and available to talk, to provide resources and tools, or to help however you might need it. Most importantly, the space itself is available for whatever you need, whether that is to gather as a group, to sit alone somewhere comfortable and quiet, or to talk to someone who will listen. We are still here, and we are here for all people as an intentionally intersectional space. You are welcome to drop by physically, make a call to our office, or send an email. Know that this space is open and available to you. Portland Community College to the PCC Staff As someone who spent the last several years in Washington D.C. working to advance community colleges, I feel a special poignancy today hearing so many students, colleagues, and friends wonder and worry about the future—and about their futures. We must acknowledge that this political season has highlighted deep divisions in our society. Today I spent time with Cabinet speaking about how we can assert our shared values and take positive action as a PCC community to deepen our commitment to equity, inclusion and civic engagement. PCC will always welcome students and colleagues who bring a rich array of perspectives and experiences. That diversity is among our greatest strengths. Today it is imperative that we stand by faculty, staff and students who may be experiencing fear or uncertainty—affirming with our words and deeds that PCC is about equitable student success and educational opportunity for all. Never has this mission been more powerful or more essential. I have only been here a few months, but have already learned that PCC is a remarkable and caring community. Much is happening right now in real time, and I appreciate the efforts of all. For my part, I promise to communicate often as we continue to plan for our shared future. P.S. Today and in the days ahead, we will be holding space for people to be together in community. Here are a few of the opportunities identified so far. Portland Community College to Students Dear Students: As someone who spent the last several years working in Washington D.C., I feel a special poignancy this week hearing many of you express worry and uncertainty about the future. There is little doubt that this political season has highlighted some deep divisions in our society. Both political candidates have acknowledged as much. At the same time, people representing the full and diverse spectrum of our country come to our nation’s community colleges in hopes of a better life. PCC is such a place – where every year thousands of students find their path and pursue their dreams. All should find opportunity here, and all should feel safe and welcome. The rich diversity of PCC offers an amazing opportunity for dialogue across difference, and for developing skills that are the foundation of our democratic society. Let this moment renew your passion for making a better life for yourself, your community and your country and for becoming the kind of leader you want to follow. Rutgers University AAUP-AFT (American Association of University Professors – American Federation of Teachers) Resisting Donald Trump We are shocked and horrified that Donald Trump, who ran on a racist, xenophobic, misogynist platform, is now the President of the US. In response to this new political landscape, the administrative heads of several universities have issued statements embracing their diverse student, faculty, and staff bodies and offering support and protection. (See statements from the University of California and the California State University). President Barchi has yet to address the danger to the Rutgers community and its core mission. This afternoon, our faculty union and the Rutgers One Coalition held an emergency meeting of students, faculty, and community activists in New Brunswick. We discussed means of responding to the attacks that people may experience in the near future. Most immediately, we approved the following statement by acclamation at the 100-strong meeting: “Rutgers One, a coalition of faculty, staff, students and community members, calls upon the Rutgers administration to join us in condemning all acts of bigotry on this campus and refuse to tolerate any attacks on immigrants, women, Arabs, Muslims, people of color, LGBTQ people and all others in our diverse community. We demand that President Barchi and his administration provide sanctuary, support, and protection to those who are already facing attacks on our campuses. We need concrete action that can ensure a safe environment for all. Further, we commit ourselves to take action against all attempts by the Trump administration to target any of our students, staff or faculty. We are united in resistance to bigotry of every kind and welcome all to join us in solidarity.” We also resolved to take the following steps: We will be holding weekly Friday meetings at 3pm in our Union office in New Brunswick to bring together students, faculty and staff to organize against the Trump agenda. We hope to expand these to Camden and Newark as well. (If you are willing to help organize this, please email back.) We will be creating a list serve to coordinate our work. If you want to join this list, please reply to this email. We are making posters and stickers which declare sanctuaries from racism, xenophobia, sexism, bigotry, religious intolerance, and attacks on unions. Once these materials are ready we will write to you so that you may post them on windows, office doors, cars etc. In the meantime, we urge you to talk to your students and colleagues of color as well as women and offer them your support and solidarity. As you may recall, the Executive Committee issued a denunciation of Donald Trump on October 10, 2016. Now our slogan, one from the labor movement, is “Don’t mourn. Organize!” That is where we are now – all the more poignantly because of Donald Trump’s appeal to workers. Let us organize, and let us also expand our calling of education. In your classrooms, your communities, and your families, find the words and sentiments that will redeem all of us from Tuesday’s disgrace. University of Chicago Message from President and Provost Early in the fall quarter, we sent a message welcoming each of you to the new academic year and affirming our strong commitment to two foundational values of the University – fostering an environment of free expression and open discourse; and ensuring that diversity and inclusion are essential features of the fabric of our campus community and our interactions beyond campus. Recent national events have generated waves of disturbing, exclusionary and sometimes threatening behavior around the country, particularly concerning gender and minority status. As a result, many individuals are asking whether the nation and its institutions are entering a period in which supporting the values of diversity and inclusion, as well as free expression and open discourse, will be increasingly challenging. As the president and provost of the University of Chicago, we are writing to reaffirm in the strongest possible terms our unwavering commitment to these values, and to the importance of the University as a community acting on these values every day. Fulfilling our highest aspirations with respect to these values and their mutual reinforcement will always demand ongoing attention and work on the part of all of us. The current national environment underscores the importance of this work. It means that we need to manifest these values more rather than less, demand more of ourselves as a community, and together be forthright and bold in demonstrating what our community aspires to be. We ask all of you for your help and commitment to the values of diversity and inclusion, free expression, and open discourse and what they mean for each of us working, learning, and living in this University community every day. University of Illinois, Chicago Dear Students, Faculty, and Staff, The events of the past week have come with mixed emotions for many of you. We want you to know that UIC remains steadfast in its commitment to creating and sustaining a community that recognizes and values the inherent worth and dignity of every person, while fostering an environment of mutual respect among all members. Today, we reaffirm the University’s commitment to access, equity, inclusion and nondiscrimination. Critical to this commitment is the work of several offices on campus that provide resources to help you be safe and successful. If you have questions, need someone to talk to, or a place to express yourself, you should consider contacting these offices: Office for Access and Equity (OAE). OAE is responsible for assuring campus compliance in matters of equal opportunity, affirmative action, and nondiscrimination in the academic and work environment. OAE also offers Dispute Resolution Services (DRS) to assist with conflict in the workplace not involving unlawful discrimination matters. UIC Counseling Center. The UIC Counseling Center is a primary resource providing comprehensive mental health services that foster personal, interpersonal, academic, and professional thriving for UIC students. Student Legal Services. UIC’s Student Legal Services (SLS) is a full-service law office dedicated to providing legal solutions for currently enrolled students. Office of Diversity. The Office of Diversity leads strategic efforts to advance access, equity, and inclusion as fundamental principles underpinning all aspects of university life. It initiates programs that promote an inclusive university climate, partner with campus units to formulate systems of accountability, and develop links with the local community and alumni groups. Centers for Cultural Understanding and Social Change. The Centers for Cultural Understanding and Social Change (CCUSC) are a collaborative group of seven centers with distinct histories, missions, and locations that promote the well-being of and cultural awareness about underrepresented and underserved groups at UIC. UIC Dialogue Initiative. The UIC Dialogue Initiative seeks to build an inclusive campus community where students, faculty, and staff feel welcomed in their identities, valued for their contributions, and feel their identities can be openly expressed. Through whatever changes await us, as a learning community we have a special obligation to ensure that our conversations and dialogues over the next weeks and months respect our varied backgrounds and beliefs. University of Maryland, Baltimore To the UMB Community: Last week, we elected a new president for our country. I think most will agree that the campaign season was long and divisive, and has left many feeling separated from their fellow citizens. In the days since the election, I’ve heard from the leaders of UMB and of the University of Maryland Medical Center and of the many programs we operate that serve our neighbors across the city and state. These leaders have relayed stories of students, faculty, staff, families, and children who feel anxious and unsettled, who feel threatened and fearful. It should be unnecessary to reaffirm UMB’s commitment to diversity, inclusion, and respect — these values are irrevocable — but when I hear that members of our family are afraid, I must reiterate that the University will not tolerate incivility of any kind, and that the differences we celebrate as a diverse community include not just differences of race, religion, nationality, gender, and sexual identity, but also of experience, opinion, and political affiliation and ideology. If you suffer any harassment, please contact your supervisor or your student affairs dean. In the months ahead, we will come together as a University community to talk about how the incoming administration might influence the issues we care about most: health care access and delivery; education; innovation; social justice and fair treatment for all. We will talk about the opportunities that lay ahead to shape compassionate policy and to join a national dialogue on providing humane care and services that uplift everyone in America. For anyone who despairs, we will talk about building hope. Should you want to share how you’re feeling post-election, counselors are available. Please contact the Student Counseling Center or the Employee Assistance Program to schedule an appointment. I look forward to continuing this conversation about how we affirm our fundamental mission to improve the human condition and serve the public good. Like the values we uphold, this mission endures — irrespective of the person or party in political power. It is our binding promise to the leaders of this state and, even more importantly, to the citizens we serve together. University of West Georgia Dear Colleagues, As we head into the weekend concluding a week, really several weeks, of national and local events, I am reminded of the incredible opportunity of reflection and discourse we have as a nation and as an institution of higher learning. This morning, we held on campus a moving ceremony honoring our Veterans–those who have served and who have given the ultimate sacrifice to uphold and protect our freedoms.  It is those freedoms that provide the opportunity to elect a President and those freedoms that provide an environment of civil discourse and opinion.  Clearly, the discourse of this election cycle has tested the boundaries. This is an emotional time for many of our faculty, staff, and students.  I ask that as a campus community we hold true to the intended values of our nation and those who sacrificed to protect those values and the core values of our institution–caring, collaboration, inclusiveness, and wisdom.  We must acknowledge and allow the civil discourse and opinion of all within a safe environment.  That is what should set us apart.  It is part of our DNA in higher education to respect and encourage variance and diversity of belief, thought, and culture. I call on your professionalism during these times and so appreciate your passion and care for each other and our students. Virginia Commonwealth University to Staff Election Message Dear VCU and VCU Health Communities, Yesterday, we elected new leaders for our city, commonwealth and nation. I am grateful to those of you who made your voice heard during the electoral process, including many of our students who voted for the first time. Whether or not your preferred candidate won, you were a part of history and a part of the process that moves our democracy forward. Thank you. I hope you will always continue to make your voice heard, both as voters and as well-educated leaders in our society. As with any election, some members of our community are enthusiastic about the winners, others are not.  For many, this election cycle was notably emotional and difficult. Now is the time, then, to demonstrate the values that make Virginia Commonwealth University such a remarkable place.  We reaffirm our commitment to working together across boundaries of discipline or scholarship, as members of one intellectual community, to achieve what’s difficult.  We reaffirm our commitment to inclusion, to ensuring that every person who comes to VCU is respected and emboldened to succeed.  We reaffirm that we will always be a place of the highest integrity, accountability, and we will offer an unyielding commitment to serving those who need us. History changes with every election. What does not change are the commitments we share as one community that is relentlessly focused on advancing the human experience for all people. You continue to inspire me.  And I know you will continue to be a bright light for Richmond, Virginia, our nation and our world. Virginia Commonwealth University School of Education to Students Election Message Dear students, On Tuesday we elected new leaders for our city, our commonwealth and our nation. Although leadership will be changing, I echo Dr. Rao’s message below in that our mission outlined by the Quest for Distinction to support student success, advance knowledge and strengthen our communities remains steadfast. At the VCU School of Education, we work to create safe spaces where innovation, inclusion and collaboration can thrive. We actively work across boundaries and disciplines to address the complex challenges facing our communities, schools and families. The election of new leaders provides new opportunities for our students, faculty and staff to build bridges that help us reach our goal of making an impact in urban and high need environments. I encourage you to engage in positive dialogues with one another as the city, commonwealth and nation adjust to the change in leadership, vision and strategy. Virginia Commonwealth University Division of Student Affairs Dear Students, We are writing to you, collectively, as leaders in the Division of Student Affairs.  We acknowledge that this election season was stressful for many individuals in our VCU community, culminating with the election of the next president.  Some members of our campus community have felt disrespected, attacked and further marginalized by political rhetoric during the political process.  We want to affirm support of all of our students while also recognizing the unique experiences and concerns of individuals. We want all students to know that we are here to support you, encourage you and contribute to your success. We now live in a space of uncertainty as we transition leadership in our nation.  Often, with this uncertainty comes a host of thoughts and feelings.  We hope that you will take advantage of some of the following services and programs we offer through our division to support your well-being, including: Office of Multicultural Student Affairs, Self-Care Space, University Counseling Services , The Wellness Resource Center, Trans Lives Matter Panel and Survivor Solidarity Support, Recreational Sports, Restorative Yoga and Mind & Body Classes. We encourage students to express their concerns and engage in conversations that further the core values articulated in Quest, the VCU Strategic Plan. We continue to have an opportunity to make individual and collective choices about how we work to bridge differences in a manner that builds up our community. Our staff will have a table each day next week on the VCU Compass from noon to 1:00 p.m. ­­­to receive your concerns, suggestions and just listen.  Please stop by to meet us.  We want you to know you have our full support. Other Organizations ACLU Joint Statement from California Legislative Leaders on Result of Presidential Election Posted in: Diversity, Librarianship, Library, management. Tagged: college · communication · diversity · election · equity · higher ed · inclusion · Library · university Finding the Right Words in Post-Election Libraries and Higher Ed Nov 14th, 2016 by Bohyun (Library Hat). Comments are off for this post ** This post was originally published in ACRL TechConnect on Nov. 15, 2016.*** This year’s election result has presented a huge challenge to all of us who work in higher education and libraries. Usually, libraries, universities, and colleges do not comment on presidential election result and we refrain from talking about politics at work. But these are not usual times that we are living in. A black female student was shoved off the sidewalk and called the ‘N’ word at Baylor University. The Ku Klux Klan is openly holding a rally. West Virginia officials publicly made a racist comment about the first lady. Steve Bannon’s prospective appointment as the chief strategist and senior counsel to the new President is being praised by white nationalist leaders and fiercely opposed by civil rights groups at the same time. Bannon is someone who calls for an ethno-state, openly calls Martin Luther King a fraud, and laments white dispossession and the deconstruction of occidental civilization. There are people drawing a swastika at a park. The ‘Whites only’ and ‘Colored’ signs were put up over water fountains in a Florida school. A Muslim student was threatened with a lighter. Asian-American women are being assaulted. Hostile acts targeting minority students are taking place on college campuses. Libraries and educational institutions exist because we value knowledge and science. Knowledge and science do not discriminate. They grow across all different races, ethnicities, religions, nationalities, sexual identities, and disabilities. Libraries and educational institutions exist to enable and empower people to freely explore, investigate, and harness different ideas and thoughts. They support, serve, and belong to ‘all’ who seek knowledge. No matter how naive it may sound, they are essential to the betterment of human lives, and they do so by creating strength from all our differences, not likeness. This is why diversity, equity, inclusion are non-negotiable and irrevocable values in libraries and educational institutions. How do we reconcile these values with the president-elect who openly dismissed and expressed hostility towards them? His campaign made remarks and promises that can be interpreted as nothing but the most blatant expressions of racism, sexism, intolerance, bigotry, harassment, and violence. What will we do to address the concerns of our students, staff, and faculty about their physical safety on campus due to their differences in race, ethnicity, religion, nationality, gender, and sexual identity? How do we assure them that we will continue to uphold these values and support everyone regardless of what they look like, how they identify their gender, what their faiths are, what disabilities they may have, who they love, where they come from, what languages they speak, or where they live? How? We say it. Explicitly. Clearly. And repeatedly. If you think that your organization is already very much pro-diversity that there is no need to confirm or reaffirm diversity, you can’t be farther from the everyday life minorities experience. Sometimes, saying isn’t much. But right now, saying it out loud can mean everything. If you support those who belong to minority groups but don’t say it out loud, how would they know it? Right now, nothing is obvious other than there is a lot of hate and violence towards minorities. The entire week after the election, I agonized about what to say to my small team of IT people whom I supervise at work. As a manager, I felt that it was my responsibility to address the anxiety and uncertainty that some of my staff – particularly those in minority groups – would be experiencing due to the election result. I also needed to ensure that whatever dialogue takes place regarding the differences of opinions between those who were pleased and those who were distressed with the election result, those dialogues remain civil and respectful. Crafting an appropriate message was much more challenging than I anticipated. I felt very strongly about the need to re-affirm the unwavering support and commitment to diversity, equity, and inclusion particularly in relation to libraries and higher education, no matter how obvious it may seem. I also felt the need to establish (within the bounds of my limited authority) that we will continue to respect, value, and celebrate diversity in interacting with library users as well as other library and university staff members. Employees are held to the standard expectations of their institutions, such as diversity, equity, inclusion, tolerance, civil dialogue, and no harassment or violence towards minorities, even if their private opinions conflict with them. At the same time, I wanted to strike a measured tone and neither scare nor upset anyone, whichever side they were on in the election. As a manager, I have to acknowledge that everyone is entitled to their private opinions as long as they do not harm others. I suspect that many of us – either a manager or not – want to say something similar about the election result. Not so much about who was and should have been as about what we are going to do now in the face of these public incidences of anger, hatred, harassment, violence, and bigotry directed at minority groups, which are coming out at an alarming pace because it affects all of us, not just minorities. Finding the right words, however, is difficult. You have to carefully consider your role, audience, and the message you want to convey. The official public statement from a university president is going to take a tone vastly different from an informal private message a supervisor sends out to a few members of his or her team. A library director’s message to library patrons assuring the continued service for all groups of users with no discrimination will likely to be quite different from the one she sends to her library staff to assuage their anxiety and fear. For such difficulty not to delay and stop us from what we have to and want to say to everyone we work with and care for, I am sharing the short message that I sent out to my team last Friday, 3 days after the election. (N.B. ‘CATS’ stands for ‘Computing and Technology Services’ and UMB refers to ‘University of Maryland, Baltimore.’) This is a customized message to address my own team. I am sharing this as a potential template for you to craft your own message. I would like to see more messages that reaffirm diversity, equity, and inclusion as non-negotiable values, explicitly state that we will not step backwards, and make a commitment to continued unwavering support for them. Dear CATS, This year’s close and divisive election left a certain level of anxiety and uncertainty in many of us. I am sure that we will hear from President Perman and the university leadership soon. In the meantime, I want to remind you of something I believe to be very important. We are all here – just as we have been all along – to provide the most excellent service to our users regardless of what they look like, what their faiths are, where they come from, what languages they speak, where they live, and who they love. A library is a powerful place where people transform themselves through learning, critical thinking, and reflection. A library’s doors have been kept open to anyone who wants to freely explore the world of ideas and pursue knowledge. Libraries are here to empower people to create a better future. A library is a place for mutual education through respectful and open-minded dialogues. And, we, the library staff and faculty, make that happen. We get to make sure that people’s ethnicity, race, gender, disability, socio-economic backgrounds, political views, or religious beliefs do not become an obstacle to that pursuit. We have a truly awesome responsibility. And I don’t have to tell you how vital our role is as a CATS member in our library’s fulfilling that responsibility. Whichever side we stood on in this election, let’s not forget to treat each other with respect and dignity. Let’s use this as an opportunity to renew our commitment to diversity, one of the UMB’s core values. Inclusive excellence is one of the themes of the UMB 2017-2021 Strategic Plan. Each and every one of us has a contribution to make because we are stronger for our differences. We have much work ahead of us! I am out today, but expect lots of donuts Monday. Have a great weekend, Bohyun   Monday, I brought in donuts of many different kinds and told everyone they were ‘diversity donuts.’ Try it. I believe it was successful in easing some stress and tension that was palpable in my team after the election. Photo from Flickr: https://www.flickr.com/photos/vnysia/4598569232 Before crafting your own message, I recommend re-reading your institution’s core values, mission and vision statements, and the most recent strategic plan. Most universities, colleges, and libraries include diversity, equity, inclusion, or something equivalent to these somewhere. Also review all public statements or internal messages that came from your institution that reaffirms diversity, equity, and inclusion. You can easily incorporate those into your own message. Make sure to clearly state your (and your institution’s) continued commitment to and unwavering support for diversity and inclusion and explicitly oppose bigotry, intolerance, harassment, and acts of violence. Encourage civil discourse and mutual respect. It is very important to reaffirm the values of diversity, equity, and inclusion ‘before’ listing any resources and help that employees or students may seek in case of harassment or assault. Without the assurance from the institution that it indeed upholds those values and will firmly stand by them, those resources and help mean little. Below I have also listed messages, notes, and statements sent out by library directors, managers, librarians, and university presidents that reaffirm the full support for and commitment to diversity, equity, and inclusion. I hope to see more of these come out. If you have already received or sent out such a message, I invite you to share in the comments. If you have not, I suggest doing so as soon as possible. Send out a message if you are in a position where doing so is appropriate. Don’t forget to ask for a message addressing those values if you have not received any from your organization. Director Chris Bourg to the MIT Libraries staff https://chrisbourg.wordpress.com/2016/11/09/care-for-one-another/ Dean K. G. Schneider to the Sonoma State University Library staff http://freerangelibrarian.com/2016/11/15/pin-and-a-prayer/ Librarian Zoe Fisher to other librarians https://quickaskzoe.com/2016/11/09/but-i-know-that-there-will-be-libraries/ University of California statement on presidential election results https://www.universityofcalifornia.edu/press-room/university-california-statement-election University of Nevada, Reno http://www.unr.edu/president/communications/2016-11-10-election University of Michigan http://president.umich.edu/news-communications/letters-to-the-community/2016-election-message/ University of Rochester and Rochester Institute of Technology http://wxxinews.org/post/ur-presidents-post-election-letter-strikes-sour-note-some Duke University https://today.duke.edu/2016/11/statement-president-brodhead-following-2016-election Clarke University http://www.clarke.edu/page.aspx?id=37181 MIT https://news.mit.edu/2016/letter-mit-community-new-administration-washington-1110 Northwestern University https://news.northwestern.edu/stories/2016/11/president-schapiro-on-the-election-and-the-university/ “Post-Election Statements and Messages that Reaffirm Diversity” (A list of more post-election statements and messages that reaffirm diversity)   Posted in: Diversity, Librarianship, Library, management. Tagged: diversity · election · equity · inclusion · message · post-election · statement · template · tolerance Say It Out Loud – Diversity, Equity, and Inclusion Nov 12th, 2016 by Bohyun (Library Hat). Comments are off for this post I usually and mostly talk about technology. But technology is so far away from my thought right now. I don’t feel that I can afford to worry about Internet surveillance or how to protect privacy at this moment. Not that they are unimportant. Such a worry is real and deserves our attention and investigation. But at a time like this when there are so many reports of public incidences of hatred, bigotry, harassment, and violence reported on university and college campuses, on streets, and in many neighborhoods coming in at an alarming pace, I don’t find myself reflecting on how we can use technology to deal with this problem. For the problem is so much bigger. There are people drawing a swastika at a park. The ‘Whites only’ and ‘Colored’ signs were put up over water fountains in a Florida school. A Muslim student was threatened with a lighter. Asian-American women are being assaulted. Hostile acts targeting minority students are taking place on college campuses. A black female student was shoved off the sidewalk and called the ‘N’ word at Baylor University. Newt Gingrich called for a House committee for Un-American Activities. The Ku Klux Klan is openly holding a rally. The list goes on and on. Photo from http://www.wftv.com/news/local/investigation-underway-after-2-racist-signs-posted-above-water-fountains-at-first-coast-high-school/466146633 We are justified to be freaking out. I suspect this is a deal breaker to not just Democrats, not just Clinton supporters, but a whole lot more people. Not everyone who voted for Donald Trump endorse the position that women, people of color, Muslims, LGBT, and all other minority groups deserve and should be deprived of the basic human right to be not publicly threatened, harassed, and assaulted, I hope. I am sure that many who voted for Donald Trump do support diversity, equity, and inclusion as important and non-negotiable values. I believe that many who voted for Donald Trump do not want a society where some of their family, friends, colleagues, and neighbors have to live in constant fear for their physical safety at minimum. There are very many white people who absolutely condemn bigotry, threat, hatred, discrimination, harassment, and violence directed at minorities and give their unwavering support to diversity, equity, and inclusion. The problem is that I don’t hear it said loudly enough, clearly enough, publicly enough. I realized that we – myself included – do not say this enough. One of my fellow librarians, Steve, wrote this on his Facebook wall after the election. I am a 56 year old white guy. … I go out into the world today and I’m trying to hold a look on my face that says I don’t hate you black people, Hispanic people, gay people, Muslim people. I mean you no harm. I don’t want to deport you or imprison you. You are my brothers and sisters. I want for you all of the benefits, the rights, the joys (such as they are) that are afforded to everybody else in our society. I don’t think this look on my face is effective. Why should they trust me? You can never APPEAR to be doing the right thing. It requires DOING the right thing. Of course, Steve doesn’t want to harm me because I am not white, I know. I am 100 % positive that he wouldn’t assault me because I am female. But by stating this publicly (I mean as far as his FB friends can see the post), he made a difference to me. Steve is not Republican. But I would feel so much better if people I know tell me the same thing whether they are Democrat or Republican. And I think it will make a huge difference to others when we all say this together. Sometimes, saying isn’t much. But right now, saying it aloud can mean everything. If you support those who belong to minority groups but don’t say it out loud, how would they know it? Because right now, nothing is obvious other than there is a lot of hate and violence towards minorities. At this point, which candidate you voted for doesn’t matter. What matters is whether you will condone open hatred and violence towards minorities and women, thereby making it acceptable in our society. There is a lot at stake here, and this goes way beyond party politics. Publicly confirming our continued support for and unwavering commitment to diversity is a big deal. People who are being insulted, threatened, harassed, and assaulted need to hear it. And when we say this together loudly enough, clearly enough, explicitly enough, it will deafen the voice of hatred, bigotry, and intolerance and chase it away to the margins of our society again. So I think I am going to say this whenever I have a chance whether formally or informally whether it is in a written form or in a conversation. If you are a librarian, you should say this to your library users. If you are a teacher, you should say this to your students. If you run a business, you need to say this to your employees and customers. If you manage a team at work, tell your team. Say this out loud to your coworkers, friends, family, neighbors, and everyone you interact with. “I support all minorities and stand for diversity, equity, and inclusion.” “I object to and will not condone the acts of harassment, violence, hatred, and threats directed at minorities.” “I will not discriminate anyone based upon their ethnicity, race, sexual orientation, disability, political views, socio-economic backgrounds, or religious beliefs.” We cannot allow diversity, equity, and inclusion to become minority opinions. And it is up to us to keep it mainstream and to make it prevail. Say it aloud and act on it. In times like this, many of us look to institutions that we belong to, the organizations we work for, professionally participate in, or personally support. We expect them to reconfirm the very basic values of diversity, equity, and inclusion. Since I work for a university, I have been looking up and reading statements from higher education institutions. So far, not a great number of universities have made public statements confirming their continued support for diversity. I am sure more are on the way. But I expected more of them would come out more promptly. This is unfortunate because many of them openly expressed their support for diversity and even include diversity in their values, mission, and goals. If your organization hasn’t already confirmed their support for these values and expressed their commitment to provide safety for all minorities, ask for it. You may even be in a position to actually craft and issue one. For those in need of right words to express your intention clearly, here are some good examples below. “The University of California is proud of being a diverse and welcoming place for students, faculty, and staff with a wide range of backgrounds, experiences and perspectives.  Diversity is central to our mission.  We remain absolutely committed to supporting all members of our community and adhering to UC’s Principles Against Intolerance.  As the Principles make clear, the University ‘strives to foster an environment in which all are included’ and ‘all are given an equal opportunity to learn and explore.’  The University of California will continue to pursue and protect these principles now and in the future, and urges our students, faculty, staff, and all others associated with the University to do so as well.” –  University of California “Our responsibility is to remain committed to education, discovery and intellectual honesty – and to diversity, equity and inclusion. We are at our best when we come together to engage respectfully across our ideological differences; to support ALL who feel marginalized, threatened or unwelcome; and to pursue knowledge and understanding, as we always have, as the students, faculty and staff of the University of Michigan.” – University of Michigan “Northwestern is committed to being a welcoming and inclusive community for all, regardless of their beliefs, and I assure you that will not change.” – Northwestern University “As a Catholic university, Clarke will not step away from its many efforts to heighten our awareness of the individuals and groups who are exclude and marginalized in so many ways and to take action for their protection and inclusion.  Today, I call on us as a community to step up our efforts to promote understanding and inclusion and to reach out to those among us who are feeling further disenfranchised, fearful and confused as a result of the election.” – Clarke University “As President, I need to represent all of RIT, and I therefore do not express preferences for political candidates. I do feel it important, however, to represent and reinforce RIT’s shared commitment to the value of inclusive diversity. I have heard from many in our community that the result of the recent election has raised concerns from those in our minority populations, those who come from immigrant families, those from countries outside of the U.S., those in our LGBTQIA+ community, those who practice Islam, and even those in our female population about whether they should be concerned for their safety and well-being as a result of the horrific discourse that accompanied the presidential election process and some of the specific views and proposals presented. At RIT, we have treasured the diverse contributions of members of these groups to our campus community, and I want to reassure all that one of RIT’s highest priorities is to demonstrate the extraordinary value of inclusive diversity and that we will continue to respect, appreciate, and benefit from the contributions of all. Anyone who feels unsafe here should make their feelings known to me and to others in a position to address their concerns. Concerned members of our community can also take advantage of opportunities to engage in open discourse about the election in the MOSAIC Center and at tomorrow’s Grey Matter discussion.” – Rochester Institute of Technology Please go ahead and say these out loud to people around you if you mean them.  No matter how obvious and cheesy they sound, I assure you, they are not obvious and cheesy to those who are facing open threats, harassment, and violence. Let’s boost the signal; let’s make it loud; let’s make it overwhelming. “I support all minorities and stand for diversity, equity, and inclusion.” “I object to and will not condone the acts of harassment, violence, hatred, and threats directed at minorities.” “I will not discriminate anyone based upon their ethnicity, race, sexual orientation, disability, political views, socio-economic backgrounds, or religious beliefs.”   Posted in: Diversity. Tagged: 2016 · election · hate crime · racism Cybersecurity, Usability, Online Privacy, and Digital Surveillance May 9th, 2016 by Bohyun (Library Hat). Comments are off for this post ** This post was originally published in ACRL TechConnect on May. 9, 2016.*** Cybersecurity is an interesting and important topic, one closely connected to those of online privacy and digital surveillance. Many of us know that it is difficult to keep things private on the Internet. The Internet was invented to share things with others quickly, and it excels at that job. Businesses that process transactions with customers and store the information online are responsible for keeping that information private. No one wants social security numbers, credit card information, medical history, or personal e-mails shared with the world. We expect and trust banks, online stores, and our doctor’s offices to keep our information safe and secure. However, keeping private information safe and secure is a challenging task. We have all heard of security breaches at J.P Morgan, Target, Sony, Anthem Blue Cross and Blue Shield, the Office of Personnel Management of the U.S. federal government, University of Maryland at College Park, and Indiana University. Sometimes, a data breach takes place when an institution fails to patch a hole in its network systems. Sometimes, people fall for a phishing scam, or a virus in a user’s computer infects the target system. Other times, online companies compile customer data into personal profiles. The profiles are then sold to data brokers and on into the hands of malicious hackers and criminals. Image from Flickr – https://www.flickr.com/photos/topgold/4978430615 Cybersecurity vs. Usability To prevent such a data breach, institutional IT staff are trained to protect their systems against vulnerabilities and intrusion attempts. Employees and end users are educated to be careful about dealing with institutional or customers’ data. There are systematic measures that organizations can implement such as two-factor authentication, stringent password requirements, and locking accounts after a certain number of failed login attempts. While these measures strengthen an institution’s defense against cyberattacks, they may negatively affect the usability of the system, lowering users’ productivity. As a simple example, security measures like a CAPTCHA can cause an accessibility issue for people with disabilities. Or imagine that a university IT office concerned about the data security of cloud services starts requiring all faculty, students, and staff to only use cloud services that are SOC 2 Type II certified as an another example. SOC stands for “Service Organization Controls.” It consists of a series of standards that measure how well a given service organization keeps its information secure. For a business to be SOC 2 certified, it must demonstrate that it has sufficient policies and strategies that will satisfactorily protect its clients’ data in five areas known as “Trust Services Principles.” Those include the security of the service provider’s system, the processing integrity of this system, the availability of the system, the privacy of personal information that the service provider collects, retains, uses, discloses, and disposes of for its clients, and the confidentiality of the information that the service provider’s system processes or maintains for the clients. The SOC 2 Type II certification means that the business had maintained relevant security policies and procedures over a period of at least six months, and therefore it is a good indicator that the business will keep the clients’ sensitive data secure. The Dropbox for Business is SOC 2 certified, but it costs money. The free version is not as secure, but many faculty, students, and staff in academia use it frequently for collaboration. If a university IT office simply bans people from using the free version of Dropbox without offering an alternative that is as easy to use as Dropbox, people will undoubtedly suffer. Some of you may know that the USPS website does not provide a way to reset the password for users who forgot their usernames. They are instead asked to create a new account. If they remember the account username but enter the wrong answers to the two security questions more than twice, the system also automatically locks their accounts for a certain period of time. Again, users have to create a new account. Clearly, the system that does not allow the password reset for those forgetful users is more secure than the one that does. However, in reality, this security measure creates a huge usability issue because average users do forget their passwords and the answers to the security questions that they set up themselves. It’s not hard to guess how frustrated people will be when they realize that they entered a wrong mailing address for mail forwarding and are now unable to get back into the system to correct because they cannot remember their passwords nor the answers to their security questions. To give an example related to libraries, a library may decide to block all international traffic to their licensed e-resources to prevent foreign hackers who have gotten hold of the username and password of a legitimate user from accessing those e-resources. This would certainly help libraries to avoid a potential breach of licensing terms in advance and spare them from having to shut down compromised user accounts one by one whenever those are found. However, this would make it impossible for legitimate users traveling outside of the country to access those e-resources as well, which many users would find it unacceptable. Furthermore, malicious hackers would probably just use a proxy to make their IP address appear to be located in the U.S. anyway. What would users do if their organization requires them to reset passwords on a weekly basis for their work computers and several or more systems that they also use constantly for work? While this may strengthen the security of those systems, it’s easy to see that it will be a nightmare having to reset all those passwords every week and keeping track of them not to forget or mix them up. Most likely, they will start using less complicated passwords or even begin to adopt just one password for all different services. Some may even stick to the same password every time the system requires them to reset it unless the system automatically detects the previous password and prevents the users from continuing to use the same one. Ill-thought-out cybersecurity measures can easily backfire. Security is important, but users also want to be able to do their job without being bogged down by unwieldy cybersecurity measures. The more user-friendly and the simpler the cybersecurity guidelines are to follow, the more users will observe them, thereby making a network more secure. Users who face cumbersome and complicated security measures may ignore or try to bypass them, increasing security risks. Image from Flickr – https://www.flickr.com/photos/topgold/4978430615 Cybersecurity vs. Privacy Usability and productivity may be a small issue, however, compared to the risk of mass surveillance resulting from aggressive security measures. In 2013, the Guardian reported that the communication records of millions of people were being collected by the National Security Agency (NSA) in bulk, regardless of suspicion of wrongdoing. A secret court order prohibited Verizon from disclosing the NSA’s information request. After a cyberattack against the University of California at Los Angeles, the University of California system installed a device that is capable of capturing, analyzing, and storing all network traffic to and from the campus for over 30 days. This security monitoring was implemented secretly without consulting or notifying the faculty and those who would be subject to the monitoring. The San Francisco Chronicle reported the IT staff who installed the system were given strict instructions not to reveal it was taking place. Selected committee members on the campus were told to keep this information to themselves. The invasion of privacy and the lack of transparency in these network monitoring programs has caused great controversy. Such wide and indiscriminate monitoring programs must have a very good justification and offer clear answers to vital questions such as what exactly will be collected, who will have access to the collected information, when and how the information will be used, what controls will be put in place to prevent the information from being used for unrelated purposes, and how the information will be disposed of. We have recently seen another case in which security concerns conflicted with people’s right to privacy. In February 2016, the FBI requested Apple to create a backdoor application that will bypass the current security measure in place in its iOS. This was because the FBI wanted to unlock an iPhone 5C recovered from one of the shooters in San Bernadino shooting incident. Apple iOS secures users’ devices by permanently erasing all data when a wrong password is entered more than ten times if people choose to activate this option in the iOS setting. The FBI’s request was met with strong opposition from Apple and others. Such a backdoor application can easily be exploited for illegal purposes by black hat hackers, for unjustified privacy infringement by other capable parties, and even for dictatorship by governments. Apple refused to comply with the request, and the court hearing was to take place in March 22. The FBI, however, withdrew the request saying that it found a way to hack into the phone in question without Apple’s help. Now, Apple has to figure out what the vulnerability in their iOS if it wants its encryption mechanism to be foolproof. In the meanwhile, iOS users know that their data is no longer as secure as they once thought. Around the same time, the Senate’s draft bill titled as “Compliance with Court Orders Act of 2016,” proposed that people should be required to comply with any authorized court order for data and that if that data is “unintelligible” – meaning encrypted – then it must be decrypted for the court. This bill is problematic because it practically nullifies the efficacy of any end-to-end encryption, which we use everyday from our iPhones to messaging services like Whatsapp and Signal. Because security is essential to privacy, it is ironic that certain cybersecurity measures are used to greatly invade privacy rather than protect it. Because we do not always fully understand how the technology actually works or how it can be exploited for both good and bad purposes, we need to be careful about giving blank permission to any party to access, collect, and use our private data without clear understanding, oversight, and consent. As we share more and more information online, cyberattacks will only increase, and organizations and the government will struggle even more to balance privacy concerns with security issues. Why Libraries Should Advocate for Online Privacy? The fact that people may no longer have privacy on the Web should concern libraries. Historically, libraries have been strong advocates of intellectual freedom striving to keep patron’s data safe and protected from the unwanted eyes of the authorities. As librarians, we believe in people’s right to read, think, and speak freely and privately as long as such an act itself does not pose harm to others. The Library Freedom Project is an example that reflects this belief held strongly within the library community. It educates librarians and their local communities about surveillance threats, privacy rights and law, and privacy-protecting technology tools to help safeguard digital freedom, and helped the Kilton Public Library in Lebanon, New Hampshire, to become the first library to operate a Tor exit relay, to provide anonymity for patrons while they browse the Internet at the library. New technologies brought us the unprecedented convenience of collecting, storing, and sharing massive amount of sensitive data online. But the fact that such sensitive data can be easily exploited by falling into the wrong hands created also the unparalleled level of potential invasion of privacy. While the majority of librarians take a very strong stance in favor of intellectual freedom and against censorship, it is often hard to discern a correct stance on online privacy particularly when it is pitted against cybersecurity. Some even argue that those who have nothing to hide do not need their privacy at all. However, privacy is not equivalent to hiding a wrongdoing. Nor do people keep certain things secrets because those things are necessarily illegal or unethical. Being watched 24/7 will drive any person crazy whether s/he is guilty of any wrongdoing or not. Privacy allows us safe space to form our thoughts and consider our actions on our own without being subject to others’ eyes and judgments. Even in the absence of actual massive surveillance, just the belief that one can be placed under surveillance at any moment is sufficient to trigger self-censorship and negatively affects one’s thoughts, ideas, creativity, imagination, choices, and actions, making people more conformist and compliant. This is further corroborated by the recent study from Oxford University, which provides empirical evidence that the mere existence of a surveillance state breeds fear and conformity and stifles free expression. Privacy is an essential part of being human, not some trivial condition that we can do without in the face of a greater concern. That’s why many people under political dictatorship continue to choose death over life under mass surveillance and censorship in their fight for freedom and privacy. The Electronic Frontier Foundation states that privacy means respect for individuals’ autonomy, anonymous speech, and the right to free association. We want to live as autonomous human beings free to speak our minds and think on our own. If part of a library’s mission is to contribute to helping people to become such autonomous human beings through learning and sharing knowledge with one another without having to worry about being observed and/or censored, libraries should advocate for people’s privacy both online and offline as well as in all forms of communication technologies and devices. Posted in: Library, Technology, Usability, user experience, Web. Tagged: data security · digital freedom · encryption · Internet · password · soc 2 · tor Three Recent Talks of Mine on UX, Data Visualization, and IT Management Apr 7th, 2016 by Bohyun (Library Hat). Comments are off for this post I have been swamped at work and pretty quiet here in my blog. But I gave a few talks recently. So I wanted to share those at least. I presented about how to turn the traditional library IT department and its operation that is usually behind the scene into a more patron-facing unit at the recent American Library Association Midwinter Meeting back in January. This program was organized by the LITA Heads of IT Interest Group. In March, I gave a short lightning talk at the 2016 Code4Lib Conference about the data visualization project of library data at my library. I was also invited to speak at the USMAI (University System of Maryland and Affiliated Institutions) UX Unconference and gave a talk about user experience, personas, and the idea of applying library personas to library strategic planning. Here are those three presentation slides for those interested! Strategically UX Oriented with Personas from Bohyun Kim Visualizing Library Data from Bohyun Kim Turning the IT Dept. Outward from Bohyun Kim Posted in: ALA, Library, presentation, Technology, Usability, user experience. Tagged: code4lib · Data Visualization · IT · management · ux Near Us and Libraries, Robots Have Arrived Oct 12th, 2015 by Bohyun (Library Hat). Comments are off for this post ** This post was originally published in ACRL TechConnect on Oct. 12, 2015.*** The movie, Robot and Frank, describes the future in which the elderly have a robot as their companion and also as a helper. The robot monitors various activities that relate to both mental and physical health and helps Frank with various house chores. But Frank also enjoys the robot’s company and goes on to enlist the robot into his adventure of breaking into a local library to steal a book and a greater heist later on. People’s lives in the movie are not particularly futuristic other than a robot in them. And even a robot may not be so futuristic to us much longer either. As a matter of fact, as of June 2015, there is now a commercially available humanoid robot that is close to performing some of the functions that the robot in the movie ‘Frank and Robot’ does. Pepper Robot, Image from Aldebaran, https://www.aldebaran.com/en/a-robots/who-is-pepper A Japanese company, SoftBank Robotics Corp. released a humanoid robot named ‘Pepper’ to the market back in June. The Pepper robot is 4 feet tall, 61 pounds, speaks 17 languages and is equipped with an array of cameras, touch sensors, accelerometer, and other sensors in his “endocrine-type multi-layer neural network,” according to the CNN report.  The Pepper robot was priced at ¥198,000 ($1,600). The Pepper owners are also responsible for an additional ¥24,600 ($200) monthly data and insurance fee. While the Pepper robot is not exactly cheap, it is surprisingly affordable for a robot. This means that the robot industry has now matured to the point where it can introduce a robot that the mass can afford. Robots come in varying capabilities and forms. Some robots are as simple as a programmable cube block that can be combined with one another to be built into a working unit. For example, Cubelets from Modular Robotics are modular robots that are used for educational purposes. Each cube performs one specific function, such as flash, battery, temperature, brightness, rotation, etc. And one can combine these blocks together to build a robot that performs a certain function. For example, you can build a lighthouse robot by combining a battery block, a light-sensor block, a rotator block, and a flash block.   A variety of cubelets available from the Modular Robotics website.   By contrast, there are advanced robots such as those in the form of an animal developed by a robotics company, Boston Dynamics. Some robots look like a human although much smaller than the Pepper robot. NAO is a 58-cm tall humanoid robot that moves, recognizes, hears and talks to people that was launched in 2006. Nao robots are an interactive educational toy that helps students to learn programming in a fun and practical way. Noticing their relevance to STEM education, some libraries are making robots available to library patrons. Westport Public Library provides robot training classes for its two Nao robots. Chicago Public Library lends a number of Finch robots that patrons can program to see how they work. In celebration of the National Robotics Week back in April, San Diego Public Library hosted their first Robot Day educating the public about how robots have impacted the society. San Diego Public Library also started a weekly Robotics Club inviting anyone to join in to help build or learn how to build a robot for the library. Haslet Public Library offers the Robotics Camp program for 6th to 8th graders who want to learn how to build with LEGO Mindstorms EV3 kits. School librarians are also starting robotics clubs. The Robotics Club at New Rochelle High School in New York is run by the school’s librarian, Ryan Paulsen. Paulsen’s robotics club started with faculty, parent, and other schools’ help along with a grant from NASA and participated in a FIRST Robotics Competition. Organizations such as the Robotics Academy at Carnegie Mellon University provides educational outreach and resources. Image from Aldebaran website at https://www.aldebaran.com/en/humanoid-robot/nao-robot There are also libraries that offer coding workshops often with Arduino or Raspberry Pi, which are inexpensive computer hardware. Ames Free Library offers Raspberry Pi workshops. San Diego Public Library runs a monthly Arduino Enthusiast Meetup.  Arduinos and Raspberry Pis can be used to build digital devices and objects that can sense and interact the physical world, which are close to a simple robot. We may see  more robotics programs at those libraries in the near future. Robots can fulfill many other functions than being educational interactive toys, however. For example, robots can be very useful in healthcare. A robot can be a patient’s emotional companion just like the Pepper. Or it can provide an easy way to communicate for a patient and her/his caregiver with physicians and others. A robot can be used at a hospital to move and deliver medication and other items and function as a telemedicine assistant. It can also provide physical assistance for a patient or a nurse and even be use for children’s therapy. Humanoid robots like Pepper may also serve at a reception desk at companies. And it is not difficult to imagine them as sales clerks at stores. Robots can be useful at schools and other educational settings as well. At a workplace, teleworkers can use robots to achieve more active presence. For example, universities and colleges can offer a similar telepresence robot to online students who want to virtually experience and utilize the campus facilities or to faculty who wish to offer the office hours or collaborate with colleagues while they are away from the office. As a matter of fact, the University of Texas, Arlington, Libraries recently acquired several Telepresence Robots to lend to their faculty and students. Not all robots do or will have the humanoid form as the Pepper robot does. But as robots become more and more capable, we will surely get to see more robots in our daily lives. References Alpeyev, Pavel, and Takashi Amano. “Robots at Work: SoftBank Aims to Bring Pepper to Stores.” Bloomberg Business, June 30, 2015. http://www.bloomberg.com/news/articles/2015-06-30/robots-at-work-softbank-aims-to-bring-pepper-to-stores. “Boston Dynamics.” Accessed September 8, 2015. http://www.bostondynamics.com/. Boyer, Katie. “Robotics Clubs At the Library.” Public Libraries Online, June 16, 2014. http://publiclibrariesonline.org/2014/06/robotics-clubs-at-the-library/. “Finch Robots Land at CPL Altgeld.” Chicago Public Library, May 12, 2014. https://www.chipublib.org/news/finch-robots-land-at-cpl/. McNickle, Michelle. “10 Medical Robots That Could Change Healthcare – InformationWeek.” InformationWeek, December 6, 2012. http://www.informationweek.com/mobile/10-medical-robots-that-could-change-healthcare/d/d-id/1107696. Singh, Angad. “‘Pepper’ the Emotional Robot, Sells out within a Minute.” CNN.com, June 23, 2015. http://www.cnn.com/2015/06/22/tech/pepper-robot-sold-out/. Tran, Uyen. “SDPL Labs: Arduino Aplenty.” The Library Incubator Project, April 17, 2015. http://www.libraryasincubatorproject.org/?p=16559. “UT Arlington Library to Begin Offering Programming Robots for Checkout.” University of Texas Arlington, March 11, 2015. https://www.uta.edu/news/releases/2015/03/Library-robots-2015.php. Waldman, Loretta. “Coming Soon to the Library: Humanoid Robots.” Wall Street Journal, September 29, 2014, sec. New York. http://www.wsj.com/articles/coming-soon-to-the-library-humanoid-robots-1412015687. Posted in: Library, Technology. Tagged: education · libraries · robotics · robots · STEM ← Earlier Posts Subscribe to our Feed via RSS Search About LibraryHat is a blog written by Bohyun Kim, CTO & Associate Professor at the University of Rhode Island Libraries (bohyun.kim.ois [at] gmail [dot] com; @bohyunkim). Most Popular - Libraries Meet the Second Machine Age - Future? Libraries? What Now? – After the ALA Summit on the Future of Libraries - Query a Google Spreadsheet like a Database with Google Visualization API Query Language - Enabling the Research ‘Flow’ and Serendipity in Today’s Digital Library Environment - Research Librarianship in Crisis: Mediate When, Where, and How? - Why Not Grow Coders from the inside of Libraries? - Do You Feel Inadequate? For Hard-Working Overachievers - Redesigning the Item Record Summary View in a Library Catalog and a Discovery Interface - Fear No Longer Regular Expressions - Using Git with BitBucket: Basic commands – pull, add, commit, push - Aaron Swartz and Too-Comfortable Research Libraries - Common Misconceptions about Library Job Search: What I have learned from the other side of the table - Applying Game Dynamics to Library Services - How to Make Your Writing Less Terrible - Netflix and Libraries: You Are What “Your Users” Think You Are, Not What You Think You Are Archives July 2018 (1) December 2017 (1) October 2017 (1) May 2017 (1) November 2016 (3) May 2016 (1) April 2016 (1) October 2015 (1) September 2015 (1) July 2015 (1) March 2015 (1) February 2015 (1) September 2014 (2) June 2014 (1) May 2014 (1) March 2014 (2) December 2013 (1) November 2013 (1) October 2013 (3) September 2013 (1) July 2013 (1) April 2013 (2) March 2013 (3) February 2013 (1) January 2013 (2) December 2012 (1) November 2012 (1) October 2012 (2) September 2012 (2) August 2012 (2) July 2012 (2) June 2012 (1) May 2012 (2) March 2012 (3) February 2012 (3) January 2012 (6) October 2011 (1) September 2011 (1) August 2011 (1) July 2011 (1) June 2011 (1) May 2011 (2) March 2011 (4) February 2011 (2) January 2011 (2) December 2010 (2) November 2010 (1) October 2010 (2) September 2010 (4) August 2010 (2) July 2010 (1) June 2010 (3) April 2010 (3) February 2010 (2) January 2010 (4) December 2009 (1) November 2009 (2) October 2009 (3) September 2009 (5) August 2009 (4) July 2009 (8) Tags 2011 ACRL ALA api change codeyear coding communication Conference Continuing Education design election emerging technologies equity inclusion interview with brand-new librarians IT javascript job job search jquery Kindle libcodeyear librarian libraries Library Library day in the life LIS lita makerspace management MLS mobile new librarians post-MLS presentation programming Publication Technology tips Tweet-up Twitter usability ux web © 2021 Library Hat | Powered by WordPress A WordPress theme by Ravi Varma 
www-coindesk-com-2657	----	Ethereum Classic Hit by Third 51% Attack in a Month - CoinDesk News Latest Opinion Features Videos Markets Get the Latest from CoinDesk Sign up for our newsletters Learn More By signing up, you will receive emails about CoinDesk products and you agree to our terms & conditions and privacy policy Sign Up Please enter a valid email address Tech Business Policy & Regulation People Topics Bitcoin Cryptocurrencies Dogecoin Ether Ethereum Evergreen Money Reimagined Newsletters Opinion Tesla Learn 101 Guides Bitcoin 101 Blockchain 101 Ethereum 101 Trading 101 All 101 Guides Assets 0x (ZRX) Cosmos Gemini Dollar Zcash Grin See all Companies ARK Invest Vision Hill CryptoCompare GDA Capital Fundstrat See all People Anthony Pompliano Vitalik Buterin Jimmy Song Jameson Lopp Satoshi Nakamoto See all Get the Latest from CoinDesk Sign up for our newsletters Learn More By signing up, you will receive emails about CoinDesk products and you agree to our terms & conditions and privacy policy Sign Up Please enter a valid email address TV Videos Podcasts Research Events Trending Tesla Elon Musk’s Tesla Sold Bitcoin in Q1 for Proceeds of $272M Nate DiCamilloApr 26, 2021 JPMorgan JPMorgan to Let Clients Invest in Bitcoin Fund for First Time: Sources Danny NelsonApr 26, 2021 South Korea South Korea’s Top Financial Regulator Suggests All Crypto Exchanges Could Be Shut Down Felix ImApr 23, 2021 Opinion Dogecoin and the New Meaning of Money Michael J. CaseyApr 23, 2021 Bitcoin 24h $54,782.96 +2.11% Bitcoin 24h $54,782.96 +1,134.95 +2.11% Ethereum 24h $2,643.69 +5.48% Ethereum 24h $2,643.69 +137.31 +5.48% XRP 24h $1.39 +10.10% XRP 24h $1.39 +0.127869 +10.10% Stellar 24h $0.506804 +10.04% Stellar 24h $0.506804 +0.046279 +10.04% Cardano 24h $1.30 +5.86% Cardano 24h $1.30 +0.071932 +5.86% Uniswap 24h $39.14 +9.82% Uniswap 24h $39.14 +3.50 +9.82% Chainlink 24h $36.82 +6.18% Chainlink 24h $36.82 +2.14 +6.18% Polkadot 24h $33.88 +4.21% Polkadot 24h $33.88 +1.37 +4.21% Dogecoin 24h $0.270923 +1.51% Dogecoin 24h $0.270923 +0.004042 +1.51% Litecoin 24h $257.12 +4.39% Litecoin 24h $257.12 +10.80 +4.39% Bitcoin Cash 24h $874.73 +4.90% Bitcoin Cash 24h $874.73 +40.92 +4.90% The Graph 24h $1.54 +5.24% The Graph 24h $1.54 +0.076531 +5.24% Filecoin 24h $152.50 +0.82% Filecoin 24h $152.50 +1.23 +0.82% NEO 24h $91.85 +4.69% NEO 24h $91.85 +4.12 +4.69% Tron 24h $0.124316 +9.84% Tron 24h $0.124316 +0.011152 +9.84% Wrapped Bitcoin 24h $54,824.63 +2.18% Wrapped Bitcoin 24h $54,824.63 +1,169.87 +2.18% BitTorrent 24h $0.007349 +6.72% BitTorrent 24h $0.007349 +0.000463 +6.72% Monero 24h $395.36 +5.09% Monero 24h $395.36 +19.14 +5.09% Aave 24h $431.82 +8.78% Aave 24h $431.82 +34.90 +8.78% Algorand 24h $1.26 +4.37% Algorand 24h $1.26 +0.052831 +4.37% Cosmos 24h $22.45 +1.91% Cosmos 24h $22.45 +0.419474 +1.91% IOTA 24h $2.14 +8.71% IOTA 24h $2.14 +0.171432 +8.71% EOS 24h $5.97 +5.60% EOS 24h $5.97 +0.316487 +5.60% Bitcoin SV 24h $278.92 +6.78% Bitcoin SV 24h $278.92 +17.70 +6.78% Tezos 24h $5.36 +4.21% Tezos 24h $5.36 +0.217154 +4.21% Maker 24h $4,452.89 +12.27% Maker 24h $4,452.89 +487.70 +12.27% Ethereum Classic 24h $33.68 +4.99% Ethereum Classic 24h $33.68 +1.60 +4.99% NEM 24h $0.332731 +10.07% NEM 24h $0.332731 +0.030501 +10.07% Dash 24h $285.38 +4.32% Dash 24h $285.38 +11.82 +4.32% Decentraland 24h $1.28 +6.17% Decentraland 24h $1.28 +0.074058 +6.17% Decred 24h $211.54 +4.46% Decred 24h $211.54 +9.04 +4.46% Zcash 24h $234.29 +4.40% Zcash 24h $234.29 +9.87 +4.40% ICON 24h $2.19 +10.50% ICON 24h $2.19 +0.208867 +10.50% Waves 24h $19.38 +6.60% Waves 24h $19.38 +1.20 +6.60% Siacoin 24h $0.039561 +15.46% Siacoin 24h $0.039561 +0.005280 +15.46% NuCypher 24h $0.476253 +18.50% NuCypher 24h $0.476253 +0.074345 +18.50% Basic Attention Token 24h $1.21 +6.89% Basic Attention Token 24h $1.21 +0.078157 +6.89% Yearn Finance 24h $48,834.32 +10.64% Yearn Finance 24h $48,834.32 +4,696.90 +10.64% 0x 24h $1.78 +18.68% 0x 24h $1.78 +0.280213 +18.68% Band Protocol 24h $16.79 +12.69% Band Protocol 24h $16.79 +1.89 +12.69% Bitcoin Gold 24h $89.64 +8.95% Bitcoin Gold 24h $89.64 +7.39 +8.95% Qtum 24h $14.64 +10.14% Qtum 24h $14.64 +1.35 +10.14% Bancor 24h $6.62 +4.97% Bancor 24h $6.62 +0.313666 +4.97% Paxos standard 24h $1.00 +0.04% Paxos standard 24h $1.00 +0.000361 +0.04% Nano 24h $8.28 +0.81% Nano 24h $8.28 +0.066587 +0.81% OMG Network 24h $7.31 +6.06% OMG Network 24h $7.31 +0.418586 +6.06% Storj 24h $2.07 +13.32% Storj 24h $2.07 +0.243722 +13.32% Ren 24h $0.866575 +1.93% Ren 24h $0.866575 +0.016389 +1.93% Numeraire 24h $67.92 +10.01% Numeraire 24h $67.92 +6.19 +10.01% Loopring 24h $0.517613 +4.57% Loopring 24h $0.517613 +0.022603 +4.57% Lisk 24h $4.75 +6.34% Lisk 24h $4.75 +0.282836 +6.34% Fetch.ai 24h $0.545469 +20.99% Fetch.ai 24h $0.545469 +0.095037 +20.99% Kava.io 24h $5.14 +7.31% Kava.io 24h $5.14 +0.350424 +7.31% Orchid 24h $0.622640 +7.09% Orchid 24h $0.622640 +0.041545 +7.09% Civic 24h $0.527224 +0.04% Civic 24h $0.527224 +0.000238 +0.04% Kyber Network 24h $3.27 +19.84% Kyber Network 24h $3.27 +0.539261 +19.84% Augur 24h $38.01 +5.99% Augur 24h $38.01 +2.15 +5.99% Aragon 24h $8.68 +3.74% Aragon 24h $8.68 +0.313024 +3.74% District0x 24h $0.303080 +7.35% District0x 24h $0.303080 +0.020820 +7.35% Enzyme 24h $107.36 +5.28% Enzyme 24h $107.36 +5.38 +5.28% PAX GOLD 24h $1,800.37 +0.42% PAX GOLD 24h $1,800.37 +7.53 +0.42% SingularDTV 24h $0.026302 +6.60% SingularDTV 24h $0.026302 +0.001628 +6.60% Tether 24h $1.00 +0.04% Tether 24h $1.00 +0.000379 +0.04% USD Coin 24h $1.00 +0.04% USD Coin 24h $1.00 +0.000436 +0.04% Dai 24h $1.00 +0.02% Dai 24h $1.00 +0.000243 +0.02% Indexes Top Assets Please consider using a different web browser for better experience. story from Markets Ethereum Classic Hit by Third 51% Attack in a Month (Ethereum Classic) Zack Voell Aug 29, 2020 at 11:00 p.m. UTCUpdated Aug 31, 2020 at 4:59 p.m. UTC Ethereum Classic Hit by Third 51% Attack in a Month The Ethereum Classic blockchain suffered a 51% attack Saturday evening, its third such attack this month, noticed by mining company Bitfly, which also spotted the first attack on Aug. 1. The attack reorganized over 7,000 blocks, or two days' worth of mining, according to a tweet shared by Bitfly. The first two attacks reorganized 3,693 and 4,000 blocks respectively. Notably, a leading organization behind the Ethereum Classic network, ETC Labs, announced its strategy to protect the network from additional attacks last week, including defensive mining that is intended to stabilize the network's plummeting hashrate and resist future 51% attacks. Stevan Lohja, technology coordinator at ETC Labs, in a private message with CoinDesk, said he finds the timing of the attack "very suspicious" as it came just a day after a meeting of Ethereum Classic core developers regarding "aggressive innovation" in the blockchain's proof of work. ETC Cooperative, another prominent foundation supporting the network's development, took to Twitter following Saturday's attack saying, "We are aware of today's attack and are working with others to test and evaluate proposed solutions as quickly as possible." After the first two attacks, exchange OKEx responded by saying it will consider delisting the asset due to the network's severe lack of security. Coinbase also took drastic measures by extending deposit and withdrawal confirmation times for ETC to roughly two weeks. Following the latest attack, leading cryptocurrency derivatives exchange FTX will reconsider its ETC perpetual futures contracts, according to CEO Sam Bankman-Fried in a private message to CoinDesk. He said this is so even though FTX doesn't support spot trading and the cryptocurrency network's insecurity has less of a direct effect on the risk of offering futures trading. The cryptocurrency seems largely unaffected by the series of attacks, trading at $6.86 at last check, less than 4% below its price during the second attack. The coin has traded hands between $6 and $8 for nearly the entire month of August. UPDATE (Aug. 29, 23:03 UTC): Adding comment from ETC Cooperative. UPDATE (Aug. 30, 01:05 UTC): Adding comment from ETC Labs’ technology coordinator. UPDATE (Aug. 31, 16:59 UTC): Clarifying Lohja’s comment in reference to Ethereum Classic core developers, not “Ethereum Core” developers Subscribe to , Subscribe By signing up, you will receive emails about CoinDesk products and you agree to our terms & conditions and privacy policy. Read more about... 51% AttacksEthereum Classic DisclosureThe leader in blockchain news, CoinDesk is a media outlet that strives for the highest journalistic standards and abides by a strict set of editorial policies. CoinDesk is an independent operating subsidiary of Digital Currency Group, which invests in cryptocurrencies and blockchain startups. Related Federal Reserve As Powell Heads to Fed Meeting, Inflation Data Can Only Get Worse Nate DiCamilloApr 27, 2021 Technical Analysis Bitcoin Rallies From Oversold Levels; Resistance Around $56K Damanick DantesApr 27, 2021 Options Bitcoin Options Market Eyes $4.2B Expiry on Friday Omkar GodboleApr 27, 2021 Economy A Year After Coronavirus Meltdown, Few Investors See Risk of Deflation: Deutsche Bank Damanick DantesApr 26, 2021 Ethereum Fees Market Wrap: Bitcoin Bounces to $54K as Ether Fees Drop Below Average in Past Week Daniel CawreyApr 26, 2021 What is undefined? About Masthead Ethics Policy Contributors Jobs Events Advertise Follow us Get the Latest from CoinDesk Sign up for our newsletters Learn More By signing up, you will receive emails about CoinDesk products and you agree to our terms & conditions and privacy policy Sign Up Please enter a valid email address Terms & Conditions Privacy Policy Newsletters © 2021 CoinDesk English The leader in blockchain news, CoinDesk is a media outlet that strives for the highest journalistic standards and abides by a strict set of editorial policies. CoinDesk is an independent operating subsidiary of Digital Currency Group, which invests in cryptocurrencies and blockchain startups. .st0{fill:#f3bb2d}.st1{fill:#fff} .st0{fill:none;stroke:#fff;stroke-miterlimit:10} .st0{fill:none;stroke:#fff;stroke-miterlimit:10} .st0{fill:#fff} .st0{fill:#fff} .st0{fill:#fff} .st0{fill:#fff} .st0{fill:none;stroke:#fff;stroke-miterlimit:10} .st0{fill:#f3bb2d}.st1{fill:#fff} 
www-dataliberate-com-1791	----	Library Metadata Evolution: The Final Mile When Schema.org arrived on the scene I thought we might have arrived at the point where library metadata  could finally blossom; adding value outside of library systems to help library curated resources become first class citizens, and hence results, in the global web we all inhabit.  But as yet it has not happened. Something For Archives in Schema.org The recent release of the Schema.org vocabulary (version 3.5) includes new types and properties, proposed by the W3C Schema Architypes Community Group, specifically target at facilitating the web sharing of archives data to aid discovery. When the Group, which I have the privilege to chair, approached the challenge of building a proposal to make Schema.org useful for archives, it was identified that the vocabulary could be already used to describe the things &#38; collections that you find in archives.  What was missing was the ability to identify the archive holding organisation, and the fact that an item is being held ... Bibframe – Schema.org – Chocolate Teapots In a session at the IFLA WLIC in Kuala Lumpur - my core theme being that there is a need to use two [linked data] vocabularies when describing library resources — Bibframe for cataloguing and [linked] metadata interchange — Schema.org for sharing on the web for discovery. Schema.org Introduces Defined Terms Do you have a list of terms relevant to your data? Things such as subjects, topics, job titles, a glossary or dictionary of terms, blog post categories, ‘official names’ for things/people/organisations, material types, forms of technology, etc. Schema.org Significant Updates for Tourism and Trips The latest release of Schema.org (3.4) includes some significant enhancements for those interested in marking up tourism, and trips in general. For tourism markup two new types TouristDestination and TouristTrip have joined the already useful TouristAttraction The Three Linked Data Choices for Libraries We are [finally] on the cusp of establishing a de facto Linked Data approach for libraries and their system suppliers - not there yet but getting there. We have a choice between BIBFRAME 2.0, Schema.org, Linky MARC and doing nothing. Structured Data: Helping Google Understand Your Site Add Schema.org structured data to your pages because during indexing, we will be able to better understand what your site is about. Schema.org for Tourism These TouristAttraction enhancements have significantly improved the capability for describing Tourist Attractions and hopefully enabling more tourist discoveries Schema.org: Describing Global Corporations Local Cafés And Everything In-between There have been  discussions in Schema.org about the way Organizations their offices, branches and other locations can be marked up; they exposed a lack of clarity in the way to structure descriptions of Organizations and their locations, offices, branches , etc. To address that lack of clarity I thought it would be useful to share some examples here. A Discovery Opportunity for Archives? So why am I now suggesting that there maybe an opportunity for the discovery of archives and their resources? 
www-dataliberate-com-6987	----	Skip to content Menu Home Services Services Overview Site Audit Service Training and Education Vocabulary Extension Blog About Founder Contact Structured Data / Schema.org Site Audit Service Launched Check out our new fixed price service to find out how your site is performing! Find Out More Data Value Liberating business, social, economic, enterprise, and financial value from data of all types delivers benefits to the media, governments, commerce, academia and individuals. Data Liberate, and its founder Richard Wallis, focus on introducing, simplifying, and demystifying, these often superficially complex techniques and technologies. Advice, guidance, evaluation, training, consultancy services, writing, podcasting, conference keynotes and presentations are some of the ways that Data Liberate can help you and your organisation identify and release value from your data, within the enterprise and on the public Web of Data Recent Postings From the Blog… Library Metadata Evolution: The Final Mile Posted on May 14, 2019May 14, 2019 by Richard Wallis When Schema.org arrived on the scene I thought we might have arrived at the point where library metadata  could finally blossom; adding value outside of library systems to help library curated resources become first class citizens, and hence results, in the global web we all inhabit.  But as yet it has not happened. Read More ... Posted in Bibframe, Data Liberate, Knowledge Graph, Libraries, schema.org, Structured DataTagged Bibframe2Schema.org, Libraries, Library of Congress1 Comment Something For Archives in Schema.org Posted on April 3, 2019 by Richard Wallis The recent release of the Schema.org vocabulary (version 3.5) includes new types and properties, proposed by the W3C Schema Architypes Community Group, specifically target at facilitating the web sharing of archives data to aid discovery. When the Group, which I have the privilege to chair, approached the challenge of building a proposal to make Schema.org useful for archives, it was identified that the vocabulary could be already used to describe the things & collections that you find in archives.  What was missing was the ability to identify the archive holding organisation, and the fact that an item is being held … Read More ... Posted in Archives, Data Publishing, Linked Data, schema.org, WebTagged Archives, schema.orgLeave a comment Bibframe – Schema.org – Chocolate Teapots Posted on August 27, 2018August 27, 2018 by Richard Wallis In a session at the IFLA WLIC in Kuala Lumpur – my core theme being that there is a need to use two [linked data] vocabularies when describing library resources — Bibframe for cataloguing and [linked] metadata interchange — Schema.org for sharing on the web for discovery. Read More ... Posted in Bibframe, Libraries, Linked Data, Marc, schema.org, Structured DataTagged Libraries, Linked Data, schema.orgLeave a comment Schema.org Introduces Defined Terms Posted on June 18, 2018June 18, 2018 by Richard Wallis Do you have a list of terms relevant to your data? Things such as subjects, topics, job titles, a glossary or dictionary of terms, blog post categories, ‘official names’ for things/people/organisations, material types, forms of technology, etc. Read More ... Posted in Data Publishing, schema.org, UncategorizedTagged #linkeddata, schema.org, SEO6 Comments Schema.org Significant Updates for Tourism and Trips Posted on June 15, 2018June 21, 2018 by Richard Wallis The latest release of Schema.org (3.4) includes some significant enhancements for those interested in marking up tourism, and trips in general. For tourism markup two new types TouristDestination and TouristTrip have joined the already useful TouristAttraction Read More ... Posted in Data Publishing, Knowledge Graph, schema.org, SEO, Tourism, WebTagged schema.org, SEO, TourismLeave a comment The Three Linked Data Choices for Libraries Posted on May 22, 2018May 22, 2018 by Richard Wallis We are [finally] on the cusp of establishing a de facto Linked Data approach for libraries and their system suppliers – not there yet but getting there. We have a choice between BIBFRAME 2.0, Schema.org, Linky MARC and doing nothing. Read More ... Posted in Bibframe, Data Publishing, Libraries, Linked Data, Marc, schema.org, Semantic Web, Structured DataTagged Bibframe, Libraries, Linked Data, Marc, Schema.ogLeave a comment Structured Data: Helping Google Understand Your Site Posted on November 13, 2017May 22, 2018 by Richard Wallis Add Schema.org structured data to your pages because during indexing, we will be able to better understand what your site is about. Read More ... Posted in Data Publishing, Google, Linked Data, schema.org, Structured Data Schema.org for Tourism Posted on September 19, 2017June 15, 2018 by Richard Wallis These TouristAttraction enhancements have significantly improved the capability for describing Tourist Attractions and hopefully enabling more tourist discoveries Read More ... Posted in Data Publishing, schema.org, SEO, TourismTagged LinkedData, schema.org, SEO, Tourism1 Comment Schema.org: Describing Global Corporations Local Cafés And Everything In-between Posted on September 15, 2017May 31, 2018 by Richard Wallis There have been  discussions in Schema.org about the way Organizations their offices, branches and other locations can be marked up; they exposed a lack of clarity in the way to structure descriptions of Organizations and their locations, offices, branches , etc. To address that lack of clarity I thought it would be useful to share some examples here. Read More ... Posted in Data Publishing, schema.org, SEOTagged Schema.og, SEO A Discovery Opportunity for Archives? Posted on June 19, 2017June 19, 2017 by Richard Wallis So why am I now suggesting that there maybe an opportunity for the discovery of archives and their resources? Read More ... Posted in Archives, Linked Data, Open Data, schema.orgTagged #linkeddata, Archives, schema.orgLeave a comment Follow Twitter LinkedIn RSS Client Engagements Developer Advocate at working on: Schema.org/Bibframe Education, Consultancy, & Implementation for: Technology Evangelist and advisor for: Schema.org Consultancy for Europeana FIBO Schema.org Working Group Schema.org Consultancy for The British Library Schema.org Consultancy for: W3C Community Groups Bibframe2Schema.org - Chair Schema Bib Extend - Chair Schema Architypes - Chair Tourism Structured Web Data - Co-Chair Schema Course Extension Financial Industry Business Ontology Sport Schema Tweets Data Liberate @DataLiberate #StructuredData https://t.co/ccH6OEcTUB (about 4 days ago) Data Liberate @DataLiberate Post: Library Metadata Evolution: The Final Mile? https://t.co/CibWjUf2o6 Many of the elements of this process are… https://t.co/a2dDjxvPmz (about 83 days ago) Data Liberate @DataLiberate Post: Something For Archives in https://t.co/415kzBg3Oj Archives focussed additions in new #schema.org (3.5) versi… https://t.co/57qxIsPxMM (about 124 days ago) Follow @dataliberate Speaking Calandar Archives May 2019 April 2019 August 2018 June 2018 May 2018 November 2017 September 2017 June 2017 September 2016 August 2016 March 2016 February 2016 August 2015 June 2015 April 2015 December 2014 September 2014 April 2014 March 2014 February 2014 September 2013 June 2013 May 2013 April 2013 March 2013 December 2012 November 2012 August 2012 June 2012 April 2012 March 2012 February 2012 January 2012 November 2011 October 2011 September 2011 August 2011 July 2011 March 2011 January 2011 December 2010 January 2010 Copyright © 2021 •Fabulous Fluid by Catch Themes Search for: Search Scroll Up Home Services Services Overview Site Audit Service Training and Education Vocabulary Extension Blog About Founder Contact This site uses cookies: Find out more.Ok 
www-diglib-org-1078	----	None 
www-diglib-org-1779	----	None 
www-diglib-org-2971	----	2021 AMIA Cross-Pollinator: Justine Thomas - DLF CLIR Fellowships & Grants DLF Publications CLIR Global Join Give CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Menu CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Search Close About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Menu About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Search Close Home > Blog and News > 2021 AMIA Cross-Pollinator: Justine Thomas 2021 AMIA Cross-Pollinator: Justine Thomas Posted April 12, 2021 by Gayle in AMIA, Events The Association of Moving Image Archivists (AMIA) and DLF will be sending Justine Thomas to attend the 2021 virtual DLF/AMIA Hack Day and AMIA spring conference! As this year’s “cross-pollinator,” Justine will enrich both the Hack Day event and the AMIA conference, sharing a vision of the library world from her perspective. About the Awardee Justine Thomas (@JustineThomasM) is currently a Digital Programs Contractor at the National Museum of American History (NMAH) focusing on digital asset management and collections information support. Prior to graduating in 2019 with a Master’s in Museum Studies from the George Washington University, Justine worked at NMAH as a collections processing intern in the Archives Center and as a Public Programs Facilitator encouraging visitors to discuss American democracy and social justice issues.   About Hack Day and the Award           The seventh AMIA+DLF Hack Day (online April 1-15) will be a unique opportunity for practitioners and managers of digital audiovisual collections to join with developers and engineers to remotely collaborate to develop solutions for digital audiovisual preservation and access. The goal of the AMIA + DLF Award is to bring “cross-pollinators”–developers and software engineers who can provide unique perspectives to moving image and sound archivists’ work with digital materials, share a vision of the library world from their perspective, and enrich the Hack Day event–to the conference. Find out more about this year’s Hack Day activities here. Did you enjoy this post? Please Share! Share on facebook Facebook Share on twitter Twitter Share on linkedin LinkedIn Share on pinterest Pinterest Share on reddit Reddit Related Posts The #DLFteach Toolkit: Recommending EPUBs for Accessibility April 27, 2021 This post was written by Hal Hinderliter, as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital The #DLFteach Toolkit: Participatory Mapping In a Pandemic April 22, 2021 This post was written by Jeanine Finn (Claremont Colleges Library), as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series Call for Proposals open for NDSA Digital Preservation 2021! April 13, 2021 The NDSA is very pleased to announce the Call for Proposals is open for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this PrevPreviousAMIA+DLF Cross-Pollinator Registration Award CLIR+DLF 211 North Union Street Suite 100-PMB 1027 Alexandria, VA 22314 E: info@diglib.org About The Digital Library Federation is a network of member institutions and a robust community of practice — advancing research, learning, social justice, & the public good through the creative design and wise application of digital library technologies Stay In Touch​ Sign up for news from CLIR. Sign Up Follow Us Facebook Twitter Youtube Linkedin Rss CLIR Unless otherwise indicated, content on this site is available for re-use under CC BY-SA 4.0 License Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset 
www-diglib-org-3467	----	Digital Library Federation - DLF CLIR Fellowships & Grants DLF Publications CLIR Global Join Give CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Menu CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Search Close About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Menu About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Search Close Welcome to the Digital Library Federation DLF serves all who are invested in the success of libraries, museums, and archives in the digital age About Events 2021 DLF Forum, NDSA's DigiPres, and Learn@DLF Calls for Proposals Submissions are due by Monday, May 17, at 11:59pm Eastern Time. Join Us Online What's the DLF? networked member institutions and a robust community of practice—advancing research, learning, social justice, & the public good through the creative design and wise application of digital library technologies DLF as Community Find your people, year-round. Grassroots, pragmatic, and mission-driven, DLF is a space where ideas are road-tested and shared strategies and visions emerge. DLF as Platform Get things done. We foster active, open, and welcoming working groups dedicated to building better libraries, museums, and archives for the digital age. DLF as Crossroads Meet up. Our annual DLF Forum serves as meeting place, marketplace, and congress for diglib practitioners from member institutions and the community at large. Read More Invest In Our Collective Work Join the DLF today! Join Us DLF Groups Get Stuff Done Anyone can start or join a group regardless of institutional affiliation. Learn More 2021 CLIR Events Calls for Proposals See CFPs for the 2021 DLF Forum, NDSA’s Digital Preservation 2021, and Learn@DLF. Join us online! Selected Posts See what’s new on the DLF blog 2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals The #DLFteach Toolkit: Recommending EPUBs for Accessibility Three Questions on IRUS-USA 2021 AMIA Cross-Pollinator: Justine Thomas The #DLFteach Toolkit: Recommending EPUBs for Accessibility Gayle April 27, 2021 This post was written by Hal Hinderliter, as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital Read More The #DLFteach Toolkit: Participatory Mapping In a Pandemic Gayle April 22, 2021 This post was written by Jeanine Finn (Claremont Colleges Library), as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series Read More Call for Proposals open for NDSA Digital Preservation 2021! kussmann April 13, 2021 The NDSA is very pleased to announce the Call for Proposals is open for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this Read More Read More Jobs Working Groups NDSA Organizers' Toolkit Community Calendar DLF Contribute DLF Events Grants & Fellowships Digitization Cost Calculator CLIR+DLF 211 North Union Street Suite 100-PMB 1027 Alexandria, VA 22314 E: info@diglib.org About The Digital Library Federation is a network of member institutions and a robust community of practice — advancing research, learning, social justice, & the public good through the creative design and wise application of digital library technologies Stay In Touch​ Sign up for news from CLIR. Sign Up Follow Us Facebook Twitter Youtube Linkedin Rss CLIR Unless otherwise indicated, content on this site is available for re-use under CC BY-SA 4.0 License Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset 
www-diglib-org-3693	----	DLF DLF Digital Library Federation The #DLFteach Toolkit: Recommending EPUBs for Accessibility This post was written by Hal Hinderliter, as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital Library Pedagogy group highlighting the experiences of  digital librarians and archivists who utilize the #DLFteach Toolkit and are new to teaching and/or digital tools. The Digital Library Pedagogy working group, also known as Read More The post The #DLFteach Toolkit: Recommending EPUBs for Accessibility appeared first on DLF. The #DLFteach Toolkit: Participatory Mapping In a Pandemic This post was written by Jeanine Finn (Claremont Colleges Library), as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF&#8217;s Digital Library Pedagogy group highlighting the experiences of  digital librarians and archivists who utilize the #DLFteach Toolkit and are new to teaching and/or digital tools. The Digital Library Read More The post The #DLFteach Toolkit: Participatory Mapping In a Pandemic appeared first on DLF. Call for Proposals open for NDSA Digital Preservation 2021! The NDSA is very pleased to announce the Call for Proposals is open for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this year on November 4th, 2021 during World Digital Preservation Day. Submissions from members and nonmembers alike are welcome, and you can learn more about session format options through the CFP. Read More The post Call for Proposals open for NDSA Digital Preservation 2021! appeared first on DLF. 2021 AMIA Cross-Pollinator: Justine Thomas The Association of Moving Image Archivists (AMIA) and DLF will be sending Justine Thomas to attend the 2021 virtual DLF/AMIA Hack Day and AMIA spring conference! As this year&#8217;s “cross-pollinator,” Justine will enrich both the Hack Day event and the AMIA conference, sharing a vision of the library world from her perspective. About the Awardee Read More The post 2021 AMIA Cross-Pollinator: Justine Thomas appeared first on DLF. 2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals We’re delighted to share that it’s CFP season for CLIR’s annual events. Based on community feedback, we’ve made the decision to take our events online again in 2021. We look forward to new and better ways to come together—as always, with community at the center. Our events will take place on the following dates: The Read More The post 2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals appeared first on DLF. Good Migrations: A Checklist for Migrating Your Digital Preservation Infrastructure My, how time flies. About half a decade ago, in 2015, I became one of the NDSA Infrastructure Interest Group co-chairs. This was part of the NDSA transition from being hosted by the Library of Congress to a new host, which would eventually be CLIR/DLF. Earlier that year, the Infrastructure Working Group (as it was Read More The post Good Migrations: A Checklist for Migrating Your Digital Preservation Infrastructure appeared first on DLF. Request for Participation in Fixity and Staffing Survey Working Groups Proposals have been accepted by the NDSA Coordinating Committee for starting up the Staffing Survey Working Group and the Fixity Survey Working Group. Both surveys were last published in 2017. If you are interested in participating in either of these groups please read the scope of work for each and complete the form below by Read More The post Request for Participation in Fixity and Staffing Survey Working Groups appeared first on DLF. AMIA+DLF Cross-Pollinator Registration Award &#160; &#160; &#160; &#160; &#160; The Association of Moving Image Archivists (AMIA) and the Digital Library Federation (DLF) are pleased to partner once again in sponsoring an AMIA+DLF Hack Day. This year’s AMIA+DLF Hack Day takes place remotely with activities from April 1-15. Registration is free, and you do not need to be registered for Read More The post AMIA+DLF Cross-Pollinator Registration Award appeared first on DLF. NDSA Welcomes Two New Members Today, the NDSA Coordinating Committee unanimously voted to welcome two new members. Each of these members bring a host of skills and experience to our group. Please help us to welcome: University of Wisconsin &#8211; Parkside: UW Parkside&#8217;s primary mission is to document the history of the University of Wisconsin &#8211; Parkside by collecting both Read More The post NDSA Welcomes Two New Members appeared first on DLF. What is Metadata Assessment? This blog post was authored by Hannah Tarver and Steven Gentry, members of the Digital Library Assessment Interest Group’s Metadata Assessment Working Group (DLF AIG MWG). It is intended to provide a summary overview of metadata assessment in digital libraries, including its importance and benefits.  If you are interested in metadata evaluation, or want to Read More The post What is Metadata Assessment? appeared first on DLF. Call for Volunteers: NDSA Task Force on Membership Engagement and Recruitment The NDSA Leadership group is spinning up a new group around NDSA Membership and invites you to consider volunteering for the Task Force on Membership Engagement and Recruitment. The focus of the Task Force will be to examine membership engagement, benefits/drawbacks of the current model type, and recruitment efforts of the NDSA. Through research and Read More The post Call for Volunteers: NDSA Task Force on Membership Engagement and Recruitment appeared first on DLF. Metadata During COVID This post was written by members of the Metadata Working Group, a subgroup of DLF&#8217;s Assessment Interest Group. Digital collections work has changed in a number of ways during the COVID-19 pandemic. For many libraries and archives, this has meant working remotely and shifting toward tasks that can be done online. Within the DLF AIG Read More The post Metadata During COVID appeared first on DLF. Three New NDSA Members Since January 2021, the NDSA Coordinating Committee unanimously voted to welcome three new members. Each of these members bring a host of skills and experience to our group. Please help us to welcome: Arkivum: Arkivum is recognized internationally for its expertise in the archiving and digital preservation of valuable data and digitized assets in large Read More The post Three New NDSA Members appeared first on DLF. Virtual 2020 NDSA Digital Preservation recordings available online! Session recordings from the virtual 2020 NDSA Digital Preservation conference are now available on NDSA’s YouTube channel, as well as on Aviary. The full program from Digital Preservation 2020: Get Active with Digital Preservation, which took place online November 12, 2020, is free and open to the public. NDSA is an affiliate of the Digital Read More The post Virtual 2020 NDSA Digital Preservation recordings available online! appeared first on DLF. Announcing Finnish Translations of the 2019 Levels of Preservation Matrix and Assessment Tool The NDSA is pleased to announce that the 2019 Levels of Preservation documents have been translated into Finnish by our colleagues from CSC – IT Center for Science and the Finnish digital preservation collaboration group.  Translations for the Assessment Tool Template and both versions of the Levels of Digital Preservation Matrix were completed.   Links to Read More The post Announcing Finnish Translations of the 2019 Levels of Preservation Matrix and Assessment Tool appeared first on DLF. CLIR Now Hiring: Senior Program Officer, DLF CLIR is now seeking applicants for a Senior Program Officer to lead the Digital Library Federation (DLF) program. This position will play a pivotal role in the strategic direction of DLF and provide vital leadership and guidance both within CLIR and the wider community. In line with the program&#8217;s mission, the Senior Program Officer will Read More The post CLIR Now Hiring: Senior Program Officer, DLF appeared first on DLF. Levels of Digital Preservation Digital Curation Decision Guide Published It is with great pleasure to announce that the Levels of Digital Preservation work continues to roll out. In these waning days of 2020, the Curatorial Team of the Levels has released its first published edition of the Digital Curation Decision Guide. This guide forms the basis of a series of decision points around collections Read More The post Levels of Digital Preservation Digital Curation Decision Guide Published appeared first on DLF. DLF Forum Community Journalist Reflection: Lisa Covington This post was written by Lisa Covington (@prof_cov), who was selected to be one of this year’s virtual DLF Forum Community Journalists. Lisa Covington, MA is a PhD Candidate at The University of Iowa studying Sociology of Education, Digital Humanities and African American Studies. Her dissertation work is “Mediating Black Girlhood: A Multi-level Comparative Analysis of Read More The post DLF Forum Community Journalist Reflection: Lisa Covington appeared first on DLF. Calls for Volunteers for 2021 Digital Preservation Conference The NDSA calls for volunteers to join our Planning Committee for the 2021 Digital Preservation conference. Digital Preservation (DigiPres) is the NDSA’s annual conference &#8211; open to members and non-members alike &#8211; focused on stewardship, curation, and preservation of digital information and cultural heritage. The 2021 meeting will take place on November 10-11th 2021 in Read More The post Calls for Volunteers for 2021 Digital Preservation Conference appeared first on DLF. DLF Forum Community Journalist Reflection: Betsy Yoon This post was written by Betsy Yoon (@betsyoon), who was selected to be one of this year’s virtual DLF Forum Community Journalists. Betsy Yoon (she/they) is an Adjunct Assistant Professor and OER/Reference Librarian at the College of Staten Island, CUNY and earned her MLIS in 2019. She also has a Master of International Affairs. She lives Read More The post DLF Forum Community Journalist Reflection: Betsy Yoon appeared first on DLF. DLF Forum Community Journalist Reflection: Carolina Hernandez This post was written by Carolina Hernandez (@carolina_hrndz), who was selected to be one of this year’s virtual DLF Forum Community Journalists. Carolina Hernandez is currently an Instruction Librarian at the University of Houston where she collaborates on creating inclusive learning environments for students. Previously, she was the Journalism Librarian at the University of Oregon, where Read More The post DLF Forum Community Journalist Reflection: Carolina Hernandez appeared first on DLF. DLF Forum Community Journalist Reflection: Rebecca Bayeck This post was written by Rebecca Bayeck (@rybayeck), who was selected to be one of this year’s virtual DLF Forum Community Journalists. Rebecca Y. Bayeck is a dual-PhD holder in Learning Design &#38; Technology and Comparative &#38; International Education from the Pennsylvania State University. Currently a CLIR postdoctoral fellow at the Schomburg Center for Research in Read More The post DLF Forum Community Journalist Reflection: Rebecca Bayeck appeared first on DLF. DLF Forum Community Journalist Reflection: Ana Hilda Figueroa de Jesús This post was written by Ana Hilda Figueroa de Jesús, who was selected to be one of this year’s virtual DLF Forum Community Journalists. Ana Hilda Figueroa de Jesús will be graduating next spring from the Universidad de Puerto Rico in Río Piedras with a BA in History of Art. Her research interest focuses on education, Read More The post DLF Forum Community Journalist Reflection: Ana Hilda Figueroa de Jesús appeared first on DLF. DLF Forum Community Journalist Reflection: Arabeth Balasko This post was written by Arabeth Balasko, who was selected to be one of this year’s virtual DLF Forum Community Journalists. Arabeth Balasko (she/her) is an archivist and historian dedicated to public service and proactive stewardship. As a professional archivist, her overarching goals are to curate collections that follow a shared standardization practice, are user-centric, and Read More The post DLF Forum Community Journalist Reflection: Arabeth Balasko appeared first on DLF. DLF Forum Community Journalist Reflection: Jocelyn Hurtado This post was written by Jocelyn Hurtado, who was selected to be one of this year’s virtual DLF Forum Community Journalists. Jocelyn Hurtado is a native Miamian who worked as an archivist at a community repository for four year. She is experienced in working with manuscript, art and artifact collections pertaining to a community of color Read More The post DLF Forum Community Journalist Reflection: Jocelyn Hurtado appeared first on DLF. DLF Forum Community Journalist Reflection: Melde Rutledge This post was written by Melde Rutledge (@MeldeRutledge), who was selected to be one of this year’s virtual DLF Forum Community Journalists. Melde Rutledge is the Digital Collections Librarian at Wake Forest University’s Z. Smith Reynolds Library. He is responsible for leading the library’s digitization services—primarily in support of ZSR’s Special Collections and Archives, as Read More The post DLF Forum Community Journalist Reflection: Melde Rutledge appeared first on DLF. DLF Forum Community Journalist Reflection: Amanda Guzman This post was written by Amanda Guzman, who was selected to be one of this year&#8217;s virtual DLF Forum Community Journalists. Amanda Guzman is an anthropological archaeologist with a PhD in Anthropology (Archaeology) from the University of California, Berkeley. She specializes in the field of museum anthropology with a research focus on the history of Read More The post DLF Forum Community Journalist Reflection: Amanda Guzman appeared first on DLF. DLF Forum Community Journalist Reflection: Shelly Black This post was written by Shelly Black (@ShellyYBlack), who was selected to be one of this year’s virtual DLF Forum Community Journalists. Shelly Black is the Cyma Rubin Library Fellow at North Carolina State University Libraries where she supports digital preservation in the Special Collections Research Center. She also works on a strategic project involving immersive Read More The post DLF Forum Community Journalist Reflection: Shelly Black appeared first on DLF. DLF Forum Community Journalist Reflection: Hsiu-Ann Tom This post was written by Hsiu-Ann Tom, who was selected to be one of this year&#8217;s virtual DLF Forum Community Journalists. Hsiu-Ann is the Digital Archivist at The Amistad Research Center in New Orleans, LA where her work focuses on born digital collection development. She received her Masters in Library and Information Science with a Read More The post DLF Forum Community Journalist Reflection: Hsiu-Ann Tom appeared first on DLF. Announcing a Portuguese Translation of the 2019 Levels of Digital Preservation Matrix  Portuguese Translations of the 2019 Levels of Digital Preservation Matrix  The NDSA is pleased to announce that Version 2 (2019) of the Levels Matrix has been translated into Portuguese by Laura Vilela R. Rezende. This document enriches the scientific studies on Digital Preservation and Research Data Curation developed by the Brazilian research group of which Read More The post Announcing a Portuguese Translation of the 2019 Levels of Digital Preservation Matrix  appeared first on DLF. 10 Additions to NDSA Membership in Summer and Fall 2020 Since the spring of 2020, the NDSA Leadership unanimously voted to welcome 10 new members. Each of these new members brings a host of skills and experience to our group. Please help us welcome: Arizona State University Library: With many of their materials from local Indigenous and LatinX communities, the Library is working with researchers Read More The post 10 Additions to NDSA Membership in Summer and Fall 2020 appeared first on DLF. Announcing Spanish Translations for the 2019 and 2013 Levels Matrix The NDSA is pleased to announce that both the original (2013) and Version 2 (2019) of the Levels Matrix  have been translated into Spanish by our colleagues from Mexico and Spain, Dr. David Leija (Universidad Autónoma de Tamaulipas) and Dr. Miquel Térmens (Universitat de Barcelona). Drs. Leija and Térmens are academic researchers and founders of Read More The post Announcing Spanish Translations for the 2019 and 2013 Levels Matrix appeared first on DLF. NDSA Announces Winners of 2020 Innovation Awards The NDSA established its Innovation Awards in 2012 to recognize and encourage innovation in the field of digital stewardship.  Since then, it has honored 39 exemplary educators, future stewards, individuals, institutions, and projects for their efforts in ensuring the ongoing viability and accessibility of our valuable digital heritage. The 2020 NDSA Innovation Awards are generously Read More The post NDSA Announces Winners of 2020 Innovation Awards appeared first on DLF. Award Winners: NDSA Levels of Digital Preservation Group This year’s World Digital Preservation Day (#WDPD) was the biggest yet! With outpourings of research, achievements, practical advice, and fun it was hard to believe that there were also awards as part of that process. On 05 November, the NDSA’s Levels of Digital Preservation Reboot was the recipient of one of the Digital Preservation Coalition’s Read More The post Award Winners: NDSA Levels of Digital Preservation Group appeared first on DLF. Meet the 2020 DLF Forum Community Journalists The 2020 Virtual DLF Forum looks different from our typical event in almost every way imaginable. Due to the fact that we aren’t convening in person and registration is free, we decided to offer a different kind of fellowship opportunity. Because the guiding purpose of this year’s Virtual DLF Forum is building community while apart, Read More The post Meet the 2020 DLF Forum Community Journalists appeared first on DLF. 2020 DLF Forum: Building Community With DLF’s Digital Library Pedagogy Working Group Though DLF is best known for our signature event, the annual DLF Forum, our working groups collaborate year round. Long before COVID-19 introduced the concept of “Zoom fatigue” into our lives, DLF’s working groups organized across institutional and geographical boundaries, building community while apart, to get work done. Made possible through the support of our Read More The post 2020 DLF Forum: Building Community With DLF’s Digital Library Pedagogy Working Group appeared first on DLF. 
www-diglib-org-4600	----	DLF Code of Conduct - DLF CLIR Fellowships & Grants DLF Publications CLIR Global Join Give CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Menu CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Search Close About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Menu About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Search Close Home > About the Digital Library Federation > DLF Code of Conduct DLF Code of Conduct About the DLF The Digital Library Federation (DLF) is committed to creating and supporting inclusive, diverse, and equitable communities of practice. We strive to be a welcoming organization and the focal point for a digital library culture that is anti-oppression, anti-racist, recognizes intersectionalities, and works compassionately across difference. Together, DLF members advance research, learning, social justice, and the public good through the creative design and wise application of digital library technologies. We know that the best problem-solving and critical thinking happens when people with a wide array of experiences and perspectives come together to work in comfort and safety as peers. We therefore expect participants in the DLF community to help create thoughtful and respectful environments where that interaction can take place. This Code of Conduct applies to all meetings, events, working groups, and other activities organized through the DLF, including those taking place in-person or online. Code of Conduct video transcript How to Be DLF is dedicated to providing collaborative and conference experiences that are free from all forms of harassment, and inclusive of all people. Small actions you can take will help us meet this goal. For instance, we suggest:  listening as much as you speak, and remembering that colleagues may have expertise you are unaware of;  encouraging and yielding the floor to those whose viewpoints may be under-represented in a group;  using welcoming language, for instance by using an individual’s stated pronouns and favoring gender-neutral collective nouns (“people,” not “guys”);  accepting critique graciously and offering it constructively;  giving credit where it is due;  seeking concrete ways to make physical spaces and online resources more universally accessible; and staying alert, as Active Bystanders, to the welfare of those around you. Likewise, it is important to understand the range of behaviors that may constitute harassment.  Harassing behavior may relate to age;  appearance or body size;  employment or military status;  ethnicity;  gender identity or expression;  individual lifestyles;  marital status;  national origin;  physical or cognitive ability;  political affiliation;  sexual orientation;  race; or  religion.  Harassment can include unwelcome or offensive verbal or written comments or nonverbal expressions, used in person or online, in private or in public. Examples of harassment can include: use of sexual and/or discriminatory images in public spaces (including online);  deliberate intimidation;  stalking;  following;  trolling; harassing photography or recording;  sustained disruption of talks or other events;  bullying behavior;  inappropriate physical contact; and unwelcome sexual attention. Sexual, discriminatory, or potentially triggering language and imagery is generally inappropriate for any DLF venue. However, this policy is not intended to constrain responsible scholarly or professional discourse and debate. We welcome engagement with difficult topics, done with respect and care. What to Do We will not tolerate harassment of DLF community members in any form. If you are being harassed, notice that someone else is being harassed, or have any other concerns, follow the procedures outlined below. Always Participants at the DLF Forum or any other DLF-hosted discussion or event (held online or in person, including DLF-sponsored social events) who are asked to stop harassing or intimidating behaviors are expected to comply immediately. Those who violate our Code of Conduct may be warned or expelled at the discretion of the organizers. We value your presence and constructive participation in our shared community, and thank you for your attention to the comfort, safety, and well-being of fellow DLF collaborators and attendees. To report incidents after our events, in online venues, or on-site but in the absence of a staff member, call/text CLIR/DLF Staff at (732) 737-7328 or email info@diglib.org. However, if you or others are in imminent danger, please first phone emergency services at 911. In Person On-site, CLIR/DLF staff can be identified by their name badges and white lanyards. Active bystanders or those experiencing harassment themselves may elect to have an in-person, confidential conversation with a staff member. Staff will then assist participants by taking incident reports, providing escorts as needed, or otherwise helping those experiencing harassment to feel safe for the duration of the event. During the in-person DLF Forum and allied conferences, in addition to reporting incidents in person, consult any additional resources and links provided by local organizers and hosts.  Online For DLF online events and meetings, CLIR/DLF encourages the following options for reporting harassment: Send an email to respect@clir.org. Four CLIR/DLF staff receive messages sent to this address. Call or text CLIR/DLF staff at 732-737-7328 (732-RESPECT). This number will be continuously monitored during programming and occasionally monitored outside of programming hours. Use the anonymous reporting form. Four CLIR/DLF staff receive messages sent via this form. All reports and inquiries will be handled in confidence. Sources of inspiration Geek Feminism; DHSI; Code4Lib; ALA; LITA; AMIA; SAA; ADHO; Recurse Center; Contributor Covenant; Vox Media; Scholars’ Lab. DLF thanks our Committee for Equity and Inclusion, our 2020 Forum Inclusivity Committee, and DLF Advisory Committee Community Advisors for work on this document. Previous versions here (pre-2016) and here (March 2018 update). You’re invited to modify and re-use This document has been made available under a CC-BY-NC 4.0 license. Please feel free to adapt and re-use for your conference or event! We suggest altering the “About DLF” section to reflect your group’s own mission statement and self-identity, and the “What to Do” section with specific actions and contacts relevant to your organization. Lastly, we appreciate acknowledgments. Colophon Updated with clarifying language, September 2020. Updated to include mention of Active Bystanders as well as information on reporting incidents outside of the customary staff reporting line, March 2018.  Updated with 24/7 emergency number, October 2017. Read about our major 2016 revision process here. CLIR+DLF 211 North Union Street Suite 100-PMB 1027 Alexandria, VA 22314 E: info@diglib.org About The Digital Library Federation is a network of member institutions and a robust community of practice — advancing research, learning, social justice, & the public good through the creative design and wise application of digital library technologies Stay In Touch​ Sign up for news from CLIR. Sign Up Follow Us Facebook Twitter Youtube Linkedin Rss CLIR Unless otherwise indicated, content on this site is available for re-use under CC BY-SA 4.0 License Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset 
www-diglib-org-4750	----	None 
www-diglib-org-5060	----	Digital Library Federation - DLF CLIR Fellowships & Grants DLF Publications CLIR Global Join Give CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Menu CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Search Close About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Menu About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Search Close Welcome to the Digital Library Federation DLF serves all who are invested in the success of libraries, museums, and archives in the digital age About Events 2021 DLF Forum, NDSA's DigiPres, and Learn@DLF Calls for Proposals Submissions are due by Monday, May 17, at 11:59pm Eastern Time. Join Us Online What's the DLF? networked member institutions and a robust community of practice—advancing research, learning, social justice, & the public good through the creative design and wise application of digital library technologies DLF as Community Find your people, year-round. Grassroots, pragmatic, and mission-driven, DLF is a space where ideas are road-tested and shared strategies and visions emerge. DLF as Platform Get things done. We foster active, open, and welcoming working groups dedicated to building better libraries, museums, and archives for the digital age. DLF as Crossroads Meet up. Our annual DLF Forum serves as meeting place, marketplace, and congress for diglib practitioners from member institutions and the community at large. Read More Invest In Our Collective Work Join the DLF today! Join Us DLF Groups Get Stuff Done Anyone can start or join a group regardless of institutional affiliation. Learn More 2021 CLIR Events Calls for Proposals See CFPs for the 2021 DLF Forum, NDSA’s Digital Preservation 2021, and Learn@DLF. Join us online! Selected Posts See what’s new on the DLF blog 2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals The #DLFteach Toolkit: Recommending EPUBs for Accessibility Three Questions on IRUS-USA 2021 AMIA Cross-Pollinator: Justine Thomas The #DLFteach Toolkit: Recommending EPUBs for Accessibility Gayle April 27, 2021 This post was written by Hal Hinderliter, as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital Read More The #DLFteach Toolkit: Participatory Mapping In a Pandemic Gayle April 22, 2021 This post was written by Jeanine Finn (Claremont Colleges Library), as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series Read More Call for Proposals open for NDSA Digital Preservation 2021! kussmann April 13, 2021 The NDSA is very pleased to announce the Call for Proposals is open for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this Read More Read More Jobs Working Groups NDSA Organizers' Toolkit Community Calendar DLF Contribute DLF Events Grants & Fellowships Digitization Cost Calculator CLIR+DLF 211 North Union Street Suite 100-PMB 1027 Alexandria, VA 22314 E: info@diglib.org About The Digital Library Federation is a network of member institutions and a robust community of practice — advancing research, learning, social justice, & the public good through the creative design and wise application of digital library technologies Stay In Touch​ Sign up for news from CLIR. Sign Up Follow Us Facebook Twitter Youtube Linkedin Rss CLIR Unless otherwise indicated, content on this site is available for re-use under CC BY-SA 4.0 License Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset 
www-diglib-org-5912	----	2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals - DLF CLIR Fellowships & Grants DLF Publications CLIR Global Join Give CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Menu CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Search Close About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Menu About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Search Close Home > Blog and News > 2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals 2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals Posted April 8, 2021 by Gayle in Director's Desk, DLF Forum News, NDSA We’re delighted to share that it’s CFP season for CLIR’s annual events. Based on community feedback, we’ve made the decision to take our events online again in 2021. We look forward to new and better ways to come together—as always, with community at the center. Our events will take place on the following dates: The DLF Forum (#DLFforum, November 1-3), our signature event, includes digital library practitioners and others from member institutions and the broader community, for whom it serves as a meeting place, marketplace, and congress. Learn more and check out the CFP here: https://forum2021.diglib.org/call-for-proposals/  NDSA’s Digital Preservation 2021: Embracing Digitality (#DigiPres21, November 4), NDSA’s major meeting and conference, will help to chart future directions for both the NDSA and digital stewardship, and is expected to be a crucial venue for intellectual exchange, community-building, development of best practices, and national-level agenda-setting in the field. Learn more and check out the CFP for this year’s event here: https://ndsa.org/conference/digital-preservation-2021/cfp/ Learn@DLF (#LearnAtDLF, November 8-10) is our dedicated workshop series for digging into tools, techniques, workflows, and concepts. Through engaging, hands-on sessions, attendees will gain experience with new tools and resources, exchange ideas, and develop and share expertise with fellow community members. Learn more and check out the CFP here: https://forum2021.diglib.org/call-for-proposals/  For all events, we encourage proposals from members and non-members; regulars and newcomers; digital library practitioners and those in adjacent fields such as institutional research and educational technology; and students, early-career professionals and senior staff alike. Proposals to more than one event are permitted, though please submit different proposals for each.  The DLF Forum and Learn@DLF CFP is here: https://forum2021.diglib.org/call-for-proposals/  NDSA’s Digital Preservation 2021: Embracing Digitality CFP is here: https://ndsa.org/conference/digital-preservation-2021/cfp/ Session options range from 5-minute lighting talks at the Forum to half-day workshops at Learn@DLF, with many options in between. The deadline for all opportunities is Monday, May 17, at 11:59pm Eastern Time. If you have any questions, please write to us at forum@diglib.org, and be sure to subscribe to our Forum newsletter to stay up on all Forum-related news. We’re looking forward to seeing you this fall. -Team DLF Did you enjoy this post? Please Share! Share on facebook Facebook Share on twitter Twitter Share on linkedin LinkedIn Share on pinterest Pinterest Share on reddit Reddit Related Posts The #DLFteach Toolkit: Recommending EPUBs for Accessibility April 27, 2021 This post was written by Hal Hinderliter, as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital The #DLFteach Toolkit: Participatory Mapping In a Pandemic April 22, 2021 This post was written by Jeanine Finn (Claremont Colleges Library), as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series Call for Proposals open for NDSA Digital Preservation 2021! April 13, 2021 The NDSA is very pleased to announce the Call for Proposals is open for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this PrevPreviousGood Migrations: A Checklist for Migrating Your Digital Preservation Infrastructure NextCall for Proposals open for NDSA Digital Preservation 2021!Next CLIR+DLF 211 North Union Street Suite 100-PMB 1027 Alexandria, VA 22314 E: info@diglib.org About The Digital Library Federation is a network of member institutions and a robust community of practice — advancing research, learning, social justice, & the public good through the creative design and wise application of digital library technologies Stay In Touch​ Sign up for news from CLIR. Sign Up Follow Us Facebook Twitter Youtube Linkedin Rss CLIR Unless otherwise indicated, content on this site is available for re-use under CC BY-SA 4.0 License Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset 
www-diglib-org-6069	----	None 
www-diglib-org-6765	----	None 
www-diglib-org-678	----	The #DLFteach Toolkit: Participatory Mapping In a Pandemic - DLF CLIR Fellowships & Grants DLF Publications CLIR Global Join Give CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Menu CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Search Close About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Menu About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Search Close Home > Blog and News > The #DLFteach Toolkit: Participatory Mapping In a Pandemic The #DLFteach Toolkit: Participatory Mapping In a Pandemic Posted April 22, 2021 by Gayle in Community This post was written by Jeanine Finn (Claremont Colleges Library), as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital Library Pedagogy group highlighting the experiences of  digital librarians and archivists who utilize the #DLFteach Toolkit and are new to teaching and/or digital tools. The Digital Library Pedagogy working group, also known as #DLFteach, is a grassroots community of practice, empowering digital library practitioners to see themselves as teachers and equip teaching librarians to engage learners in how digital library technologies shape our knowledge infrastructure. The group is open to anyone interested in learning about or collaborating on digital library pedagogy. Join our Google Group to get involved. See the original lesson plan in the #DLFteach Toolkit. Our original activity was designed around using a live GoogleSheet in coordination with ArcGIS Online to collaboratively map historic locations for an in-class lesson to introduce students to geospatial analysis concepts. In our example, a history instructor had identified a list of cholera outbreaks with place names from 18th-century colonial reports. In the original activity, students were co-located in a library classroom, reviewing the historic cholera data in groups. A Google Sheet was created and shared with everyone in the class for students to enter “tidied” data from the historic texts collaboratively. The students then worked with a live link from Google Sheets, allowing the outbreak locations to be served directly to the ArcGIS Online map. It was successful and a useful tool for encouraging engagement and for getting familiar with GIS. Then COVID-19 in 2020 arrived. Instead of a centuries-distant disease outbreak, students learning digital mapping this past year were thrust into socially-distant instructional settings driven by a contemporary pandemic that radically altered their modes of learning. The collaborative affordances of tools like ArcGIS Online were pressed into service to help students collaborate effectively and meaningfully in real-time while learning from home. As an example, one geology professor at Pomona College encouraged her students to explore the geology of their local environment. Building on shared readings and lectures on geologic history and rock formations, students were encouraged to research the history of the land around them, and include photographs, observations, and other details to enrich the ArcGIS StoryMap. The final map included photographs and geology facts from students’ home locations around the world. Header for Geology class group StoryMap at Pomona College, Fall 2020   A key feature of the ArcGIS StoryMap platform that appealed to the instructor was the ability for the students to work collaboratively on the platform itself — not across shared files on folders on Box, GSuite, the LMS, etc. While this functioned reasonably well, there were several roadblocks to effective collaboration that we encountered along the way. Most of the challenges related to permissions settings related to ArcGIS Online administration, as the “shared update” features are not set as default permissions. Other challenges included file size limitations for images the students wished to upload, the inability of more than one user to edit the same file simultaneously, and potential security issues (including firewalls) in nations with more restrictive internet laws. Reflecting on these uses of StoryMaps over this past semester, we encourage instructors and library staff interested in to: Review user license permissions and best practices for ArcGIS StoryMap collaboration from Esri (some links below). Plan ahead to help students with collecting appropriate images, including discussions of file size and copyright. Encourage the instructor to coordinate student groups with defined roles and responsibilities to lessen the likelihood of multiple editors working on the same StoryMap at once (which can cause corruption of the files. Get clarity from IT and other support staff as needed to determine if students are working remotely from countries that may have restrictions on internet use.   Resources: Participatory Mapping with Google Forms, Google Sheets, and ArcGIS Online (Esri community education blog): https://community.esri.com/t5/education-blog/participatory-mapping-with-google-forms-google-sheets-and-arcgis/ba-p/883782 Optimize group settings to share stories like never before (Esri ArcGIS blog): https://www.esri.com/arcgis-blog/products/story-maps/constituent-engagement/optimize-group-settings-to-share-stories-like-never-before/ Teach with Story Maps: Announcing the Story Maps Curriculum Portal (University of Minnesota, U-Spatial: https://research.umn.edu/units/uspatial/news/teach-story-maps-announcing-story-maps-curriculum-portal Getting Started with ArcGIS StoryMaps (Esri): https://storymaps.arcgis.com/stories/cea22a609a1d4cccb8d54c650b595bc4 VI Conclusion recommendations Gather materials ahead of time. Photographs from digital archives, maps There may be data cleaning issues. Did you enjoy this post? Please Share! Share on facebook Facebook Share on twitter Twitter Share on linkedin LinkedIn Share on pinterest Pinterest Share on reddit Reddit Related Posts The #DLFteach Toolkit: Recommending EPUBs for Accessibility April 27, 2021 This post was written by Hal Hinderliter, as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital The #DLFteach Toolkit: Participatory Mapping In a Pandemic April 22, 2021 This post was written by Jeanine Finn (Claremont Colleges Library), as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series Call for Proposals open for NDSA Digital Preservation 2021! April 13, 2021 The NDSA is very pleased to announce the Call for Proposals is open for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this PrevPreviousWhat is Metadata Assessment? NextThe #DLFteach Toolkit: Recommending EPUBs for AccessibilityNext CLIR+DLF 211 North Union Street Suite 100-PMB 1027 Alexandria, VA 22314 E: info@diglib.org About The Digital Library Federation is a network of member institutions and a robust community of practice — advancing research, learning, social justice, & the public good through the creative design and wise application of digital library technologies Stay In Touch​ Sign up for news from CLIR. Sign Up Follow Us Facebook Twitter Youtube Linkedin Rss CLIR Unless otherwise indicated, content on this site is available for re-use under CC BY-SA 4.0 License Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset 
www-diglib-org-7264	----	Call for Proposals open for NDSA Digital Preservation 2021! - DLF CLIR Fellowships & Grants DLF Publications CLIR Global Join Give CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Menu CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Search Close About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Menu About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Search Close Home > Blog and News > Call for Proposals open for NDSA Digital Preservation 2021! Call for Proposals open for NDSA Digital Preservation 2021! Posted April 13, 2021 by kussmann in NDSA, News The NDSA is very pleased to announce the Call for Proposals is open for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this year on November 4th, 2021 during World Digital Preservation Day. Submissions from members and nonmembers alike are welcome, and you can learn more about session format options through the CFP. The deadline to submit proposals is Monday, May 17, at 11:59pm Eastern Time. Digital Preservation 2021 (#DigiPres21) is held in partnership with our host organization, the Council on Library and Information Resources’ (CLIR) Digital Library Federation. Separate calls are being issued for CLIR+DLF’s 2021 events, the 2021 DLF Forum (November 1-3) and associated workshop series Learn@DLF (November 8-10). NDSA strives to create a safe, accessible, welcoming, and inclusive event, and adheres to DLF’s Code of Conduct. We look forward to seeing you online on November 4th, ~ 2021 DigiPres Planning Committee Did you enjoy this post? Please Share! Share on facebook Facebook Share on twitter Twitter Share on linkedin LinkedIn Share on pinterest Pinterest Share on reddit Reddit Related Posts The #DLFteach Toolkit: Recommending EPUBs for Accessibility April 27, 2021 This post was written by Hal Hinderliter, as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital The #DLFteach Toolkit: Participatory Mapping In a Pandemic April 22, 2021 This post was written by Jeanine Finn (Claremont Colleges Library), as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series Call for Proposals open for NDSA Digital Preservation 2021! April 13, 2021 The NDSA is very pleased to announce the Call for Proposals is open for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this PrevPrevious2021 DLF Forum, DigiPres, and Learn@DLF Calls for Proposals CLIR+DLF 211 North Union Street Suite 100-PMB 1027 Alexandria, VA 22314 E: info@diglib.org About The Digital Library Federation is a network of member institutions and a robust community of practice — advancing research, learning, social justice, & the public good through the creative design and wise application of digital library technologies Stay In Touch​ Sign up for news from CLIR. Sign Up Follow Us Facebook Twitter Youtube Linkedin Rss CLIR Unless otherwise indicated, content on this site is available for re-use under CC BY-SA 4.0 License Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset 
www-diglib-org-7423	----	None 
www-diglib-org-819	----	None 
www-diglib-org-8402	----	The #DLFteach Toolkit: Recommending EPUBs for Accessibility - DLF CLIR Fellowships & Grants DLF Publications CLIR Global Join Give CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Menu CLIR Programs Digital Library Federation DLF Forum Digital Library of the Middle East Digitizing Hidden Special Collections and Archives Recordings at Risk Mellon Fellowships for Dissertation Research Leading Change Institute Postdoctoral Fellowship Program Search Close About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Menu About About DLF Our Staff Our Members Governance DLF Code of Conduct Events DLF Year-Round 2021 DLF Forum Past Forums DLF Forum News Social Events Checklist Child Care Fund Resources DLF Organizers’ Toolkit Blog and News #DLFcontribute Series Digitizing Special Formats DLF Cost Calculator DLF Jobs Board Groups DLF Working Groups CLIR/DLF Affiliates DLF Membership Cohorts Get Involved with Groups Opportunities Grants and Fellowships Authenticity Project DLF Community Calendar Data Curation Postdocs Community/Capacity Awards Post a Job/Find a Job Contact Get in Touch Stay Connected Join DLF Our Members Benefits Search Close Home > Blog and News > The #DLFteach Toolkit: Recommending EPUBs for Accessibility The #DLFteach Toolkit: Recommending EPUBs for Accessibility Posted April 27, 2021 by Gayle in Community This post was written by Hal Hinderliter, as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital Library Pedagogy group highlighting the experiences of  digital librarians and archivists who utilize the #DLFteach Toolkit and are new to teaching and/or digital tools. The Digital Library Pedagogy working group, also known as #DLFteach, is a grassroots community of practice, empowering digital library practitioners to see themselves as teachers and equip teaching librarians to engage learners in how digital library technologies shape our knowledge infrastructure. The group is open to anyone interested in learning about or collaborating on digital library pedagogy. Join our Google Group to get involved.   For this blog post, I’ve opted to provide some background information on the topic of my #DLFteach Toolkit entry: the EPUB (not an acronym) format, used for books and other documents. Librarians, instructors, instructional designers and anyone else who needs to select file formats for content distribution should be aware of what EPUB has to offer! Electronic books: the fight over formats The production and circulation of books, journals, and other long-form texts has been radically impacted by the growth of computer-mediated communication. Electronic books (“e-books”) first emerged near a half-century ago as text-only ASCII files, but are now widely available in a multitude of different file formats. Most notably, three competing options have been competing for market dominance: PDF files, KF8 files (for Amazon’s Kindle devices), and the open-source EPUB format. The popularity of handheld Kindle devices has created a devoted fan base for KF8 e-books, but in academia the ubiquitous PDF file remains the most common way to distribute self-contained digital documents. In contrast to these options, a growing movement is urging that libraries and schools eschew Kindles and abandon their reliance on PDFs in favor of the EPUB electronic book format. The EPUB file format preserves documents as self-contained packages that manage navigation and presentation separately from the document’s reflowable content, allowing users to alter font sizes, typefaces, and color schemes to suit their individual preferences. E-books saved in the EPUB format are compatible with Apple’s iPads and iPhones as well as Sony’s Reader, Barnes & Nobles Nook, and an expansive selection of software applications for desktop, laptop, and tablet computers. Increasingly, that list includes screen reader software such as Voice Dream and VitalSource Bookshelf, meaning that a single file format – EPUB 3 – can be readily accessed by both sighted and visually impaired audiences. The lineage of EPUB can be traced back to the Digital Audio-based Information System (DAISY), developed in 1994 under the direction of the Swedish Library of Talking Books and Braille. Today, EPUB is an open-source standard that is managed by the International Digital Publishing Forum, part of the W3C. In contrast to the proprietary origins of both PDF and KF8 e-books, modifications to the open EPUB standard have always been subject to public input and debate. Accessibility in Academia: EPUB versus PDF Proponents of universal design principles recommend the use of documents that are fully accessible to everyone, including users of assistive technologies, e.g., screen readers and refreshable braille displays. The DTBook format, a precursor to EPUB, was specifically referenced by Rose et al. (2006) in their initial delineation of Universal Design for Learning (UDL) as part of UDL’s requirement for multiple means of presentation. At the time, the assumption was that DTBooks would be distributed only to students who needed accessible texts, with either printed copies or PDF files for sighted learners. Today, however, it is no longer necessary to provide multiple formats, since EPUB 3 (the accessibility community’s preferred replacement for DTBooks) can be used with equal efficacy by all types of students. In contrast, PDF files can range from completely inaccessible to largely accessible, depending on the amount of effort the publisher expended during the remediation process. PDF files generated from word processing programs (e.g., Microsoft Word) are not accessible by default, but instead require additional tweaks that necessitate the use of Adobe’s Acrobat Pro software (the version of Acrobat that retails for $179 per year). Users of assistive technologies have no recourse but to attempt opening a PDF file before often finding that the document lacks structure (needed for navigation), alt tags, metadata, or other crucial features. Even for sighted learners, PDFs downloaded from their university’s online repository will be difficult to view on smartphones, since PDF’s fixed page dimensions will require endless zooming and scrolling to display each column of text at an adequate font size. The superior accessibility of EPUB has inspired major publishers to establish academic repositories of articles in EPUB format, e.g., ABC-CLIO, ACLS Humanities, EBSCO E-Books, Proquest’s Ebrary, Elsevier’s ScienceDirect, Taylor & Francis. Many digital-only journals offer their editions as EPUBs. For example, Trude Eikebrokk, editor of Professions & Professionalism, investigated the advantages of publishing in the EPUB format as described in this excerpt from the online journal Code{4}lib: There are two important reasons why we wanted to replace PDF as our primary e-journal format. PDF is a print format. It will never be the best choice for reading on tablets (e.g. iPad) or smartphones, and it is challenging to read PDF files on e-book readers … We wanted to replace or supplement the PDF format with EPUB to better support digital reading. Our second reason for replacing PDF with EPUB was to alleviate accessibility challenges. PDF is a format that can cause many barriers, especially for users of screen readers (synthetic speech or Braille). For example, Excel tables are converted into images, which makes it impossible for screen readers to access the table content. PDF documents might also lack search and navigation support, due to either security restrictions, a lack of coded structure in text formats, or the use of PDF image formats. This can make it difficult for any reader to use the document effectively and impossible for screen reader users. On the other hand, correct use of XHTML markup and CSS style sheets in an EPUB file will result in search and navigation functionalities, support for text-to-speech/braille and speech recognition technologies. Accessibility is therefore an essential aspect of publishing e-journals: we must consider diverse user perspectives and make universal design a part of the publishing process. The Future of EPUB A robust community of accessibility activists, publishers, and e-book developers continues to advance the EPUB specification. The update to EPUB3 added synchronized audio narration, embedded video, MathML equations, HTML5 animations, and Javascript-based interactivity to the format’s existing support for metadata, hyperlinks, embedded fonts, text (saved as XHTML files) and illustrations in both Scalable Vector Graphic (SVG) and pixel-based formats. Next up: the recently announced upgrade to EPUB 3.2, which embraces documents created under the 3.0 standard while improving support for Accessible Rich Internet Applications (ARIA) and other forms of rich media. If you’re ready to join this revolution, have a run through the #DLFteach Toolkit’s EPUB MakerSpace lesson plan! Did you enjoy this post? Please Share! Share on facebook Facebook Share on twitter Twitter Share on linkedin LinkedIn Share on pinterest Pinterest Share on reddit Reddit Related Posts The #DLFteach Toolkit: Recommending EPUBs for Accessibility April 27, 2021 This post was written by Hal Hinderliter, as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series from DLF’s Digital The #DLFteach Toolkit: Participatory Mapping In a Pandemic April 22, 2021 This post was written by Jeanine Finn (Claremont Colleges Library), as part of Practitioner Perspectives: Developing, Adapting, and Contextualizing the #DLFteach Toolkit, a blog series Call for Proposals open for NDSA Digital Preservation 2021! April 13, 2021 The NDSA is very pleased to announce the Call for Proposals is open for Digital Preservation 2021: Embracing Digitality (#DigiPres21) to be held ONLINE this PrevPreviousThe #DLFteach Toolkit: Participatory Mapping In a Pandemic CLIR+DLF 211 North Union Street Suite 100-PMB 1027 Alexandria, VA 22314 E: info@diglib.org About The Digital Library Federation is a network of member institutions and a robust community of practice — advancing research, learning, social justice, & the public good through the creative design and wise application of digital library technologies Stay In Touch​ Sign up for news from CLIR. Sign Up Follow Us Facebook Twitter Youtube Linkedin Rss CLIR Unless otherwise indicated, content on this site is available for re-use under CC BY-SA 4.0 License Skip to content Open toolbar Accessibility Tools Increase Text Decrease Text Grayscale High Contrast Negative Contrast Light Background Links Underline Readable Font Reset 
www-dlib-org-7025	----	D-Lib Magazine Search D-Lib:   HOME | ABOUT D-LIB | CURRENT ISSUE | ARCHIVE | INDEXES | CALENDAR | AUTHOR GUIDELINES | SUBSCRIBE | CONTACT D-LIB   D - L I B   M A G A Z I N E ISSN: 1082-9873 | https://doi.org/10.1045/dlib.magazine   D-Lib Magazine suspended publication of new issues in July 2017. Corporation for National Research Initiatives will continue to maintain the D-Lib Magazine archive, however, suggestions for long-term archiving are welcome, as are thoughts from the community on any alternate usage of the D-Lib brand that would benefit the research community that has been served by D-Lib's virtual pages over the last two decades. Send suggestions to dlib@dlib.org.   D-Lib Magazine was produced by Corporation for National Research Initiatives. Prior to April 2006, the magazine was sponsored by the Defense Advanced Research Project Agency (DARPA) on behalf of the Digital Libraries Initiative under Grant No. N66001-98-1-8908, and by the National Science Foundation (NSF) under Grant No. IIS-0243042. From 2007 through 2016, contributions by subscribers to the D-Lib Alliance provided financial support for the continued open access publication of D-Lib Magazine. In particular, D-Lib thanks Crossref, and Hesburgh Libraries at University of Notre Dame for their long-time membership in the D-Lib Alliance. Privacy Policy Copyright© 2017 Corporation for National Research Initiatives D-Lib is registered in the U.S. Patent and Trademark Office. 
www-dlib-org-9066	----	D-Lib Magazine An electronic publication with a primary focus on digital library research and development. http://www.dlib.org/ D-Lib Magazine https://doi.org/10.1045/dlib.magazine D-Lib Magazine ceased publishing new issues in July 2017. This RSS Feed will no longer be updated. 
www-drmafrica-org-7550	----	DRM Africa – Open Data for the Community Resilience Skip to content DRM Africa Open Data for the Community Resilience A PROPOS PROJETS AGRIDATA RDC DATA CHECK MESEC – Média Sensible aux Catastrophes OPEN DATA POUR ELLES BLOGS PARTENAIRES MESSAGERIE A PROPOS PROJETS AGRIDATA RDC DATA CHECK MESEC – Média Sensible aux Catastrophes OPEN DATA POUR ELLES BLOGS PARTENAIRES MESSAGERIE Disaster Risk Management in Africa Nous sommes une initiative pour la gestion des risques de catastrophes en Afrique applicant le concept des données ouvertes pour renforcer la résilience communautaire aux aléas naturels et anthropiques Lire plus Identification des zones à risque Analyse des zones à risques Mise en place d'un mécanisme de prévention contre les catastrophes SAP RDC Un système d'Alertes Précoces Multirisque orienté les communautés vulnérables locales Connaissance des risques Evaluation des risques aux niveaux local, régional et national à travers le processus de collecte des données sur le terrain, de leur analyse et une cartographie utilisant les recentes outils et systèmes d'information géographiques SIG en sigle et les jeux de données. Services de surveillance et d'Alertes Surveillance des phénomènes avec comme objectif; anticiper un évènement pour informer et alerter rapidement les communautés exposées en vue de préparer une gestion efficace et concertée des risques. Communication et diffusion de l'information L’information préventive permettant à chaque citoyen d'être acteur tout en étant conscient de son exposition et de ses moyens d’actions. Cette information doit aider la population à adopter des comportements adaptés. C’est l’un des moyens de prévention les plus efficaces. Capacité de réponse Renforcer les capacités d’intervention nationales et communautaires • Est-ce que les plans de réponse sont à jour et ont été testés ? • Est-ce que les capacités et les connaissances locales ont été utilisées ? • Est-ce que les populations sont préparées et prêtes à réagir aux alertes précoces ? PLUS L'Initiative pour la Gestion des Risques de Catastrophes en Afrique DRM Africa en sigle est un programme de l'organisation CongoInThePicture. Cliquez ici pour plus de details BLOGS Le lac Tanganyika s’empare-t-il du port de Bujumbura? Open Data Day: des données ouvertes pour renforcer la résilience des personnes exposées aux inondations à Uvira PARTENAIRES 2020 - Programme de l'organisation CongoInThePicture X 
www-econtentpro-com-4154	----	What GetFTR Means for Journal Article Access: eContent Pro Home Blog Mailing List Log In Services Editorial Services Copy Editing & Proofreading Scientific & Scholarly Editing Journal Finder Journal Recommendation Figure, Table, Chart & Equation Conversions Typesetting & Publishing Translation Publishing Services Libraries and Open Access (OA) Organizations University Presses and Commercial Publishing Houses Academic and Research Individuals Organizations Organizations Overview Publishers Non-Profit Organizations Government Organizations Companies Professional Groups and Associations Resources Research Areas Our Team Frequently Asked Questions Blog/News Webinars Special Offers Job Opportunities About Us About Us Overview History Mission Commitment Ethics Confidentiality Terms & Conditions Contact Us What GetFTR Means for Journal Article Access By Grace Hamburger on Dec 20, 2019 STM Week is a three-day event hosted by the STM Association, where publication innovations are shared with peers and professionals from all over the world. This year, the announcement of Get Full Text Research (GetFTR) made a major splash. Backed by five of the biggest names in Academic publishing, GetFTR aims to enable streamlined access to journal articles for researchers. But what exactly is GetFTR, how does it work, and what does this new system mean for you and your research? What is GetFTR? GetFTR was developed by American Chemical Society, Elsevier, Springer Nature, Taylor and Francis Group and Wiley, while other publishers and providers are encouraged to take part. It was created with the purpose of streamlining the research process by immediately connecting researchers with up-to-date journal articles. Thus, GetFTR is a service where Academic articles are found and provided to you at absolutely no cost. The pilot is scheduled to launch in early 2020 and is completely free to all users. How It Works Whether at home, in a library, or on the go, researchers can access GetFTR anywhere. After integrating your researching platform with GetFTR, you will be redirected to your institution and asked to log in. Once logged in, you are given full access to published journal articles that are current, vetted, and relevant to your search specifications. You simply search as normal and GetFTR will deliver the complete article to you, with no need for additional software or updates. Articles found with this service will be indicated by the GetFTR indicator. For researchers with no affiliation or licensed access, access may still be provided alternatively, such as a preprint or read-only version (depending on the publisher). What This Means for Your Research The research process is painstaking, and the more thorough your research, the longer it can take. Not to mention how many times a seemingly perfect article appears during a search, only for its contents to be blocked to a scholar. GetFTR was developed to help solve this problem. This huge collection of vetted Academic articles being made available using one system is impressive, as this makes the research process more accessible and reliable. Researchers are given an opportunity to not only find valuable information in one fell swoop, but they are also being given this information directly and immediately. Search preferences are also recorded, so the more searches conducted, the more relevant and personalized GetFTR becomes. The impending launch of GetFTR will help to make the researching process easy, fast, and free. Once your research is ready and your manuscript is written, eContent Pro International is here to make things even easier. With our English language copy editing service, your work will be thoroughly analyzed for issues involving spelling, grammar, punctuation, formatting, and flow. Take a breath and let us give your work the comprehensive care it deserves! Posted in: Publishing Join Our Newsletter Receive new blog post updates Subscribe Our Services Copy Editing & Proofreading Scientific & Scholarly Editing Journal Recommendation Figures, Tables, Charts & Equations Conversions Typesetting & Publishing Translation Take 15% Off Your Next Copy Editing Order Expert editors with 70+ years combined experience Complete coverage, including: Spelling & Punctuation Grammar & Terminology Consistency & Flow References And More Upload Now & Save 15% Discount applied automatically at checkout EDITORIAL SERVICES Copy Editing & Proofreading | Scientific & Scholarly Editing | Journal Recommendation | Figure, Table, Chart, & Equation Conversions | Typesetting & Publishing | Translation PUBLISHING SERVICES Libraries and Open Access (OA) Organizations | University Presses and Commercial Publishing Houses | Academic and Research Individuals LEARN MORE Organizations | Research Areas | Our Team | FAQ | Blog | Webinars | Special Offers | Job Opportunities | Mailing List | About Us | Contact Us REFERRAL PROGRAM Refer your colleagues to discounted eContent Pro International services POLICIES Privacy Policy | Terms and Conditions | Return Policy CUSTOMER SERVICE Email: customerservice@econtentpro.com Phone: (717) 533-4010 FOLLOW US ON SOCIAL MEDIA eContent Pro International is a product of MKP Technologies. Copyright © 2021, eContent Pro/MKP Technologies, all rights reserved. 
www-eff-org-945	----	A Declaration of the Independence of Cyberspace | Electronic Frontier Foundation Skip to main content About Contact Press People Opportunities EFF 30th Anniversary Issues Free Speech Privacy Creativity and Innovation Transparency International Security Our Work Deeplinks Blog Press Releases Events Legal Cases Whitepapers Take Action Action Center Electronic Frontier Alliance Volunteer Tools Privacy Badger HTTPS Everywhere Surveillance Self-Defense Certbot Atlas of Surveillance Cover Your Tracks Crocodile Hunter Donate Donate to EFF Shop Other Ways to Give Membership FAQ Donate Donate to EFF Shop Other Ways to Give Search form Search Email updates on news, actions, and events in your area. Join EFF Lists Copyright (CC BY) Trademark Privacy Policy Thanks Electronic Frontier Foundation Donate EFF TURNS 30! LEARN MORE ABOUT US, AND HOW YOU CAN HELP. EFF TURNS 30! LEARN MORE. Electronic Frontier Foundation About Contact Press People Opportunities EFF 30th Anniversary Issues Free Speech Privacy Creativity and Innovation Transparency International Security Our Work Deeplinks Blog Press Releases Events Legal Cases Whitepapers Take Action Action Center Electronic Frontier Alliance Volunteer Tools Privacy Badger HTTPS Everywhere Surveillance Self-Defense Certbot Atlas of Surveillance Cover Your Tracks Crocodile Hunter Donate Donate to EFF Shop Other Ways to Give Membership FAQ Donate Donate to EFF Shop Other Ways to Give Search form Search A Declaration of the Independence of Cyberspace PAGE John Perry Barlow Library A Declaration of the Independence of Cyberspace Decrypting the Puzzle Palace A Not Terribly Brief History of the Electronic Frontier Foundation A Plain Text on Crypto Policy A Pretty Bad Problem Across the Electronic Frontier Barlow in Rockspace Barlow, Denning on the Clipper Chip scheme Being in Nothingness Complete ACM Columns Collection Crime and Puzzlement Cynthia Horner's Eulogy Go Placidly Amidst the Noise and Haste Is There a There in Cyberspace? J. Kreilsberg-Barlow interview Jack In, Young Pioneer! Jackboots on the Infobahn: Clipping the Wings of Freedom Just Say Yes Leaving the Physical World Mitch Kapor & John Barlow Interview Passing the Buck on Porn Selling Wine Without Bottles: The Economy of Mind on the Global Net Songs for the Dead Stopping the Information Railroad TV, LSD, and Life in the Country The Economy of Ideas (Wired Magazine) The Pursuit of Emptiness: Why Americans Have Never Been A Happy Bunch The View from the Brooklyn Bridge Thinking locally, acting globally Through Many Panes of Shattered Glass To Be At Liberty Verbum Magazine Interview Who Holds The Keys? A Declaration of the Independence of Cyberspace by John Perry Barlow  Governments of the Industrial World, you weary giants of flesh and steel, I come from Cyberspace, the new home of Mind. On behalf of the future, I ask you of the past to leave us alone. You are not welcome among us. You have no sovereignty where we gather. We have no elected government, nor are we likely to have one, so I address you with no greater authority than that with which liberty itself always speaks. I declare the global social space we are building to be naturally independent of the tyrannies you seek to impose on us. You have no moral right to rule us nor do you possess any methods of enforcement we have true reason to fear. Governments derive their just powers from the consent of the governed. You have neither solicited nor received ours. We did not invite you. You do not know us, nor do you know our world. Cyberspace does not lie within your borders. Do not think that you can build it, as though it were a public construction project. You cannot. It is an act of nature and it grows itself through our collective actions. You have not engaged in our great and gathering conversation, nor did you create the wealth of our marketplaces. You do not know our culture, our ethics, or the unwritten codes that already provide our society more order than could be obtained by any of your impositions. You claim there are problems among us that you need to solve. You use this claim as an excuse to invade our precincts. Many of these problems don't exist. Where there are real conflicts, where there are wrongs, we will identify them and address them by our means. We are forming our own Social Contract. This governance will arise according to the conditions of our world, not yours. Our world is different. Cyberspace consists of transactions, relationships, and thought itself, arrayed like a standing wave in the web of our communications. Ours is a world that is both everywhere and nowhere, but it is not where bodies live. We are creating a world that all may enter without privilege or prejudice accorded by race, economic power, military force, or station of birth. We are creating a world where anyone, anywhere may express his or her beliefs, no matter how singular, without fear of being coerced into silence or conformity. Your legal concepts of property, expression, identity, movement, and context do not apply to us. They are all based on matter, and there is no matter here. Our identities have no bodies, so, unlike you, we cannot obtain order by physical coercion. We believe that from ethics, enlightened self-interest, and the commonweal, our governance will emerge. Our identities may be distributed across many of your jurisdictions. The only law that all our constituent cultures would generally recognize is the Golden Rule. We hope we will be able to build our particular solutions on that basis. But we cannot accept the solutions you are attempting to impose. In the United States, you have today created a law, the Telecommunications Reform Act, which repudiates your own Constitution and insults the dreams of Jefferson, Washington, Mill, Madison, DeToqueville, and Brandeis. These dreams must now be born anew in us. You are terrified of your own children, since they are natives in a world where you will always be immigrants. Because you fear them, you entrust your bureaucracies with the parental responsibilities you are too cowardly to confront yourselves. In our world, all the sentiments and expressions of humanity, from the debasing to the angelic, are parts of a seamless whole, the global conversation of bits. We cannot separate the air that chokes from the air upon which wings beat. In China, Germany, France, Russia, Singapore, Italy and the United States, you are trying to ward off the virus of liberty by erecting guard posts at the frontiers of Cyberspace. These may keep out the contagion for a small time, but they will not work in a world that will soon be blanketed in bit-bearing media. Your increasingly obsolete information industries would perpetuate themselves by proposing laws, in America and elsewhere, that claim to own speech itself throughout the world. These laws would declare ideas to be another industrial product, no more noble than pig iron. In our world, whatever the human mind may create can be reproduced and distributed infinitely at no cost. The global conveyance of thought no longer requires your factories to accomplish. These increasingly hostile and colonial measures place us in the same position as those previous lovers of freedom and self-determination who had to reject the authorities of distant, uninformed powers. We must declare our virtual selves immune to your sovereignty, even as we continue to consent to your rule over our bodies. We will spread ourselves across the Planet so that no one can arrest our thoughts. We will create a civilization of the Mind in Cyberspace. May it be more humane and fair than the world your governments have made before. Davos, Switzerland February 8, 1996 Back to top Follow EFF: twitter facebook instagram youtube flicker rss Contact General Legal Security Membership Press About Calendar Volunteer Victories History Internships Jobs Staff Diversity & Inclusion Issues Free Speech Privacy Creativity & Innovation Transparency International Security Updates Blog Press Releases Events Legal Cases Whitepapers EFFector Newsletter Press Press Contact Donate Join or Renew Membership Online One-Time Donation Online Shop Other Ways to Give Copyright (CC BY) Trademark Privacy Policy Thanks JavaScript license information 
www-equitywatchinitiative-org-209	----	Equity Watch Initiative (E-WIN) Webmail Donate Now Home About us Background Overview Mission and Vision Objectives Blogs Publications Gallery Contact We need your Help to serve the community Promote gender equality and empower women and girls A gender-equitable world where men, women, boys and girls work as partners for a better future Previous Next Generate evidence to promote gender equality and end all forms of gender-based violence and abuse Promote gender inclusivity and justice for sustainable development. Strengthen women and girls’ capacity to engage in sustainable economic Mission and Vision Mission : To promote gender equality and empower women and girls to foster an inclusive society for a sustainable future. Our Vision : A gender equitable world where men, women, boys and girls work as partners for a better future.   Our Major objective To promote gender equality and empower women and girls Expand opportunities Expand opportunities for women' voices, participation and leadership... Read more Promote gender inclusivity Promote women’ and girls’ education and rights... Read more Partnership Partner with government, community groups and ... Read more Improve women and girls’ Improve women and girls’ access to sexual and reproductive ... Read more Strengthen women and girls’ Strengthen women and girls’ capacity to engage in sustainable ... Read more About E-WIN Equity Watch Initiative [E-WIN] is a registered independent, non-political, non-governmental and non- profit organization. Navigation Links Home About us Objectives Contact us Partners Media center usefull links feedback Newsletter For professionals partnership feel free to subscribe to our newletter, Tweets by Equity_Watch1 Copyright © All rights reserved to Equity Watch Initiative [E-WIN] 
www-esri-com-263	----	Optimize group settings to share stories like never before ArcGIS Blog Menu Overview Topics ArcGIS Online ArcGIS Pro ArcGIS Enterprise ArcGIS Living Atlas Apps Developers ArcGIS StoryMaps Browse All ArcGIS StoryMaps Optimize group settings to share stories like never before   Are you trying to figure out how to coauthor a story in ArcGIS StoryMaps? Do you want to share a story with colleagues, classmates, or project managers before they go public? By using ArcGIS groups you can share your story with people outside your organization, within your organization, or have them coauthor a story. In this post we’ll demystify groups and demonstrate how to optimize their settings to share stories before they go public.  First, a little background: ArcGIS is part of Esri’s Geospatial Cloud ecosystem that allows you to use powerful apps and mapping tools all in a web browser. ArcGIS StoryMaps is one of those apps. Having an ArcGIS account is the first step to help you collaborate on stories and share them. If you’ve never used groups before, or are unsure about their capabilities, now is the perfect time. This blog won’t showcase all group settings but will focus on three main workflows and the steps it takes to utilize them. You’ll do most of this outside of ArcGIS StoryMaps and in ArcGIS instead. These workflows also work with ArcGIS StoryMaps collections and if you have a storyteller account.   Here are the three sharing workflows that we’ll discuss: 1: Share a story internally for multiple people to edit and/or publish a story within an organization Examples: Write a section(s) of a story then hand off to someone else to write theirs (students, co-workers) Write a story, then have a copy editor review it and make direct changes for readability/grammar Have an admin or group member with update capabilities fix typos in the story or express map pop-ups, swap in updated images/media, etc. 2: Duplicate a story to serve as a guide Examples: A teacher can create a general blueprint for students to follow Organizations with specific brand guidelines can create a story to use as a model 3: Share a story with an outside organization before it’s public Examples: A collaborator on a project wants to check in on the progress of a story An author would like feedback on a time sensitive story without it going public   What is an ArcGIS group?  A group is a set of content within an ArcGIS account. You can use it to organize content and you can invite people to join the group. Some of the settings allow members of the group to view items even if they’re in a different organization or modify items if they’re in the same organization account. A group with update capabilities is called a share update group. These are powerful features that can be optimized to extend the sharing capabilities for your stories. Learn more about groups here.  How to make a group:   Step 1: Log into your ArcGIS and navigate to the Groups page.   Step 2: Near the upper left corner of the page you will see a Create Group button. Click that button to create a new group.     Groups are one of the main menu items within ArcGIS and are an important tool for project management.   Step 3: Make sure to adjust your group settings so you can achieve the results you want. You will have more options if you are the administrator of the organization. Some of the workflows in this blog will require specific privileges so be sure to collaborate with your organization administrator from the start. Here’s more information about how to create a shared update group. For people (teachers, project managers) who frequently need to create groups with update privileges we suggest creating a role within your organizations that enables the option to create and update groups. An administrator will need to create the role, but once it’s done people in that role don’t need an administrator every time they need to create a new group with shared update capabilities (see image below).   To create a new role within your organization be sure you enable this functionality. They will then be able to create groups that have update capabilities without needing an administrator. Workflow 1: Share a story internally for multiple people to edit Requirements: participants must be in the same organization. You will need to have your administrator set up the group so members can update items.   This workflow is designed with educators and project managers in mind. For example, if you’re an instructor and want students to work in groups of two or more to coauthor a story, this is the workflow for you. Similarly, if you are a project manager and want team members to work together, this will help make it easier. Here are the steps to get started:  Step 1: Talk with your organization administrator about setting up an ArcGIS group for your project or for a class. Step 2: Make sure the administrator creates a group that allows members to update All Items in the group. This will allow group members to update, modify, and publish content in the story. Editing abilities only extend to group members who are also part of the organization. Step 3: Next, invite team members or students to the shared update group. You will need to type in the exact user name for the individual you want to invite to the group. When you author a story you can publish it to any group (step 4). This will allow individuals to view, edit, and publish the story you’re creating.         Step 4: Share the story you want others to be able to modify to the shared update group. You can do this by going to My Content and clicking on the sharing settings for the individual item. You will then have the option to share it with a group at the bottom of the window that appears. Alternatively, when you are in a story you can now publish it to any groups you are in or that you have permissions to add to. Stories that are shared with you via a group show up in the My Groups tab in the Stories page of your account. In addition, someone can just send you the link to the story and you’ll be able to edit it if you have permissions. See the clip below for the specific steps. Note: Be sure to let your collaborators know when you are editing the story. Otherwise you may unknowingly overwrite content in the story. ArcGIS StoryMaps has a feature that will notify you if other people are editing at the same time. However we recommend alerting team members when you go into modify a story, and we suggest that you write the text outside of ArcGIS StoryMaps in order to have a second copy.    This video shows how to take a story from your content and share it with a group.     Workflow 2:  Use a story as a guide   Requirements: You must be the owner of the original story or an organizational administrator to set this up. This workflow is designed for anyone that has a story that they would like to duplicate and allow others to modify or publish from their account. Often an organization has a specific set of content or design parameters that they would like everyone to follow. Instead of recreating a story from scratch an organization can create a guide and duplicate it. An organization can have a guide story that has everything already configured (immersive sections, credits, accent color, font, logo, theme, navigation, etc.).   Step 1: Talk with your organization administrator about setting up a shared update ArcGIS group for your project.   Step 2:  Make sure the administrator creates a group that allows members to update All Items.  Step 3: If your story guide is in a shared update group any member of the group will be able to duplicate it. (This allows you to duplicate it to your account. This is helpful if your project will require a series of stories with the same branding/guidelines). Alternatively, if you are the original owner of the guide story, you can duplicate it and update its sharing settings to include your group without involving your administrator. Use the workflow 1 to see how to share a story with a group.      Workflow 3: View a story before it’s public  Requirements: Participants must have ArcGIS accounts to view a story produced by a different organization  This workflow is designed to allow you to share a story with an external partner who has an ArcGIS account but is not part of your organization.  This can be very helpful if you don’t yet want a story to be public, but need feedback on a story.  You do not need an administrator for these group settings.  Step 1: Create a group with settings that allow group members to only view items Step 2: Make sure your story is published to the organization Step 3: Share your story with the shared update group Step 4: Invite the users outside your organization by searching their user name in the invite users panel (be sure to search outside your organization). The invited user can now join the group and view your story. The external viewer will not be able to edit the story. To do so they would need to be a member of your organization (see workflow 1).   You can use these same workflows for ArcGIS StoryMaps collections and if you have a storyteller account. Log in now to get started Thanks for reading. Feel free to reach out if you have questions. Have you found a great sharing workflow that works for you? We would love to hear about it. Write a comment below. We can’t wait to see what you create. About the authors Ross Donihue Ross Donihue is a cartographer and product engineer on Esri's StoryMaps team. He uses place-based storytelling to engage users through beautiful, informative, and inspiring cartography. When he's not making maps he's likely carving a spoon, making photos, or dreaming of mountains and fermentation. Connect: Liz Todd Liz Todd is a multimedia specialist and product engineer on Esri's ArcGIS StoryMaps team. She uses place-based storytelling to elevate and empower voices, with a focus on leveraging GIS for equity and social justice. Liz also co-leads LGBTQIA+, an employee community focused on increasing representation, inclusion, and belonging for LGBTQIA+ individuals in GIS. When not at work she usually can be found climbing, reading a good book, or off in search of rocks/fossils to add to her far too large collection. Connect: Article Discussion: 2 Leave a Reply Please Login to comment newest oldest Juliette Cortes #8271You can flag a comment by clicking its flag icon. Website admin will know that you reported it. Admins may or may not choose to remove the comment or block the author. And please don't worry, your report will be anonymous. Ross and Liz, these collaboration options are already very useful! However, we are a team of researchers happily using storymaps from different universities. We are creating one story together and we cannot share edition rights because edition is only possible between members of the same organization. Would this functionality be possible soon? Thanks for all the efforts! October 2, 2020 2:59 am עינת שדה #10331You can flag a comment by clicking its flag icon. Website admin will know that you reported it. Admins may or may not choose to remove the comment or block the author. And please don't worry, your report will be anonymous. so if i am a teacher that joined independently to the app (not through any organization) I will not be able to invite my students to collaborate on creating a story map? thanks December 15, 2020 7:34 am Related Content: constituent engagement arcgis storymaps arcgis storymaps arcgis storymaps collaboration groups howto storytelling Sharing and Collaboration Sharing and Collaboration Start your first ArcGIS StoryMaps collection Multiple Authors | ArcGIS StoryMaps | September 5, 2019 This feature makes it easy to create and share a curated set of stories and other ArcGIS apps Show Description Hide Description Read this article Constituent Engagement Constituent Engagement What's new in ArcGIS StoryMaps (August 2020) Owen Evans | ArcGIS StoryMaps | August 26, 2020 ArcGIS StoryMaps has been updated with new goodies including collection enhancements, a fit/fill option for swipe, and new map drawing tools. Show Description Hide Description Read this article Sharing and Collaboration Sharing and Collaboration Sharing and Collaboration across ArcGIS Online Organizations Kelly Gerrow-Wilcox | ArcGIS Online | May 21, 2018 You can securely collaborate with other ArcGIS Online users today. Follow the steps to get cross organizational sharing set up with Group Sharing Show Description Hide Description Read this article Next Article The Digital Workplace Read this article You are going to send email to Send Move Comment Move opens in new window 
www-facebook-com-1621	----	Update Your Browser | Facebook Update Your Browser You’re using a web browser that isn’t supported by Facebook. To get a better experience, go to one of these sites and get the latest version of your preferred browser: Google Chrome Mozilla Firefox Get Facebook on Your Phone Stay connected anytime, anywhere. 
www-facebook-com-170	----	Update Your Browser | Facebook Update Your Browser You’re using a web browser that isn’t supported by Facebook. To get a better experience, go to one of these sites and get the latest version of your preferred browser: Google Chrome Mozilla Firefox Get Facebook on Your Phone Stay connected anytime, anywhere. 
www-facebook-com-560	----	Update Your Browser | Facebook Update Your Browser You’re using a web browser that isn’t supported by Facebook. To get a better experience, go to one of these sites and get the latest version of your preferred browser: Google Chrome Mozilla Firefox Get Facebook on Your Phone Stay connected anytime, anywhere. 
www-facebook-com-6013	----	Update Your Browser | Facebook Update Your Browser You’re using a web browser that isn’t supported by Facebook. To get a better experience, go to one of these sites and get the latest version of your preferred browser: Google Chrome Mozilla Firefox Get Facebook on Your Phone Stay connected anytime, anywhere. 
www-facebook-com-7908	----	Update Your Browser | Facebook Update Your Browser You’re using a web browser that isn’t supported by Facebook. To get a better experience, go to one of these sites and get the latest version of your preferred browser: Google Chrome Mozilla Firefox Get Facebook on Your Phone Stay connected anytime, anywhere. 
www-flickr-com-8922	----	code4lib photos on Flickr | Flickr Explore Recent Photos Trending Events The Commons Flickr Galleries World Map Camera Finder Flickr Blog Prints Prints & Wall Art Photo Books Get Pro Upload Log In Sign Up Log In Explore Trending Events The Commons Flickr Galleries Flickr Blog Prints & Wall Art Photo Books Get Pro About Jobs Blog Developers Guidelines Help Help forum Privacy Terms Cookies English Explore Trending Events More More Tags code4lib Related groups — code4lib Kittens And Beer View allAll Photos Tagged code4lib Disassembling a quasi-authority database "Hankalat musiikkinimet" with #screenscraping, #XQuery and #BaseX. #code4lib by Mace Ojala The database has a human-operateable interface for mapping from discouraged to encouraged (and vice versa) ways to spell musicisian names. I'm made it's data computer readable by screenscraping an some XQuery. 4 Dollars of Thirsty by Kurt Nordstrom 1 5 I saw this bottle of water out of the corner of my eye when we entered our hotel room, and went to get it when I had some thirst...only a big blue tag on the neck of the bottle stopped me dead. Four bucks for a crummy bottle of water? Get outta here. I filled a glass from the tap, and it was great tasting. I guess you just get used to mediocre water living in Texas. Oregonians got it good.  The next morning though, two young ladies entered our elevator, drinking out of similar bottles. Guess the hotel got 8 bucks richer. Just received 5.4GB of MARC records, now indexing to BaseX. Let the weekend begin! by Mace Ojala Shit, my tool to check quasi-authority control database Hankalat musiikkinimet works. #XQuery #likeaboss #code4lib by Mace Ojala ZOMG, Nostalgia by Kurt Nordstrom 1 3 Continuing our walk around downtown Portland, we dropped in on Powell's technical books, not realizing that the place was famous. They had a lot of books, but even cooler (to me), was a bunch of old computers that they had set up all over the place. Stole this picture, and woulda taken some more, but I'm skittish about photography inside of stores. Kittens... or Beer? by sylvar 2 2 The code4lib 2007 Cage Match topic.  Kittens: www.flickr.com/photos/earthandeden/395471919/  Beer: www.flickr.com/photos/peterkaminski/124337206/ Luis Salazar by Nicole C. Baratta Luis Salazar by Nicole C. Baratta Pineapple Rice by Kurt Nordstrom 2 Brandon (who is ending up in a lot of my shots), models his lunch at E-San's, which was pineapple fried rice...served in a pineapple. Pretty durn cool. Only, he got it with tofu. Who eats tofu? Seriously. Luis Salazar by Nicole C. Baratta Luis Salazar by Nicole C. Baratta Luis Salazar by Nicole C. Baratta the minivan by Gabriel Farrell 2 Luis Salazar by Nicole C. Baratta Luis Salazar by Nicole C. Baratta Luis Salazar by Nicole C. Baratta laptop in the minivan by Gabriel Farrell DSC_8579 by Ray Schwartz Brandon, with beers by Kurt Nordstrom 5 Brandon, my co-worker, graciously models for us at dinnertime. After walking around downtown Portland for a while, we succumbed to hunger (and thirst) and stopped into a place called "The Life of Riley". Nice beer selection and excellent food. I had the fish and chips. Mark and I (not pictured), were drinking IPAs, and Brandon (pictured) had a Porter. 10-feb-24 by Paul Joseph everyone's got a laptop at the code4lib conference in asheville, north carolina DSC_8946 by Ray Schwartz The great wall of "standards" by Casey Bisson 2 (image from my code4lib presentation)  Though programmers in libraries face many of the same challenges as those outside libraries, we use unique "standards" that are unsupported outside our small community and limit cross-polination with the outside world. DSC_6736 by Ray Schwartz P1240344 by Ray Schwartz 4 1 and some tshirts from apress by Ed Summers Coverage of authority control in our subject headings. #xquery #code4lib #datamining by Mace Ojala Counting subject heading distribution per thesaurus. #xquery #basex #code4lib #sundayeveningwellspent by Mace Ojala chocolate milk by Casey Bisson 2 Sweet nectar, provided for one of the conference breaks. _MG_7116 by Declan Fleming #Brainmelt with XML namespaces on #MARC records >_<' #basex by Mace Ojala Jaaaaaaahas... by Mace Ojala Classy, yet Clever? by Kurt Nordstrom 1 5 Yeah, so, somebody got funny with the baby changing station in the bathroom of the Powell's technical bookstore. I don't know whether to snicker, or roll my eyes. Guess I'll do both. DSC_6724 by Ray Schwartz 1 DSC_6957 by Ray Schwartz Visualizing Library Data by Uldis Bojārs the more info slide from #code4lib talk "Visualizing Library Data" by @librarywebchic _MG_7066 by Declan Fleming 1 DSC_6656 by Ray Schwartz DSC_6536 by Ray Schwartz _MG_7033 by Declan Fleming DSC_6697 by Ray Schwartz 1 Voodoo doughnuts! by Bess Sadler David Moody got a voodoo doll doughnut (filled with raspberry jelly) and I got a "dirty old bastard" doughnut. _MG_7032 by Declan Fleming DSC_6819 by Ray Schwartz 1 DSC_6691 by Ray Schwartz DSC_6962 by Ray Schwartz 1 DSC_6685 by Ray Schwartz DSC_6653 by Ray Schwartz _MG_7036 by Declan Fleming _MG_7067 by Declan Fleming 1 setup in the minivan by Gabriel Farrell 1 2 3 4 5 6 7 ••• 79 80 About Jobs Blog Developers Guidelines Privacy Terms Help Report abuse Help forum English SmugMug+Flickr. Privacy Terms Cookies SmugMug+Flickr. Connecting people through photography. About Jobs Blog Developers Guidelines Report abuse Privacy Terms Help forum English Privacy Terms Cookies Help SmugMug+Flickr. Connecting people through photography. 
www-force11-org-4590	----	Guiding Principles for Findable, Accessible, Interoperable and Re-usable Data Publishing version b1.0 | FORCE11 Skip to main content 45 WORK GROUPS 3,467 ACTIVE MEMBERS 1,506 MEMBER POSTS Sign In - OR - Join Now! ×Close Welcome to FORCE11 E-mail or username * Enter your e-mail address or username. Password * Enter the password that accompanies your e-mail. Forget your password? Sign In Hello Members Help sustain FORCE11, we’re asking you to reaffirm your membership to FORCE11 and donate if you’re able to. Note that after you login you will be redirected to a page where you may optionally upgrade your membership. Thank you for your support! Not a member? Create an account. ×Close Welcome to FORCE11 E-mail * A valid e-mail address. All e-mails from the system will be sent to this address. The e-mail address is not made public and will only be used if you wish to receive a new password or wish to receive certain news or notifications by e-mail. Password * Confirm password * Provide a password for the new account in both fields. Member Profile Personal Information Photo Upload The recommended image size is 768x768 @ 72dpi. Clicking "Upload" will open the image full screen so that you may crop your image. Note that there will be controls in the bottom right corner of your screen for cropping and saving your image. More information Files must be less than 5 MB. Allowed file types: png gif jpg jpeg. First Name * Last Name * Organization/Institution * The link title is limited to 128 characters maximum. URL Organization/Institution City State/Province/Region Country * Contact Information Phone Bio More information about text formats Link to C.V. URL Twitter Example: @memberhandle LinkedIn URL Skype Enter your Skype username if you wish for other members to be able to find you on Skype. Website URL Enter your personal website. Leave blank if none. ORCID ID FORCE11 List Subscribe Admin Only Add a new term by clicking "Add another item". Do not remove past term positions as this information is used on the BOD page under Ex-Officio. CAPTCHA This question is for testing whether or not you are a human visitor and to prevent automated spam submissions. Create new account Leave this field blank Already a member? Sign In. Join and be a part of the community defining the Future of Scholarship 45 Toggle navigation About Overview Board of Directors Contact Us FORCE11 Manifesto FORCE11 Endorse Policy Community Member Directory Why Join? Community Outcomes Community Discussion Forum Requests for Input Blog Code of Conduct Groups Resources Post Content Sponsorship opportunities FORCE11 PROMOTIONAL MATERIALS Grantee Indirect Cost Policy Target Areas News + Blogs News / Announcements Blogs Infrastructure Series Events FSCI 2021 FORCE2021 - San Sebastián FORCE2019 - Edinburgh Past events Event Calendars Search form Search English EnglishArabicChinese (Simplified)Chinese (Traditional)FrenchGermanHindiJapaneseRussianSpanish FORCE11 Guiding Principles for Findable, Accessible, Interoperable and Re-usable Data Publishing version b1.0 Guiding Principles for Findable, Accessible, Interoperable and Re-usable Data Publishing version b1.0 FAIR PRINCIPLES 1. Preamble: In the eScience ecosystem, the challenge of enabling optimal use of research data and methods is a complex one with multiple stakeholders: Researchers wanting to share their data and interpretations; Professional data publishers offering their services, software and tool-builders providing data analysis and processing services; Funding agencies (private and public) increasingly concerned with proper Data Stewardship; and a Data Science community mining, integrating and analysing the output to advance discovery. Computational analysis to discover meaningful patterns in massive, interlinked datasets is rapidly becoming a routine research activity. Providing machine-readable data as the main substrate for Knowledge Discovery and for these eScientific processes to run smoothly and sustainably is one of the Grand Challenges of eScience. In January 2014, representatives of a range of these stakeholders came together at the request of the Netherlands eScience Center and the Dutch Techcentre for the Life Sciences (DTL) at the Lorentz Center in Leiden, The Netherlands, to think and debate about how to further enhance this ecosystem. From these discussions, the notion emerged that, through the definition and widespread support of a minimal set of community-agreed guiding principles and practices, data providers and data consumers - both machine and human - could more easily discover, access, interoperate, and sensibly re-use, with proper citation, the vast quantities of information being generated by contemporary data-intensive science. These simple principles and practices should enable a broad range of integrative and exploratory behaviors, and support a wide range of technology choices and implementations, just as the Internet Protocol (IP) provided a minimal layer - the "waist" of an hourglass - that enabled the creation of a vast array of data provision, consumption, and visualization tools on the Internet 2. Context It is important to note that this document is a general 'guide to FAIRness of data', not a “specification”. In compiling the FAIR guiding principles for this document, technical implementation choices have been consciously avoided. The minimal [FAIR Guiding Principles] are meant to guide implementers of FAIR data environments in checking whether their particular implementation choices are indeed rendering the resulting data FAIR. In Explanatory notes and annexes we give some non-binding explanation and guidance for a FAIR view on data and what constitutes a repository of FAIR data (a 'Data FAIRport') 3. FAIR for machines as well a people In eScience, two clearly separated substrates for knowledge discovery can be distinguished. The actual data, which is as a rule beyond human intellectual capacity to analyse and The 'Explicitome' (everything we already made explicit in text, databases and any other format to date). The essence of eScience is that either functionally interlinked existing data or the combination of those with newly generated 'relatively small' datasets lead to new insights. A crucial step is machine-assisted 'pattern recognition' in the data, which is followed by 'conformational' human study of the Explicitome to rationalise patterns and determine testable hypotheses. Obviously this is a cyclical process by nature, but computational analysis of massive, originally dispersed and variable datasets is a crucial phase in any eScience process. Recognizing this new grand challenge in contemporary science, in its inaugural meeting: [Jointly Designing a Data FAIRTport'] the stakeholder group coalesced around four desiderata that a modern data publishing environment should provide to support both manual and automated deposition, exploration, sharing, and use to support machines as well as humans. These are summarized as the FAIR "Facets": Data should be Findable Data should be Accessible Data should be Interoperable Data should be Re-usable. These FAIR Facets are obviously related, but technically somewhat independent from one another, and may be implemented in any combination, incrementally, as data providers and FAIRports evolve to increasing degrees of FAIR-ness. As such, the barrier-to-entry for FAIR data producers, publishers and stewards is maintained as low as possible, with providers being encouraged to gradually increase the number of FAIR Facets they comply with. Therefore, the purpose of this document is not to define nor suggest any technological implementation for any of these facets, but rather to define the characteristics, norms, and practices that data resources, tools, and infrastructures should exhibit in order to be considered 'FAIR', and FAIR-ness can be achieved with a wide range of technologies and implementations.   FAIR data Guiding Principles For all parties involved in Data Stewardship, the facets of FAIRness, described below, provide incremental guidance regarding how they can benefit from moving toward the ultimate objective of having all concepts referred-to in Data Objects (Meta data or Data Elements themselves) unambiguously resolvable for machines, and thus also for humans. By adopting all FAIR facets, Data Objects become fully: Findable, Accessible, Interoperable, and Reusable Definitions A Concept is any defined 'unit of thought' to which we refer in our digital formats [1] A Data Object is defined for the purpose of the principles below as: An Identifiable Data Item with Data elements + Metadata + an Identifier [2] When we use the term (Meta) data here, we intend to indicate that the principle is true for Metadata as well as for the actual, collected Data Elements in the Data Object, but that the principle in question can be independently implemented for each of them [3]. FAIR Guiding Principles 1. To be Findable any Data Object should be uniquely and persistently identifiable [4] 1.1. The same Data Object should be re-findable at any point in time, thus Data Objects should be persistent, with emphasis on their metadata, [4 and JDDCP 4 and JDDCP 6] 1.2. A Data Object should minimally contain basic machine actionable metadata that allows it to be distinguished from other Data Objects [see JDDCP 5] 1.3. Identifiers for any concept used in Data Objects should therefore be Unique and Persistent [5 and JDDCP 4 and JDDCP 6]. 2. Data is Accessible in that it can be always obtained by machines and humans 2.1 Upon appropriate authorization [6] 2.2 Through a well-defined protocol [7 and JDDCP 5] 2.3 Thus, machines and humans alike will be able to judge the actual accessibilty of each Data Object. 3. Data Objects can be Interoperable only if: 3.1. (Meta) data is machine-actionable [8] 3.2. (Meta) data formats utilize shared vocabularies and/or ontologies [9] 3.3  (Meta) data within the Data Object should thus be both syntactically parseable and semantically machine-accessible [10] 4. For Data Objects to be Re-usable additional criteria are: 4.1 Data Objects should be compliant with principles 1-3 4.2 (Meta) data should be sufficiently well-described and rich that it can be automatically (or with minimal human effort) linked or integrated, like-with-like, with other data sources [11 and JDDCP 7 and JDDCP 8] 4.3 Published Data Objects should refer to their sources with rich enough metadata and provenance to enable proper citation (ref to JDDCP 1-3). JDDCP (Joint Declaration of Data Citation Principles) RDA DFT (Data Foundation and Terminology)   Annex 1: Explanatory notes to FAIR Guiding Principles [1] We follow the definitions and arguments of the Ogden/Richard Triangle and theory of meaning for concept, symbol and meaning definitions: see http://en.m.wikipedia.org/wiki/Triangle_of_reference. The Concept itself is not a Digital Object, but any symbol referring to it in computers is a Digital Object. Lingual words, URI's URLs and any other identifier are all symbols referring to the concept [2] See an exemplar view on Data Objects in Annex 4 We propose the term 'Data Object' to refer to the combination of data elements + their metadata + a unique identifier. These objects are arbitrarily complex and may appear in many forms and syntaxes. [3] We explicitly recognize that repositories of Data Objects with FAIR metadata for Data Elements that as such are not (yet) FAIR (as in machine-readable, for instance pictures, video or recorded text) are highly valuable, but should be distinct from repositories of fully machine readable, highly curated data elements (the latter obviously also with FAIR metadata attached). So FAIR metadata is a must-have and FAIR data elements are the 'ultimate goal'. [4] Persistence is an organizational property; effectively, it is an obligation, formally or informally, that an organization guarantees that something will be maintained. As such, the organizations persistence policy should be explicit and public. We propose that FAIRports clearly state their persistence guarantees and seek for replication and back up of their resources whenever possible. [5] There are ongoing and fierce debates on what exactly constitutes a 'persistent' identifier. The acronym-term PID is consciously avoided here as it may have connotations of proprietary implementations. We propose to allow many identifiers in FAIR data publishing environments as long as an identifier is uniquely referring to only one concept and the publisher provides a clear policy and description on the maximum achievable guarantee for persistent resolving of the identifier to the correct location/meaning. Obviously, 'locally' used identifiers that cannot be mapped automatically to community adopted and publicly shared identifier schemes are not FAIR. The data publisher choosing a 'proprietary' identifier scheme, will need to provide appropriate and correct mappings to public identifiers to be considered FAIR. Organizations providing persistent identifiers (i.e. 'authorities') should clearly publish the policies that govern the persistence criteria of these identifiers. Such policies should be machine readable. [6] Especially also for commercial use of FAIR data, companies need to have a clear appreciation and legal position on their ability to use data. Non-licensed data, although 'open' in the mind of most academics, will be avoided by most major companies, due to legal risks. We appreciate exceptions to full Open Access of data (for instance for patient privacy or intellectual property reasons). We therefore consider appropriate licensing of Data Objects (or even individual data elements within them) as key to FAIR data publishing. Data Object Licenses and conditions of use (academic and/or private/commercial) should be well described. Such licenses can be referred to with persistent identifiers as well as part of the metadata in Data Objects. The FAIRport community will increasingly provide and recommend standard licenses to choose from. The FAIRport community strongly recommends to publish data in complete Open Access wherever possible. It is expected that most 'authorities' to endorse FAIRports will require that exceptions to Open Access need to be well-argued (see Annex 3) (list of licenses) Jan Velterop/John Wilbanks. [7] Putting data 'on the web' is not enough. To be actually interoperable and reusable, Data Objects should not only be properly licensed, but the methods to access and/or download them should also be well described and preferably fully automated and using well established protocols. [8] in eScience, machine-readability of data is imminent. Metadata being machine readable is a conditio sine qua non for FAIRness. Having the actual data elements also machine-readable will make the Data Object of a higher level of interoperability and makes functional interlinking and analysis in broader context much easier, but it is not a pre-condition for FAIR data publishing. Some data elements, for instance images and 'raw data' can not always be made machine-processable. Being published with FAIR metadata is of very high value in its own right. [9] When the use of community adopted and public terminology systems is not possible, for instance for reasons described in explanatory note 5, or because the Data Objects contain concepts that have not yet been described in any public vocabulary or ontology known to the provider, the provider should nevertheless try to create a term vocabulary of their own and publish it publicly and openly, preferably in a machine-readable form. The vocabulary or ontology that constrains each constrained data field should be unambiguously identified either by the field itself or by the associated Data Object metadata. For non-constrained fields, whenever possible the value-type of the field should be annotated using a publicly-accessible vocabulary or ontology. This annotation should be clear in the Data Object metadata. [10] Both syntax and semantics of data models and formats used for (Meat) data in Data Objects should be easy to identify and use, parse or translate by machines. As in the case of identifier schemes and vocabularies, a wide variety of data formats (ranging from URI-featuring spread-sheets such as RightField or OntoMaton to rich RDF) can be principally FAIR. It is obvious that any parsing and translation protocol is error-prone and the ideal situation is to restrict FAIR data publishing to as few community adopted formats and standards as possible. However, if a provider can prove that an alternative data model/format is unambiguously parsable to one of the community adopted FAIR formats, there is no particular reason why such a format could not be considered FAIR. Some data types may simply be not 'capturable' in one of the existing formats, and in that case maybe only part of the data elements can be parsed. FAIRports will increasingly offer guidance and assistance in such cases. [11] The metadata of a Data Object should be sufficiently rich that a machine or a human user, upon discovery, can make an informed choice about whether or not it is appropriate to use that Data Object in the context of their analysis. Metadata contained within the Data Object should inform the consumer about the license of the data elements; this metadata should be machine-readable to facilitate automated data harvesting while maintaining proper attribution. The Metadata contained within the Data Object should inform about any access-control policy, such that consumers can determine which components of the data they are allowed to access. The Metadata within the Data Object should inform about the authentication protocol leading to access, if applicable. Furthermore, in eScience, where pattern recognition in 'big' functionally linked or integrated data sets is becoming the norm, provenance is key. In case a pattern emerges from the data analysis algorithms, rationalization and confirmational studies in the underlying data sources is a crucial next step. If the provenance of the Data Elements to their original Data Object and subsequently to the underlying resources (human readable text, data bases, raw data files etc.) is lost, researchers will not be able to track the evidence for what the pattern seems to suggest for a testable hypothesis. Final note: We explicitly acknowledge that it is possible to implement any of these sub-facets without implementing all of them. Here we give some initial guidance on how to gradully improve FAIR-ness of Data Objects. Facet-I-syn: Metadata is provided in a format that can be parsed by a machine; i.e. that there is an open standard for the format against which reliable parsing code can be written Metadata should refer to the schemata used Facet-I-sem: Metadata takes advantage of shared controlled vocabularies or ontologies, allowing the mapping of metadata fields between disparate resources (regardless of their syntax in each of those repositories) Metadata should refer to the vocabularies or ontologies used Facet-I-data: Whenever possible, data should be provided in a format that can be parsed by a machine; i.e. that there is an open standard for the format against which reliable parsing code can be written Data structures should be defined according to public, documented, and where possible machine readable, schemata. Annex 2: An exemplar modular view on Data and Data Objects At the core of the FAIR data formatting and publishing process is a comprehensive view on what constitutes Data and how is it structured. The added value (eScience) perspective of FAIR data is first and foremost 'FAIR for machines'. Human readability as a 'derivative' of well formatted and defined machine readable data is obviously crucial for final interpretation. Actually, FAIR data will improve human readability as for instance concept-denoting terms can be presented to human users in their own language, based on ARTA (Also Referred To As) tables translating machine resolvable identifiers to lingual terms. So we view data here initially in the 'digital format'. From that perspective also 'Data' and 'Metadata' are only different in 'what they represent' and in 'what they are used for' not in their technical format. Finally, in eScience,'software' dealing with the data is inseparable from the data itself and for simplicity sake we will treat 'code' as 'executable data' for the purpose of this brief document. Data used by machines are intrinsically 'digital' and each Data Object (defined in the FAIR principles) is therefore a 'Digital Object' by nature. One of smallest Digital Objects in a FAIR data setting is a single Identifier referring to a concept (unit of thought), while the concept it denotes in itself is not a Digital Object. [ref. to Ogden Triangle see FAIR principles, explanatory note 1] Identifiers can be designed for computers or for people, in FAIR data context we recommend minimally one machine-resolvable Persistent Identifier (PID) for each concept used in a Data Object. Multiple PIDs and other IDs for the same concepts are a fact of life and thus accepted, but FAIR ID's must be guaranteed to map to only one concept. Mapping Tables and Mapping Service to deal with multiple (P)IDs for concepts are thus accepted in FAIR data and should be provided where needed. Data Elements are defined as the actual data, and are therefore practically although not technically distinct from their metadata. One of the smallest possible 'Data Elements' is a single association between two concepts. Each FAIR Data Object (even a simple assertion about a single association) should have a PID (for the Data Object as a whole) and a minimal set of metadata 'about' the actual Data Object Multiple identifiable data elements can share the same metadata and PID and form one FAIR Data Object (for instance a set of images or a micro-arry data set with hundreds of expression values for genes). Individual Identifiable Data Elements can be separately used, integrated, cited and distributed as new Data Objects with a new PID and carrying sufficient metadata from the original Data Object to be traceable back to it and citable in itself or as 'derived from' the original larger Data Object. Data Objects are thus 'modular' and 'recurrent' Digital Objects that can scale from a single association between two concepts to entire databases or workflows with many modules. FAIR Data Objects can have rich or minimal, intrinsic and user defined metadata (see picture 1), they can have one or up to millions of separately identifiable data elements. Annex 3: What constitutes a data FAIRport? (BM, PG, MW) As FAIR is not a trademark, we propose to leave the decision to 'endorse' repositories as FAIRports (meta data or metadata + data can be separated) to 'authorities', such as ELIXIR nodes/the Hub, NIH or SciELO. We propose to define a 'candidate' FAIRport as any machine-oriented data repository that: Contains FAIR Data Objects (to be judged by the endorsing authority) Provides these Data Objects under well defined accessibility for Re-use Has a full and open description of all technologies, controlled vocabularies and formats used. We propose that Trusted Parties in each scientific discipline Define the 'authorities' for each 'semantic category' of concepts typically referred to in Data Objects in their discipline. Define their minimal criteria to qualify Data Objects as FAIR Review individual data FAIRports against these established criteria Give a FAIR[Trusted party] stamp of approval to compliant FAIRports Publish in Open Repositories (preferably FAIR themselves) what can be expected from FAIRports in their index and with their quality stamp. We propose to consider the following 'levels' for FAIRports, or actually Data Objects contained in them (in other words, one FAIRport could contain Data Objects with Different 'levels of FAIRness) (see figure). Level 1: Each Data Object has a PID and intrinsic FAIR metadata (in essence 'static') Level 2: Each Data Object has 'user defined' (and updated) metadata to give rich provenance in FAIR format of the data, what happened to it, what it has been used for, can be used for etc., which could also be seen as rich FAIR annotations Level 3. The Data Elements themselves in the Data Objects are 'technically' also FAIR, but not fully Open Access and not Reusable without restrictions (for instance Patient data or Proprietary data). Level 4: The metadata as well as the data elements themselves are fully FAIR and completely public, under well defined license. (Non-licensed data considered 'public' by their owner will still be excluded from integration projects by for instance Pharmaceutical companies). Annex 4: User Scenarios and links to sister initiatives (adopted from Michel's and Juns original contributions) In data driven science, researchers, but increasingly primarily machines, need first of all to find/discover data having features of interest, for which they will be using using links, metadata, as well as actual data elements/contents) Once found, machines need to be able to access/retrieve data of interest (i.e. obtain a copy of the contents in some format). Next, for researchers to decide on 'giving a go' to their computers to start to re-use/analyze data of interest in the long-list retrieved from 'the web of data' they need to have easy access to and easy workflow tools to process (a.o.): a. Rich Metadata Information about the harvested Data Objects of interest b. Answer a question using one or a group of many more datasets c. Aggregate datasets and perform a statistical analysis d. Validate the correctness / authenticity of the data e. Mirror/exchange of data between repositories (sustainability by redundancy) f. Repeat/reproduce data generation/analysis g. Functionally link or Integrate data in order to have a coherent view h. Retrieve evidence at multiple levels to indicate support for a testable hypothesis i Cite entire Data Objects or individual data elements (where possible) for proper credit. j. At any point in time, retrieve the 'cited data cluster' as it was at the time it was cited (for dynamically growing data sets, such as twitter feeds or patient blogs and side effect records.   For all these eScience workflow steps (and many more could be imagined), the following features of proper data as the main substrate for machine-assisted Knowledge Discovery are (ao): a richness of description (in machine readable format) persistence (available when requested) identifiers and citation schemes in place accessibility - available in a variety of formats interoperability - formats and standards/guidelines prepared for functional interlinking and where needed integration appropriate licensing of each data object user control reusability provenance quality measures user-contributed content The FAIR (Findable, Accessible, Interoperable and Re-usable) principles have been designed with these research workflow steps and concerns in mind: to be findable (F) or discoverable, data and metadata should be richly described to enable attribute-based search. to be broadly accessible (A), data and metadata should be retrievable in a variety of formats that are sensible to humans and machines using persistent identifiers to be interoperable (I), the description of metadata elements should follow community guidelines that use an open, well defined vocabulary. to be reusable (R), the description of essential, recommended, and optional metadata elements should be machine processable and verifiable, use should be easy and data should be citable to sustain data sharing and recognize the value of data. (adopted from Jun and with ref. to JDDCP) Data being FAIR is also a way to supporting the '7-R's', that initially motivated the creation of Research Objects. The 7-R's fit into the FAIR principles and the desired scientific and research activities in which Research Objects play the key role. Reference: 7-R (v1): Why Linked Data is not enough for scientists (2012). DOI:10.1016/j.future.2011.08.004 Reusable. Repurposeable Repeatable Reproducible Replayable Referenceable Respectful see also: http://www.scilogs.com/eresearch/more-rs-than-pirates/ We will elaborate on the implementation of FAIR principles in sister activities that seek to support machine-friendly, high quality and reproducible science such as Research Objects, BioSharing, Force11, and FAIRdom (FAIR SB models). We see FAIR principles as an overarching way to support many novel practices associated with eScience, data sharing and re-use catering for data and the accompanying software, data capture practices in study design and multi scale models, visualization and proper data citation and alt-metrics.   Post to Twitter About FORCE11 Overview Manifesto Guiding Principles Endorsement Policy Sustainability Plan Community Blog Members Directory Groups Groups - Active Groups - Completed Start a Group News + Events Blogs News Contact Us FORCE11 Davis, CA 95618 USA Email Us Copyright © 2011-2020 FORCE11. All Rights Reserved. Privacy Policy.     
www-ft-com-2926	----	Register to read | Financial Times Accessibility helpSkip to navigationSkip to contentSkip to footer Sign In Subscribe Open side navigation menuOpen search bar myFT Search the FTSearchClose search bar Home WorldSections World Home Global Economy UK US China Africa Asia Pacific Emerging Markets Europe Americas Middle East and North Africa Most Read US and Europe to send aid to fight India’s coronavirus surge White House says capital gains tax rise will hit only richest 0.3% EU to sue AstraZeneca over vaccine supply shortfall Greensill worked within UK government without contract UK schools speak out against rules on European language teaching USSections US Home US Economy US Companies US Politics & Policy Most Read White House says capital gains tax rise will hit only richest 0.3% How Apple’s iOS 14.5 update is shaking up the app economy Exxon faces ‘existential’ risk over fossil fuel focus, activist investor warns US companies plan price rises as inflation pressure builds US plans to share up to 60m doses of AstraZeneca’s Covid vaccine with other countries CompaniesSections Companies Home Energy Financials Health Industrials Media Professional Services Retail & Consumer Tech Sector Telecoms Transport Most Read Standard Life Aberdeen to change name to Abrdn Credit Suisse shareholders seek removal of risk chief after twin scandals OnlyFans feels the lockdown love as transactions hit £1.7bn ‘Nomadland’ wins Best Picture, Director and Actress Oscars UK and European banks plan to slash business trips after pandemic Tech MarketsSections Markets Home Alphaville Markets Data Capital Markets Commodities Currencies Equities Fund Management Trading Moral Money ETF Hub Most Read Coronavirus: US administers 230m Covid jabs as global total tops 1bn — as it happened ESG rush opens opportunities for betting against the angels Nasdaq Composite closes at record level for first time in two months Bitcoin boom fuels fight over money creation Further reading Climate OpinionSections Opinion Home Columnists The FT View The Big Read Lex Obituaries Letters Most Read Narendra Modi and the perils of Covid hubris Employers are in for a wave of post-pandemic litigation The tragedy of India’s second wave A new deal for the young: ensuring fair pensions A new deal for the young: how to fix the housing crisis Work & CareersSections Work & Careers Home Business School Rankings Business Education Entrepreneurship Recruitment Business Books Business Travel Most Read The CEO will see you now: the pitfalls of open plan offices Aspiring lawyers and pilots turn to crowdfunding to pay costs The new frontiers of hybrid work take shape Sick of Zoom? Lucky you don’t need it to save the planet Europe’s new space chief is on a mission to reinvigorate the agency Life & ArtsSections Life & Arts Home Arts Books Food & Drink FT Magazine House & Home Style Travel FT Globetrotter Most Read A magical melting pot: where to find Singapore’s best street food On Oscars night, men’s fashion stole the show Russia’s Nepomniachtchi sets up chess world title battle with Carlsen When the chips are down, can 007 save the semiconductor? Hunter Biden and the trouble with addiction memoirs How to Spend It Sign In Subscribe MenuSearch Home World US Companies Tech Markets Climate Opinion Work & Careers Life & Arts How to Spend It Financial Times Sign In Subscribe Search the FTSearchClose search bar Become an FT subscriber to read: Tesla: carbon offsetting, but in reverse Leverage our market expertise Expert insights, analysis and smart data help you cut through the noise to spot trends, risks and opportunities. Join over 300,000 Finance professionals who already subscribe to the FT. Choose your subscription Digital Be informed with the essential news and opinion $40 per month OR $372 for 1 year BEST VALUE - SAVE 20% Select Purchase a Digital subscription for $7.16 per week You will be billed $40 per month after the trial ends MyFT – track the topics most important to you FT Weekend – full access to the weekend content Mobile & Tablet Apps – download to read on the go Gift Article – share up to 10 articles a month with family, friends and colleagues Read more Premium Digital All the essentials plus deeper insights and analysis $11.77 per week* Select Purchase a Premium Digital subscription for $11.77 per week You will be billed $68 per month after the trial ends All the benefits of Digital plus: Lex – our agenda setting daily column In-depth analysis – on trade, emerging markets, M&A, investing and more ePaper – a digital replica of the newspaper Gift Article – share up to 20 articles a month with family, friends and colleagues Read more Team or Enterprise Premium FT.com access for multiple users, with integrations & admin tools Pay based on use Does my organisation subscribe? Get Started Purchase a Team or Enterprise subscription for per week You will be billed per month after the trial ends Group Subscription Premium Digital access, plus: Convenient access for groups of users Integration with third party platforms and CRM systems Usage based pricing and volume discounts for multiple users Subscription management tools and usage reporting SAML-based single sign-on (SSO) Dedicated account and customer success teams Read more Registration Free access Select Purchase a Registration subscription for per week You will be billed per month after the trial ends Receive free limited access to select FT areas including email newsletters, Alphaville and 3 articles of your choice Read more Full Terms and Conditions apply to all Subscriptions. Learn more and compare subscriptions Or, if you are already a subscriber Sign in Other options Premium Digital + Print Print Premium Digital + Weekend Print Weekend Print Useful links Support View Site TipsHelp CentreContact UsAbout UsAccessibilitymyFT TourCareers Legal & Privacy Terms & ConditionsPrivacy PolicyCookiesCopyrightSlavery Statement & Policies Services FT LiveShare News Tips SecurelyIndividual SubscriptionsGroup SubscriptionsRepublishingContracts & Tenders Executive Job SearchAdvertise with the FTFollow the FT on TwitterFT TransactSecondary Schools Tools PortfolioToday's Newspaper (ePaper)Alerts HubBusiness School Rankings Enterprise ToolsNews feedNewslettersCurrency Converter More from the FT Group Markets data delayed by at least 15 minutes. © THE FINANCIAL TIMES LTD 2021. FT and ‘Financial Times’ are trademarks of The Financial Times Ltd. The Financial Times and its journalism are subject to a self-regulation regime under the FT Editorial Code of Practice. Close drawer menuFinancial TimesInternational Edition Search the FTSearch Switch to UK Edition Top sections Home WorldShow more World Global Economy UK US China Africa Asia Pacific Emerging Markets Europe Americas Middle East and North Africa USShow more US US Economy US Companies US Politics & Policy CompaniesShow more Companies Energy Financials Health Industrials Media Professional Services Retail & Consumer Tech Sector Telecoms Transport Tech MarketsShow more Markets Alphaville Markets Data Capital Markets Commodities Currencies Equities Fund Management Trading Moral Money ETF Hub Climate OpinionShow more Opinion Columnists The FT View The Big Read Lex Obituaries Letters Work & CareersShow more Work & Careers Business School Rankings Business Education Entrepreneurship Recruitment Business Books Business Travel Life & ArtsShow more Life & Arts Arts Books Food & Drink FT Magazine House & Home Style Travel FT Globetrotter Personal FinanceShow more Personal Finance Property & Mortgages Investments Pensions Tax Banking & Savings Advice & Comment Next Act How to Spend It Special Reports FT recommends Graphics Lex Alphaville Lunch with the FT FT Globetrotter #techAsia Moral Money FTfm Newsletters Video Podcasts News feed myFT Portfolio Today's Newspaper (ePaper) Crossword Our Apps Help Centre Subscribe Sign In 
www-ft-com-5711	----	Register to read | Financial Times Accessibility helpSkip to navigationSkip to contentSkip to footer Sign In Subscribe Open side navigation menuOpen search bar myFT Search the FTSearchClose search bar Home WorldSections World Home Global Economy UK US China Africa Asia Pacific Emerging Markets Europe Americas Middle East and North Africa Most Read US and Europe to send aid to fight India’s coronavirus surge White House says capital gains tax rise will hit only richest 0.3% EU to sue AstraZeneca over vaccine supply shortfall Greensill worked within UK government without contract UK schools speak out against rules on European language teaching USSections US Home US Economy US Companies US Politics & Policy Most Read White House says capital gains tax rise will hit only richest 0.3% How Apple’s iOS 14.5 update is shaking up the app economy Exxon faces ‘existential’ risk over fossil fuel focus, activist investor warns US companies plan price rises as inflation pressure builds US plans to share up to 60m doses of AstraZeneca’s Covid vaccine with other countries CompaniesSections Companies Home Energy Financials Health Industrials Media Professional Services Retail & Consumer Tech Sector Telecoms Transport Most Read Standard Life Aberdeen to change name to Abrdn Credit Suisse shareholders seek removal of risk chief after twin scandals OnlyFans feels the lockdown love as transactions hit £1.7bn ‘Nomadland’ wins Best Picture, Director and Actress Oscars UK and European banks plan to slash business trips after pandemic Tech MarketsSections Markets Home Alphaville Markets Data Capital Markets Commodities Currencies Equities Fund Management Trading Moral Money ETF Hub Most Read Coronavirus: US administers 230m Covid jabs as global total tops 1bn — as it happened ESG rush opens opportunities for betting against the angels Nasdaq Composite closes at record level for first time in two months Bitcoin boom fuels fight over money creation Further reading Climate OpinionSections Opinion Home Columnists The FT View The Big Read Lex Obituaries Letters Most Read Narendra Modi and the perils of Covid hubris Employers are in for a wave of post-pandemic litigation The tragedy of India’s second wave A new deal for the young: ensuring fair pensions A new deal for the young: how to fix the housing crisis Work & CareersSections Work & Careers Home Business School Rankings Business Education Entrepreneurship Recruitment Business Books Business Travel Most Read The CEO will see you now: the pitfalls of open plan offices Aspiring lawyers and pilots turn to crowdfunding to pay costs The new frontiers of hybrid work take shape Sick of Zoom? Lucky you don’t need it to save the planet Europe’s new space chief is on a mission to reinvigorate the agency Life & ArtsSections Life & Arts Home Arts Books Food & Drink FT Magazine House & Home Style Travel FT Globetrotter Most Read A magical melting pot: where to find Singapore’s best street food On Oscars night, men’s fashion stole the show Russia’s Nepomniachtchi sets up chess world title battle with Carlsen When the chips are down, can 007 save the semiconductor? Hunter Biden and the trouble with addiction memoirs How to Spend It Sign In Subscribe MenuSearch Home World US Companies Tech Markets Climate Opinion Work & Careers Life & Arts How to Spend It Financial Times Sign In Subscribe Search the FTSearchClose search bar Become an FT subscriber to read: If you believe, they put a Dogecoin on the moon Leverage our market expertise Expert insights, analysis and smart data help you cut through the noise to spot trends, risks and opportunities. Join over 300,000 Finance professionals who already subscribe to the FT. Choose your subscription Digital Be informed with the essential news and opinion $40 per month OR $372 for 1 year BEST VALUE - SAVE 20% Select Purchase a Digital subscription for $7.16 per week You will be billed $40 per month after the trial ends MyFT – track the topics most important to you FT Weekend – full access to the weekend content Mobile & Tablet Apps – download to read on the go Gift Article – share up to 10 articles a month with family, friends and colleagues Read more Premium Digital All the essentials plus deeper insights and analysis $11.77 per week* Select Purchase a Premium Digital subscription for $11.77 per week You will be billed $68 per month after the trial ends All the benefits of Digital plus: Lex – our agenda setting daily column In-depth analysis – on trade, emerging markets, M&A, investing and more ePaper – a digital replica of the newspaper Gift Article – share up to 20 articles a month with family, friends and colleagues Read more Team or Enterprise Premium FT.com access for multiple users, with integrations & admin tools Pay based on use Does my organisation subscribe? Get Started Purchase a Team or Enterprise subscription for per week You will be billed per month after the trial ends Group Subscription Premium Digital access, plus: Convenient access for groups of users Integration with third party platforms and CRM systems Usage based pricing and volume discounts for multiple users Subscription management tools and usage reporting SAML-based single sign-on (SSO) Dedicated account and customer success teams Read more Registration Free access Select Purchase a Registration subscription for per week You will be billed per month after the trial ends Receive free limited access to select FT areas including email newsletters, Alphaville and 3 articles of your choice Read more Full Terms and Conditions apply to all Subscriptions. Learn more and compare subscriptions Or, if you are already a subscriber Sign in Other options Premium Digital + Print Print Premium Digital + Weekend Print Weekend Print Useful links Support View Site TipsHelp CentreContact UsAbout UsAccessibilitymyFT TourCareers Legal & Privacy Terms & ConditionsPrivacy PolicyCookiesCopyrightSlavery Statement & Policies Services FT LiveShare News Tips SecurelyIndividual SubscriptionsGroup SubscriptionsRepublishingContracts & Tenders Executive Job SearchAdvertise with the FTFollow the FT on TwitterFT TransactSecondary Schools Tools PortfolioToday's Newspaper (ePaper)Alerts HubBusiness School Rankings Enterprise ToolsNews feedNewslettersCurrency Converter More from the FT Group Markets data delayed by at least 15 minutes. © THE FINANCIAL TIMES LTD 2021. FT and ‘Financial Times’ are trademarks of The Financial Times Ltd. The Financial Times and its journalism are subject to a self-regulation regime under the FT Editorial Code of Practice. Close drawer menuFinancial TimesInternational Edition Search the FTSearch Switch to UK Edition Top sections Home WorldShow more World Global Economy UK US China Africa Asia Pacific Emerging Markets Europe Americas Middle East and North Africa USShow more US US Economy US Companies US Politics & Policy CompaniesShow more Companies Energy Financials Health Industrials Media Professional Services Retail & Consumer Tech Sector Telecoms Transport Tech MarketsShow more Markets Alphaville Markets Data Capital Markets Commodities Currencies Equities Fund Management Trading Moral Money ETF Hub Climate OpinionShow more Opinion Columnists The FT View The Big Read Lex Obituaries Letters Work & CareersShow more Work & Careers Business School Rankings Business Education Entrepreneurship Recruitment Business Books Business Travel Life & ArtsShow more Life & Arts Arts Books Food & Drink FT Magazine House & Home Style Travel FT Globetrotter Personal FinanceShow more Personal Finance Property & Mortgages Investments Pensions Tax Banking & Savings Advice & Comment Next Act How to Spend It Special Reports FT recommends Graphics Lex Alphaville Lunch with the FT FT Globetrotter #techAsia Moral Money FTfm Newsletters Video Podcasts News feed myFT Portfolio Today's Newspaper (ePaper) Crossword Our Apps Help Centre Subscribe Sign In 
www-ft-com-82	----	Register to read | Financial Times Accessibility helpSkip to navigationSkip to contentSkip to footer Sign In Subscribe Open side navigation menuOpen search bar myFT Search the FTSearchClose search bar Home WorldSections World Home Global Economy UK US China Africa Asia Pacific Emerging Markets Europe Americas Middle East and North Africa Most Read US and Europe to send aid to fight India’s coronavirus surge White House says capital gains tax rise will hit only richest 0.3% EU to sue AstraZeneca over vaccine supply shortfall Greensill worked within UK government without contract UK schools speak out against rules on European language teaching USSections US Home US Economy US Companies US Politics & Policy Most Read White House says capital gains tax rise will hit only richest 0.3% How Apple’s iOS 14.5 update is shaking up the app economy Exxon faces ‘existential’ risk over fossil fuel focus, activist investor warns US companies plan price rises as inflation pressure builds US plans to share up to 60m doses of AstraZeneca’s Covid vaccine with other countries CompaniesSections Companies Home Energy Financials Health Industrials Media Professional Services Retail & Consumer Tech Sector Telecoms Transport Most Read Standard Life Aberdeen to change name to Abrdn Credit Suisse shareholders seek removal of risk chief after twin scandals OnlyFans feels the lockdown love as transactions hit £1.7bn ‘Nomadland’ wins Best Picture, Director and Actress Oscars UK and European banks plan to slash business trips after pandemic Tech MarketsSections Markets Home Alphaville Markets Data Capital Markets Commodities Currencies Equities Fund Management Trading Moral Money ETF Hub Most Read Coronavirus: US administers 230m Covid jabs as global total tops 1bn — as it happened ESG rush opens opportunities for betting against the angels Nasdaq Composite closes at record level for first time in two months Bitcoin boom fuels fight over money creation Further reading Climate OpinionSections Opinion Home Columnists The FT View The Big Read Lex Obituaries Letters Most Read Narendra Modi and the perils of Covid hubris Employers are in for a wave of post-pandemic litigation The tragedy of India’s second wave A new deal for the young: ensuring fair pensions A new deal for the young: how to fix the housing crisis Work & CareersSections Work & Careers Home Business School Rankings Business Education Entrepreneurship Recruitment Business Books Business Travel Most Read The CEO will see you now: the pitfalls of open plan offices Aspiring lawyers and pilots turn to crowdfunding to pay costs The new frontiers of hybrid work take shape Sick of Zoom? Lucky you don’t need it to save the planet Europe’s new space chief is on a mission to reinvigorate the agency Life & ArtsSections Life & Arts Home Arts Books Food & Drink FT Magazine House & Home Style Travel FT Globetrotter Most Read A magical melting pot: where to find Singapore’s best street food On Oscars night, men’s fashion stole the show Russia’s Nepomniachtchi sets up chess world title battle with Carlsen When the chips are down, can 007 save the semiconductor? Hunter Biden and the trouble with addiction memoirs How to Spend It Sign In Subscribe MenuSearch Home World US Companies Tech Markets Climate Opinion Work & Careers Life & Arts How to Spend It Financial Times Sign In Subscribe Search the FTSearchClose search bar Become an FT subscriber to read: The entertainment value of bitcoin Leverage our market expertise Expert insights, analysis and smart data help you cut through the noise to spot trends, risks and opportunities. Join over 300,000 Finance professionals who already subscribe to the FT. Choose your subscription Digital Be informed with the essential news and opinion $40 per month OR $372 for 1 year BEST VALUE - SAVE 20% Select Purchase a Digital subscription for $7.16 per week You will be billed $40 per month after the trial ends MyFT – track the topics most important to you FT Weekend – full access to the weekend content Mobile & Tablet Apps – download to read on the go Gift Article – share up to 10 articles a month with family, friends and colleagues Read more Premium Digital All the essentials plus deeper insights and analysis $11.77 per week* Select Purchase a Premium Digital subscription for $11.77 per week You will be billed $68 per month after the trial ends All the benefits of Digital plus: Lex – our agenda setting daily column In-depth analysis – on trade, emerging markets, M&A, investing and more ePaper – a digital replica of the newspaper Gift Article – share up to 20 articles a month with family, friends and colleagues Read more Team or Enterprise Premium FT.com access for multiple users, with integrations & admin tools Pay based on use Does my organisation subscribe? Get Started Purchase a Team or Enterprise subscription for per week You will be billed per month after the trial ends Group Subscription Premium Digital access, plus: Convenient access for groups of users Integration with third party platforms and CRM systems Usage based pricing and volume discounts for multiple users Subscription management tools and usage reporting SAML-based single sign-on (SSO) Dedicated account and customer success teams Read more Registration Free access Select Purchase a Registration subscription for per week You will be billed per month after the trial ends Receive free limited access to select FT areas including email newsletters, Alphaville and 3 articles of your choice Read more Full Terms and Conditions apply to all Subscriptions. Learn more and compare subscriptions Or, if you are already a subscriber Sign in Other options Premium Digital + Print Print Premium Digital + Weekend Print Weekend Print Useful links Support View Site TipsHelp CentreContact UsAbout UsAccessibilitymyFT TourCareers Legal & Privacy Terms & ConditionsPrivacy PolicyCookiesCopyrightSlavery Statement & Policies Services FT LiveShare News Tips SecurelyIndividual SubscriptionsGroup SubscriptionsRepublishingContracts & Tenders Executive Job SearchAdvertise with the FTFollow the FT on TwitterFT TransactSecondary Schools Tools PortfolioToday's Newspaper (ePaper)Alerts HubBusiness School Rankings Enterprise ToolsNews feedNewslettersCurrency Converter More from the FT Group Markets data delayed by at least 15 minutes. © THE FINANCIAL TIMES LTD 2021. FT and ‘Financial Times’ are trademarks of The Financial Times Ltd. The Financial Times and its journalism are subject to a self-regulation regime under the FT Editorial Code of Practice. Close drawer menuFinancial TimesInternational Edition Search the FTSearch Switch to UK Edition Top sections Home WorldShow more World Global Economy UK US China Africa Asia Pacific Emerging Markets Europe Americas Middle East and North Africa USShow more US US Economy US Companies US Politics & Policy CompaniesShow more Companies Energy Financials Health Industrials Media Professional Services Retail & Consumer Tech Sector Telecoms Transport Tech MarketsShow more Markets Alphaville Markets Data Capital Markets Commodities Currencies Equities Fund Management Trading Moral Money ETF Hub Climate OpinionShow more Opinion Columnists The FT View The Big Read Lex Obituaries Letters Work & CareersShow more Work & Careers Business School Rankings Business Education Entrepreneurship Recruitment Business Books Business Travel Life & ArtsShow more Life & Arts Arts Books Food & Drink FT Magazine House & Home Style Travel FT Globetrotter Personal FinanceShow more Personal Finance Property & Mortgages Investments Pensions Tax Banking & Savings Advice & Comment Next Act How to Spend It Special Reports FT recommends Graphics Lex Alphaville Lunch with the FT FT Globetrotter #techAsia Moral Money FTfm Newsletters Video Podcasts News feed myFT Portfolio Today's Newspaper (ePaper) Crossword Our Apps Help Centre Subscribe Sign In 
www-gdcbemina-com-786	----	Abdul Ahad Azad Memorial Degree College Bemina || Government Degree College Bemina-Intake Capacity, Bemina Srinagar, Srinagar Education,Top Colleges in Srinagar,NAAC Accredited Colleges in Srinagar. Contact Number : +91 9419072024 Click here To Download Our App Home Contact Us Our Campus Announcements: ADMISSIONS -2021 Classcodes -Department of English Classwork moved to online mode till futher notification. EVS -Classcodes NIELIT, industrial skills, 6th Sem. NOTIFICATIONS (PERIOD ,DEC 2019 BTO FEB 2021) Annual College Road Race TIME TABLE FEEDBACK Organizational Structure Team Work Toggle navigation Home Academics Courses Offered Subjectwise Combination faculty Admission Intake Capacity Admission Rules Admission Fee Services Library Hostel Canteen Medi-Care Physical Education About Us Contact Us IQAC student’s satisfaction survey Teachers feedback survey SSR AQAR Notifications Notice Board View All ADMISSIONS -2021 Classcodes -Department of English EVS -Classcodes Classwork moved to online mode till futher notification. NOTIFICATIONS (PERIOD ,DEC 2019 BTO FEB 2021) NIELIT, industrial skills, 6th Sem. Annual College Road Race FEEDBACK TIME TABLE Prof.(Dr.) Nasreen Aman Principal College Activities campus Administrative Block College Hostel RTI Seminar College campus classroom Punctuality Week - 1st April to 6th April Main entrance Carrer Talk Carrer Talk JK scientist education and Outreach Programme one day awareness programme JK scientist education and Outreach Programme Organizational Structure Organizational Structure Team Work Examination Time Table BA/BSC Campus See more Bemina, Srinagar, Kashmir +91 9419072024 Fax : 0194-2491715 gdcbemina@gmail.com Welcome To Abdul Ahad Azad Memorial Degree College Bemina Govt. Degree College Bemina was established in the year1970 to meet the educational requirements of the western part of Srinagar city and the adjoining areas of district Budgam. The College was subsequently re-named as Ab. Ahad Azad Memorial Degree College, Bemina (shortly as AAAM Degree College, Bemina) after the name of the modernist Kashmiri poet Abdul Ahad Azad who lived in Chaduara village of Budgam district. Over the period of past 48 years of its existence the College has reached the pinnacle of success in every field of its activities, be it academic or sports or else other curricular activities. This mile stone would not have been achieved had there been not the hard work put in by the teaching and non-teaching as also the students of the College from time to time. Their combined efforts have finally led the College to be accredited as ‘A’ graded college by the NAAC. Our efforts will continue to achieve better in future. I welcome all the students to this College to brighten their future. Prof.(Dr.) Nasreen Aman Principal Study at GDC Bemina Programmes / Courses Skills Examinations Academics NCC Fellowships and Scholarships Research Hostel e-Learning Resources NSS Counselling Cell e-learning Services Collage Library College Placement Cell Games and Sports Tenders RTI Grievances Important Links Government of J&K HED Government of India Ministry of HRD UGC Cluster University Kashmir NATIONAL DIGITAL LIBRARY(NDL) Downloads Study Material BA 5th /6th sem Study Material BSC 5th/6th Sem BCOM/BBA ( 3RD YEAR , 3RD SEM ,2ND SEM ,1ST SEM) BSC 3rd Sem ENVIRONMENTAL SCIENCE B.A 1st Sem /2nd /3rd Sem ENVIRONMENTAL SCIENCE (1ST 2ND SEM CBCS) B.A 3rd Sem ENVIRONMENTAL SCIENCE BSC (BOTANY) Bemina, Srinagar, Kashmir +91 9419072024 Fax : 0194-2491715 gdcbemina@gmail.com Menu Links Home Courses Offered About Us Intake Capacity Faculty Admission Rules Photo Gallery Admission Fee LIST OF NPTEL COURSES Academic Streams Courses Offered Subjectwise Combination B.Com M.Com BBA FEEDBACK AND SUGGESTIONS E-LEARNING SERVICES Copyright © 2018 All Rights Reserved Designed And Developed By Netshell 
www-generonumero-media-3394	----	Gênero e Número close Coordenadas Políticas Especial Covid-19 Histórias Edições e Coberturas Especiais Projetos e Cartografias Open Box da Ciência Mapa da Violência de Gênero Reino Sagrado da Desinformação Volência contra LGBTs Mulheres no Jornalismo Brasileiro O Trabalho e a Vida das Mulheres na Pandemia Vídeos Entrevistas Dados Abertos Diálogos GN Republique Sobre Conselho Consultivo FAQS Assine GN HUB Mais de 250 mulheres foram vítimas de violência por dia durante a pandemia no Rio de Janeiro em 2020 Coordenadas Políticas Violência TRABALHO | POLÍTICA | DIREITOS REPRODUTIVOS | CULTURA | CIÊNCIA E EDUCAÇÃO | VIOLÊNCIA | ESPORTE | MOBILIDADE | LATINOAMÉRICA | JUSTIÇA Em livro, Alessandra Devulsky revela como o colorismo atua como um braço do racismo para hierarquizar e segregar pessoas negras Coordenadas Políticas Cultura “Falsa praticidade” mantém ultraprocessados na casa dos brasileiros enquanto mortes por doenças relacionadas só aumentam Coordenadas Políticas saúde Amazonas concentra mortes de indígenas no Brasil, mas menos da metade foram vacinados Coordenadas Políticas saúde Depoimento: “Pensava estar numa guerra”, conta enfermeira intensivista em Manaus Histórias saúde Pandemia prejudica pré-natal e mães de Manaus relatam medo durante colapso Sem categoria Depoimento: “Minha empresa de apoio a parto domiciliar cresceu na pandemia”, afirma enfermeira em Manaus Histórias saúde Domésticas envelhecem e desemprego e precarização aumentam entre mulheres jovens Coordenadas Políticas Trabalho Por uma retomada econômica com equidade e diversidade no ambiente de trabalho Coordenadas Políticas Supermercados têm ano dourado durante a pandemia em meio à insegurança alimentar e incertezas para trabalhadoras saúde Segunda categoria mais beneficiada pelo Auxílio Emergencial, trabalho doméstico perde 1,5 milhão de postos de trabalho Trabalho Para a biomédica Mellanie Dutra, o protagonismo feminino na ciência reafirma a importância das mulheres nesse espaço e inspira as próximas gerações Ciência e Educação / Coordenadas Políticas / Entrevistas Cirurgias do processo transexualizador caem 70% em 2020 e denúncias de “esvaziamento” na saúde revelam risco para população trans Coordenadas Políticas / saúde Assassinato de pessoas trans cresce 75% em dez anos sem políticas públicas eficazes de proteção Coordenadas Políticas / Violência Pouco dinheiro gasto por ministério de Damares em 2020 impacta mulheres e LGBT+ e gera temor sobre futuro da pasta Coordenadas Políticas / covid-19 Em livro, Bruna Pereira analisa violências (sexuais e afetivas) invisíveis contra mulheres negras Coordenadas Políticas Argentina aprova legalização do aborto Direitos reprodutivos 2020: o ano da pandemia e seu impacto nas mulheres, pessoas negras e LGBT+ Edições e Coberturas Especiais Maioria de prefeitos de capitais ignora mulheres, negros ou LGBT+ em seus planos de governo Coordenadas Políticas Política Ações pouco efetivas das empresas desestimulam denúncias de assédio moral e sexual, revela pesquisa Coordenadas Políticas / Trabalho / Violência Na primeira eleição municipal após assassinato de Marielle, mulheres negras eleitas são alvos de discurso de ódio e ameaças Coordenadas Políticas / Política Em Recife e Porto Alegre, resultados das eleições espelham força de oligarquias e violência política contra mulheres Coordenadas Políticas / Política No segundo turno, nenhuma capital elegeu uma mulher para a prefeitura Coordenadas Políticas "O negro é um cidadão invisível. Quando ele aparece, a violência aparece também" Coordenadas Políticas / Política Em 53% das cidades brasileiras, nenhuma mulher negra ocupará a Câmara Municipal em 2021 Coordenadas Políticas / Política TRABALHO | POLÍTICA | DIREITOS REPRODUTIVOS | CULTURA | CIÊNCIA E EDUCAÇÃO | VIOLÊNCIA | ESPORTE | MOBILIDADE | LATINOAMÉRICA | JUSTIÇA Em Recife e Porto Alegre, resultados das eleições espelham força de oligarquias e violência política contra mulheres Coordenadas Políticas Política Na capital pernambucana, circularam panfletos com acusações falsas de cunho religioso contra Marília Arraes; na capital gaúcha, Manuela D'Ávila foi alvo de mais de 500 mil compartilhamentos de conteúdos falsos No segundo turno, nenhuma capital elegeu uma mulher para a prefeitura Coordenadas Políticas Das 20 cidades onde havia mulher concorrendo, só sete terão prefeitas a partir de 2021; em Palmas (TO), Cinthia Ribeiro venceu disputa para o Executivo municipal no primeiro turno “O negro é um cidadão invisível. Quando ele aparece, a violência aparece também” Coordenadas Políticas Política Professor emérito da UFRJ e escritor, Muniz Sodré diz que o racismo brasileiro é de duplo vínculo e que vivemos uma forma social escravista, que se constitui na rejeição e Em 53% das cidades brasileiras, nenhuma mulher negra ocupará a Câmara Municipal em 2021 Coordenadas Políticas Política Mulheres negras foram eleitas pela primeira vez em pelo menos 4 capitais, mas são apenas 6% do total de vereadores, segundo análise da Gênero e Número com dados do TSE. Negros serão 44% dos vereadores nas capitais brasileiras em 2021 Coordenadas Políticas Sul registra menor proporção de negros nas câmaras municipais; mulheres são 17% do total de vereadores nas capitais Apenas 5 capitais têm chances reais de ter mulheres na prefeitura, de acordo com pesquisas Coordenadas Políticas Nas últimas décadas, apenas oito mulheres foram eleitas para o cargo em capitais; ano com maior número foi 2000, com 6 prefeitasPor Lola Ferreira*No próximo domingo, vamos às urnas eleger TRABALHO | POLÍTICA | DIREITOS REPRODUTIVOS | CULTURA | CIÊNCIA E EDUCAÇÃO | VIOLÊNCIA | ESPORTE | MOBILIDADE | LATINOAMÉRICA | JUSTIÇA Edição 11 – Direitos Reprodutivos Direitos reprodutivos Edições e Coberturas Especiais Edição 10 – Mulheres na Ciência Ciência e Educação Edições e Coberturas Especiais Edição 9 – Educação básica Ciência e Educação Edições e Coberturas Especiais Edição 8 – Música Cultura Edições e Coberturas Especiais Edição 7 – Mulheres encarceradas Edições e Coberturas Especiais Justiça Edição 6 – Especial Trabalho Edições e Coberturas Especiais Trabalho Edição 5 – Especial Espaço Público Edições e Coberturas Especiais Mobilidade Edição 4 – Violência anunciada Edições e Coberturas Especiais Violência Edição 3 – Mulheres na política II Edições e Coberturas Especiais Política Edição 2 – Mulheres na política Edições e Coberturas Especiais Política Edição 1 – Gênero no esporte Edições e Coberturas Especiais Esporte Primeira organização de mídia no Brasil orientada por dados para qualificar o debate sobre equidade de gênero. O que fazemos Nossas Áreas Equipe GN HUB Fale Conosco contato@generonumero.media Assine Insira seu e-mail e saiba mais Este campo deve ser deixado em branco Enviar Aguarde... Desenvolvido por Beta Design 
www-gfdrr-org-7669	----	GFDRR Skip to main content Header Main Navigation Who We Are How We Work Where We Work Knowledge Hub What's New Dropdown Main Navigation Who We Are + Funding Structure & Partnerships Meet Our Team How We Work + GFDRR Labs Challenge Fund Understanding Risk Open Data for Resilience Initiative Disruptive Technologies Resilient Infrastructure Building Regulation for Resilience Global Program for Safer Schools Nature-Based Solutions Resilient Cities City Resilience Program Hydromet Services Africa Hydromet Program Climate Risk and Early Warning Systems (CREWS) CREWS Caribbean Financial Protection SFRARR Central Asia Caribbean Regional Resilience Building Facility DRF Analytics DRFI in Caribbean OCTs Global Risk Financing Facility Social Resilience DRM-FCV Nexus Gender Resilience to Climate Change Small Island States Resilience Initiative Canada-Caribbean Resilience Facility Resilient Recovery Where We Work + Africa Europe and Central Asia East Asia and Pacific Latin America and Caribbean Middle East and North Africa South Asia Search Grants Knowledge Hub + Publications Online Tools E-Learning What's New + Feature Stories Events Blogs News Videos Search Image The latest news and insights Publication In Focus: Resilience & DRM Stories in IDA Countries Learn More > Feature Frontline: Preparing Healthcare Systems for Shocks Learn More > Publication Inclusive Resilience: Inclusion Matters for Resilience in South Asia Learn More > Blog Advancing social inclusion in climate and disaster resilience in South Asia Learn More > Video South Asia: Inclusive Resilience Learn More > Blog Investing in resilience to help Moldova’s at-risk communities weather the next shock Learn More > Brief Urban Resilience in the GFDRR Portfolio Learn More > Event Reflections on the 10th Anniversary of the Great East Japan Earthquake Learn More > Who We Are The Global Facility for Disaster Reduction and Recovery (GFDRR) is a global partnership that helps developing countries better understand and reduce their vulnerability to natural hazards and climate change. Learn More What We Do GFDRR is organized around these eight priority topics. Image GFDRR Labs GFDRR Labs uses cutting-edge science and technology to make high-quality risk information available faster and at lower costs, and develop new tools that allow decision-makers and communities to collect, share, and understand risk information. Learn More Image Resilient Infrastructure Infrastructure is the critical lifeline of social and economic activity, connecting communities, industry and markets with essential services for the operation of daily life. Learn More Image Resilient Cities We support cities in strengthening their ability to better manage stresses and shocks from disaster and climate hazards. Learn More Image Hydromet Services and Early Warning Systems We help countries plan and prepare for extreme weather by working with them to strengthen their hydromet monitoring, forecasting and early warning systems. Learn More Image Financial Protection We support countries in their efforts to design and implement financial protection strategies and instruments which can help them manage the costs of disaster and climate shocks. Learn More Image Social Resilience We work with countries to tackle the gender and social dimensions of disaster risk, while also mobilizing the expertise of citizens and communities for strengthening resilience. Learn More Image Resilience to Climate Change We work with countries to strengthen their ability to plan, design and implement climate resilient policies and investments. Learn More Image Resilient Recovery We support governments prepare for and respond to emergencies to ensure quicker and more resilient post-disaster recovery. Learn More Image Image Image Image Image Image Image Image Image KNOWLEDGE HUB Explore our repository of knowledge on resilience and disaster risk management. Visit the hub Image “Whenever a disaster, a crisis or a risk happens, the first ones to be affected are women and children … It is for this reason that it is very important to involve women in disaster prevention." Fatima Ahler, Director of Niger Digital Cartography Agency, President of Femme & TIC Niger Highlights Take a deep dive into some of our work at the cutting edge of resilience and disaster risk management. Publication Building Back Better Achieving resilience through stronger, faster, and more inclusive post-disaster recovery. Learn More Publication Unbreakable The report warns that human & economic impacts of extreme weather on poverty are more devastating Learn More Publication Lifelines: The Resilient Infrastructure Opportunity The business case for investing in more resilient infrastructure worldwide. Learn More Follow Us Our key partners Featured Programs The Japan-World Bank Program for Mainstreaming DRM ACP-EU Natural Disaster Risk Reduction Program GFDRR Members & Observers Australia Austria Canada Germany India Italy Japan Luxembourg Norway Serbia Sweden Switzerland United States Africa, Caribbean & Pacific (ACP) Secretariat European Union United Nations Office for Disaster Risk Reduction World Bank Group Denmark Finland France Portugal Spain Saudi Arabia United Kingdom Global Network of Civil Society Organisations for Disaster Reduction Huairou Commission International Federation of Red Cross and Red Crescent Societies Islamic Development Bank Organisation for Economic Co-operation and Development United Nations Development Programme World Meteorological Organization Global Facility for Disaster Reduction and Recovery (GFDRR) World Bank Headquarters 1818 H Street NW Washington DC 20433 (U.S.) Footer Main Navigation Who We Are Funding Structure & Partnerships Meet Our Team How We Work GFDRR Labs Resilient Infrastructure Resilient Cities Hydromet Services Financial Protection Social Resilience Resilience to Climate Change Resilient Recovery Where We Work Africa Europe and Central Asia East Asia and Pacific Latin America and Caribbean Middle East and North Africa South Asia Search Grants Knowledge Hub Publications Online Tools E-Learning What's New Feature Stories Events Blogs News Videos Search Follow Us © 2020 Global Facility for Disaster Reduction and Recovery Terms of Use Privacy Policy Contact Us 
www-gov-uk-7901	----	Foreign, Commonwealth & Development Office - GOV.UK Cookies on GOV.UK We use some essential cookies to make this website work. We’d like to set additional cookies to understand how you use GOV.UK, remember your settings and improve government services. We also use cookies set by other sites to help us deliver content from their services. Accept additional cookies Reject additional cookies View cookies You can change your cookie settings at any time. Hide this message Skip to main content GOV.UK Show or hide search Search on GOV.UK Search GOV.UK Coronavirus (COVID-19) Rules, guidance and support Home Organisations Foreign, Commonwealth & Development Office Foreign, Commonwealth & Development Office Travel Advice FCDO Worldwide Passports International development funding Development Tracker Featured Travel advice: coronavirus (COVID-19) 26 March 2021 — Guidance Guidance for British people travelling abroad during the coronavirus pandemic, if they are legally permitted to travel under current UK COVID-19 restrictions. UK Foreign Secretary travels to Geneva for key Cyprus talks 27 April 2021 — Press release Foreign Secretary Dominic Raab will be attending talks hosted by the UN in Geneva this week on the Cyprus problem. UK sanctions 22 individuals involved in serious international corruption 26 April 2021 — Press release Foreign Secretary Dominic Raab has announced the UK’s first sanctions under the new Global Anti-Corruption regime. UK sends life-saving medical equipment to India 25 April 2021 — Press release UK announces more than 600 pieces of vital medical equipment will be sent to India to support the country in its fight against COVID-19. Nazanin Zaghari-Ratcliffe: Foreign Secretary's statement, 26 April 2021 26 April 2021 — Press release Dominic Raab issued a statement following reports that Nazanin Zaghari-Ratcliffe has been sentenced to a further year in prison and a 1-year travel ban. First in-person G7 meeting of foreign ministers in 2 years to be held in May 20 April 2021 — Press release Foreign Secretary Dominic Raab announces UK will host G7 foreign and development ministers in London for a COVID-secure meeting on 3 to 5 May 2021. Latest from the Foreign, Commonwealth & Development Office Protecting international humanitarian law and objects indispensable to civilians’ survival 27 April 2021 Speech Notarial and documentary services in Mexico 27 April 2021 Guidance Guatemala travel advice 27 April 2021 Travel Advice See all latest documents Subscriptions Get emails Subscribe to feed Subscribe to feed Copy and paste this URL into your feed reader What the Foreign, Commonwealth & Development Office does We pursue our national interests and project the UK as a force for good in the world. We promote the interests of British citizens, safeguard the UK’s security, defend our values, reduce poverty and tackle global challenges with our international partners. FCDO is a ministerial department, supported by 12 agencies and public bodies. Read more about what we do Follow us Follow on All FCDO social media Follow on @FCDOGovUK on Twitter Follow on @FCDOtravelGovUK on Twitter Follow on FCDO on Facebook Follow on FCDO Travel on Facebook Follow on Instagram Follow on FCDO Blog Documents Services Order a copy of a birth, death or marriage certificate Get your document legalised See all services Guidance and regulation Travel advice: coronavirus (COVID-19) 26 March 2021 Guidance Spain travel advice 26 April 2021 Travel Advice See all guidance and regulation News and communications Protecting international humanitarian law and objects indispensable to civilians’ survival 27 April 2021 Speech UK gives new volcano relief for St Vincent and the Grenadines 27 April 2021 Press release See all news and communications Research and statistics Statistics on International Development: Provisional UK Aid Spend 2020 8 April 2021 National Statistics Statistics on International Development: Final UK Aid Spend 2019 9 March 2021 National Statistics See all research and statistics Policy papers and consultations Adaptation Action Coalition: an overview 27 April 2021 Policy paper Global anti-corruption sanctions: factors in designating people involved in serious corruption 26 April 2021 Policy paper See all policy papers and consultations Transparency and freedom of information releases UK National Action Plan on Women, Peace and Security 2018 to 2022: report to Parliament 2020 20 April 2021 Corporate report Foreign, Commonwealth & Development Office consular data 2021 8 April 2021 Transparency data See all transparency and freedom of information releases Our ministers The Rt Hon Dominic Raab MP Secretary of State for Foreign, Commonwealth and Development Affairs First Secretary of State The Rt Hon James Cleverly MP Minister of State (Minister for Middle East and North Africa) The Rt Hon Lord Zac Goldsmith Minister of State (Minister for Pacific and the Environment) Unpaid Nigel Adams MP Minister of State (Minister for Asia) Lord Ahmad of Wimbledon Minister of State (Minister for South Asia and the Commonwealth) James Duddridge MP Parliamentary Under Secretary of State (Minister for Africa) Wendy Morton MP Parliamentary Under Secretary of State (Minister for European Neighbourhood and the Americas) Our management Sir Philip Barton KCMG OBE Permanent Under-Secretary Sir Tim Barrow GCMG LVO MBE Political Director Juliet Chua Director General, Finance and Corporate Thomas Drew CMG Director General, Middle East, North Africa, Afghanistan and Pakistan Moazzam Malik Director General, Africa Vijay Rangarajan Director General, Americas and Overseas Territories Jenny Bates Director General, Indo-Pacific Kumar Iyer Director General, Delivery Sir Iain Macleod KCMG Legal Adviser Nic Hailey CMG Director General, Transformation Baroness Helena Morrissey Lead Non-Executive Director John Coffey Non-Executive Director Beverley Tew Non-Executive Director Ann Cormack MBE Non-Executive Director Dr Rachel Glennerster CMG Chief Economist Professor Carole Mundell Chief International Science Envoy Special representatives Lord Ahmad of Wimbledon Prime Minister’s Special Representative on Preventing Sexual Violence in Conflict The Rt Hon Anne-Marie Trevelyan MP UK International Champion on Adaptation and Resilience for the COP26 Presidency The Rt Hon Lord Pickles United Kingdom Special Envoy for post-Holocaust issues Fiona Bruce MP Prime Minister’s Special Envoy for Freedom of Religion or Belief Helen Grant MP Prime Minister’s Special Envoy for Girls’ Education Nick Bridge Special Representative for Climate Change Gareth Bayley Special Representative on Afghanistan and Pakistan Robert Fairweather OBE Special Representative for Sudan and South Sudan Jennifer Townson Migration and Modern Slavery Envoy Philip Parham Commonwealth Envoy Nick Dyer Special Envoy for Famine Prevention and Humanitarian Affairs Ms Josephine Gauld Deputy British High Commissioner to Kenya and Permanent Rep to the UNEP and UN Habitat Contact FCDO Office address King Charles Street London SW1A 2AH United Kingdom Email fcdo.correspondence@fcdo.gov.uk General enquiries telephone 020 7008 5000 Due to coronavirus (COVID-19) we cannot currently respond to letters sent by post within normal timescales. Use email if possible. Office address (East Kilbride) Abercrombie House Eaglesham Road East Kilbride G75 8EA United Kingdom Telephone 01355 844000 Fax 01355 844099 Use this phone number for Overseas Pensions enquiries or to contact a named member of staff. Media enquiries Email newsdesk@fcdo.gov.uk Telephone 020 7008 3100 Contact the FCDO Communication Team via email (monitored 24 hours a day) in the first instance, and we will respond as soon as possible. Contact an embassy, high commission or consulate Contact Form: Contact an embassy, high commission or consulate Legalisation and apostille service Foreign, Commonwealth & Development Office PO Box 6255 Milton Keynes  MK10 1XX United Kingdom Email legalisationenquiries@fcdo.gov.uk Contact Form: Legalisation and apostille service Enquiries 03700 002244 between midday and 4pm, Monday to Friday If you need to use British documents for business or personal matters abroad. Chevening Scholarships Contact Form: Chevening Scholarships Chevening Scholarships are the UK government’s global scholarship programme, funded by the Foreign, Commonwealth & Development Office (FCDO) and partner organisations. Lancaster House Senior Events Manager Stable Yard St James’ London SW1A 1BB United Kingdom Email Lancasterhouse.enquiries@fcdo.gov.uk Senior Events Manager 020 7008 2711 Travel advice Contact Form: Travel advice View our Travel Advice pages for our latest information on the country you’re visiting. If you have a question about our advice or a topic that isn’t covered, contact us. Internal Audit Investigations Section Head of Internal Audit Internal Audit Department 22 Whitehall London SW1A 2EJ United Kingdom Email reportingconcerns@fcdo.gov.uk confidential hotline +44 (0)1355 843747 The FCDO Internal Audit Department’s fraud and safeguarding investigation team is for raising concerns, suspicions and/or allegations of fraud, sexual exploitation and abuse or other corrupt practices. Your information will be treated in confidence. You do not have to provide your name, but doing so will assist us in taking forward your concerns. Make an FOI request Read about the Freedom of Information (FOI) Act and how to make a request. Check our previous releases to see if we’ve already answered your question. Make a new request by contacting us using the details below. Freedom of Information requests Information Rights Unit Foreign, Commonwealth & Development Office Room WH2.177 King Charles Street London SW1A 2AH United Kingdom Email information.rights@fcdo.gov.uk Reprioritisation of resources due to COVID-19 and measures to prevent the spread of the virus continue to impact on our ability to respond to requests within the statutory deadline. Our significantly reduced on-site working will mean delays to responses which require access to paper records. We aim to process requests as quickly as possible, and appreciate your patience at this time. You can also use these contact details to make a Subject Access request (SAR). See also our personal information charter for information about your rights under the Data Protection Act (DPA), including how to access any data we may hold about you. High profile groups within FCDO Preventing Sexual Violence in Conflict Initiative Corporate information Our energy use Statistics at FCDO Research at FCDO Petitions and campaigns Our governance Complaints procedure Jobs and contracts Procurement at FCDO Working for FCDO Jobs Read about the types of information we routinely publish in our Publication scheme. Our Personal information charter explains how we treat your personal information. Read our policy on Social media use. Is this page useful? Maybe Yes this page is useful No this page is not useful Thank you for your feedback Report a problem with this page Close Help us improve GOV.UK Don’t include personal or financial information like your National Insurance number or credit card details. What were you doing? What went wrong? Send Close Help us improve GOV.UK To help us improve GOV.UK, we’d like to know more about your visit today. We’ll send you a link to a feedback form. It will take only 2 minutes to fill in. Don’t worry we won’t send you spam or share your email address with anyone. Email address Send me the survey Coronavirus (COVID-19) Coronavirus (COVID-19): guidance and support Brexit Check what you need to do Services and information Benefits Births, deaths, marriages and care Business and self-employed Childcare and parenting Citizenship and living in the UK Crime, justice and the law Disabled people Driving and transport Education and learning Employing people Environment and countryside Housing and local services Money and tax Passports, travel and living abroad Visas and immigration Working, jobs and pensions Departments and policy How government works Departments Worldwide Services Guidance and regulation News and communications Research and statistics Policy papers and consultations Transparency and freedom of information releases Support links Help Privacy Cookies Contact Accessibility statement Terms and conditions Rhestr o Wasanaethau Cymraeg Built by the Government Digital Service Open Government Licence All content is available under the Open Government Licence v3.0, except where otherwise stated © Crown copyright 
www-hughrundle-net-2268	----	Information Flaneur Information Flaneur A blog about libraries, computer programming, and the impending end of humanity. Empathy Daleks Sarah Lambert&#x27;s talk at CRIG 2020 flips the usual framing of &quot;student success&quot;. For what we will The only &quot;COVID Recovery&quot; worth pursuing is one that rejects &quot;jobs&quot; and centres care. Recursive How do you search Stack Overflow for tips on fixing your stack overflow? 2020 rear vision My year in review with some thoughts on the future Library Map Part 1 - Why I recently created a map of libraries in Australia. Here&#x27;s why I did it and why I did it that way. Library Map Part 2 - How A technical overview of how I create librarymap.hugh.run Automation workflows with GitHub Actions and Webhooks - Library Map part 3 The third in my Library Map series Re-live the excitement of Generous and Open GLAM 2021 The videos of all the GOGLAM2021 talks How to write a static site generator in 30 lines or less When publishing with the gemini protocol, gemscript is so simple you can make a quick and dirty SSG with some basic shell scripting A barbaric yawp I made [a little tool for tweeting/tooting into the void, inspired by &quot;the Unix philosophy&quot; 
www-hughrundle-net-846	----	Information Flaneur Information Flaneur Hugh Rundle A barbaric yawp Mon Apr 19 2021 10:54:56 GMT+1000 (Australian Eastern Standard Time) How to write a static site generator in 30 lines or less Mon Mar 29 2021 15:54:58 GMT+1100 (Australian Eastern Daylight Time) Re-live the excitement of Generous and Open GLAM 2021 Sun Feb 21 2021 19:52:05 GMT+1100 (Australian Eastern Daylight Time) Automation workflows with GitHub Actions and Webhooks: Library Map part 3 Sat Jan 30 2021 15:11:36 GMT+1100 (Australian Eastern Daylight Time) Library Map Part 2: How Thu Jan 28 2021 09:12:16 GMT+1100 (Australian Eastern Daylight Time) Library Map Part 1: Why Mon Jan 25 2021 21:09:34 GMT+1100 (Australian Eastern Daylight Time) 2020 rear vision Mon Jan 04 2021 07:10:43 GMT+1100 (Australian Eastern Daylight Time) Recursive Sun Dec 20 2020 13:30:03 GMT+1100 (Australian Eastern Daylight Time) For what we will Mon Dec 14 2020 10:53:39 GMT+1100 (Australian Eastern Daylight Time) Empathy Daleks Mon Nov 23 2020 16:01:41 GMT+1100 (Australian Eastern Daylight Time) Home About Archive Privacy RSS I've published my thoughts about libraries and technology on this blog for several years. Some of my views have evolved over time - past writing may not reflect my current opinion. All content © Hugh Rundle except as noted in Acknowledgements. Text licensed CC-BY 4.0. Please ensure any redistribution adheres to the Creative Commons Best Practices for Attribution. Published with Eleventy. 
www-hughrundle-net-84	----	A barbaric yawp Information Flaneur A barbaric yawp Mon Apr 19 2021 10:54:56 GMT+1000 (Australian Eastern Standard Time) coding rust Over the Easter break I made a little Rust tool for sending toots and/or tweets from a command line. Of course there are dozens of existing tools that enable either of these, but I had a specific use in mind, and also wanted a reasonably small and achievable project to keep learning Rust. For various reasons I've recently been thinking about the power of "the Unix philosophy", generally summarised as: Write programs that do one thing and do it well. Write programs to work together. Write programs to handle text streams, because that is a universal interface. My little program takes a text string as input, and sends the same string to the output, the intention being not so much that it would normally be used manually on its own (though it can be) but more that it can "work together" with other programs or scripts. The "one thing" it does (I will leave the question of "well" to other people to judge) is post a tweet and/or toot to social media. It's very much a unidirectional, broadcast tool, not one for having a conversation. In that sense, it's like Whitman's "Barbaric yawp", subject of my favourite scene in Dead Poets Society and a pretty nice description of what social media has become in a decade or so. Calling the program yawp therefore seemed fitting. yawp takes text from standard input (stdin), publishes that text as a tweet and/or a toot, and then prints it to standard output (stdout). Like I said, it's not particularly complex, and not even all that useful for your daily social media posting needs, but the point is for it to be part of a tool chain. For this reason yawp takes the configuration it needs to interact with the Mastodon and Twitter APIs from environment (ENV) variables, because these are quite easy to set programatically and a fairly "universal interface" for setting and getting values to be used in programs. Here's a simple example of sending a tweet: yawp 'Hello, World!' -t We could also send a toot by piping from the echo program (the - tells yawp to use stdin instead of looking for an argument like it uses above): echo 'Hello again, World!' | yawp - -m In bash, you can send the contents of a file to stdin, so we could do this too: yawp - -mt <message.txt But really the point is to use yawp to do something like this: app_that_creates_message | yawp - -mt | do_something_else.sh >> yawping.log Anyway, enjoy firing your barbaric yawps into the cacophony. Home About Archive Privacy RSS I've published my thoughts about libraries and technology on this blog for several years. Some of my views have evolved over time - past writing may not reflect my current opinion. All content © Hugh Rundle except as noted in Acknowledgements. Text licensed CC-BY 4.0. Please ensure any redistribution adheres to the Creative Commons Best Practices for Attribution. Published with Eleventy. 
www-instagram-com-7100	----	None 
www-inthelibrarywiththeleadpipe-org-1139	----	In the Library with the Lead Pipe In the Library with the Lead Pipe An open access, peer reviewed journal Ethical Financial Stewardship: One Library’s Examination of Vendors’ Business Practices By Katy DiVittorio and Lorelle Gianelli In Brief The evaluation of library collections rarely digs into the practices or other business ventures of the companies that create or sell library resources. As financial stewards, academic Acquisition Librarians are in a unique position to consider the business philosophy and practices of our vendors as they align... Read More We Need to Talk About How We Talk About Disability: A Critical Quasi-systematic Review By Amelia Gibson, Kristen Bowen, and Dana Hanson In Brief This quasi-systematic review uses a critical disability framework to assess definitions of disability, use of critical disability approaches, and hierarchies of credibility in LIS research between 1978 and 2018. We present quantitative and qualitative findings about trends and gaps in the research, and discuss the... Read More Culturally Responsive Community Engagement Programming and the University Library: Lessons Learned from Half a Decade of VTDITC By Craig E. Arthur, Dr. Freddy Paige, La’ Portia Perkins, Jasmine Weiss, and Dr. Michael Williams (Good Homie Signs’ “Hip Hop @ VT” mural 7/18) In Brief VTDITC: Hip Hop Studies at Virginia Tech is an award-winning series of experiential learning-focused, culturally responsive community engagement programs. It is deeply rooted in hip hop culture and... Read More Creating a Student-Centered Alternative to Research Guides: Developing the Infrastructure to Support Novice Learners In Brief: Research and course guides typically feature long lists of resources without the contextual or instructional framework to direct novice researchers through the research process. An investigation of guide usage and user interactions at a large university in the southwestern U.S. revealed a need to reexamine the way research guides can be developed and... Read More Power and Status (and Lack Thereof) in Academe: Academic Freedom and Academic Librarians In Brief Academic librarians do not experience full academic freedom protections, despite the fact that they are expected to exercise independent judgment, be civically engaged, and practice applied scholarship. Academic freedom for academic librarians is not widely studied or well understood. To learn more, we conducted a survey which received over 600 responses from academic... Read More The Library Commons: An Imagination and an Invocation By Jennie Rose Halperin In Brief Commons theory can provide important interventions within neoliberal managerial information capitalism when applied to the library as an institution. The commons and its associated practices provide a model of abundance, sharing, and cooperation. Libraries can and should participate in alternative economic and management models to create an inclusive vision... Read More “Information Has Value”: The Political Economy of Information Capitalism In Brief Information capitalism dominates the production and flow of information across the globe. It produces massive information institutions that are as harmful to everyday people as they are powerful. To this point, Information Literacy (IL) educators do not have a theory and pedagogy of information capitalism. This article appraises the current state of political... Read More Training Matters: Student Employment and Learning in Academic Libraries In Brief Conceiving of student employment in academic libraries as an educationally purposeful experience requires adopting a learner-centered pedagogical approach to student employee job training. Adopting such an approach is triply beneficial: it makes that job training more effective; it identifies training as an opportunity to pursue learning goals that support the growth of students... Read More Creating a Library Wide Culture and Environment to Support MLIS Students of Color: The Diversity Scholars Program at Oregon State University Libraries In Brief The work of social justice, equity, and inclusion is not a short-term investment by a limited number of people; instead, it should be a part of every library’s and librarian’s work. At the Oregon State University Libraries (OSUL), we felt that in order to create a program dedicated to employing MLIS students of... Read More It’s Not Imposter Syndrome: Resisting Self-Doubt as Normal For Library Workers In Brief Library workers, as with other professions, are quick to diagnose ourselves and others with imposter syndrome when we doubt or devalue our everyday work.&#160; However, methods of coping with imposter syndrome have changed little in the forty years since the term was first theorized, and often centre on feel-good fixes which do not... Read More 
www-inthelibrarywiththeleadpipe-org-3674	----	In the Library with the Lead Pipe – An open access, peer reviewed journal Skip to Main Content Open Menu Home About Awards & Good Words Contact Editorial Board Denisse Solis Ian Beilin Ikumi Crocoll Jaena Rae Cabrera Kellee Warren Nicole Cooke Ryan Randall Emeritus Announcements Authors Archives Conduct Submission Guidelines Lead Pipe Publication Process Style Guide Search Home About Awards & Good Words Contact Editorial Board Denisse Solis Ian Beilin Ikumi Crocoll Jaena Rae Cabrera Kellee Warren Nicole Cooke Ryan Randall Emeritus Announcements Authors Archives Conduct Submission Guidelines Lead Pipe Publication Process Style Guide Search 2021 31 Mar Katy DiVittorio and Lorelle Gianelli /0 Comments Ethical Financial Stewardship: One Library’s Examination of Vendors’ Business Practices By Katy DiVittorio and Lorelle Gianelli In Brief The evaluation of library collections rarely digs into the practices or other business ventures of the companies that create or sell library resources. As financial stewards, academic Acquisition Librarians are in a unique position to consider the business philosophy and practices of our vendors as they align... Read More 2021 24 Feb Amelia Gibson, Kristen Bowen and Dana Hanson /0 Comments We Need to Talk About How We Talk About Disability: A Critical Quasi-systematic Review By Amelia Gibson, Kristen Bowen, and Dana Hanson In Brief This quasi-systematic review uses a critical disability framework to assess definitions of disability, use of critical disability approaches, and hierarchies of credibility in LIS research between 1978 and 2018. We present quantitative and qualitative findings about trends and gaps in the research, and discuss the... Read More 2020 2 Dec Craig Arthur, Freddy Paige, La' Portia Perkins, Jasmine Weiss and Michael Williams /0 Comments Culturally Responsive Community Engagement Programming and the University Library: Lessons Learned from Half a Decade of VTDITC By Craig E. Arthur, Dr. Freddy Paige, La’ Portia Perkins, Jasmine Weiss, and Dr. Michael Williams (Good Homie Signs’ “Hip Hop @ VT” mural 7/18) In Brief VTDITC: Hip Hop Studies at Virginia Tech is an award-winning series of experiential learning-focused, culturally responsive community engagement programs. It is deeply rooted in hip hop culture and... Read More 2020 21 Oct Jeremiah Paschke-Wood, Ellen Dubinsky and Leslie Sult /2 Comments Creating a Student-Centered Alternative to Research Guides: Developing the Infrastructure to Support Novice Learners In Brief: Research and course guides typically feature long lists of resources without the contextual or instructional framework to direct novice researchers through the research process. An investigation of guide usage and user interactions at a large university in the southwestern U.S. revealed a need to reexamine the way research guides can be developed and... Read More 2020 16 Sep Danya Leebaw and Alexis Logsdon /1 Comments Power and Status (and Lack Thereof) in Academe: Academic Freedom and Academic Librarians In Brief Academic librarians do not experience full academic freedom protections, despite the fact that they are expected to exercise independent judgment, be civically engaged, and practice applied scholarship. Academic freedom for academic librarians is not widely studied or well understood. To learn more, we conducted a survey which received over 600 responses from academic... Read More 2020 2 Sep Jennie Rose Halperin /1 Comments The Library Commons: An Imagination and an Invocation By Jennie Rose Halperin In Brief Commons theory can provide important interventions within neoliberal managerial information capitalism when applied to the library as an institution. The commons and its associated practices provide a model of abundance, sharing, and cooperation. Libraries can and should participate in alternative economic and management models to create an inclusive vision... Read More 2020 19 Aug Dave Ellenwood /1 Comments “Information Has Value”: The Political Economy of Information Capitalism In Brief Information capitalism dominates the production and flow of information across the globe. It produces massive information institutions that are as harmful to everyday people as they are powerful. To this point, Information Literacy (IL) educators do not have a theory and pedagogy of information capitalism. This article appraises the current state of political... Read More 2020 22 Jul Liz Vine /2 Comments Training Matters: Student Employment and Learning in Academic Libraries In Brief Conceiving of student employment in academic libraries as an educationally purposeful experience requires adopting a learner-centered pedagogical approach to student employee job training. Adopting such an approach is triply beneficial: it makes that job training more effective; it identifies training as an opportunity to pursue learning goals that support the growth of students... Read More 2020 24 Jun Natalia Fernández and Beth Filar Williams /1 Comments Creating a Library Wide Culture and Environment to Support MLIS Students of Color: The Diversity Scholars Program at Oregon State University Libraries In Brief The work of social justice, equity, and inclusion is not a short-term investment by a limited number of people; instead, it should be a part of every library’s and librarian’s work. At the Oregon State University Libraries (OSUL), we felt that in order to create a program dedicated to employing MLIS students of... Read More 2020 10 Jun Nicola Andrews /10 Comments It’s Not Imposter Syndrome: Resisting Self-Doubt as Normal For Library Workers In Brief Library workers, as with other professions, are quick to diagnose ourselves and others with imposter syndrome when we doubt or devalue our everyday work.  However, methods of coping with imposter syndrome have changed little in the forty years since the term was first theorized, and often centre on feel-good fixes which do not... Read More 1 2 3 … 29 Next › This work is licensed under a CC Attribution 4.0 License. ISSN 1944-6195. About this Journal | Archives | Submissions | Conduct 
www-ipandetec-org-5266	----	Inicio - IPANDETEC Toggle navigation Inicio Nosotros ¿Qué Hacemos? Proyectos Equipo Publicaciones Eventos Blog Contacto Somos una organización sin fines de lucro cuyo área de trabajo son los derechos digitales. Apóyanos y Únete a esta Organización!!! Lee Más Promovemos el uso y regulación correcto de las Tecnologías de la Información y Comunicación. Apóyanos y Únete a esta Organización!!! Lee Más Defendemos los Derechos Humanos de los ciudadanos en el entorno digital. Apóyanos y Únete a esta Organización!!! Lee Más Contribuimos a la elaboración de leyes, a través de la incidencia en políticas públicas. Apóyanos y Únete a esta Organización!!! Lee Más IPANDETEC Somos una asociación sin fines de lucro basada en Panamá que trabaja por la región centroamericana. ¿Quiénes Somos? El Instituto Panameño de Derecho y Nuevas Tecnologías (IPANDETEC) es una organización sin fines de lucro que promueve el uso y la regulación de las Tecnologías de la Información y Comunicación (TIC’s) y la defensa de los Derechos Humanos en el entorno digital en Panamá. Nuestra organización nace en el 2012, fundada por Lía Hernández y su deseo de agrupar especialistas en el área de tecnología y materias afines para promover la investigación y el estudio de los derechos digitales y contribuir de esa manera a la elaboración de proyectos, leyes y análisis. En los siguientes años, IPANDETEC ha establecido relación con organizaciones a nivel nacional e internacional en las disciplinas aplicadas a la tecnología y su desarrollo que han fortalecido la visión y la imagen de la organización. Actualmente realizamos: mapeo de actores y organizaciones de Gobernanza de Internet, en la región centroamericana; estudios de protección de datos personales; análisis de estrategias de ciberseguridad en la región de Centroamérica, desde la perspectiva de derechos humanos e inclusión social; observatorios de datos abiertos, biométricas, cifrado y cibercrimen. Presentación Transparencia Prensa Memorias MISIÓN Y VISIÓN En IPANDETEC promovemos el uso y regulación de las tecnologías de la información y comunicación y la defensa de los derechos humanos en el entorno digital en Panamá y Centroamérica. Promovemos la investigación y el estudio del Derecho de las Nuevas Tecnologías mediante el desarrollo de conferencias, seminarios, simposios y todo otro medio adecuado al mismo fin. Contribuimos a la elaboración de leyes relacionadas a las nuevas tecnologías, claras y coherentes, que cumplan las necesidades del país en función de las exigencias que se creen en el futuro. Establecemos vínculos con organizaciones afines a nivel nacional, regional e internacional, promoviendo el intercambio de los estudios que se realicen, pudiendo adherir o formar parte de instituciones internacionales aplicadas a la misma disciplina. ¿QUÉ HACEMOS? Promueve el uso y regulación de las TIC y la defensa de los Derechos Humanos en el entorno digital, a través del análisis, incidencia, investigación, monitoreo legislativo en políticas públicas de Internet en Centroamérica. Conocimiento Libre Promovemos la importancia de compartir la ciencia y cultura que otorga un vínculo con el bien público. Gobernanza de Internet Coordinación entre los sectores involucrados del ecosistema digital para impulsar y fomentar las mejores prácticas del Internet. Datos y Gobierno Abierto Impulsamos los procesos de apertura de datos con el fin de poder impulsar la transparencia, rendición de cuentas e innovación. Género Desarrollamos capacidades para la participación activa de mujeres y colectivos LGTBI+ en el ecosistema digital. Ciberseguridad Promovemos la inclusión de la perspectiva de Derechos Humanos en las políticas y legislaciones de ciberseguridad. Privacidad Incidimos en la regulación que proteja el derecho fundamental a la privacidad y espacio de la vida de cada ser. COMUNIDADES ALSUR https://www.alsur.lat/ Mas Info CREATIVE COMMONS www.facebook.com/ccpanama/ Mas Info DATOS ABIERTOS PANAMÁ www.facebook.com/datosabiertospa/ Mas Info IGF PANAMÁ Mas Info Open Knowledge Foundation https://okfn.org/network/panama/ Mas Info NCUC Mas Info GFCE https://www.thegfce.com/ Mas Info ICAAN https://www.icann.org/es Mas Info EQUIPO Nuestra organización esta conformada por un equipo de trabajo y un grupo de miembros/expertos que trabajan para el cumplimiento de nuestros fines. Lia Hernández Directora Ejecutiva Abdías Zambrano Coordinador de Políticas Públicas Simón Maquilón Colaborador de Tecnología Ariaxna Vásquez Asistente de Investigación Cristina Morales Colaboradora - Nicaragua Alejandro Quiñonez Colaborador - Guatemala Eduardo Tomé Colaborador - Honduras Kathya Guerra Colaboradora - El Salvador Federica Tortorella Colaboradora - República Dominicana ¿Quieres Ser Miembro? Nuestros miembros son abogados, sociológos, informáticos especializados en temas relacionados con Derecho y Tecnología y que nos representan como organización en eventos a nivel nacional e internacional. ¿Quieres formar parte de IPANDETEC? ÚNETE PUBLICACIONES Explora y aprende sobre los temas que pueden ser de tu interés. Todo CiberseguridadEconomía DigitalGeneroInternetPrivacidad Leer más Leer más Leer más Leer más Leer más Leer más Leer más Leer más Leer más Leer más Leer más Leer más EVENTOS Datos y Pintas V March 4, 2020 at 6:00 pm to 8:00 pm Place: Plaza 2000 - Spaces Mas Info Legal Hackers Summit LATAM 2020 February 13, 2020 at 10:00 am to Place: Colegio de Abogados Mas Info V Foro September 12, 2019 at 9:00 am to 5:00 pm Place: San Salvador Mas Info Taller: Redes Sociales en procesos electorales May 29, 2019 at 10:00 am to 6:00 pm Place: Centro Cultural de España en Guatemala Mas Info Lanzamiento del Estudio Centroamericano de Protección de Datos: Capítulo de Panamá May 29, 2019 at 6:00 pm to 8:00 pm Place: Spaces Panamá Mas Info Internet, más que las redes sociales. May 22, 2019 at 6:00 pm to 8:00 pm Place: Spaces Panamá Mas Info Webinar: Emprendimiento Digital y su Marco Legal May 14, 2019 at 10:00 am to 11:00 am Place: Abierto y gratuito: http://bit.ly/2V1B9Bj Mas Info Webinar: Protección de Datos Personales y el Proyecto de Ley en Panamá January 28, 2019 at 11:30 am to 12:30 pm Place: Registro gratuito y abierto: bit.ly/enero28web Mas Info Webinar: Consejos de Seguridad en Twitter February 5, 2019 at 10:30 am to 11:30 am Place: Registro gratuito y abierto: http://bit.ly/feb05web Mas Info Datos y Pintas III February 7, 2019 at 6:00 pm to 8:00 pm Place: Spaceworks Panamá, Plaza 2000. Calle 50 y Marbella a lado del RIU. Mas Info BLOG Noticias, artículos e información de interés. 26 Abr, 21 Alejandro Quiñonez/ Laboratorio Sapiens Internet gratuito para la educación en Guatemala El día 04 de Marzo legisladores del partido político Movimiento Semilla, presentaron al Congreso de la República la iniciativa de Leer Más 23 Abr, 21 Ipandetec Identidad digital en el Centro de América Durante siglos, el ser humano ha batallado por encontrarse a sí mismo mediante su identidad. A través de las décadas Leer Más 23 Abr, 21 Ipandetec La protesta digital y física con enfoque digital en Panamá El mundo ha cambiado a pasos agigantados durante los últimos 50 años. La globalización, el Internet, las primeras redes sociales, Leer Más 22 Abr, 21 Abdías Zambrano El Salvador aprueba su ley de datos personales Recientemente, la Asamblea Legislativa de El Salvador aprobó su primera ley de protección de datos personales. La Diputada Margarita Escobar Leer Más VER TODOS CONTACTO ¿Quieres ser miembro, voluntario o participar de nuestros eventos? Déjanos un mensaje Por favor, activa JavaScript en tu navegador para completar este formulario. Tu Nombre * Tu Email * Mensaje * Phone Enviar Información de Contacto ¿Quieres ser miembro? ¿Quieres ser voluntario? ¿Quieres participar de nuestros eventos? +507 6300-4252 ipandetec@gmail.com Nuestros Aliados El contenido de este sitio web por IPANDETEC se distribuye bajo   Licencia Creative Commons Atribución-NoComercial-CompartirIgual 4.0 Internacional. × !Chat! 
www-itee-uq-edu-au-3947	----	Elizabeth Alpert - Thinking Systems - The University of Queensland, Australia Skip to main content UQ Home Contacts Study Maps News Events Library my.UQ The University of Queensland Thinking Systems Search form Search Homepage Site menu Show Search Search form Search Home Research Projects Media Team Publications Contact Us Sitemap Home › Team › Student Research Scholars › Elizabeth Alpert Elizabeth Alpert ITEE Summer Scholar (2008-09), (2009-10), (2010-11) Betsy is starting her final year of a Bachelor of Information Technology/ Bachelor of Arts. She studies linguistics, history (classical and medieval), and whichever computing courses take her fancy. She enjoys interactions between different areas of knowledge, and is particularly interested in combining linguistics and computing to deal with issues of artificial systems and natural language usage and comprehension. On This Site Research Projects Media Team Chief Investigators International Partner Investigators Research Fellows Affiliates PhD Students & Alumni Student Research Scholars Publications Contact Us Sitemap Home › Team › Student Research Scholars › Elizabeth Alpert The University of Queensland, Australia Brisbane St Lucia, QLD 4072 +61 7 3365 1111 Other Campuses: UQ Ipswich, UQ Gatton, UQ Herston Maps and Directions © 2021 The University of Queensland A Member of Privacy & Terms of use | Feedback ABN: 63 942 912 684 CRICOS Provider No: 00025B Quick Links For Media Emergency Contact Social Media Facebook Twitter Flickr Instagram YouTube Vimeo iTunes U Linkedin Explore Giving to UQ Faculties & Divisions UQ Jobs UQ Contacts Services & Facilities Login Need Help? UQ Answers EMERGENCY 3365 3333 
www-jeffpooley-com-3345	----	‘Publishers Announce a Major New Service to Plug Leakage’ | Jeff Pooley home about publications twitter rss contact Select Page ‘Publishers Announce a Major New Service to Plug Leakage’ December 3, 2019 Ithaka S+R’s Roger Schonfeld, in a Scholary Kitchen post on the publishing oligopolists’ new “GetFTR” initiative: Backed by the American Chemical Society, Elsevier, Springer Nature, Taylor & Francis, and Wiley, GetFTR has two components. First, it enables the discovery service to indicate whether the article full text is available to the user before clicking on a link to the publisher page and if so to link directly to it. And then step two, which calls on the aptly named “SeamlessAccess” service: It requires that a user has disclosed their institutional affiliation through the SeamlessAccess.Org “Where Are You From” service, which in turn stores the affiliation information locally on their browser. Step three: The user’s institutional affiliation is sent along with the article DOI to a service which then queries the appropriate publisher to determine whether the individual should be entitled to access the article. And on to step four (“seamlessly” again): This should take place seamlessly in the background as a list of search results is loading. The user will see, in a list of search results, clear information such as a green or red button, on whether they will be able to access the full text of each article prior to clicking on the link to it. Why not atep five? A user who then clicks on the link will be taken to their institutional login or directly to the article without any intermediate pages if they are already logged in during the current session. This is, of course, dead on arrival. It’s a laughably creaky and friction-filled effort to remove the, well, friction from the current, usurious paywalled system. There’s a late-1990s music-industry desperation to it, in a world where Sci-Hub and LibGen have growing traction. The publishing industry’s extortionate behavior has been so egregious that many of us support piracy sites on moral grounds. A GetFTR button—one that, in effect, makes it simpler to get to a blocked-access $35 article landing page—won’t slow that Robin Hood dynamic one wit. A couple of other observations: The splashy Scholarly Kitchen announcement came on the same day, fittingly, as this Vice post on efforts to boost LibGen’s reliability. The GetFTR piece, down to its closing recommendation that the oligopolists develop a soup-to-nuts proprietary identity system, is written from the industry’s point of view, unabashedly. Roger Schonfeld, an especially sharp analyst based at the indispensable nonprofit Ithaka S+R, has written what reads like an industry press release, with the only real dissent issued in the form of a plea for the closed-access industry to go deeper still. Head-scratching. ResearchGate, the venture-backed for-profit social network, is repeatedly called out in the piece—and rightly so. But the whole post is weirdly silent on the wider nonprofit open-access publishing ecosystem—parts of which Ithaka S+R supports. Chemists, especially U.S.-based ones, should be up in arms over the behavior of the American Chemical Society. The ACS is, on the one hand, a useful idiot for the 37 percenters—a nominally nonprofit partner that lends a veneer of legitimacy to otherwise naked profit-protection schemes. The ACS is also the poster-child for the tail wagging the dog—a scholarly society inverting its mission to advance scholarship in the interests of closed-access publishing revenues. Search for: Recent Posts ‘Cambridge University Press strikes deals for open access’ ‘The Impacts of COVID-19 on Academic Library Budgets’ ‘Branding Spin-Off Scholarly Journals’ ‘Creative Destruction: The Structural Consequences of Scientific Curation ‘ ‘scite Launches Smart Citations on 121 Wiley Journals’ Archives April 2021 March 2021 February 2021 January 2021 December 2020 November 2020 October 2020 September 2020 August 2020 July 2020 June 2020 May 2020 April 2020 March 2020 February 2020 January 2020 December 2019 November 2019 October 2019 September 2019 August 2019 Twitter RSS 
www-irs-gov-3280	----	Internal Revenue Bulletin: 2014-16 | Internal Revenue Service Skip to main content An official website of the United States Government English Español 中文 (简体) 中文 (繁體) 한국어 Русский Tiếng Việt Kreyòl ayisyen Information Menu Help News Charities & Nonprofits Tax Pros Search Toggle search Search Include Historical Content - Any - No Include Historical Content - Any - No Search Help Menu Mobile Help Menu Toggle menu Main navigation mobile File Overview INFORMATION FOR… Individuals For you and your family International Taxpayers Individuals abroad and more Business & Self Employed EINs and other information Government Entities FILING FOR INDIVIDUALS How to File When to File Where to File Update My Information POPULAR Get My Payment Coronavirus Tax Relief Free File File an Extension Get an Identity Protection PIN (IP PIN) Pay Overview PAY BY Bank Account (Direct Pay) Debit or Credit Card Payment Plan (Installment Agreement) Electronic Federal Tax Payment System (EFTPS) POPULAR View Your Account Tax Withholding Estimator Estimated Taxes Penalties Refunds Overview Where's My Refund What to Expect Direct Deposit Reduced Refunds Fix/Correct a Return Credits & Deductions Overview INFORMATION FOR... Individuals For you and your family Businesses & Self-Employed Standard mileage and other information POPULAR Earned Income Credit (EITC) Child Tax Credit Standard Deduction Health Coverage Retirement Savings Forms & Instructions Overview FORMS & INSTRUCTIONS Form 1040 Individual Tax Return Form 1040 Instructions Instructions for Form 1040 Form W-9 Request for Taxpayer Identification Number (TIN) and Certification Form 4506-T Request for Transcript of Tax Return Form W-4 Employee's Withholding Certificate Form 941 Employer's Quarterly Federal Tax Return Form W-2 Employers engaged in a trade or business who pay compensation Form 9465 Installment Agreement Request POPULAR FOR TAX PROS Form 1040-X Amend/Fix Return Form 2848 Apply for Power of Attorney Form W-7 Apply for an ITIN Circular 230 Rules Governing Practice before IRS Search Include Historical Content - Any - No Include Historical Content - Any - No Search Information Menu Help News Charities & Nonprofits Tax Pros File Overview INFORMATION FOR… Individuals International Taxpayers Business & Self Employed Government Entities FILING FOR INDIVIDUALS How to File When to File Where to File Update My Information POPULAR Get My Payment Coronavirus Tax Relief Free File File an Extension Get an Identity Protection PIN (IP PIN) Pay Overview PAY BY Bank Account (Direct Pay) Debit or Credit Card Payment Plan (Installment Agreement) Electronic Federal Tax Payment System (EFTPS) POPULAR View Your Account Tax Withholding Estimator Estimated Taxes Penalties Refunds Overview Where's My Refund What to Expect Direct Deposit Reduced Refunds Fix/Correct a Return Credits & Deductions Overview INFORMATION FOR... Individuals Businesses & Self-Employed POPULAR Earned Income Credit (EITC) Child Tax Credit Standard Deduction Health Coverage Retirement Savings Forms & Instructions Overview FORMS & INSTRUCTIONS Form 1040 Form 1040 Instructions Form W-9 Form 4506-T Form W-4 Form 941 Form W-2 Form 9465 POPULAR FOR TAX PROS Form 1040-X Form 2848 Form W-7 Circular 230 Main navigation mobile File Overview INFORMATION FOR… Individuals For you and your family International Taxpayers Individuals abroad and more Business & Self Employed EINs and other information Government Entities FILING FOR INDIVIDUALS How to File When to File Where to File Update My Information POPULAR Get My Payment Coronavirus Tax Relief Free File File an Extension Get an Identity Protection PIN (IP PIN) Pay Overview PAY BY Bank Account (Direct Pay) Debit or Credit Card Payment Plan (Installment Agreement) Electronic Federal Tax Payment System (EFTPS) POPULAR View Your Account Tax Withholding Estimator Estimated Taxes Penalties Refunds Overview Where's My Refund What to Expect Direct Deposit Reduced Refunds Fix/Correct a Return Credits & Deductions Overview INFORMATION FOR... Individuals For you and your family Businesses & Self-Employed Standard mileage and other information POPULAR Earned Income Credit (EITC) Child Tax Credit Standard Deduction Health Coverage Retirement Savings Forms & Instructions Overview FORMS & INSTRUCTIONS Form 1040 Individual Tax Return Form 1040 Instructions Instructions for Form 1040 Form W-9 Request for Taxpayer Identification Number (TIN) and Certification Form 4506-T Request for Transcript of Tax Return Form W-4 Employee's Withholding Certificate Form 941 Employer's Quarterly Federal Tax Return Form W-2 Employers engaged in a trade or business who pay compensation Form 9465 Installment Agreement Request POPULAR FOR TAX PROS Form 1040-X Amend/Fix Return Form 2848 Apply for Power of Attorney Form W-7 Apply for an ITIN Circular 230 Rules Governing Practice before IRS Main navigation File Pay Refunds Credits & Deductions Forms & Instructions Info Menu Mobile Help News Charities & Nonprofits Tax Pros Home IRB Internal Revenue Bulletin: 2014-16   Highlights of This Issue SPECIAL ANNOUNCEMENT INCOME TAX EMPLOYEE PLANS EMPLOYMENT TAX SELF-EMPLOYMENT TAX EXCISE TAX ADMINISTRATIVE Preface The IRS Mission Introduction Part I. Rulings and Decisions Under the Internal Revenue Code of 1986 T.D. 9662 Part III. Administrative, Procedural, and Miscellaneous Notice 201420 Notice 201421 Notice 201422 Notice 201423 Notice 201424 Rev. Proc. 201428 Part IV. Items of General Interest Announcement 201414 Announcement 201415 Definition of Terms and Abbreviations Definition of Terms Abbreviations Numerical Finding List Numerical Finding List Effect of Current Actions on Previously Published Items Finding List of Current Actions on Previously Published Items INTERNAL REVENUE BULLETIN CUMULATIVE BULLETINS INTERNAL REVENUE BULLETINS ON CD-ROM We Welcome Comments About the Internal Revenue Bulletin Internal Revenue Bulletin: 2014-16 April 14, 2014 Highlights of This Issue   These synopses are intended only as aids to the reader in identifying the subject matter covered. They may not be relied upon as authoritative interpretations. SPECIAL ANNOUNCEMENT Announcement 2014–14 Announcement 2014–14 This Announcement is issued pursuant to section 521(b) of Pub. L. 106–170, the Ticket to Work and Work Incentives Improvement Act of 1999, which requires the Secretary of the Treasury to report annually to the public concerning Advance Pricing Agreements (APAs) and the APMA Program. The first report covered calendar years 1991 through 1999. Subsequent reports covered separately each calendar year from 2000 through 2012. This Fifteenth report describes the experience, structure, and activities of the APMA Program during calendar year 2013. It does not provide guidance regarding the application of the arm's length standard. INCOME TAX Notice 2014–20 Notice 2014–20 This Notice postpones until October 15, 2014, the deadline to make an election under § 165(i) to deduct in the preceding taxable year losses attributable to September 2013 major flooding sustained in federally declared disaster areas in Colorado. Notice 2014–21 Notice 2014–21 This notice explains how existing general tax principles apply to transactions using virtual currency. Notice 2014–23 Notice 2014–23 This notice provides guidance for 2014 that will allow taxpayers who are the victims of domestic violence to satisfy the joint filing requirement of § 36B(c)(1)(C) with a married-filing-separate return, in order to obtain the premium tax credit. The notice also informs that the IRS and Treasury will be issuing regulations on this subject. Announcement 2014–14 Announcement 2014–14 This Announcement is issued pursuant to section 521(b) of Pub. L. 106–170, the Ticket to Work and Work Incentives Improvement Act of 1999, which requires the Secretary of the Treasury to report annually to the public concerning Advance Pricing Agreements (APAs) and the APMA Program. The first report covered calendar years 1991 through 1999. Subsequent reports covered separately each calendar year from 2000 through 2012. This Fifteenth report describes the experience, structure, and activities of the APMA Program during calendar year 2013. It does not provide guidance regarding the application of the arm's length standard. EMPLOYEE PLANS Rev. Proc. 2014–28 Rev. Proc. 2014–28 This revenue procedure modifies Rev. Proc. 2013–22, 2013–18 I.R.B. 985, which sets forth the procedures of the Internal Revenue Service (Service) for issuing opinion and advisory letters for § 403(b) pre-approved plans (that is, § 403(b) prototype plans and § 403(b) volume submitter plans). Under the program established by Rev. Proc. 2013–22, as modified by this revenue procedure, the Service will accept applications for opinion and advisory letters regarding the acceptability under § 403(b) of the Internal Revenue Code of the form of prototype plans and volume submitter plans, respectively, through April 30, 2015. This revenue procedure also makes certain modifications to the program established by Rev. Proc. 2013–22 that are intended to allow more plan sponsors and eligible employers to participate in the § 403(b) pre-approved plan program. The appendix to Rev. Proc. 2013–22 is revised accordingly. Announcement 2014–15 Announcement 2014–15 This announcement addresses the application to Individual Retirement Accounts and Individual Retirement Annuities (collectively, “IRAs”) of the one-rollover-per-year limitation of § 408(d)(3)(B) of the Internal Revenue Code and provides transition relief for owners of IRAs. EMPLOYMENT TAX T.D. 9662 T.D. 9662 The final regulations clarify the employment tax obligations of a third party (payor) where the third party enters into a service agreement with an employer to pay wages to employees of the employer and take on other employment tax responsibilities of the employer. Under the final regulations, the Service may designate the payor to perform acts required of the employer. Under the final regulations, both the employer and the payor are liable for the employer's employment tax obligations. The final regulations also provide exceptions for when a payor will not be designated to perform acts of the employer. Notice 2014–21 Notice 2014–21 This notice explains how existing general tax principles apply to transactions using virtual currency. SELF-EMPLOYMENT TAX Notice 2014–21 Notice 2014–21 This notice explains how existing general tax principles apply to transactions using virtual currency. EXCISE TAX Notice 2014–24 Notice 2014–24 This notice provides a temporary safe harbor for an entity that reports expatriate health insurance plans on its Supplemental Health Care Exhibit (SHCE). For the 2014 and 2015 fee years, it allows such an entity to exclude 50% of its direct premiums written for expatriate plans in reporting total direct premiums written to the IRS for purposes of determining its Affordable Care Act (ACA) § 9010 Health Insurance Providers Fee. ADMINISTRATIVE Notice 2014–21 Notice 2014–21 This notice explains how existing general tax principles apply to transactions using virtual currency. Notice 2014–22 Notice 2014–22 This notice updates the appendix to Notice 2013–1, which lists the Indian tribes who have settled tribal trust cases against the United States. Notice 2012–60 originally was published in IRB 2012–41 (October 9, 2012). Notice 2012–60 was superceded by Notice 2013–1 IRB 2013–3, and the appendix to Notice 2013–1 was superceded by Notice 2013–16 (IRB 2013–14), then Notice 2013–36, and then Notice 2013–55. However, an additional tribe has settled its case against the United States since the publication of Notice 2013–55, so we are seeking to publish an updated appendix to Notice 2013–1. This notice would supercede Notice 2013–55. Notice 2013–1 Appendix is modified and superseded. Preface The IRS Mission Provide America’s taxpayers top-quality service by helping them understand and meet their tax responsibilities and enforce the law with integrity and fairness to all. Introduction The Internal Revenue Bulletin is the authoritative instrument of the Commissioner of Internal Revenue for announcing official rulings and procedures of the Internal Revenue Service and for publishing Treasury Decisions, Executive Orders, Tax Conventions, legislation, court decisions, and other items of general interest. It is published weekly. It is the policy of the Service to publish in the Bulletin all substantive rulings necessary to promote a uniform application of the tax laws, including all rulings that supersede, revoke, modify, or amend any of those previously published in the Bulletin. All published rulings apply retroactively unless otherwise indicated. Procedures relating solely to matters of internal management are not published; however, statements of internal practices and procedures that affect the rights and duties of taxpayers are published. Revenue rulings represent the conclusions of the Service on the application of the law to the pivotal facts stated in the revenue ruling. In those based on positions taken in rulings to taxpayers or technical advice to Service field offices, identifying details and information of a confidential nature are deleted to prevent unwarranted invasions of privacy and to comply with statutory requirements. Rulings and procedures reported in the Bulletin do not have the force and effect of Treasury Department Regulations, but they may be used as precedents. Unpublished rulings will not be relied on, used, or cited as precedents by Service personnel in the disposition of other cases. In applying published rulings and procedures, the effect of subsequent legislation, regulations, court decisions, rulings, and procedures must be considered, and Service personnel and others concerned are cautioned against reaching the same conclusions in other cases unless the facts and circumstances are substantially the same. The Bulletin is divided into four parts as follows: Part I.—1986 Code. This part includes rulings and decisions based on provisions of the Internal Revenue Code of 1986. Part II.—Treaties and Tax Legislation. This part is divided into two subparts as follows: Subpart A, Tax Conventions and Other Related Items, and Subpart B, Legislation and Related Committee Reports. Part III.—Administrative, Procedural, and Miscellaneous. To the extent practicable, pertinent cross references to these subjects are contained in the other Parts and Subparts. Also included in this part are Bank Secrecy Act Administrative Rulings. Bank Secrecy Act Administrative Rulings are issued by the Department of the Treasury’s Office of the Assistant Secretary (Enforcement). Part IV.—Items of General Interest. This part includes notices of proposed rulemakings, disbarment and suspension lists, and announcements. The last Bulletin for each month includes a cumulative index for the matters published during the preceding months. These monthly indexes are cumulated on a semiannual basis, and are published in the last Bulletin of each semiannual period. Part I. Rulings and Decisions Under the Internal Revenue Code of 1986 T.D. 9662 Designation of Payor to Perform Acts Required of an Employer DEPARTMENT OF THE TREASURY Internal Revenue Service 26 CFR Part 31 AGENCY: Internal Revenue Service (IRS), Treasury. ACTION: Final Regulations. SUMMARY: This document contains final regulations under section 3504 of the Internal Revenue Code (Code) providing circumstances under which a person (payor) is designated to perform the acts required of an employer and is liable for employment taxes with respect to wages or compensation paid by the payor to individuals performing services for the payor’s client pursuant to a service agreement between the payor and the client. DATES: Effective date: These final regulations are effective on March 31, 2014. Applicability date: For dates of applicability, see § 31.3504–2(f) of these regulations. FOR FURTHER INFORMATION CONTACT: Jeanne Royal Singley at (202) 317-6798 (not a toll-free number). SUPPLEMENTARY INFORMATION: Background This document contains amendments to 26 CFR part 31 under section 3504 of the Code. On January 29, 2013, Treasury and the IRS published a notice of proposed rulemaking (REG–102966–10, 78 FR 6056) (the proposed regulations) in the Federal Register under section 3504 of the Code. Treasury and the IRS received written and electronic comments responding to the proposed regulations. All comments were considered and are available for public inspection at http://www.regulations.gov or upon request. After consideration of all the public comments, the proposed regulations are adopted as amended by this Treasury decision. The public comments and revisions are discussed in this preamble. Explanation of Provisions Under section 3504, if a payor pays wages or compensation to employees who are employed by one or more employers, the Secretary is authorized, in accordance with regulations prescribed by the Secretary, to designate such payor to perform acts required of employers under the Code. Section 3504 further provides that, except as otherwise prescribed by the Secretary, all provisions of law (including penalties) applicable with respect to an employer are applicable to the payor so designated, but the employer for whom the payor acts remains subject to the provisions of law (including penalties) applicable with respect to employers. Accordingly, both an employer and the payor designated in accordance with regulations under section 3504 are liable for the employment taxes on wages or compensation paid by the payor. The IRS has established administrative procedures under which a payor may request authorization on Form 2678, Employer/Payer Appointment of Agent, to file employment tax returns and perform other acts for the employer. The proposed regulations provide rules regarding the employment tax obligations under certain three-party arrangements in which a payor enters into an agreement with the employer (client) to perform the employment tax obligations of the client with regard to wages or compensation paid by the payor to individuals performing services for the client, but the payor does not use the established IRS administrative procedures to request authorization to file employment tax returns and performs other acts for the client. Under the proposed regulations, a payor is designated under section 3504 to perform the acts of an employer in any case in which the payor enters into a service agreement with a client. For this purpose, the term service agreement means a written or oral agreement pursuant to which the payor (1) asserts it is the employer (or “co-employer”) of individuals performing services for the client, (2) pays wages or compensation to the individuals for services the individuals perform for the client, and (3) assumes responsibility to collect, report, and pay, or assumes liability for, any employment taxes with respect to the wages or compensation paid by the payor to the individuals who perform services for the client. The proposed regulations also provide exceptions to when a payor is designated under section 3504 to perform the acts of an employer even if the payor has entered into an agreement that includes all of the components of a service agreement. The proposed regulations also include numerous examples to illustrate the rules regarding designation. Summary of Comments and Explanation of Revisions The IRS received comments in response to the proposed regulations. The majority of the comments expressed support for the regulations and had no suggested changes. Other comments were outside the scope of section 3504 and these regulations. One commenter suggested deleting the term “agent” when describing the third party payor that is designated to perform the acts of an employer. The commenter indicated that many three-party arrangements are not structured as common law agency relationships and that designating the payor as an agent for purposes of these regulations may raise implications for other unrelated issues. While the proposed regulations state that the designation of a payor under the proposed regulations has no impact in determining the payor’s status for other purposes of the Code, Treasury and the IRS agree that describing the payor designated to perform the acts of an employer as an “agent” may be unnecessary, given that section 3504 grants the Secretary authority to designate a “fiduciary, agent, or other person” to perform such acts. Accordingly, the final regulations adopt this change. Another commenter asked for clarification of the application of section 3504 to payors of group disability income benefits under an administrative service contract (commonly referred to as an administrative service only agreement or an “ASO agreement”) with an employer. The comment was in response to a specific request in the notice of proposed rule-making asking whether the proposed definition of service agreement inappropriately designates or fails to designate a payor to perform acts of an employer. The commenter explained that under an ASO agreement, an insurer that administers employee disability claims may agree to withhold employment taxes on the taxable disability payments and report and pay the employment taxes to the IRS under the insurer’s employer identification number. The commenter stated that in recent years, some insurance companies have required that the employer designate the insurer as an agent on Form 2678 when entering into new ASO agreements. The commenter asked whether performing services under an ASO agreement makes the insurer an agent under section 3504 that is required to file Form 2678. The commenter also asked for an example to be included in these final regulations to clarify whether filing Form 2678 is required to perform employment tax obligations under an ASO agreement. These regulations address the designation of a payor to perform the acts of an employer when the formal IRS administrative procedures to designate an agent (i.e., filing Form 2678) are not followed. Accordingly, it is beyond the scope of these regulations to address whether a Form 2678 must be filed in order to report and pay employment taxes in any particular situation. However, § 32.1 of the Employment Tax Regulations provides specific rules for reporting employment taxes with respect to payments made by a third party on account of sickness or accident disability (often called “sick pay”). While those rules are unaffected by section 3504 or these regulations, Treasury and the IRS agree a clarification of the interaction of those rules and these regulations would be helpful. Specifically, under § 32.1, a third-party payor of sick pay may be treated as an employer or as an agent of the employer with regard to the employment tax obligations, depending on the circumstances of the arrangement. The proposed regulations contain an exception at § 31.3504–2(d)(3) that a payor is not designated to perform the acts required of an employer for any wages or compensation paid by the payor to the individual(s) performing services for a client if the payor is the employer. However, the proposed regulations are not clear whether the § 31.3504–2(d)(3) exception for payors that are employers applies if the third-party payor of the sick pay is treated as an employer under § 32.1. To clarify that a third-party payor of sick pay that is treated as an employer under § 32.1 will not be designated under these regulations to perform the acts of an employer with regard to the sick pay, these final regulations add an additional exception at § 31.3504–2(d)(4) for payors treated as employers under section 3121(a)(2)(A). No changes were needed, however, to address situations where the third-party payor of the sick pay is the agent of the employer under § 32.1. Under those circumstances, these regulations do not apply because the payments are not made pursuant to a service agreement within the meaning of these regulations. The first component of a service agreement is that the payor asserts that it is the employer of the individuals performing services for its client, such as by filing employment tax returns using its own EIN that include wages paid to the individuals performing services for the client. A third-party payor of sick pay that is treated as an agent of the employer under § 32.1 does not file employment tax returns under its own EIN to report and pay the taxes on the sick pay or otherwise assert that it is the employer. Thus, the arrangement under which the payor pays the sick pay as an agent would not be a service agreement and these regulations would not apply to designate the payor to perform the acts of the employer. Finally, although no comments were received with regard to the exception at § 31.3504–2(d)(3) for employers, these final regulations revise that provision to clarify that the exception includes a section 3401(d)(1) employer, also commonly referred to as a statutory employer, as discussed in the preamble to the proposed regulations. Section 3401(d)(1) provides that for purposes of federal income tax withholding, the term employer means the person for whom an individual performs or performed any service, of whatever nature, as an employee of such person, except that, if the person for whom the individual performs or performed the services does not have control of the payment of wages for such services, the term employer means the person having control of the payment of such wages. For purposes of section 3401(d)(1), the term control means legal control. See § 31.3401(d)–1(f). Thus, when one person is the common law employer of an individual because it controls the day-to-day performance of services by the individual, another person may be the employer liable to collect, report, and pay employment taxes because it is the entity solely in control of the payment of wages to the individual. See Winstead v. United States, 109 F.3d 989 (4th Cir. 1997). An example is added to these regulations at § 31.3504–2(e)(8) to demonstrate the application of the exception to a section 3401(d)(1) employer. Special Analyses It has been determined that this final rule is not a significant regulatory action as defined in Executive Order 12866, as supplemented by Executive Order 13563. Therefore, a regulatory assessment is not required. It also has been determined that section 553(b) of the Administrative Procedure Act (5 U.S.C. chapter 5) does not apply to this regulation, and because the regulation does not impose a collection of information on small entities, the Regulatory Flexibility Act (5 U.S.C. chapter 6) does not apply. Pursuant to section 7805(f) of the Internal Revenue Code, this regulation has been submitted to the Chief Counsel for Advocacy of the Small Business Administration for comment on its impact on small business, and no comments were received. Drafting Information The principal author of these regulations is Jeanne Royal Singley, Office of Division Counsel/Associate Chief Counsel (Tax Exempt and Government Entities). However, personnel from other offices of the IRS and Treasury participated in their development. * * * * Adoption of Amendments to the Regulations Accordingly, 26 CFR part 31 is amended as follows: PART 31—EMPLOYMENT TAXES AND COLLECTION OF INCOME TAX AT SOURCE Paragraph 1. The authority citation for part 31 continues to read in part as follows: Authority: 26 U.S.C. 7805 * * * Par. 2. Section 31.3504–2 is added to read as follows: § 31.3504–2 Designation of Payor to Perform Acts of an Employer (a) In general. A person (as defined in section 7701(a)(1)) that pays wages or compensation (“payor”) to the individual(s) performing services for any client pursuant to a service agreement, except as provided in paragraph (d) of this section, is designated to perform the acts required of an employer with respect to the wages or compensation paid. For purposes of this section the term wages has the same meaning as the term wages has for purposes of chapters 21, 23, and 24, and the term compensation has the same meaning as the term compensation has for purposes of chapter 22. This section is not applicable if the payor has been authorized as an agent of the employer under § 31.3504–1. (b) Definitions—(1) Client. The term client means an individual or entity that enters into a service agreement with the payor. (2) Service agreement. (i) The term service agreement means an agreement pursuant to which the payor — (A) Asserts it is the employer (or “co-employer”) of the individual(s) performing services for the client; (B) Pays wages or compensation to the individual(s) for services the individual(s) perform for the client; and (C) Assumes responsibility to collect, report, and pay, or assumes liability for, any taxes applicable under subtitle C of the Code with respect to the wages or compensation paid by the payor to the individual(s) performing services for the client. (ii) For purposes of paragraph (b)(2)(i)(A) of this section, the payor may implicitly or explicitly assert it is the employer (or “co-employer”) of the individual(s) performing services for the client, including by agreeing to— (A) Recruit and hire employees for the client or assign employees as permanent or temporary members of the client’s work force, or participate with the client in these actions; (B) Hire the client’s employees as its own and then provide them back to the client to perform services for the client; or (C) File employment tax returns using its own employer identification number that include wages or compensation paid to the individual(s) performing services for the client. (c) Effects of designation. If a payor is designated to perform the acts required of an employer under this section then the following rules apply— (1) A payor must perform the acts required of an employer under each applicable chapter of the Code and the relevant regulations with respect to the wages or compensation paid by such payor. All provisions of law (including penalties) and the regulations applicable to the employer are applicable to the payor so designated with respect to the wages or compensation paid by the payor; and (2) Each employer for whom the payor is designated remains subject to all provisions of law (including penalties) and of the regulations applicable to an employer. (d) Exceptions. A payor is not designated to perform the acts required of an employer under this section for any wages or compensation paid by the payor to the individual(s) performing services for a client if— (1) The wages or compensation are reported on a return filed under the client’s employer identification number (as defined in section 6109 and the applicable regulations); (2) The payor is a common paymaster under sections 3121(s) or 3231(i); (3) The payor is the employer of the individual(s) (including an employer within the meaning of section 3401(d)(1)); or (4) The payor is treated as an employer under section 3121(a)(2)(A). (e) Examples. The following examples illustrate the application of this section: (1) Example 1. Corporation P enters into an agreement with Employer, effective January 1, 2015. Under the agreement, Corporation P hires the Employer’s employees as its own employees and provides them back to Employer to perform services for Employer. Corporation P also assumes responsibility to make payment of the individuals’ wages and for the collection, reporting, and payment of applicable taxes. For all pay periods in 2015, Employer provides Corporation P with an amount equal to the gross payroll (that is, wage and tax amounts) of the individuals, and Corporation P pays wages (less the applicable withholding) to the individuals performing services for Employer. Corporation P also reports the wage and tax amounts on Form 941, Employer’s QUARTERLY Federal Tax Return, filed for each quarter of 2015 under Corporation P’s employer identification number. Corporation P is not a common paymaster, the employer of the individuals (including an employer within the meaning of section 3401(d)(1)), or treated as the employer of the individual under section 3121(a)(2)(A). Corporation P is designated to perform the acts of an employer with respect to all of the wages Corporation P paid to the individuals performing services for Employer for all quarters of 2015. Employer and Corporation P are each subject to all provisions of law (including penalties) applicable in respect of employers for all quarters of 2015 with respect to such wages. (2) Example 2. Same facts as Example 1, except that Corporation P only reports the wage and tax amounts on Form 941, Employer’s QUARTERLY Federal Tax Return, filed for the 1st and 2nd quarters of 2015. Neither Corporation P nor Employer files returns for the 3rd and 4th quarters of 2015. Corporation P is designated to perform the acts of an employer with respect to all of the wages Corporation P paid to the individuals performing services for Employer for all quarters of 2015. Employer and Corporation P are each subject to all provisions of law (including penalties) applicable in respect of employers for all quarters of 2015 with respect to such wages. (3) Example 3. Same facts as Example 1, except that neither Corporation P nor Employer reports the wage and tax amounts on Form 941, Employer’s QUARTERLY Federal Tax Return, for any quarter of 2015. Corporation P is designated to perform the acts of an employer with respect to all of the wages Corporation P paid to the individuals performing services for Employer for all quarters of 2015. Employer and Corporation P are each subject to all provisions of law (including penalties) applicable in respect of employers for all quarters of 2015 with respect to such wages. (4) Example 4. Same facts as Example 1, except that Employer provides only net payroll (that is, wages less tax amounts) to Corporation P for each pay period. Corporation P is designated to perform the acts of an employer with respect to all of the wages Corporation P paid to the individuals performing services for Employer for all quarters of 2015. Employer and Corporation P are each subject to all provisions of law (including penalties) applicable in respect of employers for all quarters of 2015 with respect to such wages. (5) Example 5. Same facts as Example 1, except that after Corporation P reports the wage and tax amounts on Form 941, Employer’s QUARTERLY Federal Tax Return, filed for each quarter of 2015 under Corporation P’s employer identification number, Corporation P files a claim for refund of the employment taxes it paid for each quarter of 2015 that are related to wages Corporation P paid to the individuals performing services for Employer. The basis for Corporation P’s refund claim is that Corporation P is not the employer of the individuals that performed services for Employer. Corporation P is designated to perform the acts of an employer with respect to all of the wages Corporation P paid to the individuals performing services for Employer for all quarters of 2015. Accordingly, Corporation P is not entitled to a refund. Employer and Corporation P are each subject to all provisions of law (including penalties) applicable in respect of employers for all quarters of 2015 with respect to such wages. (6) Example 6. Corporation S enters into an agreement with Employer, effective January 1, 2015. Under the agreement, Corporation S provides payroll services, including payment of wages to individuals performing services for Employer, and assumes responsibility for the collection, reporting, and payment of applicable taxes. For all pay periods in 2015, Employer provides Corporation S with an amount equal to the gross payroll (that is, wage and tax amounts) of the individuals, and Corporation S pays wages (less the applicable withholding) to the individuals performing services for Employer. Corporation S also reports the wage and tax amounts on Form 941, Employer’s QUARTERLY Federal Tax Return, filed for each quarter of 2015 under Employer’s employer identification number. Corporation S is not designated to perform the acts of an employer with respect to all of the wages Corporation S paid to the individuals performing services for Employer for all quarters of 2015. Corporation S did not assert it was the employer and filed Forms 941 using Employer’s employer identification number. Accordingly, Corporation S is not liable for the applicable employment taxes under this section. Employer remains subject to all provisions of law (including penalties) applicable in respect of employers for all quarters of 2015 with respect to such wages. (7) Example 7. Corporation T enters into a consulting agreement with Manufacturer effective January 1, 2015, to provide consulting services to Manufacturer. Corporation T is responsible to pay wages to the individuals providing the consulting services to Manufacturer and to collect, report, and pay the applicable taxes. Corporation T has the right to direct and control the individuals as to when and how to perform the consulting services and, thus, is the common law employer of the individuals providing the consulting services. Corporation T is not designated to perform the acts of an employer with respect to all of the wages Corporation T pays to individuals providing consulting services to Manufacturer. However, as the common law employer of the individuals, Corporation T is subject to all provisions of law (including penalties) applicable in respect of employers with respect to such wages. (8) Example 8. On January 1, 2015, Corporation U enters into an agreement with Employer for Employer to farm Corporation U’s property. Under the agreement, Corporation U and Employer agree to split the proceeds of the sale of the products grown on the property. Employer hires workers to assist it with the farming. Employer has the right to direct and control the workers as to when and how to perform the services and, thus, is the common law employer of the workers. However, Employer is unable to pay the workers until after the products are sold. Therefore, Corporation U pays wages to the workers and deducts this amount from Employer’s share of the profits. Corporation U controls the payment of wages within the meaning of section 3401(d)(1). Corporation U is not designated to perform the acts of an employer with respect to all of the wages Corporation U paid to workers providing services for Employer. However, as the section 3401(d)(1) employer of the workers performing services for Employer, Corporation U is subject to all provisions of law (including penalties) applicable in respect of employers with respect to such wages. (9) Example 9. Corporation V and Employer execute and submit a Form 2678, Employer/Payer Appointment of Agent, to the Service, requesting approval to authorize Corporation V to report, deposit, and pay taxes with respect to wages it pays, as agent of Employer for purposes of Form 941, Employer’s QUARTERLY Federal Tax Return. The Form 2678 is approved by the Service and effective for all quarters of 2015. Accordingly, Corporation V reports the wages it pays to individuals performing services for Employer and related tax amounts on Form 941 and Schedule R (Form 941), Allocation Schedule for Aggregate Form 941 Filers, filed for each quarter of 2015 under Corporation V’s employer identification number. Corporation V is not designated under this section to perform the acts of an employer with respect to all of the wages Corporation V paid to the individuals performing services for Employer for all quarters of 2015. However, as an agent authorized under § 31.3504–1(a), Corporation V is subject to all provisions of law (including penalties) applicable in respect of employers for all quarters of 2015 with respect to such wages. Employer also remains subject to all provisions of law (including penalties) applicable in respect of employers for all quarters of 2015 with respect to such wages. (f) Effective/applicability date. These final regulations are effective for wages or compensation paid by a payor in quarters beginning on or after March 31, 2014. John Dalrymple Deputy Commissioner for Services and Enforcement. Approved February 14, 2014. Mark J. Mazur Assistant Secretary of the Treasury (Tax Policy). Note (Filed by the Office of the Federal Register on March 28, 2014, 8:45 a.m. and published in the issue of the Federal Register for March 31, 2014, 79 F.R. 17860) Part III. Administrative, Procedural, and Miscellaneous Notice 2014–20 Postponement of Deadline for Making an Election to Deduct for the Preceding Taxable Year Losses Attributable to Colorado Severe Storms, Flooding, Landslides, and Mudslides PURPOSE This Notice postpones until October 15, 2014, the deadline to make an election under § 165(i) of the Internal Revenue Code to deduct in the preceding taxable year losses attributable to damage from severe storms, flooding, landslides, and mudslides sustained in a federally declared disaster area in Colorado. This postponement is granted under § 7508A. BACKGROUND From September 11 through September 30, 2013, severe storms, flooding, landslides, and mudslides in Colorado (Colorado major flooding event) caused significant damage. The President of the United States issued major disaster and emergency declarations under the authority of the Robert T. Stafford Disaster Relief and Emergency Assistance Act, 42 U.S.C. §§ 5121–5206 (Stafford Act), for certain areas in Colorado. The Federal Emergency Management Agency (FEMA) determined certain areas within Colorado to be eligible for Public Assistance or Public Assistance and Individual Assistance under the Stafford Act. Section 165(i) provides that if a taxpayer sustains a loss attributable to a federally declared disaster occurring in a disaster area, the taxpayer may elect to deduct that loss on the taxpayer’s return for the taxable year immediately preceding the taxable year in which the disaster occurred. For purposes of § 165(i), a federally declared disaster is a disaster determined by the President to warrant assistance by the Federal Government under the Stafford Act (including a disaster for which the President issues a major disaster declaration or an emergency declaration), and a disaster area is the area so determined to be eligible for such assistance. See § 165(h)(3)(C) and 42 U.S.C. § 5122. Section 1.165–11(e) of the Income Tax Regulations requires a taxpayer to make the § 165(i) election by filing a return, an amended return, or a refund claim on or before the later of: (1) the due date of the taxpayer’s income tax return (determined without regard to any extension of time for filing the return) for the taxable year in which the disaster actually occurred; or (2) the due date of the taxpayer’s income tax return (determined with regard to any extension of time for filing the return) for the immediately preceding taxable year. Section 1.165–11(e) provides that the return or claim should specify the date or dates of the disaster that gave rise to the loss, and the city, town, county, and State in which the property that was damaged or destroyed was located at the time of the disaster. In general, the election is irrevocable 90 days after the taxpayer makes the election. Section 7508A provides the Secretary of the Treasury with authority to postpone the time for performing certain acts under the internal revenue laws for up to one year for a taxpayer affected by a federally declared disaster. Section 301.7508A–1(c)(1) of the Regulations on Procedure and Administration lists several specific acts performed by taxpayers for which § 7508A relief may apply, and § 301.7508A–1(c)(1)(vii) authorizes the IRS and Treasury Department to specify additional acts. Section 301.7508A–1(d)(1) describes several types of affected taxpayers eligible for relief under § 7508A. Section 301.7508A–1(d)(1)(ix) authorizes the IRS to determine that any other person is affected by a federally declared disaster and therefore eligible for relief. Under § 301.7508A–1(d)(2), the area of a federally declared disaster for which the IRS has determined that the postponement of one or more deadlines applies is referred to as a covered disaster area. AFFECTED TAXPAYERS FOR WHICH THE SECTION 165(i) DEADLINE IS POSTPONED Under the authority of § 7508A and §§ 301.7508A–1(c)(1)(vii) and 301.7508A–1(d)(1)(ix), the IRS has determined that the areas that FEMA has determined to be eligible for Public Assistance or Public Assistance and Individual Assistance pursuant to the major disaster and emergency declarations issued in response to the Colorado major flooding event are covered disaster areas. A list of those areas is available at the Federal Emergency Management Agency (FEMA) website at www.fema.gov/disaster. Under the authority of § 301.7508A–1(d)(1)(ix), a taxpayer is an affected taxpayer to which the postponement of the deadline for making the § 165(i) election applies if: (1) the taxpayer sustained a loss attributable to the Colorado major flooding event; (2) the loss occurred in a covered disaster area for the Colorado major flooding event (regardless of whether the taxpayer’s principal residence or principal place of business is in one of the covered disaster areas); and (3) the deadline for the taxpayer to make a § 165(i) election for that loss, but for this notice, would be before October 15, 2014. Affected taxpayers for purposes of this notice are not affected taxpayers for purposes of other relief provided by the IRS unless the taxpayer separately qualifies as an affected taxpayer under other guidance issued by the IRS. GRANT OF RELIEF Under the authority of § 7508A, the IRS grants affected taxpayers, as defined above, a postponement to October 15, 2014, to make an election under § 165(i) for losses attributable to the Colorado major flooding event. To assist the IRS in identifying affected taxpayers to ensure that they receive this postponement of the deadline to make the § 165(i) election, affected taxpayers should include a reference to this notice, Notice 2014–20, with their return, amended return, or refund claim on which they are making a postponed § 165(i) election pursuant to this notice. The return or claim should also include the other information requested in § 1.165–11(e). This notice is limited to making an election under § 165(i) and does not affect the application of any other section of the Code or the regulations. DRAFTING INFORMATION The principal author of this notice is Daniel A. Cassano of the Office of Associate Chief Counsel (Income Tax & Accounting). For further information regarding this notice contact Mr. Cassano on (202) 317-7011 (not a toll-free number). Notice 2014–21 IRS Virtual Currency Guidance SECTION 1. PURPOSE This notice describes how existing general tax principles apply to transactions using virtual currency. The notice provides this guidance in the form of answers to frequently asked questions. SECTION 2. BACKGROUND The Internal Revenue Service (IRS) is aware that “virtual currency” may be used to pay for goods or services, or held for investment. Virtual currency is a digital representation of value that functions as a medium of exchange, a unit of account, and/or a store of value. In some environments, it operates like “real” currency — i.e., the coin and paper money of the United States or of any other country that is designated as legal tender, circulates, and is customarily used and accepted as a medium of exchange in the country of issuance—but it does not have legal tender status in any jurisdiction. Virtual currency that has an equivalent value in real currency, or that acts as a substitute for real currency, is referred to as “convertible” virtual currency. Bitcoin is one example of a convertible virtual currency. Bitcoin can be digitally traded between users and can be purchased for, or exchanged into, U.S. dollars, Euros, and other real or virtual currencies. For a more comprehensive description of convertible virtual currencies to date, see Financial Crimes Enforcement Network (FinCEN) Guidance on the Application of FinCEN’s Regulations to Persons Administering, Exchanging, or Using Virtual Currencies (FIN-2013-G001, March 18, 2013). SECTION 3. SCOPE In general, the sale or exchange of convertible virtual currency, or the use of convertible virtual currency to pay for goods or services in a real-world economy transaction, has tax consequences that may result in a tax liability. This notice addresses only the U.S. federal tax consequences of transactions in, or transactions that use, convertible virtual currency, and the term “virtual currency” as used in Section 4 refers only to convertible virtual currency. No inference should be drawn with respect to virtual currencies not described in this notice. The Treasury Department and the IRS recognize that there may be other questions regarding the tax consequences of virtual currency not addressed in this notice that warrant consideration. Therefore, the Treasury Department and the IRS request comments from the public regarding other types or aspects of virtual currency transactions that should be addressed in future guidance. Comments should be addressed to: Internal Revenue Service Attn: CC:PA:LPD:PR (Notice 2014–21) Room 5203 P.O. Box 7604 Ben Franklin Station Washington, D.C. 20044 or hand delivered Monday through Friday between the hours of 8 A.M. and 4 P.M. to: Courier’s Desk Internal Revenue Service Attn: CC:PA:LPD:PR (Notice 2014–21) 1111 Constitution Avenue, N.W. Washington, D.C. 20224 Alternatively, taxpayers may submit comments electronically via e-mail to the following address: Notice.Comments@irscounsel.treas.gov. Taxpayers should include “Notice 2014–21” in the subject line. All comments submitted by the public will be available for public inspection and copying in their entirety. For purposes of the FAQs in this notice, the taxpayer’s functional currency is assumed to be the U.S. dollar, the taxpayer is assumed to use the cash receipts and disbursements method of accounting and the taxpayer is assumed not to be under common control with any other party to a transaction. SECTION 4. FREQUENTLY ASKED QUESTIONS Q–1: How is virtual currency treated for federal tax purposes? A–1: For federal tax purposes, virtual currency is treated as property. General tax principles applicable to property transactions apply to transactions using virtual currency. Q–2: Is virtual currency treated as currency for purposes of determining whether a transaction results in foreign currency gain or loss under U.S. federal tax laws? A–2: No. Under currently applicable law, virtual currency is not treated as currency that could generate foreign currency gain or loss for U.S. federal tax purposes. Q–3: Must a taxpayer who receives virtual currency as payment for goods or services include in computing gross income the fair market value of the virtual currency? A–3: Yes. A taxpayer who receives virtual currency as payment for goods or services must, in computing gross income, include the fair market value of the virtual currency, measured in U.S. dollars, as of the date that the virtual currency was received. See Publication 525, Taxable and Nontaxable Income, for more information on miscellaneous income from exchanges involving property or services. Q–4: What is the basis of virtual currency received as payment for goods or services in Q&A–3? A–4: The basis of virtual currency that a taxpayer receives as payment for goods or services in Q&A–3 is the fair market value of the virtual currency in U.S. dollars as of the date of receipt. See Publication 551, Basis of Assets, for more information on the computation of basis when property is received for goods or services. Q–5: How is the fair market value of virtual currency determined? A–5: For U.S. tax purposes, transactions using virtual currency must be reported in U.S. dollars. Therefore, taxpayers will be required to determine the fair market value of virtual currency in U.S. dollars as of the date of payment or receipt. If a virtual currency is listed on an exchange and the exchange rate is established by market supply and demand, the fair market value of the virtual currency is determined by converting the virtual currency into U.S. dollars (or into another real currency which in turn can be converted into U.S. dollars) at the exchange rate, in a reasonable manner that is consistently applied. Q–6: Does a taxpayer have gain or loss upon an exchange of virtual currency for other property? A–6: Yes. If the fair market value of property received in exchange for virtual currency exceeds the taxpayer’s adjusted basis of the virtual currency, the taxpayer has taxable gain. The taxpayer has a loss if the fair market value of the property received is less than the adjusted basis of the virtual currency. See Publication 544, Sales and Other Dispositions of Assets, for information about the tax treatment of sales and exchanges, such as whether a loss is deductible. Q–7: What type of gain or loss does a taxpayer realize on the sale or exchange of virtual currency? A–7: The character of the gain or loss generally depends on whether the virtual currency is a capital asset in the hands of the taxpayer. A taxpayer generally realizes capital gain or loss on the sale or exchange of virtual currency that is a capital asset in the hands of the taxpayer. For example, stocks, bonds, and other investment property are generally capital assets. A taxpayer generally realizes ordinary gain or loss on the sale or exchange of virtual currency that is not a capital asset in the hands of the taxpayer. Inventory and other property held mainly for sale to customers in a trade or business are examples of property that is not a capital asset. See Publication 544 for more information about capital assets and the character of gain or loss. Q–8: Does a taxpayer who “mines” virtual currency (for example, uses computer resources to validate Bitcoin transactions and maintain the public Bitcoin transaction ledger) realize gross income upon receipt of the virtual currency resulting from those activities? A–8: Yes, when a taxpayer successfully “mines” virtual currency, the fair market value of the virtual currency as of the date of receipt is includible in gross income. See Publication 525, Taxable and Nontaxable Income, for more information on taxable income. Q–9: Is an individual who “mines” virtual currency as a trade or business subject to self-employment tax on the income derived from those activities? A–9: If a taxpayer’s “mining” of virtual currency constitutes a trade or business, and the “mining” activity is not undertaken by the taxpayer as an employee, the net earnings from self-employment (generally, gross income derived from carrying on a trade or business less allowable deductions) resulting from those activities constitute self-employment income and are subject to the self-employment tax. See Chapter 10 of Publication 334, Tax Guide for Small Business, for more information on self-employment tax and Publication 535, Business Expenses, for more information on determining whether expenses are from a business activity carried on to make a profit. Q–10: Does virtual currency received by an independent contractor for performing services constitute self-employment income? A–10: Yes. Generally, self-employment income includes all gross income derived by an individual from any trade or business carried on by the individual as other than an employee. Consequently, the fair market value of virtual currency received for services performed as an independent contractor, measured in U.S. dollars as of the date of receipt, constitutes self-employment income and is subject to the self-employment tax. See FS-2007-18, April 2007, Business or Hobby? Answer Has Implications for Deductions, for information on determining whether an activity is a business or a hobby. Q–11: Does virtual currency paid by an employer as remuneration for services constitute wages for employment tax purposes? A–11: Yes. Generally, the medium in which remuneration for services is paid is immaterial to the determination of whether the remuneration constitutes wages for employment tax purposes. Consequently, the fair market value of virtual currency paid as wages is subject to federal income tax withholding, Federal Insurance Contributions Act (FICA) tax, and Federal Unemployment Tax Act (FUTA) tax and must be reported on Form W–2, Wage and Tax Statement. See Publication 15 (Circular E), Employer’s Tax Guide, for information on the withholding, depositing, reporting, and paying of employment taxes. Q–12: Is a payment made using virtual currency subject to information reporting? A–12: A payment made using virtual currency is subject to information reporting to the same extent as any other payment made in property. For example, a person who in the course of a trade or business makes a payment of fixed and determinable income using virtual currency with a value of $600 or more to a U.S. non-exempt recipient in a taxable year is required to report the payment to the IRS and to the payee. Examples of payments of fixed and determinable income include rent, salaries, wages, premiums, annuities, and compensation. Q–13: Is a person who in the course of a trade or business makes a payment using virtual currency worth $600 or more to an independent contractor for performing services required to file an information return with the IRS? A–13: Generally, a person who in the course of a trade or business makes a payment of $600 or more in a taxable year to an independent contractor for the performance of services is required to report that payment to the IRS and to the payee on Form 1099–MISC, Miscellaneous Income. Payments of virtual currency required to be reported on Form 1099-MISC should be reported using the fair market value of the virtual currency in U.S. dollars as of the date of payment. The payment recipient may have income even if the recipient does not receive a Form 1099–MISC. See the Instructions to Form 1099–MISC and the General Instructions for Certain Information Returns for more information. For payments to non-U.S. persons, see Publication 515, Withholding of Tax on Nonresident Aliens and Foreign Entities. Q–14: Are payments made using virtual currency subject to backup withholding? A–14: Payments made using virtual currency are subject to backup withholding to the same extent as other payments made in property. Therefore, payors making reportable payments using virtual currency must solicit a taxpayer identification number (TIN) from the payee. The payor must backup withhold from the payment if a TIN is not obtained prior to payment or if the payor receives notification from the IRS that backup withholding is required. See Publication 1281, Backup Withholding for Missing and Incorrect Name/TINs, for more information. Q–15: Are there IRS information reporting requirements for a person who settles payments made in virtual currency on behalf of merchants that accept virtual currency from their customers? A–15: Yes, if certain requirements are met. In general, a third party that contracts with a substantial number of unrelated merchants to settle payments between the merchants and their customers is a third party settlement organization (TPSO). A TPSO is required to report payments made to a merchant on a Form 1099-K, Payment Card and Third Party Network Transactions, if, for the calendar year, both (1) the number of transactions settled for the merchant exceeds 200, and (2) the gross amount of payments made to the merchant exceeds $20,000. When completing Boxes 1, 3, and 5a–1 on the Form 1099-K, transactions where the TPSO settles payments made with virtual currency are aggregated with transactions where the TPSO settles payments made with real currency to determine the total amounts to be reported in those boxes. When determining whether the transactions are reportable, the value of the virtual currency is the fair market value of the virtual currency in U.S. dollars on the date of payment. See The Third Party Information Reporting Center, http://www.irs.gov/Tax-Professionals/Third-Party-Reporting-Information-Center, for more information on reporting transactions on Form 1099-K. Q–16: Will taxpayers be subject to penalties for having treated a virtual currency transaction in a manner that is inconsistent with this notice prior to March 25, 2014? A–16: Taxpayers may be subject to penalties for failure to comply with tax laws. For example, underpayments attributable to virtual currency transactions may be subject to penalties, such as accuracy-related penalties under section 6662. In addition, failure to timely or correctly report virtual currency transactions when required to do so may be subject to information reporting penalties under section 6721 and 6722. However, penalty relief may be available to taxpayers and persons required to file an information return who are able to establish that the underpayment or failure to properly file information returns is due to reasonable cause. SECTION 5. DRAFTING INFORMATION The principal author of this notice is Keith A. Aqui of the Office of Associate Chief Counsel (Income Tax & Accounting). For further information about income tax issues addressed in this notice, please contact Mr. Aqui at (202) 317-4718; for further information about employment tax issues addressed in this notice, please contact Mr. Neil D. Shepherd at (202) 317-4774; for further information about information reporting issues addressed in this notice, please contact Ms. Adrienne E. Griffin at (202) 317-6845; and for further information regarding foreign currency issues addressed in this notice, please contact Mr. Raymond J. Stahl at (202) 317-6938. These are not toll-free numbers. Notice 2014–22 Per Capita Payments from Proceeds of Settlements of Indian Tribal Trust Cases BACKGROUND Notice 2013–1, 2013–3 IRB 281, provides guidance on the federal tax treatment of per capita payments that members of Indian tribes receive from proceeds of certain settlements of tribal trust cases between the United States and those Indian tribes. Additional tribes have settled tribal trust cases against the United States since publication of Notice 2013–1. This notice provides an updated Appendix that reflects the additional settlement agreements. EFFECT ON OTHER DOCUMENTS Notice 2013–1 Appendix is modified and superseded. FURTHER INFORMATION For further information regarding this notice, please contact Telly Meier at phone number (202) 317-8494(not a toll-free number). Appendix Tribes That Have Entered into Settlement Agreements of Tribal Trust Cases 1. Assiniboine and Sioux Tribes of the Fort Peck Reservation 2. Bad River Band of Lake Superior Chippewa Indians 3. Blackfeet Tribe of the Blackfeet Indian Reservation 4. Bois Forte Band of Chippewa 5. Cachil Dehe Band of Wintun Indians of the Colusa Rancheria 6. Chippewa Cree Tribe of the Rocky Boy’s Reservation 7. Coeur d’Alene Tribe 8. Confederated Salish and Kootenai Tribes 9. Confederated Tribes of Siletz Indians 10. Confederated Tribes of the Colville Reservation 11. Confederated Tribes of the Goshute Reservation 12. Crow Creek Sioux Tribe 13. Eastern Shawnee Tribe of Oklahoma 14. Hualapai Indian Tribe 15. Iowa Tribe of Kansas and Nebraska 16. Kaibab Band of Paiute Indians of Arizona 17. Kickapoo Tribe of Kansas 18. Lac Courte Oreilles Band of Lake Superior Chippewa Indians 19. Lac du Flambeau Band of Lake Superior Chippewa Indians 20. Leech Lake Band of Ojibwe 21. Lower Brule Sioux Tribe 22. Makah Indian Tribe of the Makah Reservation 23. Mescalero Apache Tribe 24. Minnesota Chippewa Tribe 25. Nez Perce Tribe 26. Nooksack Indian Tribe 27. Northern Cheyenne Tribe of Indians 28. Omaha Tribe o Nebraska 29. Passamaquoddy Tribe of Maine 30. Pawnee Nation 31. Prairie Band of Potawatomi Nation 32. Pueblo of Zia 33. Quechan Tribe of the Fort Yuma Reservation 34. Red Cliff Band of Lake Superior Chippewa Indians 35. Rincon Luiseño Band of Indians 36. Rosebud Sioux Tribe 37. Round Valley Indian Tribes 38. Salt River Pima-Maricopa Indian Community 39. Santee Sioux Tribe of Nebraska 40. Sault Ste. Marie Tribe 41. Shoshone-Bannock Tribes of the Fort Hall Reservation 42. Soboba Band of Luiseno Indians 43. Spirit Lake Dakotah Nation 44. Spokane Tribe of Indians 45. Standing Rock Sioux Tribe 46. Stillaguamish Tribe of Indians 47. Summit Lake Paiute Tribe 48. Swinomish Indian Tribal Community 49. Te-Moak Tribe of Western Shoshone Indians 50. Tohono O’odham Nation 51. Tulalip Tribes 52. Tule River Indian Tribe 53. Ute Indian Tribe of the Uintah and Ouray Reservation 54. Ute Mountain Ute Tribe 55. Winnebago Tribe of Nebraska 56. Qawalangin Tribe of Unalaska 57. Tlingit & Haida Tribes of Alaska 58. Northwestern Band of Shoshone Indians 59. Hoopa Valley Tribe 60. Ak-Chin Indian Community 61. Oglala Sioux Tribe 62. Yoruk Tribe 63. Cheyenne River Sioux Tribe 64. Paiute-Shoshone Indians of the Bishop Community of the Bishop Colony 65. Seminole Nation of Oklahoma 66. Otoe-Missouria Tribe of Oklahoma 67. Samish Indian Nation 68. Tonkawa Tribe of Indians of Oklahoma 69. Yakama Nation 70. Miami Tribe of Oklahoma 71. Shoshone Indian Tribe of the Wind River Reservation Notice 2014–23 Eligibility for Premium Tax Credit for Victims of Domestic Abuse PURPOSE This notice provides guidance on circumstances in which a victim of domestic abuse who is married within the meaning of § 7703 of the Internal Revenue Code and is unable to file a joint tax return may claim a premium tax credit under § 36B. BACKGROUND Beginning in 2014, eligible individuals who purchase coverage under a qualified health plan through an Affordable Insurance Exchange are allowed a premium tax credit under § 36B. To be eligible for a premium tax credit, an individual must be an applicable taxpayer. Section 36B(c)(1) provides that an applicable taxpayer is a taxpayer (1) with household income for the taxable year between 100 percent and 400 percent of the federal poverty line for the taxpayer’s family size, (2) who may not be claimed as a dependent by another taxpayer, and (3) who files a joint tax return if married (within the meaning of § 7703). For victims of domestic abuse, contacting a spouse for purposes of filing a joint return may pose a risk of injury or trauma or, if the spouse is subject to a restraining order, may be legally prohibited. Section 7703(b) allows certain married individuals to be considered not married for purposes of the Internal Revenue Code. Under § 7703(b), a married taxpayer who lives apart from the taxpayer’s spouse for the last six months of the taxable year is considered unmarried if he or she files a separate return, maintains as the taxpayer’s home a household that is also the principal place of abode of a dependent child for more than half the year, and furnishes over half the cost of the household during the taxable year. However, § 7703(b) does not apply to many individuals who are victims of domestic abuse. For example, the abuse may have occurred in the last six months of the taxable year, the victim may not have the financial means to furnish over half the cost of a household, or the victim may not have a dependent child. Consequently, the preamble to final regulations under § 36B, issued in June of 2012, provided that Treasury and the IRS would propose regulations addressing domestic abuse and similar circumstances that create obstacles to filing a joint return. The regulations also requested comments on how to structure a rule to address such situations, including the types of documentation a taxpayer might provide to establish eligibility for the rule and the need for appropriate safeguards. The Treasury Department and IRS have received numerous comments on this subject and intend to release proposed regulations addressing this issue. RULE FOR 2014 For calendar year 2014, a married taxpayer will satisfy the joint filing requirement of § 36B(c)(1)(C) if the taxpayer files a 2014 tax return using a filing status of married filing separately and the taxpayer (i) is living apart from the individual’s spouse at the time the taxpayer files his or her tax return, (ii) is unable to file a joint return because the taxpayer is a victim of domestic abuse, and (iii) indicates on his or her 2014 income tax return in accordance with the relevant instructions that the taxpayer meets the criteria under (i) and (ii). The proposed regulations will incorporate this rule for 2014. CONTACT INFORMATION For further information regarding this notice, please contact Steve Toomey, Shareen Pflanz or Arvind Ravichandran at (202) 317-4718 (not a toll-free number). Notice 2014–24 Health Insurance Providers Fee; Procedural and Administrative Guidance SECTION 1. PURPOSE This notice provides a temporary safe harbor for covered entities that report direct premiums written for expatriate plans on a Supplemental Health Care Exhibit (SHCE). A covered entity may apply this temporary safe harbor for purposes of reporting direct premiums written on Form 8963, Report of Health Insurance Provider Information, which is used to calculate the fee imposed by § 9010 of the Affordable Care Act. SECTION 2. BACKGROUND Section 9010 of the Patient Protection and Affordable Care Act (PPACA), Public Law 111–148 (124 Stat. 119 (2010)), as amended by section 10905 of PPACA, and as further amended by section 1406 of the Health Care and Education Reconciliation Act of 2010, Public Law 111–152 (124 Stat. 1029 (2010)) (collectively, the Affordable Care Act or ACA), imposes an annual fee on covered entities engaged in the business of providing health insurance for United States health risks. The fee is a fixed amount allocated among all covered entities in proportion to their relative market share as determined by each entity’s net premiums written for the data year, which is the year immediately preceding the year in which the fee is paid (the year in which the fee is paid is the fee year). Section 9010(b)(3) requires the Secretary of the Treasury (Secretary) to calculate the amount of each covered entity’s annual fee. For this purpose, § 9010(g)(1) requires each covered entity to report to the Secretary its net premiums written for health insurance for any United States health risk for the data year. Section 9010(d) defines United States health risk to mean a health risk of any individual who is: (1) a United States citizen; (2) a resident of the United States (within the meaning of § 7701(b)(1)(A)); or (3) located in the United States, during the period such individual is so located. The Department of the Treasury (Treasury) and the Internal Revenue Service (IRS) proposed the Health Insurance Provider Fee regulations (REG–118315–12, 78 FR 14034) on March 4, 2013, and issued final regulations (T.D. 9643, 78 FR 71476) on November 26, 2013, providing guidance regarding the § 9010 fee. The regulations require each covered entity to annually report its net premiums written for health insurance of United States health risks during the data year to the IRS by April 15th of the fee year on Form 8963, Report of Health Insurance Provider Information. For covered entities that file the SHCE with the National Association of Insurance Commissioners (NAIC), net premiums written for health insurance generally will equal the amount reported on the SHCE as direct premiums written minus medical loss ratio (MLR) rebates with respect to the data year, subject to any applicable exclusions under § 9010. Form 8963 accordingly requires reporting of direct premiums written for purposes of determining net premiums written. The regulations do not provide specific rules for expatriate policies. The MLR final rule issued by HHS (MLR final rule) defines expatriate policies as group health insurance policies that provide coverage to employees, substantially all of whom are: (1) working outside their country of citizenship; (2) working outside their country of citizenship and outside the employer’s country of domicile; or (3) non-U.S. citizens working in their home country. 45 CFR 158.120(d)(4). The SHCE includes separate reporting for expatriate plans, which are defined by reference to the definition of expatriate policies in the MLR final rule. Section 57.4(b)(2) of the final regulations provides that the entire amount reported as direct premiums written on the SHCE (including direct premiums written for expatriate plans) will be considered to be for United States health risks unless the covered entity can demonstrate otherwise. The preamble to the final regulations notes that commenters expressed concern regarding the application of this presumption to expatriate policies. The preamble explains that Treasury and the IRS considered methods for a covered entity to account for its expatriate policies, but did not identify a method that would be verifiable and administrable. Treasury and the IRS are continuing to examine this issue. Section 6055 and the regulations thereunder require every person that provides minimum essential coverage (as defined in § 5000A(f)) to an individual to report to the IRS information about the coverage, including the name, address, and taxpayer identification number (TIN) of each individual covered under the policy. The data collected pursuant to § 6055 ultimately could provide insurers with information they need to determine more precisely the health risks covered by their expatriate plans. Notice 2013–45, 2013–1 IRB 116, however, delays the reporting required under § 6055 until 2016 for coverage in 2015. In the interim, this notice provides a temporary safe harbor for 2014 and 2015 for a covered entity that reports direct premiums written for expatriate plans on its SHCE that include coverage of at least one non-United States health risk. SECTION 3. TEMPORARY SAFE HARBOR Treasury and the IRS have received comments providing general data regarding expatriate plans. While the data is not exhaustive, based on an analysis of that data, Treasury and the IRS are providing a temporary safe harbor that will allow a covered entity to treat 50% of certain premiums written for expatriate plans as being attributable to non-United States health risks. Because information collected pursuant to § 6055 could provide more detailed information on the health risks covered by these plans, this safe harbor only applies for fee years 2014 and 2015. .01 Temporary Safe Harbor Method. A covered entity that satisfies the requirements of § 3.02 will be considered to have rebutted the presumption in § 57.4(b)(2) that the entire amount of direct premiums written reported on its SHCE is for United States health risks. In that event, the covered entity may, for fee years 2014 and 2015, treat 50% of the aggregate dollar amount of its direct premiums written for expatriate plans as reported on its SHCE (including the amounts for all members of the covered entity’s controlled group, if applicable) as direct premiums written for health risks that are not United States health risks and may exclude this amount in reporting direct premiums written on Form 8963. .02 Temporary Safe Harbor Requirements. (1) The covered entity (including controlled group members, if any) files one or more SHCEs with the NAIC reporting direct premiums written for expatriate plans (defined by reference to the definition of expatriate policies in the MLR final rule, which is used for purposes of the SHCE). (2) The covered entity’s aggregate direct premiums written for expatriate plans reported on its SHCE(s) include coverage of at least one non-United States health risk. (3) The covered entity (or designated entity, in the case of a controlled group) attaches a statement to its Form 8963, Report of Health Insurance Provider Information, certifying the information described in § 3.03. .03 Temporary Safe Harbor Certification. A covered entity using the temporary safe harbor must certify the following in a statement described in § 3.02(3): (1) The covered entity’s aggregate direct premiums written for expatriate plans reported on its SHCE(s) include coverage of at least one non-United States health risk. (2) The covered entity is relying on the temporary safe harbor provided in Section 3 of Notice 2014–24. (3) The aggregate dollar amount of direct premiums written for expatriate plans reported on its SHCE(s) for that covered entity (including the amounts for all members of the controlled group, if applicable). (4) The covered entity has excluded 50% of this aggregate amount in determining the amount of direct premiums reported on Form 8963 pursuant to Section 3.01 of Notice 2014–24. (5) Example. Company X, the designated entity of a controlled group, and X’s controlled group members (collectively, X Group) reported $1 million in direct premiums written for expatriate plans, in the aggregate, on their SHCEs for 2014. X Group determines that at least one of its expatriate plans covers at least one non-United States health risk. X Group intends to use the temporary safe harbor described in this notice for purposes of reporting on Form 8963. For the 2014 fee year, Company X may reduce the amount of direct premiums written reported for X Group on Form 8963 by $500,000 provided it attaches the following statement to its Form 8963: Company X hereby declares that: (1) the aggregate direct premiums written for expatriate plans reported on the X Group SHCEs for 2014 include coverage of at least one non-United States health risk; (2) X Group is relying on the temporary safe harbor provided in Section 3 of Notice 2014–24; (3) X Group reported an aggregate of $1,000,000 in direct premiums written for expatriate plans on its 2014 SHCEs; and (4) X Group excluded 50% of this amount, or $500,000, in determining the aggregate amount of direct premiums written for all X Group members reported on Form 8963. SECTION 4. EFFECTIVE/APPLICABILITY DATES This notice is effective March 28, 2014 and applies only to fee years 2014 and 2015. SECTION 5. DRAFTING INFORMATION The principal author of this notice is Charles J. Langley, Jr. of the Office of Associate Chief Counsel (Passthroughs & Special Industries). For further information regarding this notice, please contact Mr. Langley at (202) 317-6855 (not a toll-free number). Rev. Proc. 2014–28 SECTION 1. PURPOSE This revenue procedure modifies Rev. Proc. 2013–22, 2013–18 I.R.B. 985, which sets forth the procedures of the Internal Revenue Service (Service) for issuing opinion and advisory letters for § 403(b) pre-approved plans (that is, § 403(b) prototype plans and § 403(b) volume submitter plans). Under the program established by Rev. Proc. 2013–22, as modified by this revenue procedure, the Service will accept applications for opinion and advisory letters regarding the acceptability under § 403(b) of the Internal Revenue Code of the form of prototype plans and volume submitter plans, respectively, through April 30, 2015. This revenue procedure also makes certain modifications to the program established by Rev. Proc. 2013–22 that are intended to allow more plan sponsors and eligible employers to participate in the § 403(b) pre-approved plan program. The appendix to Rev. Proc. 2013–22 is revised accordingly. SECTION 2. BACKGROUND .01 Rev. Proc. 2013–22 established a new program for the submission of § 403(b) pre-approved plans to the Service, modeled after the program for pre-approved § 401(a) qualified plans, which is described in Rev. Proc. 2011–49, 2011–44 I.R.B. 608. .02 The Service has received comments from pre-approved plan sponsors requesting that the § 403(b) pre-approved plan program be modified to reflect the differing needs of plan sponsors of § 401(a) qualified plans and § 403(b) plans, and to extend the deadline to submit applications to the Service under the § 403(b) pre-approved plan program. SECTION 3. MODIFICATION OF REV. PROC. 2013–22 After evaluation and consideration of comments from pre-approved plan sponsors, the Service makes the following changes to Rev. Proc. 2013–22: .01 Sections 11.01 and 11.02, which required that, to qualify as a § 403(b) pre-approved plan sponsor, a person must expect at least 30 eligible employers to adopt its § 403(b) prototype plan basic plan document(s) or § 403(b) volume submitter specimen plan(s), as applicable, are modified to reduce the required number of eligible employers to 15. .02 Section 11.03, which required that, to qualify as a mass submitter, a person must submit opinion or advisory letter applications on behalf of at least 30 prototype sponsors, or 30 volume submitters, respectively, each of which is sponsoring, on a word-for-word identical basis, the same basic plan document or specimen plan, is modified to reduce the required number of prototype sponsors or volume submitters, as applicable, to 15. Section 11.03 is also modified to allow a person to sponsor a plan as a minor modifier of a § 403(b) volume submitter specimen plan of a mass submitter under the same conditions listed in section 11.03 for a person sponsoring a plan as a minor modifier of a § 403(b) prototype plan of a mass submitter. .03 Section 17.03, which allowed an application for an advisory letter for a § 403(b) volume submitter specimen plan to be filed by a volume submitter practitioner, by a mass submitter with respect to its mass submitter plan, or by a mass submitter on behalf of a word-for-word identical adopter of the mass submitter’s plan, is modified to also allow an application for an advisory letter for a § 403(b) volume submitter specimen plan to be filed by a mass submitter on behalf of a minor modifier of the mass submitter’s plan. .04 Section 17.04, which required that a mass submitter’s initial submission under the § 403(b) pre-approved plan program be accompanied by the applications for opinion or advisory letters filed on behalf of at least 30 word-for-word identical adopters of the basic plan document or specimen plan, as applicable, unless the mass submitter had already satisfied this requirement in connection with a previous application under Rev. Proc. 2013–22 involving another basic plan document or specimen plan, as applicable, is modified to reduce the required number of accompanying applications for opinion or advisory letters to 15. Section 17.04 is also modified to permit a mass submitter to submit additional applications on behalf of other pre-approved plan sponsors as minor modifiers of a § 403(b) volume submitter specimen plan of the mass submitter after the 15 word-for-word identical adopter requirement has been met. .05 The deadline specified in section 21.04 of Rev. Proc. 2013–22 to submit § 403(b) pre-approved plans to the Service for opinion and advisory letters is extended to April 30, 2015. .06 The Appendix to Rev. Proc. 2013–22 is amended by revising lines 4, 12, 13 and 16 thereof and, as amended, is attached is to this revenue procedure. SECTION 4. PAPERWORK REDUCTION ACT The collections of information contained in this revenue procedure have been reviewed and approved by the Office of Management and Budget in accordance with the Paperwork Reduction Act (44 U.S.C. 3507) under control number 1545–1520. SECTION 5. EFFECT ON OTHER DOCUMENTS Rev. Proc. 2013–22 is modified. SECTION 6. EFFECTIVE DATE The modification in this revenue procedure is effective as of April 14, 2014. DRAFTING INFORMATION The principal author of this revenue procedure is Eric Slack of the Employee Plans, Tax Exempt and Government Entities Division. For further information regarding this revenue procedure, please contact Mr. Slack via e-mail at RetirementPlanQuestions@irs.gov. APPENDIX Application for Approval of § 403(b) Pre-approved Plan 1. Enter amount of user fee submitted: $ 2. Name of applicant: a. EIN: b. Address: c. Phone: 3. Person to contact: a. Phone: b. Email address: c. Power of attorney attached? 4. Type of applicant (check one): _____a. Prototype sponsor _____b. Prototype mass submitter _____c. Volume submitter practitioner _____d. Volume submitter mass submitter _____e. Identical adopter of mass submitter plan _____f. Minor modifier of mass submitter plan 5. Form of plan (check one); _____a. Prototype plan _____b. Volume submitter specimen plan without adoption agreement _____c. Volume submitter specimen plan with adoption agreement 6. If the plan is a prototype plan, indicate whether the plan is a (check one): _____a. Standardized plan _____b. Nonstandardized plan 7.a. Prototype plan basic plan document number (Each of the prototype sponsor’s or prototype mass submitter’s basic plan documents must be assigned a 2-digit number, starting with 01. Enter the number you have assigned to the basic plan document that is associated with the adoption agreement for which this application is filed.): 7.b. Prototype plan adoption agreement number (Each different adoption agreement associated with a single basic plan document must be assigned a 3-digit number, beginning with 001. Enter the number you have assigned to the adoption agreement for which this application is filed.): 7.c. Volume submitter specimen plan number (Each of the volume submitter practitioner’s or volume submitter mass submitter’s specimen plans must be assigned a 2-digit number, starting with 01. Enter the number you have assigned to the specimen plan for which this application is filed.): 7.d. Volume submitter plan adoption agreement number, if applicable (Each different adoption agreement associated with a single specimen plan must be assigned a 3-digit number, beginning with 001. Enter the number you have assigned to the adoption agreement for which this application is filed.): 8. If 4.e. or 4.f. is checked, complete the following information for the mass submitter’s plan on which this application is based, to the extent the information is available when this application is filed: a. Name of mass submitter: b. File folder number: c. Letter serial number: d. Date of letter: e. Basic plan document number or specimen plan number (if b, c, and d not available): f. Adoption agreement number, if applicable (if b, c, and d not available) 9. Investment arrangement(s) permitted under the prototype or specimen plan: _____a. Annuity contracts issued by an insurance company _____b. Custodial accounts _____c. Retirement income accounts 10. Type(s) of contributions permitted under the prototype or specimen plan: _____a. Elective deferrals (other than Roth) _____b. Roth elective deferrals _____c. After-tax employee contributions _____d. Matching contributions _____e. Other nonelective employer contributions 11. Are the following documents included with the application: a. Basic plan document or specimen plan? b. Adoption agreement (if the application is for a prototype plan or for a specimen plan that uses an adoption agreement)? 12. If 4.a. or 4.c. is checked, do you expect at least 15 eligible employers to adopt your § 403(b) prototype plan basic plan documents(s) or volume submitter specimen plan(s)? 13. If 4.b. or 4.d. is checked, are applications on behalf of at least 15 prototype sponsors or volume submitters who are sponsoring the identical basic plan document or specimen plan included with this application? 14. If the answer to 13 is “no,” enter the number of the basic plan document or specimen plan for which the requirement described in 13 is met: 15. Applicant’s signature under penalties of perjury (required if 4a, b, c, or d checked): Under penalties of perjury, I declare that I have examined this application, including accompanying statements, and to the best of my knowledge and belief it is true, correct, and complete. Signature: Title: Date: 16. Prototype sponsor’s or volume submitter’s and mass submitter’s signatures under penalties of perjury (required if 4e or 4f checked): Under penalties of perjury, I declare that the prototype sponsor or volume submitter practitioner identified in line 2 of this application has adopted a prototype plan or a specimen plan that is identical to the mass submitter plan identified in line 7, or is a minor modifier of the mass submitter plan identified in line 7. Prototype sponsor’s or volume submitter’s signature: Title: Date: Mass submitter’s signature: Title: Date: Part IV. Items of General Interest Announcement 2014–14 Announcement and Reporting Advance Pricing Agreements March 27, 2014 This Announcement is issued pursuant to § 521(b) of Pub. L. 106–170, the Ticket to Work and Work Incentives Improvement Act of 1999, which requires the Secretary of the Treasury to report annually to the public concerning advance pricing agreements (APAs) and the Advance Pricing and Mutual Agreement (APMA) Program, formerly known as the Advance Pricing Agreement (APA) Program. The first report covered calendar years 1991 through 1999. Subsequent reports covered separately each calendar year 2000 through 2012. This fifteenth report describes the experience, structure, and activities of the APMA Program during calendar year 2013. It does not provide guidance regarding the application of the arm’s length standard. During 2013, the APMA Program continued to benefit from the merger and processing efficiencies that began in 2012. For the second year in a row, the number of executed APAs increased (from 140 in 2012 to 145 in 2013). The median completion time fell from 39.8 months in 2012 to 32.7 months in 2013. The increase in efficiency is further illustrated by the fact that the number of executed APAs (145) again surpassed the number of applications filed (111). Part I of this report includes information on the structure, composition, and operation of the APMA Program; Part II presents statistical data for 2013; and Part III includes general descriptions of various elements of the APAs executed in 2013, including types of transactions covered, transfer pricing methods used, and completion time. Calendar year 2013 provided many challenges to the leadership and staff of the APMA Program, but as illustrated below, the APMA Program has achieved many of its goals for 2013. The APMA Program expects to continue its progress in the years to come. Richard J. McAlonan, Jr. Director, Advance Pricing and Mutual Agreement Program Part I. The APMA Program – Structure, Composition, and Operation [Pub. L. 106–170 § 521(b)(2)(A)] In February of 2012, the former APA Program was moved from the Office of Chief Counsel to the Office of Transfer Pricing Operations, Large Business and International Division of the IRS (TPO) and combined with the United States Competent Authority (USCA) staff responsible for transfer pricing cases, thereby forming the Advance Pricing and Mutual Agreement (APMA) Program. The APMA Program Director, Richard McAlonan, joined the Program in May of 2012. After the formation of the APMA Program, the team that developed the IRS position in a bilateral or multilateral case and finalized the APA with the taxpayer also became responsible for discussing the case and obtaining an agreement with the treaty partner. This compression of functions into a single APA team has helped to eliminate inefficiencies and decreased the amount of time it takes to reach resolution once a case is set for discussion with the treaty partner. As of the date of this report, the APMA Program is comprised of 55 team leaders, 26 economists, and 10 senior managers organized in 10 groups (7 team leader groups and 3 economist groups). The team leader groups are organized by country with each group having responsibility for multiple countries. Because of the large volume of cases with certain treaty partners, some countries are the responsibility of more than one group. The APMA Program’s main office is located in Washington, DC, and it also has a significant presence in San Francisco and the Los Angeles area. During the last quarter of 2013, new proposed revenue procedures governing APA applications and MAP applications were released for public comment in Notice 2013–79, 2013–50 I.R.B. 653, and Notice 2013–78, 2013–50 I.R.B. 633, respectively. These proposed revenue procedures reflect the changes in APMA’s structure, and more importantly, were informed by the cumulative experience of more than 20 years of APA practice in the United States, which has produced more than eleven hundred unilateral and bilateral agreements since 1991. The model APA agreement, which was last revised significantly in 2009 and is currently under review for future changes, appears in this report as Appendix 1. See Pub. L. 106-170 § 521(b)(2)(B). A list of primary APMA contacts is included as Appendix 2. Part II. APMA Program Statistical Data   [Pub. L. 106–170 § 521(b)(2)(C)(i–viii)]   Table 1: APA Applications Filed § 521(b)(2)(C)(i)   Unilateral Bilateral Multilateral Total Filed 1991–1999[a]       401 Filed 2000–2012 439 904 1 1344 Filed in 2013 20 89 2 111 Total Filed 1991-2013       1856 [a] The first APA Statutory Report, which compiled APA data from 1991–1999, did not report the cumulative number of applications for those years by submission type, so the cumulative totals cannot be reported in that manner.   The 111 APA applications received during 2013, represent a slight decrease from the 126 received in 2012. The table above illustrates the number of applications filed per year; however, the table does not include situations in which the taxpayer has paid a user fee but has not yet submitted a substantially complete APA request. As of December 31, 2013, APMA had received 42 user fee filings in addition to the 111 complete APA applications.   Almost 75 percent of the bilateral applications filed in 2013 involved either Japan or Canada.   Table 2: Executed and Pending APAs § 521(b)(2)(C)(ii–vi)   Unilateral Bilateral Multilateral Total Total Executed 1991–2012 450 692 13 1155 Total Executed in 2013 39 105 1 145 Total Executed 1991–2013 489 797 14 1300           Total Pending 51 277 3 331           Renewals Executed in 2013 27 49 1 77 Renewals Pending 20 126 1 147 The APMA Program increased the number of APAs executed in its second year. The 145 APAs executed in 2013 surpassed the previous record of 140 executed agreements set in 2012. Of the 145 agreements executed in 2013, 68 of the agreements (47 percent) were new APAs (i.e., not renewal APAs), an increase from the 57 (41 percent) new APAs executed in 2012.   As the chart above illustrates, more than half of the total number of bilateral APAs executed in 2013 involved the United States entering into a mutual agreement with Japan. Canada’s 19 percent also represents a significant portion of the bilateral agreements.   The number of pending APAs decreased in 2013, primarily due to increased efficiencies within the new APMA Program. While the number of pending APAs at the end of 2013 was still higher than in some of the prior years reflected in the graph above, APMA’s streamlining of internal processes and implementation of new procedures are expected to result in a continued decrease in the pending inventory in future years.         Table 3: APAs Revoked or Cancelled and Applications Withdrawn § 521(b)(2)(C)(vii)   Unilateral Bilateral Multilateral Total Revoked or Cancelled 2013 0 0 0 0 Total Revoked or Cancelled 1991–2013       11           Applications Withdrawn in 2013 3 6 0 9 Total Applications Withdrawn 1991–2013       189 Table 4: APAs Finalized or Renewed[a] by Industry § 521(b)(2)(C)(viii) Industry   Wholesale/Retail Trade 60 Manufacturing 51 Services 11 Natural Resources and Transportation 9 All Other Industries 14 [a] APAs finalized or renewed are the same as APAs executed. Table 4a: Manufacturing APAs Finalized or Renewed Manufacturing   Transportation Equipment 15 Chemicals 10 Computer and Electronic Products 9 Miscellaneous Manufacturing 8 Fabricated Metal Products 4 Other Manufacturing 5 Table 4b: Wholesale/Retail Trade APAs Finalized or Renewed Wholesale/Retail Trade   Merchant Wholesalers, Durable Goods 33 Merchant Wholesalers, Nondurable Goods 7 General Merchandise Stores 7 Wholesale Electronic Markets and Agents and Brokers 4 Other Wholesale/Retail Trade 9 Part III. General Descriptions of APAs Executed in 2013   [Pub. L. 106–170 § 521(b)(2)(D) and (E)]   Nature of the Relationships   § 521(b)(2)(D)(i)   As in prior years, more than half of the APAs executed in 2013 involved transactions between non-U.S. parents and U.S. subsidiaries. In 2013, approximately 55 percent of the APAs executed involved transactions between a non-U.S. parent and a U.S. subsidiary; 40 percent of the APAs executed involved transactions between a U.S. parent and a non-U.S. subsidiary; and the remaining 5 percent involved transactions that included either a partnership or a branch. In 2012, approximately 75 percent of the APAs executed involved transactions between a non-U.S. parent and a U.S. subsidiary, while the remaining 25 percent involved transactions between a U.S. parent and a non-U.S. subsidiary. Tested Parties, Covered Transactions, Functions, and Risks   § 521(b)(2)(D)(ii–iii)   Consistent with prior years, the tested parties of the APAs executed in 2013[1] fell primarily into one of two categories, i.e., U.S. distributors and U.S. service providers. Combined, these two types of tested parties represent over 50 percent of the total. No other single type of entity represents greater than 10 percent of the total.   Similar to 2012, 41 percent of the transactions covered in APAs executed in 2013 involved the sale of tangible goods and 36 percent involved the provision of services. Although more than 75 percent of covered transactions involve tangible goods and services transactions, the IRS also has successfully completed numerous APAs involving transfers of intangibles. While complex transactions involving intangibles may be more challenging than other types of transactions and represent a smaller percentage of the APA inventory than other types of transactions, the IRS continues to seek opportunities to work with taxpayers and treaty partners to provide prospective certainty for such transactions wherever appropriate.   More than 60 percent of the tested parties in the APAs executed in 2013 involved distribution or related functions, e.g., marketing and product support.   The risks borne by the tested parties were primarily business risks, e.g., market risk and general business risk. A small percentage of the tested parties bore other risks such as product liability or research and development risk. Transfer Pricing Methods Used   § 521(b)(2)(D)(iv) As shown on the following graphs, and consistent with prior years, the primary transfer pricing method used for transfers of both tangible and intangible property in APAs executed in 2013 was the Comparable Profits Method/Transactional Net Margin Method (CPM/TNMM).     In controlled transactions using the CPM/TNMM, the Operating Margin was the most common profit level indicator (PLI) used to benchmark results for transfers of tangible and intangible property. Per the applicable regulations, Operating Margin is defined as the ratio of operating profits to sales.[2] The Berry Ratio, defined as the ratio of gross profit to operating expenses, was applied as the profit level indicator in 8 percent of the controlled transactions that used the CPM/TNMM.[3] Each other profit level indicator accounted for a smaller share.   For services transactions, the majority of cases applied the Services Cost Method or the CPM/TNMM. The Services Cost Method evaluates the amount charged for certain services with reference to the total services costs.   When the CPM/TNMM is used to benchmark services transactions, the Berry Ratio continues to be the most frequently used PLI. Sources of Comparables, Comparables Selection Criteria, and Nature of Adjustments to Comparable or Tested Party Data   § 521(b)(2)(D)(v–vii) For the APAs executed in 2013 that used external comparables data in the analysis, the most widely used data source for comparables was the Standard and Poor’s Compustat database. Other sources were also used in appropriate cases, e.g., where the tested party was not the U.S. entity. The most commonly used sources are listed in the following table. Table 5: Commonly Used Sources of Comparable Data Disclosure Mergent Orbis GlobalVantage Worldscope OneSource Osirus   In the majority of cases, the process of selecting comparables included comparison of a potential comparable’s functions, risks, and industry to those of the tested party. The existence of comparable products was also considered in some cases. In adjusting comparables, the standard balance sheet adjustments identified in Treas. Reg. § 1.482–1(d) and § 1.482–5(c), including adjustments to payables, receivables, and inventory, were made in the majority of cases. Where appropriate, accounting adjustments were made to convert from LIFO to FIFO inventory accounting, and a small number of cases also involved the accounting reclassification of expenses, e.g., from COGS to operating expenses. Ranges, Targets and Adjustment Mechanisms   § 521(b)(2)(D)(viii–ix) Almost 70 percent of the transactions covered in APAs executed in 2013 target an interquartile range as described in Treas. Reg. § 1.482–1(e)(2)(iii)(C). Where the transaction involves a royalty payment for the use of intangible property, both points and ranges have been used. In some cases where the covered transaction is the payment of a royalty based solely on external royalty agreements, a secondary method, e.g., a test of the post-royalty operating margin, has been imposed. The testing periods of the APAs executed in 2013 included either: (1) a single year, (2) the term of the APA not including any rollback years, or (3) the term of the APA including rollback years. APAs executed in 2013 include a number of mechanisms for making adjustments to tested party results when the results fall outside the range or do not match the point required by the APA. The following are examples of the mechanisms used in the 2013 executed APAs: an adjustment bringing the tested party’s results to the closest edge of the range applied to the results of a single year; an adjustment to the closest edge of the range applied to the results over the APA term; an adjustment to the specified point or royalty rate; or an adjustment to the median of the range for a single year. Critical Assumptions   § 521(b)(2)(D)(v) The model APA used by the IRS (included as Appendix 1 of this report) includes a standard critical assumption that there will be no material changes to the taxpayer’s business or to its tax or financial accounting practices during the APA term. Each of the APAs executed in 2013 included this standard critical assumption. A few bilateral cases have included critical assumptions tied to either the taxpayer’s profitability in a certain year or over the term of the APA, or to the amount of non-covered transactions as a percentage of the taxpayer’s revenue. Under § 11.03(2) of Rev. Proc. 2006–9, the IRS may require the taxpayer to show compliance with all the critical assumptions included in the APA. If the taxpayer’s results violate the critical assumption, then the taxpayer is required to report to the IRS the event or events creating the violation. Pursuant to § 11.06(3) of Rev. Proc. 2006–9, when a critical assumption is violated, the APMA Director may agree to modify the APA. However, if there is no agreement to modify the APA, then the APA may be cancelled. Term Lengths for APAs   § 521(b)(2)(D)(x)     Table 6: Term Lengths (Including Rollback Years) Term Length (years) Number of APAs 1 ≤3 2 ≤3 3 5 4 ≤3 5 60 6 23 7 23 8 14 9 4 10 8 11 ≤3 12 ≤3 13 ≤3 14 ≤3 15 ≤3 16 ≤3 17 ≤3 18 ≤3 19 ≤3 20 ≤3 Average 7 years As described in § 4.07 of Rev. Proc. 2006–9, taxpayers should request at least a 5-year term in an APA submission, although the appropriate APA term is decided on a case-by-case basis. Of the APAs executed in 2013, 41 percent had a 5-year term while more than half had terms of 6 years or longer. For APAs with terms of greater than 6 years, a substantial number of those were submitted as a request for a 5-year term, and the additional years were agreed to between the taxpayer and the IRS (or, in the case of a bilateral APA, between the IRS and the foreign government upon the taxpayer’s request). In 2013, 10 percent of the executed APAs included terms of 10 years or longer. The longer terms were agreed to based on the particular circumstances of each individual case and were often granted to ensure a reasonable amount of prospectivity in the APA term. The prospectivity of APA terms improved in 2013 from an average of 1 year in 2012 to 2 years in 2013. It is expected that the number of APAs with terms exceeding 10 years will decrease in future years as completion times continue to improve. Amount of Time Taken to Complete New and Renewal APAs   § 521(b)(2)(E)     Table 7: Months to Complete New and Renewal APAs § 521(b)(2)(E)   Unilateral Bilateral Unilateral & Bilateral   Average Median Average Median Average Median New 34.9 34.5 41.8 38.8 40.5 37.8 Renewal 25.4 23.7 36.2 31.9 32.4 31.4 New & Renewal 28.4 27.9 39.2 37.2 36.2 32.7 The median time required to complete the 145 APAs executed in 2013 was approximately 7 months less than the median time in 2012. Most of the improvement in completion time is attributable to bilateral APAs where agreements were reached more quickly with our treaty partners. These bilateral APA agreements were reached more quickly because of increased correspondence in advance of face-to-face meetings, elimination of the handoffs that took place prior to the merger[4], and a streamlined post-agreement process that moved from sequential processing to parallel processing once an agreement is reached.[5] Efforts to Ensure Compliance with APAs   § 521(b)(2)(F) As described in § 11.01 of Rev. Proc. 2006–9, APA taxpayers are required to file annual reports to demonstrate compliance with the terms and conditions of the APA. The filing and review of annual reports is a critical part of the APA process. Through annual report review, the APMA Program monitors taxpayer compliance with APAs on a contemporaneous basis. Annual report review provides current information on the success or problems associated with the various transfer pricing methods (TPMs) adopted in the APA process. All reports received by the APMA Program are assigned to a designated APMA team leader or economist. Whenever possible, annual report reviews are assigned to the team leader who worked the case, or another staff member who is already familiar with the relevant facts and terms of the agreement. Other team leaders and economists may assist the assigned staff member as well. The annual report is also sent to the field personnel with exam jurisdiction over the taxpayer. The field personnel conduct a parallel compliance review and coordinate with APMA personnel to resolve any questions or problems that might arise. Nature of Documentation Required in Annual Report   § 521(b)(2)(D)(xi) APAs executed in 2013 required taxpayers to provide various documents with their annual reports, depending on the specific facts of the case. While not every annual report will include each of the documents listed below (e.g., where no compensating adjustment occurs, no documentation is required) the documents listed below are required where the facts demonstrate a need for such documentation. 1. Statement identifying all material differences between Taxpayer’s business operations during APA Year and description of Taxpayer’s business operations contained in Taxpayer’s request for APA. If there have been no such material differences, a statement to that effect. 2. Statement of all material changes in the Taxpayer’s accounting methods and classifications, and methods of estimation, from those described or used in Taxpayer’s request for the APA. If there has been no material change in accounting methods and classifications or methods of estimation, a statement to that effect. 3. Description of any failure to meet Critical Assumptions. If there has been none, a statement to that effect. 4. Copy of the APA. 5. Financial analysis demonstrating Taxpayer’s compliance with TPM. 6. Organizational chart. 7. Any change to the taxpayer notice information in section 14 of the APA. 8. The amount, reason for, and financial analysis of any compensating adjustment under Paragraph 4 of Appendix A and Rev. Proc. 2006–9, § 11.02(3), for the APA Year, including but not limited to: the amounts paid or received by each affected entity; the character (such as capital or ordinary expense) and country source of the funds transferred, and the specific line item(s) of any affected U.S. tax return; and any change to any entity classification for federal income tax purposes of any member of Taxpayer’s group that is relevant to the APA. 9. The amounts, description, reason for, and financial analysis of any book-tax difference relevant to the TPM for the APA Year, as reflected on Schedule M–1 or Schedule M–3 of the U.S. return for the APA Year. 10. Financial statements and any necessary account detail to show compliance with the TPM, with a copy of the opinion from an independent CPA or other documentation required by paragraph 5(f) of the APA. 11. Where required by paragraph 5(f) of the APA, certified public accountant’s opinion that financial statements present fairly the financial position of Taxpayer and the results of its operations, in accordance with a foreign GAAP. 12. Where applicable, financial statements as prepared in accordance with a foreign GAAP. 13. Various work papers. 14. Where applicable, a review of the financial statements by a certified public accountant. Approaches for Sharing of Currency or Other Risks   § 521(b)(2)(D)(xii) In appropriate cases, APAs may provide specific approaches for dealing with currency risk, such as adjustment mechanisms and/or critical assumptions. APPENDIX 1– Model APA (based on Rev. Proc. 2006–9) [§ 521(b)(2)(B)] ADVANCE PRICING AGREEMENT between [Insert Taxpayer’s Name] and THE INTERNAL REVENUE SERVICE PARTIES The Parties to this Advance Pricing Agreement (APA) are the Internal Revenue Service (IRS) and [Insert Taxpayer’s Name], EIN ________. RECITALS [Insert Taxpayer Name] is the common parent of an affiliated group filing consolidated U.S. tax returns (collectively referred to as “Taxpayer”), and is entering into this APA on behalf of itself and other members of its consolidated group. Taxpayer’s principal place of business is [City, State]. [Insert general description of taxpayer and other relevant parties]. This APA contains the Parties’ agreement on the best method for determining arm’s-length prices of the Covered Transactions under I.R.C. section 482, the Treasury Regulations thereunder, and any applicable tax treaties. {If renewal, add} [Taxpayer and IRS previously entered into an APA covering taxable years ending _____ to ______, executed on ________.] AGREEMENT The Parties agree as follows: 1. Covered Transactions. This APA applies to the Covered Transactions, as defined in Appendix A. 2. Transfer Pricing Method. Appendix A sets forth the Transfer Pricing Method (TPM) for the Covered Transactions. 3. Term. This APA applies to the APA Term, as defined in Appendix A. 4. Operation. a. Revenue Procedure 2006–9 governs the interpretation, legal effect, and administration of this APA. b. Nonfactual oral and written representations, within the meaning of sections 10.04 and 10.05 of Revenue Procedure 2006–9 (including any proposals to use particular TPMs), made in conjunction with the APA Request constitute statements made in compromise negotiations within the meaning of Rule 408 of the Federal Rules of Evidence. 5. Compliance. a. Taxpayer must report its taxable income in an amount that is consistent with Appendix A and all other requirements of this APA on its timely filed U.S. Return. However, if Taxpayer’s timely filed U.S. Return for any taxable year covered by this APA (APA Year) is filed prior to, or no later than 60 days after, the effective date of this APA, then Taxpayer must report its taxable income for that APA Year in an amount that is consistent with Appendix A and all other requirements of this APA either on the original U.S. Return or on an amended U.S. Return filed no later than 120 days after the effective date of this APA, or through such other means as may be specified herein. b. {Use or edit the following when U.S. Group or Foreign Group contains more than one member.} [This APA addresses the arm’s-length nature of prices charged or received in the aggregate between Taxpayer and Foreign Participants with respect to the Covered Transactions. Except as explicitly provided, this APA does not address and does not bind the IRS with respect to prices charged or received, or the relative amounts of income or loss realized, by particular legal entities that are members of U.S. Group or that are members of Foreign Group.] c. For each APA Year, if Taxpayer complies with the terms and conditions of this APA, then the IRS will not make or propose any allocation or adjustment under I.R.C. section 482 to the amounts charged in the aggregate between Taxpayer and Foreign Participant[s] with respect to the Covered Transactions. d. If Taxpayer does not comply with the terms and conditions of this APA, then the IRS may: i. enforce the terms and conditions of this APA and make or propose allocations or adjustments under I.R.C. section 482 consistent with this APA; ii. cancel or revoke this APA under section 11.06 of Revenue Procedure 2006–9; or iii. revise this APA, if the Parties agree. e. Taxpayer must timely file an Annual Report (an original and four copies) for each APA Year in accordance with Appendix C and section 11.01 of Revenue Procedure 2006–9. Taxpayer must file the Annual Report for all APA Years through the APA Year ending [insert year] by [insert date]. Taxpayer must file the Annual Report for each subsequent APA Year by [insert month and day] immediately following the close of that APA Year. (If any date falls on a weekend or holiday, the Annual Report shall be due on the next date that is not a weekend or holiday.) The IRS may request additional information reasonably necessary to clarify or complete the Annual Report. Taxpayer will provide such requested information within 30 days. Additional time may be allowed for good cause. f. The IRS will determine whether Taxpayer has complied with this APA based on Taxpayer’s U.S. Returns, the Financial Statements, and other APA Records, for the APA Term and any other year necessary to verify compliance. For Taxpayer to comply with this APA, {use the following or an alternative} an independent certified public accountant must render an opinion that Taxpayer’s Financial Statements present fairly, in all material respects, Taxpayer’s financial position under U.S. GAAP. g. In accordance with section 11.04 of Revenue Procedure 2006–9, Taxpayer will (1) maintain the APA Records, and (2) make them available to the IRS in connection with an examination under section 11.03. Compliance with this subparagraph constitutes compliance with the record-maintenance provisions of I.R.C. sections 6038A and 6038C for the Covered Transactions for any taxable year during the APA Term. h. The True Taxable Income within the meaning of Treasury Regulations sections 1.482–1(a)(1) and (i)(9) of a member of an affiliated group filing a U.S. consolidated return will be determined under the I.R.C. section 1502 Treasury Regulations. i. {Optional for US Parent Signatories} To the extent that Taxpayer’s compliance with this APA depends on certain acts of Foreign Group members, Taxpayer will ensure that each Foreign Group member will perform such acts. 6. Critical Assumptions. This APA’s critical assumptions, within the meaning of Revenue Procedure 2006–9, section 4.05, appear in Appendix B. If any critical assumption has not been met, then Revenue Procedure 2006–9, section 11.06, governs. 7. Disclosure. This APA, and any background information related to this APA or the APA Request, are: (1) considered “return information” under I.R.C. section 6103(b)(2)(C); and (2) not subject to public inspection as a “written determination” under I.R.C. section 6110(b)(1). Section 521(b) of Pub. L. 106–170 provides that the Secretary of the Treasury must prepare a report for public disclosure that includes certain specifically designated information concerning all APAs, including this APA, in a form that does not reveal taxpayers’ identities, trade secrets, and proprietary or confidential business or financial information. 8. Disputes. If a dispute arises concerning the interpretation of this APA, the Parties will seek a resolution by the Director of the Advance Pricing and Mutual Agreement Program, to the extent reasonably practicable, before seeking alternative remedies. 9. Materiality. In this APA the terms “material” and “materially” will be interpreted consistently with the definition of “material facts” in Revenue Procedure 2006–9, section 11.06(4). 10. Section Captions. This APA’s section captions, which appear in italics, are for convenience and reference only. The captions do not affect in any way the interpretation or application of this APA. 11. Terms and Definitions. Unless otherwise specified, terms in the plural include the singular and vice versa. Appendix D contains definitions for capitalized terms not elsewhere defined in this APA. 12. Entire Agreement and Severability. This APA is the complete statement of the Parties’ agreement. The Parties will sever, delete, or reform any invalid or unenforceable provision in this APA to approximate the Parties’ intent as nearly as possible. 13. Successor in Interest. This APA binds, and inures to the benefit of, any successor in interest to Taxpayer. 14. Notice. Any notices required by this APA or Revenue Procedure 2006–9 must be in writing. Taxpayer will send notices to the IRS at the address and in the manner set forth in Revenue Procedure 2006–9, section 4.11. The IRS will send notices to: Taxpayer Corporation Attn: Jane Doe, Sr. Vice President (Taxes) 1000 Any Road Any City, USA 10000 (phone: _________) 15. Effective Date and Counterparts. This APA is effective starting on the date, or later date of the dates, upon which all Parties execute this APA. The Parties may execute this APA in counterparts, with each counterpart constituting an original. WITNESS, The Parties have executed this APA on the dates below. [Taxpayer Name in all caps] By: ___________________________ Date: ___________________, 201___ Jane Doe Sr. Vice President (Taxes) IRS By: ___________________________ Date: ___________________, 201___ Richard J. McAlonan, Jr. Director, Advance Pricing and Mutual Agreement Program APPENDIX A COVERED TRANSACTIONS AND TRANSFER PRICING METHOD (TPM) 1. Covered Transactions.   [Define the Covered Transactions.] 2. APA Term.   This APA applies to Taxpayer’s taxable years ending __________ through ________ (APA Term). 3. TPM.   {Note: If appropriate, adapt language from the following examples.}   [The Tested Party is __________.]   • CUP Method   The TPM is the comparable uncontrolled price (CUP) method. The Arm’s Length Range of the price charged for _________ is between _______ and ___________ per unit.   • CUT Method   The TPM is the CUT Method. The Arm’s Length Range of the royalty charged for the license of ______is between ____% and ___ % of [Taxpayer’s, Foreign Participants’, or other specified party’s] Net Sales Revenue. [Insert definition of net sales revenue or other royalty base.]   • Resale Price Method (RPM)   The TPM is the resale price method (RPM). The Tested Party’s Gross Margin for any APA Year is defined as follows: the Tested Party’s gross profit divided by its sales revenue (as those terms are defined in Treasury Regulations sections 1.482–5(d)(1) and (2)) for that APA Year. The Arm’s Length Range is between ____% and ___ %, and the Median of the Arm’s Length Range is ___%.   Cost Plus Method   The TPM is the cost plus method. The Tested Party’s Cost Plus Markup is defined as follows for any APA Year: the Tested Party’s ratio of gross profit to production costs (as those terms are defined in Treasury Regulations sections 1.482–3(d)(1) and (2)) for that APA Year. The Arm’s Length Range is between ___% and ___%, and the Median of the Arm’s Length Range is ___%.   • CPM with Berry Ratio PLI   The TPM is the comparable profits method (CPM). The profit level indicator is a Berry Ratio. The Tested Party’s Berry Ratio is defined as follows for any APA Year: the Tested Party’s gross profit divided by its operating expenses (as those terms are defined in Treasury Regulations sections 1.482–5(d)(2) and (3)) for that APA Year. The Arm’s Length Range is between ____ and ___, and the Median of the Arm’s Length Range is ___.   • CPM using an Operating Margin PLI   The TPM is the comparable profits method (CPM). The profit level indicator is an operating margin. The Tested Party’s Operating Margin is defined as follows for any APA Year: the Tested Party’s operating profit divided by its sales revenue (as those terms are defined in Treasury Regulations section 1.482–5(d)(1) and (4)) for that APA Year. The Arm’s Length Range is between ____% and ___ %, and the Median of the Arm’s Length Range is ___%.   • CPM using a Three-year Rolling Average Operating Margin PLI   The TPM is the comparable profits method (CPM). The profit level indicator is an operating margin. The Tested Party’s Three-Year Rolling Average operating margin is defined as follows for any APA Year: the sum of the Tested Party’s operating profit (within the meaning of Treasury Regulation section 1.482–5(d)(4) for that APA Year and the two preceding years, divided by the sum of its sales revenue (within the meaning of Treasury Regulation section 1.482–5(d)(1)) for that APA Year and the two preceding years. The Arm’s Length Range is between ____% and ____%, and the Median of the Arm’s Length Range is ___%.   • Residual Profit Split Method   The TPM is the residual profit split method. [Insert description of routine profit level determinations and residual profit-split mechanism].   [Insert additional provisions as needed.] 4. Application of TPM.   For any APA Year, if the results of Taxpayer’s actual transactions produce a [price per unit, royalty rate for the Covered Transactions] [or] [Gross Margin, Cost Plus Markup, Berry Ratio, Operating Margin, Three-Year Rolling Average Operating Margin for the Tested Party] within the Arm’s Length Range, then the amounts reported on Taxpayer’s U.S. Return must clearly reflect such results.   For any APA year, if the results of Taxpayer’s actual transactions produce a [price per unit, royalty rate] [or] [Gross Margin, Cost Plus Markup, Berry Ratio, Operating Margin, Three-Year Rolling Average Operating Margin for the Tested Party] outside the Arm’s Length Range, then amounts reported on Taxpayer’s U.S. Return must clearly reflect an adjustment that brings the [price per unit, royalty rate] [or] [Tested Party’s Gross Margin, Cost Plus Markup, Berry Ratio, Operating Margin, Three-Year Rolling Average Operating Margin] to the Median.   For purposes of this Appendix A, the “results of Taxpayer’s actual transactions” means the results reflected in Taxpayer’s and Tested Party’s books and records as computed under U.S. GAAP [insert another relevant accounting standard if applicable], with the following adjustments:   (a) [The fair value of stock-based compensation as disclosed in the Tested Party’s audited financial statements shall be treated as an operating expense]; and   (b) To the extent that the results in any prior APA Year are relevant (for example, to compute a multi-year average), such results shall be adjusted to reflect the amount of any adjustment made for that prior APA Year under this Appendix A. 5. APA Revenue Procedure Treatment   If Taxpayer makes an adjustment under paragraph 4 of this Appendix A (a “primary adjustment”), Taxpayer and its related foreign entity may elect APA Revenue Procedure Treatment in accordance with section 11.02(3) of Revenue Procedure 2006–9 and avoid the possible adverse tax consequences of a secondary adjustment that would otherwise follow the primary adjustment.   [Insert additional provisions as needed.] APPENDIX B CRITICAL ASSUMPTIONS This APA’s critical assumptions are: 1. The business activities, functions performed, risks assumed, assets employed, and financial and tax accounting methods and classifications [and methods of estimation] of Taxpayer in relation to the Covered Transactions will remain materially the same as described or used in Taxpayer’s APA Request. A mere change in business results will not be a material change. [Insert additional provisions as needed.] APPENDIX C APA RECORDS AND ANNUAL REPORT APA RECORDS The APA Records will consist of all documents listed below for inclusion in the Annual Report, as well as all documents, notes, work papers, records, or other writings that support the information provided in such documents. ANNUAL REPORT The Annual Report (and each of the four copies required by paragraph 5(e) of this APA) will include: 1. Two copies of a properly completed APA Annual Report Summary in the form of Appendix E to this APA, one copy of the form bound with, and one copy provided separately from, the rest of the Annual Report. 2. A table of contents, organized as follows: 3. Statements that fully identify, describe, analyze, and explain: a. All material differences between the U.S. Group’s business operations (including functions, risks assumed, markets, contractual terms, economic conditions, property, services, and assets employed) during the APA Year from the business operations described in the APA Request. If there have been no material differences, the Annual Report will include a statement to that effect. b. All material differences between the U.S. Group’s accounting methods and classifications, and methods of estimation used during the APA Year, from those described or used in the APA Request. If any change was made to conform to changes in U.S. GAAP (or other relevant accounting standards) Taxpayer will specifically identify the change. If there has been no material change in accounting methods and classifications or methods of estimation, the Annual Report will include a statement to that effect. c. Any change to the Taxpayer notice information in paragraph 14 of this APA. d. Any failure to meet any critical assumption. If there has been no failure, the Annual Report will include a statement to that effect. e. Whether or not material information submitted while the APA Request was pending is discovered to be false, incorrect, or incomplete. f. Any change to any entity classification for federal income tax purposes (including any change that causes an entity to be disregarded for federal income tax purposes) of any Worldwide Group member that is a party to the Covered Transactions or is otherwise relevant to the TPM. g. The amount, reason for, and financial analysis of (1) any primary adjustments made under Appendix A for the APA Year; and (2) any (a) secondary adjustments that follow such primary adjustments or (b) accounts receivable that Taxpayer establishes, in lieu of secondary adjustments, by electing APA Revenue Procedure Treatment pursuant to paragraph 5 of Appendix A and Revenue Procedure 2006–9, section 11.02(3), for the APA Year, including but not limited to: i. the amounts due or owed, and paid or received by each affected entity; ii. the character (such as capital, ordinary, income, expense) and country source of the funds transferred, and the specific affected line item(s) of any affected U.S. Return; iii. the date(s) and means by which the payments are or will be made; and iv. whether or not APA Revenue Procedure was elected pursuant to paragraph 5 of Appendix A and Revenue Procedure 2006–9, section 11.02(3). h. The amounts, description, reason for, and financial analysis of any book-tax difference relevant to the TPM for the APA Year, as reflected on Schedule M–1 or Schedule M–3 of the U.S. Return for the APA Year. i. Whether Taxpayer contemplates requesting, or has requested, to renew, modify, or cancel the APA. 4. The Financial Statements, and any necessary account detail to show compliance with the TPM, including consolidating financial statements, segmented financial data, records from the general ledger, or similar information if the assets, liabilities, income, or expenses relevant to showing compliance with the TPM are a subset of the assets, liabilities, income, or expenses presented in the Financial Statements. 5. {Use the following or the alternative prescribed by paragraph 5(f) of this APA:} A copy of the independent certified public accountant’s opinion required by paragraph 5(f) of this APA. 6. A financial analysis that reflects Taxpayer’s TPM calculations for the APA Year. The calculations must reconcile with and reference the information required under item 4 above in sufficient account detail to allow the IRS to determine whether Taxpayer has complied with the TPM. 7. An organizational chart for the Worldwide Group, revised annually to reflect all ownership or structural changes of entities that are parties to the Covered Transactions or are otherwise relevant to the TPM. 8. A copy of the APA and any amendment. 9. A penalty of perjury statement, executed in accordance with Revenue Procedure 2006–9, section 11.01(6) and (7). APPENDIX D DEFINITIONS The following definitions control for all purposes of this APA. The definitions appear alphabetically below: Term Definition Annual Report A report within the meaning of Revenue Procedure 2006–9, section 11.01. APA This Advance Pricing Agreement, which is an “advance pricing agreement” within the meaning of Revenue Procedure 2006–9, section 2.04. APA Records The records specified in Appendix C. APA Request Taxpayer’s request for this APA dated _________, including any amendments or supplemental or additional information thereto. APA Year This term is defined in paragraph 5(a) of this APA. Covered Transaction(s) This term is defined in Appendix A. Financial Statements Financial statements prepared in accordance with U.S. GAAP and stated in U.S. dollars. Foreign Group Worldwide Group members that are not U.S. persons. Foreign Participants [name the foreign entities involved in Covered Transactions]. I.R.C. The Internal Revenue Code of 1986, 26 U.S.C., as amended. Pub. L. 106–170 The Ticket to Work and Work Incentives Improvement Act of 1999. Revenue Procedure 2006–9 Rev. Proc. 2006–9, 2006–1 C.B. 278. Transfer Pricing Method (TPM) A transfer pricing method within the meaning of Treasury Regulation section 1.482–1(b) and Revenue Procedure 2006–9, section 2.04. U.S. GAAP U.S. generally-accepted accounting principles. U.S. Group Worldwide Group members that are U.S. persons. U.S. Return For each taxable year, the “returns with respect to income taxes under subtitle A” that Taxpayer must “make” in accordance with I.R.C. section 6012. {Or substitute for partnership: For each taxable year, the “return” that Taxpayer must “make” in accordance with I.R.C. section 6031.} Worldwide Group Taxpayer and all organizations, trades, businesses, entities, or branches (whether or not incorporated, organized in the United States, or affiliated) owned or controlled directly or indirectly by the same interests. APPENDIX E APA ANNUAL REPORT SUMMARY FORM The APA Annual Report Summary on the next page is a required APA Record. The APA Team Leader supplies some of the information requested on the form. Taxpayer is to supply the remaining information requested by the form and submit the form as part of its Annual Report. APA Annual Report Department of the Treasury—Internal Revenue Service APA No. _______________ SUMMARY Large Business and International Division Team Leader ____________________________   Transfer Pricing Operations Economist _______________________________   Advance Pricing and Mutual Agreement Program Intl Examiner _____________________________ APA Annual Report Department of the Treasury—Internal Revenue Service APA No. _______________ SUMMARY Large Business and International Division Team Leader ____________________________   Transfer Pricing Operations Economist _______________________________   Advance Pricing and Mutual Agreement Program Intl Examiner _____________________________ APA Information Taxpayer Name: ___________________________________________________   Taxpayer EIN:_________________ NAICS:___________________   APA Term: Taxable years ending ________ to ____________   Original APA [ ] Renewal APA [ ]   Annual Report due dates:   _________________, 201__ for all APA Years through APA Year ending in 200__; for each APA Year   thereafter, on _________________ [month and day] immediately following the close of the APA Year   Principal foreign country(ies) involved in covered transaction(s): _______________________________________   Type of APA: [ ] unilateral [ ] bilateral with ________________   Tested party is [ ] US [ ] foreign [ ] both   Approximate dollar volume of covered transactions (on an annual basis) involving tangible goods and services:   [ ] N/A [ ] <$50 million [ ] $50–100 million [ ] $100–250 million [ ] $250–500 million [ ] >$500 million   APA tests on (check all that apply):   [ ] annual basis [ ] multi-year basis [ ] term basis   APA provides (check all that apply) a:   [ ] range [ ] point [ ] floor only [ ] ceiling only [ ] other_____________   APA provides for adjustment (check all that apply) to:   [ ] nearest edge [ ] median [ ] other point APA Annual Report APA date executed: ______________, 201__ Information This APA Annual Report Summary is for APA Year(s) ending in 200__ and was filed on _____________, 201__ (to be completed Check here [ ] if Annual Report was filed after original due date but in accordance with extension. by the Taxpayer) Has this APA been amended or changed? [ ] yes [ ] no Effective Date: ______________________   Has Taxpayer complied with all APA terms and conditions? [ ] yes [ ] no   Were all the critical assumptions met? [ ] yes [ ] no   Has a Primary Compensating Adjustment been made in any APA Year covered by this Annual Report?   [ ] yes [ ] no If yes, which year(s): 200___   Have any necessary Secondary Compensating Adjustments been made? [ ] yes [ ] no   Did Taxpayer elect APA Revenue Procedure treatment? [ ] yes [ ] no   Any change to the entity classification of a party to the APA? [ ] yes [ ] no   Taxpayer notice information contained in the APA remains unchanged? [ ] yes [ ] no   Taxpayer’s current US principal place of business: (City, State) _____________________________________ APA Annual Report Financial analysis reflecting TPM calculations [ ] yes [ ] no Checklist of Financial statements showing compliance with TPM(s) [ ] yes [ ] no Key Contents Schedule M–1 or M–-3 book-tax differences [ ] yes [ ] no (to be completed Current organizational chart of relevant portion of world-wide group [ ] yes [ ] no by the Taxpayer) Attach copy of APA [ ] yes [ ] no   Other APA records and documents included:         Contact Information Authorized Representative Phone Number Affiliation and Address                       APPENDIX 2 – APMA Contacts APMA LEADERSHIP Director McAlonan, Richard 202-515-4706 richard.j.mcalonanjr@irs.gov Deputy Director Dhawale, Hareesh 202-515-4306 hareesh.dhawale@irs.gov ECONOMISTS Senior Manager Larson, Chuck 312-292-3663 charles.r.larson@irs.gov Senior Manager Thayer, Victor 949-360-3435 victor.e.thayer@irs.gov AUSTRALIA, AUSTRIA, GERMANY, ISRAEL, JAPAN, KAZAKHSTAN, NETHERLANDS, NEW ZEALAND, RUSSIA & UKRAINE Senior Manager Cohen, Judith 202-515-4312 judith.c.cohen@irs.gov CHINA, INDONESIA, JAPAN, SOUTH AFRICA & THAILAND Senior Manager Rock, Peter 415-547-3776 peter.c.rock@irs.gov CANADA, ITALY & LUXEMBOURG Senior Manager McOmber, Jim 202-515-4742 james.b.mcomber2@irs.gov DENMARK, INDIA, IRELAND, NORWAY, SWEDEN, SWITZERLAND & UK Senior Manager Hughes, John 202-515-4307 john.c.hughes@irs.gov ARGENTINA, CANADA, CARIBBEAN, MEXICO, PORTUGAL, PUERTO RICO, SPAIN & VENEZUELA Senior Manager Wood, Kenneth 202-515-4736 kenneth.w.wood@irs.gov BELGIUM, CANADA, FRANCE, GREECE, HUNGARY, ROMANIA & TURKEY Senior Manager Fouts, Patricia 202-515-4740 patricia.a.fouts@irs.gov GUAM, JAPAN, KOREA, MOROCCO & PHILIPPINES Senior Manager Bracken, Dennis 310-414-3617 dennis.j.bracken@irs.gov       As of March 27, 2014   [1] Not all APAs executed in 2013 included a tested party. [2] Treas. Reg. § 1.482–5(b)(4)(ii)(A). [3] Treas. Reg. § 1.482–5(b)(4)(ii)(B). [4] Before the merger, a developed APA position was “handed off” from an APA team leader to a USCA analyst who would then discuss the case with the treaty partner. When the USCA office reached agreement with the treaty partner and entered into a mutual agreement under the treaty, the agreement was “handed off” from the USCA analyst back to the APA Director and team leader to draft the domestic APA. [5] For example, APMA would send a draft APA to the taxpayer in advance of receiving the signed agreement from the treaty partner. Announcement 2014–15 Application of One-Per-Year Limit on IRA Rollovers This announcement addresses the application to Individual Retirement Accounts and Individual Retirement Annuities (collectively, “IRAs”) of the one-rollover-per-year limitation of § 408(d)(3)(B) of the Internal Revenue Code and provides transition relief for owners of IRAs. Section 408(d)(3)(A)(i) provides generally that any amount distributed from an IRA will not be included in the gross income of the distributee to the extent the amount is paid into an IRA for the benefit of the distributee no later than 60 days after the distributee receives the distribution. Section 408(d)(3)(B) provides that an individual is permitted to make only one rollover described in the preceding sentence in any 1-year period. Proposed Regulation § 1.408–4(b)(4)(ii) and IRS Publication 590, Individual Retirement Arrangements (IRAs), provide that this limitation is applied on an IRA-by-IRA basis. However, a recent Tax Court opinion, Bobrow v. Commissioner, T.C. Memo. 2014-21, held that the limitation applies on an aggregate basis, meaning that an individual could not make an IRA-to-IRA rollover if he or she had made such a rollover involving any of the individual’s IRAs in the preceding 1-year period. The IRS anticipates that it will follow the interpretation of § 408(d)(3)(B) in Bobrow and, accordingly, intends to withdraw the proposed regulation and revise Publication 590 to the extent needed to follow that interpretation. These actions by the IRS will not affect the ability of an IRA owner to transfer funds from one IRA trustee directly to another, because such a transfer is not a rollover and, therefore, is not subject to the one-rollover-per-year limitation of § 408(d)(3)(B). See Rev. Rul. 78–406, 1978–2 C.B. 157. The IRS has received comments about the administrative challenges presented by the Bobrow interpretation of § 408(d)(3)(B). The IRS understands that adoption of the Tax Court’s interpretation of the statute will require IRA trustees to make changes in the processing of IRA rollovers and in IRA disclosure documents, which will take time to implement. Accordingly, the IRS will not apply the Bobrow interpretation of § 408(d)(3)(B) to any rollover that involves an IRA distribution occurring before January 1, 2015. Regardless of the ultimate resolution of the Bobrow case, the Treasury Department and the IRS expect to issue a proposed regulation under § 408 that would provide that the IRA rollover limitation applies on an aggregate basis. However, in no event would the regulation be effective before January 1, 2015. DRAFTING INFORMATION The principal author of this announcement is Roger Kuehnle of the Employee Plans, Tax Exempt and Government Entities Division. Questions regarding this announcement may be sent via e-mail to RetirementPlanQuestions@irs.gov. Definition of Terms and Abbreviations Definition of Terms Revenue rulings and revenue procedures (hereinafter referred to as “rulings”) that have an effect on previous rulings use the following defined terms to describe the effect: Amplified describes a situation where no change is being made in a prior published position, but the prior position is being extended to apply to a variation of the fact situation set forth therein. Thus, if an earlier ruling held that a principle applied to A, and the new ruling holds that the same principle also applies to B, the earlier ruling is amplified. (Compare with modified, below). Clarified is used in those instances where the language in a prior ruling is being made clear because the language has caused, or may cause, some confusion. It is not used where a position in a prior ruling is being changed. Distinguished describes a situation where a ruling mentions a previously published ruling and points out an essential difference between them. Modified is used where the substance of a previously published position is being changed. Thus, if a prior ruling held that a principle applied to A but not to B, and the new ruling holds that it applies to both A and B, the prior ruling is modified because it corrects a published position. (Compare with amplified and clarified, above). Obsoleted describes a previously published ruling that is not considered determinative with respect to future transactions. This term is most commonly used in a ruling that lists previously published rulings that are obsoleted because of changes in laws or regulations. A ruling may also be obsoleted because the substance has been included in regulations subsequently adopted. Revoked describes situations where the position in the previously published ruling is not correct and the correct position is being stated in a new ruling. Superseded describes a situation where the new ruling does nothing more than restate the substance and situation of a previously published ruling (or rulings). Thus, the term is used to republish under the 1986 Code and regulations the same position published under the 1939 Code and regulations. The term is also used when it is desired to republish in a single ruling a series of situations, names, etc., that were previously published over a period of time in separate rulings. If the new ruling does more than restate the substance of a prior ruling, a combination of terms is used. For example, modified and superseded describes a situation where the substance of a previously published ruling is being changed in part and is continued without change in part and it is desired to restate the valid portion of the previously published ruling in a new ruling that is self contained. In this case, the previously published ruling is first modified and then, as modified, is superseded. Supplemented is used in situations in which a list, such as a list of the names of countries, is published in a ruling and that list is expanded by adding further names in subsequent rulings. After the original ruling has been supplemented several times, a new ruling may be published that includes the list in the original ruling and the additions, and supersedes all prior rulings in the series. Suspended is used in rare situations to show that the previous published rulings will not be applied pending some future action such as the issuance of new or amended regulations, the outcome of cases in litigation, or the outcome of a Service study. Abbreviations The following abbreviations in current use and formerly used will appear in material published in the Bulletin. A—Individual. Acq.—Acquiescence. B—Individual. BE—Beneficiary. BK—Bank. B.T.A.—Board of Tax Appeals. C—Individual. C.B.—Cumulative Bulletin. CFR—Code of Federal Regulations. CI—City. COOP—Cooperative. Ct.D.—Court Decision. CY—County. D—Decedent. DC—Dummy Corporation. DE—Donee. Del. Order—Delegation Order. DISC—Domestic International Sales Corporation. DR—Donor. E—Estate. EE—Employee. E.O.—Executive Order. ER—Employer. ERISA—Employee Retirement Income Security Act. EX—Executor. F—Fiduciary. FC—Foreign Country. FICA—Federal Insurance Contributions Act. FISC—Foreign International Sales Company. FPH—Foreign Personal Holding Company. F.R.—Federal Register. FUTA—Federal Unemployment Tax Act. FX—Foreign corporation. G.C.M.—Chief Counsel’s Memorandum. GE—Grantee. GP—General Partner. GR—Grantor. IC—Insurance Company. I.R.B.—Internal Revenue Bulletin. LE—Lessee. LP—Limited Partner. LR—Lessor. M—Minor. Nonacq.—Nonacquiescence. O—Organization. P—Parent Corporation. PHC—Personal Holding Company. PO—Possession of the U.S. PR—Partner. PRS—Partnership. PTE—Prohibited Transaction Exemption. Pub. L.—Public Law. REIT—Real Estate Investment Trust. Rev. Proc.—Revenue Procedure. Rev. Rul.—Revenue Ruling. S—Subsidiary. S.P.R.—Statement of Procedural Rules. Stat.—Statutes at Large. T—Target Corporation. T.C.—Tax Court. T.D.—Treasury Decision. TFE—Transferee. TFR—Transferor. T.I.R.—Technical Information Release. TP—Taxpayer. TR—Trust. TT—Trustee. U.S.C.—United States Code. X—Corporation. Y—Corporation. Z—Corporation. Numerical Finding List Numerical Finding List A cumulative list of all revenue rulings, revenue procedures, Treasury decisions, etc., published in Internal Revenue Bulletins 2013–27 through 2013–52 is in Internal Revenue Bulletin 2013–52, dated December 23, 2013. Bulletins 2014–1 through 2014–16   Announcements Article Issue Link Page 2014-1 2014-02 I.R.B. 2014-02 393 2014-2 2014-04 I.R.B. 2014-04 448 2014-4 2014-07 I.R.B. 2014-07 523 2014-05 2014-06 I.R.B. 2014-06 507 2014-06 2014-06 I.R.B. 2014-06 508 2014-07 2014-06 I.R.B. 2014-06 508 2014-08 2014-06 I.R.B. 2014-06 508 2014-09 2014-06 I.R.B. 2014-06 508 2014-10 2014-06 I.R.B. 2014-06 508 2014-11 2014-06 I.R.B. 2014-06 508 2014-12 2014-06 I.R.B. 2014-06 509 2014-13 2014-10 I.R.B. 2014-10 620 2014-14 2014-16 I.R.B. 2014-16 948 2014-15 2014-16 I.R.B. 2014-16 973   Notices Article Issue Link Page 2014-1 2014-02 I.R.B. 2014-02 270 2014-2 2014-03 I.R.B. 2014-03 407 2014-3 2014-03 I.R.B. 2014-03 408 2014-4 2014-02 I.R.B. 2014-02 274 2014-5 2014-02 I.R.B. 2014-02 276 2014-6 2014-02 I.R.B. 2014-02 279 2014-7 2014-04 I.R.B. 2014-04 445 2014-8 2014-05 I.R.B. 2014-05 452 2014-9 2014-05 I.R.B. 2014-05 455 2014-10 2014-09 I.R.B. 2014-09 605 2014-11 2014-13 I.R.B. 2014-13 880 2014-12 2014-09 I.R.B. 2014-09 606 2014-13 2014-10 I.R.B. 2014-10 616 2014-14 2014-13 I.R.B. 2014-13 881 2014-15 2014-12 I.R.B. 2014-12 661 2014-16 2014-14 I.R.B. 2014-14 920 2014-17 2014-13 I.R.B. 2014-13 881 2014-18 2014-15 I.R.B. 2014-15 926 2014-20 2014-16 I.R.B. 2014-16 937 2014-21 2014-16 I.R.B. 2014-16 938 2014-22 2014-16 I.R.B. 2014-16 940 2014-23 2014-16 I.R.B. 2014-16 942 2014-24 2014-16 I.R.B. 2014-16 942   Proposed Regulations Article Issue Link Page REG-154890-03 2014-06 I.R.B. 2014-06 504 REG-159420-04 2014-02 I.R.B. 2014-02 374 REG-144468-05 2014-06 I.R.B. 2014-06 474 REG-163195-05 2014-15 I.R.B. 2014-15 930 REG-119305-11 2014-08 I.R.B. 2014-08 524 REG-140974-11 2014-03 I.R.B. 2014-03 438 REG-121534-12 2014-06 I.R.B. 2014-06 473 REG-122706-12 2014-11 I.R.B. 2014-11 647 REG-134361-12 2014-13 I.R.B. 2014-13 895 REG-136984-12 2014-02 I.R.B. 2014-02 378 REG-113350-13 2014-03 I.R.B. 2014-03 440 REG-130967-13 2014-13 I.R.B. 2014-13 884 REG-141036-13 2014-07 I.R.B. 2014-07 516 REG-143172-13 2014-02 I.R.B. 2014-02 383 REG-108641-14 2014-15 I.R.B. 2014-15 928   Revenue Procedures Article Issue Link Page 2014-1 2014-01 I.R.B. 2014-01 1 2014-2 2014-01 I.R.B. 2014-01 90 2014-3 2014-01 I.R.B. 2014-01 111 2014-4 2014-01 I.R.B. 2014-01 125 2014-5 2014-01 I.R.B. 2014-01 169 2014-6 2014-01 I.R.B. 2014-01 198 2014-7 2014-01 I.R.B. 2014-01 238 2014-8 2014-01 I.R.B. 2014-01 242 2014-9 2014-02 I.R.B. 2014-02 281 2014-10 2014-02 I.R.B. 2014-02 293 2014-11 2014-03 I.R.B. 2014-03 411 2014-12 2014-03 I.R.B. 2014-03 415 2014-13 2014-03 I.R.B. 2014-03 419 2014-14 2014-02 I.R.B. 2014-02 295 2014-15 2014-05 I.R.B. 2014-05 456 2014-16 2014-09 I.R.B. 2014-09 606 2014-17 2014-12 I.R.B. 2014-12 661 2014-18 2014-07 I.R.B. 2014-07 513 2014-19 2014-10 I.R.B. 2014-10 619 2014-20 2014-09 I.R.B. 2014-09 614 2014-21 2014-11 I.R.B. 2014-11 641 2014-22 2014-11 I.R.B. 2014-11 646 2014-23 2014-12 I.R.B. 2014-12 685 2014-24 2014-13 I.R.B. 2014-13 879 2014-25 2014-15 I.R.B. 2014-15 927 2014-28 2014-16 I.R.B. 2014-16 944   Revenue Rulings Article Issue Link Page 2014-1 2014-02 I.R.B. 2014-02 263 2014-2 2014-02 I.R.B. 2014-02 255 2014-3 2014-02 I.R.B. 2014-02 259 2014-4 2014-05 I.R.B. 2014-05 449 2014-6 2014-07 I.R.B. 2014-07 510 2014-8 2014-11 I.R.B. 2014-11 624 2014-10 2014-14 I.R.B. 2014-14 906 2014-11 2014-14 I.R.B. 2014-14 906 2014-12 2014-15 I.R.B. 2014-15 923   Treasury Decisions Article Issue Link Page 9649 2014-02 I.R.B. 2014-02 265 9650 2014-03 I.R.B. 2014-03 394 9651 2014-04 I.R.B. 2014-04 441 9652 2014-12 I.R.B. 2014-12 655 9653 2014-06 I.R.B. 2014-06 460 9654 2014-06 I.R.B. 2014-06 461 9655 2014-09 I.R.B. 2014-09 541 9656 2014-11 I.R.B. 2014-11 626 9657 2014-13 I.R.B. 2014-13 687 9658 2014-13 I.R.B. 2014-13 748 9659 2014-12 I.R.B. 2014-12 653 9660 2014-13 I.R.B. 2014-13 842 9661 2014-13 I.R.B. 2014-13 855 9662 2014-16 I.R.B. 2014-16 933   Effect of Current Actions on Previously Published Items Finding List of Current Actions on Previously Published Items A cumulative list of current actions on previously published items in Internal Revenue Bulletins 2013–27 through 2013–52 is in Internal Revenue Bulletin 2013–52, dated December 23, 2013. Bulletins 2014–1 through 2014–16 Announcements Old Article Action New Article Issue Link Page 2007-44 Modified by Ann. 2014-4 2014-07 I.R.B. 2014-07 523 2011-49 Modified by Ann. 2014-4 2014-07 I.R.B. 2014-07 523   Notices Old Article Action New Article Issue Link Page 2003-37 Obsoleted by REG-163195-05 2014-15 I.R.B. 2014-15 930 2006-109 Modified by Notice 2014-4 2014-02 I.R.B. 2014-02 274 2007-59 Obsoleted by REG-163195-05 2014-15 I.R.B. 2014-15 930 2009-78 Superseded by T.D. 9654 2014-06 I.R.B. 2014-06 461 2013-1 Superseded by Notice 2014-22 2014-16 I.R.B. 2014-16 940 2013-1 Modified by Notice 2014-22 2014-16 I.R.B. 2014-16 940 2013-13 Obsoleted by REG-163195-05 2014-15 I.R.B. 2014-15 930 2013-17 Amplified by Notice 2014-1 2014-02 I.R.B. 2014-02 270   Revenue Procedures Old Article Action New Article Issue Link Page 2003-49 Modified and superseded by Rev. Proc. 2014-14 2014-02 I.R.B. 2014-02 295 2004-42 Obsoleted by REG-163195-05 2014-15 I.R.B. 2014-15 930 2004-43 Obsoleted by REG-163195-05 2014-15 I.R.B. 2014-15 930 2011-4 Modified by Rev. Proc. 2014-17 2014-12 I.R.B. 2014-12 661 2011-14 Modified by Rev. Proc. 2014-16 2014-09 I.R.B. 2014-09 606 2011-14 Clarified by Rev. Proc. 2014-16 2014-09 I.R.B. 2014-09 606 2011-14 Modified by Rev. Proc. 2014-17 2014-12 I.R.B. 2014-12 661 2011-44 Modified and Superseded by Rev. Proc. 2014-11 2014-03 I.R.B. 2014-03 411 2011-49 Modified by Rev. Proc. 2014-6 2014-01 I.R.B. 2014-01 198 2012-14 Modified by Rev. Proc. 2014-17 2014-12 I.R.B. 2014-12 661 2012-19 Modified by Rev. Proc. 2014-16 2014-09 I.R.B. 2014-09 606 2012-19 Superseded by Rev. Proc. 2014-16 2014-09 I.R.B. 2014-09 606 2012-20 Modified by Rev. Proc. 2014-17 2014-12 I.R.B. 2014-12 661 2012-20 Superseded by Rev. Proc. 2014-17 2014-12 I.R.B. 2014-12 661 2013-1 Superseded by Rev. Proc. 2014-1 2014-01 I.R.B. 2014-01 1 2013-2 Superseded by Rev. Proc. 2014-2 2014-01 I.R.B. 2014-01 90 2013-3 Superseded by Rev. Proc. 2014-3 2014-01 I.R.B. 2014-01 111 2013-4 Superseded by Rev. Proc. 2014-4 2014-01 I.R.B. 2014-01 125 2013-5 Superseded by Rev. Proc. 2014-5 2014-01 I.R.B. 2014-01 169 2013-6 Superseded by Rev. Proc. 2014-6 2014-01 I.R.B. 2014-01 198 2013-7 Superseded by Rev. Proc. 2014-7 2014-01 I.R.B. 2014-01 238 2013-8 Superseded by Rev. Proc. 2014-8 2014-01 I.R.B. 2014-01 242 2013-9 Superseded by Rev. Proc. 2014-9 2014-02 I.R.B. 2014-02 281 2013-10 Superseded by Rev. Proc. 2014-10 2014-02 I.R.B. 2014-02 293 2013-22 Modified by Rev. Proc. 2014-28 2014-16 I.R.B. 2014-16 944 2013-24 Obsoleted by Rev. Proc. 2014-23 2014-12 I.R.B. 2014-12 685 2013-27 Obsoleted by Rev. Proc. 2014-23 2014-12 I.R.B. 2014-12 685 2013-32 Superseded in part by Rev. Proc. 2014-1, and 2014-01 I.R.B. 2014-01 1     Rev. Proc. 2014-3 2014-01 I.R.B. 2014-01 111 2014-1 Amplified by Rev. Proc. 2014-18 2014-07 I.R.B. 2014-07 513 2014-1 I.R.B. 111 Amplified by Rev. Proc. 2014-24 2014-13 I.R.B. 2014-13 879 2014-3 Amplified by Rev. Proc. 2014-18 2014-07 I.R.B. 2014-07 513 2014-3 I.R.B 111 Amplified by Rev. Proc. 2014-24 2014-13 I.R.B. 2014-13 879 2014-4 Modified by Rev. Proc. 2014-19 2014-10 I.R.B. 2014-10 619 Proposed Regulations Old Article Action New Article Issue Link Page 209054-87 A portion withdrawn by REG-113350-13 2014-03 I.R.B. 2014-03 440 Revenue Rulings Old Article Action New Article Issue Link Page 2005-48 (2005-2 CB 259) Obsoleted by T.D. 9659 2014-12 I.R.B. 2014-12 653   INTERNAL REVENUE BULLETIN The Introduction at the beginning of this issue describes the purpose and content of this publication. The weekly Internal Revenue Bulletins are available at www.irs.gov/irb/. CUMULATIVE BULLETINS The contents of the weekly Bulletins were consolidated semiannually into permanent, indexed, Cumulative Bulletins through the 2008–2 edition. INTERNAL REVENUE BULLETINS ON CD-ROM Internal Revenue Bulletins are available annually as part of Publication 1796 (Tax Products CD-ROM). The CD-ROM can be purchased from National Technical Information Service (NTIS) on the Internet at www.irs.gov/cdorders (discount for online orders) or by calling 1-877-233-6767. The first release is available in mid-December and the final release is available in late January. We Welcome Comments About the Internal Revenue Bulletin If you have comments concerning the format or production of the Internal Revenue Bulletin or suggestions for improving it, we would be pleased to hear from you. You can email us your suggestions or comments through the IRS Internet Home Page (www.irs.gov) or write to the IRS Bulletin Unit, SE:W:CAR:MP:P:SPA, Washington, DC 20224. Page Last Reviewed or Updated: 20-Dec-2019 Share  Facebook  Twitter  Linkedin Print Footer Navigation Our Agency About IRS Careers Operations and Budget Tax Statistics Help Find a Local Office Know Your Rights Taxpayer Bill of Rights Taxpayer Advocate Service Civil Rights FOIA No FEAR Act Data Resolve an Issue IRS Notices and Letters Independent Office of Appeals Identity Theft Phishing Tax Fraud Criminal Investigation Languages Español 中文 (简体) 中文 (繁體) 한국어 Pусский Tiếng Việt Kreyòl ayisyen English Other Languages Related Sites U.S. Treasury Treasury Inspector General for Tax Administration USA.gov USAspending.gov Subfooter Privacy Policy Accessibility 
www-linkedin-com-8225	----	Get it wrong for me: What I need from allies Join now Sign in Alt text: Image of two empty chairs on a stage Get it wrong for me: What I need from allies Published on May 28, 2020May 28, 2020 • 3,290 Likes • 184 Comments Report this post Megan CarpenterFollow corporate & executive communications at Microsoft Like3,290 Comment184 Share LinkedIn Facebook Twitter 0 I used to hate it when someone who was interested in being an ally to me and other members of the Black community asked me ‘what can I do?’, ‘what do you need from me?’. I used to see it as a tax. It was something that caused me so much frustration because, at the time, all I ever wanted to do was respond with an icy glare and say, ‘nothing – there is nothing you can do.’ Based on the hate, bigotry and violence that Black people across the globe often experience, I couldn’t comprehend how those who were not Black and who came to me with their own feelings of helplessness, could possibly make an impact. There are a litany of powerful thought-pieces circulating right now about how Black people in the US feel about the seeming never-ending acts of violence and bigotry in our country. And I certainly don’t claim to speak for my entire community. I have an experience and perspective that is colored by the reality of my own personal privilege, but I will say one of the most common sentiments I am seeing in the Black community right now is exhaustion. We are tired. I am tired. I am tired of the fear. I am tired of the hate. I am tired of feeling helpless. I am tired of worrying about the physical safety and freedom of the Black people in my life, every single day. But I am also tired of people who say they don’t get it, that they don’t know why we are making such a big deal out of this race thing, that they thought this was all in the past. With each news story, each new video, I attempt to untangle the two - my identity and my job - and I fail. I will admit that I likely approach this conversation from a different vantage point than many of the people who will stumble across this post. As of a year ago, I now work in Diversity & Inclusion. I am not a HR professional, but I am a communicator who knows how language and stories can drive meaningful culture change. I work alongside some of the most innovative minds in the business. This was not a job I came to lightly. I was convinced for years that I could only drive change from the outside – and there are some brilliant advocates, and activists who are doing that in truly powerful ways. Right now I’m making use of the access that I have been granted at work (a former mentor of mine would tell me to say the access I have earned) the access I have earned at work, to instead wrestle daily with the fact that I simply don’t have the option to leave my identity at the door each day. With each news story, each new video I attempt to untangle the two - my identity and my job - and I fail. I cannot separate them and while I am fully committed to doing the right thing by the company I work for, part of doing that means being willing to bring some of that identity with me. So now when an interested ally asks me, ‘what can I do’… I feel differently. I am too tired to carry this alone, now I am ready to put people to work. I am ready to have high expectations, or really any expectations of those around me to show up. I am ready to ask for and assume that those who want to be an ally to me and my community will show up in the ways that I need them to, and not in the ways that they want to. I will give you grace if you give me effort. Now, when someone asks, ‘what do you need from me’, I say, ‘I need you to learn, I need you to care’. Somehow, we’ve all evolved to underestimate the power of learning and the power of seeking to understand. Knowing what things harm me is a sign that you value me. We’ve come to expect we will just magically know all of this without having to work at it, to think we have to act immediately without first understanding, and to believe we can’t make mistakes. This is the difference between an eager ally and an informed ally. I personally don’t want the flashy signs or symbols of allyship, I’m not looking for the buttons and t-shirts and hashtags (there are many dissenting opinions here both in the D&I space and even within community). I want an ally who pays attention to what is happening outside their own community or perspective. I want an ally who knows that these things are happening to people like me, without me needing to tell them that they are happening to people like me. Then I want an ally who works to change their individual behavior and change the system around us for the better. Not just one or the other. I want a bunch of people who are interested in becoming allies to me to get it wrong. Because I promise, you will get it wrong, likely more than once. But please get it wrong, for me. Be wrong on my behalf. Try stuff, learn stuff, make attempts, and fail. Embrace the discomfort of not knowing, of not being certain, of not understanding and then be motivated enough to learn and get better. I will give you grace if you give me effort. We are risking our lives; you can risk getting things wrong. I often point people to my favorite book on the topic of race, by Seattle author and activist Ijeoma Oluo, So You Want To Talk About Race. I consider it the perfect book for anyone looking to begin their learning journey on this topic. An especially easy way for executives and leaders to expand their awareness is to sign up for Ellen McGirt’s Fortune raceAhead, which shares regular reflections on current events across a number of identities and communities. As I reflect on my own path ahead, both personally and professionally, I am still tired. There is still so much work to do. I think about the work I want to do for the Black community, for the queer community, for people with disabilities, and I also think about the work I want to do for communities that I am not a part of. For example, members of the Asian community in many parts of the world, are experiencing rampant xenophobia right now. I want to get it wrong for people who need me to be their ally. Yes, it feels like the list will never end, yes it feels like anything we do will never be enough. No, we will never be done. But I’m going to keep swinging at it any way, and now, I’m ready to tell you what I need from you. Related - Relentless Relentless empathy: When hate demands critical leadership, Lindsay-Rae McIntyre, Chief Diversity Officer, Microsoft Published By Megan Carpenter corporate & executive communications at Microsoft FollowNow when an interested ally asks me, ‘what can I do’…I feel differently. I am too tired to carry this alone, now I am ready to put people to work. 184 comments Sign in to leave your comment Show more comments. More from Megan Carpenter 3 articles 100% real, authentic, genuine, true… January 28, 2020 Reflections: Diverse and inclusive brand… November 5, 2019 LinkedIn© 2020 About Accessibility User Agreement Privacy Policy Cookie Policy Copyright Policy Brand Policy Guest Controls Community Guidelines العربية (Arabic) Čeština (Czech) Dansk (Danish) Deutsch (German) English (English) Español (Spanish) Français (French) Bahasa Indonesia (Bahasa Indonesia) Italiano (Italian) 日本語 (Japanese) 한국어 (Korean) Bahasa Malaysia (Malay) Nederlands (Dutch) Norsk (Norwegian) Polski (Polish) Português (Portuguese) Română (Romanian) Русский (Russian) Svenska (Swedish) ภาษาไทย (Thai) Tagalog (Tagalog) Türkçe (Turkish) 简体中文 (Chinese (Simplified)) 正體中文 (Chinese (Traditional)) Language 
www-mapbox-com-6575	----	Maps, geocoding, and navigation APIs & SDKs | Mapbox Live Maps for carbon forestry with OpenForests and SilviaTerra → Products Products Maps Smooth, fast, real-time maps Navigation Turn-by-turn routing Search Search points-of-interest, addresses, and places Studio Design custom maps Vision Second set of eyes for your car Data Build with Mapbox data Dash Beautiful maps, live traffic, music and voice for cars platforms Web Mobile AR Atlas Solutions Industries Logistics Consumer apps Business intelligence All industries → Solutions Asset tracking Real time maps Interactive storytelling All solutions → Use cases Data visualization Store locator Turn-by-turn navigation All use cases → Webinars →How-To Videos → Documentation Company AboutCustomersCareersDiversity & InclusionCommunity PricingBlog Maps and location for developers Precise location data and powerful developer tools to change the way we navigate the world. Start mapping for free New Service Announcing Mapbox Dash Explore Mapbox Dash → Product update Navigation SDK v2 Public Preview Learn about Navigation SDK → Product Update Announcing Mobile Maps SDK v10 Mobile Maps SDK v10 → Product Update Announcing GL JS v2 Learn about GLJS → New service Announcing Mapbox Tiling Service Learn about MTS → new product Understand where and when movement is happening Learn about Movement Data → Maps Our APIs, SDKs, and live updating map data give developers tools to build better mapping, navigation, and search experiences across platforms. Learn about Maps → Mapbox Studio Mapbox Studio is like Photoshop for maps. We give designers control over everything from colors and fonts, to 3D features and camera angles, to the pitch of the map as a car enters a turn. Design in Studio → Navigation Mapbox provides powerful routing engines, traffic-aware travel times, and intuitive turn-by-turn directions to help you build engaging navigation experiences. Explore Navigation → Search Search and geocoding are tied to everything we build — maps, navigation, AR — and underly nearly every app that helps humans explore their world. Discover Search → Case study Plan and optimize your route at the snap of a photo Learn more → Vision The Mapbox Vision SDK describes every curb, lane, street sign, and road hazard it sees as data. Vision's AI-powered semantic segmentation, object detection, and classification deliver precise navigation guidance, display driver assistance alerts, and detect map road incidents. Discover Vision → Data Our data is powered by hundreds of data sources and a distributed global network of more than half a billion monthly active users. Learn about Data → Atlas With Atlas, you can self-host Mapbox APIs and data on your private cloud or on-premises network using Docker Compose or Kubernetes. Host Mapbox maps and geocoding APIs, Streets, Satellite, and Terrain tilesets, and Mapbox Studio on your network, behind a firewall, or even air-gapped. Use Atlas to power on-premises applications using Mapbox GL JS v2 and Mapbox Maps SDKs for iOS and Android. Self-host with Atlas → Showcase Discover more from our customers → PRODUCTS Maps Navigation Search Studio Vision SDK Data Pricing Platforms Web Mobile AR Atlas SUPPORT DocumentationPaid Support TutorialsHelp COMPANY About Customers Careers Press Community Contact SOLUTIONS Industries Automotive Logistics Data Visualization Asset Tracking Store Locator Webinars Events Blog © Mapbox TermsPrivacySecurity By browsing this website you agree to our cookie policy Agree 
www-microsoft-com-5891	----	None 
www-mijasmultimedia-org-7784	----	MIJAS MULTIMEDIA - Mission des Jacobins Sage A PROPOS RADIO LIVE VOD E-MAILS CONTACT A LA UNE Uvira: la marche de l’ECIDE dispersée pendant que les gémellipares marchent sans inquiétude. - 2021/04/24 8:41 Uvira : 9 journalistes formés sur la couverture médiatique d’une crise sanitaire. - 2021/04/14 3:05 Sud-Kivu : Les hauts plateaux d’Uvira, Fizi sous menace d’une insécurité grandissante. - 2021/04/13 3:12 Insécurité à Uvira : Manifestation de la population à Sange pour dire non au retour du 122ème bataillon des FARDC. - 2021/04/13 6:15 Uvira : Un match de football de filles pour clôturer le mois de la femme à l’organisation Popoli Fratelli - 2021/03/31 7:30  April 26, 2021 Main menu Skip to content POLITIQUE ECONOMIE SOCIETE SANTE CULTURE SPORT avril 24, 2021 avril 14, 2021 avril 13, 2021 avril 13, 2021 mars 31, 2021 mars 31, 2021 mars 29, 2021 mars 28, 2021 mars 28, 2021 mars 5, 2021 Uvira: la marche de l’ECIDE dispersée pendant que les gémellipares marchent sans inquiétude. Uvira : 9 journalistes formés sur la couverture médiatique d’une crise sanitaire. Sud-Kivu : Les hauts plateaux d’Uvira, Fizi sous menace d’une insécurité grandissante. Insécurité à Uvira : Manifestation de la population à Sange pour dire non au retour du 122ème bataillon des FARDC. ACTUALITES Uvira: la marche de l’ECIDE dispersée pendant que les gémellipares marchent sans inquiétude. avril 24, 2021La marche pacifique prévue par le Parti Ecidé de Martin Fayulu MADIDI pour dénoncer l’insécurité grandissante à l’Est de... Uvira : 9 journalistes formés sur la couverture médiatique d’une crise sanitaire. avril 14, 2021 Sud-Kivu : Les hauts plateaux d’Uvira, Fizi sous menace d’une insécurité grandissante. avril 13, 2021 Insécurité à Uvira : Manifestation de la population à Sange pour dire non au retour du 122ème bataillon des FARDC. avril 13, 2021 Uvira : Un match de football de filles pour clôturer le mois de la femme à l’organisation Popoli Fratelli mars 31, 2021 POLITIQUE Uvira: la marche de l’ECIDE dispersée pendant que les gémellipares marchent sans inquiétude. La marche pacifique prévue par le Parti Ecidé de... avril 24, 2021 Sud-Kivu : Les hauts plateaux d’Uvira, Fizi sous menace d’une insécurité grandissante. Un fugitif de la justice congolaise rejoint le groupe... avril 13, 2021 SHAHIDI TV SALON DU RIRE 10 active SALON DU RIRE 10 Le papa du savoir fait un commentaire sur le gouvernement SAMA active cérémonie de remise officielle de la dot de monsieur MUSUMARI et BEATRICE active OBSEQUE DU CAPITAINE DJOSKI active MAG CELEBRITE – MANJONJO, LA STAR DU CINEMA UVIROIS PARLE DE SA CARRIERE Suivez ce programme sur SHAHIDI TV tous les lundi à 13 H 20 et […] active LANCEMENT OFFICIEL DES ACTIVITES DE COMMERCIALISATION DES PRODUITS AGRICOLE PAR ADIJF active BERNARD DEPS DEVOILE SES AMBITIONS DANS MAG CELEBRITE – SHAHIDI TV Une production de S-TV, présentée par TINA MAGAMBO Retrouver ce […] More Videos SHOW NDUL AWARDS WEBRADIO OU CLIQUEZ EN BAS POUR ECOUTER NOS EMISSIONS Barza du Peuple Your browser does not support the audio element. SHAHIDI TV EN DIRECT JOURNAUX FRANCAIS/SWAHILI Your browser does not support the audio element. Your browser does not support the audio element. SANTE Uvira: Le littoral du lac Tanganyika devient un dépotoir public. Des déchets ménagers et non ménagers, dégradable et non dégradable, en grande quantité pour certains endroits et les autres en petite quantité sur le lac Tanganyika. janvier 30, 2021 UVIRA: la deuxième phase de campagne de vaccination contre le cholera était satisfaisante Après une première campagne de vaccination contre le cholera... octobre 4, 2020 NOTRE STAFF CEDRIC CLOVIS elie gaius IDRISSA MAYANI MUNGUBI NGABO PASCAL PAVEL SOUMIALOT ECRIVEZ-NOUS Votre nom (obligatoire) Votre adresse de messagerie (obligatoire) Objet Votre message Combien nous consultent? Total users online: 3 Guests online: 3 Registered online: 0; METEO UVIRA WEATHER ADS CONTACT 4, Avenue Kakungwe C. Mulongwe Ville d’Uvira RD Congo Mail to: contact@mijasmutimedia.org Web: https://mijasmultimedia.org RSS FEED Uvira: la marche de l’ECIDE dispersée pendant que les gémellipares marchent sans inquiétude. avril 24, 2021 Uvira : 9 journalistes formés sur la couverture médiatique d’une crise sanitaire. avril 14, 2021 ABONNE-VOUS AUX NEWSLETTERS Email Aimez notre page 2020 All rights reserved. Designed by DRM Africa Programme de la MIJAS CONGO ASBL 
www-miskatonic-org-1440	----	William Denton William Denton Miskatonic University Press RMS back at the FSF I became an associate member of the Free Software Foundation about six years ago and gave it a monthly donation to support its work. It was founded by Richard Stallman (known as RMS), who among other achievements created Emacs in the 1970s. I use it every day. In September 2019 RMS made offensive comments connected to Jeffrey Epstein and donations to MIT (see MIT scientist resigns over emails discussing academic linked to Epstein in the Guardian for some details). Like many others, I told the FSF I would stop my donations if RMS didn’t leave. I got a quick response—the same day the whole thing happened, I think—saying he was gone. Good. I continued my donations. I understood people were working to improve the environment at the FSF, but didn’t know any details. On Sunday at the Libre Planet 2021 conference there was a surprise announcement from RMS that he was back on the FSF board. (“No LibrePlanet organizers (staff or volunteer), speakers, award winners, exhibitors, or sponsors were made aware of Richard Stallman’s announcement until it was public,” tweeted @fsf.) I was amazed and appalled. It’s unbelievable the board did this and, once it was done, that the FSF handled it so badly. I emailed the FSF to cancel my monthly donation immediately. I haven’t heard back yet. I imagine the poor staff are overwhelmed. Even if RMS is kicked off yet again, the only way I might possibly one day support the FSF is if its governance is completely overhauled. Today I signed the open letter demanding the entire FSF board resign. I see some people I know on the list, and I hope others will join. Tomorrow I’ll start donating to the Software Freedom Conservancy. Vexation After Vexation STAPLR is running a new composition: “Vexation After Vexation,” an interpretation of Erik Satie’s mysterious solo piano work Vexations. You can listen to STAPLR on the site or right here: Your browser does not support embedded audio. Shame! I ran “Library Silences” for months and months—it seemed appropriate—but now it’s time for something different. (York University Libraries Ambiences are always available for home listening.) Photo from Wikipedia The score of Vexations fits on one page, but instructions say (translated from French), “In order to play the theme 840 times in succession, it would be advisable to prepare oneself beforehand, and in the deepest silence, by serious immobilities.” The piece was ignored until John Cage took it, and the suggestion about 840 repetitions, seriously and organized a performance in 1963 where it was actually played 840 times. (This has been done many times since. In 2010 during Nuit Blanche I watched a performance downtown for quite a while. It was beautiful. If you want to hear it yourself by a real pianist, I recommend buying Stephane Ginsburgh’s 42 Vexations (1893) and making a playlist where the one track is played twenty times.) One of many Nuit Blanche pianists In this STAPLR composition, one one-minute iteration of Vexations is played for each minute of help given at any desk at York University Libraries that day. It keeps a running counter of how many more minutes it should play. Let’s say that at 0900 someone checks their email and answers a quick research question that takes them five minutes. They enter that into our reference statistics system, where STAPLR sees it and counts 5. It starts to play five iterations. One minute later, the counter is at 4. One minute later, the counter goes down to 3, but there’s another question in the system, this time a virtual chat that took 10 minutes to answer, so the counter goes up to 13. After one iteration it goes down to 12, then 11, then up again if there’s another question. If the counter reaches 0 it will wait and start back up when there’s another question answered. STAPLR screenshot This is how it began this morning: [2021-02-23 07:57:40] Vexation After Vexation 1.0: {} (0 mins) [2021-02-23 07:58:40] Vexation After Vexation 1.0: {} (0 mins) [2021-02-23 07:59:40] Vexation After Vexation 1.0: {"AskUs"=&gt;{"1"=&gt;[3]}} (3 mins) [2021-02-23 08:00:40] Vexation After Vexation 1.0: {} (2 mins) [2021-02-23 08:01:40] Vexation After Vexation 1.0: {"AskUs"=&gt;{"1"=&gt;[3]}} (4 mins) [2021-02-23 08:02:40] Vexation After Vexation 1.0: {} (3 mins) [2021-02-23 08:03:40] Vexation After Vexation 1.0: {} (2 mins) [2021-02-23 08:04:40] Vexation After Vexation 1.0: {} (1 mins) [2021-02-23 08:05:40] Vexation After Vexation 1.0: {"AskUs"=&gt;{"1"=&gt;[3]}} (3 mins) [2021-02-23 08:06:40] Vexation After Vexation 1.0: {} (2 mins) [2021-02-23 08:07:40] Vexation After Vexation 1.0: {} (1 mins) [2021-02-23 08:08:40] Vexation After Vexation 1.0: {} (0 mins) [2021-02-23 08:09:40] Vexation After Vexation 1.0: {} (0 mins) Close to 1000 it really got going: [2021-02-23 09:53:40] Vexation After Vexation 1.0: {"Osgoode"=&gt;{"4"=&gt;[40]}} (40 mins) That ran down for 13 minutes then more activity came in and it’s been going ever since. I’m curious to see when it stops. (The server reboots around 0600, but it could run all night.) Vexations has a bass theme played in the left hand and two sections (the second a slight variation of the first) played by the right hand accompanied by the bass theme on the left. It’s usually played thus: bass theme alone, theme A, bass theme alone, theme B, repeat. There are 13 quarter-notes in each section, so setting the speed to 52 bpm makes it work out at exactly one minute per repetition. This is faster than it’s normally played, but it still works well. “Vexation After Vexation” doesn’t tell you how busy the desks at York University Libraries are right now, the way other STAPLR sonifications do, but I think it perfectly combines Satie and STAPLR. I’m looking forward to listening to it through to the end of April, at least. Press play, turn the volume low, and let it go in the background through your day as a piece of aural furniture. Spade and Archive The Mystery House: How a San Francisco Mason Solved a Real Estate Mystery—and a Literary Secret, from the noir issue of California Freemason (!) is about Bill Arney, who lived in apartment 401 at 891 Post Street in San Francisco: the apartment where Dashiell Hammett wrote The Maltese Falcon and the model for detective Sam Spade’s apartment in the book. I’ve been in that apartment! In February 2008 I went to a meeting hosted by the Internet Archive about planning the Open Library. I was out in San Francisco for two days. The first was the meeting, at the Presidio where the Archive then was. That was the day I saw Aaron Swartz, though I never talked to him. He committed suicide just under five years later. I did talk to Brewster Kahle, who happily is still with the Archive and still pushing the limits of access to knowledge. Here’s a blurry photo with Swartz on the left and Kahle on the right: Aaron Swartz and Brewster Kahle The room was full of leading library technologists of the time, generally from the Code4Lib world. I was out of my depth and don’t remember contributing anything, but I was damned glad to be there. It was a mind-blowing day (not just the ideas floating around, but seeing the IA’s servers, for example) and then a memorable evening after. The Dashiell Hammett Tour cover The next day Don Herron generously gave me a solo Dashiell Hammett tour. He knows Arney and when we got near he called to see if we could come up to see the apartment. We could. I went into Dashiell Hammett’s apartment! This was Sam Spade’s place! And that was just one part of the tour. Herron knows Hammett’s San Francisco like the back of his hand, and he showed me where various stories had been set, where Miles Archer was shot, and many other places, as well as covering a lot of city history. He does an incredible job, and if you’re ever in San Francisco, I highly recommend the tour. Read some Hammett beforehand if you haven’t, but even if you’re not a great fan, it’s a perfect combination of guide and subjects that makes a great introduction to the city. What a city. I packed in hours more walking that day, including City Lights and later drinks at the Top of the Mark. Those two days in San Francisco was the best short trip I’ve ever had. No more tracking Today I upgraded to the latest version of Matomo (moving up from an older version from when it was called Piwik): that’s the open, non-proprietary self-controlled more private equivalent of Google Analytics. The upgrade had been on my to do list for over a year. It didn’t take long, even with the renaming, which meant I needed to change some URLs in Javascript footers that put a tracker on every page. I got it all working and looked at the fresh Matomo interface. It tells me: not many people look at my web site; the three most popular pages are an out of date post from 2012 (Counting and aggregating in R), Twists, Slugs and Roscoes: A Glossary of Hardboiled Slang and this list of definitions and principles from Ranganathan’s Prolegomena to Library Classification; and Freedom of information request for York University eresource costs completed has had over 400 views since posted two weeks ago, which is very nice to see. Screenshot of Matomo report on this site I hadn’t looked at the stats in over a year. I don’t use them. I don’t need them. Why am I tracking users on my site anyway? There is no reason. Becky Yoose and other experts would ask me: Why are you recording personal information you’re not using? So I turned it off. I went even further: I disabled logging on the web server. I added a privacy statement to the sidebar: “Zero logging: As of 23 June 2020, no tracking is done on this web site and no logs are kept. I know absolutely nothing about how the site is used.” I also turned off logging on Listening to Art (which I didn’t even know I’d set up: I thought it was like GHG.EARTH and STAPLR, where there’s no tracking). Matomo is an excellent application! It’s under the GPL, the code is on GitHub, it’s easy to install and use … I like everything about it. I just don’t need it. (And now I don’t have to ever upgrade it again.) Zero logging is punk. Freedom of information request for York University eresource costs completed Abstract The data I requested in March 2018 through provincial freedom of information legislation was supplied last month, and the costs paid by York University Libraries for electronic resources in fiscal years 2017 and 2018 are now public: York University Libraries eresource costs (DOI: 10.5683/SP2/K1XCLU). There are three files: the data (extracted by me; available as CSV or in other formats; it is not complete, there are some redactions in what I was given), an R Markdown file with a basic R script to do some simple analysis, and the PDF released to me by York that is the responsive record. Librarian Bill prepared the data that was released to Civilian Bill, who turned it into a more usable form and gave it back to Librarian Bill to post in York University’s official data repository. Both of us are pleased that this can be added to the list of York University librarian and archivist research outputs, and that it stands as an example of York University Libraries’ commitment to open data. Background I first wrote about this on 22 August 2018, in Freedom of information request for York University eresource costs denied. I’m a librarian at York University Libraries in Toronto. Let’s call me Librarian Bill when I’m there. At home I’m Civilian Bill, and last month Civilian Bill put in a freedom of information request to York University for the amounts the Libraries spent on electronic resources in fiscal years 2017 and 2018. Civilian Bill knew the information exists because Librarian Bill prepared a spreadsheet with precisely those costs. York has refused to release the data. Their response is “withhold in full.” I made this request under Ontario’s Freedom of Information and Protection of Privacy Act (FIPPA) because I was inspired by Jane Schmidt’s talk Innovate this! Bullshit in Academic Libraries and What We Can Do About It. She said: My challenge to all of you here today is to go back to your libraries and start shining a light into the deep recesses of the databases you use…. Do you know how much your library spends on the products you use every day? Are you able to speak confidently on how those prices have fluctuated over time and why they have? If something doesn’t work the way that we think it should or as it is advertised, why is an increase in price—no matter how modest—a given? These are all questions that we need to start asking more consistently. Also, thank you, Simon Fraser University and University of Alberta, for taking the lead on sharing your expenditure data. In July Civilian Bill filed my request. It was denied. Civilian Bill appealed and eventually won. As reported on 14 March 2019 in Freedom of information appeal for York University eresource costs successful: Seven months later Civilian Bill and Librarian Bill am very happy to report the data will be released. York University said in their response: As a result of mediation with [the mediator] at the Information and Privacy Commission York University would like to suggest a possible resolution to Appeal PA-18-403. York University is committing the resources necessary to schedule the release of this information with a goal of April 30, 2020 for the completion of this project. It is hoped that this will resolve the appeal. I marked 30 April 2020 in my calendar. The deadline approaches Summer and fall of 2019 came and went … the days grew short … winter began … then the days began to lengthen. By February the change was really noticeable. My mood brightened. Spring would be here soon. Finally! And with spring would come the FIPPA response. I waited quietly. Would it happen? It? The final absurd irony? On 04 March 2020 I received an official email from Patti Ryan, director of Content Development and Analysis (my department, called CDA), working through channels. The email said in part: I am writing to request your help with doing a final check of the eResources cost data for F2017 and F2018 in order to prepare for their release to the privacy office. Recall that this has been requested from the DLO [Dean of Libraries’ Office] in connection with a freedom of information request, but is also part of CDA’s workplan. Yes! It happened! Librarian Bill was being asked to prepare the data for release to Civilian Bill! We was overjoyed. If you ever meet one of the Bills in person, let him tell you about this, because I love talking about it. I responded immediately to confirm that of course I would work on this. This is provincial legislation we’re talking about! And open data! I was pleased to see the work fitted with Goal 1 of the Libraries’ 2016–2020 strategic plan: “Advance the University Community’s Evolving Engagement with Open Scholarship.” It’s great when something you really believe in is part of your institution’s strategic plan. By early April Librarian Bill finished up a new spreadsheet containing all the data. I was directed to compare the data to the University of Alberta Libraries cost release to double check anything redacted there but not in what I had prepared. The eresources librarian, Aaron Lupton, checked any final missing non-disclosure details with vendors. With all that done, I handed it back through channels and waited. The deadline passes The end of April arrived. May began. May continued. I waited. Nothing. On 11 May Civilian Bill emailed York’s Information and Privacy Office to ask about the status of the release. That email never arrived, but Librarian Bill followed up on 19 May and got a quick response saying the release had been posted earlier in the month, but mail delivery is slow and we could have a PDF by email. I waited over the weekend to see if the envelope would arrive, but it didn’t. On 26 May the response was supplied as a PDF. The envelope has still not arrived in the post. The responsive record This is the PDF I got: Cost_release_data_F2017_F2018.pdf. Here is the first page. Page 1 of the PDF Civilian Bill was very pleased! To Librarian Bill this was nothing new, of course. Having this PDF made my work a lot easier, because it’s a live PDF with structured data in side it, not just a static image. Whoever got the spreadsheet I had prepared had turned it into a PDF, and all the columns and rows and cells were still in it. The printed version of this PDF on paper would have required a lot of tedious work scanning and OCRing and cleaning. Of course, you might ask, Why didn’t they send the spreadsheet? Well, they have their processes in the Information and Privacy Office, and if they deal with PDFs, fair enough. The real question is: Why didn’t York University Libraries release this data back in 2017? I had a PDF containing easily extracted data, which was going to save me a lot of time, and I would work with it. Starting to extract the data I thought that pdf2txt would be the easiest way to get the data out. I’d used it before (so I thought) and it had worked well (so I thought). It’s part of the PDFMiner project, but after a bunch of fiddling I couldn’t get beyond it dumping all the data out in one mixed-up column, which was no good. Doing it manually seemed to be the only way. I hoped I could copy and paste column by column from the PDF into a file, but that got messed up on most pages because there were some cells with line breaks that made the selection veer over into the next column right. For example, here I’m selecting the F2017 column (second from right) from the bottom up. All fine so far. Selecting text in a column But when I get to the line where the title is on two lines inside its cell, the F2018 column (far right) starts getting picked up instead. Selection moves into the wrong column Every time this happened I had to treat the row specially. On some the pages this meant a fair bit of fiddly work. I got six pages done one day then put it aside. (I was doing all this in Emacs and Org, which made the work quick, but wait until you see what happened next.) The day after next I woke up in the middle of the night and thought, “I should use pdf2txt to pull the data out.” Then I remembered I’d tried it and it hadn’t worked. But something wasn’t right. I knew I’d extracted data from PDFs where the page structure was maintained. Aha! That was with pdftotext, an entirely different program, that is part of Poppler! Yes, it is confusing. I hope no one writes pdf2text or pdftotxt. pdftotext comes with a -layout option: Maintain (as best as possible) the original physical layout of the text. The default is to ´undo’ physical layout (columns, hyphenation, etc.) and output the text in reading order. Here’s what it looks like. Skip past the header and notice in the first attempt there’s just one column of output, while in the second there is structure. (I cleaned up spacing to make it more readable.) $ pdftotext Cost_release_data_F2017_F2018.pdf $ head Cost_release_data_F2017_F2018.txt These costs include all e-resources purchased by and licensed to York University Libraries (YUL) for the fiscal periods (May to April) for the years indicated. Costs indicated are in Canadian dollars paid at time invoice was processed by YUL. Costs are exclusive of taxes. Where cost information is indicated as “Redacted” for a product, this indicates that a non-disclosure clause prohibits release of cost information. Where cost information is indicated as “NA”, no costs were incurred for the fiscal year period. vendor (miscellaneous) ACM Adam Matthew Digital Adam Matthew Digital Adam Matthew Digital $ pdftotext -layout Cost_release_data_F2017_F2018.pdf $ head Cost_release_data_F2017_F2018.txt These costs include all e-resources purchased by and licensed to York University Libraries (YUL) for the fiscal periods (May to April) for the years indicated. Costs indicated are in Canadian dollars paid at time invoice was processed by YUL. Costs are exclusive of taxes. Where cost information is indicated as “Redacted” for a product, this indicates that a non-disclosure clause prohibits release of cost information. Where cost information is indicated as “NA”, no costs were incurred for the fiscal year period. vendor title 2016-2017 2017-2018 (miscellaneous) Open Access NA 43187 ACM ACM Digital Library 5780 6017 Adam Matthew Digital American Consumer Culture REDACTED REDACTED Now I had a text file with ragged but more or less even columns of data. Emacs and Org make it easy I’ve often written about how much I like the text editor Emacs and within it Org mode. (My Emacs configuration files are available if you want to see the details.) Whenever I’m dealing with text, I use Emacs. If that text (including numbers) is structured as a table, I use Org. Its table editor looks confusing in the documentation, but simple use is a lot easier than it looks, and it’s very powerful and really helpful. In this case, the best thing about the tables (think: spreadsheets) is that it marks the columns with the pipe symbol (“|”) and if you enter them ragged it will align them to fit. If you start with |col_one|col_two |101|202 |808|1000309 And then hit TAB or Ctrl-c, it’ll instantly make it look like this: | col_one | col_two | | 101 | 202 | | 808 | 1000309 | With the output from pdftotext, I had one text file with fourteen sections (one per original page) of somewhat ragged columns of data. I used the Emacs rectangle commands to add columns of pipe symbols into the raw text, then copy the block of ragged text into an Org table, where it would be nicely formatted automatically. Here’s what it looks like to start. Emacs screenshot 1: raw text Here I’ve added four columns of pipes (using C-x &lt;SPC&gt; to go into rectangle mark mode, which is super cool). They don’t all line up, but that’s OK. Emacs screenshot 1: raw text Here I paste all that into an Org file. There’s a blank line between this and the nice-looking table above. Emacs screenshot 1: raw text I remove the blank line, hit TAB, and it all aligns. Emacs screenshot 1: raw text Beautiful! Then I use M-x org-table-export to write all that to a CSV file. This is more Emacs information than most people need, but I want to show how powerful it is, and that a multipurpose tool like this can make life easier. Dataverse Now that the data was extracted, where should it go? Somewhere reliable … somewhere the data would be available forever, or close enough … somewhere not commercial … somewhere affiliated with York. The answer: the Scholars Portal Dataverse. Depositing your data explains how York researchers can use Dataverse. As it happens, the librarian in charge of York’s Dataverse has her office across from me in the library (back when we were in our offices). Minglu Wang asked a couple of questions and then set me up and sent me a long list of great resources about good data practices. She’s an expert on research data management and I strongly recommend anyone at York with data to preserve get in touch with her. Librarian Bill now have my own dataverse and within it is the “Eresource costs” dataverse at the nice URL https://dataverse.scholarsportal.info/dataverse/eres. Some analysis There’s an R Markdown file in the Dataverse that you can load into RStudio or the like, or you can just copy and paste the lines into an R session. (I do my R sessions inside Emacs with ESS and Org, which you probably predicted.) Here’s some of what it has. First, load in the tidyverse (install it if it’s not already there). ## install.packages("tidyverse") library(tidyverse) Now get the data right out of Dataverse (skipping the step where you have to click to agree to abide by the CC BY license, because I haven’t found out how to turn it off): ## Get the data from the CSV. raw_costs &lt;- read_csv("https://dataverse.scholarsportal.info/api/access/datafile/105969?format=original") ## Turn it into a better (longer) data structure. ## Replace all NAs with 0s while we're at it. costs &lt;- raw_costs %&gt;% pivot_longer(c("F2017", "F2018"), names_to = "year", values_to = "cost") %&gt;% replace_na(list(cost = 0)) ## Pick out all the products where the cost is known. costs_known &lt;- costs %&gt;% filter(! cost == "REDACTED") %&gt;% mutate(cost = as.numeric(cost)) This takes the “wide” format of the original data and makes it “long” and tidy. Notice how instead of “F2017” and “F2018” columns there’s one column with “year” that has the values of either 2017 or 2018. ℝ&gt; costs # A tibble: 1,722 x 4 vendor title year cost &lt;chr&gt; &lt;chr&gt; &lt;chr&gt; &lt;chr&gt; 1 (miscellaneous) Open Access F2017 0 2 (miscellaneous) Open Access F2018 43187 3 ACM ACM Digital Library F2017 5780 4 ACM ACM Digital Library F2018 6017 5 Adam Matthew Digital American Consumer Culture F2017 REDACTED 6 Adam Matthew Digital American Consumer Culture F2018 REDACTED 7 Adam Matthew Digital American History I F2017 114 8 Adam Matthew Digital American History I F2018 122 9 Adam Matthew Digital American Indian Histories and Cultures F2017 98 10 Adam Matthew Digital American Indian Histories and Cultures F2018 105 With this in hand, we can make a chart showing the vendors paid over $100,000 (Canadian). ## Short list of vendors where YUL spent the most. major_amount &lt;- 100000 major_amount_pretty &lt;- format(major_amount, big.mark = ",", scientific = FALSE) major_vendors &lt;- total_vendor_costs %&gt;% filter(total &gt; major_amount) %&gt;% pull(vendor) %&gt;% unique() major_vendor_costs &lt;- total_vendor_costs %&gt;% filter(vendor %in% major_vendors) ## The reorder function sorts the vendor list by total costs, which ## makes the chart much more readable. ## coord_flip() helps make this kind of chart more readable. major_vendor_costs %&gt;% ggplot(aes(x = reorder(vendor, total), y = total / 1000, fill = year)) + geom_col(position = "dodge") + geom_label(aes(label = round(total / 1000, -1)), position = position_dodge(0.9), show.legend = FALSE) + coord_flip() + labs(title = paste0("York University Libraries eresource costs: vendors paid over $", major_amount_pretty, " total"), subtitle = "Does not include all costs because some were redacted", x = "", y = "$000 (rounded)", fill = "", caption = "William Denton &lt;wdenton@yorku.ca&gt;, CC BY") + theme_minimal() Chart showing total amount spent on major vendors In F2018 we know the Libraries paid Elsevier about $1.57 million. And that’s not including the sixteen products where the prices were redacted! The most expensive product was ScienceDirect—no surprise—at about $1.4 million. The Elsevier F2018 annual report says it had an “adjusted operating profit margin” of 31.3% that year—yes, 31.3%—so of that $1.57 million that we know, $491,000 was pure profit for the company. The Libraries’ collections budget is (in this fiscal year) on the order of $13 million. That means close to 4% of the collections budget goes straight to Elsevier profit. This is an example of a major issue in scholarly publishing. See SPARC’s Big Deal Cancellation Tracking for more about all this. Here’s a chart counting redactions by vendor: costs_redacted &lt;- costs %&gt;% filter(cost == "REDACTED") costs_redacted %&gt;% count(vendor, year) %&gt;% ggplot(aes(x = reorder(vendor, n), y = n, fill = year)) + geom_col(position = "dodge") + coord_flip() + labs(title = "York University Libraries eresource costs: vendors with redacted costs", subtitle = "Count of products where costs were redacted because of vendor license restrictions", x = "", y = "$000", fill = "", caption = "William Denton &lt;wdenton@yorku.ca&gt;, CC BY") + theme_minimal() Chart counting redactions per vendor Why the redactions? Because I thought it would make things go faster to ask for costs where there was no non-disclosure agreement. It didn’t. Along the way I learned that FIPPA doesn’t care about non-disclosure agreements in contracts. But my original request was for costs that didn’t have an NDA, and I let it ride. What next? I’m going to file for the eresource costs for F2019 and F2020, of course. With no redactions. LibGuides There’s that great old quote from Jamie Zawinski (though there’s more behind it): Some people, when confronted with a problem, think “I know, I’ll use regular expressions.” Now they have two problems. I paraphase: Some librarians, when confronted with a problem, think “I know, I’ll make a LibGuide.” Now they have two problems. Associate dean job open at York University York University Libraries, where I work, is hiring an associate dean of the Research and Open Scholarship division. YUL moved into a new organizational structure in the summer of 2018. There are three divisions, each overseen by an associate dean (AD). The Restructuring Progress Update – Sep 26 explains a bit about Research and Open Scholarship, which is where this AD is needed. (The person who had held that role for a long time left to become a chief librarian at another university; someone internal filled the role for two or three years but then stepped down and the role is now vacant.) God is an Astronaut in performance. (Now, I should say we began to move into a new structure in 2018, because it’s not all done yet. The librarians and archivists have moved and by and large are settled into new roles, though there are a number of unresolved issues. The restructuring is still an item on the agenda of the regular meetings between YUFA (our union) and the Employer. For more about library reorgs, see my post Navigating the Reorganization about an October conference on the subject.) This division contains Content Development and Analysis (where I am; this department manages the $13 million collections budget), Open Scholarship, Archives and Special Collections and Content Development and Acquisitions (the acquisitions department, which has about a dozen people, but no web presence). Warning sign on compact shelving: Crushing hazard The ad says: York University Libraries (YUL) is seeking an experienced leader for the Associate Dean, Research &amp; Open Scholarship position. The position will be attractive to individuals who understand the evolving role of the research library, have a strong understanding of research culture, scholarly communications, content and unique collections, and are adept at championing the Libraries. Qualifications include: A successful record of leadership, planning, developing and managing library programs and services and leading staff through change gained through at least five years of experience in library management positions. There’s no closing date in the ad because the search will be open until it’s filled, but it looks like they’ll start reviewing applications in the second week of January. No associate dean job is easy. Whoever takes this job will face many of the same problems as at other academic libraries. On the other hand, York University is (aside from the strikes) a fine place to work. I really like being there: the students are smart and engaged, the faculty are doing interesting research, the salary and benefits are good, and it’s coming up on two years since we got a subway station. On the third hand, there are (as you’d expect) some things about the job that are unique to York University Libraries. I’m not on the search committee. If anyone is considering applying, or gets asked for an interview, I’m happy to take a phone call and answer questions. See also Interviewing at York University Libraries for a general idea of how the day will probably go; however, this is not a regular position so various things during the day will be different. We need an associate dean, and I hope we get a really good one. Spread the word. Whip Radio I’ve been listening to more streaming radio recently, thanks to Resonance FM and Radio Aporee combined with a wish just to hear some good old radio even though my tuner died years ago and CBC Radio isn’t good any more. It’s a bit of a pain to bookmark stations in a browser, though, and I use NoScript so the players never work without a bit of fiddling anyway. So I started to make a list of the URLs of the actual music streams, which are hidden in the web pages. If you know what to do you can use that URL to play the station without running a browser. Once I had a few of them I realized I should make a list, and the list turned into a script: Whip Radio. If you’re comfortable with a command line, you might like it. Screenshot of Whip Radio The source code is on GitHub: Whip Radio. It suits me, because I know what the stations are, but there are some improvements to be made, and I’ll do what I can. If it’s useful to anyone else, have at it. If you’ve never heard Radio Aporee, do try it. It’s “a responsive stream of sound, a topographic radio that listens, that may (or may not…) recognise and react to events, e.g. new sound uploads, listeners tuning in, mobile app activity, live sessions, phone calls etc. it’s an ongoing experiment and exploration of affective geographies and new practices related to sound/art and radio.” A few minutes after you start listening you’ll probably hear a robot say someone in your area has started listening, and it’ll play a recording made near you. Mapping the Indian Residential School Locations Dataset My colleague Rosa Orlandini’s Residential School Locations Project was used in a workshop today as an example of best practices in making data openly available. It is one result of her sabbatical work last year, which I couldn’t hope to summarize properly, but the metadata explains more about it, the Wikipedia article Canadian Indian residential school system gives background, and you can email her for more. When I looked at the data and saw Indian Residential School Locations Dataset (CSV Format) I loaded it up into R and made a quick map. (If you try to get the data by hand it makes you agree to terms and conditions even though it’s CC-BY, which I’ll report, but I found that if you link directly to the CSV there’s no problem.) library(tidyverse) library(maps) ca_map &lt;- map_data(map = "world") %&gt;% filter(region == "Canada") read_csv("https://dataverse.scholarsportal.info/api/access/datafile/75625?format=original&amp;gbrecs=true") %&gt;% ggplot() + geom_polygon(data = ca_map, aes(x = long, y = lat, group = group), fill = NA, colour = "black") + coord_map(projection = "gilbert") + geom_point(data = irs_locations, aes(x = Longitude, y = Latitude)) + labs(title = "Indian Residential Schools Location Dataset", subtitle = "Data provided by Rosa Orlandini (https://doi.org/10.5683/SP2/RIYEMU) (CC-BY)", caption = "William Denton (CC-BY)", x = "", y = "") Map of Indian Residential Schools It’s hard to see some of the dots, and there are factors in the data that would be useful to show, like religious affiliations of the schools, but as a first look it’s a decent start. Library eresource vendor policies Where I work, at York University Libraries, we’ve got a new web page up: vendor policies. It’s a big long list of everyone electronic resource that we subscribe to, with links (where known) to the vendor’s privacy policy and terms and conditions of use, and a note about whether account creation is optional or mandatory. The data was collected by Stephanie Power and is available on GitHub under a CC-0 license. Screenshot of web page There’s also a little script that generates the long HTML list, suitable for pasting into a content management system. The list isn’t pretty, but our goal was first to build a complete list so that all of these policies that affect our users are in one place. That’s done. Our second goal is to tell other libraries about it. Perhaps this list is useful elsewhere, particularly in Ontario, where the academic libraries share a lot of online resources? Right now it’s a list of what York has, and for other libraries to use it we’d need to figure out a way of listing who has access to what, but maybe that’s just a matter of adding a column to the CSV. If anyone’s interested in using the data or has ideas about improving it, please email me at wdenton@yorku.ca. Freedom of information appeal for York University eresource costs successful Last August I wrote Freedom of information request for York University eresource costs denied: I’m a librarian at York University Libraries in Toronto. Let’s call me Librarian Bill when I’m there. At home I’m Civilian Bill, and last month Civilian Bill put in a freedom of information request to York University for the amounts the Libraries spent on electronic resources in fiscal years 2017 and 2018. Civilian Bill knew the information exists because Librarian Bill prepared a spreadsheet with precisely those costs. York has refused to release the data. Their response is “withhold in full.” Civilian Bill appealed. Seven months later Civilian Bill and Librarian Bill am very happy to report the data will be released. My appeal was handled the by same mediator who had my request for communications between the chairs of York’s Senate and Board of Governors, which made me happy. She was excellent: helpful, informative, quick to act, expert on all aspects of the legislation and a fine example of the civil service at its best. If you’re in Ontario and have an idea for a freedom of information request but are worried things are stacked against you, don’t be. My mediator—I assume they’re all equally good—was everything I’d hoped for. Last fall I had a number of phone conversations with the mediator, and she in turn talked to York quite a bit. The mediator said (I paraphrase—any misunderstandings are mine) that the Freedom of Information and Protection of Privacy Act (FIPPA) didn’t recognize non-disclosure agreements, so that part of its argument didn’t hold up. The adjudicator might look favourably on my request … but it would take two or three years to get to adjudication because there’s a big backlog. It emerged that the problem, from York’s point of view, was that the information about NDAs in the big list of eresources I’d help create was unverified. My impression was that York thought it needed to be reviewed and double-checked. At least that was a potentially justifiable reason, unlike the unexplained blanket “denied in full” that was the initial response. We were at an impasse. Mediation was unsuccessful. York wasn’t going to release the data. I informed the mediator I wanted to go to adjudication. Soon after that I got a call from the mediator saying I should expect a letter from York. This is what it said: York's first response As a result of mediation with [the mediator] at the Information and Privacy Commission York University would like to suggest as a possible resolution to Appeal PA-18-403 a schedule for the release of the costs you have requested. York University would be able to commit the resources necessary to schedule the release of this information by, or in advance of, April 30, 2020. That was a surprise! But “would be able to commit the resources necessary” was too vague. I “would be able to commit” to bringing my lunch to work every day for a month but that doesn’t mean I’m actually going to do it. The mediator followed up with York, and last week I got this: York's second response As a result of mediation with [the mediator] at the Information and Privacy Commission York University would like to suggest a possible resolution to Appeal PA-18-403. York University is committing the resources necessary to schedule the release of this information with a goal of April 30, 2020 for the completion of this project. It is hoped that this will resolve the appeal. Is committing the resources necessary. That satisfied me. I talked to the mediator one last time and said I was now willing to drop the appeal. She immediately did her report, which says in part: Mediator's report DECISION: The university issued a decision denying access to the responsive records pursuant to sections 17 and 18 of the Act. APPEAL: The requester, now appellant, appealed the university’s decision. RESULTS OF MEDIATION: During mediation, the mediator had discussions with the appellant and the university. The mediator provided the appellant with information about the exemptions applied to the records at issue. The appellant clarified that he is not seeking access to the costs of resources that have been protected under a non disclosure agreement with the university. In regards to the remainder of the resources, the appellant further narrowed the request to include only the name of each publication and total cost per year for 2017 and 2018. The appellant also advised that he is aware of one document in the possession of the university that he believed contained the information that he is seeking relating to this narrowed request. The university located the document that the appellant was seeking access to and stated that it is not prepared to release it due to economic and other interests. The university then answered some of the appellant’s questions and sent him a letter confirming its commitment to schedule the release of the information at issue in this appeal with a goal of April 2020 for the completion of the project. The appellant advised that he is satisfied with the university’s response and wishes to withdraw his appeal. Accordingly, this appeal has been closed. What made York change its mind? I don’t know. There were two bodies involved: the Information and Privacy Officer and the Libraries. I assume the two were able to work together to arrive at this result. I have no idea. No one in the Libraries has told me anything about my appeal—not brought it up in conversation, not so much as alluded to it. Now, this data is going to be released to me personally. Whatever I get, I’ll make it public, but of course my goal is for York to release this information itself, in a good data set, like other universities do. If York can give me the information, it seems to me there’s nothing preventing it from doing a proper full release. And I hope that with F2017 and F2018 done there will be nothing to prevent York from releasing F2019 and onwards, because each year we only add a few handfuls of new eresources and their license agreements will be easy to check. This all began “early in 2017 [when] Librarian Bill was part of a group at York University Libraries that resolved to make public the costs YUL spent on electronic resources.” That didn’t happen, so in 2018 Civilian Bill filed a FIPPA request, was denied, and appealed. In 2019 a satisfactory plan was proposed for releasing the data; as a result the appeal was dropped. Civilian Bill should have the data in early 2020. When he does, both Civilian Bill and Librarian Bill will be happy. I’ll finish with Article 10.01 from the York University Faculty Association collective agreement, where academic freedom is defined: The parties agree to continue their practice of upholding, protecting, and promoting academic freedom as essential to the pursuit of truth and the fulfilment of the University’s objectives. Academic freedom includes the freedom of an employee to examine, question, teach, and learn; to disseminate his/her opinion(s) on any questions related to his/her teaching, professional activities, and research both inside and outside the classroom; to pursue without interference or reprisal, and consistent with the time constraints imposed by his/her other University duties, his/her research, creative or professional activities, and to freely publish and make public the results thereof; to criticize the University or society at large; and to be free from institutional censorship. Academic freedom does not require neutrality on the part of the individual, nor does it preclude commitment on the part of the individual. Rather, academic freedom makes such commitment possible. I’m privileged to have academic freedom and I’m happy to use it. Phasers on Satie I’m running “Phasers on Satie (Long Phase)” on STAPLR for a while. It uses the left-hand riff from Vexations by Erik Satie, normalized so all the notes are the same length. There are eighteen notes, and it’s running at 54 beats per minute, so it plays three times a minute. Every time there’s an interaction at one of the York University Libraries desks, the piece begins to play for as many minutes as the interaction was long, but after the first time the riff is played it starts to go out of phase, and gets a little more behind every repetition. The phase length is such that the first note is just about to run into the second note by the time the repetitions are done, but it stops one repetition before that happens. Because the interactions at the desks start at different times and last for different durations, many different types of patterns can crop up. The title is a tribute to the Canadian prog band FM and the late, great Nash the Slash. FM did “Phasors on Stun” on their first album, Black Noise. .bt-video-container iframe,.bt-video-container object,.bt-video-container embed{position:absolute;top:0;left:0;width:100%;height:100%;margin-top:0} Wanted: Media archivist at York We’re hiring four new people this spring at York University Libraries, where I work, and the first posting is up: media archivist. York University Libraries (YUL) seeks a dynamic and innovative individual with strong leadership potential to advance York University Libraries’ archival and special collections portfolio in support of the research community across campus and beyond in the area of media (film, sound recordings, audiovisual recordings and photography; in both digital and analogue form). This position is for an early or mid-career professional with some applied experience and/or expertise in the area of film and AV preservation. Everything I know about interviewing at York University Libraries I put on a web page: the informatively titled Interviewing at York University Libraries. Any archivists who might apply for the job should read that, and if they have any questions after, I’m glad to chat or to put them on to someone else. Hollinger boxes are important in archives A few things to note about the job ad. This position is for an early or mid-career professional … We used to advertise for people with “up to four years post-MLIS experience” or some such. Many people found this confusing and even thought it was age discrimination (which it isn’t). It was done because librarian and archivist salaries are calculated based on years-since-MLIS, and the most recent the graduate, the lower the salary (see my interviewing page for more). Budgets are always tight. Nevertheless, lately we’ve stopped being so restrictive and now specify a general level of experience. We hope this will mean a better pool of applicants. York University is an Affirmative Action (AA) employer and strongly values diversity, including gender and sexual diversity, within its community. We take this seriously. Our newly revised AA plan says, “The affirmative action target groups for the Libraries for the year and for the near future are: visible minorities, members of Aboriginal or Indigenous persons; and persons with disabilities. The Libraries actively seek to recruit and hire members of these groups to enrich the full-time staffing complement.” We mean it. If you’re an archivist who falls into one of those groups and think you might want to work at York, please apply and self-identify. If you know such a person, please tell them about this. Consideration will be given to those who have followed non-traditional career paths or had career interruptions. This was introduced into our ads a couple of years ago, and I’m happy to say I helped. If someone applying took some time out of their career to raise a child, look after someone, deal with an illness, or anything else causing a gap in their resume, they should mention it briefly, alluding to the line in the ad. Interviews will take place between 10-16 April 2019. This is good to know not only for anyone applying but also for us already working at YUL. Now I know when the job talks will be, so I marked that in my calendar so I can keep my mornings clear. The Wobblies on STAPLR I’m running “The Wobblies” on STAPLR this month. STAPLR (Sounds in Time Actively Performing Library Reference) is my sonification of activity at the reference and help desks at York University Libraries. Here’s a one-hour sample, heard starting at 1211 on Friday 01 February 2019. Your browser doesn't support HTML5 audio. Download the MP3 audio file if you like. STAPLR cover STAPLR uses Sonic Pi to make its sounds. The composition is named after The Wobblies because it randomly chooses between two Sonic Pi synths, :mod_dsaw and :mod_pulse, both of which sound wobbly. For the mapping of data to sound I ignore everything about the desk interaction except how long it took, which determines how long the sound is. Where it happened and what type of question it was are ignored. Right now it’s version 1.5 of “The Wobblies,” and I may change it through the month, but it’ll sound much the same. Bargaining Parity for Librarians and Archivists Last month the Canadian Association of University Teachers (CAUT), an association of academic faculty associations and unions, released Bargaining Parity for Librarians and Archivists, an eight-page bargaining advisory that covers all the major issues for academic librarians and archivists in collective agreements, with examples of good language from different contracts across the country. Here’s the announcement: One of the greatest barriers to librarians and archivists accessing their academic rights is their definitional separation from the rest of the academic staff. This bargaining advisory, also available on-line in the members’ only section of the CAUT website, reviews current collective agreement language related to librarian and archivist terms and conditions of employment, including language that promotes parity with faculty and reflects the needs of librarians and archivists. It will be of interest to any librarian or archivist involved in bargaining or revising collective agreements, and will be helpful to others who want to know more about these issues and how they affect us (especially professors on bargaining teams). Snippet from the start of the advisory. I am on the Librarians’ and Archivists’ Committee right now and helped write the advisory. It was a complex project and took a while, and I’m delighted it’s now out. Thanks to all the other committee members and the CAUT people involved. It’s a good committee, and CAUT does great work. CAUT doesn’t make its bargaining advisories publicly available because it doesn’t want employers to have access to the expertise it provides to unions. I understand that point, and I’m never going to out of my way to help management in labour negotiations, but I don’t think this advisory should be restricted this way. It’s the only thing I’ve written in my professional career that isn’t freely available online somewhere. Perhaps we’ll be able to change CAUT policy one day! Nevertheless the advisory is available to those who need it. Anyone in Canada who’s in an association or union that’s a CAUT member can get it on the web site (your union will have a username and password; just ask) and in the US I presume there’s a way to go through the AAUP to get it. Or just ask one of us on the committee. (Aussi disponsible en français.) Freedom of information request for York University eresource costs denied I’m a librarian at York University Libraries in Toronto. Let’s call me Librarian Bill when I’m there. At home I’m Civilian Bill, and last month Civilian Bill put in a freedom of information request to York University for the amounts the Libraries spent on electronic resources in fiscal years 2017 and 2018. Civilian Bill knew the information exists because Librarian Bill prepared a spreadsheet with precisely those costs. York has refused to release the data. Their response is “withhold in full.” Background: other eresource cost data releases Librarians generally believe that the costs their libraries pay for resources should be made public. Aside from people having the right to know where their money is going (in F2018 50% of York’s revenue came from students and 35% from the province), this is part of the move to making all publicly-funded research freely available online: one tactic is to show the exorbitant and increasing costs of closed subscriptions to journals and the like when it would actually be cheaper to make everything free and open. It’s always been normal for the total amount spent on collections to be public, but now more detailed information should be available: how much it costs to subscribe to JSTOR, PsycINFO, Nature, SciFinder and all the other hundreds of things that libraries pay for annually. In Canada a few libraries have been doing this for years: University of Alberta subscription expenditures, F2014–F2017. Simon Fraser University Library serials costs, F1991–F2017 (has more than just serials). Recently the big consortium of academic libraries, the Canadian Research Knowledge Network (CRKN), published Expenditures of CARL member libraries for scholarly resource subscriptions licensed through CRKN for 2016–2017. (CARL is the Canadian Association of Research Libraries.) It includes these costs paid by York: Product Cost (CAD) Cambridge Journals Online 93,173.17 MathSciNet and Consortium Database Fees 12,865.73 NRC Research Press Journals 14,862.00 Oxford Journals Online 126,483.41 RSC Electronic Journals 31,164.85 Sage Premier All-Access 228,521.38 ScienceDirect Freedom Collection 1,406,367.81 Scopus 156,766.50 Taylor &amp; Francis Journals (SSH/S&amp;T/Medical) 691,874.95 Web of Knowledge 19,671.56 Web of Science 134,903.21 Wiley Online Library 590,217.56 Releasing cost data is complicated by the way vendors sometimes put non-disclosure agreements in their contracts. Many don’t—often pricing is up on the vendor’s web site, especially, for example, for institutional subscriptions to things like The Journal of Something Defined Very Particularly where it’s $250 instead of $60 for an individual. Where vendors do prevent disclosure of costs, it’s so they can play libraries off against each other and not reveal that they gave a special deal to another place. This is one part of what helps major scholarly publishers have profit margins of 35% or more. York University subway station under construction York University Libraries plans to release cost data Early in 2017 Librarian Bill was part of a group at York University Libraries that resolved to make public the costs YUL spent on electronic resources. There was institutional support for this. The eresources librarian went through scores and scores of license agreements to figure out which had NDAs and which didn’t—this was pretty tedious work—and Librarian Bill matched those up with products and costs and made a spreadsheet that looked like this: t vendor consor publisher title nda F2017 i Elsevier CRKN   ScienceDirect NO 1406368 It had about 800 rows. The “t” column is the type, an internal code so we can distinguish between journals, data sources, etc. Publisher here is blank but sometimes we buy access to a title not from the (small) publisher but from a (larger) vendor and in such cases that’s recorded. The NO in the “nda” column means that there isn’t an NDA preventing us from releasing the costs. (Note: the cost given there is the CRKN cost rounded; this is already public.) This data was not released. In April of this year the fiscal year ended and in May Librarian Bill added the F2018 costs to the spreadsheet after the eresources librarian checked new licenses for NDAs. There was no progress on releasing this updated data. In fact there was no sign the data would ever be released. Bullshit Right around then Jane Schmidt (a librarian at Ryerson University in downtown Toronto) posted the text of Innovate this! Bullshit in Academic Libraries and What We Can Do About It, the talk she gave in May at the conference of the Canadian Association of Professional Academic Librarians. In section 6, “We are terrible at business but spend a lot of money,” she says: In 2016/2017, the University of Alberta spent $7,289,007.31 on services and products from ProCLC and Ebquest. Sure, UofA is a heavy hitter when it comes to collections expenditure, but I am willing to bet that every single university library in Canada does business with these companies and pays a minimum of triple figures and upwards of millions in most cases. We tend to focus a lot of our outrage on the money spent with the big five publishers—which, don’t get me wrong—are a major problem, but in terms of the amount of contact our staff, students and researchers have with the tools, content and services provided by these aggregators, we really need to pay very close attention to how these products are actually working. We’ve been told by more than one of our vendor reps during these circuitous exchanges that they are surprised they don’t get more detailed questions from libraries. My challenge to all of you here today is to go back to your libraries and start shining a light into the deep recesses of the databases you use…. Do you know how much your library spends on the products you use every day? Are you able to speak confidently on how those prices have fluctuated over time and why they have? If something doesn’t work the way that we think it should or as it is advertised, why is an increase in price—no matter how modest—a given? These are all questions that we need to start asking more consistently. Also, thank you, Simon Fraser University and University of Alberta, for taking the lead on sharing your expenditure data. “She’s right,” Civilian Bill and Librarian Bill said to myselves. “Let’s make use of our rights under provincial legislation and try to get the data released through a FIPPA request.” My FIPPA request On 10 July 2018 Civilian Bill used the form on York’s Making an Access to Information Request page: I request the costs paid by York University Libraries for subscribed eresources in fiscal years 2017 and 2018. “Eresources” means electronic resources, which includes but is not limited to online journals, proprietary data sets, article indexes and databases, and multimedia streaming services (but not ebooks). I request not an aggregate sum but the individual costs paid for each product: JSTOR, PsycINFO, Books 24x7, Metal Music Studies, etc. Some costs will be protected under a nondisclosure agreement. For those I request that the product be listed but I understand it may not be possible to also include the price. As examples of what this data looks like, here are two Canadian university libraries that have released theirs: Alberta: https://doi.org/10.7939/DVN/10844 Simon Fraser: https://www.lib.sfu.ca/about/overview/collections/serials-costs The Canadian Association of Research Libraries released costs paid by CRKN members, which includes York: http://dx.doi.org/10.20383/101.033 . This is where York gets access to large packages such as Elsevier’s ScienceDirect. This is just a small fraction of the total number of eresources York has, however. Preferred method of access is a spreadsheet (a printout will suffice). Civilian Bill had the advantage here of knowing this information was available and would take under an hour to pull because Librarian Bill had prepared the data and left instructions on how to generate a list of all eresource costs where there was no NDA. Civilian Bill was really hoping that Librarian Bill would be the one asked to look for the requested information, but he wasn’t. Lichen. York’s response On 24 July York sent a letter saying they had received Civilian Bill’s request on 17 July and that they had 30 days to respond. On 13 August they sent a letter (postmarked 17 August) which Civilian Bill received on 21 August. It said: A search for responsive records was conducted and four documents located. Access to these records is being denied under sections 17 and 18 of the Act. This is the index of the four responsive records: Withhold in full. It is correct: those four documents all contain the data. The last one, raw-cost-disclosure-f2017-f018, is the updated one Librarian Bill prepared with all the data prepped and ready to go. But for each of the four, the “decision to release” is “withhold in full.” Withhold in full! Two exemptions are cited from the Freedom of Information and Protection of Privacy Act, R.S.O. 1990, c. F.31 as reasons not to supply the information. First, S. 17, Third party information, which says in part: 17 (1) A head shall refuse to disclose a record that reveals a trade secret or scientific, technical, commercial, financial or labour relations information, supplied in confidence implicitly or explicitly, where the disclosure could reasonably be expected to, (a) prejudice significantly the competitive position or interfere significantly with the contractual or other negotiations of a person, group of persons, or organization; (b) result in similar information no longer being supplied to the institution where it is in the public interest that similar information continue to be so supplied; (c) result in undue loss or gain to any person, group, committee or financial institution or agency. Second, S 18, Economic and other interests of Ontario, which says in part: 18 (1) A head may refuse to disclose a record that contains, (a) trade secrets or financial, commercial, scientific or technical information that belongs to the Government of Ontario or an institution and has monetary value or potential monetary value; (c) information where the disclosure could reasonably be expected to prejudice the economic interests of an institution or the competitive position of an institution; (d) information where the disclosure could reasonably be expected to be injurious to the financial interests of the Government of Ontario or the ability of the Government of Ontario to manage the economy of Ontario; (e) positions, plans, procedures, criteria or instructions to be applied to any negotiations carried on or to be carried on by or on behalf of an institution or the Government of Ontario. Civilian Bill and Librarian Bill leave it to the reader to think about whether those claimed exemptions are valid. Neither of us thinks they are. YUL’s support for democratizing scholarship Neither Civilian Bill nor Librarian Bill were expecting this. We figured we’d be told the responsive records could be released in part, with redactions where NDAs prevented the costs becoming public. There would be a small access fee, which Civilian Bill would pay, and then he would publish the data. Because it was now public, York University Libraries would release it properly, with Librarian Bill helping. The roadblock to open release, whatever it was, would have been overcome. But that didn’t happen. Part I of York University Library’s 2016–2020 strategic plan, the section titled “Transformative Knowledge Infrastructure and Scholarly Content,” says the Libraries will: Support the academic priorities of the University at the Glendon, Keele and Markham campuses through an intentional commitment to sustaining open access, democratizing scholarship, and developing a robust and responsive knowledge infrastructure that anticipates the evolving scholarly record. One of YUL’s values is “We believe in advancing the democratization of knowledge.” The actions do not match the words. Interviewing at York University Libraries Something new: Interviewing at York University Libraries is, as it says at the top, “everything I can think of about the hiring process for librarians and archivists at York University Libraries (YUL) that might be useful to someone interviewing for a job.” A couple of months ago an old friend got in touch because he knew someone who was up for a job at YUL and asked if I could give them the inside scoop on what happens at interviews. I was happy to do it, and told everything I could. Then I got to thinking about the other candidates (their names had been posted internally) and thought to be fair I should see if they wanted any tips. However, I wasn’t going to email them at their current workplace. Two had no personal web presence; the other requested everyone be in touch through Twitter. I gave up. To stop this from happening again, I decided to write down everything here so everyone has equal access. It may also be of interest to other academic library workers who want to compare hiring processes. Sorting LCC call numbers in R Here’s the easiest way to sort Library of Congress Classification call numbers in R: call_numbers &lt;- c("QA 7 H3 1992", "QA 76.73 R3 W53 2015", "QA 90 H33 2016", "QA 276.45 R3 A35 2010") library(gtools) mixedsort(call_numbers) ## [1] "QA 7 H3 1992" "QA 76.73 R3 W53 2015" "QA 90 H33 2016" "QA 276.45 R3 A35 2010" gtools is part of standard R. The docs says about mixedsort and mixedorder: These functions sort or order character strings containing embedded numbers so that the numbers are numerically sorted rather than sorted by character value. I.e. “Asprin 50mg” will come before “Asprin 100mg”. In addition, case of character strings is ignored so that “a”, will come before “B” and “C”. (I don’t know why “Aspirin” is misspelled.) If you have a data frame (df) with column call_number then you would use mixedorder to sort the whole thing by call number thusly: df[mixedorder(df$call_number), ] I asked about this on Stack Overflow and on the Code4Lib mailing list last July, then I went on vacation and sort of forgot about it. Nine months later, I thanked Li Kai, who pointed me to a Stack Overflow that solved my problem and let me then answer my own question. Unrelated library sign. Research data management librarian job posting at York University Research data management librarian wanted at York University Libraries in Toronto, where I work. York University Libraries (YUL) seeks a dynamic and innovative individual with strong leadership potential to advance York University Libraries’ research data management portfolio in support of the research community across campus. The successful candidate will be a member of the new Research and Open Scholarship division and will report to the Director for Open Scholarship. The incumbent will lead the development of a research data management program on campus and will coordinate ongoing support in this area within a team-based environment. The incumbent will work collegially with departmental members to advance the wider responsibilities of the Open Scholarship Department. I’m not on the search committee and am happy to answer any questions I can from anyone interested in applying, by email or even by phone. They’re looking for someone who knows RDM and also can handle chemistry and other physical sciences. Pay at York is good. I’d guess someone five years out of library school would get over $90,000 CAD. We have good benefits, time for (and expectations about) research, and of course the subway comes right on campus now. After six years the person will be up for continuing appointment (what we call tenure) and then get a sabbatical year. On the bad side, of course, CUPE 3903 is on strike right now. YUL is going through a restructuring and this position will be in a new department. Anyone taking the job should ask serious questions about how the department will work and what support they will have in the role, but all in all I think the new structure will work pretty well and there is a lot of promise ahead. Non-Canadians are welcome to apply. The way it works, any qualified Canadian trumps any non-Canadian, even if the non-Canadian is actually better, but don’t let that stop you from applying. The bigger the pool the better for us, of course, but with specialized knowledge like this, you never know what will happen or how many Canadians will apply. Anyone whose career has taken unusual turns or had to take some time out (for parental or caregiver leave or something else) should mention that in the cover letter, and the search committee will consider it. Yesterday's Slow Scholarship Reading Group article Yesterday the Slow Scholarship Reading Group at York University Libraries met and we discussed Baharak Yousefi’s recent book chapter On the Disparity Between What We Say and What We Do in Libraries. It’s from Feminists Among Us: Resistance and Advocacy in Library Leadership (2017), which Yousefi edited with Shirley Lew. (Disclosure: That was published by Library Juice Press, and a chapter I co-wrote will be coming out very soon in a new book from them. Details when it’s out.) Abstract: Uses Keller Easterling’s concept of infrastructure space to probe the discrepancies between what we state to be our core purpose and values and what we do in libraries. Cat. Here’s one of the many bits that grabbed me: Since the beginning of my work as a practitioner in Canadian libraries almost a decade ago, I have been interested in the details of how the culture and disposition of the profession is set, communicated, sometimes obscured, and policed in our everyday practice. More recently, after I became a middle manager with a significant amount of decision- making power, this interest became more pronounced as I struggled to reconcile the belief that our decisions are made in accordance with our values, policies, and resources with the reality that there are significant disparities between what we say and what we do. For example, at the 2015 Association of College and Research Libraries Conference in Portland, Oregon, a ballroom full of librarians sat listening to Lawrence Lessig talk about the tragic death of American computer programmer, activist, and open access advocate Aaron Swartz at a conference sponsored by library vendors who actively oppose Lessig’s call for equality and equal access to knowledge. There is something disconcerting about our ability to dissociate ourselves personally from our collective actions and responsibilities. The article led to some great discussion. I recommend it, and if you like it, try getting some other people to read it and then get together to talk about it. If you work in a library, I bet you, like me, will see a lot of examples at your own institution of how what you say does not equal what you do. 
www-miskatonic-org-8614	----	Miskatonic University Press | William Denton Miskatonic University Press About Framework 750 31 March 2021 field.recordings Framework Radio has reached episode 750! It’s an incredible radio show and podcast. Host and producer Patrick McGinley has been running it since 2002, and the sound and music he’s collected and edited over the years has become a remarkable body of work. Framework 750 screenshot “Framework is a show consecrated to field-recording, and its use in composition. Field-recording, phonography, the art of sound-hunting; open your ears and listen!” Listen to a couple of recent episodes (preferable with headphones), and if it’s at all your kind of thing, subscribe and listen every week. (Then consider supporting it.) Compose key 31 March 2021 unix I’m probably years behind everyone else on this, but I just learned about the compose key. I’d always wondered how other people who used the Latin alphabet—French-speakers in Canada, even—were able to write accented characters like à or é, not to mention people further abroad who needed the Icelandic þ or Turkish Ş. European keyboards have different keys on them, but did people with American keyboards always copy and paste from somewhere else? In Emacs I use the Ivy, Swiper and Counsel package that lets me use counsel-unicode-char (bound to C-c 8) to search for any Unicode character, and C-x 8 RET is a built-in command that does the same. But I don’t want to copy characters from a temporary Emacs buffer into a web browser. Happily, on Unix systems it’s easy to get this going. On my Ubuntu machine, Gnome-tweaks lets me set up a compose key. I made it [Right-Alt], which I never use. Now I can hit [Right-Alt] o c to get © and similar combinations to get many other combined characters. I am now able to use proper em—and en!—dashes in my emails, and when I put in footnotes with URLs I can use ¹ and ² instead of bulky old [1] and [2]. CBC Radio's Nero Wolfe series 27 March 2021 radio rex.stout Ever since I started reading Rex Stout’s Nero Wolfe mysteries I’ve wanted to hear the 1982 CBC Radio series of hour-long dramatizations starring Mavor Moore as Wolfe and Don Francks as Archie Goodwin. (Moore, I should add, was a member of the Arts and Letters Club of Toronto.) Mavor Moore and Don Francks The New Adventures of Nero Wolfe from the 1950s is at the Internet Archive but even though it stars Sydney Greenstreet I don’t like the approach they took with the scripts (which were all new stories done in half an hour). There are no episodes of the CBC series there so I’d give up hope of hearing it. Then I discovered that the whole series was uploaded to YouTube! Here’s the first episode: Disguise for Murder. The rest are easily findable, either in the sidebar or by searching for “cbc nero wolfe.” I wanted to download them all, and for the sake of future me I’ll document what I did, with youtube-dl to download and then ffmpeg to convert to MP3 (with a tip from Stack Overflow; I find ffmpeg cryptic). There’s probably a better way, but this works. $ youtube-dl --extract-audio https://www.youtube.com/watch?v=ZDCWWH1yYmQ $ ffmpeg -i "Nero Wolfe CBC - Disguise for Murder - January 16, 1982-ZDCWWH1yYmQ.m4a" -q:a 0 -map a "01 Disguise for Murder.mp3" I did this for all thirteen, then edited the metdata with EasyTag. RMS back at the FSF 24 March 2021 code4lib emacs I became an associate member of the Free Software Foundation about six years ago and gave it a monthly donation to support its work. It was founded by Richard Stallman (known as RMS), who among other achievements created Emacs in the 1970s. I use it every day. In September 2019 RMS made offensive comments connected to Jeffrey Epstein and donations to MIT (see MIT scientist resigns over emails discussing academic linked to Epstein in the Guardian for some details). Like many others, I told the FSF I would stop my donations if RMS didn’t leave. I got a quick response—the same day the whole thing happened, I think—saying he was gone. Good. I continued my donations. I understood people were working to improve the environment at the FSF, but didn’t know any details. On Sunday at the Libre Planet 2021 conference there was a surprise announcement from RMS that he was back on the FSF board. (“No LibrePlanet organizers (staff or volunteer), speakers, award winners, exhibitors, or sponsors were made aware of Richard Stallman’s announcement until it was public,” tweeted @fsf.) I was amazed and appalled. It’s unbelievable the board did this and, once it was done, that the FSF handled it so badly. I emailed the FSF to cancel my monthly donation immediately. I haven’t heard back yet. I imagine the poor staff are overwhelmed. Even if RMS is kicked off yet again, the only way I might possibly one day support the FSF is if its governance is completely overhauled. Today I signed the open letter demanding the entire FSF board resign. I see some people I know on the list, and I hope others will join. Tomorrow I’ll start donating to the Software Freedom Conservancy. Best radio comedies 16 March 2021 radio The five best, imhoe: The Jack Benny Program (NBC 1932–1948, CBS 1949–1955) Round the Horne (BBC 1965-1968) Frantic Times by the Frantics (CBC 1981–1984) The Great Eastern (CBC 1994–1999) Cabin Pressure (BBC 2008–2014) I need to go back and revisit The Goon Show (BBC 1951–1960) to compare. Other suggestions welcome. GNU Terry Prachett 12 March 2021 terry.pratchett Six years since Terry Pratchett died. GNU Terry Pratchett is still in the headers for all requests to this web site. $ curl -I https://www.miskatonic.org/ HTTP/1.1 200 OK Date: Fri, 12 Mar 2021 22:17:54 GMT Server: Apache Last-Modified: Thu, 11 Mar 2021 01:39:28 GMT ETag: "935a-5bd38dab6331c" Accept-Ranges: bytes Content-Length: 37722 Strict-Transport-Security: max-age=10886400; includeSubDomains; preload X-Clacks-Overhead: GNU Terry Pratchett X-Frame-Options: DENY Content-Type: text/html James Atlee Phillips 10 March 2021 literature wikipedia I was reading The Death Bird Contract, one of the Joe Gall spy novels by Philip Atlee, and discovered that the author, whose real name was James Atlee Phillips, had no Wikipedia entry, so I created James Atlee Phillips. I cite a New York Times obit that a little research reveals has mistakes. I didn’t finish the book. Compared to Donald Hamilton’s excellent Matt Helm series it was racist, lurid and overwrought. Among other pages I created I see Joan Bodger is doing well, and David Partridge better, but the one that turned out the best is for the great New Orleans brass band The Soul Rebels. Sonic Pi Vexations 04 March 2021 erik.satie music sonic.pi I’ve released an album on Bandcamp: Sonic Pi Vexations. It’s Erik Satie’s Vexations performed on Sonic Pi. Name your own price. Album cover The music is CC BY and the included code is GPL v3. Vexation After Vexation 23 February 2021 code4lib erik.satie staplr STAPLR is running a new composition: “Vexation After Vexation,” an interpretation of Erik Satie’s mysterious solo piano work Vexations. You can listen to STAPLR on the site or right here: Your browser does not support embedded audio. Shame! I ran “Library Silences” for months and months—it seemed appropriate—but now it’s time for something different. (York University Libraries Ambiences are always available for home listening.) Photo from Wikipedia The score of Vexations fits on one page, but instructions say (translated from French), “In order to play the theme 840 times in succession, it would be advisable to prepare oneself beforehand, and in the deepest silence, by serious immobilities.” The piece was ignored until John Cage took it, and the suggestion about 840 repetitions, seriously and organized a performance in 1963 where it was actually played 840 times. (This has been done many times since. In 2010 during Nuit Blanche I watched a performance downtown for quite a while. It was beautiful. If you want to hear it yourself by a real pianist, I recommend buying Stephane Ginsburgh’s 42 Vexations (1893) and making a playlist where the one track is played twenty times.) One of many Nuit Blanche pianists In this STAPLR composition, one one-minute iteration of Vexations is played for each minute of help given at any desk at York University Libraries that day. It keeps a running counter of how many more minutes it should play. Let’s say that at 0900 someone checks their email and answers a quick research question that takes them five minutes. They enter that into our reference statistics system, where STAPLR sees it and counts 5. It starts to play five iterations. One minute later, the counter is at 4. One minute later, the counter goes down to 3, but there’s another question in the system, this time a virtual chat that took 10 minutes to answer, so the counter goes up to 13. After one iteration it goes down to 12, then 11, then up again if there’s another question. If the counter reaches 0 it will wait and start back up when there’s another question answered. STAPLR screenshot This is how it began this morning: [2021-02-23 07:57:40] Vexation After Vexation 1.0: {} (0 mins) [2021-02-23 07:58:40] Vexation After Vexation 1.0: {} (0 mins) [2021-02-23 07:59:40] Vexation After Vexation 1.0: {"AskUs"=>{"1"=>[3]}} (3 mins) [2021-02-23 08:00:40] Vexation After Vexation 1.0: {} (2 mins) [2021-02-23 08:01:40] Vexation After Vexation 1.0: {"AskUs"=>{"1"=>[3]}} (4 mins) [2021-02-23 08:02:40] Vexation After Vexation 1.0: {} (3 mins) [2021-02-23 08:03:40] Vexation After Vexation 1.0: {} (2 mins) [2021-02-23 08:04:40] Vexation After Vexation 1.0: {} (1 mins) [2021-02-23 08:05:40] Vexation After Vexation 1.0: {"AskUs"=>{"1"=>[3]}} (3 mins) [2021-02-23 08:06:40] Vexation After Vexation 1.0: {} (2 mins) [2021-02-23 08:07:40] Vexation After Vexation 1.0: {} (1 mins) [2021-02-23 08:08:40] Vexation After Vexation 1.0: {} (0 mins) [2021-02-23 08:09:40] Vexation After Vexation 1.0: {} (0 mins) Close to 1000 it really got going: [2021-02-23 09:53:40] Vexation After Vexation 1.0: {"Osgoode"=>{"4"=>[40]}} (40 mins) That ran down for 13 minutes then more activity came in and it’s been going ever since. I’m curious to see when it stops. (The server reboots around 0600, but it could run all night.) Vexations has a bass theme played in the left hand and two sections (the second a slight variation of the first) played by the right hand accompanied by the bass theme on the left. It’s usually played thus: bass theme alone, theme A, bass theme alone, theme B, repeat. There are 13 quarter-notes in each section, so setting the speed to 52 bpm makes it work out at exactly one minute per repetition. This is faster than it’s normally played, but it still works well. “Vexation After Vexation” doesn’t tell you how busy the desks at York University Libraries are right now, the way other STAPLR sonifications do, but I think it perfectly combines Satie and STAPLR. I’m looking forward to listening to it through to the end of April, at least. Press play, turn the volume low, and let it go in the background through your day as a piece of aural furniture. Take Note 20 February 2021 stationery I learned this week that my favourite stationery store in Toronto, Take Note, opened up an online store. I placed an order for a Lamy Safari (the 2021 special edition coloured “terra red,” with a broad nib) and a bottle of J. Herbin Corail des Tropiques ink (see the review at the Well Appointed Desk). They’re fairly close, but I didn’t twig to that until I got them. The pen and ink were hand-delivered the next day! That’s only for people that live within a certain distance, but they’ll ship anywhere, so if you need pens, ink, pencils or paper, I recommend them. I make a sheet for each ink I have so I can see what it looks like on paper with various pens. Here’s the one for this ink, with a writing sample (I always use whatever book I’m reading) made with the new pen. Pen, ink and ink swatch List of all blog posts William Denton <wtd@pobox.com> Toronto, Ontario "Legendo autem et scribendo vitam procudito." — Marcus Terentius Varro (116–27 BCE) Search Contents List of all blog posts Blog posts grouped by tags Burton and Gordon Fictional Dentons Fictional Footnotes Hardboiled and Noir Library Science Publications and Talks RARA-AVIS Old Stuff Projects Conforguration: configure things (servers and dotfiles) in Org. GHG.EARTH: a sonification of climate change. Listening to Art: field recordings of visual art. STAPLR: Sounds in Time Actively Performing Library Reference. Theatre Science. France Audio Montage on framework radio: episodes #691 and #693. See also Kady MacDonald Denton, children's book author and illustrator The Arts and Letters Club of Toronto Atmospheric CO₂ March 1958: 314.43 ppm February 2021: 415.88 ppm Increase: 101.45 ppm Change: 32.3 % At Mauna Loa: data, code. Privacy Zero logging: As of 23 June 2020, no tracking is done on this web site and no logs are kept. I know absolutely nothing about how the site is used. © William Denton 1993–2021. CC BY: Content on this site is licensed under a Creative Commons Attribution 4.0 International License. 
www-mozilla-org-8768	----	Community Participation Guidelines — Mozilla Menu Download Firefox Firefox Privacy Notice Get a Firefox Account Firefox Browsers Close Firefox Browsers menu Firefox for Desktop Get the not-for-profit-backed browser on Windows, Mac or Linux. Firefox for Android Get the customizable mobile browser for Android smartphones. Firefox for iOS Get the mobile browser for your iPhone or iPad. Privacy Promise Learn how Firefox treats your data with respect. Firefox Blog Read about new Firefox features and ways to stay safe online. Release Notes Get the details on the latest Firefox updates. View all Firefox Browsers Products Close Products menu Firefox Monitor See if your email has appeared in a company’s data breach. Facebook Container Help prevent Facebook from collecting your data outside their site. Pocket Save and discover the best stories from across the web. Mozilla VPN Get protection beyond your browser, on all your devices. Product Promise Learn how each Firefox product protects and respects your data. Firefox Relay Sign up for new accounts without handing over your email address. Firefox Private Network (beta) Protect your browser’s connection to the internet. View all Products Who We Are Close Who We Are menu Mozilla Manifesto Learn about the values and principles that guide our mission. Mozilla Foundation Meet the not-for-profit behind Firefox that stands for a better web. Get involved Join the fight for a healthy internet. Leadership Meet the team that’s building technology for a better internet. Careers Work for a mission-driven organization that makes people-first products. Mozilla Blog Learn about Mozilla and the issues that matter to us. More About Mozilla Innovation Close Innovation menu Mozilla Hubs Gather in this interactive, online, multi-dimensional social space. Firefox Developer Edition Get the Firefox browser built just for developers. MDN Web Docs Check out the home for web developer resources. Firefox Reality Explore the web with the Firefox browser for virtual reality. Common Voice Donate your voice so the future of the web can hear everyone. WebAssembly Learn more about the new, low-level, assembly-like language. More Mozilla Innovation Menu About Mozilla About Mozilla Mission History Leadership Governance Forums Patents Our Products Software and other innovations designed to advance our mission. Learn more Get Involved Become a volunteer contributor in a number of different areas. Learn more Mozilla Community Participation Guidelines Version 3.1 – Updated January 16, 2020 The heart of Mozilla is people. We put people first and do our best to recognize, appreciate and respect the diversity of our global contributors. The Mozilla Project welcomes contributions from everyone who shares our goals and wants to contribute in a healthy and constructive manner within our community. As such, we have adopted this code of conduct and require all those who participate to agree and adhere to these Community Participation Guidelines in order to help us create a safe and positive community experience for all. These guidelines aim to support a community where all people should feel safe to participate, introduce new ideas and inspire others, regardless of: Background Family status Gender Gender identity or expression Marital status Sex Sexual orientation Native language Age Ability Race and/or ethnicity Caste National origin Socioeconomic status Religion Geographic location Any other dimension of diversity Openness, collaboration and participation are core aspects of our work — from development on Firefox to collaboratively designing curriculum. We gain strength from diversity and actively seek participation from those who enhance it. These guidelines exist to enable diverse individuals and groups to interact and collaborate to mutual advantage. This document outlines both expected and prohibited behavior. When and How to Use These Guidelines These guidelines outline our behavior expectations as members of the Mozilla community in all Mozilla activities, both offline and online. Your participation is contingent upon following these guidelines in all Mozilla activities, including but not limited to: Working in Mozilla spaces. Working with other Mozillians and other Mozilla community participants virtually or co-located. Representing Mozilla at public events. Representing Mozilla in social media (official accounts, staff accounts, personal accounts, Facebook pages). Participating in Mozilla offsites and trainings. Participating in Mozilla-related forums, mailing lists, wikis, websites, chat channels, bugs, group or person-to-person meetings, and Mozilla-related correspondence. These guidelines work in conjunction with our Anti-Harassment/Discrimination Policies[1], which sets out protections for, and obligations of, Mozilla employees. The Anti-Harassment/Discrimination Policy is crafted with specific legal definitions and requirements in mind. While these guidelines / code of conduct are specifically aimed at Mozilla’s work and community, we recognize that it is possible for actions taken outside of Mozilla’s online or in person spaces to have a deep impact on community health. (For example, in the past, we publicly identified an anonymous posting aimed at a Mozilla employee in a non-Mozilla forum as clear grounds for removal from the Mozilla community.) This is an active topic in the diversity and inclusion realm. We anticipate wide-ranging discussions among our communities about appropriate boundaries. Expected Behavior The following behaviors are expected of all Mozillians: Be Respectful Value each other’s ideas, styles and viewpoints. We may not always agree, but disagreement is no excuse for poor manners. Be open to different possibilities and to being wrong. Be respectful in all interactions and communications, especially when debating the merits of different options. Be aware of your impact and how intense interactions may be affecting people. Be direct, constructive and positive. Take responsibility for your impact and your mistakes – if someone says they have been harmed through your words or actions, listen carefully, apologize sincerely, and correct the behavior going forward. Be Direct but Professional We are likely to have some discussions about if and when criticism is respectful and when it’s not. We must be able to speak directly when we disagree and when we think we need to improve. We cannot withhold hard truths. Doing so respectfully is hard, doing so when others don’t seem to be listening is harder, and hearing such comments when one is the recipient can be even harder still. We need to be honest and direct, as well as respectful. Be Inclusive Seek diverse perspectives. Diversity of views and of people on teams powers innovation, even if it is not always comfortable. Encourage all voices. Help new perspectives be heard and listen actively. If you find yourself dominating a discussion, it is especially important to step back and encourage other voices to join in. Be aware of how much time is taken up by dominant members of the group. Provide alternative ways to contribute or participate when possible. Be inclusive of everyone in an interaction, respecting and facilitating people’s participation whether they are: Remote (on video or phone) Not native language speakers Coming from a different culture Using pronouns other than “he” or “she” Living in a different time zone Facing other challenges to participate Think about how you might facilitate alternative ways to contribute or participate. If you find yourself dominating a discussion, step back. Make way for other voices and listen actively to them. Understand Different Perspectives Our goal should not be to “win” every disagreement or argument. A more productive goal is to be open to ideas that make our own ideas better. Strive to be an example for inclusive thinking. “Winning” is when different perspectives make our work richer and stronger. Appreciate and Accommodate Our Similarities and Differences Mozillians come from many cultures and backgrounds. Cultural differences can encompass everything from official religious observances to personal habits to clothing. Be respectful of people with different cultural practices, attitudes and beliefs. Work to eliminate your own biases, prejudices and discriminatory practices. Think of others’ needs from their point of view. Use preferred titles (including pronouns) and the appropriate tone of voice. Respect people’s right to privacy and confidentiality. Be open to learning from and educating others as well as educating yourself; it is unrealistic to expect Mozillians to know the cultural practices of every ethnic and cultural group, but everyone needs to recognize one’s native culture is only part of positive interactions. Lead by Example By matching your actions with your words, you become a person others want to follow. Your actions influence others to behave and respond in ways that are valuable and appropriate for our organizational outcomes. Design your community and your work for inclusion. Hold yourself and others accountable for inclusive behaviors. Make decisions based on the highest good for Mozilla’s mission. Behavior That Will Not Be Tolerated The following behaviors are considered to be unacceptable under these guidelines. Violence and Threats of Violence Violence and threats of violence are not acceptable - online or offline. This includes incitement of violence toward any individual, including encouraging a person to commit self-harm. This also includes posting or threatening to post other people’s personally identifying information (“doxxing”) online. Personal Attacks Conflicts will inevitably arise, but frustration should never turn into a personal attack. It is not okay to insult, demean or belittle others. Attacking someone for their opinions, beliefs and ideas is not acceptable. It is important to speak directly when we disagree and when we think we need to improve, but such discussions must be conducted respectfully and professionally, remaining focused on the issue at hand. Derogatory Language Hurtful or harmful language related to: Background Family status Gender Gender identity or expression Marital status Sex Sexual orientation Native language Age Ability Race and/or ethnicity Caste National origin Socioeconomic status Religion Geographic location Other attributes is not acceptable. This includes deliberately referring to someone by a gender that they do not identify with, and/or questioning the legitimacy of an individual’s gender identity. If you’re unsure if a word is derogatory, don’t use it. This also includes repeated subtle and/or indirect discrimination; when asked to stop, stop the behavior in question. Unwelcome Sexual Attention or Physical Contact Unwelcome sexual attention or unwelcome physical contact is not acceptable. This includes sexualized comments, jokes or imagery in interactions, communications or presentation materials, as well as inappropriate touching, groping, or sexual advances. Additionally, touching a person without permission, including sensitive areas such as their hair, pregnant stomach, mobility device (wheelchair, scooter, etc) or tattoos is unacceptable. This includes physically blocking or intimidating another person. Physical contact or simulated physical contact (such as emojis like “kiss”) without affirmative consent is not acceptable. The sharing or distribution of sexualized images or text is unacceptable. Disruptive Behavior Sustained disruption of events, forums, or meetings, including talks and presentations, will not be tolerated. This includes: ‘Talking over’ or ‘heckling’ speakers. Drinking alcohol to excess or using recreational drugs to excess, or pushing others to do so. Making derogatory comments about those who abstain from alcohol or other substances, pushing people to drink, talking about their abstinence or preferences to others, or pressuring them to drink - physically or through jeering. Otherwise influencing crowd actions that cause hostility in the session. Influencing Unacceptable Behavior We will treat influencing or leading such activities the same way we treat the activities themselves, and thus the same consequences apply. Consequences of Unacceptable Behavior Bad behavior from any Mozillian, including those with decision-making authority, will not be tolerated. Intentional efforts to exclude people (except as part of a consequence of the guidelines or other official action) from Mozilla activities are not acceptable and will be dealt with appropriately. Reports of harassment/discrimination will be promptly and thoroughly investigated by the people responsible for the safety of the space, event or activity. Appropriate measures will be taken to address the situation. Anyone being asked to stop unacceptable behavior is expected to comply immediately. Violation of these guidelines can result in anyone being asked to leave an event or online space, either temporarily or for the duration of the event, or being banned from participation in spaces, or future events and activities in perpetuity. Mozilla Staff are held accountable, in addition to these guidelines, to Mozilla’s staff Anti-Harassment/Discrimination Policies [1]. Mozilla staff in violation of these guidelines may be subject to further consequences, such as disciplinary action, up to and including termination of employment. For contractors or vendors, violation of these guidelines may affect continuation or renewal of contract. In addition, any participants who abuse the reporting process will be considered to be in violation of these guidelines and subject to the same consequences. False reporting, especially to retaliate or exclude, will not be accepted or tolerated. Reporting If you believe you’re experiencing unacceptable behavior that will not be tolerated as outlined above, please use cpg-report@mozilla.com to report. Reports are triaged by the Community Participation Guidelines Response Lead. After receiving a concise description of your situation, they will review and determine the next steps. In addition to conducting any investigation, they can provide a range of resources, from a private consultation to other community resources. They will involve other colleagues or outside specialists (such as legal counsel), as needed to appropriately address each situation. Additional Resources: How to Report Questions: cpg-questions@mozilla.com Please also report to us if you observe a potentially dangerous situation, someone in distress, or violations of these guidelines, even if the situation is not happening to you. If you feel you have been unfairly accused of violating these guidelines, please follow the same reporting process. Mozilla Spaces Each physical or virtual Mozilla space shall have a designated contact. Mozilla Events All Mozilla events will have designated a specific safety guideline with emergency and anti-abuse contacts at the event as well as online. These contacts will be posted prominently throughout the event, and in print and online materials. Event leaders are requested to speak at the event about the guidelines and to ask participants to review and agree to them when they sign up for the event. Reports will receive an email notice of receipt. Once an incident has been investigated and a decision has been communicated to the relevant parties, all have the opportunity to appeal this decision by sending an email to cpg-questions@mozilla.com. Ask Questions Everyone is encouraged to ask questions about these guidelines. If you are organizing an event or activity, reach out for tips for building inclusion for your event, activity or space. Your input is welcome and you will always get a response within 24 hours (or on the next weekday, if it is the weekend) if you reach out to cpg-questions@mozilla.com. Please review this change log for updates to this document. License and Attribution This set of guidelines is distributed under a Creative Commons Attribution-ShareAlike license. These guidelines have been adapted with modifications from Mozilla’s original Community Participation Guidelines, the Ubuntu Code of Conduct, Mozilla’s View Source Conference Code of Conduct, and the Rust Language Code of Conduct, which are based on Stumptown Syndicate’s Citizen Code of Conduct. Additional text from the LGBTQ in Technology Code of Conduct and the WisCon code of conduct. This document and all associated processes are only possible with the hard work of many, many Mozillians. Modifications to these Guidelines Mozilla may amend the guidelines from time to time and may also vary the procedures it sets out where appropriate in a particular case. Your agreement to comply with the guidelines will be deemed agreement to any changes to it. This policy does not form part of any Mozilla employee’s contract of employment or otherwise have contractual effect. [1] The anti-harassment policy is accessible to paid staff here. Love the Web? Get the Mozilla newsletter and help us keep it open and free. Your email address Afghanistan Akrotiri Albania Algeria American Samoa Andorra Angola Anguilla Antarctica Antigua and Barbuda Argentina Armenia Aruba Ashmore and Cartier Islands Australia Austria Azerbaijan Bahamas, The Bahrain Baker Island Bangladesh Barbados Bassas da India Belarus Belgium Belize Benin Bermuda Bhutan Bolivia Bonaire, Sint Eustatius, and Saba Bosnia and Herzegovina Botswana Bouvet Island Brazil British Indian Ocean Territory Brunei Bulgaria Burkina Faso Burma Burundi Cabo Verde Cambodia Cameroon Canada Cayman Islands Central African Republic Chad Chile China Christmas Island Clipperton Island Cocos (Keeling) Islands Colombia Comoros Congo (Brazzaville) Congo (Kinshasa) Cook Islands Coral Sea Islands Costa Rica Croatia Cuba Curaçao Cyprus Czech Republic Côte d’Ivoire Denmark Dhekelia Diego Garcia Djibouti Dominica Dominican Republic Ecuador Egypt El Salvador Equatorial Guinea Eritrea Estonia Ethiopia Europa Island Falkland Islands (Islas Malvinas) Faroe Islands Fiji Finland France French Guiana French Polynesia French Southern and Antarctic Lands Gabon Gambia, The Gaza Strip Georgia Germany Ghana Gibraltar Glorioso Islands Greece Greenland Grenada Guadeloupe Guam Guatemala Guernsey Guinea Guinea-Bissau Guyana Haiti Heard Island and McDonald Islands Honduras Hong Kong Howland Island Hungary Iceland India Indonesia Iran Iraq Ireland Isle of Man Israel Italy Jamaica Jan Mayen Japan Jarvis Island Jersey Johnston Atoll Jordan Juan de Nova Island Kazakhstan Kenya Kingman Reef Kiribati Korea, North Korea, South Kosovo Kuwait Kyrgyzstan Laos Latvia Lebanon Lesotho Liberia Libya Liechtenstein Lithuania Luxembourg Macau Macedonia Madagascar Malawi Malaysia Maldives Mali Malta Marshall Islands Martinique Mauritania Mauritius Mayotte Mexico Micronesia, Federated States of Midway Islands Moldova Monaco Mongolia Montenegro Montserrat Morocco Mozambique Namibia Nauru Navassa Island Nepal Netherlands New Caledonia New Zealand Nicaragua Niger Nigeria Niue Norfolk Island Northern Mariana Islands Norway Oman Pakistan Palau Palmyra Atoll Panama Papua New Guinea Paracel Islands Paraguay Peru Philippines Pitcairn Islands Poland Portugal Puerto Rico Qatar Reunion Romania Russia Rwanda Saint Barthelemy Saint Helena, Ascension, and Tristan da Cunha Saint Kitts and Nevis Saint Lucia Saint Martin Saint Pierre and Miquelon Saint Vincent and the Grenadines Samoa San Marino Sao Tome and Principe Saudi Arabia Senegal Serbia Seychelles Sierra Leone Singapore Sint Maarten Slovakia Slovenia Solomon Islands Somalia South Africa South Georgia and South Sandwich Islands South Sudan Spain Spratly Islands Sri Lanka Sudan Suriname Svalbard Swaziland Sweden Switzerland Syria Taiwan Tajikistan Tanzania Thailand Timor-Leste Togo Tokelau Tonga Trinidad and Tobago Tromelin Island Tunisia Turkey Turkmenistan Turks and Caicos Islands Tuvalu Uganda Ukraine United Arab Emirates United Kingdom United States Uruguay Uzbekistan Vanuatu Vatican City Venezuela Vietnam Virgin Islands, British Virgin Islands, U.S. Wake Island Wallis and Futuna West Bank Western Sahara Yemen Zambia Zimbabwe Deutsch English Español Français Polski Português Format HTML Text I’m okay with Mozilla handling my info as explained in this Privacy Notice Sign Up Now We will only send you Mozilla-related information. Thanks! If you haven’t previously confirmed a subscription to a Mozilla-related newsletter, you may have to do so. Please check your inbox or your spam filter for an email from us. Company Mozilla Manifesto Press Center Corporate Blog Careers Contact Donate Resources Privacy Hub Browser Comparison Brand Standards Support Product Help File a Bug Developers Developer Edition Beta Beta for Android Nightly Nightly for Android Enterprise Tools Follow @Mozilla Twitter (@mozilla) Instagram (@mozilla) Follow @Firefox Twitter (@firefox) Instagram (@firefox) YouTube (@firefoxchannel) Language Language عربي Deutsch English Español (de España) Français हिन्दी (भारत) Bahasa Indonesia Italiano 日本語 Melayu Nederlands Polski Português (do Brasil) Русский 中文 (简体) 正體中文 (繁體) Go Mozilla Website Privacy Notice Cookies Legal Community Participation Guidelines Visit Mozilla Corporation’s not-for-profit parent, the Mozilla Foundation. Portions of this content are ©1998–2021 by individual mozilla.org contributors. Content available under a Creative Commons license. 
www-msfhr-org-8167	----	Michael Smith Foundation for Health Research | Jump to Navigation Michael Smith Foundation for Health Research MSFHR Menu Home About Overview MSFHR in Action Strategic Plan Forward Thinking People Dr. Michael Smith Careers Financial Statements MSFHR Logo Funding Overview Current Opportunities Manage Your Award Review Process MSFHR ApplyNet Awards Database Our Work Overview COVID-19 Research Response Funding Programs Knowledge Translation Aubrey J. Tingle Prize Patient-Oriented Research Equity, Diversity and Inclusion (EDI) Other Activities Completed Programs Partnership Overview Our Partners Information For Prospective Partners Information For Researchers Contact Overview Staff Directory News Overview Spark Blog Subscribe MSFHR ApplyNet Search form About MSFHR in Action Strategic Plan Forward Thinking People Dr. Michael Smith Careers Financial Statements MSFHR Logo Funding Current Opportunities Manage Your Award Review Process MSFHR ApplyNet Awards Database Our Work COVID-19 Research Response Funding Programs Knowledge Translation Aubrey J. Tingle Prize Patient-Oriented Research Equity, Diversity and Inclusion (EDI) Other Activities Completed Programs Partnership Our Partners Information For Prospective Partners Information For Researchers Contact Staff Directory News Spark Blog Subscribe A 20-year journey of health research and beyond Follow our journey Explore our funding programs MSFHR is BC’s health research funding agency. Our funding programs are designed to attract, develop and retain BC’s health research talent and contribute to addressing BC health system priorities. Apply now MSFHR in action MSFHR helps develop, retain and recruit the talented people whose research improves the health of British Columbians, addresses health system priorities, creates jobs and adds to the knowledge economy. Learn more Supporting talent development MSFHR is named in honour of Dr. Michael Smith, who became BC's first Nobel Laureate. A pre-eminent scientist with a long-standing commitment to supporting emerging talent, Dr. Smith helped establish BC as a hub of world-class health research. Learn more Applications open: 2021 C2 and Reach competitions Features April 8, 2021 BC funding partnership enables critical COVID-19 vaccine research in BC Features March 11, 2021 Talking about sex: A new app to revolutionize women's sex therapy Features March 8, 2021 Spark Issues Subscribe © 2021 Michael Smith Foundation for Health Research. All rights reserved.   Terms of Use 
www-msn-com-7760	----	Buyer of $69 million Beeple NFT is a crypto investor using the pseudonym Metakovan Home News Weather Coronavirus News Entertainment Sports More > esports Money Lifestyle Shopping Health & Fitness Food & Drink Travel Autos Video Kids money You are using an older browser version. Please use a supported version for the best MSN experience. Previous Next Buyer of $69 million Beeple NFT is a crypto investor using the pseudonym Metakovan CNBC 3/12/2021 Robert Frank Metakovan's real identity is not known, but the investor is the co-founder of the NFT collection called Metapurse, which collects NFTs to display in the metaverse through virtual museums. An NFT by the artist Beeple sold at Christie’s on Thursday for over $60 million, making it the most expensive NFT ever sold at auction. The sale capped two weeks of frenzied online bidding and ushers in a new era in collectibles, where prices for blockchain-based digital images now rival prices paid for Picassos and Monets. Here's what we know about Metakoven, the buyer of record $69 million Beeple NFT CNBC See more videos SHARE SHARE TWEET SHARE EMAIL What to watch next How you can save $1 million for retirement USA TODAY How much the most populous states pay mail carriers GOBankingRates Creepy ways your company can spy on you while you work from home Veuer Major companies suspend social media advertising over online hate speech CBS News Women and retirement planning Money Talks News This bookshop survived earthquakes and recessions. But not this CNN Money Jim Cramer on Chesapeake Energy filing for bankruptcy CNBC Should you save for retirement or pay down debt? Money Talks News Ford just unveiled its 2021 tech-savvy pickup — here's what's new CNBC Amazon looking to get into the self-driving car business buys startup for $1B Veuer Economic outlook as more Americans file for unemployment CBS News 5 items you can sell for additional income GOBankingRates 5 practical reasons to put purchases on credit GOBankingRates How to prepare for the next recession Money Talks News Disney World workers petition to delay reopening of theme park CNBC 500 Delta staff have tested positive for COVID-19 and 10 have died Veuer Click to expand Replay Video How you can save $1 million for retirement How you can save a million bucks for retirement USA TODAY How much the most populous states pay mail carriers Americans rely on mail carriers to send and receive their mail. Have you ever wondered how much these essential workers make? GOBankingRates Creepy ways your company can spy on you while you work from home Just because you’re working from home doesn’t mean your boss can’t keep tabs on your every move. Veuer’s Sean Dowling has more. Veuer UP NEXT The buyer of the Beeple non-fungible token for $69 million is a crypto investor who goes by the pseudonym of Metakovan. Metakovan's real identity is not known, but the investor is the co-founder of the NFT collection called Metapurse, which collects NFTs to display in the metaverse through virtual museums. Metakovan already owns the largest collection of Beeples, and fractionalized the ownership of one collection of Beeples with a special token called the B.20 Coin. CNBC spoke with Metakovan's partner in Metapurse, who goes by the name of Twobadour, who said the NFT is "the most valuable work of its generation." Twobadour said they don't know their exact plans for this work, but options include fractionalizing it or offering it as a new token. He said the goal is not to make money, but to decentralize and democratize art so token holders everywhere can share a piece of history and share the wealth. For example, it's like if people could go to the Museum of Modern Art and actually own some of the work, he said. "We made history and we created a god" in Beeple, he said. © Provided by CNBC A detail shot from a collage The announcement only partially solves the biggest mystery behind the most dramatic transaction in the art world since Leonardo DaVinci's "Salvator Mundi" sold for $450 million in 2017. The market for NFTs — which can be any digital asset whose ownership is recorded on a blockchain — has exploded in recent weeks to over $400 million as a vast new army of young collectors pay record prices for everything from NBA highlight videos to cat memes and art. For his $69 million, Metakovan will get "essentially a long string of numbers and letters," according to Noah Davis, an art specialist at Christie's. "It's a code that exists on the Ethereum blockchain. It is a block in the chain that will be dropped into their Ethereum wallet." The buyer also gets "a gigantic JPEG," Davis said. The sale capped two weeks of frenzied online bidding and ushers in a new era in collectibles, where prices for blockchain-based digital images now rival prices paid for Picassos and Monets. While the future of NFT prices and their longer-term role in the art world remains an open question, and many see it as a speculative fad, the eight-figure price for the Beeple has caused the art world to suddenly take notice. Shortly after the auction result, Mike Winkelmann, known as Beeple, tweeted: "holy f---" On Thursday night, he also tweeted an image of a digitized "Mona Lisa" with the caption: "THE NEXT CHAPTER." The record-breaking work, called "The First 5,000 days" was the first ever to sell at a major auction house. In 2007, Winkelmann set out to post a new work of digital art every day for the rest of his life and hasn't missed a single day. The first 5,000 of those works, which he calls "Everydays," were compiled to form "The First 5,000 days." Beeple: We're just beginning to scratch the surface with NFTs CNBC See more videos SHARE SHARE TWEET SHARE EMAIL What to watch next How you can save $1 million for retirement USA TODAY How much the most populous states pay mail carriers GOBankingRates Creepy ways your company can spy on you while you work from home Veuer Major companies suspend social media advertising over online hate speech CBS News Women and retirement planning Money Talks News This bookshop survived earthquakes and recessions. But not this CNN Money Jim Cramer on Chesapeake Energy filing for bankruptcy CNBC Should you save for retirement or pay down debt? Money Talks News Ford just unveiled its 2021 tech-savvy pickup — here's what's new CNBC Amazon looking to get into the self-driving car business buys startup for $1B Veuer Economic outlook as more Americans file for unemployment CBS News 5 items you can sell for additional income GOBankingRates 5 practical reasons to put purchases on credit GOBankingRates How to prepare for the next recession Money Talks News Disney World workers petition to delay reopening of theme park CNBC 500 Delta staff have tested positive for COVID-19 and 10 have died Veuer Click to expand Replay Video How you can save $1 million for retirement How you can save a million bucks for retirement USA TODAY How much the most populous states pay mail carriers Americans rely on mail carriers to send and receive their mail. Have you ever wondered how much these essential workers make? GOBankingRates Creepy ways your company can spy on you while you work from home Just because you’re working from home doesn’t mean your boss can’t keep tabs on your every move. Veuer’s Sean Dowling has more. Veuer UP NEXT Go to MSN Home AdChoices AdChoices AdChoices More from CNBC Here's what we know about Metakoven, the buyer of record $69 million Beeple NFT CNBC Beeple, who just sold an NFT for $69 million, talks about the other ways we can use the technology CNBC How Mark Cuban and experts say Americans should spend their stimulus check CNBC CNBC View the full site Home News Weather Coronavirus News Entertainment Sports esports Money Lifestyle Shopping Health & Fitness Food & Drink Travel Autos Video Kids © 2021 Microsoft Privacy & Cookies Terms of use About our Ads Feedback Help MSN Worldwide MSN Blog About Us Editorial Standards 
www-muckrock-com-7487	----	SOMA Global RPD contract • MuckRock A boldfaced logotype that spells MuckRock. The MuckRock logo depicts an electronic document with an antennae inscribed inside a circle. A list of items News Projects Requests Assignments A list of items More Jurisdictions Agencies Donate About Us Staff FAQ API A magnifying glass Search Cancel File Request Sign Up Log In SOMA Global RPD contract Share Symbolizes a link to our Facebook page Facebook Symbolizes a link to our Twitter page Twitter Icon resembling an envelope. Email The logo for the social network Reddit Reddit Tom Nash filed this request with the Richmond Police Department of Richmond, VA . Submitted April 5, 2019 Status Completed MuckRock users can file, duplicate, track, and share public records requests like this one. Learn more. File a Request 10 Communications 8 Files Communications Collapse All From: Tom Nash 04/05/2019 Subject: Virginia Freedom of Information Act Request: SOMA Global RPD contract Email To Whom It May Concern: Pursuant to the Virginia Freedom of Information Act, I hereby request the following records: - The contract and/or agreement between RPD and SOMA Global mentioned here: https://www.somaglobal.com/city-richmond-enters-agreement-soma-global-provide-public-safety-solutions-virginia/ - The Request for Proposals related to this contract. The requested documents will be made available to the general public, and this request is not being made for commercial purposes. In the event that there are fees, I would be grateful if you would inform me of the total charges in advance of fulfilling my request. I would prefer the request filled electronically, by e-mail attachment if available or CD-ROM if not. Thank you in advance for your anticipated cooperation in this matter. I look forward to receiving your response to this request within 5 business days, as the statute requires. Please note that I am a citizen of Virginia, and am using MuckRock's services to help manage and track my request. Sincerely, Tom Nash From: Richmond Police Department 04/05/2019 Subject: RE: Virginia Freedom of Information Act Request: SOMA Global RPD contract Email Good afternoon Mr. Nash, The Richmond Police Department does not house the information you have requested. That information must be requested the from the City's Procurement Services. I have copied Ms. Jackson and Ms. Kochanski on your request. All communications regarding this request should be directed to them. Best regards, William K. Shipman Associate General Counsel From: Richmond Police Department 05/07/2019 Subject: FW: Virginia Freedom of Information Act Request: SOMA Global RPD contract Email Hi Mr. Nash, Per your request for the City of Richmond - Richmond Police Department - Records Management Solicitation 180003244: - The contract and/or agreement between RPD and SOMA Global mentioned here: https://www.somaglobal.com/city-richmond-enters-agreement-soma-global-provide-public-safety-solutions-virginia/ - The Request for Proposals related to this contract. Due to the size of the documents, several emails containing the documents will be sent to you today. The emails will be labels and attachments documented. Please let me know if you have any problems opening the attachments. Thank you, Sue Kochanski City of Richmond Department of Procurement Services Management Analyst, Senior 900 E Broad Street, Room 1104 Richmond, VA 23219 Office: (804) 646-5802 Fax: (804) 646-5989 email: Sue.Kochanski@Richmondgov.com From: Richmond Police Department 05/07/2019 Subject: Email 1 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Email Email 1 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Attached: Contract and Intent to Award Thank you, Sue Kochanski City of Richmond Department of Procurement Services Management Analyst, Senior 900 E Broad Street, Room 1104 Richmond, VA 23219 Office: (804) 646-5802 Fax: (804) 646-5989 email: Sue.Kochanski@Richmondgov.com<mailto:Sue.Kochanski@Richmondgov.com> Intent to Award Notice - RFP B180003244 An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download Contract 19000006191 - RPD RMS An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download From: Richmond Police Department 05/07/2019 Subject: Email 2 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Email Email 2 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Attached: Proposal Log Thank you, Sue Kochanski City of Richmond Department of Procurement Services Management Analyst, Senior 900 E Broad Street, Room 1104 Richmond, VA 23219 Office: (804) 646-5802 Fax: (804) 646-5989 email: Sue.Kochanski@Richmondgov.com<mailto:Sue.Kochanski@Richmondgov.com> a Proposal Receipt Log B180003244 RPD RMS An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download From: Richmond Police Department 05/07/2019 Subject: Email 3 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Email Email 3 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Attached: CSI Proposal - Pages 1 - 75 Thank you, Sue Kochanski City of Richmond Department of Procurement Services Management Analyst, Senior 900 E Broad Street, Room 1104 Richmond, VA 23219 Office: (804) 646-5802 Fax: (804) 646-5989 email: Sue.Kochanski@Richmondgov.com<mailto:Sue.Kochanski@Richmondgov.com> CSI Proposal - Pages 1-75 An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download From: Richmond Police Department 05/07/2019 Subject: Email 5 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Email Email 5 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Attached: CSI Proposal - Pages 151-222 Thank you, Sue Kochanski City of Richmond Department of Procurement Services Management Analyst, Senior 900 E Broad Street, Room 1104 Richmond, VA 23219 Office: (804) 646-5802 Fax: (804) 646-5989 email: Sue.Kochanski@Richmondgov.com<mailto:Sue.Kochanski@Richmondgov.com> CSI Proposal - Pages 151-222 An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download From: Richmond Police Department 05/07/2019 Subject: Email 4 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Email Email 4 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Attached: CSI Proposal - Pages 76 - 150 Thank you, Sue Kochanski City of Richmond Department of Procurement Services Management Analyst, Senior 900 E Broad Street, Room 1104 Richmond, VA 23219 Office: (804) 646-5802 Fax: (804) 646-5989 email: Sue.Kochanski@Richmondgov.com<mailto:Sue.Kochanski@Richmondgov.com> CSI Proposal - Pages 76-150 An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download From: Richmond Police Department 05/07/2019 Subject: Email 7 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Email Email 7 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Attached: TriTech Proposal Thank you, Sue Kochanski City of Richmond Department of Procurement Services Management Analyst, Senior 900 E Broad Street, Room 1104 Richmond, VA 23219 Office: (804) 646-5802 Fax: (804) 646-5989 email: Sue.Kochanski@Richmondgov.com<mailto:Sue.Kochanski@Richmondgov.com> TriTech Proposal An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download From: Richmond Police Department 05/07/2019 Subject: Email 6 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Email Email 6 of 7 - City of Richmond - Richmond Police Department - Records Management Solicitation 180003244 Attached: Mark43 and Optimum Technology Proposals Thank you, Sue Kochanski City of Richmond Department of Procurement Services Management Analyst, Senior 900 E Broad Street, Room 1104 Richmond, VA 23219 Office: (804) 646-5802 Fax: (804) 646-5989 email: Sue.Kochanski@Richmondgov.com<mailto:Sue.Kochanski@Richmondgov.com> Mark43 and Optimum Technology An arrow pointing down Download Files pages Close Contract 19000006191 - RPD RMS An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download Intent to Award Notice - RFP B180003244 An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download a Proposal Receipt Log B180003244 RPD RMS An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download CSI Proposal - Pages 1-75 An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download CSI Proposal - Pages 151-222 An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download CSI Proposal - Pages 76-150 An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download TriTech Proposal An eye View Opening and closing brackets with a diagonal slash through the middle. Embed An arrow pointing down Download Mark43 and Optimum Technology An arrow pointing down Download Newsletter Want the latest investigative and FOIA news? Icon resembling an envelope. Subscribe The MuckRock logo depicts an electronic document with an antennae inscribed inside a circle. MuckRock is a non-profit collaborative news site that gives you the tools to keep our government transparent and accountable. Make a Donation © 2010–2021 Muckrock Sections News Projects Requests Agencies Jurisdictions Newsletters About About Us Staff FAQ Store API Privacy Policy Terms of Service Financials Feeds Latest Reporting Latest Questions Recently Filed Requests Recently Completed Requests 
www-nationalgeographic-com-2867	----	Elon Musk: A Million Humans Could Live on Mars By the 2060s Skip to content Renew Subscribe Menu SpaceX CEO Elon Musk stands with the Dragon 2 capsule, which is designed to carry astronauts into space. Photograph by Kevork Djansezian, Getty Images Please be respectful of copyright. Unauthorized use is prohibited. Exploring Mars Elon Musk: A Million Humans Could Live on Mars By the 2060s The SpaceX plan for building a Mars settlement includes refueling in orbit, a fleet of passenger ships, and the biggest rocket ever made. ByNadia Drake Published September 27, 2016 • 14 min read ShareTweetEmail GUADALAJARA, MexicoIn perhaps the most eagerly anticipated aerospace announcement of the year, SpaceX founder Elon Musk has revealed his grand plan for establishing a human settlement on Mars. In short, Musk thinks it’s possible to begin shuttling thousands of people between Earth and our smaller, redder neighbor sometime within the next decade or so. And not too long after that—perhaps 40 or a hundred years later, Mars could be home to a self-sustaining colony of a million people. “This is not about everyone moving to Mars, this is about becoming multiplanetary,” he said on September 27 at the International Astronautical Congress in Guadalajara, Mexico. “This is really about minimizing existential risk and having a tremendous sense of adventure.” Musk’s timeline sounds ambitious, and that's something he readily acknowledges. “I think the technical outline of the plan is about right. He also didn’t pretend that it was going to be easy and that they were going to do it in ten years,” says Bobby Braun, NASA’s former chief technologist who’s now at Georgia Tech University. “I mean, who’s to say what’s possible in a hundred years?” National Geographic Channel is currently in production on MARS, a global event series set to premiere November 14. Join the journey at MakeMarsHome.com. #CountdownToMars And for those wondering whether we should go at all, the reason for Musk making Mars an imperative is simple. “The future of humanity is fundamentally going to bifurcate along one of two directions: Either we’re going to become a multiplanet species and a spacefaring civilization, or we’re going be stuck on one planet until some eventual extinction event,” Musk told Ron Howard during an interview for National Geographic Channel’s MARS, a global event series that premieres worldwide on November 14. “For me to be excited and inspired about the future, it’s got to be the first option. It’s got to be: We’re going to be a spacefaring civilization.” Mars Fleet Though he admitted his exact timeline is fuzzy, Musk thinks it’s possible humans could begin flying to Mars by the mid-2020s. And he thinks the plan for getting there will go something like this: SpaceX Interplanetary Transport System Watch an animation of Elon Musk's vision for how to send humans to Mars. It starts with a really big rocket, something at least 200 feet tall when fully assembled. In a simulation of what SpaceX calls its Interplanetary Transport System, a spacecraft loaded with astronauts will launch on top of a 39-foot-wide booster that produces a whopping 28 million pounds of thrust. Using 42 Raptor engines, the booster will accelerate the assemblage to 5,374 miles an hour. Overall, the whole thing is 3.5 times more powerful than NASA’s Saturn V, the biggest rocket built to date, which carried the Apollo missions to the moon. Perhaps not coincidentally, the SpaceX rocket would launch from the same pad, 39A, at Kennedy Space Center in Cape Canaveral, Florida. The rocket would deliver the crew capsule to orbit around Earth, then the booster would steer itself toward a soft landing back at the launch pad, a feat that SpaceX rocket boosters have been doing for almost a year now. Next, the booster would pick up a fuel tanker and carry that into orbit, where it would fuel the spaceship for its journey to Mars. Once en route, that spaceship would deploy solar panels to harvest energy from the sun and conserve valuable propellant for what promises to be an exciting landing on the Red Planet. As Musk envisions it, fleets of these crew-carrying capsules will remain in Earth orbit until a favorable planetary alignment brings the two planets close together—something that happens every 26 months. “We’d ultimately have upward of a thousand or more spaceships waiting in orbit. And so the Mars colonial fleet would depart en masse,” Musk says. The key to his plan is reusing the various spaceships as much as possible. “I just don’t think there’s any way to have a self-sustaining Mars base without reusability. I think this is really fundamental,” Musk says. “If wooden sailing ships in the old days were not reusable, I don’t think the United States would exist.” Musk anticipates being able to use each rocket booster a thousand times, each tanker a hundred times, and each spaceship 12 times. At the beginning, he imagines that maybe a hundred humans would be hitching a ride on each ship, with that number gradually increasing to more than 200. By his calculations, then, putting a million people on Mars could take anywhere from 40 to a hundred years after the first ship launches. And, no, it would not necessarily be a one-way trip: “I think it’s very important to give people the option of returning,” Musk says. Colonizing Mars After landing a few cargo-carrying spacecraft without people on Mars, starting with the Red Dragon capsule in 2018, Musk says the human phase of colonization could begin. For sure, landing a heavy craft on a planet with a thin atmosphere will be difficult. It was tough enough to gently lower NASA’s Curiosity rover to the surface, and at 2,000 pounds, that payload weighed just a fraction of Musk’s proposed vessels. For now, Musk plans to continue developing supersonic retrorockets that can gradually and gently lower a much heavier spacecraft to the Martian surface, using his reusable Falcon 9 boosters as a model. And that’s not all these spacecraft will need: Hurtling through the Martian atmosphere at supersonic speeds will test even the most heat-tolerant materials on Earth, so it’s no small task to design a spacecraft that can withstand a heated entry and propulsive landing—and then be refueled and sent back to Earth so it can start over again. The first journeys would primarily serve the purpose of delivering supplies and establishing a propellant depot on the Martian surface, a fuel reservoir that could be tapped into for return trips to Earth. After that depot is set up and cargo delivered to the surface, the fun can (sort of) begin. Early human settlers will need to be good at digging beneath the surface and dredging up buried ice, which will supply precious water and be used to make the cryo-methane propellant that will power the whole enterprise. As such, the earliest interplanetary spaceships would probably stay on Mars, and they would be carrying mostly cargo, fuel, and a small crew: “builders and fixers” who are “the hearty explorer type,” Musk said to Howard. “Are you prepared to die? If that’s OK, then you’re a candidate for going.” While there will undoubtedly be intense competition and lots of fanfare over the first few seats on a Mars-bound mission, Musk worries that too much emphasis will be placed on those early bootprints. “In the sort of grander historical context, what really matters is being able to send a large number of people, like tens of thousands if not hundreds of thousands of people, and ultimately millions of tons of cargo,” he says. “I actually care much more about that than, say, the first few trips.” In short, his vision for establishing a settlement on Mars is more an endurance sport than a sprint. Rocket Man But Musk is used to that. In 2001, he founded SpaceX with one goal in mind: put humans on Mars. At the time, he recalls, he found himself thinking about why, after the successful Apollo missions to the moon, humans hadn’t visited Mars—or reached very far into space at all. “It always seemed like we should have gone there by now, and we should have had a base on the moon, and we should have had space hotels and all these things,” he said to Howard. “I’d assumed that it was a lack of will … it was not a lack of will.” I think what we want to avoid is a replay of Apollo. ByElon MuskSpaceX Instead, resources devoted to space exploration were scarce, and government spaceflight programs couldn’t assume the kind of risk that a private endeavor could tolerate. With an accumulated fortune from his time at Paypal, Musk founded a company dedicated to building rockets and vastly improving the vehicles that form the foundation of an interplanetary journey. Contracts with private clients and the U.S. government followed, and now SpaceX is working on a version of its Dragon capsule that can send humans to the International Space Station. Over the years, the company has had many high-profile successes—including landing the first suborbital reusable rocket stages on land and at sea—and its share of failures, with rockets exploding on the launch pad or en route to orbit. That’s no surprise for any big technology development. But putting humans on Mars is a completely different challenge from sending humans into orbit, or even to the moon, especially when the goal isn’t just a few casual trips. “I think what we want to avoid is a replay of Apollo,” Musk says. “We don’t want to send a few people, a few missions to Mars and then never go there again. That that will not accomplish the multiplanetary goal.” Funding Muskville Musk’s ultimate vision of a second, self-sustaining habitat for humans in the solar system is grand and lofty, but by no means unique. What makes Musk’s plan stand out from centuries of science fiction is that he might actually be able to make it happen—if he can bring costs down to his ideal levels. “Entrepreneurs are able to look at questions that we think about, but we’re not quite ready to go there yet, things like supersonic retrograde propulsion,” said NASA administrator Charlie Bolden during a panel at the IAC. "I think we can quibble over the numbers and the dollars and the timeframes and all, but we shouldn’t lose the fact that this guy went out on the international stage today and just laid it all out on the line," Braun adds. "I found it refreshing." But for Mars to be a viable destination, Musk says the cost of the trip needs to come down to about $200,000, or the average price of a house in the United States. Trouble is, that’s a significant decrease from current cost estimates. Musk doesn’t anticipate being able to do all of this on his own and said to Howard that some sort of synergistic relationship between governments and private industry will be crucial. Elon Musk at the Dragon 2 unveiling in May 2014. Photograph by Jae C. Hong, AP Please be respectful of copyright. Unauthorized use is prohibited. “I think we want to try to get as much in the way of private resources dedicated to the cause, and then get as much as possible in the way of government resources, so that if one of those funding sources disappears, things continue.” But combining different management styles, abilities to assume risk, sources of funding, and working with old institutional road maps will be a challenge, to say the least. How might that all work? “With difficulty,” says space policy expert John Logsdon, professor emeritus at The George Washington University. “It will involve breaking things.” For instance, reaching Mars in the 2020s will require a bit of a kick in the pants for SpaceX on the technology front. The massive rocket featured in the simulation is much more powerful than anything in the company’s current arsenal. The first iteration of that futuristic rocket, a gargantuan stepping stone known as the Falcon Heavy, has already been delayed for years. These types of delays are one of the reasons why space policy experts are skeptical about the timing of Musk’s plan, which he acknowledges is murky at best. “Based on past performance, I don’t know how you could say, well, yeah he’s missed all these other deadlines, but this time he’s gonna do it,” Logsdon says. “So I think the reasonable posture is that I’ll believe it when he does it.” If humans do manage to touch down on Mars, Musk thinks the momentum from such an achievement will propel additional developments, just as early explorers searching for glory, gold, and spices drove improvements in ship technology and global industry. Ultimately, Musk believes this kind of endeavor will bring Mars out of the realm of science fiction and transform it from a world fraught with difficulty and danger to one that humans might actually enjoy living on—including Musk. “I think that Mars is gonna be a great place to go,” he says. “It will be the planet of opportunity.” ShareTweetEmail Read This Next The hidden world of whale culture Magazine Planet Possible The hidden world of whale culture From singing competitions to food preferences, scientists are learning whales have cultural differences once thought to be unique to humans. These ants can shrink and regrow their brains Animals These ants can shrink and regrow their brains New research on Indian jumping ants shows they can undergo dramatic reversible changes previously unknown in insects. How to stop discarded face masks from polluting the planet Environment Planet Possible How to stop discarded face masks from polluting the planet Personal protective equipment is made of plastic and isn't recyclable. Now it’s being found everywhere on earth, including the oceans. The solution isn’t complicated: Throw them away. ‘Tiger King’ stars’ face scrutiny in court Animals Wildlife Watch ‘Tiger King’ stars’ face scrutiny in court Citing rotten food, separating young cubs from their mothers, missing animals, and fraud, court cases aim to end the most exploitive practices. Go Further Animals Is Florida losing its fight against invasive species? Animals Is Florida losing its fight against invasive species? The pioneering science that unlocked the secrets of whale culture Animals The pioneering science that unlocked the secrets of whale culture Many animals play dead—and not just to avoid getting eaten Animals Many animals play dead—and not just to avoid getting eaten Groundbreaking effort launched to decode whale language Animals Groundbreaking effort launched to decode whale language This fish stuck in a disposable glove is a warning about the risks of COVID-19 litter Animals This fish stuck in a disposable glove is a warning about the risks of COVID-19 litter ‘Tiger King’ stars’ face scrutiny in court Animals Wildlife Watch ‘Tiger King’ stars’ face scrutiny in court Environment Highest weather station in the Andes will help scientists search for climate answers, Video Story Environment Perpetual Planet Highest weather station in the Andes will help scientists search for climate answers Garlic mustard hurts native plants but its power is waning Environment Garlic mustard hurts native plants but its power is waning 'Forest gardens’ show how Native land stewardship can outdo nature Environment Planet Possible 'Forest gardens’ show how Native land stewardship can outdo nature For young climate activists, the pandemic is the defining moment for action Environment Planet Possible For young climate activists, the pandemic is the defining moment for action Biden wants to cut U.S. climate pollution in half—here’s how Environment Planet Possible Biden wants to cut U.S. climate pollution in half—here’s how 51 years of environmental victories, in photos Environment Planet Possible 51 years of environmental victories, in photos History & Culture How vaccination became 'hip' in the '50s, thanks to teens History & Culture How vaccination became 'hip' in the '50s, thanks to teens 80 years ago, a player made baseball history … an organ player, that is History & Culture 80 years ago, a player made baseball history … an organ player, that is 240 men started Magellan's voyage around the world. Only 18 finished it. History Magazine 240 men started Magellan's voyage around the world. Only 18 finished it. After 60 years, Bay of Pigs disaster still haunts veterans who fought History & Culture After 60 years, Bay of Pigs disaster still haunts veterans who fought Archaeologists discover mysterious monument hidden in plain sight History & Culture Archaeologists discover mysterious monument hidden in plain sight Stolen in 1792, the French Blue diamond's fate puzzled historians for centuries History Magazine Stolen in 1792, the French Blue diamond's fate puzzled historians for centuries Science Rare chunks of Earth’s mantle found exposed in Maryland Science Rare chunks of Earth’s mantle found exposed in Maryland Did tyrannosaurs live in groups? Experts discuss new fossil clues. Science Did tyrannosaurs live in groups? Experts discuss new fossil clues. U.S. will share its AstraZeneca vaccine supply with the world Science Coronavirus Coverage U.S. will share its AstraZeneca vaccine supply with the world How India’s second wave became the worst COVID-19 surge in the world Science Coronavirus Coverage How India’s second wave became the worst COVID-19 surge in the world How did Brazil’s lauded vaccine program fail at COVID-19? Science Coronavirus Coverage How did Brazil’s lauded vaccine program fail at COVID-19? SpaceX launches first astronauts on a reused rocket Science SpaceX launches first astronauts on a reused rocket Travel Greyhound connects America. What happens if intercity buses disappear? Travel Greyhound connects America. What happens if intercity buses disappear? Meet a man who has lived alone on an island for 32 years Travel Meet a man who has lived alone on an island for 32 years Travelers are crossing borders for vaccines. Is that okay? Travel Coronavirus Coverage Travelers are crossing borders for vaccines. Is that okay? Out of this world colors Partner Content Out of this world colors Here’s how locals are preserving Italy’s famed wildflower bloom from overtourism Travel Planet Possible Here’s how locals are preserving Italy’s famed wildflower bloom from overtourism See fireflies magically light up this national park Travel Planet Possible See fireflies magically light up this national park Subscriber Exclusive Content previous Magazine Why are people so dang obsessed with Mars? Read Magazine How viruses shape our world Read Animals The era of greyhound racing in the U.S. is coming to an end Read Magazine See how people have imagined life on Mars through history Read Magazine See how NASA’s new Mars rover will explore the red planet Explore Magazine Why are people so dang obsessed with Mars? Read Magazine How viruses shape our world Read Animals The era of greyhound racing in the U.S. is coming to an end Read Magazine See how people have imagined life on Mars through history Read Magazine See how NASA’s new Mars rover will explore the red planet Explore Magazine Why are people so dang obsessed with Mars? Read Magazine How viruses shape our world Read Animals The era of greyhound racing in the U.S. is coming to an end Read Magazine See how people have imagined life on Mars through history Read Magazine See how NASA’s new Mars rover will explore the red planet Explore next See More The best of National Geographic delivered to your inbox Sign up for more inspiring photos, stories, and special offers from National Geographic. Sign Up Legal Terms of Use Privacy Policy Your California Privacy Rights Children's Online Privacy Policy Interest-Based Ads About Nielsen Measurement Do Not Sell My Info Our Sites Nat Geo Home Attend a Live Event Book a Trip Buy Maps Inspire Your Kids Shop Nat Geo Visit the D.C. Museum Watch TV Learn About Our Impact Support our Mission Nat Geo Partners Masthead Press Room Advertise With Us Join Us Subscribe Customer Service Renew Subscription Manage Your Subscription Work at Nat Geo Sign up for Our Newsletters Contribute to Protect the Planet Pitch a Story Follow us National Geographic FacebookNational Geographic TwitterNational Geographic Instagram United States (Change) Copyright © 1996-2015 National Geographic SocietyCopyright © 2015-2021 National Geographic Partners, LLC. All rights reserved 
www-nature-com-2208	----	The FAIR Guiding Principles for scientific data management and stewardship | Scientific Data Skip to main content Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript. Advertisement View all journals Search My Account Login Explore content Journal information Publish with us Sign up for alerts RSS feed nature scientific data comment article The FAIR Guiding Principles for scientific data management and stewardship Download PDF Open Access Published: 15 March 2016 The FAIR Guiding Principles for scientific data management and stewardship Mark D. Wilkinson  ORCID: orcid.org/0000-0001-6960-357X1, Michel Dumontier  ORCID: orcid.org/0000-0003-4727-94352, IJsbrand Jan Aalbersberg  ORCID: orcid.org/0000-0002-0209-44803, Gabrielle Appleton  ORCID: orcid.org/0000-0003-0179-73843, Myles Axton  ORCID: orcid.org/0000-0002-8042-41314, Arie Baak  ORCID: orcid.org/0000-0003-2829-67155, Niklas Blomberg  ORCID: orcid.org/0000-0003-4155-59106, Jan-Willem Boiten  ORCID: orcid.org/0000-0003-0327-638X7, Luiz Bonino da Silva Santos  ORCID: orcid.org/0000-0002-1164-13518, Philip E. Bourne  ORCID: orcid.org/0000-0002-7618-72929, Jildau Bouwman10, Anthony J. Brookes  ORCID: orcid.org/0000-0001-8686-001711, Tim Clark  ORCID: orcid.org/0000-0003-4060-736012, Mercè Crosas  ORCID: orcid.org/0000-0003-1304-193913, Ingrid Dillo  ORCID: orcid.org/0000-0001-5654-239214, Olivier Dumon  ORCID: orcid.org/0000-0001-8599-73453, Scott Edmunds  ORCID: orcid.org/0000-0001-6444-143615, Chris T. Evelo  ORCID: orcid.org/0000-0002-5301-314216, Richard Finkers  ORCID: orcid.org/0000-0002-4368-805817, Alejandra Gonzalez-Beltran  ORCID: orcid.org/0000-0003-3499-826218, Alasdair J.G. Gray  ORCID: orcid.org/0000-0002-5711-487219, Paul Groth  ORCID: orcid.org/0000-0003-0183-69103, Carole Goble  ORCID: orcid.org/0000-0003-1219-213720, Jeffrey S. Grethe  ORCID: orcid.org/0000-0001-5212-705221, Jaap Heringa  ORCID: orcid.org/0000-0001-8641-493022, Peter A.C ’t Hoen  ORCID: orcid.org/0000-0003-4450-311223, Rob Hooft  ORCID: orcid.org/0000-0001-6825-943924, Tobias Kuhn  ORCID: orcid.org/0000-0002-1267-023425, Ruben Kok22, Joost Kok  ORCID: orcid.org/0000-0002-7352-140026, Scott J. Lusher  ORCID: orcid.org/0000-0003-2401-422327, Maryann E. Martone  ORCID: orcid.org/0000-0002-8406-387128, Albert Mons29, Abel L. Packer  ORCID: orcid.org/0000-0001-9610-572830, Bengt Persson  ORCID: orcid.org/0000-0003-3165-534431, Philippe Rocca-Serra  ORCID: orcid.org/0000-0001-9853-566818, Marco Roos  ORCID: orcid.org/0000-0002-8691-772X32, Rene van Schaik33, Susanna-Assunta Sansone  ORCID: orcid.org/0000-0001-5306-569018, Erik Schultes  ORCID: orcid.org/0000-0001-8888-635X34, Thierry Sengstag  ORCID: orcid.org/0000-0002-7516-624635, Ted Slater  ORCID: orcid.org/0000-0003-1386-073136, George Strawn37, Morris A. Swertz  ORCID: orcid.org/0000-0002-0979-340138, Mark Thompson  ORCID: orcid.org/0000-0002-7633-144232, Johan van der Lei39, Erik van Mulligen  ORCID: orcid.org/0000-0003-1377-938639, Jan Velterop  ORCID: orcid.org/0000-0002-4836-656840, Andra Waagmeester  ORCID: orcid.org/0000-0001-9773-400841, Peter Wittenburg42, Katherine Wolstencroft  ORCID: orcid.org/0000-0002-1279-513343, Jun Zhao  ORCID: orcid.org/0000-0001-6935-902844 & Barend Mons  ORCID: orcid.org/0000-0003-3934-007245,46,47  Scientific Data volume 3, Article number: 160018 (2016) Cite this article 196k Accesses 2457 Citations 1854 Altmetric Metrics details Subjects Publication characteristics Research data An Addendum to this article was published on 19 March 2019 Abstract There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community. Download PDF Comment Supporting discovery through good data management Good data management is not a goal in itself, but rather is the key conduit leading to knowledge discovery and innovation, and to subsequent data and knowledge integration and reuse by the community after the data publication process. Unfortunately, the existing digital ecosystem surrounding scholarly data publication prevents us from extracting maximum benefit from our research investments (e.g., ref. 1). Partially in response to this, science funders, publishers and governmental agencies are beginning to require data management and stewardship plans for data generated in publicly funded experiments. Beyond proper collection, annotation, and archival, data stewardship includes the notion of ‘long-term care’ of valuable digital assets, with the goal that they should be discovered and re-used for downstream investigations, either alone, or in combination with newly generated data. The outcomes from good data management and stewardship, therefore, are high quality digital publications that facilitate and simplify this ongoing process of discovery, evaluation, and reuse in downstream studies. What constitutes ‘good data management’ is, however, largely undefined, and is generally left as a decision for the data or repository owner. Therefore, bringing some clarity around the goals and desiderata of good data management and stewardship, and defining simple guideposts to inform those who publish and/or preserve scholarly data, would be of great utility. This article describes four foundational principles—Findability, Accessibility, Interoperability, and Reusability—that serve to guide data producers and publishers as they navigate around these obstacles, thereby helping to maximize the added-value gained by contemporary, formal scholarly digital publishing. Importantly, it is our intent that the principles apply not only to ‘data’ in the conventional sense, but also to the algorithms, tools, and workflows that led to that data. All scholarly digital research objects2—from data to analytical pipelines—benefit from application of these principles, since all components of the research process must be available to ensure transparency, reproducibility, and reusability. There are numerous and diverse stakeholders who stand to benefit from overcoming these obstacles: researchers wanting to share, get credit, and reuse each other’s data and interpretations; professional data publishers offering their services; software and tool-builders providing data analysis and processing services such as reusable workflows; funding agencies (private and public) increasingly concerned with long-term data stewardship; and a data science community mining, integrating and analysing new and existing data to advance discovery. To facilitate the reading of this manuscript by these diverse stakeholders, we provide definitions for common abbreviations in Box 1. Humans, however, are not the only critical stakeholders in the milieu of scientific data. Similar problems are encountered by the applications and computational agents that we task to undertake data retrieval and analysis on our behalf. These ‘computational stakeholders’ are increasingly relevant, and demand as much, or more, attention as their importance grows. One of the grand challenges of data-intensive science, therefore, is to improve knowledge discovery through assisting both humans, and their computational agents, in the discovery of, access to, and integration and analysis of, task-appropriate scientific data and other scholarly digital objects. For certain types of important digital objects, there are well-curated, deeply-integrated, special-purpose repositories such as Genbank3, Worldwide Protein Data Bank (wwPDB4), and UniProt5 in the life sciences; Space Physics Data Facility (SPDF; http://spdf.gsfc.nasa.gov/) and Set of Identifications, Measurements and Bibliography for Astronomical Data (SIMBAD6) in the space sciences. These foundational and critical core resources are continuously curating and capturing high-value reference datasets and fine-tuning them to enhance scholarly output, provide support for both human and mechanical users, and provide extensive tooling to access their content in rich, dynamic ways. However, not all datasets or even data types can be captured by, or submitted to, these repositories. Many important datasets emerging from traditional, low-throughput bench science don’t fit in the data models of these special-purpose repositories, yet these datasets are no less important with respect to integrative research, reproducibility, and reuse in general. Apparently in response to this, we see the emergence of numerous general-purpose data repositories, at scales ranging from institutional (for example, a single university), to open globally-scoped repositories such as Dataverse7, FigShare (http://figshare.com), Dryad8, Mendeley Data (https://data.mendeley.com/), Zenodo (http://zenodo.org/), DataHub (http://datahub.io), DANS (http://www.dans.knaw.nl/), and EUDat9. Such repositories accept a wide range of data types in a wide variety of formats, generally do not attempt to integrate or harmonize the deposited data, and place few restrictions (or requirements) on the descriptors of the data deposition. The resulting data ecosystem, therefore, appears to be moving away from centralization, is becoming more diverse, and less integrated, thereby exacerbating the discovery and re-usability problem for both human and computational stakeholders. A specific example of these obstacles could be imagined in the domain of gene regulation and expression analysis. Suppose a researcher has generated a dataset of differentially-selected polyadenylation sites in a non-model pathogenic organism grown under a variety of environmental conditions that stimulate its pathogenic state. The researcher is interested in comparing the alternatively-polyadenylated genes in this local dataset, to other examples of alternative-polyadenylation, and the expression levels of these genes—both in this organism and related model organisms—during the infection process. Given that there is no special-purpose archive for differential polyadenylation data, and no model organism database for this pathogen, where does the researcher begin? We will consider the current approach to this problem from a variety of data discovery and integration perspectives. If the desired datasets existed, where might they have been published, and how would one begin to search for them, using what search tools? The desired search would need to filter based on specific species, specific tissues, specific types of data (Poly-A, microarray, NGS), specific conditions (infection), and specific genes—is that information (‘metadata’) captured by the repositories, and if so, what formats is it in, is it searchable, and how? Once the data is discovered, can it be downloaded? In what format(s)? Can that format be easily integrated with private in-house data (the local dataset of alternative polyadenylation sites) as well as other data publications from third-parties and with the community’s core gene/protein data repositories? Can this integration be done automatically to save time and avoid copy/paste errors? Does the researcher have permission to use the data from these third-party researchers, under what license conditions, and who should be cited if a data-point is re-used? Questions such as these highlight some of the barriers to data discovery and reuse, not only for humans, but even more so for machines; yet it is precisely these kinds of deeply and broadly integrative analyses that constitute the bulk of contemporary e-Science. The reason that we often need several weeks (or months) of specialist technical effort to gather the data necessary to answer such research questions is not the lack of appropriate technology; the reason is, that we do not pay our valuable digital objects the careful attention they deserve when we create and preserve them. Overcoming these barriers, therefore, necessitates that all stakeholders—including researchers, special-purpose, and general-purpose repositories—evolve to meet the emergent challenges described above. The goal is for scholarly digital objects of all kinds to become ‘first class citizens’ in the scientific publication ecosystem, where the quality of the publication—and more importantly, the impact of the publication—is a function of its ability to be accurately and appropriately found, re-used, and cited over time, by all stakeholders, both human and mechanical. With this goal in-mind, a workshop was held in Leiden, Netherlands, in 2014, named ‘Jointly Designing a Data Fairport’. This workshop brought together a wide group of academic and private stakeholders all of whom had an interest in overcoming data discovery and reuse obstacles. From the deliberations at the workshop the notion emerged that, through the definition of, and widespread support for, a minimal set of community-agreed guiding principles and practices, all stakeholders could more easily discover, access, appropriately integrate and re-use, and adequately cite, the vast quantities of information being generated by contemporary data-intensive science. The meeting concluded with a draft formulation of a set of foundational principles that were subsequently elaborated in greater detail—namely, that all research objects should be Findable, Accessible, Interoperable and Reusable (FAIR) both for machines and for people. These are now referred to as the FAIR Guiding Principles. Subsequently, a dedicated FAIR working group, established by several members of the FORCE11 community10 fine-tuned and improved the Principles. The results of these efforts are reported here. Box 1: Terms and Abbreviations BD2K—Big Data 2 Knowledge, is a trans-NIH initiative established to enable biomedical research as a digital research enterprise, to facilitate discovery and support new knowledge, and to maximise community engagement. DOI—Digital Object Identifier; a code used to permanently and stably identify (usually digital) objects. DOIs provide a standard mechanism for retrieval of metadata about the object, and generally a means to access the data object itself. FAIR—Findable, Accessible, Interoperable, Reusable. FORCE11—The Future of Research Communications and e-Scholarship; a community of scholars, librarians, archivists, publishers and research funders that has arisen organically to help facilitate the change toward improved knowledge creation and sharing, initiated in 2011. Interoperability—the ability of data or tools from non-cooperating resources to integrate or work together with minimal effort. JDDCP—Joint Declaration of Data Citation Principles; Acknowledging data as a first-class research output, and to support good research practices around data re-use, JDDCP proposes a set of guiding principles for citation of data within scholarly literature, another dataset, or any other research object. RDF—Resource Description Framework; a globally-accepted framework for data and knowledge representation that is intended to be read and interpreted by machines. The significance of machines in data-rich research environments The emphasis placed on FAIRness being applied to both human-driven and machine-driven activities, is a specific focus of the FAIR Guiding Principles that distinguishes them from many peer initiatives (discussed in the subsequent section). Humans and machines often face distinct barriers when attempting to find and process data on the Web. Humans have an intuitive sense of ‘semantics’ (the meaning or intent of a digital object) because we are capable of identifying and interpreting a wide variety of contextual cues, whether those take the form of structural/visual/iconic cues in the layout of a Web page, or the content of narrative notes. As such, we are less likely to make errors in the selection of appropriate data or other digital objects, although humans will face similar difficulties if sufficient contextual metadata is lacking. The primary limitation of humans, however, is that we are unable to operate at the scope, scale, and speed necessitated by the scale of contemporary scientific data and complexity of e-Science. It is for this reason that humans increasingly rely on computational agents to undertake discovery and integration tasks on their behalf. This necessitates machines to be capable of autonomously and appropriately acting when faced with the wide range of types, formats, and access-mechanisms/protocols that will be encountered during their self-guided exploration of the global data ecosystem. It also necessitates that the machines keep an exquisite record of provenance such that the data they are collecting can be accurately and adequately cited. Assisting these agents, therefore, is a critical consideration for all participants in the data management and stewardship process—from researchers and data producers to data repository hosts. Throughout this paper, we use the phrase ‘machine actionable’ to indicate a continuum of possible states wherein a digital object provides increasingly more detailed information to an autonomously-acting, computational data explorer. This information enables the agent—to a degree dependent on the amount of detail provided—to have the capacity, when faced with a digital object never encountered before, to: a) identify the type of object (with respect to both structure and intent), b) determine if it is useful within the context of the agent’s current task by interrogating metadata and/or data elements, c) determine if it is usable, with respect to license, consent, or other accessibility or use constraints, and d) take appropriate action, in much the same manner that a human would. For example, a machine may be capable of determining the data-type of a discovered digital object, but not capable of parsing it due to it being in an unknown format; or it may be capable of processing the contained data, but not capable of determining the licensing requirements related to the retrieval and/or use of that data. The optimal state—where machines fully ‘understand’ and can autonomously and correctly operate-on a digital object—may rarely be achieved. Nevertheless, the FAIR principles provide ‘steps along a path’ toward machine-actionability; adopting, in whole or in part, the FAIR principles, leads the resource along the continuum towards this optimal state. In addition, the idea of being machine-actionable applies in two contexts—first, when referring to the contextual metadata surrounding a digital object (‘what is it?’), and second, when referring to the content of the digital object itself (‘how do I process it/integrate it?’). Either, or both of these may be machine-actionable, and each forms its own continuum of actionability. Finally, we wish to draw a distinction between data that is machine-actionable as a result of specific investment in software supporting that data-type, for example, bespoke parsers that understand life science wwPDB files or space science Space Physics Archive Search and Extract (SPASE) files, and data that is machine-actionable exclusively through the utilization of general-purpose, open technologies. To reiterate the earlier point—ultimate machine-actionability occurs when a machine can make a useful decision regarding data that it has not encountered before. This distinction is important when considering both (a) the rapidly growing and evolving data environment, with new technologies and new, more complex data-types continuously being developed, and (b) the growth of general-purpose repositories, where the data-types likely to be encountered by an agent are unpredictable. Creating bespoke parsers, in all computer languages, for all data-types and all analytical tools that require those data-types, is not a sustainable activity. As such, the focus on assisting machines in their discovery and exploration of data through application of more generalized interoperability technologies and standards at the data/repository level, becomes a first-priority for good data stewardship. The FAIR Guiding Principles in detail Representatives of the interested stakeholder-groups, discussed above, coalesced around four core desiderata—the FAIR Guiding Principles—and limited elaboration of these, which have been refined (Box 2) from the meeting’s original draft, available at (https://www.force11.org/node/6062). A separate document that dynamically addresses community discussion relating to clarifications and explanations of the principles, and detailed guidelines for and examples of FAIR implementations, is currently being constructed (http://datafairport.org/fair-principles-living-document-menu). The FAIR Guiding Principles describe distinct considerations for contemporary data publishing environments with respect to supporting both manual and automated deposition, exploration, sharing, and reuse. While there have been a number of recent, often domain-focused publications advocating for specific improvements in practices relating to data management and archival1,11,12, FAIR differs in that it describes concise, domain-independent, high-level principles that can be applied to a wide range of scholarly outputs. Throughout the Principles, we use the phrase ‘(meta)data’ in cases where the Principle should be applied to both metadata and data. The elements of the FAIR Principles are related, but independent and separable. The Principles define characteristics that contemporary data resources, tools, vocabularies and infrastructures should exhibit to assist discovery and reuse by third-parties. By minimally defining each guiding principle, the barrier-to-entry for data producers, publishers and stewards who wish to make their data holdings FAIR is purposely maintained as low as possible. The Principles may be adhered to in any combination and incrementally, as data providers’ publishing environments evolve to increasing degrees of ‘FAIRness’. Moreover, the modularity of the Principles, and their distinction between data and metadata, explicitly support a wide range of special circumstances. One such example is highly sensitive or personally-identifiable data, where publication of rich metadata to facilitate discovery, including clear rules regarding the process for accessing the data, provides a high degree of ‘FAIRness’ even in the absence of FAIR publication of the data itself. A second example involves the publication of non-data research objects. Analytical workflows, for example, are a critical component of the scholarly ecosystem, and their formal publication is necessary to achieve both transparency and scientific reproducibility. The FAIR principles can equally be applied to these non-data assets, which need to be identified, described, discovered, and reused in much the same manner as data. Specific exemplar efforts that provide varying levels of FAIRness are detailed later in this document. Additional issues, however, remain to be addressed. First, when community-endorsed vocabularies or other (meta)data standards do not include the attributes necessary to achieve rich annotation, there are two possible solutions: either publish an extension of an existing, closely related vocabulary, or—in the extreme case—create and explicitly publish a new vocabulary resource, following FAIR principles (‘I2’). Second, to explicitly identify the standard chosen when more than one vocabulary or other (meta)data standard is available, and given that for instance in the life sciences there are over 600 content standards, the BioSharing registry (https://biosharing.org/) can be of use as it describes the standards in detail, including versions where applicable. Box 2: The FAIR Guiding Principles To be Findable: F1. (meta)data are assigned a globally unique and persistent identifier F2. data are described with rich metadata (defined by R1 below) F3. metadata clearly and explicitly include the identifier of the data it describes F4. (meta)data are registered or indexed in a searchable resource To be Accessible: A1. (meta)data are retrievable by their identifier using a standardized communications protocol A1.1 the protocol is open, free, and universally implementable A1.2 the protocol allows for an authentication and authorization procedure, where necessary A2. metadata are accessible, even when the data are no longer available To be Interoperable: I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. I2. (meta)data use vocabularies that follow FAIR principles I3. (meta)data include qualified references to other (meta)data To be Reusable: R1. meta(data) are richly described with a plurality of accurate and relevant attributes R1.1. (meta)data are released with a clear and accessible data usage license R1.2. (meta)data are associated with detailed provenance R1.3. (meta)data meet domain-relevant community standards The Principles precede implementation These high-level FAIR Guiding Principles precede implementation choices, and do not suggest any specific technology, standard, or implementation-solution; moreover, the Principles are not, themselves, a standard or a specification. They act as a guide to data publishers and stewards to assist them in evaluating whether their particular implementation choices are rendering their digital research artefacts Findable, Accessible, Interoperable, and Reusable. We anticipate that these high level principles will enable a broad range of integrative and exploratory behaviours, based on a wide range of technology choices and implementations. Indeed, many repositories are already implementing various aspects of FAIR using a variety of technology choices and several examples are detailed in the next section; examples include Scientific Data itself and how narrative data articles are anchored to a progressively FAIR structured metadata. Examples of FAIRness, and the resulting value-added Dataverse7: Dataverse is an open-source data repository software installed in dozens of institutions globally to support public community repositories or institutional research data repositories. Harvard Dataverse, with more than 60,000 datasets, is the largest of the current Dataverse repositories, and is open to all researchers from all research fields. Dataverse generates a formal citation for each deposit, following the standard defined by Altman and King13. Dataverse makes the Digital Object Identifier (DOI), or other persistent identifiers (Handles), public when the dataset is published (‘F’). This resolves to a landing page, providing access to metadata, data files, dataset terms, waivers or licenses, and version information, all of which is indexed and searchable (‘F’, ‘A’, and ‘R’). Deposits include metadata, data files, and any complementary files (such as documentation or code) needed to understand the data and analysis (‘R’). Metadata is always public, even if the data are restricted or removed for privacy issues (‘F’, ‘A’). This metadata is offered at three levels, extensively supporting the ‘I’ and ‘R’ FAIR principles: 1) data citation metadata, which maps to DataCite schema or Dublin Core Terms, 2) domain-specific metadata, which when possible maps to metadata standards used within a scientific domain, and 3) file-level metadata, which can be deep and extensive for tabular data files (including column-level metadata). Finally, Dataverse provides public machine-accessible interfaces to search the data, access the metadata and download the data files, using a token to grant access when data files are restricted (‘A’). FAIRDOM (http://fair-dom.org/about): integrates the SEEK14 and openBIS15 platforms to produce a FAIR data and model management facility for Systems Biology. Individual research assets (or aggregates of data and models) are identified with unique and persistent HTTP URLs, which can be registered with DOIs for publication (‘F’). Assets can be accessed over the Web in a variety of formats appropriate for individuals and/or their computers (RDF, XML) (‘I’). Research assets are annotated with rich metadata, using community standards, formats and ontologies (‘I’). The metadata is stored as RDF to enable interoperability and assets can be downloaded for reuse (‘R’). ISA16: is a community-driven metadata tracking framework to facilitate standards-compliant collection, curation, management and reuse of life science datasets. ISA provides progressively FAIR structured metadata to Nature Scientific Data’s Data Descriptor articles, and many GigaScience data papers, and underpins the EBI MetaboLights database among other data resources. At the heart is a general-purpose, extensible ISA model, originally only available as a tabular representation but subsequently enhanced as an RDF-based representation17, and JSON serializations to enable the ‘I’ and ‘R’, becoming ‘FAIR’ when published as linked data (http://elixir-uk.org/node-events/201cisa-as-a-fair-research-object201d-hack-the-spec-event-1) and complementing other research objects18. Open PHACTS19: Open PHACTS is a data integration platform for information pertaining to drug discovery. Access to the platform is mediated through a machine-accessible interface20 which provides multiple representations that are both human (HTML) and machine readable (RDF, JSON, XML, CSV, etc), providing the ‘A’ facet of FAIRness. The interface allows multiple URLs to be used to access information about a particular entity through a mappings service (‘F’ and ‘A’). Thus, a user can provide a ChEMBL URL to retrieve information sourced from, for example, Chemspider or DrugBank. Each call provides a canonical URL in its response (‘A’ and ‘I’). All data sources used are described using standardized dataset descriptions, following the global VoID standard, with rich provenance (‘R’ and ‘I’). All interface features are described using RDF following the Linked Data API specification (‘A’). Finally, a majority of the datasets are described using community agreed upon ontologies (‘I’). wwPDB4,21: wwPDB is a special-purpose, intensively-curated data archive that hosts information about experimentally-determined 3D structures of proteins and nucleic acids. All wwPDB entries are stably hosted on an FTP server (‘A’) and represented in machine-readable formats (text and XML); the latter are machine-actionable using the metadata provided by the wwPDB conforming to the Macromolecular Information Framework (mmCIF22), a data standard of the International Union of Crystallography (IUCr) (‘F’,‘I’ for humans, ‘F’,‘I’ for IUCr-aware machines). The wwPDB metadata contains cross-references to common identifiers such as PubMed and NCBI Taxonomy, and their wwPDB metadata are described in data dictionaries and schema documents (http://mmcif.wwpdb.org and http://pdbml.wwpdb.org) which conform to the IUCr data standard for the chemical and structural biology domains (‘R’). A variety of software tools are available to interpret both wwPDB data and meta-data (‘I’,‘R’ for humans, ‘I’,‘R’ for machines with this software). Each entry is represented by a DOI (‘F’, ‘A’ for humans and machines). The DOI resolves to a zipped file which requires special software for further interrogation/interpretation. Other wwPDB access points23–25 provide access to wwPDB records through URLs that are likely to be stable in the long-term (‘F’), and all data and metadata is searchable through one or more of the wwPDB-affiliated websites (‘F’) UniProt26: UniProt is a comprehensive resource for protein sequence and annotation data. All entries are uniquely identified by a stable URL, that provides access to the record in a variety of formats including a web page, plain-text, and RDF (‘F’ and ‘A’). The record contains rich metadata (‘F’) that is both human-readable (HTML) and machine-readable (text and RDF), where the RDF formatted response utilizes shared vocabularies and ontologies such as UniProt Core, FALDO, and ECO (‘I’). Interlinking with more than 150 different databases, every UniProt record has extensive links into, for example, PubMed, enabling rich citation. These links are machine-actionable in the RDF representation (‘R’). Finally, in the RDF representation, the UniProt Core Ontology explicitly types all records, leaving no ambiguity—neither for humans nor machines—about what the data represents (‘R’), enabling fully-automated retrieval of records and cross-referencing information. In addition to, and in support of, communities and resources that are already pursuing FAIR objectives, the Data Citation Implementation Group of Force11 has published specific technical recommendations for how to implement many of the principles27, with a particular focus on identifiers and their resolution, persistence, and metadata accessibility especially related to citation. In addition, the ‘Skunkworks’ group that emerged from the Lorentz Workshop has been creating software supporting infrastructures28 that are, end-to-end, compatible with FAIR principles, and can be implemented over existing repositories. These code modules have a particular focus on metadata publication and searchability, compatibility in cases of strict privacy considerations, and the extremely difficult problem of data and metadata interoperability (manuscript in preparation). Finally, there are several emergent projects, some listed in Box 3, for which FAIR is a key objective. These projects may provide valuable advice and guidance for those wishing to become more FAIR. Box 3: Emergent community/collaborative initiatives with FAIR as a core focus or activity bioCADDIE (https://biocaddie.org): The NIH BD2K biomedical and healthCAre Data Discovery Index Ecosystem (bioCADDIE) consortium works to develop a Data Discovery Index (DDI) prototype, which is set to be as transformative and impactful for data as PubMed for the biomedical literature30. The DDI focuses on finding (‘F’) and accessing (‘A’) the datasets stored across different sources, and progressively works to identify relevant metadata31 (‘I’) and maps them to community standards (‘R’), linking to BioSharing. CEDAR32: The Center for Expanded Data Annotation and Retrieval (CEDAR) is an NIH BD2K funded center of excellence to develop tools and technologies that reduce the burden of authoring and enhancing metadata that meet community-based standards. CEDAR will enable the creation of metadata templates that implement community based standards for experimental metadata, from BioSharing (https://biosharing.org), and that will be uniquely identifiable and retrievable with HTTP URIs, and annotated with vocabularies and ontologies drawn from BioPortal (http://bioportal.bioontology.org) (‘F’,‘A’,‘I’,‘R’). These templates will guide users to create rich metadata with unique and stable HTTP identifiers (‘F’) that can be retrieved using HTTP (‘A’) and accessible in a variety of formats (JSON-LD, TURTLE, RDF/XML, CSV, etc) (‘I’). These metadata will use community standards, as defined by the template, and include provenance and data usage (‘R’). These two projects, among others, provide tools and or collaborative opportunities for those who wish to improve the FAIRness of their data. FAIRness is a prerequisite for proper data management and data stewardship The ideas within the FAIR Guiding Principles reflect, combine, build upon and extend previous work by both the Concept Web Alliance (https://conceptweblog.wordpress.com/) partners, who focused on machine-actionability and harmonization of data structures and semantics, and by the scientific and scholarly organizations that developed the Joint Declaration of Data Citation Principles (JDDCP29), who focused on primary scholarly data being made citable, discoverable and available for reuse, so as to be capable of supporting more rigorous scholarship. An attempt to define the similarities and overlaps between the FAIR Principles and the JDDCP is provided at (https://www.force11.org/node/6062). The FAIR Principles are also complementary to the ‘Data Seal of Approval’ (DSA) (http://datasealofapproval.org/media/filer_public/2013/09/27/guidelines_2014-2015.pdf) in that they share the general aim to render data re-usable for users other than those who originally generated them. While the DSA focuses primarily on the responsibilities and conduct of data producers and repositories, FAIR focuses primarily on the data itself. Clearly, the broader community of stakeholders is coalescing around a set of common, dovetailed visions spanning all facets of the scholarly data publishing ecosystem. The end result, when implemented, will be more rigorous management and stewardship of these valuable digital resources, to the benefit of the entire academic community. As stated at the outset, good data management and stewardship is not a goal in itself, but rather a pre-condition supporting knowledge discovery and innovation. Contemporary e-Science requires data to be Findable, Accessible, Interoperable, and Reusable in the long-term, and these objectives are rapidly becoming expectations of agencies and publishers. We demonstrate, therefore, that the FAIR Data Principles provide a set of mileposts for data producers and publishers. They guide the implementation of the most basic levels of good Data Management and Stewardship practice, thus helping researchers adhere to the expectations and requirements of their funding agencies. We call on all data producers and publishers to examine and implement these principles, and actively participate with the FAIR initiative by joining the Force11 working group. By working together towards shared, common goals, the valuable data produced by our community will gradually achieve the critical goals of FAIRness. Additional Information How to cite this article: Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3:160018 doi: 10.1038/sdata.2016.18 (2016). References 1Roche, D. G., Kruuk, L. E. B., Lanfear, R. & Binning, S. A. Public Data Archiving in Ecology and Evolution: How Well Are We Doing? PLOS Biol. 13, e1002295 (2015). Article  Google Scholar  2Bechhofer, S. et al. Research Objects: Towards Exchange and Reuse of Digital Knowledge. Nat. Preced. 10.1038/npre.2010.4626.1 (2010). 3Benson, D. A. et al. GenBank. Nucleic Acids Res. 41, D36–D42 (2013). CAS  Article  Google Scholar  4Berman, H., Henrick, K. & Nakamura, H. Announcing the worldwide Protein Data Bank. Nat. Struct. Biol. 10, 980–980 (2003). CAS  Article  Google Scholar  5The Uniprot Consortium. UniProt: a hub for protein information. Nucleic Acids Res. 43, D204–D212 (2015). Article  Google Scholar  6Wenger, M. et al. The SIMBAD astronomical database-The CDS reference database for astronomical objects. Astron. Astrophys. Suppl. Ser. 143, 9–22 (2000). ADS  Article  Google Scholar  7Crosas, M. "The Dataverse Network®: An Open-Source Application for Sharing, Discovering and Preserving Data". D-Lib Mag 17 (1), p2 (2011). Google Scholar  8White, H. C., Carrier, S., Thompson, A., Greenberg, J. & Scherle, R. The Dryad data repository: A Singapore framework metadata architecture in a DSpace environment. Univ. Göttingen, p157 (2008). 9Lecarpentier, D. et al. EUDAT: A New Cross-Disciplinary Data Infrastructure for Science. Int. J. Digit. Curation 8, 279–287 (2013). Article  Google Scholar  10Martone, M. E. FORCE11: Building the Future for Research Communications and e-Scholarship. Bioscience 65, 635 (2015). Article  Google Scholar  11White, E. et al. Nine simple ways to make it easier to (re)use your data. Ideas Ecol. Evol. 6 (2013). 12Sandve, G. K., Nekrutenko, A., Taylor, J. & Hovig, E. Ten Simple Rules for Reproducible Computational Research. PLoS Comput. Biol. 9, e1003285 (2013). ADS  Article  Google Scholar  13Altman, M. & King, G. in D-Lib Magazine 13, no. 3/4 (2007). Google Scholar  14Wolstencroft, K. et al. SEEK: a systems biology data and model management platform. BMC Syst. Biol. 9, 33 (2015). Article  Google Scholar  15Bauch, A. et al. openBIS: a flexible framework for managing and analyzing complex data in biology research. BMC Bioinformatics 12, 468 (2011). Article  Google Scholar  16Sansone, S.-A. et al. Toward interoperable bioscience data. Nat. Genet. 44, 121–126 (2012). CAS  Article  Google Scholar  17González-Beltrán, A., Maguire, E., Sansone, S.-A. & Rocca-Serra, P. linkedISA: semantic representation of ISA-Tab experimental metadata. BMC Bioinformatics 15, S4 (2014). Article  Google Scholar  18González-Beltrán, A. et al. From Peer-Reviewed to Peer-Reproduced in Scholarly Publishing: The Complementary Roles of Data Models and Workflows in Bioinformatics. PLoS ONE 10, e0127612 (2015). Article  Google Scholar  19Harland, L. Open PHACTS: A Semantic Knowledge Infrastructure for Public and Commercial Drug Discovery Research. Knowl. Eng. Knowl. Manag. Lect. Notes Comput. Sci. 7603/2012, 1–7 (2012). Google Scholar  20Groth, P. et al. API-centric Linked Data integration: The Open PHACTS Discovery Platform case study. Web Semant. Sci. Serv. Agents World Wide Web 29, 12–18 (2014). Article  Google Scholar  21Berman, H. M. et al. The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000). ADS  CAS  Article  Google Scholar  22Bourne, P. E., Berman, H. M., Watenpaugh, K., Westbrook, J. D. & Fitzgerald, P. M. D. The macromolecular crystallographic information file (mmCIF). Meth. Enzym 277, 571–590 (1997). CAS  Article  Google Scholar  23Rose, P. W. et al. The RCSB Protein Data Bank: views of structural biology for basic and applied research and education. Nucleic Acids Res. 43, D345–D356 (2015). CAS  Article  Google Scholar  24Kinjo, A. R. et al. Protein Data Bank Japan (PDBj): maintaining a structural data archive and resource description framework format. Nucleic Acids Res. 40, D453–D460 (2012). CAS  Article  Google Scholar  25Gutmanas, A. et al. PDBe: Protein Data Bank in Europe. Nucleic Acids Res. 42, D285–D291 (2014). CAS  Article  Google Scholar  26UniProt Consortium. UniProt: a hub for protein information. Nucleic Acids Res. 43, D204–D212 (2015). Article  Google Scholar  27Starr, J. et al. Achieving human and machine accessibility of cited data in scholarly publications. PeerJ Comput. Sci. 1, e1 (2015). Article  Google Scholar  28Wilkinson, M., Dumontier, M. & Durbin, P. DataFairPort: The Perl libraries version 0.231 10.5281/zenodo.33584 (2015). 29Data Citation Synthesis Group: Joint Declaration of Data Citation Principles. San Diego CA: FORCE11 https://www.force11.org/datacitation (2014). 30Ohno-machado, L. et al. NIH BD2K bioCADDIE white paper—Data Discovery Index. http://dx.doi.org/10.6084/m9.figshare.1362572 (2015). 31NIH BD2K bioCADDIE WG3 Members. WG3-MetadataSpecifications: NIH BD2K bioCADDIE Data Discovery Index WG3 Metadata Specification v1 doi:10.5281/zenodo.28019 (2015). 32Musen, M. A. et al. The center for expanded data annotation and retrieval. J. Am. Med. Informatics Assoc. 22, 1148–1152 (2015). Google Scholar  Download references Acknowledgements The original Lorentz Workshop ‘Jointly Designing a Data FAIRport’ was organized by Barend Mons in collaboration with and co-sponsored by the Lorentz center, The Dutch Techcenter for the Life Sciences and the Netherlands eScience Center. The principles and themes described in this manuscript represent the significant voluntary contributions and participation of the authors at, and/or subsequent to, this workshop and from the wider Force11, BD2K and ELIXIR communities. We also acknowledge and thank the organizers and backers of the NBDC/DBCLS BioHackathon 2015, where several of the authors made significant revisions to the FAIR Principles. Author information Affiliations Center for Plant Biotechnology and Genomics, Universidad Politécnica de Madrid, Madrid, 28223, Spain Mark D. Wilkinson Stanford University, Stanford, 94305-5411, USA Michel Dumontier Elsevier,, Amsterdam, 1043 NX, The Netherlands IJsbrand Jan Aalbersberg, Gabrielle Appleton, Olivier Dumon & Paul Groth Nature Genetics, New York, 10004-1562, USA Myles Axton Euretos and Phortos Consultants, Rotterdam, 2741 CA, The Netherlands Arie Baak ELIXIR, Wellcome Genome Campus, Hinxton, CB10 1SA, UK Niklas Blomberg Lygature, Eindhoven, 5656 AG, The Netherlands Jan-Willem Boiten Vrije Universiteit Amsterdam, Dutch Techcenter for Life Sciences, Amsterdam, 1081 HV, The Netherlands Luiz Bonino da Silva Santos Office of the Director, National Institutes of Health, Rockville, 20892, USA Philip E. Bourne TNO, Zeist, 3700 AJ, The Netherlands Jildau Bouwman Department of Genetics, University of Leicester, Leicester, LE1 7RH, UK Anthony J. Brookes Harvard Medical School, Boston, MA 02115, Massachusetts, USA Tim Clark Harvard University, Cambridge, MA 02138, Massachusetts, USA Mercè Crosas Data Archiving and Networked Services (DANS), The Hague, 2593 HW, The Netherlands Ingrid Dillo GigaScience, Beijing Genomics Institute, Shenzhen, 518083, China Scott Edmunds Department of Bioinformatics, Maastricht University, Maastricht, 6200 MD, The Netherlands Chris T. Evelo Wageningen UR Plant Breeding, Wageningen, 6708 PB, The Netherlands Richard Finkers Oxford e-Research Center, University of Oxford, Oxford, OX1 3QG, UK Alejandra Gonzalez-Beltran, Philippe Rocca-Serra & Susanna-Assunta Sansone Heriot-Watt University, Edinburgh, EH14 4AS, UK Alasdair J.G. Gray School of Computer Science, University of Manchester, Manchester, M13 9PL, UK Carole Goble Center for Research in Biological Systems, School of Medicine, University of California San Diego,, La Jolla, 92093-0446, California, USA Jeffrey S. Grethe Dutch Techcenter for the Life Sciences, Utrecht, 3501 DE, The Netherlands Jaap Heringa & Ruben Kok Department of Human Genetics, Leiden University Medical Center, Dutch Techcenter for the Life Sciences, Leiden, 2300 RC, The Netherlands Peter A.C ’t Hoen Dutch TechCenter for Life Sciences and ELIXIR-NL, Utrecht, 3501 DE, The Netherlands Rob Hooft VU University Amsterdam, Amsterdam, 1081 HV, The Netherlands Tobias Kuhn Leiden Center of Data Science, Leiden University, Leiden, 2300 RA, The Netherlands Joost Kok Netherlands eScience Center, Amsterdam, 1098 XG, The Netherlands Scott J. Lusher National Center for Microscopy and Imaging Research, UCSD, San Diego, 92103, USA Maryann E. Martone Phortos Consultants, San Diego, 92011, USA Albert Mons SciELO/FAPESP Program, UNIFESP Foundation, São Paulo, 05468-901, Brazil Abel L. Packer Bioinformatics Infrastructure for Life Sciences (BILS), Science for Life Laboratory, Dept of Cell and Molecular Biology, Uppsala University, Uppsala, S-751 24, Sweden Bengt Persson Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands Marco Roos & Mark Thompson Bayer CropScience, Gent Area, 1831, Belgium Rene van Schaik Leiden Institute for Advanced Computer Science, Leiden University Medical Center, Leiden, 2300 RA, The Netherlands Erik Schultes Swiss Institute of Bioinformatics and University of Basel, Basel, 4056, Switzerland Thierry Sengstag Cray, Inc., Seattle, 98164, USA Ted Slater Unaffiliated George Strawn University Medical Center Groningen (UMCG), University of Groningen, Groningen, 9713 GZ, The Netherlands Morris A. Swertz Erasmus MC, Rotterdam, 3015 CE, The Netherlands Johan van der Lei & Erik van Mulligen Independent Open Access and Open Science Advocate, Guildford, GU1 3PW, UK Jan Velterop Micelio, Antwerp, 2180, Belgium Andra Waagmeester Max Planck Compute and Data Facility, MPS, Garching, 85748, Germany Peter Wittenburg Leiden Institute of Advanced Computer Science, Leiden University, Leiden, 2333 CA, The Netherlands Katherine Wolstencroft Department of Computer Science, Oxford University, Oxford, OX1 3QD, UK Jun Zhao Leiden University Medical Center, Leiden and Dutch TechCenter for Life Sciences, Utrecht, 2333 ZA, The Netherlands Barend Mons Netherlands eScience Center, Amsterdam, 1098 XG, The Netherlands Barend Mons Erasmus MC, Rotterdam, 3015 CE, The Netherlands Barend Mons Authors Mark D. WilkinsonView author publications You can also search for this author in PubMed Google Scholar Michel DumontierView author publications You can also search for this author in PubMed Google Scholar IJsbrand Jan AalbersbergView author publications You can also search for this author in PubMed Google Scholar Gabrielle AppletonView author publications You can also search for this author in PubMed Google Scholar Myles AxtonView author publications You can also search for this author in PubMed Google Scholar Arie BaakView author publications You can also search for this author in PubMed Google Scholar Niklas BlombergView author publications You can also search for this author in PubMed Google Scholar Jan-Willem BoitenView author publications You can also search for this author in PubMed Google Scholar Luiz Bonino da Silva SantosView author publications You can also search for this author in PubMed Google Scholar Philip E. BourneView author publications You can also search for this author in PubMed Google Scholar Jildau BouwmanView author publications You can also search for this author in PubMed Google Scholar Anthony J. BrookesView author publications You can also search for this author in PubMed Google Scholar Tim ClarkView author publications You can also search for this author in PubMed Google Scholar Mercè CrosasView author publications You can also search for this author in PubMed Google Scholar Ingrid DilloView author publications You can also search for this author in PubMed Google Scholar Olivier DumonView author publications You can also search for this author in PubMed Google Scholar Scott EdmundsView author publications You can also search for this author in PubMed Google Scholar Chris T. EveloView author publications You can also search for this author in PubMed Google Scholar Richard FinkersView author publications You can also search for this author in PubMed Google Scholar Alejandra Gonzalez-BeltranView author publications You can also search for this author in PubMed Google Scholar Alasdair J.G. GrayView author publications You can also search for this author in PubMed Google Scholar Paul GrothView author publications You can also search for this author in PubMed Google Scholar Carole GobleView author publications You can also search for this author in PubMed Google Scholar Jeffrey S. GretheView author publications You can also search for this author in PubMed Google Scholar Jaap HeringaView author publications You can also search for this author in PubMed Google Scholar Peter A.C ’t HoenView author publications You can also search for this author in PubMed Google Scholar Rob HooftView author publications You can also search for this author in PubMed Google Scholar Tobias KuhnView author publications You can also search for this author in PubMed Google Scholar Ruben KokView author publications You can also search for this author in PubMed Google Scholar Joost KokView author publications You can also search for this author in PubMed Google Scholar Scott J. LusherView author publications You can also search for this author in PubMed Google Scholar Maryann E. MartoneView author publications You can also search for this author in PubMed Google Scholar Albert MonsView author publications You can also search for this author in PubMed Google Scholar Abel L. PackerView author publications You can also search for this author in PubMed Google Scholar Bengt PerssonView author publications You can also search for this author in PubMed Google Scholar Philippe Rocca-SerraView author publications You can also search for this author in PubMed Google Scholar Marco RoosView author publications You can also search for this author in PubMed Google Scholar Rene van SchaikView author publications You can also search for this author in PubMed Google Scholar Susanna-Assunta SansoneView author publications You can also search for this author in PubMed Google Scholar Erik SchultesView author publications You can also search for this author in PubMed Google Scholar Thierry SengstagView author publications You can also search for this author in PubMed Google Scholar Ted SlaterView author publications You can also search for this author in PubMed Google Scholar George StrawnView author publications You can also search for this author in PubMed Google Scholar Morris A. SwertzView author publications You can also search for this author in PubMed Google Scholar Mark ThompsonView author publications You can also search for this author in PubMed Google Scholar Johan van der LeiView author publications You can also search for this author in PubMed Google Scholar Erik van MulligenView author publications You can also search for this author in PubMed Google Scholar Jan VelteropView author publications You can also search for this author in PubMed Google Scholar Andra WaagmeesterView author publications You can also search for this author in PubMed Google Scholar Peter WittenburgView author publications You can also search for this author in PubMed Google Scholar Katherine WolstencroftView author publications You can also search for this author in PubMed Google Scholar Jun ZhaoView author publications You can also search for this author in PubMed Google Scholar Barend MonsView author publications You can also search for this author in PubMed Google Scholar Contributions M.W. was the primary author of the manuscript, and participated extensively in the drafting and editing of the FAIR Principles. M.D. was significantly involved in the drafting of the FAIR Principles. B.M. conceived of the FAIR Data Initiative, contributed extensively to the drafting of the principles, and to this manuscript text. All other authors are listed alphabetically, and contributed to the manuscript either by their participation in the initial workshop and/or by editing or commenting on the manuscript text. Corresponding author Correspondence to Barend Mons. Ethics declarations Competing interests M.A. is the Nature Genetics’ Editor in Chief; S.A.S. is Scientific Data’s Honorary Academic Editor and consultant. Rights and permissions This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0 Metadata associated with this Data Descriptor is available at http://www.nature.com/sdata/ and is released under the CC0 waiver to maximize reuse. Reprints and Permissions About this article Cite this article Wilkinson, M., Dumontier, M., Aalbersberg, I. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3, 160018 (2016). https://doi.org/10.1038/sdata.2016.18 Download citation Received: 10 December 2015 Accepted: 12 February 2016 Published: 15 March 2016 DOI: https://doi.org/10.1038/sdata.2016.18 Further reading Neuroplasticity in the phonological system: The PMN and the N400 as markers for the perception of non-native phonemic contrasts by late second language learners Karin Heidlmayr , Emmanuel Ferragne  & Frédéric Isel Neuropsychologia (2021) BioContainers Registry: Searching Bioinformatics and Proteomics Tools, Packages, and Containers Jingwen Bai , Chakradhar Bandla , Jiaxin Guo , Roberto Vera Alvarez , Mingze Bai , Juan Antonio Vizcaíno , Pablo Moreno , Björn Grüning , Olivier Sallou  & Yasset Perez-Riverol Journal of Proteome Research (2021) Why openness makes research infrastructure resilient Helena Cousijn , Ginny Hendricks  & Alice Meadows Learned Publishing (2021) Promoting Ethically Responsible Use of Agricultural Biotechnology Antoine L. Harfouche , Vasiliki Petousi , Richard Meilan , Jeremy Sweet , Tomasz Twardowski  & Arie Altman Trends in Plant Science (2021) Ten simple rules for navigating the computational aspect of an interdisciplinary PhD Sabrina Islam , Christine A. Wells  & Russell Schwartz PLOS Computational Biology (2021) Download PDF Associated Content Collection Scientific data Collection Metadata quality Explore content Research articles News & Comment Collections Follow us on Facebook Follow us on Twitter Sign up for alerts RSS feed Journal information About Publish Policies Contact Publish with us For Authors Submit manuscript Search Search articles by subject, keyword or author Show results from All journals This journal Search Advanced search Quick links Explore articles by subject Find a job Guide to authors Editorial policies Scientific Data ISSN 2052-4463 (online) nature.com sitemap About us Press releases Press office Contact us Discover content Journals A-Z Articles by subject Nano Protocol Exchange Nature Index Publishing policies Nature portfolio policies Open access Author & Researcher services Reprints & permissions Research data Language editing Scientific editing Nature Masterclasses Nature Research Academies Libraries & institutions Librarian service & tools Librarian portal Open research Recommend to library Advertising & partnerships Advertising Partnerships & Services Media kits Branded content Career development Nature Careers Nature Conferences Nature events Regional websites Nature Africa Nature China Nature India Nature Italy Nature Japan Nature Korea Nature Middle East Legal & Privacy Privacy Policy Use of cookies Manage cookies/Do not sell my data Legal notice Accessibility statement Terms & Conditions California Privacy Statement © 2021 Springer Nature Limited \ Close Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily. Email address Sign up I agree my information will be processed in accordance with the Nature and Springer Nature Limited Privacy Policy. Close Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing 
www-nbc12-com-4691	----	Group raises concern over RPD’s record management system Skip to content Healthcare Pros Inside Local Business RVA Today Open For You Advertise With Us WATCH LIVE News COVID-19 Vaccine in Virginia Weather Send it to 12 TV Home WATCH LIVE Send it to 12 Stay Connected Podcast: How We Got Here CW Richmond Contests Advertise With Us News Crime State National Education Politics Coronavirus Coverage Virginia Mercury Business Weather First Alert Weather Blog Allergy Report Closings and Delays JES Weathernet Sign up for email alerts Dress for the Weather Traffic On Your Side Acts of Kindness Investigate Restaurant Report RVA Parenting Savings Guide Digital Dialogues Sports Football Scores High School Sports National Sports Hardee's Super Fan Contest 12 About Town Community Calendar Picture your pet About Us Contact Us Meet the Team Management Team NBC12 Viewpoint NBC12 Jobs TV Latest Newscasts One Good Thing Full Court Press with Greta Van Susteren Circle - Country Music & Lifestyle Gray DC Bureau Investigate TV Top Story Group raises concern over RPD’s record management system By A.J. Nwoko | April 4, 2019 at 4:58 AM EDT - Updated April 4 at 11:35 AM RICHMOND, VA (WWBT) - Social justice group Richmond Transparency Accountability Project (RTAP) is pushing for a new record management system for Richmond police. RTAP held a town hall meeting Wednesday at the Richmond Public Library discuss the Richmond Police Department (RPD) crime incident information center. Dr. Liz Coston, a sociology professor at Virginia Commonwealth University, says trends RTAP found in the data are concerning. “Use of force data, complaint data, data on traffic stops or data on field interviews... all of those show that black people in Richmond are stopped at disproportionately higher rates than white residents of Richmond and that’s really concerning to us,” said Coston. RPD’s current record management system has not been revised in over a decade. Now, a new record management system is being developed. RPD has partnered with public safety service provider Soma Global to create an advanced records management platform for the city. According to a press release from Soma Global, the SOMA Platform allows for deployment in the cloud, on premise or in a hybrid scenario. The City of Richmond is planning a hybrid solution with multi-jurisdictional features spanning use across Richmond City Sheriff’s Office and Virginia Commonwealth University: Search and share crime data from anywhere, in any browser Fully integrated web maps to geographically visualize incidents Multi-jurisdictional and advanced data sharing features for a complete operating picture Advanced analytics and machine learning algorithms for smart policing However, Coston says they also have concerns about the new system. “Soma Global markets themselves as being a provider of predictive policing technologies. They argue that those technologies reduce crime. We know from the implementation of those technologies from other communities that they disproportionately harm black and Hispanic communities,” said Coston. It’s unknown when Richmond will start using the new system, but according to RPD, people can already track crime daily using a community crime map online. The police department also posts crime data and internal affairs complaints on its own website for added transparency. “We think as the Richmond transparency and accountability project, that members of our community should be policed fairly and equitably,” said Coston. The group collected suggestions from people who attended the meeting. They plan to present those surveys to Mayor Levar Stoney and the police department before they start the new system. Copyright 2019 WWBT. All rights reserved. Submit a news tip. 59 Currently in Richmond, VA Full Forecast Sponsored By Author A.J. Nwoko Multimedia journalist A.J. Nwoko is on your side every evening as a multimedia journalist at NBC12. His passion lies in telling stories about everyday people in his community doing extraordinary things and boldly overcoming adversity. RECENT CONTENT Census: Virginia will hold steady at 11 congressional seats New figures released by the U.S. Census Bureau show Virginia’s population grew over the past decade but not enough to gain an additional seat in Congress. By  Associated Press   Director of Black History Museum in Virginia dies after battle with cancer Adele Johnson passed away this weekend after a long battle with cancer. By  NBC12 Newsroom Virginia AG says colleges can impose vaccine requirement By  Associated Press Published 2h at 6:46 PM Albemarle County 8-year-old goes viral for hysterical impression of her mom working from home By  Riley Wyant Published 3h at 6:23 PM   ‘It was productive’: Virginia NAACP discusses public safety concerns with Gov. Northam and other state leaders By  Karina Bolster Published 3h at 6:14 PM Gov. Northam signs LGBTQ+ Advisory Board bill By  NBC12 Newsroom Published 3h at 6:07 PM 5710 Midlothian Turnpike Richmond, VA 23225 (804) 230-1212 Contact Us WWBT FCC Public File publicfile@nbc12.com 804-230-1212 WUPV FCC Public File publicfile@cwrichmond.tv 804-230-1212 FCC Applications EEO Report Closed Captioning Careers Privacy Policy Terms of Service WWBT FCC Public File publicfile@nbc12.com 804-230-1212 WUPV FCC Public File publicfile@cwrichmond.tv 804-230-1212 FCC Applications EEO Report Closed Captioning Careers Privacy Policy Terms of Service A Gray Media Group, Inc. Station - © 2002-2021 Gray Television, Inc. 
www-netzerochallenge-info-1789	----	Net Zero Challenge     What is this? How do I apply? Key Dates What we want to see in the Challenge Resources About Open Up Guides Why a Challenge Prize? The Problem Net Zero Challenge Ultimate Goal FAQs Partners Advisory Committee Contact   Applications for the Net Zero Challenge are now closed.   ​ Meet the projects shortlisted for the Net Zero Challenge here and the Panel of Expers here.   Please register here to join us for the live online Pitch Challenge on April 13th 2021, 15.00 (London time). What is this?   The Net Zero Challenge is a global competition to answer the following question ​ How can you advance climate action using open data ?   Our aim is to identify, promote and support innovative, practical and scalable uses of open data that can: ​ understand climate risks track climate progress enable informed climate action, or evaluate climate impact. ​ We want to hear from individuals, groups and organisations.   You can apply with a ​ project you are already working on, or concept for something you want to develop in the future.   The first stage of the Net Zero Challenge is a 'Virtual Pitch Contest'.    The prize for this 'pitch' stage of the Net Zero Challenge is $1,000 USD - to support development of your idea.  ​ Make an application here.   Read more about the application process here and check out the regularly updated FAQs ​ Email netzero@okfn.org to receive Net Zero Challenge updates. ​ Let's see if we can use open data to advance climate action. ​ Welcome to the Net Zero Challenge.  The Net Zero Challenge is an Open Knowledge Foundation project.   We are extremely grateful to our funding partners for supporting this project.    How do I apply?   The Net Zero Challenge launches on Friday 29th January 2021. ​ Application can be made here.   All applications must be received by Friday 12th March 2021 at 6pm Pacific Standard Time (San Francisco time).    Late submissions will not be accepted.  ​ Applications will be reviewed and a short list of applicants invited to pitch their idea to a panel of experts at a 'Virtual Pitch Contest'.   ​ Pitches will take the form of a public 3 minute presentation by video conference, followed by Q&A from our panel of experts. ​   Pitches can be live, or prerecorded. Q&A will be live.  LAUNCH ​ APPLICATIONS CLOSE ​ Key Dates Friday 29th January 2021  ​ Friday 12th March 2021 6pm  Pacific Standard Time (San Francisco time) APPLY HERE ​ Make your application by Friday 12th March 2021 6pm  Pacific Standard Time (San Francisco time)     What we want to see in the Net Zero Challenge   The question we want to answer in the Net Zero Challenge is ​ How can you  advance climate action  using open data ? Your idea, or project, must do one, or more, of the following ​ understand climate risks track climate progress enable informed climate action, or evaluate climate impact. ​ Some ways in which you might do this include   making climate relevant data easier to discover, view and understand by the general data user creating a useful passthrough tool or API for climate-relevant data in any country or jurisdiction organising climate data so that potential data users (including those who are less data-literate) can see what's available, and make use of it ​ We are very open minded about your approach and methodology.  ​ What we care about is the outcome, and whether you answer the question. ​ You might consider whether your idea, or project, is ​ technically achievable ​easy to use easily integrated or can be provided as a tool scalable good value for money published with IP licences that allow free use by others  explainable (this is the key test of the Challenge. Can you pitch your project in three minutes to a general audience)  ​     Resources   In 2019, Open Data Charter and the World Resources Institute published the first version of the Open Up Guide: Using Open Data to Advance Climate Action.   This Open Up Guide showed how the production, disclosure, monitoring, and use of data will be essential to achieve climate action.    A coordinated global climate response requires the sharing of  information, and while government policy is critical to solving the climate crisis, there is also an important role for the private sector and civil society.   Making climate relevant data open will ​ empower national and subnational governments to develop low-carbon development plans inform private sector investment decisions, and allow civil society to participate more effectively and translate information to less data-literate users. ​ To help data holders and potential data users identify, locate, publish and use climate relevant data, the Open Up Guide: Using Open Data to Advance Climate Action contains   a list of Existing Public Repositories of Climatic and Climate Relevant Data Relevant Data Types that national stakeholders should consider publishing in an open format. ​       Open Up Guides are peer-reviewed reports that outline the journey from data publication to impact.  ​ They are intended as practical tool to support governments and their partners take strategic action.  ​ Open Up Guides identify key datasets and common standards that facilitate interoperability.   Open Up Guides help to embed open data as a central ingredient to providing better solutions to the most pressing policy challenges of our time. ​ In addition to the Open Up Guide: Using Open Data to Advance Climate Action, Open Up Guides have also been created for  ​ Anti-Corruption, and Agriculture data.   Open Up Guides are published by Open Data Charter. ​ About Open Up Guides Why a Challenge Prize? "Challenge prizes offer a reward to whoever can first or most effectively solve a problem.   They are a tried and tested method of attracting new innovators to change the status quo.   At the same time, they also challenge incumbents to redirect their efforts or think about a problem in a new way.   This leads to breakthrough solutions, creation of new cohorts of innovators, and can result in systemic change. ​ For innovators, the value of a challenge prize is much more than the winner’s award.   Participants that reach the finalist stage or go on to win often attract new investors and supporters, buoyed by the publicity and credibility of the challenge.   The prize amounts are often modest by comparison." ​ NESTA Challenges Challenge Prizes: A Practical Guide   The Problem   Many countries generate data for policy planning and international reporting as part of their commitments to solve the climate crisis.   However, few publish this data in open formats.  ​ The Open Up Guide: Using Open Data to Advance Climate Action shows that climate relevant data is often ​​ incomplete fragmented across agencies, and not made available in interoperable and accessible formats. ​ This matters because the ability of people to access and understand climate-relevant data can ​ enable greater political accountability lead to efforts towards global climate goals being more diversified across the sub-national governments and their agencies, the private sector and civil society, and increase climate ambition based on informed planning. Net Zero Challenge Ultimate Goal     We want all governments to publish climate relevant data in an open format so that data users are able to use the data to achieve climate action. ​ This is our ultimate goal​.   In the meantime, we launched the Net Zero Challenge to identify practical demonstrations of climate-relevant data being used to achieve innovative solutions to climate problems.   Such demonstrations are powerful tools to advocate that governments make a commitment to publish climate-relevant data in open formats.  ​ Scaling up these solutions will also then that the impact of climate-relevant data is maximised.      FAQs   These FAQs are regularly updated.  ​ The last update occurred on Wednesday 10th February 2021.  ​ I missed the application deadline. Can I still apply?  In order to make the Net Zero Challenge a fair competition, all applications must be made by the application deadline.    My project is part of a university funded project. Can I apply? We want to identify, promote and reward the best projects. We are happy to accept applications from individuals, or groups of any sort. ​ $1,000 USD is not enough for me to fund my project. Are there other funds available? At the moment, we only have funds to support the Net Zero Challenge pitch challenge. We are actively working with our partners to fund a second stage of the project that will help applicants turn their ideas into prototypes and take them to COP26. Watch this space ! ​ What definition of open data must applications align with? We expect applicant projects to align with the definition of open data found in the Open Definition. Alignment with other definitions of open data that have broad community support may be considered on a case by case basis.  ​ If you have a question about the Net Zero Challenge that is not answered in the FAQs, please email netzero@okfn.org    Partners   The Net Zero Challenge is an Open Knowledge Foundation project.   Our mission is to make a fair, free and open future. Find out more here. Funding for the Net Zero Challenge is provided by our partners Microsoft and the UK's Foreign, Commonwealth & Development Office. ​ Open Knowledge Foundation is extremely grateful to both our partners for their support for this important project.   We would like to acknowledge both organisations for taking bold leadership in supporting action on climate change. ​   Advisory Committee   The Net Zero Challenge Advisory Committee include:   Open Data Charter the Innovation and Open Data Team at Transport for New South Wales the Open Data Day team, at Open Knowledge Foundation The Open Data Charter's Open Up Guide: Using Climate Data to Advance Climate Action was very much an inspiration for the Net Zero Challenge. We are grateful to the Open Data Charter for their groundbreaking work in mapping the opportunities and challenges of using open data to accelerate climate action. ​ The Innovation and Open Data Team at Transport for New South Wales​ have been enormously generous with their time providing guidance on how to successful run a challenge prize. Their regular Open Data Innovation Challenges provided many useful insights for us. We are extremely grateful for their support and advice.  ​ And finally thanks to the Open Data Day team at Open Knowledge Foundation who have offered great encouragement in this innovative project.    Contact   If you have any questions about the Net Zero Challenge, please first check out our FAQs. ​ To contact the Net Zero Challenge team directly, please email netzero@okfn.org  The Net Zero Challenge is an Open Knowledge Foundation project funded by Microsoft and the UK's Foreign, Commonwealth & Development Office.    
www-newscientist-com-9057	----	Bitcoin mining emissions in China will hit 130 million tonnes by 2024 | New Scientist × NEWSLETTERS Sign up to read our regular email newsletters image/svg+xml Subscribe and save Menu news podcasts video technology space physics health more mind environment crosswords shop courses events tours jobs SIGN UP TO THE DAILY NEWSLETTER Sign In Search Bitcoin mining emissions in China will hit 130 million tonnes by 2024 Technology 6 April 2021 By Donna Lu Mining bitcoin requires a lot of computer power Andrey Rudakov/Bloomberg via Getty Images The carbon emissions associated with mining bitcoin have accelerated rapidly in China, and they will soon outstrip the total annual emissions of mid-sized European countries. Analysis by Guan Dabo at Tsinghua University in Beijing, China, and his colleagues suggests that the total carbon footprint of bitcoin mining in China will peak in 2024, releasing around 130 million metric tonnes of carbon. Advertisement This figure exceeds the annual carbon emissions of countries including Italy and the Czech Republic. By 2024, bitcoin mining in China will require 297 terawatt-hours of energy and account for approximately 5.4 per cent of the carbon emissions from generating electricity in the country. Mining bitcoin relies on computers racing to solve mathematical puzzles, with miners receiving bitcoin for being the first to process a batch of verified transactions. The number of bitcoin awarded for this are halved every four years, and the puzzles have become more difficult and require more computing oomph to solve. The cost of powerful computer equipment and the electricity to run it has also increased. The researchers predicted the emissions peak in China in 2024 based on calculations of when the overall cost of mining – the investment in computing equipment and the electricity costs – outweighs the financial rewards of selling mined bitcoin. They used both financial projections and carbon emissions analysis to model the emissions footprint in China, taking into account factors such as location. “Are you in Shanghai, Beijing, or other places? That does matter because it determines what type of electricity you use,” says Guan. “Overall, from all of China’s bitcoin mining activity, 40 per cent is powered by coal.” Bitcoin miners in Beijing or other parts of northern China are very likely to be using electricity from coal-powered plants. Mining in southern provinces – especially Guizhou, Yunnan and Sichuan – is in large part powered by hydroelectricity, says Guan. Given China’s commitment to a 2060 net-zero carbon goal, regulations to reduce carbon emissions from bitcoin mining and future emergent sectors will need to be implemented, he says. Journal reference: Nature Communications, DOI: 10.1038/s41467-021-22256-3 More on these topics: China bitcoin &amp; cryptocurrency carbon emissions Advertisement Trending Latest Video Free Earth’s land may have formed 500 million years earlier than we thought Covid-19 news: Countries send supplies to India as crisis deepens Can we finally wipe out malaria with a vaccine 37 years in the making? Malaria vaccine from Oxford covid-19 team is most effective ever made Covid-19 vaccine side-effects: Here’s everything you need to know Covid-19 news: Countries send supplies to India as crisis deepens Covid-19 vaccine side-effects: Here’s everything you need to know 70 per cent of people live in countries without sustainable resources Oxygen Express trains deliver supplies as India hit by covid-19 surge We must seize this historic moment to secure our climate future Covid-19: The story of a pandemic Fukushima 10 years on: Life inside the exclusion zone Perseverance Mars landing: Will NASA find life? Life found beneath Antarctic ice sheet 'shouldn't be there' Avi Loeb: Is ‘Oumuamua extraterrestrial technology? Covid-19 news: Countries send supplies to India as crisis deepens 70 per cent of people live in countries without sustainable resources Oxygen Express trains deliver supplies as India hit by covid-19 surge Light is an electromagnetic wave, so why can’t magnets bend sunbeams? Earth’s land may have formed 500 million years earlier than we thought Magazine issue 3330 , published 17 April 2021 Subscribe View in the app Buy In Print Previous article Mars swung between humid and arid conditions before it dried up Next article Dementia risk doubles if people have both vision and hearing loss Advertisement MORE FROM NEW SCIENTIST 70 per cent of people live in countries without sustainable resources Environment A single pint of beer contains up to 2 million bubbles Physics 10 of the best popular science books as chosen by authors and writers Humans Covid-19 news: Japan to declare state of emergency ahead of Olympics Health Sign up to our newsletters Enter your email address to get started Contact us Coronavirus: customer update Help About us Privacy & cookies Cookie preferences Terms & conditions Advertise Write for us Events Science jobs Syndication RSS feeds Gift subscriptions Student subscriptions Educational subscriptions Corporate subscriptions Get the app FOLLOW US © Copyright New Scientist Ltd. Back to top 
www-niso-org-6120	----	ANSI/NISO Z39.88-2004 (R2010) The OpenURL Framework for Context-Sensitive Services | NISO website Skip to main content User account menu Member Login Search Menu Main navigation Home What We Do Welcome To NISO Directory Staff Board Members Leadership Committees Architecture Committee Information Creation & Curation Information Discovery & Interchange Information Policy & Analysis DEI Committee Strategic Plan NISO Policies and Procedures Annual Reports Press Releases and Announcements Awards Contact Join NISO NISO Membership Roster NISO Voting Member Benefits Pay your NISO Dues or Invoice Join NISO Library Standards Alliance NISO Membership Dues Explore Explore by Topic Explore by Publication Type Events Upcoming Past Events NISO Plus NISO I/O All ISQ Most Recent Newsline Press Releases Short hits Tweets Type Standards Committees Standards & Publications Information Standards Creating NISO Standards Information Standards In Practice Standards Timeline International Standards Schemas Home Standards & Publications ANSI/NISO Z39.88-2004 (R2010) The OpenURL Framework for Context-Sensitive Services Abstract The OpenURL Framework Standard defines an architecture for creating OpenURL Framework Applications. An OpenURL Framework Application is a networked service environment, in which packages of information are transported over a network. These packages have a description of a referenced resource at their core, and they are transported with the intent of obtaining context-sensitive services pertaining to the referenced resource. To enable the recipients of these packages to deliver such context-sensitive services, each package describes the referenced resource itself, the network context in which the resource is referenced, and the context in which the service request takes place. This Standard specifies how to construct these packages as Representations of abstract information constructs called ContextObjects. To this end, the OpenURL Framework Standard defines the following core components: Character Encoding, Serialization, Constraint Language, ContextObject Format, Metadata Format, and Namespace. In addition, this Standard defines Transport, a core component that enables communities to specify how to transport ContextObject Representations. Finally, this Standard specifies how a community can deploy a new OpenURL Framework Application by defining a new Community Profile, the last core component. This Standard defines the OpenURL Framework Registry and the rules that govern the usage of this Registry. The OpenURL Framework Registry contains all instances of all core components created by communities that have deployed OpenURL Framework Applications. This Standard defines and registers the initial content of the OpenURL Framework Registry, thereby deploying two distinct OpenURL Framework Applications. ANSI/NISO Z39.88-2004 (R2010) The OpenURL Framework for Context-Sensitive Services Publication type Standard Front Matter Publication Date: May 13, 2010 ISBN: 978-1-937522-38-4 Privacy Policy © 2021 National Information Standards Organization 3600 Clipper Mill Road, Ste. 302 Baltimore, MD 21211 Phone: (301) 654-2512 nisohq@niso.org Contact 
www-niso-org-9470	----	Information Discovery & Interchange Topic Committee | NISO website Skip to main content User account menu Member Login Search Menu Main navigation Home What We Do Welcome To NISO Directory Staff Board Members Leadership Committees Architecture Committee Information Creation & Curation Information Discovery & Interchange Information Policy & Analysis DEI Committee Strategic Plan NISO Policies and Procedures Annual Reports Press Releases and Announcements Awards Contact Join NISO NISO Membership Roster NISO Voting Member Benefits Pay your NISO Dues or Invoice Join NISO Library Standards Alliance NISO Membership Dues Explore Explore by Topic Explore by Publication Type Events Upcoming Past Events NISO Plus NISO I/O All ISQ Most Recent Newsline Press Releases Short hits Tweets Type Standards Committees Standards & Publications Information Standards Creating NISO Standards Information Standards In Practice Standards Timeline International Standards Schemas Home Topic committees Information Discovery & Interchange Topic Committee Committee Description The NISO Information Discovery & Interchange Topic Committee was renewed by the NISO Board of Directors in June 2017 (and was originally formed as the Discovery to Delivery Topic Committee in early 2007 in response to a strategic restructuring).  As part of NISO's organizational structure, topic committees that bring together leaders in specific subjects have been created to provide direction to the organization for standards development in those umbrella topic areas. The Discovery & Interchange Topic Committee focuses on issues regarding the finding and distribution of information by and to users, including indexed discovery services, linked data, OpenURL, interface design, web services, etc. The topic committee is charged with the following tasks: Track standards development within NISO and in other standards organizations related to the topic. Identify where new standards may provide solutions in their specific area. Convene Thought Leader meetings to incubate new standards activities. Create and provide guidance and oversight to standards working groups under their purview. Manage the five-year reaffirmation process for approved standards. The work of this group is complemented by NISO's Information Policy & Analysis Topic Committee, Information Creation & Curation Topic Committee, and Architecture Committee. Published in February 2015: D2D-commissioned white paper on The Future of Library Resource Discovery, written by Marshall Breeding Roster Co-chairs: Christine Stohn Peter Murray Members: Scott Bernier Robert Boissy Mark Dehmlow John Dove Doralyn Rossmann Bob Schulz Jan Waterhouse Julie Zhu Board liaison: Greg Suprock NISO staff: Todd Carpenter, Nettie Lagace The following NISO standards and recommended practices fall under the Information Discovery & Interchange Topic Committee. Active Groups: Flexible API STandard for E-content NISO (FASTEN) Working Group KBART (Knowledge Base And Related Tools) Standing Committee NCIP (NISO Circulation Interchange Protocol) Standing Committee ODI (Open Discovery Initiative) Standing Committee ResourceSync Standing Committee (NISO/OAI) (SIP) Standard Interchange Protocol Working Group Standards and Recommended Practices: ANSI/NISO Z39.50-2003 (S2014), Information Retrieval : Application Service Definition & Protocol Specification ANSI/NISO Z39.74-1996 (R2012), Guides to Accompany Microform Sets ANSI/NISO Z39.83-1-2012, NISO Circulation Interchange - Part 1: Protocol (NCIP) ANSI/NISO Z39.83-2-2012, NISO Circulation Interchange Protocol (NCIP) Part 2: Implementation Profile 1 ANSI/NISO Z39.88-2004 (R2010), The OpenURL Framework for Context-Sensitive Services ANSI/NISO Z39.89-2003 (S2014), The U.S. National Z39.50 Profile for Library Applications ANSI/NISO Z39.92-200x Information Retrieval Service Description Specification (DSFTU) NISO RP-2005-01, Ranking of Authentication and Access Methods Available to the Metasearch Environment NISO RP-2005-02, Search and Retrieval Results Set Metadata NISO RP-2005-03, Search and Retrieval Citation Level Data Elements NISO RP-2006-01, Best Practices for Designing Web Services in the Library Context NISO RP-2006-02, NISO Metasearch XML Gateway Implementers Guide NISO RP-11-2011, ESPReSSO: Establishing Suggested Practices Regarding Single Sign-On NISO RP-12-2012 Physical Delivery of Library Resources NISO RP-19-2014, Open Discovery Initiative: Promoting Transparency in Discovery NISO RP-21-2013, Improving OpenURLs Through Analytics (IOTA): Recommendations for Link Resolver Providers NISO TR-05-2013, IOTA Working Group Summary of Activities and Outcomes NISO RP-22-2015, Access License and Indicators Inactive/Completed Groups: Access & License Indicators Working Group IOTA (Improving OpenURL Through Analytics) Working Group Metasearch Initiative Task Group 1 - Access Management (Committee BA) Task Group 2 - Collection Description (Committee BB) Task Group 3 - Search/Retrieve (Committee BC) Physical Delivery of Library Resources Working Group SSO (Single Sign-on) Authentication Working Group Privacy Policy © 2021 National Information Standards Organization 3600 Clipper Mill Road, Ste. 302 Baltimore, MD 21211 Phone: (301) 654-2512 nisohq@niso.org Contact 
www-nuktaafrica-co-tz-9568	----	Nukta Africa – Making an impact through digital storytelling Services Our Work Training Events Team Blog Contact Menu Services Our Work Training Events Team Blog Contact Subscribe our multimedia and data journalism courses that will help you compete in the global market Read More Subscribe our multimedia and data journalism courses that will help you compete in the global market Read More Subscribe our multimedia and data journalism courses that will help you compete in the global market Read More Subscribe our multimedia and data journalism courses that will help you compete in the global market Read More Subscribe our multimedia and data journalism courses that will help you compete in the global market Read More Previous Next Home of digital & data storytelling Learn more How we help journalists realise their career dreams in Tanzania In 2020 Nukta Africa through its flagship training program had a lot to celebrate despite the Covid-19 impact on our lives and businesses across the Read More How we help journalists realise their career dreams in Tanzania In 2020 Nukta Africa through its flagship training program had a lot to celebrate despite the Covid-19 impact on our lives and businesses across the Read More Our partners and clients Mission Statement As one of the fastest growing digital media companies, Nukta Africa aims to transform people’s lives through data and digital tools and content Our Vision We aim at becoming the leading and most innovative digital media and technology company in Sub-Saharan Africa We provide training and develop digital and data-driven content to improve people’s lives. We are working with journalists, media organisations, NGOs and corporates on creating impactful stories What we offer We offer digital and data storytelling trainings We believe in evidence-based journalism and digital storytelling. In our courses you will be able learn emerging techniques and tools on producing data and multimedia projects. On a data journalism course you will be trained on how to produce data-driven stories for print, online, radio and TV while on multimedia storytelling you will learn how to create engaging video, texts, audio and interactive visualisations. We offer training and continuous mentorship to individual journalists, newsrooms, NGOs, and companies which want to improve their storytelling skills. Our training packages includes:- Data journalism courses Fact checking courses Multimedia storytelling Narrative storytelling for advocacy See our upcoming trainings We provide analytical news content Through our independent online news portal, www.nukta.co.tz , we offer fresh, analytical and data-driven news stories on Business, Technology, Safari and Education. The news stories are not only meant to inform you but also give you evidence-based analyses which can help you make decisions for your daily life, home, work and business. Visit nukta.co.tz. Provide advertising solutions to businesses Businesses can use our news and social media platforms to advertise their products and services to our audience. Display Ads: You can get all from static images, text, floating banners, popups ads, flash and videos Native ads. We provide space for native ads on our site Sponsored content: We create impactful content about products and services by answering the so what questions to your customers Success stories: We create impactful narratives for CSR and success stories for NGOs and corporates Contact us for more advertising information via sales@nukta.co.tz. We turn complex data into human interesting visualisation Has your company spent countless hours and resources collecting data which should be shared with clients and the wider public? We can help you transform this rich data into custom designed infographics with compelling stories. Infographics are now a crucial part of storytelling and help to deliver a message as quick as possible to your audience. Infographics can help you stand out with engaging report presentations, advertisement and documenting the impact of Corporate Social Responsibility projects. Our journey so far 450+ journalists trained on fact checking, digital and data storytelling, multimedia storytelling 55+ training sessions conducted for journalism students, communication professionals and other corporate workers 1,300% online audience growth was recorded within a year in 2019 15 major events on data, digital storytelling and renewable energy were conducted 1,000+ articles published in 2019 of which 75% were data driven 4 newly introduced courses to help journalists and communication professionals adopt digital transformation We are committed to provide high quality editorial services while abiding to the best business practices of our media industry. Let’s get in touch. Privacy Policy Data Policy Events Training Menu Privacy Policy Data Policy Events Training Twitter Instagram Facebook Nukta Africa 2021. All rights Reserved. 
www-nypl-org-5375	----	SpecialCollections.txt | The New York Public Library Skip to main content Click to learn about accessibility at the Library The New York Public Library Log In NYPL Locations Near Me Open Search Open Navigation The New York Public Library Log In Locations Get a Library Card Get Email Updates Donate Shop Books/Music/Movies Research Education Events Connect Give Get Help Search Get a Library CardGet Email UpdatesShop NYPLDonate Blogs Blog Channels Posts by Subject About NYPL Blogs Blogger Profiles Audio & Video Digital Projects Print Publications Connect with NYPL SpecialCollections.txt Exploring the digital space of the Special Collections Division, our work with digitized and born-digital material, and the systems that make this work possible. Reflections from the Pratt Digital Preservation & Archives Fellow at NYPL: Part 2 by Anne Boissonnault May 2, 2018 See this follow-up on a semester of detailed research and preservation, a treat for digital archivists and archive enthusiasts. Leave a comment Reflections from the Pratt Digital Preservation & Archives Fellow at NYPL: Part 1 by Anne Boissonnault March 6, 2018 What goes into being an archivist? Here's a look at current projects from a student working across NYPL departments and locations. Leave a comment Implementing ArchivesSpace at NYPL: Part 2 by Alexander Duryee January 19, 2018 The Archives Unit provides an inside look at their implementation of the ArchivesSpace data management system. If you're fascinated by archive and data storage, this post is for you. Leave a comment Let's Start with a Box: Special Collections & Object Management by Mary Kidd, Special Collections Operations and Systems Coordinator July 6, 2017 In an effort to address this issue, key staff of Special Collections came together to form the Object Data Model Group. Over a series of months we thought conceptually about what we wanted an ideal system for object management to accomplish.Leave a comment Understanding File Formats by Nick Krabbenhoeft, Head of Digital Preservation May 2, 2017 Trying to come up with a list of all file formats is an endless, seemingly impossible, but ultimately necessary task for all of the staff that works with digital materials at NYPL. This series of posts will discuss how we in NYPL's Special Collections understand and use formats.Leave a comment Implementing ArchivesSpace at NYPL: Part 1 by Alexander Duryee March 8, 2017 In 2014, the Archives Unit at The New York Public Library began its evaluation of ArchivesSpace. Following a rigorous review of the application, we began implementation in earnest in 2016, and started using it in production earlier this year. Leave a comment NYPL’s New Digital Archives Lab by Susan Malsbury January 11, 2017 In August, the Digital Archives Program, took an exciting and long-awaited step by moving into our own space. Leave a comment Subscribe to NYPL Blogs RSS Feeds SpecialCollections.txt All NYPL Blogs Accessibility Press Careers Space Rental Privacy Policy Other Policies Terms & Conditions Governance Rules & Regulations About NYPL Language © The New York Public Library, 2021 The New York Public Library is a 501(c)(3) | EIN 13-1887440 
www-nypl-org-6925	----	None 
www-nytimes-com-1612	----	Robot Cars Can’t Count on Us in an Emergency - The New York Times SectionsSEARCH Skip to contentSkip to site index Technology Log in Today’s Paper Technology|Robot Cars Can’t Count on Us in an Emergency https://www.nytimes.com/2017/06/07/technology/google-self-driving-cars-handoff-problem.html Advertisement Continue reading the main story Supported by Continue reading the main story Bits Robot Cars Can’t Count on Us in an Emergency A driving simulator at the Toyota Research Institute. One possible new feature being designed by Toyota is adding the ability to stop not just when a pedestrian is detected, but also to swerve to avoid an accident.Credit...Christie Hemm Klok for The New York Times By John Markoff June 7, 2017 SAN FRANCISCO — Three years ago, Google’s self-driving car project abruptly shifted from designing a vehicle that would drive autonomously most of the time while occasionally requiring human oversight, to a slow-speed robot without a brake pedal, accelerator or steering wheel. In other words, human driving was no longer permitted. The company made the decision after giving self-driving cars to Google employees for their work commutes and recording what the passengers did while the autonomous system did the driving. In-car cameras recorded employees climbing into the back seat, climbing out of an open car window, and even smooching while the car was in motion, according to two former Google engineers. “We saw stuff that made us a little nervous,” Chris Urmson, a roboticist who was then head of the project, said at the time. He later mentioned in a blog post that the company had spotted a number of “silly” actions, including the driver turning around while the car was moving. Johnny Luu, a spokesman for Google’s self-driving car effort, now called Waymo, disputed the accounts that went beyond what Mr. Urmson described, but said behavior like an employee’s rummaging in the back seat for his laptop while the car was moving and other “egregious” acts contributed to shutting down the experiment. We humans are easily distracted by our games, phones and mates. And automotive engineers, computer interaction designers and, yes, lawyers, wonder if the self-driving cars they are working on will ever really be able to count on us in an emergency. Engineers say they believe that cars will be intelligent enough to do all the driving, somewhere between five years and a decade from now, depending on whom you ask. But until then, what passes for autonomous driving will be a delicate ballet between human and machine: Humans may be required to take the wheel at a moment’s notice when the computer can’t decide what to do. To outline a development path to complete autonomy, the automotive industry has established five levels of human-to-machine control, ranging from manual driving — Level 0 — up through complete autonomy, Level 6. In the middle, Level 3 is an approach in which the artificial intelligence driving the car may ask humans to take over in an emergency. But many automotive technologists are skeptical that the so-called handoff from machine to human can be counted on, because of the challenge of quickly bringing a distracted human back into control of a rapidly moving vehicle. “Do you really want last-minute handoffs?” said Stefan Heck, chief executive of Nauto, a start-up based in Palo Alto, Calif., that has developed a system that simultaneously observes both the driver and the outside environment and provides alerts and safety information. “There is a really good debate going on over whether it will be possible to solve the handoff problem.” Nauto’s data shows that a “driver distraction event” occurs, on average, every four miles. Mr. Heck said there was evidence that the inattention of human drivers was a factor in half of the approximately 40,000 traffic fatalities in the United States last year. Last month, a group of scientists at Stanford University presented research showing that most drivers required more than five seconds to regain control of a car when — while playing a game on a smartphone — they were abruptly required to return their attention to driving. Another group of Stanford researchers published research in the journal Science Robotics in December that highlighted a more subtle problem. Taking back control of a car is a very different experience at a high speed than at a low one, and adapting to the feel of the steering took a significant amount of time even when the test subjects were prepared for the handoff. “There is a motor-learning process if I haven’t been controlling the vehicle and I have to take control,” said J. Christian Gerdes, a Stanford University mechanical engineering professor who was one of the authors of the study. The handoff challenge is compounded by what is known as “over-trust” by automotive engineers. Over-trust was what Google observed when it saw its engineers not paying attention during commutes with prototype self-driving cars. Driver inattention was implied in a recent National Highway Traffic Safety Administration investigation that absolved the Tesla from blame in a 2016 Florida accident in which a Model S sedan drove under a tractor-trailer rig, killing the driver. Image A simulator at the Toyota Research Institute. The company is working on technologies that will assist human drivers in remaining vigilant when they are required to oversee an autonomous driving system for long stretches of time.Credit...Christie Hemm Klok for The New York Times Solving the over-trust issue is a key to autonomous vehicles in the Level 3 category, where the computer hands off to humans. The first commercial vehicle to offer Level 3 autonomy is expected to be released next month by Audi. A version of its luxury A8 model will be able to drive in stop-and-go freeway traffic up to 37 miles an hour while allowing drivers to pursue other tasks. The vehicle reportedly will notify drivers in emergencies, giving them eight to 10 seconds to intervene. Despite these limited advances, many automotive technologists remain uncertain about whether technology will ever be able to operate smoothly with a human driver who may be reading email or playing World of Warcraft. “I believe that Level 3 autonomous driving is unsolvable,” said John Leonard, a mechanical engineering professor at the Massachusetts Institute of Technology who has collected detailed examples of driving situations that are currently impossible for state-of-the-art autonomous driving systems. “The notion that a human can be a reliable backup is a fallacy.” Yet, despite widespread skepticism, the automotive industry is spending heavily on artificial intelligence technologies designed to make cars safer before they are fully autonomous. The idea is that self-driving technology (warning lights, emergency braking) can help humans be safer drivers. Gill Pratt, a roboticist who heads an ambitious Toyota research effort in Silicon Valley; Ann Arbor, Mich.; and Cambridge, Mass.; said he did not see the automation ratings — one through five — as a straight line of technical progress. Instead, he said, he saw the ratings as different ways of addressing the same car-safety question, regardless of who or what is in control. Unlike many in the industry who say that advances in machine learning will soon make self-driving cars safer than those driven by humans, Mr. Pratt has pushed for less futuristic “guardian” technologies that could be added to a car the same way that anti-lock brakes, stability control, blind-spot warning lights and other features have become common. One possible new feature being designed by the Toyota Research Institute is adding the ability not just to stop when a pedestrian is detected, but also to swerve to avoid an accident, he said. Toyota is also working on technologies that will assist human drivers in remaining vigilant when they are required to oversee an autonomous driving system for long stretches of time. There is already a rich literature that explores the challenges of keeping airplane pilots vigilant; Toyota researchers say they will be able to develop techniques to maintain human driver attention. Mr. Pratt said Toyota had not given up on the challenge of Level 3 driving. But to make a safe Level 3 car, he said, it may be necessary to develop technologies that see risks as much as 15 seconds in the future. Still, over-trust will be a tough challenge to overcome. “Imagine if the autopilot disengages once in 10,000 miles,” he said. “You will be very tempted to over-trust the system. Then when it does mess up, you will be unprepared.” And if all those issues do get resolved, there is one more question: Will people really use self-driving cars? Last September, researchers at the University of Michigan Transportation Research Institute published results of a survey reporting that for 62 percent of Americans, an increase in productivity as a result of self-driving cars was unlikely. The researchers found that 23 percent of Americans would refuse to drive in autonomous cars and 36 percent would be so nervous that they would not take their eyes off the road. An additional 3 percent said they would be too motion-sick to take advantage of the cars. “Also of importance is the fact that current trips in light-duty vehicles average only about 19 minutes, a rather short duration for sustained productive activity or invigorating sleep,” the researchers concluded. Advertisement Continue reading the main story Site Index Site Information Navigation © 2021 The New York Times Company NYTCo Contact Us Accessibility Work with us Advertise T Brand Studio Your Ad Choices Privacy Policy Terms of Service Terms of Sale Site Map Canada International Help Subscriptions 
www-nytimes-com-2604	----	Are NFT Purchases Real? The Dollars Are. - The New York Times SectionsSEARCH Skip to contentSkip to site index Business Log in Today’s Paper Business|Are NFT Purchases Real? The Dollars Are. https://www.nytimes.com/2021/04/09/business/nft-bitcoin-stocks-bonds.html The New Digital Market NYT Column: $560,000 Flying Cat, Pop-Tart Body: $600,000 What Is an NFT? ‘NFT Mania’ Advertisement Continue reading the main story Supported by Continue reading the main story essay Are NFT Purchases Real? The Dollars Are. Dive down a rabbit hole and explore nonfungible tokens, multimillion-dollar digital art and the nature of reality. Credit...Glynis Sweeny By John Schwartz Published April 9, 2021Updated April 15, 2021 When an artist sold something called a nonfungible token for nearly $70 million at auction a few weeks ago, people were astonished — and confused. These tokens, or NFTs, are a complicated way to ensure ownership of something that exists in the digital world. In this case, it certified a yearslong project of daily images by an artist known as Beeple. NFTs went from that thing you never heard of to the thing you’re tired of hearing about — and, most likely, without your ever having figured out what they actually are in the brief time in between. Of course, blockchain is involved, because everything unknowable and wacky these days has a blockchain connection. And cryptocurrency, because obviously. Coverage has exploded — articles explaining what the tokens are and about the eye-popping prices tokens and other such “investments” are getting in online marketplaces, as well as the unsettling fact that some NFTs mysteriously go missing or disappear. My colleague Kevin Roose joined in the fun, auctioning an NFT of his article about NFTs (cue the music from the movie “Inception”) — and the thing sold for more than $500,000! The money did not go to buying Kevin a beach house: It will do good through the Neediest Cases Fund. The whole thing can leave you wondering what art is, or even what reality is. It certainly made me wonder whether, with windfalls like these, I should stick with those boring, no-load, low-fee index funds that everyone tells me are the foundation of a smart investment plan. Bruce Sterling, a science fiction author and art festival curator, told me the universe has changed. He said the shock the Beeple sale — Beeple is the alias of the digital artist Mike Winkelmann — sent through the art world “reminds me of the early days of rock ’n’ roll,” where people were confronted with something raw and new. I’m an old guy so I liked this analogy. He said, approvingly, that the NFT sale was the auction-house equivalent of Elvis gyrating on the Ed Sullivan show. “This would probably not have happened if we hadn’t been locked indoors for a year with our computers in a world with hundreds of millions of computers and oceans of funny money,” he said. Matt Blaze, a professor of computer science and law at Georgetown University, pointed out the enormous energy required to run the computers that keep this virtual world afloat. “I’m not an expert on the art market, and far be it from me to denigrate a new way for artists to extract money from people who want to give it to them,” he said. “But it would be nice if they could find a way to support artists that doesn’t involve wasting enough energy to light up a small city with each transaction.” Besides, he added, “the blockchain is contributing nothing here that an ordinary digital signature doesn’t already accomplish.” I called Beeple, otherwise known as Mr. Winkelmann, on Zoom. He was happy, as anyone would be with the payday he just had. Is his art real, I asked? He said he provides his buyers with physical screens with his works on them, so that’s kind of real, maybe. Then he held up his phone and showed me an app that summarized his personal financial assets: At that moment, they included $56,635,781.41 in cash. He had received his payment in cryptocurrency, and immediately converted it into what I still think of as real money. The digital artist had transformed most of his new wealth into something I could understand: U.S. dollars. But those dollars on his screen are a digital representation, too! “It’s not like I have 56 million dollar bills in my house,” he said, waving his hands to show the lack of stacks of bills. “I just have a number; you and I know this number is as real as anything else.” In the world of modern art, it’s common for people to look at an abstract piece and say, “My kid could do that!” But, he said, “I’m pretty sure a kid couldn’t do what I do,” and showed me one of his pieces. It depicted a big sphere, and the image also contained a mountain and, by the way, a goat, among other elements, that he used digital trickery to manipulate, resize and juggle. The process was playful, but it also had something more, a guiding sensibility. Something that felt like — I might as well say it — art. Besides, he asked me, what’s the inherent value of a baseball card? “You paid this much money for a little piece of cardboard?” he asked. “Even a painting. It’s just a piece of stretched fabric with some splotches of paint on it. Why would you pay for that?” Did I mention that an NFT of a cat with a Pop-Tart body that leaves a trail of rainbows recently sold for nearly $600,000? He had me wondering whether anything is real, and whether we’re not all just living in a consensual illusion. All of this pondering about the nature of reality put me in a brain fog that carried me through a recent trip to Texas. As I was in line at the airport in Houston, a Transportation Security Agency officer stopped a family of three in front of me. He got the attention of the daughter, who looked to be of high school age, and said, “Buy cryptocurrency! You know Bitcoin? Do it! You’re young!” And he smiled and walked on. “Don’t,” I said, and the four of us laughed. I asked her about the encounter, and she shook her head. “I think there could be a crash,” she said. Make up your own mind. But even the great Beeple has stashed most of his money in dollars, not Bitcoin. Advertisement Continue reading the main story Site Index Site Information Navigation © 2021 The New York Times Company NYTCo Contact Us Accessibility Work with us Advertise T Brand Studio Your Ad Choices Privacy Policy Terms of Service Terms of Sale Site Map Canada International Help Subscriptions 
www-nytimes-com-612	----	Crypto token of New York Times column sells for $560,000. - The New York Times SectionsSEARCH Skip to contentSkip to site index Business Log in Today’s Paper Business|Crypto token of New York Times column sells for $560,000. https://www.nytimes.com/2021/03/25/business/nyt-column-nft.html The New Digital Market NYT Column: $560,000 Flying Cat, Pop-Tart Body: $600,000 What Is an NFT? ‘NFT Mania’ Advertisement Continue reading the main story Elon Musk Tweet Was Illegal Anti-Union Behavior, Board Says Crypto token of New York Times column sells for $560,000. An NFT collector who goes by the handle @3fmusic placed a last-minute winning bid of 350 ether. By Kevin Roose March 25, 2021 A one-of-a-kind digital collectible item created out of a New York Times technology column sold for more than $500,000 in an auction, the first such sale in the history of the newspaper. An image of the column — titled “Buy This Column on the Blockchain!” — was turned into a nonfungible token, or NFT, and sold in a heated auction that brought in more than 30 bids on the NFT marketplace website Foundation. The NFT, a unique bit of digital code that is stored on the Ethereum blockchain and refers to a 14 megabyte graphic of the column hosted on a decentralized file hosting service, cannot be duplicated or counterfeited, making it potentially valuable for collectors. Some NFTs have sold for hundreds of thousands of dollars in recent weeks, with one such sale — a collection of art by the digital artist Beeple — bringing in more than $69 million at auction. Along with the token, the winner of the auction — should they choose to identify themselves — will receive additional perks including a voice message from Michael Barbaro, the host of “The Daily” podcast. All proceeds from the auction will be donated to the Neediest Cases Fund, a Times-affiliated charity. The winner of the auction, an NFT collector who goes by the handle @3fmusic, placed a last-minute winning bid of 350 ether, a digital currency, which translates to roughly $560,000 at Wednesday’s exchange rates. A link on the user’s profile led to the website of a Dubai-based music studio. @3fmusic could not be reached as of Wednesday afternoon. The user appeared to be an avid collector of NFT artwork. In addition to the Times token, their collection on Foundation also includes such works as “The result of 2020,” an image of a sad-looking Kermit the Frog, and “Mushy’s Midafternoon Nap,” an image of a cartoon toadstool sitting on a log. Advertisement Continue reading the main story Site Index Site Information Navigation © 2021 The New York Times Company NYTCo Contact Us Accessibility Work with us Advertise T Brand Studio Your Ad Choices Privacy Policy Terms of Service Terms of Sale Site Map Canada International Help Subscriptions 
www-nytimes-com-6951	----	A second Google A.I. researcher says the company fired her. - The New York Times SectionsSEARCH Skip to contentSkip to site index Technology Log in Today’s Paper Technology|A second Google A.I. researcher says the company fired her. https://www.nytimes.com/2021/02/19/technology/google-ethical-artificial-intelligence-team.html Advertisement Continue reading the main story Second Google A.I. Researcher Says She Was Fired A second Google A.I. researcher says the company fired her. Margaret Mitchell, who was one of the leaders of Google’s Ethical A.I. team, sent a tweet on Friday afternoon saying merely: “I’m fired.”Credit...Cody O'Loughlin for The New York Times By Cade Metz Feb. 19, 2021 Two months after the jarring departure of a well-known artificial intelligence researcher at Google, a second A.I. researcher at the company said she was fired after criticizing the way it has treated employees who were working on ways to address bias and toxicity in its artificial intelligence systems. Margaret Mitchell, known as Meg, who was one of the leaders of Google’s Ethical A.I. team, sent a tweet on Friday afternoon saying merely: “I’m fired.” Google confirmed that her employment had been terminated. “After conducting a review of this manager’s conduct, we confirmed that there were multiple violations of our code of conduct,” read a statement from the company. The statement went on to claim that Dr. Mitchell had violated the company’s security policies by lifting confidential documents and private employee data from the Google network. The company said previously that Dr. Mitchell had tried to remove such files, the news site Axios reported last month. Dr. Mitchell said on Friday evening that she would soon have a public comment. Dr. Mitchell’s post on Twitter comes less than two months after Timnit Gebru, the other leader of the Ethical A.I. team at Google, said that she had been fired by the company after criticizing its approach to minority hiring as well as its approach to bias in A.I. In the wake of Dr. Gebru’s departure from the company, Dr. Mitchell strongly and publicly criticized Google’s stance on the matter. More than a month ago, Dr. Mitchell said that she had been locked out of her work accounts. On Wednesday, she tweeted that she remained locked out after she tried to defend Dr. Gebru, who is Black. “Exhausted by the endless degradation to save face for the Upper Crust in tech at the expense of minorities’ lifelong careers,” she wrote. Dr. Mitchell’s departure from the company was another example of the rising tension between Google’s senior management and its work force, which is more outspoken than workers at other big companies. The news also highlighted a growing conflict in the tech industry over bias in A.I., which is entwined with questions involving hiring from underrepresented communities. Today’s A.I. systems can carry human biases because they learn their skills by analyzing vast amounts of digital data. Because the researchers and engineers building these systems are often white men, many worry that researchers are not giving this issue the attention it needs. Google announced in a blog post yesterday that an executive at the company, Marian Croak, who is Black, will oversee a new group inside the company dedicated to responsible A.I. Advertisement Continue reading the main story Site Index Site Information Navigation © 2021 The New York Times Company NYTCo Contact Us Accessibility Work with us Advertise T Brand Studio Your Ad Choices Privacy Policy Terms of Service Terms of Sale Site Map Canada International Help Subscriptions 
www-nytimes-com-9887	----	Google Researcher Timnit Gebru Says She Was Fired For Paper on AI Bias - The New York Times SectionsSEARCH Skip to contentSkip to site index Technology Log in Today’s Paper Technology|Google Researcher Says She Was Fired Over Paper Highlighting Bias in A.I. https://www.nytimes.com/2020/12/03/technology/google-researcher-timnit-gebru.html Advertisement Continue reading the main story Supported by Continue reading the main story Google Researcher Says She Was Fired Over Paper Highlighting Bias in A.I. Timnit Gebru, one of the few Black women in her field, had voiced exasperation over the company’s response to efforts to increase minority hiring. Timnit Gebru, a respected researcher at Google, questioned biases built into artificial intelligence systems.Credit...Cody O'Loughlin for The New York Times By Cade Metz and Daisuke Wakabayashi Dec. 3, 2020 A well-respected Google researcher said she was fired by the company after criticizing its approach to minority hiring and the biases built into today’s artificial intelligence systems. Timnit Gebru, who was a co-leader of Google’s Ethical A.I. team, said in a tweet on Wednesday evening that she was fired because of an email she had sent a day earlier to a group that included company employees. In the email, reviewed by The New York Times, she expressed exasperation over Google’s response to efforts by her and other employees to increase minority hiring and draw attention to bias in artificial intelligence. “Your life starts getting worse when you start advocating for underrepresented people. You start making the other leaders upset,” the email read. “There is no way more documents or more conversations will achieve anything.” Her departure from Google highlights growing tension between Google’s outspoken work force and its buttoned-up senior management, while raising concerns over the company’s efforts to build fair and reliable technology. It may also have a chilling effect on both Black tech workers and researchers who have left academia in recent years for high-paying jobs in Silicon Valley. “Her firing only indicates that scientists, activists and scholars who want to work in this field — and are Black women — are not welcome in Silicon Valley,” said Mutale Nkonde, a fellow with the Stanford Digital Civil Society Lab. “It is very disappointing.” A Google spokesman declined to comment. In an email sent to Google employees, Jeff Dean, who oversees Google’s A.I. work, including that of Dr. Gebru and her team, called her departure “a difficult moment, especially given the important research topics she was involved in, and how deeply we care about responsible A.I. research as an org and as a company.” After years of an anything-goes environment where employees engaged in freewheeling discussions in companywide meetings and online message boards, Google has started to crack down on workplace discourse. Many Google employees have bristled at the new restrictions and have argued that the company has broken from a tradition of transparency and free debate. On Wednesday, the National Labor Relations Board said Google had most likely violated labor law when it fired two employees who were involved in labor organizing. The federal agency said Google illegally surveilled the employees before firing them. Google’s battles with its workers, who have spoken out in recent years about the company’s handling of sexual harassment and its work with the Defense Department and federal border agencies, have diminished its reputation as a utopia for tech workers with generous salaries, perks and workplace freedom. Like other technology companies, Google has also faced criticism for not doing enough to resolve the lack of women and racial minorities among its ranks. The problems of racial inequality, especially the mistreatment of Black employees at technology companies, has plagued Silicon Valley for years. Coinbase, the most valuable cryptocurrency start-up, has experienced an exodus of Black employees in the last two years over what the workers said was racist and discriminatory treatment. Researchers worry that the people who are building artificial intelligence systems may be building their own biases into the technology. Over the past several years, several public experiments have shown that the systems often interact differently with people of color — perhaps because they are underrepresented among the developers who create those systems. Dr. Gebru, 37, was born and raised in Ethiopia. In 2018, while a researcher at Stanford University, she helped write a paper that is widely seen as a turning point in efforts to pinpoint and remove bias in artificial intelligence. She joined Google later that year, and helped build the Ethical A.I. team. After hiring researchers like Dr. Gebru, Google has painted itself as a company dedicated to “ethical” A.I. But it is often reluctant to publicly acknowledge flaws in its own systems. In an interview with The Times, Dr. Gebru said her exasperation stemmed from the company’s treatment of a research paper she had written with six other researchers, four of them at Google. The paper, also reviewed by The Times, pinpointed flaws in a new breed of language technology, including a system built by Google that underpins the company’s search engine. These systems learn the vagaries of language by analyzing enormous amounts of text, including thousands of books, Wikipedia entries and other online documents. Because this text includes biased and sometimes hateful language, the technology may end up generating biased and hateful language. After she and the other researchers submitted the paper to an academic conference, Dr. Gebru said, a Google manager demanded that she either retract the paper from the conference or remove her name and the names of the other Google employees. She refused to do so without further discussion and, in the email sent Tuesday evening, said she would resign after an appropriate amount of time if the company could not explain why it wanted her to retract the paper and answer other concerns. The company responded to her email, she said, by saying it could not meet her demands and that her resignation was accepted immediately. Her access to company email and other services was immediately revoked. In his note to employees, Mr. Dean said Google respected “her decision to resign.” Mr. Dean also said that the paper did not acknowledge recent research showing ways of mitigating bias in such systems. “It was dehumanizing,” Dr. Gebru said. “They may have reasons for shutting down our research. But what is most upsetting is that they refuse to have a discussion about why.” Dr. Gebru’s departure from Google comes at a time when A.I. technology is playing a bigger role in nearly every facet of Google’s business. The company has hitched its future to artificial intelligence — whether with its voice-enabled digital assistant or its automated placement of advertising for marketers — as the breakthrough technology to make the next generation of services and devices smarter and more capable. Sundar Pichai, chief executive of Alphabet, Google’s parent company, has compared the advent of artificial intelligence to that of electricity or fire, and has said that it is essential to the future of the company and computing. Earlier this year, Mr. Pichai called for greater regulation and responsible handling of artificial intelligence, arguing that society needs to balance potential harms with new opportunities. Google has repeatedly committed to eliminating bias in its systems. The trouble, Dr. Gebru said, is that most of the people making the ultimate decisions are men. “They are not only failing to prioritize hiring more people from minority communities, they are quashing their voices,” she said. Julien Cornebise, an honorary associate professor at University College London and a former researcher with DeepMind, a prominent A.I. lab owned by the same parent company as Google’s, was among many artificial intelligence researchers who said Dr. Gebru’s departure reflected a larger problem in the industry. “This shows how some large tech companies only support ethics and fairness and other A.I.-for-social-good causes as long as their positive P.R. impact outweighs the extra scrutiny they bring,” he said. “Timnit is a brilliant researcher. We need more like her in our field.” Advertisement Continue reading the main story Site Index Site Information Navigation © 2021 The New York Times Company NYTCo Contact Us Accessibility Work with us Advertise T Brand Studio Your Ad Choices Privacy Policy Terms of Service Terms of Sale Site Map Canada International Help Subscriptions 
www-oclc-org-192	----	OCLC Research JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Research Areas Partnership People News & Events Publications Presentations About Settings Menu Search Research Report Total Cost of Stewardship: Responsible Collection Building in Archives and Special Collections Research Report Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project Research Report Transitioning to the Next Generation of Metadata Events 06 May 2021 Total Cost of Stewardship (TCoS) Office Hours – RLP support sessions for the TCoS Tool Suite Join us for open office hours to ask questions and get support for using and implementing the tools in your own work. Time: 11:00 AM – 12:00 PM Eastern Daylight Time, North America [UTC -4] 06 May 2021 Total Cost of Stewardship (TCoS) Office Hours – open support sessions for the TCoS Tool Suite Join us for open office hours to ask questions and get support for using and implementing the tools in your own work. Time: 4:00 PM – 5:00 PM Eastern Daylight Time, North America [UTC -4] 11 May 2021 Total Cost of Stewardship (TCoS) Office Hours – RLP support sessions for the TCoS Tool Suite Join us for open office hours to ask questions and get support for using and implementing the tools in your own work. Time: 2:00 PM – 3:00 PM Eastern Daylight Time, North America [UTC -4] More Events News University of Waterloo Library joins OCLC Research Library Partnership 13 April 2021 The OCLC Research Library Partnership (RLP) is excited to welcome the University of Waterloo as a Partner. The OCLC and LIBER announce new open scholarship workshop series 01 April 2021 OCLC and LIBER partner to support developing strategic relationships across universities to advance open scholarship OCLC and LIBER Reimagine Descriptive Workflows, a new project from OCLC underway 30 March 2021 OCLC has been awarded a grant from The Andrew W. Mellon Foundation to convene a diverse group of experts, practitioners, and community members to determine ways to address systemic biases and improve racial equity in descriptive practi... More News    Publications Total Cost of Stewardship: Responsible Collection Building in Archives and Special Collections 16 March 2021 The  Total Cost of Stewardship framework is a holistic approach to understanding the resources needed to responsibly acquire and steward archives and special collections. Included materials: Research Report, Annotated Bibliography, ... Transforming Metadata into Linked Data to Improve Digital Collection Discoverability 21 January 2021 This report shares the CONTENTdm Linked Data Pilot project findings. In this pilot project, OCLC and five partner institutions investigated methods for—and the feasibility of—transforming metadata into linked data to improve the discoverability and m... Transitioning to the Next Generation of Metadata 29 September 2020 This report synthesizes six years (2015-2020) of OCLC Research Library Partners Metadata Managers Focus Group discussions to trace how metadata services are transitioning into the “next generation of metadata” and the impact on future metadata servic... More Publications  Read more about our work on the OCLC Research blog Hanging Together. Follow OCLC Research:   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies. See OCLC's cookie notice to learn more. Privacy statement Accessibility statement ISO 27001 Certificate 
www-oclc-org-2443	----	Partnership JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Research Areas Partnership People News & Events Publications Presentations About Settings Menu Search Research Partnership The OCLC Research Library Partnership About Our work Partners Working groups Interest groups Join us Learn, collaborate, share, repeat  The OCLC Research Library Partnership (RLP) is a unique transnational network of research libraries, supported by a combination of partnership dues and co-investment from OCLC. The RLP offers extensive professional development opportunities for library staff, delivered in a combination of formats—ranging from small group discussions and webinars to working groups and in-person events. Most of our activities are offered virtually, enabling active, international participation, without travel. The Research Library Partnership supports libraries as they evolve to meet 21st century challenges, providing them with the connections, knowledge, and resources to plan with confidence in a complex, rapidly changing ecosystem. The RLP provides research, programming and peer-led learning in key areas of interest to research libraries.   research support unique and distinctive collections resource sharing next generation metadata Across these four areas, the RLP seeks to support libraries through the challenges of COVID-19 and to advance equity, diversity and inclusion efforts.  “…being able to engage with colleagues at other universities from my desk has been an absolute lifeline for me. Thank you!” Dr. Cathy Pink, Senior Data Librarian, University of Bath How we work RLP activities are led by an energetic team who work with member institutions to develop an array of programming for both senior library leaders and staff, virtually and in person: Online learning: The Works In Progress Webinars are live, participatory online learning opportunities focusing on leading edge work done at OCLC and our partner institutions. Our webinars provide a cost-effective way for members to stay up-to-date on emerging practices. Discussion: We host small group discussion on topics related to our programmatic focus. Collaborative research: We also engage members in research collaboratively through ad hoc working groups. Information sharing: RLP email and discussions lists facilitate sharing and allow subscribers to receive updates about webinars, events, research projects, and more. Consultations: RLP staff also consult directly with partner institutions, sharing their expertise on resource sharing, metadata, distinctive collections, linked data, research support services, and more. Join us The OCLC Research Library Partnership offers an affordable way for research libraries to stay up-to-date, network, engage in peer learning, and shape the future of OCLC Research. Learn more about our dues structure and how to join. Upcoming Events Show details May 6 06 May 2021 Total Cost of Stewardship (TCoS) Office Hours – RLP support sessions for the TCoS Tool Suite Join us for open office hours to ask questions and get support for using and implementing the tools in your own work. Time: 11:00 AM – 12:00 PM Eastern Daylight Time, North America [UTC -4] Register to attend More info Show details May 6 06 May 2021 Total Cost of Stewardship (TCoS) Office Hours – open support sessions for the TCoS Tool Suite Join us for open office hours to ask questions and get support for using and implementing the tools in your own work. Time: 4:00 PM – 5:00 PM Eastern Daylight Time, North America [UTC -4] Register to attend More info Show details May 11 11 May 2021 Total Cost of Stewardship (TCoS) Office Hours – RLP support sessions for the TCoS Tool Suite Join us for open office hours to ask questions and get support for using and implementing the tools in your own work. Time: 2:00 PM – 3:00 PM Eastern Daylight Time, North America [UTC -4] Register to attend More info for all events >> Follow OCLC Research:   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies. See OCLC's cookie notice to learn more. Privacy statement Accessibility statement ISO 27001 Certificate 
www-oclc-org-3504	----	Research Library Partnership Team JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Research Areas Partnership People News & Events Publications Presentations About Settings Menu Search Research People Research Library Partnership (RLP) OCLC Research Directory OCLC Research is organized around different areas of library research and ways libraries can engage with each other and library learning. The areas of the division are: Technical Research, Library Trends and User Research, the Research Library Partnership, and WebJunction. Explore the bios, projects, publications, and presentations of individual members of OCLC Research. Leadership Research Research Library Partnership WebJunction Communications & Support Research Library Partnership The Research Library Partnership (RLP) is a venue for research libraries to undertake significant, innovative, collective action to benefit scholars and researchers everywhere. The Program Officers listed below conduct original research along with partner institutions, and organize events, collaborative projects, and working groups.   Rebecca Bryant Senior Program Officer Conducts research and leads RLP activities related to research information management, data management, campus collaboration, and more. Rachel Frick Executive Director, Research Library Partnership Leads the academic and research library engagement activities for OCLC Research through the work of the Research Library Partnership Program. Dennis Massie Senior Program Officer Conceives and manages OCLC Research projects centered on sharing collections and coordinates the SHARES resource sharing consortium. Mercy Procaccini Program Officer Facilitates learning and engagement opportunities, including the Metadata Managers Focus Group and the Works in Progress webinar series. Merrilee Proffitt Senior Manager Helps manage the OCLC RLP. Webinar & event wrangler. Wikimedia & libraries. Home base at the intersection of special collections & digitization. Titia van der Werf Senior Program Officer Coordinates OCLC Research work in Europe. Titia combines community of practice and research expertise in the archives and library profession. Chela Scott Weber Senior Program Officer Leads work related to archives, special, and distinctive collections for the Research Library Partnership. Follow OCLC Research:   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies. See OCLC's cookie notice to learn more. Privacy statement Accessibility statement ISO 27001 Certificate 
www-oclc-org-3632	----	Social Interoperability in Research Support JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Research Areas Partnership People News & Events Publications Presentations About Settings Menu Search Research Publications Social Interoperability in Research Support: Cross-campus partnerships and the university research enterprise Social Interoperability in Research Support: Cross-Campus Partnerships and the University Research Enterprise by Rebecca Bryant, Annette Dortmund, and Brian Lavoie Report Supplemental To develop robust research support services across the entire research life cycle, individuals and units from across the university, including the library, must collaborate across internal silos. Effective social interoperability—the creation and maintenance of working relationships between individuals and organizational units—in higher education requires a thorough knowledge of campus partners. The OCLC Research report Social Interoperability in Research Support explores the social and structural norms that shape cross-campus collaboration and offers a conceptual model of key university stakeholders in research support. Information about their goals, interests, expertise, and crucially, the importance of cross-campus relationships in their work was synthesized from interviews conducted with practitioners from  a wide range of campus stakeholders in research support. The report describes the network of campus units involved in both the provision and consumption of major categories of research support services, and concludes with recommendations for establishing and maintaining successful cross-campus relationships. Visit oc.lc/social-interoperability-project for more related project outputs. Download US Letter .pdf Download A4 .pdf   Suggested citation: Bryant, Rebecca, Annette Dortmund, and Brian Lavoie. 2020. Social Interoperability in Research Support: Cross-Campus Partnerships and the University Research Enterprise. Dublin, OH: OCLC Research. https://doi.org/10.25333/wyrd-n586 For More Information For more information about this work, please contact OCLC Research. Email OCLC Research Follow OCLC Research:   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies. See OCLC's cookie notice to learn more. Privacy statement Accessibility statement ISO 27001 Certificate 
www-oclc-org-4656	----	Advancing racial equity | OCLC JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Research Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Close Based on your browser settings, we have selected English as your preferred language on the OCLC.org website. This setting will be used whenever content in that language is available. You can change your preferred language below. English Español Français Nederlands 中文简体 Español Français Nederlands 中文简体 Searching for items in a library? Try WorldCat.org. Membership Products Support Research Events About Contacts Sign in to Services Settings Menu Search Advancing racial equity our core values Overview Leadership Finance Technology Diversity Awards and professional development OCLC is committed to advancing equity, diversity, and inclusion As a nonprofit, membership organization, OCLC reflects the shared expectations of libraries around the world. Its founders infused our organization with a strong sense of public purpose and librarian values, including equity, diversity, and inclusion (EDI). Many of our customers and content partners are engaged in or leading equity initiatives for the betterment of their communities and users. As their chosen technology partner, we are committed to using our platform and resources to support, elevate, and amplify that work. To do this effectively, we must also continually evaluate and calibrate our internal practices and policies to ensure OCLC’s workplace culture truly reflects our purpose and values. Making our values known “OCLC stands in solidarity with our libraries and the communities they serve to oppose systemic racism and injustice.” Skip Prichard President and Chief Executive Officer, OCLC Read more about OCLC’s commitment to social justice on the Next blog Our workplace culture and values We believe in the dignity and worth of every individual. We know that diverse and inclusive teams drive creativity and innovation. For many years, OCLC has lived out these values through human resource practices as well as through our products and services. In 2020, Skip Prichard, OCLC's President and Chief Executive Officer, called for the formation of an internal working group to advise and guide OCLC’s efforts to advance racial equity. Fellowships, scholarships, and sponsorships for librarians from underrepresented groups Domestic partner benefits and pride recognition Cultural heritage celebrations Diverse workforce representation goals in manager compensation packages Ongoing employee training on equity, diversity, racial bias, and inclusion Using our technology and resources to advance racial equity Community convening Words matter Austlangs OCLC has organized a diverse group of experts, practitioners, and community members to explore the opportunities and challenges of implementing antiracist and inclusive language in metadata descriptions. This community convening is deepening OCLC’s understanding of community needs and directions and facilitate the development of an agenda that organizations can use to re imagine descriptive practices for libraries, archives, and other memory institutions. Learn how OCLC is helping 'Reimagine Descriptive Workflows'   We are now working on eradicating the term “Master record” and other insensitive terminology from services, product documentation, marketing collateral, and public web pages. “WorldCat record” will replace “master record.” Although this requires development work on user interfaces, we believe this is the right step. In products like CONTENTdm, “digital original” will replace “master file” and “base set” will replace “master set.” We are auditing other systems and terminology to ensure appropriate descriptions are reflected.   This project enables WorldCat to filter and facet content about the Aboriginal and Torres Strait Islander peoples—Australia’s first people—by extending language coding in MARC records to better facilitate ingest and discovery of records for resources that include these native languages. This will help to raise the visibility of the richness and diversity of Aboriginal and Torres Strait Islander cultures. The changes also have applications beyond Austlangs that will allow for significantly expanded language representation in WorldCat.   Responding to our members' needs Public libraries Through WebJunction, OCLC offers free webinars, trainings, and resources to help public library staff create a welcoming, inclusive environment that meets the needs of their diverse local communities. WebJunction has partnered with the nonprofit Legal Services Corporation to strengthen access to civil legal justice through public libraries, and the team is currently working with Washington State University to develop training for staff of tribal and rural libraries on community-centered curation of cultural collections. View the WebJunction Access & Equity site Enroll in WebJunction's free Access & Equity courses Academic libraries OCLC Research hosts a Distinguished Seminar Series that has focused on EDI topics since 2016 and our Research Library Partnership (RLP) hosts a webinar series, Works-in-Progress, that addresses EDI issues in academic libraries. The RLP is also increasing its programming that examines racial inequity as it relates to library operational workflows, like collection building and metadata practices. View the OCLC Distinguished Seminar Series Global conversations Global Council, comprising 48 elected member library leaders, hosted roundtable discussions on issues of race and antiracism and the library’s role around the world. In 2020, the Council selected the United Nations Sustainable Development Goals (SDGs) to help foster increased awareness and adoption of the SDGs. Goal #10 Reduced Inequalities, was selected as one of the five goals to survey libraries on globally. Working with OCLC Research, this study will culminate in a membership report in June 2021 as the Council continues roundtable discussions on what more libraries can do. Learn more about the FY21 area of focus Related reading   Charleston Syllabus Find out how a group of scholars and librarians used WorldCat.org to share resources related to civil rights, African American issues, and the impact of history on today's culture.   OCLC Research Library Partnership Survey Results and discussion of the 2017 RLP survey that was conducted to explore if and how the RLP’s 150 Partner Institutions were modifying library and archival collections, practices, and services through the lens of equity, diversity, and inclusion.   Hanging Together, the blog of OCLC Research Read how OCLC Research supports academic libraries in addressing equity, diversity, and inclusion. About About OCLC Latest news Leadership Financial Statements and Reports OCLC Technology Advancing racial equity Next blog Contact Us Events Upcoming events Upcoming webinars Information for Academic libraries Public libraries Research libraries Special libraries Groups and consortia Partners Membership Overview Libraries and groups Individuals Councils Resources Products Wise WorldShare Management Services WorldShare Interlibrary Loan WorldCat Discovery OCLC Cataloging Subscription EZproxy Dewey Services CONTENTdm All products and services » Visit related sites Support & Training OCLC Community Center OCLC Research Developer Network WebJunction President's Leadership Blog Careers About Careers View open positions in the Americas View open positions in Europe and Asia Pacific @OCLC @OCLC_ANZ   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates Privacy statement Cookie notice Cookie settings Accessibility statement ISO 27001 Certificate 
www-oclc-org-4672	----	WorldCat - Shared entity management infrastructure | OCLC JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Research Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Close Based on your browser settings, we have selected English as your preferred language on the OCLC.org website. This setting will be used whenever content in that language is available. You can change your preferred language below. English Deutsch Español Français Nederlands 中文简体 Deutsch Español Français Nederlands 中文简体 Searching for items in a library? Try WorldCat.org. Membership Products Support Research Events About Contacts Sign in to Services Settings Menu Search Overview Inside WorldCat OCLC delivers quality Libraries provide the foundation Partners support growth Linked data Shared entity management infrastructure For decades, OCLC has provided reliable infrastructure for OCLC numbers and authority records. Now, with financial support from the Andrew W. Mellon Foundation, we’re building a persistent, shared, and centralized entity management infrastructure for library linked data work. When completed in December 2021, this infrastructure will include easily accessible authoritative descriptions of works and persons, enhanced and managed by OCLC and the library community. Connections to other external vocabularies will place library collections in a broader context across the web. As always, we’re closely collaborating with the library community to fully understand and address needs. Our entity management infrastructure will be largely API-based, allowing librarians to customize their workflows around linked data opportunities. We will also create a basic user interface for librarians to create and manage entities. No matter how far along your library is with using new data models (including BIBFRAME), or even if you wish to continue to work in MARC, you’ll see improved data quality and workflow efficiency as a result of this new entity management infrastructure. “Reliable identifiers...” “For linked data to move into common use, libraries need reliable and persistent identifiers and metadata for the critical entities they rely on. This project begins to build that infrastructure and advances the whole field.” Lorcan Dempsey Vice President Membership and Research, and Chief Strategist OCLC Latest updates How your library will benefit from linked data 02 September 2020 As we move toward new ways to create and share information about library collections, I believe that no matter what type of library you are associated with, you and your users will benefit from this project. New OCLC linked data recording and advisory group members 01 September 2020 Be sure to watch our "Moving from research to reality" on-demand recording and welcome to our new advisory group members.  International interest continues to grow in OCLC's Shared Entity Management Infrastructure 01 July 2020 OCLC welcomes four new libraries from Europe and Canada to our advisory group. Shared entity management infrastructure advisory group members Agence bibliographique de l’enseignement supérieur (ABES) Biblioteca Universitaria di Bologna Biblioteca Nazionale Centrale di Roma Bibliothèque nationale de France (BnF) Brigham Young University Library British Library Central Michigan University Libraries Cleveland Public Library Cornell University Library Deutsche Nationalbibliothek (DNB) Harvard University National Library Board, Singapore National Library of Medicine National Library of New Zealand New York University Libraries Oxford University Libraries Princeton University Library Smithsonian Libraries Temple University Libraries U.S. Army Engineer Research and Development Center University of California, Davis Library University of Minnesota Libraries University of Pennsylvania Libraries University of Tennessee Libraries University of Victoria Libraries Yale University Library   Have a question or want to get involved? If you have a question or are interested in joining the advisory group to provide feedback, test APIs, or help with workflow creation as our work progresses, please contact us. Shared entity management advisory board contact On-demand webinars 02 March 2021 OCLC and Linked Data: The transition to contextual metadata Learn more about the impacts of contextual linked metadata on the library community. 29 July 2020 OCLC and Linked Data: Moving from research to reality Highlighting OCLC’s linked data progress and its impact on the library community. About About OCLC Latest news Leadership Financial Statements and Reports OCLC Technology Advancing racial equity Next blog Contact Us Events Upcoming events Upcoming webinars Information for Academic libraries Public libraries Research libraries Special libraries Groups and consortia Partners Membership Overview Libraries and groups Individuals Councils Resources Products Wise WorldShare Management Services WorldShare Interlibrary Loan WorldCat Discovery OCLC Cataloging Subscription EZproxy Dewey Services CONTENTdm All products and services » Visit related sites Support & Training OCLC Community Center OCLC Research Developer Network WebJunction President's Leadership Blog Careers About Careers View open positions in the Americas View open positions in Europe and Asia Pacific @OCLC @OCLC_ANZ   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates Privacy statement Cookie notice Cookie settings Accessibility statement ISO 27001 Certificate 
www-oclc-org-5309	----	Next Generation of Metadata - Register | OCLC Welcome Recordings Next Generation of Metadata OCLC Research Discussion Series Recordings Watch the OCLC Research Discussion Series We hope you enjoyed this series of online events.   OPENING PLENARY     An opening plenary webinar giving an overview of next generation metadata, how and why it is changing, and the impact this could have.  Watch the recording Read the blog post     CLOSING PLENARY   A closing plenary webinar by OCLC and representatives from the round table discussions to bring together what was discussed and to share highlights from the sessions with the wider group. Watch the recording View the slides ROUND TABLE DISCUSSIONS BLOG POSTS Eight interactive small round table online discussions exploring how existing initiatives are shaping the landscape of next generation metadata took place. The insights, ideas, perpsectives, and takeaways were summarized in a series of Hanging Together blog posts. First English round table on next generation metadata: towards a critical mass of interoperable library data Read the blog post     Italian round table on next generation metadata: Interoperability, Sustainability and More Read the blog post (English) Read the blog post (Italian)     French round table on next generation metadata: the challenge lies in managing multiple, co-existing ‘right scales’ Read the blog post (English) Read the blog post (French)     Spanish round table on next generation metadata: managing researcher identities is top of mind Read the blog post (English) Read the blog post (Spanish)   Dutch round table on next generation metadata: think bigger than NACO and WorldCat Read the blog post (English) Blog post in Dutch to follow     German round table on next-generation metadata: Formats, contexts and deficits Read the blog post (English) Read the blog post (German) Second English round table on next generation metadata: Silos and other challenges Read the blog post Third English round table on next generation metadata: investing in the utility of authorities and identifiers Read the blog post Additional Resources Reports discussed in the meeting were: the full “Transitioning to the Next Generation of Metadata (also available in Spanish)” and “Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project” reports, or the summary documents of the Transitioning to the Next Generation of Metadata report, available in Dutch, English, French, German, Italian, and Spanish. Because what is known must be shared.® +1-800-848-5878 Next Generation of metadata Overview Register About OCLC Home Membership Products Research Events About © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates Privacy statement Cookie notice Cookie settings Accessibility statement ISO 27001 Certificate 
www-oclc-org-5435	----	Rebecca Bryant JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Research Areas Partnership People News & Events Publications Presentations About Settings Menu Search Research People Rebecca Bryant Rebecca Bryant Senior Program Officer Rebecca Bryant, PhD, serves as Senior Program Officer at OCLC Research where she leads and develops areas for the OCLC Research Library Partnership and for OCLC Research related to research information management (RIM), research data management (RDM), and institutional scholarly communications practices. Her OCLC publications include  Research Information Management: Defining RIM and the Library’s Role, Convenience and Compliance: Case Studies on Persistent Identifiers in European Research Information Management, and The Realities of Research Data Management.  Prior to joining OCLC in 2016, Rebecca served in previous roles at the University of Illinois, including as Project Manager for Researcher Information Services in the University Library, Assistant Dean in the Graduate College, and Project Leader on the system-wide UI-Integrate ERP implementation project. She has also served as Director of Community at ORCID where she led outreach initiatives to encourage the adoption of ORCID identifiers throughout the scholarly communications community, particularly promoting adoption and integration within universities worldwide.  Rebecca earned a bachelor’s degree at Butler University, a master’s degree from the University of Cincinnati College-Conservatory of Music, and a PhD in historical musicology from the University of Illinois at Urbana-Champaign. Contact Rebecca Publications Social Interoperability in Research Support: Cross-campus Partnerships and the University Research Enterprise 20 August 2020 Rebecca Bryant, Annette Dortmund, Brian Lavoie The report defines social interoperability and describes the network of campus units involved in major areas of university research support services. It concludes by offering recommendations for cultivating successful cross-campus relationships. Practices and Patterns in Research Information Management: Findings from a Global Survey 3 December 2018 Rebecca Bryant, Anna Clements, Pablo de Castro, Joanne Cantrell, Annette Dortmund, Jan Fransen, Peggy Gallagher, Michele Mennielli OCLC and eruoCRIS partnered to conduct an international survey of research information management (RIM) practices to examine the broad global RIM ecosystem. This report details the complexity of RIM practices and the growing need for improved system-to-system interoperability. Vers un changement de cap : les bibliothèques, expertes en métadonnées au service de la recherche 11 July 2018 Rebecca Bryant, Brian Lavoie, Contance Malpas This excerpt of the OCLC Research Report, The Realities of Research Data Management—published in the French journal Archimag—examines the categories of incentives that inspired four research universities to acquire RDM capacity: compliance, evolving scholarly norms, institutional strategy, and research demand. View all of Rebecca's publications >> Presentations The Rapidly Changing Research Information Management Landscape By Rebecca Bryant 11 February 2021 Building Cross-campus Relationships in Research Support Services By Rebecca Bryant, Brian Lavoie 20 November 2020 Round up, OCLC-LIBER Open Science Discussion Series By Rachel Frick, Astrid Verheusen, Titia van der Werf, Rebecca Bryant 5 November 2020 Cross-campus partnerships, the library, and the university research enterprise By Brian Lavoie, Rebecca Bryant, Annette Dortmund 26 August 2020 Research Information Management in the United States By Rebecca Bryant 12 May 2020 View all of Rebecca's presentations >> Follow OCLC Research:   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies. See OCLC's cookie notice to learn more. Privacy statement Accessibility statement ISO 27001 Certificate 
www-oclc-org-5647	----	Transitioning to the Next Generation of Metadata JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Research Areas Partnership People News & Events Publications Presentations About Settings Menu Search Research Publications Transitioning to the Next Generation of Metadata Transitioning to the Next Generation of Metadata by Karen Smith-Yoshimura This report synthesizes six years (2015-2020) of OCLC Research Library Partners Metadata Managers Focus Group discussions to trace how metadata services are transitioning into the “next generation of metadata” and the impact on future metadata services and staffing requirements. Report Supplemental Transitioning to the Next Generation of Metadata synthesizes six years (2015-2020) of OCLC Research Library Partners Metadata Managers Focus Group discussions and what they may foretell for the “next generation of metadata.” The firm belief that metadata underlies all discovery regardless of format, now and in the future, permeates all Focus Group discussions.  Yet metadata is changing. Innovations in librarianship are exerting pressure on metadata management practices to evolve as librarians are required to provide metadata for far more resources of various types and to collaborate on institutional or multi-institutional projects with fewer staff.  This report considers: Why is metadata changing?  How is the creation process changing? How is the metadata itself changing?  What impact will these changes have on future staffing requirements, and how can libraries prepare? This report proposes that transitioning to the next generation of metadata is an evolving process, intertwined with changing standards, infrastructures, and tools. Together, Focus Group members came to a common understanding of the challenges, shared possible approaches to address them, and inoculated these ideas into other communities that they interact with.  Download US Letter .pdf Download A4 .pdf La versión en español de este informe ha sido gentilmente proporcionada por la Biblioteca Nacional de España: Descargar en español .pdf     Suggested citation: Smith-Yoshimura, Karen. 2020. Transitioning to the Next Generation of Metadata. Dublin, OH: OCLC Research. https://doi.org/10.25333/rqgd-b343.  Follow OCLC Research:   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies. See OCLC's cookie notice to learn more. Privacy statement Accessibility statement ISO 27001 Certificate 
www-oclc-org-6178	----	OCLC Developers Network News OCLC Developers Network News Latest developments, technical briefs, and activities at Developer Network APIs are the first built using new strategy and infrastructure New versions of the WorldCat Search and Metadata APIs were the first developed using OCLC's new API strategy and cloud-based infrastructure. WMS Acquisitions API release, 26 March 2021 The next release of the WMS Acquisitions API is scheduled for 26 March 2021.&nbsp; Call for proposals: DevConnect 2021 OCLC invites you to contribute your insights and projects to DevConnect 2021. Planned maintenance: Classify User Interface, 25 February OCLC will be performing maintenance on the Classify User Interface on 25 February 2021 at 9:00am Eastern US (UTC -5).&nbsp; Search API 2.0 ‘heldinState’ parameter change On 12 February 2021 we are installing a new version of WorldCat Search API 2.0 so the ‘heldinState’ parameter aligns with ISO standards and better supports global holdings. Planned maintenance: Classify User Interface, 15 February OCLC will be performing maintenance on the Classify User Interface on 15 February 2021 at 11:00am Eastern US (UTC -5).&nbsp; Planned maintenance: Classify API OCLC will be performing quarterly maintenance on the experimental Classify API on 4 February 2021 from 10:00am – 10:45am Eastern US (UTC -5).&nbsp; Upcoming WMS Circulation API Changes OCLC is making update to our Circulation APIs to support staff hold notes. Upcoming WMS Acquisitions API Changes OCLC will be making changes to the WMS Acquisitions API to better support Ship-To and Bill-To addresses on Purchase Orders and Invoices Upcoming WMS NCIP API Changes OCLC will be making two changes to the WMS NCIP Service in the coming weeks. On 3 May 2020, OCLC will install changes which affect the Patron Profile – UpdateRequestItem message. On 7 June 2020, OCLC will install changes affecting both the Staff and Patron Profiles.&nbsp; 
www-oclc-org-6729	----	Next Generation of Metadata | OCLC Welcome Recordings Metadata is changing. Innovations in librarianship are exerting pressure on metadata management practices to evolve as librarians are required to provide metadata for far more resources of various types and to collaborate on institutional or multi-institutional projects with fewer staff. In 2020, Karen Smith-Yoshimura published the report, “Transitioning to the Next Generation of Metadata”, which brought together six years of discussions with the OCLC Research Library Partners Metadata Managers Focus Group that shone a light on the evolution of the next generation of metadata. Following this in early 2021 the report, “Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project” was published. This report shares the CONTENTdm Linked Data Pilot project findings, where OCLC and five partner institutions investigated methods for—and the feasibility of—transforming metadata into linked data to improve the discoverability and management of digitized cultural materials. During the spring of 2021, OCLC Research ran a discussion series focused on these two reports where participants were able to share their own experiences, get a better understanding of the topic area, and gain confidence in planning ahead. The Series The series consists of three components: Opening plenary webinar Tuesday 23 February 2021, 15:00 (CET) An opening plenary webinar giving an overview of next generation metadata, how and why it is changing, and the impact this could have. OCLC speakers in this session include Rachel Frick (Executive Director, Research Library Partnership), John Chapman (Senior Product Manager, Metadata Services), Annette Dortmund (Senior Product Manager), and Titia van der Werf (Senior Program Officer). Interactive round table During the first two weeks of March 2021. Eight interactive small round table online discussions exploring how existing initiatives are shaping the landscape of next generation metadata, gaining insights and fresh ideas, and aligning each other’s perspectives into a shared perspective. These sessions followed the same structure, but were based on language. Closing plenary webinar Tuesday 13 April 2021, 15:00 (CET) A closing plenary webinar by OCLC and representatives from the round table discussions to bring together what was discussed and to share highlights from the sessions with the wider group. OCLC speakers in this session include Andrew K. Pace (Executive Director for Technical Research), Rachel Frick (Executive Director, Research Library Partnership), John Chapman (Senior Product Manager, Metadata Services), Annette Dortmund (Senior Product Manager), and Titia van der Werf (Senior Program Officer). Watch the recordings and read the blog posts Recordings and blog posts Because what is known must be shared.® +1-800-848-5878 Next Generation of metadata Overview Register About OCLC Home Membership Products Research Events About © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates Privacy statement Cookie notice Cookie settings Accessibility statement ISO 27001 Certificate 
www-oclc-org-7090	----	Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Research Areas Partnership People News & Events Publications Presentations About Settings Menu Search Research Publications Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project by Greta Bahnemann, Michael Carroll, Paul Clough, Mario Einaudi, Chatham Ewing, Jeff Mixter, Jason Roy, Holly Tomren, Bruce Washburn, Elliot Williams This report shares the CONTENTdm Linked Data Pilot project findings. In this pilot project, OCLC and five partner institutions investigated methods for—and the feasibility of—transforming metadata into linked data to improve the discoverability and management of digitized cultural materials. Transforming Metadata into Linked Data to Improve Digital Collection Discoverability shares the findings from the CONTENTdm Linked Data Pilot project. In this pilot project, OCLC partnered with five institutions that manage their digital collections with OCLC’s CONTENTdm service to investigate methods for—and the feasibility of—transforming metadata into linked data to improve the discoverability and management of digitized cultural materials and their descriptions. Five institutions partnered with OCLC to collaborate on this Linked Data project, representing a diverse cross-section of different types of institutions: The Cleveland Public Library The Huntington Library, Art Museum, and Botanical Gardens The Minnesota Digital Library Temple University Libraries University of Miami Libraries The CONTENTdm Linked Data Pilot project is another stage in a growing body of linked data research and development that OCLC has undertaken over the past decade. The findings detailed in this report examine the benefits of working in a linked data environment, the potential to develop a shared data model, and the challenges facing efforts to transform metadata into linked data. Download US Letter .pdf Download A4 .pdf   Suggested citation: Bahnemann, Greta, Michael Carroll, Paul Clough, Mario Einaudi, Chatham Ewing, Jeff Mixter, Jason Roy, Holly Tomren, Bruce Washburn, and Elliot Williams. 2021. Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project. Dublin, OH: OCLC Research. https://doi.org/10.25333/fzcv-0851. Short URL: oc.lc/transform-linked-data Follow OCLC Research:   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies. See OCLC's cookie notice to learn more. Privacy statement Accessibility statement ISO 27001 Certificate 
www-oclc-org-7563	----	Partners JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Research Areas Partnership People News & Events Publications Presentations About Settings Menu Search Research Partnership Partners The OCLC Research Library Partnership About Our work Partners Working groups Interest groups Join us OCLC Research Library Partnership Roster Join the OCLC Research Library Partnership! Through partnership, you can advance your institution's goals, tap into an international pool of experts, and benefit from a range of collaborative efforts focused on maximizing the impact cultural institutions have on the communities they serve. Becoming an OCLC Research Library Partner Listed under each institution is the Partner Representative, who serves as the primary contact at the Partner institution. Also listed is the OCLC Research liaison, who is the Partner’s primary contact in OCLC Research. American Antiquarian Society Partner Representative: Ms. Bethany Jarret, Head of Acquisitions OCLC Research Liaison: Chela Scott Weber American Philosophical Society Library Partner Representative: Ms. Marian L. Christ, Assistant Librarian, Head Cataloguer and Bibliographer OCLC Research Liaison: Chela Scott Weber American University in Cairo Partner Representative: Mr. Peter Philps, Director for Collections OCLC Research Liaison: Titia van der Werf Art Institute of Chicago Partner Representative: Ms. Autumn L. Mather, Director of the Ryerson and Burnham Libraries Research Center OCLC Research Liaison: Chela Scott Weber Australian National University Partner Representative: Ms. Roxanne Missingham, University Librarian OCLC Research Liaison: Merrilee Proffitt Bard College-Bard Graduate Center Partner Representative: Ms. Heather Topcik, Chief Librarian OCLC Research Liaison: Dennis Massie Binghamton University, State University of New York Partner Representative: Mr. Curtis Kendrick, Dean of Libraries OCLC Research Liaison: Chela Scott Weber Boston College Partner Representative: Ms. Christine Conroy, Associate University Librarian for Collections and Administration OCLC Research Liaison: Mercy Procaccini Boston University Partner Representative: Mr. Mark P. Newton, Associate University Librarian, Digital Strategies and Innovations OCLC Research Liaison: Mercy Procaccini Brandeis University Partner Representative: Mr. Matthew Sheehy, University Librarian OCLC Research Liaison: Dennis Massie Brigham Young University Partner Representative: Mr. Rick Anderson, University Librarian OCLC Research Liaison: Merrilee Proffitt British Library Partner Representative: Dr. Torsten Reimer, Head of Research Services OCLC Research Liaison: Merrilee Proffitt Brown University Partner Representative: Dr. Joseph Meisel, Joukowsky Family University Librarian OCLC Research Liaison: Mercy Procaccini Bryn Mawr College Partner Representative: Dr. Gina Siesing, Chief Information Officer, The Constance A. Jones Director of Libraries OCLC Research Liaison: Chela Scott Weber Carnegie Mellon University Partner Representative: Dean Keith Webster, Dean of University Libraries OCLC Research Liaison: Rebecca Bryant Sterling and Francine Clark Art Institute Partner Representative: Ms. Susan Roeper, Librarian OCLC Research Liaison: Chela Scott Weber Cleveland Museum of Art Partner Representative: Mr. Matthew Gengler, Head, Access Services OCLC Research Liaison: Chela Scott Weber Cold Spring Harbor Laboratory Partner Representative: Ms. Ludmila Pollock, Executive Director, Library and Archives OCLC Research Liaison: Rebecca Bryant Columbia University Partner Representative: Ms. Ann D. Thornton, Vice Provost and University Librarian OCLC Research Liaison: Mercy Procaccini Cornell University Partner Representative: Mr. Gerald Beasley, Carl A. Kroch University Librarian OCLC Research Liaison: Mercy Procaccini Emory University Partner Representative: Ms. Yolanda Cooper, University Librarian OCLC Research Liaison: Rebecca Bryant Folger Shakespeare Library Partner Representative: Mr. Greg Prickman, Eric Weinmann Librarian, Director of Collections OCLC Research Liaison: Chela Scott Weber Fordham University Partner Representative: Ms. Charlotte A. Labbé, Head of Interlibrary Loan  OCLC Research Liaison: Rebecca Bryant Frick Collection and Frick Art Reference Library Partner Representative: Dr. Stephen J. Bury, Andrew W. Mellon Chief Librarian, Frick Art Reference Library OCLC Research Liaison: Chela Scott Weber George Washington University Partner Representative: Ms. Geneva Henry, University Librarian & Vice Provost for Libraries OCLC Research Liaison: Rebecca Bryant Getty Research Institute Partner Representative: Ms. Kathleen Salomon, Associate Director and Chief Librarian. OCLC Research Liaison: Chela Scott Weber The Graduate Center, CUNY Partner Representative: Ms. Emily Drabinski, Interim Executive Chief Librarian OCLC Research Liaison: Dennis Massie Hagley Museum and Library Partner Representative: Dr. Erik P. Rau, Director of Library Services OCLC Research Liaison: Chela Scott Weber Haverford College Partner Representative: Ms. Terry Snyder, Librarian of the College OCLC Research Liaison: Chela Scott Weber Hebrew Union College - Jewish Institute of Religion Partner Representative: Mr. Yoram Bitton, Director of Libraries OCLC Research Liaison: Chela Scott Weber Huntington Library, Art Collections, and Botanical Gardens Partner Representative: Ms. Sandra L. Brooke, Avery Director of the Library OCLC Research Liaison: Chela Scott Weber Indiana University Bloomington Partner Representative: Ms. Carolyn Walters, Ruth Lilly Dean of University Libraries OCLC Research Liaison: Rebecca Bryant Institute for Advanced Study Partner Representative: Ms. Marcia Tucker, Librarian OCLC Research Liaison: Chela Scott Weber International Institute of Social History Partner Representative: Mrs. Afelonne Doek, Director of Collections and Digital Infrastructure OCLC Research Liaison: Titia van der Werf King's College London Partner Representative: Mrs. Elisabeth Hannon, Director of Libraries & Collections OCLC Research Liaison: Merrilee Proffitt Library Company of Philadelphia Partner Representative: Ms. Rachel A. D'Agostino, Curator of Printed Books OCLC Research Liaison: Chela Scott Weber Library of Congress Partner Representative: Mr. Beacher Wiggins, Director for Acquisitions & Bibliographic Access OCLC Research Liaison: Rebecca Bryant Linda Hall Library of Science, Engineering & Technology Partner Representative: Ms. Lisa Browar, President OCLC Research Liaison: Chela Scott Weber Louisiana State University Partner Representative: Ms. Gina R. Costello, Associate Dean OCLC Research Liaison: Rebecca Bryant Memorial Sloan-Kettering Cancer Center Library Partner Representative: Ms. Donna Gibson, Director of Library Services OCLC Research Liaison: Rebecca Bryant Metropolitan Museum of Art Partner Representative: Mr. Kenneth Soehner, Chief Librarian OCLC Research Liaison: Chela Scott Weber Minnesota Historical Society Partner Representative: Ms. Shawn Rounds, Director of Library and Archives, State Archivist OCLC Research Liaison: Chela Scott Weber Monash University Partner Representative: Mr. Robert Gerrity, University Librarian OCLC Research Liaison: Merrilee Proffitt Montana State University Partner Representative: Mr. Kenning Arlitsch, Dean of the Library OCLC Research Liaison: Merrilee Proffitt Museum of Fine Arts, Houston Partner Representative: Mr. Jon Evans, Library Director OCLC Research Liaison: Chela Scott Weber National Archives and Records Administration Partner Representative: Ms. Meg Phillips, External Affairs Liaison OCLC Research Liaison: Chela Scott Weber National Gallery of Art Partner Representative: Mr. Roger Lawson, Executive Librarian OCLC Research Liaison: Chela Scott Weber National Library of Australia Partner Representative: Ms. Alison Dellit, Assistant Director-General Collaboration OCLC Research Liaison: Merrilee Proffitt National Library of Scotland Partner Representative: Dr. John Scally, Chief Executive and National Librarian OCLC Research Liaison: Merrilee Proffitt Natural History Museum (UK) Partner Representative: Ms. Jane Smith, Head of Library Services and Collections OCLC Research Liaison: Merrilee Proffitt Nelson-Atkins Museum of Art Partner Representative: Ms. Amelia Nelson, Head Library Services OCLC Research Liaison: Chela Scott Weber The New School Partner Representative: Mr. Ed Scarcelle, University Librarian OCLC Research Liaison: Rebecca Bryant New York Public Library Partner Representative: Dr. William Kelly, Mellon Director of the Research Libraries OCLC Research Liaison: Chela Scott Weber New York University Partner Representative: Ms. H. Austin Booth, Dean, Division of Libraries OCLC Research Liaison: Chela Scott Weber New-York Historical Society Partner Representative: Dr. Michael Ryan, Vice President and Director of the Patricia D. Klingenstein Library OCLC Research Liaison: Chela Scott Weber Newberry Library Partner Representative: Ms. Alice Schreyer, Vice President for Collections and Library Services OCLC Research Liaison: Chela Scott Weber Northeastern University Partner Representative: Mr. Evan Simpson, Associate Dean for Research and Learning Services OCLC Research Liaison: Mercy Procaccini OCAD University Partner Representative: Mr. Tony White, University Librarian OCLC Research Liaison: Merrilee Proffitt Ohio State University Partner Representative: Mr. Damon E. Jaggars, Vice Provost and Director of University Libraries OCLC Research Liaison: Rebecca Bryant Pennsylvania State University Partner Representative: Ms. Ann Copeland, Head, Cataloging and Metadata Services  OCLC Research Liaison: Mercy Procaccini Princeton University Partner Representative: Ms. Anne Jarvis, University Librarian OCLC Research Liaison: Mercy Procaccini Rockefeller University Partner Representative: Mr. Matthew V. Covey, University Librarian OCLC Research Liaison: Rebecca Bryant Rutgers, The State University of New Jersey Partner Representative: Dr. Krisellen Maloney, Vice President for Information Services & University Librarian OCLC Research Liaison: Rebecca Bryant Saint Louis Art Museum Partner Representative: Dr. Keli E. Rylance, Head Librarian OCLC Research Liaison: Chela Scott Weber Smithsonian Institution Partner Representative: Mr. Martin Kalfatovic, Associate Director, Smithsonian Institution Libraries OCLC Research Liaison: Chela Scott Weber Stanford University Robert Crown Law Library Partner Representative: Mr. William Huggins, Head of Access Services OCLC Research Liaison: Dennis Massie Stony Brook University Partner Representative: Dr. Shafeek Fazal, Interim Dean of Stony Brook University Libraries OCLC Research Liaison: Chela Scott Weber Swarthmore College Partner Representative: Ms. Peggy Seiden, College Librarian OCLC Research Liaison: Chela Scott Weber Syracuse University Partner Representative: Mr. David Seaman, Dean of Libraries and University Librarian OCLC Research Liaison: Rebecca Bryant Temple University Partner Representative: Mr. Joe Lucia, Dean of Libraries OCLC Research Liaison: Mercy Procaccini Trinity College Dublin Partner Representative: Ms. Helen Shenton, Librarian & College Archivist OCLC Research Liaison: Merrilee Proffitt Tufts University Partner Representative: Ms. Dorothy Meaney, Director, Tisch Library OCLC Research Liaison: Rebecca Bryant Università Cattolica del Sacro Cuore Partner Representative: Dr. Paolo Sirito, Head of Central Library, Milan Campus OCLC Research Liaison: Titia van der Werf University of Amsterdam Partner Representative: Dr. Bert Zeeman, Interim Director, University of Amsterdam Library OCLC Research Liaison: Titia van der Werf University of Arizona Partner Representative: Mr. Shan C. Sutton, Dean of University Libraries OCLC Research Liaison: Merrilee Proffitt University of Bath Partner Representative: Ms. Kate Robinson, University Librarian OCLC Research Liaison: Merrilee Proffitt University of Calgary Partner Representative: Dr. Mary-Jo Romaniuk, Vice Provost (Libraries and Cultural Resources) OCLC Research Liaison: Merrilee Proffitt University of California, Irvine Partner Representative: Ms. Lorelei Tanji, University Librarian OCLC Research Liaison: Merrilee Proffitt University of California, Los Angeles Partner Representative: Ms. Virginia Steel, Norman and Armena Powell University Librarian OCLC Research Liaison: Merrilee Proffitt University of California, Riverside Partner Representative: Mr. Steven Mandeville-Gamble, University Librarian OCLC Research Liaison: Merrilee Proffitt University of California, San Diego Partner Representative: Mr. Erik Mitchell, University Librarian OCLC Research Liaison: Merrilee Proffitt University of Cambridge Partner Representative: Ms. Patricia Killiard, Deputy Director, Academic Services OCLC Research Liaison: Merrilee Proffitt University of Chicago Partner Representative: Mr. James Mouw, Associate University Librarian for Collection Services OCLC Research Liaison: Rebecca Bryant University of Cincinnati Partner Representative: Mr. Dan Gottlieb, Associate Dean of Collections & Scholarly Resources OCLC Research Liaison: Rebecca Bryant University of Delaware Partner Representative: Ms. Monica McCormick, Associate University Librarian for Publishing, Preservation, Research, and Digital Access OCLC Research Liaison: Rebecca Bryant University of Edinburgh Partner Representative: Ms. Kirsty Lingstadt, Head of Digital Library OCLC Research Liaison: Merrilee Proffitt University of Glasgow Partner Representative: Ms. Susan Ashworth, Executive Director of Information Services OCLC Research Liaison: Merrilee Proffitt University of Hong Kong Partner Representative: Mr. Peter Sidorko, University Librarian OCLC Research Liaison: Merrilee Proffitt University of Houston Partner Representative: Ms. Athena N. Jackson, Dean of University Libraries and Elizabeth D. Rockwell Chair OCLC Research Liaison: Rebecca Bryant University of Illinois at Urbana-Champaign Partner Representative: Mr. John Wilkin, Juanita J. and Robert E. Simpson Dean of Libraries, University Librarian OCLC Research Liaison: Rebecca Bryant University of Iowa Law Library Partner Representative: Ms. Amy Koopmann, Interim Associate Director OCLC Research Liaison: Merrilee Proffitt University of Kansas Partner Representative: Ms. Mary Roach, Associate Dean OCLC Research Liaison: Merrilee Proffitt University of Leeds Partner Representative: Mr. Masud Khokhar, University Librarian and Keeper of the Brotherton Collection OCLC Research Liaison: Merrilee Proffitt University of Liverpool Partner Representative: Mr. Philip Sykes, University Librarian OCLC Research Liaison: Merrilee Proffitt University of Manchester Partner Representative: Ms. Sandra Bracegirdle, Head of Content, Collections and Discovery  OCLC Research Liaison: Merrilee Proffitt University of Manitoba Partner Representative: Ms. Lisa O'Hara, Vice Provost (Libraries) and University Librarian OCLC Research Liaison: Merrilee Proffitt University of Maryland Partner Representative: Mr. Daniel C. Mack, Associate Dean of Libraries OCLC Research Liaison: Rebecca Bryant University of Miami Partner Representative: Dr. Charles Eckman, Dean and University Librarian OCLC Research Liaison: Rebecca Bryant University of Michigan Partner Representative: Dean James L. Hilton, Vice Provost for Digital Educational Initiatives, University Librarian and Dean of Libraries OCLC Research Liaison: Rebecca Bryant University of Minnesota Partner Representative: Ms. Lisa German, University Librarian and Dean of Libraries OCLC Research Liaison: Merrilee Proffitt University of Nevada, Las Vegas Partner Representative: Ms. Maggie Farrell, Dean of Libraries OCLC Research Liaison: Merrilee Proffitt University of Nevada, Reno Partner Representative: Ms. Kathlin L. Ray, Dean, University Libraries and Teaching and Learning Technologies OCLC Research Liaison: Merrilee Proffitt University of Notre Dame Partner Representative: Ms. Cheryl S. Smith, Program Director for Teaching, Research and User Services OCLC Research Liaison: Rebecca Bryant University of Oxford Partner Representative: Mr. Richard Ovenden, Bodley's Librarian OCLC Research Liaison: Merrilee Proffitt University of Pennsylvania Partner Representative: Dr. Constantia Constantinou, H. Carton Rogers III Vice Provost and Director of the Penn Libraries OCLC Research Liaison: Mercy Procaccini University of Pittsburgh Partner Representative: Dr. Fern Brody, Associate University Librarian OCLC Research Liaison: Rebecca Bryant University of Sheffield Partner Representative: Ms. Anna Clements, Interim Director of Library Services, and University Librarian OCLC Research Liaison: Merrilee Proffitt University of South Florida Partner Representative: Mr. Matthew Knight, Assistant Librarian, Coordinator of Special Collections OCLC Research Liaison: Rebecca Bryant University of Southern California Partner Representative: Ms. Catherine Quinlan, Dean of the USC Libraries OCLC Research Liaison: Merrilee Proffitt University of St Andrews Partner Representative: Ms. Jennifer Louden, Deputy Director of Libraries and Museums OCLC Research Liaison: Merrilee Proffitt University of Sydney Partner Representative: Mr. Philip Kent, University Librarian (Interim) OCLC Research Liaison: Merrilee Proffitt University of Tennessee, Knoxville Partner Representative: Dr. Steven Smith, Dean, University Libraries OCLC Research Liaison: Rebecca Bryant University of Texas at Austin Partner Representative: Ms. Catherine Hamer, Director of Academic Engagement OCLC Research Liaison: Rebecca Bryant University of Texas at Dallas Partner Representative: Dr. Ellen Safley, Dean of Libraries OCLC Research Liaison: Rebecca Bryant University of Toronto Partner Representative: Mr. Larry Alford, Chief Librarian OCLC Research Liaison: Merrilee Proffitt University of Warwick Partner Representative: Mr. Robin Green, University Librarian OCLC Research Liaison: Merrilee Proffitt University of Washington Partner Representative: Ms. Lizabeth (Betsy) Wilson, Dean of University Libraries OCLC Research Liaison: Merrilee Proffitt University of Waterloo Partner Representative: Ms. Beth S. Namachchivaya, University Librarian OCLC Research Liaison: Merrilee Proffitt University of Wyoming Partner Representative: Dr. Ivan K. Gaetz, Dean of Libraries OCLC Research Liaison: Merrilee Proffitt University of York (UK) Partner Representative: Ms. Michelle Blake, Director of Library and Archives (Interim) OCLC Research Liaison: Merrilee Proffitt Utrecht University Partner Representative: Drs. Kees Zandbergen, Department Manager, Collection Services, Metadata and Acquisitions OCLC Research Liaison: Titia van der Werf Virginia Tech Partner Representative: Dr. Tyler Walters, Dean, University Libraries OCLC Research Liaison: Rebecca Bryant Washington University in Saint Louis Partner Representative: Ms. Harriett Green, Associate University Librarian for Digital Scholarship and Technology Services OCLC Research Liaison: Merrilee Proffitt Wellcome Library Partner Representative: Ms. Donne Robertson, Head of Wellcome Collection Operations OCLC Research Liaison: Merrilee Proffitt Winterthur Museum, Garden & Library Partner Representative: Dr. Catharine Dann Roeber, Library Director (Interim) OCLC Research Liaison: Chela Scott Weber Yale University Partner Representative: Ms. Barbara Rockenbach, University Librarian OCLC Research Liaison: Mercy Procaccini Stay connected A number of subscription e-mail and discussion lists have been developed for staff at OCLC Research Library Partnership institutions to support continuing conversations about our work, share updates and solicit volunteers for new activities. Our e-mail and discussion lists Follow OCLC Research:   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies. See OCLC's cookie notice to learn more. Privacy statement Accessibility statement ISO 27001 Certificate 
www-oclc-org-8473	----	University of Waterloo Library joins OCLC Research Library Partnership JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Research Areas Partnership People News & Events Publications Presentations About Settings Menu Search University of Waterloo Library joins OCLC Research Library Partnership 13 April 2021 The OCLC Research Library Partnership (RLP) is excited to welcome the University of Waterloo as a Partner. The University of Waterloo Library's 130 FTE staff provide vital research information resources and programs to a community of over 40,000 students and employees at one of Canada's most innovative and comprehensive universities. The Library welcomes over 2 million learners and researchers annually across five facilities located on three campuses in the Region of Waterloo, Ontario, Canada. The University of Waterloo Library provides access to collections and resources of over 2.3 million volumes, 100,000+ journal titles, and substantial digital resources. The Library Strategic Plan 2020-2025 focuses squarely on partnerships as the lifeblood of Library services and programs. Key among these partnerships are: Waterloo's program on bibliometrics and research impact; the Portage Canadian research data management expertise network; and Omni, the Ontario Council of University Libraries' (OCUL) new digital library platform and services. "At the University of Waterloo, the Library's focus is on equipping scholars with lifelong skills to navigate complex information, wherever their research and learning may lead them,” says Beth Sandore Namachchivaya, University Librarian. “Through the OCLC Research Library Partnership, Waterloo can connect with global library expertise and perspectives on a variety of cutting-edge research information themes to share and enrich local practice and to grow partnerships." The OCLC RLP supports focused programming and research in four areas crucial to research libraries: Research support Unique and distinctive collections Resource sharing Next generation metadata Across these four areas, the RLP seeks to support libraries through the challenges of COVID-19 and to advance equity, diversity, and inclusion efforts. The RLP currently comprises 124 Partner institutions around the world. Visit oc.lc/rlp to learn more about the OCLC Research Library Partnership. Follow OCLC Research:   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies. See OCLC's cookie notice to learn more. Privacy statement Accessibility statement ISO 27001 Certificate 
www-oclc-org-8631	----	FAST: Subject terminology schema | OCLC JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Research Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Close Based on your browser settings, we have selected English as your preferred language on the OCLC.org website. This setting will be used whenever content in that language is available. You can change your preferred language below. English This page is only available in English. This page is only available in English. Searching for items in a library? Try WorldCat.org. Membership Products Support Research Events About Contacts Sign in to Services Settings Menu Search FAST (Faceted Application of Subject Terminology) Overview Supporting work Assign subject headings faster and easier FAST (Faceted Application of Subject Terminology) is derived from the Library of Congress Subject Headings (LCSH), and is one of the library domains most widely used subject terminology schemas. The development of FAST has been a collaboration of OCLC Research and the Library of Congress. Work on FAST began in late 1998. FAST was developed in large part to meet the need for a general-use subject terminology scheme, which is: simple to learn and apply, faceted-navigation friendly, modern in its design, and maintains upward compatibility with LCSH. FAST has evolved into a multi-faceted vocabulary with a universe of approximately 1.7 million headings across all facets. The facets are designed to be used together, but each may also be used independently. FAST Applications searchFAST A full feature search interface to the FAST database. FAST Converter Web interface for the conversion of LCSH to FAST headings. assign FAST A Web service that automates the manual selection of FAST subjects. FAST Linked Data A Linked Data service to interact with the Semantic Web. Learn more about the FAST Research Project FAST Frequently Asked Questions  Fast Policy and Outreach (FPOC) Committee The FAST Policy and Outreach Committee (FPOC) is seeking new members for positions that will become available in January 2021. Established in September 2018, the joint committee of OCLC and FAST implementers acts as an advisory and outreach body. Their priority is to ensure FAST will be a fully supported, widely adopted and community developed general subject vocabulary derived from LCSH with tools and services that serve the needs of diverse communities and contexts. Members serve for a maximum of two terms, each of two years, and are eligible to serve as co-chair.  The committee meets online once a month, with meetings usually lasting one hour. Members are also expected to contribute to the advisory work of the committee and the promotion of FAST. Membership guidelines can be found here. Interested candidates can self-nominate using this form and return it no later than December 11, 2020. FPOC is focused on promoting the use and implementation of OCLC’s faceted vocabulary and encourages candidates working in diverse subject areas to apply.  Committee members Members of the FAST Policy and Outreach Committee (and organizations they represent) include: Jill Annitto, ATLA (American Theological Library Association) Heidy Berthoud, Smithsonian Libraries Chew Chiat Naun, Harvard University Alan Danskin, British Library Jesse Lambertson, University of Chicago, D’Angelo Law Library Estelle Markel-Joyet, American Philosophical Society Library & Museum Steve McDonald, Tufts University and Program for Cooperative Cataloging (PCC) Dean Seeman, University of Victoria Libraries Becky Dean, OCLC Sandi Jones, OCLC Nathan Putnam, OCLC Diane Vizine-Goetz, OCLC The Committee has two Co-Chairs: Jill Annitto of ATLA and Alan Danskin of the British Library. Comments or questions? Please contact customer support Service availability FAST (Faceted Application of Subject Terminology) Available everywhere Join this listserv Join FACETVOC-L, a listserv discussion of all Faceted Controlled Vocabularies. To subscribe, simply go to the Internet Subscription form and provide your information. To post messages, simply send an e-mail message to FACETVOC-L@oclclists.org. Recent news FAST headings for COVID-19 08 June 2020 Upcoming events On-demand webinars 14 October 2020 21st Century Indexing Learn how FAST (Faceted Application of Subject Terminology) can help libraries and other cultural institutions to assign subject headings About About OCLC Latest news Leadership Financial Statements and Reports OCLC Technology Advancing racial equity Next blog Contact Us Events Upcoming events Upcoming webinars Information for Academic libraries Public libraries Research libraries Special libraries Groups and consortia Partners Membership Overview Libraries and groups Individuals Councils Resources Products Wise WorldShare Management Services WorldShare Interlibrary Loan WorldCat Discovery OCLC Cataloging Subscription EZproxy Dewey Services CONTENTdm All products and services » Visit related sites Support & Training OCLC Community Center OCLC Research Developer Network WebJunction President's Leadership Blog Careers About Careers View open positions in the Americas View open positions in Europe and Asia Pacific @OCLC @OCLC_ANZ   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates Privacy statement Cookie notice Cookie settings Accessibility statement ISO 27001 Certificate 
www-oclc-org-9509	----	Equity, Diversity, and Inclusion Initiatives JavaScript is currently not supported or is disabled by this browser. Some features of this site will not be available. Please enable JavaScript for full functionality. OCLC.org OCLC.org Home Support & Training Community Center Developer Network WebJunction  COVID-19 | Information and resources to help Skip to page content. Research Areas Partnership People News & Events Publications Presentations About Settings Menu Search Equity, Diversity, and Inclusion Initiatives Overview OCLC Research is convening community conversations around issues of equity, diversity, and inclusion (EDI) in the library field. EDI cuts across many areas of librarianship from staffing to collections and beyond. We hope to support libraries as they work through these issues and are working on a number of initiatives in that effort. Current Work Towards respectful and inclusive description A summary of interviews with librarians about the difficulty of and barriers to cataloging topics relating to Indigenous peoples in respectful ways. Interviewees characterized the harms that arise from the current situation, and also suggested potential solutions. Digital stewardship training courses OCLC’s WebJunction, in partnership with Washington State University’s Center for Digital Scholarship and Curation, is creating a series of 10 free online courses for staff at tribal archives, libraries, museums (TALMs), and small public libraries on digital stewardship and community-centered collaborative curation of cultural collections. The CARE Principles for Indigenous Data Governance: overview and Australian activities The CARE Principles for Indigenous Data Governance focus on appropriate use and reuse of Indigenous data. This blog post summarizes a webinar co-hosted by National and State Libraries Australia (NSLA) and the OCLC Research Library Partnership. Panelists shared updates and examples of their work, as well as lessons they’ve learned. Equity, Diversity, and Inclusion in the OCLC Research Library Partnership Survey In 2017, the OCLC Research Library Partnership (RLP) conducted a survey to explore if and how our 150 Partner institutions are modifying library and archival collections, practices, and services through the lens of equality, diversity, and inclusion (EDI). Our objective was to capture a snapshot of efforts across the Partnership to inform next steps, reveal possible directions to explore, and serve as a starting point for further discussions and action regarding EDI in the library field. Read all about the survey and summary results. Training Resources WebJunction offers free webinars, trainings, and resources to help library staff create a welcoming environment that represents their communities’ needs around access, equity, diversity, and inclusion. Distinguished Seminar Series OCLC Research has hosted the Distinguished Seminar Series since 1978, and beginning in 2016, the series began a focus on EDI issues. Here are the most recent presentations. For the Greater (Not) Good (Enough): Open Access and Information Privilege Char Booth addresses the concept that open access has had a huge impact on publishing and scholarly communication, yet who you are, what you earn, and how you research still create serious barriers to information availability. We Have Never Been Neutral: Search, Discovery, and the Politics of Access Dr. Kimberly Christen addresses the concept that library and archive practices are neutral and non-biased, trace the often violent histories of collecting and the construction of the public domain, unpack their connections to the foundations of libraries and archives, and open a space to provide a framework for ethical engagements and reciprocal practices through culturally responsive tools and engagements. Welcome to the Library: Success of Diversity and Inclusion Initiatives In this presentation, Trevor A. Dawes, Vice Provost for Libraries and Museums and May Morris University Librarian at the University of Delaware, reviewes the work of the ALA Task Force for Equity, Diversity, and Inclusion, which he co-chaired, and also examines the diversity and inclusion efforts at several libraries. Wikipedia’s gender gap, and what would Hari Seldon do about it? Distinguished Seminar Series featuring Rosie Stephenson-Goodknight, Visiting Scholar at Northeastern University, Women Writers Project. Why so Few? The Underrepresentation of Women in Technology and Software Development Sandy Payette, founding CEO of DuraSpace, discusses the alarming gender disparity in computing as a cultural issue in the areas of technology and software development. View the full series. OCLC Research Library Partnership Webinars In addition to the Distinguished Seminar Series, the OCLC Research Library Partnership hosts a Works-in-Progress webinar series. Recent webinars that involve EDI include: Works in Progress Webinar: Umbra Search African American History Learn about an effort lead by University of Minnesota to facilitate access to African American history through a website and search widget; digitization of over 500,000 items; and workshops around the country about access to culturally diverse collections. This Wasn’t for You Yesterday, but It Will Be Tomorrow—Digitization Policy to Counteract Histories of Exclusion This webinar explores the intersection of digitization policy and anti-racist action, using LSU Libraries’ experience as a lens to explore the state of the field. Works in Progress Webinar: Decolonizing Descriptions: Finding, Naming and Changing the Relationship between Indigenous People, Libraries and Archives This webinar examines how two different organizations—the Association for Manitoba Archives and the University of Alberta Libraries—began the processes of examining subject headings and classification schemes as they relate to Indigenous people. Works in Progress Webinar: Diversity, Inclusion, and Social Justice Work in the MIT Libraries’ Collections Directorate This webinar explores how MIT Libraries have implemented the recommendations of its Diversity, Inclusion, and Social Justice Task Force, with a focus on the work of the Scholarly Communication and Collections Strategy department. Works in Progress Webinar: Dynamics of change—continuing the conversation on library naming, finding, and relationship-building with Indigenous peoples Learn about how the University of Toronto Libraries is reckoning with both cultural change and the development of culturally appropriate metadata on a system-wide scale. Works in Progress Webinar: Operationalizing the CARE Principles for Indigenous Data Governance What are the CARE Principles? Learn how they work to address the historical legacy of data inequities that impact Indigenous Peoples. Presenters will also discuss how the CARE Principles promote a broader understanding of the FAIR Principles when applied to Indigenous data and collections. Resources Explore equity and access resources on WebJunction.   As we continue to expand our efforts, we will add them here. Libraries as Community Catalysts Lead Rachel Frick Definitions Early in our work, we consulted the definitions in the American Library Association’s Statement on Equity, Diversity and Inclusion: “Equity” takes difference into account to ensure a fair process and, ultimately, a fair outcome. Equity recognizes that some groups were (and are) disadvantaged in accessing educational and employment opportunities and are, therefore, underrepresented or marginalized in many organizations and institutions. Equity, therefore, means increasing diversity by ameliorating conditions of disadvantaged groups. “Diversity” can be defined as the sum of the ways that people are both alike and different. When we recognize, value, and embrace diversity, we are recognizing, valuing, and embracing the uniqueness of each individual. “Inclusion” means an environment in which all individuals are treated fairly and respectfully; are valued for their distinctive skills, experiences, and perspectives; have equal access to resources and opportunities; and can contribute fully to the organization’s success. Follow OCLC Research:   © 2021 OCLC Domestic and international trademarks and/or service marks of OCLC, Inc. and its affiliates This site uses cookies. By continuing to browse the site, you are agreeing to our use of cookies. See OCLC's cookie notice to learn more. Privacy statement Accessibility statement ISO 27001 Certificate 
www-ojoconmipisto-com-7608	----	Inicio - Ojoconmipisto.com Saltar al contenido Denuncia aquí Search Ojoconmipisto.com Menú Inicio Noticias #JotaCe Especiales Quiénes somos Suscríbete Menú Search Buscar: Cerrar la búsqueda Cerrar el menú Inicio Acerca de Blog Quiénes somos Inicio Noticias #JotaCe Especiales Quiénes somos Suscríbete Facebook Twitter Instagram Youtube Quetzaltenango podría pagar Q1.8 millones en sueldos caídos de 12 exempleados 27 abril 2021 Aún no hay fecha para escuchar la resolución del juez de primera instancia de trabajo a cargo la cual estaba prevista para el 14 de abril. Otras 28 personas también reclaman en más casos su reinstalación. Últimas noticias Ver todas Guatemala Villa Nueva recibirá el transporte pesado de San Lucas, pero se arriesga a un caos vehicular 26 abril 2021 Con la construcción del nuevo paso a desnivel, los camiones deben tomar la Cuesta del Zope para llegar a la ciudad de Guatemala, pero esto puede significar que el tránsito se vuelva cada vez más lento. Noticias Las alcaldías con policía municipal buscan a militares o expolicías para entrenarlas 26 abril 2021 Su principal función es resguardar los bienes municipales y aunque pueden portar armas, no necesariamente se las proveen. Noticias El hotel del alcalde de Zacapa continúa como proveedor de municipalidades 23 abril 2021 Desde que asumió el puesto hace 15 meses, “Torre Fuerte” ha sido contratado en ocho ocasiones . Ver todas Especiales Ver todos Ver todos Caricaturas de JotaCe Ver todos #JotaCe Pueblo Nuevo, Suchitepéquez, compra ingredientes para elaborar pan 26 marzo 2021 La tradición en Semana Santa en municipios de occidente es elaborar pan, la comuna adquirió los ingredientes para sus vecinos, gastó Q177 mil 050. #JotaCe Fraijanes compra tres camillas para sus clínicas municipales 11 marzo 2021 Costaron Q87 mil 900, dos son hidráulicas y una eléctrica para fisioterapia. #JotaCe Canillá, Quiché, construye una nueva fosa para sus desechos 26 febrero 2021 La que tenían colapsó y por ello abrirán otra que les costará Q89 mil 876. Ver todos Podcast Listen to “Un año con la pandemia: Capítulo 1” on Spreaker. ¿Qué es Ojoconmipisto? Videos y Podcast Ver todos Podcast Información pública en tiempos de pandemia: Capítulo 5 Información pública en tiempos de pandemia: Capítulo 4 Información pública en tiempos de pandemia: Capítulo 3 Videos Aprende sobre salud: COVID-19 y las vacunas Plantas de tratamiento El papel de la Contraloría en las municipalidades Suscríbete Recibe las noticias de la semana, nuestros nuevos especiales e infórmate sobre actividades municipales por medio de nuestro newsletter. Inicio Noticias Especiales #JotaCe Quiénes somos COMPARTE CONTÁCTANOS redaccion@ojoconmipisto.com Dirección: 17 calle “A” 7–09, zona 10 Todo el material en esta página, cuando no se indique lo contrario, es CC-BY-SA. Si reúsas o adaptas recursos de esta página por favor vincúlalos y referéncialos. Este sitio fue diseñado por Datasketch. Suscríbete Recibe las noticias de la semana, nuestros nuevos especiales e infórmate sobre actividades municipales por medio de nuestro newsletter. Deja vacío este campo si eres humano: 
www-open-contracting-org-7725	----	Open Contracting Partnership: open, fair & efficient public contracting What is open contracting Global Principles Global Procurement Spend Beneficial Ownership Covid-19 Extractives Gender Health Infrastructure What is open contracting Learn about open contracting, why it matters, and how a data standard has changed public procurement. Global Principles Global Procurement Spend Countries and cities working on open contracting Open contracting and... Beneficial Ownership COVID-19 Extractives Gender Infrastructure Health Mythbusting commercial confidentiality Impact Impact stories Why openness matters Evidence Our research Open contracting delivers. The impact Open contracting delivers. Learn about facts, evidence, and impact globally. Impact stories Evidence for open contracting Why openness matters Our research Countries and cities working on open contracting Implement Resources Our services Lift Get started implementing open contracting Create better, faster and deeper procurement reforms through open contracting. Let us help you. Learn about our services and get in touch. Set clear goals Publish, use & improve data Monitoring & feedback processes Measure & institutionalize reforms Resources and support Publications and tools Open Contracting Lift Our services Virtual learning resources Data Open Contracting Data Standard Documentation Using open contracting data All about open contracting data Data is only useful once it is used. Learn about how to publish data through the Open Contracting Data Standard and how to find and use it. Open Contracting Data Standard Why implement OCDS Implement the OCDS Implement the OC4IDS Using open contracting data Data analysis tools and guidance Access open contracting data Learn All about learning Build the knowledge you need to implement and improve open contracting practices. Publish Discover the Open Contracting Data Standard Connect OCDS to the contracting process Implement OCDS to publish contracting data View All Use Explore open contracting data using common tools Analyze, visualize, and interpret contracting data Apply open contracting data to real-world situations View All Understand - What is Open Contracting Learning Library About us Our team Our Advisory Board Our Governance Our finances Our partners Work with us People & culture Contact us Learn more about who we are We transform public contracting so that it is fair and efficient. Learn about who we are, our vision, mission, strategy and values. Our team Our Advisory Board Our governance Our finances Our partners What we're learning Work with us People & culture Contact us News All the latest blogs and updates Get the latest updates from the open contracting community through our blogs. Blogs News and announcements Events Latest resources and data tools Español Русский Data Standard Publications and tools Worldwide Language English Español Русский Our world runs on public contracts. We make sure they are open, fair & efficient. Integrity in IMF emergency lending: what worked? Our Annual Report 2020 Fight for life: how Ukraine is fixing medical procurement and serving patients better Emergency procurement for COVID-19: Buying fast, open and smart Integrity in IMF emergency lending: what worked? Did countries deliver on their procurement & beneficial ownership transparency commitments? Our Annual Report 2020 Our 2020 Annual Report highlights stories from our partners in the open contracting community as they responded to the pandemic. Fight for life: how Ukraine is fixing medical procurement and serving patients better Read our latest story on open contracting in Ukraine’s health sector Emergency procurement for COVID-19: Buying fast, open and smart Our resource page provides guidance to ensure government’s response to COVID-19 is transparent and effective What’s new All latest news Blog Powering an inclusive recovery through open contracting 7 Apr 2021 Kaye Sklar Read Blog Buy open, buy fast: how open contracting helped Lithuania’s Coronavirus response 30 Mar 2021 Open Contracting Read Blog Tracking COVID-19 procurement in Kazakhstan: from red flags to cancellations 11 Mar 2021 Open Contracting Read As seen in: Get started Start implementing goal-driven procurement reforms. Learn what you can do to implement open contracting Explore our publications, tools and resources Publish and use data Start implementing the Open Contracting Data Standard Find and use open contracting data How we can help We’re working to make public contracting fair and effective, so it provides everyone, everywhere with the public goods, works and services that they need. How can we help you get started on open contracting? Learn more about our free helpdesk, implementation guidance and other services. Our strategy Our services Our reform support program Lift Our community “When you don’t know what to do, you must do the right thing. What was good for the country was that there be open data.” Silvana Vallejo, Director, SERCOP “Open data and government accountability can’t be sacrificed, even when radical and urgent steps are needed.” Vasyl Zadvornyy, CEO, Prozorro “When we went to the field with information provided by the procuring entity, a lot of projects were very different to what was on paper. The utility of having standards came to the fore.” Gift Maxwell, PPDC, Nigeria, “There are no trade secrets in public contracts. Winning business through an open contracting process is not altruism, it’s good business.” Mo Ibrahim, Philantropist “We had been trying to participate directly in the school feeding program for the last 12 years. Now, we can sell to the city directly.” Luz Marina Rojas, Entrepreneur, Colombia “With the Open Contracting Data Standard it was like asking me to dig an Olympic pool and giving me an excavator instead of a teaspoon.” Juan Pane, Paraguay “For citizens to be engaged in promoting accountability for effective service delivery, they must have information on the contracts in their localities.” Edwin Muhumuza, PPDA, Uganda “When we started Prozorro, at first they didn’t believe, then they laughed, then they started putting obstacles in our way. And then we won. It was too late.” Yuriy Bugay, Ukraine “When you don’t know what to do, you must do the right thing. What was good for the country was that there be open data.” Silvana Vallejo, Director, SERCOP “Open data and government accountability can’t be sacrificed, even when radical and urgent steps are needed.” Vasyl Zadvornyy, CEO, Prozorro “When we went to the field with information provided by the procuring entity, a lot of projects were very different to what was on paper. The utility of having standards came to the fore.” Gift Maxwell, PPDC, Nigeria, “There are no trade secrets in public contracts. Winning business through an open contracting process is not altruism, it’s good business.” Mo Ibrahim, Philantropist “We had been trying to participate directly in the school feeding program for the last 12 years. Now, we can sell to the city directly.” Luz Marina Rojas, Entrepreneur, Colombia “With the Open Contracting Data Standard it was like asking me to dig an Olympic pool and giving me an excavator instead of a teaspoon.” Juan Pane, Paraguay “For citizens to be engaged in promoting accountability for effective service delivery, they must have information on the contracts in their localities.” Edwin Muhumuza, PPDA, Uganda “When we started Prozorro, at first they didn’t believe, then they laughed, then they started putting obstacles in our way. And then we won. It was too late.” Yuriy Bugay, Ukraine Join our community Join the growing community of open contracting champions and innovators and receive regular updates on what’s happening around the world, exciting tools and latest research. Get our newsletter What is open contracting Impact Implement About Latest Get Our Newsletter Contact engage@open-contracting.org Open Contracting Partnership, 1100 13th Street NW, Suite 800, 20005 Washington, D.C., USA Connect with us: This work by the Open Contracting Partnership, unless otherwise noted, is licensed under a Creative Commons Attribution 4.0 International License. Open Contracting Partnership 2021 Terms 
www-periferiacenter-com-4526	----	HOME | Periféria Központ   PERIFÉRIA POLICY AND RESEARCH CENTER MISSION ABOUT US THEMES Housing Policies Financialization of Housing Urban Transformations MAIN PROJECTS Rental Housing Cooperatives Municipalities and Housing Indebtedness of Households Reports on Housing Poverty Chinese Infrastructure Investments Migration and Urban Regeneration PERIFÉRIA WORKING PAPERS PARTNERS   HU EN LATEST ANNUAL REPORT January 2021 ​ The first annual report of Periféria Center, presenting our activities and key results from 2020, and our ongoing projects and future plans, is out now. © 2018-2020 by Periféria Center. Web design: Linda Szabó Logo design: Anna Balázs info@periferiakozpont.hu   
www-plantuml-com-4604	----	None 
www-plantuml-com-9754	----	None 
www-qut-edu-au-2024	----	QUT - Institute for Future Environments - Home COVID-19 - latest advice - updated Thursday 15 April, 8am (AEST) Skip to content QUT Institute for Future Environments the university for the real world Contact usContact us Search for: Menu Home About About IFE Our institute News Events Science in Focus Subscribe to Insights Get our monthly e-news and invitations to our major public events delivered directly to your inbox. Engage Connect and partner with us Partner with us Subscribe to Insights Be one of the first to know about our fascinating public events with remarkable speakers through our monthly e-news. Research Our research centres Centre for Agriculture and the Bioeconomy Centre for the Environment Centre for Clean Energy Technologies and Practices Centre for a Waste-Free World Subscribe to Insights Subscribe to our e-news and discover research with real-world impact. Facilities Our research facilities Central Analytical Research Facility Research Engineering Facility Biorefining Research Facility Samford Ecological Research Facility Digital Observatory Visualisation and eResearch Our research locations Banyo Pilot Plant Da Vinci Precinct Mackay Renewable Biocommodities Pilot Plant Subscribe to Insights Find out more about IFE's researchers and their innovative work by subscribing to our monthly e-news. Testing the batteries of the future Clean energy technologies are being advanced with the arrival of a vanadium flow battery, the first of its kind in Australia, at QUT’s hybrid renewable hydrogen pilot plant in the Redlands. Find out more Samford Ecological Research Facility: Year in Review Mealybugs, treecreepers, bees, greenhouse gas and vegetation biomass are just some of the research topics being studied at QUT's own peri-urban research facility during 2020. Read the highlights Researchers of QUT: Professor Zoran Ristovski Get to know our researchers at QUT. Learn more about Professor Zoran Ristovski's research into atmospheric sciences and in particular aerosol science, or aerosol physics and chemistry. Watch now How social media can help guide pandemic policy decisions Supported by the Digital Observatory, QUT researchers have collected and analysed tweets to better understand public perceptions and actions during the COVID-19 pandemic. Find out more The buffel kerfuffle Researchers are investigating how one fast growing buffel grass species is quietly destroying native wildlife and cultural sites in arid Australia. Find out more Video spotlight: CAB's Agrifood Systems Program The Centre for Agriculture and the Bioeconomy's Agrifood Systems Program brings a social scientific lens to the complex systems of agriculture, food, economy, environment and society. Watch now Are you interested in improving Antarctic conservation? Securing Antarctica’s Environmental Future (SAEF) is an ARC Special Research Initiative that aims to strengthen Antarctic science, policy and governance at a time of rapid environmental and geopolitical change. Find out more Institute for Future Environments About us The Institute for Future Environments brings together research, industry, government and community to create real-world impact and develop innovations that create sustainable futures for our environment, society and economy. More about us Real world research Engage with us Our research centres Centre for Agriculture and the Bioeconomy The Centre for Agriculture and the Bioeconomy delivers profitable, sustainable and resilient agricultural and bioeconomy technologies and systems. Find out more Centre for Clean Energy Technologies and Practices The Centre for Clean Energy Technologies and Practices develops innovative and sustainable clean energy generation, distribution and usage solutions for Australia and beyond. Find out more Centre for a Waste-Free World The Centre for a Waste-Free World develops technologies and processes to innovate the social, environmental and economic role of waste and transform it into valuable circular commodities. Find out more Centre for the Environment The Centre for the Environment brings research, government, industry and community together to create real-world solutions to the most pressing environmental challenges. Find out more Our research facilities We manage a diverse range of world-class facilities to support research, teaching and learning and enable real-world results. Our expertise encompasses the latest in digital technologies, industrial engineering and scientific analysis. Central Analytical Research Facility The Central Analytical Research Facility provides specialist equipment and expert scientists to help you carry out analytical research. Research Engineering Facility The Research Engineering Facility provides cutting-edge research engineering and technology services including drones, aviation, robotics and autonomous systems. Visualisation and eResearch The Visualisation and eResearch group develops advanced visualisation environments and high-performance interaction design systems. Digital Observatory The Digital Observatory provides state-of-the-art research infrastructure and expertise for tracking, collecting and analysing dynamic digital data. Biorefining Research Facility The Biorefining Research Facility develops innovative products and processes to build a sustainable and profitable biorefining industry in Australia. Samford Ecological Research Facility The Samford Ecological Research Facility is a living laboratory for ecological research and educational experiences for researchers, students and visitors. Engage with us Engage with us We bring research, industry and government together to create real-world solutions for our future environments. Learn more about how we can work together News 24 Apr 2020 QUT researchers to head to Antarctica in preservation efforts QUT researchers are working on building the technology necessary to help preserve, model, and monitor the harsh Antarctic environment and its wildlife. 20 Sep 2019 Drilling down on climate change issues in The Cube's 2019 Residency Climate change and the human impact on earth will be explored in a new digital project at QUT’s The Cube. Events No results were found Contact us ife@qut.edu.au Institute for Future Environments Level 6, P Block Gardens Point 2 George St Brisbane QLD 4000 Australia Postal address Institute for Future Environments GPO Box 2434 Brisbane QLD 4001 Gain Insights. Receive our monthly e-news. Subscribe Study Research Engage About QUT Undergraduate study Postgraduate study PhDs and research degrees Professional and executive education International students Study abroad and exchange Student life Graduate success Applying Fees Scholarships Key dates Whatever you want to study, our range of courses will give you the hands-on learning, industry connections and real-world perspective you’ll need to succeed. Explore study at QUT Why QUT? Our research Our experts World-class facilities Study with us How to apply Master of philosophy PhD Professional doctorate Partner with us Research jobs Institute for Future Environments Institute of Health and Biomedical Innovation Join us at the forefront of research and development. Access cutting-edge facilities and technology, work with world-renowned experts, and create real-world change. Explore research at QUT Industry and partnerships Work with our students Expertise for industry Executive education Give to QUT Make a real impact Ways to give Get involved Alumni Alumni benefits Update alumni details Alumni networking Connect with us and see how we can support you. We invite industry partners, alumni and our valued donors to join the vibrant, exciting QUT community. Explore ways to engage with QUT Our university Our people Achievements and recognition Campuses and venues News Events Jobs Library Learn what makes Queensland University of Technology (QUT) the university for the real world. Read about our goals for future growth, achievements and success, and the organisational structures that support us. Explore QUT Login Legal Privacy QUT on Facebook QUT on Instagram The QUTube QUT on Twitter QUT on LinkedIn Our social media Contact QUT QUT acknowledges the Traditional Owners of the lands where QUT now stands. CRICOS No. 00213J ABN 83 791 724 622 Page updated: 4 August 2020 
www-qut-edu-au-9934	----	QUT - Home COVID-19 - latest advice - updated Thursday 15 April, 8am (AEST) Skip to content QUT QUT - the university for the real world Login Contact QUTContact QUT Search Search Menu Study Undergraduate Study areas Business Creative industries Education Engineering English language and pathway programs Health Justice Languages Law Science Explore QUT College IELTS test centre Graduate success stories Mid-year entry New to QUT Why choose QUT Double degrees Flexible study Applying guide Bridging programs Fees, costs and scholarships Student life Events I am A high school student Not currently in high school An international student Switching to QUT Returning to uni A parent or guardian A career adviser Postgraduate Study areas Business Creative industries Education Engineering English language and pathway programs Health Justice Languages Law Science Research degrees Doctor of Philosophy Master of Philosophy Professional doctorate Explore Why choose QUT Fast track to postgraduate Career change courses Scholarships Applying guide Flexible study options Student life Events Online study Fully online degrees Our collection of courses are taught entirely online, giving you the flexibility to study anywhere, anytime. Our fully online degrees Short courses Boost your professional development. Building and planning Business Creative practice, communication and design Education Enterprise Leadership Program(ELP) Health and community Languages Law and justice Professional Programs Science, technology, engineering and mathematics Open online courses Browse our courses Discover our open online courses and continue your learning journey. Whether you're looking for professional development or just want to explore a field you're interested in, our open online courses could be just what you're looking for. International students Study areas Business Creative industries Education Engineering English language and pathway programs Health Justice Languages Law Science Explore QUT in your country Why choose QUT Living in Brisbane Scholarships QUT College Applying guide Study abroad and exchange Events Ask us a question Agent resources IELTS test centre #CountdownToQUT Other languages عربي Bahasa Indonesia español 简体字版 繁體字版 日本語 한국어 português ภาษาไทย tiếng Việt QUTeX - Professional and executive education QUTeX - Professional and executive education Bringing together knowledge from across the whole university to help you or your organisation get future fit, fast. More about QUTeX Our blog Contact us Events Real World Futures Explore future thinking, working, and living with leading industry experts and thought leaders. For individuals Improve your performance, advance your career or make a change with real world short courses. Browse all courses Take your development further with advanced learning and award pathways. Ascend: Executive eXcelerator Program Enterprise Leadership Pathways to Politics for Women Public Sector Management For organisations Achieve exceptional business outcomes with co-designed executive education. Tailored executive education Digital Capability Practice Leadership Coaching Practice Speakers' Circle Professional Advantage Package Find a course Take our quiz Try the Match My Skills quiz and discover a personalised list of courses that are perfect for you. Research Why QUT Explore What makes us different Our research vision Equity and diversity Support for researchers Meet our experts Ethics and integrity Research jobs at QUT Our facilities Central Analytical Research Facility Centre for Children's Health Research Creative Industries Precinct Mackay Renewable Biocommodities Pilot Plant Medical Engineering Research Facility The Cube Translational Research Institute Our research Research strengths Australian Research Council Centre of Excellence for the Digital Child Centre for Agriculture and the Bioeconomy Centre for Biomedical Technologies Centre for Data Science Centre for Future Enterprise Centre for Genomics and Personalised Health Centre for Healthcare Transformation Centre for Justice Centre for Materials Science Centre for Robotics Digital Media Research Centre Research projects Browse our projects Institutes Carumba Institute Study with us Research degrees Doctor of Philosophy Make a significant contribution to new knowledge in your field. Master of Philosophy An ideal stepping-stone to a PhD. Professional doctorate Integrate a research degree into your professional goals. Explore Find a supervisor Scholarships (Australian and New Zealand students) Scholarships (international students) Funding your research degree Supporting your research journey Applying guide e-Grad School Partner with us Collaborate with us Why partner with us? Join forces with us for research consultancy and solutions. Our experts Search for a research supervisor, industry expert or collaborator. Research jobs at QUT Integrate a research degree into your professional goals. Engage with our graduate researchers Work with graduate researchers to help solve your real-world problems. Research partnerships Plant genomics Australian Centre for Health Law Research Biorefineries for profit CARRS-Q PRIME Futures Grants awarded Our researchers are set to receive federal funding to mitigate the impacts of bushfire smoke on wineries and vineyards, and to investigate renewable feedstock. Engage Industry and partnerships Work with our students Employment and mentorship Find graduate and student talent to employ, or mentor our students. Work placements Partner with us to offer work experience opportunities to our students. Sponsor scholarships and events Make a difference to students and the university. Entrepreneurship Innovation community Study entrepreneurship with us or join our innovator community. QUT foundry Create your business, side hustle or social enterprise with us. Industry and innovation Research partnerships Start a collaborative research partnership with our experts. Expertise for industry We can help with more than research. See how we can support your business. Learning and teaching Engage with our award-winning experts. Professional development Customised executive education Customised professional development solutions for your team. Short courses and professional development Study a short course for personal or professional development. Giving Give to QUT Ways to give Discover the different options to donate to QUT. Get involved Share your skills or time by participating in research studies, mentoring our students, giving a guest lecture or volunteering. Contribute Support students Give the life-changing gift of education by contributing to a scholarship. Support research Help shape a better world by giving to a research project. Support community Elevate society by supporting community programs and cultivating Indigenous connections. Alumni Networking Get involved Networking and events Alumni around the world Update your details Alumni benefits Benefits and special offers Tools and resources QCommunity Connect with us Discover news, events and more from our alumni community. Real World Wisdom Our webinar series with experienced alumni sharing real world career advice. Alumni awards Outstanding Alumni Awards Recognising our graduates for their exceptional accomplishments. Student Leadership Awards Acknowledging exceptional contributions from our student leaders. Alumni Service Awards Awarding outstanding personal service from our alumni volunteers. Australian first Motorists taking part in a study conducted by our researchers will get the chance to drive and be driven by Australia's most advanced cooperative and automated car. About QUT Our university Our university QUT at a glance Organisational structure History 30 years of QUT Achievements and recognition Aboriginal and Torres Strait Islander people Sustainability Equity LGBTIQA+ Leadership and strategy Executive team Governance and policy QUT Blueprint Campus to Country STEM strategy Jobs Job vacancies Find job opportunities and your guide to working at QUT. Campuses and venues Gardens Point campus Maps and getting here Around campus Libraries and learning spaces Gym and sports facilities Food and retail Kelvin Grove campus Maps and getting here Around campus Libraries and learning spaces Gym and sports facilities Food and retail Public venues QUT Art Museum William Robinson Gallery Science and Engineering Centre Old Government House Gardens Theatre Education Precinct Creative Industries Precinct Services and facilities Find a service or facility Campus experiences Virtual tours Science in 360° Virtual reality projects News and events News by subject All QUT news Business Creative industries Education Engineering Health Law Science Justice Alumni Media team contact Email: media@qut.edu.au Phone: + 61 7 3138 2361 COVID-19 latest advice Events by subject All QUT events Business Alumni Our specialists Staff profiles Find an expert Engage with an expert in their field for media or research consultancy. Historic collection A selection of digitised images taken and collected by Robert Augustus Henry L’Estrange (1858-1941) can be viewed at QUT as part of the month-long Australian Heritage Festival. QUT Online Pathways to QUT Get advice for post-school and mature-age students about ways to gain entry to QUT, switching courses, and support to help you study. Search Personalise your journey Mid-year entry Make your move mid-year and start your studies at QUT with our range of courses available for entry in July. Make your move Pathways to QUT event Get tailored advice for post-school and mature-age students about ways to gain entry to QUT, switching courses, and support to help you settle in to study. Register now Lifeline op shops go online Lifeline's pivot to online shopping in the new COVID norm is a breakthrough in the non-profit sector, according to a QUT Emeritus Professor. Why it's remarkable Different paths for nursing scholars Kiara and Jenny took different journeys to get to university but both will use their Argent Indigenous Nursing Scholarship to give back to their communities. Meet the scholars Do cyclists look 'less human'? A QUT urban planning academic has launched a survey asking people if they think different types of attire can make cyclists seem 'less human'. Take the survey Study At Queensland University of Technology (QUT) you can choose from more than 100 real-world courses and turn your passion into a rewarding career. Study at QUT Study options Undergraduate study Postgraduate study PhDs and research degrees Short courses and professional development Browse courses by study area Business Creative industries Education Engineering English language and pathway programs Health Justice Languages Law Science Virtual campus Explore our university, learning spaces, laboratories and facilities, all without setting foot on campus. Take a tour Search for a course Our range of study options will take you where you want to go. Search Scholarships open Our scholarships offer financial support, work experience and exciting networking opportunities. Browse our scholarships Take our quiz Try the Match My Skills quiz and discover a personalised list of courses that are perfect for you. Match My Skills Research Research at QUT ARCC of Excellence for the Digital Child Centre for Agriculture and the Bioeconomy Centre for Biomedical Technologies Centre for Data Science Centre for Future Enterprise Centre for Healthcare Transformation Centre for Materials Science Centre for Robotics Digital Media Research Centre Search Engage Explore the many ways of connecting with us. Join our QUT community and access our expert staff, cutting-edge research, and executive education. We welcome new industry partners, potential donors, corporate education clients and alumni. Engage with us Ways to connect Research partnerships Work with our students Executive education Give to QUT Alumni Kidney dilemma A health economics study has helped solve a kidney transplant versus dialysis dilemma. Read the research Discover QUT Find out how our staff, campuses, goals and policies make us the a world-class university with a global outlook for the real world. About QUT Our university Our people Campuses and venues Governance and policy Contact us Public spaces events Study events A quick guide to QUT Are you in Year 9 or 10 and need advice? Join us for a webinar to get your questions answered. Register now Master of Business event Explore your study options and find out how you can advance your career at our information event. Register now Our campuses Explore what's on offer at our two main campuses. Gardens Point Kelvin Grove View campus maps Public venues We are home to many cultural, entertainment and function venues. Gardens Theatre QUT Art Museum William Robinson Gallery Science and Engineering Centre Old Government House Education Precinct Creative Industries Precinct Events QUT hosts events throughout the year for members of our community. QUT Virtual Open Day All QUT events Learn more about our alumni news and research breakthroughs. All news stories Study Research Engage About QUT Undergraduate study Postgraduate study PhDs and research degrees Professional and executive education International students Study abroad and exchange Student life Graduate success Applying Fees Scholarships Key dates Whatever you want to study, our range of courses will give you the hands-on learning, industry connections and real-world perspective you’ll need to succeed. Explore study at QUT Why QUT? Our research Our experts World-class facilities Study with us How to apply Master of philosophy PhD Professional doctorate Partner with us Research jobs Institute for Future Environments Institute of Health and Biomedical Innovation Join us at the forefront of research and development. Access cutting-edge facilities and technology, work with world-renowned experts, and create real-world change. Explore research at QUT Industry and partnerships Work with our students Expertise for industry Executive education Give to QUT Make a real impact Ways to give Get involved Alumni Alumni benefits Update alumni details Alumni networking Connect with us and see how we can support you. We invite industry partners, alumni and our valued donors to join the vibrant, exciting QUT community. Explore ways to engage with QUT Our university Our people Achievements and recognition Campuses and venues News Events Jobs Library Learn what makes Queensland University of Technology (QUT) the university for the real world. Read about our goals for future growth, achievements and success, and the organisational structures that support us. Explore QUT Login Legal Privacy QUT on Facebook QUT on Instagram The QUTube QUT on Twitter QUT on LinkedIn Our social media Contact QUT QUT acknowledges the Traditional Owners of the lands where QUT now stands. CRICOS No. 00213J ABN 83 791 724 622 Page updated: 20 April 2021 Before we make your booking We just have a few questions before booking you in for a 30-minute appointment with us. Please note that we can only call Australian phone numbers. What option best describes you? I’m an Australian or New Zealand citizen I’m an Australian permanent resident or permanent humanitarian visa holder I’m an International student currently completing Year 12 in Australia I’m an International student For international students, please follow the link to register your interest with QUT and we’ll be in touch. Contact us Are you thinking of studying at QUT in the future? Yes I am already a QUT student If you’re already a student with us then HiQ is your best option for help and advice. See your options for contacting HiQ What are you interested in studying? An undergraduate degree A Master of Social Work A Master of Teaching Any other postgraduate or certificate program For information about other postgraduate or certificate programs, please contact HiQ. See your options for contacting HiQ Let’s see when we can book you in Choose your appointment time 
www-rd-alliance-org-165	----	FAIR for Research Software (FAIR4RS) WG | RDA Building the social and technical bridges to enable open sharing and re-use of data RDA EURDA USCONTACT US LOGINREGISTRATION 61 O&A Members Active Organisational & Affiliate members Members: 11749 MEMBERSHIP Becoming a member of RDA is simple and open to both individuals and organizations Register now WG & IGs: 96 RDA Groups Discover what RDA Working and Interest Groups and all other Groups are up to and find out how to join them. Explore Groups About RDA The Research Data Alliance With over 10000 members from 145 countries, RDA provides a neutral space where its members can come together to develop and adopt infrastructure that promotes data-sharing and data-driven research   RDA in a Nutshell Who is RDA? RDA Foundation RDA for Newcomers Our Funders Code of Conduct Member Focus Articles Communication Kit Organisational Bodies RDA Council RDA Council Subgroups RDA Secretariat  RDA Technical Advisory Board  RDA Organisational Advisory Board RDA Funders Forum RDA Secretary General RDA Strategic Plan 2020 - 2023 Get involved Membership Organisational membership Organisational & Affiliate Members Become an Organisational Member The Value of RDA for... COVID-19 Funders Individuals Infrastructure Providers Libraries Organisations performing Research Regions Student/Early Career Programs The European Open Science Cloud (EOSC) Calling RDA Community Request for Comments Call for Papers: Research Data Alliance Results Special Collection Groups RDA Working and Interest Groups The Research Data Alliance accomplishes its mission primarily through Working and Interest Groups. Discover all of them and learn how to join.   Find your Group by topic & discipline Creating and Managing RDA Groups Creating or Joining an RDA Working Group WG Case Statement Process Creating or Joining an RDA Interest Group Creating or Joining a Community of Practice Group Chairs: Roles and Responsibilities Birds of a Feather All Groups Working Groups Interest Groups Historical Groups Coordination Groups National Groups Communities of Practice Recommendations & Outputs RDA Solutions RDA Outputs are the technical and social infrastructure solutions that enable data sharing, exchange and interoperability   Discover them all NEW! Recommendations & outputs catalogue RDA Outputs Adoption Use Cases Adoption Stories RDA Europe Adoption Grants Standards Interest in RDA Recommendations RDA for Disciplines RDA domain research This whiteboard is open to all RDA discipline specialists willing to give a personal account of what data-related challenges they are facing and how RDA is helping them   Find your discipline Agriculture Biodiversity Biomedical Sciences Chemistry Digital Humanities Interdisciplinary research Librarianship, Archival Science and Information Science Linguistics Social Sciences RDA and the Sustainable Development Goals (SDGs) Scholarly Communication RDA Europe Ambassadors Plenaries & Events Plenaries Next Planned Past Training & Webinars Upcoming webinars Past webinars RDA Remote Meeting Guide Events RDA Meetings & Events Other meeting & events WG/IG Collaborative Meetings Past Events News & Media News & Articles RDA Newsletter Reports & Publications Blogs RDA Videos Joint Statements You are here Home » Working and Interest Groups » Working Group » FAIR for Research Software (FAIR4RS) WG WG FAIR for Research Software (FAIR4RS) WG Taxonomy: Posts Create Wiki index Events Repository Outputs Case Statements Plenaries Members create new content Add new post Add Wiki Page Add Group event Add Repository Files Add Output Add Case Statement Group Status: WGs Producing deliverables (~6-12 months after RDA endorsement) Join Group   Please make sure the group follows the new RDA Groups Policy, which came into effect on 1 April 2021. Please contact enquiries[at]rd-alliance.org if you have any questions.   Status:  Recognised & Endorsed Chair (s):  Michelle Barker, Paula Andrea Martinez, Leyla Garcia, Daniel S. Katz, Neil Chue Hong Secretariat Liaison:  Stefanie Kethers TAB Liaison:  Rob Quick   How to engage with the group To keep the community engaged we have a list of communications channels we invite you to join. One of the major challenges of data-driven research is to facilitate knowledge discovery by assisting humans and machines in their discovery of, access to, integration and analysis of data and their associated research objects, e.g., algorithms, software, and workflows. To address this, an initial effort to define a "DATA FAIRPORT" [1] began in 2014 at the Lorentz workshop and transitioned into developing a set of FAIR data Guiding Principles in 2016. The details of the FAIR data principles [2] strongly contribute to addressing this challenge with regard to research data, and the principles, at a high level, are intended to apply to all research objects; both those used in research and that form the outputs of research. Here we focus on the adaptation and adoption of the FAIR principles for the case of research software.   Software has become essential for research. To improve the findability, accessibility, interoperability, and reuse of research software [3] , it is desirable to develop and apply a set of  FAIR Guiding Principles for software. Many of the high-level FAIR data principles can be directly applied to research software by treating software and data as similar digital research objects. However, specific characteristics of software — such as its executability, composite nature, and continuous evolution and versioning — make it necessary to revise and extend the original data principles.   Application of the FAIR principles to software will continue to advance the aims of the open science movement. The FAIR For Research Software Working Group (FAIR4RS WG) will be jointly convened as an RDA Working Group, FORCE11 Working Group, and Research Software Alliance (ReSA) Taskforce, in recognition of the importance of this work for the advancement of the research sector. FAIR4RS WG will enable coordination of a range of existing community-led discussions on how to define and effectively apply FAIR principles to research software, to achieve adoption of these principles.   The working group will deliver: A document developed with community support defining FAIR principles for research software A document providing guidelines on how to apply the FAIR principles for research software (based on existing frameworks) A document summarising the definition of the FAIR principles for research software, implementation guidelines and adoption examples. To start the group's activities, there are four subgroups working on: A fresh look at FAIR for Research Software is examining the FAIR principles in the context of research software from scratch, not based on pre-existing work. Lead: Daniel S. Katz FAIR work in other contexts is examining efforts to apply FAIR principles to different forms including workflows, notebooks and training material, to provide insights for the definition and implementation of FAIR principles for research software. Lead: Michelle Barker Definition of research software is reviewing existing definitions of research software and will specify the scope for the WG outputs. Lead: Morane Gruenpeter Review of new research related to FAIR Software is reviewing new research around FAIR software that has come out since the release of the Towards FAIR principles for research software paper in August 2019. Lead: Neil Chue Hong 1 See also DTL, 2014; and Kok, 2014. 2 See also Wilkinson et al., 2016. 3 For further information refer to Clément-Fontaine et al., 2019.     About Who is RDA? RDA Foundation RDA for Newcomers Our Funders Code of Conduct Member Focus Articles Communication Kit Organisational Bodies RDA Council RDA Secretariat  RDA Technical Advisory Board  RDA Organisational Advisory Board RDA Funders Forum RDA Secretary General RDA Strategic Plan 2020 - 2023 Get Involved The Value of RDA for... COVID-19 Funders Individuals Infrastructure Providers Libraries Organisations performing Research Regions Student/Early Career Programs The European Open Science Cloud (EOSC) Calling RDA Community Request for Comments Call for Papers: Research Data Alliance Results Special Collection RDA for Disciplines Agriculture Biodiversity Biomedical Sciences Chemistry Digital Humanities Interdisciplinary research Librarianship, Archival Science and Information Science Linguistics RDA and the Sustainable Development Goals (SDGs) Social Sciences Scholarly Communication RDA Europe Ambassadors Groups Creating and Managing RDA Groups All Groups Working Groups Interest Groups Historical Groups Coordination Groups National Groups Communities of Practice Recommendations & Output NEW! Recommendations & outputs catalogue RDA Outputs Adoption Use Cases Adoption Stories RDA Europe Adoption Grants Standards Interest in RDA Recommendations Plenaries & Events Plenaries Next Planned Past Training & Webinars Upcoming webinars Past webinars RDA Remote Meeting Guide Events RDA Meetings & Events Other meeting & events WG/IG Collaborative Meetings Past Events News & Media News & Articles RDA Newsletter Reports & Publications Blogs RDA Videos Joint Statements The Research Data Alliance is supported by the European Commission, the National Science Foundation and other U.S. agencies, and the Australian Government.  | Privacy Policy | Terms of use / Copyright 
www-reaper-fm-5949	----	REAPER | Audio Production Without Limits DOWNLOAD REAPER VERSION 6.28: April 22, 2021 PURCHASE cost: not so much DOWNLOAD USER GUIDE essential reading RESOURCES Language Packs Themes REAPER Stash SWS REAPER Extension ReaPack Package Manager The (unofficial) REAPER Blog customize, modify, extend VIDEOS watch and learn FORUM discuss, share, learn This is REAPER. REAPER is a complete digital audio production application for computers, offering a full multitrack audio and MIDI recording, editing, processing, mixing and mastering toolset. REAPER supports a vast range of hardware, digital formats and plugins, and can be comprehensively extended, scripted and modified. Do Anything REAPER's full, flexible feature set and renowned stability have found a home wherever digital audio is used: commercial and home studios, broadcast, location recording, education, science and research, sound design, game development, and more. From mission-critical professional environments to students' laptops, there is a single version of REAPER, fully featured with no artificial limitations. You can evaluate REAPER in full for 60 days. A REAPER license is affordably priced and DRM-free. Constant Evolution A new REAPER 6 license includes unlimited free updates through REAPER version 7.99. Frequency varies, but updates are typically released every few weeks. These updates include bug fixes, feature improvements, and significant new features, all of which are free. Updates only take a minute or so. All preferences and configurations are preserved, and forward and backward compatibility are maintained. Feature Highlights Efficient, fast to load, and tightly coded. Can be installed and run from a portable or network drive. Powerful audio and MIDI routing with multichannel support throughout. 64-bit internal audio processing. Import, record to, and render to many media formats, at almost any bit depth and sample rate. Thorough MIDI hardware and software support. Support for thousands of third-party plug-in effects and virtual instruments, including VST, VST3, LV2, AU, DX, and JS. Hundreds of studio-quality effects for processing audio and MIDI, and built-in tools for creating new effects. Automation, modulation, grouping, VCA, surround, macros, OSC, scripting, control surfaces, custom skins and layouts. A whole lot more. REAPER 6.28: FX allow JS/Channel Mapper-Downmixer to be renamed • fix mapping track channels above 32 using the FX pin connector dialog • avoid zombie bridge processes in various corner cases Marker/region manager fix undesired periodic marker deselection improve selection behavior when removing markers/regions move edit cursor when seeking during playback Media explorer fix tempo sync when inserting media on new track • improve readability of embedded cues MIDI enforce some reasonable maximum MIDI message and per-block list sizes (128MB and 256MB initially) fix unlikely but possible incorrect handling of PDC+MIDI more... New in Version 6 FX Plug-in Embedding:Embed small versions of some plug-ins into your tracks control and mixer panels, including ReaEQ, ReaFIR, ReaXcomp, graphical JSFX plug-ins, and more MIDI CC Envelopes:Handle MIDI CC data as continuous data envelopes, rather than discrete events; create smooth, musical articulations and effects Auto-stretch Timebase:Automatically stretch and reconform audio around complex tempo changes; easily work with tempo-mapped and live-played recordings together Routing Diagram:View and edit project routing using a high-level graphical patchbay emulation Retina/HiDPI:Automatic rendering to HiDPI and Retina displays; new Default 6 theme supports 100%, 150% and 200% natively Big Project Improvements:Vastly optimized behavior for projects with high (200+) track counts; Metal display support for massively faster screen drawing on newer macOS displays ..and More:New theme with extensive customizability via Tweaker script; Dynamic Split improvements; import and render media with embedded transient information; per-track positive or negative playback offset; faster and higher quality samplerate conversion; and many other fixes and improvements • Home     Company     Reviews     Radio   About     Technical   Download     Old Versions     Language Packs     ReaPlugs   Purchase     Distribution   Developer     Theme Development     Custom Cursors     JSFX Programming     ReaScript     Extensions SDK     Extensions to VST SDK     OSC     Language Pack Template   Resources     User Guide     Videos     Stash     Forum 
www-reddit-com-2605	----	NFT disappearing from wallet???? : Metamask Press J to jump to the feed. Press question mark to learn the rest of the keyboard shortcuts Log InSign Up User account menu 6 NFT disappearing from wallet???? Close 6 Posted by6 months ago Archived NFT disappearing from wallet???? Bought "Big Boy Pants" NFT from Rariable that showed in my mobile Metamask wallet for a few days, then disappeared???? Any ideas??? 12 commentsshare save hide report 100% Upvoted This thread is archived New comments cannot be posted and votes cannot be cast Sort by best level 16 months ago Did you bought it directly from Ivan link? Aparently that nft was scammed on rarible 2 Share ReportSave level 2Original Poster6 months ago No, bought on Rarible, but even if a fake, shouldn't I have it??? 2 Share ReportSave Continue this thread  level 16 months ago Therr is an add token option on metamask. Google how to use it and give it a try. It cannot dissappear like that once it is in the chain. Dont worry 1 Share ReportSave level 2Original Poster6 months ago Thanks, i'm on it... 2 Share ReportSave level 128 days ago You have been scammed by Ethereum... It's a scam network run by ghosts. So, your only remedy now is to talk to ghosts... Good luck if you will get a response. 1 Share ReportSave level 121 days ago https://www.reddit.com/r/Metamask/comments/jhvlfe/nft_disappearing_from_wallet/ 1 Share ReportSave View Entire Discussion (12 Comments) More posts from the Metamask community Continue browsing in r/Metamask r/Metamask Support would never DM you! Only use https://support.metamask.io for getting help. MetaMask is a bridge that allows you to visit the distributed web of tomorrow in your browser today. It allows you to access Ethereum dapps right in your browser without running a full Ethereum node. You can post news or questions here, just be kind! Please, no spamming about tokens or projects. 13.6k Members 717 Online Created Sep 6, 2017 Join helpReddit AppReddit coinsReddit premiumReddit gifts aboutcareerspressadvertiseblogTermsContent policyPrivacy policyMod policy Reddit Inc © 2021. All rights reserved Back to Top 
www-reddit-com-6960	----	NFT I purchased is missing on OpenSea : opensea Press J to jump to the feed. Press question mark to learn the rest of the keyboard shortcuts Log InSign Up User account menu 3 NFT I purchased is missing on OpenSea Close 3 Posted by1 month ago NFT I purchased is missing on OpenSea I bought my second NFT on OpenSea last weekend, and I revisited my account last night - and the NFT I purchased is missing and has apparently disappeared. The image is in my wallet, but I can't find any record of the NFT existing or me purchasing it on OpenSea (but ETH transaction details are in my wallet). Any thoughts on how to fix and re-add to OpenSea??? 6 commentsshare save hide report 100% Upvoted Log in or sign up to leave a commentLog InSign Up Sort by best level 1 · 1m If the image is in your wallet and you can view the transaction(s) on Etherscan, try looking it up - https://opensea.io/assets/<your_contract_address>/<token_id> If that doesn't work—the second thing I'd try is importing it on another platform [like Rarible] and see if it's visible. You can also try re-listing it via the contract address: https://opensea.io/get-listed/step-two 1 ReplyShare ReportSave level 2 · 28d So what it doesn't work? There is simply no recourse with the scammish blockchain networks as its run by ghosts... You mess with blockchain, you are dead. Period 1 ReplyShare ReportSave level 1 · 1m Mod If the NFT was a fake, it might have been hidden on the site. 1 ReplyShare ReportSave level 2 · 1m Wait, is that a thing? OpenSea should display all assets, right? You could use another wallet like http://rainbow.me to interact with it. 1 ReplyShare ReportSave level 2 · 28d You're right. NFT is FAKE... Wait for my expose. 0 ReplyShare ReportSave level 1 · 28d And OpenSea never even talk to you, right? Keep your money, I am about to launch a site that you will have legal remedy... Ethereum as it is now is nothing but a WHITE SCAM 0 ReplyShare ReportSave View Entire Discussion (6 Comments) More posts from the opensea community Continue browsing in r/opensea r/opensea OpenSea is the first and largest NFT marketplace. Buy, sell, & create the world of NFTs: cryptoart, game items, domain names and more! Contact the mods if you'd like to do an AMA. 7.9k Voyagers 157 Exploring Created Dec 27, 2017 Join helpReddit AppReddit coinsReddit premiumReddit gifts aboutcareerspressadvertiseblogTermsContent policyPrivacy policyMod policy Reddit Inc © 2021. All rights reserved Back to Top 
www-researchsoft-org-8143	----	ReSA About People Task forces Resources News Contact Twitter Back to top Research software: recognised and valued as a fundamental and vital component of research worldwide Learn more Our mission To bring research software communities together to collaborate on the advancement of research software. How we can help Research Enable better and more efficient research Community Engage in collaborative projects Policy Support policies that recognise and value research software Software matters A UK survey of 1,000 randomly chosen researchers showed that more than 90% of researchers acknowledged software as being important for their own research, and about 70% of researchers said that their research would not be possible without software [1]. A study of Nature papers from Jan-March 2016 reveals that “32 of the 40 papers examined mention software, and the 32 papers contain 211 mentions of distinct pieces of software, for an average of 6.5 mentions per paper.” [2]. [1] Hettrick. S. J., et al. (2014). UK Research Software Survey 2014 [Data set]. doi:10.5281/zenodo.14809 [2] Nangia, Udit; Katz, Daniel S. (2017): Understanding Software in Research: Initial Results from Examining Nature and a Call for Collaboration. doi:10.1109/eScience.2017.78 90% 70% 6.5 avg software important for own research own research not possible without research distict software mentions per paper Without data it’s difficult to validate results. But without code, we waste the opportunity to advance science. Neil Chue Hong, Director, Software Sustainability Institute, Director, University of Edinburgh, UK twitter github © Copyright 2021 by The Research Software Alliance License Code of Conduct 
www-roadandtrack-com-5986	----	Tesla's "Full Self Driving" Beta Is Just Laughably Bad and Potentially Dangerous Search Join Now The Track Club R&T Crew R&T Experiences News Videos New Cars First Drives Road Tests Comparison Tests Future Cars & Spy Shots Technology Motorsports Car Culture Gear Buying & Maintenance Vintage Style Travel Entertainment Design Car Shows Contact Follow Facebook Twitter Pinterest Instagram YouTube Newsletter Shop R&T Do Not Sell My Personal Information Privacy Notice/Notice at Collection Terms of Use Join Now Sign In My Account Sign Out Type keyword(s) to search Today's Top Stories 1 The VW GTI Is Both the Same and Better than Better 2 What Happens When a Stock Car Ahead of You Rolls 3 F1 Is Officially Adding Three Sprint Races 4 Watch a Miata Have Two Near Misses on the 'Ring 5 The Hyundai Kona N Is a Cute Crossover With 276 HP Our car experts choose every product we feature. We may earn money from the links on this page. Tesla's "Full Self Driving" Beta Is Just Laughably Bad and Potentially Dangerous If you think we're anywhere near fully autonomous cars, this video might convince you otherwise. By Mack Hogan Mar 19, 2021 AI Addict on YouTube A beta version of Tesla's "Full Self Driving" Autopilot update has begun rolling out to certain users. And man, if you thought "Full Self Driving" was even close to a reality, this video of the system in action will certainly relieve you of that notion. It is perhaps the best comprehensive video at illustrating just how morally dubious, technologically limited, and potentially dangerous Autopilot's "Full Self Driving" beta program is. This content is imported from YouTube. You may be able to find the same content in another format, or you may be able to find more information, at their web site. In a 13-minute video posted to YouTube by user "AI Addict," we see a Model 3 with FSD Beta 8.2 fumbling its way around Oakland. It appears hapless and utterly confused at all times, never passably imitating a human driver. Early in the video, the front-seat passenger remarks at the car's correct decision to pass a bunch of double-parked cars rather than waiting behind them—but the moment of praise is cut short when the car parks itself right on the center line while trying to get into a left-turn lane. That's because—like all semi-autonomous systems on sale today—Tesla's "Full Self Driving" and "Autopilot" systems are not, in fact, fully autonomous. They require constant human supervision and split-second intervention. And now that the latest beta version of the software is out in the wild, it seems to require more attention than ever. Related Story Tesla Employees Worried About Autopilot for Years Quite quickly, the video moves from "embarrassing mistakes" to "extremely risky, potentially harmful driving." In autonomous mode, the Tesla breaks a variety of traffic laws, starting with a last-minute attempt to cross a hard line and execute an illegal lane change. It then attempts to make a left turn next to another car, only to give up midway through the intersection and disengage. It goes on to take another turn far too wide, landing it in the oncoming lane and requiring driver intervention. Shortly thereafter, it crosses into the oncoming lane again on a straight stretch of road with bikers and oncoming traffic. It then drunkenly stumbles through an intersection and once again requires driver intervention to make it through. While making an unprotected left after a stop sign, it slows down before the turn and chills in the pathway of oncoming cars that have to brake to avoid hitting it. Want more Road & Track? Join the Track Club today! Road & Track $75.00 JOIN NOW! The video's not even halfway done, but the litany of errors continues with another random disengagement. The Tesla attempts to make a right turn at a red light where that's prohibited, once again nearly breaking the law and requiring the driver to actively prevent it from doing something. It randomly stops in the middle of the road, proceeds straight through a turn-only lane, stops behind a parked car, and eventually almost slams into a curb while making a turn. After holding up traffic to creep around a stopped car, it confidently drives directly into the oncoming lane before realizing its mistake and disengaging. Another traffic violation on the books—and yet another moment where the befuddled car just gives up and leaves it to the human driver to sort out the mess. The Tesla's software is defeated by cars stopped in the roadway and an intersection where it clearly has the right of way. Then comes another near collision. This time, the Tesla arrives at an intersection where it has a stop sign and cross traffic doesn't. It proceeds with two cars incoming, the first car narrowly passing the car's front bumper and the trailing car braking to avoid T-boning the Model 3. It is absolutely unbelievable and indefensible that the driver, who is supposed to be monitoring the car to ensure safe operation, did not intervene there. It's even wilder that this software is available to the public. But that isn't the end of the video. To round it out, the Model 3 nearly slams into a Camry that has the right of way while trying to negotiate a kink in the road. Once it gets through that intersection, it drives straight for a fence and nearly plows directly into it. Both of these incidents required driver intervention to avoid. To be sure, nobody has solved autonomous driving. It is a challenging problem that some experts say will only be solved with highly advanced artificial intelligence. Tesla's software clearly does a decent job of identifying cars, stop signs, pedestrians, bikes, traffic lights, and other basic obstacles. Yet to think this constitutes anything close to "full self-driving" is ludicrous. There's nothing wrong with having limited capabilities, but Tesla stands alone in its inability to acknowledge its own shortcomings. When technology is immature, the natural reaction is to continue working on it until it's ironed out. Tesla has opted against that strategy here, instead choosing to sell software it knows is incomplete, charging a substantial premium, and hoping that those who buy it have the nuanced, advanced understanding of its limitations—and the ability and responsibility to jump in and save it when it inevitably gets baffled. In short, every Tesla owner who purchases "Full Self-Driving" is serving as an unpaid safety supervisor, conducting research on Tesla's behalf. Perhaps more damning, the company takes no responsibility for its actions and leaves it up to driver discretion to decide when and where to test it out. This content is imported from Twitter. You may be able to find the same content in another format, or you may be able to find more information, at their web site. YouTuber AI Addict posted a video yesterday of a drive through Oakland on the latest version of Tesla FSD Beta 8.2 “City Streets”. Regardless of what you think of a product Tesla calls “Full Self-Driving”, do yourself a favor and watch these clips.https://t.co/wezBAWLIa4 pic.twitter.com/8g0hirjBqy — Taylor Ogan (@TaylorOgan) March 16, 2021 That leads to videos like this, where early adopters carry out uncontrolled tests on city streets, with pedestrians, cyclists, and other drivers unaware that they're part of the experiment. If even one of those Tesla drivers slips up, the consequences can be deadly. All of this testing is being carried out on public roads, for the benefit of the world's most valuable automaker, at basically zero cost. We've reached out to Tesla for comment on the video, but the company has no press office and does not typically respond to inquiries. This content is created and maintained by a third party, and imported onto this page to help users provide their email addresses. You may be able to find more information about this and similar content at piano.io Advertisement - Continue Reading Below More From News The Ferrari 812 Can Never Have Too Much Horsepower Why Lotus Killed the Elise Advertisement - Continue Reading Below The Hyundai Kona N Is a Cute Crossover With 276 HP The Emira Will Be Lotus's Final Gas-Powered Car Is This a Half-million Dollar R34 GT-R? The New BMW M4 Beats the Old M4 in a Drag Race The Ford Mustang Mach-E GT Starts at $61,000 F1 Is Officially Adding Three Sprint Races Watch a Miata Have Two Near Misses on the 'Ring Rimac C_Two Stomps Taycan Turbo S in a Drag Race Newsletter About Us Media Kit Press Room Contact Us Community Guidelines Advertise Online Customer Service Subscribe Other Hearst Subscriptions Give a Gift A Part of Hearst Digital Media Road & Track participates in various affiliate marketing programs, which means we may get paid commissions on editorially chosen products purchased through our links to retailer sites. ©2021 Hearst Autos, Inc. All Rights Reserved. Privacy Notice/Notice at Collection Your California Privacy Rights Interest-Based Ads Terms of Use Do Not Sell My Personal Information 
www-roadandtrack-com-8058	----	Tesla Model S Involved In Fatal Crash While in Autopilot Mode - NHTSA to Investigate Tesla Autopilot Mode Search Join Now The Track Club R&T Crew R&T Experiences News Videos New Cars First Drives Road Tests Comparison Tests Future Cars & Spy Shots Technology Motorsports Car Culture Gear Buying & Maintenance Vintage Style Travel Entertainment Design Car Shows Contact Follow Facebook Twitter Pinterest Instagram YouTube Newsletter Shop R&T Do Not Sell My Personal Information Privacy Notice/Notice at Collection Terms of Use Join Now Sign In My Account Sign Out Type keyword(s) to search Today's Top Stories 1 The VW GTI Is Both the Same and Better than Better 2 What Happens When a Stock Car Ahead of You Rolls 3 F1 Is Officially Adding Three Sprint Races 4 Watch a Miata Have Two Near Misses on the 'Ring 5 The Hyundai Kona N Is a Cute Crossover With 276 HP Our car experts choose every product we feature. We may earn money from the links on this page. Fatal Tesla Model S Crash While In Autopilot Triggers NHTSA Investigation This is the first fatal crash that has occurred while a Tesla has had Autopilot engaged. By Chris Perkins and Andrew Del-Colle Jul 8, 2016 Tesla Motors UPDATE: The man driving the Tesla Model S that had Autopilot engaged when it hit a tractor-trailer has been officially verified to be Joshua Brown, a 40-year-old former Navy SEAL and technology entrepreneur from Ohio. The fatal accident occurred in Florida. According to an AP report, Brown had nicknamed his car "Tessy" and was an avid fan of Tesla and Autopilot. As reported in our original story (below), Brown was known in the Tesla community for sharing videos of his Autopilot adventures on YouTube. Only a month ago he shared a video of his Model S's Autopilot system saving him from a potential accident. Details about the crash are starting to come out, and the AP reports that "by the time firefighters arrived, the Tesla wreckage — with its roof sheared off — had come to rest in a yard hundreds of feet from the crash site." For a car to hit a tractor-trailer, lose its roof, and still be able to continue its momentum for hundreds of feet indicates a high rate of speed. Reuters also reports that police found a portable DVD player in the wreckage. This, along with the the truck driver telling the AP that Brown had been watching something before the accident, points to the fact that he might have been distracted while approaching the intersection. It's unclear whether or not Brown was watching a film though, because eyewitnesses at the scene offer conflicting accounts as to whether or not the DVD player was on, per Reuters. We have also received a statement from the National Highway Transportation Safety Administration, which made clear that the preliminary investigation is not an indictment of Tesla or Autopilot. "The opening of the Preliminary Evaluation should not be construed as a finding that the Office of Defects Investigation believes there is either a presence or absence of a defect in the subject vehicles," said Bryan Thomas, NHTSA's communications director. This is the first fatal accident in a Tesla vehicle since the company launched its semi-autonomous driving technology last October. In that time, vehicles using Autopilot have traveled more than 130 million miles, according to Tesla's data logs. The National Transportation Board has also announced it will investigate the wreckage from the crash to determine if there are any significant issues with Autopilot. ORIGINAL REPORT: We've seen minor incidents with Teslas driven in Autopilot mode, but it seems like we now have our first major incident with the semi-autonomous driving function engaged. Tesla said in a statement today that NHTSA has opened up a preliminary evaluation into the performance of Autopilot during a fatal accident with a Model S and a tractor-trailer. Per a tweet from CNBC's Phil Lebau, the accident occurred May 7th in Florida. In a statement given to the press, NHTSA says that this incident,"calls for an examination of the design and performance of any driving aids in use at the time of the crash." The car involved in the crash was a 2015 Model S, and NHTSA says it will investigate 25,000 cars. From Tesla: What we know is that the vehicle was on a divided highway with Autopilot engaged when a tractor trailer drove across the highway perpendicular to the Model S. Neither Autopilot nor the driver noticed the white side of the tractor trailer against a brightly lit sky, so the brake was not applied. The high ride height of the trailer combined with its positioning across the road and the extremely rare circumstances of the impact caused the Model S to pass under the trailer, with the bottom of the trailer impacting the windshield of the Model S. Had the Model S impacted the front or rear of the trailer, even at high speed, its advanced crash safety system would likely have prevented serious injury as it has in numerous other similar incidents. While details remain minimal so far, there are a few quick observations that can be made based off Tesla's description of the accident. First off, this isn't the first time we've seen seen a potential issue with Autopilot not being able to sense obstructions of a certain height. Earlier this year, a Model S owner claimed that his Autopilot-equipped car crashed itself into a trailer. While Tesla debunked this claim and said that the man irresponsibly used the automated Summon parking feature, the incident showed that the Model S's hardware does have limitations in terms of detecting forward obstructions. Perhaps that happened here when the semi crossed in front of the Model S. Tesla also points out that if the car had hit the front or rear of the trailer the occupant would have likely survived thanks to the Model S's crash safety system. Of course, there's no way to know if that is true, and that isn't what happened. What is perhaps most worrisome is whether the driver was even paying attention at all before the accident happened. Tesla says that the car's hardware and the driver couldn't see the tractor-trailer's white side because of the "brightly lit sky" behind it, but until we find out more, this sounds like speculation. Worth noting is that the Model S's Autopilot system relies on a forward-facing radar and camera to "see" obstacles, and though it can be easy for humans to lose an object against the sky, a tractor-trailer is a very big object to miss if you are actively engaged in the driving experience. Tesla has faced plenty of scrutiny from regulators and other carmakers since the release of Autopilot in October. It is the first commercial semi-autonomous driving system that allows drivers to fully remove their hands from the wheel, and to many in the auto industry, Tesla deployed it to the public sooner than was believed to be safe or responsible. The fact that the company called the rollout a "beta" hasn't helped either. Almost immediately after Autopilot's release there were issues with drivers experimenting with the technology. While there have been instances of Autopilot preventing incidents, there have also been numerous smaller accidents and even videos of drivers sleeping while at the wheel. Again, we do not know near enough yet to say definitively what happened here, but this surely won't help Tesla's case with critics. In its statement, Telsa said the driver killed "was a friend to Tesla and the broader EV community," but did not release any further details. According to The Verge, the Model S owner killed in the accident reportedly was the same driver that recently posted a video of a near accident while in Autopilot mode. The video was posted to YouTube by a man named Joshua Brown, whose obituary matches the circumstances of the crash as described in Tesla's statement. In the video, the Model S quickly veers right to avoid being hit by a truck moving into its lane, showing just one of the many potential benefits of autonomous technologies. Tesla CEO Elon Musk offered in his condolences in a tweet sent out shortly after news of the crash broke. This content is imported from Twitter. You may be able to find the same content in another format, or you may be able to find more information, at their web site. Our condolences for the tragic loss https://t.co/zI2100zEGL — Elon Musk (@elonmusk) June 30, 2016 While NHTSA conducts its investigation and more details come to light, it will be telling to see how the world—especially the media—reacts to this news. Considering this is the first fatality to occur on public roads with an advanced semi-autonomous technology engaged, there is the possibility that the technology, one that almost every carmaker in the world is working on in varying capacities to reduce everything from deaths to emissions, will suffer a set back. That said, accidents and fatalities are bound to happen as we move forward with such a new technology. As has been done in so many other fields, society has to decide if it thinks the potential benefits outweigh the inevitable costs. If we do, then the ultimate question is how to develop the technology as responsibly as possible, and right now, all eyes are on Tesla. This story was last updated on 7/8 3:30 p.m. ET to reflect news of the NTSB's investigation. This content is created and maintained by a third party, and imported onto this page to help users provide their email addresses. You may be able to find more information about this and similar content at piano.io Advertisement - Continue Reading Below More From Technology God Help Me, I Love Honda's Automatic Motorcycle Here's How Brake-by-Wire Works, and Why It's Safe Advertisement - Continue Reading Below Future Tesla Cars Will Have Structural Batteries Ford 7.3 V-8 Makes 600 N/A Horses With Basic Mods GM's New EVs Will Use Wireless Battery Management How a Tesla Can Drive Twice as Far as Porsche's EV Fury Without Sound Porsche Investigated Over "Engine Manipulation" The Ram TRX's Grill Badge Is Hollowed Out for Air Why the Ram TRX Makes 702 HP, Not 707 HP Newsletter About Us Media Kit Press Room Contact Us Community Guidelines Advertise Online Customer Service Subscribe Other Hearst Subscriptions Give a Gift A Part of Hearst Digital Media Road & Track participates in various affiliate marketing programs, which means we may get paid commissions on editorially chosen products purchased through our links to retailer sites. ©2021 Hearst Autos, Inc. All Rights Reserved. Privacy Notice/Notice at Collection Your California Privacy Rights Interest-Based Ads Terms of Use Do Not Sell My Personal Information 
www-rust-lang-org-1433	----	Rust Programming Language Install Learn Playground Tools Governance Community Blog Language English (en-US) Español (es) Français (fr) Italiano (it) 日本語 (ja) Português (pt-BR) Русский (ru) Türkçe (tr) 简体中文 (zh-CN) 正體中文 (zh-TW) Rust A language empowering everyone to build reliable and efficient software. Get Started Version 1.51.0 Why Rust? Performance Rust is blazingly fast and memory-efficient: with no runtime or garbage collector, it can power performance-critical services, run on embedded devices, and easily integrate with other languages. Reliability Rust’s rich type system and ownership model guarantee memory-safety and thread-safety — enabling you to eliminate many classes of bugs at compile-time. Productivity Rust has great documentation, a friendly compiler with useful error messages, and top-notch tooling — an integrated package manager and build tool, smart multi-editor support with auto-completion and type inspections, an auto-formatter, and more. Build it in Rust In 2018, the Rust community decided to improve programming experience for a few distinct domains (see the 2018 roadmap). For these, you can find many high-quality crates and some awesome guides on how to get started. Command Line Whip up a CLI tool quickly with Rust’s robust ecosystem. Rust helps you maintain your app with confidence and distribute it with ease. Building Tools WebAssembly Use Rust to supercharge your JavaScript, one module at a time. Publish to npm, bundle with webpack, and you’re off to the races. Writing Web Apps Networking Predictable performance. Tiny resource footprint. Rock-solid reliability. Rust is great for network services. Working On Servers Embedded Targeting low-resource devices? Need low-level control without giving up high-level conveniences? Rust has you covered. Starting With Embedded Rust in production Hundreds of companies around the world are using Rust in production today for fast, low-resource, cross-platform solutions. Software you know and love, like Firefox, Dropbox, and Cloudflare, uses Rust. From startups to large corporations, from embedded devices to scalable web services, Rust is a great fit. My biggest compliment to Rust is that it's boring, and this is an amazing compliment. – Chris Dickinson, Engineer at npm, Inc All the documentation, the tooling, the community is great - you have all the tools to succeed in writing Rust code. – Antonio Verardi, Infrastructure Engineer Learn More Get involved Read Rust We love documentation! Take a look at the books available online, as well as key blog posts and user guides. Read the book Watch Rust The Rust community has a dedicated YouTube channel collecting a huge range of presentations and tutorials. Watch the Videos Contribute code Rust is truly a community effort, and we welcome contribution from hobbyists and production users, from newcomers and seasoned professionals. Come help us make the Rust experience even better! Read Contribution Guide Thanks Rust would not exist without the generous contributions of time, work, and resources from individuals and companies. We are very grateful for the support! Individuals Rust is a community project and is very thankful for the many community contributions it receives. See individual contributors Corporate sponsors The Rust project receives support from companies through the donation of infrastructure. See sponsors Get help! Documentation Rust Forge (Contributor Documentation) Ask a Question on the Users Forum Check Website Status Language English (en-US) Español (es) Français (fr) Italiano (it) 日本語 (ja) Português (pt-BR) Русский (ru) Türkçe (tr) 简体中文 (zh-CN) 正體中文 (zh-TW) Terms and policies Code of Conduct Licenses Logo Policy and Media Guide Security Disclosures Privacy Policy All Policies Social Maintained by the Rust Team. See a bug? File an issue! Looking for the previous website? 
www-sheldon-hess-org-1380	----	Coral Sheldon-Hess – Tech teacher, data geek, maker, bird nerd Skip to content Coral Sheldon-Hess Tech teacher, data geek, maker, bird nerd open primary menu twitter linkedin rss github ravelry About Blog Contact Sidebar Posts by category librarianship leadership teaching and learning social justice alaska technology travel crafts geekery hiring and employment on a personal note past projects Coral Sheldon-Hess Posts Slack-like tools in the online classroom Slack-like tools in the online classroom Published by Coral Sheldon-Hess on March 30, 2021 A Slack-type tool fills in a really important gap in student-student and student-professor communication. Without something Slack-like, your choices for communication are the learning management system (LMS) discussion boards or email. I think it’s uncontroversial to say we all hate LMS discussion boards. … As for email, I’ll go out on a limb and say that I suspect most professors do not enjoy answering multiple versions of the same question over and over, one by one, especially when they have to choose between knowing in their hearts that some students aren’t asking and won’t know, versus making yet another LMS announcement to address any given issue. Continue readingSlack-like tools in the online classroom The Online Unconference of Niche Interests The Online Unconference of Niche Interests Published by Coral Sheldon-Hess on November 27, 2020 If you’re looking for a fun and educational thing to do this weekend, you might consider attending the second quarterly(??) Online Unconference of Niche Interests (“OUNI” for short), scheduled to run from 2pm until a bit after 5pm Eastern Standard Time, this Sunday, November 29. We have a set of volunteer presenters who will each talk for up to 15 minutes about a niche topic they’re into. Continue readingThe Online Unconference of Niche Interests What I’m telling family about COVID-19 What I’m telling family about COVID-19 Published by Coral Sheldon-Hess on June 16, 2020 A family member asked me to tell them about COVID-19. It was a general question, which I chose to interpret as “how does transmission work, and what is the real risk?” This is what I said. As I told them, I’m not a biologist of any sort, and I will accept corrections (both from people who are biologists and from those who can cite sources), of course. Both this person and I have autoimmune issues, so I take that as a given in this post. Continue readingWhat I’m telling family about COVID-19 What I’ve been up to during all this What I’ve been up to during all this Published by Coral Sheldon-Hess on April 24, 2020 How my household is doing Perhaps the best place to start writing about what I’ve been up to is to be really clear: I’m OK, and, at least for now, so are my loved ones. My spouse and I are both incredibly lucky to have jobs that can be done… Continue readingWhat I’ve been up to during all this Get that bread Get that bread Published by Coral Sheldon-Hess on February 24, 2020 I want to tell you about my take on the New Artisan Bread in Five Minutes a Day recipe. The things I have to add to the discussion: 1) a couple of hacks for people who, like me, do not have a kitchen fan that vents outdoors (I promise I’ll explain why this matters) and who like at least a little bit of whole grain in their bread, plus 2) photos of some of the steps they don’t show as clearly in the book. I’m still experimenting (always!), but I have a base recipe/approach that I like and that I think is good enough to share. Continue readingGet that bread 2019 year-end post 2019 year-end post Published by Coral Sheldon-Hess on December 26, 2019 We’re rapidly approaching the time for the traditional year-end post, which I’ve been known to skip in recent years—I had a run of several really rough years, there. While 2019 wasn’t without personal challenges and setbacks (and a whole lot of frightening developments in the US and abroad), it brought… Continue reading2019 year-end post Belated update Belated update Published by Coral Sheldon-Hess on October 9, 2019 Right now I should be grading or preparing for classes, but honestly I’m three blog posts behind where I wanted to be by now (I haven’t forgotten my WisCon promise to make a post about tabletop roleplaying games) and fighting a pretty nasty headache. So what if I take a… Continue readingBelated update Doing Data Things Doing Data Things Published by Coral Sheldon-Hess on December 12, 2018 TLDR: I took two classes this semester, and I’m going to teach at least one, probably 1.5, classes next semester. I’m super psyched about it. I’ll still work for the library where I’m an adjunct, too, but fewer hours per week. I’m still available for full-time hire, if you have data for me to work with. Continue readingDoing Data Things 2018 2018 Published by Coral Sheldon-Hess on December 31, 2017 I usually do a year-end post. That’s not happening in 2017. This year took so much from me, and from people I care about, that I refuse to write about it. But I’d like to write about 2018. Not “resolutions” so much as “plans and goals”—and maybe not even those… Continue reading2018 A librarian again A librarian again Published by Coral Sheldon-Hess on October 9, 2017 Over the past few years, I’ve come to dread the “what do you do?” question, because what people generally mean is “where do you work?” And it’s awkward when you can’t have that conversation the way they expect. Continue readingA librarian again Posts navigation 1 2 … 31 Next Author WordPress Theme by Compete Themes 
www-sheldon-hess-org-456	----	Coral Sheldon-Hess Coral Sheldon-Hess Tech teacher, data geek, maker, bird nerd Slack-like tools in the online classroom A Slack-type tool fills in a really important gap in student-student and student-professor communication. Without something Slack-like, your choices for communication are the learning management system (LMS) discussion boards or email. I think it's uncontroversial to say we all hate LMS discussion boards. ... As for email, I'll go out on a limb and say that I suspect most professors do not enjoy answering multiple versions of the same question over and over, one by one, especially when they have to choose between knowing in their hearts that some students aren't asking and won't know, versus making yet another LMS announcement to address any given issue. The Online Unconference of Niche Interests If you're looking for a fun and educational thing to do this weekend, you might consider attending the second quarterly(??) Online Unconference of Niche Interests ("OUNI" for short), scheduled to run from 2pm until a bit after 5pm Eastern Standard Time, this Sunday, November 29. We have a set of volunteer presenters who will each talk for up to 15 minutes about a niche topic they're into. What I’m telling family about COVID-19 A family member asked me to tell them about COVID-19. It was a general question, which I chose to interpret as "how does transmission work, and what is the real risk?" This is what I said. As I told them, I'm not a biologist of any sort, and I will accept corrections (both from people who are biologists and from those who can cite sources), of course. Both this person and I have autoimmune issues, so I take that as a given in this post. What I’ve been up to during all this How my household is doing Perhaps the best place to start writing about what I&#8217;ve been up to is to be really clear: I&#8217;m OK, and, at least for now, so are my loved ones. My spouse and I are both incredibly lucky to have jobs that can be done&#8230; Get that bread I want to tell you about my take on the New Artisan Bread in Five Minutes a Day recipe. The things I have to add to the discussion: 1) a couple of hacks for people who, like me, do not have a kitchen fan that vents outdoors (I promise I'll explain why this matters) and who like at least a little bit of whole grain in their bread, plus 2) photos of some of the steps they don't show as clearly in the book. I'm still experimenting (always!), but I have a base recipe/approach that I like and that I think is good enough to share. 2019 year-end post We&#8217;re rapidly approaching the time for the traditional year-end post, which I&#8217;ve been known to skip in recent years—I had a run of several really rough years, there. While 2019 wasn&#8217;t without personal challenges and setbacks (and a whole lot of frightening developments in the US and abroad), it brought&#8230; Belated update Right now I should be grading or preparing for classes, but honestly I&#8217;m three blog posts behind where I wanted to be by now (I haven&#8217;t forgotten my WisCon promise to make a post about tabletop roleplaying games) and fighting a pretty nasty headache. So what if I take a&#8230; Doing Data Things TLDR: I took two classes this semester, and I'm going to teach at least one, probably 1.5, classes next semester. I'm super psyched about it. I'll still work for the library where I'm an adjunct, too, but fewer hours per week. I'm still available for full-time hire, if you have data for me to work with. 2018 I usually do a year-end post. That&#8217;s not happening in 2017. This year took so much from me, and from people I care about, that I refuse to write about it. But I&#8217;d like to write about 2018. Not &#8220;resolutions&#8221; so much as &#8220;plans and goals&#8221;&#8212;and maybe not even those&#8230; A librarian again Over the past few years, I’ve come to dread the “what do you do?” question, because what people generally mean is “where do you work?” And it’s awkward when you can’t have that conversation the way they expect. 
www-rva-gov-5744	----	Home | Richmond RVA Burger Menu 2020 Election Results COVID 19 Updates Mayor Council City News 911/Emergency Communications Adult Drug Court Ambulance Authority Animal Care and Control Assessor of Real Estate Auditor Budget City Attorney Circuit Court Clerk City Clerk Commonwealth Attorney Community Wealth Building Emergency Management Economic Development Employee Directory Finance Fire Human Resources Human Services Housing & Community Information Technology Inspector General Juvenile Court Justice Services Minority Business Multicultural Affairs Parks and Recreation Planning and Development Review Police Procurement Services Public Utilities Public Works Redevelopment & Housing Richmond Gas Works Richmond Public Library Retirement System Richmond City Health District RVA311 Social Services Sustainability Sheriff Treasurer Voter Registrar Mayor Levar Stoney COVID-19 Info About Ask The Mayor Initiatives Mayor's Working Groups Press Resources City Gov 101 Richmond City Council City News 911/Emergency Communications About Using 911 Active Live Events Feed Text-to-911 Technology Complaints Process Outreach Newsletter Employment En espanol Online Newsroom Yearly Award Winners History Adult Drug Court About Us Court Contacts Court Services FAQ Requirements Department of Animal Care & Control About Us Adopt Animal Control Vols/Foster License & Permits Lost/Found Donate RACC Foundation Forms Assessor of Real Estate About Property Search Transfer Search Assessor GeoHub Data Request Forms GIS Mapping Appeal Process Exemptions Office of the City Auditor About Audit Guide Mission Reports Issued FAQ Budget & Strategic Planning About Agencies Documents Budget Unit Forms and Links Grants Performance Mgt Reports Awards FAQ Office of the City Attorney City Attorney Annual Reports City Code Ordinances File a claim Our Attorneys Circuit Court Clerk for the City of Richmond, VA About Judgments Military OCRA Online Payment Online Payments Trust Wills Virginia Judiciary Online Payment System (VJOPS) Business Civil Cases Criminal Division Deeds Handgun Permit Jury Information Marriages Notary Records Research Financing E- Filing Office of the City Clerk Commonwealth Attorney About Us CJR Unit Connect Employment Home Resources Victim Witness News Community Wealth Building About Us Career Stations Youth Academy Our Approach Events/Calendar News and Media Emergency Management About Us (CERT) Be Prepared EMACV Code Red Shelters COOP Resources Economic Development About Us New Initiatives RVA Tourism RVA Advantages Site Selection Transportation Finance About Tax/Fee/License Important Dates Services Online Payment Divisions Forms Reports Directory Description Fire and Emergency Services Welcome Permits Community Operations Fire Prevention Training FAQs Human Resources About Employees Forms Rules and Regulations Candidates FAQs Human Services Agencies COVID 19 Response Expert Cabinet Homelessness Partnerships Programs About Us Housing & Community About Us Federally Funded Programs Lead Based Paint Hazard Control Program Affordable Housing Trust Fund Information Technology About Us Property Search Traffic Info Awards Mapping Apps Open Data Inspector General Home IG Reports Juvenile Court Adult Criminal Cases Confidentiality Continuances Directions Docket Filing Forms General Info Home Juvenile Mission Motions Payment Options Protective Orders Protective Orders Justice Services Adult Services Youth Services Detention CCJB FAQ Links Confidentiality Filing License Public Notices Visitation Minority Business Development About Us MBD Directory Forms Procurement Advisory Board Training Announcements Assistance Events Glossary Multicultural Affairs About OMA COVID Response Calendar Events Language Access Services Volunteering Parks & Recreation About Us Home Parks and Recreation Digest Program Guide Permits and Special Events Out of School Time Program Cemeteries Cultural Arts Outdoor Ed FestivalOfArts Recreation Youth Athletics Volunteers 17th Street Market Volunteer Planning and Development Review Contact Info Mission Useful Links Board & Comms Divisions Master Plan Forms FAQs Police Contact Us Employment Helpful Links Chief of Police Join our Team Newsletters Patrol Services Police Forms FAQs Procurement Services About Us Solicitations Supplier Portal Surplus Property Forecast Calendar Request Proposal Request for Information Listings Request for Proposals Public Utilities Natural Gas Permits Stormwater Streetlight Water Wastewater About Us Billing Public Works Vision Zero Parking Maintenance Right of Way Public Info Shared Mobility Transportation CIP/Projects Paving Clean City Forms & Apps Bridge/AssetMGT About Us Retirement System Employees Exec Director Home IAC Publications Trustees About Us Social Services Main About Us Family Services CSA Benefits Opportunities Internship Resources Contact Us FAQ Sustainability About Climate Change RVAgreen 2050 Initiatives Get Involved News Sheriff Message Biography Careers News Contact Us Mission Statement FAQs Treasurer About Accomplishments Board Center FAQs Services Voter Registrar Home Voter Registration Voting at the Polls Upcoming Elections Absentee Voting Political Advertising Election Result Electoral Board 2020 Election Results COVID 19 Updates Mayor COVID-19 Info About Ask The Mayor Initiatives Mayor's Working Groups Press Resources City Gov 101 Council City News 911/Emergency Communications About Using 911 Active Live Events Feed Text-to-911 Technology Complaints Process Outreach Newsletter Employment En espanol Online Newsroom Yearly Award Winners History Adult Drug Court About Us Court Contacts Court Services FAQ Requirements Ambulance Authority Animal Care and Control About Us Adopt Animal Control Vols/Foster License & Permits Lost/Found Donate RACC Foundation Forms Assessor of Real Estate About Property Search Transfer Search Assessor GeoHub Data Request Forms GIS Mapping Appeal Process Exemptions Auditor About Audit Guide Mission Reports Issued FAQ Budget About Agencies Documents Budget Unit Forms and Links Grants Performance Mgt Reports Awards FAQ City Attorney City Attorney Annual Reports City Code Ordinances File a claim Our Attorneys Circuit Court Clerk About Judgments Military OCRA Online Payment Online Payments Trust Wills Virginia Judiciary Online Payment System (VJOPS) Business Civil Cases Criminal Division Deeds Handgun Permit Jury Information Marriages Notary Records Research Financing E- Filing City Clerk Commonwealth Attorney About Us CJR Unit Connect Employment Home Resources Victim Witness News Community Wealth Building About Us Career Stations Youth Academy Our Approach Events/Calendar News and Media Emergency Management About Us (CERT) Be Prepared EMACV Code Red Shelters COOP Resources Economic Development About Us New Initiatives RVA Tourism RVA Advantages Site Selection Transportation Employee Directory Finance About Tax/Fee/License Important Dates Services Online Payment Divisions Forms Reports Directory Description Fire Welcome Permits Community Operations Fire Prevention Training FAQs Human Resources About Employees Forms Rules and Regulations Candidates FAQs Human Services Agencies COVID 19 Response Expert Cabinet Homelessness Partnerships Programs About Us Housing & Community About Us Federally Funded Programs Lead Based Paint Hazard Control Program Affordable Housing Trust Fund Information Technology About Us Property Search Traffic Info Awards Mapping Apps Open Data Inspector General Home IG Reports Juvenile Court Adult Criminal Cases Confidentiality Continuances Directions Docket Filing Forms General Info Home Juvenile Mission Motions Payment Options Protective Orders Protective Orders Justice Services Adult Services Youth Services Detention CCJB FAQ Links Confidentiality Filing License Public Notices Visitation Minority Business About Us MBD Directory Forms Procurement Advisory Board Training Announcements Assistance Events Glossary Multicultural Affairs About OMA COVID Response Calendar Events Language Access Services Volunteering Parks and Recreation About Us Home Parks and Recreation Digest Program Guide Permits and Special Events Out of School Time Program Cemeteries Cultural Arts Outdoor Ed FestivalOfArts Recreation Youth Athletics Volunteers 17th Street Market Volunteer Planning and Development Review Contact Info Mission Useful Links Board & Comms Divisions Master Plan Forms FAQs Police Contact Us Employment Helpful Links Chief of Police Join our Team Newsletters Patrol Services Police Forms FAQs Procurement Services About Us Solicitations Supplier Portal Surplus Property Forecast Calendar Request Proposal Request for Information Listings Request for Proposals Public Utilities Natural Gas Permits Stormwater Streetlight Water Wastewater About Us Billing Public Works Vision Zero Parking Maintenance Right of Way Public Info Shared Mobility Transportation CIP/Projects Paving Clean City Forms & Apps Bridge/AssetMGT About Us Redevelopment & Housing Richmond Gas Works Richmond Public Library Retirement System Employees Exec Director Home IAC Publications Trustees About Us Richmond City Health District RVA311 Social Services Main About Us Family Services CSA Benefits Opportunities Internship Resources Contact Us FAQ Sustainability About Climate Change RVAgreen 2050 Initiatives Get Involved News Sheriff Message Biography Careers News Contact Us Mission Statement FAQs Treasurer About Accomplishments Board Center FAQs Services Voter Registrar Home Voter Registration Voting at the Polls Upcoming Elections Absentee Voting Political Advertising Election Result Electoral Board Richmond Quick Links Virtual City Hall Public Meeting Online Payments Other Services Elections and Voting Weather and Traffic I am Searching For: RVA 311 Support Links Meeting Calendar Online Payments Admissions Tax Parking Ticket Property Tax Real Estate Tax Traffic Ticket Lodging Tax Meals Tax Other Services Adopt a pet Ordinance Searches Library Online Services Minority Business Directory Circuit Court Code Red Ordinence and Resolution Property Transfer Search. Procurement Reporting Special Events Parks Registration Victim notification Crime Incident Information Emergency Communications - Active Calls Fraud, Waste and Abuse Open Data Portal Elections and Voting Find the Polling Places nearest you Weather and Traffic Current RVA weather conditions Report a Road Problem Emergency Services Active Calls (911) Traffic Accidents and Hazards Job Search Property Search Property Transfer Search Ordinances Searches Minority Business Directory Search Deeds Index Search Official City Code Search Permits and Inspections Virginia Sex Offender Crime Incident Information Request Non Emergency City Services Employee Directory RAPIDS Employee Self-Serve Public Meeting Online Payments Other Services Elections and Voting Weather and Traffic I am Searching For: RVA 311 Support Links Public Meeting Meeting Calendar Online Payments Admissions Tax Parking Ticket Property Tax Real Estate Tax Traffic Ticket Lodging Tax Meals Tax Other Services Adopt a pet Ordinance Searches Library Online Services Minority Business Directory Circuit Court Code Red Ordinence and Resolution Property Transfer Search. Procurement Reporting Special Events Parks Registration Victim notification Crime Incident Information Emergency Communications - Active Calls Fraud, Waste and Abuse Open Data Portal Elections and Voting Find the Polling Places nearest you Weather and Traffic Current RVA weather conditions Report a Road Problem Emergency Services Active Calls (911) Traffic Accidents and Hazards I am Searching For: Job Search Property Search Property Transfer Search Ordinances Searches Minority Business Directory Search Deeds Index Search Official City Code Search Permits and Inspections Virginia Sex Offender Crime Incident Information RVA 311 Request Non Emergency City Services Support Links Employee Directory RAPIDS Employee Self-Serve Socials Page City News Virtual City Hall Socials Page City News Latest News Latest News Public Information Advisory - Governmental Operations Standing Committee Meeting Rescheduled Public Information Advisory - Public Safety Standing Committee Resort Casino Evaluation Panel reduces short list to two operators Public Information Advisory - Special Meeting of the Education and Human Services Standing Committee Governor Northam, Mayor Stoney announce sustainable biotech firm to expand operations in Richmond, creating 250 new jobs RVA Burger Menu 2020 Election Results COVID 19 Updates Mayor Council City News 911/Emergency Communications Adult Drug Court Ambulance Authority Animal Care and Control Assessor of Real Estate Auditor Budget City Attorney Circuit Court Clerk City Clerk Commonwealth Attorney Community Wealth Building Emergency Management Economic Development Employee Directory Finance Fire Human Resources Human Services Housing & Community Information Technology Inspector General Juvenile Court Justice Services Minority Business Multicultural Affairs Parks and Recreation Planning and Development Review Police Procurement Services Public Utilities Public Works Redevelopment & Housing Richmond Gas Works Richmond Public Library Retirement System Richmond City Health District RVA311 Social Services Sustainability Sheriff Treasurer Voter Registrar Mayor Levar Stoney COVID-19 Info About Ask The Mayor Initiatives Mayor's Working Groups Press Resources City Gov 101 Richmond City Council City News 911/Emergency Communications About Using 911 Active Live Events Feed Text-to-911 Technology Complaints Process Outreach Newsletter Employment En espanol Online Newsroom Yearly Award Winners History Adult Drug Court About Us Court Contacts Court Services FAQ Requirements Department of Animal Care & Control About Us Adopt Animal Control Vols/Foster License & Permits Lost/Found Donate RACC Foundation Forms Assessor of Real Estate About Property Search Transfer Search Assessor GeoHub Data Request Forms GIS Mapping Appeal Process Exemptions Office of the City Auditor About Audit Guide Mission Reports Issued FAQ Budget & Strategic Planning About Agencies Documents Budget Unit Forms and Links Grants Performance Mgt Reports Awards FAQ Office of the City Attorney City Attorney Annual Reports City Code Ordinances File a claim Our Attorneys Circuit Court Clerk for the City of Richmond, VA About Judgments Military OCRA Online Payment Online Payments Trust Wills Virginia Judiciary Online Payment System (VJOPS) Business Civil Cases Criminal Division Deeds Handgun Permit Jury Information Marriages Notary Records Research Financing E- Filing Office of the City Clerk Commonwealth Attorney About Us CJR Unit Connect Employment Home Resources Victim Witness News Community Wealth Building About Us Career Stations Youth Academy Our Approach Events/Calendar News and Media Emergency Management About Us (CERT) Be Prepared EMACV Code Red Shelters COOP Resources Economic Development About Us New Initiatives RVA Tourism RVA Advantages Site Selection Transportation Finance About Tax/Fee/License Important Dates Services Online Payment Divisions Forms Reports Directory Description Fire and Emergency Services Welcome Permits Community Operations Fire Prevention Training FAQs Human Resources About Employees Forms Rules and Regulations Candidates FAQs Human Services Agencies COVID 19 Response Expert Cabinet Homelessness Partnerships Programs About Us Housing & Community About Us Federally Funded Programs Lead Based Paint Hazard Control Program Affordable Housing Trust Fund Information Technology About Us Property Search Traffic Info Awards Mapping Apps Open Data Inspector General Home IG Reports Juvenile Court Adult Criminal Cases Confidentiality Continuances Directions Docket Filing Forms General Info Home Juvenile Mission Motions Payment Options Protective Orders Protective Orders Justice Services Adult Services Youth Services Detention CCJB FAQ Links Confidentiality Filing License Public Notices Visitation Minority Business Development About Us MBD Directory Forms Procurement Advisory Board Training Announcements Assistance Events Glossary Multicultural Affairs About OMA COVID Response Calendar Events Language Access Services Volunteering Parks & Recreation About Us Home Parks and Recreation Digest Program Guide Permits and Special Events Out of School Time Program Cemeteries Cultural Arts Outdoor Ed FestivalOfArts Recreation Youth Athletics Volunteers 17th Street Market Volunteer Planning and Development Review Contact Info Mission Useful Links Board & Comms Divisions Master Plan Forms FAQs Police Contact Us Employment Helpful Links Chief of Police Join our Team Newsletters Patrol Services Police Forms FAQs Procurement Services About Us Solicitations Supplier Portal Surplus Property Forecast Calendar Request Proposal Request for Information Listings Request for Proposals Public Utilities Natural Gas Permits Stormwater Streetlight Water Wastewater About Us Billing Public Works Vision Zero Parking Maintenance Right of Way Public Info Shared Mobility Transportation CIP/Projects Paving Clean City Forms & Apps Bridge/AssetMGT About Us Retirement System Employees Exec Director Home IAC Publications Trustees About Us Social Services Main About Us Family Services CSA Benefits Opportunities Internship Resources Contact Us FAQ Sustainability About Climate Change RVAgreen 2050 Initiatives Get Involved News Sheriff Message Biography Careers News Contact Us Mission Statement FAQs Treasurer About Accomplishments Board Center FAQs Services Voter Registrar Home Voter Registration Voting at the Polls Upcoming Elections Absentee Voting Political Advertising Election Result Electoral Board 2020 Election Results COVID 19 Updates Mayor COVID-19 Info About Ask The Mayor Initiatives Mayor's Working Groups Press Resources City Gov 101 Council City News 911/Emergency Communications About Using 911 Active Live Events Feed Text-to-911 Technology Complaints Process Outreach Newsletter Employment En espanol Online Newsroom Yearly Award Winners History Adult Drug Court About Us Court Contacts Court Services FAQ Requirements Ambulance Authority Animal Care and Control About Us Adopt Animal Control Vols/Foster License & Permits Lost/Found Donate RACC Foundation Forms Assessor of Real Estate About Property Search Transfer Search Assessor GeoHub Data Request Forms GIS Mapping Appeal Process Exemptions Auditor About Audit Guide Mission Reports Issued FAQ Budget About Agencies Documents Budget Unit Forms and Links Grants Performance Mgt Reports Awards FAQ City Attorney City Attorney Annual Reports City Code Ordinances File a claim Our Attorneys Circuit Court Clerk About Judgments Military OCRA Online Payment Online Payments Trust Wills Virginia Judiciary Online Payment System (VJOPS) Business Civil Cases Criminal Division Deeds Handgun Permit Jury Information Marriages Notary Records Research Financing E- Filing City Clerk Commonwealth Attorney About Us CJR Unit Connect Employment Home Resources Victim Witness News Community Wealth Building About Us Career Stations Youth Academy Our Approach Events/Calendar News and Media Emergency Management About Us (CERT) Be Prepared EMACV Code Red Shelters COOP Resources Economic Development About Us New Initiatives RVA Tourism RVA Advantages Site Selection Transportation Employee Directory Finance About Tax/Fee/License Important Dates Services Online Payment Divisions Forms Reports Directory Description Fire Welcome Permits Community Operations Fire Prevention Training FAQs Human Resources About Employees Forms Rules and Regulations Candidates FAQs Human Services Agencies COVID 19 Response Expert Cabinet Homelessness Partnerships Programs About Us Housing & Community About Us Federally Funded Programs Lead Based Paint Hazard Control Program Affordable Housing Trust Fund Information Technology About Us Property Search Traffic Info Awards Mapping Apps Open Data Inspector General Home IG Reports Juvenile Court Adult Criminal Cases Confidentiality Continuances Directions Docket Filing Forms General Info Home Juvenile Mission Motions Payment Options Protective Orders Protective Orders Justice Services Adult Services Youth Services Detention CCJB FAQ Links Confidentiality Filing License Public Notices Visitation Minority Business About Us MBD Directory Forms Procurement Advisory Board Training Announcements Assistance Events Glossary Multicultural Affairs About OMA COVID Response Calendar Events Language Access Services Volunteering Parks and Recreation About Us Home Parks and Recreation Digest Program Guide Permits and Special Events Out of School Time Program Cemeteries Cultural Arts Outdoor Ed FestivalOfArts Recreation Youth Athletics Volunteers 17th Street Market Volunteer Planning and Development Review Contact Info Mission Useful Links Board & Comms Divisions Master Plan Forms FAQs Police Contact Us Employment Helpful Links Chief of Police Join our Team Newsletters Patrol Services Police Forms FAQs Procurement Services About Us Solicitations Supplier Portal Surplus Property Forecast Calendar Request Proposal Request for Information Listings Request for Proposals Public Utilities Natural Gas Permits Stormwater Streetlight Water Wastewater About Us Billing Public Works Vision Zero Parking Maintenance Right of Way Public Info Shared Mobility Transportation CIP/Projects Paving Clean City Forms & Apps Bridge/AssetMGT About Us Redevelopment & Housing Richmond Gas Works Richmond Public Library Retirement System Employees Exec Director Home IAC Publications Trustees About Us Richmond City Health District RVA311 Social Services Main About Us Family Services CSA Benefits Opportunities Internship Resources Contact Us FAQ Sustainability About Climate Change RVAgreen 2050 Initiatives Get Involved News Sheriff Message Biography Careers News Contact Us Mission Statement FAQs Treasurer About Accomplishments Board Center FAQs Services Voter Registrar Home Voter Registration Voting at the Polls Upcoming Elections Absentee Voting Political Advertising Election Result Electoral Board Richmond Quick Links Virtual City Hall Public Meeting Online Payments Other Services Elections and Voting Weather and Traffic I am Searching For: RVA 311 Support Links Meeting Calendar Online Payments Admissions Tax Parking Ticket Property Tax Real Estate Tax Traffic Ticket Lodging Tax Meals Tax Other Services Adopt a pet Ordinance Searches Library Online Services Minority Business Directory Circuit Court Code Red Ordinence and Resolution Property Transfer Search. Procurement Reporting Special Events Parks Registration Victim notification Crime Incident Information Emergency Communications - Active Calls Fraud, Waste and Abuse Open Data Portal Elections and Voting Find the Polling Places nearest you Weather and Traffic Current RVA weather conditions Report a Road Problem Emergency Services Active Calls (911) Traffic Accidents and Hazards Job Search Property Search Property Transfer Search Ordinances Searches Minority Business Directory Search Deeds Index Search Official City Code Search Permits and Inspections Virginia Sex Offender Crime Incident Information Request Non Emergency City Services Employee Directory RAPIDS Employee Self-Serve Public Meeting Online Payments Other Services Elections and Voting Weather and Traffic I am Searching For: RVA 311 Support Links Public Meeting Meeting Calendar Online Payments Admissions Tax Parking Ticket Property Tax Real Estate Tax Traffic Ticket Lodging Tax Meals Tax Other Services Adopt a pet Ordinance Searches Library Online Services Minority Business Directory Circuit Court Code Red Ordinence and Resolution Property Transfer Search. Procurement Reporting Special Events Parks Registration Victim notification Crime Incident Information Emergency Communications - Active Calls Fraud, Waste and Abuse Open Data Portal Elections and Voting Find the Polling Places nearest you Weather and Traffic Current RVA weather conditions Report a Road Problem Emergency Services Active Calls (911) Traffic Accidents and Hazards I am Searching For: Job Search Property Search Property Transfer Search Ordinances Searches Minority Business Directory Search Deeds Index Search Official City Code Search Permits and Inspections Virginia Sex Offender Crime Incident Information RVA 311 Request Non Emergency City Services Support Links Employee Directory RAPIDS Employee Self-Serve Socials Page City News Virtual City Hall Socials Page City News Richmond Quick Icons Richmond Featured Menu RVA Budget Hub Learn about the city's budgeting process and the FY22 proposed budget RVA Budget Hub Learn about the city's budgeting process and the FY22 proposed budget COVID-19 Our webpage for everything pandemic-related COVID-19 Our webpage for everything pandemic-related Resort Casino Project Click here for background information, timely updates and engagement opportunities Resort Casino Project Click here for background information, timely updates and engagement opportunities RVAgreen 2050 What does your community need to be healthy and resilient? RVAgreen 2050 What does your community need to be healthy and resilient? RVA Budget Hub Learn about the city's budgeting process and the FY22 proposed budget RVA Budget Hub Learn about the city's budgeting process and the FY22 proposed budget COVID-19 Our webpage for everything pandemic-related COVID-19 Our webpage for everything pandemic-related Resort Casino Project Click here for background information, timely updates and engagement opportunities Resort Casino Project Click here for background information, timely updates and engagement opportunities RVAgreen 2050 What does your community need to be healthy and resilient? RVAgreen 2050 What does your community need to be healthy and resilient? RVA Budget Hub Learn about the city's budgeting process and the FY22 proposed budget RVA Budget Hub Learn about the city's budgeting process and the FY22 proposed budget COVID-19 Our webpage for everything pandemic-related COVID-19 Our webpage for everything pandemic-related Resort Casino Project Click here for background information, timely updates and engagement opportunities Resort Casino Project Click here for background information, timely updates and engagement opportunities RVAgreen 2050 What does your community need to be healthy and resilient? RVAgreen 2050 What does your community need to be healthy and resilient? Scroll Down Latest News Public Information Advisory - Governmental Operations Standing Committee Meeting Rescheduled Public Information Advisory - Public Safety Standing Committee Resort Casino Evaluation Panel reduces short list to two operators Public Information Advisory - Special Meeting of the Education and Human Services Standing Committee Governor Northam, Mayor Stoney announce sustainable biotech firm to expand operations in Richmond, creating 250 new jobs A New Virtual City Hall Posted by The Mayor's Office (1 year ago) Richmond's new Virtual City Hall now provides for more business, taxpayer and visitor needs online. Access services, get information, pay online and more. Conducting your city business online is quick and easy. You can also "type it, find it" from the upper right hand corner of any page on the city's website. And don't forget to visit our all new Socials page, which has all the city's many social media feeds in one place! About Mayor Posted by The Mayor's Office (1 year ago) Mayor Levar M. Stoney is Richmond's 80th and youngest elected mayor in city history. The new RVA.gov website fulfills his vision to provide citizens, businesses and visitors more ways to interact with the city government. Follow the mayor on Facebook and Twitter and visit the city's mayor page to interact now. City Council Posted by The Mayor's Office (1 year ago) Richmond City Council is comprised of representatives from nine districts throughout the city.  Click here to access the schedule, agendas, and meeting information for Richmond City Council Formal, Informal, Standing Committee, Budget, and Special Meetings. COVID 19 Updates Posted by The Mayor's Office (1 year ago) Stay up to date on the most recent COVID-19 updates from the city.  Click here to access the dedicated update page. Visit RVAStrong to connect with city and community efforts underway for residents to get and give help during the COVID-19 pandemic. City News Posted by The Mayor's Office (1 year ago) Get the latest RVA news & announcements in our city news ticker above or on our new City News page. Remember, you can also now watch RVA's news Channel 17 live online and find all the city's social media feeds on our Socials page. All announcements, news releases and urgent messages scroll in our news ticker and are posted on the City News page, so please stay informed and visit often.   RVA-TV17 Be a Hero and Wear a Mask More Videos > A New Virtual City Hall Posted by The Mayor's Office (1 year ago) Richmond's new Virtual City Hall now provides for more business, taxpayer and visitor needs online. Access services, get information, pay online and more. Conducting your city business online is quick and easy. You can also "type it, find it" from the upper right hand corner of any page on the city's website. And don't forget to visit our all new Socials page, which has all the city's many social media feeds in one place! Meet Our Mayor Posted by The Mayor's Office (1 year ago) Mayor Levar M. Stoney is Richmond's 80th and youngest elected mayor in city history. The new RVA.gov website fulfills his vision to provide citizens, businesses and visitors more ways to interact with the city government. Follow the mayor on Facebook and Twitter and visit the city's mayor page to interact now. Meet our Council Posted by The Mayor's Office (1 year ago) Richmond City Council is comprised of representatives from nine districts throughout the city.  Click here to access the schedule, agendas, and meeting information for Richmond City Council Formal, Informal, Standing Committee, Budget, and Special Meetings. COVID-19 Updates Posted by The Mayor's Office (1 year ago) Stay up to date on the most recent COVID-19 updates from the city.  Click here to access the dedicated update page. Visit RVAStrong to connect with city and community efforts underway for residents to get and give help during the COVID-19 pandemic. City News Posted by The Mayor's Office (1 year ago) Get the latest RVA news & announcements in our city news ticker above or on our new City News page. Remember, you can also now watch RVA's news Channel 17 live online and find all the city's social media feeds on our Socials page. All announcements, news releases and urgent messages scroll in our news ticker and are posted on the City News page, so please stay informed and visit often.   RVA-TV17 Be a Hero and Wear a Mask More Videos > Programs & Services Programs and Services Menu Pay Online Online Payment Services Parking Ticket Taxes - Admissions, Lodging and Meals Property Tax Utilities - Pay my Bills RVA 311 311 Service Requests Request Non Emergency City Services Jobs Jobs Jobs Employee Relations Attractions Places To Visit Food Events Drinks Hotels Arts & Culture About Sports Tours & Trails Visitor's Guide Safety Public Safety Services Traffic Information Ambulance Authority Police Neighbor To Neighbor Volunteer Application Emergency Management Fire Prevention Emergency Communications Sheriff Programs & Services Programs and Services Menu Pay Online Online Payment Services Parking Ticket Taxes - Admissions, Lodging and Meals Property Tax Utilities - Pay my Bills RVA 311 311 Service Requests Request Non Emergency City Services Jobs Jobs Jobs Employee Relations Attractions Places To Visit Food Events Drinks Hotels Arts & Culture About Sports Tours & Trails Visitor's Guide Safety Public Safety Services Traffic Information Ambulance Authority Police Neighbor To Neighbor Volunteer Application Emergency Management Fire Prevention Emergency Communications Sheriff Resources Resources Menu RVA Strong RVAStrong centralizes support for our neighbors who need it and amplifies those who are willing to offer it. Visit to learn more. Emergency Alerts Get CodeRED alerts for your area via phone or email - sign up for free here! City News Learn about the most recent news and announcements from your local government. Parking @RVA Parking downtown? Learn about the Parking Enterprise Fund, parking tickets, restricted parking, and more! Resources RVA Strong RVAStrong centralizes support for our neighbors who need it and amplifies those who are willing to offer it. Visit to learn more. Emergency Alerts Get CodeRED alerts for your area via phone or email - sign up for free here! City News Learn about the most recent news and announcements from your local government. Parking @RVA Parking downtown? Learn about the Parking Enterprise Fund, parking tickets, restricted parking, and more! Footer menu Accessibility Auditor Reports FOIA Privacy Policy RVA 311 Tech Issues Emergencies: Call  or text 911 Non-Emergency Police: Call  (804) 646-5100 Report issues at rva311.com Information and Non-Emergency Services: Call 3-1-1  900 E. Broad Street, Richmond, VA 23219 Hours: Monday - Friday, 8am - 5pm  Mayor Levar Stoney Select language Select Language English Afrikaans Amharic Arabic Armenian Bengali Bosnian Chinese (Simplified) Filipino French Haitian Creole Hindi Hmong Italian Khmer Korean Kurdish (Kurmanji) Lao Mongolian Myanmar (Burmese) Nepali Persian Portuguese Punjabi Russian Somali Spanish Sudanese Swahili Thai Turkish Urdu Uzbek Vietnamese Yoruba Zulu RVA.gov usa Google Translate™ para proveer una traducción genérica en el idioma de su preferencia. Esta aplicación no sustituye su derecho de obtener una traducción profesional provista por la Ciudad de Richmond como parte del Plan de Acceso a Lenguaje. Para solicitar una traducción, llame al 804-646-0145. 
www-smartct-org-4765	----	SmartCT Homepage We're sorry but SmartCT doesn't work properly without JavaScript enabled. Please enable it to continue. 
www-soundonsound-com-1131	----	Rode M5 Skip to main content Log in Register Subscribe Shop Help 0 items Search SOUND ON SOUND Home About Advertise Contact Information Readers' Ads Glossary Videos Tutorials Podcasts Search News Latest All News Podcasts Search Forum Latest posts FAQ Members Rules Search Glossary Magazine Latest issue Subscribe Advertise Publication Dates Write for SOS Search All issues Reviews Articles Podcasts Search Techniques Articles Series Tutorials Podcasts Search People Articles Podcasts Search Sound Advice Articles Tutorials Podcasts Search Music Business Articles Podcasts Search You are here Home Reviews Rode M5 Capacitor Microphone Pair Microphones / Miking By Paul White Published February 2014 Making quality stereo recordings has never been cheaper, thanks to the launch of Rode's new M5 mic set. Rode's small-diaphragm NT5 and NT55 models have proven to be very popular in both project and pro studios, though for the past few years the company have been enjoying large-scale success with their video camera microphones, broadcast microphones and some more consumer-style products. Now they have turned their attention back to the project-studio market, with the launch of the new model NT1 large-diaphragm capacitor and the small-diaphragm M5, reviewed here. With the aim of delivering similar quality to the NT5 but at an even lower price, Rode adopted back-electret technology to develop this new end-fire model. Designed and manufactured in Australia, the M5 has a half-inch capsule with a fixed cardioid pattern, and Rode promote its key attributes as being low noise and a wide frequency response. Such small-diaphragm microphones are often the best choice for recording acoustic instruments or ensembles and choirs. Since many of these applications require stereo pairs, Rode are presently only selling the M5s in matched-pair sets. They ensure that there's a variation of no more than 1dB sensitivity between any matched microphones. Rode Works Externally, the microphone employs Rode's proprietary matte-black ceramic coating, which is both durable and attractive. As supplied, the mics come with WS5 foam windshields and RM5 stand mounts, all packaged in a sturdy cardboard box with a form-fitting cardboard tray holding the mics and accessories. There's no plastic storage pouch. Internally, the circuitry employs a J-FET impedance converter feeding a transformerless, bipolar output stage, making good use of surface-mount components to keep things compact. The 20Hz to 20kHz response exhibits a subtle, wide presence hump of no more than a couple of dB centred at around 7kHz, but other than that, the response is sensibly flat until it hits the -3dB points at 20Hz and 20kHz. There are no pads or filter switches on the mic, though it can handle SPLs of up to 140dB (at 1kHz for one percent THD). It has a quoted sensitivity of -34dB ref 1V/P, and with its A-weighted noise figure of 19dB EIN, the background noise is not untypical for a good-quality microphone of this type, and will certainly not pose a problem in normal studio applications. The M5 isnicely compact, at 100mm long and 20mm in diameter, making it easy to position around awkward instruments. Also, not to be undervalued, the 12-month warranty period is extended to 10 years once you register the product with Rode. Driving The M5 My first test was to use a single M5 to record acoustic guitar, the results of which sounded detailed and articulate, with that gentle presence hump bringing out the transients while avoiding harshness or brittleness. At no time did the tonality give away the fact that I was using a budget microphone. Putting up my NT55 (fitted with a cardioid capsule) produced very similar results, though if anything, the NT55 exhibited a slightly more forward character in the highs than did the M5 so, for that particular instrument in that particular room, I actually preferred the sound of the M5, which came as something of a surprise! The two mics are surprisingly close in character though, with no obvious 'tells' to suggest that the M5 is the poor relation. One popular technique for testing a mic's high-frequency response is to record a set of jangling keys, as that sound contains a lot of HF (and even ultrasonic) energy. As an alternative to that, though, I recorded a set of miniature bells (also rich in high frequencies), which in my case was a little more scientific as several of my keys have plastic fobs! Neither the NT55 nor the M5 seemed unduly upset, however, and both gave a good impression of the bells as heard in the room with minimal intermodulation artifacts. As a stereo pair the mics are also very effective, and well matched in sensitivity. They are useful in both spaced and coincident configurations, the low end always sounding tight and controlled. The M5s were very easy to position, as the former are an inch or so shorter than the NT5 and significantly shorter than the NT55. Given their attractive price, the level of performance and that enticingly long warranty, there's little not to like about Rode's new mic. It is a capable general-purpose mic that can handle just about any acoustic instrument source (though it probably wouldn't be my first choice for kick drum), where their ability to sound smooth yet detailed will be appreciated.   Alternatives Most of the familiar names in microphone manufacturing offer small-diaphragm mics, many at a budget price, as with the MXL 606, though the M5's combination of performance, price and warranty surely puts it towards the top end of anyone's shortlist. Pros Surprisingly inexpensive. Nicely engineered. Detailed, clean sound. Long warranty. Available as matched pairs. Cons No plastic storage pouch or box included. Summary Whether used singly or in pairs, the M5 offers a serious level of performance at an entry-level price. information £169 per matched pair, including VAT.Source Distribution +44 (0)20 8962 5080 sales@sourcedistribution.co.uk www.sourcedistribution.co.uk www.rodemic.com $199 per pair.Rode USA +1 562 364 7400 usasupport@rodemic.com www.rodemic.com Test Spec Buy PDF version Published February 2014 Previous articleNext article In this article... Introduction Rode Works Driving The M5 Alternatives SOS Competitions Win! GIK Acoustics Impression Room Kit WIN award-winning Aston Microphones Element mic Readers' Ads View all ads Create free ad On the same subject Melodium 42Bn May 2021 Shure MV7 May 2021 Golden Gear: AKG D19 May 2021 Q. What mic polar pattern is best for recording speech? May 2021 Warm Audio WA‑87 R2 May 2021 From the same manufacturer Rode RodeCaster Pro v2 June 2020 Rode NTG5 March 2020 Rode TF-5 October 2019 Rode RodeCaster Pro April 2019 Session Notes: Magician's Nephew | Big Drum Sound January 2019 Sign Up TO SOS Newsletters Latest Videos Bad Musicians Need Good Engineers 1 month 1 week ago. Using Phaser Effects 6 days 32 min ago. 100 Years Of The Theremin | Podcast 1 month 1 week ago. New forum posts Re: Correct tool to avoid clipping during "hands o... > Wonks > Tue Apr 27, 2021 5:45 pm Recording: Gear & Techniques Re: DDA D Series Manual and Schematics > Hugh Robjohns > Tue Apr 27, 2021 5:39 pm Recording: Gear & Techniques Re: Korg Triton Taktile 49 - problems using with a PC > Wonks > Tue Apr 27, 2021 5:38 pm Keyboards & Synthesis Re: Musician Bios: How to Not Ruin Your Career? > Petros_K > Tue Apr 27, 2021 5:30 pm Music Business Re: DDA D Series Manual and Schematics > Sam Spoons > Tue Apr 27, 2021 5:28 pm Recording: Gear & Techniques Recent topics Speaker - Amp Match  Correct tool to avoid clipping during "hands off t...  Why doesn't my bounce sound like my active mix?  DDA D Series Manual and Schematics  Korg Triton Taktile 49 - problems using with a PC  Musician Bios: How to Not Ruin Your Career?  Loft conversion studio/office  Luthier recommendations Essex (ish) area...  Remote Control of UAD Console - AT Last!  Please Recomend good headphones for mixing.  Recently active forums Mixing, Mastering & Post Production Recording: Gear & Techniques Keyboards & Synthesis Music Business DIY Electronics & Studio Design Guitar Technology Feedback New Products &amp; Industry News Windows Music Mac Music Music Theory, Songwriting & Composition Live Sound & Performance Apps & Other Computers/OS Remote Collaboration User Reviews SOS Support Forum Useful Information Archive You may login with either your assigned username or your e-mail address. The password field is case sensitive. Request new password Create an account Contact Us Cookie Policy Help Privacy Policy Terms of Use        All contents copyright © SOS Publications Group and/or its licensors, 1985-2021. All rights reserved. The contents of this article are subject to worldwide copyright protection and reproduction in whole or part, whether mechanical or electronic, is expressly forbidden without the prior written consent of the Publishers. Great care has been taken to ensure accuracy in the preparation of this article but neither Sound On Sound Limited nor the publishers can be held responsible for its contents. The views expressed are those of the contributors and not necessarily those of the publishers. Web site designed & maintained by PB Associates & SOS 
www-soundonsound-com-2993	----	Audio Technica AT2020 Skip to main content Log in Register Subscribe Shop Help 0 items Search SOUND ON SOUND Home About Advertise Contact Information Readers' Ads Glossary Videos Tutorials Podcasts Search News Latest All News Podcasts Search Forum Latest posts FAQ Members Rules Search Glossary Magazine Latest issue Subscribe Advertise Publication Dates Write for SOS Search All issues Reviews Articles Podcasts Search Techniques Articles Series Tutorials Podcasts Search People Articles Podcasts Search Sound Advice Articles Tutorials Podcasts Search Music Business Articles Podcasts Search You are here Home Reviews Audio Technica AT2020 Condenser Microphone Microphones / Miking By Paul White Published February 2006 Audio-Technica's new entry-level mic puts in a star performance. Audio-Technica's new AT2020 is unashamedly a 'no frills' mic, with basic standmount and a soft zip-up vinyl case. The polar pattern is a fixed cardioid, and there are no switches for pads or filters. Slightly smaller than Audio-technica's other side-entry vocal mics, the casing contains a 16mm-diameter back-electret capsule, which is slightly smaller than the usual one-inch-diameter capsules adopted by most microphone designs of this type. It's a mistake to imagine that smaller diaphragms equate to a less warm or solid sound, though, and this model offers plenty of bass extension, with a full 20Hz-20kHz bandwidth with only a slight dip centred around 70Hz and a very gentle presence hump between 5kHz and 15kHz. It's also wrong to think of back-electret capsules as somehow being second-class citizens, as many top measurement mics are back-electrets, for example. Photo: Mark Ewing This microphone is extremely solid, weighing in at 12.1oz — so solid in fact that I couldn't find a way into it to take a peek at the electronics! The cast metal is quite thick and chunky, and a tough wire mesh protects and screens the capsule. A secondary layer of finer mesh helps reduce popping but, as always, you should still use a proper external pop shield when recording vocals. The open-circuit sensitivity of the mic is 14.1mV/Pa, which is comparable with other studio vocal mics, though the quoted noise figure is just slightly on the high side of average at 20dB. The maximum SPL is a hefty 144dBSPL at 1kHz for one percent THD, so there's no worry about very loud sounds such as close-miked horns. This equates to a dynamic range of 124dB at 1kHz and maximum SPL. Phantom power is required, and there are no battery options. Studio Tests Despite its low cost, this mic actually delivers a very mature, believable sound both on vocals and on instruments. As a vocal mic, it displays the kind of lower mid-range warmth that flatters most singers, and combines this with a very subtle presence peak that adds 'air' and clarity without erring on the side of sibilance or harshness. Because the mic doesn't hype the character of the original sound too much, it's more likely to work well with a wide range of singers. Although the noise figure isn't brilliant by today's standards, noise isn't an issue when close-miking voices or instruments. I don't for a moment imagine that Audio-Technica would recommend this mic for recording classical instruments from several metres away, but you might still get away with it if the performance is reasonably loud. Tested on acoustic guitar, the mic immediately gave a well-balanced sound, and it was very easy to find a sweet spot where the acoustic sound of the guitar resembled the miked sound. There's plenty of depth and clarity, but without that scratchy honkiness that some mics seem to lean towards. I was also impressed with the off-axis characteristics of this mic, with its exceptionally good rear-axis rejection and good rejection of sounds arriving from 90 degrees off axis. This could be good news when working in studios with computers, as keeping the computer noise out of the microphones can be a major headache. Final Impressions As a low-cost, general-purpose capacitor microphone, the AT2020 is hard to fault other than on account of its noise level. I particularly liked the well-balanced sound this microphone produced, both on voice and on acoustic guitar. It's also a pity the mic doesn't come with a shockmount, but I can understand that Audio-Technica wanted to keep the price as keen as possible, and the included standmount works fine. Clearly Audio-Technica aren't without some stiff competition, but they have a reputation for good engineering and good-sounding mics, and a bit of reputation goes a long way when you're trying to choose between similarly priced microphones. The AT2020 is built to the same standard as Audio-Technica's more up-market microphones, and I certainly have no complaints about its subjective sound quality. Indeed, for some applications in turns in better results than microphones costing several times its UK price, so it probably doesn't take a crystal ball to predict that Audio-Technica will sell a lot of these over the coming months.  Pros Inexpensive. Well engineered. Good sound on vocals and instruments. Cons Noise level slightly higher than average. Summary This is a great-sounding little mic that can deliver very professional results in just about any home-studio miking application. information £99.95 including VAT. Audio-Technica UK +44 (0)113 277 1441. +44 (0)113 270 4836. sales@audio- technica.co.uk www.audio- technica.co.uk www.audio- technica.co.jp Published February 2006 Previous articleNext article In this article... Introduction Studio Tests Final Impressions SOS Competitions Win! GIK Acoustics Impression Room Kit WIN award-winning Aston Microphones Element mic Readers' Ads View all ads Create free ad On the same subject Melodium 42Bn May 2021 Shure MV7 May 2021 Golden Gear: AKG D19 May 2021 Q. What mic polar pattern is best for recording speech? May 2021 Warm Audio WA‑87 R2 May 2021 From the same manufacturer Choosing A USB Microphone | Audio Examples June 2020 Audio-Technica ATM350GL June 2020 Q. How should you pan vocals when double-tracking? March 2020 Audio-Technica ATH M60x March 2019 Audio-Technica 3000 Series February 2019 Sign Up TO SOS Newsletters Latest Videos Bad Musicians Need Good Engineers 1 month 1 week ago. Using Phaser Effects 6 days 32 min ago. 100 Years Of The Theremin | Podcast 1 month 1 week ago. New forum posts Re: Correct tool to avoid clipping during "hands o... > Wonks > Tue Apr 27, 2021 5:45 pm Recording: Gear & Techniques Re: DDA D Series Manual and Schematics > Hugh Robjohns > Tue Apr 27, 2021 5:39 pm Recording: Gear & Techniques Re: Korg Triton Taktile 49 - problems using with a PC > Wonks > Tue Apr 27, 2021 5:38 pm Keyboards & Synthesis Re: Musician Bios: How to Not Ruin Your Career? > Petros_K > Tue Apr 27, 2021 5:30 pm Music Business Re: DDA D Series Manual and Schematics > Sam Spoons > Tue Apr 27, 2021 5:28 pm Recording: Gear & Techniques Recent topics Speaker - Amp Match  Correct tool to avoid clipping during "hands off t...  Why doesn't my bounce sound like my active mix?  DDA D Series Manual and Schematics  Korg Triton Taktile 49 - problems using with a PC  Musician Bios: How to Not Ruin Your Career?  Loft conversion studio/office  Luthier recommendations Essex (ish) area...  Remote Control of UAD Console - AT Last!  Please Recomend good headphones for mixing.  Recently active forums Mixing, Mastering & Post Production Recording: Gear & Techniques Keyboards & Synthesis Music Business DIY Electronics & Studio Design Guitar Technology Feedback New Products &amp; Industry News Windows Music Mac Music Music Theory, Songwriting & Composition Live Sound & Performance Apps & Other Computers/OS Remote Collaboration User Reviews SOS Support Forum Useful Information Archive You may login with either your assigned username or your e-mail address. The password field is case sensitive. Request new password Create an account Contact Us Cookie Policy Help Privacy Policy Terms of Use        All contents copyright © SOS Publications Group and/or its licensors, 1985-2021. All rights reserved. The contents of this article are subject to worldwide copyright protection and reproduction in whole or part, whether mechanical or electronic, is expressly forbidden without the prior written consent of the Publishers. Great care has been taken to ensure accuracy in the preparation of this article but neither Sound On Sound Limited nor the publishers can be held responsible for its contents. The views expressed are those of the contributors and not necessarily those of the publishers. Web site designed & maintained by PB Associates & SOS 
www-stm-assoc-org-3540	----	Improving Access to Research Special Announcement Ralph Youngen, ACS Sr. Director, Technology Strategy & Partnerships Gaby Appleton, Elsevier Managing Director, Researcher Products Laird Barrett, Springer Nature Digital Product Manager Paul Tuten, Taylor & Francis Chief Product and Technology Officer Todd Toler, Wiley VP, Digital Product Management There is a problem with the research ecosystem “High on my list of favourite things: going through about eight different login stages on a website only to find that my institution doesn't have access to the text anyway.” Historian. “Just how long does it take to navigate the frankly awful login systems of publishers... I look around and think "if everyone in here spent an extra 15 mins per day…” Neuroscientist. “Within the scholarly ecosystem, it is time to reimagine the future…we’d like to see this change come from within…” “Search and discovery have reached a level of maturity and status quo, yet our shared systems for authentication are out of step…” Sponsors: Advisors: What is GetFTR?  A new service that enables researchers to get faster access to published research  Compatible with all of today’s research discovery tools, scientific collaboration networks, library management systems, etc.  Provides on-the-fly verification of a user’s entitlement rights to a research article based upon the user’s institutional affiliation  Works directly with publisher platforms to determine entitlement status  Is privacy preserving and fully GDPR compliant How does GetFTR work? What does GetFTR address? Streamline Access Collaboration & Workflow Level Playing Field Drive Standards Minimize paywalls and “access denied” experience for researchers to full text journal articles Enable discovery services and scholarly collaboration networks to streamline access to the best available version of the content that researchers are entitled to Enable legitimate competition between publishers, between social / collaboration tool providers, while sharing data fairly and legally Create a service for all to participate in, with transparent governance Accelerate global adoption of industry standards Use common, open standards where possible How can integration partners use GetFTR? Integration partners can leverage the SeamlessAccess.org infrastructure to discover a user’s institutional affiliation. Works across Open Access and Subscribed Articles DOIs from search results along with the user’s affiliation are sent to GetFTR. Open Access articles can be appropriately badged. Institutionally subscribed articles carry the GetFTR “trustmark”. Integrators can rewrite links to use WAYFless URLs provided by GetFTR, customized for the user’s affiliation. https://en.wikipedia.org/wiki/File:Open_Access_logo_PLoS_transparent.svg Flexibility for Partners Integrators can choose to add a GetFTR button instead of rewriting existing links. Publishers may choose to provide unentitled users with an alternative version. Seeking widespread participation The best possible outcome for researchers is widespread adoption of GetFTR by: • publishers who are willing to make their entitlements available to participating integration partners; and • integration partners (e.g. research discovery tools, scientific collaboration platforms, library management systems, etc.) who are willing to adopt GetFTR to provide seamless pathways to published research. Resources available: Reference Site A lightweight discovery service to demonstrate GetFTR Provides guidance to those wishing to integrate with GetFTR Developer Portal Timeline: 2020 First quarter 2020: Pilot launch • Publishers anticipated: ACS, Elsevier, Springer Nature, Taylor & Francis, Wiley • Integration Partners anticipated: Dimensions, Mendeley, ReadCube Papers • Other publishers or integration partners welcomed • Goals of the pilot: • Experimentation in live environments • Further learnings from real users • Refinement of the service as appropriate Mid-year 2020 and onward: Full launch and broad scale-out to additional publishers and integration partners www.GetFullTextResearch.com Thank you! 
www-technologyreview-com-4006	----	Predictive policing algorithms are racist. They need to be dismantled. | MIT Technology Review You need to enable JavaScript to view this site. Skip to Content MIT Technology Review Sign inSubscribe Topics Magazine Newsletters Events Sign inSubscribe Expand menu MIT Technology Review Sign inSubscribe Topics Magazine Newsletters Events Sign inSubscribe Expand menu Artificial intelligence Predictive policing algorithms are racist. They need to be dismantled. Lack of transparency and biased training data mean these tools are not fit for purpose. If we can’t fix them, we should ditch them. by Will Douglas Heaven archive page July 17, 2020 Franziska Barczyk Yeshimabeit Milner was in high school the first time she saw kids she knew getting handcuffed and stuffed into police cars. It was February 29, 2008, and the principal of a nearby school in Miami, with a majority Haitian and African-American population, had put one of his students in a chokehold. The next day several dozen kids staged a peaceful demonstration. It didn’t go well. That night, Miami’s NBC 6 News at Six kicked off with a segment called “Chaos on Campus.” (There’s a clip on YouTube.) “Tensions run high at Edison Senior High after a fight for rights ends in a battle with the law,” the broadcast said. Cut to blurry phone footage of screaming teenagers: “The chaos you see is an all-out brawl inside the school’s cafeteria.” Students told reporters that police hit them with batons, threw them on the floor, and pushed them up against walls. The police claimed they were the ones getting attacked—“with water bottles, soda pops, milk, and so on”—and called for emergency backup. Around 25 students were arrested, and many were charged with multiple crimes, including resisting arrest with violence. Milner remembers watching on TV and seeing kids she’d gone to elementary school with being taken into custody. “It was so crazy,” she says.  "There's a long history of data being weaponized against Black communities." For Milner, the events of that day and the long-term implications for those arrested were pivotal. Soon after, while still at school, she got involved with data-based activism, documenting fellow students’ experiences of racist policing. She is now the director of Data for Black Lives, a grassroots digital rights organization she cofounded in 2017. What she learned as a teenager pushed her into a life of fighting back against bias in the criminal justice system and dismantling what she calls the school-to-prison pipeline. “There’s a long history of data being weaponized against Black communities,” she says. Inequality and the misuses of police power don’t just play out on the streets or during school riots. For Milner and other activists, the focus is now on where there is most potential for long-lasting damage: predictive policing tools and the abuse of data by police forces. A number of studies have shown that these tools perpetuate systemic racism, and yet we still know very little about how they work, who is using them, and for what purpose. All of this needs to change before a proper reckoning can take place. Luckily, the tide may be turning. There are two broad types of predictive policing tool. Location-based algorithms draw on links between places, events, and historical crime rates to predict where and when crimes are more likely to happen—for example, in certain weather conditions or at large sporting events. The tools identify hot spots, and the police plan patrols around these tip-offs. One of the most common, called PredPol, which is used by dozens of cities in the US, breaks locations up into 500-by-500 foot blocks, and updates its predictions throughout the day—a kind of crime weather forecast. Other tools draw on data about people, such as their age, gender, marital status, history of substance abuse, and criminal record, to predict who has a high chance of being involved in future criminal activity. These person-based tools can be used either by police, to intervene before a crime takes place, or by courts, to determine during pretrial hearings or sentencing whether someone who has been arrested is likely to reoffend. For example, a tool called COMPAS, used in many jurisdictions to help make decisions about pretrial release and sentencing, issues a statistical score between 1 and 10 to quantify how likely a person is to be rearrested if released. Sign up for The Download  - Your daily dose of what's up in emerging technology Sign up Stay updated on MIT Technology Review initiatives and events? YesNo The problem lies with the data the algorithms feed upon. For one thing, predictive algorithms are easily skewed by arrest rates. According to US Department of Justice figures, you are more than twice as likely to be arrested if you are Black than if you are white. A Black person is five times as likely to be stopped without just cause as a white person. The mass arrest at Edison Senior High was just one example of a type of disproportionate police response that is not uncommon in Black communities. The kids Milner watched being arrested were being set up for a lifetime of biased assessment because of that arrest record. But it wasn’t just their own lives that were affected that day. The data generated by their arrests would have been fed into algorithms that would disproportionately target all young Black people the algorithms assessed. Though by law the algorithms do not use race as a predictor, other variables, such as socioeconomic background, education, and zip code, act as proxies. Even without explicitly considering race, these tools are racist. That’s why, for many, the very concept of predictive policing itself is the problem. The writer and academic Dorothy Roberts, who studies law and social rights at the University of Pennsylvania, put it well in an online panel discussion in June. “Racism has always been about predicting, about making certain racial groups seem as if they are predisposed to do bad things and therefore justify controlling them,” she said. Risk assessments have been part of the criminal justice system for decades. But police departments and courts have made more use of automated tools in the last few years, for two main reasons. First, budget cuts have led to an efficiency drive. “People are calling to defund the police, but they’ve already been defunded,” says Milner. “Cities have been going broke for years, and they’ve been replacing cops with algorithms.” Exact figures are hard to come by, but predictive tools are thought to be used by police forces or courts in most US states.  The second reason for the increased use of algorithms is the widespread belief that they are more objective than humans: they were first introduced to make decision-making in the criminal justice system more fair. Starting in the 1990s, early automated techniques used rule-based decision trees, but today prediction is done with machine learning. CLAY BANKS VIA UNSPLASH Yet increasing evidence suggests that human prejudices have been baked into these tools because the machine-learning models are trained on biased police data. Far from avoiding racism, they may simply be better at hiding it. Many critics now view these tools as a form of tech-washing, where a veneer of objectivity covers mechanisms that perpetuate inequities in society. “It's really just in the past few years that people’s views of these tools have shifted from being something that might alleviate bias to something that might entrench it,” says Alice Xiang, a lawyer and data scientist who leads research into fairness, transparency and accountability at the Partnership on AI. These biases have been compounded since the first generation of prediction tools appeared 20 or 30 years ago. “We took bad data in the first place, and then we used tools to make it worse,” says Katy Weathington, who studies algorithmic bias at the University of Colorado Boulder. “It's just been a self-reinforcing loop over and over again.” Things might be getting worse. In the wake of the protests about police bias after the death of George Floyd at the hands of a police officer in Minneapolis, some police departments are doubling down on their use of predictive tools. A month ago, New York Police Department commissioner Dermot Shea sent a letter to his officers. “In the current climate, we have to fight crime differently,” he wrote. “We will do it with less street-stops—perhaps exposing you to less danger and liability—while better utilizing data, intelligence, and all the technology at our disposal ... That means for the NYPD’s part, we’ll redouble our precision-policing efforts.” Police like the idea of tools that give them a heads-up and allow them to intervene early because they think it keeps crime rates down, says Rashida Richardson, director of policy research at the AI Now Institute. But in practice, their use can feel like harassment. Researchers have found that some police departments give officers “most wanted” lists of people the tool identifies as high risk. This first came to light when people in Chicago reported that police had been knocking on their doors and telling them they were being watched. In other states, says Richardson, police were warning people on the lists that they were at high risk of being involved in gang-related crime and asking them to take actions to avoid this. If they were later arrested for any type of crime, prosecutors used the prior warning to seek higher charges. “It's almost like a digital form of entrapment, where you give people some vague information and then hold it against them,” she says. "It's almost like a digital form of entrapment." Similarly, studies—including one commissioned by the UK government’s Centre for Data Ethics and Innovation last year—suggest that identifying certain areas as hot spots primes officers to expect trouble when on patrol, making them more likely to stop or arrest people there because of prejudice rather than need.  Another problem with the algorithms is that many were trained on white populations outside the US, partly because criminal records are hard to get hold of across different US jurisdictions. Static 99, a tool designed to predict recidivism among sex offenders, was trained in Canada, where only around 3% of the population is Black compared with 12% in the US. Several other tools used in the US were developed in Europe, where 2% of the population is Black. Because of the differences in socioeconomic conditions between countries and populations, the tools are likely to be less accurate in places where they were not trained. Moreover, some pretrial algorithms trained many years ago still use predictors that are out of date. For example, some still predict that a defendant who doesn’t have a landline phone is less likely to show up in court. But do these tools work, even if imperfectly? It depends what you mean by “work.” In general it is practically impossible to disentangle the use of predictive policing tools from other factors that affect crime or incarceration rates. Still, a handful of small studies have drawn limited conclusions. Some show signs that courts’ use of risk assessment tools has had a minor positive impact. A 2016 study of a machine-learning tool used in Pennsylvania to inform parole decisions found no evidence that it jeopardized public safety (that is, it correctly identified high-risk individuals who shouldn't be paroled) and some evidence that it identified nonviolent people who could be safely released. Rashida Richardson is director of policy research at the AI Now Institute. She previously led work on the legal issues around privacy and surveillance at the American Civil Liberties Union.COURTESY OF AI NOW Another study, in 2018, looked at a tool used by the courts in Kentucky and found that although risk scores were being interpreted inconsistently between counties, which led to discrepancies in who was and was not released, the tool would have slightly reduced incarceration rates if it had been used properly. And the American Civil Liberties Union reports that an assessment tool adopted as part of the 2017 New Jersey Criminal Justice Reform Act led to a 20% decline in the number of people jailed while awaiting trial. Advocates of such tools say that algorithms can be more fair than human decision makers, or at least make unfairness explicit. In many cases, especially at pretrial bail hearings, judges are expected to rush through many dozens of cases in a short time. In one study of pretrial hearings in Cook County, Illinois, researchers found that judges spent an average of just 30 seconds considering each case. In such conditions, it is reasonable to assume that judges are making snap decisions driven at least in part by their personal biases. Melissa Hamilton at the University of Surrey in the UK, who studies legal issues around risk assessment tools, is critical of their use in practice but believes they can do a better job than people in principle. “The alternative is a human decision maker’s black-box brain,” she says. But there is an obvious problem. The arrest data used to train predictive tools does not give an accurate picture of criminal activity. Arrest data is used because it is what police departments record. But arrests do not necessarily lead to convictions. “We’re trying to measure people committing crimes, but all we have is data on arrests,” says Xiang. "We’re trying to measure people committing crimes, but all we have is data on arrests." What's more, arrest data encodes patterns of racist policing behavior. As a result, they’re more likely to predict a high potential for crime in minority neighborhoods or among minority people. Even when arrest and crime data match up, there are a myriad of socioeconomic reasons why certain populations and certain neighborhoods have higher historical crime rates than others. Feeding this data into predictive tools allows the past to shape the future. Some tools also use data on where a call to police has been made, which is an even weaker reflection of actual crime patterns than arrest data, and one even more warped by racist motivations. Consider the case of Amy Cooper, who called the police simply because a Black bird-watcher, Christian Cooper, asked her to put her dog on a leash in New York’s Central Park. “Just because there’s a call that a crime occurred doesn’t mean a crime actually occurred,” says Richardson. “If the call becomes a data point to justify dispatching police to a specific neighborhood, or even to target a specific individual, you get a feedback loop where data-driven technologies legitimize discriminatory policing.” As more critics argue that these tools are not fit for purpose, there are calls for a kind of algorithmic affirmative action, in which the bias in the data is counterbalanced in some way. One way to do this for risk assessment algorithms, in theory, would be to use differential risk thresholds—three arrests for a Black person could indicate the same level of risk as, say, two arrests for a white person.  This was one of the approaches examined in a study published in May by Jennifer Skeem, who studies public policy at the University of California, Berkeley, and Christopher Lowenkamp, a social science analyst at the Administrative Office of the US Courts in Washington, DC. The pair looked at three different options for removing the bias in algorithms that had assessed the risk of recidivism for around 68,000 participants, half white and half Black. They found that the best balance between races was achieved when algorithms took race explicitly into account—which existing tools are legally forbidden from doing—and assigned Black people a higher threshold than whites for being deemed high risk. Of course, this idea is pretty controversial. It means essentially manipulating the data in order to forgive some proportion of crimes because of the perpetrator’s race, says Xiang: “That is something that makes people very uncomfortable.” The idea of holding members of different groups to different standards goes against many people’s sense of fairness, even if it’s done in a way that’s supposed to address historical injustice. (You can try out this trade-off for yourself in our interactive story on algorithmic bias in the criminal legal system, which lets you experiment with a simplified version of the COMPAS tool.)  At any rate, the US legal system is not ready to have such a discussion. “The legal profession has been way behind the ball on these risk assessment tools,” says Hamilton. In the last few years she has been giving training courses to lawyers and found that defense attorneys are often not even aware that their clients are being assessed in this way. “If you're not aware of it, you're not going to be challenging it,” she says. The lack of awareness can be blamed on the murkiness of the overall picture: law enforcement has been so tight-lipped about how it uses these technologies that it’s very hard for anyone to assess how well they work. Even when information is available, it is hard to link any one system to any one outcome. And the few detailed studies that have been done focus on specific tools and draw conclusions that may not apply to other systems or jurisdictions. It is not even clear what tools are being used and who is using them. “We don’t know how many police departments have used, or are currently using, predictive policing,” says Richardson. For example, the fact that police in New Orleans were using a predictive tool developed by secretive data-mining firm Palantir came to light only after an investigation by The Verge. And public records show that theNew York Police Department has paid $2.5 million to Palantir but isn’t saying what for.  GETTY Most tools are licensed to police departments by a ragtag mix of small firms, state authorities, and researchers. Some are proprietary systems; some aren’t. They all work in slightly different ways. On the basis of the tools’ outputs, researchers re-create as well as they can what they believe is going on. Hamid Khan, an activist who fought for years to get the Los Angeles police to drop a predictive tool called PredPol, demanded an audit of the tool by the police department’s inspector general. According to Khan, in March 2019 the inspector general said that the task was impossible because the tool was so complicated. In the UK, Hamilton tried to look into a tool called OASys, which—like COMPAS—is commonly used in pretrial hearings, sentencing, and parole. The company that makes OASys does its own audits and has not released much information about how it works, says Hamilton. She has repeatedly tried to get information from the developers, but they stopped responding to her requests. She says, “I think they looked up my studies and decided: Nope.” The familiar refrain from companies that make these tools is that they cannot share information because it would be giving up trade secrets or confidential information about people the tools have assessed. All this means that only a handful have been studied in any detail, though some information is available about a few of them. Static 99 was developed by a group of data scientists who shared details about its algorithms. Public Safety Assessment, one of the most common pretrial risk assessment tools in the US, was originally developed by Arnold Ventures, a private organization, but it turned out to be easier to convince jurisdictions to adopt it if some details about how it worked were revealed, says Hamilton. Still, the makers of both tools have refused to release the data sets they used for training, which would be needed to fully understand how they work. Buying a risk assessment tool is subject to the same regulations as buying a snow plow. Not only is there little insight into the mechanisms inside these tools, but critics say police departments and courts are not doing enough to make sure they buy tools that function as expected. For the NYPD, buying a risk assessment tool is subject to the same regulations as buying a snow plow, says Milner.  “Police are able to go full speed into buying tech without knowing what they're using, not investing time to ensure that it can be used safely,” says Richardson. “And then there’s no ongoing audit or analysis to determine if it’s even working.” Efforts to change this have faced resistance. Last month New York City passed the Public Oversight of Surveillance Technology (POST) Act, which requires the NYPD to list all its surveillance technologies and describe how they affect the city’s residents. The NYPD is the biggest police force in the US, and proponents of the bill hope that the disclosure will also shed light on what tech other police departments in the country are using. But getting this far was hard. Richardson, who did advocacy work on the bill, had been watching it sit in limbo since 2017, until widespread calls for policing reform in the last few months tipped the balance of opinion. It was frustration at trying to find basic information about digital policing practices in New York that led Richardson to work on the bill. Police had resisted when she and her colleagues wanted to learn more about the NYPD’s use of surveillance tools. Freedom of Information Act requests and litigation by the New York Civil Liberties Union weren’t working. In 2015, with the help of city council member Daniel Garodnik, they proposed legislation that would force the issue.  “We experienced significant backlash from the NYPD, including a nasty PR campaign suggesting that the bill was giving the map of the city to terrorists,” says Richardson. “There was no support from the mayor and a hostile city council.”  With its ethical problems and lack of transparency, the current state of predictive policing is a mess. But what can be done about it? Xiang and Hamilton think algorithmic tools have the potential to be fairer than humans, as long as everybody involved in developing and using them is fully aware of their limitations and deliberately works to make them fair. But this challenge is not merely a technical one. A reckoning is needed about what to do about bias in the data, because that is there to stay. “It carries with it the scars of generations of policing,” says Weathington. And what it means to have a fair algorithm is not something computer scientists can answer, says Xiang. “It’s not really something anyone can answer. It’s asking what a fair criminal justice system would look like. Even if you’re a lawyer, even if you are an ethicist, you cannot provide one firm answer to that.” “These are fundamental questions that are not going to be solvable in the sense that a mathematical problem can be solvable,” she adds.  Hamilton agrees. Civil rights groups have a hard choice to make, she says: “If you’re against risk assessment, more minorities are probably going to remain locked up. If you accept risk assessment, you’re kind of complicit with promoting racial bias in the algorithms.” But this doesn’t mean nothing can be done. Richardson says policymakers should be called out for their “tactical ignorance” about the shortcomings of these tools. For example, the NYPD has been involved in dozens of lawsuits concerning years of biased policing. “I don’t understand how you can be actively dealing with settlement negotiations concerning racially biased practices and still think that data resulting from those practices is okay to use,” she says. Yeshimabeit Milner is co-founder and director of Data for Black Lives, a grassroots collective of activists and computer scientists using data to reform the criminal justice system. COURTESY OF DATA FOR BLACK LIVES For Milner, the key to bringing about change is to involve the people most affected. In 2008, after watching those kids she knew get arrested, Milner joined an organization that surveyed around 600 young people about their experiences with arrests and police brutality in schools, and then turned what she learned into a comic book. Young people around the country used the comic book to start doing similar work where they lived. Today her organization, Data for Black Lives, coordinates around 4,000 software engineers, mathematicians, and activists in universities and community hubs. Risk assessment tools are not the only way the misuse of data perpetuates systemic racism, but it’s one very much in their sights. “We’re not going to stop every single private company from developing risk assessment tools, but we can change the culture and educate people, give them ways to push back,” says Milner. In Atlanta they are training people who have spent time in jail to do data science, so that they can play a part in reforming the technologies used by the criminal justice system.  In the meantime, Milner, Weathington, Richardson, and others think police should stop using flawed predictive tools until there’s an agreed-on way to make them more fair. Most people would agree that society should have a way to decide who is a danger to others. But replacing a prejudiced human cop or judge with algorithms that merely conceal those same prejudices is not the answer. If there is even a chance they perpetuate racist practices, they should be pulled. As advocates for change have found, however, it takes long years to make a difference, with resistance at every step. It is no coincidence that both Khan and Richardson saw progress after weeks of nationwide outrage at police brutality. “The recent uprisings definitely worked in our favor,” says Richardson. But it also took five years of constant pressure from her and fellow advocates. Khan, too, had been campaigning against predictive policing in the LAPD for years.  That pressure needs to continue, even after the marches have stopped. “Eliminating bias is not a technical solution,” says Milner. “It takes deeper and, honestly, less sexy and more costly policy change.” hide Article meta Share facebooklink opens in a new window twitterlink opens in a new window redditlink opens in a new window linkedinlink opens in a new window whatsapplink opens in a new window emaillink opens in a new window Link Tagged AI Ethics Author Will Douglas Heaven Popular Stop talking about AI ethics. It’s time to talk about power. Could covid lead to a lifetime of autoimmune disease? The pandemic could remake public transportation for the better What are the ingredients of Pfizer’s covid-19 vaccine? Latest content View more Follow twitterlink opens in a new window facebooklink opens in a new window instagramlink opens in a new window rsslink opens in a new window linkedinlink opens in a new window MIT Technology Review Our mission is to bring about better-informed and more conscious decisions about technology through authoritative, influential, and trustworthy journalism. Subscribe to support our journalism. About Help twitterlink opens in a new window facebooklink opens in a new window instagramlink opens in a new window rsslink opens in a new window linkedinlink opens in a new window Cover Art by Simon Landrein © 2021 MIT Technology Review Back to top 
www-theatlantic-com-5784	----	NFTs Were Supposed to Protect Artists. They Don't. - The Atlantic Skip to contentSite Navigation PopularLatest Sections Politics Ideas Photo Science Culture Podcasts Health Education Planet Technology Family Projects Business Global Events Books Fiction Newsletters The Atlantic Crossword Play Crossword The Print Edition Latest IssuePast Issues Give a Gift Search The Atlantic Quick Links Dear Therapist Crossword Puzzle Manage Subscription Popular Latest Sign In Subscribe Ideas NFTs Weren’t Supposed to End Like This When we invented non-fungible tokens, we were trying to protect artists. But tech-world opportunism has struck again. April 2, 2021 Anil Dash CEO of Glitch The Atlantic The only thing we’d wanted to do was ensure that artists could make some money and have control over their work. Back in May 2014, I was paired up with the artist Kevin McCoy at Seven on Seven, an annual event in New York City designed to spark new ideas by connecting technologists and artists. I wasn’t sure which one I was supposed to be; McCoy and his wife, Jennifer, were already renowned for their collaborative digital art, and he was better at coding than I was. At the time, I was working as a consultant to auction houses and media companies—a role that had me obsessively thinking about the provenance, ownership, distribution, and control of artworks. Seven on Seven was modeled after tech-industry hackathons, in which people stay up all night to create a working prototype that they then show to an audience. This was around the peak of Tumblr culture, when a raucous, wildly inspiring community of millions of artists and fans was sharing images and videos completely devoid of attribution, compensation, or context. As it turned out, some of the McCoys’ works were among those being widely “reblogged” by Tumblr users. And Kevin had been thinking a lot about the potential of the then-nascent blockchain—essentially an indelible ledger of digital transactions—to offer artists a way to support and protect their creations. Recommended Reading America’s Challenge Isn’t Vaccine Hesitancy. It’s COVID-19 Denialism. David A. Graham If the Author Is a Bad Person, Does That Change Anything? Judith Shulevitz The GOP Is a Grave Threat to American Democracy Peter Wehner Recommended Reading America’s Challenge Isn’t Vaccine Hesitancy. It’s COVID-19 Denialism. David A. Graham If the Author Is a Bad Person, Does That Change Anything? Judith Shulevitz The GOP Is a Grave Threat to American Democracy Peter Wehner By the wee hours of the night, McCoy and I had hacked together a first version of a blockchain-backed means of asserting ownership over an original digital work. Exhausted and a little loopy, we gave our creation an ironic name: monetized graphics. Our first live demonstration was at the New Museum of Contemporary Art in New York City, where the mere phrase monetized graphics prompted knowing laughter from an audience wary of corporate-sounding intrusions into the creative arts. McCoy used a blockchain called Namecoin to register a video clip that his wife had previously made, and I bought it with the four bucks in my wallet. We didn’t patent the basic idea, but for a few years McCoy tried to popularize it, with limited success. Our first demo might just have been ahead of its time. The system of verifiably unique digital artworks that we demonstrated that day in 2014 is now making headlines in the form of non-fungible tokens, or NFTs, and it’s the basis of a billion-dollar market. Head-spinning prices are now being paid for artworks that, just a few months ago, would have been mere curiosities. Last week, Kevin Roose, a technology writer for The New York Times, offered a digital image of his column for sale in a charity auction, and a pseudonymous buyer paid the equivalent of $560,000 in cryptocurrency for it. McCoy has just put up for sale the very first NFT we created while building our system. Capturing an animation called Quantum, it could go for $7 million or more, Axios reports. I have no financial stake in that sale. The only NFT I own is the one I bought for $4, and I have no plans to sell it. I certainly didn’t predict the current NFT mania, and until recently had written off our project as a footnote in internet history. The idea behind NFTs was, and is, profound. Technology should be enabling artists to exercise control over their work, to more easily sell it, to more strongly protect against others appropriating it without permission. By devising the technology specifically for artistic use, McCoy and I hoped we might prevent it from becoming yet another method of exploiting creative professionals. But nothing went the way it was supposed to. Our dream of empowering artists hasn’t yet come true, but it has yielded a lot of commercially exploitable hype. If you liked an artwork, would you pay more for it just because someone included its name in a spreadsheet? I probably wouldn’t. But once you leave aside the technical details of NFTs, putting artworks on the blockchain is like listing them in an auction catalog. It adds a measure of certainty about the work being considered. By default, copies of a digital image or video are perfect replicas—indistinguishable from the original down to its bits and bytes. Being able to separate an artist’s initial creation from mere copies confers power, and in 2014 it was genuinely new. But the NFT prototype we created in a one-night hackathon had some shortcomings. You couldn’t store the actual digital artwork in a blockchain; because of technical limits, records in most blockchains are too small to hold an entire image. Many people suggested that rather than trying to shoehorn the whole artwork into the blockchain, one could just include the web address of an image, or perhaps a mathematical compression of the work, and use it to reference the artwork elsewhere. We took that shortcut because we were running out of time. Seven years later, all of today’s popular NFT platforms still use the same shortcut. This means that when someone buys an NFT, they’re not buying the actual digital artwork; they’re buying a link to it. And worse, they’re buying a link that, in many cases, lives on the website of a new start-up that’s likely to fail within a few years. Decades from now, how will anyone verify whether the linked artwork is the original? All common NFT platforms today share some of these weaknesses. They still depend on one company staying in business to verify your art. They still depend on the old-fashioned pre-blockchain internet, where an artwork would suddenly vanish if someone forgot to renew a domain name. “Right now NFTs are built on an absolute house of cards constructed by the people selling them,” the software engineer Jonty Wareing recently wrote on Twitter. Meanwhile, most of the start-ups and platforms used to sell NFTs today are no more innovative than any random website selling posters. Many of the works being sold as NFTs aren’t digital artworks at all; they’re just digital pictures of works created in conventional media. But the situation gets worse. Over the past decade, the blockchain has become a refuge for people who need another place to rest their assets. For global tycoons, it’s just an alternative to parking their money in some real estate they would never visit. They can leave money in blockchain-based cryptocurrencies instead, which appreciate in value as long as people buy up bitcoin, Dogecoin, Ethereum, and the like faster than the overall supply increases. Within the tech industry, a second group of investors hopes to use blockchains to build new apps, in areas such as social media or e-commerce, that bypass Google, Facebook, Amazon, Apple, and other tech giants. Instead of giving a cut of their revenue to the App Store, for example, these investors want to build new lines of business in which they can keep the whole pie for themselves. One major challenge is that the blockchain has, at present, approximately zero uses for the typical consumer. Theoretical uses abound, but no ordinary person is choosing a blockchain-based technology over its traditional counterpart. More than a decade after blockchains first caught tech geeks’ eye, not a single smartphone app that you use with friends or co-workers relies on that technology. By contrast, when the web was the same age that bitcoin is today, it had half a billion users around the world. There’s only one exception to the lack of interest in blockchain apps today: apps for trading cryptocurrencies themselves. What results is an almost hermetically sealed economy, whose currencies exist only to be traded and become derivatives of themselves. If you squint, it looks like an absurd art project. After a decade of whiplash-inducing changes in valuation, billions of dollars are now invested in cryptocurrencies, and the people who have made those bets can’t cash in their chips anywhere. They can’t buy real estate with cryptocurrency. They can’t buy yachts with it. So the only rich-person hobby they can partake in with their cryptowealth is buying art. And in this art market, no one is obligated to have any taste or judgment about art itself. If NFT prices suddenly plunge, these investors will try buying polo horses or Davos tickets with cryptocurrencies instead. Think of a kid who’s spent the day playing Skee-Ball and now has a whole lot of tickets to spend. Every toy looks enticing. NFTs have become just such a plaything. The most common criticism of NFTs is that they’re wildly environmentally irresponsible. Each transaction or recording of an artwork requires more and more computing power to complete. More computing power means more resources consumed. Many enthusiasts today will respond that “clean” or “green” NFTs are already starting to circulate. But the blockchain and cryptocurrency enthusiasts of the past decade have shown that environmental responsibility is less than an afterthought. No evidence suggests that cryptotraders will make more money by embracing green NFTs. Since the day he and I first teamed up to work on the technology, Kevin McCoy has been the authority on NFTs for me. He is more responsible for the concept than any other person, and he told me recently that he believes green NFTs will succeed. I want to believe him. But I also look at the history of other gold rushes. People usually choose short-term profit over long-term responsibility. Although I absolutely see lots of artists who care deeply about the impact of their work, I don’t see broad support from the cryptorich for abandoning the devastatingly destructive tech that brought them this far. I’m convinced by the artist and coder Everest Pipkin, whose comprehensive overview of the environmental and ethical pitfalls bears this straightforward headline: “HERE IS THE ARTICLE YOU CAN SEND TO PEOPLE WHEN THEY SAY ‘BUT THE ENVIRONMENTAL ISSUES WITH CRYPTOART WILL BE SOLVED SOON, RIGHT?” In the meantime, the current NFT market is drawing an extraordinary range of grifters and spammers. People are creating NFTs of artists’ works without asking permission or even letting the artists know. Today, I run a platform that helps people create apps. Typically, the most popular apps are prosaic—messaging systems for work, or tools for building a website. For the entire first week of March, our most popular offering each day was a Twitter app that let people block lists of users en masse. The app skyrocketed in popularity because artists were using it to block NFT spammers from hijacking their works and monetizing them as NFTs without permission. Mainstream brands see their own opportunity to capitalize on the hype. Companies selling toilet paper, potato chips, and light beer are tailgating on NFTs’ newfound popularity to offer incomprehensible blockchain-themed promotions on social media. I don’t want to let go of the optimistic ideal behind NFTs. McCoy still believes that blockchain technologies can help artists sustain their work. But in my work as a technologist, my optimism has been dashed many times by opportunists who rushed in after a technology took off. In the early days of digital music, the advent of MP3s and new distribution systems was supposed to allow artists to sell directly to fans. In the early days of social media, companies made blogging technologies with the promise that writers would be able to communicate directly with their readers. This pattern played out in industry after industry. But these changes left creators at the mercy of companies far more powerful, far more ruthless, and far less accountable than the record labels and publishers they’d disrupted. Musicians and writers gained direct access to their audiences, but its cost was a precarity that few could have imagined before their field was disrupted. Artists were the original gig economy. Our initial NFT demo in 2014 was so well received that McCoy and I were invited to present the tech again a week or two later—this time at TechCrunch Disrupt NY, one of the technology industry’s highest-profile conferences. The crowd was a mix of tech geeks and corporate types, all eager to spot the next hot start-up or popular smartphone app. McCoy and I gave a slightly more polished demo of how our proto-NFTs could help artists. Just like at the art museum, we made fun of our own phrase, monetized graphics. This time, nobody in the audience laughed. In the tech world, monetizing innovations is no joke. It’s how the industry operates, and this crowd was all business. 
www-ulacit-ac-cr-4940	----	ULACIT Costa Rica × Acerca de Ulacit Mensaje de la Rectora Filosofía Institucional Autoridades Razones para estudiar en ULACIT Campus About the University of Arizona Admisiones Admisión en Línea Orientación Vocacional Requisitos de Admisión Programa de Becas Precios y Financiamiento Programas Servicios Estudiantiles Plataforma Digital de Servicios Base de Conocimientos y Preguntas Frecuentes Matrícula Aprendizaje en Servicio CIR Colocación Laboral Clínica de Salud Integral Emergencias Estudios en el extranjero Eventos Giras Lost and Found Organizaciones Estudiantiles Dog Friendly Práctica Profesional Registro Salud, Deporte y Bienestar Servicios de Alimentación Servicio de Casilleros Servicio de Estacionamiento Student Success Center TCU Graduaciones Egresados Servicios a egresados Actualice su información de contacto Study Abroad Discover Costa Rica Academic Programs Admissions Tuition and Fees Student Services Tours and travel Academic Policies Faculty-Led Programs Apply Online Empresas Capacite a su personal Llene sus plazas vacantes Patrocine a un estudiante Solicite pasantes Establezca una alianza Participe en ferias de empleo Moxie Publicaciones Acerca de Ulacit Mensaje de la Rectora Filosofía Institucional Autoridades Razones para estudiar en ULACIT Campus About the University of Arizona Admisiones Admisión en Línea Orientación Vocacional Requisitos de Admisión Programa de Becas Precios y Financiamiento Programas Servicios Estudiantiles Plataforma Digital de Servicios Base de Conocimientos y Preguntas Frecuentes Matrícula Aprendizaje en Servicio CIR Colocación Laboral Clínica de Salud Integral Emergencias Estudios en el extranjero Eventos Giras Lost and Found Organizaciones Estudiantiles Dog Friendly Práctica Profesional Registro Salud, Deporte y Bienestar Servicios de Alimentación Servicio de Casilleros Servicio de Estacionamiento Student Success Center TCU Graduaciones Egresados Servicios a egresados Actualice su información de contacto Study Abroad Discover Costa Rica Academic Programs Admissions Tuition and Fees Student Services Tours and travel Academic Policies Faculty-Led Programs Apply Online Empresas Capacite a su personal Llene sus plazas vacantes Patrocine a un estudiante Solicite pasantes Establezca una alianza Participe en ferias de empleo Moxie Publicaciones × BUSCAR INFORMACIÓN Buscar CHAT ULACIT RANKED THE BEST UNIVERSITY IN COSTA RICA, TEN YEARS IN A ROW Noticias Eventos BlackBoard Archivos Noticias Estudiantes de ULACIT crean videojuego disponible en el Google Play Store FlappyBeach, disponible de forma gratuita para dispositivos Android, cuenta ya con cientos de descargas en el Google Play Store.   Su nombre es FlappyBeach y desde el 9 de abril ha recibido más de 100 descargas del Google Play Store. Es la creación de tres amigos que comparten la afición de jugar y crear videojuegos. […] VER MÁS ULACIT le ofrece siete cursos para actualizar su perfil profesional La Universidad Latinoamericana de Ciencia y Tecnología (ULACIT) presenta su oferta de cursos de actualización profesional para el segundo cuatrimestre de 2021, pensada para que tanto profesionales como estudiantes en formación adquieran habilidades que les permita mantenerse competitivos en el mercado laboral. La oferta de ULACIT se enfoca en la formación de habilidades tecnológicas y […] VER MÁS ULACIT busca a los mejores estudiantes para otorgarles becas del 100%   Fecha límite para aplicar es el viernes 16 de abril   Las Becas de Liderazgo de ULACIT buscan premiar con una beca 100% a estudiantes con gran trayectoria académica, rasgos de liderazgo, talentos excepcionales y vocación de servicio. También está abierta la convocatoria de las Becas al Mérito, que otorgan un 50% de beneficio […] VER MÁS ULACIT Virtual Job Fair: Conocé las vacantes de las empresas participantes     Del martes 16 al viernes 19 de marzo 24 empresas ofrecerán 1.150 oportunidades laborales en nuestra ULACIT Virtual Job Fair. Estudiantes y profesionales de todo el país están invitados a conocer los perfiles deseados y aplicar.   Durante estos cuatro días las empresas, todas líderes de la industria médica, financiera, centros de servicios, […] VER MÁS Cinco estudiantes de Publicidad de ULACIT ganan la certificación “Media Class” Ignacio Fernández  con una nota de 100, Li Yu con un 93, Roberto Sánchez con un 92, José Espinoza con un 90 y Camila Villegas con 89, son los primeros estudiantes de ULACIT en certificarse con el Media Class de Kantar Ibope Media.     El proceso de certificación, que requiere una nota mínima de 85 para […] VER MÁS ULACIT, mejor universidad privada del país en Responsabilidad y Gobierno Corporativo En diciembre anterior también ocupó la primera posición en el reporte Merco de Empresas y Líderes Costa Rica 2020. Merco es el primer monitor auditado del mundo, con supervisión de KPMG   Nuevamente, la Universidad Latinoamericana de Ciencia y Tecnología (ULACIT) sobresale en la primera posición entre las universidades privadas del país. Esta vez, en […] VER MÁS VER MÁS EVENTOS 18 AGO Hora: 5:00 p.m. Charla de Blackboard Evento gratuito dirigido a estudiantes Para más información cir@ulacit.ac.cr Tel. 2523-4000 www.ulacit.ac.cr Registrate aquí VER MÁS 4 AGO Hora: 5:00 p.m. Charla de Blackboard Evento gratuito dirigido a estudiantes Para más información cir@ulacit.ac.cr Tel. 2523-4000 www.ulacit.ac.cr Registrate aquí VER MÁS 28 JULIO Hora: 5:00 p.m. Cómo maximizar el uso de EBSCOhost para tus trabajos de la U Evento gratuito dirigido a estudiantes Para más información cir@ulacit.ac.cr Tel. 2523-4000 www.ulacit.ac.cr Registrate aquí VER MÁS VER MÁS Programas Seleccione aquí ----- TÉCNICOS ----- Asistencia Contable Asistente Dental Asistencia Legal Gestión Empresarial Salud Ocupacional Traducción Profesional Español-Inglés Producción Publicitaria ----- ESPECIALIZACIONES ----- Administración de Base de Datos Oracle Animación Digital y Efectos Visuales Cine y Televisión Digital Cloud Computing Conversational English Cybersecurity Diseño Gráfico Diseño y Programación de Software para Web Diseño y Programación de Videojuegos Fotografía y Cinematografía Digital Ingeniería del Sonido Mercadeo Digital Producción Musical y Diseño Sonoro para Medios Digitales User Experience (UX) ----- BACHILLERATOS ----- Administración de Negocios Bachelor of Science in Business Administration Bilingual Bachelor’s Degree in International Relations Contaduría Derecho Economía Empresarial Educación Especial con énfasis en Problemas de Aprendizaje Enseñanza del Inglés Enseñanza del Inglés con énfasis en Educación Preescolar Ingeniería en Circuitos y Sistemas Electrónicos Ingeniería en Seguridad Laboral y Ambiental Ingeniería Industrial Ingeniería Informática Inteligencia de Negocios y Gestión de la Información Mercadeo y Medios Digitales Psicología Publicidad con énfasis en Producción Multimedia ----- LICENCIATURAS ----- Administración de Negocios Comportamiento Organizacional Contaduría Pública Derecho Derecho con énfasis en Derecho Empresarial Derecho con énfasis en Derecho Penal Derecho con énfasis en Derecho Tributario Economía Empresarial Enseñanza del Inglés Finanzas Ingeniería Biomédica Ingeniería Industrial Ingeniería Industrial con énfasis en Gestión de la Cadena de Suministros Ingeniería Industrial con énfasis en Gestión de Operaciones Ingeniería Industrial con énfasis en Ingeniería de la Calidad Ingeniería Industrial con énfasis en Sistemas Modernos de Manufactura Ingeniería Informática con énfasis en Desarrollo de Software Ingeniería Informática con énfasis en Gestión de Recursos Tecnológicos Ingeniería Informática con énfasis en Redes y Sistemas Telemáticos Ingeniería Química Industrial Mercadeo Negocios Internacionales Odontología Psicología Salud Ocupacional con énfasis en Seguridad Industrial ----- ESPECIALIDADES ----- Derecho Notarial y Registral Ortodoncia y Ortopedia Funcional Odontología General Avanzada ----- MICROMASTERS ----- Administración de la Tecnología Asesoría Fiscal Comercio Internacional Finanzas Gerencia de la Calidad Gerencia de Operaciones Gerencia Social Mercadeo Recursos Humanos ----- MAESTRÍAS ----- Administración de Empresas con énfasis en Administración de la Tecnología Administración de Empresas con énfasis en Comercio Internacional Administración de Empresas con énfasis en Finanzas Administración de Empresas con énfasis en Gerencia de Operaciones Administración de Empresas con énfasis en Gerencia Social Administración de Empresas con énfasis en Mercadeo Administración de Empresas con énfasis en Recursos Humanos Administración de Empresas con mención en Gerencia de la Calidad Asesoría Fiscal de Empresas Currículum y Docencia Universitaria Derecho Empresarial Enseñanza del Inglés con mención en Dirección y Evaluación de Programas de Inglés Gerencia de Proyectos Ingeniería de Tecnologías de Información con mención en Administración de Proyectos Mercadeo Internacional y Gerencia Psicopedagogía ----- DOCTORADOS ----- Ciencias Empresariales y Económicas Derecho Administración de Negocios Técnico, Bachillerato, Licenciatura, Doctorado y Maestría VER MÁS Inteligencia de Negocios y Gestión de la Información Bachillerato VER MÁS Contaduría Técnico, Bachillerato y Licenciatura VER MÁS Derecho Técnico, Bachillerato, Licenciatura, Especialidad, Maestría y Doctorado VER MÁS Economía Empresarial Bachillerato y Licenciatura VER MÁS Ingeniería Biomédica a Licenciatura VER MÁS Ingeniería en Circuitos y Sistemas Electrónicos Bachillerato VER MÁS Ingeniería Industrial Bachillerato, Licenciatura VER MÁS Ingeniería Informática Bachillerato, Licenciatura y Maestría VER MÁS Ingeniería Química Industrial Licenciatura VER MÁS Psicología Bachillerato, Licenciatura VER MÁS Educación Técnico, Bachillerato, Licenciatura y Maestría VER MÁS Publicidad Técnico, Bachillerato VER MÁS Relaciones Internacionales Bachillerato VER MÁS Ingeniería en Seguridad Laboral y Ambiental Técnico, Bachillerato y Licenciatura VER MÁS Odontología Técnico, Licenciatura y Especialidad VER MÁS Especializaciones a VER MÁS Certificaciones Internacionales y Cursos Libres a VER MÁS Micromasters a VER MÁS Transiciones a VER MÁS Nuestros egresados “Al llegar de Venezuela buscaba algo más que un centro de estudios. En ULACIT encontré el lugar idóneo para prepararme de forma integral para el futuro”. Venezuela Doriana Dos Santos Psicóloga “Lo más importante para mi preparación fue participar en casos reales durante toda mi carrera. Eso hace toda la diferencia”. Panamá Abdiel Solís Publicista “ULACIT me encanta, me enseñó a trabajar por proyectos y me impulsó a hacer la diferencia en Costa Rica y el mundo”. Costa Rica Mariana Quesada Ingeniera industrial Nos destaca Metodología de enseñanza VER MÁS Vida Universitaria VER MÁS Prestigio VER MÁS Matrícula y admisiones Proceso de admisión  ULACIT busca reclutar a los mejores estudiantes dentro y fuera del país, por eso ha creado un proceso de admisión acorde con el perfil de la Universidad. Para solicitar admisión, puede comunicarse con alguno de nuestros asesores. VER MÁS Programa de becas El Programa de becas al mérito de ULACIT otorga un subsidio al estudiante, al reconocer su desempeño académico y talentos excepcionales. VER MÁS Descubrí la mejor universidad privada de costa rica. Facebook Instagram YouTube © 2020 UNIVERSIDAD LATINOAMERICANA DE CIENCIA Y TECNOLOGÍA - ULACIT. TÉRMINOS DE USO DEL SITIO WEB 
www-vice-com-9018	----	People's Expensive NFTs Keep Vanishing. This Is Why Sign InCreate Account + English Video TV News Tech Rec Room Food World News The 8:46 Project Games Music Health Money Drugs Election 2020 Identity Entertainment Environment Travel Horoscopes Sex VICE Magazine Sign InCreate Account Video TV Podcasts Shop Apps VICE Voices Newsletters Rec Room News Tech Rec Room Food World News The 8:46 Project Games Music Health Money Drugs Election 2020 Identity Entertainment Environment Travel Horoscopes Sex VICE Magazine About Jobs Partner VICE Voices Content Funding on VICE Security Policy Privacy & Terms © 2021 VICE MEDIA GROUP People's Expensive NFTs Keep Vanishing. This Is Why “There was no history of my ever purchasing it, or ever owning it,” said one confused NFT buyer. “Now there’s nothing. My money’s gone.” BM by Ben Munster March 29, 2021, 3:56pm Share Tweet Snap Image: becon via Getty Images Last month, Tom Kuennen, a property manager from Ontario, coughed up $500 worth of cryptocurrency for a JPEG of an Elon Musk-themed “Moon Ticket” from DarpaLabs, an anonymous digital art collective. He purchased it through the marketplace OpenSea, one of the largest vendors of so-called non-fungible tokens, or NFTs, in the hopes of reselling it for a profit.  “It’s like a casino,” he said in an interview. “If it goes up 100 times you resell it, if it doesn't, well, you don’t tell anyone.” Advertisement He never got the chance to find out. A week later, he opened up his digital “wallet,” where the artwork would supposedly be available, and was faced with an ominous banner reading, “This page has gone off grid. We’ve got a 404 error and explored deep and wide, but we can’t find the page you’re looking for.”  Tech People Are Stealing Art and Turning It Into NFTs Ben Munster 03.15.21 The artwork, which he expected to be on the page, had disappeared entirely. “There was no history of my ever purchasing it, or ever owning it,” he said. “Now there’s nothing. My money’s gone.” Was it a glitch? A hack? Did Kuennen perhaps misunderstand how, exactly, NFTs work and how they’re stored? You can’t blame him; over the past few months, numerous individuals have complained about their NFTs going “missing,” “disappearing,” or becoming otherwise unavailable on social media. This despite the oft-repeated NFT sales pitch: that NFT artworks are logged immutably, and irreversibly, onto the Ethereum blockchain.  So why would an NFT go missing? The answer, it turns out, points to the complex working of NFTs that are often misunderstood even by the people willing to shell out large sums for them.  How to make an NFT disappear  When you buy an NFT for potentially as much as an actual house, in most cases you're not purchasing an artwork or even an image file. Instead, you are buying a little bit of code that references a piece of media located somewhere else on the internet. This is where the problems begin. Ed Clements is a community manager for OpenSea who fields these kinds of problems daily. In an interview, he explained that digital artworks themselves are not immutably registered “on the blockchain” when a purchase is made. When you buy an artwork, rather, you’re “minting” a new cryptographic signature that, when decoded, points to an image hosted elsewhere. This could be a regular website, or it might be the InterPlanetary File System, a large peer-to-peer file storage system.  Tech People Are Spending Millions on JPEGs, Tweets, And Other Crypto Collectibles Leigh Cuen 02.16.21 Clements distinguished between the NFT artwork (the image) and the NFT, which is the little cryptographic signature that actually gets logged.  "I use the analogy of OpenSea and similar platforms acting like windows into a gallery where your NFT is hanging,” he said. “The platform can close the window whenever they want, but the NFT still exists and it is up to each platform to decide whether or not they want to close their window.” Advertisement So when Kuennen bought that Moon Ticket, there was no JPEG logged onto the blockchain itself. There was just a certificate, pointing to an URL. And that pointer, Clements explained, can be suppressed for a number of reasons, including a violation of a marketplace’s terms and conditions. Copyright violations and stolen artworks are a feature of the emerging NFT space. "Closing the window" on an NFT isn't difficult. NFTs are rendered visually only on the front-end of a given marketplace, where you see all the images on offer. All the front-end code does is sift through the alphanumeric soup on the blockchain to produce a URL that links to where the image is hosted, or less commonly metadata which describes the image. According to Clement: “the code that finds the information on the blockchain and displays the images and information is simply told, ‘don't display this one.’” “NFTs come in all different shapes and sizes,” Mewny, a pseudonymous developer at cryptocurrency data insight organization eGirl Capital, said after getting input from their team. “Usually OpenSea will either have to render the image from on-chain metadata or retrieve it from a link in the metadata.” But “in both cases,” he said, “it can simply choose not to.” An important point to reiterate is that while NFT artworks can be taken down, the NFTs themselves live inside Ethereum. This means that  the NFT marketplaces can only interact with and interpret that data, but cannot edit or remove it. As long as the linked image hasn't been removed from its source, an NFT bought on OpenSea could still  be viewed on Rarible, SuperRare, or whatever—they are all just interfaces to the ledger.  Advertisement The kind of suppression detailed by Clements is likely the explanation for many cases of "missing" NFTs, such as one case  documented on Reddit when user "elm099" complained that an NFT called “Big Boy Pants” had disappeared from his wallet. In this case, the user could see the NFT transaction logged on the blockchain, but couldn’t find the image itself.  In the case that an NFT artwork was actually removed at the source, rather than suppressed by a marketplace, then it would not display no matter which website you used. If you saved the image to your phone before it was removed, you could gaze at it while absorbing the aura of a cryptographic signature displayed on a second screen, but that could lessen the already-tenuous connection between NFT and artwork.  Missing from the blockchain For Kuennen, though, this explanation was wholly unsatisfactory. He was doubtful that his NFT violated OpenSea’s terms and conditions, and he received no correspondence to that effect. No email, no warning, nothing.  He said he couldn’t even find a record of the token itself on the Ethereum blockchain, though he was able to view the transaction in which he spent $500 and bought the image. This was truly disturbing, because even if an NFT artwork has been taken down, the signature should still be available.  Advertisement We called up a few developers, and they were just as baffled as Kuennen.  “This one’s a pickle,” said Mewny, speculating that the token hadn’t actually been minted at all, and that it would be minted “properly” at a later date in order to save on expensive Ethereum fees. It’s not unlike those cafeterias which sell customers little plastic tokens that can later be exchanged for food after queuing. Except in this case, the token is invisible, the queue never ends, and the “food” is a JPEG stuck to a wall—which abruptly disappears after about a week.  Tech With NFTs, Musicians See a Way to Finally Take Control Max Mertens 03.12.21 Sam Williams, the founder of Arweave, an Ethereum file storage application, pointed to a recent OpenSea update in which the company began to mint tokens only after a sale is made to minimise losses from gas fees in the case of a botched sale. As it turns out, however, the resolution to the riddle of Kuennen’s missing NFT record on the blockchain has to do with even more arcane Ethereum minutiae. Strap in. NFTs are generally represented by a  form of token called the ERC-721. It’s just as simple to locate this token’s whereabouts as ether (Ethereum's in-house currency) and other tokens such as ERC-20s. The NFT marketplace SuperRare, for instance, sends tokens directly to buyers’ wallets, where their movements can be tracked rather easily. The token can then generally be found under the ERC-721 tab.  Advertisement OpenSea, however, has been experimenting with a new new token variant: the ERC-1155, a “multitoken” that designates collections of NFTs.  This token standard, novel as it is, isn’t yet compatible with Etherscan, said Williams. That means ERC-1155s saved on Ethereum don’t show up, even if we know they are on the blockchain because the payments record is there, and the “smart contracts” which process the sale are designed to fail instantly if the exchange can’t be made.  Take, for instance, the buyer B39A88, who last week purchased this collection by the artist “Foswell Banks.” (Who may or may not be this reporter.) The payment record is there and the art is on OpenSea. But under the ERC-721 tab the NFT tied to the artwork is nowhere to be seen. We know, however, that it is online; it’s just not compatible with Etherscan.  Mystery solved In the end, it turns out that the case of Kuennen's missing NFT came down to two causes: a terms of service violation on OpenSea that resulted in the image being suppressed, and an unreadable ERC-1155 standard that made it inaccessible on Etherscan. We know this because we reached out to OpenSea CTO Alex Attalah and he took a look at Kuennen’s Moon Ticket screenshot.  “Checked our moderator logs,” Attalah wrote. “The creator made a collection, our mods saw it and initially it looked good and non-Elon related. Then they modified it significantly to look like a SpaceX collection (including an official-looking SpaceX banner), and users started to report items including 'Moon ticket #29'. Our team took it down before other users were deceived by it.” “Unless there's a blue checkmark on the collection, we ask buyers to do their research in our TOS," he added. "Your friend still owns the item in his wallet, though—nothing is being removed from it. Just OpenSea's TOS means we can't show it.” Atallah recommended that Kuennen simply hook up his wallet to a different marketplace, such as Rarible, where it might not have been taken down.  Kuennen did just that, and returned to us with something of a half-victory: A screenshot in the “collectibles” section of his new Rarible wallet showing, in place of a 404, a blank frame where the image should have been. The image was still either being suppressed or was removed at the source, but Rarible showed that the NFT existed—unlike OpenSea, which plans to replace its impenetrable 404 banner with a proper notification soon, said Atallah.  This is all illustrative of a common problem with Ethereum and cryptocurrencies generally, which despite being immutable and unhackable and abstractly perfect can only be taken advantage of via unreliable third-party applications.  Kuennen, for his part, seemed a little nonplussed. “While I still don’t understand what has happened at least it is still somewhere,” he said via WhatsApp. “Yay.”  He wondered whether there was any way he could restore the image without having to gaze into the blockchain itself. Probably not. Could he, at least, somehow restore the link to the image? But where was it even hosted? Was it even hosted? None of this seemed even remotely hopeful. The best bet, he figured, would be to just resell it as it is, and call it avant-garde.  Tagged:cryptocurrencyethereumblockchainNFTnftsnon-fungible tokensrarible ORIGINAL REPORTING ON EVERYTHING THAT MATTERS IN YOUR INBOX. Subscribe By signing up to the VICE newsletter you agree to receive electronic communications from VICE that may sometimes include advertisements or sponsored content. Advertisement About Jobs Partner VICE Voices Content Funding on VICE Security Policy Privacy & Terms © 2021 VICE MEDIA GROUP 
www-visitnsw-com-5621	----	Narrow Neck Lookout | NSW Holidays & Accommodation, Things to Do, Attractions and Events Skip to main content Toggle navigation Language Australia China International Germany Hong Kong SAR, China India Indonesia Japan Korea Malaysia New Zealand Singapore Taiwan, China United Kingdom United States Destinations Blue Mountains Central Coast Country NSW Hunter Valley Lord Howe Island North Coast Outback NSW Snowy Mountains South Coast Family Holidays in NSW See all Things to do Aboriginal Culture Arts & Culture Adventure & Sport Beach Lifestyle Family Holidays Food & Drink Luxury Experiences Nature & Parks Caravan & Camping See all Road trips Accommodation Hotel & Motel Backpacker Hostel Self Contained Bed & Breakfast Resorts Caravan & Camping Farm Stays Cabins & Cottages Spa & Retreat See all Events Mudgee Classic Narooma Oyster Festival Australian Celtic Fringe Festival Moree On A Plate: Food and Wine Festival Australian Senior League Baseball Championship See all Deals Facebook Twitter Youtube InstagramFacebook Overview Location Nearby Share Previous Next 0 1 Narrow Neck Lookout Glen Raphael Drive Katoomba NSW 2780 Australia 1300 653 408 tourism@bmcc.nsw.gov.au http://www.visitbluemountains.com.au Call Visit Website Get Directions Home Destinations Blue mountains Katoomba area Katoomba Attractions Narrow Neck Lookout Narrow Neck Lookout Overview Narrow Neck Lookout, as the name implies, has views over the Narrow Neck Peninsula, a plateau which divides the Jamison and Megalong Valleys. There is a lookout on Cliff Drive down over Narrow Neck… Narrow Neck Lookout, as the name implies, has views over the Narrow Neck Peninsula, a plateau which divides the Jamison and Megalong Valleys. There is a lookout on Cliff Drive down over Narrow Neck and the two valleys or you can continue along Cliff Drive to Glen Raphael Drive, a dirt road leading two kilometres along Narrow Neck to a locked gate. The next seven kilometres is for walkers and mountain bike riders only and offers spectacular views into the surrounding valleys. Narrow Neck is well known as a 'must do' mountain bike ride. **To keep visitors safe ALL camping in NSW national parks now requires a booking. For day visitors, please plan ahead and visit outside of peak periods (11am and 2pm). Check park alerts and visit COVID-19 updates for more information before visiting any park – www.nationalparks.nsw.gov.au ** Contact Info Glen Raphael Drive Katoomba NSW 2780 Australia 1300 653 408 tourism@bmcc.nsw.gov.au http://www.visitbluemountains.com.au Location Open in maps Website Call Email Nearby Events 13 May - 16 May Events Ultra-Trail Australia Ultra-Trail Australia is four day running festival incorporating a 100km, 50km, 22km and 11km trail run through the world… 13 May - 16 May Events Ultra-Trail Australia Ultra-Trail Australia is an adventure packed four-day trail running event held in the World Heritage listed Blue Mountains… Last Days 28 Apr - 02 May Events Sculpture Otherwise Sculpture Otherwise is an exhibition of small sculpture by artists participating in Sculpture at Scenic World in 2021. … Sign up to our newsletter Plan the trip of a lifetime. First name Email address Sign me up By signing up, I have read and agree to the Privacy Policy and Terms of Use of Destination NSW. Facebook Twitter YouTube Instagram Pinterest Blog Visit NSW Contact Us Disclaimer Privacy Terms of Use On this site Destinations Things To Do NSW Road Trips Events Accommodation Deals Information Travel Information List your Business Business in NSW Education in NSW Sydney for all Link to us Our sites Sydney.com Destination NSW Corporate Regional Conferencing Destination NSW Media Centre Vivid Sydney NSW Government Destination New South Wales (Corporate site) VisitNSW.com is the official tourism site for Destination NSW. © Copyright 2021 Destination NSW. All rights reserved Created by arjuazkafrom the Noun Project 
www-washington-edu-3538	----	The UW MyResearch Lifecycle with the four stages: Plan/Propose, Setup, Manage, and Closeout - UW Research Skip to main content University of Washington Research Contact Us About Office of Research Announcements UW Home Quick Links MyResearch Project Lifecycle Plan/Propose Plan/Propose Overview Develop Hypothesis/Question Select Funding Source Conduct Literature Search   Sponsor Requirements Recruit Team Budget   Facilities and Resources Write Proposal Submit Proposal Tools MyResearch SAGE Zipline More Tools Setup Setup Overview Sponsor Requirements Facilities and Resources Financials   Subawards Compliance Requirements (Non-Financial) Records and Documentation   Methods Collaborations Data Collection Tools MyResearch SAGE Zipline More Tools Manage Manage Overview Financials Compliance Requirements (Non-Financial) Facilities and Resources   Collect Data Analyze Data Subawards   Award Changes Reporting Tools MyResearch SAGE Zipline More Tools Closeout Closeout Overview Financials Subawards Equipment and Materials   Reporting Dissemination of Results   Data Sharing Records Retention Tools MyResearch SAGE Zipline More Tools Resources Support Offices Human Subjects Division Office of Animal Welfare Office of Research Central Office of Research Information Services Office of Sponsored Programs Research Units Applied Physics Laboratory Washington National Primate Research Center Browse Forms and Templates Policies, Procedures and Guidance Announcements Glossary FAQs Popular Resources Funding Opportunities Grant Information Memoranda (GIMs) Institutional Facts and Rates Research Stats & Rankings New to UW Research Tools MyResearch SAGE Zipline More Tools Collaboration Connect to Expertise Find Research Expertise Research Centers and Institutes Resources for Collaboration Interdisciplinary Resources Guidance for NIH Institutional Training Grants Collaborative Proposal Development Resources Establish a Research Center or Institute Shared Research Facilities and Resources Tools MyResearch SAGE Zipline More Tools Compliance Compliance Overview Compliance Training Training Overview Research Training Overview Collaborative for Research Education Required Research Training   Research Administration Learning HSD Training and Education MyResearch Training Transcript UW Research Home UW Research Office of Research The UW MyResearch Lifecycle with the four stages: Plan/Propose, Setup, Manage, and Closeout The UW MyResearch Lifecycle with the four stages: Plan/Propose, Setup, Manage, and Closeout The MyResearch Project Lifecycle. Download University of Washington Office of Research Tools MyResearch SAGE Zipline More Tools OR Support Offices Human Subjects Division (HSD) Office of Animal Welfare (OAW) Office of Research (OR) Office of Research Information Services (ORIS) Office of Sponsored Programs (OSP) OR Research Units Applied Physics Laboratory (APL-UW) WA National Primate Research Center (WaNPRC) Research Partner Offices CoMotion Corporate and Foundation Relations (CFR) Enivronmental Health and Safety (EH&S) Grant and Contract Accounting (GCA) Institute of Translational Health Sciences (ITHS) Management Accounting and Analysis (MAA) Post Award Fiscal Compliance (PAFC) Collaboration Research Expertise Centers and Institutes Interdisciplinary Resources Guidance for NIH Institutional Training Grants Collaborative Proposal Development Resources About Research Fact Sheet Research Annual Report Stats and Rankings Honors and Awards Office of Research Contact Us Manage Subscriptions Gerberding Hall G80 Box 351202 Seattle, WA 98195 Accessibility Jobs Campus Safety My UW Privacy Terms © 2021 University of Washington | Seattle, WA 
www-waterwide-org-6233	----	Welcome - Water With Development (WaterWide) Skip to main content About WaterWide Mission Contact Us Our Work Tracking Advocacy Charity Training Get involved Career Volunteering Brand partnership Resource Facts and Data Press Release Blog Publication Media Field stories Donate Toggle navigation About WaterWide Mission Contact Us Our Work Tracking Advocacy Charity Training Get involved Career Volunteering Brand partnership Resource Facts and Data Press Release Blog Publication Media Field stories Donate Close Toggle navigation Water With Development (WaterWide) About WaterWide Mission Contact Us Our Work Tracking Advocacy Charity Training Get involved Career Volunteering Brand partnership Resource Facts and Data Press Release Blog Publication Media Field stories Donate 3 in 10 people lack access to safely managed drinking water services. EACH DAY, Nearly 1,000 children die due to preventable water and sanitation-related diseases… We can change this. Achieving Sustainable Development Goal 6 by 2030 About Us WaterWide is a non-profit organisation that tracks government spending for Water, Sanitation and Hygiene for improved access in rural communities. We envision a world where there is access for all to safe water, improved sanitation and proper hygiene and citizens’ are empowered to demand a transparent and accountable government. Using the power of technology, we amplify the unheard voices of people living in marginalised rural communities WE TRACK Government fund and International aid Tracking of government spending and international aid can help to ensure that funds are spent judiciously, and not diverted to private pockets. It enables communities to ask: ‘Are public resources being used as planned? And are they bringing the expected results?’ It can lead to better public services for communities. Read more WE ADVOCATE Amplifying unheard voices... For us, advocacy never just raises awareness of an issue, problem or situation. It always seeks to change the policies, practices, systems, structures, decisions and attitudes that are causing the issue, problem or situation, in favour of people living in marginalised rural communities. Find out more WE ARE CHARITABLE Safe and reliable water restores hope and unlocks potential... We provide help and raise funds for people living in rural communities because access to safe water, improved sanitation and proper hygiene are crucial to the development of any community. Find out more WE TRAIN Capacity building for people living in rural communities. We build the knowledge and develop the capacity of relevant stakeholders in the water and sanitation sector and people living in rural communities to effectively track the implementation of government WASH projects, we also train them on best sanitation and hygiene habits and empower them to manage available water infrastructure. Find out more News. Press releases. Blogs. Facts and data. Media. The Realities of WASH Data- Open Data Day 2021 Water With Development, with support from Open Knowledge Foundation, organised... March 5, 2021 Community Sensitization: Empowering Marginalised Communities With Information on how COVID19 spreads, Source of The Virus and Local Mitigation Measures. Kaida is like every other rural community in Nigeria characterized... July 1, 2020 COMMUNITY SENSITIZATION: EMPOWERING PERSONS LIVING WITH DISABILITIES (PLWD) WITH INFORMATION ON HOW COVID19 SPREADS, SOURCE OF THE VIRUS AND LOCAL MITIGATION MEASURES. The provision of safe water, sanitation and hygienic conditions are... June 16, 2020 COVID-19: The need for water and sanitation As the cases of COVID19 increases by the day in... April 9, 2020 GET INVOLVED We need you! Yes You to achieve Sustainable Development Goal 6 by 2030 Join our community of passionate/great minds to help bring safe water, improved sanitation and proper hygiene to rural communities. Career Volunteer Brand Partnership Donate OUR PARTNERS Alone we can do a lot; together we can do so much more… The Sustainable Development Goal 17 seeks to strengthen global partnerships to support and achieve the ambitious targets of the 2030 Agenda, bringing together national governments, the international community, civil society, the private sector and other actors. Partner with us OUR PARTNERS DONATE Diseases from unsafe water and lack of basic sanitation kill more people every year than all forms of violence. Many of us have no idea what it's like to be thirsty. We have plenty of water to drink, but many people around the world don’t have that luxury. Every day, nearly 1000 children die from diseases caused by unsafe water and poor sanitation. DONATE WaterWide is a non-profit organisation that tracks government spending for Water, Sanitation and Hygiene for improved access in rural communities. BLOG UPDATES The Realities of WASH Data- Open Data Day 2021 Mar 05, 2021 Community Sensitization: Empowering Marginalised Communities With Information on how COVID19 spreads, Source of The Virus and Local Mitigation Measures. Jul 01, 2020 COMMUNITY SENSITIZATION: EMPOWERING PERSONS LIVING WITH DISABILITIES (PLWD) WITH INFORMATION ON HOW COVID19 SPREADS, SOURCE OF THE VIRUS AND LOCAL MITIGATION MEASURES. Jun 16, 2020 CONTACT INFO    @WaterWideNg    WaterWideNg    @waterwide_ng    WaterWideNg    Send us an eMail Community Park, Bassan Plaza, 10th Street, 3rd Floor, D Wing Behind Total House, Central Business District, FCT, Abuja, Nigeria Newsletter signup By subscribing to our mailing list you will always be update with the latest news from us. Subscribe OK WaterWide.org © 2019 All rights reserved Site Design by SmartEdge Solution ×Close Login Username Password  Remember Me Lost your password? Not a Member yet? Sign in ×Close Register account Username Email Password Retype password Already have an account? Register ×Close Forgot Password Username or E-mail: Already have an account? Reset Password ×Close Newsletter Newsletter Get timely updates from your favorite products Email Subscribe 
www-web2learning-net-36	----	What I Learned Today… Skip to the content Search What I Learned Today... Menu About Me Publications & Presentations Library Mashups The Accidental Systems Librarian Open Source Software for Libraries My Presenting/Learning Calendar Blog Archives Search Search for: Close search Close Menu About Me Publications & PresentationsShow sub menu Library Mashups The Accidental Systems Librarian Open Source Software for Libraries My Presenting/Learning Calendar Blog Archives Facebook Twitter LinkedIn Categories About Me Taking a Break Post author By Nicole C. Baratta Post date May 4, 2016 No Comments on Taking a Break I’m sure those of you who are still reading have noticed that I haven’t been updating this site much in the past few years. I was sharing my links with you all but now Delicious has started adding ads to that. I’m going to rethink how I can use this site effectively going forward. For now you can read my regular content on Opensource.com at https://opensource.com/users/nengard. Share this: Email Twitter Facebook Tumblr LinkedIn Reddit Pocket Pinterest Categories Link Sharing Bookmarks for May 3, 2016 Post author By Nicole C. Baratta Post date May 3, 2016 No Comments on Bookmarks for May 3, 2016 Today I found the following resources and bookmarked them on Delicious. Start A Fire Grow and expand your audience by recommending your content within any link you share Digest powered by RSS Digest Share this: Email Twitter Facebook Tumblr LinkedIn Reddit Pocket Pinterest Categories Link Sharing Bookmarks for April 4, 2016 Post author By Nicole C. Baratta Post date April 4, 2016 No Comments on Bookmarks for April 4, 2016 Today I found the following resources and bookmarked them on Delicious. Mattermost Mattermost is an open source, self-hosted Slack-alternative mBlock Program your app, Arduino projects and robots by dragging & dropping Fidus Writer Fidus Writer is an online collaborative editor especially made for academics who need to use citations and/or formulas. Beek Social network for booklovers Open eBooks Open eBooks is a partnership between Digital Public Library of America, The New York Public Library, and First Book, with content support from digital books distributor Baker & Taylor. Digest powered by RSS Digest Share this: Email Twitter Facebook Tumblr LinkedIn Reddit Pocket Pinterest Categories Link Sharing Bookmarks for February 25, 2016 Post author By Nicole C. Baratta Post date February 25, 2016 No Comments on Bookmarks for February 25, 2016 Today I found the following resources and bookmarked them on Delicious. Connfa Open Source iOS & Android App for Conferences & Events Paperless Scan, index, and archive all of your paper documents Foss2Serve Foss2serve promotes student learning via participation in humanitarian Free and Open Source Software (FOSS) projects. Disk Inventory X Disk Inventory X is a disk usage utility for Mac OS X 10.3 (and later). It shows the sizes of files and folders in a special graphical way called “treemaps”. Loomio Loomio is the easiest way to make decisions together. Loomio empowers organisations and communities to turn discussion into action, wherever people are. DemocracyOS DemocracyOS is an online space for deliberation and voting on political proposals. It is a platform for a more open and participatory government. The software aims to stimulate better arguments and come to better rulings, as peers. Digest powered by RSS Digest Share this: Email Twitter Facebook Tumblr LinkedIn Reddit Pocket Pinterest Categories Link Sharing Bookmarks for January 9, 2016 Post author By Nicole C. Baratta Post date January 9, 2016 No Comments on Bookmarks for January 9, 2016 Today I found the following resources and bookmarked them on Delicious. Superpowers The open source, extensible, collaborative HTML5 2D+3D game maker Sequel Pro Sequel Pro is a fast, easy-to-use Mac database management application for working with MySQL databases. Digest powered by RSS Digest Share this: Email Twitter Facebook Tumblr LinkedIn Reddit Pocket Pinterest Categories Link Sharing Bookmarks for December 11, 2015 Post author By Nicole C. Baratta Post date December 11, 2015 No Comments on Bookmarks for December 11, 2015 Today I found the following resources and bookmarked them on Delicious. Open Broadcaster Software Free, open source software for live streaming and recording Digest powered by RSS Digest Share this: Email Twitter Facebook Tumblr LinkedIn Reddit Pocket Pinterest Categories Link Sharing Bookmarks for November 22, 2015 Post author By Nicole C. Baratta Post date November 22, 2015 No Comments on Bookmarks for November 22, 2015 Today I found the following resources and bookmarked them on Delicious. NumFOCUS Foundation NumFOCUS promotes and supports the ongoing research and development of open-source computing tools through educational, community, and public channels. Digest powered by RSS Digest Share this: Email Twitter Facebook Tumblr LinkedIn Reddit Pocket Pinterest Categories Link Sharing Bookmarks for November 16, 2015 Post author By Nicole C. Baratta Post date November 16, 2015 No Comments on Bookmarks for November 16, 2015 Today I found the following resources and bookmarked them on Delicious. Smore Smore makes it easy to design beautiful and effective online flyers and newsletters. Ninite Install and Update All Your Programs at Once Digest powered by RSS Digest Share this: Email Twitter Facebook Tumblr LinkedIn Reddit Pocket Pinterest Categories Link Sharing Bookmarks for November 13, 2015 Post author By Nicole C. Baratta Post date November 13, 2015 No Comments on Bookmarks for November 13, 2015 Today I found the following resources and bookmarked them on Delicious. VIM Adventures Learning VIM while playing a game Digest powered by RSS Digest Share this: Email Twitter Facebook Tumblr LinkedIn Reddit Pocket Pinterest Categories Link Sharing Bookmarks for November 10, 2015 Post author By Nicole C. Baratta Post date November 10, 2015 No Comments on Bookmarks for November 10, 2015 Today I found the following resources and bookmarked them on Delicious. Star Wars: Building a Galaxy with Code Digest powered by RSS Digest Share this: Email Twitter Facebook Tumblr LinkedIn Reddit Pocket Pinterest Posts navigation ← Newer Posts1 2 … 312 Older Posts → Facebook Twitter LinkedIn Search for: Tags amazon android ato2014 blogging chrome cil2007 CIL2008 cil2010 cil2011 cil2012 code4lib08 facebook feedburner firefox gmail Google il2005 il2006 il2007 il2012 il2014 koha kohacon09 kohacon10 kohacon12 libraries mapping nfais2008 njla2008 Open Source oscon oscon2013 PHP pinterest rss sla sla2008 special libraries association sxsw15 twitter valenj08 webinar windows wordpress zotero Learn Library Mashups Learn Systems Librarianship Learn Open Source My Sites Library Mashups Practical Open Source for Libraries The Accidental Systems Librarian © 2021 What I Learned Today… Powered by WordPress To the top ↑ Up ↑ Send to Email Address Your Name Your Email Address Cancel Post was not sent - check your email addresses! Email check failed, please try again Sorry, your blog cannot share posts by email. 
www-wikidata-org-849	----	official website - Wikidata official website (P856) From Wikidata Jump to navigation Jump to search URL of the official homepage of an item (current or former) [if the homepage changes, add an additional statement with preferred rank. Do not remove the former URL] site web address home page url official homepage official page official web site website (official) web site official URL home page (official) official site website URL (official) homepage (official) official homepage www site (official) site official website official site (official) Language Label Description Also known as English official website URL of the official homepage of an item (current or former) [if the homepage changes, add an additional statement with preferred rank. Do not remove the former URL] site web address home page url official homepage official page official web site website (official) web site official URL home page (official) official site website URL (official) homepage (official) official homepage www site (official) site official website official site (official) Data type URL Statements instance of Wikidata property for items about works 0 references Wikidata property for items about organizations 0 references Wikidata property for items about people 0 references Wikidata property encoding a vCard value 0 references Wikidata property for Wikivoyage listings 0 references Wikidata property associated with websites 0 references short name webbplats (Swedish) 0 references site web (French) 0 references OpenStreetMap tag or key Key:website 0 references Key:contact:website 0 references subproperty of URL 0 references subject item of this property official website 0 references home page 1 reference Wikimedia import URL https://www.wikidata.org/wiki/Wikidata:Project_chat/Archive/2018/11#Rename_official_homepage_(P856)_to_%22homepage%22_or_%22official_URL%22 Wikidata usage instructions For active WMF sites, please use P4174 (English) 0 references Pour les pages des réseaux sociaux, il existe des propriétés distinctes (French) 0 references 对于活跃的维基媒体基金会网站，请使用P4174 (Simplified Chinese) 0 references властивість для вказання власного офіційного сайту; для сторінок в соцмережах, на вебхостінгах медіаконтенту існують окремі властивості (Ukrainian) 0 references 홈페이지 주소가 변경되었다고 예전 값을 변경하거나 삭제하지 마세요. 대신 새로운 값을 추가하면서 '선호하는 등급'을 부여해 가장 최신의 정보란 것을 나타내주시고 예전의 값은 '일반 등급'으로 내려주세요. (Korean) 0 references Wikidata property example Facebook official website https://www.facebook.com 0 references Daikin Industries official website http://www.daikin.co.jp/ language of work or name Japanese 0 references Intel official website https://www.intel.com/content/www/us/en/homepage.html language of work or name English 0 references third-party formatter URL https://web.archive.org/web/*/$1/ 0 references equivalent property http://xmlns.com/foaf/0.1/homepage 0 references http://d-nb.info/standards/elementset/gnd#homepage 0 references http://dati.beniculturali.it/cis/hasWebSite 0 references external superproperty http://schema.org/url 0 references corresponding template Template:Official website 0 references Template:Wikidata/P856 0 references Template:Official URL 0 references property usage tracking category Category:Pages using Wikidata property P856 0 references category for value same as Wikidata Category: Official website is in Wikidata 0 references category for value not in Wikidata Category:Official website not in Wikidata 0 references category for value different from Wikidata Category:Official website different in Wikidata and Wikipedia 0 references see also official blog 0 references stated in 0 references imported from Wikimedia project 0 references website account on 0 references house publication 0 references described at URL 0 references exact match 0 references curriculum vitae URL 0 references terms of service URL 0 references charter URL 0 references privacy policy URL 0 references property proposal discussion https://www.wikidata.org/wiki/Wikidata:Property_proposal/Archive/14#P856 0 references Constraints property constraint format constraint format as a regular expression https?:\/\/\S+|ftp?:\/\/\S+ 0 references allowed qualifiers constraint property language of work or name start time end time retrieved archive URL archive date applies to jurisdiction intended public applies to part copyright license part of publisher subject has role web feed URL named as point in time has quality reason for deprecation title reason for preferred rank privacy policy URL terms of service URL statement is subject of copyright status latest start date earliest date earliest end date latest date of object has role 0 references conflicts-with constraint property instance of item of property constraint Wikimedia disambiguation page Wikimedia category Wikimedia list article Wikimedia template 0 references property scope constraint property scope as main value 1 reference Wikimedia import URL https://www.wikidata.org/wiki/Help:Sources stated in Wikipedia:Verifiability retrieved 3 April 2019 distinct-values constraint exception to constraint Basshunter Q58629847 Q20577117 Al Liamm Telegram Koh-Lanta, season 17 Sempach RDA/ONIX framework value vocabularies RDA value vocabularies AIATSIS Place Thesaurus AIATSIS Subject Thesaurus Washington State Parks and Recreation Commission Washington State Parks 0 references single-best-value constraint separator language of work or name applies to jurisdiction intended public applies to part start time end time subject has role named as of exception to constraint Emmy Award 0 references item-requires-statement constraint property instance of 0 references format constraint format as a regular expression (?i)((?!\b(:\/\/web\.archive\.org\/)).)* syntax clarification not include 'web.archive.org'. Instead add archive link with 'archive URL' (P1065) and qualify former official website with 'end date' (P582) (English) 请不要包含“web.archive.org”，请改为正常添加原始链接，将存档链接添加为“存档URL”（P1065）并加限定符“结束日期”（P582）至前官方网站 (Simplified Chinese) exception to constraint Wayback Machine 0 references required qualifier constraint property language of work or name constraint status suggestion constraint 0 references allowed entity types constraint item of property constraint Wikibase item 0 references Retrieved from "https://www.wikidata.org/w/index.php?title=Property:P856&oldid=1409315340" Navigation menu Personal tools English Not logged in Talk Contributions Create account Log in Namespaces Property Discussion Variants Views Read View history More Search Navigation Main page Community portal Project chat Create a new Item Recent changes Random Item Query Service Nearby Help Donate Lexicographical data Create a new Lexeme Recent changes Random Lexeme Tools What links here Related changes Special pages Permanent link Page information Concept URI Print/export Download as PDF Printable version This page was last edited on 27 April 2021, at 00:50. All structured data from the main, Property, Lexeme, and EntitySchema namespaces is available under the Creative Commons CC0 License; text in the other namespaces is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Privacy policy About Wikidata Disclaimers Mobile view Data access Developers Statistics Cookie statement 
www-wtvr-com-5718	----	Council approves ordinance for VCU Police to continue sharing records with RPD 1 weather alerts 1 closings/delays 1 weather alerts 1 closings/delays Menu Search site Watch Live Watch Alerts Live Watch 0 Close x Live Watch Alerts Search site Go COVID-19 News and Resource Center The Rebound Richmond You Are Not Alone We're Open Virginia Together Getting Back To Work Making Ends Meet Managing the Pressure Doing Whatâ€™s Right State of Education Weather Forecast Interactive Radar Map Center Weather Alerts Skytracker Cams School Closings & Delays Weather News Meteorologists Traffic Gas Prices News WTVR CBS 6 Originals Breaking News and Alerts Local National International Virginia Politics National Politics Election 2020 Consumer Financial Fitness Don't Waste Your Money Crime and Justice Entertainment Military and Government Offbeat Weather News Watch Live Newscasts Watch Videos WTVR CBS 6 News Team Contact WTVR CBS 6 News CBS 6 Problem Solvers Contact the Problem Solvers Problem Solvers Investigations Working On Your Bill Shelby Brown Laura French Melissa Hipolit Bree Sison WTVR CBS 6 Originals Beyond the Roster Building Better Minds CBS 6 Gives Heroes Among Us I Have a Story Our RVA Problem Solvers Investigations Wayne's World Health Buddy Check 6 Sports Beyond the Roster Final Score Friday Olympics Community A List Events with Antoinette Essa Events Calendar Battle of the Brains Paws for Pets Community Advisory Council Eat It, Virginia Food and Drink News Scott Wise Robey Martin Virginia This Morning Embrace Green Space Jessica Noll Bill Bevins Virginia This Morning on Facebook Virginia This Morning on Instagram Contests TV Listings About Us Contact WTVR CBS 6 Staff Email Directory Meet the WTVR CBS 6 News Team Jobs at WTVR Internships at WTVR CBS 6 WTVR CBS 6 Speaker Request Form Advertise on CBS 6 and WTVR.com Closed Captioning Info Public File EEO Reports TV Listings Sign In Newsletters Sign Out Manage Emails Apps Careers Search News Local Originals National Politics Virginia Politics Entertainment Offbeat Sports Weather Quick links... News Local Originals National Politics Virginia Politics Entertainment Offbeat Sports Weather 1 weather alerts 1 closings/delays NewsLocal News Actions Facebook Tweet Email Council approves ordinance for VCU Police to continue sharing records with RPD By: Jon Burkett Posted at 8:25 PM, Feb 08, 2021 and last updated 2021-02-09 08:09:46-05 RICHMOND, Va. -- The Richmond City Council passed an ordinance Monday night that will allow Richmond Police and VCU Police to continue to share records. VCU Police Chief John Venuti told council members that the agreement has been in place since 2012, but a necessary system upgrade needed the city council's approval to keep the agreement going. Some members of the city council said the agreement helps lead police to arrests, while others have concerns about "over-policing." Council members have said they have had many calls and emails from citizens with concerns about the system and how it may be used. Community members against the system protested outside of city council Monday. Their concerns are about transparency, how the records management system was obtained and if the system being used will over-police communities of color. According to police, the system is not one to predict where crime will happen, but to see patterns and trends. Richmond Police Chief Gerald Smith said, "I will be scheduling meetings to discuss...this system with the public in an open dialogue to discuss what it is, what it does and how it will help improve public safety here in the city of Richmond." Smith did not give specific dates on Monday as far as when he plans on having these meetings with the public. Copyright 2021 Scripps Media, Inc. All rights reserved. This material may not be published, broadcast, rewritten, or redistributed. Sign up for the Headlines Newsletter and receive up to date information. now signed up to receive the Headlines Newsletter. Click here to manage all Newsletters Depend on the CBS 6 Weather Authority to keep you ahead of the storm. News Weather Traffic Problem Solvers TV Listings About & Contact Support Sitemap Privacy Policy Privacy Center Terms of Use EEO FCC Public File FCC Application Public File Contact Us Accessibility Statement Closed Captioning Contact Social CBS6News cbs6 cbs6 Scripps Local Media Â© 2021 Scripps Media, Inc 
www-youtube-com-2004	----	Project Yarquen - Net Zero Challenge pitch contest - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-2258	----	My favorite scene from Dead Poets Society - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-25	----	Clima de EleiÃ§Ã£o (Election Climate) - Net Zero Challenge pitch contest - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-3291	----	All l'amour for you: Is this Team Happy original song the last word on what love really, really is? - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-3629	----	Closer to Fine (cover) - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-3714	----	Citizen Science Avian Index for Sustainable Forests - Net Zero Challenge pitch contest - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-3822	----	Concluding remarks - Net Zero Challenge pitch contest - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-4175	----	CarbonGeoScales - Net Zero Challenge pitch contest - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-507	----	Team Happy: Original song - Seventeen - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-5211	----	Is this the Perfect Country Pop Song? - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-5828	----	Team Happy Cover: Colour My World - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-6042	----	Team Happy: Original Song - They Say Dancing - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-6683	----	Shredding the Girl and Balloon - The Directorâ€™s half cut - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-7013	----	Collaborations Workshop 2021 - Panel Live Stream - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-7227	----	Team Happy: Original - Dry Pebbles - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-7921	----	Team Happy: Original Song: Goodbye Mongrel Dog - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-8193	----	Collaborations Workshop 2021 - Keynotes Live Stream - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-8240	----	Introduction - Net Zero Challenge pitch contest - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-9027	----	Net Zero Challenge pitch contest - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-9033	----	AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-9093	----	Team Happy: Original Song - I Called Your Name - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-9449	----	AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-youtube-com-9576	----	Snapshot Climate Tool - Net Zero Challenge pitch contest - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
www-zotero-org-7715	----	Zotero Blog Log In Register Upgrade Storage Home Groups People Documentation Forums Get Involved Move Zotero Citations Between Google Docs, Word, and LibreOffice Posted July 23rd, 2019 by Dan Stillman Last year, we added Google Docs integration to Zotero, bringing to Google Docs the same powerful citation functionality — with support for over 9,000 citation styles — that Zotero offers in Word and LibreOffice. Today we’re adding a feature that lets you move documents between Google Docs and Word or LibreOffice while preserving active Zotero citations. You can now begin writing a document collaboratively in Google Docs and move it to Word or LibreOffice for final editing, or vice versa. When you use this feature, Zotero will convert the citations and bibliography to a temporary format that can be transferred safely between word processors. We’ve added instructions for specific word processors, but the basic process is the same: Choose “Switch to a Different Word Processor…” from the plugin’s Document Preferences window. Save the converted file. Open the file in the other word processor. Click Refresh to continue using it. In Google Docs, you can also choose “Switch Word Processors…” from the Zotero menu. While the process should be entirely reversible, we recommend performing the conversion in a copy of the file. While this conversion process is required to move active citations in and out of Google Docs, you can also use it to move documents between Word and LibreOffice without some of the problems inherent in Bookmarks mode. You can start using this feature today in Zotero 5.0.72 and Zotero Connector 5.0.57. Retracted item notifications with Retraction Watch integration Posted June 14th, 2019 by Dan Stillman Zotero can now help you avoid relying on retracted publications in your research by automatically checking your database and documents for works that have been retracted. We’re providing this service in partnership with Retraction Watch, which maintains the largest database of retractions available, and we’re proud to help sustain their important work. How It Works Retracted publications are flagged in the items list, and if you click on one you’ll see a warning at the top of the item pane with details on the retraction and links to additional information. If you try to cite a retracted item using the word processor plugin, Zotero will warn you and confirm that you still want to cite it. If you’ve already added a citation to a document and it later is retracted, Zotero will warn you the next time you update the document’s citations, even if the item no longer exists in your Zotero library or was added by a co-author. Currently, this feature is limited to items with a DOI or PMID (entered in the DOI field or in Extra as “DOI:”, “PMID:”, or “PubMed ID:”), which covers about 3/4 of Retraction Watch data, but we’re hoping to support items without identifiers as best as possible in a future update. Designed for Privacy The full retraction data is stored on Zotero servers, but we’ve designed this feature in a way that allows the Zotero client to check for retracted items without sharing the contents of your library. You don’t need to use Zotero syncing or upload a list of items to benefit from this feature. For each item in your library, Zotero calculates a non-unique identifier that could map to hundreds or thousands of publications, and then compares those to a list of similar partial identifiers of retracted publications that it retrieves from Zotero servers. For each potential match, it requests the full details of all possible retractions, and then checks for local items matching any of those full identifiers and flags any that it finds. The Zotero servers have no way of knowing whether you have the retracted work in your library or one of hundreds or thousands of others. (A similar approach is used by some tools to check for compromised passwords without sharing the passwords they’re checking with the server.) And, as with our other services, we’re not logging the contents of even these anonymized lookups. This feature is available today in Zotero 5.0.67. Scan Books into Zotero from Your iPhone or iPad Posted November 5th, 2018 by Dan Stillman Zotero makes it easy to collect research materials with a single click as you browse the web, but what do you do when you want to add a real, physical book to your Zotero library? If you have an iPhone or iPad running iOS 12, you can now save a book to Zotero just by scanning its barcode: Your browser doesn’t support HTML5 video in WebM with VP9 or MP4 with H.264. This feature takes advantage of the new Shortcuts functionality in iOS 12, which can chain together series of actions to perform tasks. To get started, you’ll first need to install Apple’s Shortcuts app, if you don’t yet have it on your iPhone or iPad. Next, install the Scan Book to Zotero shortcut by tapping on the link below from your iPhone or iPad and selecting Open in “Shortcuts”: Download Shortcut Update, October 2019: In iOS 13, you need to enable “Allow Untrusted Shortcuts” in Settings to install shortcuts from outside the Shortcuts app Gallery. As of iOS 13.1.2, it may be necessary to first download another shortcut from the Gallery before the option appears in Settings. After the shortcut opens, tap Done to close it, and then tap on the “Scan Book to Zotero” rectangle. The first time you run it, you’ll need to select “Run Shortcut” and grant the shortcut access to the camera, and you’ll need to log into the Zotero website before you can save. (If you haven’t yet set up syncing with Zotero on your computer, you’ll want to do that as well so that items you save will sync to Zotero on your computer.) Whenever you want to scan a book into Zotero, you can trigger the shortcut in a number of different ways: You can open the Shortcuts app and select Scan Book to Zotero. You can swipe right from the lock screen or home screen to open the Today View and select Scan Book to Zotero in the Shortcuts widget. If the Shortcuts widget doesn’t appear or doesn’t appear where you want it, you can add or move it via the Edit button at the bottom. If you have an iPhone that supports 3D Touch, you can hard-press on the Shortcuts app icon and select Scan Book to Zotero from the widget popup. You can say something like “Hey Siri, add this book to Zotero”. (Maybe don’t use this one in the library.) To set a phrase for Siri, open the Shortcuts app, tap the three dots in the Scan Book to Zotero rectangle, tap the settings icon in the top right, and then tap Add to Siri and assign a phrase. In our testing, we found Siri support to still be a bit buggy in the current version of Shortcuts, so if Siri doesn’t recognize your phrase, try editing the shortcut and re-recording the phrase or wait for an update from Apple. Happy scanning! P.S. If you don’t use an iPhone or iPad, or you can’t upgrade to iOS 12, you can still save a book from your phone when you’re away from your computer by entering the ISBN manually. Simply bookmark this page and load it whenever you need to add a physical book. Zotero Comes to Google Docs Posted October 19th, 2018 by Dan Stillman We’re excited to announce the availability of Zotero integration with Google Docs, joining Zotero’s existing support for Microsoft Word and LibreOffice. The same powerful functionality that Zotero has long offered for traditional word processors is now available for Google Docs. You can quickly search for items in your Zotero library, add page numbers and other details, and insert citations. When you’re done, a single click inserts a formatted bibliography based on the citations in your document. Zotero supports complex style requirements such as Ibid. and name disambiguation, and it keeps your citations and bibliography updated as you make changes to items in your library. If you need to switch citation styles, you can easily reformat your entire document in any of the over 9,000 citation styles that Zotero supports. Google Docs support is part of the Zotero Connector for Chrome and Firefox, which adds a new Zotero menu to the Google Docs interface: It also adds a toolbar button for one-click citing: When you start using Zotero in a document, you’ll first need to authenticate it with your Google account. You can then begin inserting citations from the Zotero libraries on your computer, just as you can with Word and LibreOffice. Once you’ve finished your document and are ready to submit it, use File → “Make a copy…” and, in the new document, use Zotero → “Unlink Citations” to convert the citations and bibliography to plain text. You can then download that second document as a PDF or other type of file, while keeping active citations in the original document in case you need to make further changes. Zotero will prompt you to create a copy if you try to download your original document. Built for Collaboration Zotero and Google Docs are a perfect combination for people writing together. Zotero groups are a great way to collect and manage materials for a shared project, and Google Docs integration allows you and your coauthors to insert and edit citations in a shared document. Groups are free and can contain an unlimited number of members, so you can collaborate with as many people as you like. While citing from the same library allows everyone to make changes to items in Zotero and have them reflected in the document, if you don’t want to work from a group, that’s fine too: Zotero can generate correct citations and bibliography entries even for items people add from their own libraries. Get Started Ready to try it out? Open a document in Google Docs and look for the Zotero menu. If you don’t see it, make sure you have Zotero Connector 5.0.42 for Chrome or Firefox. See our documentation to learn more about using Zotero with Google Docs. If you run into any trouble, let us know in the Zotero Forums. « Previous Entries Archives July 2019 June 2019 November 2018 October 2018 May 2018 March 2018 November 2017 July 2017 April 2016 March 2016 September 2015 June 2015 May 2014 April 2014 October 2013 May 2013 April 2013 November 2012 September 2012 April 2012 January 2012 September 2011 August 2011 July 2011 June 2011 April 2011 March 2011 February 2011 January 2011 October 2010 September 2010 August 2010 May 2010 April 2010 March 2010 January 2010 December 2009 October 2009 June 2009 May 2009 April 2009 March 2009 February 2009 January 2009 December 2008 November 2008 October 2008 September 2008 August 2008 July 2008 June 2008 May 2008 April 2008 March 2008 February 2008 January 2008 December 2007 November 2007 October 2007 September 2007 August 2007 July 2007 June 2007 May 2007 April 2007 March 2007 February 2007 January 2007 December 2006 November 2006 October 2006 Categories Community Spotlight Contest CSL Features Jobs News Translators Workshops Meta Log in Powered by WordPress Blog Forums Developers Support Privacy Get Involved Jobs About Zotero is a project of the Corporation for Digital Scholarship, a nonprofit organization dedicated to the development of software and services for researchers and cultural heritage institutions. 
ycharts-com-9736	----	Bitcoin Blockchain Size Bitcoin Blockchain Size 341.20 GB for Apr 26 2021 Overview Interactive Chart Level Chart Basic Info Bitcoin Blockchain Size is at a current level of 341.20, up from 341.01 yesterday and up from 274.32 one year ago. This is a change of 0.06% from yesterday and 24.38% from one year ago. Report Bitcoin Statistics Category Cryptocurrency Region N/A Source Blockchain.com Stats Last Value 341.20 Latest Period Apr 26 2021 Last Updated Apr 26 2021, 23:08 EDT Average Growth Rate 141.0% Value from 1 Year Ago 274.32 Change from 1 Year Ago 24.38% Frequency Daily Unit Gigabytes Adjustment N/A Download Source File Download Notes The total size of all block headers and transactions. Not including database indexes Historical Data View and export this data back to 2009. Upgrade now. Date Value April 26, 2021 341.20 April 25, 2021 341.01 April 24, 2021 340.82 April 23, 2021 340.64 April 22, 2021 340.46 April 21, 2021 340.32 April 20, 2021 340.15 April 19, 2021 340.03 April 18, 2021 339.87 April 17, 2021 339.74 April 16, 2021 339.61 April 15, 2021 339.38 April 14, 2021 339.21 April 13, 2021 339.02 April 12, 2021 338.82 April 11, 2021 338.62 April 10, 2021 338.41 April 09, 2021 338.21 April 08, 2021 338.03 April 07, 2021 337.83 April 06, 2021 337.64 April 05, 2021 337.42 April 04, 2021 337.22 April 03, 2021 337.04 April 02, 2021 336.86 Date Value April 01, 2021 336.67 March 31, 2021 336.45 March 30, 2021 336.26 March 29, 2021 336.07 March 28, 2021 336.86 March 27, 2021 336.66 March 26, 2021 336.48 March 25, 2021 336.27 March 24, 2021 336.07 March 23, 2021 335.85 March 22, 2021 335.65 March 21, 2021 335.42 March 20, 2021 335.21 March 19, 2021 335.02 March 18, 2021 334.82 March 17, 2021 334.65 March 16, 2021 334.45 March 15, 2021 334.25 March 14, 2021 334.04 March 13, 2021 333.84 March 12, 2021 333.65 March 11, 2021 333.48 March 10, 2021 333.28 March 09, 2021 333.10 March 08, 2021 332.88 Basic Info Bitcoin Blockchain Size is at a current level of 341.20, up from 341.01 yesterday and up from 274.32 one year ago. This is a change of 0.06% from yesterday and 24.38% from one year ago. Report Bitcoin Statistics Category Cryptocurrency Region N/A Source Blockchain.com Stats Last Value 341.20 Latest Period Apr 26 2021 Last Updated Apr 26 2021, 23:08 EDT Average Growth Rate 141.0% Value from 1 Year Ago 274.32 Change from 1 Year Ago 24.38% Frequency Daily Unit Gigabytes Adjustment N/A Download Source File Download Notes The total size of all block headers and transactions. Not including database indexes 
youtu-be-7318	----	I MAY I MIGHT (A CONSIDERED REPLY TO A LONG STANDING PROPOSAL) - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
youtu-be-8576	----	MarcEdit 7.5 -- Example of how to setup a JSON 2 MARC Authority Translation - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
youtu-be-8913	----	[FSD Beta 8.2] Oakland - Close Calls, Pedestrians, Bicycles! - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
youtu-be-9708	----	Starship SN11 Explodes During Failed Landing in the Fog - YouTube AboutPressCopyrightContact usCreatorsAdvertiseDevelopersTermsPrivacyPolicy & SafetyHow YouTube worksTest new features© 2021 Google LLC 
zbw-eu-6282	----	ZBW Labs Jump to navigation English Deutsch Main menu News About us News Data donation to Wikidata, part 2: country/subject dossiers of the 20th Century Press Archives 2021-01-29 by Joachim Neubert The world's largest public newspaper clippings archive comprises lots of material of great interest particularly for authors and readers in the Wikiverse. ZBW has digitized the material from the first half of the last century, and has put all available metadata under a CC0 license. More so, we are donating that data to Wikidata, by adding or enhancing items and providing ways to access the dossiers (called "folders") and clippings easily from there. Pressemappe 20. Jahrhundert Wikidata   Read more about Data donation to Wikidata, part 2: country/subject dossiers of the 20th Century Press Archives Log in or register to post comments Building the SWIB20 participants map 2020-12-07 by Joachim Neubert   Here we describe the process of building the interactive SWIB20 participants map, created by a query to Wikidata. The map was intended to support participants of SWIB20 to make contacts in the virtual conference space. However, in compliance with GDPR we want to avoid publishing personal details. So we choose to publish a map of institutions, to which the participants are affiliated. (Obvious downside: the 9 un-affiliated participants could not be represented on the map). We suppose that the method can be applied to other conferences and other use cases - e.g., the downloaders of scientific software or the institutions subscribed to an academic journal. Therefore, we describe the process in some detail. Wikidata for Authorities Linked data   Read more about Building the SWIB20 participants map Log in or register to post comments Journal Map: developing an open environment for accessing and analyzing performance indicators from journals in economics 2020-11-16 by Timo Borst by Franz Osorio, Timo Borst Introduction Bibliometrics, scientometrics, informetrics and webometrics have been both research topics and practical guidelines for publishing, reading, citing, measuring and acquiring published research for a while (Hood 2001). Citation databases and measures had been introduced in the 1960s, becoming benchmarks both for the publishing industry and academic libraries managing their holdings and journal acquisitions that tend to be more selective with a growing number of journals on the one side, budget cuts on the other. Due to the Open Access movement triggering a transformation of traditional publishing models (Schimmer 2015), and in the light of both global and distributed information infrastructures for publishing and communicating on the web that have yielded more diverse practices and communities, this situation has dramatically changed: While bibliometrics of research output in its core understanding still is highly relevant to stakeholders and the scientific community, visibility, influence and impact of scientific results has shifted to locations in the World Wide Web that are commonly shared and quickly accessible not only by peers, but by the general public (Thelwall 2013). This has several implications for different stakeholders who are referring to metrics in dealing with scientific results:   With the rise of social networks, platforms and their use also by academics and research communities, the term 'metrics' itself has gained a broader meaning: while traditional citation indexes only track citations of literature published in (other) journals, 'mentions', 'reads' and 'tweets', albeit less formal, have become indicators and measures for (scientific) impact. Altmetrics has influenced research performance, evaluation and measurement, which formerly had been exclusively associated with traditional bibliometrics. Scientists are becoming aware of alternative publishing channels and both the option and need of 'self-advertising' their output. In particular academic libraries are forced to manage their journal subscriptions and holdings in the light of increasing scientific output on the one hand, and stagnating budgets on the other. While editorial products from the publishing industry are exposed to a global competing market requiring a 'brand' strategy, altmetrics may serve as additional scattered indicators for scientific awareness and value. Against this background, we took the opportunity to collect, process and display some impact or signal data with respect to literature in economics from different sources, such as 'traditional' citation databases, journal rankings and community platforms resp. altmetrics indicators: CitEc. The long-standing citation service maintainted by the RePEc community provided a dump of both working papers (as part of series) and journal articles, the latter with significant information on classic impact factors such as impact factor (2 and 5 years) and h-index. Rankings of journals in economics including Scimago Journal Rank (SJR) and two German journal rankings, that are regularly released and updated (VHB Jourqual, Handelsblatt Ranking). Usage data from Altmetric.com that we collected for those articles that could be identified via their Digital Object Identifier. Usage data from the scientific community platform and reference manager Mendeley.com, in particular the number of saves or bookmarks on an individual paper. Requirements A major consideration for this project was finding an open environment in which to implement it. Finding an open platform to use served a few purposes. As a member of the "Leibniz Research Association," ZBW has a commitment to Open Science and in part that means making use of open technologies to as great extent as possible (The ZBW - Open Scienc...). This open system should allow direct access to the underlying data so that users are able to use it for their own investigations and purposes. Additionally, if possible the user should be able to manipulate the data within the system. The first instance of the project was created in Tableau, which offers a variety of means to express data and create interfaces for the user to filter and manipulate data. It also can provide a way to work with the data and create visualizations without programming skills or knowledge. Tableau is one of the most popular tools to create and deliver data visualization in particular within academic libraries (Murphy 2013). However, the software is proprietary and has a monthly fee to use and maintain, as well as closing off the data and making only the final visualization available to users. It was able to provide a starting point for how we wanted to the data to appear to the user, but it is in no way open. Challenges The first technical challenge was to consolidate the data from the different sources which had varying formats and organizations. Broadly speaking, the bibliometric data (CitEc and journal rankings) existed as a spread sheet with multiple pages, while the altmetrics and Mendeley data came from a database dumps with multiple tables that were presented as several CSV files. In addition to these different formats, the data needed to be cleaned and gaps filled in. The sources also had very different scopes. The altmetrics and Mendeley data covered only 30 journals, the bibliometric data, on the other hand, had more than 1,000 journals. Transitioning from Tableau to an open platform was big challenge. While there are many ways to create data visualizations and present them to users, the decision was made to use R to work with the data and Shiny to present it. R is used widely to work with data and to present it (Kläre 2017). The language has lots of support for these kinds of task over many libraries. The primary libraries used were R Plotly and R Shiny. Plotly is a popular library for creating interactive visualizations. Without too much work Plotly can provide features including information popups while hovering over a chart and on the fly filtering. Shiny provides a framework to create a web application to present the data without requiring a lot of work to create HTML and CSS. The transition required time spent getting to know R and its libraries, to learn how to create the kinds of charts and filters that would be useful for users. While Shiny alleviates the need to create HTML and CSS, it does have a specific set of requirements and structures in order to function. The final challenge was in making this project accessible to users such that they would be able to see what we had done, have access to the data, and have an environment in which they could explore the data without needing anything other than what we were providing. In order to achieve this we used Binder as the platform. At it's most basic Binder makes it possible to share a Jupyter Notebook stored in a Github repository with a URL by running the Jupyter Notebook remotely and providing access through a browser with no requirements placed on the user. Additionally, Binder is able to run a web application using R and Shiny. To move from a locally running instance of R Shiny to one that can run in Binder, instructions for the runtime environment need to be created and added to the repository. These include information on what version of the language to use,  which packages and libraries to install for the language, and any additional requirements there might be to run everything. Solutions Given the disparate sources and formats for the data, there was work that needed to be done to prepare it for visualization. The largest dataset, the bibliographic data, had several identifiers for each journal but without journal names. Having the journals names is important because in general the names are how users will know the journals. Adding the names to the data would allow users to filter on specific journals or pull up two journals for a comparison. Providing the names of the journals is also a benefit for anyone who may repurpose the data and saves them from having to look them up. In order to fill this gap, we used metadata available through Research Papers in Economics (RePEc). RePEc is an organization that seeks to "enhance the dissemination of research in Economics and related sciences". It contains metadata for more than 3 million papers available in different formats. The bibliographic data contained RePEc Handles which we used to look up the journal information as XML and then parse the XML to find the title of the journal.  After writing a small Python script to go through the RePEc data and find the missing names there were only 6 journals whose names were still missing. For the data that originated in an MySQL database, the major work that needed to be done was to correct the formatting. The data was provided as CSV files but it was not formatted such that it could be used right away. Some of the fields had double quotation marks and when the CSV file was created those quotes were put into other quotation marks resulting doubled quotation marks which made machine parsing difficult without intervention directly on the files. The work was to go through the files and quickly remove the doubled quotation marks. In addition to that, it was useful for some visualizations to provide a condensed version of the data. The data from the database was at the article level which is useful for some things, but could be time consuming for other actions. For example, the altmetrics data covered only 30 journals but had almost 14,000 rows. We could use the Python library pandas to go through the all those rows and condense the data down so that there are only 30 rows with the data for each column being the sum of all rows. In this way, there is a dataset that can be used to easily and quickly generate summaries on the journal level. Shiny applications require a specific structure and files in order to do the work of creating HTML without needing to write the full HTML and CSS. At it's most basic there are two main parts to the Shiny application. The first defines the user interface (UI) of the page. It says what goes where, what kind of elements to include, and how things are labeled. This section defines what the user interacts with by creating inputs and also defining the layout of the output. The second part acts as a server that handles the computations and processing of the data that will be passed on to the UI for display. The two pieces work in tandem, passing information back and forth to create a visualization based on user input. Using Shiny allowed almost all of the time spent on creating the project to be concentrated on processing the data and creating the visualizations. The only difficulty in creating the frontend was making sure all the pieces of the UI and Server were connected correctly. Binder provided a solution for hosting the application, making the data available to users, and making it shareable all in an open environment. Notebooks and applications hosted with Binder are shareable in part because the source is often a repository like Github. By passing a Github repository to Binder, say one that has a Jupyter Notebook in it, Binder will build a Docker image to run the notebook and then serve the result to the user without them needing to do anything. Out of the box the Docker image will contain only the most basic functions. The result is that if a notebook requires a library that isn't standard, it won't be possible to run all of the code in the notebook. In order to address this, Binder allows for the inclusion in a repository of certain files that can define what extra elements should be included when building the Docker image. This can be very specific such as what version of the language to use and listing various libraries that should be included to ensure that the notebook can be run smoothly. Binder also has support for more advanced functionality in the Docker images such as creating a Postgres database and loading it with data. These kinds of activities require using different hooks that Binder looks for during the creation of the Docker image to run scripts. Results and evaluation The final product has three main sections that divide the data categorically into altmetrics, bibliometrics, and data from Mendeley. There are additionally some sections that exist as areas where something new could be tried out and refined without potentially causing issues with the three previously mentioned areas. Each section has visualizations that are based on the data available. Considering the requirements for the project, the result goes a long way to meeting the requirements. The most apparent area that the Journal Map succeeds in is its goals is of presenting data that we have collected. The application serves as a dashboard for the data that can be explored by changing filters and journal selections. By presenting the data as a dashboard, the barrier to entry for users to explore the data is low. However, there exists a way to access the data directly and perform new calculations, or create new visualizations. This can be done through the application's access to an R-Studio environment. Access to R-Studio provides two major features. First, it gives direct access to the all the underlying code that creates the dashboard and the data used by it. Second, it provides an R terminal so that users can work with the data directly. In R-Studio, the user can also modify the existing files and then run them from R-Studio to see the results. Using Binder and R as the backend of the applications allows us to provide users with different ways to access and work with data without any extra requirements on the part of the user. However, anything changed in R-Studio won't affect the dashboard view and won't persist between sessions. Changes exist only in the current session. All the major pieces of this project were able to be done using open technologies: Binder to serve the application, R to write the code, and Github to host all the code. Using these technologies and leveraging their capabilities allows the project to support the Open Science paradigm that was part of the impetus for the project. The biggest drawback to the current implementation is that Binder is a third party host and so there are certain things that are out of our control. For example, Binder can be slow to load. It takes on average 1+ minutes for the Docker image to load. There's not much, if anything, we can do to speed that up. The other issue is that if there is an update to the Binder source code that breaks something, then the application will be inaccessible until the issue is resolved. Outlook and future work The application, in its current state, has parts that are not finalized. As we receive feedback, we will make changes to the application to add or change visualizations. As mentioned previously, there a few sections that were created to test different visualizations independently of the more complete sections, those can be finalized. In the future it may be possible to move from BinderHub to a locally created and administered version of Binder. There is support and documentation for creating local, self hosted instances of Binder. Going that direction would give more control, and may make it possible to get the Docker image to load more quickly. While the application runs stand-alone, the data that is visualized may also be integrated in other contexts. One option we are already prototyping is integrating the data into our subject portal EconBiz, so users would be able to judge the scientific impact of an article in terms of both bibliometric and altmetric indicators.   References William W. Hood, Concepcion S. Wilson. The Literature of Bibliometrics, Scientometrics, and Informetrics. Scientometrics 52, 291–314 Springer Science and Business Media LLC, 2001. Link R. Schimmer. Disrupting the subscription journals’ business model for the necessary large-scale transformation to open access. (2015). Link Mike Thelwall, Stefanie Haustein, Vincent Larivière, Cassidy R. Sugimoto. Do Altmetrics Work? Twitter and Ten Other Social Web Services. PLoS ONE 8, e64841 Public Library of Science (PLoS), 2013. Link The ZBW - Open Science Future. Link Sarah Anne Murphy. Data Visualization and Rapid Analytics: Applying Tableau Desktop to Support Library Decision-Making. Journal of Web Librarianship 7, 465–476 Informa UK Limited, 2013. Link Christina Kläre, Timo Borst. Statistic packages and their use in research in Economics | EDaWaX - Blog of the project ’European Data Watch Extended’. EDaWaX - European Data Watch Extended (2017). Link Journal Map - Binder application for displaying and analyzing metrics data about scientific journals Read more about Journal Map: developing an open environment for accessing and analyzing performance indicators from journals in economics Log in or register to post comments Integrating altmetrics into a subject repository - EconStor as a use case 2019-11-21 by Wolfgang Riese Back in 2015 the ZBW Leibniz Information Center for Economics (ZBW) teamed up with the Göttingen State and university library (SUB), the Service Center of Götting library federation (VZG) and GESIS Leibniz Institute for the Social Sciences in the *metrics project funded by the German Research Foundation (DFG). The aim of the project was: “… to develop a deeper understanding of *metrics, especially in terms of their general significance and their perception amongst stakeholders.” (*metrics project about). In the practical part of the project the following DSpace based repositories of the project partners participated as data sources for online publications and – in the case of EconStor – also as implementer for the presentation of the social media signals: EconStor - a subject repository for economics and business studies run by the ZBW, currently (Aug. 2019) containing round about 180,000 downloadable files, GoeScholar - the Publication Server of the Georg-August-Universität Göttingen run by the SUB Göttingen, offering approximately 11,000 publicly browsable items so far, SSOAR - the “Social Science Open Access Repository” maintained by GESIS, currently containing about 53,000 publicly available items. In the work package “Technology analysis for the collection and provision of *metrics” of the project an analysis of currently available *metrics technologies and services had been performed. As stated by [Wilsdon 2017], currently suppliers of altmetrics “remain too narrow (mainly considering research products with DOIs)”, which leads to problems to acquire *metrics data for repositories like EconStor with working papers as the main content. As up to now it is unusual – at least in the social sciences and economics – to create DOIs for this kind of documents. Only the resulting final article published in a journal will receive a DOI. Based on the findings in this work package, a test implementation of the *metrics crawler had been built. The crawler had been actively deployed from early 2018 to spring 2019 at the VZG. For the aggregation of the *metrics data the crawler had been fed with persistent identifiers and metadata from the aforementioned repositories. At this stage of the project, the project partners still had the expectation, that the persistent identifiers (e.g. handle, URNs, …), or their local URL counterparts, as used by the repositories could be harnessed to easily identify social media mentions of their documents, e.g. for EconStor: handle: “hdl:10419/…” handle.net resolver URL: “http(s)://hdl.handle.net/10419/…” EconStor landing page URL with handle: “http(s)://www.econstor.eu/handle/10419/…” EconStor bitstream (PDF) URL with handle: “http(s)://www.econstor.eu/bitstream/10419/…” Integrating altmetrics data into EconStor Read more about Integrating altmetrics into a subject repository - EconStor as a use case Log in or register to post comments 20th Century Press Archives: Data donation to Wikidata 2019-10-24 by Joachim Neubert ZBW is donating a large open dataset from the 20th Century Press Archives to Wikidata, in order to make it better accessible to various scientific disciplines such as contemporary, economic and business history, media and information science, to journalists, teachers, students, and the general public. The 20th Century Press Archives (PM20) is a large public newspaper clippings archive, extracted from more than 1500 different sources published in Germany and all over the world, covering roughly a full century (1908-2005). The clippings are organized in thematic folders about persons, companies and institutions, general subjects, and wares. During a project originally funded by the German Research Foundation (DFG), the material up to 1960 has been digitized. 25,000 folders with more than two million pages up to 1949 are freely accessible online.  The fine-grained thematic access and the public nature of the archives makes it to our best knowledge unique across the world (more information on Wikipedia) and an essential research data fund for some of the disciplines mentioned above. The data donation does not only mean that ZBW has assigned a CC0 license to all PM20 metadata, which makes it compatible with Wikidata. (Due to intellectual property rights, only the metadata can be licensed by ZBW - all legal rights on the press articles themselves remain with their original creators.) The donation also includes investing a substantial amount of working time (during, as planned, two years) devoted to the integration of this data into Wikidata. Here we want to share our experiences regarding the integration of the persons archive metadata. Linked data   Open data   Read more about 20th Century Press Archives: Data donation to Wikidata Log in or register to post comments ZBW's contribution to "Coding da Vinci": Dossiers about persons and companies from 20th Century Press Archives 2018-10-23 by Joachim Neubert At 27th and 28th of October, the Kick-off for the "Kultur-Hackathon" Coding da Vinci is held in Mainz, Germany, organized this time by GLAM institutions from the Rhein-Main area: "For five weeks, devoted fans of culture and hacking alike will prototype, code and design to make open cultural data come alive." New software applications are enabled by free and open data. For the first time, ZBW is among the data providers. It contributes the person and company dossiers of the 20th Century Press Archive. For about a hundred years, the predecessor organizations of ZBW in Kiel and Hamburg had collected press clippings, business reports and other material about a wide range of political, economic and social topics, about persons, organizations, wares, events and general subjects. During a project funded by the German Research Organization (DFG), the documents published up to 1948 (about 5,7 million pages) had been digitized and are made publicly accessible with according metadata, until recently solely in the "Pressemappe 20. Jahrhundert" (PM20) web application. Additionally, the dossiers - for example about Mahatma Gandhi or the Hamburg-Bremer Afrika Linie - can be loaded into a web viewer. As a first step to open up this unique source of data for various communities, ZBW has decided to put the complete PM20 metadata* under a CC-Zero license, which allows free reuse in all contexts. For our Coding da Vinci contribution, we have prepared all person and company dossiers which already contain documents. The dossiers are interlinked among each other. Controlled vocabularies (for, e.g., "country", or "field of activity") provide multi-dimensional access to the data. Most of the persons and a good share of organizations were linked to GND identifiers. As a starter, we had mapped dossiers to Wikidata according to existing GND IDs. That allows to run queries for PM20 dossiers completely on Wikidata, making use of all the good stuff there. An example query shows the birth places of PM20 economists on a map, enriched with images from Wikimedia commons. The initial mapping was much extended by fantastic semi-automatic and manual mapping efforts by the Wikidata community. So currently more than 80 % of the dossiers about - often rather prominent - PM20 persons are linked not only to Wikidata, but also connected to Wikipedia pages. That offers great opportunities for mash-ups to further data sources, and we are looking forward to what the "Coding da Vinci" crowd may make out of these opportunities. Technically, the data has been converted from an internal intermediate format to still quite experimental RDF and loaded into a SPARQL endpoint. There it was enriched with data from Wikidata and extracted with a construct query. We have decided to transform it to JSON-LD for publication (following practices recommended by our hbz colleagues). So developers can use the data as "plain old JSON", with the plethora of web tools available for this, while linked data enthusiasts can utilize sophisticated Semantic Web tools by applying the provided JSON-LD context. In order to make the dataset discoverable and reusable for future research, we published it persistently at zenodo.org. With it, we provide examples and data documentation. A GitHub repository gives you additional code examples and a way to address issues and suggestions. * For the scanned documents, the legal regulations apply - ZBW cannot assign licenses here.   Pressemappe 20. Jahrhundert Linked data   Read more about ZBW's contribution to "Coding da Vinci": Dossiers about persons and companies from 20th Century Press Archives Log in or register to post comments Wikidata as authority linking hub: Connecting RePEc and GND researcher identifiers 2017-11-30 by Joachim Neubert In the EconBiz portal for publications in economics, we have data from different sources. In some of these sources, most notably ZBW's "ECONIS" bibliographical database, authors are disambiguated by identifiers of the Integrated Authority File (GND) - in total more than 470,000. Data stemming from "Research papers in Economics" (RePEc) contains another identifier: RePEc authors can register themselves in the RePEc Author Service (RAS), and claim their papers. This data is used for various rankings of authors and, indirectly, of institutions in economics, which provides a big incentive for authors - about 50,000 have signed into RAS - to keep both their article claims and personal data up-to-date. While GND is well known and linked to many other authorities, RAS had no links to any other researcher identifier system. Thus, until recently, the author identifiers were disconnected, which precludes the possibility to display all publications of an author on a portal page. To overcome that limitation, colleagues at ZBW have matched a good 3,000 authors with RAS and GND IDs by their publications (see details here). Making that pre-existing mapping maintainable and extensible however would have meant to set up some custom editing interface, would have required storage and operating resources and wouldn't easily have been made publicly accessible. In a previous article, we described the opportunities offered by Wikidata. Now we made use of it. Wikidata for Authorities Authority control   Wikidata   Read more about Wikidata as authority linking hub: Connecting RePEc and GND researcher identifiers Log in or register to post comments New version of multi-lingual JEL classification published in LOD 2017-03-02 by Joachim Neubert The Journal of Economic Literature Classification Scheme (JEL) was created and is maintained by the American Economic Association. The AEA provides this widely used resource freely for scholarly purposes. Thanks to André Davids (KU Leuven), who has translated the originally English-only labels of the classification to French, Spanish and German, we provide a multi-lingual version of JEL. It's lastest version (as of 2017-01) is published in the formats RDFa and RDF download files. These formats and translations are provided "as is" and are not authorized by AEA. In order to make changes in JEL tracable more easily, we have created lists of inserted and removed JEL classes in the context of the skos-history project. JEL Klassifikation für Linked Open Data skos-history Linked data   Read more about New version of multi-lingual JEL classification published in LOD Log in or register to post comments Economists in Wikidata: Opportunities of Authority Linking 2017-01-17 by Joachim Neubert Wikidata is a large database, which connects all of the roughly 300 Wikipedia projects. Besides interlinking all Wikipedia pages in different languages about a specific item – e.g., a person -, it also connects to more than 1000 different sources of authority information. The linking is achieved by a „authority control“ class of Wikidata properties. The values of these properties are identifiers, which unambiguously identify the wikidata item in external, web-accessible databases. The property definitions includes an URI pattern (called „formatter URL“). When the identifier value is inserted into the URI pattern, the resulting URI can be used to look up the authoritiy entry. The resulting URI may point to a Linked Data resource - as it is the case with the GND ID property. This, on the one hand, provides a light-weight and robust mechanism to create links in the web of data. On the other hand, these links can be exploited by every application which is driven by one of the authorities to provide additional data: Links to Wikipedia pages in multiple languages, images, life data, nationality and affiliations of the according persons, and much more. Wikidata item for the Indian Economist Bina Agarwal, visualized via the SQID browser Wikidata for Authorities Wikidata   Authority control   Linked data   Read more about Economists in Wikidata: Opportunities of Authority Linking Log in or register to post comments Integrating a Research Data Repository with established research practices 2016-06-03 by Timo Borst Authors: Timo Borst, Konstantin Ott In recent years, repositories for managing research data have emerged, which are supposed to help researchers to upload, describe, distribute and share their data. To promote and foster the distribution of research data in the light of paradigms like Open Science and Open Access, these repositories are normally implemented and hosted as stand-alone applications, meaning that they offer a web interface for manually uploading the data, and a presentation interface for browsing, searching and accessing the data. Sometimes, the first component (interface for uploading the data) is substituted or complemented by a submission interface from another application. E.g., in Dataverse or in CKAN data is submitted from remote third-party applications by means of data deposit APIs [1]. However the upload of data is organized and eventually embedded into a publishing framework (data either as a supplement of a journal article, or as a stand-alone research output subject to review and release as part of a ‘data journal’), it definitely means that this data is supposed to be made publicly available, which is often reflected by policies and guidelines for data deposit. Institutional view on research data Read more about Integrating a Research Data Repository with established research practices Log in or register to post comments Pages 1 2 3 next › last » Tags in DBpedia - Web Taxonomy Your browser does not support canvas. Application programming interface Authority control Drupal Economics Electronic publishing Impact factor Linked data Organizer Recommender system Repository (publishing) Thesaurus Wikidata Search form Search (rdf/xml, turtle, nt)   Imprint   Privacy Powered by Drupal 
zbw-eu-1469	----	ZBW Labs ZBW Labs Data donation to Wikidata, part 2: country/subject dossiers of the 20th Century Press Archives The world's largest public newspaper clippings archive comprises lots of material of great interest particularly for authors and readers in the Wikiverse. ZBW has digitized the material from the first half of the last century, and has put all available metadata under a CC0 license. More so, we are donating that data to Wikidata, by adding or enhancing items and providing ways to access the dossiers (called "folders") and clippings easily from there. Challenges of modelling a complex faceted classification in Wikidata That had been done for the persons' archive in 2019 - see our prior blog post. For persons, we could just link from existing or a few newly created person items to the biographical folders of the archive. The countries/subjects archives provided a different challenge: The folders there were organized by countries (or continents, or cities in a few cases, or other geopolitical categories), and within the country, by an extended subject category system (available also as SKOS). To put it differently: Each folder was defined by a geo and a subject facet - a method widely used in general purpose press archives, because it allowed a comprehensible and, supported by a signature system, unambiguous sequential shelf order, indispensable for quick access to the printed material.Folders specifically about one significant topic (like the Treaty of Sèvres) are rare in the press archives, whereas country/subject combinations are rare among Wikidata items - so direct linking between existing items and PM20 folders was hardly achievable. The folders in themselves had to be represented as Wikidata items, just like other sources used there. Here however we did not have works or scientific articles, but thematic mini-collections of press clippings, often not notable in themselves and normally without further formal bibliographic data. So a class of PM20 country/subject folder was created (as subclass of dossier, a collection of documents). Aiming at items for each folder - and having them linked via PM20 folder ID (P4293) to the actual press archive folders was yet only part of the solution.In order to represent the faceted structure of the archive, we needed anchor points for both facets. That was easy for the geographical categories: the vast majority of them already existed as items in Wikidata, a few historical ones, such as Russian peripheral countries, had to be created. For the subject categories, the situation was much different. Categories such as The country and its people, politics and economy, general or Postal services, telegraphy and telephony were constructed as baskets for collecting articles on certain broader topics. They do not have an equivalent in Wikidata, which tries to describe real world entities or clear-cut concepts. We decided therefore to represent the categories of the subject category system with their own items of type PM20 subject category. Each of the about 1400 categories is connected to the upper one via a "part of" (P361) property, thus forming a five-level hierarchy.More implementation subtletiesFor both facets, according Wikidata properties where created as "PM20 geo code" (P8483) and "PM20 subject code" (P8484). As external identifiers, they link directly to lists of subjects (e.g., for Japan) or geographical entities (e.g., for The country ..., general). For all countries where the press archives material has been processed - this includes the tedious task of clarifying the intellectual property rights status of each article -, the Wikidata item for the country includes now a link to a list of all press archives dossiers about this country, covering the first half of the 20th century. The folders represented in Wikidata (e.g., Japan : The country ..., general) use "facet of" (P1269) and "main subject" (P921) properties to connect to the items for the country and subject categories. Thus, not only each of the 9,200 accessible folders of the PM20 country/subject archive is accessible via Wikidata. Since the structural metadata of PM20 is available, too, it can be queried in its various dimensions - see for example the list of top level subject categories with the number of folders and documents, or a list of folders per country, ordered by signature (with subtleties covered by a "series ordial" (P1545) qualifier). The interactive map of subject folders as shown above is also created by a SPARQL query, and gives a first impression of the geographical areas covered in depth - or yet only sparsely - in the online archive.Core areas: worldwide economy, worldwide colonialismThe online data reveals core areas of attention during 40 years of press clippings collection until 1949. Economy, of course, was in the focus of the former HWWA (Hamburg Archive for the International Economy), in Germany and namely Hamburg, as well as in every other country. More than half of all subject categories are part of the n Economy section of the category system and give in 4,500 folders very detailed access to the field. About 100,000 of the almost 270,000 online documents of the archive are part of this section, followed by history and general politics, foreign policy, and public finance, down to more peripheral topics like settling and migration, minorities, justice or literature. Originating in the history of the institution (which was founded as "Zentralstelle des Hamburgischen Kolonialinstituts", the central office of the Hamburg colonial institute) colonial efforts all over the world were monitored closely. We published with priority the material about the former German colonies, listed in the Archivführer Deutsche Kolonialgeschichte (Archive guide to the German Colonial Past, also interconnected to Wikidata). Originally collected to support the aggressive and inhuman policy of the German Empire, it is now available to serve as research material for critical analysis in the emerging field of colonial and postcolonial studies.Enabling future community effortsWhile all material about the German colonies (and some about the Italian ones) is online, and accessible now via Wikidata, this is not true for the former British/French/Dutch/Belgian colonies. While Japan or Argentina are accessible completely, China, India or the US are missing, as well as most of the European countries. And while 800+ folders about Hamburg cover it's contemporary history quite well, the vast majority of the material about Germany as a whole is only accessible "on premises" within ZBW's locations. It however is available as digital images, and can be accessed through finding aids (in German), which in the reading rooms directly link to a document viewer. The metadata for this material is now open data and can be changed and enhanced in Wikidata. A very selective example how that could work is a topic in German-Danish history - the 1920 Schleswig plebiscites. The PM20 folder about these events was not part of the published material, but got some interest with last year's centenary. The PM20 metadata on Wikidata made it possible to create an according folder completely in Wikidata, Nordslesvig : Historical events, with a (provisional) link to a stretch of images on a digitized film. While the checking and activation of these images for the public was a one-time effort in the context of an open science event, the creation of a new PM20 folder on Wikidata may demonstrate how open metadata can be used by a dedicated community of knowledge to enable access to not-yet-open knowledge. Current intellectual property law in the EU forbids open access to all digitized clippings from newspapers published in 1960 until 2031, and all where the death date of a named author is not known until after 2100. Of course, we hope for a change in that obstrusive legislation in a not-so-far future. We are confident that the metadata about the material, now in Wikidata, will help bridging the gap until it will finally be possible to use all digitized press archives contents as open scientific and educational resources, within and outside of the Wikimedia projects. More information at WikiProject 20th Century Press Archives, which links also to the code for creating this data donation. Pressemappe 20. Jahrhundert Wikidata &#160; Building the SWIB20 participants map   Here we describe the process of building the interactive SWIB20 participants map, created by a query to Wikidata. The map was intended to support participants of SWIB20 to make contacts in the virtual conference space. However, in compliance with GDPR we want to avoid publishing personal details. So we choose to publish a map of institutions, to which the participants are affiliated. (Obvious downside: the 9 un-affiliated participants could not be represented on the map). We suppose that the method can be applied to other conferences and other use cases - e.g., the downloaders of scientific software or the institutions subscribed to an academic journal. Therefore, we describe the process in some detail. We started with a list of institution names (with country code and city, but without person ids), extracted and transformed from our ConfTool registration system, saved it in CSV format. Country names were normalized, cities were not (and only used for context information). We created an OpenRefine project, and reconciled the institution name column with Wikidata items of type Q43229 (organization, and all its subtypes). We included the country column (-&gt; P17, country) as relevant other detail, and let OpenRefine “Auto-match candidates with high confidence”. Of our original set of 335 country/institution entries, 193 were automaticaly matched via the Wikidata reconciliation service. At the end of the conference, 400 institutions were identified and put on the map (data set). We went through all un-matched entries and either a) selected one of the suggested items, or b) looked up and tweaked the name string in Wikidata, or in Google, until we found an according Wikipedia page, openend the linked Wikidata object from there, and inserted the QID in OpenRefine, or c) created a new Wikidata item (if the institution seemed notable), or d) attached “not yet determined” (Q59496158) where no Wikidata item (yet) exists, or e) attached “undefined value” (Q7883029) where no institution had been given The results were exported from OpenRefine into a .tsv file (settings) Again via a script, we loaded ConfTool participants data, built a lookup table from all available OpenRefine results (country/name string -&gt; WD item QID), aggregated participant counts per QID, and loaded that data into a custom SPARQL endpoint, which is accessible from the Wikidata Query Service. As in step 1, for all (new) institution name strings, which were not yet mapped to Wikidata, a .csv file was produced. (An additional remark: If no approved custom SPARQL endpoint is available, it is feasible to generate a static query with all data in it’s “values” clause.) During the preparation of the conference, more and more participants registered, which required multiple loops: Use the csv file of step 5 and re-iterate, starting at step 2. (Since I found no straightforward way to update an existing OpenRefine project with extended data, I created a new project with new input and output files for every iteration.) Finally, to display the map we could run a federated query on WDQS. It fetches the institution items from the custom endpoint and enriches them from Wikidata with name, logo and image of the institution (if present), as well as with geographic coordinates, obtained directly or indirectly as follows: a) item has “coodinate location” (P625) itself, or b) item has “headquarters location” item with coordinates (P159/P625), or c) item has “located in administrative entity” item with coordinates (P131/P625), or c) item has “country” item (P17/P625) Applying this method, only one institution item could not be located on the map. Data improvements The way to improve the map was to improve the data about the items in Wikidata - which also helps all future Wikidata users. New items For a few institutions, new items were created: Burundi Association of Librarians, Archivists and Documentalists FAO representation in Kenya Aurora Information Technology Istituto di Informatica Giuridica e Sistemi Giudiziari For another 14 institutions, mostly private companies, no items were created due to notability concerns. Everything else already had an item in Wikidata! Improvement of existing items In order to improve the display on the map, we enhanced selected items in Wikidata in various ways: Add English label Add type (instance of) Add headquarter location Add image and/or logo And we hope, that participants of the conference also took the opportunity to make their institution “look better”, by adding for example an image of it to the Wikidata knowledge base. Putting Wikidata into use for a completely custom purpose thus created incentives for improving “the sum of all human knowledge” step by tiny step.       Wikidata for Authorities Linked data &#160; Deutsch Journal Map: developing an open environment for accessing and analyzing performance indicators from journals in economics by Franz Osorio, Timo Borst Introduction Bibliometrics, scientometrics, informetrics and webometrics have been both research topics and practical guidelines for publishing, reading, citing, measuring and acquiring published research for a while (Hood 2001). Citation databases and measures had been introduced in the 1960s, becoming benchmarks both for the publishing industry and academic libraries managing their holdings and journal acquisitions that tend to be more selective with a growing number of journals on the one side, budget cuts on the other. Due to the Open Access movement triggering a transformation of traditional publishing models (Schimmer 2015), and in the light of both global and distributed information infrastructures for publishing and communicating on the web that have yielded more diverse practices and communities, this situation has dramatically changed: While bibliometrics of research output in its core understanding still is highly relevant to stakeholders and the scientific community, visibility, influence and impact of scientific results has shifted to locations in the World Wide Web that are commonly shared and quickly accessible not only by peers, but by the general public (Thelwall 2013). This has several implications for different stakeholders who are referring to metrics in dealing with scientific results:   With the rise of social networks, platforms and their use also by academics and research communities, the term 'metrics' itself has gained a broader meaning: while traditional citation indexes only track citations of literature published in (other) journals, 'mentions', 'reads' and 'tweets', albeit less formal, have become indicators and measures for (scientific) impact. Altmetrics has influenced research performance, evaluation and measurement, which formerly had been exclusively associated with traditional bibliometrics. Scientists are becoming aware of alternative publishing channels and both the option and need of 'self-advertising' their output. In particular academic libraries are forced to manage their journal subscriptions and holdings in the light of increasing scientific output on the one hand, and stagnating budgets on the other. While editorial products from the publishing industry are exposed to a global competing market requiring a 'brand' strategy, altmetrics may serve as additional scattered indicators for scientific awareness and value. Against this background, we took the opportunity to collect, process and display some impact or signal data with respect to literature in economics from different sources, such as 'traditional' citation databases, journal rankings and community platforms resp. altmetrics indicators: CitEc. The long-standing citation service maintainted by the RePEc community provided a dump of both working papers (as part of series) and journal articles, the latter with significant information on classic impact factors such as impact factor (2 and 5 years) and h-index. Rankings of journals in economics including Scimago Journal Rank (SJR) and two German journal rankings, that are regularly released and updated (VHB Jourqual, Handelsblatt Ranking). Usage data from Altmetric.com that we collected for those articles that could be identified via their Digital Object Identifier. Usage data from the scientific community platform and reference manager Mendeley.com, in particular the number of saves or bookmarks on an individual paper. Requirements A major consideration for this project was finding an open environment in which to implement it. Finding an open platform to use served a few purposes. As a member of the "Leibniz Research Association," ZBW has a commitment to Open Science and in part that means making use of open technologies to as great extent as possible (The ZBW - Open Scienc...). This open system should allow direct access to the underlying data so that users are able to use it for their own investigations and purposes. Additionally, if possible the user should be able to manipulate the data within the system. The first instance of the project was created in Tableau, which offers a variety of means to express data and create interfaces for the user to filter and manipulate data. It also can provide a way to work with the data and create visualizations without programming skills or knowledge. Tableau is one of the most popular tools to create and deliver data visualization in particular within academic libraries (Murphy 2013). However, the software is proprietary and has a monthly fee to use and maintain, as well as closing off the data and making only the final visualization available to users. It was able to provide a starting point for how we wanted to the data to appear to the user, but it is in no way open. Challenges The first technical challenge was to consolidate the data from the different sources which had varying formats and organizations. Broadly speaking, the bibliometric data (CitEc and journal rankings) existed as a spread sheet with multiple pages, while the altmetrics and Mendeley data came from a database dumps with multiple tables that were presented as several CSV files. In addition to these different formats, the data needed to be cleaned and gaps filled in. The sources also had very different scopes. The altmetrics and Mendeley data covered only 30 journals, the bibliometric data, on the other hand, had more than 1,000 journals. Transitioning from Tableau to an open platform was big challenge. While there are many ways to create data visualizations and present them to users, the decision was made to use R to work with the data and Shiny to present it. R is used widely to work with data and to present it (Kläre 2017). The language has lots of support for these kinds of task over many libraries. The primary libraries used were R Plotly and R Shiny. Plotly is a popular library for creating interactive visualizations. Without too much work Plotly can provide features including information popups while hovering over a chart and on the fly filtering. Shiny provides a framework to create a web application to present the data without requiring a lot of work to create HTML and CSS. The transition required time spent getting to know R and its libraries, to learn how to create the kinds of charts and filters that would be useful for users. While Shiny alleviates the need to create HTML and CSS, it does have a specific set of requirements and structures in order to function. The final challenge was in making this project accessible to users such that they would be able to see what we had done, have access to the data, and have an environment in which they could explore the data without needing anything other than what we were providing. In order to achieve this we used Binder as the platform. At it's most basic Binder makes it possible to share a Jupyter Notebook stored in a Github repository with a URL by running the Jupyter Notebook remotely and providing access through a browser with no requirements placed on the user. Additionally, Binder is able to run a web application using R and Shiny. To move from a locally running instance of R Shiny to one that can run in Binder, instructions for the runtime environment need to be created and added to the repository. These include information on what version of the language to use,  which packages and libraries to install for the language, and any additional requirements there might be to run everything. Solutions Given the disparate sources and formats for the data, there was work that needed to be done to prepare it for visualization. The largest dataset, the bibliographic data, had several identifiers for each journal but without journal names. Having the journals names is important because in general the names are how users will know the journals. Adding the names to the data would allow users to filter on specific journals or pull up two journals for a comparison. Providing the names of the journals is also a benefit for anyone who may repurpose the data and saves them from having to look them up. In order to fill this gap, we used metadata available through Research Papers in Economics (RePEc). RePEc is an organization that seeks to "enhance the dissemination of research in Economics and related sciences". It contains metadata for more than 3 million papers available in different formats. The bibliographic data contained RePEc Handles which we used to look up the journal information as XML and then parse the XML to find the title of the journal.  After writing a small Python script to go through the RePEc data and find the missing names there were only 6 journals whose names were still missing. For the data that originated in an MySQL database, the major work that needed to be done was to correct the formatting. The data was provided as CSV files but it was not formatted such that it could be used right away. Some of the fields had double quotation marks and when the CSV file was created those quotes were put into other quotation marks resulting doubled quotation marks which made machine parsing difficult without intervention directly on the files. The work was to go through the files and quickly remove the doubled quotation marks. In addition to that, it was useful for some visualizations to provide a condensed version of the data. The data from the database was at the article level which is useful for some things, but could be time consuming for other actions. For example, the altmetrics data covered only 30 journals but had almost 14,000 rows. We could use the Python library pandas to go through the all those rows and condense the data down so that there are only 30 rows with the data for each column being the sum of all rows. In this way, there is a dataset that can be used to easily and quickly generate summaries on the journal level. Shiny applications require a specific structure and files in order to do the work of creating HTML without needing to write the full HTML and CSS. At it's most basic there are two main parts to the Shiny application. The first defines the user interface (UI) of the page. It says what goes where, what kind of elements to include, and how things are labeled. This section defines what the user interacts with by creating inputs and also defining the layout of the output. The second part acts as a server that handles the computations and processing of the data that will be passed on to the UI for display. The two pieces work in tandem, passing information back and forth to create a visualization based on user input. Using Shiny allowed almost all of the time spent on creating the project to be concentrated on processing the data and creating the visualizations. The only difficulty in creating the frontend was making sure all the pieces of the UI and Server were connected correctly. Binder provided a solution for hosting the application, making the data available to users, and making it shareable all in an open environment. Notebooks and applications hosted with Binder are shareable in part because the source is often a repository like Github. By passing a Github repository to Binder, say one that has a Jupyter Notebook in it, Binder will build a Docker image to run the notebook and then serve the result to the user without them needing to do anything. Out of the box the Docker image will contain only the most basic functions. The result is that if a notebook requires a library that isn't standard, it won't be possible to run all of the code in the notebook. In order to address this, Binder allows for the inclusion in a repository of certain files that can define what extra elements should be included when building the Docker image. This can be very specific such as what version of the language to use and listing various libraries that should be included to ensure that the notebook can be run smoothly. Binder also has support for more advanced functionality in the Docker images such as creating a Postgres database and loading it with data. These kinds of activities require using different hooks that Binder looks for during the creation of the Docker image to run scripts. Results and evaluation The final product has three main sections that divide the data categorically into altmetrics, bibliometrics, and data from Mendeley. There are additionally some sections that exist as areas where something new could be tried out and refined without potentially causing issues with the three previously mentioned areas. Each section has visualizations that are based on the data available. Considering the requirements for the project, the result goes a long way to meeting the requirements. The most apparent area that the Journal Map succeeds in is its goals is of presenting data that we have collected. The application serves as a dashboard for the data that can be explored by changing filters and journal selections. By presenting the data as a dashboard, the barrier to entry for users to explore the data is low. However, there exists a way to access the data directly and perform new calculations, or create new visualizations. This can be done through the application's access to an R-Studio environment. Access to R-Studio provides two major features. First, it gives direct access to the all the underlying code that creates the dashboard and the data used by it. Second, it provides an R terminal so that users can work with the data directly. In R-Studio, the user can also modify the existing files and then run them from R-Studio to see the results. Using Binder and R as the backend of the applications allows us to provide users with different ways to access and work with data without any extra requirements on the part of the user. However, anything changed in R-Studio won't affect the dashboard view and won't persist between sessions. Changes exist only in the current session. All the major pieces of this project were able to be done using open technologies: Binder to serve the application, R to write the code, and Github to host all the code. Using these technologies and leveraging their capabilities allows the project to support the Open Science paradigm that was part of the impetus for the project. The biggest drawback to the current implementation is that Binder is a third party host and so there are certain things that are out of our control. For example, Binder can be slow to load. It takes on average 1+ minutes for the Docker image to load. There's not much, if anything, we can do to speed that up. The other issue is that if there is an update to the Binder source code that breaks something, then the application will be inaccessible until the issue is resolved. Outlook and future work The application, in its current state, has parts that are not finalized. As we receive feedback, we will make changes to the application to add or change visualizations. As mentioned previously, there a few sections that were created to test different visualizations independently of the more complete sections, those can be finalized. In the future it may be possible to move from BinderHub to a locally created and administered version of Binder. There is support and documentation for creating local, self hosted instances of Binder. Going that direction would give more control, and may make it possible to get the Docker image to load more quickly. While the application runs stand-alone, the data that is visualized may also be integrated in other contexts. One option we are already prototyping is integrating the data into our subject portal EconBiz, so users would be able to judge the scientific impact of an article in terms of both bibliometric and altmetric indicators.   References William W. Hood, Concepcion S. Wilson. The Literature of Bibliometrics, Scientometrics, and Informetrics. Scientometrics 52, 291–314 Springer Science and Business Media LLC, 2001. Link R. Schimmer. Disrupting the subscription journals’ business model for the necessary large-scale transformation to open access. (2015). Link Mike Thelwall, Stefanie Haustein, Vincent Larivière, Cassidy R. Sugimoto. Do Altmetrics Work? Twitter and Ten Other Social Web Services. PLoS ONE 8, e64841 Public Library of Science (PLoS), 2013. Link The ZBW - Open Science Future. Link Sarah Anne Murphy. Data Visualization and Rapid Analytics: Applying Tableau Desktop to Support Library Decision-Making. Journal of Web Librarianship 7, 465–476 Informa UK Limited, 2013. Link Christina Kläre, Timo Borst. Statistic packages and their use in research in Economics | EDaWaX - Blog of the project ’European Data Watch Extended’. EDaWaX - European Data Watch Extended (2017). Link   Journal Map - Binder application for displaying and analyzing metrics data about scientific journals Integrating altmetrics into a subject repository - EconStor as a use case Back in 2015 the ZBW Leibniz Information Center for Economics (ZBW) teamed up with the Göttingen State and university library (SUB), the Service Center of Götting library federation (VZG) and GESIS Leibniz Institute for the Social Sciences in the *metrics project funded by the German Research Foundation (DFG). The aim of the project was: “… to develop a deeper understanding of *metrics, especially in terms of their general significance and their perception amongst stakeholders.” (*metrics project about). In the practical part of the project the following DSpace based repositories of the project partners participated as data sources for online publications and – in the case of EconStor – also as implementer for the presentation of the social media signals: EconStor - a subject repository for economics and business studies run by the ZBW, currently (Aug. 2019) containing round about 180,000 downloadable files, GoeScholar - the Publication Server of the Georg-August-Universität Göttingen run by the SUB Göttingen, offering approximately 11,000 publicly browsable items so far, SSOAR - the “Social Science Open Access Repository” maintained by GESIS, currently containing about 53,000 publicly available items. In the work package “Technology analysis for the collection and provision of *metrics” of the project an analysis of currently available *metrics technologies and services had been performed. As stated by [Wilsdon 2017], currently suppliers of altmetrics “remain too narrow (mainly considering research products with DOIs)”, which leads to problems to acquire *metrics data for repositories like EconStor with working papers as the main content. As up to now it is unusual – at least in the social sciences and economics – to create DOIs for this kind of documents. Only the resulting final article published in a journal will receive a DOI. Based on the findings in this work package, a test implementation of the *metrics crawler had been built. The crawler had been actively deployed from early 2018 to spring 2019 at the VZG. For the aggregation of the *metrics data the crawler had been fed with persistent identifiers and metadata from the aforementioned repositories. At this stage of the project, the project partners still had the expectation, that the persistent identifiers (e.g. handle, URNs, …), or their local URL counterparts, as used by the repositories could be harnessed to easily identify social media mentions of their documents, e.g. for EconStor: handle: “hdl:10419/…” handle.net resolver URL: “http(s)://hdl.handle.net/10419/…” EconStor landing page URL with handle: “http(s)://www.econstor.eu/handle/10419/…” EconStor bitstream (PDF) URL with handle: “http(s)://www.econstor.eu/bitstream/10419/…” This resulted in two datasets: One for publications identified by DOIs (doi:10.xxxx/yyyyy) or the respective metadata from Crossref and one for documents identified by the repository URLs (https://www.econstor.eu/handle/10419/xxxx) or the items metadata stored in the repository. During the first part of the project several social media platforms had been identified as possible data sources for the implementation phase. This had been done by interviews and online surveys. For the resulting ranking see the Social Media Registry. Additional research examined which social media platforms are relevant to researchers at different stages of their career and if and how they use them (see: [Lemke 2018], [Lemke 2019] and [Mehrazar 2018]). This list of possible sources for social media citations or mentions had then been further reduced to the following six social media platforms which are offering free and open available online APIs: Facebook Mendeley Reddit Twitter Wikipedia Youtube Of particular interest to the EconStor team were the social media services Mendeley and Twitter, as those had been found being among the “Top 3 most used altmetric sources …” for Economic and Business Studies (EBS) journals “… - with Mendeley being the most complete platform for EBS journals” [Nuredini 2016]. *metrics integration in EconStor In early 2019 the EconStor team finally received a MySQL data dump of the compiled data which had been collected by the *metrics crawler. In consultations between the project partners and based on the aforementioned research, it became clear, that only the collected data from Mendeley, Twitter and Wikipedia were suitable to be embedded into EconStor. It was also made clear, by the VZG, that it had been nearly impossible to use handle or respective local URLs to extract social media mentions from the free of charge provided APIs of the different social media services. Instead, in case of Wikipedia ISBNs had been used and for Mendeley the title and author(s) as provided in the repository’s metadata. Only for the search via the Twitter API the handle URLs had been used. The datasets used by the *metrics crawler to identify works from EconStor included a dataset of 15,703 DOIs (~10% of the EconStor content back then), sometimes representing other manifestations of the documents stored in EconStor (e.g. pre- or postprint versions of an article), their respective metadata from the Crossref DOI registry and also a dataset of 153,807 EconStor documents identified by the handle/URL and metadata stored in the repository itself. This second dataset also included the documents related to the publications identified by the DOI set. The following table (Table 1) shows the results of the *metrics crawler for items in EconStor. It displays one row for each service and the used identifier set. Each row also shows the time period during which the crawler harvested the service and how many unique items per identifier set were found during that period. social media service (set) harvested from harvested until unique EconStor items mentioned Mendeley (DOI) 2018-08-06 2019-01-11 7,800 Mendeley (URL) 2019-01-10 2019-01-11 24,953 Twitter (DOI) 2018-02-13 (date of first captured tweet 2018-02-03) 2019-01-11 (date of last captured tweet 2019-01-10) 418 Twitter (URL) 2018-12-14 (date of first captured tweet 2018-12-05) 2019-01-11 (date of last captured tweet 2019-01-09) 32 Wikipedia (DOI) 2018-10-05 2019-01-11 93 Wikipedia (URL) 2019-01-11 2019-01-11 100 Table 1: Unique EconStor Items found per identifier set and social media service The following table (Table 2) shows how many of the EconStor items were found with identifiers from both sets. As you can see, only for the service Mendeley the sets have a significant overlap. Which shows, that it is desirable for a service such as EconStor, to expand the captured coverage of its items in social media by the use of other identifies than just DOIs. social media site unique items identified by both DOI and URL Mendeley 4,323 Twitter 0 Wikipedia 2 Table 2: Overlap in found identifiers As a result of the project, the landing pages of EconStor items, which have been mentioned on Mendeley, twitter or Wikipedia during the time of data gathering, have now, for the time being, a listing of “Social Media Mentions”. This is in addition to the already existing cites and citations, based on the RePEc - CitEc service and the download statistics, which is displayed on separate pages. Image 1: “EconStor item landing page” The back end on the EconStor server is realized as a small RESTful Web service programmed in Java that returns JSON formatted data (see Figure 1). Given a list of identifiers (DOIs/handle) it returns the sum of mentions for Mendeley, Twitter and Wikipedia in the Database, per specified EconStor item, as well as the links to the counted tweets and Wikipedia articles. In case of Wikipedia this is also grouped by the language of the Wikipedia the mention was found in.   { "_metrics": { "sum_mendeley": 0, "sum_twitter": 3, "sum_wikipedia": 0 }, "identifier": "10419/144535", "identifiertype": "HANDLE", "repository": "EconStor", "tweetData": { "1075481976793116673": { "created_at": "Wed Dec 19 20:04:19 +0000 2018", "description": "Economist Wettbewerb Regulierung Monopole Economics @DICEHHU @HHU_de VWL Antitrust Düsseldorf Quakenbrück Berlin FC St. Pauli", "id_str": "1075481976793116673", "name": "Justus Haucap", "screen_name": "haucap" }, "1075484066949025793": { "created_at": "Wed Dec 19 20:12:37 +0000 2018", "description": "Twitterkanal des Wirtschaftsdienst - Zeitschrift für Wirtschaftspolitik, hrsg. von @ZBW_news; RT ≠ Zustimmung; Impressum: https://t.co/X0gevZb9lR", "id_str": "1075484066949025793", "name": "Wirtschaftsdienst", "screen_name": "Zeitschrift_WD" }, "1075486159772504065": { "created_at": "Wed Dec 19 20:20:56 +0000 2018", "description": "Professor for International Economics at HTW Berlin - University of Applied Sciences; Senior Policy Fellow at the European Council on Foreign Relations", "id_str": "1075486159772504065", "name": "Sebastian Dullien", "screen_name": "SDullien" } }, "twitterids": [ "1075486159772504065", "1075484066949025793", "1075481976793116673" ], "wikipediaQuerys": {} } Figure 1: “Example json returned by webservice - Twitter mentions”   Image 2: “Mendeley and Twitter mentions” During the creation of the landing page of an EconStor item (see Image 1), a JAVA servlet queries the web service and, if some social media mentions is detected, renders the result into the web page. For each of the three social media platforms the sum of the mentions is displayed and for Twitter and Wikipedia even backlinks to the mentioning tweets/articles are provided as a drop-down list, below the number of mentions (see Image 2). In case of Wikipedia this is also grouped by the languages of the articles in Wikipedia in which the ISBN of the corresponding work has been found. Conclusion While being an interesting addition to the existing download statistics and citations by RePEc/CitEc, that are already integrated into EconStor, currently the gathered “social media mentions” offer only a limited additional value to the EconStor landing pages. One reason might be, that only a fraction of all the documents of EconStor are covered. Another reason might be according to [Lemke 2019], that there is currently a great reluctance to use social media services among economists and social scientists, as it is perceived as: “unsuitable for academic discourse; … to cost much time; … separating personal from professional matters is bothersome; … increases the efforts necessary to handle information overload.” Theoretically, the prospect of a tool for the measurement of the scientific uptake, with a quicker response time than classical bibliometrics, could be very rewarding, especially for a repository like EconStor with its many preprints (e.g. working papers) provided in open access. As [Thelwall 2013] has stated: “In response, some publishers have turned to altmetrics, which are counts of citations or mentions in specific social web services because they can appear more rapidly than citations. For example, it would be reasonable to expect a typical article to be most tweeted on its publication day and most blogged within a month of publication.” and “Social media mentions, being available immediately after publication—and even before publication in the case of preprints…”. But especially these preprints, that come without a DOI, are still a challenge to be correctly identified, and therefore to be counted as social media mentions. This is something the *metrics crawler has not changed, since it is using title and author metadata to search in Mendeley, which does not give a 100% sure identification and ISBNs to search in Wikipedia. Even though a quick check revealed that at the time of writing this article (Aug. 2019) at least Wikipedia offers a handle search. A quick search for EconStor handles in the English Wikipedia returns now a list of 184 pages with mentions of “hdl:10419/”, the German Wikipedia 13 - but these are still very small numbers (Aug. 22nd, 2019: currently 179,557 full texts are available in EconStor). https://en.wikipedia.org/w/api.php?action=query&amp;list=search&amp;srlimit=500&amp;srsearch=%22hdl:10419%2F%22&amp;srwhat=text&amp;srprop&amp;srinfo=totalhits&amp;srenablerewrites=0&amp;format=jsonsearch via API in english wikipedia Another problem is, that at the time of this writing, the *metrics crawler is not continuously operated, therefore our analysis is based on a data dump of social media mentions from spring 2018 to early 2019. Since it is one of the major benefits of altmetrics that it can be obtained much faster and is more recent then classical citation-based metrics, it reduces the value of the continued integration of this static and continuously getting older dataset being integrated into EconStor landing pages. Hence, we are looking for more recent and regular updates of social media data that could serve as a ‘real-time’ basis for monitoring social media usage in economics. As a consequence, we are currently looking for: a) an institution to commit itself to run the *metrics crawler and b) a more active social media usage in the sciences of Economics and Business Studies. References [Lemke 2018] Lemke, Steffen; Mehrazar, Maryam; Mazarakis, Athanasios; Peters, Isabella (2018): Are There Different Types of Online Research Impact?, In: Building &amp; Sustaining an Ethical Future with Emerging Technology. Proceedings of the 81st Annual Meeting, Vancouver, Canada, 10–14 November 2018, ISBN 978-0-578-41425-6, Association for Information Science and Technology (ASIS&amp;T), Silver Spring, pp. 282-289 http://hdl.handle.net/11108/394 [Lemke 2019] Lemke, Steffen; Mehrazar, Maryam; Mazarakis, Athanasios; Peters, Isabella (2019): “When You Use Social Media You Are Not Working”: Barriers for the Use of Metrics in Social Sciences, Frontiers in Research Metrics and Analytics, ISSN 2504-0537, Vol. 3, Iss. [Article] 39, pp. 1-18, http://dx.doi.org/10.3389/frma.2018.00039 [Mehrazar 2018] Maryam Mehrazar, Christoph Carl Kling, Steffen Lemke, Athanasios Mazarakis, and Isabella Peters (2018): Can We Count on Social Media Metrics? First Insights into the Active Scholarly Use of Social Media, WebSci ’18: 10th ACM Conference on Web Science, May 27–30, 2018, Amsterdam, Netherlands. ACM, New York, NY, USA, Article 4, 5 pages, https://doi.org/10.1145/3201064.3201101 [Metrics 2019] Einbindung von *metrics in EconStor, https://metrics-project.net/downloads/2019-03-28-EconStor-metrics-Abschluss-WS-SUB-G%C3%B6.pptx [Nuredini 2016] Nuredini, Kaltrina; Peters, Isabella (2016): Enriching the knowledge of altmetrics studies by exploring social media metrics for Economic and Business Studies journals, Proceedings of the 21st International Conference on Science and Technology Indicators (STI Conference 2016), València (Spain), September 14-16, 2016, http://hdl.handle.net/11108/261 [OR2019] Relevance and Challenges of Altmetrics for Repositories - answers from the *metrics project. https://www.conftool.net/or2019/index.php/Paper-P7A-424Orth%2CWeiland_b.pdf?page=downloadPaper&amp;filename=Paper-P7A-424Orth%2CWeiland_b.pdf&amp;form_id=424&amp;form_index=2&amp;form_version=final [Social Media Registry] Social Media Registry - Current Status of Social Media Plattforms and *metrics, https://docs.google.com/spreadsheets/d/10OALs5kxtmML4Naf1ShXh0cTmONE8q9EFhTzmgPINv4/edit?usp=sharing [Thelwall 2013] Thelwall M, Haustein S, Larivie`re V, Sugimoto CR (2013): Do Altmetrics Work? Twitter and Ten Other Social Web Services. PLoS ONE 8(5): e64841. http://dx.doi.org/10.1371/journal.pone.0064841 [Wilsdon 2017] Wilsdon, James et al. (2017): Next-generation metrics: Responsible metrics and evaluation for open science. Report of the European Commission Expert Group on Altmetrics, ISBN 78-92-79-66130-3, http://dx.doi.org/10.2777/337729 Integrating altmetrics data into EconStor 20th Century Press Archives: Data donation to Wikidata ZBW is donating a large open dataset from the 20th Century Press Archives to Wikidata, in order to make it better accessible to various scientific disciplines such as contemporary, economic and business history, media and information science, to journalists, teachers, students, and the general public. The 20th Century Press Archives (PM20) is a large public newspaper clippings archive, extracted from more than 1500 different sources published in Germany and all over the world, covering roughly a full century (1908-2005). The clippings are organized in thematic folders about persons, companies and institutions, general subjects, and wares. During a project originally funded by the German Research Foundation (DFG), the material up to 1960 has been digitized. 25,000 folders with more than two million pages up to 1949 are freely accessible online.  The fine-grained thematic access and the public nature of the archives makes it to our best knowledge unique across the world (more information on Wikipedia) and an essential research data fund for some of the disciplines mentioned above. The data donation does not only mean that ZBW has assigned a CC0 license to all PM20 metadata, which makes it compatible with Wikidata. (Due to intellectual property rights, only the metadata can be licensed by ZBW - all legal rights on the press articles themselves remain with their original creators.) The donation also includes investing a substantial amount of working time (during, as planned, two years) devoted to the integration of this data into Wikidata. Here we want to share our experiences regarding the integration of the persons archive metadata. Folders from the persons archive, in 2015 (Credit: Max-Michael Wannags) Linking our folders to WikidataThe essential bit for linking the digitized folders was in place before the project even started: an external identifier property (PM20 folder ID, P4293), proposed by an administrator of the German Wikipedia in order to link to PM20 person and company folders. We participated in the property proposal discussion and made sure that the links did not have to reference our legacy Coldfusion application. Instead, we created a "partial redirect" on the purl.org service (maintained formerly by OCLC, now by the Internet Archive) for persistent URLs which may redirect to another application on another server in future. Secondly, the identifier and URL format was extended to include subject and ware folders, which are defined by a combination of two keys, one for the country and another for the topic. The format of the links in Wikidata is controlled by a regular expression, which covers all four archives mentioned above. That works pretty well -  very few format errors occurred so far -, and it relieved us from creating four different archive-specific properties.Shortly after the property creation, Magnus Manske, the author of the original Mediawiki software and lots of related tools, scraped our web site and created a Mix-n-Match catalog from it. During the following two years, more than 60 Wikidata users contributed to matching Wikidata items for humans to PM20 folder IDs. For a start, deriving links from GND Many of the PM20 person and company folders were already identified by an identifier from the German Integrated Authority File (GND). So, our first step was creating PM20 links for all Wikidata items which had matching GND IDs. For all these items and folders, disambiguation had already taken place, and we could safely add all these links automatically. Infrastructure: PM20 endpoint, federated queries and QuickStatements To make this work, we relied heavily on Linked Data technologies. A PM20 SPARQL endpoint had already been set up for our contribution to Coding da Vinci (a "Kultur-Hackathon" in Germany). Almost all automated changes to Wikidata we made are based on federated queries on our own endpoint, reaching out to the Wikidata endpoint, or vice versa, from Wikidata to PM20. In the latter case, the external endpoint has to be registered at Wikidata. Wikidata maintains a help page for this type of queries. For our purposes, federated queries allow extracting current data from both endpoints. In the case of the above-mentioned missing_pm20_id_via_gnd.rq query, this way we can skip all items, where a link to PM20 already exists. Within the query itself, we create a statement string which we can feed into the QuickStatements tool. That includes, for every single statement, a reference to PM20 with link to the actual folder, so that the provenance of these statements is always clear and traceable. Via script, a statement file is extracted and saved with a timestamp. Data imports via QuickStatements are executed in batch mode, and an activity log keeps track of all data imports and other activities related to PM20. Creating missing items After the matching of about 93 % of the person folders which include free documents in Mix-n-Match, and some efforts to discover more pre-existing Wikidata items, we decided to create the 346 missing person items, again via QuickStatements input. We used the description field in Wikidata by importing the content of the free-text "occupation" field in PM20 for better disambiguation of the newly created items. (Here a rather minimal example of such an item created from PM20 metadata.) Thus, all PM20 person folders which have digitized content were linked to Wikidata in June 2019. Supplementing Wikidata with PM20 metadata A second part of the integration of PM20 metadata into Wikidata was the import of missing property values to the according items. This comprised simple facts like "date of birth/death", occupations such as "economist", "business economist", "social scientist", "earth scientist", which we could derive from the "field of activity" in PM20, up to relations between existing items, e.g. a family member to the according family, or a board member to the according company. A few other source properties have been postponed, because alternative solutions exist, and the best one may depend on the intended use in future applications. The steps of this enrichment process and links to the code used - including the automatic generation of references - are online, too. Complex statement added to Wikidata item for Friedrich Krupp AG Again, we used federated queries. Often the target of a Wikidata property is an item in itself. Sometimes, we could directly get this via the target item's PM20 folder ID (families, companies); sometimes we had to create lookup tables. For the latter, we used "values" clauses in the query (in case of "occupation"), or (in case of "country of citizenship"), we have to match countries from our internal classification in advance - a process for which we use OpenRefine. Other than PM20 folder IDs, which we avoided adding when folders do not contain digitized content, we added the metadata to all items which were linked to PM20, and intend to repeat this process periodically when more items (e.g., companies) are identified by PM20 folder IDs. In some housekeeping activity, we also add periodically the numbers of documents (online and total) and the exact folder names as qualifiers to newly emerging PM20 links in items. Results of the data donation so far With all 5266 persons folder with digitized documents linked to Wikidata, the data donation of the person folders metadata is completed. Besides the folder links, which have already heavily been used to create links in Wikipedia articles, we have got - more than 6000 statements which are sourced in PM20 (from "date of birth" to the track gauge of a Brazilian railway line) - more than 1000 items, for which PM20 ID is the only external identifier The data donation will be presented on the WikidataCon in Berlin (24.-26.10.2019) as a "birthday present" on the occasion Wikidata's seventh birthday. ZBW will further keep the digital content available, amended with a static landing page for every folder, which also will serve as source link for the metadata we have integrated into Wikidata. But in future, Wikidata will be the primary access path to our data, providing further metadata in multiple languages and links to a plethora of other external sources. And the best is, different from our current application, everybody will be able to enhance this open data through the interactive tools and data interfaces provided by Wikidata.Participate in WikiProject 20th Century Press Archives For the topics, wares and companies archives, there is still a long way to go. The best structure for representing these archives and their folders - often defined by the combination of a country within a geographical hierarchy with a subject heading in a deeply nested topic classification -, has to be figured out. Existing items have to be matched, and lots of other work is to be done. Therefore, we have created the WikiProject 20th Century Press Archives in Wikidata to keep track of discussions and decisions, and to create a focal point for participation. Everybody on Wikidata is invited to participate - or just kibitz. It could be challenging particularly for information scientists, and people interested in historic systems for the organization of knowledge about the whole world, to take part in the mapping of one of these systems to the emerging Wikidata knowledge graph.   Linked data &#160; Open data &#160; ZBW's contribution to "Coding da Vinci": Dossiers about persons and companies from 20th Century Press Archives At 27th and 28th of October, the Kick-off for the "Kultur-Hackathon" Coding da Vinci is held in Mainz, Germany, organized this time by GLAM institutions from the Rhein-Main area: "For five weeks, devoted fans of culture and hacking alike will prototype, code and design to make open cultural data come alive." New software applications are enabled by free and open data. For the first time, ZBW is among the data providers. It contributes the person and company dossiers of the 20th Century Press Archive. For about a hundred years, the predecessor organizations of ZBW in Kiel and Hamburg had collected press clippings, business reports and other material about a wide range of political, economic and social topics, about persons, organizations, wares, events and general subjects. During a project funded by the German Research Organization (DFG), the documents published up to 1948 (about 5,7 million pages) had been digitized and are made publicly accessible with according metadata, until recently solely in the "Pressemappe 20. Jahrhundert" (PM20) web application. Additionally, the dossiers - for example about Mahatma Gandhi or the Hamburg-Bremer Afrika Linie - can be loaded into a web viewer. As a first step to open up this unique source of data for various communities, ZBW has decided to put the complete PM20 metadata* under a CC-Zero license, which allows free reuse in all contexts. For our Coding da Vinci contribution, we have prepared all person and company dossiers which already contain documents. The dossiers are interlinked among each other. Controlled vocabularies (for, e.g., "country", or "field of activity") provide multi-dimensional access to the data. Most of the persons and a good share of organizations were linked to GND identifiers. As a starter, we had mapped dossiers to Wikidata according to existing GND IDs. That allows to run queries for PM20 dossiers completely on Wikidata, making use of all the good stuff there. An example query shows the birth places of PM20 economists on a map, enriched with images from Wikimedia commons. The initial mapping was much extended by fantastic semi-automatic and manual mapping efforts by the Wikidata community. So currently more than 80 % of the dossiers about - often rather prominent - PM20 persons are linked not only to Wikidata, but also connected to Wikipedia pages. That offers great opportunities for mash-ups to further data sources, and we are looking forward to what the "Coding da Vinci" crowd may make out of these opportunities. Technically, the data has been converted from an internal intermediate format to still quite experimental RDF and loaded into a SPARQL endpoint. There it was enriched with data from Wikidata and extracted with a construct query. We have decided to transform it to JSON-LD for publication (following practices recommended by our hbz colleagues). So developers can use the data as "plain old JSON", with the plethora of web tools available for this, while linked data enthusiasts can utilize sophisticated Semantic Web tools by applying the provided JSON-LD context. In order to make the dataset discoverable and reusable for future research, we published it persistently at zenodo.org. With it, we provide examples and data documentation. A GitHub repository gives you additional code examples and a way to address issues and suggestions. * For the scanned documents, the legal regulations apply - ZBW cannot assign licenses here.     Pressemappe 20. Jahrhundert Linked data &#160; Wikidata as authority linking hub: Connecting RePEc and GND researcher identifiers In the EconBiz portal for publications in economics, we have data from different sources. In some of these sources, most notably ZBW's "ECONIS" bibliographical database, authors are disambiguated by identifiers of the Integrated Authority File (GND) - in total more than 470,000. Data stemming from "Research papers in Economics" (RePEc) contains another identifier: RePEc authors can register themselves in the RePEc Author Service (RAS), and claim their papers. This data is used for various rankings of authors and, indirectly, of institutions in economics, which provides a big incentive for authors - about 50,000 have signed into RAS - to keep both their article claims and personal data up-to-date. While GND is well known and linked to many other authorities, RAS had no links to any other researcher identifier system. Thus, until recently, the author identifiers were disconnected, which precludes the possibility to display all publications of an author on a portal page. To overcome that limitation, colleagues at ZBW have matched a good 3,000 authors with RAS and GND IDs by their publications (see details here). Making that pre-existing mapping maintainable and extensible however would have meant to set up some custom editing interface, would have required storage and operating resources and wouldn't easily have been made publicly accessible. In a previous article, we described the opportunities offered by Wikidata. Now we made use of it. v\:* {behavior:url(#default#VML);} o\:* {behavior:url(#default#VML);} w\:* {behavior:url(#default#VML);} .shape {behavior:url(#default#VML);} Normal 0 false false false false DE X-NONE X-NONE DefSemiHidden="true" DefQFormat="false" DefPriority="99" LatentStyleCount="267"> UnhideWhenUsed="false" QFormat="true" Name="Normal"> UnhideWhenUsed="false" QFormat="true" Name="heading 1"> UnhideWhenUsed="false" QFormat="true" Name="Title"> UnhideWhenUsed="false" QFormat="true" Name="Subtitle"> UnhideWhenUsed="false" QFormat="true" Name="Strong"> UnhideWhenUsed="false" QFormat="true" Name="Emphasis"> UnhideWhenUsed="false" Name="Table Grid"> UnhideWhenUsed="false" QFormat="true" Name="No Spacing"> UnhideWhenUsed="false" Name="Light Shading"> UnhideWhenUsed="false" Name="Light List"> UnhideWhenUsed="false" Name="Light Grid"> UnhideWhenUsed="false" Name="Medium Shading 1"> UnhideWhenUsed="false" Name="Medium Shading 2"> UnhideWhenUsed="false" Name="Medium List 1"> UnhideWhenUsed="false" Name="Medium List 2"> UnhideWhenUsed="false" Name="Medium Grid 1"> UnhideWhenUsed="false" Name="Medium Grid 2"> UnhideWhenUsed="false" Name="Medium Grid 3"> UnhideWhenUsed="false" Name="Dark List"> UnhideWhenUsed="false" Name="Colorful Shading"> UnhideWhenUsed="false" Name="Colorful List"> UnhideWhenUsed="false" Name="Colorful Grid"> UnhideWhenUsed="false" Name="Light Shading Accent 1"> UnhideWhenUsed="false" Name="Light List Accent 1"> UnhideWhenUsed="false" Name="Light Grid Accent 1"> UnhideWhenUsed="false" Name="Medium Shading 1 Accent 1"> UnhideWhenUsed="false" Name="Medium Shading 2 Accent 1"> UnhideWhenUsed="false" Name="Medium List 1 Accent 1"> UnhideWhenUsed="false" QFormat="true" Name="List Paragraph"> UnhideWhenUsed="false" QFormat="true" Name="Quote"> UnhideWhenUsed="false" QFormat="true" Name="Intense Quote"> UnhideWhenUsed="false" Name="Medium List 2 Accent 1"> UnhideWhenUsed="false" Name="Medium Grid 1 Accent 1"> UnhideWhenUsed="false" Name="Medium Grid 2 Accent 1"> UnhideWhenUsed="false" Name="Medium Grid 3 Accent 1"> UnhideWhenUsed="false" Name="Dark List Accent 1"> UnhideWhenUsed="false" Name="Colorful Shading Accent 1"> UnhideWhenUsed="false" Name="Colorful List Accent 1"> UnhideWhenUsed="false" Name="Colorful Grid Accent 1"> UnhideWhenUsed="false" Name="Light Shading Accent 2"> UnhideWhenUsed="false" Name="Light List Accent 2"> UnhideWhenUsed="false" Name="Light Grid Accent 2"> UnhideWhenUsed="false" Name="Medium Shading 1 Accent 2"> UnhideWhenUsed="false" Name="Medium Shading 2 Accent 2"> UnhideWhenUsed="false" Name="Medium List 1 Accent 2"> UnhideWhenUsed="false" Name="Medium List 2 Accent 2"> UnhideWhenUsed="false" Name="Medium Grid 1 Accent 2"> UnhideWhenUsed="false" Name="Medium Grid 2 Accent 2"> UnhideWhenUsed="false" Name="Medium Grid 3 Accent 2"> UnhideWhenUsed="false" Name="Dark List Accent 2"> UnhideWhenUsed="false" Name="Colorful Shading Accent 2"> UnhideWhenUsed="false" Name="Colorful List Accent 2"> UnhideWhenUsed="false" Name="Colorful Grid Accent 2"> UnhideWhenUsed="false" Name="Light Shading Accent 3"> UnhideWhenUsed="false" Name="Light List Accent 3"> UnhideWhenUsed="false" Name="Light Grid Accent 3"> UnhideWhenUsed="false" Name="Medium Shading 1 Accent 3"> UnhideWhenUsed="false" Name="Medium Shading 2 Accent 3"> UnhideWhenUsed="false" Name="Medium List 1 Accent 3"> UnhideWhenUsed="false" Name="Medium List 2 Accent 3"> UnhideWhenUsed="false" Name="Medium Grid 1 Accent 3"> UnhideWhenUsed="false" Name="Medium Grid 2 Accent 3"> UnhideWhenUsed="false" Name="Medium Grid 3 Accent 3"> UnhideWhenUsed="false" Name="Dark List Accent 3"> UnhideWhenUsed="false" Name="Colorful Shading Accent 3"> UnhideWhenUsed="false" Name="Colorful List Accent 3"> UnhideWhenUsed="false" Name="Colorful Grid Accent 3"> UnhideWhenUsed="false" Name="Light Shading Accent 4"> UnhideWhenUsed="false" Name="Light List Accent 4"> UnhideWhenUsed="false" Name="Light Grid Accent 4"> UnhideWhenUsed="false" Name="Medium Shading 1 Accent 4"> UnhideWhenUsed="false" Name="Medium Shading 2 Accent 4"> UnhideWhenUsed="false" Name="Medium List 1 Accent 4"> UnhideWhenUsed="false" Name="Medium List 2 Accent 4"> UnhideWhenUsed="false" Name="Medium Grid 1 Accent 4"> UnhideWhenUsed="false" Name="Medium Grid 2 Accent 4"> UnhideWhenUsed="false" Name="Medium Grid 3 Accent 4"> UnhideWhenUsed="false" Name="Dark List Accent 4"> UnhideWhenUsed="false" Name="Colorful Shading Accent 4"> UnhideWhenUsed="false" Name="Colorful List Accent 4"> UnhideWhenUsed="false" Name="Colorful Grid Accent 4"> UnhideWhenUsed="false" Name="Light Shading Accent 5"> UnhideWhenUsed="false" Name="Light List Accent 5"> UnhideWhenUsed="false" Name="Light Grid Accent 5"> UnhideWhenUsed="false" Name="Medium Shading 1 Accent 5"> UnhideWhenUsed="false" Name="Medium Shading 2 Accent 5"> UnhideWhenUsed="false" Name="Medium List 1 Accent 5"> UnhideWhenUsed="false" Name="Medium List 2 Accent 5"> UnhideWhenUsed="false" Name="Medium Grid 1 Accent 5"> UnhideWhenUsed="false" Name="Medium Grid 2 Accent 5"> UnhideWhenUsed="false" Name="Medium Grid 3 Accent 5"> UnhideWhenUsed="false" Name="Dark List Accent 5"> UnhideWhenUsed="false" Name="Colorful Shading Accent 5"> UnhideWhenUsed="false" Name="Colorful List Accent 5"> UnhideWhenUsed="false" Name="Colorful Grid Accent 5"> UnhideWhenUsed="false" Name="Light Shading Accent 6"> UnhideWhenUsed="false" Name="Light List Accent 6"> UnhideWhenUsed="false" Name="Light Grid Accent 6"> UnhideWhenUsed="false" Name="Medium Shading 1 Accent 6"> UnhideWhenUsed="false" Name="Medium Shading 2 Accent 6"> UnhideWhenUsed="false" Name="Medium List 1 Accent 6"> UnhideWhenUsed="false" Name="Medium List 2 Accent 6"> UnhideWhenUsed="false" Name="Medium Grid 1 Accent 6"> UnhideWhenUsed="false" Name="Medium Grid 2 Accent 6"> UnhideWhenUsed="false" Name="Medium Grid 3 Accent 6"> UnhideWhenUsed="false" Name="Dark List Accent 6"> UnhideWhenUsed="false" Name="Colorful Shading Accent 6"> UnhideWhenUsed="false" Name="Colorful List Accent 6"> UnhideWhenUsed="false" Name="Colorful Grid Accent 6"> UnhideWhenUsed="false" QFormat="true" Name="Subtle Emphasis"> UnhideWhenUsed="false" QFormat="true" Name="Intense Emphasis"> UnhideWhenUsed="false" QFormat="true" Name="Subtle Reference"> UnhideWhenUsed="false" QFormat="true" Name="Intense Reference"> UnhideWhenUsed="false" QFormat="true" Name="Book Title"> /* Style Definitions */ table.MsoNormalTable {mso-style-name:"Normale Tabelle"; mso-tstyle-rowband-size:0; mso-tstyle-colband-size:0; mso-style-noshow:yes; mso-style-priority:99; mso-style-parent:""; mso-padding-alt:0cm 5.4pt 0cm 5.4pt; mso-para-margin:0cm; mso-para-margin-bottom:.0001pt; line-height:12.0pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Times New Roman","serif";} Initial situation in Wikidata Economists were, at the start of this small project in April 2017, already well represented among the 3.4 million persons in Wikidata - though the precise extent is difficult to estimate. Furthermore, properties for linking GND and RePEc author identifiers to Wikidata items were already in place: P227 “GND ID”, in ~375,000 items P2428 “RePEc Short-ID” (further-on: RAS ID), in ~2,200 items both properties in ~760 items For both properties, “single value” and “distinct values” constraints are defined, so that (with rare exceptions) a 1:1 relation between the authority entry and the Wikidata item should exist. That, in turn, means that a 1:1 relation between both authority entries can be assumed. The relative amounts of IDs in EconBiz and Wikidata is illustrated by the following image. Person identifiers in Wikidata and EconBiz, with unknown overlap at the beginning of the project (the number of 1.1 million persons in EconBiz is a very rough estimate, because most names – outside GND and RAS – are not disambiguated) Since many economists have Wikipedia pages, from which Wikidata items have been created routinely, the first task was finding these items and adding GND and/or RAS identifiers to them. The second task was adding items for persons which did not already exist in Wikidata. Adding mapping-derived identifiers to Wikidata items For items already identified by either GND or RAS, the reciprocal identifiers where added automatically: A federated SPARQL query on the mapping and the public Wikidata endpoint retrieved the items and the missing IDs. A script transformed that into input for Wikidata’s QuickStatements2 tool, which allows adding statements (as well as new items) to Wikidata. The tool takes csv-formatted input via a web form and applies it in batch to the live dataset. Import statements for QuickStatements2. The first input line adds the RAS ID “pan31” to the item for the economist James Andreoni. The rest of the input line creates a reference to ZBWs mapping for this statement and so allows tracking its provenance in Wikidata. That step resulted in 384 added GND IDs to items identified by RAS ID, and, in the reverse direction, 77 added RAS IDs to items identified by GND ID. For the future, it is expected that tools like wdmapper will facilitate such operations. Identifying more Wikidata items Obviously, the previous step left out the already existing economists in Wikidata, which up to then had neither a GND nor a RAS ID. Therefore, these items had to be identified by adding one of the identifiers. A semi-automatic approach was applied to that end, starting with the “most important” persons from RePEc and EconBiz datasets. That was extended in an automatic step, taking advantage of existing VIAF identifiers (a step which could have been also the first one). For RePEc, the “Top economists” ranking page (~4,600 authors) was scraped and cross-linked to a custom-created basic RDF dataset of the RePEc authors. The result was transformed to an input file for Wikidata’s Mix’n’match tool, which had been developed for the alignment of external catalogs with Wikidata. The tool takes a simple CSV file, consisting of a name, a description and an identifier, and tries to automatically match against Wikidata labels. In a subsequent interactive step, it allows to confirm or remove every match. If confirmed, the identifier is automatically added as value to the according property of the matched Wikidata item. For GND, all authors with more than 30 publications in EconBiz where selected in a custom SPARQL endpoint. Just as the “RePEc Top” matchset, a “GND economists (de)” matchset with ~18,000 GND IDs, names and descriptions was loaded into Mix’n’match and aligned to Wikidata. Becoming more familiar with the Wikidata-related tools, policies and procedures, existing VIAF property values were exploited as another opportunity for seeding GND IDs in Wikidata. In a federated SPARQL query on a custom VIAF and the public Wikidata endpoint, about 12,000 missing GND IDs were determined and added to Wikidata items which had been identified by VIAF ID. After each of these steps, the first task – adding mapping-derived GND or RAS identifiers – was repeated. That resulted in 1908 Wikidata items carrying both IDs. Since ZBWs author mapping based on at least 10 matching publications, the alignment of high-frequency resp. highly-ranked GND and RePEc authors made it highly probable that authors already present in Wikidata were identified in the previous steps. That reduced the danger of creating duplicates in the following task. Creating new Wikidata items from the mapped authorities For the rest of the authors in the mapping, 2179 new Wikidata items were created. This task was carried out again by the QuickStatements2 tool, for which the input statements were created by a script, based on a SPARQL query on the afore-mentioned endpoints for RePEc authors and GND entries. The input statements were derived from both authorities, in the following fashion: the label (name of the person) was taken from GND the occupation “economist” was derived from RePEc (and in particular from the occurrence in its “Top Economists” list) gender and date of birth/death were taken from GND (if available) the English description was a concatenated string “economist” plus the affiliations from RePEc the German description was a concatenated string “Wirtschaftswissenschaftler/in” plus the affiliations from GND The use of Wikidata’s description field for affiliations was a makeshift: In the absence of an existing mapping of RePEc (and mostly also GND) organizations to Wikidata, it allows for better identification of the individual researchers. In a later step, when according organization/institute items exist in Wikidata and mappings are in place, the items for authors can be supplemented step-by-step by formal “affiliation” (P1416) statements. According to Wikidata’s policy, an extensive reference to the source for each statement in the synthesized new Wikidata item was added. The creation of items in an automated fashion involves the danger of duplicates. However, such duplicates turned up only in very few cases. They have been solved by merging items, which technically is very easy in Wikidata. Interestingly, a number of “fake duplicates” indeed revealed multifarious quality issues, in Wikidata and in both of the authority files, which, too, have been subsequently resolved. ... and even more new items for economists ... The good experiences so far let us get bolder, and we considered creating Wikidata items for the still missing "Top Economists" (according to RePEc). For item creation, one aspect we had to consider was the compliance with Wikidata's notability policy. This policy is much more relaxed than the policies of the large Wikipedias. It states as one criterion sufficient for item creation that the item "refers to an instance of a clearly identifiable conceptual or material entity. The entity must be notable, in the sense that it can be described using serious and publicly available references." There seems to be some consensus in the community that authority files such as GND or RePEc authors count as "serious and publicly available references". This of course should hold even more for a bibliometric ranked subset of these external identifiers. We thus inserted another 1,839 Wikidata items for the rest of the RePEc Top 10 % list. Additionally - to mitigate the immanent gender bias such selections often bear - we imported all missing researchers from RePEc's "Top 10 % Female Economists" list. Again, we added reference statements to RePEc which allow Wikidata users to keep track of the source of the information. Results The immediate result of the project was: all of the 3081 pairs of identifiers from the initial mapping by ZBW is incorporated now in Wikidata items 1217 Wikidata items in addition to these also have both identifiers (created by individual Wikidata editors, or the efforts described above) (All numbers in this section as of 2017-11-13.) While that still is only a beginning, given the total amount of authors represented in EconBiz, it is a significant share of the “most important” ones: Top 10 % RAS and frequent GND in EconBiz (&gt; 30 publications). “Wikidata economists” is a rough estimate of the amount of persons in the field of economics (twice the number of those with the explicit occupation “economist”) While the top RePEc economists are now completely covered by Wikidata, for GND the overlap has been improved significantly during the last year. This occured in parts as a side-effect of the efforts described above, in parts it is caused by the genuine growth of Wikidata in regard to the number of items as well as the increasing density of external identifiers. Here the current percentages, compared to those one year earlier, which were presented in our previous article: Large improvements in the coverage of the most frequent authors by Wikidata (query, result) While the improvements in absolute numbers are impressive, too - the number of GND IDs for all EconBiz persons (with at least one publication) has increased from 39,778 to 59,074 - the image demonstrates that particularly the coverage for our most frequent authors has risen largely. The addition of all RePEc top economists has created further opportunities for matching these items from the afore-mentioned GND Mix-n-match set, which will again will add up to the mapping. All matching and duplicates checking done, we may re-consider the option of adding the remaining frequent GND persons (&gt;30 publications in EconBiz) automatically to Wikidata. The mapping data can be retrieved by everyone, via SPARQL queries, by specialized tools such as wdmapper, or as part of the Wikidata dumps. What is more, it can be extended by everybody – either as a by-product of individual edits adding identifiers to persons in Wikidata, or by a directed approach. For directed extensions, any subset can be used as a starting point: Either a new version of the above mentioned ranking, or other rankings also published by RePEc, covering in particular female, or economists from e.g. Latin America; or all identifiers from a particular institution, either derived from GND or RAS. The results of all such efforts are available at once and add up continuously. Yet, the benefits of using Wikidata cannot be reduced to the publication and maintenance of mapping itself. In many cases it offers much more than just a linking point for two identifiers: links to Wikipedia pages about the authors, possibly in multiple languages rich data about the authors in defined formats, sometimes with explicit provenance information access to pictures etc. from Wikimedia Commons, or quotations from Wikiquote links to multiple other authorities As an example for the latter, the in total 6825 RAS identifiers in Wikidata are already mapped to 2389 VIAF and 1742 LoC authority IDs (while ORCID with 69 IDs is still remarkably low). At the same time, these RePEc-connected items were linked to 1502 English, 690 German and  272 Spanish Wikipedia pages which provide rich human-readable information. In turn, when we take the GND persons in EconBiz as a starting point, roughly 60,000 are already represented in Wikidata. Besides large amounts of other identifiers, the according Wikidata items offer more than 33,000 links to German and more than 24,000 links to English Wikipedia pages (query). For ZBW, “releasing” the dataset into Wikidata as a trustworthy and sustainable public database not only saves the “technical” costs of data ownership (programming, storage, operating, for access and for maintenance). Responsibility for - and fun from - extending, amending and keeping the dataset current can be shared with many other interested parties and individuals.   Wikidata for Authorities Authority control &#160; Wikidata &#160; Deutsch New version of multi-lingual JEL classification published in LOD The Journal of Economic Literature Classification Scheme (JEL) was created and is maintained by the American Economic Association. The AEA provides this widely used resource freely for scholarly purposes. Thanks to André Davids (KU Leuven), who has translated the originally English-only labels of the classification to French, Spanish and German, we provide a multi-lingual version of JEL. It's lastest version (as of 2017-01) is published in the formats RDFa and RDF download files. These formats and translations are provided "as is" and are not authorized by AEA. In order to make changes in JEL tracable more easily, we have created lists of inserted and removed JEL classes in the context of the skos-history project. JEL Klassifikation für Linked Open Dataskos-history Linked data &#160; Economists in Wikidata: Opportunities of Authority Linking Wikidata is a large database, which connects all of the roughly 300 Wikipedia projects. Besides interlinking all Wikipedia pages in different languages about a specific item – e.g., a person -, it also connects to more than 1000 different sources of authority information. The linking is achieved by a „authority control“ class of Wikidata properties. The values of these properties are identifiers, which unambiguously identify the wikidata item in external, web-accessible databases. The property definitions includes an URI pattern (called „formatter URL“). When the identifier value is inserted into the URI pattern, the resulting URI can be used to look up the authoritiy entry. The resulting URI may point to a Linked Data resource - as it is the case with the GND ID property. This, on the one hand, provides a light-weight and robust mechanism to create links in the web of data. On the other hand, these links can be exploited by every application which is driven by one of the authorities to provide additional data: Links to Wikipedia pages in multiple languages, images, life data, nationality and affiliations of the according persons, and much more. Wikidata item for the Indian Economist Bina Agarwal, visualized via the SQID browser In 2014, a group of students under the guidance of Jakob Voß published a handbook on "Normdaten in Wikidata" (in German), describing the structures and the practical editing capabilities of the the standard Wikidata user interface. The experiment described here focuses on persons from the subject domain of economics. It uses the authority identifiers of the about 450,000 economists referenced by their GND ID as creators, contributors or subjects of books, articles and working papers in ZBW's economics search portal EconBiz. These GND IDs were obtained from a prototype of the upcoming EconBiz Research Dataset (EBDS). To 40,000 of these persons, or 8.7 %, a person in Wikidata is connected by GND. If we consider the frequent (more than 30 publications) and the very frequent (more than 150 publications) authors in EconBiz, the coverage increases significantly: Economics-related Persons in EconBiz Number of publications total in Wikidata percentage Datasets: EBDS as of 2016-11-18; Wikidata as of 2016-11-07 (query, result) &gt; 0 457,244 39,778 8.7 % &gt; 30 18,008 3,232 17.9 % &gt; 150 1,225 547 44.7 % These are numbers "out of the box" - ready-made opportunities to link out from existing metadata in EconBiz and to enrich user interfaces with biographical data from Wikidata/Wikipedia, without any additional effort to improve the coverage on either the EconBiz or the Wikidata side. However: We can safely assume that many of the EconBiz authors, particularly of the high-frequency authors, and even more of the persons who are subject of publications, are "notable" according the Wikidata notablitiy guidelines. Probably, their items exist and are just missing the according GND property. To check this assumption, we take a closer look to the Wikidata persons which have the occupation "economist" (most wikidata properties accept other wikidata items - instead of arbitrary strings - as values, which allows for exact queries and is indispensible in a multilingual environment).  Of these approximately 20,000 persons, less than 30 % have a GND ID property! Even if we restrict that to the 4,800 "internationally recognized economists" (which we define here as having Wikipedia pages in three or more different languages), almost half of them lack a GND ID property. When we compare that with the coverage by VIAF IDs, more than 50 % of all and 80 % the internationally recognized Wikidata economists are linked to VIAF (SPARQL Lab live query). Therefore, for a whole lot of the persons we have looked at here, we can take it for granted the person exists in Wikidata as well as in the GND, and the only reason for the lack of a GND ID is that nobody has added it to Wikidata yet. As an aside: The information about the occupation of persons is to be taken as a very rough approximation: Some Wikidata persons were economists by education or at some point of their career, but are famous now for other reasons (examples include Vladimir Putin or the president of Liberia, Ellen Johnson Sirleaf). On the other hand, EconBiz authors known to Wikidata are often qualified not as economist, but as university teacher, politican, historican or sociologist. Nevertheless, their work was deemed relevant for the broad field of economics, and the conclusions drawn at the "economists" in Wikidata and GND will hold for them, too: There are lots of opportunities for linking already well defined items. What can we gain? The screenshot above demonstrates, that not only data about the person itself, her affiliations, awards received, and possibly many other details can be obtained. The "Identifiers" box on the bottom right shows authoritiy entries. Besides the GND ID, which served as an entry point for us, there are links to VIAF and other national libraries' authorities, but also to non-library identifier systems like ISNI and ORCID. In total, Wikidata comprises more than 14 million authority links, more than 5 millions of these for persons. When we take a closer look at the 40,000 EconBiz persons which we can look up by their GND ID in Wikidata, an astonishing variety of authorities is addressed from there: 343 different authorities are linked from the subset, ranging from "almost complete" (VIAF, Library of Congress Name Authority File) to - in the given context- quite exotic authorities of, e.g., Members of the Belgian Senate, chess players or Swedish Olympic Committee athletes. Some of these entries link to carefully crafted biographies, sometimes behind a paywall  (Notable Names Database, Oxford Dictionary of National Biography, Munzinger Archiv, Sächsische Biographie, Dizionario Biografico degli Italiani), or to free text resources (Project Gutenberg authors). Links to the world of museums and archives are also provided, from the Getty Union List of Artist Names to specific links into the British Museum or the Musée d'Orsay collections. A particular use can be made of properties which express the prominence of the according persons: Nobel Prize IDs, for example, definitivly should be linked to according GND IDs (and indeed, they are). But also TED speakers or persons with an entry in the Munzinger Archive (a famous and long-established German biographical service) are assumed to have GND IDs. That opens a road to a very focused improvement of the data quality: A list of persons with that properties, restricted to the subject field (e.g., "occupation economist"), can be easily generated from Wikidata's SPARQL Query Service. In Wikidata, it is very easy to add the missing ID entries discovered during such cross-checks interactively. And if it turns out that an "very important" person from the field is missing from the GND at all, that is a all-the-more valuable opportunity to improve the data quality at the source. How can we start improving? As a prove of concept, and as a practical starting point, we have developed a micro-application for adding missing authority property values. It consists of two SPARQL Lab scripts: missing_property creates a list of Wikidata persons, which have a certain authority property (by default: TED speaker ID) and lacks another one (by default: GND ID). For each entry in the list, a link to an application is created, which looks up the name in the according authority file (by default: search_person, for a broad yet ranked full-text search of person names in GND). If we can identify the person in the GND list, we can copy its GND ID, return to the first one, click on the link to the Wikidata item of the person and add the property value manually through Wikidata's standard edit interface. (Wikidata is open and welcoming such contributions!) It takes effect within a few seconds - when we reload the missing_property list, the improved item should not show up any more. Instead of identifying the most prominent economics-related persons in Wikidata, the other way works too: While most of the GND-identified persons are related to only one or twe works, as an according statistics show, few are related to a disproportionate amount of publications. Of the 1,200 persons related to more than 150 publications, less than 700 are missing links to Wikidata by their GND ID. By adding this property (for the vast majority of these persons, a Wikidata item should already exist), we could enrich, at a rough estimate, more than 100,000 person links in EconBiz publications. Another micro-application demonstrates, how the work could be organized: The list of EconBiz persons by descending publication count provides "SEARCH in Wikidata" links (functional on a custom endpoint): Each link triggers a query which looks up all name variants in GND and executes a search for these names in a full-text indexed Wikidata set, bringing up an according ranked list of suggestions (example with the GND ID of John H. Dunning). Again, the GND ID can be added - manually but straightforward - to an identified Wikidata item. While we can not expect to reduce the quantitative gap between the 450,000 persons in EconBiz and the 40,000 of them linked to Wikidata significantly by such manual efforts, we surely can step-by-step improve for the most prominent persons. This empowers applications to show biographical background links to Wikipedia where our users expect them most probably. Other tools for creating authority links and more automated approaches will be covered in further blog posts. And the great thing about wikidata is: All efforts add up - while we are doing modest improvements in our field of interest, many others do the same, so Wikidata already features an impressive overall amont of authority links. PS. All queries used in this analysis are published at GitHub. The public Wikidata endpoint cannot be used for research involving large datasets due to its limitations (in particular the 30 second timeout, the preclusion of the "service" clause for federated queries, and the lack of full-text search). Therefore, we’ve loaded the Wikidata dataset (along with others) into custom Apache Fuseki endpoints on a performant machine. Even there, a „power query“ like the one on the number of all authority links in Wikidata takes about 7 minutes. Therefore, we publish the according result files in the GitHub repository alongside with the queries. Wikidata for Authorities Wikidata &#160; Authority control &#160; Linked data &#160; Integrating a Research Data Repository with established research practices Authors: Timo Borst, Konstantin Ott In recent years, repositories for managing research data have emerged, which are supposed to help researchers to upload, describe, distribute and share their data. To promote and foster the distribution of research data in the light of paradigms like Open Science and Open Access, these repositories are normally implemented and hosted as stand-alone applications, meaning that they offer a web interface for manually uploading the data, and a presentation interface for browsing, searching and accessing the data. Sometimes, the first component (interface for uploading the data) is substituted or complemented by a submission interface from another application. E.g., in Dataverse or in CKAN data is submitted from remote third-party applications by means of data deposit APIs [1]. However the upload of data is organized and eventually embedded into a publishing framework (data either as a supplement of a journal article, or as a stand-alone research output subject to review and release as part of a ‘data journal’), it definitely means that this data is supposed to be made publicly available, which is often reflected by policies and guidelines for data deposit. In clear contrast to this publishing model, the vast majority of current research data however is not supposed to be published, at least in terms of scientific publications. Several studies and surveys on research data management indicate that at least in the social sciences there is a strong tendency and practice to process and share data amongst peers in a local and protected environment (often with several local copies on different personal devices), before eventually uploading and disseminating derivatives from this data to a publicly accessible repository. E.g., according to a survey among Austrian researchers, the portion of researchers agreeing to share their data either on request or among colleagues is 57% resp. 53%, while the agreement to share on a disciplinary repository is only 28% [2]. And in another survey among researchers from a local university and cooperation partner, almost 70% preferred an institutional local archive, while only 10% agreed on a national or international archive. Even if there is data planned to be published via a publicly accessible repository, it will first be stored and processed in a protected environment, carefully shared with peers (project members, institutional colleagues, sponsors) and often subject to access restrictions – in other words, it is used before being published.With this situation in mind, we designed and developed a central research data repository as part of a funded project called ‘SowiDataNet’ (SDN - Network of data from Social Sciences and Economics) [3]. The overall goal of the project is to develop and establish a national web infrastructure for archiving and managing research data in the social sciences, particularly quantitative (statistical) data from surveys. It aims at smaller institutional research groups or teams, which often do lack an institutional support or infrastructure for managing their research data. As a front-end application, the repository based on DSpace software provides a typical web interface for browsing, searching and accessing the content. As a back-end application, it provides typical forms for capturing metadata and bitstreams, with some enhancements regarding the integration of authority control by means of external webservices. From the point of view of the participating research institutions, a central requirement is the development of a local view (‘showcase’) on the repository’s data, so that this view can be smoothly integrated into the website of the institution. The web interface of the view is generated by means of the Play Framework in combination with the Bootstrap framework for generating the layout, while all of the data is retrieved and requested from the DSpace backend via its Discover interface and REST-API. SDN ArchitectureDiagram: SowiDataNet software componentsThe purpose of the showcase application is to provide an institutional subset and view of the central repository’s data, which can easily be integrated into any institutional website, either as an iFrame to be embedded by the institution (which might be considered as an easy rather than a satisfactory technical solution), or as a stand-alone subpage being linked from the institution’s homepage, optionally using a proxy server for preserving the institutional domain namespace. While these solutions imply the standard way of hosting the showcase software, a third approach suggests the deployment of the showcase software on an institution’s server for customizing the application. In this case, every institution can modify the layout of their institutional view by customizing their institutional CSS file. Because using Bootstrap and LESS Compiling the CSS file, a lightweight possibility might be to modify only some LESS Variables compiling to an institutional CSS file.As a result from the requirement analysis conducted with the project partners (two research institutes from the social sciences), and in accordance with the survey results cited, there is a strong demand for managing not only data which is to be published in the central repository, but also data which is protected and circulating only among the members of the institution. Moreover, this data is described by additional specific metadata containing internal hints on the availability restrictions and access conditions. Hence, we had to distinguish between the following two basic use cases to be covered by the showcase: To provide a view on the public SDN data (‘data published’) To provide a view on the public SDN data plus the internal institutional data resp. their corresponding metadata records, the latter only visible and accessible for institutional members (‘data in use’) From the perspective of a research institution and data provider, the second use case turned out to be the primary one, since it covers more the institutional practices and workflows than the publishing model does. As a matter of fact, research data is primarily generated, processed and shared in a protected environment, before it may eventually be published and distributed to a wider, potentially abstract and unknown community – and this fact must be acknowledged and reflected by a central research data repository aiming at the contributions from researchers which are bound to an institution.If ‘data in use’ is to be integrated into the showcase as an internal view on protected data to be shared only within an institution, it means to restrict the access to this data on different levels. First, for every community (in the sense of an institution), we introduce a DSpace collection for just those internal data, and protect it by assigning it to a DSpace user role ‘internal[COMMUNITY_NAME]’. This role is associated with an IP range, so that only requests from that range will be assigned to the role ‘internal’ and granted access to the internal collection. In the context of our project, we enter only the IP of the showcase application, so that every user of this application will see the protected items. Depending on the locality of the showcase application resp. server, we have to take further steps: If the application resp. server is located in the institution’s intranet, the protected items are only visible and accessible from the institution’s network. If the application is externally hosted and accessible via the World Wide Web – which is expected to be the default solution for most of the research institutes –, then the showcase application needs an authentication procedure, which is preferably realized by means of the central DSpace SowiDataNet repository, so that every user of the showcase application is granted access by becoming a DSpace user.In the context of an r&amp;d project where we are partnering with research institutes, it turned out that the management of research data is twofold: while repository providers are focused on the publishing and unrestricted access to research data, researchers are mainly interested in local archiving and sharing of their data. In order to manage this data, the researchers’ institutional practices need to be reflected and supported. For this purpose, we developed an additional viewing and access component. When it comes to their integration with existing institutional research practices and workflows, the implementation of research data repositories requires concepts and actions which go far beyond the original idea of a central publishing platform. Further research and development is planned in order to understand and support better the sharing of data in both institutional and cross-institutional subgroups, so the integration with a public central repository will be fostered.Link to prototype References[1] Dataverse Deposit-API. Retrieved 24 May 2016, from http://guides.dataverse.org/en/3.6.2/dataverse-api-main.html#data-deposit-api[2] Forschende und ihre Daten. Ergebnisse einer österreichweiten Befragung – Report 2015. Version 1.2 - Zenodo. (2015). Retrieved 24 May 2016, from https://zenodo.org/record/32043#.VrhmKEa5pmM[3] Project homepage: https://sowidatanet.de/. Retrieved 24 May 2016.[4] Research data management survey: report - Nottingham ePrints. (2013). Retrieved 24 May 2016, from http://eprints.nottingham.ac.uk/1893/[5] University of Oxford Research Data Management Survey 2012 : The Results | DaMaRO. (2012). Retrieved 24 May 2016, from https://blogs.it.ox.ac.uk/damaro/2013/01/03/university-of-oxford-research-data-management-survey-2012-the-results/ Institutional view on research data